The present disclosure generally relates to classes of compounds that bind the chromo-barrel domain of AT-rich interactive domain 4B (ARID4B).
Estrogen receptor alpha (ERα) plays a major role in the development and progression of breast cancer. ERα is activated upon binding of 17-β-estradiol (E2) to the ligand binding domain (LBD), resulting in the conformational change and recruitment of crucial coactivators for transcriptional activation of estrogen response element (ERE)-containing target genes. More than 70% of breast cancers are ERα+ and are treatable with endocrine therapies that inhibit estrogen biosynthesis or ERα activity. However, not all patients with ERα+ cancer respond to endocrine therapy and nearly all ERα+ metastatic cancers that initially respond to endocrine therapy will eventually become endocrine therapy-resistant, hormone-independent cancers. There is an urgent need for new and more effective therapies.
AT-rich interactive domain 4B (ARID4B), also known as retinoblastoma binding protein 1-like 1 (RBP1L1), is a member of the ARID family and a vital component of the SIN3A/HDAC1 chromatin remodeling complex. It has also been implicated in breast cancer progression and metastasis, as well as prostate cancer. The ARID4B protein possesses three key domains: the ARID domain for putative DNA binding activity, the chromobarrel domain, and the tudor domain. The latter two bind methylated histones and play critical roles as molecular adaptors in the assembly of the chromatin remodeling complexes. These complexes are associated with endocrine resistance in breast cancer, and likewise ARID4B is highly expressed in human breast cancers. Compelling evidence has shown that ARID4B is a therapeutic target, as applied to endocrine resistance in breast cancers. Targeting the chromobarrel or tudor domain of ARID4B perturbs the chromatin remodeling complexes, in turn effectively disrupting its role in breast cancer progression and endocrine resistance, providing an alternative means of eliminating breast cancer.
The present disclosure generally relates to classes of compounds that bind the chromo-barrel domain of AT-rich interactive domain 4B (ARID4B). Large-scale genomic analyses of TCGA and other breast cancer datasets showed that ARID4B is amplified in breast cancer (up to 22%). Genome-wide transcriptome analysis and IHC confirmed that expression of ARID4B is elevated in breast cancer cohorts. Clinical data clearly showed that elevated ARID4B expression is associated with high grade tumors and unfavorable clinical outcomes in patients with ERα+ breast cancer treated with systemic endocrine therapies. It has also been identified that ARID4B interacts with and activates not only the wild-type ERα, but also the constitutively active ERα Y537S and D538G mutants in a ligand-independent manner. Results from a novel mammary gland-specific Arid4b knockout mouse model show that knockout of Arid4b inhibisd mammary gland tumorigenesis, suggesting a causative role of Arid4b in breast cancer. Collectively, these data provide compelling evidence to support a critical role for ARID4B in the tumorigenesis and therapy resistance of breast cancer, and a strong rationale for developing inhibitors for pharmacological inhibition of ARID4B.
The rational development of a small molecule inhibitor (SMI) of a given protein-protein pathway involves a multidisciplinary approach: computational studies, chemical synthesis, characterization of compounds, and biological characterization. ARID4B lacks a defined crystal structure which requires a heavier emphasis on in silico studies to develop SMIs for this pathway. This multidisciplinary approach is used herein to rationally design small molecule inhibitors of ARID4B protein, ARD150, which targets the chromodomain as seen in
In some aspects, the present disclosure concerns derivatives from the base compound ARD150, the structure of which is set forth in Formula I:
As set forth in
In developing more potent analogs of ARD150, two structural modifications were considered: modification of the aromatic core and the solubilizing tail. The aromatic core modification seeks to enhance pi-pi interaction with amino acid residues in the chromobarrel domain of the ARID4B. Modification of the solubilizing tail seeks to increase the number of hydrogen bond donors on the tail, thereby increasing the number of hydrogen bond interactions in the protein pocket of the chromobarrel domain.
In some aspects, the derivatives may include modifications to the benzene ring of the head group identified in
In some aspects, the head group is selected from one of Formula Ia, Ib, Ic, and Id:
In some aspects, the present disclosure concerns a first class or grouping of ARD150 derivatives that comprise spacer between the benzene ring and the amide of the head group as identified in
With respect to Formula II, in some aspects, the spacer is null or —OCH2. In some aspects, R1 is null, H, or CH3. In some aspects, R2 is H or CH3. In some aspects, R0 is selected from:
or a piperazine with the nitrogen of the depicted amide.
In some aspects, the compound of the first class or grouping of compounds (Class I) is selected from compounds 1a-1e as set forth in
In some aspects, Class I analogs are characterized by modifications of the “tail” section (e.g., 1a-1e in
In some aspects, the compound may have the structure as follows:
In some aspects, the compound is 1d as set forth in
In some aspects, the compound may have the structure as follows:
In some aspects, the compound is referred to as ARD153 herein.
In some aspects, the composition of Formula II bookends a diamine with a basic structure as set forth in Formula III:
In some aspects, each R3 is independently null, C6H5OC6H5CH2, or CH3. In some aspects, X is null, O, or S—S. In certain aspects, the diamine is selected from the following structures:
In some aspects, the bookended or symmetrical compounds are set forth in Formula IV:
With respect to Forumla IV, in some aspects, each spacer is independently null or OCH2. In some aspects, each R1 is independently null, H, or CH3. In some aspects, each R2 is independently H or CH3. In some aspects, each R3 is independently null, C6H5OC6H5CH2, or CH3. In some aspects, X is null, O, or S—S. The overall scheme for the compounds of the second class or grouping (Class II) is set forth below:
In some aspects, the second grouping (Class II) of compounds are C2-symmetric compounds based on the same scaffolds as Class I analogs (
In some aspects, the present disclosure concerns a Class I compound reacted with a diamine. In other aspects, the present disclosure concerns one of Formulas Ia, Ib, Ic, or Id linked through an amide to a diamine as set forth in Formula III. In some aspects, the present disclosure concerns one of Formulas Ia, Ib, Ic, or Id linked through an amide to a diamine as set forth in IIIl, IIIb, IIIc, IIId, IIle, or IIIf.
In certain aspects, the present disclosure concerns the ARD150 derivatives of the structures 2a, 2b, 2c, 2d, 2e, 2f, 2g, 2h, 2i, and 2j as set forth in
In some aspects, the compound has the following structure:
In some aspects, the compound is referred to as 2b as set forth in
It should be understood that the present disclosure not only includes the compounds as described herein, but also salts or deivatives thereof. For example, for solubility and/or administration purposes, it can be desirable to produce a salt form. Such can include ionization of the compound and pairing with a cation/anion to provide the salt. Cations may include aluminum, arginine, lysine, benzathine, magnesium, histidine, lithium, meglumine, potassium, sodium, procaine, triethylaminje, zinc, ethylenediamine, ethanolamine, diethanolamine, choline, chloroprocaine, and calcium. Anions may include, acetate, chloride, aspartate, lactobionate, malate, maleate, mandelate, mesylate, benzenesulfonate, benzoate, besylate, bicarbonate, bitartrate, bromide, methylsulfate, napsylate, nitrate, octanoate, oleate, pamoate, pantothenate, citrate, camsylate, carbonate, decanoate, edetate, esylate, phosphate, polygalacturonate, propionate, fumarate, gluceptate, gluconate, glutamate, glycolate, salicylate, stearate, succinate, sulfate, tartrate, teoclate, tosylate, lactate, isethionate, hexanoate, hydroxynaphthoate, and iodide.
It will also be appreciated that the present disclosure also includes pharmacuetical compositions that include the compounds as set forth herein. Such may include a crystal or amorphous form of the compound or a salt thereof. Such may include additional materials, such as excipients, carriers, surfactants, other active compounds, flavoring agents, vitamins, minerals, and the like. Compositions may comprise other ingredients, known per se by one of ordinary skill in the art, such as pharmaceutically acceptable carriers, excipients, diluents, adjuvants, freeze drying stabilizers, wetting or emulsifying agents, pH buffering agents, gelling or viscosity enhancing additives, and preservatives, depending on the route of administration. Such are described in further detail in Remington: The Science and Practice of Pharmacy, Lippincott Williams & Wilkins, 22nd Ed., 2012; and Ansel's Pharmaceutical Dosage Forms and Drug Delivery Systems, 10th Ed., Philadelphia, PA, 2013.
Examples of pharmaceutically acceptable carriers, excipients or diluents include, but are not limited to demineralised or distilled water; saline solution; vegetable based oils such as peanut oil, arachis oil, safflower oil, olive oil, cottonseed oil, maize oil, sesame oil, or coconut oil; silicone oils, including polysiloxanes, such as methyl polysiloxane, phenyl polysiloxane and methylphenyl polysolpoxane; volatile silicones; mineral oils such as light liquid paraffin oil, or heavy liquid paraffin oil; squalene; cellulose derivatives such as methylcellulose, ethylcellulose, carboxymethylcellulose, carboxymethylcellulose sodium salt, or hydroxypropyl methylcellulose; lower alkanols, for example ethanol or isopropanol; lower aralkanols; lower polyalkylene glycols or lower alkylene glycols, for example polyethylene glycol, polypropylene glycol, ethylene glycol, propylene glycol, 1,3-butylene glycol or glycerin; fatty acid esters such as isopropyl palmitate, isopropyl myristate or ethyl oleate; polyvinylpyrrolidone; agar; carrageenan; gum tragacanth or gum acacia; and petroleum jelly. Typically, the carrier or carriers will form from 10% to 99.9% by weight of the vaccine composition and may be buffered by conventional methods using reagents known in the art, such as sodium hydrogen phosphate, sodium dihydrogen phosphate, potassium hydrogen phosphate, potassium dihydrogen phosphate, a mixture thereof, and the like.
The route of administration can be oral, sublingual, intranasal, transdermal (i.e., applied on or at the skin surface for systemic absorption), ocular, percutaneous, via mucosal administration, or via a parenteral route (intradermal, intramuscular, subcutaneous, intravenous, or intraperitoneal).
The terms “pharmaceutically-acceptable,” “physiologically-tolerable,” and grammatical variations thereof, as they refer to compositions, carriers, excipients, and reagents, are used interchangeably and represent that the materials are capable of administration to or upon a subject without the production of undesirable physiological effects to a degree that would prohibit administration of the composition. For example, “pharmaceutically-acceptable excipient” means, for example, an excipient that is useful in preparing a pharmaceutical composition that is generally safe, non-toxic, and desirable, and includes excipients that are acceptable for veterinary use as well as for human pharmaceutical use. Such excipients can be solid, liquid, semisolid, or, in the case of an aerosol composition, gaseous. A person of ordinary skill in the art would be able to determine the appropriate timing, sequence and dosages of administration for particular compositions of the present disclosure.
In aspects, preferred examples of such carriers or diluents include, but are not limited to, water, saline, Ringer's solutions, dextrose solution, and 5% human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils can also be used. The use of such media and compounds for pharmaceutically active substances is well known in the art.
Solutions or suspensions used for parenteral, intradermal, or subcutaneous application can include, but are not limited to, the following components: a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial compounds such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating compounds such as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, and compounds for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. Examples of excipients can include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, water, ethanol, DMSO, glycol, propylene, dried skim milk, and the like. The composition can also contain pH buffering reagents, and wetting or emulsifying agents.
As set forth in the working examples herein, it is demonstrated that ARID4B can be responsible for activation of estrogen receptor (ER) signaling, primarily through the ERα. As is also demonstrated herein, targeting the ARID4B gene or expressed gene or the activity thereof can affect ER activity. Accordingly, it is an aspect of the disclosure that methods of reducing or inhibiting ARID4B activity also negatively impact ER activity. Such can be achieved with techniques such as with the administration of siRNA, dsRNA, antibody or actrive fragment(s) thereof or similar.
It is also an aspect of the present disclosure that administration of the compounds as set forth herein impacts ARID4B activity and/or ER activity and/or ERα activity.
In some aspects, the present disclosure concerns administering one or more of the ARD150 derived compounds as set forth herein to a cell. In some aspects, the cell is in vitro. In some aspects, the cell is in vivo. In some aspects, one or more of the ARD150 derived compounds as set forth herein is administered to a subject. As identified herein, ARD150 binds to and inhibits ARDI4B and/or the chromobarrel thereof. Through the identification of derivatives of ARD150 as set forth herein, their application as improved inhibitors and/or more potent inhibitors through their administration to a cell and/or a subject is also contemplated. In some aspects, the present disclosure concerns administration of an effective amount of an ARD150 derivative, wherein an effective amount include an effective dose (ED) or effective concentration (EC) such that the desired response of ARID4B inhibition is achieved. In some aspects, the amount may be of about the ED50 or EC50 or that amount wherein 50% of a studied population demonstrates the desired ARID4B inhibition. In some aspect, the amount administered is above the minimum effective dose, but below the maximum tolerated dose. Such amounts may very between derivatives, but each can be readily determined through straightforward analysis.
In some aspects, the present disclosure concerns administration of at least one of compounds 1a, 1b, 1c, 1d, 1e, 2a, 2b, 2c, 2d, 2e, 2f, 2g, 2h, 2i, 2j, or a combination thereof to a cell, such as a cell within a subject. In some aspects, the compounds can be administered in their native form or as a salt thereof. Administration may include the additional presence of a pharmaceutically effective carrier. Routes of administration in vivo may include oral, sublingual, subdermal, intravenous, intramuscular, inhalation, or other as understood in such endeavors. Examples of acceptable carriers and the like can be found at Remington: The Science and Practice of Pharmacy, 23rd Edition, A. Adejare ed., 2020.
The methods may include administration or providing one or more of the compoisitions as set forth herein to a cell or a subject. In some aspects, the compositions of the present disclosure are provided to a human subject.
In aspects, the compositions of the present disclosure may be administered alone or as a pharmaceutical compoisition. The compositions may be administered by routes such as oral, sublingual, intranasal, transdermal (i.e., applied on or at the skin surface for systemic absorption), ocular, percutaneous, via mucosal administration, or via a parenteral route (intradermal, intramuscular, subcutaneous, intravenous, or intraperitoneal).
As also disclosed herein, it is an aspect that ARID4B is upregulated or overexpressed or overactive in some types of cellular dysplasia, including oncogenesic cells and cancerous cells. Administration of the compounds and compositions described herein can negatively impact hyperplasia, dysplasia, oncogenic growth or tumor growth. The compounds of the present disclosure can be administered alone or in combination with one or more other therapeutic compounds. It will be apparent that as the present disclosure identifies ARID4B aberrant activity in breast tiossue, combining with other active agents for breast cancers can be of a benefit. It should also be apparent that ocmbination with one or more pro-apoptotic agents and/or chemotherapeutic agents and/or antibodies/fragments thereof and/or radiation treamtments. Such can be administered together, independently, over synchronized time courses, and/or over individualized time courses.
A 1st aspect, either alone or in combination with any other aspect herein concerns an ARD150 derivative compound comprising the structure as set forth in Formula (II),
or a salt thereof wherein the spacer is null or —OCH2; R1 is null, H, or CH3; R2 is H or CH3; and, R0 is selected from:
or a piperazine with the nitrogen of the depicted amide.
A 2nd aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 1st aspect, wherein the compound is selected from 1a-1e:
or a salt thereof.
A 3rd aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 1st or 2nd aspect, wherein the compound comprises
or a salt thereof.
A 4th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 1st aspect, wherein the composition of Formula II bookends a diamine with a basic structure as set forth in Formula III:
A 5th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 4th aspect, wherein each R3 is independently null, C6H5OC6H5CH2, or CH3.
A 6th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 4th aspect, wherein X is null, O, or S—S.
A 7th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 4th aspect, wherein the diamine is selected from the following structures:
A 8th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 7th aspect, wherein the bookended or symmetrical compounds are set forth in Formula IV:
wherein each spacer is independently null or OCH2; each R1 is independently null, H, or CH3; each R2 is independently H or CH3; each R3 is independently null, C6H5OC6H5CH2, or CH3; and, X is null, O, or S—S.
A 9th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 7th aspect, wherein the compounds is selected from compounds 2a-2f as follow:
or a salt thereof.
A 10th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 1st or 7th aspect, wherein the compounds comprises
or a salt thereof.
An 11th aspect, either alone or in combination with any other aspect herein concerns the ARD150 derivative compound of the 1st aspect, wherein the compounds comprises
or a salt thereof.
A 12th aspect, either alone or in combination with any other aspect herein concerns a pharmecutical composition comprising the compouind of any of apects 1 to 11 and a pharmaceutically acceptable carrier.
A 13th aspect, either alone or in combination with any other aspect herein concerns the pharmaceutical composition of the 12th aspect, further compriusing an excipient.
A 14th aspect, either alone or in combination with any other aspect herein concerns a method for targeting AT-rich interactive domain 4B (ARID4B) in a cell, comprising administering the ARD150 derivative compound of any one of aspects 1-13 to a cell.
A 15th aspect, either alone or in combination with any other aspect herein concerns the method of the 14th aspect, wherein the cell is a cancer cell.
A 16th aspect, either alone or in combination with any other aspect herein concerns the method of the 14th aspect, wherein the cell is in vivo.
A 17th aspect, either alone or in combination with any other aspect herein concerns a method for treating cellular dysplasia in breast tissue of a subject comprising administering the compound of any of aspects 1 to 12 to the subject.
A 18th aspect, either alone or in combination with any other aspect herein concerns the method of the 17th aspect, wherein the compound is administered by a route selected from parenteral, topical, intravenous, oral, subcutaneous, sublingual, intraarterial, intradermal, transdermal, rectal, intracranial, intrathecal, intraperitoneal, intranasal; vaginally; intramuscular route or as inhalants.
ARID4B is amplified and expression is elevated in breast cancer.
Analysis of breast cancer genomes in The Cancer Genome Atlas (TCGA) and other breast cancer datasets showed that ARID4B is amplified in breast cancer (up to 22%) (
Kaplan-Meier survival analysis revealed that elevated ARID4B expression is associated with decreased recurrence-free survival (RFS), particularly evident in patients with ERα+ and not ERα− breast cancers (
To determine whether ARID4B is involved in ERα signaling, RNA-seq and gene set enrichment analysis (GSEA) were performed. The results showed the ER□ signaling pathway to be the top pathway affected by knockdown of ARID4B in MCF7 cells (
It was next determined whether there exists a causal role of ARID4B in mammary gland tumorigenesis in vivo, using MMTV-Cre; MMTV-Erbb2 mice (JAX) that express Erbb2 (Erbb2MGOE) and develop mammary gland tumors. Erbb2MGOE mice were crossed with Arid4bflox/flox mice to generate mice expressing Erbb2 but carrying a mammary gland-specific Arid4b deletion (Erbb2MGOEArid4bMG−/−) (
These results identified ARID4B as a novel coactivator for ERα and show that mammary gland-specific ablation of Arid4b in mice inhibits tumorigenesis. Interestingly, ARID4B not only activated the wild-type ERα, but also interacted with constitutively active ERα mutants in a ligand-independent manner and stimulated their activity to the full extent as in the presence of E2. Collectively, these results strongly suggest that activation of ERα by ARID4B may drive breast cancer development and progression to antiestrogen resistant cancer.
ARID4B knockout (ARID4B KO) cells were generated in ERα+ breast cancer cells, including the E2-dependent MCF7 and T47D (
ARID4B knock-in mice that overexpress ARID4B in a mammary gland-specific manner (ARID4BMGOE) (
ARID4B contains a Tudor domain, a RBBP1 N-terminal domain (RBB1NT, also known as PWWP domain for the conserved Pro-Trp-Trp-Pro motif), an ARID domain (a putative DNA binding domain), and a chromo domain at its N-terminus. Tudor and chromo domain bind methylated histone. The function of RBB1NT domain remains unclear, but it was proposed to mediate protein-protein interaction and binding of methylated histone. To determine which domain(s) is important for the biological function of ARID4B, mutants devoid of the Tudor domain (ΔT, aa58-113), RBB1NT (ΔR, aa170-262,), ARID domain (ΔA, aa311-394), and chromo domain (ΔC, aa571-624) were created by site-directed mutagenesis (confirmed by Sanger sequencing,
It was then undertaken to identify compounds that inhibit ARID4B, focusing on the chromodomain therein. First, the structure of ARID4B was created using a homology model based on an optimized computational platform that combines the iterative threading assembly refinement (TASSER) and SWISSMODEL (
These data summarize different approaches to synthesize and characterize the two classes (Class I and II) of novel, potent analogs of ARD150. It will be appreciated that through the use of the identified sets of “head” and “tail” groups, a wide variety of analogs can be created.
To a doubly purged Schlenk flask equipped with a stir bar was added the acid (1 equiv.), diamine (1 equiv.), EDC (2.5 equiv.), HOBt (2.5 equiv.), DIPEA (2.5 equiv.), and DMF (5 mL). This process formed an off-white heterogeneous mixture, which was then allowed to continually stir at room temperature under inert conditions for 24h.
The mixture was diluted with water, centrifuged, and extracted with DCM (5 mL×2), then the combined organic layers were dried over anhydrous magnesium sulfate and filtered. The resultant filtrate was removed in vacuo to afford the crude amide, which was then purified via chromatography (silica; 1-5% methanol in DCM (mobile phase)) to yield the N-Boc intermediate compound.
To a solution of the starting material (1 equiv.) in dichloromethane (5 mL) was added trifluoroacetic acid (5 mL, approx. 10 eq), and the mixture was stirred at room temperature for one hour. After this time, the reaction was concentrated under reduced pressure to remove most of the excess TFA.
The resulting viscous oil was dissolved in dichloromethane (12 mL), washed several times with diethyl ether (ca. 30 mL), and concentrated to yield an off-white precipitate. To this precipitate was added 20 mL of saturated sodium bicarbonate solution, then extracted with DCM, dried with magnesium sulfate, filtered, and concentrated under reduced pressure to yield the target compound as an off-white solid.
A doubly purged Schlenk flask equipped with a stir bar was added the acid (2 equiv.), diamine (1 equiv.), EDC (2.5 equiv.), HOBt (2.5 equiv.), DIPEA (2.5 equiv.), and DMF (5 mL). This process formed an off-white heterogeneous mixture, which was then allowed to continually stir at room temperature under inert conditions for 24h.
The mixture was diluted with water, centrifuged, and extracted with DCM (5 mL×2), then the combined organic layers were dried over anhydrous magnesium sulfate and filtered. The resultant filtrate was removed in vacuo to afford the crude amide, which was then purified via chromatography (silica; 1-5% methanol in DCM (mobile phase)) to yield the target compound.
To a round bottom flask equipped with a stir bar was added the acid (2 equiv.), aryl diamine (1 equiv.), HATU (2.5 equiv.), and DMF (5 mL) This formed an off-white heterogeneous mixture, which was then allowed to stir at room temperature for 24h continually.
The mixture was diluted with water and extracted with DCM (5 mL×2), and the combined organic layers were dried over anhydrous magnesium sulfate and filtered. The resultant filtrate was removed in vacuo to afford the crude amide, which was then purified via chromatography (silica; 30% ethyl acetate (mobile phase)) to yield the target compound.
Reactions were monitored throughout via (silica) TLC with a workable solvent system as the mobile phase (often either 30-50% ethyl acetate in hexane or 1-5% methanol in DCM seemed to work). Likewise, compounds were purified using corresponding column chromatographic methods using the same solvent system for its TLC. A given compound was deemed synthesized if 1H and 13C NMR was on point. Each compound was deemed pure and ready for biological studies if its purity (determined via HPLC) was greater than or equal to 97%.
As identified herein, an iterative scoring process was used to develop new variants of ARD150 to minimize the computational resources required. Analogs of ARD150 were generated by optimizing the critical interactions within the chromobarrel domain pocket, which were then re-docked and scored. The highest 1% of these compounds were further studied to become scaffolds. Molecular modeling of a library of compounds against the ARID4B protein revealed three novel classes of compounds better at targeting the protein than ARD150. In addition, docking results and structure-docking score relationships have shown that certain modifications could improve affinity for the ARID4B protein, as shown in
N-Boc Intermediate for Compound 1a:
1H NMR (400 MHZ, Chloroform-d) δ 7.46 (s, 1H), 7.33 (d, J=7.9 Hz, 1H), 6.85 (d, J=8.4 Hz, 1H), 3.91 (d, J=5.9 Hz, 6H), 3.66 (s, 8H), 3.54 (t, J=5.3 Hz, 2H), 3.29 (t, J=5.2 Hz, 2H), 1.42 (s, 9H).
13C NMR (101 MHz, CDCl3) δ 167.21, 156.07, 151.74, 148.94, 127.06, 119.55, 110.78, 110.25, 79.55, 70.21, 56.01, 39.90, 28.40.
Compound 1a:
1H NMR (400 MHZ, Chloroform-d) 7.44-7.31 (m, 3H), 6.77 (d, J=8.2 Hz, 1H), 6.12 (s, 1H), 3.81 (s, 6H), 3.66-3.48 (m, 10H), 3.03 (dd, J=7.5, 4.3 Hz, 2H).
13C NMR (101 MHz, CDCl3) δ 168.28, 152.01, 148.87, 126.20, 120.25, 110.45, 70.08, 70.03, 69.93, 66.55, 55.99, 54.10, 42.29, 39.90, 39.62
N-Boc Intermediate of Compound 1b:
1H NMR (400 MHZ, Chloroform-d) δ 7.35 (d, J=11.6 Hz, 2H), 6.90 (d, J=8.4 Hz, 1H), 6.60 (s, 1H), 5.04 (s, 1H), 4.30 (d, J=2.3 Hz, 4H), 3.66 (d, J=6.7 Hz, 8H), 3.58 (t, J=5.2 Hz, 2H), 3.32 (s, 2H), 1.46 (s, 9H).
13C NMR (101 MHz, CDCl3) δ 166.75, 156.01, 146.40, 143.29, 127.89, 120.51, 117.21, 116.47, 79.41, 70.28, 64.54, 64.21, 40.40, 39.65, 28.41.
Compound 1b:
1H NMR (400 MHZ, DMSO-d6) δ 8.36 (s, 1H), 7.87 (s, 3H), 7.37 (d, J=14.2 Hz, 2H), 6.91 (d, J=8.3 Hz, 1H), 4.28 (s, 4H), 3.62-3.49 (m, 8H), 3.39 (q, J=5.9 Hz, 2H), 2.96 (d, J=5.3 Hz, 2H).
13C NMR (101 MHz, DMSO) δ 159.03, 158.68, 146.42, 143.35, 127.87, 121.06, 117.17, 116.68, 70.15, 69.37, 67.14, 64.78, 64.45.
Compound 1c:
1H NMR (400 MHZ, Chloroform-d) δ 7.34 (s, 1H), 7.31 (s, 1H), 6.88 (d, J=8.4 Hz, 1H), 6.64 (s, 1H), 4.29-4.25 (m, 4H), 3.71-3.54 (m, 12H), 3.41 (t, J=5.1 Hz, 2H), 3.34-3.25 (m, 4H), 3.15 (s, 1H), 1.54 (d, J=9.2 Hz, 9H).
13C NMR (101 MHz, CDCl3) δ 166.74, 157.52, 146.44, 143.33, 127.82, 120.48, 117.23, 116.48, 70.52, 70.23, 69.82, 64.54, 64.22, 47.96, 45.15, 40.65, 39.68, 25.78, 25.45, 24.77, 24.27.
Compound 1d:
1H NMR (400 MHZ, Chloroform-d) δ 7.37-7.22 (m, 5H), 6.81 (s, 4H), 4.94 (s, 2H), 4.57 (s, 2H), 3.57-3.45 (m, 4H), 3.34 (d, J=11.3 Hz, 4H), 1.39 (s, 9H).
13C NMR (101 MHZ, CDCl3) δ 166.96, 154.51, 153.73, 152.00, 137.10, 128.58, 127.96, 127.49, 115.97, 115.53, 80.39, 76.73, 70.64, 68.54, 45.34, 42.00, 28.38.
Compound 1e:
1H NMR (400 MHZ, DMSO-d6) δ 8.91 (s, 1H), 7.49-7.31 (m, 5H), 6.98-6.84 (m, 4H), 5.04 (s, 2H), 4.79 (s, 2H), 3.66 (s, 5H), 3.13 (d, J=25.9 Hz, 4H).
13C NMR (101 MHz, DMSO) δ 166.87, 158.92, 158.57, 153.17, 152.54, 137.79, 128.87, 128.21, 128.06, 116.02, 70.09, 66.77.
Compound 2a:
1H NMR (400 MHZ, Chloroform-d) δ 8.11 (d, J=8.4 Hz, 2H), 7.98 (dd, J=8.5, 2.1 Hz, 2H), 7.69 (d, J=2.0 Hz, 2H), 7.58-7.53 (m, 2H), 7.50-7.43 (m, 4H), 7.03 (d, J=8.5 Hz, 2H), 4.02 (s, 6H), 3.98 (s, 6H).
13C NMR (101 MHz, CDCl3) δ 162.46, 155.22, 149.32, 143.61, 128.93, 128.71, 125.64, 124.83, 120.58, 116.70, 112.55, 110.88, 108.47, 56.24.
Compound 2b:
1H NMR (400 MHZ, DMSO-d6) δ 9.67 (s, 2H), 7.69 (d, J=8.4 Hz, 2H), 7.61 (s, 2H), 7.47 (s, 4H), 7.30 (s, 1H), 7.10 (d, J=8.6 Hz, 2H), 3.85 (s, 12H), 2.26 (s, 12H).
13C NMR (101 MHz, DMSO) δ 164.97, 152.03, 138.40, 136.57, 135.39, 126.96, 126.34, 121.25, 111.49, 56.07, 18.75.
Compound 2c:
1H NMR (400 MHZ, Chloroform-d) δ 8.10 (d, J=8.4 Hz, 2H), 7.98 (dd, J=8.5, 2.1 Hz, 2H), 7.69 (d, J=2.0 Hz, 2H), 7.58-7.53 (m, 2H), 7.50-7.42 (m, 4H), 7.03 (d, J=8.5 Hz, 2H), 4.02 (s, 6H), 3.98 (s, 6H).
13C NMR (101 MHz, CDCl3) δ 162.47, 155.22, 149.32, 143.59, 128.71, 125.64, 124.84, 120.57, 116.70, 112.55, 110.88, 108.47, 56.32.
Compound 2d:
1H NMR (400 MHZ, DMSO-d6) δ 10.09 (s, 2H), 7.75 (d, J=9.1 Hz, 4H), 7.64-7.58 (m, 2H), 7.53 (d, J=2.0 Hz, 2H), 7.08 (d, J=8.6 Hz, 2H), 7.01 (d, J=9.0 Hz, 4H), 3.84 (s, 12H).
13C NMR (101 MHZ, DMSO) δ 165.23, 153.25, 152.07, 148.77, 135.22, 127.39, 122.67, 121.44, 119.04, 111.45, 56.10.
Compound 2e:
1H NMR (400 MHZ, DMSO-d6) δ 8.77 (s, 1H), 8.55 (s, 1H), 7.52 (dd, J=8.4, 4.4 Hz, 4H), 7.45 (d, J=2.0 Hz, 2H), 7.43 (d, J=2.1 Hz, 2H), 7.38 (d, J=2.0 Hz, 3H), 6.94 (d, J=8.4 Hz, 3H), 4.31 (dd, J=4.4, 3.0 Hz, 8H).
13C NMR (101 MHz, DMSO) δ 167.19, 151.60, 147.92, 143.47, 140.08, 135.11, 129.34, 124.21, 123.41, 121.20, 118.60, 117.52, 64.89, 64.36.
Compound 2f:
1H NMR (400 MHZ, DMSO-d6) δ 8.78 (s, 1H), 8.55 (s, 1H), 7.52 (dd, J=8.4, 4.4 Hz, 4H), 7.44 (dd, J=8.4, 2.1 Hz, 5H), 7.38 (d, J=2.0 Hz, 4H), 6.94 (d, J=8.4 Hz, 5H), 4.33-4.30 (m, 8H).
13C NMR (101 MHz, DMSO) δ 167.19, 151.61, 147.92, 143.47, 140.09, 135.11, 129.34, 124.21, 123.41, 121.20, 118.60, 117.52, 64.89, 64.36.
Compound 2g:
1H NMR (400 MHZ, DMSO-d6) δ 9.58 (d, J=33.7 Hz, 2H), 7.59-7.54 (m, 3H), 7.44 (s, 3H), 7.29 (s, 1H), 7.17 (s, 1H), 7.00 (d, J=8.3 Hz, 2H), 4.32 (s, 8H), 2.24 (s, 12H).
13C NMR (101 MHZ, DMSO) δ 167.20, 164.63, 146.77, 143.49, 138.46, 136.55, 136.18, 135.33, 127.71, 126.33, 125.17, 123.41, 121.56, 121.40, 118.60, 117.52, 117.37, 117.05, 64.88, 64.51, 18.70.
Compound 2h:
1H NMR (400 MHZ, DMSO-d6) δ 9.49 (s, 2H), 7.45-7.34 (m, 14H), 7.00 (s, 8H), 5.07 (s, 4H), 4.67 (s, 4H), 2.16 (s, 12H).
13C NMR (101 MHz, DMSO) δ 167.31, 153.35, 152.46, 138.49, 137.78, 136.28, 128.86, 128.10, 126.31, 116.15, 70.10, 18.68.
Compound 2i:
1H NMR (400 MHZ, Chloroform-d) δ 7.37-7.22 (m, 14H), 6.95-6.77 (m, 12H), 4.94 (d, J=2.4 Hz, 4H), 4.49 (d, J=3.4 Hz, 4H).
13C NMR (101 MHz, CDCl3) δ 166.48, 153.63, 152.32, 151.75, 137.19, 128.62, 128.59, 127.97, 127.93, 127.48, 121.91, 119.30, 116.13, 115.95, 115.87, 115.82, 115.71, 70.64, 68.31, 36.66, 35.76.
Compound 2j:
1H NMR (400 MHZ, DMSO-d6) δ 10.19 (s, 2H), 7.68 (d, J=8.8 Hz, 4H), 7.52-7.30 (m, 15H), 7.00-6.91 (m, 8H), 5.04 (s, 4H), 4.63 (s, 4H).
13C NMR (101 MHZ, DMSO) δ 167.50, 153.37, 152.41, 138.99, 137.75, 130.61, 130.32, 128.86, 128.10, 120.95, 116.14, 70.09, 68.32.
The synthesis of the 15 final compounds required an emphasis on speed, efficiency, and simplicity. As shown in
Purification efforts are ongoing in those compounds determined to be >97% pure via HPLC. These efforts primarily involve column chromatographic methods or simple separation to remove impurities, byproducts, and leftover starting materials. Once 1H/13C NMR and HPLC have fully characterized a compound, it can be sent for further biological study. These studies include cell-based assays and in vitro/in vivo testing to quantify a given analog's specificity, toxicity, and potency when targeting ARID4B. Biological characterization is ongoing, and its exact methods and results are beyond the scope of this paper.
The overarching goal of this project is to synthesize potent inhibitors of the ARID4B pathway. More broadly, the iterative in silico-aided nature of this project's small molecule design process can be expanded upon and applied to other fields. Furthermore, such a process allowed for the rapid creation of novel, first-in-class small molecule inhibitors such as ARD150 in the absence of a crystal structure of the target protein.
Research into this family of compounds is an ongoing and collaborative effort. As the synthesis and characterization of new analogs are completed, they are characterized biologically in varied mouse models. The theoretical Structure-in-silico Activity Relationship (SiAR) and biological characterization results can be combined to determine which structural elements contribute most heavily towards potency. This cycle can be repeatedly iterated to develop increasingly potent and effective analogs of ARD150.
These products could be used as lead compounds to develop drugs that target the ARID4B pathway. An iterative SiAR-synthesis cycle can be used in future stages of development, as well, to optimize further aspects of these analogs for optimal pharmacokinetics. Successful creation of a potent and effective small molecule inhibitor for ARID4B will be vital in treating certain aggressive breast cancers and may find applications as a therapeutic intervention for related diseases.
Four distinct structural scaffolds (ARD150, ARD153, RWR-10, and RWR-18) that target the chromodomain of ARID4B with strong interactions with amino acids in their respective binding pockets (
Using CETSA (cellular thermal shift assay) to examine whether ARD150 or RWR-18 binds to ARID4B, MCF7 cells treated or not with ARD150 or RWR-18 for 2 h were collected, followed by heating at 46, 49, 52, 55, and 58° C. to denature and precipitate ARID4B protein, cell lysis, removal of cell debris and aggregates by centrifugation, and finally detection of remaining thermostable ARID4B protein by western blot. The ARID4B aggregation temperatures without ARD150 were determined to be 49-52° C., but shifted to 52-55° C. after treatment with ARD150 and even more so with RWR-18 (
Isothermal titration calorimetry (ITC) was used to confirm the binding of ARD150 and its derivative RWR18 (derived from ARD150 with slight structural modifications) to ARID4B chromodomain and to determine the Kd, reaction stoichiometry (n), enthalpy (ΔH) and entropy (ΔS). The results showed the Kd of ARD150 and RWR18 for ARID4B chromodomain are 3.5 μM and 1 μM, respectively (
To determine the EC50 of ARD150 and two of its derivatives in breast cancer, MCF7, ZR75-1 (wild-type ERα+), and MCF7 Y537S (constitutively activate ERα mutant) and their ARID4B KO counterparts were seeded in 24 well plate and treated with the various concentrations (0, 0.1, 1, 5, 10, 20, 40, 60, and 100 μM) of ARD150 and RWR-18 for 14 days. The results showed that ARID4B inhibitors inhibited proliferation of MCF7 control cells in a dose-dependent manner as determined by MTT assays. The EC50 of ARD150 and RWR-18 for MCF7 are 13.8 and 7.9 μM, respectively (
Consistent with knockout of ARID4B inhibited E2-induced and constitutive expression of ERα target genes, pre-treatment with RWR-18 (5 and 10 μM) abolished the induction of PGR and GREB1 genes by E2 in MCF7 and ZR75-1 cells (
Since RWR-18 is more potent and has lower Kd than ARD150. The potential toxicity of RWR-18 was tested using 9-week-old Balb/c female mice. The mice were injected (i.p.) with vehicle or 10 mg/kg, 25 mg/kg, and 50 mg/kg of RWR-18 (daily for 7 days, 3 mice/group). The body weight was measured daily for 21 days. The liver, spleen, heart, lung, and kidney were collected for macroscopic and histological examination. The results showed that control mice and mice treated with RWR-18 at doses tested exhibited similar body weight gain (
Histone peptide binding assays were used to determine that ARID4B binds to which methylated lysine. Full-length His-ARID4B and deletion mutants expressed in Sf9 were purified by Ni-NTA affinity resin (
To provide a better understanding into the ARID4B binding landscape, ChIP-seq using ARID4B antibody was carried out in MCF7 cells. ChIP-seq analysis identified ARID4B enrichment on the promoters of ERα target genes, including PGR, GREB1, CCND1, SLC7A5 (
ChIP assays were performed to determine the recruitment of ERα to the enhancers in MCF7 cells with or without knockout of ARID4B. Consistently, ERα was recruited to the enhancers and not promoters of PGR and GREB1 upon treatment with E2, and knockout of ARID4B significantly reduced the E2-induced ERα recruitment to the enhancers of PGR and GREB1 (
Since RWR-18 targets the chromodomain of ARID4B, which is important for binding to H3K4me3 (
The effects of RWR18 on tumor growth was determined using MCF7 xenograft model. When tumor in SCID mice reached 150-200 mm3, mice were randomized to receive vehicle or RWR18 (25 mg/kg) once every two days for 24 days. The graphs showed tumor growth in mice receiving vehicle (
While particular aspects have been illustrated and described herein, it should be understood that various other changes and modifications may be made without departing from the spirit and scope of the claimed subject matter. Moreover, although various aspects of the claimed subject matter have been described herein, such aspects need not be utilized in combination. It is therefore intended that the appended claims cover all such changes and modifications that are within the scope of the claimed subject matter.
It is appreciated that all reagents are obtainable by sources known in the art unless otherwise specified.
It is also to be understood that this disclosure is not limited to the specific aspects and methods described herein, as specific components and/or conditions may, of course, vary. Furthermore, the terminology used herein is used only for the purpose of describing particular aspects of the present disclosure and is not intended to be limiting in any way. It will be also understood that, although the terms “first,” “second,” “third” etc. may be used herein to describe various elements, components, regions, layers, and/or sections, these elements, components, regions, layers, and/or sections should not be limited by these terms. These terms are only used to distinguish one element, component, region, layer, or section from another element, component, region, layer, or section. Thus, “a first element,” “component,” “region,” “layer,” or “section” discussed below could be termed a second (or other) element, component, region, layer, or section without departing from the teachings herein. Similarly, as used herein, the singular forms “a,” “an,” and “the” are intended to include the plural forms, including “at least one,” unless the content clearly indicates otherwise. “Or” means “and/or.” As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items. It will be further understood that the terms “comprises” and/or “comprising,” or “includes” and/or “including” when used in this specification, specify the presence of stated features, regions, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, regions, integers, steps, operations, elements, components, and/or groups thereof. The term “or a combination thereof” means a combination including at least one of the foregoing elements.
Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. It will be further understood that terms such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and the present disclosure, and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
Reference is made in detail to exemplary compositions, aspects and methods of the present disclosure, which constitute the best modes of practicing the disclosure presently known to the inventors. The Figures are not necessarily to scale. However, it is to be understood that the disclosed aspects are merely exemplary of the disclosure that may be embodied in various and alternative forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but merely as a representative basis for any aspect of the disclosure and/or as a representative basis for teaching one skilled in the art to variously employ the present disclosure.
Patents, publications, and applications mentioned in the specification are indicative of the levels of those skilled in the art to which the disclosure pertains. These patents, publications, and applications are incorporated herein by reference to the same extent as if each individual patent, publication, or application was specifically and individually incorporated herein by reference.
The foregoing description is illustrative of particular embodiments of the disclosure, but is not meant to be a limitation upon the practice thereof. The following claims, including all equivalents thereof, are intended to define the scope of the disclosure.
This application claims priority to U.S. Provisional Patent Application 63/300,455, filed Jan. 18, 2022, the content of which is hereby incorporated by reference in its entirety.
The present disclosure was supported by a grant from the National Institutes for Health, Center for Pharmaceutical Research and Innovation (Grant No. P20 GM130456) and the National Cancer Institute (Grant No. R01CA258421-01). The Government may have rights to the invention.
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/US2023/060841 | 1/18/2023 | WO |
| Number | Date | Country | |
|---|---|---|---|
| 63300455 | Jan 2022 | US |