The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jun. 17, 2021, is named R2103-7017WO_SL.txt and is 1,386,978 bytes in size.
Alternative splicing is a major source of protein diversity in higher eukaryotes, and is frequently regulated in a tissue-specific or development stage-specific manner. Disease associated alternative splicing patterns in pre-mRNAs are often mapped to changes in splice site signals or sequence motifs and regulatory splicing factors (Faustino and Cooper (2003), Genes Dev 17(4):419-37). Current therapies to modulate RNA expression involve oligonucleotide targeting and gene therapy; however, each of these modalities exhibit unique challenges as currently presented. As such, there is a need for new technologies to modulate RNA expression, including the development of small molecule compounds that target splicing.
The present disclosure features compounds and related compositions that, inter alia, modulate nucleic acid splicing, e.g., splicing of a pre-mRNA, as well as methods of use thereof. In an embodiment, the compounds described herein are compounds of Formulas (I) or (II) and pharmaceutically acceptable salts, solvates, hydrates, tautomers, or stereoisomers thereof. The present disclosure additionally provides methods of using the compounds of the disclosure (e.g., compounds of Formulas (I) or (II) and pharmaceutically acceptable salts, solvates, hydrates, tautomers, stereoisomers thereof), and compositions thereof, e.g., to target, and in embodiments bind or form a complex with, a nucleic acid (e.g., a pre-mRNA or nucleic acid component of a small nuclear ribonucleoprotein (snRNP) or spliceosome), a protein (e.g., a protein component of an snRNP or spliceosome, e.g., a member of the splicing machinery, e.g., one or more of the U1, U2, U4, U5, U6, U11, U12, U4atac, U6atac snRNPs), or a combination thereof. In another aspect, the compounds described herein may be used to alter the composition or structure of a nucleic acid (e.g., a pre-mRNA or mRNA (e.g., a pre-mRNA and the mRNA which arises from the pre-mRNA), e.g., by increasing or decreasing splicing at a splice site. In some embodiments, increasing or decreasing splicing results in modulating the level of a gene product (e.g., an RNA or protein) produced.
In another aspect, the compounds described herein may be used for the prevention and/or treatment of a disease, disorder, or condition, e.g., a disease, disorder or condition associated with splicing, e.g., alternative splicing. In some embodiments, the compounds described herein (e.g., compounds of Formulas (I) or (II), and pharmaceutically acceptable salts, solvates, hydrates, tautomers, stereoisomers thereof) and compositions thereof are used for the prevention and/or treatment of a proliferative disease, disorder, or condition (e.g., a disease, disorder, or condition characterized by unwanted cell proliferation, e.g., a cancer or a benign neoplasm) in a subject. In some embodiments, the compounds described herein (e.g., compounds of Formulas (I) or (II), and pharmaceutically acceptable salts, solvates, hydrates, tautomers, stereoisomers thereof) and compositions thereof are used for the prevention and/or treatment of a non-proliferative disease, disorder, or condition. In some embodiments, the compounds described herein (e.g., compounds of Formulas (I) or (II), and pharmaceutically acceptable salts, solvates, hydrates, tautomers, stereoisomers thereof) and compositions thereof are used for the prevention and/or treatment of a neurological disease or disorder, an autoimmune disease or disorder, immunodeficiency disease or disorder, a lysosomal storage disease or disorder, a cardiovascular disease or disorder, a metabolic disease or disorder, a respiratory disease or disorder, a renal disease or disorder, or an infectious disease in a subject.
In another aspect, the present disclosure provides compounds of Formula (I):
or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, wherein each of A, B, L, X, Y, Z, R2, R7, and subvariables thereof are defined as described herein.
In another aspect, the present disclosure provides compounds of Formula (II):
or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, wherein each of A, B, L, X, Y, Z, R2, and subvariables thereof are defined as described herein.
In another aspect, the present invention provides pharmaceutical compositions comprising a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, and optionally a pharmaceutically acceptable excipient. In an embodiment, the pharmaceutical compositions described herein include an effective amount (e.g., a therapeutically effective amount) of a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In another aspect, the present disclosure provides methods for modulating splicing, e.g., splicing of a nucleic acid (e.g., a DNA or RNA, e.g., a pre-mRNA) with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof. In another aspect, the present disclosure provides compositions for use in modulating splicing, e.g., splicing of a nucleic acid (e.g., a DNA or RNA, e.g., a pre-mRNA) with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof. Modulation of splicing may comprise impacting any step involved in splicing and may include an event upstream or downstream of a splicing event. For example, in some embodiments, the compound of Formulas (I) or (II) binds to a target, e.g., a target nucleic acid (e.g., DNA or RNA, e.g., a precursor RNA, e.g., a pre-mRNA), a target protein, or combination thereof (e.g., an snRNP and a pre-mRNA). A target may include a splice site in a pre-mRNA or a component of the splicing machinery, such as the U1 snRNP. In some embodiments, the compound of Formulas (I) or (II) alters a target nucleic acid (e.g., DNA or RNA, e.g., a precursor RNA, e.g., a pre-mRNA), target protein, or combination thereof. In some embodiments, the compound of Formulas (I) or (II) increases or decreases splicing at a splice site on a target nucleic acid (e.g., an RNA, e.g., a precursor RNA, e.g., a pre-mRNA) by about 0.5% or more (e.g., about 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 75%, 90%, 95%, or more), relative to a reference (e.g., the absence of a compound of Formulas (I) or (II), e.g., in a healthy or diseased cell or tissue). In some embodiments, the presence of a compound of Formulas (I) or (II) results an increase or decrease of transcription of a target nucleic acid (e.g., an RNA) by about 0.5% or more (e.g., about 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50%, 75%, 90%, 95%, or more), relative to a reference (e.g., the absence of a compound of Formulas (I) or (II), e.g., in a healthy or diseased cell or tissue).
In another aspect, the present disclosure provides methods for preventing and/or treating a disease, disorder, or condition in a subject by administering a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, or related compositions. In some embodiments, the disease or disorder entails unwanted or aberrant splicing. In some embodiments, the disease or disorder is a proliferative disease, disorder, or condition. Exemplary proliferative diseases include cancer, a benign neoplasm, or angiogenesis. In other embodiments, the present disclosure provides methods for treating and/or preventing a non-proliferative disease, disorder, or condition. In still other embodiments, the present disclosure provides methods for treating and/or preventing a neurological disease or disorder, autoimmune disease or disorder, immunodeficiency disease or disorder, lysosomal storage disease or disorder, cardiovascular disease or disorder, metabolic disease or disorder, respiratory disease or disorder, renal disease or disorder, or infectious disease.
In another aspect, the present disclosure provides methods of down-regulating the expression of (e.g., the level of or the rate of production of) a target protein with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof in a biological sample or subject. In another aspect, the present disclosure provides methods of up-regulating the expression of (e.g., the level of or the rate of production of) a target protein with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof in a biological sample or subject. In another aspect, the present disclosure provides methods of altering the isoform of a target protein with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof in a biological sample or subject. Another aspect of the disclosure relates to methods of inhibiting the activity of a target protein in a biological sample or subject. In some embodiments, administration of a compound of Formulas (I) or (II) to a biological sample, a cell, or a subject comprises inhibition of cell growth or induction of cell death.
In another aspect, the present disclosure provides compositions for use in preventing and/or treating a disease, disorder, or condition in a subject by administering a compound of Formulas (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, or related compositions. In some embodiments, the disease or disorder entails unwanted or aberrant splicing. In some embodiments, the disease or disorder is a proliferative disease, disorder, or condition. Exemplary proliferative diseases include cancer, a benign neoplasm, or angiogenesis. In other embodiments, the present disclosure provides methods for treating and/or preventing a non-proliferative disease, disorder, or condition. In still other embodiments, the present disclosure provides compositions for use in treating and/or preventing a neurological disease or disorder, autoimmune disease or disorder, immunodeficiency disease or disorder, lysosomal storage disease or disorder, cardiovascular disease or disorder, metabolic disease or disorder, respiratory disease or disorder, renal disease or disorder, or infectious disease.
In another aspect, the present disclosure provides compositions for use in down-regulating the expression of (e.g., the level of or the rate of production of) a target protein with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof in a biological sample or subject. In another aspect, the present disclosure provides compositions for use in up-regulating the expression of (e.g., the level of or the rate of production of) a target protein with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof in a biological sample or subject. In another aspect, the present disclosure provides compositions for use in altering the isoform of a target protein with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof in a biological sample or subject. Another aspect of the disclosure relates to compositions for use in inhibiting the activity of a target protein in a biological sample or subject. In some embodiments, administration of a compound of Formulas (I) or (II) to a biological sample, a cell, or a subject comprises inhibition of cell growth or induction of cell death.
In another aspect, the present disclosure features kits comprising a container with a compound of Formulas (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer thereof, or a pharmaceutical composition thereof. In certain embodiments, the kits described herein further include instructions for administering the compound of Formulas (I) or (II), or the pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer thereof, or the pharmaceutical composition thereof.
In any and all aspects of the present disclosure, in some embodiments, the compound, target nucleic acid (e.g., DNA, RNA, e.g., pre-mRNA), or target protein described herein is a compound, target nucleic acid (e.g., DNA, RNA, e.g., pre-mRNA), or target protein other than a compound, target nucleic acid (e.g., DNA, RNA, e.g., pre-mRNA), or target protein described one of U.S. Pat. No. 8,729,263, U.S. Publication No. 2015/0005289, WO 2014/028459, WO 2016/128343, WO 2016/196386, WO 2017/100726, WO 2018/232039, WO 2018/098446, WO 2019/028440, WO 2019/060917, WO 2019/199972, and WO 2020/004594. In some embodiments, the compound, target nucleic acid (e.g., DNA, RNA, e.g., pre-mRNA), or target protein described herein is a compound, target nucleic acid (e.g., DNA, RNA, e.g., pre-mRNA), or target protein described one of U.S. Pat. No. 8,729,263, U.S. Publication No. ¶2015/0005289, WO 2014/028459, WO 2016/128343, WO 2016/196386, WO 2017/100726, WO 2018/232039, WO 2018/098446, WO 2019/028440, WO 2019/060917, WO 2019/199972, and WO 2020/004594, each of which is incorporated herein by reference in its entirety.
The details of one or more embodiments of the invention are set forth herein. Other features, objects, and advantages of the invention will be apparent from the Detailed Description, the Examples, and the Claims.
Definitions of specific functional groups and chemical terms are described in more detail below. The chemical elements are identified in accordance with the Periodic Table of the Elements, CAS version, Handbook of Chemistry and Physics, 75th Ed., inside cover, and specific functional groups are generally defined as described therein. Additionally, general principles of organic chemistry, as well as specific functional moieties and reactivity, are described in Thomas Sorrell, Organic Chemistry, University Science Books, Sausalito, 1999; Smith and March, March's Advanced Organic Chemistry, 5th Edition, John Wiley & Sons, Inc., New York, 2001; Larock, Comprehensive Organic Transformations, VCH Publishers, Inc., New York, 1989; and Carruthers, Some Modern Methods of Organic Synthesis, 3rd Edition, Cambridge University Press, Cambridge, 1987.
The abbreviations used herein have their conventional meaning within the chemical and biological arts. The chemical structures and formulae set forth herein are constructed according to the standard rules of chemical valency known in the chemical arts.
When a range of values is listed, it is intended to encompass each value and sub-range within the range. For example “C1-C6 alkyl” is intended to encompass, C1, C2, C3, C4, C5, C6, C1-C6, C1-C5, C1-C4, C1-C3, C1-C2, C2-C6, C2-C5, C2-C4, C2-C3, C3-C6, C3-C5, C3-C4, C4-C6, C4-C5, and C5-C6 alkyl.
The following terms are intended to have the meanings presented therewith below and are useful in understanding the description and intended scope of the present invention.
As used herein, “alkyl” refers to a radical of a straight-chain or branched saturated hydrocarbon group having from 1 to 24 carbon atoms (“C1-C24 alkyl”). In some embodiments, an alkyl group has 1 to 12 carbon atoms (“C1-C12 alkyl”). In some embodiments, an alkyl group has 1 to 8 carbon atoms (“C1-C8 alkyl”). In some embodiments, an alkyl group has 1 to 6 carbon atoms (“C1-C6 alkyl”). In some embodiments, an alkyl group has 2 to 6 carbon atoms (“C2-C6 alkyl”). In some embodiments, an alkyl group has 1 carbon atom (“C1 alkyl”). Examples of C1-C6alkyl groups include methyl (C1), ethyl (C2), n-propyl (C3), isopropyl (C3), n-butyl (C4), tert butyl (C4), sec-butyl (C4), iso-butyl (C4), n-pentyl (C5), 3-pentanyl (C5), amyl (C5), neopentyl (C5), 3-methyl-2-butanyl (C5), tertiary amyl (C5), and n-hexyl (C6). Additional examples of alkyl groups include n-heptyl (C7), n-octyl (C8) and the like. Each instance of an alkyl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted alkyl”) or substituted (a “substituted alkyl”) with one or more substituents; e.g., for instance from 1 to 5 substituents, 1 to 3 substituents, or 1 substituent. In certain embodiments, the alkyl group is unsubstituted C1-C10 alkyl (e.g., —CH3). In certain embodiments, the alkyl group is substituted C1-C6 alkyl.
As used herein, “alkenyl” refers to a radical of a straight-chain or branched hydrocarbon group having from 2 to 24 carbon atoms, one or more carbon-carbon double bonds, and no triple bonds (“C2-C24 alkenyl”). In some embodiments, an alkenyl group has 2 to 10 carbon atoms (“C2-C10 alkenyl”). In some embodiments, an alkenyl group has 2 to 8 carbon atoms (“C2-C8 alkenyl”). In some embodiments, an alkenyl group has 2 to 6 carbon atoms (“C2-C6 alkenyl”). In some embodiments, an alkenyl group has 2 carbon atoms (“C2 alkenyl”). The one or more carbon-carbon double bonds can be internal (such as in 2-butenyl) or terminal (such as in 1-butenyl). Examples of C2-C4 alkenyl groups include ethenyl (C2), 1-propenyl (C3), 2-propenyl (C3), 1-butenyl (C4), 2-butenyl (C4), butadienyl (C4), and the like. Examples of C2-C6 alkenyl groups include the aforementioned C2 i alkenyl groups as well as pentenyl (C5), pentadienyl (C5), hexenyl (C6), and the like. Additional examples of alkenyl include heptenyl (C7), octenyl (C8), octatrienyl (C8), and the like. Each instance of an alkenyl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted alkenyl”) or substituted (a “substituted alkenyl”) with one or more substituents e.g., for instance from 1 to 5 substituents, 1 to 3 substituents, or 1 substituent. In certain embodiments, the alkenyl group is unsubstituted C1-C10 alkenyl. In certain embodiments, the alkenyl group is substituted C2-C6 alkenyl.
As used herein, the term “alkynyl” refers to a radical of a straight-chain or branched hydrocarbon group having from 2 to 24 carbon atoms, one or more carbon-carbon triple bonds (“C2-C24 alkenyl”). In some embodiments, an alkynyl group has 2 to 10 carbon atoms (“C2-C10 alkynyl”). In some embodiments, an alkynyl group has 2 to 8 carbon atoms (“C2-C8 alkynyl”). In some embodiments, an alkynyl group has 2 to 6 carbon atoms (“C2-C6 alkynyl”). In some embodiments, an alkynyl group has 2 carbon atoms (“C2 alkynyl”). The one or more carbon-carbon triple bonds can be internal (such as in 2-butynyl) or terminal (such as in 1-butynyl). Examples of C2-C4 alkynyl groups include ethynyl (C2), 1-propynyl (C3), 2-propynyl (C3), 1-butynyl (C4), 2-butynyl (C4), and the like. Each instance of an alkynyl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted alkynyl”) or substituted (a “substituted alkynyl”) with one or more substituents e.g., for instance from 1 to 5 substituents, 1 to 3 substituents, or 1 substituent. In certain embodiments, the alkynyl group is unsubstituted C2-10 alkynyl. In certain embodiments, the alkynyl group is substituted C2-6 alkynyl.
As used herein, the term “haloalkyl,” refers to a non-cyclic stable straight or branched chain, or combinations thereof, including at least one carbon atom and at least one halogen selected from the group consisting of F, Cl, Br, and I. The halogen(s) F, Cl, Br, and I may be placed at any position of the haloalkyl group. Exemplary haloalkyl groups include, but are not limited to: —CF3, —CCl3, —CH2—CF3, —CH2—CCl3, —CH2—CBr3, —CH2—CI3, —CH2—CH2—CH(CF3)—CH3, —CH2-CH2—CH(Br)—CH3, and —CH2-CH═CH-CH2—CF3. Each instance of a haloalkyl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted haloalkyl”) or substituted (a “substituted haloalkyl”) with one or more substituents e.g., for instance from 1 to 5 substituents, 1 to 3 substituents, or 1 substituent.
As used herein, the term “heteroalkyl,” refers to a non-cyclic stable straight or branched chain, or combinations thereof, including at least one carbon atom and at least one heteroatom selected from the group consisting of O, N, P, Si, and S, and wherein the nitrogen and sulfur atoms may optionally be oxidized, and the nitrogen heteroatom may optionally be quaternized. The heteroatom(s) O, N, P, S, and Si may be placed at any position of the heteroalkyl group. Exemplary heteroalkyl groups include, but are not limited to: —CH2-CH2—O—CH3, —CH2—CH2—NH—CH3, —CH2-CH2—N(CH3)-CH3, —CH2—S—CH2-CH3, —CH2—CH2, —S(O)—CH3, —CH2-CH2—S(O)2—CH3, —CH═CH—O—CH3, —Si(CH3)3, —CH2-CH═N—OCH3, —CH═CH—N(CH3)-CH3, —O—CH3, and —O—CH2-CH3. Up to two or three heteroatoms may be consecutive, such as, for example, —CH2—NH—OCH3 and —CH2—O—Si(CH3)3. Where “heteroalkyl” is recited, followed by recitations of specific heteroalkyl groups, such as —CH2O, —NRCRD, or the like, it will be understood that the terms heteroalkyl and —CH2O or —NRCRD are not redundant or mutually exclusive. Rather, the specific heteroalkyl groups are recited to add clarity. Thus, the term “heteroalkyl” should not be interpreted herein as excluding specific heteroalkyl groups, such as —CH2O, —NRCBD, or the like. Each instance of a heteroalkyl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted heteroalkyl”) or substituted (a “substituted heteroalkyl”) with one or more substituents e.g., for instance from 1 to 5 substituents, 1 to 3 substituents, or 1 substituent.
As used herein, “aryl” refers to a radical of a monocyclic or polycyclic (e.g., bicyclic or tricyclic) 4n+2 aromatic ring system (e.g., having 6, 10, or 14 it electrons shared in a cyclic array) having 6-14 ring carbon atoms and zero heteroatoms provided in the aromatic ring system (“C6-C14 aryl”). In some embodiments, an aryl group has six ring carbon atoms (“C6 aryl”; e.g., phenyl). In some embodiments, an aryl group has ten ring carbon atoms (“C10 aryl”; e.g., naphthyl such as 1-naphthyl and 2-naphthyl). In some embodiments, an aryl group has fourteen ring carbon atoms (“C14 aryl”; e.g., anthracyl). An aryl group may be described as, e.g., a C6-C10-membered aryl, wherein the term “membered” refers to the non-hydrogen ring atoms within the moiety. Aryl groups include phenyl, naphthyl, indenyl, and tetrahydronaphthyl. Each instance of an aryl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted aryl”) or substituted (a “substituted aryl”) with one or more substituents. In certain embodiments, the aryl group is unsubstituted C6-C14 aryl. In certain embodiments, the aryl group is substituted C6-C14 aryl.
As used herein, “heteroaryl” refers to a radical of a 5-10 membered monocyclic or bicyclic 4n+2 aromatic ring system (e.g., having 6 or 10 it electrons shared in a cyclic array) having ring carbon atoms and 1-4 ring heteroatoms provided in the aromatic ring system, wherein each heteroatom is independently selected from nitrogen, oxygen and sulfur (“5-10 membered heteroaryl”). In heteroaryl groups that contain one or more nitrogen atoms, the point of attachment can be a carbon or nitrogen atom, as valency permits. Heteroaryl bicyclic ring systems can include one or more heteroatoms in one or both rings. “Heteroaryl” also includes ring systems wherein the heteroaryl ring, as defined above, is fused with one or more aryl groups wherein the point of attachment is either on the aryl or heteroaryl ring, and in such instances, the number of ring members designates the number of ring members in the fused (aryl/heteroaryl) ring system. Bicyclic heteroaryl groups wherein one ring does not contain a heteroatom (e.g., indolyl, quinolinyl, carbazolyl, and the like) the point of attachment can be on either ring, i.e., either the ring bearing a heteroatom (e.g., 2-indolyl) or the ring that does not contain a heteroatom (e.g., 5-indolyl). A heteroaryl group may be described as, e.g., a 6-10-membered heteroaryl, wherein the term “membered” refers to the non-hydrogen ring atoms within the moiety. Each instance of a heteroaryl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted heteroaryl”) or substituted (a “substituted heteroaryl”) with one or more substituents e.g., for instance from 1 to 5 substituents, 1 to 3 substituents, or 1 substituent.
Exemplary 5-membered heteroaryl groups containing one heteroatom include, without limitation, pyrrolyl, furanyl and thiophenyl. Exemplary 5-membered heteroaryl groups containing two heteroatoms include, without limitation, imidazolyl, pyrazolyl, oxazolyl, isoxazolyl, thiazolyl, and isothiazolyl. Exemplary 5-membered heteroaryl groups containing three heteroatoms include, without limitation, triazolyl, oxadiazolyl, and thiadiazolyl. Exemplary 5-membered heteroaryl groups containing four heteroatoms include, without limitation, tetrazolyl. Exemplary 6-membered heteroaryl groups containing one heteroatom include, without limitation, pyridinyl. Exemplary 6-membered heteroaryl groups containing two heteroatoms include, without limitation, pyridazinyl, pyrimidinyl, and pyrazinyl. Exemplary 6-membered heteroaryl groups containing three or four heteroatoms include, without limitation, triazinyl and tetrazinyl, respectively. Exemplary 7-membered heteroaryl groups containing one heteroatom include, without limitation, azepinyl, oxepinyl, and thiepinyl. Exemplary 5,6-bicyclic heteroaryl groups include, without limitation, indolyl, isoindolyl, indazolyl, benzotriazolyl, benzothiophenyl, isobenzothiophenyl, benzofuranyl, benzoisofuranyl, benzimidazolyl, benzoxazolyl, benzisoxazolyl, benzoxadiazolyl, benzthiazolyl, benzisothiazolyl, benzthiadiazolyl, indolizinyl, and purinyl. Exemplary 6,6-bicyclic heteroaryl groups include, without limitation, naphthyridinyl, pteridinyl, quinolinyl, isoquinolinyl, cinnolinyl, quinoxalinyl, phthalazinyl, and quinazolinyl. Other exemplary heteroaryl groups include heme and heme derivatives.
As used herein, “cycloalkyl” refers to a radical of a non-aromatic cyclic hydrocarbon group having from 3 to 10 ring carbon atoms (“C3-C10 cycloalkyl”) and zero heteroatoms in the non-aromatic ring system. In some embodiments, a cycloalkyl group has 3 to 8 ring carbon atoms (“C3-C8 cycloalkyl”). In some embodiments, a cycloalkyl group has 3 to 6 ring carbon atoms (“C3-C6 cycloalkyl”). In some embodiments, a cycloalkyl group has 3 to 6 ring carbon atoms (“C3-C6 cycloalkyl”). In some embodiments, a cycloalkyl group has 5 to 10 ring carbon atoms (“C5-C10 cycloalkyl”). A cycloalkyl group may be described as, e.g., a C4-C7-membered cycloalkyl, wherein the term “membered” refers to the non-hydrogen ring atoms within the moiety. Exemplary C3-C6 cycloalkyl groups include, without limitation, cyclopropyl (C3), cyclopropenyl (C3), cyclobutyl (C4), cyclobutenyl (C4), cyclopentyl (C5), cyclopentenyl (C5), cyclohexyl (C6), cyclohexenyl (C6), cyclohexadienyl (C6), and the like. Exemplary C3-C8 cycloalkyl groups include, without limitation, the aforementioned C3-C6 cycloalkyl groups as well as cycloheptyl (C7), cycloheptenyl (C7), cycloheptadienyl (C7), cycloheptatrienyl (C7), cyclooctyl (C8), cyclooctenyl (C8), cubanyl (C8), bicyclo[1.1.1]pentanyl (C5), bicyclo[2.2.2]octanyl (C8), bicyclo[2.1.1]hexanyl (C6), bicyclo[3.1.1]heptanyl (C7), and the like. Exemplary C3-C10 cycloalkyl groups include, without limitation, the aforementioned C3-C8 cycloalkyl groups as well as cyclononyl (C9), cyclononenyl (C9), cyclodecyl (C10), cyclodecenyl (C10), octahydro-1H-indenyl (C9), decahydronaphthalenyl (C10), spiro[4.5]decanyl (C10), and the like. As the foregoing examples illustrate, in certain embodiments, the cycloalkyl group is either monocyclic (“monocyclic cycloalkyl”) or contain a fused, bridged or spiro ring system such as a bicyclic system (“bicyclic cycloalkyl”) and can be saturated or can be partially unsaturated. “Cycloalkyl” also includes ring systems wherein the cycloalkyl ring, as defined above, is fused with one or more aryl groups wherein the point of attachment is on the cycloalkyl ring, and in such instances, the number of carbons continue to designate the number of carbons in the cycloalkyl ring system. Each instance of a cycloalkyl group may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted cycloalkyl”) or substituted (a “substituted cycloalkyl”) with one or more substituents. In certain embodiments, the cycloalkyl group is unsubstituted C3-C10 cycloalkyl. In certain embodiments, the cycloalkyl group is a substituted C3-C10 cycloalkyl.
“Heterocyclyl” as used herein refers to a radical of a 3- to 10-membered non-aromatic ring system having ring carbon atoms and 1 to 4 ring heteroatoms, wherein each heteroatom is independently selected from nitrogen, oxygen, sulfur, boron, phosphorus, and silicon (“3-10 membered heterocyclyl”). In heterocyclyl groups that contain one or more nitrogen atoms, the point of attachment can be a carbon or nitrogen atom, as valency permits. A heterocyclyl group can either be monocyclic (“monocyclic heterocyclyl”) or a fused, bridged or spiro ring system such as a bicyclic system (“bicyclic heterocyclyl”), and can be saturated or can be partially unsaturated. Heterocyclyl bicyclic ring systems can include one or more heteroatoms in one or both rings. “Heterocyclyl” also includes ring systems wherein the heterocyclyl ring, as defined above, is fused with one or more cycloalkyl groups wherein the point of attachment is either on the cycloalkyl or heterocyclyl ring, or ring systems wherein the heterocyclyl ring, as defined above, is fused with one or more aryl or heteroaryl groups, wherein the point of attachment is on the heterocyclyl ring, and in such instances, the number of ring members continue to designate the number of ring members in the heterocyclyl ring system. A heterocyclyl group may be described as, e.g., a 3-7-membered heterocyclyl, wherein the term “membered” refers to the non-hydrogen ring atoms, i.e., carbon, nitrogen, oxygen, sulfur, boron, phosphorus, and silicon, within the moiety. Each instance of heterocyclyl may be independently optionally substituted, i.e., unsubstituted (an “unsubstituted heterocyclyl”) or substituted (a “substituted heterocyclyl”) with one or more substituents. In certain embodiments, the heterocyclyl group is unsubstituted 3-10 membered heterocyclyl. In certain embodiments, the heterocyclyl group is substituted 3-10 membered heterocyclyl.
Exemplary 3-membered heterocyclyl groups containing one heteroatom include, without limitation, azirdinyl, oxiranyl, thiorenyl. Exemplary 4-membered heterocyclyl groups containing one heteroatom include, without limitation, azetidinyl, oxetanyl and thietanyl. Exemplary 5-membered heterocyclyl groups containing one heteroatom include, without limitation, tetrahydrofuranyl, dihydrofuranyl, tetrahydrothiophenyl, dihydrothiophenyl, pyrrolidinyl, dihydropyrrolyl and pyrrolyl-2,5-dione. Exemplary 5-membered heterocyclyl groups containing two heteroatoms include, without limitation, dioxolanyl, oxasulfuranyl, disulfuranyl, and oxazolidin-2-one. Exemplary 5-membered heterocyclyl groups containing three heteroatoms include, without limitation, triazolinyl, oxadiazolinyl, and thiadiazolinyl. Exemplary 6-membered heterocyclyl groups containing one heteroatom include, without limitation, piperidinyl (e.g., 2,2,6,6-tetramethylpiperidinyl), tetrahydropyranyl, dihydropyridinyl, pyridinonyl (e.g., 1-methylpyridin2-onyl), and thianyl. Exemplary 6-membered heterocyclyl groups containing two heteroatoms include, without limitation, piperazinyl, morpholinyl, pyridazinonyl (2-methylpyridazin-3-onyl), pyrimidinonyl (e.g., 1-methylpyrimidin-2-onyl, 3-methylpyrimidin-4-onyl), dithianyl, dioxanyl. Exemplary 6-membered heterocyclyl groups containing two heteroatoms include, without limitation, triazinanyl. Exemplary 7-membered heterocyclyl groups containing one heteroatom include, without limitation, azepanyl, oxepanyl and thiepanyl. Exemplary 8-membered heterocyclyl groups containing one heteroatom include, without limitation, azocanyl, oxecanyl and thiocanyl. Exemplary 5-membered heterocyclyl groups fused to a C6 aryl ring (also referred to herein as a 5,6-bicyclic heterocyclyl ring) include, without limitation, indolinyl, isoindolinyl, dihydrobenzofuranyl, dihydrobenzothienyl, benzoxazolinonyl, and the like. Exemplary 5-membered heterocyclyl groups fused to a heterocyclyl ring (also referred to herein as a 5,5-bicyclic heterocyclyl ring) include, without limitation, octahydropyrrolopyrrolyl (e.g., octahydropyrrolo[3,4-c]pyrrolyl), and the like. Exemplary 6-membered heterocyclyl groups fused to a heterocyclyl ring (also referred to as a 4,6-membered heterocyclyl ring) include, without limitation, diazaspirononanyl (e.g., 2,7-diazaspiro[3.5]nonanyl). Exemplary 6-membered heterocyclyl groups fused to an aryl ring (also referred to herein as a 6,6-bicyclic heterocyclyl ring) include, without limitation, tetrahydroquinolinyl, tetrahydroisoquinolinyl, and the like. Exemplary 6-membered heterocyclyl groups fused to a cycloalkyl ring (also referred to herein as a 6,7-bicyclic heterocyclyl ring) include, without limitation, azabicyclooctanyl (e.g., (1, 5)-8-azabicyclo[3.2.1]octanyl). Exemplary 6-membered heterocyclyl groups fused to a cycloalkyl ring (also referred to herein as a 6,8-bicyclic heterocyclyl ring) include, without limitation, azabicyclononanyl (e.g., 9-azabicyclo[3.3.1]nonanyl).
The terms “alkylene,” “alkenylene,” “alkynylene,” “haloalkylene,” “heteroalkylene,” “cycloalkylene,” or “heterocyclylene,” alone or as part of another substituent, mean, unless otherwise stated, a divalent radical derived from an alkyl, alkenyl, alkynyl, haloalkylene, heteroalkylene, cycloalkyl, or heterocyclyl respectively. For example, the term “alkenylene,” by itself or as part of another substituent, means, unless otherwise stated, a divalent radical derived from an alkene. An alkylene, alkenylene, alkynylene, haloalkylene, heteroalkylene, cycloalkylene, or heterocyclylene group may be described as, e.g., a C1-C6-membered alkylene, C2-C6-membered alkenylene, C2-C6-membered alkynylene, C1-C6-membered haloalkylene, C1-C6-membered heteroalkylene, C3-C8-membered cycloalkylene, or C3-C8-membered heterocyclylene, wherein the term “membered” refers to the non-hydrogen atoms within the moiety. In the case of heteroalkylene and heterocyclylene groups, heteroatoms can also occupy either or both of the chain termini (e.g., alkyleneoxy, alkylenedioxy, alkyleneamino, alkylenediamino, and the like). Still further, no orientation of the linking group is implied by the direction in which the formula of the linking group is written. For example, the formula —C(O)2R′— may represent both —C(O)2R′— and —R′C(O)2—.
As used herein, the terms “cyano” or “—CN” refer to a substituent having a carbon atom joined to a nitrogen atom by a triple bond, e.g., C≡N.
As used herein, the terms “halogen” or “halo” refer to fluorine, chlorine, bromine or iodine.
As used herein, the term “hydroxy” refers to —OH.
As used herein, the term “nitro” refers to a substitutent having two oxygen atoms bound to a nitrogen atom, e.g., —NO2.
As used herein, the term “nucleobase” as used herein, is a nitrogen-containing biological compounds found linked to a sugar within a nucleoside—the basic building blocks of deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). The primary, or naturally occurring, nucleobases are cytosine (DNA and RNA), guanine (DNA and RNA), adenine (DNA and RNA), thymine (DNA) and uracil (RNA), abbreviated as C, G, A, T, and U, respectively. Because A, G, C, and T appear in the DNA, these molecules are called DNA-bases; A, G, C, and U are called RNA-bases. Adenine and guanine belong to the double-ringed class of molecules called purines (abbreviated as R). Cytosine, thymine, and uracil are all pyrimidines. Other nucleobases that do not function as normal parts of the genetic code, are termed non-naturally occurring. In an embodiment, a nucleobase may be chemically modified, for example, with an alkyl (e.g., methyl), halo, —O-alkyl, or other modification.
As used herein, the term “nucleic acid” refers to deoxyribonucleic acids (DNA) or ribonucleic acids (RNA) and polymers thereof in either single- or double-stranded form. The term “nucleic acid” includes a gene, cDNA, pre-mRNA, or an mRNA. In one embodiment, the nucleic acid molecule is synthetic (e.g., chemically synthesized) or recombinant. Unless specifically limited, the term encompasses nucleic acids containing analogues or derivatives of natural nucleotides that have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), alleles, orthologs, SNPs, and complementarity sequences as well as the sequence explicitly indicated.
As used herein, “oxo” refers to a carbonyl, i.e., —C(O)—.
The symbol “” as used herein in relation to a compound of Formula (I) or (II) refers to an attachment point to another moiety or functional group within the compound.
Alkyl, alkenyl, alkynyl, haloalkyl, heteroalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl groups, as defined herein, are optionally substituted. In general, the term “substituted”, whether preceded by the term “optionally” or not, means that at least one hydrogen present on a group (e.g., a carbon or nitrogen atom) is replaced with a permissible substituent, e.g., a substituent which upon substitution results in a stable compound, e.g., a compound which does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, or other reaction. Unless otherwise indicated, a “substituted” group has a substituent at one or more substitutable positions of the group, and when more than one position in any given structure is substituted, the substituent is either the same or different at each position. The term “substituted” is contemplated to include substitution with all permissible substituents of organic compounds, such as any of the substituents described herein that result in the formation of a stable compound. The present disclosure contemplates any and all such combinations in order to arrive at a stable compound. For purposes of this invention, heteroatoms such as nitrogen may have hydrogen substituents and/or any suitable substituent as described herein which satisfy the valencies of the heteroatoms and results in the formation of a stable moiety.
Two or more substituents may optionally be joined to form aryl, heteroaryl, cycloalkyl, or heterocyclyl groups. Such so-called ring-forming substituents are typically, though not necessarily, found attached to a cyclic base structure. In one embodiment, the ring-forming substituents are attached to adjacent members of the base structure. For example, two ring-forming substituents attached to adjacent members of a cyclic base structure create a fused ring structure. In another embodiment, the ring-forming substituents are attached to a single member of the base structure. For example, two ring-forming substituents attached to a single member of a cyclic base structure create a spirocyclic structure. In yet another embodiment, the ring-forming substituents are attached to non-adjacent members of the base structure.
The compounds provided herein may exist in one or more particular geometric, optical, enantiomeric, diasteriomeric, epimeric, stereoisomeric, tautomeric, conformational, or anomeric forms, including but not limited to: cis- and trans-forms; E- and Z-forms; endo- and exo-forms; R-, S-, and meso-forms; D- and L-forms; d- and 1-forms; (+) and (−) forms; keto-, enol-, and enolate-forms; syn- and anti-forms; synclinal- and anticlinal-forms; α- and β-forms; axial and equatorial forms; boat-, chair-, twist-, envelope-, and half chair-forms; and combinations thereof, hereinafter collectively referred to as “isomers” (or “isomeric forms”).
Compounds described herein can comprise one or more asymmetric centers, and thus can exist in various isomeric forms, e.g., enantiomers and/or diastereomers. For example, the compounds described herein can be in the form of an individual enantiomer, diastereomer or geometric isomer, or can be in the form of a mixture of stereoisomers, including racemic mixtures and mixtures enriched in one or more stereoisomer. In an embodiment, the stereochemistry depicted in a compound is relative rather than absolute. Isomers can be isolated from mixtures by methods known to those skilled in the art, including chiral high-pressure liquid chromatography (HPLC) and the formation and crystallization of chiral salts; or preferred isomers can be prepared by asymmetric syntheses. See, for example, Jacques et al., Enantiomers, Racemates and Resolutions (Wiley Interscience, New York, 1981); Wilen et al., Tetrahedron 33:2725 (1977); Eliel, Stereochemistry of Carbon Compounds (McGraw-Hill, N Y, 1962); and Wilen, Tables of Resolving Agents and Optical Resolutions p. 268 (E. L. Eliel, Ed., Univ. of Notre Dame Press, Notre Dame, IN 1972). This disclosure additionally encompasses compounds described herein as individual isomers substantially free of other isomers, and alternatively, as mixtures of various isomers.
As used herein, a pure enantiomeric compound is substantially free from other enantiomers or stereoisomers of the compound (i.e., in enantiomeric excess). In other words, an “S” form of the compound is substantially free from the “R” form of the compound and is, thus, in enantiomeric excess of the “R” form. The term “enantiomerically pure” or “pure enantiomer” denotes that the compound comprises more than 75% by weight, more than 80% by weight, more than 85% by weight, more than 90% by weight, more than 91% by weight, more than 92% by weight, more than 93% by weight, more than 94% by weight, more than 95% by weight, more than 96% by weight, more than 97% by weight, more than 98% by weight, more than 99% by weight, more than 99.5% by weight, or more than 99.9% by weight, of the enantiomer. In certain embodiments, the weights are based upon total weight of all enantiomers or stereoisomers of the compound.
In the compositions provided herein, an enantiomerically pure compound can be present with other active or inactive ingredients. For example, a pharmaceutical composition comprising an enantiomerically pure R-compound can comprise, for example, about 90% excipient and about 10% enantiomerically pure R-compound. In certain embodiments, the enantiomerically pure R-compound in such compositions can, for example, comprise, at least about 95% by weight R-compound and at most about 5% by weight S-compound, by total weight of the compound. For example, a pharmaceutical composition comprising an enantiomerically pure S-compound can comprise, for example, about 90% excipient and about 10% enantiomerically pure S-compound. In certain embodiments, the enantiomerically pure S-compound in such compositions can, for example, comprise, at least about 95% by weight S-compound and at most about 5% by weight R-compound, by total weight of the compound.
In some embodiments, a diastereomerically pure compound can be present with other active or inactive ingredients. For example, a pharmaceutical composition comprising a diastereometerically pure exo compound can comprise, for example, about 90% excipient and about 10% diastereometerically pure exo compound. In certain embodiments, the diastereometerically pure exo compound in such compositions can, for example, comprise, at least about 95% by weight exo compound and at most about 5% by weight endo compound, by total weight of the compound. For example, a pharmaceutical composition comprising a diastereometerically pure endo compound can comprise, for example, about 90% excipient and about 10% diastereometerically pure endo compound. In certain embodiments, the diastereometerically pure endo compound in such compositions can, for example, comprise, at least about 95% by weight endo compound and at most about 5% by weight exo compound, by total weight of the compound.
In some embodiments, an isomerically pure compound can be present with other active or inactive ingredients. For example, a pharmaceutical composition comprising a isomerically pure exo compound can comprise, for example, about 90% excipient and about 10% isomerically pure exo compound. In certain embodiments, the isomerically pure exo compound in such compositions can, for example, comprise, at least about 95% by weight exo compound and at most about 5% by weight endo compound, by total weight of the compound. For example, a pharmaceutical composition comprising an isomerically pure endo compound can comprise, for example, about 90% excipient and about 10% isomerically pure endo compound. In certain embodiments, the isomerically pure endo compound in such compositions can, for example, comprise, at least about 95% by weight endo compound and at most about 5% by weight exo compound, by total weight of the compound.
In certain embodiments, the active ingredient can be formulated with little or no excipient or carrier.
Compound described herein may also comprise one or more isotopic substitutions. For example, H may be in any isotopic form, including 1H, 2H (D or deuterium), and 3H (T or tritium); C may be in any isotopic form, including 12C, 13C, and 14C; O may be in any isotopic form, including 16O and 18O; N may be in any isotopic form, including 14N and 15N; F may be in any isotopic form, including 18F, 19F, and the like.
The term “pharmaceutically acceptable salt” is meant to include salts of the active compounds that are prepared with relatively nontoxic acids or bases, depending on the particular substituents found on the compounds described herein. When compounds of the present disclosure contain relatively acidic functionalities, base addition salts can be obtained by contacting the neutral form of such compounds with a sufficient amount of the desired base, either neat or in a suitable inert solvent. Examples of pharmaceutically acceptable base addition salts include sodium, potassium, calcium, ammonium, organic amino, or magnesium salt, or a similar salt. When compounds of the present invention contain relatively basic functionalities, acid addition salts can be obtained by contacting the neutral form of such compounds with a sufficient amount of the desired acid, either neat or in a suitable inert solvent. Examples of pharmaceutically acceptable acid addition salts include those derived from inorganic acids like hydrochloric, hydrobromic, nitric, carbonic, monohydrogencarbonic, phosphoric, monohydrogenphosphoric, dihydrogenphosphoric, sulfuric, monohydrogensulfuric, hydriodic, or phosphorous acids and the like, as well as the salts derived from organic acids like acetic, propionic, isobutyric, maleic, malonic, benzoic, succinic, suberic, fumaric, lactic, mandelic, phthalic, benzenesulfonic, p-tolylsulfonic, citric, tartaric, methanesulfonic, and the like. Also included are salts of amino acids such as arginate and the like, and salts of organic acids like glucuronic or galactunoric acids and the like (see, e.g., Berge et al, Journal of Pharmaceutical Science 66: 1-19 (1977)). Certain specific compounds of the present invention contain both basic and acidic functionalities that allow the compounds to be converted into either base or acid addition salts. These salts may be prepared by methods known to those skilled in the art. Other pharmaceutically acceptable carriers known to those of skill in the art are suitable for the present invention.
In addition to salt forms, the present disclosure provides compounds in a prodrug form. Prodrugs of the compounds described herein are those compounds that readily undergo chemical changes under physiological conditions to provide the compounds of the present invention. Additionally, prodrugs can be converted to the compounds of the present invention by chemical or biochemical methods in an ex vivo environment. For example, prodrugs can be slowly converted to the compounds of the present invention when placed in a transdermal patch reservoir with a suitable enzyme or chemical reagent.
The term “solvate” refers to forms of the compound that are associated with a solvent, usually by a solvolysis reaction. This physical association may include hydrogen bonding. Conventional solvents include water, methanol, ethanol, acetic acid, DMSO, THF, diethyl ether, and the like. The compounds of Formulas (I) or (II) may be prepared, e.g., in crystalline form, and may be solvated. Suitable solvates include pharmaceutically acceptable solvates and further include both stoichiometric solvates and non-stoichiometric solvates. In certain instances, the solvate will be capable of isolation, for example, when one or more solvent molecules are incorporated in the crystal lattice of a crystalline solid. “Solvate” encompasses both solution-phase and isolable solvates. Representative solvates include hydrates, ethanolates, and methanolates.
The term “hydrate” refers to a compound which is associated with water. Typically, the number of the water molecules contained in a hydrate of a compound is in a definite ratio to the number of the compound molecules in the hydrate. Therefore, a hydrate of a compound may be represented, for example, by the general formula R·x H2O, wherein R is the compound and wherein x is a number greater than 0. A given compound may form more than one type of hydrates, including, e.g., monohydrates (x is 1), lower hydrates (x is a number greater than 0 and smaller than 1, e.g., hemihydrates (R·0.5 H2O)), and polyhydrates (x is a number greater than 1, e.g., dihydrates (R·2 H2O) and hexahydrates (R·6 H2O)).
The term “tautomer” refers to compounds that are interchangeable forms of a particular compound structure, and that vary in the displacement of hydrogen atoms and electrons. Thus, two structures may be in equilibrium through the movement of 71 electrons and an atom (usually H). For example, enols and ketones are tautomers because they are rapidly interconverted by treatment with either acid or base. Another example of tautomerism is the aci- and nitro-forms of phenylnitromethane that are likewise formed by treatment with acid or base. Tautomeric forms may be relevant to the attainment of the optimal chemical reactivity and biological activity of a compound of interest.
The following definitions are more general terms used throughout the present disclosure.
The articles “a” and “an” refer to one or more than one (e.g., to at least one) of the grammatical object of the article. By way of example, “an element” means one element or more than one element. The term “and/or” means either “and” or “or” unless indicated otherwise.
The term “about” is used herein to mean within the typical ranges of tolerances in the art. For example, “about” can be understood as about 2 standard deviations from the mean. In certain embodiments, about means ±10%. In certain embodiments, about means ±5%. When about is present before a series of numbers or a range, it is understood that “about” can modify each of the numbers in the series or range.
“Acquire” or “acquiring” as used herein, refer to obtaining possession of a value, e.g., a numerical value, or image, or a physical entity (e.g., a sample), by “directly acquiring” or “indirectly acquiring” the value or physical entity. “Directly acquiring” means performing a process (e.g., performing an analytical method or protocol) to obtain the value or physical entity. “Indirectly acquiring” refers to receiving the value or physical entity from another party or source (e.g., a third-party laboratory that directly acquired the physical entity or value). Directly acquiring a value or physical entity includes performing a process that includes a physical change in a physical substance or the use of a machine or device. Examples of directly acquiring a value include obtaining a sample from a human subject. Directly acquiring a value includes performing a process that uses a machine or device, e.g., mass spectrometer to acquire mass spectrometry data.
The terms “administer,” “administering,” or “administration,” as used herein refers to implanting, absorbing, ingesting, injecting, inhaling, or otherwise introducing an inventive compound, or a pharmaceutical composition thereof.
As used herein, the terms “condition,” “disease,” and “disorder” are used interchangeably.
An “effective amount” of a compound of Formulas (I) or (II) refers to an amount sufficient to elicit the desired biological response, i.e., treating the condition. As will be appreciated by those of ordinary skill in this art, the effective amount of a compound of Formulas (I) or (II) may vary depending on such factors as the desired biological endpoint, the pharmacokinetics of the compound, the condition being treated, the mode of administration, and the age and health of the subject. An effective amount encompasses therapeutic and prophylactic treatment. For example, in treating cancer, an effective amount of an inventive compound may reduce the tumor burden or stop the growth or spread of a tumor.
A “therapeutically effective amount” of a compound of Formulas (I) or (II) is an amount sufficient to provide a therapeutic benefit in the treatment of a condition or to delay or minimize one or more symptoms associated with the condition. In some embodiments, a therapeutically effective amount is an amount sufficient to provide a therapeutic benefit in the treatment of a condition or to minimize one or more symptoms associated with the condition. A therapeutically effective amount of a compound means an amount of therapeutic agent, alone or in combination with other therapies, which provides a therapeutic benefit in the treatment of the condition. The term “therapeutically effective amount” can encompass an amount that improves overall therapy, reduces or avoids symptoms or causes of the condition, or enhances the therapeutic efficacy of another therapeutic agent.
The terms “peptide,” “polypeptide,” and “protein” are used interchangeably, and refer to a compound comprised of amino acid residues covalently linked by peptide bonds. A protein or peptide must contain at least two amino acids, and no limitation is placed on the maximum number of amino acids that can comprised therein. Polypeptides include any peptide or protein comprising two or more amino acids joined to each other by peptide bonds. As used herein, the term refers to both short chains, which also commonly are referred to in the art as peptides, oligopeptides and oligomers, for example, and to longer chains, which generally are referred to in the art as proteins, of which there are many types.
“Prevention,” “prevent,” and “preventing” as used herein refers to a treatment that comprises administering a therapy, e.g., administering a compound described herein (e.g., a compound of Formulas (I) or (II)) prior to the onset of a disease, disorder, or condition in order to preclude the physical manifestation of said disease, disorder, or condition. In some embodiments, “prevention,” “prevent,” and “preventing” require that signs or symptoms of the disease, disorder, or condition have not yet developed or have not yet been observed. In some embodiments, treatment comprises prevention and in other embodiments it does not.
A “subject” to which administration is contemplated includes, but is not limited to, humans (i.e., a male or female of any age group, e.g., a pediatric subject (e.g., infant, child, adolescent) or adult subject (e.g., young adult, middle-aged adult, or senior adult)) and/or other non-human animals, for example, mammals (e.g., primates (e.g., cynomolgus monkeys, rhesus monkeys); commercially relevant mammals such as cattle, pigs, horses, sheep, goats, cats, and/or dogs) and birds (e.g., commercially relevant birds such as chickens, ducks, geese, and/or turkeys). In certain embodiments, the animal is a mammal. The animal may be a male or female and at any stage of development. A non-human animal may be a transgenic animal.
As used herein, the terms “treatment,” “treat,” and “treating” refer to reversing, alleviating, delaying the onset of, or inhibiting the progress of one or more of a symptom, manifestation, or underlying cause of a disease, disorder, or condition (e.g., as described herein), e.g., by administering a therapy, e.g., administering a compound described herein (e.g., a compound of Formulas (I) or (II)). In an embodiment, treating comprises reducing, reversing, alleviating, delaying the onset of, or inhibiting the progress of a symptom of a disease, disorder, or condition. In an embodiment, treating comprises reducing, reversing, alleviating, delaying the onset of, or inhibiting the progress of a manifestation of a disease, disorder, or condition. In an embodiment, treating comprises reducing, reversing, alleviating, reducing, or delaying the onset of, an underlying cause of a disease, disorder, or condition. In some embodiments, “treatment,” “treat,” and “treating” require that signs or symptoms of the disease, disorder, or condition have developed or have been observed. In other embodiments, treatment may be administered in the absence of signs or symptoms of the disease or condition, e.g., in preventive treatment. For example, treatment may be administered to a susceptible individual prior to the onset of symptoms (e.g., in light of a history of symptoms and/or in light of genetic or other susceptibility factors). Treatment may also be continued after symptoms have resolved, for example, to delay or prevent recurrence. Treatment may also be continued after symptoms have resolved, for example, to delay or prevent recurrence. In some embodiments, treatment comprises prevention and in other embodiments it does not.
A “proliferative disease” refers to a disease that occurs due to abnormal extension by the multiplication of cells (Walker, Cambridge Dictionary of Biology; Cambridge University Press: Cambridge, UK, 1990). A proliferative disease may be associated with: 1) the pathological proliferation of normally quiescent cells; 2) the pathological migration of cells from their normal location (e.g., metastasis of neoplastic cells); 3) the pathological expression of proteolytic enzymes such as the matrix metalloproteinases (e.g., collagenases, gelatinases, and elastases); 4) the pathological angiogenesis as in proliferative retinopathy and tumor metastasis; or 5) evasion of host immune surveillance and elimination of neoplastic cells. Exemplary proliferative diseases include cancers (i.e., “malignant neoplasms”), benign neoplasms, and angiogenesis.
A “non-proliferative disease” refers to a disease that does not primarily extend through the abnormal multiplication of cells. A non-proliferative disease may be associated with any cell type or tissue type in a subject. Exemplary non-proliferative diseases include neurological diseases or disorders (e.g., a repeat expansion disease); autoimmune disease or disorders; immunodeficiency diseases or disorders; lysosomal storage diseases or disorders; inflammatory diseases or disorders; cardiovascular conditions, diseases, or disorders; metabolic diseases or disorders; respiratory conditions, diseases, or disorders; renal diseases or disorders; and infectious diseases.
In one aspect, the present disclosure features a compound of Formula (I):
In another aspect, the present disclosure features a compound of Formula (II):
As generally described herein for compounds of Formula (I) and (II), each of A or B are independently cycloalkyl, heterocyclyl, aryl, or heteroaryl, each of which is optionally substituted with one or more R1.
In some embodiments, each of A and B are independently a monocyclic ring, e.g., monocyclic cycloalkyl, monocyclic heterocyclyl, monocyclic aryl, or monocyclic heteroaryl. The monocyclic ring may be saturated, partially unsaturated, or fully unsaturated (e.g., aromatic). In some embodiments, A or B are independently a monocyclic ring comprising between 3 and 10 ring atoms (e.g., 3, 4, 5, 6, 7, 8, 9, or 10 ring atoms). In some embodiments, A is a 4-membered monocyclic ring. In some embodiments, B is a 4-membered monocyclic ring. In some embodiments, A is a 5-membered monocyclic ring. In some embodiments, B is a 5-membered monocyclic ring. In some embodiments, A is a 6-membered monocyclic ring. In some embodiments, B is a 6-membered monocyclic ring. In some embodiments, A is a 7-membered monocyclic ring. In some embodiments, B is a 7-membered monocyclic ring. In some embodiments, A is an 8-membered monocyclic ring. In some embodiments, B is an 8-membered monocyclic ring. In some embodiments, A or B are independently a monocyclic ring optionally substituted with one or more R1.
In some embodiments, A or B are independently a bicyclic ring, e.g., bicyclic cycloalkyl, bicyclic heterocyclyl, bicyclic aryl, or bicyclic heteroaryl. The bicyclic ring may be saturated, partially unsaturated, or fully unsaturated (e.g., aromatic). In some embodiments, A or B are independently a bicyclic ring comprising a fused, bridged, or spiro ring system. In some embodiments, A or B are independently a bicyclic ring comprising between 4 and 18 ring atoms (e.g., 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18 ring atoms). In some embodiments, A is a 6-membered bicyclic ring. In some embodiments, B is a 6-membered bicyclic ring. In some embodiments, A is a 7-membered bicyclic ring. In some embodiments, B is a 7-membered bicyclic ring. In some embodiments, A is an 8-membered bicyclic ring. In some embodiments, B is an 8-membered bicyclic ring. In some embodiments, A is a 9-membered bicyclic ring. In some embodiments, B is a 9-membered bicyclic ring. In some embodiments, A is a 10-membered bicyclic ring. In some embodiments, B is a 10-membered bicyclic ring. In some embodiments, A is an 11-membered bicyclic ring. In some embodiments, B is an 11-membered bicyclic ring. In some embodiments, A is a 12-membered bicyclic ring. In some embodiments, B is a 12-membered bicyclic ring. In some embodiments, A or B are independently a bicyclic ring optionally substituted with one or more R1.
In some embodiments, A or B are independently a tricyclic ring, e.g., tricyclic cycloalkyl, tricyclic heterocyclyl, tricyclic aryl, or tricyclic heteroaryl. The tricyclic ring may be saturated, partially unsaturated, or fully unsaturated (e.g., aromatic). In some embodiments, A or B are independently a tricyclic ring that comprises a fused, bridged, or spiro ring system, or a combination thereof. In some embodiments, A or B are independently a tricyclic ring comprising between 6 and 24 ring atoms (e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 ring atoms). In some embodiments, A is an 8-membered tricyclic ring. In some embodiments, B is an 8-membered tricyclic ring. In some embodiments, A is a 9-membered tricyclic ring. In some embodiments, B is a 9-membered tricyclic ring. In some embodiments, A is a 10-membered tricyclic ring. In some embodiments, B is a 10-membered tricyclic ring. In some embodiments, A or B are independently a tricyclic ring optionally substituted with one or more R1.
In some embodiments, A or B are independently monocyclic cycloalkyl, monocyclic heterocyclyl, monocyclic aryl, or monocyclic heteroaryl. In some embodiments, A or B are independently bicyclic cycloalkyl, bicyclic heterocyclyl, bicyclic aryl, or bicyclic heteroaryl. In some embodiments, A or B are independently tricyclic cycloalkyl, tricyclic heterocyclyl, tricyclic aryl, or tricyclic heteroaryl. In some embodiments, A is monocyclic heterocyclyl. In some embodiments, B is monocyclic heterocyclyl. In some embodiments, A is bicyclic heterocyclyl. In some embodiments, B is bicyclic heterocyclyl. In some embodiments, A is monocyclic heteroaryl. In some embodiments, B is monocyclic heteroaryl. In some embodiments, A is bicyclic heteroaryl. In some embodiments, B is bicyclic heteroaryl.
In some embodiments, A or B are independently a nitrogen-containing heterocyclyl, e.g., heterocyclyl comprising one or more nitrogen atom. The one or more nitrogen atom of the nitrogen-containing heterocyclyl may be at any position of the ring. In some embodiments, the nitrogen-containing heterocyclyl is monocyclic, bicyclic, or tricyclic. In some embodiments, A or B are independently heterocyclyl comprising at least 1, at least 2, at least 3, at least 4, at least 5, or at least 6 nitrogen atoms. In some embodiments, A is heterocyclyl comprising 1 nitrogen atom. In some embodiments, B is heterocyclyl comprising 1 nitrogen atom. In some embodiments, A is heterocyclyl comprising 2 nitrogen atoms. In some embodiments, B is heterocyclyl comprising 2 nitrogen atoms. In some embodiments, A is heterocyclyl comprising 3 nitrogen atoms. In some embodiments, B is heterocyclyl comprising 3 nitrogen atoms. In some embodiments, A is heterocyclyl comprising 4 nitrogen atoms. In some embodiments, B is heterocyclyl comprising 4 nitrogen atoms. In some embodiments, A or B are independently a nitrogen-containing heterocyclyl comprising one or more additional heteroatoms, e.g., one or more of oxygen, sulfur, boron, silicon, or phosphorus. In some embodiments, the one or more nitrogen of the nitrogen-containing heterocyclyl is substituted, e.g., with R1.
In some embodiments, A or B are independently a nitrogen-containing heteroaryl, e.g., heteroaryl comprising one or more nitrogen atom. The one or more nitrogen atom of the nitrogen-containing heteroaryl may be at any position of the ring. In some embodiments, the nitrogen-containing heteroaryl is monocyclic, bicyclic, or tricyclic. In some embodiments, A or B are independently heteroaryl comprising at least 1, at least 2, at least 3, at least 4, at least 5, or at least 6 nitrogen atoms. In some embodiments, A is heteroaryl comprising 1 nitrogen atom. In some embodiments, B is heteroaryl comprising 1 nitrogen atom. In some embodiments, A is heteroaryl comprising 2 nitrogen atoms. In some embodiments, B is heteroaryl comprising 2 nitrogen atoms. In some embodiments, A is heteroaryl comprising 3 nitrogen atoms. In some embodiments, B is heteroaryl comprising 3 nitrogen atoms. In some embodiments, A is heteroaryl comprising 4 nitrogen atoms. In some embodiments, B is heteroaryl comprising 4 nitrogen atoms. In some embodiments, A or B are independently a nitrogen-containing heteroaryl comprising one or more additional heteroatoms, e.g., one or more of oxygen, sulfur, boron, silicon, or phosphorus. In some embodiments, the one or more nitrogen of the nitrogen-containing heteroaryl is substituted, e.g., with R1.
In some embodiments, A is a 6-membered nitrogen-containing heterocyclyl, e.g., a 6-membered heterocyclyl comprising one or more nitrogen. In some embodiments, A is a 6-membered heterocyclyl comprising 1 nitrogen atom. In some embodiments, A is a 6-membered heterocyclyl comprising 2 nitrogen atoms. In some embodiments, A is a 6-membered heterocyclyl comprising 3 nitrogen atoms. In some embodiments, A is a 6-membered heterocyclyl comprising 4 nitrogen atoms. The one or more nitrogen atom of the 6-membered nitrogen-containing heterocyclyl may be at any position of the ring. In some embodiments, A is a 6-membered nitrogen-containing heterocyclyl optionally substituted with one or more R1. In some embodiments, the one or more nitrogen of the 6-membered nitrogen-containing heterocyclyl is substituted, e.g., with R1. In some embodiments, A is a 6-membered nitrogen-containing heterocyclyl comprising one or more additional heteroatoms, e.g., one or more of oxygen, sulfur, boron, silicon, or phosphorus.
In some embodiments, B is a 5-membered nitrogen-containing heterocyclyl or heteroaryl, e.g., a 5-membered heterocyclyl or heteroaryl comprising one or more nitrogen. In some embodiments, B is a 5-membered heterocyclyl comprising 1 nitrogen atom. In some embodiments, B is a 5-membered heteroaryl comprising 1 nitrogen atom. In some embodiments, B is a 5-membered heterocyclyl comprising 2 nitrogen atoms. In some embodiments, B is a 5-membered heteroaryl comprising 2 nitrogen atoms. In some embodiments, B is a 5-membered heterocyclyl comprising 3 nitrogen atoms. In some embodiments, B is a 5-membered heteroaryl comprising 3 nitrogen atoms. The one or more nitrogen atom of the 5-membered nitrogen-containing heterocyclyl or heteroaryl may be at any position of the ring. In some embodiments, B is a 5-membered nitrogen-containing heterocyclyl optionally substituted with one or more R1. In some embodiments, B is a 5-membered nitrogen-containing heteroaryl optionally substituted with one or more R1. In some embodiments, the one or more nitrogen of the 5-membered nitrogen-containing heterocyclyl or heteroaryl is substituted, e.g., with R1. In some embodiments, B is a 5-membered nitrogen-containing heterocyclyl or heteroaryl comprising one or more additional heteroatoms, e.g., one or more of oxygen, sulfur, boron, silicon, or phosphorus.
In some embodiments, B is a nitrogen-containing bicyclic heteroaryl (e.g., a 9-membered nitrogen-containing bicyclic heteroaryl), that is optionally substituted with one or more R1. In some embodiments, B is a 9-membered bicyclic heteroaryl comprising 1 nitrogen atom. In some embodiments, B is a 9-membered bicyclic heteroaryl comprising 2 nitrogen atoms. In some embodiments, B is a 9-membered bicyclic heteroaryl comprising 3 nitrogen atoms. In some embodiments, B is a 9-membered bicyclic heteroaryl comprising 4 nitrogen atoms. The one or more nitrogen atom of the 9-membered bicyclic heteroaryl may be at any position of the ring. In some embodiments, B is a 9-membered bicyclic heteroaryl substituted with one or more R1.
In some embodiments, each of A and B are independently selected from:
wherein each R1 is as defined herein. In an embodiment, A and B are each independently a saturated, partially saturated, or unsaturated (e.g., aromatic) derivative of one of the rings described above. In an embodiment, A and B are each independently a stereoisomer of one of the rings described above.
In some embodiments, each of A and B are independently selected from:
wherein each R1 is as defined herein. In an embodiment, A and B are each independently a saturated, partially saturated, or unsaturated (e.g., aromatic) derivative of one of the rings described above. In an embodiment, A and B are each independently a stereoisomer of one of the rings described above.
In some embodiments, A is selected from
In some embodiments, A is selected from
wherein R1 is as defined herein.
In some embodiments, A is selected from
and wherein R1 is as defined herein.
In some embodiments, A is selected from,
In some embodiments, A is selected from
wherein R1 is as defined herein.
In some embodiments, A is selected from
In some embodiments, A is selected from
In some embodiments, A is
wherein R1 is as defined herein.
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, B is selected from
wherein R is as defined herein.
In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is selected from
wherein R1 is as defined herein.
In some embodiments, B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
As generally described herein for Formulas (I) and (II), L may be absent or refer to a C1-C6-alkylene, C1-C6-heteroalkylene, —O—, —C(O)—, —N(R3)—, —N(R3)C(O)—, or —C(O)N(R3)— group, wherein each alkylene and heteroalkylene is optionally substituted with one or more R4. In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—) As generally described herein for Formula (I) or (II), X may be N or C. In some embodiments, X is N. In some embodiments, X is C.
As generally described herein for Formula (I) or (II), Y may be N, C, or C(R5b), wherein the bonds in the ring comprising X and Y may be single or double bonds as valency permits. In some embodiment, Y is N or C. In some embodiments, Y is N. In some embodiments, Y is C. In some embodiments, Y is C(R5b) (e.g., CH). As generally described herein for Formula (II), Y may be N, N(R5a), C(R5b), or C(R5b)(R5c) wherein the bonds in the ring comprising X and Y may be single or double bonds as valency permits. In some embodiments, Y is N(R5a) or C(R5b) In some embodiments, Y is N(R5a) (e.g., NH). In some embodiments, Y is C(R5b) (e.g., CH).
In some embodiments, X is C and Y is N. In some embodiments, X is C and Y is C. In some embodiments, X is N and Y is N. In some embodiments, X is N and Y is C.
As generally described herein for Formulas (I) and (II), Z may be N or C(R6). In some embodiments, Z is N. In some embodiments, Z is C(R6) (e.g., CH).
In some embodiments, X is C and Z is N. In some embodiments, X is N and Z is N. In some embodiments, X is N and Z is C(R6). In some embodiments, X is N and Z is CH. In some embodiments, Y is N and Z is N. In some embodiments, Y is C and Z is N. In some embodiments, Y is C and Z is C(R6).
In some embodiments, X is C, Y is N, and Z is N. In some embodiments, X is N, Y is C(R5b), and Z is N. In some embodiments, X is N, Y is CH and Z is N. In some embodiments, X is N, Y is C, and Z is C(R6). In some embodiments, X is N, Y is CH and Z is CH.
In some embodiments, R2 is absent.
In some embodiments, R1 is C1-C6-alkyl. In some embodiments, R1 is CH3. In some embodiments, A is substituted with 0 or 1 R1. In some embodiments, B is substituted with 0, 1, or 2 R1.
In some embodiments, the compound of Formula (I) is a compound of Formula (I-a):
or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, wherein A and B are each independently cycloalkyl, heterocyclyl, aryl, or heteroaryl, each of which is optionally substituted with one or more R1; L is absent, C1-C6-alkylene, C1-C6-heteroalkylene, —O—, —C(O)—, —N(R3)—, —N(R3)C(O)—, or —C(O)N(R3)—, wherein each alkylene and heteroalkylene is optionally substituted with one or more R4; X is N or C; Y is N, C, or C(R5b) wherein the dashed lines representing bonds in the ring comprising X and Y may be single or double bonds as valency permits; each R1 is independently hydrogen, C1-C6-alkyl, C2-C6-alkenyl, C2-C6-alkynyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, C1-C6 alkylene-aryl, C1-C6 alkenylene-aryl, C1-C6 alkylene-heteroaryl, heteroaryl, halo, cyano, oxo, —ORA, —NRBRC, —NBC(O)RD, —NO2, —C(O)NRBRC, —C(O)R1, —C(O)OR1, or —S(O)R1, wherein each alkyl, alkylene, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R8; or two R1 groups, together with the atoms to which they are attached, form a 3-7-membered cycloalkyl, heterocyclyl, aryl, or heteroaryl, wherein each cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R8; R2 is absent, hydrogen, or C1-C6-alkyl; each R3 is independently hydrogen, C1-C6-alkyl, or C1-C6-haloalkyl; each R4 is independently C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, halo, cyano, oxo, —ORA, —NRBRC, —C(O)RD, or —C(O)ORD; R5b is hydrogen, C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, halo, or —ORA; R7 is hydrogen, C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, halo, or —ORA; each R8 is independently C1-C6-alkyl, C2-C6-alkenyl, C2-C6-alkynyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, halo, oxo, cyano, —ORA, —NRBRC, —NRBC(O)RD, —NO2, —C(O)NRBRC, —C(O)RD, —C(O)ORD, or —S(O)xRD, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R9; each R9 is independently C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, halo, cyano, oxo, or —ORA; each RA is independently hydrogen, C1-C6 alkyl, C1-C6 haloalkyl, aryl, heteroaryl, C1-C6 alkylene-aryl, C1-C6 alkylene-heteroaryl, —C(O)RD, or —S(O)xRD; each RB and RC is independently hydrogen, C1-C6 alkyl, C1-C6 heteroalkyl, cycloalkyl, heterocyclyl, —ORA; or RB and RC together with the atom to which they are attached form a 3-7-membered heterocyclyl ring optionally substituted with one or more R10; each R10 is independently hydrogen, C1-C6 alkyl, C2-C6 alkenyl, C2-C6 alkynyl, C1-C6 heteroalkyl, C1-C6 haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, C1-C6 alkylene-aryl, or C1-C6 alkylene-heteroaryl; each R10 is independently C1-C6-alkyl or halo; and x is 0, 1, or 2.
In some embodiments, A is heterocyclyl optionally substituted with one or more R1. In some embodiments, A is monocyclic nitrogen-containing heterocyclyl. In some embodiments, A is optionally substituted piperidinyl. In some embodiments, A is optionally substituted piperazinyl. In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is heteroaryl optionally substituted with one or more R1. In some embodiments, A is bicyclic nitrogen-containing heteroaryl. In some embodiments, A is optionally substituted indazolyl. In some embodiments, A is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
In some embodiments, B is heteroaryl optionally substituted with one or more R1. In some embodiments, B is bicyclic nitrogen-containing heteroaryl. In some embodiments, B is optionally substituted indazolyl. In some embodiments, B is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is heterocyclyl optionally substituted with one or more R1. In some embodiments, B is monocyclic nitrogen-containing heterocyclyl. In some embodiments, B is optionally substituted piperidinyl. In some embodiments, B is
wherein R1 is as defined herein. In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—).
In some embodiments, X is N. In some embodiments, X is C. In some embodiments, Y is N. In some embodiments, Y is C. In some embodiments, X is C and Y is N. In some embodiments, X is N and Y is C. In some embodiments, R7 is hydrogen.
In some embodiments, the compound of Formula (I) is a compound of Formula (I-b):
or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, A and B are each independently cycloalkyl, heterocyclyl, aryl, or heteroaryl, each of which is optionally substituted with one or more R1; L is absent, C1-C6-alkylene, C1-C6-heteroalkylene, —O—, —C(O)—, —N(R3)—, —N(R3)C(O)—, or —C(O)N(R3)—, wherein each alkylene and heteroalkylene is optionally substituted with one or more R4; Z is N or C(R6); each R1 is independently hydrogen, C1-C6-alkyl, C2-C6-alkenyl, C2-C6-alkynyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, C1-C6 alkylene-aryl, C1-C6 alkenylene-aryl, C1-C6 alkylene-heteroaryl, heteroaryl, halo, cyano, oxo, —ORA, —NRBRC, —NRBC(O)RD, —NO2, —C(O)NRBRC, —C(O)RD, —C(O)ORD, or —S(O)xRD, wherein each alkyl, alkylene, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R1; or two R1 groups, together with the atoms to which they are attached, form a 3-7-membered cycloalkyl, heterocyclyl, aryl, or heteroaryl, wherein each cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R8; each R3 is independently hydrogen, C1-C6-alkyl, or C1-C6-haloalkyl; each R4 is independently C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, halo, cyano, oxo, —ORA, —NRBRC, —C(O)RD, or —C(O)ORD; R6 is hydrogen, C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, or halo; R7 is hydrogen, C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, halo, or —ORA; each R8 is independently C1-C6-alkyl, C2-C6-alkenyl, C2-C6-alkynyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, halo, oxo, cyano, —ORA, —NRBRC, —NRBC(O)RD, —NO2, —C(O)NRBRC, —C(O)RD, —C(O)ORD, or —S(O)xRD, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R9; each R9 is independently C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, halo, cyano, oxo, or —ORA; each RA is independently hydrogen, C1-C6 alkyl, C1-C6 haloalkyl, aryl, heteroaryl, C1-C6 alkylene-aryl, C1-C6 alkylene-heteroaryl, —C(O)RD, or —S(O)xRD; each RB and RC is independently hydrogen, C1-C6 alkyl, C1-C6 heteroalkyl, cycloalkyl, heterocyclyl, —ORA; or RB and RC together with the atom to which they are attached form a 3-7-membered heterocyclyl ring optionally substituted with one or more R10; each R10 is independently hydrogen, C1-C6 alkyl, C2-C6 alkenyl, C2-C6 alkynyl, C1-C6 heteroalkyl, C1-C6 haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, C1-C6 alkylene-aryl, or C1-C6 alkylene-heteroaryl; each R10 is independently C1-C6-alkyl or halo; and x is 0, 1, or 2.
In some embodiments, A is heterocyclyl optionally substituted with one or more R1. In some embodiments, A is monocyclic nitrogen-containing heterocyclyl. In some embodiments, A is optionally substituted piperidinyl. In some embodiments, A is optionally substituted piperazinyl. In some embodiments A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments A is R, wherein each
is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is heteroaryl optionally substituted with one or more R1. In some embodiments, A is bicyclic nitrogen-containing heteroaryl. In some embodiments, A is optionally substituted indazolyl. In some embodiments, A is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
In some embodiments, B is heteroaryl optionally substituted with one or more R1. In some embodiments, B is bicyclic nitrogen-containing heteroaryl. In some embodiments, B is optionally substituted indazolyl. In some embodiments, B is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, B is selected from
In some embodiments B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is heterocyclyl optionally substituted with one or more R1. In some embodiments, B is monocyclic nitrogen-containing heterocyclyl. In some embodiments, B is optionally substituted piperidinyl. In some embodiments, B is
wherein R1 is as defined herein. In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—)
In some embodiments, Z is N. In some embodiments, Z is C(R6) (e.g., CH). In some embodiments, R7 is hydrogen.
In some embodiments, the compound of Formula (I) is a compound of Formula (I-c):
In some embodiments, A is heterocyclyl optionally substituted with one or more R1. In some embodiments, A is monocyclic nitrogen-containing heterocyclyl. In some embodiments, A is optionally substituted piperidinyl. In some embodiments, A is optionally substituted piperazinyl. In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is heteroaryl optionally substituted with one or more R1. In some embodiments, A is bicyclic nitrogen-containing heteroaryl. In some embodiments, A is optionally substituted indazolyl. In some embodiments, A is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
In some embodiments, B is heteroaryl optionally substituted with one or more R1. In some embodiments, B is bicyclic nitrogen-containing heteroaryl. In some embodiments, B is optionally substituted indazolyl. In some embodiments, B is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is heterocyclyl optionally substituted with one or more R1. In some embodiments, B is monocyclic nitrogen-containing heterocyclyl. In some embodiments, B is optionally substituted piperidinyl. In some embodiments, B is
wherein R1 is as defined herein. In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—)
In some embodiments, Z is N. In some embodiments, Z is C(R6) (e.g., CH). In some embodiments, R7 is hydrogen.
In some embodiments, the compound of Formula (I) is selected from a compound in Table 1, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2,7-dimethyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 173, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2,7-dimethyl-2H-indazolyl); L is absent; X is N; Y is C; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-c) is Compound 174, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is N; Y is C; Z is C(R6) (e.g., CH); R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I) and (I-c) is Compound 175, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 176, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is N; Y is C; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-c) is Compound 177, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2,7-dimethyl-2H-indazolyl); L is absent; X is N; Y is C; Z is C(R6) (e.g., CH); R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I) and (I-c) is Compound 178, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is bicyclic heterocyclyl (e.g., 2,7-dimethyl-2H-indazolyl); B is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 179, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperazyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 181, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperazyl) substituted with R1; R1 is C1-C6 alkyl (e.g. C1 alkyl) substituted with R8; R8 is aryl (e.g., phenyl) substituted with R9; R9 is halo (e.g. F); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 185, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is bicyclic heterocyclyl (e.g., 8-fluoro-2-methylimidazo[1,2-a]pyridinyl); B is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 186, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is bicyclic heterocyclyl (e.g., 2,7-dimethyl-2H-indazolyl); B is monocyclic heterocyclyl (e.g., 1-carbaldehyde-2,2,6,6-tetramethylpiperidinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 187, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., 1-methyl-1,2,3,6-tetrahydropyridinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 188, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is —N(R3)— (e.g., —NH—); X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 189, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is —N(R3)— (e.g., —N(CH3)—); X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 190, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 197, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is N; Y is C; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-c) is Compound 198, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is N; Y is C; Z is C(R6) (e.g., CH); R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I) and (I-c) is Compound 199, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); B is monocyclic heterocyclyl (e.g., piperidinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 200, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); B is monocyclic heterocyclyl (e.g., piperidinyl); L is absent; X is N; Y is C; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (II-a), and (I-c) is Compound 201, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); B is monocyclic heterocyclyl (e.g., piperidinyl); L is absent; X is N; Y is C; Z is C(R6) (e.g., CH); R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I) and (I-c) is Compound 202, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperazyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 219, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperazyl); B is bicyclic heterocyclyl (e.g., 8-fluoro-2-methylimidazo[1,2-a]pyridinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 220, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperazyl); B is bicyclic heterocyclyl (e.g., 8-fluoro-2-methylimidazo[1,2-a]pyridinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 221, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., 1-methyl-1H-pyrazolyl); B is monocyclic heterocyclyl (e.g., piperidinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 222, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., piperazyl); B is bicyclic heterocyclyl (e.g., 2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 223, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (I), A is monocyclic heterocyclyl (e.g., N-methyl piperazyl); B is bicyclic heterocyclyl (e.g., 2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridinyl); L is absent; X is C; Y is N; Z is N; R2 is absent; and R7 is hydrogen. In some embodiments, the compound of Formulas (I), (I-a), and (I-b) is Compound 224, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, the compound of Formula (II) is a compound of Formula (II-a):
In some embodiments, A is heterocyclyl optionally substituted with one or more R1. In some embodiments, A is monocyclic nitrogen-containing heterocyclyl. In some embodiments, A is optionally substituted piperidinyl. In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is heteroaryl optionally substituted with one or more R1. In some embodiments, A is bicyclic nitrogen-containing heteroaryl. In some embodiments, A is optionally substituted indazolyl. In some embodiments, A is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is N
In some embodiments, B is heteroaryl optionally substituted with one or more R1. In some embodiments, B is bicyclic nitrogen-containing heteroaryl. In some embodiments, B is optionally substituted indazolyl. In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is heterocyclyl optionally substituted with one or more R1. In some embodiments, B is monocyclic nitrogen-containing heterocyclyl. In some embodiments, B is optionally substituted piperazinyl. In some embodiments, B is
wherein R1 is as defined herein. In some embodiments, B is
In some embodiments, B is
In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—)
In some embodiments, Y is N(R5a) or C(R5b). In some embodiments, Y is N(R5a) (e.g., NH). In some embodiments, Y is C(R5b) (e.g., CH).
In some embodiments, Z is N. In some embodiments, Z is C(R6) (e.g., CH).
In some embodiments, Y is N(R5a) (e.g., NH) and Z is C(R6) (e.g., CH). In some embodiments, Y is C(R5b) (e.g., CH) and Z is C(R6) (e.g., CH). In some embodiments, Y is C(R5b) (e.g., CH) and Z is N.
In some embodiments, the compound of Formula (II) is a compound of Formula (II-b):
or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, wherein A and B are each independently cycloalkyl, heterocyclyl, aryl, or heteroaryl, each of which is optionally substituted with one or more R1; L is absent, C1-C6-alkylene, C1-C6-heteroalkylene, —O—, —C(O)—, —N(R3)—, —N(R3)C(O)—, or —C(O)N(R3)—, wherein each alkylene and heteroalkylene is optionally substituted with one or more R4; Z is N or C(R6); each R1 is independently hydrogen, C1-C6-alkyl, C2-C6-alkenyl, C2-C6-alkynyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, C1-C6 alkylene-aryl, C1-C6 alkenylene-aryl, C1-C6 alkylene-heteroaryl, heteroaryl, halo, cyano, oxo, —ORA, —NRBRC, —NRBC(O)RD, —NO2, —C(O)NRBRC, —C(O)RD, —C(O)ORD, or —S(O)xRD, wherein each alkyl, alkylene, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R7; or two R1 groups, together with the atoms to which they are attached, form a 3-7-membered cycloalkyl, heterocyclyl, aryl, or heteroaryl, wherein each cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R7; each R3 is independently hydrogen, C1-C6-alkyl, or C1-C6-haloalkyl; each R4 is independently C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, halo, cyano, oxo, —ORA, —NRBRC, —C(O)RD, or —C(O)ORD; R6 is hydrogen, C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, or halo; each R7 is independently C1-C6-alkyl, C2-C6-alkenyl, C2-C6-alkynyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, halo, oxo, cyano, —ORA, —NRBRC, —NRBC(O)RD, —NO2, —C(O)NRBRC, —C(O)RD, —C(O)ORD, or —S(O)xRD, wherein each alkyl, alkenyl, alkynyl, heteroalkyl, haloalkyl, cycloalkyl, heterocyclyl, aryl, and heteroaryl is optionally substituted with one or more R8; each R8 is independently C1-C6-alkyl, C1-C6-heteroalkyl, C1-C6-haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, halo, cyano, oxo, or —ORA; each RA is independently hydrogen, C1-C6 alkyl, C1-C6 haloalkyl, aryl, heteroaryl, C1-C6 alkylene-aryl, C1-C6 alkylene-heteroaryl, —C(O)RD, or —S(O)xRD; each RB and RC is independently hydrogen, C1-C6 alkyl, C1-C6 heteroalkyl, cycloalkyl, heterocyclyl, —ORA; or RB and RC together with the atom to which they are attached form a 3-7-membered heterocyclyl ring optionally substituted with one or more R9; each RD is independently hydrogen, C1-C6 alkyl, C2-C6 alkenyl, C2-C6 alkynyl, C1-C6 heteroalkyl, C1-C6 haloalkyl, cycloalkyl, heterocyclyl, aryl, heteroaryl, C1-C6 alkylene-aryl, or C1-C6 alkylene-heteroaryl; each R9 is independently C1-C6-alkyl or halo; and x is 0, 1, or 2.
In some embodiments, A is heterocyclyl optionally substituted with one or more R1. In some embodiments, A is monocyclic nitrogen-containing heterocyclyl. In some embodiments, A is optionally substituted piperidinyl. In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl some embodiments, A is
In some embodiments, A is
In some embodiments, A is heteroaryl optionally substituted with one or more R1. In some embodiments, A is bicyclic nitrogen-containing heteroaryl. In some embodiments, A is optionally substituted indazolyl. In some embodiments, A is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
In some embodiments, B is heteroaryl optionally substituted with one or more R1. In some embodiments, B is bicyclic nitrogen-containing heteroaryl. In some embodiments, B is optionally substituted indazolyl. In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is heterocyclyl optionally substituted with one or more R1. In some embodiments, B is monocyclic nitrogen-containing heterocyclyl. In some embodiments, B is optionally substituted piperazinyl. In some embodiments, B is
wherein R1 is as defined herein. In some embodiments, B is
In some embodiments, B is
In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—)
In some embodiments, Z is N. In some embodiments, Z is C(R6) (e.g., CH).
In some embodiments, the compound of Formula (II) is a compound of Formula (II-c):
In some embodiments, A is heterocyclyl optionally substituted with one or more R1. In some embodiments, A is monocyclic nitrogen-containing heterocyclyl. In some embodiments, A is optionally substituted piperidinyl. In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is
wherein each R1 is independently hydrogen or C1-C6-alkyl. In some embodiments, A is
In some embodiments, A is
In some embodiments, A is heteroaryl optionally substituted with one or more R1. In some embodiments, A is bicyclic nitrogen-containing heteroaryl. In some embodiments, A is optionally substituted indazolyl. In some embodiments, A is optionally substituted imidazo[1,2-a]pyridinyl. In some embodiments, A is
wherein each R1 is as defined herein. In some embodiments, A is
In some embodiments, A is
wherein each R is as defined herein. In some embodiments, A is
In some embodiments, A is
In some embodiments, B is heteroaryl optionally substituted with one or more R1. In some embodiments, B is bicyclic nitrogen-containing heteroaryl. In some embodiments, B is optionally substituted indazolyl. In some embodiments, B is selected from
In some embodiments, B is selected from
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is
In some embodiments, B is heterocyclyl optionally substituted with one or more R1. In some embodiments, B is monocyclic nitrogen-containing heterocyclyl. In some embodiments, B is optionally substituted piperazinyl. In some embodiments B is
wherein R1 is as defined herein. In some embodiments, B is
In some embodiments, B is
In some embodiments, L is absent or C1-C6-heteroalkylene. In some embodiments, L is absent. In some embodiments, L is C1-C6-heteroalkylene (e.g., —N(CH3)—)
In some embodiments, Z is N. In some embodiments, Z is C(R6) (e.g., CH).
In some embodiments, the compound of Formula (II) is selected from a compound in Table 2, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (II), A is monocyclic heterocyclyl (e.g., N-methyl piperazyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; Y is N(R5a) (e.g., NH); Z is N; and R1 is absent. In some embodiments, the compound of Formulas (II), (II-a), and (II-b) is Compound 172, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (II), A is monocyclic heterocyclyl (e.g, piperidinyl); B is bicyclic heterocyclyl (e.g., 2-methyl-2H-indazolyl); L is absent; Y is N(R5a) (e.g., NH); Z is N; and R1 is absent. In some embodiments, the compound of Formulas (II), (II-a), and (II-b) is Compound 195, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
In some embodiments, for Formula (II), A is bicyclic heterocyclyl (e.g, 2-methyl-2H-indazolyl); B is monocyclic heterocyclyl (e.g., piperidinyl); L is absent; Y is N(R5a) (e.g., NH); Z is N; and R1 is absent. In some embodiments, the compound of Formulas (II), (II-a), and (II-b) is Compound 196, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof.
The present invention provides pharmaceutical compositions comprising a compound of Formula (I) or (II), e.g., a compound of Formula (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer, as described herein, and optionally a pharmaceutically acceptable excipient. In certain embodiments, the pharmaceutical composition described herein comprises a compound of Formula (I) or (II) or a pharmaceutically acceptable salt thereof, and optionally a pharmaceutically acceptable excipient. In certain embodiments, the compound of Formula (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, is provided in an effective amount in the pharmaceutical composition. In certain embodiments, the effective amount is a therapeutically effective amount. In certain embodiments, the effective amount is a prophylactically effective amount.
Pharmaceutical compositions described herein can be prepared by any method known in the art of pharmacology. In general, such preparatory methods include the steps of bringing the compound of Formula (I) or (II) (the “active ingredient”) into association with a carrier and/or one or more other accessory ingredients, and then, if necessary and/or desirable, shaping and/or packaging the product into a desired single- or multi-dose unit.
Pharmaceutical compositions can be prepared, packaged, and/or sold in bulk, as a single unit dose, and/or as a plurality of single unit doses. As used herein, a “unit dose” is a discrete amount of the pharmaceutical composition comprising a predetermined amount of the active ingredient. The amount of the active ingredient is generally equal to the dosage of the active ingredient which would be administered to a subject and/or a convenient fraction of such a dosage such as, for example, one-half or one-third of such a dosage.
Relative amounts of the active ingredient, the pharmaceutically acceptable excipient, and/or any additional ingredients in a pharmaceutical composition of the invention will vary, depending upon the identity, size, and/or condition of the subject treated and further depending upon the route by which the composition is to be administered. By way of example, the composition may comprise between 0.1% and 100% (w/w) active ingredient.
The term “pharmaceutically acceptable excipient” refers to a non-toxic carrier, adjuvant, diluent, or vehicle that does not destroy the pharmacological activity of the compound with which it is formulated. Pharmaceutically acceptable excipients useful in the manufacture of the pharmaceutical compositions of the invention are any of those that are well known in the art of pharmaceutical formulation and include inert diluents, dispersing and/or granulating agents, surface active agents and/or emulsifiers, disintegrating agents, binding agents, preservatives, buffering agents, lubricating agents, and/or oils. Pharmaceutically acceptable excipients useful in the manufacture of the pharmaceutical compositions of the invention include, but are not limited to, ion exchangers, alumina, aluminum stearate, lecithin, serum proteins, such as human serum albumin, buffer substances such as phosphates, glycine, sorbic acid, potassium sorbate, partial glyceride mixtures of saturated vegetable fatty acids, water, salts or electrolytes, such as protamine sulfate, disodium hydrogen phosphate, potassium hydrogen phosphate, sodium chloride, zinc salts, colloidal silica, magnesium trisilicate, polyvinyl pyrrolidone, cellulose-based substances, polyethylene glycol, sodium carboxymethylcellulose, polyacrylates, waxes, polyethylene-polyoxypropylene-block polymers, polyethylene glycol and wool fat.
Compositions of the present invention may be administered orally, parenterally (including subcutaneous, intramuscular, intravenous and intradermal), by inhalation spray, topically, rectally, nasally, buccally, vaginally or via an implanted reservoir. In some embodiments, provided compounds or compositions are administrable intravenously and/or orally.
The term “parenteral” as used herein includes subcutaneous, intravenous, intramuscular, intraocular, intravitreal, intra-articular, intra-synovial, intrasternal, intrathecal, intrahepatic, intraperitoneal intralesional and intracranial injection or infusion techniques. Preferably, the compositions are administered orally, subcutaneously, intraperitoneally, or intravenously. Sterile injectable forms of the compositions of this invention may be aqueous or oleaginous suspension. These suspensions may be formulated according to techniques known in the art using suitable dispersing or wetting agents and suspending agents. The sterile injectable preparation may also be a sterile injectable solution or suspension in a non-toxic parenterally acceptable diluent or solvent, for example as a solution in 1,3-butanediol. Among the acceptable vehicles and solvents that may be employed are water, Ringer's solution and isotonic sodium chloride solution. In addition, sterile, fixed oils are conventionally employed as a solvent or suspending medium.
Pharmaceutically acceptable compositions of this invention may be orally administered in any orally acceptable dosage form including, but not limited to, capsules, tablets, aqueous suspensions or solutions. In the case of tablets for oral use, carriers commonly used include lactose and corn starch. Lubricating agents, such as magnesium stearate, are also typically added. For oral administration in a capsule form, useful diluents include lactose and dried cornstarch. When aqueous suspensions are required for oral use, the active ingredient is combined with emulsifying and suspending agents. If desired, certain sweetening, flavoring or coloring agents may also be added. In some embodiments, a provided oral formulation is formulated for immediate release or sustained/delayed release. In some embodiments, the composition is suitable for buccal or sublingual administration, including tablets, lozenges and pastilles. A provided compound can also be in micro-encapsulated form.
Alternatively, pharmaceutically acceptable compositions of this invention may be administered in the form of suppositories for rectal administration. Pharmaceutically acceptable compositions of this invention may also be administered topically, especially when the target of treatment includes areas or organs readily accessible by topical application, including diseases of the eye, the skin, or the lower intestinal tract. Suitable topical formulations are readily prepared for each of these areas or organs.
For ophthalmic use, provided pharmaceutically acceptable compositions may be formulated as micronized suspensions or in an ointment such as petrolatum.
In order to prolong the effect of a drug, it is often desirable to slow the absorption of the drug from subcutaneous or intramuscular injection. This can be accomplished by the use of a liquid suspension of crystalline or amorphous material with poor water solubility. The rate of absorption of the drug then depends upon its rate of dissolution which, in turn, may depend upon crystal size and crystalline form. Alternatively, delayed absorption of a parenterally administered drug form is accomplished by dissolving or suspending the drug in an oil vehicle.
Although the descriptions of pharmaceutical compositions provided herein are principally directed to pharmaceutical compositions which are suitable for administration to humans, it will be understood by the skilled artisan that such compositions are generally suitable for administration to animals of all sorts. Modification of pharmaceutical compositions suitable for administration to humans in order to render the compositions suitable for administration to various animals is well understood, and the ordinarily skilled veterinary pharmacologist can design and/or perform such modification with ordinary experimentation.
Compounds provided herein are typically formulated in dosage unit form, e.g., single unit dosage form, for ease of administration and uniformity of dosage. It will be understood, however, that the total daily usage of the compositions of the present invention will be decided by the attending physician within the scope of sound medical judgment. The specific therapeutically effective dose level for any particular subject or organism will depend upon a variety of factors including the disease being treated and the severity of the disorder; the activity of the specific active ingredient employed; the specific composition employed; the age, body weight, general health, sex and diet of the subject; the time of administration, route of administration, and rate of excretion of the specific active ingredient employed; the duration of the treatment; drugs used in combination or coincidental with the specific active ingredient employed; and like factors well known in the medical arts.
The exact amount of a compound required to achieve an effective amount will vary from subject to subject, depending, for example, on species, age, and general condition of a subject, severity of the side effects or disorder, identity of the particular compound(s), mode of administration, and the like. The desired dosage can be delivered three times a day, two times a day, once a day, every other day, every third day, every week, every two weeks, every three weeks, or every four weeks. In certain embodiments, the desired dosage can be delivered using multiple administrations (e.g., two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, or more administrations).
In certain embodiments, an effective amount of a compound for administration one or more times a day to a 70 kg adult human may comprise about 0.0001 mg to about 3000 mg, about 0.0001 mg to about 2000 mg, about 0.0001 mg to about 1000 mg, about 0.001 mg to about 1000 mg, about 0.01 mg to about 1000 mg, about 0.1 mg to about 1000 mg, about 1 mg to about 1000 mg, about 1 mg to about 100 mg, about 10 mg to about 1000 mg, or about 100 mg to about 1000 mg, of a compound per unit dosage form.
In certain embodiments, the compounds of Formula (I) or (II) may be at dosage levels sufficient to deliver from about 0.001 mg/kg to about 100 mg/kg, from about 0.01 mg/kg to about 50 mg/kg, preferably from about 0.1 mg/kg to about 40 mg/kg, preferably from about 0.5 mg/kg to about 30 mg/kg, from about 0.01 mg/kg to about 10 mg/kg, from about 0.1 mg/kg to about 10 mg/kg, and more preferably from about 1 mg/kg to about 25 mg/kg, of subject body weight per day, one or more times a day, to obtain the desired therapeutic effect.
It will be appreciated that dose ranges as described herein provide guidance for the administration of provided pharmaceutical compositions to an adult. The amount to be administered to, for example, a child or an adolescent can be determined by a medical practitioner or person skilled in the art and can be lower or the same as that administered to an adult.
It will be also appreciated that a compound or composition, as described herein, can be administered in combination with one or more additional pharmaceutical agents. The compounds or compositions can be administered in combination with additional pharmaceutical agents that improve their bioavailability, reduce and/or modify their metabolism, inhibit their excretion, and/or modify their distribution within the body. It will also be appreciated that the therapy employed may achieve a desired effect for the same disorder, and/or it may achieve different effects.
The compound or composition can be administered concurrently with, prior to, or subsequent to, one or more additional pharmaceutical agents, which may be useful as, e.g., combination therapies. Pharmaceutical agents include therapeutically active agents. Pharmaceutical agents also include prophylactically active agents. Each additional pharmaceutical agent may be administered at a dose and/or on a time schedule determined for that pharmaceutical agent. The additional pharmaceutical agents may also be administered together with each other and/or with the compound or composition described herein in a single dose or administered separately in different doses. The particular combination to employ in a regimen will take into account compatibility of the inventive compound with the additional pharmaceutical agents and/or the desired therapeutic and/or prophylactic effect to be achieved. In general, it is expected that the additional pharmaceutical agents utilized in combination be utilized at levels that do not exceed the levels at which they are utilized individually. In some embodiments, the levels utilized in combination will be lower than those utilized individually.
Exemplary additional pharmaceutical agents include, but are not limited to, anti-proliferative agents, anti-cancer agents, anti-diabetic agents, anti-inflammatory agents, immunosuppressant agents, and a pain-relieving agent Pharmaceutical agents include small organic molecules such as drug compounds (e.g., compounds approved by the U.S. Food and Drug Administration as provided in the Code of Federal Regulations (CFR)), peptides, proteins, carbohydrates, monosaccharides, oligosaccharides, polysaccharides, nucleoproteins, mucoproteins, lipoproteins, synthetic polypeptides or proteins, small molecules linked to proteins, glycoproteins, steroids, nucleic acids, DNAs, RNAs, nucleotides, nucleosides, oligonucleotides, antisense oligonucleotides, lipids, hormones, vitamins, and cells.
Also encompassed by the invention are kits (e.g., pharmaceutical packs). The inventive kits may be useful for preventing and/or treating a proliferative disease or a non-proliferative disease, e.g., as described herein. The kits provided may comprise an inventive pharmaceutical composition or compound and a container (e.g., a vial, ampule, bottle, syringe, and/or dispenser package, or other suitable container). In some embodiments, provided kits may optionally further include a second container comprising a pharmaceutical excipient for dilution or suspension of an inventive pharmaceutical composition or compound. In some embodiments, the inventive pharmaceutical composition or compound provided in the container and the second container are combined to form one-unit dosage form.
Thus, in one aspect, provided are kits including a first container comprising a compound described herein, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, or a pharmaceutical composition thereof. In certain embodiments, the kit of the disclosure includes a first container comprising a compound described herein, or a pharmaceutically acceptable salt thereof, or a pharmaceutical composition thereof. In certain embodiments, the kits are useful in preventing and/or treating a disease, disorder, or condition described herein in a subject (e.g., a proliferative disease or a non-proliferative disease). In certain embodiments, the kits further include instructions for administering the compound, or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, or stereoisomer thereof, or a pharmaceutical composition thereof, to a subject to prevent and/or treat a proliferative disease or a non-proliferative disease.
Described herein are compounds useful for modulating splicing. In some embodiments, a compound of Formula (I) or (II) may be used to alter the amount, structure, or composition of a nucleic acid (e.g., a precursor RNA, e.g., a pre-mRNA, or the resulting mRNA) by increasing or decreasing splicing at a splice site. In some embodiments, increasing or decreasing splicing results in modulating the level or structure of a gene product (e.g., an RNA or protein) produced. In some embodiments, a compound of Formula (I) or (II) may modulate a component of the splicing machinery, e.g., by modulating the interaction with a component of the splicing machinery with another entity (e.g., nucleic acid, protein, or a combination thereof). The splicing machinery as referred to herein comprises one or more spliceosome components. Spliceosome components may comprise, for example, one or more of major spliceosome members (U1, U2, U4, U5, U6 snRNPs), or minor spliceosome members (U11, U12, U4atac, U6atac snRNPs) and their accessory splicing factors.
In another aspect, the present disclosure features a method of modifying of a target (e.g., a precursor RNA, e.g., a pre-mRNA) through inclusion of a splice site in the target, wherein the method comprises providing a compound of Formula (I) or (II). In some embodiments, inclusion of a splice site in a target (e.g., a precursor RNA, e.g., a pre-mRNA, or the resulting mRNA) results in addition or deletion of one or more nucleic acids to the target (e.g., a new exon, e.g. a skipped exon). Addition or deletion of one or more nucleic acids to the target may result in an increase in the levels of a gene product (e.g., RNA, e.g., mRNA, or protein).
In another aspect, the present disclosure features a method of modifying a target (e.g., a precursor RNA, e.g., a pre-mRNA, or the resulting mRNA) through exclusion of a splice site in the target, wherein the method comprises providing a compound of Formula (I) or (II). In some embodiments, exclusion of a splice site in a target (e.g., a precursor RNA, e.g., a pre-mRNA) results in deletion or addition of one or more nucleic acids from the target (e.g., a skipped exon, e.g. a new exon). Deletion or addition of one or more nucleic acids from the target may result in a decrease in the levels of a gene product (e.g., RNA, e.g., mRNA, or protein). In other embodiments, the methods of modifying a target (e.g., a precursor RNA, e.g., a pre-mRNA, or the resulting mRNA) comprise suppression of splicing at a splice site or enhancement of splicing at a splice site (e.g., by more than about 0.5%, e.g., 1%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or more), e.g., as compared to a reference (e.g., the absence of a compound of Formula (I) or (II), or in a healthy or diseased cell or tissue).
The methods described herein can be used to modulate splicing, e.g., of a nucleic acid comprising a particular sequence (e.g., a target sequence). Exemplary genes encoding a target sequence (e.g., a target sequence comprising DNA or RNA, e.g., pre-mRNA) include, inter al/a, ABCA4, ABCA9, ABCB1, ABCB5, ABCC9, ABCD1, ACADL, ACADM, ACADSB, ACSS2, ACTB, ACTG2, ADA, ADAL, ADAM10, ADAM15, ADAM22, ADAM32, ADAMTS12, ADAMTS13, ADAMTS20, ADAMTS6, ADAMTS9, ADAR, ADCY3, ADCY10, ADCY8, ADNP, ADRBK2, AFP, AGL, AGT, AHCTF1, AHR, AKAP10, AKAP3, AKNA, ALAS1, ALS2CL, ALB, ALDH3A2, ALG6, AMBRA1, ANK3, ANTXR2, ANXA10, ANXA11, ANGPTL3, AP2A2, AP4E1, APC, APOA1, APOB, APOC3, APOH, AR, ARID2, ARID3A, ARID3B, ARFGEF1, ARFGEF2, ARHGAP1, ARHGAP8, ARHGAP18, ARHGAP26, ARHGEF18, ARHGEF2, ARPC3, ARS2, ASH1L, ASH1L-IT1, ASNSD1, ASPM, ATADS, ATF1, ATG4A, ATG16L2, ATM, ATN1, ATP11C, ATP6V1 G3, ATP13A5, ATP7A, ATP7B, ATR, ATXN2, ATXN3, ATXN7, ATXN10, AXIN1, B2M, B4GALNT3, BBS4, BCL2, BCL2L1, BCL2-like 11 (BIM), BCL11B, BBOXI, BCSIL, BEAN1, BHLHE40, BMPR2, BMP2K, BPTF, BRAF, BRCA1, BRCA2, BRCC3, BRSK1, BRSK2, BTAF1, BTK, C2orf55, C4orf29, C6orf118, C9orf43, C9orf72, C10orf137, C11orf30, C11orf65, C11orf70, C11orf87, C12orf51, C13orf1, C13orf15, C14orf101, C14orf118, C15orf29, C15orf42, C15orf60, C16orf33, C16orf38, C16orf48, C18orf8, C19orf42, C1orf107, C1orf114, C1orf130, C1orf149, C1orf27, C1orf71, C1orf94, C1R, C20orf74, C21 orf70, C3orf23, C4orf18, C5orf34, C8B, C8orf33, C9orf114, C9orf86, C9orf98, C3, CA11, CAB39, CACHD1, CACNA1A, CACNA1B, CACNA1C CACNA2D1, CACNA1G, CACNA1H, CALCA, CALCOCO2, CAMK1D, CAMKK1, CAPN3, CAPN9, CAPSL, CARD11, CARKD, CASZ1, CAT, CBLB, CBX1, CBX3, CCDC102B, CCDC11, CCDC15, CCDC18, CCDC5, CCDC81, CCDC131, CCDC146, CD4, CD274, CD1B, CDC14A, CDC16, CDC2L5, CDC42BPB, CDCA8, CDH10, CDH11, CDH24, CDH8, CDH9, CDK5RAP2, CDK6, CDK8, CDK11B, CD33, CD46, CDH1, CDH23, CDK6, CDK11B, CDK13, CEBPZ, CEL, CELSR3, CENPA, CENPI, CENPT, CENTB2, CENTG2, CEP110, CEP170, CEP192, CETP, CFB, CFTR, CFH, CGN, CGNL1, CHAF1A, CHD9, CHIC2, CHL1, CHN1, CHM, CLEC16A, CL1C2, CLCN1, CLINT1, CLK1, CLPB, CLPTM1, CMIP, CMYA5, CNGA3, CNOT1, CNOT7, CNTN6, COG3, COL11A1, COL11A2, COL12A1, COL14A1, COL15A1, COL17A1, COL19A1, COLIA1, COLIA2, COL2A1, COL3A1, COL4A1, COL4A2, COL4A5, COL4A6, COL5A2, COL6A1, COL7A1, COL9A1, COL9A2, COL22A1, COL24A1, COL25A1, COL29A1, COLQ, COMTD1, COPA, COPB2, COPS7B, COPZ2, CPSF2, CPXM2, CR1, CRBN, CRYZ, CREBBP, CRKRS, CSE1L, CSTB, CSTF3, CT45-6, CTNNB1, CUBN, CUL4B, CUL5, CXorf41, CXXC1, CYBB, CYFIP2, CYP3A4, CYP3A43, CYP3A5, CYP4F2, CYP4F3, CYP17, CYP19, CYP24A1, CYP27A1, DAB1, DAZ2, DCBLD1, DCC, DCTN3, DCUN1D4, DDA1, DDEF1, DDX1, DDX24, DDX4, DENND2D, DEPDC2, DES, DGAT2, DHFR, DHRS7, DHRS9, DHX8, DIP2A, DMD, DMTF1, DNAH3, DNAH8, DNAI1, DNAJA4, DNAJC13, DNAJC7, DNMT1, DNTTIP2, DOCK4, DOCK5, DOCK10, DOCK11, DOT1L, DPP3, DPP4, DPY19L2P2, DR1, DSCC1, DVL3, DUX4, DYNCIH1, DYSF, E2F1, E2F3, E2F8, E4F1, EBF1, EBF3, ECM2, EDEM3, EFCAB3, EFCAB4B, EFNA4, EFTUD2, EGFR, EIF3A, ELA1, ELA2A, ELF2, ELF3, ELF4, EMCN, EMD, EMLS, ENO3, ENPP3, EP300, EPAS1, EPB41L5, EPHA3, EPHA4, EPHB1, EPHB2, EPHB3, EPS15, ERBB4, ERCC1, ERCC8, ERGIC3, ERMN, ERMP1, ERN1, ERN2, ESR1, ESRRG, ETS2, ETV3, ETV4, ETV5, ETV6, EVC2, EWSR1, EXO1, EXOC4, F3, F11, F13A1, F5, F7, F8, FAH, FAM13A1, FAM13B1, FAM13C1, FAM134A, FAM161A, FAM176B, FAM184A, FAM19A1, FAM20A, FAM23B, FAM65C, FANCA, FANCC, FANCG, FANCM, FANK1, FAR2, FBN1, FBXO15, FBXO18, FBXO38, FCGBP, FECH, FEZ2, FGA, FGD6, FGFR2, FGFR1OP, FGFR1OP2, FGFR2, FGG, FGR, FIX, FKBP3, FLI1, FLJ35848, FLJ36070, FLNA, FN1, FNBP1L, FOLH1, FOSL1, FOSL2, FOXK1, FOXM1, FOXO1, FOXP4, FRAS1, FUT9, FXN, FZD3, FZD6, GAB1, GABPA, GALC, GALNT3, GAPDH, GART, GAS2L3, GATA3, GATAD2A, GBA, GBGT1, GCG, GCGR, GCK, GFI1, GFM1, GH1, GHR, GHV, GJA1, GLA, GLT8D1, GNA11, GNAQ, GNAS, GNB5, GOLGB1, GOLT1A, GOLT1B, GPATCH1, GPR158, GPR160, GPX4, GRAMD3, GRHL1, GRHL2, GRHPR, GRIA1, GRIA3, GRIA4, GRIN2B, GRM3, GRM4, GRN, GSDMB, GSTCD, GSTO2, GTF2I, GTPBP4, HADHA, HAND2, HBA2, HBB, HCK, HDAC3, HDAC5, HDX, HEPACAM2, HERC1, HES7, HEXA, HEXB, HHEX, HIPK3, HLA-DPB1, HLA-G, HLCS, HLTF, HMBS, HMGA1, HMGCL, HNF1A, HNF1B, HNF4A, HNF4G, HNRNPH1, HOXC10, HP1BP3, HPGD, HPRT1, HPRT2, HSF1, HSF4, HSF2BP, HSPA9, HSPG2, HTT, HXA, ICA1, IDH1, IDS, IFI44L, IKBKAP, IKZF1, IKZF3, ILIR2, IL5RA, IL7RA, IMMT, INPP5D, INSR, INTS3, INTU, IP04, IP08, IQGAP2, IRF2, IRF4, IRF8, IRX3, ISL1, ISL2, ITFG1, ITGA6, ITGAL, ITGB1, ITGB2, 1TGB3, ITGB4, ITIH1, ITPR2, IWS1, JAK1, JAK2, JAG1, JMJD1C, JPH3, KALRN, KAT6A, KATNAL2, KCNN2, KCNT2, KDM2A, KIAA0256, KIAA0528, KIAA0564, KIAA0586, KIAA1033, KIAA1166, KIAA1219, KIAA1409, KIAA1622, KIAA1787, KIF3B, KIF15, KIF16B, KJF5A, KJF5B, KJF9, KIN, KIR2DL5B, KIR3DL2, KJR3DL3, KIT, KLF3, KLF5, KLF7, KLF10, KLF12, KLF16, KLHL20, KLK12, KLKBJ, KMT2A, KMT2B, KPNA5, KRAS, KREMEN1, KRIT1, KRT5, KRTCAP2, KYNU, LICAM, L3MBTL, L3MBTL2, LACE1, LAMA1, LAMA2, LAMA3, LAMB1, LARP7, LDLR, LEF1, LENG1, LGALS3, LGMN, LHCGR, LHX3, LHX6, LIMCH1, LIMK2, LIN28B, LIN54, LMBRD1, LMBRD2, LMLN, LMNA, LMO2, LMO7, LOC389634, LOC390110, LPA, LPCAT2, LPL, LRP4, LRPPRC, LRRK2, LRRC19, LRRC42, LRWD1, LUM, LVRN, LYN, LYST, MADD, MAG11, MAGT1, MALT1, MAP2K1, MAP4K4, MAPK8IP3, MAPK9, MAPT, MARC1, MARCH5, MATN2, MBD3, MCF2L2, MCM6, MDGA2, MDM4, ASXL1, FUS, SPR54, MECOM, MEF2C, MEF2D, MEGF10, MEGF11, MEMO1, MET, MGA, MGAM, MGAT4A, MGAT5, MGC16169, MGC34774, MKKS, MIB1, MIER2, MITF, MKL2, MLANA, MLH1, MLL5, MLX, MME, MPDZ, MPI, MRAP2, MRPL11, MRPL39, MRPS28, MRPS35, MS4A13, MSH2, MSH3, MSMB, MST1R, MTDH, MTERF3, MTF1, MTF2, MTIF2, MTHFR, MUC2, MUT, MVK, MYB, MYBL2, MYC, MYCBP2, MYH2, MYRF, MYT1, MY019, MY03A, MY09B, MYOM2, MYOM3, NAG, NARG1, NARG2, NCOA1, NDC80, NDFIP2, NEB, NEDD4, NEK1, NEK5, NEK11, NF1, NF2, NFATC2, NFE2L2, NFIA, NFIB, NFIX, NFKB1, NFKB2, NFKBIL2, NFRKB, NFYA, NFYB, NIPA2, NKAIN2, NKAP, NLRC3, NLRC5, NLRP3, NLRP7, NLRP8, NLRP13, NME1, NME1-NME2, NME2, NME7, NOL10, NOP561, NOS1, NOS2A, NOTCH1, NPAS4, NPM1, NRID1, NRIH3, NRIH4, NR4A3, NRSA1, NRXN1, NSMAF, NSMCE2, NT5C, NT5C2, NT5C3, NUBP1, NUBPL, NUDT5, NUMA1, NUP88, NUP98, NUP160, NUPL1, OAT, OAZ1, OBFC2A, OBFC2B, OLIG2, OMA1, OPA1, OPN4, OPTN, OSBPL11, OSBPL8, OSGEPL1, OTC, OTX2, OVOL2, OXT, PA2G4, PADI4, PAH, PAN2, PAOX, PAPOLG, PARD3, PARP1, PARVB, PAWR, PAX3, PAX8, PBGD, PBRM1, PBX2, PCBP4, PCCA, PCGF2, PCNX, PCOTH, PDCD4, PDE4D, PDE8B, PDE10A, PD1A3, PDH1, PDLIM5, PDXK, PDZRN3, PEL12, PDK4, PDS5A, PDS5B, PGK1, PGM2, PHACTR4, PHEX, PHKB, PHLDB2, PHOX2B, PHTF1, PIAS1, PIEZO1, PIGF, PIGN, PIGT, PIK3C2G, PIK3CA, PIK3CD, PIK3CG, PIK3RI, PIPSK1A, PITRM1, PIWIL3, PKD1, PKHDIL1, PKD2, PKJB, PKLR, PKM1, PKM2, PLAGL2, PLCB1, PLCB4, PLCG1, PLD1, PLEKHA5, PLEKHA7, PLEKHM1, PLKR, PLXNC1, PMFBP1, POLN, POLR3D, POMT2, POSTN, POU2AF1, POU2F2, POU2F3, PPARA, PPFIA2, PPPIR12A, PPP3CB, PPP4C, PPP4RIL, PPP4R2, PRAME, PRC1, PRDM1, PREX1, PREX2, PRIM1, PRIM2, PRKAR1A, PRKCA, PRKG1, PRMT7, PROC, PROCR, PROSC, PRODH, PROX1, PRPF40B, PRPF4B, PRRG2, PRUNE2, PSD3, PSEN1, PSMAL, PTCH1, PTEN, PTK2, PTK2B, PTPN2, PTPN3, PTPN4, PTPN11, PTPN22, PTPRD, PTPRK, PTPRM, PTPRN2, PTPRT, PUS10, PVRL2, PYGM, QRSL1, RAB11FIP2, RAB23, RAF1, RALBP1, RALGDS, RBICC1, RBL2, RBM39, RBM45, RBPJ, RBSN, REC8, RELB, RFC4, RFT1, RFTN1, RHOA, RHPN2, RIF1, PIT1, RLN3, RMND5B, RNF11, RNF32, RNFT1, RNGTT, ROCK1, ROCK2, RORA, RP1, RP6KA3, RP11-265F1, RP13-36C9, RPAP3, RPN1, RPGR, RPL22, RPL22L1, RPS6KA6, RREB1, RRM1, RRP1B, RSK2, RTEL1, RTF1, RUFYM1, RUNX1, RUNX2, RXRA, RYR3, SAAL1, SAE1, SALL4, SAT1, SATB2, SBCAD, SCN1A, SCN2A, SCN3A, SCN4A, SCN5A, SCN8A, SCNA, SCN11A, SCO1, SCYL3, SDC1, SDK1, SDK2, SEC24A, SEC24D, SEC31A, SEL1L, SENP3, SENP6, SENP7, SERPINA1, SETD3, SETD4, SETDB1, SEZ6, SFRS12, SGCE, SGOL2, SGPL1, SH2D1A, SH3BGRL2, SH3PXD2A, SH3PXD2B, SH3RF2, SH3TC2, SHOC2, SIPAIL2, SIPAIL3, SIVA1, SKAP1, SKIV2L2, SLC6A11, SLC6A13, SLC6A6, SLC7A2, SLC12A3, SLC13A1, SLC22A17, SLC25A14, SLC28A3, SLC33A1, SLC35F6, SLC38A1, SLC38A4, SLC39A10, SLC4A2, SLC6A8, SMARCA1, SMARCA2, SMARCA5, SMARCC2, SMC5, SMN2, SMOX, SMS, SMTN, SNCAIP, SNORD86, SNRK, SNRP70, SNX5, SNX6, SOD1, SOD10, SOS, SOS2, SOX5, SOX6, SOX8, SP1, SP2, SP3, SP110, SPAG9, SPATA13, SPATA4, SPATS1, SPECC1L, SPDEF, SP11, SPINK5, SPP2, SPTA3, SRF, SRM, SRP72, SSX3, SSX5, SSX9, STAG1, STAG2, STAMBPL1, STARD6, STAT1, STAT3, STAT5A, STAT5B, STAT6, STK17B, STX3, STXBP1, SUCLG2, SULF2, SUPT6H, SUPT16H, SV2C, SYCP2, SYT6, SYCPI, SYTL3, SYTL5, TAF2, TARDBP, TBC1D3G, TBC1D8B, TBC1D26, TBC1D29, TBCEL, TBK1, TBP, TBPL1, TBR1, TBX, TCEB3, TCF3, TCF4, TCF7L2, TCFL5, TCF12, TCP11L2, TDRD3, TEAD1, TEAD3, TEAD4, TECTB, TEK, TERF1, TERF2, TET2, TFAP2A, TFAP2B, TFAP2C, TFAP4, TFDP1, TFRC, TG, TGM7, TGS1, THAP7, THAP12, THOC2, TIAL1, TIAM2, TIMM50, TLK2, TM4SF20, TM6SF1, TMEM27, TMEM77, TMEM156, TMEM194A, TMF1, TMPRSS6, TNFRSF10A, TNFRSF10B, TNFRSF8, TNK2, TNKS, TNKS2, TOMIL1, TOMIL2, TOP2B, TP53, TP53INP1, TP53BP2, TP53I3, TP63, TRAF3IP3, TRAPPC2, TRIM44, TRIM65, TRIML1, TRIML2, TRPM3, TRPM5, TRPM7, TRPS1, TSC2, TSC2, TSHB, TSPAN7, TTC17, TTF1, TTLL5, TTLL9, TTN, TTPAL, TTR, TUSC3, TXNDC10, UBE3A, UCK1, UGTIA1, UHRFIBP1, UNC45B, UNC5C, USH2A, USF2, USP1, USP6, USP18, USP38, USP39, UTP20, UTP15, UTP18, UTRN, UTX, UTY, UVRAG, UXT, VAPA, VEGFA, VPS29, VPS35, VPS39, VT11A, VT11B, VWA3B, WDFY2, WDR16, WDR17, WDR26, WDR44, WDR67, WDTC1, WRN, WRNIP1, WT1, WWC3, XBP1, XRN1, XRN2, XX-FW88277, YAP1, YAR5, YBX1, YGM, YY1, ZBTB18, ZBTB20, ZC3HAV1, ZC3HC1, ZC3H7A, ZDHHC9, ZEB, ZEB2, ZFPM1, ZFYVE1, ZFX, ZIC2, ZNF37A, ZNF91, ZNF114, ZNF155, ZNF169, ZNF205, ZNF236, ZNF317, ZNF320, ZNF326, ZNF335, ZNF365, ZNF367, ZNF407, ZNF468, ZNF506, ZNF511, ZNF511-PRAP1, ZNF519, ZNF521, ZNF592, ZNF618, ZNF763, and ZWINT.
Additional exemplary genes encoding a target sequence (e.g., a target sequence comprising DNA or RNA, e.g., pre-mRNA) include genes include A1CF, A4GALT, AAR2, ABAT, ABCA11P, ZNF721, ABCA5, ABHD10, ABHD13, ABHD2, ABHD6, AC000120.3, KRIT1, AC004076.1, ZNF772, AC004076.9, ZNF772, AC004223.3, RAD51D, AC004381.6, AC006486.1, ERF, AC007390.5, AC007780.1, PRKAR1A, AC007998.2, INO80C, AC009070.1, CMC2, AC009879.2, AC009879.3, ADHFE1, AC010487.3, ZNF816-ZNF321P, ZNF816, AC010328.3, AC010522.1, ZNF587B, AC010547.4, ZNF19, AC012313.3, ZNF497, AC012651.1, CAPN3, AC013489.1, DET1, AC016747.4, C2orJ74, ACO20907.6, FXYD3, ACO21087.5, PDCD6, AHRR, ACO22137.3, ZNF761, ACO25283.3, NAA60, ACO27644.4, RABGEF1, AC055811.2, FLCN, AC069368.3, ANKDD1A, AC073610.3, ARF3, AC074091.1, GPN1, AC079447.1, LIPT1, AC092587.1, AC079594.2, TRIM59, AC091060.1, C18orf21, AC092143.3, MC1R, AC093227.2, ZNF607, AC093512.2, ALDOA, AC098588.1, ANAPC10, AC107871.1, CALML4, AC114490.2, ZMYM6, AC138649.1, NIPA1, AC138894.1, CLN3, AC139768.1, AC242426.2, CHD1L, ACADM, ACAP3, ACKR2,RP11-141M3.5, KRBOX1, ACMSD, ACOT9, ACP5, ACPL2, ACSBG1, ACSF2, ACSF3, ACSL1, ACSL3, ACVR1, ADAL, ADAM29, ADAMTS10, ADAMTSL5, ADARB1, ADAT2, ADCK3, ADD3, ADGRG1, ADGRG2, ADH1B, ADIPOR1, ADNP, ADPRH, AGBL5, AGPAT1, AGPAT3, AGR2, AGTR1, AHDC1, AHI1, AHNAK, AIFM1, AIFM3, AIMP2, AK4, AKAP1, AKNAD1, CLCC1, AKRIA1, AKT1, AKTIS1, AKT2, AL139011.2, PEX19, AL157935.2, ST6GALNAC6, AL358113.1, TJP2, AL441992.2, KYAT1, AL449266.1, CLCC1, AL590556.3, LINC00339, CDC42, ALAS1, ALB, ALDH16A1, ALDH1B1, ALDH3A2, ALDH3B2, ALDOA, ALKBH2, ALPL, AMD1, AMICA1, AMN1, AMOTL2, AMY1B, AMY2B, ANAPC10, ANAPC11, ANAPC15, ANG, RNASE4, AL163636.2, ANGEL2, ANGPTL1, ANKMY1, ANKRD11, ANKRD28, ANKRD46, ANKRD9, ANKS3, ANKS3, RP11-127120.7, ANKS6, ANKZF1, ANPEP, ANXA11, ANXA2, ANXA8L2, AL603965.1, AOC3, AP000304.12, CRYZL1, AP000311.1, CRYZL1, AP000893.2, RAB30, AP001267. S, ATPSMG, AP002495.2, AP003175.1, OR2AT4, AP003419.1, CLCF1, AP005263.1, ANKRD12, AP006621. S, AP006621.1, AP1G1, AP3M1, AP3M2, APBA2, APBB1, APLP2, APOA2, APOL1, APOL3, APTX, ARAP1, STARD10, ARF4, ARFIP1, ARFIP2, ARFRP1, ARHGAP11A, ARHGAP33, ARHGAP4, ARHGEF10, ARHGEF3, ARHGEF35, OR2AJ-AS1, ARHGEF35, OR2A1-AS1, ARHGEF34P, ARID1B, ARHGEF35, OR2A20P, OR2A1 AS1, ARHGEF9, ARL1, ARL13B, ARL16, ARL6, ARMC6, ARMC8, ARMCX2, ARMCX5, RP4-769N13.6, ARMCX5-GPRASP2, BHLHB9, ARMCX5-GPRASP2, GPRASP1, ARMCX5-GPRASP2, GPRASP2, ARMCX6, ARNT2, ARPP19, ARRB2, ARSA, ART3, ASB3, GPR75-A 5B3, ASCC2, ASNS, ASNS, AC079781.5, ASPSCR1, ASS1, ASUN, ATE1, ATF1, ATF7IP2, ATG13, ATG4D, ATG7, ATG9A, ATM, ATOX1, ATP1B3, ATP2C1, ATPSF1A, ATPSG2, ATP5J, ATP5MD, ATP5PF, ATP6AP2, ATP6V0B, ATP6V1C1, ATP6VID, ATP7B, ATXN1, ATXNIL,IST1, ATXN3, ATXN7L1, AURKA, AURKB, AXDND1, B3GALNT1, B3GALT5, AF064860.1, B3GALT5,AF064860.5, B3GNT5, B4GALT3, B4GALT4, B9D1, BACH1, BAIAP2, BANF1, BANF2, BAX, BAZ2A, BBIP1, BCHE, BCL2L14, BCL6, BCL9L, BCS1L, BDH1, BDKRB2, AL355102.2, BEST1, BEST3, BEX4, BHLHB9, BID, BIN3, BIRC2, BIVM, BIVM-ERCC5, BIVM, BLCAP, BLK, BLOC1S1, RP11-644F5.10, BLOCIS6, AC090527.2, BLOCIS6, RP11-96020.4, BLVRA, BMF, BOLA1, BORCS8-MEF2B, BORCS8, BRCA1, BRD1, BRDT, BRINP3, BROX, BTBD10, BTBD3, BTBD9, BTD, BTF3L4, BTNL9, BUB1B-PAK6, PAK6, BUB3, C10orf68, C11orf1, C11orf48, C11orf54, C11orf54, AP001273.2, C11orf57, C11orf63, C11orf82, C12orf23, C12orf4, C12orf65, C12orf79, C14orf159, C14orf93, C17orf62, C18orf21, C19orf12, C19orf40, C19orf47, C19orf48, C19orf54, CID, C1GALT1, C1QB, C1QTNF1, C1S, C1orf101, C1orf112, C1orf116, C1orf159, C1orf63, C2, C2, CFB, C20orf27, C21orf58, C2CD4D, C2orf15, LIPT1, MRPL30, C2orf80, C2orf81, C3orf14, C3orf17, C3orf18, C3orf22, C3orf33,AC104472.3, C4orf33, C5orf28, C5orf34, C6orf118, C6orf203, C6orf211, C6orf48, C7orf50, C7orf55, C7orf55-LUC7L2, LUC7L2, C8orf44-SGK3, C8orf44, C8orf59, C9,DAB2, C9orf153, C9orf9, CASBP1, CA5B, CABYR, CALCA, CALCOCO1, CALCOCO2, CALM1, CALM3, CALML4, RP11-315D16.2, CALN1, CALU, CANT1, CANX, CAP1, CAPN12, CAPS2, CARD8, CARHSP1, CARNS1, CASC1, CASP3, CASP7, CBFA2T2, CBS, CBY1, CCBL1, CCBL2, RBMXL1, CCDC12, CCDC126, CCDC14, CCDC149, CCDC150, CCDC169-SOHLH2, CCDC169, CCDC171, CCDC37, CCDC41, CCDC57, CCDC63, CCDC7, CCDC74B, CCDC77, CCDC82, CCDC90B, CCDC91, CCDC92, CCNE1, CCHCR1, CCL28, CCNB1IP1, CCNC, CCND3, CCNG1, CCP110, CCR9, CCT7, CCT8, CD151, CD1D, CD200, CD22, CD226, CD276, CD36, CD59, CDC26, CDC42, CDC42SE1, CDC42SE2, CDHR3, CDK10, CDK16, CDK4, CDKAL1, CDKL3, CTD-2410N18.4, CDKN1A, CDKN2A, CDNF, CEBPZOS, CELF1, CFMIP, CENPK, CEP170B, CEP250, CEP57, CEP57L1, CEP63, CERS4, CFL1, CFL2, CFLAR, CGNL1, CHCHD7, CHD1L, CHD8, CHFR,ZNF605, CHIA, CHID1, CHL1, CHM, CHMP1A, CHMP3, RNF103-CHMP3, CHRNA2, CIDEC, CIRBP, CITED1, CKLF-CMTM1, CMTM1, CKMT1B, CLDN12,CTB-13L3.1, CLDND1,AC021660.3, CLDND1,CPDX, CLHC1, CLIP1, CLUL1, CMC4, MTCP1, CNDP2, CNFN, CNOT1, CNOT6, CNOT7, CNOT8, CNR1, CNR2, CNTFR, CNTRL, COA1, COASY, COCH, COL8A1, COLCA1, COLEC11, COMMD3-BMI1, BMI1, COPS5, COPS7B, COQ8A, CORO6, COTL1, COX14,RP4-60503.4, COX7A2, COX7A2L, COX7B2, CPA4, CPA5, CPEB1, CPNE1, AL109827.1, RBM12, CPNE1, RP1-309K20.6, RBM12, CPNE3, CPSF3L, CPTIC, CREB3L2, CREM, CRP, CRYZ, CS,AC073896.1, CS, RP11-977G19.10, CSAD, CSDE1, CSF2RA, CSGALNACT1, CSK, CSNK2A1, CSRNP2, CT45A4, CT45A4,CT45A5, CT45A6, CTBP2, CTCFL, CTD-2116N17.1, KIAA0101, CTD-2349B8.1, SYT17, CTD-2528L19.4, ZNF607, CTD-2619J13.8, ZNF497, CTNNA1, CTNNBIP1, CTNND1, CTPS2, CTSB, CTSL, CTTN, CUL2, CUL9, CWC15, CXorf4OB, CYB561A3, CYBC1, CYLD, CYP11A1, CYP2R1, CYP4B1, CYP4F22, DAG1, DAGLB,KDELR2, DARS, DBNL, DCAF11, DCAF8,PEX19, DCLRE1C, DCTD, DCTN1, DCTN4, DCUNID2, DDR1, DDX11, DDX19B, AC012184.2, DDX19B, RP11-529K1.3, DDX25, DDX39B, ATP6VIG2-DDX39B, SNORD84, DDX42, DDX60L, DEDD, DEDD2, DEFA1, DEFA1B, DEFA1B, DEFA3, DENND1C DENND2A, DENND4B, DET1, DGKA, DGKZ, DGLUCY, DHRS4L2, DHRS9, DHX40, DIABLO, AC048338.1, DIAPH1, DICER1, DKKL1, DLG1, DLG3, DLST, DMC1, DMKN, DMTF1, DMTN, DNAJC14, DNAJC19, DNAL1, DNASEIL1, DNMT3A, DOC2A, DOCK8, DOK1, DOPEY1, DPAGT1, DPP8, DRAM2, DRD2, DROSHA, DSN1, DTNA, DTX2, DTX3, DUOX1, DUOXA1, DUS2, DUSP10, DUSP13, DUSP18, DUSP22, DYDC1, DYDC2, DYNLL1, DYNLT1, DYRK1A, DYRK2, DYRK4, RP11-500M8.7, DZIP1L, E2F6, ECHDC1, ECSIT, ECT2, EDC3, EDEM1, EDEM2, MMP24-AS1, RP4-61404.11, EEFIAKNMT, EEF1D, EFEMP1, EFHC1, EGFL7, EHF, E124, EIF1AD, EIF2B5, EIF4G1, EIF2B5, POLR2H, EIF3E, EIF3K, EIF4E3, EIF4G1, ELF1, ELMO2, ELMOD1, AP000889.3, ELMOD3, ELOC, ELOF1, ELOVL1, ELOVL7, ELP1, ELP6, EML3, EMP3, ENC1, ENDOV, ENO1, ENPP5, ENTHD2, ENTPD6, EP400NL, EPB41L1, EPDR1,NME8, EPHX1, EPM2A, EPN1, EPN2, EPN3, EPS8L2, ERBB3, ERC1, ERCC1, ERG, ERI2, ERI2, DCUNID3, ERLIN2, ERMARD, ERRF11, ESR2,RP11-544120.2, ESRRA, ESRRB, ESRRG, ETFA, ETFRF1, ETV1, ETV4, ETV7, EVA1A, EVC2, EVX1, EXD2, EXO5, EXOC1, EXOC2, FAAP24, FABP6, FADS1, FADS2, FAHD2B, FAM107B, FAM111A, FAM111B, FAM114A1, FAM114A2, FAM115C, FAM115C, FAM115D, FAM120B, FAM133B, FAM135A, FAM153A, FAM153B, FAM154B, FAM156A, FAM156B, FAM168B, FAM172A, FAM182B, FAM192A, FAM19A2, FAM200B, FAM220A, FAM220A, AC009412.1, FAM222B, FAM227B, FAM234A, AC004754.1, FAM3C, FAM45A, FAM49B, FAM60A, FAM63A, FAM81A, FAM86B1, FAM86B2, FANCI, FANK1, FAR2, FAXC, FAXDC2, FBF1, FBH1, FBXL4, FBXO18, FBXO22, FBXO31, FBXO41, FBXO44, FBXO45, FBXW9, FCHO1, FCHSD2, FDFT1, FDPS, FER, FETUB, FGD4, FGF1, FGFR1, FGFRL1, FGL1, FHL2, FIBCD1, FIGNL1, FIGNL1,DDC, FKBP5, FKRP, FLRT2, FLRT3, FMC1, LUC7L2, FMC1-LUC7L2, FNDC3B, FOLH1, FOLR1, FOXP1, FOXK1, FOXM1, FOXO1, FOXP4, AC097634.4, FOXRED1, FPR1, FPR2, FRG1B, FRS2, FTO, FTSJ1, FUK, FUT10, FUT3, FUT6, FXYD3, FZD3, G2E3, GAA, GABARAPL1, GABPB1, GABRA5, GAL3ST1, GALE, GALNT11, GALNT14, GALNT6, GAPVD1, GARNL3, GAS2L3, GAS8, GA TA1, GATA2, GATA4, GBA, GCNT1, GDPD2, GDPD5, GEMIN7,MARK4, GEMIN8, GGA3, GGACT, AL356966.1, GGPS1, GHRL, GID8, GIGYF2, GIMAP8, GIPC1, GJB1, GJB6, GLB1L, GLI1, GLT8D1, GMFG, GMPR2, GNAI2, GNAQ, GNB1, GNB2, GNE, GNG2, GNGT2, GNPDA1, GNPDA2, GOLGA3,CHFR, GOLGA4, GOLPH3L, GOLT1B, GPBP1L1, GPER1, GPR116, GPR141,EPDR1, GPR155, GPR161, GPR56, GPR63, GPR75-ASB3,ASB3, GPR85, GPSM2, GRAMD1B, GRB10, GRB7, GREM2, GRIA2, GSDMB, GSE1, GSN, GSTA4, GSTZ1, GTDC1, GTF2H1, GTF2H4, VARS2, GTF3C2, GUCY1A3, GUCY1B3, GUK1, GULP1, GYPC, GYS1, GZF1, HAGH, HAO2, HAPLN3, HAVCR1, HAX1, HBG2, AC104389.4, HBG2, AC104389.4, HBE1, HBG2, AC104389.4, HBE1,OR51B5, HBG2,HBE1, AC104389.28, HBS1L, HCFC1R1, HCK, HDAC2, HDAC6, HDAC7, HDLBP, HEATR4, HECTD4, HEXIM2, HHAT, HHATL, CCDC13, HINFP, HIRA, C22orf39, HIVEP3, HJV, HKR1, HLF, HMBOX1, HMGA1, HMGB3, HMGCR, HMGN4, HMOX2, HNRNPC, HNRNPD, HNRNPH1, HNRNPH3, HNRNPR, HOMER3, HOPX, HOXA3, HOXB3, HOXB3,HOXB4, HOXC4, HOXD3, HOXD3,HOXD4, HPCAL1, HPS4, HPS5, HRH1, HS3ST3A1, HSH2D, HSP90AA1, HSPD1, HTT, HUWE1, HYOU1, IAH1, ICA1L, ICAM2, ICE2, ICK, IDH2, IDH3G, IDS, IFI27, IFI44, IFT20, IFT22, IFT88, IGF2, INS-IGF2, IGF2BP3, IGFBP6, IKBKAP, IKBKB, IL11, IL18BP, IL18RAP, IL1RAP, ILIRL1, IL18R1, IL1RN, IL32, IL4I1,NUP62,AC011452.1, IL4I1,NUP62,CTC-326K19.6, IL6ST, ILVBL, IMMP1L, IMPDH1, INCA1, ING1, INIP, INPP1, INPP5J, INPP5K, INSIG2, INTS11, INTS12, INTS14, IP6K2, IP6K3, IPOH, LRRC70, IQCE, IQGAP3, IRAK4, IRF3, IRF5, IRF6, ISG20, IST1, ISYNA1, ITFG2, ITGB1BP1, ITGB7, ITIH4, RP5-966M1.6, ITPRIPL1, JADE1, JAK2, JARID2, JDP2, KANK1, KANK1,RP11-31F19.1, KANK2, KANSL1L, KAT6A, KBTBD2, KBTBD3, KCNAB2, KCNE3, KCNG1, KCNJ16, KCNJ9, KCNMB2,AC117457.1,LINC01014, KCTD20, KCTD7,RABGEF1, KDM1B, KDM4A,AL451062.3, KHNYN, KIAA0040, KIAA0125, KIAA0196, KIAA0226L, PPPIR2P4, KIAA0391, KIAA0391, AL121594.1, KIAA0391, PSMA6, KIAA0753, KIAA0895, KIAA0895L, KIAA1191, KIAA1407, KIAA1841, C2orJ74, KIF12, KIF14, KIF27, KIF9, KIFC3, KIN, KIRREL1, KITLG, KLC1, APOPT1, AL139300.1, KLC4, KLHDC4, KLHDC8A, KLHL13, KLHL18, KLHL2, KLHL24, KLHL7, KLK11, KLK2, KLK5, KLK6, KLK7, KNOP1, KRBA2, AC135178.2, KRBA2, RP11-849F2.7, KRIT1, KRT15, KRT8, KTN1, KXD1, KYAT3, RBMXL1, KYNU, L3MBTL1, LACC1, LARGE, LARP4, LARP7, LAT2, LBHD1, LCA5, LCA5L, LCTL, LEPROTL1, LGALS8, LGALS9C, LGMN, LHFPL2, LIG4, LIMCH1, LIMK2, LIMS2, LINC00921, ZNF263, LIPF, LLGL2, LMAN2L, LMCD1, IMF1, RP11-161M6.2, LMO1, LMO3, LOXHD1, LPAR1, LPAR2, LPAR4, LPAR5, LPAR6, LPHN1, LPIN2, LPIN3, LPP, LRFN5, LRIF1, LRMP, LRRC14, LRRC20, LRRC24, C8orJ82, LRRC39, LRRC42, LRRC48, LRRC4C, LRRC8A, LRRC8B, LRRD1, LRTOMT, LRTOMT, AP000812.5, LSM7, LTB4R, LTBP3, LUC7L2, FMC1-LUC7L2, LUC7L3, LUZP1, LYG1, LYL1, LYPD4, LYPD6B, LYRM1, LYRM5, LYSMD4, MACC1, MADIL1, MADIL1, AC069288.1, MAEA, MAFF, MAFG, MAFK, MAGEA12, CSAG4, MAGEA2, MAGEA2B, MAGEA4, MAGEB1, MAGOHB, MAN2A2, MANBAL, MAOB, MAP2K3, MAP3K7CL, MAP3K8, MAP7, MAP9, MAPK6, MAPK7, MAPK8, MAPKAP1, 10-Mar, 7-Mar, 8-Mar, MARK2, MASP1, MATK, MATR3, MATR3,SNHG4, MB, MBD5, MBNL1, MBOAT7, MCC, MCFD2, MCM9, MCOLN3, MCRS1, MDC1, MDGA2, MDH2, MDM2, ME1, MEAK7, MECR, MED4, MEF2A, MEF2B,BORCS8 MEF2B, MEF2BNB-MEF2B, MEF2B, MEF2BNB, MEF2C MEF2D, MEGF10, MEI1, MEIS2, MELK, MET, METTL13, METTL23, MFF, MFN2, MFSD2A, MGST3, MIB2, MICAL1, MICAL3, MICOS10, NBL1,MICOS10-NBL1, MID1, MINA, MINOS1-NBL1,MINOS1, MIOS, MIPOL1, MIS12, MKLN1, MKNK1, MKNK1,MOB3C, MLF2, MLH1, MMP17, MOBP, MOCS1, MOGS, MOK, MORF4L1, MPC1, MPC2, MPG, MPI, MPP1, MPP2, MPPE1, MPST, MRAS, MRO, MROH1, MROH7-TTC4, MROH7, MRPL14, MRPL24, MRPL33,BABAM2, MRPL33, BRE, MRPL47, MRPL48, MRPL55, MRRF, MRTFA, MRTFB, MRVI1, MS4A1, MS4A15, MS4A3, MS4A6E,MS4A7,MS4A14, MSANTD3, MSANTD4, MSH5,MSH5-SAPCD1, MSL2, MSRB3, MSS51, MTCP1, CMC4, MTERF, MTERF1, MTERF3, MTERFD2, MTERFD3, MTF2, MTG2, MTHFD2, MTHFD2L, MTIF2, MTIF3, MTMR10, MTRF1, MTRR, MTUS2, MUTYH, MVK, MX1, MX2, MYH10, MYL12A, MYB, MYD88, MYL5, MYLIP, MYNN, MYO15A, MYO1B, MYOM2, MZF1, N4BP2L2, NAA60, NAB1, NAE1, NAGK, NAPIL1, NAPIL4, NARFL, NARG2, NAT1, NAT10, NBPF11, W12-3658N16.1, NBPF12, NBPF15, NBPF24, NBPF6, NBPF9, NBR1, NCAPG2, NCBP2, NCEH1, NCOA1, NCOA4, NDC1, NDRG1, NDRG2, NDRG4, NDST1, NDUFAF6, NDUFB2, NDUFC1, NDUFS1, NDUFS8, NDUFV1, NEDD1, NEIL1, NEIL2, NEK10, NEK11, NEK6, NEK9, NELFA, NEU4, NFAT5, NFE2, NFE2L2, ACO19080.1, NFRKB, NFYA, NFYC, NIF3L1, NIPA2, NKIRAS1, NKX2-1, NLRC3, NME1,NME1-NME2,NME2, NME1-NME2, NME2, NME4, NME6, NME9, NOD1, NOL10, NOL8, NONO, NPAS1, NPIPA8, RP11-1212A22.1, NPIPB3, NPIPB4, NPIPB9, NPL, NPM1, NPPA, NQO2, NRIH3, NR2C2, NR2F2, NR4A1, NRDC, NREP, NRF1, NRG4, NRIP1, NSD2, NSDHL, NSG1, NSMCE2, NSRP1, NT5C2, NTF4, NTMT1, NTNG2, NUBP2, NUCB2, NUDT1, NUDT2, NUDT4, NUF2, NUMBL, NUP50, NUP54, NUP85, NVL, NXF1, NXPE1, NXPE3, OARD1, OAT, OAZ2, OCIAD1, OCLN, ODF2, OGDHL, OGFOD2, ACO26362.1, OGFOD2, RP11-197N18.2, OLA1, OPRL1, OPTN, OR2H1, ORAI2, ORMDL1, ORMDL2, ORMDL3, OSBPL2, OSBPL3, OSBPL5, OSBPL9, OSER1, OSGIN1, OSR2, P2RX4, P2RY2, P2RY6, P4HA2, PABPC1, PACRGL, PACSIN3, PADI1, PAIP2, PAK1, PAK3, PAK4, PAK7, PALB2, PANK2, PAQR6, PARP11, PARVG, PASK, PAX6, PBRM1, PBXIP1, PCBP3, PCBP4,AC115284.1, PCBP4, RP11-155D18.14, RP11-155D18.12, PCGF3, PCGF5, PCNP, PCSK9, PDCD10, PDCD6, AHRR, PDDC1, PDGFRB, PDIA6, PDIKIL, PDLIM7, PDP1, PDPK1, PDPN, PDZD11, PEA15, PEX2, PEX5, PEX5L, PFKM, PFN4, PGAP2, PGAP2, AC090587.2, PGAP3, PGM3, PGPEP1, PHB, PHC2, PHF20, PHF21A, PHF23, PHKB, PHLDB1, PHOSPHO1, PHOSPHO2, KLHL23, PI4 KB, PIAS2, PICALM, PIF1, PIGN, PIGO, PIGT, PIK3CD, PILRB, STAG3L5P-PVRIG2P-PILRB, PIPSKIB, PIR, PISD, PIWIL4,FUT4, PKD2, PKIA, PKIG, PKM, PKN2, PLA1A, PLA2G2A, PLA2G5, PLA2G7, PLAC8, PLAGL1, PLD1, PLD3, PLEKHA1, PLEKHA2, PLEKHA6, PLEKHG5, PLIN1, PLS1, PLS3, PLSCR1, PLSCR2, PLSCR4, PLXNB1, PLXNB2, PMP22, PMS1, PNISR, PNKP,AKTIS1, PNMT, PNPLA4, PNPLA8, PNPO, PNRC1, POC1B, POFUT1, POLB, POLD1, POLH, POLI, POLL, POLR1B, POM121, POM121C,AC006014.7, POM121C, AC211429.1, POMC, POMT1, POP1, PORCH, POU5F1, PSORS1C3, PPARD, PPARG, PPHLN1, PPIL3, PPIL4, PPM1A, PPM1B,AC013717.1, PPP1CB, PPP1R11, PPP1R13L, PPPIR26, PPPIR9A, PPP2R2B, PPP3CA, PPP6R1, PPP6R3, PPT2,PPT2-EGFL8, EGFL8, PPWD1, PRDM2, PRDM8, PRELID3A, PREPL, PRICKLE1, PRKAG1, PRMT2, PRMT5, PRMT7, PROM1, PRPS1, PRPSAP2, PRR14L, PRR15L, PRR5,PRR5-ARHGAP8, PRR5L, PRR7, PRRC2B, PRRT4, PRSS50, PRSS45, PRSS44, PRUNE, PRUNE1, PSEN1, PSMA2, PSMF1, PSORS1C1, PSPH, PSRC1, PTBP3, PTHLH, PTK2, PTPDC1, PTPRM, PUF60, PUM2, PUS1, PUS10, PXN, PXYLP1, PYCR1, QRICH1, R3HCC1L, R3HDM2, RAB17, RAB23, RAB3A, RAB3D,TMEM205, RAB4B-EGLN2, EGLN2, AC008537.1, RAB5B, RAB7L1, RABL2A, RABL2B, RABL5, RACGAP1, RAD17, RAD51L3-RFFL, RAD5 D, RAD52, RAE1, RAI14, RAI2, RALBP1, RAN, RANGAP1, RAP1A, RAP1B, RAP1GAP, RAPGEF4, RAPGEFL1, RASGRP2, RASSF1, RBCK1, RBM12B, RBM14, RBM4, RBM14 RBM4, RBM23, RBM4, RBM14-RBM4, RBM47, RBM7,AP002373.1, RBM7, RP11-212D19.4, RBMS2, RBMY1E, RBPJ, RBPMS, RBSN, RCBTB2, RCC1, RCC1, SNHG3, RCCD1, RECQL, RELL2, REPIN1, AC073111.3, REPIN1, ZNF775, RER1, RERE, RFWD3, RFX3, RGL2, RGMB, RGS11, RGS3, RGS5, AL592435.1, RHBDD1, RHNO1, TULP3, RHOC, AL603832.3, RHOC,RP11-426L16.10, RHOH, RIC8B, RIMKLB, RIN1, RIPK2, PIT1, RLIM, RNASE4,ANG,AL163636.6, RNASEK, RNASEK-C17orf49, RNF111, RNF123, RNF13, RNF14, RNF185, RNF216, RNF24, RNF32, RNF34, RNF38, RNF4, RNF44, RNH1, RNMT, RNPS1, R060, ROPN1, ROPN1B, ROR2, RP1-102H19.8, C6orf163, RP1-283E3.8,CDK11A, RP11-120M18.2,PRKAR1A, RP11-133K1.2, PAK6, RP11-164J13.1,CAPN3, RP11-21JJ8.1, ANKRD12, RP11-322E11.6,INO80C, RP11-337C18.10,CHD1L, RP11-432B6.3, TRIM59, RP11-468E2.4,IRF9, RP11-484M3.5, UPK1B, RP11-517H2.6, CCR6, RP11-613M10.9, SLC25A51, RP11-659G9.3, RAB30, RP11-691N7.6,CTNND1, RP11-849H4.2, RP11-896J10.3, NKX2-1, RP11-96020.4, SQRDL, RP11-986E7.7, SERPINA3, RP4-769N13.6, GPRASP1, RP4-769N13.6, GPRASP2, RP4-798P15.3, SEC16B, RP5-1021I20.4, ZNF410, RP6-109B7.3, FLJ27365, RPE, RPH3AL, RPL15, RPL17, RPL17-C18orf32,RPL17, RPL23A, RPL36,HSD11B1L, RPP38, RPS20, RPS27A, RPS3A, RPS6KA3, RPS6KC1, RPS6KL1, RPUSD1, RRAGD, RRAS2, RRBP1, RSLID1, RSRC2, RSRP1, RUBCNL, RUNX1T1, RUVBL2, RWDD1, RWDD4, S100A3,AL162258.1, S100A13,RP1-178F15.5, S100A16, S100A4, S100A3, S100A6, S100PBP, SAA1, SACM1L, SAMD4B, SAR1A, SARAF, SARNP, RP11-76217.5, SCAMP5, SCAP, SCAPER, SCFD1, SCGB3A2, SCIN, SCML1, SCNN1D, SCO2, SCOC, SCRN1, SDC2, SDC4, SEC13, SEC14L1, SEC14L2, SEC22C, SEC23B, SEC24C, SEC61 G, SEMA4A, SEMA4C, SEMA4D, SEMA6C, SENP7, SEPP1, 11-Sep, 2-Sep, SERGEF, AC055860.1, SERP1, SERPINA1, SERPINA5, SERPINB6, SERPING1, SERPINH1, SERTAD3, SETD5, SFMBT1, AC096887.1, SFTPA1, SFTPA2, SFXN2, SGCD, SGCE, SGK3, SGK3,C8orf44, SH2B1, SH2D6, SH3BP1,Z83844.3, SH3BP2, SH3BP5, SH3D19, SH3YL1, SHC1, SHISA5, SHMT1, SHMT2, SHOC2, SHROOM1, SIGLEC5,SIGLEC14, SIL1, SIN3A, SIRT2, SIRT6, SKP1, STAT4, AC104109.3, SLAIN1, SLC10A3, SLC12A9, SLC14A1, SLC16A6, SLC1A2, SLC1A6, SLC2OA2, SLC25A18, SLC25A19, SLC25A22, SLC25A25, SLC25A29, SLC25A30, SLC25A32, SLC25A39, SLC25A44, SLC25A45, SLC25A53, SLC26A11, SLC26A4, SLC28A1, SLC29A1, SLC2A14, SLC2A5, SLC2A8, SLC35B2, SLC35B3, SLC35C2, SLC37A1, SLC38A1, SLC38A11, SLC39A13, SLC39A14, SLC41A3, SLC44A3, SLC4A7, SLC4A8, SLC5A10, SLC5A11, SLC6A1, SLC6A12, SLC6A9, SLC7A2, SLC7A6, SLC7A7, SLCO1A2, SLCO1C1, SLCO2B1, SLFN11, SLFN12, SLFNL1, SLMO1, SLTM, SLU7, SMAD2, SMAP2, SMARCA2, SMARCE1, AC073508.2, SMARCE1, KRT222, SMC6, SMG7, SMIM22, SMOX, SMPDL3A, SMTN, SMU1, SMUG1, SNAP25, SNCA, SNRK, SNRPC, SNRPD1, SNRPD2, SNRPN, SNRPN, SNURF, SNUPN, SNX11, SNX16, SNX17, SOAT1, SOHLH2, CCDC169-SOHLH2, CCDC169, SORBS1, SORBS2, SOX5, SP2, SPART, SPATA20, SPATA21, SPA T52, SPATS2L, SPDYE2, SPECC1, SPECC1L,SPECC1L ADORA2A, SPECC1L ADORA2A, ADORA2A, SPED, SPG20, SPG21, SPIDR, SPIN1, SPOCD1, SPOP, SPRR2A, SPRR2B, SPRR2E, SPRR2B, SPRR2F, SPRR2D, SPRR3, SPRY1, SPRY4, SPTBN2, SRC, SRGAP1, SRP68, SRSF11, SSX1, SSX2IP, ST3GAL4, ST3GAL6, ST5, ST6GALNAC6, ST7L, STAC3, STAG1, STAG2, STAMBP, STAMBPL1, STARD3NL, STATE, STAU1, STAU2, ACO22826.2, STAU2, RP11-463D19.2, STEAP2, STEAP3, STIL, STK25, STK33, STK38L, STK40, STMN1, STON1, STON1-GTF2A1L, STRAP, STRBP, STRC, AC011330.5, STRC, CATSPER2, STRC, CATSPER2, AC011330.5, STRCSTRCP1, STT3A, STX16-NPEPL1, NPEPL1, STX5, STX6, STX8, STXBP6, STYK1, SULT1A1, SULT1A2, SUMF2, SUN1, SUN2, SUN2, DNAL4, SUOX, SUPT6H, SUV39H2, SV2B, SYBU, SYNCRIP, SYNJ2, SYT1, SYTL4, TAB2, TACC1, TADA2B, TAF1C, TAF6,AC073842.2, TAF6, RP11-506M12.1, TAF9, TAGLN, TANK, TAPSAR1,PSMB9, TAPT1, TATDN1, TAZ, TBC1D1, TBC1D12, HELLS, TBC1D15, TBC1D3H, TBC1D3G, TBC1D5, TBC1D5,SATB1, TBCA, TBCEL, TBCEL, AP000646.1, TBL1XR1, TBP, TBX5, TBXAS1, TCAF1, TCEA2, TCEAL4, TCEAL8, TCEAL9, TCEANC, TCEB1, TCF19, TCF25, TCF4, TCP1, TCP10L, AP000275.65, TCP11, TCP11L2, TCTN1, TDG, TDP1, TDRD7, TEAD2, TECR, TENC1, TENT4A, TEX264, TEX30, TEX37, TFDP1, TFDP2, TFEB, TFG, TFP1, TF, TFPI, TGIF1, THAP6, THBS3, THOC5, THRAP3, THUMPD3, TIAL1, TIMM9, TIMP1, TIRAP, TJAP1, TJP2, TK2, TLDC1, TLE3, TLE6, TLN1, TLR10, TM9SF1, TMBIM1, TMBIM4, TMBIM6, TMC6, TMCC1, TMCO4, TMEM126A, TMEM139, TMEM150B, TMEM155, TMEM161B, TMEM164, TMEM168, TMEM169, TMEM175, TMEM176B, TMEM182, TMEM199, CTB-96E2.3, TMEM216, TMEM218, TMEM230, TMEM263, TMEM45A, TMEM45B, TMEM62, TMEM63B, TMEM66, TMEM68, TMEM98, TMEM9B, TMPRSS11D, TMPRSS5, TMSB15B, TMTC4, TMUB2, TMX2-CTNND1, RP11-691N7.6,CTNND1, TNFAIP2, TNFAIP8L2, SCNM1, TNFRSF10C, TNFRSF19, TNFRSF8, TNFSF12-TNFSF13, TNFSF12, TNFSF13, TNFSF12-TNFSF13, TNFSF13, TNIP1, TNK2, TNNT1, TNRC18, TNS3, TOB2, TOM1L1, TOP1MT, TOP3B, TOX2, TP53,RP11-199F11.2, TP53I11, TP53INP2, TPCN1, TPM3P9,ACO22137.3, TPT1, TRA2B, TRAF2, TRAF3, TRAPPC12, TRAPPC3, TREH, TREX1, TREX2, TRIB2, TRIM3, TRIM36, TRIM39, TRIM46, TRIM6, TRIM6-TRIM34, TRIM6-TRIM34, TRIM34, TRIM66, TRIM73, TRIT1, TRMT10B, TRMT2B, TRMT2B AS1, TRNT1, TRO, TROVE2, TRPS1, TRPT1, TSC2, TSGA10, TSPANJ4, TSPAN3, TSPAN4, TSPAN5, TSPAN6, TSPAN9, TSPO, TTC12, TTC23, TTC3, TTC39A, TTC39C, TTLL1, TTLL7, TTPAL, TUBD1, TWNK, TXNL4A, TXNL4B, TXNRD1, TYK2, U2AF1, UBA2, UBA52, UBAP2, UBE2D2, UBE2D3, UBE2E3, UBE2I, UBE2J2, UBE3A, UBL7, UBXN11, UBXN7, UGDH, UGGT1, UGP2, UMAD1,AC007161.3, UNC45A, UQCC1, URGCP-MRPS24, URGCP, USMG5, USP16, USP21, USP28, USP3, USP33, USP35, USP54, USP9Y, USPL1, UTP15, VARS2, VASH2, VAV3, VDAC1, VDAC2, VDR, VEZT, VGF, VIL1, VILL, VIPR1, VPS29, VPS37C, VPS8, VPS9D1, VRK2, VWA1, VWA5A, WARS, WASF1, WASHC5, WBP5, WDHD1, WDPCP, WDR37, WDR53, WDR6, WDR72, WDR74, WDR81, WDR86, WDYHV1, WFDC3, WHSC1, WIPF1, WSCD2, WWP2, XAGE1A, XAGE1B, XKR9, XPNPEP1, XRCC3, XRN2, XXYLT1, YIF1A, YIF1B, YIPF1, YIPF5, YPEL5, YWHAB, YWHAZ, YY1AP1, ZBTB1, ZBTB14, ZBTB18, ZBTB20, ZBTB21, ZBTB25, ZBTB33, ZBTB34, ZBTB38, ZBTB43, ZBTB49, ZBTB7B, ZBTB7C, ZBTB8OS, ZC3H11A, ZBED6, ZC3H13, ZCCHC17, ZCCHC7, ZDHHC11, ZDHHC13, ZEB2, ZFAND5, ZFAND6, ZFP1, ZFP62, ZFX, ZFYVE16, ZFYVE19, ZFYVE20, ZFYVE27, ZHX2, AC016405.1, ZHX3, ZIK1, ZIM2,PEG3, ZKSCAN1, ZKSCAN3, ZKSCAN8, ZMAT3, ZMAT5, ZMIZ2, ZMYM6, ZMYMS11, ZNF10,AC026786.1, ZNF133, ZNF146, ZNF16, ZNF177, ZNF18, ZNF200, ZNF202, ZNF211, ZNF219, ZNF226, ZNF227, ZNF23, AC010547.4, ZNF23, AC010547.9, ZNF239, ZNF248, ZNF25, ZNF253, ZNF254, ZNF254, AC092279.1, ZNF263, ZNF274, ZNF275, ZNF28,ZNF468, ZNF283, ZNF287, ZNF3, ZNF320, ZNF322, ZNF324B, ZNF331, ZNF334, ZNF34, ZNF350, ZNF385A, ZNF395, FBXO16, ZNF415, ZNF418, ZNF43, ZNF433 AS1, AC008770.4, ZNF438, ZNF444, ZNF445, ZNF467, ZNF480, ZNF493, ZNF493, CTD-2561J22.3, ZNF502, ZNF507, ZNF512, AC074091.1, ZNF512,RP11-158113.2, ZNF512B, ZNF512B, SAMD10, ZNF521, ZNF532, ZNF544, ACO20915.5, ZNF544, CTD-3138B18.4, ZNF559,ZNF177, ZNF562, ZNF567, ZNF569, ZNF570, ZNF571 AS1,ZNF540, ZNF577, ZNF580,ZNF581, ZNF580, ZNF581,CCDC106, ZNF600, ZNF611, ZNF613, ZNF615, ZNF619,ZNF620, ZNF639, ZNF652, ZNF665, ZNF667, ZNF668, ZNF671, ZNF682, ZNF687, ZNF691, ZNF696, ZNF701, ZNF706, ZNF707, ZNF714, ZNF717, ZNF718, ZNF720, ZNF721, ZNF730, ZNF763, ZNF780B,AC005614.5, ZNF782, ZNF786, ZNF79, ZNF791, ZNF81, ZNF83, ZNF837, ZNF839, ZNF84, ZNF845, ZNF846, ZNF865, ZNF91, ZNF92, ZNHIT3, ZSCAN21, ZSCAN25, ZSCAN30, and ZSCAN32.
In some embodiments, the gene encoding a target sequence comprises the HTT gene. In some embodiments, the gene encoding a target sequence comprises the SMN2 gene.
Exemplary genes that may be modulated by the compounds of Formula (I) or (II) described herein may also include, inter al/a, AC005258.1, AC005943.1, AC007849.1, AC008770.2, AC010487.3, AC011477.4, AC012651.1, AC012531.3, AC034102.2, AC073896.4, AC104472.3, AL109811.3, AL133342.1, AL137782.1, AL157871.5, AF241726.2, AL355336.1, AL358113.1, AL360181.3, AL445423.2, AL691482.3, AP001267.5, RF01169, and RF02271.
The compounds described herein may further be used to modulate a sequence comprising a particular splice site sequence, e.g., an RNA sequence (e.g., a pre-mRNA sequence). In some embodiments, the splice site sequence comprises a 5′ splice site sequence. In some embodiments, the splice site sequence comprises a 3′ splice site sequence. Exemplary gene sequences and splice site sequences (e.g., 5′ splice site sequences) include AAAgcaaguu, AAAguaaaaa, AAAguaaaau, AAAguaaagu, AAAguaaaua, AAAguaaaug, AAAguaaauu, AAAguaacac, AAAguaacca, AAAguaacuu, AAAguaagaa, AAAguaagac, AAAguaagag, AAAguaagau, AAAguaagca, AAAguaagcc, AAAguaagcu, AAAguaagga, AAAguaaggg, AAAguaaggu, AAAguaagua, AAAguaaguc, AAAguaagug, AAAguaaguu, AAAguaaucu, AAAguaauua, AAAguacaaa, AAAguaccgg, AAAguacuag, AAAguacugg, AAAguacuuc, AAAguacuug, AAAguagcuu, AAAguaggag, AAAguaggau, AAAguagggg, AAAguaggua, AAAguaguaa, AAAguauauu, AAAguauccu, AAAguaucuc, AAAguaugga, AAAguaugua, AAAguaugug, AAAguauguu, AAAguauugg, AAAguauuuu, AAAgucagau, AAAgucugag, AAAgugaaua, AAAgugagaa, AAAgugagac, AAAgugagag, AAAgugagau, AAAgugagca, AAAgugagcu, AAAgugaggg, AAAgugagua, AAAgugaguc, AAAgugagug, AAAgugaguu, AAAgugcguc, AAAgugcuga, AAAguggguc, AAAguggguu, AAAgugguaa, AAAguguaug, AAAgugugug, AAAguguguu, AAAguuaagu, AAAguuacuu, AAAguuagug, AAAguuaugu, AAAguugagu, AAAguuugua, AACguaaaac, AACguaaagc, AACguaaagg, AACguaagca, AACguaaggg, AACguaaguc, AACguaagug, AACguaaugg, AACguaguga, AACguaugua, AACguauguu, AACgugagca, AACgugagga, AACgugauuu, AACgugggau, AACgugggua, AACguguguu, AACguuggua, AAGgcaaauu, AAGgcaagag, AAGgcaagau, AAGgcaagcc, AAGgcaagga, AAGgcaaggg, AAGgcaagug, AAGgcaaguu, AAGgcacugc, AAGgcagaaa, AAGgcaggau, AAGgcaggca, AAGgcaggga, AAGgcagggg, AAGgcaggua, AAGgcaggug, AAGgcaucuc, AAGgcaugcu, AAGgcaugga, AAGgcauguu, AAGgcauuau, AAGgcgagcu, AAGgcgaguc, AAGgcgaguu, AAGgcuagcc, AAGguaaaaa, AAGguaaaac, AAGguaaaag, AAGguaaaau, AAGguaaaca, AAGguaaacc, AAGguaaacu, AAGguaaaga, AAGguaaagc, AAGguaaagg, AAGguaaagu, AAGguaaaua, AAGguaaauc, AAGguaaaug, AAGguaaauu, AAGguaacaa, AAGguaacau, AAGguaaccc, AAGguaacua, AAGguaacuc, AAGguaacug, AAGguaacuu, AAGguaagaa, AAGguaagac, AAGguaagag, AAGguaagau, AAGguaagca, AAGguaagcc, AAGguaagcg, AAGguaagcu, AAGguaagga, AAGguaaggc, AAGguaaggg, AAGguaaggu, AAGguaagua, AAGguaaguc, AAGguaagug, AAGguaaguu, AAGguaauaa, AAGguaauac, AAGguaauag, AAGguaauau, AAGguaauca, AAGguaaucc, AAGguaaucu, AAGguaauga, AAGguaaugc, AAGguaaugg, AAGguaaugu, AAGguaauua, AAGguaauuc, AAGguaauug, AAGguaauuu, AAGguacaaa, AAGguacaag, AAGguacaau, AAGguacacc, AAGguacacu, AAGguacagg, AAGguacagu, AAGguacaua, AAGguacaug, AAGguacauu, AAGguaccaa, AAGguaccag, AAGguaccca, AAGguacccu, AAGguaccuc, AAGguaccug, AAGguaccuu, AAGguacgaa, AAGguacggg, AAGguacggu, AAGguacguc, AAGguacguu, AAGguacuaa, AAGguacuau, AAGguacucu, AAGguacuga, AAGguacugc, AAGguacugu, AAGguacuuc, AAGguacuug, AAGguacuuu, AAGguagaaa, AAGguagaac, AAGguagaca, AAGguagacc, AAGguagacu, AAGguagagu, AAGguagaua, AAGguagcaa, AAGguagcag, AAGguagcca, AAGguagccu, AAGguagcua, AAGguagcug, AAGguagcuu, AAGguaggaa, AAGguaggag, AAGguaggau, AAGguaggca, AAGguaggcc, AAGguaggcu, AAGguaggga, AAGguagggc, AAGguagggg, AAGguagggu, AAGguaggua, AAGguagguc, AAGguaggug, AAGguagguu, AAGguaguaa, AAGguaguag, AAGguagucu, AAGguagugc, AAGguagugg, AAGguaguuc, AAGguaguuu, AAGguauaaa, AAGguauaau, AAGguauaca, AAGguauacu, AAGguauaua, AAGguauauc, AAGguauaug, AAGguauauu, AAGguaucac, AAGguaucag, AAGguauccc, AAGguauccu, AAGguaucuc, AAGguaucug, AAGguaucuu, AAGguaugaa, AAGguaugac, AAGguaugag, AAGguaugau, AAGguaugca, AAGguaugcc, AAGguaugcu, AAGguaugga, AAGguauggc, AAGguauggg, AAGguaugua, AAGguauguc, AAGguaugug, AAGguauguu, AAGguauuaa, AAGguauuac, AAGguauuag, AAGguauuau, AAGguauucc, AAGguauuga, AAGguauugu, AAGguauuua, AAGguauuuc, AAGguauuug, AAGguauuuu, AAGgucaaau, AAGgucaaga, AAGgucaagu, AAGgucacag, AAGgucagaa, AAGgucagac, AAGgucagag, AAGgucagca, AAGgucagcc, AAGgucagcg, AAGgucagcu, AAGgucagga, AAGgucaggc, AAGgucaggg, AAGgucaggu, AAGgucagua, AAGgucaguc, AAGgucagug, AAGgucaguu, AAGgucauag, AAGgucaucu, AAGguccaca, AAGguccaga, AAGguccaua, AAGgucccag, AAGgucccuc, AAGguccuuc, AAGgucgagg, AAGgucuaau, AAGgucuacc, AAGgucuaua, AAGgucuccu, AAGgucucug, AAGgucucuu, AAGgucugaa, AAGgucugag, AAGgucugga, AAGgucuggg, AAGgucugua, AAGgucuguu, AAGgucuucu, AAGgucuuuu, AAGgugaaac, AAGgugaaag, AAGgugaaau, AAGgugaacu, AAGgugaagc, AAGgugaagg, AAGgugaagu, AAGgugaaua, AAGgugaaug, AAGgugaauu, AAGgugacaa, AAGgugacag, AAGgugacau, AAGgugacug, AAGgugacuu, AAGgugagaa, AAGgugagac, AAGgugagag, AAGgugagau, AAGgugagca, AAGgugagcc, AAGgugagcg, AAGgugagcu, AAGgugagga, AAGgugaggc, AAGgugaggg, AAGgugaggu, AAGgugagua, AAGgugaguc, AAGgugagug, AAGgugaguu, AAGgugauaa, AAGgugauca, AAGgugaucc, AAGgugauga, AAGgugaugc, AAGgugaugu, AAGgugauua, AAGgugauug, AAGgugauuu, AAGgugcaca, AAGgugcauc, AAGgugcccu, AAGgugccug, AAGgugcgug, AAGgugcguu, AAGgugcucc, AAGgugcuga, AAGgugcugc, AAGgugcugg, AAGgugcuua, AAGgugcuuu, AAGguggaua, AAGguggcua, AAGguggcug, AAGguggcuu, AAGgugggaa, AAGgugggag, AAGgugggau, AAGgugggca, AAGgugggcc, AAGgugggcg, AAGgugggga, AAGguggggu, AAGgugggua, AAGgugggug, AAGguggguu, AAGgugguaa, AAGgugguac, AAGgugguau, AAGguggugg, AAGgugguua, AAGgugguuc, AAGgugguuu, AAGguguaag, AAGgugucaa, AAGgugucag, AAGgugucug, AAGgugugaa, AAGgugugag, AAGgugugca, AAGgugugga, AAGguguggu, AAGgugugua, AAGguguguc, AAGgugugug, AAGguguguu, AAGguguucu, AAGguguugc, AAGguguugg, AAGguguuug, AAGguuaaaa, AAGguuaaca, AAGguuaagc, AAGguuaauu, AAGguuacau, AAGguuagaa, AAGguuagau, AAGguuagca, AAGguuagcc, AAGguuagga, AAGguuaggc, AAGguuagua, AAGguuaguc, AAGguuagug, AAGguuaguu, AAGguuauag, AAGguuauga, AAGguucaaa, AAGguucaag, AAGguuccuu, AAGguucggc, AAGguucguu, AAGguucuaa, AAGguucuga, AAGguucuua, AAGguugaau, AAGguugacu, AAGguugagg, AAGguugagu, AAGguugaua, AAGguugcac, AAGguugcug, AAGguuggaa, AAGguuggca, AAGguuggga, AAGguugggg, AAGguuggua, AAGguugguc, AAGguuggug, AAGguugguu, AAGguuguaa, AAGguugucc, AAGguugugc, AAGguuguua, AAGguuuacc, AAGguuuaua, AAGguuuauu, AAGguuuccu, AAGguuucgu, AAGguuugag, AAGguuugca, AAGguuugcc, AAGguuugcu, AAGguuugga, AAGguuuggu, AAGguuugua, AAGguuuguc, AAGguuugug, AAGguuuuaa, AAGguuuuca, AAGguuuucg, AAGguuuugc, AAGguuuugu, AAGguuuuuu, AAUgcaagua, AAUgcaaguc, AAUguaaaca, AAUguaaaua, AAUguaaauc, AAUguaaaug, AAUguaaauu, AAUguaacua, AAUguaagaa, AAUguaagag, AAUguaagau, AAUguaagcc, AAUguaagcu, AAUguaagga, AAUguaagua, AAUguaaguc, AAUguaagug, AAUguaaguu, AAUguaauca, AAUguaauga, AAUguaaugu, AAUguacauc, AAUguacaug, AAUguacgau, AAUguacgua, AAUguacguc, AAUguacgug, AAUguacucu, AAUguaggca, AAUguagguu, AAUguaucua, AAUguaugaa, AAUguaugua, AAUguaugug, AAUguauguu, AAUgucagag, AAUgucagau, AAUgucagcu, AAUgucagua, AAUgucaguc, AAUgucagug, AAUgucaguu, AAUgucggua, AAUgucuguu, AAUgugagaa, AAUgugagca, AAUgugagcc, AAUgugagga, AAUgugagua, AAUgugaguc, AAUgugagug, AAUgugaguu, AAUgugauau, AAUgugcaua, AAUgugcgua, AAUgugcguc, AAUgugggac, AAUguggguc, AAUgugggug, AAUgugguuu, AAUgugugua, AAUguuaagu, AAUguuagaa, AAUguuagau, AAUguuagua, AAUguuggug, ACAgcaagua, ACAguaaaua, ACAguaaaug, ACAguaagaa, ACAguaagca, ACAguaagua, ACAguaaguc, ACAguaagug, ACAguaaguu, ACAguacgua, ACAguaggug, ACAguauaac, ACAguaugua, ACAgucaguu, ACAgugagaa, ACAgugagcc, ACAgugagcu, ACAgugagga, ACAgugaggu, ACAgugagua, ACAgugaguc, ACAgugagug, ACAgugaguu, ACAgugggua, ACAguggguu, ACAguguaaa, ACAguuaagc, ACAguuaagu, ACAguuaugu, ACAguugagu, ACAguuguga, ACCguaagua, ACCgugagaa, ACCgugagca, ACCgugaguu, ACCgugggug, ACGguaaaac, ACGguaacua, ACGguaagua, ACGguaagug, ACGguaaguu, ACGguaauua, ACGguaauuu, ACGguacaau, ACGguacagu, ACGguaccag, ACGguacggu, ACGguacgua, ACGguaggaa, ACGguaggag, ACGguaggug, ACGguaguaa, ACGguauaau, ACGguaugac, ACGguaugcg, ACGguaugua, ACGguauguc, ACGgugaaac, ACGgugaagu, ACGgugaauc, ACGgugacag, ACGgugacca, ACGgugagaa, ACGgugagau, ACGgugagcc, ACGgugagua, ACGgugagug, ACGgugaguu, ACGgugcgug, ACGguggcac, ACGguggggc, ACGgugggug, ACGguguagu, ACGgugucac, ACGgugugua, ACGguguguu, ACGguuagug, ACGguuaguu, ACGguucaau, ACUguaaaua, ACUguaagaa, ACUguaagac, ACUguaagca, ACUguaagcu, ACUguaagua, ACUguaaguc, ACUguaaguu, ACUguacguu, ACUguacugc, ACUguaggcu, ACUguaggua, ACUguauauu, ACUguaugaa, ACUguaugcu, ACUguaugug, ACUguauucc, ACUgucagcu, ACUgucagug, ACUgugaacg, ACUgugagca, ACUgugagcg, ACUgugagcu, ACUgugagua, ACUgugaguc, ACUgugagug, ACUgugaguu, ACUgugggua, ACUgugugug, ACUguuaagu, AGAgcaagua, AGAguaaaac, AGAguaaacg, AGAguaaaga, AGAguaaagu, AGAguaaauc, AGAguaaaug, AGAguaacau, AGAguaacua, AGAguaagaa, AGAguaagac, AGAguaagag, AGAguaagau, AGAguaagca, AGAguaagcu, AGAguaagga, AGAguaaggc, AGAguaaggg, AGAguaaggu, AGAguaaguc, AGAguaagug, AGAguaaguu, AGAguaauaa, AGAguaaugu, AGAguaauuc, AGAguaauuu, AGAguacacc, AGAguaccug, AGAguacgug, AGAguacucu, AGAguacuga, AGAguacuuu, AGAguagcug, AGAguaggaa, AGAguaggga, AGAguagggu, AGAguagguc, AGAguaggug, AGAguagguu, AGAguauaua, AGAguauauu, AGAguaugaa, AGAguaugac, AGAguaugau, AGAguauguc, AGAguaugug, AGAguauguu, AGAguauuaa, AGAguauuau, AGAgucagug, AGAgugagac, AGAgugagag, AGAgugagau, AGAgugagca, AGAgugagua, AGAgugaguc, AGAgugagug, AGAgugaguu, AGAgugcguc, AGAgugggga, AGAgugggug, AGAgugugug, AGAguguuuc, AGAguuagua, AGAguugaga, AGAguugagu, AGAguugguu, AGAguuugau, AGCguaagcu, AGCguaagug, AGCgugagcc, AGCgugagug, AGCguuguuc, AGGgcagagu, AGGgcagccu, AGGgcuagua, AGGguaaaga, AGGguaaaua, AGGguaaauc, AGGguaaauu, AGGguaacca, AGGguaacug, AGGguaacuu, AGGguaagaa, AGGguaagag, AGGguaagau, AGGguaagca, AGGguaagga, AGGguaaggc, AGGguaaggg, AGGguaagua, AGGguaaguc, AGGguaagug, AGGguaaguu, AGGguaauac, AGGguaauga, AGGguaauua, AGGguaauuu, AGGguacacc, AGGguacagu, AGGguacggu, AGGguaggac, AGGguaggag, AGGguaggca, AGGguaggcc, AGGguaggga, AGGguagggu, AGGguagguc, AGGguaggug, AGGguagguu, AGGguauaua, AGGguaugac, AGGguaugag, AGGguaugau, AGGguaugca, AGGguaugcu, AGGguauggg, AGGguauggu, AGGguaugua, AGGguauguc, AGGguaugug, AGGguauuac, AGGguauucu, AGGguauuuc, AGGgucagag, AGGgucagca, AGGgucagga, AGGgucaggg, AGGgucagug, AGGgucaguu, AGGguccccu, AGGgucggga, AGGgucugca, AGGgucuguu, AGGgugaaga, AGGgugacua, AGGgugagaa, AGGgugagac, AGGgugagag, AGGgugagca, AGGgugagcc, AGGgugagcu, AGGgugagga, AGGgugaggg, AGGgugaggu, AGGgugagua, AGGgugaguc, AGGgugagug, AGGgugaguu, AGGgugggga, AGGguggggu, AGGgugggua, AGGgugggug, AGGgugugua, AGGgugugug, AGGguuaaug, AGGguuagaa, AGGguuaguu, AGGguuggug, AGGguuugug, AGGguuuguu, AGUguaaaag, AGUguaaaua, AGUguaaauu, AGUguaagaa, AGUguaagag, AGUguaagau, AGUguaagca, AGUguaagcc, AGUguaagua, AGUguaagug, AGUguaaguu, AGUguaauug, AGUguaggac, AGUguagguc, AGUguaugag, AGUguaugua, AGUguauguu, AGUguauugu, AGUguauuua, AGUgucaguc, AGUgugagag, AGUgugagca, AGUgugagcc, AGUgugagcu, AGUgugagua, AGUgugaguc, AGUgugagug, AGUgugaguu, AGUgugggua, AGUgugggug, AGUgugugua, AGUguuccua, AGUguugggg, AGUguuucag, AUAguaaaua, AUAguaagac, AUAguaagau, AUAguaagca, AUAguaagua, AUAguaagug, AUAguaaguu, AUAguaggua, AUAguauguu, AUAgucucac, AUAgugagac, AUAgugagag, AUAgugagau, AUAgugagcc, AUAgugaggc, AUAgugagua, AUAgugaguc, AUAgugagug, AUAgugcguc, AUAgugugua, AUAguucagu, AUCguaagcc, AUCguaaguu, AUCguauucc, AUCgugagua, AUGgcaagcg, AUGgcaagga, AUGgcaaguu, AUGgcaggua, AUGgcaugug, AUGgcgccau, AUGgcuugug, AUGguaaaac, AUGguaaaau, AUGguaaacc, AUGguaaaga, AUGguaaaua, AUGguaaaug, AUGguaaauu, AUGguaacag, AUGguaacau, AUGguaacua, AUGguaacuc, AUGguaacuu, AUGguaagaa, AUGguaagac, AUGguaagag, AUGguaagau, AUGguaagca, AUGguaagcc, AUGguaagcu, AUGguaagga, AUGguaaggg, AUGguaagua, AUGguaaguc, AUGguaagug, AUGguaaguu, AUGguaauaa, AUGguaauau, AUGguaauga, AUGguaaugg, AUGguaauug, AUGguaauuu, AUGguacagc, AUGguacauc, AUGguaccag, AUGguaccug, AUGguacgag, AUGguacggu, AUGguagauc, AUGguagcag, AUGguagcug, AUGguaggaa, AUGguaggau, AUGguaggca, AUGguaggcu, AUGguagggg, AUGguagggu, AUGguaggua, AUGguaggug, AUGguaguuu, AUGguauagu, AUGguauaua, AUGguaucag, AUGguaucuu, AUGguaugau, AUGguaugca, AUGguaugcc, AUGguaugcg, AUGguaugcu, AUGguaugga, AUGguauggc, AUGguaugug, AUGguauguu, AUGguauuau, AUGguauuga, AUGguauuug, AUGgucaggg, AUGgucaguc, AUGgucagug, AUGgucauuu, AUGgugaaaa, AUGgugaaac, AUGgugaaau, AUGgugaacu, AUGgugaaga, AUGgugacgu, AUGgugagaa, AUGgugagac, AUGgugagag, AUGgugagca, AUGgugagcc, AUGgugagcg, AUGgugagcu, AUGgugaggc, AUGgugaggg, AUGgugagua, AUGgugaguc, AUGgugagug, AUGgugaguu, AUGgugauuu, AUGgugcgau, AUGgugcgug, AUGgugggua, AUGgugggug, AUGguggguu, AUGgugguua, AUGguguaag, AUGgugugaa, AUGgugugua, AUGgugugug, AUGguuacuc, AUGguuagca, AUGguuaguc, AUGguuagug, AUGguuaguu, AUGguucagu, AUGguucguc, AUGguuggua, AUGguugguc, AUGguugguu, AUGguuguuu, AUGguuugca, AUGguuugua, AUUgcaagua, AUUguaaaua, AUUguaagau, AUUguaagca, AUUguaagga, AUUguaaggc, AUUguaagua, AUUguaaguc, AUUguaaguu, AUUguaauua, AUUguaauuu, AUUguacaaa, AUUguaccuc, AUUguacgug, AUUguacuug, AUUguaggua, AUUguaugag, AUUguaugua, AUUgucuguu, AUUgugagcu, AUUgugagua, AUUgugaguc, AUUgugaguu, AUUgugcgug, AUUgugggug, AUUguuagug, CAAguaaaaa, CAAguaaaua, CAAguaaauc, CAAguaaaug, CAAguaaccc, CAAguaacua, CAAguaacug, CAAguaagaa, CAAguaagac, CAAguaagau, CAAguaaggu, CAAguaagua, CAAguaaguc, CAAguaagug, CAAguaaguu, CAAguaaucc, CAAguaaucu, CAAguaauua, CAAguaauuc, CAAguaauug, CAAguaauuu, CAAguacaca, CAAguacguu, CAAguacuuu, CAAguagcug, CAAguaggau, CAAguaggua, CAAguagguc, CAAguaggug, CAAguagguu, CAAguaguuu, CAAguauaac, CAAguauaug, CAAguaucuu, CAAguaugag, CAAguaugua, CAAguauguc, CAAguaugug, CAAguauguu, CAAguauuga, CAAguauuuc, CAAgucagac, CAAgucagua, CAAgucuaua, CAAgucugau, CAAgugacuu, CAAgugagaa, CAAgugagac, CAAgugagca, CAAgugaggc, CAAgugaggg, CAAgugagua, CAAgugaguc, CAAgugagug, CAAgugaucc, CAAgugaucu, CAAgugauuc, CAAgugauug, CAAgugauuu, CAAgugccuu, CAAgugggua, CAAguggguc, CAAgugggug, CAAgugugag, CAAguuaaaa, CAAguuaagu, CAAguuaauc, CAAguuagaa, CAAguuaguu, CAAguucaag, CAAguuccgu, CAAguuggua, CAAguuuagu, CAAguuucca, CAAguuuguu, CACguaagag, CACguaagca, CACguaauug, CACguaggac, CACguaucga, CACgucaguu, CACgugagcu, CACgugaguc, CACgugagug, CAGgcaagaa, CAGgcaagac, CAGgcaagag, CAGgcaagga, CAGgcaagua, CAGgcaagug, CAGgcaaguu, CAGgcacgca, CAGgcagagg, CAGgcaggug, CAGgcaucau, CAGgcaugaa, CAGgcaugag, CAGgcaugca, CAGgcaugcg, CAGgcaugug, CAGgcgagag, CAGgcgccug, CAGgcgugug, CAGguaaaaa, CAGguaaaag, CAGguaaaca, CAGguaaacc, CAGguaaaga, CAGguaaagc, CAGguaaagu, CAGguaaaua, CAGguaaauc, CAGguaaaug, CAGguaaauu, CAGguaacag, CAGguaacau, CAGguaacca, CAGguaaccg, CAGguaacgu, CAGguaacua, CAGguaacuc, CAGguaacug, CAGguaacuu, CAGguaagaa, CAGguaagac, CAGguaagag, CAGguaagau, CAGguaagcc, CAGguaagga, CAGguaaggc, CAGguaaggg, CAGguaaggu, CAGguaagua, CAGguaagug, CAGguaaguu, CAGguaauaa, CAGguaauau, CAGguaaucc, CAGguaaugc, CAGguaaugg, CAGguaaugu, CAGguaauua, CAGguaauuc, CAGguaauug, CAGguaauuu, CAGguacaaa, CAGguacaag, CAGguacaau, CAGguacaca, CAGguacacg, CAGguacaga, CAGguacagg, CAGguacagu, CAGguacaua, CAGguacaug, CAGguacauu, CAGguaccac, CAGguaccca, CAGguacccg, CAGguacccu, CAGguaccgc, CAGguaccgg, CAGguaccuc, CAGguaccug, CAGguaccuu, CAGguacgag, CAGguacgca, CAGguacgcc, CAGguacggu, CAGguacgua, CAGguacgug, CAGguacuaa, CAGguacuag, CAGguacuau, CAGguacucc, CAGguacucu, CAGguacuga, CAGguacugc, CAGguacugu, CAGguacuua, CAGguacuuu, CAGguagaaa, CAGguagaac, CAGguagaag, CAGguagaca, CAGguagacc, CAGguagaga, CAGguagauu, CAGguagcaa, CAGguagcac, CAGguagcag, CAGguagcca, CAGguagcgu, CAGguagcua, CAGguagcuc, CAGguagcug, CAGguagcuu, CAGguaggaa, CAGguaggac, CAGguaggag, CAGguaggca, CAGguaggga, CAGguagggc, CAGguagggg, CAGguagggu, CAGguaggua, CAGguagguc, CAGguaggug, CAGguagguu, CAGguaguaa, CAGguaguau, CAGguaguca, CAGguagucc, CAGguaguga, CAGguagugu, CAGguaguuc, CAGguaguug, CAGguaguuu, CAGguauaag, CAGguauaca, CAGguauaga, CAGguauauc, CAGguauaug, CAGguauauu, CAGguaucag, CAGguaucau, CAGguauccu, CAGguaucga, CAGguaucgc, CAGguaucua, CAGguaucug, CAGguaucuu, CAGguaugaa, CAGguaugac, CAGguaugag, CAGguaugau, CAGguaugca, CAGguaugcc, CAGguaugcg, CAGguaugcu, CAGguaugga, CAGguauggg, CAGguauggu, CAGguaugua, CAGguauguc, CAGguaugug, CAGguauguu, CAGguauuau, CAGguauuca, CAGguauucu, CAGguauuga, CAGguauugg, CAGguauugu, CAGguauuua, CAGguauuuc, CAGguauuug, CAGguauuuu, CAGgucaaca, CAGgucaaug, CAGgucacgu, CAGgucagaa, CAGgucagac, CAGgucagca, CAGgucagcc, CAGgucagcg, CAGgucagga, CAGgucagua, CAGgucaguc, CAGgucagug, CAGgucaguu, CAGgucaucc, CAGgucaugc, CAGgucauua, CAGgucauuu, CAGguccacc, CAGguccacu, CAGguccagu, CAGguccauc, CAGguccauu, CAGgucccag, CAGgucccug, CAGguccuga, CAGguccugc, CAGguccugg, CAGgucggcc, CAGgucggug, CAGgucguug, CAGgucucuc, CAGgucucuu, CAGgucugag, CAGgucugcc, CAGgucugcg, CAGgucugga, CAGgucuggu, CAGgucugua, CAGgucuguc, CAGgucugug, CAGgucuguu, CAGgucuucc, CAGgucuuuc, CAGgugaaag, CAGgugaaau, CAGgugaaca, CAGgugaaga, CAGgugaagg, CAGgugaaua, CAGgugaauc, CAGgugaauu, CAGgugacaa, CAGgugacau, CAGgugacca, CAGgugaccc, CAGgugaccg, CAGgugaccu, CAGgugacgg, CAGgugacua, CAGgugacuc, CAGgugacug, CAGgugagaa, CAGgugagac, CAGgugagag, CAGgugagau, CAGgugagca, CAGgugagcc, CAGgugagcg, CAGgugagcu, CAGgugagga, CAGgugaggc, CAGgugaggg, CAGgugaggu, CAGgugagua, CAGgugaguc, CAGgugagug, CAGgugaguu, CAGgugauaa, CAGgugaucc, CAGgugaucu, CAGgugaugc, CAGgugaugg, CAGgugaugu, CAGgugauua, CAGgugauuc, CAGgugauug, CAGgugauuu, CAGgugcaaa, CAGgugcaag, CAGgugcaca, CAGgugcacg, CAGgugcaga, CAGgugcagg, CAGgugcaua, CAGgugcauc, CAGgugcaug, CAGgugccaa, CAGgugccca, CAGgugcccc, CAGgugcccg, CAGgugccua, CAGgugccug, CAGgugcgaa, CAGgugcgca, CAGgugcgcc, CAGgugcgcg, CAGgugcgga, CAGgugcggu, CAGgugcgua, CAGgugcguc, CAGgugcgug, CAGgugcuag, CAGgugcuau, CAGgugcuca, CAGgugcucc, CAGgugcucg, CAGgugcugc, CAGgugcugg, CAGgugcuua, CAGgugcuuc, CAGgugcuug, CAGguggaac, CAGguggaag, CAGguggaau, CAGguggaga, CAGguggagu, CAGguggauu, CAGguggcca, CAGguggcuc, CAGguggcug, CAGgugggaa, CAGgugggac, CAGgugggag, CAGgugggau, CAGgugggca, CAGgugggcc, CAGgugggcu, CAGgugggga, CAGguggggc, CAGguggggg, CAGguggggu, CAGgugggua, CAGguggguc, CAGgugggug, CAGguggguu, CAGguggucu, CAGguggugg, CAGgugguug, CAGguguaca, CAGguguagg, CAGguguauc, CAGgugucac, CAGgugucag, CAGgugucca, CAGguguccu, CAGgugucua, CAGgugucuc, CAGgugucug, CAGgugugaa, CAGgugugac, CAGgugugag, CAGgugugau, CAGgugugca, CAGgugugcc, CAGgugugcg, CAGgugugcu, CAGgugugga, CAGguguggc, CAGgugugua, CAGguguguc, CAGgugugug, CAGguguguu, CAGguguuua, CAGguuaaaa, CAGguuaaua, CAGguuaauc, CAGguuaccu, CAGguuagaa, CAGguuagag, CAGguuagau, CAGguuagcc, CAGguuaggg, CAGguuaggu, CAGguuagua, CAGguuaguc, CAGguuagug, CAGguuaguu, CAGguuauca, CAGguuaugu, CAGguuauua, CAGguuauug, CAGguucaaa, CAGguucaac, CAGguucaag, CAGguucaca, CAGguucacg, CAGguucagg, CAGguucaug, CAGguuccag, CAGguuccca, CAGguucccg, CAGguucgaa, CAGguucgag, CAGguucuau, CAGguucugc, CAGguucuua, CAGguucuuc, CAGguucuuu, CAGguugaac, CAGguugaag, CAGguugagu, CAGguugaua, CAGguuggag, CAGguuggca, CAGguuggcc, CAGguugguc, CAGguuggug, CAGguugguu, CAGguuguaa, CAGguuguac, CAGguuguau, CAGguuguca, CAGguuguga, CAGguuguug, CAGguuuaag, CAGguuuacc, CAGguuuagc, CAGguuuagu, CAGguuucuu, CAGguuugaa, CAGguuugag, CAGguuugau, CAGguuugcc, CAGguuugcu, CAGguuuggg, CAGguuuggu, CAGguuugua, CAGguuugug, CAGguuuguu, CAGguuuucu, CAGguuuugg, CAGguuuuuc, CAGguuuuuu, CAUgcagguu, CAUguaaaac, CAUguaacua, CAUguaagaa, CAUguaagag, CAUguaagau, CAUguaagcc, CAUguaagua, CAUguaagug, CAUguaaguu, CAUguaauua, CAUguacaua, CAUguaccac, CAUguacguu, CAUguaggua, CAUguaggug, CAUguagguu, CAUguaugaa, CAUguaugua, CAUguaugug, CAUguauguu, CAUgugagaa, CAUgugagca, CAUgugagcu, CAUgugagua, CAUgugaguc, CAUgugagug, CAUgugaguu, CAUgugcgua, CAUgugggaa, CAUguggguu, CAUgugugug, CAUguguguu, CAUguuaaua, CAUguuagcc, CCAguaagau, CCAguaagca, CCAguaagcc, CCAguaagcu, CCAguaagga, CCAguaagua, CCAguaaguc, CCAguaagug, CCAguaaguu, CCAguaauug, CCAguacggg, CCAguagguc, CCAguauugu, CCAgugaggc, CCAgugagua, CCAgugagug, CCAguggguc, CCAguuaguu, CCAguugagu, CCCguaagau, CCCguauguc, CCCguauguu, CCCguccugc, CCCgugagug, CCGguaaaga, CCGguaagau, CCGguaagcc, CCGguaagga, CCGguaaggc, CCGguaaugg, CCGguacagu, CCGguacuga, CCGguauucc, CCGgucagug, CCGgugaaaa, CCGgugagaa, CCGgugaggg, CCGgugagug, CCGgugaguu, CCGgugcgcg, CCGgugggcg, CCGguugguc, CCUguaaaug, CCUguaaauu, CCUguaagaa, CCUguaagac, CCUguaagag, CCUguaagca, CCUguaagcg, CCUguaagga, CCUguaaguu, CCUguaggua, CCUguaggug, CCUguaucuu, CCUguauggu, CCUguaugug, CCUgugagaa, CCUgugagca, CCUgugaggg, CCUgugaguc, CCUgugagug, CCUgugaguu, CCUguggcuc, CCUgugggua, CCUgugugua, CCUguuagaa, CGAguaaggg, CGAguaaggu, CGAguagcug, CGAguaggug, CGAguagguu, CGAgugagca, CGCguaagag, CGGgcaggca, CGGguaagcc, CGGguaagcu, CGGguaaguu, CGGguaauuc, CGGguaauuu, CGGguacagu, CGGguacggg, CGGguaggag, CGGguaggcc, CGGguaggug, CGGguauuua, CGGgucugag, CGGgugaccg, CGGgugacuc, CGGgugagaa, CGGgugaggg, CGGgugaggu, CGGgugagua, CGGgugagug, CGGgugaguu, CGGgugauuu, CGGgugccuu, CGGgugggag, CGGgugggug, CGGguggguu, CGGguguguc, CGGgugugug, CGGguguguu, CGGguucaag, CGGguucaug, CGGguuugcu, CGUguagggu, CGUguaugca, CGUguaugua, CGUgucugua, CGUgugagug, CGUguuuucu, CUAguaaaug, CUAguaagcg, CUAguaagcu, CUAguaagua, CUAguaaguc, CUAguaagug, CUAguaaguu, CUAguaauuu, CUAguaggua, CUAguagguu, CUAguaugua, CUAguauguu, CUAgugagua, CUCguaagca, CUCguaagug, CUCguaaguu, CUCguaucug, CUCgucugug, CUCgugaaua, CUCgugagua, CUCgugauua, CUGguaaaaa, CUGguaaaau, CUGguaaacc, CUGguaaacg, CUGguaaagc, CUGguaaaua, CUGguaaauc, CUGguaaaug, CUGguaaauu, CUGguaacac, CUGguaacag, CUGguaaccc, CUGguaaccg, CUGguaacug, CUGguaacuu, CUGguaagaa, CUGguaagag, CUGguaagau, CUGguaagca, CUGguaagcc, CUGguaagcu, CUGguaagga, CUGguaaggc, CUGguaaggg, CUGguaaggu, CUGguaagua, CUGguaagug, CUGguaaguu, CUGguaauga, CUGguaaugc, CUGguaauuc, CUGguaauuu, CUGguacaac, CUGguacaau, CUGguacaga, CUGguacaua, CUGguacauu, CUGguaccau, CUGguacguu, CUGguacuaa, CUGguacuug, CUGguacuuu, CUGguagaga, CUGguagaua, CUGguagcgu, CUGguaggau, CUGguaggca, CUGguaggua, CUGguagguc, CUGguaggug, CUGguaucaa, CUGguaugau, CUGguauggc, CUGguauggu, CUGguaugua, CUGguaugug, CUGguauguu, CUGguauuga, CUGguauuuc, CUGguauuuu, CUGgucaaca, CUGgucagag, CUGgucccgc, CUGgucggua, CUGgucuggg, CUGgugaagu, CUGgugaaua, CUGgugaauu, CUGgugacua, CUGgugagaa, CUGgugagac, CUGgugagca, CUGgugagcu, CUGgugagga, CUGgugaggc, CUGgugaggg, CUGgugaggu, CUGgugagua, CUGgugaguc, CUGgugagug, CUGgugaguu, CUGgugauua, CUGgugauuu, CUGgugcaga, CUGgugcgcu, CUGgugcgug, CUGgugcuga, CUGgugggag, CUGgugggga, CUGgugggua, CUGguggguc, CUGgugggug, CUGguggguu, CUGgugugaa, CUGgugugca, CUGgugugcu, CUGguguggu, CUGgugugug, CUGguguguu, CUGguuagcu, CUGguuagug, CUGguucgug, CUGguuggcu, CUGguuguuu, CUGguuugua, CUGguuuguc, CUGguuugug, CUUguaaaug, CUUguaagcu, CUUguaagga, CUUguaaggc, CUUguaagua, CUUguaagug, CUUguaaguu, CUUguacguc, CUUguacgug, CUUguaggua, CUUguagugc, CUUguauagg, CUUgucagua, CUUgugagua, CUUgugaguc, CUUgugaguu, CUUguggguu, CUUgugugua, CUUguuagug, CUUguuugag, GAAguaaaac, GAAguaaagc, GAAguaaagu, GAAguaaaua, GAAguaaauu, GAAguaagaa, GAAguaagcc, GAAguaagcu, GAAguaagga, GAAguaagua, GAAguaagug, GAAguaaguu, GAAguaauau, GAAguaaugc, GAAguaauua, GAAguaauuu, GAAguaccau, GAAguacgua, GAAguacguc, GAAguaggca, GAAguagguc, GAAguauaaa, GAAguaugcu, GAAguaugug, GAAguauguu, GAAguauuaa, GAAgucagug, GAAgugagag, GAAgugagcg, GAAgugaggu, GAAgugaguc, GAAgugagug, GAAgugaguu, GAAgugauaa, GAAgugauuc, GAAgugcgug, GAAguguggg, GAAguguguc, GAAguuggug, GACguaaagu, GACguaagcu, GACguaagua, GACguaaugg, GACguaugcc, GACguauguu, GACgugagcc, GACgugagug, GAGgcaaaug, GAGgcaagag, GAGgcaagua, GAGgcaagug, GAGgcaaguu, GAGgcacgag, GAGgcaggga, GAGgcaugug, GAGgcgaagg, GAGguaaaaa, GAGguaaaac, GAGguaaaag, GAGguaaaau, GAGguaaacc, GAGguaaaga, GAGguaaagc, GAGguaaagu, GAGguaaaua, GAGguaaauc, GAGguaaaug, GAGguaaauu, GAGguaacaa, GAGguaacag, GAGguaacca, GAGguaaccu, GAGguaacuu, GAGguaagaa, GAGguaagag, GAGguaagau, GAGguaagca, GAGguaagcc, GAGguaagcg, GAGguaagcu, GAGguaagga, GAGguaaggc, GAGguaaggg, GAGguaaggu, GAGguaagua, GAGguaaguc, GAGguaauaa, GAGguaauac, GAGguaauau, GAGguaauca, GAGguaaucu, GAGguaaugg, GAGguaaugu, GAGguaauug, GAGguaauuu, GAGguacaaa, GAGguacaac, GAGguacaga, GAGguacagc, GAGguacagu, GAGguacaua, GAGguacauu, GAGguaccag, GAGguaccga, GAGguaccug, GAGguaccuu, GAGguacuag, GAGguacuau, GAGguacucc, GAGguacugc, GAGguacugg, GAGguacugu, GAGguacuug, GAGguacuuu, GAGguagaag, GAGguagaga, GAGguagagg, GAGguagagu, GAGguagauc, GAGguagcua, GAGguagcug, GAGguaggaa, GAGguaggag, GAGguaggca, GAGguaggcu, GAGguaggga, GAGguagggc, GAGguagggg, GAGguaggua, GAGguaggug, GAGguagguu, GAGguaguaa, GAGguaguag, GAGguaguau, GAGguagucu, GAGguagugc, GAGguagugg, GAGguaguua, GAGguaguug, GAGguauaag, GAGguauacu, GAGguauagc, GAGguauaug, GAGguauauu, GAGguaucau, GAGguaucug, GAGguaucuu, GAGguaugaa, GAGguaugac, GAGguaugag, GAGguaugcc, GAGguaugcg, GAGguaugcu, GAGguaugga, GAGguauggg, GAGguauggu, GAGguaugua, GAGguauguc, GAGguaugug, GAGguauguu, GAGguauucc, GAGguauuga, GAGguauugu, GAGguauuua, GAGguauuuc, GAGguauuug, GAGguauuuu, GAGgucaaca, GAGgucaagg, GAGgucaaug, GAGgucacug, GAGgucagaa, GAGgucagag, GAGgucagcu, GAGgucagga, GAGgucaggc, GAGgucaggg, GAGgucaggu, GAGgucagua, GAGgucauau, GAGgucaugu, GAGgucauuu, GAGguccaua, GAGguccauc, GAGguccggg, GAGguccggu, GAGguccuug, GAGgucgggg, GAGgucucgu, GAGgucugag, GAGgucuggu, GAGgucuguc, GAGgucuguu, GAGgucuuuu, GAGgugaaaa, GAGgugaaau, GAGgugaaca, GAGgugaagg, GAGgugaaua, GAGgugaauu, GAGgugacau, GAGgugacca, GAGgugaccu, GAGgugacua, GAGgugacuu, GAGgugagaa, GAGgugagac, GAGgugagag, GAGgugagau, GAGgugagca, GAGgugagcc, GAGgugagcg, GAGgugagcu, GAGgugagga, GAGgugaggc, GAGgugaggg, GAGgugagua, GAGgugagug, GAGgugaguu, GAGgugauau, GAGgugaucc, GAGgugaucu, GAGgugauga, GAGgugaugg, GAGgugaugu, GAGgugauuc, GAGgugcaca, GAGgugcaga, GAGgugcagc, GAGgugcagg, GAGgugccag, GAGgugccca, GAGgugccuu, GAGgugcggg, GAGgugcgug, GAGgugcucc, GAGgugcugg, GAGgugcuua, GAGgugcuug, GAGguggaaa, GAGguggaau, GAGguggacc, GAGguggacg, GAGguggagg, GAGguggcug, GAGgugggaa, GAGgugggag, GAGgugggau, GAGgugggca, GAGgugggcg, GAGgugggcu, GAGgugggga, GAGguggggc, GAGguggggg, GAGgugggua, GAGguggguc, GAGgugggug, GAGguggguu, GAGgugguau, GAGgugguuc, GAGgugucau, GAGgugugag, GAGgugugau, GAGgugugca, GAGgugugcu, GAGgugugga, GAGguguggg, GAGguguggu, GAGgugugua, GAGgugugug, GAGguuaaau, GAGguuaaga, GAGguuaaua, GAGguuaccg, GAGguuagaa, GAGguuagac, GAGguuagag, GAGguuaggu, GAGguuagua, GAGguuaguc, GAGguuagug, GAGguuaguu, GAGguuaugu, GAGguuauuc, GAGguucaaa, GAGguucaua, GAGguucuga, GAGguugaag, GAGguugcag, GAGguugcug, GAGguuggaa, GAGguuggag, GAGguuggau, GAGguuggua, GAGguugguc, GAGguugguu, GAGguuguag, GAGguuucug, GAGguuugag, GAGguuugga, GAGguuuggg, GAGguuugua, GAGguuuguu, GAGguuuuca, GAGguuuuga, GAGguuuugg, GAGguuuuua, GAGguuuuuc, GAUguaaaau, GAUguaagca, GAUguaagcc, GAUguaaggu, GAUguaagua, GAUguaagug, GAUguaaguu, GAUguacauc, GAUguaggua, GAUguauggc, GAUguaugua, GAUguauguu, GAUgucagug, GAUgugagag, GAUgugagcc, GAUgugagcu, GAUgugagga, GAUgugaguc, GAUgugagug, GAUgugaguu, GAUgugggua, GAUgugggug, GAUguguguu, GAUguuagcu, GAUguucagu, GAUguucgug, GAUguuuguu, GCAguaaagg, GCAguaagaa, GCAguaagga, GCAguaagua, GCAguaaguc, GCAguaaguu, GCAguagaug, GCAguaggua, GCAguaugug, GCAguauguu, GCAgucagua, GCAgucagug, GCAguccggu, GCAgugacuu, GCAgugagcc, GCAgugagcg, GCAgugagcu, GCAgugagua, GCAgugagug, GCAgugaguu, GCAgugggua, GCAguuaagu, GCAguugagu, GCCguaaguc, GCCgugagua, GCGguaaagc, GCGguaaaua, GCGguaagcu, GCGguaaggg, GCGguaagug, GCGguaauca, GCGguacgua, GCGguacuug, GCGguagggu, GCGguagugu, GCGgugagca, GCGgugagcu, GCGgugaguu, GCGguggcuc, GCGgugugca, GCGguguguu, GCGguuaagu, GCGguuugca, GCUgcuguaa, GCUguaaaua, GCUguaagac, GCUguaagag, GCUguaagca, GCUguaagga, GCUguaagua, GCUguaaguc, GCUguaagug, GCUguaaguu, GCUguaggug, GCUguauggu, GCUgucagug, GCUguccuug, GCUgugagaa, GCUgugagcc, GCUgugagga, GCUgugagua, GCUgugaguc, GCUgugagug, GCUgugaguu, GCUguggguu, GGAguaagag, GGAguaagca, GGAguaagcc, GGAguaagcu, GGAguaagga, GGAguaagug, GGAguaaguu, GGAguaauuu, GGAguacugu, GGAguaggaa, GGAguaggua, GGAguagguu, GGAguaguau, GGAguaugac, GGAguauggu, GGAgucaagu, GGAgugaggg, GGAgugagua, GGAgugaguc, GGAgugagug, GGAgugaguu, GGAgugcuuu, GGAgugggca, GGAgugggug, GGAguuaagg, GGAguugaga, GGCguaagcc, GGCguaggua, GGCguaggug, GGCgugagcc, GGCgugaguc, GGGguaaaca, GGGguaaacc, GGGguaaacu, GGGguaagaa, GGGguaagag, GGGguaagau, GGGguaagca, GGGguaagcc, GGGguaagcu, GGGguaagga, GGGguaaggg, GGGguaagua, GGGguaagug, GGGguaaguu, GGGguagaca, GGGguaggag, GGGguaggcc, GGGguaggga, GGGguaggua, GGGguaggug, GGGguagguu, GGGguagugc, GGGguaucug, GGGguaugac, GGGguaugga, GGGguaugua, GGGguauguc, GGGguaugug, GGGguauguu, GGGgucagua, GGGguccgug, GGGgucggag, GGGgucugug, GGGgugaaca, GGGgugaaga, GGGgugagaa, GGGgugagau, GGGgugagcc, GGGgugagcg, GGGgugagcu, GGGgugagga, GGGgugaggc, GGGgugaggg, GGGgugaguc, GGGgugagug, GGGgugaguu, GGGgugcgua, GGGguggggu, GGGgugggua, GGGgugggug, GGGguggguu, GGGgugugcg, GGGgugugua, GGGguguguc, GGGgugugug, GGGguuacag, GGGguuggac, GGGguuggga, GGGguuugcc, GGGguuugua, GGUguaagaa, GGUguaagau, GGUguaagca, GGUguaagcc, GGUguaagcg, GGUguaaguc, GGUguaagug, GGUguagguc, GGUguaggug, GGUguagguu, GGUguccgua, GGUgugagag, GGUgugagcc, GGUgugagcu, GGUgugagua, GGUgugaguc, GGUgugcuuc, GGUguggcug, GGUgugguga, GGUgugucug, GGUguugaaa, GGUguugcug, GUAguaagau, GUAguaagua, GUAguaagug, GUAguagcuu, GUAguaggua, GUAgucagua, GUAgugagua, GUAguggugg, GUAguuaagu, GUAguuucug, GUCguaagug, GUCgugagug, GUCgugaguu, GUGgcaagua, GUGgcuugua, GUGguaaaau, GUGguaaaga, GUGguaaauu, GUGguaacau, GUGguaacua, GUGguaagaa, GUGguaagac, GUGguaagag, GUGguaagau, GUGguaagca, GUGguaagcg, GUGguaagcu, GUGguaagga, GUGguaaggc, GUGguaagua, GUGguaaguc, GUGguaagug, GUGguaaguu, GUGguaauga, GUGguaauuc, GUGguaauuu, GUGguacaug, GUGguacgau, GUGguacuau, GUGguacuug, GUGguagaua, GUGguagcgc, GUGguaggga, GUGguagguc, GUGguaggug, GUGguagguu, GUGguauaaa, GUGguaucuc, GUGguaugaa, GUGguaugau, GUGguaugca, GUGguaugua, GUGguauguu, GUGguccgug, GUGgucuggc, GUGgugaaac, GUGgugagaa, GUGgugagau, GUGgugagca, GUGgugagcu, GUGgugagga, GUGgugaggc, GUGgugagug, GUGgugaguu, GUGgugauua, GUGgugauuc, GUGgugcgau, GUGgugcuua, GUGgugggaa, GUGgugggua, GUGguggguc, GUGguguccg, GUGguuagca, GUGguuaggu, GUGguuagug, GUGguuugca, GUGguuugua, GUUguaaggu, GUUguaagua, GUUguaaguc, GUUguaaguu, GUUguaccac, GUUguagcgu, GUUguaugug, GUUguauguu, GUUgucugug, GUUgugagcu, GUUgugagug, GUUgugaguu, GUUgugggua, GUUguggguu, UAAguaaaug, UAAguaacua, UAAguaagaa, UAAguaagag, UAAguaagau, UAAguaagca, UAAguaagcu, UAAguaagga, UAAguaaggu, UAAguaagua, UAAguaaguc, UAAguaagug, UAAguaaguu, UAAguaauaa, UAAguacuag, UAAguaguuu, UAAguauaaa, UAAguauaca, UAAguaugua, UAAguauuau, UAAguauuuu, UAAgucuuuu, UAAgugagac, UAAgugagga, UAAgugaggg, UAAgugagua, UAAgugaguc, UAAgugagug, UAAgugaguu, UAAgugaucc, UAAgugauuc, UAAgugcgug, UAAguuaagu, UAAguuccag, UAAguucuuu, UAAguuguaa, UAAguuguau, UAAguuuguu, UACguaacug, UACguaagaa, UACguaagau, UACguaagua, UACguaagug, UACguauccu, UACgucuggc, UACgugacca, UAGgcaagac, UAGgcaaguc, UAGgcagguc, UAGgcgugug, UAGguaaaaa, UAGguaaaac, UAGguaaaag, UAGguaaaau, UAGguaaaca, UAGguaaaga, UAGguaaaua, UAGguaaauc, UAGguaaaug, UAGguaaauu, UAGguaacac, UAGguaacag, UAGguaacau, UAGguaacca, UAGguaacgg, UAGguaacua, UAGguaacuc, UAGguaacug, UAGguaacuu, UAGguaagac, UAGguaagag, UAGguaagau, UAGguaagca, UAGguaagcc, UAGguaagcu, UAGguaagga, UAGguaaggc, UAGguaaggg, UAGguaagua, UAGguaaguc, UAGguaagug, UAGguaaguu, UAGguaauag, UAGguaauau, UAGguaaucu, UAGguaauga, UAGguaaugg, UAGguaaugu, UAGguaauua, UAGguaauuc, UAGguaauuu, UAGguacagc, UAGguacagu, UAGguacauu, UAGguaccag, UAGguaccua, UAGguaccuu, UAGguacgag, UAGguacgua, UAGguacguu, UAGguacuau, UAGguacuga, UAGguacugg, UAGguacuuc, UAGguacuuu, UAGguagcgg, UAGguaggaa, UAGguaggac, UAGguaggau, UAGguaggga, UAGguagggg, UAGguaggua, UAGguagguc, UAGguaggug, UAGguagguu, UAGguaguaa, UAGguagucu, UAGguagugg, UAGguagugu, UAGguaguuu, UAGguauaaa, UAGguauaac, UAGguauaag, UAGguauaau, UAGguauaca, UAGguauacu, UAGguauaua, UAGguauauc, UAGguauauu, UAGguaucag, UAGguaucua, UAGguaucuc, UAGguaugaa, UAGguaugag, UAGguaugca, UAGguaugga, UAGguauggc, UAGguauggu, UAGguaugua, UAGguauguc, UAGguaugug, UAGguauguu, UAGguauuaa, UAGguauuac, UAGguauuau, UAGguauuca, UAGguauucc, UAGguauucu, UAGguauuga, UAGguauuua, UAGguauuuc, UAGguauuuu, UAGgucacuc, UAGgucagcu, UAGgucaggu, UAGgucagua, UAGgucagug, UAGgucaguu, UAGgucaucu, UAGgucauug, UAGguccaau, UAGguccugu, UAGgucucaa, UAGgucucgc, UAGgucuggc, UAGgucuguc, UAGgucugug, UAGgugaagu, UAGgugaaua, UAGgugaaug, UAGgugaauu, UAGgugacau, UAGgugacca, UAGgugacua, UAGgugagaa, UAGgugagac, UAGgugagag, UAGgugagau, UAGgugagcc, UAGgugagcu, UAGgugagga, UAGgugaggc, UAGgugaggu, UAGgugagua, UAGgugaguc, UAGgugagug, UAGgugauca, UAGgugauuc, UAGgugauuu, UAGgugcaua, UAGgugcauc, UAGgugccgu, UAGgugccug, UAGgugcgca, UAGgugcgua, UAGgugcgug, UAGgugcuga, UAGguggaua, UAGgugggaa, UAGgugggac, UAGgugggag, UAGgugggau, UAGgugggcc, UAGgugggcu, UAGguggguu, UAGguggugu, UAGguguaaa, UAGgugugaa, UAGgugugag, UAGgugugca, UAGgugugcc, UAGgugugcg, UAGguguggu, UAGgugugua, UAGgugugug, UAGguguugg, UAGguuaagc, UAGguuagac, UAGguuagcc, UAGguuaggc, UAGguuagua, UAGguuaguc, UAGguuagug, UAGguucccc, UAGguucuac, UAGguuggua, UAGguugguu, UAGguugucc, UAGguuuauu, UAGguuugcc, UAGguuugua, UAGguuuguc, UAGguuugug, UAGguuuguu, UAGguuuuuc, UAGguuuuug, UAUguaagaa, UAUguaagau, UAUguaagca, UAUguaagcc, UAUguaagua, UAUguaaguc, UAUguaagug, UAUguaaguu, UAUguacgug, UAUguacguu, UAUguagguc, UAUguagguu, UAUguauccu, UAUguaucuc, UAUguaugua, UAUguauguc, UAUguaugug, UAUguauuau, UAUgucagaa, UAUgucugua, UAUgugaaua, UAUgugacag, UAUgugagua, UAUgugagug, UAUgugaguu, UAUgugggca, UAUgugugua, UAUguguuua, UAUguuuugu, UCAgcgacau, UCAguaaaau, UCAguaaaua, UCAguaacug, UCAguaagaa, UCAguaagag, UCAguaagau, UCAguaagca, UCAguaagcc, UCAguaagcu, UCAguaaggg, UCAguaagua, UCAguaaguc, UCAguaagug, UCAguaaguu, UCAguaucuu, UCAguaugga, UCAguauggu, UCAgucccca, UCAgugagca, UCAgugagcu, UCAgugagua, UCAgugagug, UCAgugaguu, UCAgugauug, UCAgugggug, UCAguugagc, UCAguugauu, UCAguuuagu, UCCguaagca, UCCguaagcu, UCCguaaguc, UCCguaagug, UCCguaauag, UCCguacuua, UCCguaugua, UCCguauguu, UCCgugagau, UCCgugaguc, UCGguaaauu, UCGguaagag, UCGguaagcu, UCGguacauc, UCGguacucc, UCGguagacc, UCGguagguu, UCGguaguaa, UCGguaugug, UCGguauguu, UCGguauuga, UCGgucagua, UCGgucuuag, UCGgugaagu, UCGgugagaa, UCGgugagca, UCGgugaggc, UCGgugagua, UCGgugcgcu, UCGgugcuuu, UCGgugguuu, UCGguuagcu, UCUguaaaag, UCUguaagaa, UCUguaagau, UCUguaagca, UCUguaagcu, UCUguaagua, UCUguaaguc, UCUguaagug, UCUguaaguu, UCUguaauaa, UCUguaauga, UCUguaaugu, UCUguaggua, UCUguagguu, UCUguauaua, UCUguaugac, UCUguaugua, UCUguccucg, UCUgugagag, UCUgugagcu, UCUgugagga, UCUgugagua, UCUgugaguc, UCUgugagug, UCUgugaguu, UCUgugcgua, UCUgugugag, UGAguaacuu, UGAguaagau, UGAguaagca, UGAguaagcu, UGAguaaggc, UGAguaaggu, UGAguaagua, UGAguaaguc, UGAguaagug, UGAguaaguu, UGAguaaucc, UGAguaauua, UGAguacagu, UGAguacgua, UGAguacguu, UGAguacugu, UGAguagcug, UGAguaggua, UGAguauaaa, UGAguaugcu, UGAguaugga, UGAguaugua, UGAguauguc, UGAguauguu, UGAgucagag, UGAgucuacg, UGAgugaaua, UGAgugaauu, UGAgugagaa, UGAgugagau, UGAgugagca, UGAgugagcc, UGAgugagga, UGAgugagua, UGAgugagug, UGAgugaguu, UGAgugggaa, UGAguuaaga, UGAguuaaug, UGAguuacgg, UGAguuaggu, UGAguucuau, UGAguugguu, UGAguuguag, UGAguuuauc, UGCguaaguc, UGCguaagug, UGCguacggc, UGCguacggg, UGCguaugua, UGGgcaaguc, UGGgcaagug, UGGgcacauc, UGGgccacgu, UGGgccccgg, UGGguaaaau, UGGguaaagc, UGGguaaagg, UGGguaaagu, UGGguaaaua, UGGguaaaug, UGGguaaauu, UGGguaacag, UGGguaacau, UGGguaacua, UGGguaacuu, UGGguaagaa, UGGguaagac, UGGguaagag, UGGguaagau, UGGguaagca, UGGguaagcc, UGGguaagcu, UGGguaaggg, UGGguaaggu, UGGguaagua, UGGguaaguc, UGGguaagug, UGGguaaguu, UGGguaaugu, UGGguaauua, UGGguaauuu, UGGguacaaa, UGGguacagu, UGGguacuac, UGGguaggga, UGGguagguc, UGGguaggug, UGGguagguu, UGGguaguua, UGGguauagu, UGGguaugaa, UGGguaugac, UGGguaugag, UGGguaugua, UGGguauguc, UGGguaugug, UGGguauguu, UGGguauuug, UGGgucuuug, UGGgugaccu, UGGgugacua, UGGgugagac, UGGgugagag, UGGgugagca, UGGgugagcc, UGGgugagga, UGGgugaggc, UGGgugaggg, UGGgugagua, UGGgugaguc, UGGgugagug, UGGgugaguu, UGGgugcgug, UGGguggagg, UGGguggcuu, UGGguggggg, UGGgugggua, UGGguggguc, UGGgugggug, UGGguggguu, UGGgugugga, UGGguguguc, UGGgugugug, UGGguguguu, UGGguguuua, UGGguuaaug, UGGguuaguc, UGGguuagug, UGGguuaguu, UGGguucaag, UGGguucgua, UGGguuggug, UGGguuuaag, UGGguuugua, UGUgcaagua, UGUguaaaua, UGUguaagaa, UGUguaagac, UGUguaagag, UGUguaaggu, UGUguaagua, UGUguaaguc, UGUguaaguu, UGUguacuuc, UGUguaggcg, UGUguaggua, UGUguaguua, UGUguaugug, UGUgucagua, UGUgucugua, UGUgucuguc, UGUgugaccc, UGUgugagau, UGUgugagca, UGUgugagcc, UGUgugagua, UGUgugaguc, UGUgugagug, UGUgugcgug, UGUgugggug, UGUguggguu, UGUgugugag, UGUguguucu, UGUguuuaga, UUAguaaaua, UUAguaagaa, UUAguaagua, UUAguaagug, UUAguaaguu, UUAguaggug, UUAgugagca, UUAgugaguu, UUAguuaagu, UUCguaaguc, UUCguaaguu, UUCguaauua, UUCgugagua, UUCgugaguu, UUGgcaagug, UUGgccgagu, UUGguaaaaa, UUGguaaaau, UUGguaaaga, UUGguaaagg, UUGguaaagu, UUGguaaauc, UUGguaaaug, UUGguaaauu, UUGguaacug, UUGguaacuu, UUGguaagaa, UUGguaagag, UUGguaagcu, UUGguaagga, UUGguaaggg, UUGguaagua, UUGguaagug, UUGguaaguu, UUGguaauac, UUGguaauca, UUGguaaugc, UUGguaaugu, UUGguaauug, UUGguaauuu, UUGguacaua, UUGguacgug, UUGguagagg, UUGguaggac, UUGguaggcg, UUGguaggcu, UUGguaggga, UUGguaggua, UUGguagguc, UUGguaggug, UUGguauaaa, UUGguauaca, UUGguauauu, UUGguaucua, UUGguaucuc, UUGguaugca, UUGguaugua, UUGguaugug, UUGguauguu, UUGguauugu, UUGguauuua, UUGguauuuu, UUGgucagaa, UUGgucagua, UUGgucucug, UUGgucugca, UUGgugaaaa, UUGgugacug, UUGgugagac, UUGgugagau, UUGgugagca, UUGgugagga, UUGgugaggg, UUGgugagua, UUGgugaguc, UUGgugagug, UUGgugaguu, UUGgugaugg, UUGgugauua, UUGgugauug, UUGgugcaca, UUGgugggaa, UUGguggggc, UUGgugggua, UUGguggguc, UUGgugggug, UUGguggguu, UUGguguggu, UUGguguguc, UUGgugugug, UUGguguguu, UUGguuaagu, UUGguuagca, UUGguuagug, UUGguuaguu, UUGguuggga, UUGguugguu, UUGguuugua, UUGguuuguc, UUUgcaagug, UUUguaaaua, UUUguaaaug, UUUguaagaa, UUUguaagac, UUUguaagag, UUUguaagca, UUUguaaggu, UUUguaagua, UUUguaaguc, UUUguaagug, UUUguaaguu, UUUguaauuu, UUUguacagg, UUUguacgug, UUUguacuag, UUUguacugu, UUUguagguu, UUUguauccu, UUUguauguu, UUUgugagca, UUUgugagug, UUUgugcguc, UUUguguguc, and uGGguaccug.
Additional exemplary gene sequences and splice site sequences (e.g., 5′ splice site sequences) include AAGgcaagau, AUGguaugug, GGGgugaggc, CAGguaggug, AAGgucagua, AAGguuagag, AUGgcacuua, UAAguaaguc, UGGgugagcu, CGAgcugggc, AAAgcacccc, UAGguggggg, AGAguaacgu, UCGgugaugu, AAUgucaguu, AGGgucugag, GAGgugacug, AUGguagguu, GAGgucuguc, CAGguaugug, CAAguacugc, CACgugcgua, CCGgugagcu, CAGguacuuc, CAGgcgagag, GAAgcaagua, AGGgugagca, CAGgcaaguc, AAGgugaggc, CAGguaagua, CCAguugggu, AAGguguggg, CAGguuggag, CCGguaugaa, UGGguaaugu, CAGgugaggu, AGAguaauag, CAGguaugag, AUGguaaguu, UUGguggguc, UUUguaagca, CUCguaugcc, UAGguaagag, UAGgcaaguu, GGAguuaagu, GAGguaugcc, AAGguguggu, CAGgugggug, UUAguaagua, AAGguuggcu, UGAguaugug, CCAgccuucc, CCUguacgug, CCUguaggua, CAGguacgcu, GAGguucuuc, AAGguugccu, CGUguucacu, CGGgugggga, UAGgugggau, CGGguaagga, AAGguacuau, GGGguaagcu, ACGguagagc, CAGgugaaga, GCGguaagag, CAGguguugu, GAAguuugug, AUGgugagca, CGGguucgug, AUUguccggc, GAUgugugug, AUGgucuguu, AAGguaggau, CCGguaagau, AAGguaaaga, GGGgugaguu, AGGguuggug, GGAgugagug, AGUguaagga, UAGguaacug, AAGgugaaga, UGGguaagug, CAGguaagag, UAGgugagcg, GAGguaaaaa, GCCguaaguu, AAGguuuugu, CAGgugagga, ACAgcccaug, GCGgugagcc, CAGguaugca, AUGguaccua, CAAguaugua, AUGguggugc, UAAguggcag, UAGguauagu, CUGguauuua, AGGguaaacg, AUAguaagug, UUGguacuga, GGUguaagcc, GAGguggaua, GAUguaagaa, ACGgucaguu, UAAguaaaca, AAGguaucug, AGGguauuug, AAGgugaaug, CUGgugaauu, CAGguuuuuu, CAUguaugug, UUGguagagg, AAGguaugcc, CAGgugccac, UCGguauuga, AAGguuugug, AAUguacagg, CAUguggguu, CAUgugaguu, UUGguaaugu, AGUguaggug, GAGguaacuc, GAGguggcgc, CUGguaauug, GAGguuugcu, UGUguacgug, UAGguaaaga, CUAguaggca, UCUgugaguc, UCUguaaggc, CAGguuugug, GAGguagggc, AAGguaacca, ACUgugaguu, UAGguaauag, AAAguaagcu, AUGgugagug, UAGguuugug, AACguaggac, GUAgcaggua, GAGgucagac, AGGguaugaa, GAGguuagug, CAGgcacgug, GGGgcaagac, CAGguguguc, CAGguauuga, CAGguauguc, AAGgcaaggu, UUGgugagaa, AAGguaaaau, GGGguaagua, AAGguaucuu, GACgugaguc, UAUguaugcu, AAGguacugu, CAGgugaacu, CACguaaaug, AAGgugugau, GAAguauuug, AAGgucugug, AAGguggagg, AAGguauaug, CAGguucuua, AGGguaacca, CAGgugucac, AAAguucugu, UUGgugaguu, CAAgugaguc, UAGguagguc, GCGgugagcu, AUUgugagga, CAGgugcaca, CAGguuggaa, CUGgucacuu, GGAguaagug, GAGgugggcu, AAGguacuug, AGGguaggau, AAUguguguu, ACAguuaagu, GAGgugugug, AAGgcgggcu, AUAgcaagua, AAGguuguua, CAAgcaaggc, GUGguaauua, UCUguucagu, AGGguaggcc, AAGguaucau, UAGguaccuu, AAGguaugac, GGAguaggua, UAAguuggca, AGUgugaggc, GAGguuugug, UGGgucugcu, CAGgugaucc, CAGgucagug, AAGguaaggg, CAGgugcagu, GAGguggguc, GCUgugagug, AAGguggagu, GGGgucaguu, AGCguaagug, AGAguaugaa, GGGguagggu, AAGgccagca, CGAguaugcc, GUGgugagcg, AAUguaaauu, CAGgugcgca, GGUguaugaa, CUUgugaguu, AAGguaucuc, AGAguaagga, UAGguaagac, GAGgugagug, CAGguguguu, UUGgugagua, AGGgcgaguu, CAGguuuugc, UUUgugaguu, AGGguaagca, GAGguccucu, CCAgcaggua, GAGguucgcg, CAGgugaucu, ACUguaagua, AAGguaaauc, CAGgcaaaua, GUGguaagca, CAGguuaaau, UUGguaauaa, UAUguaggua, CAGguaguau, AAGgugugcc, UGGguaagag, CAGgcaagca, UUGguaaggg, AAGgcaggug, ACGguaaaug, GCUgugagca, AUGguacaca, GUAguguguu, ACUguaagag, CCCgcagguc, GAGgugagcc, GAGgugcugu, UAAguaugcu, GAGgccaucu, UCAgugagug, CAGgugcuac, AAUgugggug, GAGgugugaa, CUGguagguc, GUGgcgcgcg, CAGgugcaaa, UAAguggagg, CAUgugggua, GAGguagggu, AAAgugaguu, AGGguucuag, UGUgugagcu, AGGgugaauc, CAGgucaggg, AAGgucccug, CUGguagagu, UAGgucaguu, AAAguaaggg, CAAguaugug, CAGgugcuuu, AAGguaauuc, GGGgugcacg, ACUgugcuac, CAGguaccua, CAGguagcuu, UGGgugaggc, CUGguacauu, AGGguaaucu, CAGguacaag, CAGguaauuc, AGGgcacuug, UAGgugagaa, GAGguaaugc, CCAgugaguu, AAAguaugug, CUGgugaauc, UAUguaugua, CCUgcaggug, CAGguaucug, GAGgugaggu, CUGguaaaac, UGUgugugcu, CAGguuaagu, CAGguaaucc, UAGguauuug, UGGguagguc, CAGguaacag, AGCgugcgug, AAGgucagga, GGUgugagcc, CUGguaagua, GGGgugggca, AAGgugggaa, CAGgugagug, CUGguuguua, CAGguaauag, UAGgugaguu, AGAguaaguu, UAGguaaucc, CCGgugacug, GUCgugauua, CUUguaagug, UAGguaguca, CUGguaaguc, AGGgugagcg, CAGguaugga, AUUgugacca, GUUgugggua, AAGguacaag, CUAgcaagug, CUGgugagau, CAGgugggca, AUGgcucgag, CUGguacguu, UUGgugugua, GAGgugucug, GAGgugggac, GGGgugggag, GCAgcgugag, GAGguaaaga, GAGguaugua, AAGgugagac, AAGguacaau, CUGguaugag, AACguaaaau, GUGguaggga, CUGguaugug, CUUguaagca, AAGguaggga, AUUguaagcc, AUGguaagcu, CAGgugaauu, UAGgugaaua, CAAguaugga, AUGguauggc, GAGgucaugc, CAGguacccu, ACAgugagac, CAGgucugau, GAAguugggu, CUGgugcgug, CAGguacgag, ACAgugagcc, AAGguaagua, GGAguaaggc, GAGgugugua, AAGgucauuu, CAGguagucu, AUGguaucug, AAGguaaacu, GAGguaggug, CUGguaagca, AGGguaagag, AAAguaaagc, CAGguuugag, GAGgcgggua, CGAguacgau, CAGguuguug, AAAguauggg, UAGgcugguc, AAGguaagga, AAGguuuccu, UUGguaaaac, GAGguaagua, CAGguucaag, UGGguuaugu, GAGgugaguu, ACGgugaaac, GAUguaacca, AAGgugcggg, CCGguacgug, GAUgugagaa, GUGgcgguga, CAGguauuag, GAGguuggga, AAGgcuagua, AAGgugggcg, CAGgcaggga, AAUguuaguu, GAGguaaagg, CAGgugugcu, CUGguaugau, AUGguuaguc, CUGgugagaa, CAGgccggcg, CAGgugacug, AAAguaaggu, UAAguacuug, AAGguaaagc, UCGguagggg, CAGguaggaa, AGUguaagca, CCCgugagau, GUGguuguuu, CAGguuugcc, AGGguauggg, UAAguaagug, GAGguaagac, GAUguagguc, CAAguaggug, AUAguaaaua, GAGguugggg, GAGgcgagua, CAGguagugu, GUGguaggug, CAAgugagug, AAGgugacaa, CCAgcguaau, ACGgugaggu, GGGguauauu, CAGgugagua, AAGgugcgug, UAUguaaauu, CAGgucagua, ACGguacuua, GAGgucagca, UAAguaugua, GGGgucagac, AAUgugugag, UCCgucagua, CAGgugcuuc, CCAguuagug, CCGgugggcg, AGGgugcaug, GGGguaggau, UAGgugggcc, GAGguguucg, UUGgcaagaa, UCCguaagua, CAGguguaag, CUCgugagua, GAGguguuuu, GAGgugagca, GAGguaaagu, AAGguacguu, CAGguccagu, AUGgugaaac, GUAgugagcu, CAGgugaaaa, AGGguacagg, AAGguaacgc, AAGguauacc, CCUgugagau, GGGguacgug, GAGguauggu, UAGguauuau, GAAguaggag, UCGguaaggg, CCGguaagcg, GAAguaauua, CAGgugaguc, AAGgucaaga, AUGguaaguc, CAGgugagcu, CCAguuuuug, CAGgugggag, AAGguauuau, AAGguaaaua, AAGgugcugu, AAAguacacc, CUGguucgug, UCAguaaguc, GAAguacgug, CAGgugacaa, UGGguaagaa, UGUguagggg, GAGguaggca, UUGgugaggc, AUGgugugua, CAGguccucc, UUGguaaaug, GCUgugaguu, AUGgucugua, CAUgcaggug, CUGguacacc, CAGguccuua, CAAguaaucu, AUGgcagccu, AAGgucagaa, AACgugaggc, CAGgcacgca, ACGguccagg, UCUguacaua, GAGgugauua, ACGguaaaua, AUGguaacug, CAGgcgcguu, CAGguauaga, AAGguuuguu, CAGguaugaa, UAGguuggua, CUGgugagac, CAGguuagga, AUGgugacug, UUGguauccc, CUUguaggac, AAAguguguu, CAGguuucuu, GGGguauggc, GGGguaggac, ACUguaaguc, AUCguaagcu, UAGguucccc, GGUgugagca, CUGguuggua, GGGguuaggg, UGAguaagaa, GAGguauucc, UGGguuaguc, CAGgcucgug, UAGguagagu, UAGgugcccu, AAAgugagua, GAGguucaua, UUGguaagag, ACCgugugua, UAUguaguau, UGGguaauag, CAGgucugaa, AAAguauaaa, GUGgugaguc, AGUgugauua, UUGgugugug, CAGgugaugg, GCUgugagua, CAGguacaug, AAGguacagu, GAAguuguag, CAGgugauua, UAGgugaauu, GGUguuaaua, CAGguauuua, CAAguacucg, CAAguaagaa, AAGguaccuu, ACGgugaggg, UGAgcaggca, GGGgugaccg, GAGguaaaug, CGGguuugug, AAGgugagcg, GUGguaugga, CUGguaagga, GAGguaccag, CCGgugagug, AAGguuagaa, GAGguacuug, AGAguaaaac, UCUgugagua, AAGgcgggaa, CAGguaugcg, AGGguaaaac, AAGgugacug, AGGguauguu, AAGguaugua, CAGgucucuc, CAGgcaugua, CUGguaggua, AAGgucaugc, CAGguacaca, GAUguacguu, ACAguacgug, ACGguaccca, CAGguagugc, ACAguaagag, GGUgcacacc, GAGguguaac, AAGgugugua, UAGguacuua, GCGguacugc, UGGguaaguc, CAUguaggua, CAGguaggau, CAGgucuggc, GUGguuuuaa, CAGgugggaa, UGGgugagua, CGAgugagcc, AAGguauggc, AGUguuguca, CAGgugauuu, UAGguaucuc, UAAguauguu, AAGguugagc, AGAguaaaga, GGUguaagua, GGGgugagcu, CAGguauaau, GAGguacaaa, AUGguaccaa, UAGguagggg, UGAgucagaa, AAGgcaauua, UUGguaagau, CAGguacaga, AGAguuagag, CAGgugcguc, GAGguauuac, ACGguacaga, CAGgucuucc, AAGguaaggu, GAGguaauuu, AGUguaggcu, AAAguaagcg, CCUguaagcc, AGGgugauuu, UGUguaugaa, CUGguacaca, AGGguagaga, AUAguaagca, AGAguaugua, UUGgucagca, CAGgcaaguu, AAGguauaua, AAGgucugga, CAGguacgca, AGGgugcggg, AUGguaagug, AAAgugauga, UGCgugagua, AGAguaggga, UGUguaggua, UAGguaggau, UAAgugagug, GCUguaagua, GAAguaagaa, UCGgugaggc, UAGguauuuu, AAGguacaca, AAGguaggua, UGGguagguu, ACAgcaagua, GAGguaggag, UGGgugaguu, GCGgugagau, CCUguagguu, CAGgugugua, CUGguaagcc, AAGgugauuc, CAGguagcua, GUUguaagug, AUGguaagca, AUAguaggga, GGGguucgcu, CCGgucagag, GUAguaugag, CGUguaagau, UGAguaggca, UCAguaugua, GAGguaucug, AGAguauuuu, AAGguuguag, AGUguaaguu, CGGguaaguu, UCGgugcgga, UAGguaagua, GAAguuagau, GCUgugagac, CAGgcaggua, CAGguagggg, UAAguuaaga, AUGguggguu, UAGguaaguu, CUGguaaauu, CCGguaagga, GAGgcaggca, CAUguaagug, AAGgugccua, UUGguaggga, AAGguaaaca, CGGgugugag, GGGgugugag, UCCguggguc, ACGguaaauc, UCAguaggua, CAGgucagcc, CAGgcggugg, CGAguaagcu, CCCgugagca, AAAguaauga, CUGguaagcu, CGGguaacca, CAGgucgcac, GAGguaggcc, UAGgugagcc, UAGguaggca, GCGgugcgug, AUGgugagua, GGGgugaggg, GAGgucacac, CAGguaggcc, CAAgugcuga, GUCgucuuca, CAUguaagaa, GUAguaagga, UAGguuugua, CAAguuagag, AAGguagagu, AAGgugagau, AAAguaggua, ACAgugaauc, CAGgugugcg, CAGgucggcc, AAGguaguau, ACUgucaguc, UCUgcagccu, CGAguaagug, AGAguaauua, AGUgugagug, CCGgugagcg, AAGguaaccu, AAGguugugg, AAGgcauggg, AAGgucagag, ACGguaaggu, GGGgugagca, GAGguugcuu, AAGguaucgc, CCGguaaagg, AAAguuaaug, UAGguacgag, ACCguaauua, GGGguaagga, CCGguaacgc, CAGgucagaa, AAGguacuga, GAGgugacca, GGGgugagcc, AAGguacagg, AUGguaauua, CAGgugagag, AAGgugacuc, AUAguaagua, GAGguaaacc, CAGgugggau, CAGgugagaa, AGGguaaaaa, GAGgugugac, CACguaagcu, CAGguccccc, CAGgucaggu, CGGguaaguc, ACGguauggg, GAUguaaguu, CAAguaauau, CAGguugggg, CCUgugcugg, AAGguaugau, AGGguagagg, AAGguggguu, CAGgugugaa, UUGguaugug, UUGguaucuc, GGGgugagug, CUGgugugug, AGGguagggc, GUGgugagua, CAGguaugua, AAGguacauu, UUAguaagug, AAUguauauc, CUUguaagua, GAGguuagua, CAGguaaggu, CAGguaaugu, AGGgugaggc, CAGguauuuc, CAGgucugga, GGGgugugcu, UAGgugagug, AAUguaaccu, UAAgugaguc, CAGgugcacu, ACGguaagua, GAGguauccu, UCUguaaguc, CAGguauuca, UGUguaagug, CCAgcaaggc, GAGgugaagg, AAUguggggu, UCGgugcgug, UUGguaaggc, GAGguaagug, AAAguaagau, UAGgucuuuu, GAGgucugau, CCAguuagag, UGGgugaaaa, AGAguaagau, CAGguaauug, CAGgccgguc, CCGguaagag, GAGgugagcu, CUGguaagac, CAGgugagau, CUGguuuguu, UGGguaggua, CAGguuagug, CAGguguucg, CGGguagguc, GUGguacaua, AAGguacuaa, GAUgugagua, UGUguaagac, GAGguagccg, UAGgugaucu, CAGguacgug, CUUgucaguc, GAGguaucac, GAGguaauga, AAGguaacac, CAGguaaagc, AAGgcaagua, CGCgugagcc, AGUgugcguu, GAUguaagca, AAGguaauag, GGAgcaguug, AGCguaagau, AAGgucaggc, GAGguauuca, AAUguaaagu, CAGguaacaa, UCGguaggug, AAAguaaguc, CGGgugcagu, GGUgugugca, UGAgugagaa, CACguguaag, GUGguuggua, GCAgccuuga, CGAgugugau, CAGguauaua, UAUguaugug, CCCgugguca, AUGguaagac, GAGgugugga, AGUguauccu, UGAguguguc, UGGguaaucu, AUGgcagguu, GAGguaagau, UCAgcagcgu, AAGgugggau, CGGgugcgcu, CAGgugucug, AGCgugguaa, AAUgugaaug, UCGgugagac, UAGguaaagc, CUGguaaaag, CCGgugcgga, CAGguacuca, CAGguagcaa, GAAguugagu, GAGguggagg, AGGguaugag, UAGguaugcu, UAGgugagac, CAGguaauua, CGUguaagcc, CUUguaaguu, AAGguaacuu, UCGgcaaggc, GAGguucucg, GAGgugggcg, AAGgcaugug, CUGguauguu, UAAgucauuu, CAUguaauua, AAUguaaaga, UAGgugcuca, AAGguaaugg, GAGguacuga, UGGguaagua, UGGguaaaaa, AAGgugagcu, UACgugaguu, AGGgugagcc, CGGgugagga, UGGgugagag, GGUguaagcu, CGGguggguu, CCAgcuaagu, AAGguuuguc, GAGguuagac, GAGguaccuc, UUUguaaguu, GAGguuagga, CAGguaggga, AGGguaauac, UGCgugugua, CCAguaacca, AGGgucuguc, UGGguaugua, GUGguaagcu, CAGguaaccu, AAGgugaguu, UAGguucgug, AAAguuagua, UGGgcaaguc, AAGgcacagu, GUUguaaguc, AAGguuugcc, CUUgcauggg, GCGgugagua, GGGguaagcg, GCCguaagaa, GAGgucggga, UUGguauugu, AGUgugagac, CUGgugggga, AGAguaaggu, CCGguggguc, CAGguauucu, UGGguaacgu, UUGgugagag, UAGguacccu, GGGgugcguc, AAGgcaggag, ACGguacauu, GAGguaguua, CAGguauggg, UUUguguguc, CAGguacuua, AUGguauacu, AGUgugagcc, ACAguaacga, CUGguaccca, CAGguaaccc, GGAguaagua, GAGgugggug, ACUguauguc, ACGgugagua, CUGguaaugu, AAGguaucag, CAGgugcccc, AGUgucagug, AAGguaggag, GGAguaugug, UUGguauuuu, CCUguuguga, UUUguaagaa, UAGguaacau, CAGguaagca, CAGgucacag, CAGgugugag, UAGguuugcg, CUGguaagaa, ACGguuguau, AAGguugggg, AAGgugaauu, GGGguuaguu, ACGguaaggc, CAGguuuaag, CUGguaaguu, GGGgugagag, UGGguggguu, GAGguuuguu, UGGguaaaug, CAGgcaggcc, CACgugcagg, AAGgugagcc, CAAguaagug, CAGgucaguc, GCGguauaau, UAGguaaagu, UAGguggauu, GAGgucugga, UCGgucaguu, UGGguaacug, AAGguuugau, UGUgcuggug, UGUguaccuc, UGGguacagu, AUCgucagcg, CAGgucuugg, GAAguuggua, GAAguaaaga, UUGguaagcu, UAGguaccag, AGGguaucau, CAGguaaaaa, ACGguaauuu, AUUguaaguu, GAGguacagu, CAGgugaaag, UGGguuguuu, GGGguaggug, CAGgugccca, AGCgugagau, CCAgugagug, AGGguagaug, UGGguguguc, AUCgcgugag, AGGguaagcc, AGGguagcag, UUCguuuccg, AAGguaagcg, UGGguaagcc, CAGguauggc, UGUguaagua, AAGguagaga, ACGguaauaa, CUGguacggu, GAGgucacag, UAUguaaguu, CUGguacgcc, CAAguaagau, CUAgugagua, CCGguaaccg, CUUguaaguc, GUGgugagaa, ACCguaugua, GUAguaagug, UUGgugggua, CGGguacuuu, UGGguaaaua, AGAgugagua, AAGguagguu, AAGguaugcg, CCUguaggcu, ACAguagaaa, CCGguuagua, CGGguaggcg, GCAgugagug, GAGgugaguc, CUGguagccu, CAUguaugua, GAAguaacuu, GAAguaagau, AAGguuagau, AAGguaauca, AAUguaugua, UGAguaagau, AGAgugagca, GUAguucuau, GAGguaauca, UAGguaugga, UAGgugggac, GAGguacaug, UGGguaaggc, CAGguacgcc, CCAguuacgc, ACUgugguga, GAGguaaguc, AUUguaggug, ACCgucagug, AAUgugaggg, ACUgugagug, UGGguguggu, AAGguuggga, AAGguuugga, UCCgugagug, CGGgugagug, AGAguaagcu, CAGgcaagcu, UAGguauauu, AAAguagcag, GAGguaaccu, AAGgugggca, AGGgugagua, UGGguaaggu, CUUgucagug, UAGgugcgcu, GAGgcaaauu, AGGguaccuc, CAAgugcgua, AGAguaagac, GUGguaaaua, GAUguaagcg, GAGguaaagc, UAGgugagua, CAGguaacau, CCUguacggc, UAGguauguc, UAGguccaua, GAGgugaaaa, AAAguacuga, UUGguaagcg, CAGgcaagcg, UUUgcagguu, CAGguuuaua, CUGguaaagc, AUGgugagcu, CAGgugguug, GUAguaaguu, CAGguaauac, CAGgcaaggc, AAGguaauuu, UUUguccgug, GAGguagguu, ACCgugagug, CAAguaagcu, ACAgugagua, UUGgugagau, AAGguagucu, CAGguaaagg, GGGguaugga, UUUguaagug, GUGguaagag, AGUgugaguu, AAGgcaagcg, UAAgugagua, AGGgugagug, AGUguacgug, AGGgugcgua, GGCgugagcc, CGAguuauga, CAGguaaaga, UUGgugaaga, AGGguaaugg, AAGguccaga, AGUgugaguc, CAGguaauuu, CAGguaacgc, CUGguacacu, CUGguuagug, CAGguacuug, CACguaagua, GUGgugcggc, GAGgucaguu, AUGguaugcc, AAGgugugug, CUGguggguc, CAGgugaggc, AAGguuaguc, AAGguagcug, GAGgucagga, GUUguaggua, UGGguacaag, AUGguaggug, GAGguaagcc, AUGgcaagua, AAGguauauu, GCGgugagag, AAGgugcuuc, UAGguacauc, ACUgugguaa, GAGguaggcu, GAGguaugca, AGGguaguuc, CAGguauccu, AGGguaaguc, AGGgucaguu, CAGguuggga, CAGguggaua, GGAguagguu, GAGguaggau, GGGguuugug, UAGguaauug, AAGguaaccc, ACGguaagaa, GAGguagggg, CGAguaggug, UCCguaagug, UCGguacagg, CAAguaagcg, AAGguccgcg, AAUgugagua, CAGgugaaug, GUGguaaggc, AGAgugagug, UCUguauguc, UGGgugaguc, UCGguuagua, GAUguaugca, GAGguuggug, GAGguggggc, UGGgucaguc, GCAgugagua, CAGguugcuu, AGGguagagu, UAGgucaggu, CGCguaugua, GAGguauuaa, CAGguaaacu, AAAguaaguu, GGGgucuggc, GCUguggggu, UUGguaaguc, AAGguagaag, AAUgugaguc, AAGgucagcu, AAGguaagag, AUGgugagga, AAGguacuuc, AAGguaagaa, CCGguacagc, GCGgugcgga, CAGguacaua, CUGgugagga, CUGguaggug, AACguagguu, AUGgugugug, UUGguacuau, CAGgucggug, CAGgcauggg, AUGguaucuu, AAGguaacua, CAGgugggcg, CACgugagga, AAGgugguuc, UGGgcauucu, AUGguaagcc, AGGgucagug, AGAguacgua, AAGguaggca, AAGguauuca, CAGguagauu, GAGguauuua, GAGgucuaca, GUUguagguc, CAGguacucg, GUCguauguu, AAGguacuuu, AGAgugagau, AGUguuggua, AAUgugagug, AAGguagauu, AUGguuugua, GAGgccccag, AUGgucaguu, UCUguaagga, CAGgucgggc, CAGguaagcc, UAGgucagug, AGAguaggaa, CUGguacuuc, CUCguaagca, CAGguaacua, CAGguggcug, UGGguccgua, GAGguugugc, CAGgugcgcg, AAAguauggc, UGAguacgua, CUGguacgga, CAAgugaccu, AAGgugaugu, AAGgucugca, AAAguuugua, AAGgugagca, GAUguaagcc, CAAguaauuu, CAGgugugug, UGGgugaggg, AAGgugaccu, UAGgugugag, CAGgcagguc, UCAguaaguu, UCAgcaguga, AAGguaccac, UAAguaggug, AAGgucagcc, CAGguaacuc, AAAguaagag, AAGguagaua, AAGgcaaggg, CAGgugucgg, CAGguggcua, GAGguugcca, CAGgccgugg, UUGguauaug, GAGguugagu, GAGguagguc, GUGguaagac, UAGguccuuc, GAGgcaaguc, GAGguaacau, CAGguauauc, UCGguugguu, CAGgugaacc, CAGgucuuuu, CAGgcauggc, AAAguacuug, CAGgugauuc, UUGguagguu, UAUgugagca, CAGgugagcg, AAUguaauaa, AAAguaaggc, UAGguuuguc, UAGgugggag, GAGguaaguu, AAGguagccg, CAGguggugc, UGAgucaguu, CUGguaggcc, CAAguaagga, CGGguaaggc, AAGgcgagga, CAGguaguuc, CAGguaagga, CCUgugagug, AAGguaaaug, CCGguaauua, CAGguaaguu, AAGgugguca, CAGguaccuc, AUCguaagua, CCGguacaua, GCGgugagug, GAGgugguau, CUGgugugga, GAGguaauuc, CAAguacgua, UCUguaagug, AAUguaagug, AGGgucuguu, GAGguacugc, AGGguaaggc, AAGgcaagag, CAGguggguu, UAGguuagga, UGAguaagcu, AGAguaagag, AUGgcaggug, UAGgcaagua, AUGguaggua, GCAgcccgca, ACGguaaacu, AGGgugaguu, GUAguagucu, GUGgcugaaa, CAGguuaguc, CUGgugagca, UCAguaagug, AAAgugauug, UAGgucugga, GAGguguuuc, AAGguaaauu, CAUguacauc, AAGguuugaa, CCAgcaagug, UAGguaauaa, GAGgcaagug, CAAgugauuc, CAGgucgugg, GAAguaugcc, UCGgugcccu, GAGgucaguc, CAGgugagac, UUUgucugua, CAGguagaua, UGGguaucag, UAGgugggcu, AUGgugagau, CAGguaacac, CCGguauccu, UAGguaagcu, UCAguacauc, UAGguuugcc, AUGguaagaa, UUGguaagac, CCGguuaguc, GAGguaagaa, UGGguaaguu, CCGgugagaa, CCUgugaggg, ACGguaggag, ACAguauguc, CAGguauuaa, CAGguggauc, AGAgugcgua, AAGgugaccg, AGAguaggug, ACUguaugua, UAGgucaauu, AGUguguaag, CGGguaccuu, CUAgugaguu, CUAguaagug, CAGguacaac, UAGgugugug, CAUguacggc, AUGgugugag, AGGguggaag, CAGgugcgag, UAGgugcucc, AAGguggugg, AAGgucuguu, CAGgugggcc, AAGgucaguc, CAGguuuuua, AACgugaggu, CGGguaagag, UUUgucggua, UAGguuaagu, GUGguaagaa, CAGguauugg, GCUguaaguu, CUAguaagua, UCGguaaaua, CAGguaacuu, CCUgugagua, CAGguuauau, CUGgugaaca, AAGguauaaa, GAGguaagca, AAGgugaagc, CAGgugaguu, UUUgugagua, CUUguacgcc, AGAguaagug, UGGguaggug, UGAgcccugc, UGUguaugua, AAGguagagg, GAGguggggg, UAGguaauuc, AAGgcauggu, AGAguaagca, AAGguaggaa, CAAguaagua, ACUguaauug, CAGgucugug, UCGguaccga, CUGgugagag, AAGguuugcu, AUGguaccac, UAAguuaguu, CAGguaggac, AGAgugaggc, CGAgucagua, CAGgucugag, GAGguggugg, ACGguauugg, GCUgcgagua, CUGguaagug, GUGgugagau, GGGguuugau, UCUgugagug, CUUgucagua, GAGguaaaac, UCUguaagau, CCAguaaguu, CAGguaaagu, GCGgugagca, UAAguaagag, CUGgcaggug, GAGguaaggg, UGAguaaguu, GAGgugagac, GCUgucuguu, AAGguaacaa, GAGguaacgg, CUGguauucu, CAAguaacug, AAGguggggu, UAGguauggc, CAGguauuuu, GUGguaaacu, GAGgucugag, CUGguaaggu, CAAguaaguu, AAGguagacc, GAGgcgagcg, CUGguaaaua, UGUguaagcg, CAGguuaggg, GGGgugagga, ACAguaugug, CCGgugggga, GAGgucagug, AGGguaaggu, ACAguaagua, GGUguaaggu, GAGguaauaa, CAGguauucc, CUGguauaaa, CCGgucugug, CAGguaacug, GCAguaagua, AAGguagggg, CAAguccacc, CAAguuggug, CAGgugcggu, CAGguaaaau, ACGguaagga, UGGguaauaa, UAGguaagug, CCGguagguu, AGAguaugga, CUCgugaguc, AAAgccggug, UUGguaauuu, GAGguaaaag, CCUgugugag, AAAguaagga, UGAgugagug, AAGguacaug, CCGguaaaug, CAGgugaagc, CAGguacccg, GAGguaaggc, UUUguauguu, CAGgugcucc, UCGguagguc, CGGgugaggc, AAGguaauua, ACUgugaguc, AAGgucagca, GUGgugagug, CAUguccacc, AAGgugaccc, CGGguuagua, GCGguaguaa, GCUguaggua, CCUguugagu, UAGgucuggc, GAUgugagcc, CUUgugagua, CUGguguguu, GAGgcaugug, CAGgcaagag, UUGguaagaa, GAGguguggg, GAGguauuuu, CAGguaguaa, AGGguaagac, UUUguaggca, AGGgugagau, GAGguuugua, AAGgugagug, GAGgugggag, AAGgugagaa, CUGguaagag, AUAguaaaga, GAUgugaguc, AAGgugcagg, CAGgucuguc, GAGgugauuu, CAGguuggcu, CGGguauggg, AUGguccauc, CCGguuggug, GGAguaaguc, AAUguaagga, CAGguuuguu, UAGgugugua, UAUgucuuug, ACGguacuuc, AAGgcacgcg, CUGguaaacc, CUUgugggua, UGAguaaguc, CUGgugggug, GAGguggaga, GUGguggcug, GUGguaagug, AACgugagua, GAAgcuguaa, CGGguaucuu, CAGgugucag, AAUguacgca, CCGgugggua, UGGgugaggu, AAGguauguu, CAGguauguu, CAGguuugcu, UUGguaaguu, CAGguaguug, CCUgugaaua, GCUgugugug, CAAguaauuc, AGGguaaugu, GCUgugaguc, ACCguaaguu, CGUguaagua, GGGguaaguc, AAUguaugau, AAUgugauua, UCAguaagaa, CAGguccguc, GAAguauuga, UUGguaagga, CAGgucgguu, UAGguuagug, ACGguaaaac, AAGguagguc, UACgugagua, UUGguaagca, GCGgugaguc, GAAguaaggg, CGCgugaguu, CAGguacccc, UCUguaagac, GAGgugggca, AAUguaagac, CAGgcaaggg, CAAguaacua, AAAguuuguc, CAGguacugu, AAGgucccuc, UCGguaaguc, UGGgugagug, CUUgugagau, AGAgugagcu, UAAgugggga, UAGguaggga, CAGguuagcc, AGGguaauca, AAGguucagc, UGGgugggug, CAGguuguga, AAGguaagug, CAUgugcgua, CCGguauauu, ACCguaugug, CAGguauagu, CAGguauuac, CAGgugcagg, GUGgugagcu, AAGguaacau, CUGgugaugg, AUGguaaaug, CCGgugagca, AAGguaaacc, AAGguacugg, GCGgucagga, CUGgucaggg, AAAguacguu, AGAguagguu, AGGguaagcu, AUUgugagua, CCGgccacca, GAGguaacuu, GAGguaugaa, CAGgucagac, UAGgcgugug, AGGguaaguu, CAGgcaugag, CAGguaacgu, CAGgcgagca, UAGguauggu, AGAguaggau, CUGguuucaa, GAGguaaacu, CAGgcaugca, UUGguaaucu, AGGgcagaau, AUGguaaaac, GCUgcaggug, GAAgcacgug, CAUguaaaca, UGGguaagau, AGGguagcua, AGGguggggu, CCUguaaguu, UGAgugaguu, GGAguaugua, CAGgugaccu, AAAguacgga, GAGguacaga, GAUguaggua, GGGguaauug, UAGguggguu, GUGguacgua, AAGguacagc, GAGgugaaga, GGGguaagca, UGAguagguc, GGGguaaguu, AUUgugaguu, UCAguaagac, AGUgugagcu, AAGgcaaaac, CUGgugaguc, AAGgucucug, GAGgcugugc, AGAgugagac, GAGgugaugu, AGAguauggu, UGGguggguc, GCUgcugagc, CAGguagcug, UAGgucagaa, CCGguaggug, GCAguaugau, CAGguuucag, GAGguuugcc, GGGguggggg, AAGguacaua, UGGguguguu, AGAguaaggc, GCGguuagug, AAGgugacuu, AUGguaagau, AUGguaguug, CAUguaagac, CUGguaugua, UUCguaagga, GAAguaugac, CGGguaauuc, UGGguaacuu, CAGgugccua, CAUguagggc, ACCgucagga, CGUguucgau, GAGgcaggac, UAGguaauau, UCGguauacu, UAGguugugc, CCGgugaguc, CAGgugccaa, CAGgugaugc, AAGgugagga, GUGgugaggg, UGGgucagua, GAGgucaggg, UAGguacgua, GAGgcaagag, CCUguuggua, GAGguaucca, UAAguaagcu, AAGgucaguu, AAAguuaaag, GAGgugcuau, ACGguaaguu, CUGgugaggg, GAGguuaugu, CUUgugugca, UGAgcugggg, AAGguauagu, UAGguaaaac, GGGgugaggu, GAGgcaagca, GGAguaacgu, AGAguaagua, AAAguaagua, GAGgcaacca, UGUguaaguu, UAGgugaggc, ACAguaagaa, UGAguaagug, CAAgucagua, AGGguaaaug, AAGguaugca, GCUgugcgug, GAGguucgcc, AAGgcuugca, CAGgcaagug, AUAguaaguc, UUGguaggua, GCAgcaggua, AAGguauauc, AGCguaagcc, CUGguucgaa, ACGgugggug, CUGgucauug, CAGgucagga, CAAgugagac, GAGguacugg, GAGguguagu, GAGguguccu, CAGgugcgua, AGUgcccuga, AUGgugaguc, UGUgugugua, CAGguaugcu, CUGguacagu, UUGguacgua, UCUguacgua, UAAguaauuc, CACguaugug, CAGgcaagua, UCGgugagug, GGUgugaguc, UCUguaagcu, AAGguucaga, AGGguacuuc, GCGgcagguu, GAGgcccgug, CAGguauaaa, AUGgucaagu, AAGgugagua, GUGguuuguu, AGAgugagga, GAGguaugac, UAGgcgugag, AAGguacucc, UGAgugagga, GAGguaugau, GGGgucggua, ACGguaugca, CAGguaccac, UAAguaccug, AGGgugggcu, CUGgucuguu, UAGgucagag, AAGguguguu, CUGgucagug, AAGgugggac, GUGguaguag, CUAguuuagg, CCCgccccau, GCUguacugc, GAGguaauau, UAGguuggug, AAGguccaac, UAGgugagga, GUGguaaguu, AGUgugagag, AAUguacaug, UUGgcaggug, UAGguuauug, CAGguacuga, GCGguggguc, UGUguaagau, GAGgugagua, GCAgccccgg, CAGgugcuaa, AGUguaagag, CAGguacauc, CAGgugggac, AGGguaaaua, UAAguaauua, CAGguaaccg, AAGguuugca, UAGgugguuu, CAGgugaccg, UGUguaagcu, GGAgugaguc, AGGguaggag, AGGgugggug, AAGgucugag, GAUguaauau, GGGguaauua, UAGguaggua, GAGgcaagua, GAGguaagga, UAGguacuac, UCGgugggug, AAGgugugga, CAGgucugcc, UAAgugagcc, GAAguaaguu, GAAguaagcc, UAGgugcgac, GAGguauggc, GCAguaagaa, CAGgugugga, UUGguaacgu, GCUguaaaaa, UUGguuagua, AUAguaaggg, UUGguacuag, CGGgcagccg, CAGgugcugg, UAUgugaguu, CAGgucuggg, UAAguaagaa, AAGguuauua, AGAguaaagc, AGAgugugag, UAGgugcgag, CAAguaaacg, AAGguacgua, CUGgugagua, CCAguaugua, UUGgugagug, UGAguaagua, GAGguuagca, GUGguaagcc, CUGguauggc, AAAguaacac, CAGguacuaa, UCUguaaguu, GAGgugaggg, ACUgugggua, GAUguuugug, CAGgugucaa, CAGgucacca, CCGgugagua, UUGguaaaua, CAGguggggg, ACUgcaggug, UAGguauguu, GGAgcaagug, UCGgugccuc, CAAguaacuu, GAGguaacca, CAGguaauau, GGAguaagaa, GAGguaccuu, AGGguaagga, CCUgugaguc, GAGguaaugg, AUGguguguc, GGGgugagua, AGGgucaggu, UGGguaaggg, AGGguagguu, AUAgugaguu, CCCguaggcu, ACAguaugua, GACgugugua, GCGgugagga, CAGgugaccc, UAAguuuagu, ACAguugagu, CGGgugaggg, CAGguggauu, CGGguagagg, UAGgugcgug, GGGguaagaa, GAGguggggu, CACguggguu, ACGguaauug, AGAgugaguc, UUGgcuccaa, AAGgugaugc, AAGguugguc, AGCguaaguu, AUUguaugua, UCAguuaagu, CAAguacgug, CAGgugcgug, CAGguaggua, AUGguggggu, AUGgugaguu, CAGguaauca, AAGguagggu, CAGgccaagg, GUGgugagag, AAGguuggug, CAGguacucu, UAGgcaugug, UUGguaccuu, CUGgugugcc, ACAguugcca, UUGguaauau, GAGgugcaug, UUGguuugua, UUGguaagug, UGUgugugug, GUGguuugua, GCGguacaca, AGAguaugcu, UUUguaagua, UCUgugcggg, AAGgucagug, GAGguaggaa, GCGguuagca, AGGgugaggg, GAAgugagua, CAGgugacag, AAGgugauua, GAGgccagcc, GAGgucuccu, UAGguauuac, CAUguaagag, CUGguagggc, GAAguaagua, CGGguaagug, CAGguaaucu, GUGguaggua, CAGgugggua, AAGgccagug, AAAgugaauc, ACGguuacgu, AUGguaggaa, CGGgugagac, GAGguuggaa, UGGgugagcc, CCAgugagua, CUAguacgag, CAGguaugac, GCUgugaggu, CUGguaugaa, GGUguacgac, CUUgugagug, GUGgugagca, CUGguaacuu, CAGguacuau, AGGguaaggg, UUGguuaguu, GGUguaagca, UCGgugagga, UGGguaaaca, UCGguacgug, UAGguagcag, CUGguaaggc, GUGguaagga, UAAguaagca, GAGguuccaa, CUGguaugga, GGGgugggua, CAGguuuccc, CAGgucucug, GAGgugagga, CUUguggguu, AUGgugagac, CAGgugaagg, GCGguagggg, GUUguuuccc, AAAgcaucca, GUGguagguu, AAGgugugaa, CAGguacagu, AAGguaccaa, UUGguaauug, AAGgugcuca, AAGguucaac, CAGguuuaca, GCUguaagug, AGGguauguc, GAGgucgggg, AAGgugccug, AAGguaaaaa, GUGgugaguu, UAGguaagaa, AGGguauccu, GUGguaauau, UCUguaagua, UGGguaugga, AUGguaugga, GACgugagcc, CUGguuuggc, AUGguauauc, AAAguaaacu, AGCgugagug, CUGguauaga, CAGgugggga, AGAguauguu, UAGguacuug, GCAguaggug, AGUguauguc, AAGguuaagc, CUGguggccu, GAAgugaguc, UUGguguaag, CAGguaagaa, CGGgucucgg, GAGgugcaca, CUCguuaguu, AAGgugauca, UAUguaagaa, GAGgugcuug, CAGgugguca, ACGguaaguc, ACAguaaugu, CCUguaaggu, GAGguuaagu, UCGguaugug, UGGguauguu, AAGguauuac, CAGgugaggg, UUGguaaaca, AAGguagugu, GAGguguggc, CAGguacgga, AAGgucauca, CAAguaggca, CAGgugaaac, CAGguacugc, AAUgcaagug, CAUguaauuc, AAGguaugcu, CUGgugaguu, CAGgugguuu, UGUgugagua, AAGgucggug, AUGguaaauu, AGGguauuac, AGUguaugga, AACguaagau, GUGguaaggu, ACUguuagua, CAGguaucag, AAGguuaguu, CUGgugagcu, UUGgugagcu, UGUguacgua, GAGgucagcc, GAGguagaau, AAGguaugag, UAGguauuuc, UGUguaacac, AGUguaaggc, GAGgucugcu, AAGguuagca, CAGguaaaug, AACguaagcu, CAGgucugca, CAGguauugu, GUGguaauuc, GAGguauaug, GCCgugagcc, GAGguaagag, UGAguaugua, CAGguaaggg, GAGguaaauu, CAGgcaacuu, UGUguaaguc, CAGgugcgcu, CGGguaaacc, CCGgucaguc, UAGgugggcg, GCGgucaguu, GGGguggguc, AGCguaauag, ACGgugaguc, CUGguacuug, CAGguuggua, AGAguaugug, CUGgugggua, GAGguggcuu, AUAguauuga, UGAgucgucc, CAGgugcucu, UACguaauau, GCUguccuga, CAGgcugcac, CUGgugcgcu, GCGguaagaa, UAAguuacuu, GAAgugagug, UAGgcaaguc, UAAguaaaua, ACGgugagug, CAGguagguu, GGGguauaac, GUUgugaguu, CAUgugagua, GAGgugcauu, AAGguuugua, UCGguaaugu, CGAguaaggg, GAGgcacgga, AGGgugugga, CAGguauggu, AAGguagaaa, CAGgugccug, UGGguauaug, UGAgugagac, UGGguaauuu, AUGguaaaua, AAGgcaaagg, AGUguuuguu, AUGguauugg, CUGgugaggc, UUGguaaaau, ACAgugaguu, CAGgugcugu, GAGguuaaga, AGAguaagaa, GAGguccgcg, GUGgugagga, CAGgugagcc, CAGgugacau, AUGgcaagcu, UCGguaauau, CAGgcaacaa, GGGguaggga, CUGgucucgc, UAGguaacga, CGGguaaggu, UAGguaaugc, CAGgcaagaa, ACAguaggua, CAAguaugag, GCUguucgaa, AAGguuaugc, GAUgugaguu, CAGguggaga, AGAguuaguu, UGAgugugcg, GAGguacagc, CAGguaagac, CAUgugcuuu, AGGguguguu, ACAguuaagg, ACAgugaggg, GAUguauacc, UUAguaagcu, CAGguaagau, AGAgcugcgu, GAGgcaaguu, GAAguaagug, AAGgugaaaa, AAGguaccua, GAGguaucag, AUGguaugua, AAGguaugaa, UUGgugagcc, AAGguuagga, AGGguaugua, CAGguaccga, AGAguaaacu, AAGgugcaua, AAGguaaugu, CCGgugugug, AGGguaaauu, GGGguuuggc, CAGguacacg, UUGguaacca, GAGgucaggu, UCUguuggua, CAGguuaguu, UUGguauguc, AAGgugcguc, AGGguaagaa, UUUguaagcc, AAGgucaggu, CUGguaaacu, UCGguaauuu, CUGguaggcu, GAGgucugua, GAGguacuuu, CUGguaaagg, CGGgugugug, CAGguguggu, UCGguacguc, CAGgugccag, GGGgugagaa, ACAgcuagua, AAGguauagc, CUGguaggag, GCUguacgua, AAGguaaagg, CAAgcacgag, CUAguaagac, CCCguaagcg, CAAgugugag, AUGguaaggg, AAGgugaggg, CAAguaggua, GGUguugcug, GAGguacugu, UAGguaagau, CAGgugcgaa, GAGguccagg, UUGguauaca, GGAgugagua, GAGgugagau, AAGguggggc, CAGguaaacg, UCGguaacuu, CAGguaaauu, GAGgugcgca, ACUgugagua, ACGgugugac, GUGguaaguc, CAGguaggca, CAGgucagca, GUGguaugug, AAAguaucug, CGGguaugua, AAGguaauaa, GAGgugggga, GCUguaggug, GAAgugaguu, AAAguauuua, UAUguaagua, ACGguaugag, CUGgugagug, AGAguaaaau, GCUguauggc, AUGguaaacc, GCAguaauaa, UAAguauuua, AAUgucagug, AUUgcaggag, CCGguaagaa, AAGgcaaguu, GAGguuuguc, AAGguaacug, AAAguaugag, GAUguuagua, CAGguggguc, AAGguaccga, CCAguaauua, GUGguaugcg, AUGgugcgcu, CAGgucuaug, AAGguauuua, CUAguaagau, AGAguaauuu, GAGguaacgu, AAGguagcca, CUGgucccgg, GAGguccuuc, ACGgucaccc, AAGguaauac, CAGgugcaug, AUGguaauag, UUUguaacac, UGGguaugau, CAGgcccccc, AGAguaguaa, AGUguaagaa, GAAguauguu, CAGgugugca, UUGgugaggg, UGGguugguu, CAGguacgua, GAGgugcggc, UCUguacggg, CGGgugcgug, UACguaagug, CAUguaagga, CAGgugacgg, GAUguaugcu, UCUgcaauuc, UGAguaaggc, GAGguauauu, AGAgugaguu, AAGguaagcu, UAGgugaagu, CAGguuagua, UAUguaagug, UUGguggggg, UGAgcucaaa, UCGguaugua, UAAguaugcc, AAUguaagua, CAGguuugca, ACGgugagag, CAGguguuuu, GUGgugagcc, AGGguacaua, UAGguaaccc, GUGgucagua, CUGgugagcc, CAGgugcuua, AUAgucguga, AUAgugagug, GAGgucaaaa, CGUguagcuu, CAGguguuug, CAGguuggac, CAGguaagcu, AGGgucagaa, CACguauguc, CACgugagug, GGGguacgga, AAGgcaggac, GAGgugaagc, GAGguuugaa, CAGguaagug, CAGguaacca, CAGguacucc, AAGgugcuuu, GAGguaaaua, GAGgcaggug, GAGguucgga, CAGguauuug, CAGguaaaua, CAGgugaugu, CAGgugauac, GAGgugaggc, AGGguggggg, UAAguaaguu, UGGgugaaca, UAGguacugc, CAGgcuccug, AGGguaggca, CAGgugcccg, GAGguacauc, AGGgugugug, AAGguaguaa, UGGguaugag, GGGgugugug, CUAguaggug, GAGgcaagga, AAGgcaagac, AAAgugcggu, AAGguugguu, GAGguuaaug, UUGgugaguc, UCGguuagcu, GCAguaagca, AAGgcaagca, ACAguaagcu, GAGguaacag, AAAguacgua, GAGguaauac, UUGguaggug, CUGguuaguc, GAGgugacgc, ACAguaagga, AAUguacuua, GGGguacagu, CGUguaugug, UCCguagguu, GAGguggucg, UCAgugaguc, AAAguaagca, GAGgucuggu, GAGguaauua, GUAguaagua, AAGgugggga, UCUgugagca, GAAguucgug, ACGgugaggc, UCAgugagua, UAGguaguug, GGUgucuggg, GGGguaagug, GAGguggguu, UGUgugaguu, CAUguaagua, AAGguaggug, AAUguaggag, GAGgcacguc, CAAguacauu, UUGguacaga, GAGguaguag, AAAgugaggg, UUGgucagug, AGGgugaguc, CAGgugaaca, GGUgugggcc, CGGgugagcu, GGGgugaguc, ACAgugagag, AGGgugaggu, GCUguaaguc, AUAguagguu, CAGgcaugug, AAGguaaguu, CAGguccgug, GAGgcaggua, AUGguggaag, AUGgugggcg, GAGgugagaa, AGUgugagca, UUGguaagua, CAAguaagca, GGUgugagcu, CCCgugggua, CAGguagaau, CAGgcugagc, CUGguggccc, UGAguaagag, CACguuagcu, AAGgugaguc, AAGguagcuc, UCGgugaguu, GAGgcccuuc, CAGguuaugc, CCUguaagcu, CAGgucuccu, UAGguaggcu, GGGguagggg, AAGguaguga, GAGguuguug, CAGguugguu, AAAguaagcc, ACAgugagug, UGGgugugau, CCCguaacua, AAGguguugc, AAAgcuggug, GAGguauagu, ACGguaagag, AUGguacggu, GAGgccaguu, GAGguaugcg, UCGgugggag, AAGguggaua, CCAguguggc, AGGguaagug, UCUguagguc, CAGgcaagga, CGGguaauuu, AUUgugaguc, CAGguaaacc, AAGgucaauu, AAGgugaaua, GUCguaagaa, GCGguaaguc, CUGguagagc, GAGgucgguc, CAGguaaaca, AAGgcaagga, CAGgucgucu, GGGguagggc, CUGguacuaa, GAGguagcug, CUUgucagcu, UAGguaaggc, CUGguauuac, UAAguacguc, AAGguaagcc, ACGgugaaag, CCAgccaaua, CAGguuuguc, AAGguauaau, AAGgucuuag, AGGgugagcu, AAGguuaggg, CGGguaaauu, CAGguaacgg, AGAgugugua, ACAguaaguu, GAUguaauuu, GAGguaggga, UUGgcaagug, AAAgugagga, AAGguagugc, AGAguaauuc, GGAguaaaua, GUGguaccca, CAGguauugc, GAUgugaggg, CAAguaaauc, CAGgugucuc, AAGguaacag, UUGguaaaag, CAGguaucau, ACGgugagac, CUGguaugac, CAGguucacu, GAGgugauca, AGUguaaguc, AACguaagua, AAAgugagug, GAGguacagg, CAAguaauga, GAUguaagga, UCAguucccc, GCGguaagga, UAGguacuaa, AAGgugaaag, ACUguaagug, UGGguaugug, AUGguaacag, CAGguagggu, ACAguaagug, AAGgugcucc, AAGgugugcu, AAGgugguga, ACGgugcgcc, AAGguauugc, GGGguaugug, CAGgugggcu, GAGguauguu, AACgugaaua, CAGguaaugg, UAGguaugau, CAGgcaggug, GGGguugguc, AAGguauggg, UAAgugaggc, CAAgugaucg, AAAguacggg, AGAgcuacag, GAGgugggaa, CAGguacuuu, GAGgugagag, CAGguagguc, UGGguacagc, AAGgugucag, AAGgcaagaa, GAGguaaaca, AAGguaaagu, AAGguaguca, CUGguauguc, GAGguauggg, AAGguauugu, CUGguacuga, GAGguaagcu, UGGgugggua, CAGguucgug, AAGguauggu, CAGgugagca, UGGguaaauu, UGUguaggug, UGUgugagcc, CUGguaauau, AAAguauguu, UGUguaagaa, CUAgugagaa, AGGguagguc, AAGgugggug, UCGguaagug, AGUguaaaua, GAUguaagug, AAGguuagug, UAGguaagca, CAAgugagaa, AGUguaagua, CAGgugaauc, UGGgugagac, AAGguagggc, CUGguuugug, GCGguagggc, GAGguaaucc, AUUguaauaa, CUGgugaaua, AAGguuuaaa, CCUguacugu, GCGgugagcg, AAGguaaucc, UAUgugagua, CCCgugagug, CAGgugcaga, CAGgucaguu, CAGguaggcu, AAAguaagug, UAGguugguc, CAGguugccu, AAGguaugga, GGUguggacg, AAAgugagaa, AGGgugagag, GAUguggcau, UCGguaaggu, GAGgugcguc, CGGgugaguc, AAGguacggg, GAGguucuug, AAGgugcuug, UAGguaugua, AUGgucagca, CGGguacuca, AGGgugagga, AUCgugagua, UCAguaagua, UAGguaaaua, AAGguaauug, GAAgucagug, CAGguacaaa, AAAguuaauc, AGCgugagcg, CCGgcuggug, AGUguaauuu, UGAgccacuc, GGGgucugua, AUGgcauguc, CGGguaaaga, AGGguagcau, CGGguaggag, GAGguucgug, UAAguuauuc, UAUguaagau, AAGguaguuu, CAGgugguau, GUGguaauga, AAGgugauuu, CAGgugaagu, GUAguaauua, AUGguuggug, CCAguaagug, UAGgugagag, AUGgugaggc, AAAguuagug, AAGgugccuu, UAGguaugag, CAGgugugac, CUGguggguu, AUGguaagga, UCUguaagaa, UCCgugaguu, AAAgcaggua, UAUgugagug, CAGguggagg, CAGguuagac, AUAguaagac, AAGguguugu, GAGgucugug, AAGguaagau, CAUguaaguu, CUGguaauua, CAGguaggcg, AGAguaaguc, UGGgugagga, AAUguaggua, UAGguuagca, GGGguaggua, GAGguauugc, AUUguacaca, GAAguaggua, GGAguaagcu, UAGguaugug, GAGgugaaua, GAGgugggau, AAGguaaucu, GGUgugaguu, AACgugaguu, GAGguaaccg, UAGguaagga, AUUguaagaa, UGGgugagca, AAGguaaggc, CCAguaucgu, CCGgugggug, GAGguagugu, ACGgugggaa, GAGgugaccu, CACguaugua, AGGgugggga, AAUguaaguc, AAAguuaagu, CAUgugagug, AGAguauguc, GCGguaugac, CGGgugaguu, CCGguauuuu, GAGguagaac, UAGguaugaa, CAGgcgcgug, CAAguaaguc, AGUguaagau, AAGguucuac, CCAguaagua, GAGguagcag, CAGgucuguu, CAGguacaau, CCGguaaaga, UAAgugcugu, AGGgugagaa, CUCguaaggu, CAGgucagcu, CAGguaaggc, AGGgugcagg, GAGgugaaac, AGGguaagua, AAUguaugcc, AAGguaagca, ACGguacggu, AAGguaauga, UCUgcucaau, ACGguaaugu, AAGguaguug, ACGguaagug, CAGgugauga, GAGguaacac, GAGguaggua, CAGguaccuu, CAGguaauaa, UUGgugggug, CUGguaauga, UAGguaaguc, AGGgugugac, GAGgcaauaa, GUGguaaagc, CUGgugggcg, GAUguauguu, AGGgugagac, UCGgucagca, AUGgugauua, CGAgugugua, CAGguuggug, AGCgcaagua, UGGguacguu, GAGguauuug, AGUguacaua, AUGguaagua, ACAguagguu, AAGgugagag, UUGgugaagu, AAAguaugua, UGGguaagga, UAGgugccuu, and CCUgugggug.
Additional exemplary gene sequences and splice site sequences (e.g., 5′ splice site sequences) include UCCguaaguu, GUGguaaacg, CGGgugcggu, CAUguacuuc, AGAguaaagg, CGCgugagua, AGAgugggca, AGAguaagcc, AGAguaaaca, GUGguuauga, AGGguaauaa, UGAguaagac, AGAguuuguu, CGGgucugca, CAGguaaguc, AAGguagaau, CAGgucccuc, AGAguaaugg, GAGgucuaag, AGAguagagu, AUGgucagua, GAGgccuggg, AAGguguggc, AGAgugaucu, AAGguaucca, UUCguaagua, UAAgugggug, GCCgugaacg, GAGguugugg, UAUguaugca, UGUguaacaa, AGGguauuag, UGAguauauc, AGAguuugug, GAGgucgcug, GAGgucaucg, ACGguaaagc, UGAguacuug, CGAgucgccg, CUGguacguc, AGGguauugc, GAAgugaaug, CAGaugaguc, UGGguauugg, UGAguaaaga, GUGguuccug, UGAgcaagua, UAUguaagag, AAGgucuugc, AAAgcaugug, AGAguacagu, GUGguaaucc, CAGguagagg, AAGguacaac, UGGgcagcau, CCGgucauca, CCGguuugua, UGAguaaggg, GAAguaugua, GGGguagcuc, GCUguacaua, CUGgucucuu, GUGguaaaug, AUCguaagug, GAGgcaugua, AAGgucuccc, UGGgugcguu, UGUguagguu, GAAgugagca, GGUguaauuu, CUGgugaaau, AUCguaaguc, AGAguaaucc, GGAguagguc, GAGguaccaa, CUUguaggug, AAGguauaag, AGAguuggua, AUGguuugug, UGGgucagau, AGAguaggac, AGAguagugu, AGAguaggag, CAGgucucua, AAGguggaug, UGGguaucaa, GAUguaugga, AAGguguuuc, GCAguguaaa, UUAguaugua, UCUguaugca, AAUguaaaau, AGAguaaauu, GGGguacuuu, GAAguuugau, AAAguagauu, UGUguagagu, UGGguaagcg, CGGguucagg, AGGguacgac, UCGguaagaa, AGGguuggca, AAAguacagu, UAAguuaagg, AUGguaaugu, GUGguuuuac, AGAguaacaa, AAGguagccc, GCGgugaggc, AUGguucagc, AAGguacuua, AAGguccgug, UAGguaagcg, AUGguaccuu, GCCguggugg, CUGgugcguc, CAGguggaaa, AAAgucugua, GAGguaaccc, AGAguauggg, UAUgccccug, AAGgugccag, ACGgugcggc, AGGguacuga, AGAguaagcg, CUGgcaaggg, CCAgugugug, GAGguagacg, CGGgugcggg, GAUguaagcu, AUUguauuua, UGCgugagug, CUGgucuaua, GAGgugcuag, GAGgugccau, CAGguacguc, GAGguucagc, AACguaagaa, AGAguaguac, AAGguaacgg, UAGgugugac, CCGguaauag, CAGguaccag, UUUguaauug, AAUguacgaa, CAGguaauga, AUCgucaagg, CUGguagaug, GGGgugcagu, AGUgugagaa, GGGguuuuau, CCUguccccu, AUUgugaagu, AAGguaaacg, UACgucgugg, AAGgugccau, GGGgucccag, UAUguauggu, CGGguaauua, CGGguacucc, CAGgugacuu, AGUguggguu, AGAguauggc, AAGgccaaca, AAAgcaagua, UCAguagguc, GUGguggcgg, CAUguauccu, UCGgugagcc, AUAguugggu, AAUguuagcu, AUGgugaaug, CGGguaaugu, UCUguaggug, CCGgugaggc, UGAguccacu, CUAguaagag, CGGguggggc, CGAguaagca, UGUgccaauu, UCGguaagcc, UAUguaggug, UUGgugggcc, GAGgcugggc, AGAguaacuu, ACGguagguc, CAGgcccaga, CCGguggguu, AAGgugacgg, GGGguacagc, CAUguaaguc, AUUgugagaa, UGUguaagga, UUUguaagau, AGGgucauuu, UGGguuuguu, CGAguaagcc, GUGgugugua, AUGguauaac, UGGguacgua, AAAguagagu, UCGguaacug, AGAguaauga, AUGguggguc, AGAguaauau, CAGguacugg, UAAgucaguu, GCGguagaga, AAGgugaugg, ACAguauguu, GAUguacguc, UAGguuucuc, GAGgcauggg, AUAgcuaagu, GUAgucugua, AAGgugaacg, GUGguggucg, GAGguugauc, UGAguggguu, ACUguacgug, CUGgugacug, CAAguuaagc, GAGguaccca, AACguaacuu, CAGguuacua, AGAguuaguc, UGGgcacguc, AGUguauggu, AAGguugcaa, CAGguuguua, AAGgcauccc, GAUguaaggc, AGGguacggg, GAGgucaaag, CAAgugagcg, AGAguaaucu, UCGguagcug, AAAguaguag, CAGguucguc, CGUguaugaa, AGUguaaaaa, AAGgucucac, UAGguggagc, UGAguaggug, AGAguaugcc, GAGguugcau, CAAguaagag, UCUgugugcc, GAGgugaugc, GGGgugauaa, CCCgugagcc, AGAguaacug, GCGguaagua, AGAguacauc, UCGgucuggg, UAAguaucuc, GGCguagguu, AGAguacgcc, GAUgucuucu, AGGgcaaggu, CGAguaugau, AUGguagagu, CAAguacgag, UCGguaugau, CCGguguguu, AGGgucugug, GGAguaggcu, AAGgucuaug, GCAgugcgug, UGGgugagaa, AGGguaaagu, GAGguaggac, CUAguaagca, UUAguaggcu, CUGgugggau, CUGguuagua, AAGguacgug, CGGgugagau, AAGgugcaug, AAUgugggcu, CAGguugacu, CAGguuacag, GCGguaacau, AUUgucaguc, CAAguauaca, GAUguccgcc, AAGgugcgga, AACguaagag, UGGguuggua, CAAguguaag, GUGguaacgu, CUGgugauca, AGGguggggc, UCGguaaaga, CAGguacacc, CGGguaaggg, CAAguuugcu, ACAgugcgug, UUGguauggg, GAGgcucauc, CUGguaauag, AUGguggaua, UCAgugaauu, AAUguaauua, GCAgucuaaa, AAGguauucu, GAGgucauca, UGGguccaug, AGAguuugua, AGGguagacu, AAGguaggac, UGUguguuga, UCAguacgug, AUGgucucuc, UGAguuagua, UGAguaaagu, GAGgugaccg, GAGguauauc, CAGgugccau, AGAgugguga, GUUguaagaa, AGAguaaaua, AGGgugaagg, CUGguagauu, GAGguucagg, AGGgucuuca, CUGguaaccu, ACAguacuga, AGAguggguc, AUGguaugag, AAGguuauau, AGAguauagu, AAAguaugaa, UAGguggcua, ACCguauggg, AAAguauaau, UUUguauggc, GGGgucgcgu, GUGgugguuu, CAGguuugac, GGAguaggcg, GAGguacccu, AUGgugugca, GUGguuggug, AAAguaugcu, UAAguuacau, ACAguaugag, GGAguauguu, UUUgugagaa, AAUgugcguu, CAGguagagu, AUGguguuaa, CAUgugcguc, AUAguuggau, GAGguacgua, GUUgugagaa, CAAguacauc, GAGguaguuu, ACUguacaga, CCGguuguga, UGGgucagug, GUAguaagaa, GACguacuuu, AGAgucaguc, UAGguuaguu, AGGgcagcag, AAGguccuac, AAUguaauug, CAGgugcggg, CUGguaaugg, CAAguagccc, GAAgucaguu, ACAguaauug, UUAguuagua, CCUguauuuu, AUCguaagaa, CCAgugagca, GAAguaaggc, UGAgugggua, UCAgugguag, UCUguacagg, CGAgugagug, UCCguaugug, CAUgccguuu, AAAgugacuu, AGAguaggca, GAAguaagag, CAGgcagguu, UUGguagagc, AAGguggaaa, GAGgcagguc, AUGguacgac, AGGguaggaa, AGGguaggua, UUGguaaggu, AUGguacaga, CAGguagagc, UAGguaaggu, GGGguuagag, AAGguaucaa, GAGguagccc, CAGgugccuc, GCAguaagag, ACGguagagu, UGGguaaugg, CUGgucaguu, GUGguacauu, AAAguagguu, AAGgccaaga, CGGgugggca, ACGguccggg, CGAguaugag, CUGguaugcc, GAGguggaug, CAGgccuuuc, AAAguacauc, AAAguaauca, GAGguaacug, CUGguaaaga, CGUguaagca, UGGgcaagua, GCGguggcga, GAGguggccg, AUUgcaugca, ACGgugacug, CAGgucagau, AGAguaacuc, UGAguaacag, AAGguacccg, AGGguaggcu, GGGgcaggac, CCUguaagug, AUUguaagug, ACUguacgag, GUAguagugu, AGAguaugag, UCAguguggg, UGGguauaua, UAGguagcua, GGGguaaaga, AGGguuacuu, CAUguaaaug, GGAguaguaa, CAGgucaauc, CGGguuagug, UAGguacaug, UAGguuaaga, UGGguaccuu, CGGguggaca, CAGgucuuac, AAGguggagc, AUGguaacca, UCGguaaguu, UAUguacaaa, AAUguagauu, GUAgcuagua, AAGguauugg, GAGgucuuug, GAAguucagg, UGGguaucac, AGAguacugg, CAGguuaaug, AGGguacgug, AGGgcacagg, CUGguuaguu, UUGguacgag, ACGgugauca, CCUgugagag, GAGgugaagu, AAGguacauc, UCUguaugug, UUGguggaag, UGGgcagguu, GAAguggagc, ACAguaagac, CGGguaccaa, CAAguacguc, AGAgugaggg, CGGguaagaa, AAUguaggug, AUCgugugcu, UAGgucaugg, CAGguuuuga, AAGgcaugca, GAGgugcugc, AAGguuaaua, CAGguucauc, GCGguaggug, GACgugagua, CAGgucuacu, UUGguaugag, AGCgugggca, AUGguaaggu, AUGguaccuc, UUGguauggu, UAUguaugaa, UGGguauggg, GAUguaaaua, CCGguaaguu, GAGgucugaa, GAGgugcgag, CUGgucagcc, CAGguuuugu, CGGguggugu, UAAguuagua, UUUgugugug, CAGguuaacc, UUGguacuuu, GCUguaaggc, AGGguggcug, GAUguaaaaa, AAGgucaaaa, CAGguagcgc, CAGguuuggc, GAGgugguuu, CGGguaaaua, CUGguucggu, GGAgugagcc, AAGgugcgcg, GAAguacauc, AGUgucugua, CCCgugagcu, GAGguucaca, CUAgugggua, GAGguaacua, UCGguauguc, UAAguauuug, CAGguaagcg, GAGgugguaa, CGAguaagag, CCGguaagcu, GAGgucuugu, AAGguggguc, CACguaagug, AGUguaauga, AAAgugugua, GGAgugccaa, CACgugaguu, AAGguuggau, UAUguaaaua, CUGguaggaa, UAUguaaacu, AAUguauuuu, CUGgcaagug, UGUgugguau, UAUguauguu, UUGgugacuc, GGAguaaggu, AAGguagaug, UGGguagggu, AAUguaauuc, GUGguauggc, GGAguggguu, AGGguaccac, UAGgugacag, ACAguaggca, AUGguuugaa, GCAguaacua, CCGguaggua, AGAguaggcc, AAGguugaca, CUGgugugua, GAAgucuguc, UGGgcucgga, CAGguagccu, AGAguaggua, UAAguauguc, CUGguauauc, GAGguguguu, AUGgugcaug, AAGguacgcc, UGAguaacua, GAGgugacag, GUUguccugu, UUGgugucuu, AAUgugaagg, UUGguggaua, UAGguguguu, CUGgcaaguu, GCAguaagau, GCGguggaaa, UGCguccagc, AAAguggagu, CGUgugagcc, AGAguacugu, CAGguauagc, UACguaagga, AAGgucuuua, AAGguggucu, GGGguaaauu, UCAgugagga, AGAguacguu, GAGgucguca, UAGguuugau, CAUguaaacc, AAGguggcac, CAGguagaug, AACguaaaag, UAGgucucug, AUAguaggug, UAGgcaagag, UAGgcacggc, AAGgucuuca, CCAguaugcu, CAAgugaguu, CAGgucucaa, CAGguuacau, GGAgugagca, AGAguacgca, CUGguguugg, AAGguacuca, CUAguaaggg, AGAguaaaag, AAGguaacga, CUGguccccg, UAAguauggg, GAGgucgagc, UUGguauaua, AAAgucaagg, AAGgucuagg, CGAguagguc, AGGguucguu, GAGgcaggcc, CUAguauuac, ACGguaugug, UAGgugguuc, AGAguauaac, UUGgugcguc, ACCguuaucu, CCAgugauga, GAAguaugca, GAAguauggc, CCGguaggac, AAUguaagca, AGAguaauug, AGGguugguu, GUGguaggag, AAGgcaguuu, CAAguaagcc, CUGgcaagua, CAGgcaugau, AGGguaauug, GGGguaaccu, AAAguaacua, UAGgucugcc, ACGguaugaa, AGUguauggg, UGGguuggca, UAGguaaacu, AGAgugggua, AGAguauuug, AGUguaggaa, CUUguacgua, GAUgugagau, CAGgcagcca, AAGgucacug, AAGgucugac, UAGguuccuu, CUGgugcuuu, UGAguuggug, UUGgugggau, UGAguagggu, UCGgugaggu, AAAguaaaga, AAGgcaaguc, CGGguaaagc, AAAguuaguu, UUAguaagca, GAGgucacau, UAAgugguau, UAGgugcuuu, GGAguaggca, UGAguaagga, CAGguggagc, GAUguagaag, AAUgccugcc, AUGguaaggc, UGGguaauau, CUGguaccuc, CACgugagcc, UGAguuugug, CCGguagugu, AAAgugacaa, GAAguggguu, CAGgugcagc, GAGgugggcc, UAUgugcguc, GGGguacugg, CUGguagguu, UUGgcauguu, AAUguaauac, UAGgccggug, AGAgucagua, UAAguaaauc, CAGguuccuc, UAGguacgau, AGAguuagug, GCAguaagug, AGGgugguag, GGAguaaugu, GAUguaaguc, CCAguuucgu, AAGguucggg, AUGguggagu, AAGguaccgg, GAAgugcgaa, UGGgucaguu, AAGguguaga, UGGguaggcc, CCAgugaguc, AAGgucacuu, AGCgugaggc, UCCgugguaa, AGAguacuua, GGGgucagau, AAGguggacc, AGAgugagcg, AGAgucagau, UAAguauuac, AGAguauuuc, AGAguucagc, AUGgugaagu, UAGgugaucc, GGAguaagau, UAGguaccaa, AGAguugguc, GAAgugagac, AUCguagguu, GAGguacgcu, ACGguaaggg, CAGgcauguc, UUAguaagau, UGAguagguu, AGGguacgaa, ACGguauguu, AGGguacugu, UUGguaugga, UAAguaacug, GCGgucagcc, UUUgugaguc, GUGgucagug, CUGgucugua, GAGguucuua, AUGguacuga, AAUgugcuuu, AGGguggcgu, CCGgcaggaa, CAUguggguc, UUGguuuguu, CAGguucugu, ACGguaagcg, CUGgucagua, UCAguaggcu, UGAguaggac, CAGguuuuaa, GAGguguccc, AGGguggguu, GUGgugagac, CACguaggga, GUGguauuuu, GAGauauccu, AAGgugaaca, UAAguagggc, CUGgugcggg, CUGgucaaua, AGAguaaaaa, AAGgugcagu, CGGguaagca, AAAgugagcc, AUGguaauca, GCAguacgug, AUGguacaug, AAGguuaaga, CGGguaaaug, GAGguucgca, GAGgcucugg, AUGgugggac, AACgugguag, AAGgugauag, GGGguuugca, CAUguaaggg, UCAguugagu, AAAgugcggc, AGAgugagcc, AUGgcaagaa, ACAguaaggu, AAGgucucua, GUGguaaaaa, AAAguaggug, UAGgugcacu, GUCgugguau, CAGguauagg, UGAgugagag, ACUgugagcc, AUCguuaguu, UUUguaccaa, UGGgugagau, AGAgugagaa, AGAguagggg, AGGgcaagua, CGGgucagua, UUGguaugcc, CGGguuagau, GGGgugaagu, CCCgugugaa, GCAguuugga, UGCguaagac, AGAgucugua, CACgugagca, AGGguaaaag, CAGgcugggu, GAAgucuuca, AAGgcaaaaa, GUAguaaaua, CUAgugagag, GAAguuucug, CCUguacgua, GAGgugcgcg, AAGguguaaa, CCAguauguu, CCGgucagcu, AUGguuccug, CAAguuaaau, AGAguaggcu, AUGgugggca, GGAguaagac, AGGgucacga, UAGgugauau, GAAguaaguc, CGGguaagau, CAAguagcua, UGAguaaaau, GUCguacgug, AUGguacgua, CAGgucucgg, GAGgcauguc, AGAgugggau, GUGguuagag, UGGgugguga, AAGguuaaac, CUUguuagcu, AAAguaggaa, UAGguuguau, AGGgugcgcc, AAGgugggcu, UAAguaucug, AAGguaacgu, AUGguggggc, CAAguacacg, GGCguaagug, AUAguaggac, AGAgugaggu, UUUguaaaaa, GAAguuugua, CUAguaaucu, AAGguuuuua, GAGgugcguu, UAGgcgagua, ACCgugagua, CAGgucccga, AUGguacugg, UGAguucagu, AAUguguggu, UCCguugguu, CAGgucagag, CAGgucccua, UAGguagacu, CAAguuaagg, GAGgugugcg, GAAgcugccc, CGAguacgug, CGGguaggua, UUGguauuga, AUUguaugau, UUGguaugaa, GAGgugguca, GCUguaugaa, CAGguguugc, CAGguaaaac, AUAguaaggu, CUGguuagag, AGCgugugag, AAGguuaucu, CACgugagua, AGGgucagua, GAGguauaau, CAGguuauuu, AGGguggacu, AUUguaauuc, UUUguggguu, AUGguacgug, AAGguguucc, CAGgugacgc, GAGguacuaa, ACAguucagu, GAGgucacgg, CAAguaaggc, AAGguuuggg, AAAgugggcu, GCGguucuug, GAGguggagc, UGAgucagug, CAGgucaagg, AGUguaagcu, GAGgcagaaa, AAGgucacac, GAAguagguu, GUCguaaguu, AGAguaugca, CCUgugcaaa, ACGgugaaaa, CAGguacgaa, CAUgugagga, AGCgugagua, GGUguguagg, AACgugagcu, GAGgugaacu, AGAguucagu, AACgugugua, CAGguugugg, AAGguacuag, UCAgugaaaa, AAUgucuggu, ACGguaaaau, CUGguguaag, GAGgugcgaa, AGGguuucuc, CAGguagccc, AUUguauugg, AUGguacuua, GAGgcccgac, UCGguaagac, CGGgcuguag, UAUgugugug, UAGguagaaa, GUGgucauua, UAGgugaaag, ACUguaauuc, GCAguacagg, UCGgugaguc, UAUguaggga, AUGguauguc, GUGgugugug, CUGgugaccu, AAUgugaaua, UAGgucucac, GAGguuauug, UGAguaggcu, CGGgcacgua, GCAguaaaua, CCGgugagag, UAAguugguc, CCGgugagcc, AAGguuguca, CUGguauuau, GGGguauggg, AAAgucagua, UUUguaugua, UAAguacugc, CAGguaccaa, GAAguucaga, AUGgugcggu, GUGgugaggu, UGAguaagcc, UAUguaaggg, GUGguggaaa, GAGgugauug, GGAguuugua, AAGgucacga, GUGguagagg, UAAguauauc, AAGgugucca, UAUgugguau, GAGguacaau, AAGguggggg, GGAguaggug, and UAGgugacuu.
In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AGA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AAA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AAC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AAU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AAG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises ACA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AUA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AUU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AUG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AUC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CAA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CAU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CAC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CAG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GAA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GAC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GAU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GAG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GGA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GCA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GGG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GGC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GUU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GGU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GUC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GUA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GUG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UCU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UCC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UCA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UCG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UUU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UUC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UUA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UUG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UGU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UAU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises GGA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CUU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CUC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CUA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CUG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CCU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CCC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CCA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CCG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises ACU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises ACC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises ACG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AGC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AGU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises AGG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CGU. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UAC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UAA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises UAG. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CGC. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CGA. In some embodiments, the splice site sequence (e.g., 5′ splice site sequence) comprises CGG. In some embodiments, the splice site sequence comprises AGAguaaggg.
In an embodiment, a gene sequence or splice site sequence provided herein is related to a proliferative disease, disorder, or condition (e.g., cancer, benign neoplasm, or inflammatory disease). In an embodiment, a gene sequence or splice site sequence provided herein is related to a non-proliferative disease, disorder, or condition. In an embodiment, a gene sequence or splice site sequence provided herein is related to a neurological disease or disorder; autoimmune disease or disorder; immunodeficiency disease or disorder; lysosomal storage disease or disorder; cardiovascular condition, disease or disorder; metabolic disease or disorder; respiratory condition, disease, or disorder; renal disease or disorder; or infectious disease in a subject. In an embodiment, a gene sequence or splice site sequence provided herein is related to a neurological disease or disorder (e.g., Huntington's disease). In an embodiment, a gene sequence or splice site sequence provided herein is related to an immunodeficiency disease or disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to a lysosomal storage disease or disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to a cardiovascular condition, disease or disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to a metabolic disease or disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to a respiratory condition, disease, or disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to a renal disease or disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to an infectious disease.
In an embodiment, a gene sequence or splice site sequence provided herein is related to a mental retardation disorder. In an embodiment, a gene sequence or splice site sequence provided herein is related to a mutation in the SETD5 gene. In an embodiment, a gene sequence or splice site sequence provided herein is related to an immunodeficiency disorder. In an embodiment, a gene sequence and splice site sequence provided herein is related to a mutation in the GATA2 gene.
In some embodiments, a compound of Formula (I) or (II) described herein interacts with (e.g., binds to) a splicing complex component (e.g., a nucleic acid (e.g., an RNA) or a protein). In some embodiments, the splicing complex component is selected from 9G8, A1 hnRNP, A2 hnRNP, ASD-1, ASD-2b, ASF, BRR2, B1 hnRNP, C1 hnRNP, C2 hnRNP, CBP20, CBP80, CELF, F hnRNP, FBP11, Fox-1, Fox-2, G hnRNP, H hnRNP, hnRNP 1, hnRNP 3, hnRNP C, hnRNP G, hnRNP K, hnRNP M, hnRNP U, Hu, HUR, I hnRNP, K hnRNP, KH-type splicing regulatory protein (KSRP), L hnRNP, LUC7L, M hnRNP, mBBP, muscle-blind like (MBNL), NF45, NFAR, Nova-1, Nova-2, nPTB, P54/SFRS11, polypyrimidine tract binding protein (PTB), a PRP protein (e.g., PRP8, PRP6, PRP31, PRP4, PRP3, PRP28, PRP5, PRP2, PRP19), PRP 19 complex proteins, RBM42, R hnRNP, RNPC 1, SAD 1, SAM68, SC35, SF, SF1BBP, SF2, SF3A complex, SF3B complex, SFRS10, an Sm protein (such as B, D1, D2, D3, F, E, G), SNU17, SNU66, SNU114, an SR protein, SRm300, SRp20, SRp3Oc, SRP35C, SRP36, SRP38, SRp40, SRp55, SRp75, SRSF, STAR, GSG, SUP-12, TASR-1, TASR-2, TIA, TIAR, TRA2, TRA2a/b, U hnRNP, U1 snRNP, U11 snRNP, U12 snRNP, U1-70K, U1-A, U1-C, U2 snRNP, U2AF1-R52, U2AF35, U2AF65, U4 snRNP, U5 snRNP, U6 snRNP, Urp, and YB1.
In some embodiments, the splicing complex component comprises RNA (e.g., snRNA). In some embodiments, a compound described herein binds to a splicing complex component comprising snRNA. The snRNA may be selected from, e.g., U1 snRNA, U2 snRNA, U4 snRNA, U5 snRNA, U6 snRNA, U11 snRNA, U12 snRNA, U4atac snRNA, and any combination thereof.
In some embodiments, the splicing complex component comprises a protein, e.g., a protein associated with an snRNA. In some embodiments, the protein comprises SC35, SRp55, SRp40, SRm300, SFRS10, TASR-1, TASR-2, SF2/ASF, 9G8, SRp75, SRp30c, SRp20 and P54/SFRS11. In some embodiments, the splicing complex component comprises a U2 snRNA auxiliary factor (e.g., U2AF65, U2AF35), Urp/U2AF1-RS2, SFIBBP, CBP80, CBP 20, SF1 or PTB/hnRNP1. In some embodiments, the splicing complex component comprises a heterogenous ribonucleoprotein particle (hnRNP), e.g., an hnRNP protein. In some embodiments, the hnRNP protein comprises A1, A2/B1, L, M, K, U, F, H, G, R, I or C1/C2. Human genes encoding hnRNPs include HNRNPA0, HNRNPA1, HNRNPAIL1, HNRNPA1L2, HNRNPA3, HNRNPA2B1, HNRNPAB, HNRNPB1, HNRNPC, HNRNPCL1, HNRNPD, HNRPDL, HNRNPF, HNRNPH1, HNRNPH2, HNRNPH3, HNRNPK, HNRNPL, HNRPLL, HNRNPM, HNRNPR, HNRNPU, HNRNPUL1, HNRNPUL2, HNRNPUL3, and FMR1.
In one aspect, the compounds of Formula (I) or (II) and pharmaceutically acceptable salts, solvates, hydrates, tautomers, stereoisomers, and compositions thereof, may modulate (e.g., increase or decrease) a splicing event of a target nucleic acid sequence (e.g., DNA, RNA, or a pre-mRNA), for example, a nucleic acid encoding a gene described herein, or a nucleic acid encoding a protein described herein, or a nucleic acid comprising a splice site described herein. In an embodiment, the splicing event is an alternative splicing event.
In an embodiment, the compound of Formula (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer, and compositions thereof increases splicing at splice site on a target nucleic acid (e.g., an RNA, e.g., a pre-mRNA), by about 0.5%, 1%, 2%, 3% 4% 5%, 6% 7% 8%, 9% 10% 15% 20% 25% 30% 35% 40% 45% 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or more, e.g., as determined by a known method in the art, e.g., qPCR. In an embodiment, the compound of Formula (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer, and compositions thereof decreases splicing at splice site on a target nucleic acid (e.g., an RNA, e.g., a pre-mRNA), by about 0.5%, 1%, 2% 3% 4% 5% 6% 7% 8% 9% 10% 15% 20% 25% 30% 35% 40% 45% 50% 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or more, e.g., as determined by a known method in the art, e.g., qPCR.
In another aspect, the present disclosure features a method of forming a complex comprising a component of a spliceosome (e.g., a major spliceosome component or a minor spliceosome component), a nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA), and a compound of Formula (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer, or composition thereof, comprising contacting the nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA) with said compound of Formula (I) or (II). In an embodiment, the component of a spliceosome is selected from the U1, U2, U4, U5, U6, U11, U12, U4atac, U6atac small nuclear ribonucleoproteins (snRNPs), or a related accessory factor. In an embodiment, the component of a spliceosome is recruited to the nucleic acid in the presence of the compound of Formula (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer, or composition thereof.
In another aspect, the present disclosure features a method of altering the structure or conformation of a nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA) comprising contacting the nucleic acid with a compound of Formula (I) or (II) or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer, or composition thereof. In an embodiment, the altering comprises forming a bulge or kink in the nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA). In an embodiment, the altering comprises stabilizing a bulge or a kink in the nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA). In an embodiment, the altering comprises reducing a bulge or a kink in the nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA). In an embodiment, the nucleic acid (e.g., a DNA, RNA, e.g., a pre-mRNA) comprises a splice site. In an embodiment, the compound of Formula (I) or (II) interacts with a nucleobase, ribose, or phosphate moiety of a nucleic acid (e.g., a DNA, RNA, e.g., pre-mRNA).
The present disclosure also provides methods for the treatment or prevention of a disease, disorder, or condition. In an embodiment, the disease, disorder or condition is related to (e.g., caused by) a splicing event, such as an unwanted, aberrant, or alternative splicing event. In an embodiment, the disease, disorder or condition comprises a proliferative disease (e.g., cancer, benign neoplasm, or inflammatory disease) or non-proliferative disease. In an embodiment, the disease, disorder, or condition comprises a neurological disease, autoimmune disorder, immunodeficiency disorder, cardiovascular condition, metabolic disorder, lysosomal storage disease, respiratory condition, renal disease, or infectious disease in a subject. In another embodiment, the disease, disorder, or condition comprises a haploinsufficiency disease, an autosomal recessive disease (e.g., with residual function), or a paralogue activation disorder. In another embodiment, the disease, disorder, or condition comprises an autosomal dominant disorder (e.g., with residual function). Such methods comprise the step of administering to the subject in need thereof an effective amount of a compound of Formula (I) or (II), or a pharmaceutically acceptable salt, solvate, hydrate, tautomer, stereoisomer thereof, or a pharmaceutical composition thereof. In certain embodiments, the methods described herein include administering to a subject an effective amount of a compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or a pharmaceutical composition thereof.
In certain embodiments, the subject being treated is a mammal. In certain embodiments, the subject is a human. In certain embodiments, the subject is a domesticated animal, such as a dog, cat, cow, pig, horse, sheep, or goat. In certain embodiments, the subject is a companion animal such as a dog or cat. In certain embodiments, the subject is a livestock animal such as a cow, pig, horse, sheep, or goat. In certain embodiments, the subject is a zoo animal. In another embodiment, the subject is a research animal such as a rodent, dog, or non-human primate. In certain embodiments, the subject is a non-human transgenic animal such as a transgenic mouse or transgenic pig.
A proliferative disease, disorder, or condition may also be associated with inhibition of apoptosis of a cell in a biological sample or subject. All types of biological samples described herein or known in the art are contemplated as being within the scope of the disclosure. The compounds of Formula (I) or (II) and pharmaceutically acceptable salts, solvates, hydrates, tautomers, stereoisomers, and compositions thereof, may induce apoptosis, and therefore, be useful in treating and/or preventing proliferative diseases, disorders, or conditions.
In certain embodiments, the proliferative disease to be treated or prevented using the compounds of Formula (I) or (II) is cancer. As used herein, the term “cancer” refers to a malignant neoplasm (Stedman's Medical Dictionary, 25th ed.; Hensyl ed.; Williams & Wilkins: Philadelphia, 1990). All types of cancers disclosed herein or known in the art are contemplated as being within the scope of the disclosure. Exemplary cancers include, but are not limited to, acoustic neuroma; adenocarcinoma; adrenal gland cancer; anal cancer; angiosarcoma (e.g., lymphangiosarcoma, lymphangioendotheliosarcoma, hemangiosarcoma); appendix cancer; benign monoclonal gammopathy; biliary cancer (e.g., cholangiocarcinoma); bladder cancer; breast cancer (e.g., adenocarcinoma of the breast, papillary carcinoma of the breast, mammary cancer, medullary carcinoma of the breast); brain cancer (e.g., meningioma, glioblastomas, glioma (e.g., astrocytoma, oligodendroglioma), medulloblastoma); bronchus cancer; carcinoid tumor; cervical cancer (e.g., cervical adenocarcinoma); choriocarcinoma; chordoma; craniopharyngioma; colorectal cancer (e.g., colon cancer, rectal cancer, colorectal adenocarcinoma); connective tissue cancer; epithelial carcinoma; ependymoma; endotheliosarcoma (e.g., Kaposi's sarcoma, multiple idiopathic hemorrhagic sarcoma); endometrial cancer (e.g., uterine cancer, uterine sarcoma); esophageal cancer (e.g., adenocarcinoma of the esophagus, Barrett's adenocarcinoma); Ewing's sarcoma; eye cancer (e.g., intraocular melanoma, retinoblastoma); familiar hypereosinophilia; gall bladder cancer; gastric cancer (e.g., stomach adenocarcinoma); gastrointestinal stromal tumor (GIST); germ cell cancer; head and neck cancer (e.g., head and neck squamous cell carcinoma, oral cancer (e.g., oral squamous cell carcinoma), throat cancer (e.g., laryngeal cancer, pharyngeal cancer, nasopharyngeal cancer, oropharyngeal cancer)); hematopoietic cancers (e.g., leukemia such as acute lymphocytic leukemia (ALL) (e.g., B-cell ALL, T-cell ALL), acute myelocytic leukemia (AML) (e.g., B-cell AML, T-cell AML), chronic myelocytic leukemia (CML) (e.g., B-cell CML, T-cell CML), and chronic lymphocytic leukemia (CLL) (e.g., B-cell CLL, T-cell CLL)); lymphoma such as Hodgkin lymphoma (HL) (e.g., B-cell HL, T-cell HL) and non-Hodgkin lymphoma (NHL) (e.g., B-cell NHL such as diffuse large cell lymphoma (DLCL) (e.g., diffuse large B-cell lymphoma), follicular lymphoma, chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL), mantle cell lymphoma (MCL), marginal zone B-cell lymphomas (e.g., mucosa-associated lymphoid tissue (MALT) lymphomas, nodal marginal zone B-cell lymphoma, splenic marginal zone B-cell lymphoma), primary mediastinal B-cell lymphoma, Burkitt lymphoma, lymphoplasmacytic lymphoma (i.e., Waldenstrom's macroglobulinemia), hairy cell leukemia (HCL), immunoblastic large cell lymphoma, precursor B-lymphoblastic lymphoma and primary central nervous system (CNS) lymphoma; and T-cell NHL such as precursor T-lymphoblastic lymphoma/leukemia, peripheral T-cell lymphoma (PTCL) (e.g., cutaneous T-cell lymphoma (CTCL) (e.g., mycosis fungoides, Sezary syndrome), angioimmunoblastic T-cell lymphoma, extranodal natural killer T-cell lymphoma, enteropathy type T-cell lymphoma, subcutaneous panniculitis-like T-cell lymphoma, and anaplastic large cell lymphoma); a mixture of one or more leukemia/lymphoma as described above; and multiple myeloma (MM)), heavy chain disease (e.g., alpha chain disease, gamma chain disease, mu chain disease); hemangioblastoma; hypopharynx cancer; inflammatory myofibroblastic tumors; immunocytic amyloidosis; kidney cancer (e.g., nephroblastoma a.k.a. Wilms' tumor, renal cell carcinoma); liver cancer (e.g., hepatocellular cancer (HCC), malignant hepatoma); lung cancer (e.g., bronchogenic carcinoma, small cell lung cancer (SCLC), non-small cell lung cancer (NSCLC), adenocarcinoma of the lung); leiomyosarcoma (LMS); mastocytosis (e.g., systemic mastocytosis); muscle cancer; myelodysplastic syndrome (MDS); mesothelioma; myeloproliferative disorder (MPD) (e.g., polycythemia vera (PV), essential thrombocytosis (ET), agnogenic myeloid metaplasia (AMM) a.k.a. myelofibrosis (MF), chronic idiopathic myelofibrosis, chronic myelocytic leukemia (CML), chronic neutrophilic leukemia (CNL), hypereosinophilic syndrome (HES)); neuroblastoma; neurofibroma (e.g., neurofibromatosis (NF) type 1 or type 2, schwannomatosis); neuroendocrine cancer (e.g., gastroenteropancreatic neuroendocrine tumor (GEP-NET), carcinoid tumor); osteosarcoma (e.g., bone cancer); ovarian cancer (e.g., cystadenocarcinoma, ovarian embryonal carcinoma, ovarian adenocarcinoma); papillary adenocarcinoma; pancreatic cancer (e.g., pancreatic adenocarcinoma, intraductal papillary mucinous neoplasm (IPMN), Islet cell tumors); penile cancer (e.g., Paget's disease of the penis and scrotum); pinealoma; primitive neuroectodermal tumor (PNT); plasma cell neoplasia; paraneoplastic syndromes; intraepithelial neoplasms; prostate cancer (e.g., prostate adenocarcinoma); rectal cancer; rhabdomyosarcoma; salivary gland cancer; skin cancer (e.g., squamous cell carcinoma (SCC), keratoacanthoma (KA), melanoma, basal cell carcinoma (BCC)); small bowel cancer (e.g., appendix cancer); soft tissue sarcoma (e.g., malignant fibrous histiocytoma (MFH), liposarcoma, malignant peripheral nerve sheath tumor (MPNST), chondrosarcoma, fibrosarcoma, myxosarcoma); sebaceous gland carcinoma; small intestine cancer; sweat gland carcinoma; synovioma; testicular cancer (e.g., seminoma, testicular embryonal carcinoma); thyroid cancer (e.g., papillary carcinoma of the thyroid, papillary thyroid carcinoma (PTC), medullary thyroid cancer); urethral cancer; vaginal cancer; and vulvar cancer (e.g., Paget's disease of the vulva).
In some embodiments, the proliferative disease is associated with a benign neoplasm. For example, a benign neoplasm may include adenoma, fibroma, hemangioma, tuberous sclerosis, and lipoma. All types of benign neoplasms disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In some embodiments, the proliferative disease is associated with angiogenesis. All types of angiogenesis disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In some embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a non-proliferative disease. Exemplary non-proliferative diseases include a neurological disease, autoimmune disorder, immunodeficiency disorder, lysosomal storage disease, cardiovascular condition, metabolic disorder, respiratory condition, inflammatory disease, renal disease, or infectious disease.
In certain embodiments, the non-proliferative disease is a neurological disease. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a neurological disease, disorder, or condition. A neurological disease, disorder, or condition may include a neurodegenerative disease, a psychiatric condition, or a musculoskeletal disease. A neurological disease may further include a repeat expansion disease, e.g., which may be characterized by the expansion of a nucleic acid sequence in the genome. For example, a repeat expansion disease includes myotonic dystrophy, amyotrophic lateral sclerosis, Huntington's disease, a trinucleotide repeat disease, or a polyglutamine disorder (e.g., ataxia, fragile X syndrome). In some embodiments, the neurological disease comprises a repeat expansion disease, e.g., Huntington's disease. Additional neurological diseases, disorders, and conditions include Alzheimer's disease, Huntington's chorea, a prion disease (e.g., Creutzfeld-Jacob disease, bovine spongiform encephalopathy, Kuru, or scrapie), a mental retardation disorder (e.g., a disorder caused by a SETD5 gene mutation, e.g., intellectual disability-facial dysmorphism syndrome, autism spectrum disorder), Lewy Body disease, diffuse Lewy body disease (DLBD), dementia, progressive supranuclear palsy (PSP), progressive bulbar palsy (PBP), psuedobulbar palsy, spinal and bulbar muscular atrophy (SBMA), primary lateral sclerosis, Pick's disease, primary progressive aphasia, corticobasal dementia, Parkinson's disease, Down's syndrome, multiple system atrophy, spinal muscular atrophy (SMA), progressive spinobulbar muscular atrophy (e.g., Kennedy disease), post-polio syndrome (PPS), spinocerebellar ataxia, pantothenate kinase-associated neurodegeneration (PANK), spinal degenerative disease/motor neuron degenerative diseases, upper motor neuron disorder, lower motor neuron disorder, Hallervorden-Spatz syndrome, cerebral infarction, cerebral trauma, chronic traumatic encephalopathy, transient ischemic attack, Lytigo-bodig (amyotrophic lateral sclerosis-parkinsonism dementia), Guam-Parkinsonism dementia, hippocampal sclerosis, corticobasal degeneration, Alexander disease, Apler's disease, Krabbe's disease, neuroborreliosis, neurosyphilis, Sandhoff disease, Tay-Sachs disease, Schilder's disease, Batten disease, Cockayne syndrome, Kearns-Sayre syndrome, Gerstmann-Strauss er-Scheinker syndrome and other transmissible spongiform encephalopathies, hereditary spastic paraparesis, Leigh's syndrome, a demyelinating diseases, neuronal ceroid lipofuscinoses, epilepsy, tremors, depression, mania, anxiety and anxiety disorders, sleep disorders (e.g., narcolepsy, fatal familial insomnia), acute brain injuries (e.g., stroke, head injury), autism, Machado-Joseph disease, or a combination thereof. In some embodiments, the neurological disease comprises Friedrich's ataxia or Sturge Weber syndrome. In some embodiments, the neurological disease comprises Huntington's disease. In some embodiments, the neurological disease comprises spinal muscular atrophy (SMA). All types of neurological diseases disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the non-proliferative disease is an autoimmune disorder or an immunodeficiency disorder. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat an autoimmune disease, disorder, or condition, or an immunodeficiency disease, disorder, or condition. Exemplary autoimmune and immunodeficiency diseases, disorders, and conditions include arthritis (e.g., rheumatoid arthritis, osteoarthritis, gout), Chagas disease, chronic obstructive pulmonary disease (COPD), dermatomyositis, diabetes mellitus type 1, endometriosis, Goodpasture's syndrome, Graves' disease, Guillain-Barre syndrome (GBS), Hashiomoto's disease, Hidradenitis suppurativa, Kawasaki disease, ankylosing spondylitis, IgA nephropathy, idiopathic thrombocytopenic purpura, inflammatory bowel disease, Crohn's disease, ulcerative colitis, collagenous colitis, lymphocytic colitis, ischemic colitis, diversion colitis, Behcet's syndrome, infective colitis, indeterminate colitisinterstitial cystitis, lupus (e.g., systemic lupus erythematosus, discoid lupus, drug-induced lupus, neonatal lupus), mixed connective tissue disease, morphea, multiple sclerosis, myasthenia gravis, narcolepsy, neuromyotonia, pemphigus vulgaris, pernicious anemia, psoriasis, psoriatic arthritis, polymyositis, primary biliary cirrhosis, relapsing polychondritis, scleroderma, Sjögren's syndrome, Stiff person syndrome, vasculitis, vitiligo, a disorder caused by a GATA2 mutation (e.g., GATA2 deficiency; GATA2 haploinsufficiency; Emberger syndrome; monocytopenia and Mycobacterium avium complex/dendritic cell, monocyte, B and NK lymphocyte deficiency; familial myelodysplastic syndrome; acute myeloid leukemia; chronic myelomonocytic leukemia), neutropenia, aplastic anemia, and Wegener's granulomatosis. In some embodiments, the autoimmune or immunodeficiency disorder comprises chronic mucocutaneous candidiasis. All types of autoimmune disorders and immunodeficiency disorders disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the non-proliferative disease is a cardiovascular condition. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a cardiovascular disease, disorder, or condition. A cardiovascular disease, disorder, or condition may include a condition relating to the heart or vascular system, such as the arteries, veins, or blood. Exemplary cardiovascular diseases, disorders, or conditions include angina, arrhythmias (atrial or ventricular or both), heart failure, arteriosclerosis, atheroma, atherosclerosis, cardiac hypertrophy, cardiac or vascular aneurysm, cardiac myocyte dysfunction, carotid obstructive disease, endothelial damage after PTCA (percutaneous transluminal coronary angioplasty), hypertension including essential hypertension, pulmonary hypertension and secondary hypertension (renovascular hypertension, chronic glomerulonephritis), myocardial infarction, myocardial ischemia, peripheral obstructive arteriopathy of a limb, an organ, or a tissue; peripheral artery occlusive disease (PAOD), reperfusion injury following ischemia of the brain, heart or other organ or tissue, restenosis, stroke, thrombosis, transient ischemic attack (TIA), vascular occlusion, vasculitis, and vasoconstriction. All types of cardiovascular diseases, disorders, or conditions disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the non-proliferative disease is a metabolic disorder. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a metabolic disease, disorder, or condition. A metabolic disease, disorder, or condition may include a disorder or condition that is characterized by abnormal metabolism, such as those disorders relating to the consumption of food and water, digestion, nutrient processing, and waste removal. A metabolic disease, disorder, or condition may include an acid-base imbalance, a mitochondrial disease, a wasting syndrome, a malabsorption disorder, an iron metabolism disorder, a calcium metabolism disorder, a DNA repair deficiency disorder, a glucose metabolism disorder, hyperlactatemia, a disorder of the gut microbiota. Exemplary metabolic conditions include obesity, diabetes (Type I or Type II), insulin resistance, glucose intolerance, lactose intolerance, eczema, hypertension, Hunter syndrome, Krabbe disease, sickle cell anemia, maple syrup urine disease, Pompe disease, and metachromatic leukodystrophy. All types of metabolic diseases, disorders, or conditions disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the non-proliferative disease is a respiratory condition. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a respiratory disease, disorder, or condition. A respiratory disease, disorder, or condition can include a disorder or condition relating to any part of the respiratory system, such as the lungs, alveoli, trachea, bronchi, nasal passages, or nose. Exemplary respiratory diseases, disorders, or conditions include asthma, allergies, bronchitis, allergic rhinitis, chronic obstructive pulmonary disease (COPD), lung cancer, oxygen toxicity, emphysema, chronic bronchitis, and acute respiratory distress syndrome. All types of respiratory diseases, disorders, or conditions disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the non-proliferative disease is a renal disease. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a renal disease, disorder, or condition. A renal disease, disorder, or condition can include a disease, disorder, or condition relating to any part of the waste production, storage, and removal system, including the kidneys, ureter, bladder, urethra, adrenal gland, and pelvis. Exemplary renal diseases include acute kidney failure, amyloidosis, Alport syndrome, adenovirus nephritis, acute lobar nephronia, tubular necrosis, glomerulonephritis, kidney stones, urinary tract infections, chronic kidney disease, polycystic kidney disease, and focal segmental glomerulosclerosis (FSGS). In some embodiments, the renal disease, disorder, or condition comprises HIV-associated nephropathy or hypertensive nephropathy. All types of renal diseases, disorders, or conditions disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the non-proliferative disease is an infectious disease. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat an infectious disease, disorder, or condition. An infectious disease may be caused by a pathogen such as a virus or bacteria. Exemplary infectious diseases include human immunodeficiency syndrome (HIV), acquired immunodeficiency syndrome (AIDS), meningitis, African sleeping sickness, actinomycosis, pneumonia, botulism, chlamydia, Chagas disease, Colorado tick fever, cholera, typhus, giardiasis, food poisoning, ebola hemorrhagic fever, diphtheria, Dengue fever, gonorrhea, streptococcal infection (e.g., Group A or Group B), hepatitis A, hepatitis B, hepatitis C, herpes simplex, hookworm infection, influenza, Epstein-Barr infection, Kawasaki disease, kuru, leprosy, leishmaniasis, measles, mumps, norovirus, meningococcal disease, malaria, Lyme disease, listeriosis, rabies, rhinovirus, rubella, tetanus, shingles, scarlet fever, scabies, Zika fever, yellow fever, tuberculosis, toxoplasmosis, or tularemia. In some embodiments, the infectious disease comprises cytomegalovirus. All types of infectious diseases, disorders, or conditions disclosed herein or known in the art are contemplated as being within the scope of the disclosure.
In certain embodiments, the disease, disorder, or condition is a haploinsufficiency disease. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a haploinsufficiency disease, disorder, or condition. A haploinsufficiency disease, disorder, or condition may refer to a monogenic disease in which an allele of a gene has a loss-of-function lesion, e.g., a total loss of function lesion. In an embodiment, the loss-of-function lesion is present in an autosomal dominant inheritance pattern or is derived from a sporadic event. In an embodiment, the reduction of gene product function due to the altered allele drives the disease phenotype despite the remaining functional allele (i.e. said disease is haploinsufficient with regard to the gene in question). In an embodiment, a compound of Formula (I) or (II) increases expression of the haploinsufficient gene locus. In an embodiment, a compound of Formula (I) or (II) increases one or both alleles at the haploinsufficient gene locus. Exemplary haploinsufficiency diseases, disorders, and conditions include Robinow syndrome, cardiomyopathy, cerebellar ataxia, pheochromocytoma, Charcot-Marie-Tooth disease, neuropathy, Takenouchi-Kosaki syndrome, Coffin-Sins syndrome 2, chromosome 1p35 deletion syndrome, spinocerebellar ataxia 47, deafness, seizures, dystonia 9, GLUT1 deficiency syndrome 1, GLUT1 deficiency syndrome 2, stomatin-deficient cryohydrocytosis, basal cell carcinoma, basal cell nevus syndrome, medulloblastoma, somatic, brain malformations, macular degeneration, cone-rod dystrophy, Dejerine-Sottas disease, hypomyelinating neuropathy, Roussy-Levy syndrome, glaucoma, autoimmune lymphoproliferative syndrome, pituitary hormone deficiency, epileptic encephalopathy, early infantile, popliteal pterygium syndrome, van der Woude syndrome, Loeys-Dietz syndrome, Skraban-Deardorff syndrome, erythrocytosis, megalencephaly-polymicrogyria-polydactyly-hydrocephalus syndrome, mental retardation, CINCA syndrome, familial cold inflammatory syndrome 1, keratoendothelitis fugax hereditaria, Muckle-Wells syndrome, Feingold syndrome 1, Acute myeloid leukemia, Heyn-Sproul-Jackson syndrome, Tatton-Brown-Rahman syndrome, Shashi-Pena syndrome, Spastic paraplegia, autosomal dominant, macrophthalmia, colobomatous, with microcornea, holoprosencephaly, schizencephaly, endometrial cancer, familial, colorectal cancer, hereditary nonpolyposis, intellectual developmental disorder with dysmorphic facies and behavioral abnormalities, ovarian hyperstimulation syndrome, schizophrenia, Dias-Logan syndrome, premature ovarian failure, dystonia, dopa-responsive, due to sepiapterin reductase deficiency, Beck-Fahrner syndrome, chromosome 2p12-p11.2 deletion syndrome, neuronopathy, spastic paraplegia, familial adult myoclonic, colorectal cancer, hypothyroidism, Culler-Jones syndrome, holoprosencephaly, myelokathexis, WHIM syndrome, Mowat-Wilson syndrome, mental retardation, an intellectual developmental disorder, autism spectrum disorder, epilepsy, epileptic encephalopathy, Dravet syndrome, migraines, a mental retardation disorder (e.g., a disorder caused by a SETD5 gene mutation, e.g., intellectual disability-facial dysmorphism syndrome, autism spectrum disorder), a disorder caused by a GATA2 mutation (e.g., GATA2 deficiency; GATA2 haploinsufficiency; Emberger syndrome; monocytopenia and Mycobacterium avium complex/dendritic cell, monocyte, B and NK lymphocyte deficiency; familial myelodysplastic syndrome; acute myeloid leukemia; chronic myelomonocytic leukemia), and febrile seizures.
In certain embodiments, the disease, disorder, or condition is an autosomal recessive disease, e.g., with residual function. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat an autosomal recessive disease, disorder, or condition. An autosomal recessive disease with residual function may refer to a monogenic disease with either homozygous recessive or compound heterozygous heritability. These diseases may also be characterized by insufficient gene product activity (e.g., a level of gene product greater than 0%). In an embodiment, a compound of Formula (I) or (II) may increase the expression of a target (e.g., a gene) related to an autosomal recessive disease with residual function. Exemplary autosomal recessive diseases with residual function include Friedreich's ataxia, Stargardt disease, Usher syndrome, chlorioderma, fragile X syndrome, achromatopsia 3, Hurler syndrome, hemophilia B, alpha-1-antitrypsin deficiency, Gaucher disease, X-linked retinoschisis, Wiskott-Aldrich syndrome, mucopolysaccharidosis (Sanfilippo B), DDC deficiency, epidermolysis bullosa dystrophica, Fabry disease, metachromatic leukodystrophy, and odontochondrodysplasia.
In certain embodiments, the disease, disorder, or condition is an autosomal dominant disease. In certain embodiments, the compound of Formula (I), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat an autosomal dominant disease, disorder, or condition. An autosomal dominant disease may refer to a monogenic disease in which the mutated gene is a dominant gene. These diseases may also be characterized by insufficient gene product activity (e.g., a level of gene product greater than 0%). In an embodiment, a compound of Formula (I) may increase the expression of a target (e.g., a gene) related to an autosomal dominant disease. Exemplary autosomal dominant diseases include Huntington's disease, achondroplasia, antithrombin III deficiency, Gilbert's disease, Ehlers-Danlos syndrome, hereditary hemorrhagic telangiectasia, intestinal polyposis, hereditary elliptosis, hereditary spherocytosis, marble bone disease, Marfan's syndrome, protein C deficiency, Treacher Collins syndrome, Von Willebrand's disease, tuberous sclerosis, osteogenesis imperfecta, polycystic kidney disease, neurofibromatosis, and idiopathic hypoparathyroidism.
In certain embodiments, the disease, disorder, or condition is a paralogue activation disorder. In certain embodiments, the compound of Formula (I) or (II), or a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof, is used to prevent or treat a paralogue activation disease, disorder, or condition. A paralogue activation disorder may comprise a homozygous mutation of genetic locus leading to loss-of-function for the gene product. In these disorders, there may exist a separate genetic locus encoding a protein with overlapping function (e.g. developmental paralogue), which is otherwise not expressed sufficiently to compensate for the mutated gene. In an embodiment, a compound of Formula (I) or (II) activates a gene connected with a paralogue activation disorder (e.g., a paralogue gene).
The cell described herein may be an abnormal cell. The cell may be in vitro or in vivo. In certain embodiments, the cell is a proliferative cell. In certain embodiments, the cell is a cancer cell. In certain embodiments, the cell is a non-proliferative cell. In certain embodiments, the cell is a blood cell. In certain embodiments, the cell is a lymphocyte. In certain embodiments, the cell is a benign neoplastic cell. In certain embodiments, the cell is an endothelial cell. In certain embodiments, the cell is an immune cell. In certain embodiments, the cell is a neuronal cell. In certain embodiments, the cell is a glial cell. In certain embodiments, the cell is a brain cell. In certain embodiments, the cell is a fibroblast. In certain embodiment, the cell is a primary cell, e.g., a cell isolated from a subject (e.g., a human subject).
In certain embodiments, the methods described herein comprise the additional step of administering one or more additional pharmaceutical agents in combination with the compound of Formula (I), a pharmaceutically acceptable salt thereof, or compositions comprising such compound or pharmaceutically acceptable salt thereof. Such additional pharmaceutical agents include, but are not limited to, anti-proliferative agents, anti-cancer agents, anti-diabetic agents, anti-inflammatory agents, immunosuppressant agents, and a pain-relieving agent. The additional pharmaceutical agent(s) may synergistically augment the modulation of splicing induced by the inventive compounds or compositions of this disclosure in the biological sample or subject. Thus, the combination of the inventive compounds or compositions and the additional pharmaceutical agent(s) may be useful in treating, for example, a cancer or other disease, disorder, or condition resistant to a treatment using the additional pharmaceutical agent(s) without the inventive compounds or compositions.
In order that the invention described herein may be more fully understood, the following examples are set forth. The examples described in this application are offered to illustrate the compounds, pharmaceutical compositions, and methods provided herein and are not to be construed in any way as limiting their scope.
The compounds provided herein can be prepared from readily available starting materials using modifications to the specific synthesis protocols set forth below that would be well known to those of skill in the art. It will be appreciated that where typical or preferred process conditions (i.e., reaction temperatures, times, mole ratios of reactants, solvents, pressures, etc.) are given, other process conditions can also be used unless otherwise stated. Optimum reaction conditions may vary with the particular reactants or solvents used, but such conditions can be determined by those skilled in the art by routine optimization procedures.
Additionally, as will be apparent to those skilled in the art, conventional protecting groups may be necessary to prevent certain functional groups from undergoing undesired reactions. The choice of a suitable protecting group for a particular functional group as well as suitable conditions for protection and deprotection are well known in the art. For example, numerous protecting groups, and their introduction and removal, are described in Greene et al., Protecting Groups in Organic Synthesis, Second Edition, Wiley, New York, 1991, and references cited therein.
Reactions can be purified or analyzed according to any suitable method known in the art. For example, product formation can be monitored by spectroscopic means, such as nuclear magnetic resonance (NMR) spectroscopy (e.g., 1H or 13C), infrared (IR) spectroscopy, spectrophotometry (e.g., UV-visible), mass spectrometry (MS), or by chromatographic methods such as high performance liquid chromatography (HPLC) or thin layer chromatography (TLC).
Proton NMR: 1H NMR spectra were recorded in CDCl3 solution in 5-mm o.d. tubes (Wildmad) at 24° C. and were collected on a BRUKER AVANCE NEO 400 at 400 MHz for 1H. The chemical shifts ((δ) are reported relative to tetramethylsilane (TMS=0.00 ppm) and expressed in ppm.
LC/MS: Liquid chromatography-mass spectrometry (LC/MS) was performed on Shimadzu-2020EV using column: Shim-pack XR-ODS (C18, Ø4.6×50 mm, 3 μm, 120 Å, 40° C.) operating in ESI(+) ionization mode; flow rate=1.2 mL/min. Mobile phase=0.05% TFA in water or CH3CN; or on Shimadzu-2020EV using column: Poroshell HPH-C18 (C18, Ø4.6×50 mm, 3 μm, 120 Å, 40° C.) operating in ESI(+) ionization mode; flow rate=1.2 mL/min. Mobile phase A: Water/5 mM NH4HCO3, Mobile phase B: CH3CN.)
Analytical chiral HPLC: Analytical chiral HPLC was performed on a Agilent 1260 using column: CHIRALPAK IG-3, CHIRALPA IC-3 or CHIRALPAK OJ-3, with flow rate=1.2 mL/min. Mobile phase=MTBE(DEA):EtOH=50:50). Preparative HPLC purification: prep-HPLC purification was performed using one of the following HPLC conditions:
Condition 1: BioBasic C-18 column (length: 250, diameter: 21.2 mm, particle size: 5 μm); Flow rate=5 mL/min); Mobile Phase A: water (containing 0.1% TFA); Mobile Phase B: acetonitrile; Gradient of 10-90% B over 90 min.
Preparative chiral HPLC: purification by chiral HPLC was performed on a Gilson-GX 281 using column: CHIRALPAK IG-3, CHIRALPAK 1C-3 or CHIRALPAK OJ-3.
Compounds of the present disclosure may be prepared using one of the synthetic protocols illustrated below in Schemes A-B.
An exemplary method of preparing a compound of Formula (I-I) is provided in Scheme A. In this scheme, B-1 and B-2 are coupled to provide B-3 in Step 1. The reaction of Step 1 involves incubating B-1 and B-2 with hexafluorophosphate azabenzotriazole tetramethyl uranium (HATU), or a similar coupling agent, in the presence of diisopropylethylamine (DIPEA), in dimethylformamide (DMF). Suitable alternatives to DIPEA and DMF may also be used in the reaction. In Step 2, B-3 is cyclized to provide B-4, which entails treating B-3 with triethyl orthoformate and tosic acid, or a similar combination of reagents. Finally, a compound of Formula (I-I) is prepared in Step 3 by coupling B-4 with B-5 in the presence of cesium carbonate and DIPEA. Alternatives reagents to cesium carbonate and/or DIPEA can also be used in this reaction. The reaction of Step 3 may be conducted in dioxane or a similar solvent, with heating to 80° C. or a temperature sufficient to provide the compound of Formula (I-I). Each starting material and/or intermediate in Scheme B may be protected and deprotected using standard protecting group methods. In addition, purification and characterization of each intermediate as well as the final compound of Formula (I) may be afforded by any accepted procedure.
An exemplary method of preparing a compound of Formula (II-I) is provided in Scheme B. Step 1 involves coupling C-1 with C-2 to provide the intermediate C-3. The reaction of Step 1 may be carried out using cesium carbonate, or a similar reagent, and any suitable solvent such as dioxane. The reaction of Step 1 may be carried out at 80° C. or a temperature sufficient to provide C-3. In Step 2, C-3 is incubated with C-4 in the presence of diisopropylethylamine (DIPEA), using dichloromethane or a similar solvent, in order to afford C-5. Alternative reagents to DIPEA may also be used in the reaction of Step 2. In Step 3, C-5 is cyclized by first treating C-5 with ammonia and methanol, or a similar mixture, over an extended period (e.g., overnight) at room temperature. The intermediate may then be heated to a suitable temperature to provide a compound of Formula (II-I). Each starting material and/or intermediate in Scheme C may be protected and deprotected using standard protecting group methods. In addition, purification and characterization of each intermediate as well as the final compound of Formula (II) may be afforded by any accepted procedure.
A mixture of ethyl 5-bromothiazole-4-carboxylate (B1; 2 g, 8.47 mmol), tert-butyl carbamate (B2; 1.2 g, 10.1 mmol), Cs2CO3 (3.86 g, 11.8 mmol), Pd(OAc)2 (0.19 g, 0.85 mmol) and (±)-BINAP (1.58 g, 2.53 mmol) in degassed dioxane (85 mL) was heated to 100° C. for 12 h, and then cooled to room temperature. The reaction mixture was filtered through Celite and concentrated under reduced pressure, and then purified by column chromatography on silica gel eluting with a gradient of 0-40% ethyl acetate in hexanes, to afford ethyl 5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B3; 1.8 g) as a solid. LCMS (ES, m/z): 273.1 [M+H]+.
N-Bromosuccinimide (3 g, 16.9 mmol) was added to a solution of ethyl 5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B3; 2.07 g, 7.61 mmol) in dichloromethane (55 mL), and the reaction mixture was stirred at room temperature for 3 h. The volatiles were then removed under reduced pressure, and the residue was purified by column chromatography on silica gel eluting with a gradient of 0-30% ethyl acetate in hexanes, to afford ethyl 2-bromo-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B4; 1.6 g) as a solid. LCMS (ES, m/z): 351.0 [M+H]+.
A mixture of ethyl 2-bromo-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B4; 250 mg, 0.71 mmol), 2,7-dimethyl-5-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-2H-indazole (B5; 213 mg, 0.78 mmol), PdCl2(dppf)-CH2Cl2 (58.1 mg, 0.071 mmol) and Cs2CO3 (464 mg, 1.42 mmol) in dioxane (2.5 mL) and H2O (0.2 mL) was heated to 95° C. for 4 h, and then cooled to room temperature. The mixture was filtered through Celite using ethyl acetate as an eluent, and further concentrated under reduced pressure. The residue was purified by column chromatography on silica gel eluting with a gradient of 0-2% methanol in ethyl acetate to afford ethyl 5-((tert-butoxycarbonyl)amino)-2-(2,7-dimethyl-2H-indazol-5-yl)thiazole-4-carboxylate (B6; 220 mg) as a solid. LCMS (ES, m/z): 417.1 [M+H]+.
Step 1: A 2M solution of sodium hydroxide (1.35 mL, 2.7 mmol) was added to a solution of ethyl 5-((tert-butoxycarbonyl)amino)-2-(2,7-dimethyl-2H-indazol-5-yl)thiazole-4-carboxylate (B6; 219 mg, 0.53 mmol) in methanol/tetrahydrofuran (10 mL, 2:1 ratio) at room temperature, and the reaction mixture was heated to 50° C. for 12 h, and then cooled to room temperature. Next, a 6M solution of HCl was added until a pH of 3-4 was achieved. The mixture was then concentrated under reduced pressure, and water (10 mL) and dichloromethane (10 mL) were added, the layers were separated, and the aqueous phase was further extracted with dichloromethane (3×15 ml). The combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product of Step 1 was used in Step 2 without further purification.
Step 2: Hexafluorophosphate azabenzotriazole tetramethyl uronium (HATU; 200 mg, 0.53 mmol) and diisopropylethylamine (147 μL, 0.84 mmol) were sequentially added to a mixture of the crude product of Step 1 (163 mg, 0.42 mmol) and 1-methylpiperidin-4-amine (B7; 79.0 μL, 0.63 mmol) in dimethylformamide (6 mL), and the resulting mixture was stirred at 0° C. for 1 h, and then at room temperature for an additional 1 h. The volatiles were then removed under reduced pressure, and the residue was purified by column chromatography on silica gel eluting with a gradient of 0-15% methanol in dichloromethane to afford tert-butyl (2-(2,7-dimethyl-2H-indazol-5-yl)-4-((1-methylpiperidin-4-yl)carbamoyl)thiazol-5-yl)carbamate (B8; 122 mg) as an oil. LCMS (ES, m/z): 484.9 [M+H]+.
A 4M solution of HCl in dioxane (12 mL, 48 mmol) was added to a solution of tert-butyl (2-(2,7-dimethyl-2H-indazol-5-yl)-4-((1-methylpiperidin-4-yl)carbamoyl)thiazol-5-yl)carbamate (B8; 175 mg, 0.36 mmol) in methanol (12 mL), and the reaction mixture was stirred at room temperature for 5 h. The volatiles were then removed under reduced pressure, and the crude material was triturated with diethyl ether, and the resulting precipitate was collected by filtration and dried under vacuum. The solid was then dissolved in water (5 mL) and the pH of the solution was adjusted to ˜8 with saturated solution of sodium carbonate. The aqueous solution was then extracted with ethyl acetate (3×20 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure to afford 5-amino-2-(2,7-dimethyl-2H-indazol-5-yl)-N-(1-methylpiperidin-4-yl)thiazole-4 carboxamide (B9; 90.0 mg) as a solid. LCMS (ES, m/z): 385.2 [M+H]+.
Ethyl chloroformate (0.25 mL, 2.62 mmol) was added to a solution of 5-amino-2-(2,7-dimethyl-2H-indazol-5-yl)-N-(1-methylpiperidin-4-yl)thiazole-4-carboxamide (B9; 40 mg, 0.1 mmol) in dimethylformamide (0.5 mL), and the reaction mixture was stirred at room temperature for 10 min, and then at 150° C. for an additional 12 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (15 mL) were added, and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×15 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The residue was purified by semi-preparative HPLC (Condition 1) to afford 2-(2,7-dimethyl-2H-indazol-5-yl)-6-(1-methylpiperidin-4-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 179; 12 mg) as a trifluoroacetate salt. LCMS (ES, m/z): 395.2 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 9.53 (1H, br s), 8.47 (1H, s), 8.38 (1H, s), 8.24 (1H, s), 7.66 (1H, s), 4.81 (1H, t, J=11.9 Hz), 4.17 (3H, s), 3.55 (2H, m), 3.18 (2H, m), 2.78 (3H, d, J=3.9 Hz), 2.54 (3H, s), 2.28-2.38 (2H, m), 2.12 (2H, m).
Hexafluorophosphate azabenzotriazole tetramethyl uronium (HATU; 1.47 g, 3.86 mmol) and diisopropylethylamine (1.08 mL, 6.18 mmol) were sequentially added to a mixture of 5-((tert-butoxycarbonyl)amino)-2-(2,7-dimethyl-2H-indazol-5-yl)thiazole-4-carboxylic acid (B10; prepared by the method outlined in Example 1; 1 g, 3.09 mmol) and 2-methyl-2H-indazol-5-amine (B11; (0.6 g, 4.07 mmol) in dimethylformamide (44 mL), and the resulting mixture was stirred at 0° C. for 1 h, and then at room temperature for an additional 1 h. The volatiles were then removed under reduced pressure, and the residue was purified by column chromatography on silica gel eluting with a gradient of 0-100% ethyl acetate in hexanes, to afford tert-butyl (2-bromo-4-((2-methyl-2H-indazol-6-yl)carbamoyl)thiazol-5-yl)carbamate (B12; 1.35 g) as a solid. LCMS (ES, m/z): 452.1 [M+H]+.
A mixture of tert-butyl (2-bromo-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B12; 100 mg, 0.22 mmol), N-methyl piperazine (B14; 0.12 mL, 1.11 mmol) and diisopropylethylamine (0.06 mL, 0.34 mmol) in dioxane (0.5 mL) was heated to 100° C. for 48 h, and then cooled to room temperature. The volatiles were removed under reduced pressure, and the residue was purified by column chromatography on silica gel eluting with a gradient of 0-20% methanol in dichloromethane, to afford tert-butyl (4-((2-methyl-2H-indazol-5-yl)carbamoyl)-2-(4-methylpiperazin-1-yl)thiazol-5-yl)carbamate (B15; 67 mg) as a solid. LCMS (ES, m/z): 472.2 [M+H]+.
Step 1: Trifluoroacetic acid (1.2 mL, 15.6 mmol) was added to a solution of tert-butyl (4-((2-methyl-2H-indazol-5-yl)carbamoyl)-2-(4-methylpiperazin-1-yl)thiazol-5-yl)carbamate (B15; 101 mg, 0.21 mmol) in a mixture of dichloromethane (3 mL) and water (0.1 mL), and the reaction mixture was stirred at room temperature for 2 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (15 mL) were added, and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×10 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product from Step 1 was used directly in Step 2 without further purification.
Step 2: Ethyl chloroformate (0.14 mL, 1.51 mmol) was added to a solution of the crude product from Step 1 (75 mg, 0.2 mmol) in dimethylformamide (2 mL), and the reaction mixture was stirred at room temperature for 10 min, and then at 150° C. for an additional 1 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (15 mL) were added, and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×15 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The residue was purified by column chromatography on silica gel eluting with a gradient of 0-15% methanol in dichloromethane, to afford 6-(2-methyl-2H-indazol-5-yl)-2-(4-methylpiperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 181; 32 mg) as a solid. LCMS (ES, m/z): 382.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.49 (1H, s), 8.28 (1H, s), 7.84 (1H, s), 7.71 (1H, d, J=9.1 Hz), 7.26 (1H, dd, J=9.1, 2.0 Hz), 4.22 (3H, s), 3.54 (4H, t, J=4.7 Hz), 2.44 (4H, t, J=4.8 Hz), 2.23 (3H, s).
Diisopropylethylamine (41 μL, 0.24 mmol) was added to a solution of tert-butyl (2-bromo-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B12 from Example 2; 50 mg, 0.16 mmol) and 1-(4-fluorobenzyl)piperazine (B26; 150 mg, 0.77 mmol) in dioxane (0.5 mL), and the reaction mixture was heated to 100° C. for 36 h, and then cooled to room temperature. The mixture was then concentrated under reduced pressure and purified by column chromatography on silica gel eluting with a gradient of 0-10% methanol in dichloromethane, to afford tert-butyl (2-(4-(4-fluorobenzyl)piperazin-1-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B27; 52 mg) as a solid. LCMS (ES, m/z): 566.0 [M+H]+.
Step 1: Trifluoroacetic acid (0.36 mL, 4.7 mmol) was added to a solution of tert-butyl (2-(4-(4-fluorobenzyl)piperazin-1-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B27; 54.0 mg, 0.095 mmol) in dichloromethane (2.5 mL) and H2O (50 μL), and the mixture was stirred at room temperature for 2 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (10 mL) were added and the layers were separated. The aqueous layer was further extracted with dichloromethane (3×15 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product from Step 1 was used in Step 2 without further purification. Step 2: Ethyl chloroformate (0.11 mL, 1.2 mmol) was added to a solution of the crude product from Step 1 (44 mg, 0.12 mmol) in dimethylformamide (2 mL), and the reaction mixture was stirred at room temperature for 10 min, and then at 130° C. for an additional 1 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (15 mL) were added, and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×15 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product was purified by column chromatography on silica gel eluting with a gradient of 0-10% methanol in dichloromethane, to afford 2-(4-(4-fluorobenzyl)piperazin-1-yl)-6-(2-methyl-2H-indazol-5-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 185; 25 mg) as a solid. LCMS (ES, m/z): 476.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.47 (1H, s), 8.26 (1H, s), 7.82 (1H, s), 7.69 (1H, d, J=9.1 Hz), 7.36 (2H, t, J=6.7 Hz), 7.24 (1H, d, J=9.2 Hz), 7.15 (2H, t, J=8.6 Hz), 4.20 (3H, s), 3.52 (6H, s). Note: Signal of four hydrogen atoms are overlapping with solvent peak of residual DMSO-ds.
A mixture of ethyl 2-bromo-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B10, prepared by the method outlined in Example 1; 350 mg, 1 mmol), 5-fluoro-2-methyl-7-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)imidazo[1,2-a]pyridine (B25; 358 mg, 1.29 mmol), PdCl2(dppf).CH2Cl2 (81 mg, 0.1 mmol) and Cs2CO3 (649 mg, 1.99 mmol) in dioxane (5 mL) and H2O (0.3 mL) was heated to 95° C. for 4 h, and then cooled to room temperature. The reaction mixture was filtered through Celite using ethyl acetate as an eluent. The filtrate was concentrated under reduced pressure and purified by column chromatography on silica gel eluting with a gradient of 0-60% ethyl acetate in hexanes, to afford ethyl 5-((tert-butoxycarbonyl)amino)-2-(5-fluoro-2-methylimidazo[1,2-a]pyridin-7-yl)thiazole-4-carboxyl ate (B28; 170 mg) as a solid. LCMS (ES, m/z): 421.2 [M+H]+.
Step 1: A 2M solution of sodium hydroxide (1 mL, 2 mmol) was added to a solution of ethyl 5-((tert-butoxycarbonyl)amino)-2-(5-fluoro-2-methylimidazo[1,2-a]pyridin-7-yl)thiazole-4-carboxylate (B28; 170 mg, 0.4 mmol) in methanol/tetrahydrofuran (7 mL, 3:1 ratio) at room temperature, and the mixture was heated to 55° C. for 24 h, and then cooled to room temperature. A 6M solution of HCl was then added until a pH of 3-4 was achieved, and the volatiles were removed under reduced pressure. Water (10 mL) and dichloromethane (10 mL) were then added and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×15 ml), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product from Step 1 was used in Step 2 without further purification.
Step 2: Hexafluorophosphate azabenzotriazole tetramethyl uronium (HATU; 164 mg, 0.43 mmol) and diisopropylethylamine (0.12 mL, 0.69 mmol) were sequentially added to a mixture of the crude product from Step 1 (135 mg, 0.334 mmol) and 1-methylpiperidin-4-amine (B7; 51 mg, 0.45 mmol) in dimethylformamide (10 mL), and the reaction mixture was stirred at 0° C. for 1 h, and then at room temperature for an additional 1 h. The volatiles were then removed under reduced pressure, and the residue was purified by column chromatography on silica gel eluting with a gradient of 0-15% methanol in dichloromethane, to afford tert-butyl (2-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-4-((1-methylpiperidin-4-yl)carbamoyl)thiazol-5-yl)carbamate (B29; 123 mg) as an oil. LCMS (ES, m/z): 488.9 [M+H]+.
Step 1: Trifluoroacetic acid (1.9 mL, 25 mmol) was added to a solution of tert-butyl (2-(4-(4-fluorobenzyl)piperazin-1-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B29; 110 mg, 0.23 mmol) in dichloromethane (15 mL) and H2O (0.6 mL), and the reaction mixture was stirred at room temperature for 3 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (20 mL) were added and the layers were separated. The aqueous layer was further extracted with dichloromethane (3×20 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product from Step 1 was used in directly in Step 2 without further purification.
Step 2: Ethyl chloroformate (0.11 mL, 1.18 mmol) was added to a solution of the crude product from Step 1 (42 mg, 0.11 mmol) in dimethylformamide (2 mL), and the reaction mixture was stirred at room temperature for 10 min, and then at 150° C. for an additional 12 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and ethyl acetate (15 mL) were added, and the layers were separated. The aqueous phase was further extracted with ethyl acetate (3×15 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The residue was purified by column chromatography on silica gel eluting with a gradient of 0-20% methanol in dichloromethane, to afford 2-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-6-(1-methylpiperidin-4-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 186; 5.6 mg) as a solid. LCMS (ES, m/z): 399.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 9.28 (1H, s), 8.64 (1H, s), 7.96 (1H, s), 7.67 (1H, d, J=11.7 Hz), 4.66 (1H, m), 2.95 (2H, m), 2.39 (3H, s), 2.25 (3H, s), 2.10 (4H, br s), 1.85 (2H, br s).
Step 1: A 2M solution of sodium hydroxide (1.3 mL, 2.6 mmol) was added to a solution of methyl 5-((tert-butoxycarbonyl)amino)-2-(2,7-dimethyl-2H-indazol-5-yl)thiazole-4-carboxylate (B6 from Example 1; 210 mg, 0.5 mmol) in a mixture of methanol/tetrahydrofuran (9 mL, 2:1 ratio) at room temperature, and the reaction mixture was heated to 55° C. for 12 h, and then cooled to room temperature. A 6M solution of HCl was then added to achieve a pH of 3-4, and the volatiles were removed under reduced pressure. Water (10 mL) and dichloromethane (10 mL) were then added and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×15 ml), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product from Step 1 was used in directly in Step 2 without further purification.
Step 2: Hexafluorophosphate azabenzotriazole tetramethyl uronium (HATU; 240 mg, 0.63 mmol) and diisopropylethylamine (176 μL, 1 mmol) were sequentially added to a mixture of the crude product from Step 1 (196 mg, 0.51 mmol) and 2,2,6,6-tetramethylpiperidin-4-amine (B30; 103 mg, 0.66 mmol) in dimethylformamide (10 mL), and the reaction mixture was stirred at 0° C. for 1 h. The volatiles were then removed under reduced pressure, and the residue was purified by reverse phase chromatography using a C18 column, eluting with a gradient of 0-100% acetonitrile in neutral water, followed by 100% MeOH, to afford the tert-butyl (2-(2,7-dimethyl-2H-indazol-5-yl)-4-((2,2,6,6-tetramethylpiperidin-4-yl)carbamoyl)thiazol-5-yl)carbamate (B31; 105 mg) as an oil. LCMS (ES, m/z): 527.3 [M+H]+.
Step 1: Trifluoroacetic acid (3.9 mL, 51 mmol) was added to a solution of tert-butyl (2-(2,7-dimethyl-2H-indazol-5-yl)-4-((2,2,6,6-tetramethylpiperidin-4-yl)carbamoyl)thiazol-5-yl)carbamate (B31; 100 mg, 0.17 mmol) in dichloromethane (15 mL) and H2O (0.2 mL), and the reaction mixture was stirred at room temperature for 2 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (20 mL) and dichloromethane (20 mL) were added and the layers were separated. The aqueous layer was further extracted with dichloromethane (3×20 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude material from Step 1 was used directly in Step 2 without further purification.
Step 2: A mixture of dimethylformamide (0.6 mL) and POC13 (0.61 mL, 6.54 mmol) was stirred at room temperature for 40 min, and then a solution of the crude product from Step 1 (93 mg, 0.218 mmol) in dimethylformamide (0.5 mL) was then added, and the reaction mixture was stirred at room temperature for 40 min, and then at 80° C. for an additional 12 h. The volatiles were removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (10 mL) were added and the layers were separated. The aqueous layer was further extracted with dichloromethane (3×10 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The residue was first purified by column chromatography on silica gel eluting with a gradient of 0-20% methanol in dichloromethane, and further purified by semi-preparative HPLC (Condition 1) to afford 4-(2-(2,7-dimethyl-2H-indazol-5-yl)-7-oxothiazolo[5,4-d]pyrimidin-6(7H)-yl)-2,2,6,6-tetramethylpiperidine-1-carbaldehyde (Compound 187; 22 mg) as a trifluoroacetate salt. LCMS (ES, m/z): 465.2 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 10.39 (1H, s), 8.81 (1H, d, J=12.4 Hz), 8.61 (2H, br s), 7.95 (1H, d, J=12.4 Hz), 7.84 (1H, s), 5.03 (1H, t, J=12.2 Hz), 4.52 (3H, s), 2.67 (3H, s), 2.43 (2H, t, J=14.3 Hz), 2.06 (2H, d, J=13.1 Hz), 1.51 (6H, s), 1.42 (6H, s).
A mixture of tert-butyl (2-bromo-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B12 from Example 2; 350 mg, 0.77 mmol), 1-methyl-4-(4,4,5,5-tetramethyl-1,3,2-dioxaborolan-2-yl)-1,2,3,6-tetrahydropyridine (B32; 224 mg, 1 mmol), PdCl2(dppf).DCM (63 mg, 0.077 mmol) and Cs2CO3 (504 mg, 1.54 mmol) in a mixture of dioxane (5 mL) and H2O (0.1 mL) was heated to 95° C. for 4 h and then cooled to room temperature. The reaction mixture was then filtered through Celite using ethyl acetate/methanol (9:1) as an eluent. The volatiles were removed under reduced pressure, and the residue was purified by column chromatography on silica gel eluting with a gradient of 0-60% ethyl acetate in hexanes, to afford tert-butyl (2-(1-methyl-1,2,3,6-tetrahydropyridin-4-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B33; 240 mg) as a solid. LCMS (ES, m/z): 469.2 [M+H]+.
Step 1: Trifluoroacetic acid (1 mL, 13 mmol) was added to a solution of tert-butyl (2-(1-methyl-1,2,3,6-tetrahydropyridin-4-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (B33; 120 mg, 0.26 mmol) in a mixture of dichloromethane (6 mL) and H2O (0.1 mL), and the reaction mixture was stirred at room temperature for 2 h. The reaction was quenched with a saturated solution of NaHCO3, and then dichloromethane (20 mL) was added and the layers were separated. The aqueous layer was further extracted with dichloromethane (3×15 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The crude product from Step 1 was used directly in Step 2 without further purification. Step 2: Ethyl chloroformate (0.18 mL, 1.89 mmol) was added to a solution of the crude product from Step 1 (70 mg, 0.19 mmol) in dimethylformamide (3 mL), and the reaction mixture was stirred at room temperature for 30 min, and then at 130° C. for an additional 3 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (20 mL) and dichloromethane (20 mL) were added, and the layers were separated. The aqueous phase was further extracted with dichloromethane (3×20 mL), and the combined organic layers were dried over Na2SO4, filtered and concentrated under reduced pressure. The residue was purified by column chromatography on silica gel eluting with a gradient of 0-15% methanol in dichloromethane, to afford 2-(1-methyl-1,2,3,6-tetrahydropyridin-4-yl)-6-(2-methyl-2H-indazol-5-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 188; 21 mg) as a solid. LCMS (ES, m/z): 379.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.52 (2H, d, J=7.7 Hz), 7.89 (1H, s), 7.73 (1H, d, J=9.1 Hz), 7.31 (1H, d, J=9.1 Hz), 6.75 (1H, s), 4.22 (3H, s), 3.11 (2H, s), 2.61 (4H, m), 2.30 (3H, s).
Di-tert-butyl dicarbonate (4.8 g, 22 mmol), triethylamine (6 mL, 43.5 mmol) and 4-dimethylaminopyridine (0.24 g, 2 mmol) were sequentially added to a solution of ethyl 2-amino-5-bromothiazole-4-carboxylate (B34; 5 g, 20 mmol) in anhydrous tetrahydrofuran (85 mL) at 0° C. The reaction mixture was then slowly warmed to room temperature and stirred for 12 h. The volatiles were then removed under reduced pressure, and water (100 mL) and dichloromethane (100 mL) were added, and the layers were separated. The aqueous layer was extracted with dichloromethane (3×50 mL), and the combined organic layers were dried over sodium sulfate, filtered and concentrated under reduced pressure. The crude product was purified by column chromatography on silica gel eluting with ethyl acetate (0-30%) in hexanes, to afford ethyl 5-bromo-2-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B35; 3.9 g) as a solid. LCMS (ES, m/z): 373.0 [M+Na]+.
Triphenylphosphine (11.7 g, 44 mmol) was added to a solution of 4-hydroxy-N-methylpiperidine (B36; 4.5 g, 39 mmol) in tetrahydrofuran (40 mL) at 0° C., followed by the dropwise addition of diethylazodicarboxylate (7.8 mL, 46 mmol). Ethyl 5-bromo-2-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B35; 3.9 g, 11 mmol) was then added, and the reaction mixture was warmed to room temperature and stirred for 12 h. The volatiles were then removed under reduced pressure, and water (50 mL) and dichloromethane (50 mL) were added, and the layers were separated. The aqueous layer was extracted with dichloromethane (3×50 mL), and the combined organic layers were dried over sodium sulfate, filtered, and concentrated under reduced pressure. The crude material was purified by column chromatography on silica gel eluting with methanol (0-15%) in dichloromethane, to afford the ethyl 5-bromo-2-((tert-butoxycarbonyl)(1-methylpiperidin-4-yl)amino)thiazole-4-carboxylate (B37; 3.2 g) as a solid. LCMS (ES, m/z): 448.1 [M+H]+.
A mixture of ethyl 5-bromo-2-((tert-butoxycarbonyl)(1-methylpiperidin-4-yl)amino)thiazole-4-carboxylate (B37; 1 g, 2.2 mmol), tert-butyl carbamate (B2; 0.31 g, 2.7 mmol), cesium carbonate (1 g, 3.1 mmol), palladium acetate (50 mg, 0.22 mmol) and (±)-BINAP (0.42 g, 0.67 mmol) in degassed dioxane (22 mL) was heated to 100° C. for 12 h, and then cooled to room temperature, filtered through Celite, and concentrated under reduced pressure. The crude material was purified by column chromatography on silica gel eluting with methanol (0-15%) in dichloromethane, to afford ethyl 2-((tert-butoxycarbonyl)(1-methylpiperidin-4-yl)amino)-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B38; 0.87 g) as a solid. LCMS (ES, m/z): 485.3 [M+H]+.
A solution of sodium hydroxide (2M, 4.5 mL, 9 mmol) was added to a solution of ethyl 2-((tert-butoxycarbonyl)(1-methylpiperidin-4-yl)amino)-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (B38; 0.87 g, 1.8 mmol) in methanol/tetrahydrofuran (32 mL, 2.5:1 ratio) at room temperature, and the mixture was heated to 55° C. for 36 h, and then cooled to room temperature. A solution of hydrochloric acid (6M) was then added to achieve a pH of 3-4. The volatiles were removed under reduced pressure, and water (50 mL) and dichloromethane (50 mL) were added and the layers were separated. The aqueous layer was extracted with dichloromethane (3×30 mL), and the combined organic layers were dried over sodium sulfate, filtered, and concentrated under reduced pressure, to afford 2-((tert-butoxycarbonyl)(1-methylpiperidin-4-yl)amino)-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylic acid (B39; 0.7 g) as a solid. LCMS (ES, m/z): 457.2 [M+H]+.
Hexafluorophosphate azabenzotriazole tetramethyl uronium (0.36 g, 0.95 mmol) and diisopropylethylamine (0.27 mL, 1.5 mmol) were sequentially added to a solution of 5-((tert-butoxycarbonyl)amino)-2-(2,7-dimethyl-2H-indazol-5-yl)thiazole-4-carboxylic acid (B39; 0.35 g, 0.77 mmol) and 2-methyl-2H-indazol-5-amine (B11; 0.15 g, 1 mmol) in dimethylformamide (10 mL), and the reaction mixture was stirred at 0° C. for 1 h, then warmed to room temperature and stirred for an additional 1 h. The volatiles were removed under reduced pressure, and water (15 mL) and dichloromethane (10 mL) were added and the layers were separated. The aqueous layer was extracted with dichloromethane (3×10 mL), and the combined organic layers were dried over sodium sulfate, filtered and concentrated under reduced pressure. The residue was purified by reverse phase chromatography on a C18 column, eluting with acetonitrile (0-100%) in 0.1% aqueous formic acid. The fractions containing product were collected and concentrated under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (10 mL) were added and the layers were separated. The aqueous phase was extracted with dichloromethane (3×10 mL), and the combined organic layers were dried over sodium sulfate, filtered and concentrated under reduced pressure, to afford tert-butyl (5-((tert-butoxycarbonyl)amino)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-2-yl)(1-methylpiperidin-4-yl)carbamate (B40; 0.34 g) as a solid. LCMS (ES, m/z): 585.9 [M+H]+.
Step 1: Hydrochloric acid in dioxane (4M, 21 mL) was added to a solution of tert-butyl (5-((tert-butoxycarbonyl)amino)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-2-yl)(1-methylpiperidin-4-yl)carbamate (B40; 0.31 g, 0.63 mmol) in methanol (21 mL), and the reaction mixture was stirred at room temperature for 48 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (15 mL) and dichloromethane (20 mL) were added and the layers were separated. The aqueous solution was extracted with dichloromethane (3×20 mL), and the combined organic layers were dried over sodium sulfate, filtered, and concentrated under reduced pressure to provide an amine intermediate.
Step 2: Ethyl chloroformate (0.17 mL, 1.82 mmol) was added to a solution of the amine intermediate from Step 1 (0.10 g, 0.26 mmol) in dimethylformamide (5 mL), and the reaction mixture was stirred at room temperature for 30 min, and then at 90° C. for 2 h. The volatiles were removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (10 mL) were added and the layers were separated. The aqueous phase was extracted with dichloromethane (3×10 mL), and the combined organic layers were dried over sodium sulfate, filtered, and concentrated under reduced pressure. The material was purified by reverse phase chromatography on a C18 column eluting with acetonitrile (0-100%) in 0.1% aqueous formic acid. The fractions containing product were then collected and concentrated under reduced pressure, and a saturated solution of NaHCO3 (5 mL) and dichloromethane (10 mL) were added and the layers were separated. The aqueous phase was extracted with dichloromethane (2×10 mL), and the combined organic layers were dried over sodium sulfate and concentrated under reduced pressure, to afford 6-(2-methyl-2H-indazol-5-yl)-2-((1-methylpiperidin-4-yl)amino)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 189; 21 mg) as a solid. LCMS (ES, m/z): 396.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.47 (1H, s), 8.25 (1H, s), 8.17 (1H, d, J=7.3 Hz), 7.81 (1H, s), 7.68 (1H, d, J=9.1 Hz), 7.23 (1H, d, J=9.2 Hz), 4.20 (3H, s), 3.64-3.67 (1H, m), 2.71 (2H, br s), 2.17 (3H, s), 1.99 (4H, m), 1.48 (2H, m).
Sodium hydride (6 mg, 0.15 mmol) was added to a solution of 6-(2-methyl-2H-indazol-5-yl)-2-((1-methylpiperidin-4-yl)amino)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 189; 48 mg, 0.12 mmol) in anhydrous dimethylformamide (2.5 mL) at 0° C., and the mixture was stirred for 15 min. Methyl iodide (8 μL, 1.28 mmol) in anhydrous dimethylformamide (2.5 mL) was then added, and the reaction mixture was stirred for 30 min, and then slowly warmed to room temperature and stirred for 12 h. Water (two drops) was then added, and the volatiles were removed under reduced pressure, and water (10 mL) and ethyl acetate (10 mL) were added and the layers were separated. The aqueous layer was extracted with ethyl acetate (3×10 mL), and the combined organic layers were dried over sodium sulfate, filtered, and concentrated under reduced pressure. The crude product was purified by reverse phase chromatography on a C18 column eluting with acetonitrile (0-100%) in 0.1% aqueous formic acid, to afford 2-(methyl(1-methylpiperidin-4-yl)amino)-6-(2-methyl-2H-indazol-5-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 190; 11 mg) as a solid. LCMS (ES, m/z): 410.1 [M+H]+. 1H NMR (CHCl3-d, 400 MHz): δH 8.00 (1H, s), 7.96 (1H, s), 7.81 (1H, d, J=9.1 Hz), 7.66 (1H, s), 7.25 (1H, m), 4.27 (3H, s), 4.25 (1H, br s) 3.05 (3H, s), 2.98 (2H, d, J=11.5 Hz), 2.33 (3H, s), 2.20 (2H, t, J=11.8 Hz), 1.89-1.98 (2H, m), 1.82 (2H, m).
Ethyl chloroformate (0.1 mL, 1.1 mmol) was added to a solution of 5-amino-N-(2-methyl-2H-indazol-5-yl)-2-(1-methylpiperidin-4-yl)thiazole-4-carboxamide (Compound 191; 25 mg, 0.07 mmol) in dimethylformamide, and the reaction mixture was stirred at room temperature for 30 min, and then heated to 130° C. for 2 h. The volatiles were then removed under reduced pressure, and a saturated solution of NaHCO3 (10 mL) and dichloromethane (10 mL) were added, and the layers were separated. The aqueous phase was extracted with dichloromethane (3×10 mL), and the combined organic layers were dried over sodium sulfate, filtered, and concentrated under reduced pressure. The crude product was purified by column chromatography on silica gel eluting with methanol (0-15%) in dichloromethane, to afford 6-(2-methyl-2H-indazol-5-yl)-2-(1-methylpiperidin-4-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (Compound 176; 12 mg) as a solid. LCMS (ES, m/z): 381.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.51 (2H, s), 7.87 (1H, s), 7.72 (1H, d, J=9.1 Hz), 7.30 (1H, d, J=9.1 Hz), 4.22 (3H, s), 3.06 (1H, t, J=12.0 Hz), 2.85 (2H, d, J=10.5 Hz), 2.20 (3H, s), 2.05 (4H, m), 1.79 (2H, m).
A mixture of ethyl 2-bromo-5-((tert-butoxycarbonyl)amino)thiazole-4-carboxylate (0.60 g, 1.7 mmol), 1-(4-methoxybenzyl)piperazine (1.7 g, 8.5 mmol), and DIPEA (0.45 mL, 2.6 mmol) in dioxane (5.5 mL) was heated to 100° C. for 36 h, then cooled to room temperature. The reaction mixture of concentrated in vacuo and purified by column chromatography on silica gel using a gradient of 0-5% MeOH in DCM to afford ethyl 5-((tert-butoxycarbonyl)amino)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazole-4-carboxyl ate (0.55 g, 87%) as solid. LCMS (ES, m/z): 476.9 [M+H]+
To a solution of ethyl 5-((tert-butoxycarbonyl)amino)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazole-4-carboxylate (0.50 g, 1.1 mmol) in a mixture of MeOH/THF (18 mL, 2.6:1 ratio) was added 2 M NaOH (2.6 mL, 5.2 mmol) at room temperature. The reaction mixture was stirred at 55° C. for 12 h, then cooled to room temperature, and pH adjusted to 5 with 6 M HCl. The reaction mixture was concentrated in vacuo to afford a residue, water (30 mL) was added to the residue, and the mixture was sonicated and filtered to collect the solid. The solid was washed with water and dried under vacuum to afford acid intermediate as a solid. LCMS (ES, m/z): 448.9 [M+H]+.
To a solution of acid intermediate and 2-methyl-2H-indazol-5-amine (0.21 g, 1.4 mmol) in DMF (18 mL) were added HATU (0.55 g, 1.5 mmol) and DIPEA (0.25 mL, 1.5 mmol) sequentially. The reaction mixture was stirred at 0° C. for 1 h, then stirred at room temperature for 1 h. The reaction mixture was concentrated in vacuo to a residue, water (20 mL) and DCM (20 mL) were added to the residue, and the layers were separated. The aqueous phase was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to a residue. The residue was purified by reverse phase chromatography using a gradient of 0-100% acetonitrile in water (containing 0.1 FA). The fractions containing product were collected and concentrated in vacuo to a residue. The residue was partitioned between saturated solution of NaHCO3 (15 mL) and DCM (20 mL) were added, and the layers were separated. The aqueous phase was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered and concentrated in vacuo to afford tert-butyl (2-(4-(4-methoxybenzyl)piperazin-1-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (0.31 g, 52%) as a solid. LCMS (ES, m/z): 578.2 [M+H]+.
To a solution of tert-butyl (2-(4-(4-methoxybenzyl)piperazin-1-yl)-4-((2-methyl-2H-indazol-5-yl)carbamoyl)thiazol-5-yl)carbamate (0.26 g, 0.45 mmol) in DCM (10 mL) was added TFA (4 mL, 54 mmol). The reaction mixture was stirred at room temperature for 2 h. A saturated solution of NaHCO3 was added until pH ˜8. DCM (15 mL) was added, and the layers were separated. The aqueous layer was extracted with DCM (3×15 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to afford amine intermediate as a solid. The solid was used in the next step without further purification. LCMS (ES, m/z): 478.0 [M+H]+.
To a solution of the amine intermediate in DMF (8 mL) was added ethyl chloroformate (0.57 mL, 5.9 mmol). The reaction mixture was stirred at room temperature for 10 min, then at 80° C. for 2 h. The reaction mixture is concentrated in vacuo to a residue. The residue was partitioned between a saturated solution of NaHCO3 (20 mL) and DCM (25 mL), and the layers were separated. The aqueous layer was extracted with DCM (3×25 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-10% MeOH in ethyl acetate to afford 2-(4-(4-methoxybenzyl)piperazin-1-yl)-6-(2-methyl-2H-indazol-5-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (70 mg, 33%) as a solid. LCMS (ES, m/z): 487.9 [M+H]+.
A solution of 2-(4-(4-methoxybenzyl)piperazin-1-yl)-6-(2-methyl-2H-indazol-5-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (30 mg, 0.062 mmol) in TFA (2 mL) was heated to 135° C. for 5 h. The reaction mixture was diluted with DCM (10 mL), basified with a saturated solution of NaHCO3 until pH ˜8, and the layers were separated. The aqueous phase was extracted with DCM (3×10 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-20% MeOH in DCM to afford 6-(2-methyl-2H-indazol-5-yl)-2-(piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (24 mg, 91%) as a solid. LCMS (ES, m/z): 368.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.49 (1H, s), 8.26 (1H, s), 7.83 (1H, s), 7.70 (1H, d, J=9.1 Hz), 7.25 (1H, d, J=9.1 Hz), 4.22 (3H, s), 3.47 (4H, t, J=4.8 Hz), 2.78 (4H, t, J=4.8 Hz).
To a solution of ethyl 5-((tert-butoxycarbonyl)amino)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazole-4-carboxylate (0.50 g, 1.1 mmol) in a mixture of methanol and THE (18 mL, 2.6:1 ratio) was added 2 M NaOH (2.6 mL, 5.2 mmol) at room temperature. The reaction mixture was stirred at 55° C. for 12 h, then cooled to room temperature. pH adjusted to 5 with 6 M HCl, and concentrated in vacuo to give a residue. Water (30.0 mL) was added to the residue, then the mixture was sonicated. The solid was collected by filtration, washed with water, and dried under vacuum to afford an acid intermediate. LCMS (ES, m/z): 448.9 [M+H]+.
To a mixture of the acid intermediate and 8-fluoro-2-methylimidazo[1,2-a]pyridin-6-amine hydrochloride (0.36 g, 2.2 mmol) in DMF (28 mL) were added HATU (0.84 g, 2.2 mmol) and DIPEA (0.38 mL, 2.2 mmol), sequentially. The reaction mixture was stirred at 0° C. for 1 h, then at room temperature for 1 h. The reaction mixture was concentrated in vacuo to give a residue, then partitioned between water (20 mL) and DCM (20 mL). The aqueous phase was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by reverse phase chromatography using a gradient of 0-100% acetonitrile in water (containing 0.1% formic acid). The fractions containing product were collected and concentrated in vacuo to a residue. The residue was partitioned between a saturated solution of NaHCO3 (15 mL) and DCM (20 mL). The aqueous phase was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to afford tert-butyl (4-((8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)carbamoyl)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazol-5-yl)carbamate (0.31 g, 48%) as a solid. LCMS (ES, m/z): 595.9 [M+H]+.
To a solution of tert-butyl (4-((8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)carbamoyl)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazol-5-yl)carbamate (0.31 g, 0.54 mmol) in DCM (10 mL) was added TFA (4.0 mL, 54 mmol). The reaction mixture was stirred at room temperature for 2 h, basified with a saturated solution of NaHCO3 until pH ˜8, diluted with DCM (15 mL), and the layers were separated. The aqueous layer was extracted with DCM (3×15 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated under reduced pressure to afford an amine intermediate as a solid, which was used without further purification. LCMS (ES, m/z): 496.2 [M+H]+.
To a solution of the amine intermediate in DMF (8.0 mL) was added ethyl chloroformate (0.72 mL, 7.6 mmol). The reaction mixture was stirred at room temperature for 10 min, then at 80° C. for 3 h. The reaction mixture was concentrated in vacuo to a residue. The residue was partitioned between a saturated solution of NaHCO3 (20 mL) and DCM (25 mL), and the layers were separated. The aqueous layer was extracted with DCM (3×25 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to a reside. The residue was purified by column chromatography on silica gel using a gradient of 0-10% MeOH in EtOAc to afford 6-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (103 mg, 37%) as a solid. LCMS (ES, m/z): 505.9 [M+H]+.
A solution of 6-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (0.10 g, 0.19 mmol) in TFA (7.0 mL) was heated to 135° C. for 5 h. The reaction mixture was diluted with DCM (10 mL), then basified with a saturated solution of NaHCO3 until pH ˜8, and the layers were separated. The aqueous phase was extracted with DCM (3×10 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-20% MeOH in DCM to afford 6-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-2-(piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (44 mg, 58%) as a solid. LCMS (ES, m/z): 386.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.73 (1H, s), 8.33 (1H, s), 7.93 (1H, d, J=2.9 Hz), 7.39 (1H, d, J=11.5 Hz), 3.49 (4H, t, J=4.9 Hz), 2.85 (4H, t, J=4.8 Hz), 2.40 (3H, s).
To a solution of 6-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-2-(piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (20 mg, 0.052 mmol) in THE (2.0 mL) in EtOH (1.0 mL) were added formaldehyde (37% in H2O, 26 μL, 0.26 mmol) and NaBH(OAc)3 (68 mg, 0.31 mmol), sequentially. The reaction mixture was stirred at room temperature for 12 h, diluted with water (2.0 mL), and the volatiles were evaporated under reduced pressure. Water (5.0 mL) and DCM (10 mL) were added, and the layers were separated. The aqueous phase was extracted with DCM (3×10 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-20% MeOH in DCM to afford 6-(8-fluoro-2-methylimidazo[1,2-a]pyridin-6-yl)-2-(4-methylpiperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (16 mg, 77%) as a solid. LCMS (ES, m/z): 400.1 [M+H]+. 1H NMR (DMSO-d6, 400 MHz): δH 8.73 (1H, s), 8.33 (1H, s), 7.93 (1H, s), 7.39 (1H, d, J=11.5 Hz), 3.53-3.54 (4H, m), 2.43-2.44 (4H, m), 2.40 (3H, s), 2.24 (3H, s).
To a solution of ethyl 5-((tert-butoxycarbonyl)amino)-2-(4-(4-methoxybenzyl)piperazin-1-yl)thiazole-4-carboxylate (0.58 g, 1.2 mmol) in a mixture of methanol and THE (18 mL, 2.6:1 ratio) was added 2 M NaOH (3.1 mL, 6.1 mmol) at room temperature. The reaction mixture was stirred at 55° C. for 12 h and then cooled to room temperature. The reaction mixture was pH adjusted to ˜5 with 6 M HCl, and concentrated in vacuo to give a residue. To the residue was added water (30 mL), and the mixture was sonicated and the solid collected by filtration. The solid was washed with water and dried under vacuum to afford an acid intermediate as a solid. LCMS (ES, m/z): 448.9 [M+H]+.
To a mixture of acid intermediate and 2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-amine hydrochloride (0.56 g, 2.2 mmol) in DMF (30 mL) were added HATU (0.85 g, 2.2 mmol) and DIPEA (0.39 mL, 2.2 mmol), sequentially. The reaction mixture was stirred at 0° C. for 1 h, then at room temperature for 1 h. The reaction mixture was concentrated in vacuo to give a residue. The residue was partitioned between water (20 mL) and DCM (20 mL), and the layers were separated. The aqueous phase was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by reverse phase chromatography using a gradient of 0-100% acetonitrile in water (containing 0.1% formic acid). The fractions containing product were collected and concentrated in vacuo to give a residue. The residue was partitioned between a saturated solution of NaHCO3 (15 mL) and DCM (20 mL), and the layers were separated. The aqueous phase was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to afford tert-butyl (2-(4-(4-methoxybenzyl)piperazin-1-yl)-4-((2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)carbamoyl)thiazol-5-yl)carbamate (0.47 g, 60%) as a solid. LCMS (ES, m/z): 646.2 [M+H]+.
To a solution of tert-butyl (2-(4-(4-methoxybenzyl)piperazin-1-yl)-4-((2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)carbamoyl)thiazol-5-yl)carbamate (0.47 g, 0.74 mmol) in DCM (10 mL) was added TFA (5.6 mL, 74.0 mmol). The reaction mixture was stirred at room temperature for 2 h, basified with a saturated solution of NaHCO3 to pH ˜8, diluted with DCM (15 mL), and the layers were separated. The aqueous layer was extracted with DCM (3×15 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to afford an amine intermediate as a solid. LCMS (ES, m/z): 546.1 [M+H]+. To a solution of amine intermediate in DMF (10 mL) was added ethyl chloroformate (1.4 mL, 14.8 mmol). The reaction mixture was heated to 110° C. for 3 h, then cooled to room temperature, and concentrated in vacuo to give a residue. The residue was partitioned between a saturated solution of NaHCO3 (20 mL) and DCM (20 mL), and the layers were separated. The aqueous layer was extracted with DCM (3×20 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-10% MeOH in DCM to afford 2-(4-(4-methoxybenzyl)piperazin-1-yl)-6-(2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (0.18 g, 44% over two steps) as a solid. LCMS (ES, m/z): 555.9 [M+H]+.
A solution of 2-(4-(4-methoxybenzyl)piperazin-1-yl)-6-(2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (0.18 g, 0.32 mmol) in TFA (10.8 mL, 141 mmol) was heated to 135° C. for 5 h. The reaction mixture was diluted with DCM (30 mL), basified with a saturated solution of NaHCO3 to pH ˜8, and the layers were separated. The aqueous phase was extracted with DCM (3×30 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-20% MeOH in DCM to afford 6-(2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)-2-(piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (75 mg, 53%) as a solid. LCMS (ES, m/z): 435.7 [M+H]+. 1H NMR (CHCl3-d, 400 MHz): δH 8.42 (1H, s), 7.92 (1H, s), 7.54 (1H, s), 7.50 (1H, s), 3.64 (4H, t, J=4.9 Hz), 3.01 (4H, t, J=4.9 Hz), 2.55 (3H, s).
To a solution of 6-(2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)-2-(piperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (38 mg, 0.087 mmol) in a mixture of DCM (0.90 mL) and EtOH (0.30 mL) were added formaldehyde (37% in H2O, 43 μL, 0.44 mmol) and NaBH(OAc)3 (0.11 g, 0.52 mmol), sequentially. The reaction mixture was stirred at room temperature for 12 h, then water (5 mL) was added, and the volatiles were evaporated under reduced pressure. Water (5 mL) and DCM (10 mL) were added, and the layers were separated. The aqueous layer was extracted with DCM (3×10 mL). The organic layers were combined, dried over Na2SO4, filtered, and concentrated in vacuo to give a residue. The residue was purified by column chromatography on silica gel using a gradient of 0-10% MeOH/Et3N (2:1) in DCM to afford 6-(2-methyl-8-(trifluoromethyl)imidazo[1,2-a]pyridin-6-yl)-2-(4-methylpiperazin-1-yl)thiazolo[5,4-d]pyrimidin-7(6H)-one (32.0 mg, 81%) as a solid. LCMS (ES, m/z): 449.8 [M+H]+. 1H NMR (CHCl3-d, 400 MHz): δH 8.43 (1H, s), 7.93 (1H, s), 7.55 (1H, s), 7.50 (1H, s), 3.79 (4H, s), 2.70 (4H, br s), 2.56 (3H, s), 2.47 (3H, s).
Compounds described herein were used to modulate RNA transcript abundance in cells. The expression of a target mRNA was measured by detecting the formation of an exon-exon junction in the canonical transcript (CJ). A compound mediated exon-inclusion event was detected by observing an increase in formation of a new junction with an alternative exon (AJ). Real-time qPCR assays were used to detect these splicing switches and interrogate the potency of various compounds towards different target genes. A high-throughput real time quantitative PCR (RT-qPCR) assay was developed to measure these two isoforms of the mRNA (CJ and AJ) for an exemplary gene, HTT, together with a control housekeeping gene, GAPDH or GUSB or PPIA, used for normalization. Briefly, the A673 or K562 cell line was treated with various compounds described herein (e.g., compounds of Formula (I)). After treatment, the levels of the HTT mRNA targets were determined from each sample of cell lysate by cDNA synthesis followed by qPCR.
The A673 cell line was cultured in DMEM with 10% FBS. Cells were diluted with full growth media and plated in a 96-well plate (15,000 cells in 100 ul media per well). The plate was incubated at 37° C. with 5% CO2 for 24 hours to allow cells to adhere. An 11-point 3-fold serial dilution of the compounds was made in DMSO then diluted in media in an intermediate plate. Compounds were transferred from the intermediate plate to the cell plate with the top dose at a final concentration of 10 uM in the well. Final DMSO concentration was kept at or below 0.25%. The cell plate was returned to the incubator at 37° C. with 5% CO2 for an additional 24 hours.
The K562 cell line was cultured in IMDM with 10% FBS. For K562, cells were diluted with full growth media and plated in either a 96-well plate (50,000 cells in 50 uL media per well) or a 384-well plate (8,000-40,000 cells in 45 uL media per well). An 11-point 3-fold serial dilution of the compounds were made in DMSO then diluted in media in an intermediate plate. Compound was transferred from the intermediate plate to the cell plate with the top dose at a final concentration of 10 uM in the well. Final DMSO concentration was kept at or below 0.25%. Final volume was 100 uL for 96-well plate and 50 uL for 384-well plate. The cell plate was then placed in an incubator at 37° C. with 5% CO2 for 24 hours.
The cells were then gently washed with 50 uL-100 uL cold PBS before proceeding to addition of lysis buffer. 30 uL-50 uL of room temperature lysis buffer with DNAse I (and optionally RNAsin) was added to each well. Cells were shaken/mixed thoroughly at room temperature for 5-10 minutes for lysis to take place and then 3 uL-5 uL of room temperature stop solution was added and wells were shaken/mixed again. After 2-5 minutes, the cell lysate plate was transferred to ice for RT-qPCR reaction setup. The lysates could also be frozen at ˜80° C. for later use.
In some cases, a direct lysis buffer was used. An appropriate volume of 3×lysis buffer (10 mM Tris, 150 mM NaCl, 1.5%-2.5% Igepal and 0.1-1 U/uL RNAsin, pH 7.4) was directly added to either K562 or A673 cells in media and mixed by pipetting 3 times. The plates were then incubated at room temperature with shaking/rocking for 20-50 minutes to allow for lysis to take place. After this time, the cell lysate plate was transferred to ice to set up for the RT-qPCR reactions. The lysates could also be frozen at −80° C. for later use.
To set up 10 uL RT-qPCR reactions, cell lysates were transferred to 384-well qPCR plates containing the master mix according to the table below. The plates were sealed, gently vortexed, and spun down before the run. The volumes were adjusted accordingly in some instances where the reaction was carried in 20 uL. The table below summarizes the components of the RT-qPCR reactions:
The RT-qPCR reaction was performed using a QuantStudio (ThermoFisher) under the following fast cycling conditions. All samples and standards were analyzed at least in duplicate. In some instances, bulk room temperature (RT) step of 5-10 minutes was completed for all plates before proceeding with qPCR. The table below summarizes the PCR cycle:
The data analysis was performed by first determining the ΔCt vs the housekeeper gene. This ΔCt was then normalized against the DMSO control (ΔΔCt) and converted to RQ (relative quantification) using the 2{circumflex over ( )}(−ΔΔCt) equation. The RQ were then converted to a percentage response by arbitrarily setting an assay window of 3.5 ΔCt for HTT-CJ and an assay window of 9 ΔCt for HTT-AJ. These assay windows correspond to the maximal modulation observed at high concentration of the most active compounds. The percentage response was then fitted to the 4 parametric logistic equation to evaluate the concentration dependence of compound treatment. The increase in AJ mRNA is reported as AC50 (compound concentration having 50% response in AJ increase) while the decrease in CJ mRNA levels is reported as IC50 (compound concentration having 50% response in CJ decrease).
A summary of these results is illustrated in Table 3, wherein “A” represents an AC50/IC50 of less than 100 nM; “B” represents an AC50/IC50 of between 100 nM and 1 μM; and “C” represents an AC50/IC50 of between 1 μM and 10 μM; and “D” represents an AC50/IC50 of greater than 10 μM.
Additional studies were carried out for a larger panel of genes using the protocol provided above. The junction between flanking upstream and downstream exons was used to design canonical junction qPCR assays. At least one of the forward primer, reverse primer or the CY5-labeled 5′ nuclease probe (with 3′ quencher such as ZEN/Iowa Black FQ) was designed to overlap with the exon junction to capture the CJ mRNA transcript. BLAST was used to confirm the specificity of the probeset and parameters such as melting temperature, GC content, amplicon size, and primer dimer formation are considered during their design. Data for the decrease in CJ mRNA levels for three exemplary genes (HTT, SMN2, and Target C) analyzed in this panel are reported as IC50 (compound concentration having 50% response in CJ decrease).
A summary of the results from the panel is illustrated in Table 4, wherein “A” represents an IC50 of less than 100 nM; “B” represents an IC50 of between 100 nM and 1 μM; and “C” represents an IC50 of between 1 μM and 10 μM; and “D” represents an IC50 of greater than 10 μM.
This application refers to various issued patents, published patent applications, journal articles, and other publications, all of which are incorporated herein by reference. If there is a conflict between any of the incorporated references and the instant specification, the specification shall control. In addition, any particular embodiment of the present invention that falls within the prior art may be explicitly excluded from any one or more of the claims. Because such embodiments are deemed to be known to one of ordinary skill in the art, they may be excluded even if the exclusion is not set forth explicitly herein. Any particular embodiment of the invention can be excluded from any claim, for any reason, whether or not related to the existence of prior art.
Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation many equivalents to the specific embodiments described herein. The scope of the present embodiments described herein is not intended to be limited to the above Description, Figures, or Examples but rather is as set forth in the appended claims. Those of ordinary skill in the art will appreciate that various changes and modifications to this description may be made without departing from the spirit or scope of the present invention, as defined in the following claims.
This application is a U.S. National Phase Application under 35 U.S.C. § 371 of International Application PCT/US2021/026435, filed Apr. 8, 2021, which claims priority to U.S. Application No. 63/007,327, filed Apr. 8, 2020; U.S. Application No. 63/043,920, filed Jun. 25, 2020; U.S. Application No. 63/072,873, filed Aug. 31, 2020; and U.S. Application No. 63/126,493, filed Dec. 16, 2020. The disclosure of each of the foregoing applications is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/026435 | 4/8/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63126493 | Dec 2020 | US | |
63072873 | Aug 2020 | US | |
63043920 | Jun 2020 | US | |
63007327 | Apr 2020 | US |