The invention relates to the elucidation of the different forms of the Rel enzyme crystal structure, to screening methods to identify Rel modulators binding into the catalytic site of said crystal structures and to the molecules identified thereby. The invention is of particular interest to the field of molecular biology, more particular in the development of anti-persister drugs against microorganisms such as antibiotic resistant bacteria.
The overuse and misuse of antibiotics combined with a lack of progress in the development of new antibacterial drugs have led to the emergence of pathogenic antibiotic resistant bacteria. The incidence of these bacteria (also known as “superbugs”) is increasing at an alarming rate, and thus bacterial infections are resurging as a prominent threat to human health (Ventola, The antibiotic resistance crisis, Pharmacy and therapeutics, 2015). In the last years, multiple health instances have repeatedly warned about these pathogenic antibiotic (multi)resistant bacteria and the threats they pose to human health (Michael et al., The antimicrobial resistance crisis: causes, consequences, and management, Frontiers in public health, 2013).
One mechanism that bacteria use to survive in presence of antibiotics is by the phenomenon of bacterial persistence. Whereas the majority of a bacterial population will proliferate quickly in an infected host organism, a smaller fraction of this population will actively suppress growth. Since the majority of all clinically used antibiotics target rapidly dividing bacteria, the small population of bacteria in the persistence state will not be affected by these drugs and are able to switch back to their normal, non-persistent state post-antibiotic treatment(s). Hence, persister cells are a primary source of chronic infections because they are difficult or often even impossible to eradicate using conventional antibiotics. Additionally, these persisters provide a viable cell reservoir wherefrom resistant mutants can arise, because mutations increasing antibiotic tolerance favor selection of resistance mutations (Windels et al., Bacterial persistence promotes the evolution of antibiotic resistance, 2019).
A growing body of experimental evidence supports the notion that the stringent response, a bacterial phenotypic resetting that is crucial to cope with adverse environmental changes, is intricately involved in the formation of persister cells. A crucial mediator of this stringent response is the alarmone guanosine polyphosphate (guanosine 3′,5′-bisdiphosphate and guanosine 5′-triphosphate-3′-diphosphate), abbreviated as (p)ppGpp. The levels of (p)ppGpp are tightly regulated by the concerted opposing activities of RelA/SpoT homologue (RSH) enzymes that can both transfer a pyrophosphate group of ATP to the 3′ position of GDP (or GTP) or remove the 3′ pyrophosphate moiety from (p)ppGpp (Geiger et al., Role of the (p)ppGpp Synthase RSH, a RelA/SpoT Homolog, in Stringent Response and Virulence of Staphylococcus aureus, Infection and immunity, 2010). While RSH enzymes are universally conserved in bacteria, they are not present in humans which mark them as a very promising drug target. While progress is being made, these bifunctional RSH enzymes have proven to be difficult to structurally characterize since they display poor stability upon crystallization and have a tendency to aggregate. Despite a considerable amount of research, it remains unclear how the two opposing activities of Rel are controlled at the molecular level.
Taken together, there is an unmet need to develop innovative strategies that are effective in counteracting pathogenic antibiotic resistant bacteria and bacterial persistence in general, including new classes of antimicrobials, vaccines, and treatment strategies. Approaches which allow screening for compounds that are able to modulate one or both activities of RSH enzymes entail great value to generate these novel antimicrobials.
The present inventors have identified the structural changes that Rel, an RSH enzyme, undergoes upon ligand binding and which are necessary to perform its biological functions (i.e. Rel hydrolase and synthetase activity). Furthermore, the inventors have developed new screening methods to identify compounds that interfere with these conformational changes. Finally, promising Rel interacting compounds have been identified by these methods and are a potent manner of steering Rel activity, and thus counteracting persister cells.
As evidenced in the Examples section, by elucidation of the three dimensional structure of the complete Rel enzyme at an unprecedented resolution, distinct conformations of the protein could be observed which are each connected to different activity states of Rel. This information evidences the presence of an allosteric mechanism which acts as a Rel activity switch. The binding of GDP/ATP stretches apart the N-terminal catalytic domains of RelTf (RelTtNTD) activating the synthetase domain and allosterically blocking the hydrolase active site. Conversely, binding of ppGpp unlocks the hydrolase domain and triggers recoil of both NTDs, which partially buries the synthetase active site and precludes the binding of synthesis precursors. By further extensive structural analyses, key atomic coordinates of the three dimensional structures of Rel were identified that are able to discriminate the different conformational states. In addition, key amino acid residues were discovered in the protein structure that are of high value for candidate compounds to interact with and to modulate Rel synthetase and/or hydrolase activity. Based on this newly obtained structural information, screening methods were developed to identify such compounds. By using this novel screening approach, the inventors have found Rel interacting compounds that are capable of steering Rel activity, and are thus ultimately capable of counteracting persister cells.
The invention therefore relates to the following aspects:
As used herein, the singular forms “a”, “an”, and “the” include both singular and plural referents unless the context clearly dictates otherwise.
The terms “comprising”, “comprises” and “comprised of” as used herein are synonymous with “including”, “includes” or “containing”, “contains”, and are inclusive or open-ended and do not exclude additional, non-recited members, elements or method steps. The terms also encompass “consisting of” and “consisting essentially of”, which enjoy well-established meanings in patent terminology.
The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.
The terms “about” or “approximately” as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of ±10% or less, preferably ±5% or less, more preferably ±1% or less, and still more preferably ±0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier “about” refers is itself also specifically, and preferably, disclosed.
Whereas the terms “one or more” or “at least one”, such as one or more members or at least one member of a group of members, is clear per se, by means of further exemplification, the term encompasses inter alia a reference to any one of said members, or to any two or more of said members, such as, e.g., any 3 or more, 4 or more, 5 or more, 6 or more, or 7 or more etc. of said members, and up to all said members. In another example, “one or more” or “at least one” may refer to 1, 2, 3, 4, 5, 6, 7 or more.
The discussion of the background to the invention herein is included to explain the context of the invention. This is not to be taken as an admission that any of the material referred to was published, known, or part of the common general knowledge in any country as of the priority date of any of the claims.
Throughout this disclosure, various publications, patents and published patent specifications are referenced by an identifying citation. All documents cited in the present specification are hereby incorporated by reference in their entirety. In particular, the teachings or sections of such documents herein specifically referred to are incorporated by reference.
Unless otherwise defined, all terms used in disclosing the invention, including technical and scientific terms, have the meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. By means of further guidance, term definitions are included to better appreciate the teaching of the invention. When specific terms are defined in connection with a particular aspect of the invention or a particular embodiment of the invention, such connotation is meant to apply throughout this specification, i.e., also in the context of other aspects or embodiments of the invention, unless otherwise defined.
In the following passages, different aspects or embodiments of the invention are defined in more detail. Each aspect or embodiment so defined may be combined with any other aspect(s) or embodiment(s) unless clearly indicated to the contrary. In particular, any feature indicated as being preferred or advantageous may be combined with any other feature or features indicated as being preferred or advantageous.
Reference throughout this specification to “one embodiment”, “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention, and form different embodiments, as would be understood by those in the art. For example, in the appended claims, any of the claimed embodiments can be used in any combination.
Amino acids are referred to herein with their full name, their three-letter abbreviation or their one letter abbreviation.
The following detailed description is not to be taken in a limiting sense. The scope of the present invention is defined by the appended claims. It is evident that disclosed embodiments may relate to both RSH enzyme modulators such as Rel modulators and methods to identify Rel and/or RSH enzyme modulators. Certain embodiments directed to the Rel and/or RSH modulators may apply to the methods or uses described herein. It is evident that the terms such as “(candidate) compound”, “(candidate) binding compound”, and “(candidate) ligand” may be used interchangeably to describe the invention.
A skilled person is aware of standard molecular biology techniques that are available in the art (Sambrook et al., Molecular cloning: a laboratory manual, Cold Spring Harbor laboratory press, 1989; Ausubel et al., Current protocols in molecular biology, John Wiley and Sons, 1989; Perbal, A Practical Guide to Molecular Cloning, John Wiley & Sons, 1988; Watson et al., Recombinant DNA, Scientific American Books, New York; Birren et al. Genome Analysis: A Laboratory Manual Series, Vols. 1-4 Cold Spring Harbor laboratory press, New York, 1998).
The term “RSH enzymes” as used herein is an abbreviation for the group of RelA/SpoT homolog enzymes. RSH enzymes derive their name from the sequence similarity to the RelA and SpoT enzymes of Escherichia coli. RSH enzymes comprise a family of enzymes that synthesize and/or hydrolyze the alarmone ppGpp and play a central role in the bacterial stringent response. So-called “Long” RSH enzymes that comprise a hydrolase and synthetase domain have been identified in a vast and diverse amount of bacteria and plant chloroplasts, while specific RSH enzymes that only synthesize or hydrolyze (p)ppGpp have also been discovered in disparate bacteria and animals respectively. In the art, RSH enzymes are stratified into three groups based on their activity: long RSH enzymes, small alarmone synthetases (SASs), and small alarmone hydrolases (SAHs). These initial groups have been further classified in a plethora of subgroups (Atkinson et al., The RelA/SpoT Homolog (RSH) Superfamily: Distribution and functional evolution of ppGpp synthetases and hydrolases across the tree of life, Plos One, 2011). Long RSHs comprise two catalytic domains (the (p)ppGpp hydrolase (HD) domain and the (p)ppGpp synthetase (SYN) domain) and a C-terminal protein domain that is involved in regulation of the enzyme. In contrast, both SASs and SAHs lack the conserved C-terminal regulatory domain. According to the art, long RSHs are most broadly distributed and often further comprise TGS (ThrRS, GTPase, and SpoT) and ACT (Aspartokinase, Chorismate mutase and TyrA) domains in their C-terminal domain, which may play a role in sensing stress signals such as starvation signals and transducing said signal to the catalytic domain. The “Rel protein” or “Rel enzyme” as referred to herein is an example of a bifunctional RelA/SpoT homolog that is able to both synthesize and hydrolyse (p)ppGpp. Hence, Rel is able to control (p)ppGpp levels by its opposing activities. Structural similarities have been described between the Rel hydrolase domain and the 3′,5′-cyclic-nucleotide phosphodiesterase superfamily (Enzyme commission number (E.C. number) 3.1.4.17), as well as between the Rel synthetase domain and the nucleotidyltransferase superfamily (E.C. 2.7.7.-), particularly DNA polymerase β (Hogg et al., Conformational Antagonism between Opposing Active Sites in a Bifunctional RelA/SpoT Homolog Modulates (p)ppGpp Metabolism during the Stringent Response, Cell, 2004).
The term “small” as used as used herein, e.g. in terms such as “small molecule” or “small compound” or “small candidate (binding) compound” refers to a low molecular weight compound that is organic, anorganic or organometallic and has a molecular weight of less than 1000 Da, and for instance has a molecular weight of less than 900 Da, or less than 750 Da, or even less than 600 Da. Small compounds used in the methods herein may be naturally occurring or solely occurring due to chemical synthesis.
The term “stringent response”, used interchangeably in the art with “stringent control” is indicative for a stress response mediated by RSH enzymes in response to various stress conditions including the non-limiting examples of amino acid starvation, fatty acid limitation, iron limitation, and heat shock. In such stress conditions, the stringent response mediates a profound shift in gene expression from a program focused on growth to a gene expression profile that allows prolonged survival in a stationary phase following failure of aminoacyl-tRNA pools to support protein synthesis. Hence, the stringent response is a key mediator in the process of bacterial persister cell formation. The stringent response has been extensively described in the art (inter alia in Traxler et al., The global, ppGpp-mediated stringent response to amino acid starvation in Escherichia coli, Molecular microbiology, 2013). The stringent response is governed by the alarmones guanosine 5′,3′ bispyrophosphate and guanosine pentaphosphate (ppGpp and pppGpp respectively). (p)ppGpp accumulation will actively inhibit resource intensive cellular processes including replication, transcription and translation. (p)ppGpp has been demonstrated to bind to RNA polymerase proximal to its active site which causes a cessation of transcription of stable RNAs. Furthermore, (p)ppGpp decreases the half-life of the open complex at most promoters that have been tested in the art, hereby mediating a strong down regulation of promoters with intrinsically short half-lives, such as those of stable RNA genes. Taken together, the stringent response includes a large-scale down regulation of the translation apparatus (Barker et al., Mechanism of regulation of transcription initiation by ppGpp. Effects of ppGpp on transcription initiation in vivo and in vitro, Journal of molecular biology, 2001). Additionally, (p)ppGpp has been shown to upregulate transcription of promoters that act on amino acid biosynthesis genes together with RNA-polymerase binding transcription factor DksA (Paul et al., DksA potentiates direct activation of amino acid promoters by ppGpp, PNAS USA, 2005).
“Persister cells”, or short “persisters” as used herein is used to describe a population of bacterial cells that are in or going into a metabolically inactive (i.e. dormant) or near dormant state characterized by no growth or very slow growth, also called a stationary phase (Lewis, Persister cells, dormancy and infectious disease, Nature reviews microbiology, 2007). Typically, in an infected organism which is optionally being treated with antibiotics, persister cells amount to a small fraction of the total bacterial population present in said infected organism. Upon termination of antibiotics treatment, persister cells can leave their dormant state and return to a growth-focused gene expression signature, and expand to a full size bacterial infection. Persister cells are often described to constitute a subpopulation of bacteria that, due their slow growth rate, become highly tolerant to antibiotics. Persistent bacterial cells are not per definition originating from genetic mutation, although a skilled person is aware that persistence of a bacterial cell is associated with the emergence of antibiotic resistance (Windels et al., Bacterial persistence promotes the evolution of antibiotic resistance, 2019). Links between (p)ppGpp production and formation of bacterial persister cells have been described (inter alia in Korch et al., Characterization of the hipA7 allele of Escherichia coli and evidence that high persistence is governed by (p)ppGpp synthesis, Molecular microbiology, 2003). Persister cells may form within biofilms.
The term “biofilm” is commonly used in the art and is indicative for a collection or aggregate of (syntrophic) microorganisms such as bacteria wherein the different cells adhere to each other, and optionally the surface contacting the cells, or a portion of the cells. Biofilms are further characterized by a viscous extracellular matrix comprising extracellular polymeric substance (EPS) produced by microorganisms of the biofilm, wherein the microorganisms are embedded by the EPS. Biofilms may be formed both in or on organisms and on non-living surfaces in a wide array of different settings. Biofilms are complex microbiological systems wherein the microorganism comprised in said biofilm may be organized into a functional unit or functional community (Lopez et al., Biofilms, Cold Spring Harbor perspectives in biology, 2010).
The term “alarmones” is known to a skilled person and refers to intracellular signal molecules that are produced as a consequence of and in response to environmental cues. The main function of alarmones is to regulate gene expression. Typically, the concentration of alarmones rises when a cell experiences stressful environmental factors. (p)ppGpp is considered a textbook example of an alarmone (Hauryliuk et al., Recent functional insights into the role of (p)ppGpp in bacterial physiology, Nature reviews microbiology, 2015).
“Modulator” as used herein indicates a molecule that influences one or more (enzymatic) activities of one or more proteins upon interaction with (and/or binding of) said protein. As used herein, the modulating effect of the modulators described herein is intended to act on the hydrolase activity, or the synthetase activity, or both the hydrolase and synthetase activity of the Rel protein as defined herein. The principal binding site of a modulator is commonly termed the orthosteric site, which may be for example the active site of an enzyme where it engages in a binding with (a) substrate(s). Additionally, modulators may exert their activity by binding to a second binding site, commonly referred to as an allosteric binding site. In instances where a modulator impacts both the hydrolase and the synthetase activity, the direction of activity modulation is not limited to upregulation or downregulation of both enzymatic activities, but can also entail an upregulation of one enzymatic activity and downregulation of the other (e.g. upregulation of hydrolase activity and downregulation of synthetase activity by a single modulator, or vice versa). Hence, a modulator as discussed herein can refer to a molecule that is a hydrolase activator, hydrolase inhibitor, synthetase activator, or synthetase inhibitor, or any one or more of these. In certain embodiments, the molecule modulates the activity level of one enzymatic domain or enzymatic moiety but induces no significant effect on the activity level of the other enzymatic domain or enzymatic moiety.
“Synthetase” as used herein refers to an enzyme, or enzyme domain that catalyses a synthesis process. In the context of the invention, “synthetase activity” refers to the transfer of pyrophosphate from ATP to the 3′ position of the ribose of GDP or GTP. The term “hydrolase” used herein is indicative for a class of enzymes or enzyme domains that utilise water to disrupt, or break a chemical bond, generating two distinct molecules from one molecule. Hence, it is evident that hydrolase refers to an enzyme capable of conducting hydrolysis. Unless explicitly mentioned, by hydrolase activity herein is meant the hydrolysis of (p)ppGpp, i.e. removal of the 3′ pyrophosphate moiety from (p)ppGpp. Long RSH enzymes such as Rel are known to comprise both the synthetase and hydrolase activity described above and are thus able to both synthesize and degrade alarmones (i.e. (p)ppGpp).
In the context of the invention, a modulator is said to be an “inhibitor” when a consequence of interaction between the modulator and the target protein, here RSH enzymes such as the Rel protein, is that at least one activity of said target protein is reduced, either partially (i.e. to a certain degree) or completely. In the latter case it is understood that due to interaction with the modulator an enzymatic activity of the target protein is diminished to 0%, or below an activity level that can be measured by methods available in the art (such as in Gratani et al., Regulation of the opposing (p)ppGpp synthetase and hydrolase activities in a bifunctional RelA/SpoT homologue from Staphylococcus aureus, PLoS genetics, 2018). “Inhibition” as used herein refers to the inhibition of a process, herein a molecular process, more particularly either the RSH or Rel enzyme hydrolase activity, synthetase activity, or both. It is evident to a skilled person that inhibition can be used interchangeably with the term “attenuation”. In certain embodiments, the inhibitor selectively inhibits Rel hydrolase activity. In alternative embodiments, the inhibitor selectively inhibits Rel synthetase activity. In yet alternative embodiments, the inhibitor selectively inhibits both Rel hydrolase and synthetase activity. Both reversible and irreversible inhibitors are envisaged herein. “Reversible inhibition” and “irreversible inhibition” are known terms to person skilled in the art and are commonly used to further specify an enzyme inhibitor. Binding of an inhibitor to an enzyme is either reversible or irreversible. Irreversible inhibitors usually react with the enzyme and induce a chemical change or modification (e.g. via covalent bond formation). These inhibitors typically modify key amino acid residues needed for enzymatic activity. In contrast, reversible inhibitors bind non-covalently and different types of inhibition have been described depending on whether these inhibitors bind to the enzyme, the enzyme-substrate complex, or both. Methods to measure the dissociation constant (Kd) of a reversible inhibitor are well known to a skilled person (Pollard, A guide to simple and informative binding assays, Molecular biology of the cell, 2010).
The term “dissociation constant”, or “Kd” used herein is an equilibrium constant that quantitatively expresses the propensity of a larger object to separate or dissociate reversibly into smaller components. It is known to a person skilled in the art that the dissociation constant is routinely used to quantify the affinity between a ligand and a drug and is therefore indicative for how tightly or strongly a ligand binds to its target protein. The affinity of a ligand for a protein is associated with the amount of non-covalent intermolecular interactions between the ligand and the protein such as hydrogen bonds, electrostatic interactions, hydrophobic interactions and Van der Waals forces. In addition, the concentration of other molecules present in the proximal environment the ligand-protein interaction takes place in can also affect affinities. This observation is known to a skilled person as molecular crowding (Rivas et al., Macromolecular crowding in vitro, in vivo, and in between, Trends in biochemical sciences, 2016).
“In silico analysis” as defined herein is indicative for an analysis performed on a computing system or by use of a computer simulation system that is guided by a set of specific instructions such as a molecular docking computer program or tool. “Molecular docking” indicates a method that allows prediction of a binding and/or preferred orientation of one molecule to a second molecule when bound to each other to form a stable complex. Hence, it is understood that molecular docking software predicts the behaviour of molecules in binding sites of target proteins. Molecular docking software tools and programs that allow assessing of specificity of a candidate molecule or candidate compound against a particular target have been described in the art. Molecular docking software allows searching for complementarities between shape and/or electrostatics of binding sites surfaces and ligands. A molecular docking process can be separated into two major steps: searching and scoring. Numerous examples of different docking tools and programs have been described and are thus known to a skilled person (Pagadala et al., Software for molecular docking: a review, Biophysical Reviews, 2017). Two main popular molecular docking approaches have been described, a first being molecular docking relying on shape complementarity or geometric matching, and a second one relying on simulating the docking process whereby ligand-protein pairwise interaction energies are calculated.
A “conformational change” as described herein is to be understood as a change in the three-dimensional shape of a molecule, here an RSH enzyme such as Rel. A conformational change may be induced by numerous factors including the non-limiting examples of temperature, pH, voltage, light, ion concentration, post translational modification or binding to a second molecule. The conformational change as described in the current application is a consequence, either directly or indirectly, of binding to a modulator molecule. A protein may display different functions and/or engage in distinct interactions depending on its conformation. In light of the current invention, the conformational state may impact the hydrolase and/or synthetase activity levels. In certain embodiments, specific conformations partially or even completely inhibit hydrolase and/or synthetase activity. In alternative embodiments, specific conformations cause an upregulation of the hydrolase and/or synthetase activity. When “stabilization” of a conformational state is described in the context of the current invention upon binding a Rel modulator, it is intended that the Rel protein adopts a particular state such as but not limited to an open or closed state for at least the time window wherein candidate compound-Rel interaction is occurring.
The term “crystal structure” as used herein is a three-dimensional description of ordered arrangements or structures of elements such as atoms, ions, or molecules in a crystalline material. Crystal structure refers to a protein crystal structure obtained by protein crystallography, the process of forming a protein crystal by experimentation, unless stated otherwise. In a typical protein crystallization process, proteins are dissolved in an aqueous environment comprising a sample solution until supersaturation is obtained. Different approaches have been described in detail in the art and include as non-limiting examples vapor diffusion, batch, microdialysis and liquid-liquid diffusion. Once a protein crystal is obtained, different techniques such as X-ray diffraction, cryo-electron microscopy, or nuclear magnetic resonance are suitable to determine the protein crystal structure. The term “supersaturation” refers to a condition of a solution that contains more of a dissolved material than can be dissolved by the solvent under normal conditions and has been defined in the art as a non-equilibrium condition in which some quantity of the macromolecule in excess of the solubility limit, under specific chemical and physical conditions, is nonetheless present in solution (McPherson and Gavira, Introduction to protein crystallization, Structural biology communications, 2014). Protein crystals thus also compose a large amount of solvent molecules such as the non-limiting example of water. Due to the different methodologies for preparing a protein crystal, these crystals further comprise a varying range of buffers, salts, small binding proteins, and precipitation agents which can vary substantially in concentration.
Typical crystals have a size of between 20 μm to multiple mm. A crystal optimal for X-ray diffraction analysis is ideally free of cracks and other defects.
In certain embodiments, the amino acid sequence of Rel as used by the (screening) methods described herein has at least 70%, preferably at least 80% sequence identity to the amino acid sequence of the Thermus thermophilus RelA/SpoT (P)ppGpp synthetase I as defined in SEQ ID NO: 1:
In certain embodiments, the amino acid sequence of a Rel as used by the (screening) methods described herein has preferably at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, at least about 99.5% sequence identity to the amino acid sequence defined in SEQ ID NO: 1. In certain embodiments, the amino acid sequence of a Rel as used by the (screening) methods described herein is as defined in SEQ ID NO: 1.
In a first aspect, the invention relates to a method for identifying compounds that modulate Rel hydrolase and/or Rel synthetase activity comprising the step of employing a three dimensional structure represented by a set of atomic coordinates presented in Table 1, 2, 3, or 4 or a subset thereof, or atomic coordinates which deviate from those in Table 1, 2, 3, or 4, or a subset thereof, by RMSD over protein backbone atoms by no more than 3 Å and assessing the degree of fit to the three-dimensional protein structure of Rel of said candidate compound. “RMSD”, “root-mean-square deviation”, or “root-mean-square deviation of atomic positions” as used herein is indicative for a quantitative measurement of similarity between two or more protein structures, more specifically the measure of the average distance between the (backbone) atoms of superimposed proteins. The RMSD value is commonly calculated by the formula:
wherein δ is the distance between atom i and the mean position of the N equivalent atoms, or alternatively a reference structure. When calculating the RMSD for backbone, heavy atoms values are calculated for C, N, O, and Cα or solely for Cα. As a RMSD value represents a distance, the value is commonly expressed in the art in Å (Angstrom). 1 Å corresponds to 10−10 m, or 0.1 nanometer. A skilled person appreciates that a lower RMSD indicates smaller structural differences between the compared structures, or between a structure and a reference structure. In certain embodiments, the atomic coordinates used in the method deviate by no more than 2.5 Å, preferably no more than 2 Å, more preferable no more than 1.5 Å, even more preferably no more than 1 Å from the atomic coordinates of Table 1, 2, 3, or 4. In certain embodiments where a subset of atomic coordinates is used, the subset of coordinates is uniquely present in Table 1, 2, 3, or 4. In alternative embodiments where a subset of atomic coordinates is used, the subset of coordinates is present in each of Table 1, 2, 3, and 4.
The term “atomic coordinates” as used herein refers to a position of an atom in space, typically expressed by a set of X, Y, and Z Cartesian coordinates and the chemical element each atom represents. Atomic coordinates for a certain protein structure are typically combined in atomic coordinate data files, which can have various data formats, including the formats of Tables 1, 2, 3, or 4 as enclosed in this specification. Other non-limiting data formats include Protein Data Bank (PDB) format or various text formats. Minor variations in the atomic coordinates are envisaged, and the claims have been formulated with the intent of encompassing such variations. In certain embodiments, the atomic coordinates further contain additional information. It is evident to a skilled person that a three-dimensional rigid body rotation or a translation of said atomic coordinates does not alter the structure of the molecule. It is evident that, since the atomic coordinates disclosed herein are a relative collection of points delineating a three-dimensional structure, a distinct set of coordinates may define a similar or identical three-dimensional structure. In view hereof, multiple computer analysis tools and programs have been developed to assess whether a molecular structure bears similarity to the structured defined by the atomic coordinates, or a subset of atomic coordinates described herein in Table 1, 2, 3, or 4. By means of illustration and not limitation, a suitable software application for conducting such analyses is the Molecular Similarity program of QUANTA (Molecular Simulations Inc., San Diego, Calif.). The Molecular Similarity program and consorts permit extensive comparison between different structures, different conformations of the same structure, and different parts of the same structure. The method of comparison typically involves a step of calculating one or more optimal translations and rotations required such that the RMSD of the fit over the specified pairs of equivalent atoms is an absolute minimum. Therefore, atomic coordinates of a Rel protein or fragments leading to the atomic coordinates in any of Tables 1, 2, 3, or 4 by translations and/or rotations are within the scope of the present invention.
“Degree of fit”, or alternatively “goodness of fit” in the art, is an expression to indicate the likelihood that a certain candidate binding mode represents a favourable binding interaction and allows ranking of different ligands relative to each other. In certain embodiments, the degree of fit between the three-dimensional Rel structure and the candidate Rel modulator is expressed with a numerical value. In alternative embodiments, the degree of fit is expressed by an illustration of the superimposed Rel structure and the compound structure. In certain embodiments, the degree of fit of a ligand is expressed relative to the fit of a known ligand of the Rel protein. A degree of fit may be expressed as an absolute or relative value, depending on the template used for calculating the quantitative score. When the degree(s) of fit are expressed as absolute values, this absolute value corresponds to a score given to a candidate compound based on the number of interactions in silico predicted to occur with a set of atomic coordinates as described in Tables 1 to 4 herein, and/or with a set of amino acid residues in said region on the surface of the protein as described herein. Said number of interaction can be one or more such as two, three, four, five, six, seven, eight, nine, ten, more than ten, or all amino acid residues in said region on the surface of the protein as defined herein. In certain embodiments, the atomic coordinates described in Tables 1 to 4, and/or the amino acid residues cited herein to constitute a surface region of the protein are further abstracted to a pharmacophore, i.e. a set of molecular features required for molecular recognition of a ligand by a biological macromolecule, herein the candidate compound and the Rel protein, Rel hydrolase, or Rel synthetase domain. In certain embodiments, a degree of fit (i.e. a fitting score) of 2.4 is used as threshold for candidate compounds to be considered for further examination and/or validation. In alternative embodiments, a fitting score of 3.0 is used. In alternative embodiments, a fitting score of between 2.4 and 3.0 is used, preferably between 2.5 and 3.0, between 2.7 and 3.0, between 2.9 and 3.0. In alternative embodiments a fitting score of between 2.4 and 2.9 is used, preferably between 2.4 and 2.7, between 2.4 and 2.5. In certain embodiments, a variable fitting score threshold is used depending on the molecular weight of candidate compounds. In further embodiments, candidate compounds of 301 Da to 330 Da have a fitting score threshold of 2.4, candidate compounds of 331 Da to 380 Da a fitting score threshold of 2.5, candidate compounds of 381 Da to 420 Da a fitting score threshold of 2.7, candidate compounds of 421 Da to 490 Da a fitting score threshold of 2.9, and candidate compounds of 491 Da to 540 Da a fitting score threshold of 3.0. When the degree of fit is a relative value, this degree of fit may be expressed relative to a reference compound known to modulate the activity of the Rel protein. In such embodiments, a candidate compound is considered a bona fide modulator of Rel when the degree of fit is at least 50%, preferably at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, most preferably at least 95% to a reference compound known to modulate the activity of Rel. It is evident that a direct comparison between the degrees of fit of multiple ligands may be derived from this initial score. Numerous scoring functions or mechanisms have been described in the art (inter alia in Fu and Zhang, An overview of scoring functions used for protein-ligand interactions in molecular docking, Interdisciplinary Science: Computational Life Sciences, 2019), and it is evident that different such scoring functions are suitable for generating a degree of fit between a candidate compound and the Rel protein. For example, when using the AMBER scoring function (Wang et al., Development and testing of a general amber force field, Journal of Computational Chemistry, 2004), a candidate compound is considered to be a candidate bona fide modulator when a docking score threshold is met. In certain embodiments a docking score threshold of −8.9 kcal/mol is used. In certain embodiments a docking score threshold of between −8.9 kcal/mol and −10.5 kcal/mol is used. In further embodiments a docking score threshold of between −9.4 kcal/mol and −10.5 kcal/mol is used. In yet further embodiments a docking score threshold of between −9.7 kcal and −10.5 kcal/mol is used. In alternative further embodiments a docking score threshold of between −8.9 kcal/mol and −10.3 kcal/mol is used. In further embodiments a docking score threshold of between −8.9 kcal/mol and −9.7 kcal/mol is used. In further further embodiments a docking score threshold of between −8.9 kcal/mol and −9.4 kcal/mol is used. In alternative embodiments, a docking score threshold of −10.5 kcal/mol was used. In yet alternative embodiments, a variable docking score threshold was used, preferably based on the molecular weight of the candidate compounds. In further embodiments, compounds with a molecular weight of 301 Da to 330 Da are assigned a docking score threshold of −8.9 kcal/mol, compounds with a molecular weight of 331 Da to 380 Da are assigned a docking score of −9.4 kcal/mol, compounds with a molecular weight of 381 Da to 420 Da are assigned a docking score of −9.7 kcal/mol, compounds with a molecular weight of 421 Da to 490 Da are assigned a docking score of −10.3 kcal/mol, and compounds with a molecular weight of 491 to 540 Da are assigned a docking score threshold of −10.5 kcal/mol.
In certain embodiments, the method further comprises assessing interactions of said candidate compound to one or more amino acid residues of a region on the surface of the protein defined by amino acid residues: Arg43, Ser45, His156, Thr153, Met157, Asn150, Leu154, Lys161, Arg147, Lys143, Glu168, and 11e165 of the Rel amino acid sequence as defined in SEQ ID NO: 1, wherein an interaction indicates the candidate compound is a modulator of Rel hydrolase activity, or of Rel hydrolase and synthetase activity. In further embodiments, the method comprises assessing whether the candidate compound interacts with at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, or all amino acid residues of the group consisting of Arg43, Ser45, His156, Thr153, Met157, Asn150, Leu154, Lys161, Arg147, Lys143, Glu168, and 11e165 of the Rel amino acid sequence as defined in SEQ ID NO: 1. In certain embodiments, the candidate compound is an inhibitor of Rel hydrolase activity.
The term “region on the surface of the protein” as used herein intends to refer to a surface patch that defines a binding site which involves the residues that are listed with respect to said region.
In certain embodiments, the method further comprises assessing interactions of said candidate compound to one or more amino acid residues of a region on the surface of the protein defined by amino acid residues: Asn327, Tyr329, Lys325, His333, Arg277, Arg349, Gln347, Glu345, Asp272, Arg316, Lys251, Arg249, Ala275, Arg355, Ser255, and Lys186 of the Rel amino acid sequence as defined in SEQ ID NO: 1, wherein an interaction indicates the candidate compound is a modulator of Rel synthetase activity or of Rel synthetase and hydrolase activity. In certain embodiments, the method comprises assessing whether the candidate compound interacts with at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, at least 15, or all amino acid residues of the group consisting of Asn327, Tyr329, Lys325, His333, Arg277, Arg349, Gln347, Glu345, Asp272, Arg316, Lys251, Arg249, Ala275, Arg355, Ser255, and Lys186 of the Rel amino acid sequence as defined in SEQ ID NO: 1. In certain embodiments, the method further comprises a step to assess whether a candidate compound is an inhibitor of Rel synthetase activity based on interaction with a specific amino acid residue or a specific subset of amino residues of the group described above. In alternative embodiments, the method further comprises a step to assess whether a candidate compound is an inhibitor or Rel synthetase and hydrolase activity based on interaction with a specific amino acid residue or a specific subset of amino residues of the group described above.
In certain embodiments, the method further comprises assessing interactions of said candidate compound to one or more amino acid residues of a region on the surface of the protein defined by amino acid residues: Lys164, Asp200, Tyr201, Arg204, Tyr211, Lys212, His219, Arg221, Arg222, Arg225 of the Rel amino acid sequence as defined in SEQ ID NO: 1, wherein an interaction indicates the candidate compound is an allosteric compound or an effector of the Rel synthetase and/or hydrolase activity. In certain embodiments, the method comprises assessing whether the candidate allosteric compound interacts with at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or all amino acid residues of the group consisting of: Lys164, Asp200, Tyr201, Arg204, Tyr211, Lys212, His219, Arg221, Arg222, Arg225 of the Rel amino acid sequence as defined in SEQ ID NO: 1. In certain embodiments, the candidate allosteric compound is an allosteric hydrolase and/or synthetase inhibitor. In alternative embodiments, the candidate allosteric compound is an allosteric hydrolase and/or synthetase activator. In certain embodiments, the allosteric compound is a Rel hydrolase inhibitor and a Rel synthetase activator. In certain embodiments, the allosteric compound is a Rel hydrolase activator and a Rel synthetase inhibitor. In certain embodiments, the candidate allosteric compound inhibits the synthetase and/or hydrolase activity by at least 50%, preferably at least 60%, preferably at least 75%, more preferably at least 90%, most preferably at least 95% compared to the Rel synthetase activity and/or hydrolase activity in absence of said allosteric compound. In certain embodiments, the method further comprises comparing the conformational state of Rel with or without said candidate compound binds to the allosteric site of Rel, wherein a change in conformational state is indicative for the candidate compound to be a bona fide effector of the Rel hydrolase and/or synthetase activity, preferably wherein the conformational state of Rel without candidate binding is the conformational state characterized by the atomic coordinates of Table 4.
An “allosteric site” of an enzyme as used herein refers to a site not part of an active site of said enzyme discussed, thus a site other than the enzyme's active site(s). It is said that the regulatory site of an allosteric protein is physically distinct from its active (catalytic or enzymatic) site (Kirschner, Allosteric regulation of enzyme activity; an introduction to the molecular basis of and the experimental approaches to the problem, Current topics in microbiology and immunology, 1968). “Allosteric modulators” or “allosteric modulators” in the context of the invention are therefore modulators that bind to a site different to the enzyme's active site(s) but nonetheless have an effect on the enzymatic activity of said enzyme. Allosteric modulators that enhance the activity of an enzyme are referred to as allosteric activators. The latter can initiate and/or maintain a hyperactive enzyme. In contrast, modulators that decrease the enzymatic activity are called allosteric inhibitors. In the context of the current invention, allosteric modulators may be allosteric activators for a first enzymatic (hydrolase or synthetase) activity and optionally also be an allosteric inhibitor for a second enzymatic (hydrolase or synthetase) activity of the target protein. Since allosteric regulation is often induced by an effect of the allosteric modulator on the conformation of the target enzyme or affected enzymatic domain, it is therefore said that allosteric modulators may induce a conformational change on the protein they bind to.
In certain embodiments, the method further comprises determining a score of a candidate compound to modulate Rel hydrolase and/or Rel synthetase activity based on the number of interactions with said amino acid residues. In certain embodiments, the score is directly proportional to the amount of interactions with said residues. A skilled person understands that in these embodiments the score for Rel hydrolase modulators is dependent on the amount of interactions with amino acid residues of the group consisting of Arg43, Ser45, His156, Thr153, Met157, Asn150, Leu154, Lys161, Arg147, Lys143, Glu168, and Ile165 of the Rel amino acid sequence as defined in SEQ ID NO: 1, the score for Rel hydrolase modulators is dependent on the amount of interactions with amino acid residues of the group consisting of Asn327, Tyr329, Lys325, His333, Arg277, Arg349, Gln347, Glu345, Asp272, Arg316, Lys251, Arg249, Ala275, Arg355, Ser255, and Lys186 of the Rel amino acid sequence as defined in SEQ ID NO: 1, and the score for allosteric Rel synthetase and/or Rel hydrolase modulators is dependent on the amount of interactions with amino acid residues of the group consisting of Lys164, Asp200, Tyr201, Arg204, Tyr211, Lys212, His219, Arg221, Arg222, Arg225 of the Rel amino acid sequence as defined in SEQ ID NO: 1. It is evident that a single candidate compound may be characterized by two separate scores indicative for Rel hydrolase and Rel synthetase modulation respectively when compared to the hydrolase and synthetase activity of an identical Rel protein in absence of any hydrolase and/or synthetase modulator. The score may be expressed as an absolute value and/or as a relative value compared to one or more reference Rel modulator molecules. In an illustrative embodiment, the score may be a positive integer that is a sum of the number of interactions between the amino acid residues described herein and the candidate compound. In an alternative illustrative embodiment, the score may be a percentage, wherein 0% indicates no interaction(s) between the candidate compound and the Rel protein, and 100% indicates an interaction with each of the amino acid residues described herein that are indicated to form, or be part of, the relevant portion of the Rel surface region as defined herein. It is evident that a candidate compound with a higher score, said score being linearly correlated to the amount of interactions, indicates a higher likelihood of a candidate compound to be a strong modulator (e.g. inhibitor) of the Rel protein when compared to a candidate compound with a lower score.
In certain embodiments, the method further comprises comparing the conformational state of Rel before and after said candidate compound binds to Rel, wherein a change in conformational state is indicative for the candidate compound to be a bona fide modulator of Rel hydrolase and/or Rel synthetase activity, preferably wherein the conformational state of Rel before candidate binding is the resting conformational state characterized by the atomic coordinates of Table 1. In certain embodiments, the method comprises detection of any atomic coordinates that are different after binding of the candidate Rel modulator from the atomic coordinates characterizing the resting conformational state of Rel shown in Table 1. By a conformational change as used herein it is intended a structural change, or structural transition, in the three dimensional structure of the Rel before and after binding of the candidate compound to Rel. A conformational change can be any transition from the following Rel conformations: open conformation, closed conformation, intermediate conformation (indicative for a structurally folded Rel protein that is distinct from the open and closed conformation), and an (partially) unfolded Rel conformation.
In certain embodiments, the method further comprises comparing the conformational state of Rel with or without said candidate compound binds to Rel, wherein a change in conformational state is indicative for the candidate compound to be a bona fide modulator of Rel hydrolase or Rel hydrolase and synthetase activity, preferably wherein the conformational state of Rel without candidate binding is the (p)ppGpp bound conformational state characterized by the atomic coordinates of Table 3. In further embodiments, the candidate compound is considered a Rel hydrolase inhibitor by the methods described herein when upon binding with one or more of said Rel amino acid residues, Rel is stabilized in an open state. In further embodiments, the candidate compound (completely) inhibits Rel hydrolase activity and partially inhibits Rel synthetase activity when Rel is stabilized in an open state. In certain embodiments, the candidate inhibitor reduces Rel hydrolase activity by at least 50%, preferably at least 60%, preferably at least 75%, preferably at least 90%, more preferably at least 95%, most preferably at least 99% compared to the Rel hydrolase activity in absence of said inhibitor.
In certain embodiments, the method further comprises comparing the conformational state of Rel with or without said candidate compound binds to Rel, wherein a change in conformational state is indicative for the candidate compound to be a bona fide modulator of Rel synthetase or Rel synthetase and hydrolase activity, preferably wherein the conformational state of Rel without candidate binding is the AMP-G4P bound conformational state characterized by the atomic coordinates of Table 2. In further embodiments, the candidate compound is considered a Rel synthetase inhibitor when upon binding with one or more of said Rel amino acid residues, Rel is stabilized in a closed state. In further embodiments, the candidate compound (completely) inhibits Rel synthetase activity and partially inhibits Rel hydrolase activity when Rel is stabilized in an open state. In certain embodiments, the candidate inhibitor reduces Rel synthetase activity by at least 50%, preferably at least 60%, preferably at least 75%, preferably at least 90%, more preferably at least 95%, most preferably at least 99% compared to the Rel hydrolase activity in absence of said inhibitor.
In certain embodiments, the method further comprises testing of the ability of the candidate compounds for modulating Rel synthetase and/or Rel hydrolase activity. In certain embodiments, the method comprises in vitro and/or in vivo testing of the ability of the candidate compounds for Rel modulation Rel synthetase and/or Rel hydrolase activity. In certain embodiments, the testing of the candidate compounds involves testing of said compound in competition with one or more natural Rel substrates including but not limited to (p)ppGpp and/or AMP-G4P.
By means of an illustrative example, in vitro testing of the hydrolase activity of Rel in presence of an candidate Rel synthetase-modulating compound can comprise contacting said candidate compound with recombinant Rel protein and measuring removal of the 3′ pyrophosphate moiety from (p)ppGpp (i.e. monitoring the hydrolysis reaction mediated by Rel). In vitro synthetase activity testing can be performed in a similar assay, whereby synthesis of (p)ppGpp can be monitored (i.e. transfer of the pyrophosphate group of ATP onto the 3′ of GDP or GTP). Similar experimental conditions can be devised for in vivo activity testing. Methods for assessing a plethora of different enzymatic activities are known in the art (Ou et al., Methods of measuring enzyme activity ex vivo and in vivo, Annual review of analytical chemistry, 2018).
In certain embodiments, the methods described herein are computer-implemented methods. In further embodiments, the computer comprising an inputting device, a processor, a user interface, and an outputting device, wherein said method comprises the steps of:
a) generating a three-dimensional structure of said atomic coordinates, or said subset thereof;
b) fitting the structure of step a) with the structure of a candidate compound by computational modeling;
c) selecting a ligand that possesses energetically favorable interactions with the structure of step a).
In certain embodiments, the method further comprises selection of ligands that possess multiple energetically favorable interactions with said three-dimensional structure in favor of ligands that possess one energetically favorable interaction with said three-dimensional structure. In certain embodiments, the three-dimensional structure is generated using the atomic coordinates from at least one list of atomic coordinates of Table 2, 3, or 4. In alternative embodiments, the three-dimensional structure is generated using a subset of atomic coordinates from Table 2, 3, or 4. In further embodiments, the three-dimensional structure is generated using a subset of atomic coordinates that are unique for Table 2, 3, or 4. By the term “energetically favorable interaction” as used herein is envisaged any interaction with interaction energies <0 kJ/mol. Alternatively an energetically favorable interaction may be expressed as an interaction having a negative Gibbs free energy (ΔG) value. Since a protein-ligand association extent is correlated to the magnitude of a negative ΔG, ΔG can be regarded as determinant for the stability of the protein-ligand complex under investigation, or, alternatively, the binding affinity of a ligand to a given acceptor, in the context of the current specification the RSH enzyme Rel. Free energy is a function of the states of a system and, as thus, ΔG values are defined by the initial and final thermodynamic state, regardless of any intermediates states. The concept of energetically favorable interactions is known to a person skilled in the art (Du et al., Insights into Protein-Ligand Interactions: Mechanisms, Models, and Methods, International journal of molecular sciences, 2016).
In certain embodiments, the method comprises superimposing the generated three-dimensional structure with the structure of the candidate compound. In further embodiments, the method comprises selecting from a collection of distinct structure-candidate compound superimposed orientations a most favorable orientation of said structure with said candidate compound. Hence, in certain embodiments, the method comprises docking modeling or molecular docking. In certain embodiments, the method comprises a computer-implemented step of proposing candidate structure modifications to further increasing the number of favorable interactions with the generated three-dimensional structure. In yet further embodiments, the method comprises ranking an obtained collection of candidate compounds based on the number of favorable interactions they engage in with the generated three-dimensional structure, wherein candidate compounds with a higher number of favorable interactions are ranked higher than candidate compounds with fewer favorable interactions.
The terms “docking modeling” and “molecular docking” are indicative for one or more quantitative and/or qualitative analyses of a molecular structure based on structural information and interaction models. Modeling may refer to any one of numeric-based molecular dynamic models, interactive computer graphic models, energy minimization models, distance geometry, molecular mechanics models, or any structure-based constraints model. These illustrative molecular modeling approaches may be employed to the atomic coordinates or a subset of atomic coordinates as described herein in any one of Tables 1, 2, 3, or 4 to obtain a range of three-dimensional models and to investigate the structure of any binding sites, such as the binding sites of candidate Rel modulators. Modeling methods and tools have been developed to design or select chemical molecules that have a complementarity to particular target regions, in the context of the invention a particular target region of Rel. In certain embodiments, the chemical molecule, i.e. the candidate compound has a stereochemical complementarity to said target regions. Stereochemical complementarity refers to a scenario wherein there are a number of energetically favorable contacts between the candidate compound and (the target region of) Rel. A skilled person appreciates that if a certain number of energetically favorable interactions are sufficient to modulate Rel activity, and that it is thereby not a precondition that all the key amino acid residues as described herein are engaged in an energetically favorable interaction. Non-limiting examples of software programs suitable for conducting molecular docking analysis have been described in detail in the art (Pagadala et al., Software for molecular docking: a review, Biophysical Reviews, 2017).
Any computer system or any computer-implemented method relying on a computer system described herein may further comprise means for machine learning of said device to predict candidate Rel modulators and/or score candidate Rel modulators based on input of a reference set of candidate compounds by a user, or based on date generated from earlier fitting and/or selection steps of candidate modulators. The combination of machine learning models for in silico screening and prediction of enzyme binding molecules or modulators is known in the art, and therefore also envisaged by the current invention (Li, et al., Machine learning models combined with virtual screening and molecular docking to predict human Topoisomerase I inhibitors, Molecules, 2019). Non-limiting examples of machine learning models, i.e. machine learning algorithms include Linear regression, logistic regression, decision trees, support vector machines, naive Bayes, k-nearest neighbors (kNN), k-means, random forest, dimensionality reduction algorithms, and gradient boosting algorithms such as gradient boosting machine (GBM), XGBoost, LightGBM, and CatBoost.
In certain embodiments, the method comprises selecting a candidate compound that can bind to at least 1 amino acid residue, preferably more than 1 amino acid residue of the generated three-dimensional structure without steric interference. The terms “steric interference”, “steric hindrance”, and “steric effects” are known to a person skilled in the art. Steric interference or alternatively referred to as steric hindrance is a consequence of a steric effect, and indicates the slowing of chemical reactions due to steric bulk.
Further aspects herein relate to an in vitro method for identifying a compound which modulates Rel hydrolase and/or synthetase activity comprising the steps of:
a) providing a candidate compound;
b) providing the Rel polypeptide;
c) contacting said candidate compound with said Rel polypeptide;
d) determining the hydrolase and/or synthetase activity of Rel in the presence and absence of said candidate compound; and
e) identifying said candidate compound as a compound which modulates Rel hydrolase and/or synthetase activity if a change in activity is detected.
An illustrative method to assess hydrolase and/or synthetase activity is described above. In certain embodiments, the method comprises further selecting additional candidate compounds based on common structural features from a database. In certain embodiments, recombinant Rel protein is used in the methods described herein. Means and method to produce and purify recombinant protein have been described in detail in the art (inter alia in Grasslund et al., Protein production and purification, Nature methods, 2011). In certain embodiments, the complete Rel amino acid sequence is provided (i.e. SEQ ID NO: 1) or an amino acid sequence with at least 70%, preferably at least 80% sequence identity to SEQ ID NO: 1, while in alternative embodiments a functional fragment of Rel is provided, for example the hydrolase domain or the synthetase domain. In certain embodiments, the method may express a difference in hydrolase and/or synthetase activity or Rel in presence of absence of a candidate compound by a quantitative indication, such as a ratio. In certain embodiments, the method further comprises immobilization of the Rel protein or the candidate compound on a solid surface. In further embodiments, the method comprises a step of washing away excess Rel protein or excess candidate compound prior to determining the hydrolase and/or synthetase activity. In certain embodiments, the method comprises detecting a change in hydrolase and/or synthetase activity by colorimetry or spectrophotometry. In certain embodiments, a change of activity is considered as an increase of hydrolase and/or synthetase activity of the Rel protein by at least 10%, preferably 25%, preferably 50%, preferably 75%, preferably 100% in presence of said candidate compound when compared to the respective hydrolase and/or synthetase activity when the enzymatic activity of said Rel protein is assessed in absence of any (candidate) compound. In certain embodiments, a change of activity is considered as a decrease of hydrolase and/or synthetase activity of the Rel protein by at least 10%, preferably 25%, preferably 50%, preferably 75%, preferably 100% in presence of said candidate compound when compared to the respective hydrolase and/or synthetase when the enzymatic activity of said Rel protein is assessed in absence of any (candidate) compound. In certain embodiments, the method identifies candidate compounds capable of inhibiting the hydrolase and/or synthetase activity. In alternative embodiments, the method identifies candidate compounds capable of stimulating the hydrolase and/or synthetase activity.
Using the crystal structure as defined herein, the inventors have identified a number of candidate compounds that fit within said model and have subsequently confirmed their modulatory effect on the Rel enzyme activity.
Thus, a further aspect of the invention relates to Rel modulators obtained by any of the methods described herein. In certain embodiments, the present invention relates to a modulator of Rel hydrolase and/or synthetase activity obtained by the methods as defined herein. In certain embodiments, the present invention relates to an inhibitor of Rel hydrolase and/or synthetase activity obtained by the methods as defined herein. In certain embodiments, the present invention relates to an activator of Rel hydrolase and/or synthetase activity obtained by the methods as defined herein. In certain embodiments, the present invention relates to an effector of Rel hydrolase and/or synthetase activity obtained by the methods as defined herein.
The present invention thus also relates to modulators of Rel hydrolase and/or synthetase activity. In an embodiment, a compound as identified herein as “modulator of Rel hydrolase and/or synthetase activity” is a compound having a general formula selected from the group comprising formula (I), formula (II), formula (III) and formula (IV) as defined herein. The term “a compound of formula (I)”, or “a compound of formula (II)”, or “a compound of formula (III)”, or “a compound of formula (IV)”, as described herein, intends to also encompass an isomer, preferably a stereo-isomer or a tautomer, a solvate, a salt, preferably a pharmaceutically acceptable salt, or a prodrug of said (respective) compound, preferably a pharmaceutically acceptable salt, solvate, hydrate, polymorph, tautomer, stereoisomer, or prodrug of said (respective) compound.
In an embodiment, a compound as identified herein as modulator of Rel hydrolase and/or synthetase activity is
Preferred statements and embodiments of the compounds as identified and defined herein, as Rel modulators are set herein below. Each statement or embodiment of a compound (Rel modulator) so defined may be combined with any other statement and/or embodiment, unless clearly indicated to the contrary. In particular, any feature indicated as being preferred or advantageous may be combined with any other features or statements indicated as being preferred or advantageous. Hereto, embodiments of compounds as identified herein as Rel modulators are in particular captured by any one or any combination of one or more of the below numbered statements and embodiments, with any other aspect and/or embodiment.
Statement 1: Modulator of Rel hydrolase and/or synthetase activity, wherein said modulator is a compound of formula (I),
Statement 2: The modulator according to statement 1, wherein said modulator is a compound of formula (I),
Statement 3: The modulator according to any of statements 1 to 2, wherein said modulator is a compound of formula (I),
Statement 4: The modulator according to any of statements 1 to 3 wherein said modulator is a compound of formula (I),
Statement 5: The modulator according to any of statements 1 to 4, wherein said modulator is any of compounds 1 to 24 as selected from Table A. Yes/No in the 5th and 6th columns indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity.
Statement 6: The modulator according to any of statements 1 to 3, wherein said modulator is a compound of formula (I),
wherein m is an integer selected from 1, 2, 3, 4, or 5;
wherein each R1 is independently selected from halogen, ═O, nitro, or a group comprising hydroxyl, heterocyclyl, heteroaryl, C1-6alkyl, C2-6alkenyl, C5-12aryl, heteroC1-6alkyl, heteroC2-6alkenyl, C5-12arylC1-6alkyl, C5-12arylC2-6alkenyl-, C5-12aryl-heteroC1-6alkyl-, C5-12aryl-heteroC2-6alkenyl-, heterocyclyl-C1-6alkyl-, heterocyclyl-heteroC1-6alkyl-, heterocyclyl-C2-6alkenyl-, heterocyclyl-heteroC2-6alkenyl-, heteroaryl-C1-6alkyl-, heteroaryl-C1-6alkenyl-, heteroaryl-heteroC1-6alkyl-, heteroaryl-heteroC2-6alkenyl-, C1-6alkyl-SO2-, heteroC1-6alkyl-SO2-, heterocyclyl-SO2-, Heteroaryl-heteroC1-6alkyl-heteroaryl-, Heteroaryl-heteroC2-6alkenyl-heteroaryl-, Heteroaryl-C2-6alkenyl-heteroaryl-, C5-12Aryl-heteroaryl-C1-6alkyl-, C5-12aryl-heteroaryl-heteroC2-6alkenyl-, C1-6Alkyloxy-C5-12aryl-C2-6alkenyl-, C1-6Alkyloxy-heterocyclyl-heteroC1-6alkyl-, C5-12Aryl-heteroC1-6alkyl-heteroaryl-C1-6alkyl-, C2-6alkenyl-C5-12aryl-heteroC2-6alkenyl, C5-12aryl-imino-, heteroC1-6alkyl-C5-12aryl-imino-, C2-6alkenyl-C5-12aryl-imino-, and wherein each of said groups can be unsubstituted or substituted with one or more substituents each independently selected from the group comprising halogen, nitro, oxo, C1-6alkyloxy, —C(O)OH, hydroxyl, hydroxycarbonylC1-6alkyl-, hydroxycarbonylC2-6alkenyl-, C1-6alkyl, C2-6alkenyl, and C5-12aryl;
wherein cycle A is selected from the group represented by formula (Ib);
Statement 7: The modulator according to any of statements 1 to 3 and 6, wherein said modulator is a compound of formula (I),
wherein m is an integer selected from 1, 2, 3, 4, or 5; and preferably from 1, 2, 3 or 4
wherein each R1 is independently selected from halogen, ═O, nitro, or a group comprising hydroxyl, heteroaryl, heterocyclyl, heteroC1-6alkyl, heteroC2-6alkenyl, C1-6alkyl, C2-6alkenyl, C5-12aryl, C5-12arylC1-6alkyl, C5-12arylC2-6alkenyl-, C5-12aryl-heteroC1-6alkyl-, C5-12aryl-heteroC2-6alkenyl-, heterocyclyl-C1-6alkyl-, heterocyclyl-heteroC1-6alkyl-, heterocyclyl-C2-6alkenyl-, heterocyclyl-heteroC2-6alkenyl-, heteroaryl-C1-6alkyl-, heteroaryl-C1-6alkenyl-, heteroaryl-heteroC1-6alkyl-, heteroaryl-heteroC2-6alkenyl-, SO2-, heteroC1-6alkyl-SO2-, heterocyclyl-SO2-, Heteroaryl-heteroC1-6alkyl-hetero Heteroaryl-heteroC2-6alkenyl-heteroaryl-, Heteroaryl-C2-6alkenyl-heteroaryl-, C5-12Aryl-heteroaryl-C1-6alkyl-, C5-12aryl-heteroaryl-heteroC2-6alkenyl-, C1-6Alkyloxy-C5-12aryl-C2-6alkenyl-, C1-6Alkyloxy-heterocyclyl-heteroC1-6alkyl-, C5-1 2Aryl-heteroC1-6alkyl-heteroaryl-C1-6alkyl-, C2-6alkenyl-C5-12aryl-heteroC2-6alkenyl, C5-12aryl-imino-, heteroC1-6alkyl-C5-12-aryl-imino-, C2-6alkenyl-C5-12aryl-imino-, and wherein each of said groups can be unsubstituted or substituted with one or more substituents each independently selected from the group comprising halogen, nitro, oxo, C1-6alkyloxy, —C(O)OH, hydroxyl, hydroxycarbonylC1-6alkyl-, hydroxycarbonylC2-6alkenyl-, C1-6alkyl, C2-6alkenyl, and C5-12aryl, and
wherein cycle A is selected from the group represented by formula (Ib);
Statement 8: The modulator according to any of statements 1 to 3 and 6 to 7, wherein said modulator is any of compounds 25 to 46 as selected from Table B. Yes/No in the 5th and 6th columns indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity.
Statement 9: Modulator of Rel hydrolase and/or synthetase activity, wherein said modulator is a compound of formula (II), or an isomer, preferably a stereo-isomer or a tautomer, a solvate, a salt, preferably a pharmaceutically acceptable salt, or a prodrug thereof,
Statement 10: The modulator according to statement 9, wherein said modulator is a compound of formula (II),
Statement 11: The modulator according any of statements 9 or 10, wherein said modulator is a compound of formula (II),
Statement 12: The modulator according to any of statements 9 to 11, wherein said modulator is any of compounds 47-49 as selected from Table C. Yes/No in the 5th and 6th columns indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity.
Statement 13: The modulator according to statement 9 or 10, wherein said modulator is a compound of formula (II),
Statement 14: The modulator according to any of statements 9, 10, and 13, wherein said modulator is any of compounds 50-51 as selected from Table D. Yes/No in the 5th and 6th columns indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity.
Statement 15: Modulator of Rel hydrolase and/or synthetase activity, wherein said modulator is a compound of formula (III), or an isomer, preferably a stereo-isomer or a tautomer, a solvate, a salt, preferably a pharmaceutically acceptable salt, or a prodrug thereof,
Statement 16: Modulator according to statement 15, wherein said modulator is a compound of formula (III),
Statement 17: Modulator according to statement 15 or 16, wherein said modulator is a compound of formula (III),
Statement 18: Modulator according to any of statement 15 to 17, wherein said modulator is a compound of formula (III), and wherein cycle C is selected from the group comprising
Statement 19: Modulator according to any of statements 15 to 18 wherein said modulator is any of compounds 52-56 as selected from Table E. Yes/No in the 5th and 6th columns indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity.
Statement 20: Modulator of Rel hydrolase and/or synthetase activity, wherein said modulator is a compound of formula (IV),
Statement 21: The modulator according to statement 20,
Statement 22: The modulator according to statement 20 or 21, wherein said modulator is a compound of formula (IV),
Statement 23: The modulator according to any of statements 20 to 22,
wherein cycle D is a heteroaryl, and preferably a heteroaryl selected from the group comprising triazol-2-yl, pyridinyl, 1H-pyrazol-5-yl, pyrrolyl, furanyl, thiophenyl, pyrazolyl, imidazolyl, oxazolyl, isoxazolyl, thiazolyl, isothiazolyl, triazolyl, oxadiazolyl, thiadiazolyl, tetrazolyl, oxatriazolyl, thiatriazolyl, pyrimidyl, pyrazinyl, pyridazinyl, oxazinyl, dioxinyl, thiazinyl, triazinyl, imidazo[2,1-b][1,3]thiazolyl, thieno[3,2-b]furanyl, thieno[3,2-b]thiophenyl, thieno[2,3-d][1,3]thiazolyl, thieno[2,3-d]imidazolyl, tetrazolo[1,5-a]pyridinyl, indolyl, indolizinyl, isoindolyl, benzofuranyl, isobenzofuranyl, benzothiophenyl, isobenzothiophenyl, indazolyl, benzimidazolyl, 1,3-benzoxazolyl, 1,2-benzisoxazolyl, 2,1-benzisoxazolyl, 1,3-benzothiazolyl, 1,2-benzoisothiazolyl, 2,1-benzoisothiazolyl, benzotriazolyl, 1,2,3-benzoxadiazolyl, 2,1,3-benzoxadiazolyl, 1,2,3-benzothiadiazolyl, 2,1,3-benzothiadiazolyl, benzo[d]oxazol-2(3H)-one, 2,3-dihydro-benzofuranyl, thienopyridinyl, purinyl, imidazo[1,2-a]pyridinyl, 6-oxo-pyridazin-1(6H)-yl, 2-oxopyridin-1(2H)-yl, 6-oxo-pyridazin-1(6H)-yl, 2-oxopyridin-1(2H)-yl, 1,3-benzodioxolyl, quinolinyl, isoquinolinyl, cinnolinyl, quinazolinyl, and quinoxalinyl,
wherein said cycle D is an aryl, and preferably an aryl selected from the group comprising phenyl, biphenyl, naphthyl, 5,6,7,8-tetrahydronaphthalenyl, 1,2,6,7,8,8a-hexahydroacenaphthylenyl, 1,2-dihydroacenaphthylenyl, and 2,3-dihydro-1H-indenyl.
Statement 24: The modulator according to any of statements 20 to 23, wherein said modulator is any of the compounds 57-74 as selected from Table F. Yes/No in the 5th and 6th columns indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity.
Statement 25: Modulator of Rel hydrolase and/or synthetase activity, wherein said modulator is a compound selected from the group of compounds as given in Table A, Table B, Table C, Table D, Table E, and Table F, or an isomer, preferably a stereo-isomer or a tautomer, a solvate, a salt, preferably a pharmaceutically acceptable salt, or a prodrug thereof.
In Tables A to F the identified compounds were indicated by their respective Molport ID's (https://www.molport.com) or Zinc ID's (https://www.zinc.docking.org).
As indicated above, the “Yes/No” in the 5th and 6th columns of TABLES A to F indicates whether there is, or respectively is not an inhibition of the synthetase (ST) and/or hydrolase (HD) activity. Activities of all the above listed compounds (or other compounds as encompassed by the present invention) were confirmed via in vitro biochemical testing. The action of a candidate compound is evaluated based on its effect on the production of (p)ppGpp. Therefore, in order to assess the potential influence of a candidate compound on synthesis activity, the enzyme was mixed with GDP or GTP and radioactive ATP to produce (p)ppGpp in absence or presence of the candidate compound. Subsequently, the reaction is developed by thin layer chromatography (TLC). In an alternative assay that enables determining the effect of a compound on the hydrolysis activity, radioactive (p)ppGpp is incubated with the enzyme and any decrease of the (p)ppGpp spot on TLC is evaluated when the assay is conducted in presence of the compound and compared to control conditions where no candidate compound is added during incubation. Hence, the inhibitory activity of compounds is assessed based on the effect on the hydrolysis reaction.
The term “alkyl” or “C1-18alkyl” as used herein means C1-C18 normal, secondary, or tertiary, linear, branched or straight hydrocarbon with no site of unsaturation. Examples are methyl, ethyl, 1-propyl (n-propyl), 2-propyl (iPr), 1-butyl, 2-methyl-1-propyl(i-Bu), 2-butyl (s-Bu), 2-dimethyl-2-propyl (t-Bu), 1-pentyl (n-pentyl), 2-pentyl, 3-pentyl, 2-methyl-2-butyl, 3-methyl-2-butyl, 3-methyl-1-butyl, 2-methyl-1-butyl, 1-hexyl, 2-hexyl, 3-hexyl, 2-methyl-2-pentyl, 3-methyl-2-pentyl, 4-methyl-2-pentyl, 3-methyl-3-pentyl, 2-methyl-3-pentyl, 2,3-dimethyl-2-butyl, 3,3-dimethyl-2-butyl, n-heptyl, n-octyl, n-nonyl, n-decyl, n-undecyl, n-dodecyl, n-tridecyl, n-tetradecyl, n-pentadecyl, n-hexadecyl, n-heptadecyl, n-octadecyl, n-nonadecyl, and n-icosyl. In particular embodiments, the term alkyl refers to C1-18alkyl (C1-18 hydrocarbons), for instance C1-12alkyl (C1-12 hydrocarbons), yet more in particular to C1-9alkyl (C1-9 hydrocarbons), yet more in particular to C1-6alkyl (C1-6 hydrocarbons) as further defined herein above.
The term “haloalkyl” as a group or part of a group, refers to an alkyl group having the meaning as defined above wherein one, two, or three hydrogen atoms are each replaced with a halogen as defined herein. Non-limiting examples of such haloalkyl groups include chloromethyl, 1-bromoethyl, fluoromethyl, difluoromethyl, trifluoromethyl, 1,1,1-trifluoroethyl and the like.
The term “alkoxy” or “alkyloxy”, as a group or part of a group, refers to a group having the formula —ORb wherein Rb is C1-6alkyl as defined herein above. Non-limiting examples of suitable C1-6alkoxy include methoxy, ethoxy, propoxy, isopropoxy, butoxy, isobutoxy, sec-butoxy, tert-butoxy, pentyloxy and hexyloxy. The term “haloalkoxy”, as a group or part of a group, refers to a group of formula —O—Rc, wherein Rc is haloalkyl as defined herein. Non-limiting examples of suitable haloalkoxy include fluoromethoxy, difluoromethoxy, trifluoromethoxy, 2,2,2-trifluoroethoxy, 1,1,2,2-tetrafluoroethoxy, 2-fluoroethoxy, 2-chloroethoxy, 2,2-difluoroethoxy, 2,2,2-trichloroethoxy, trichloromethoxy, 2-bromoethoxy, pentafluoroethyl, 3,3,3-trichloropropoxy, 4,4,4-trichlorobutoxy.
The term “cycloalkyl” or “C3-18cycloalkyl” as used herein and unless otherwise stated means a saturated hydrocarbon monovalent group having from 3 to 18 carbon atoms consisting of or comprising a C3-10 monocyclic or C7-18 polycyclic saturated hydrocarbon, such as for instance cyclopropyl, cyclobutyl, cyclopentyl, cyclopropylethylene, methylcyclopropylene, cyclohexyl, cycloheptyl, cyclooctyl, cyclooctylmethylene, norbornyl, fenchyl, trimethyltricycloheptyl, decalinyl, adamantyl and the like. In particular embodiments, the term cycloalkyl refers to C3-10cycloalkyl (saturated cyclic C3-10hydrocarbons), yet more in particular to C3-9cycloalkyl (saturated cyclic C3-9hydrocarbons), still more in particular to C3-6cycloalkyl (saturated cyclic C3-6hydrocarbons) as further defined herein above.
The term “alkenyl” or “C2-18alkenyl” as used herein is C2-C18 normal, secondary or tertiary, linear, branched or straight hydrocarbon with at least one site (usually 1 to 3, preferably 1) of unsaturation, namely a carbon-carbon, sp2 double bond. Examples include, but are not limited to: ethylene or vinyl (—CH═CH2), allyl (—CH2CH═CH2), and 5-hexenyl (—CH2CH2CH2CH2CH═CH2). The double bond may be in the cis or trans configuration. In particular embodiments, the term alkenyl refers to C2-12alkenyl (C2-12hydrocarbons), yet more in particular to C2-9 alkenyl (C2-9 hydrocarbons), still more in particular to C2-6 alkenyl (C2-6hydrocarbons) as further defined herein above with at least one site (usually 1 to 3, preferably 1) of unsaturation, namely a carbon-carbon, sp2 double bond.
The term “alkenyloxy”, as a group or part of a group, refers to a group having the formula —ORd wherein Rd is alkenyl as defined herein above.
The term “cycloalkenyl” as used herein refers to a non-aromatic hydrocarbon group having from 5 to 18 carbon atoms with at least one site (usually 1 to 3, preferably 1) of unsaturation, namely a carbon-carbon, sp2 double bond and consisting of or comprising a C5-10 monocyclic or C7-18 polycyclic hydrocarbon. Examples include, but are not limited to: cyclopentenyl (—C5H7), cyclopentenylpropylene, methylcyclohexenylene and cyclohexenyl (—C6H9). The double bond may be in the cis or trans configuration. In particular embodiments, the term cycloalkenyl refers to C5-12cycloalkenyl (cyclic C5-12 hydrocarbons), yet more in particular to C5-9cycloalkenyl (cyclic C5-9 hydrocarbons), still more in particular to C5-6cycloalkenyl (cyclic C5-6hydrocarbons) as further defined herein above with at least one site of unsaturation, namely a carbon-carbon, sp2 double bond.
The term “alkylene” as used herein each refer to a saturated, branched or straight chain hydrocarbon group of 1-18 carbon atoms (more in particular C1-12, C1-9 or C1-6 carbon atoms), and having two monovalent group centers derived by the removal of two hydrogen atoms from the same or two different carbon atoms of a parent alkane. Typical alkylene include, but are not limited to: methylene (—CH2—), 1,2-ethyl (—CH2CH2—), 1,3-propyl (—CH2CH2CH2—), 1,4-butyl (—CH2CH2CH2CH2—), and the like.
The term “alkenylene” as used herein each refer to a branched or straight chain hydrocarbon of 2-18 carbon atoms (more in particular C2-12, C2-9 or C2-6 carbon atoms) with at least one site (usually 1 to 3, preferably 1) of unsaturation, namely a carbon-carbon, sp2 double bond, and having two monovalent centers derived by the removal of two hydrogen atoms from the same or two different carbon atoms of a parent alkene.
The term “heteroalkyl” as used herein refers to an alkyl wherein one or more carbon atoms are replaced by one or more atoms independently selected from the group comprising oxygen, nitrogen and sulphur atom with the proviso that said chain may not contain two adjacent O atoms or two adjacent S atoms. Said one or more atoms replacing said carbon atoms may be positioned at the beginning of the hydrocarbon chain, in the hydrocarbon chain or at the end of the hydrocarbon chain. This means that one or more —CH3 of said alkyl can be replaced by —NH2 and/or that one or more —CH2— of said alkyl can be replaced by —NH—, —O— or —S—. In some embodiments the term heteroalkyl encompasses an alkyl which comprises one or more heteroatoms in the hydrocarbon chain, said heteroatoms being selected from the atoms consisting of O, S, and N, whereas the heteroatoms may be positioned at the beginning of the hydrocarbon chain, in the hydrocarbon chain or at the end of the hydrocarbon chain. The S atoms in said chains may be optionally oxidized with one or two oxygen atoms, to afford sulfoxides and sulfones, respectively. Furthermore, the heteroalkyl groups in the compounds of the present invention can contain an oxo or thio group at any carbon or heteroatom that will result in a stable compound. Exemplary heteroalkyl groups include, but are not limited to, alcohols, alkyl ethers, primary, secondary, and tertiary alkyl amines, amides, ketones, esters, alkyl sulfides, and alkyl sulfones. The term heteroalkyl thus comprises but is not limited to —Ra—S—; —Ra—O—, —Ra—N(Ro)2—O—Rb, —NRo—Rb, —Ra—O—Rb, —O—Ra—S—Rb, —S—Ra, —O—Ra—NR0Rb, —NRo—Ra—S—Rb, —Ra—NRo—Rb, —NR0Ra—S—Rb, —S—Rb, wherein Ra is alkylene, Rb is alkyl, and Ro is hydrogen or alkyl as defined herein. In particular embodiments, the term encompasses heteroC1-12alkyl, heteroC1-9alkyl and heteroC1-6alkyl. In some embodiments heteroalkyl is selected from the group comprising alkyloxy, alkyl-oxy-alkyl, (mono or di)alkylamino, (mono or di-)alkyl-amino-alkyl, alkylthio, and alkyl-thio-alkyl.
The term “heteroalkenyl” as used herein refers to an alkenyl wherein one or more carbon atoms are replaced by one or more atoms independently selected from oxygen, nitrogen and sulphur atom, with the proviso that said chain may not contain two adjacent O atoms or two adjacent S atoms. Said one or more atoms replacing said carbon atoms may be positioned at the beginning of the hydrocarbon chain, in the hydrocarbon chain or at the end of the hydrocarbon chain. This means that one or more —CH3 of said alkenyl can be replaced by —NH2, that one or more —CH2— of said alkenyl can be replaced by —NH—, —O— or —S— and/or that one or more —CH═ of said alkenyl can be replaced by —N═. In some embodiments the term heteroalkenyl encompasses an alkenyl which comprises one or more heteroatoms in the hydrocarbon chain, said heteroatoms being selected from the atoms consisting of O, S, and N, whereas the heteroatoms may be positioned at the beginning of the hydrocarbon chain, in the hydrocarbon chain or at the end of the hydrocarbon chain. The S atoms in said chains may be optionally oxidized with one or two oxygen atoms, to afford sulfoxides and sulfones, respectively. Furthermore, the heteroalkyl groups in the compounds of the present invention can contain an oxo or thio group at any carbon or heteroatom that will result in a stable compound. The term heteroalkenyl thus comprises imines, —O-alkenyl, —NH-alkenyl, —N(alkenyl)2, —N(alkyl)(alkenyl), and —S-alkenyl. The term heteroalkenyl thus comprises but is not limited to —Rd—O—, —O—Rd, —NH—(Rd), —N═Rd, —N(Rd))2, —N(Rb)(Rd), —NH—NH—Rd, —Rd═N—N═Rd, —Rd═N—N═, —Rd—S—, —S—Rd wherein Rb is alkyl and Rd is alkenyl as defined herein. In particular embodiments, the term heteroalkenyl encompasses heteroC2-18alkenyl, heteroC2-12alkenyl, heteroC2-9alkenyl and heteroC2-6alkenyl. In some embodiments heteroalkenyl is selected from the group comprising alkenyloxy, alkenyl-oxy-alkenyl, (mono or di-)alkenylamino, (mono or di-)alkenyl-amino-alkenyl, alkenylthio, and alkenyl-thio-alkenyl.
The term “heteroalkylene” as used herein refers to an alkylene wherein one or more carbon atoms are replaced by one or more oxygen, nitrogen or sulphur atoms, with the proviso that said chain may not contain two adjacent O atoms or two adjacent S atoms. This means that one or more —CH3 of said alkylene can be replaced by —NH2 and/or that one or more —CH2— of said alkylene can be replaced by —NH—, —O— or —S—. The S atoms in said chains may be optionally oxidized with one or two oxygen atoms, to afford sulfoxides and sulfones, respectively. Furthermore, the heteroalkylene groups in the compounds of the present invention can contain an oxo or thio group at any carbon or heteroatom that will result in a stable compound.
The term “heteroalkenylene” as used herein refers to an alkenylene wherein one or more carbon atoms are replaced by one or more oxygen, nitrogen or sulphur atoms, with the proviso that said chain may not contain two adjacent O atoms or two adjacent S atoms. This means that one or more —CH3 of said alkenylene can be replaced by —NH2, that one or more —CH2— of said alkenylene can be replaced by —NH—, —O— or —S— and/or that one or more —CH═ of said alkenylene can be replaced by —N═. The S atoms in said chains may be optionally oxidized with one or two oxygen atoms, to afford sulfoxides and sulfones, respectively. Furthermore, the heteroalkenylene groups in the compounds of the present invention can contain an oxo or thio group at any carbon or heteroatom that will result in a stable compound.
The term “aryl” as used herein means an aromatic hydrocarbon of 5-20 carbon atoms derived by the removal of hydrogen from a carbon atom of an aromatic ring system. Examples of aryl groups include, but are not limited to 1 ring, or 2 or 3 rings fused together, of which at least one ring is aromatic. Such ring can be derived from benzene, naphthalene, anthracene, biphenyl, 2,3-dihydro-1H-indenyl, 5,6,7,8-tetrahydronaphthalenyl, 1,2,6,7,8,8a-hexahydroacenaphthylenyl, 1,2-dihydroacenaphthylenyl, and the like. Particular aryl groups are phenyl and naphthyl, especially phenyl.
The term “aryloxy”, as a group or part of a group, refers to a group having the formula —ORg wherein Rg is aryl as defined herein above.
The term “arylalkyl” or “arylalkyl-” as used herein refers to an alkyl in which one of the hydrogen atoms bonded to a carbon atom, typically a terminal or sp3 carbon atom, is replaced with an aryl. Typical arylalkyl groups include, but are not limited to, benzyl, 2-phenylethan-1-yl, 2-phenylethen-1-yl, naphthylmethyl, 2-naphthylethyl, and the like. The arylalkyl group can comprise 6 to 20 carbon atoms, e.g. the alkyl moiety of the arylalkyl group is 1 to 6 carbon atoms and the aryl moiety is 5 to 14 carbon atoms.
The term “arylalkyloxy”, as a group or part of a group, refers to a group having the formula —O—Ra—Rg wherein Rg is aryl, and Ra is alkylene as defined herein above.
The term “arylalkenyl” or “arylalkenyl-” as used herein refers to an alkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an aryl.
The term “aryl-alkenyl” as a group or part of a group refers to an alkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an aryl. The aryl-alkenyl group can comprise 6 to 20 atoms, e.g. the alkenyl moiety of the aryl-alkenyl group can comprise 1 to 6 carbon atoms and the aryl moiety can comprise 5 to 14 atoms, such as ═CH—Rg, wherein Rg is aryl as defined herein above.
The term “arylheteroalkyl” or “arylheteroalkyl-” as used herein refers to a heteroalkyl in which one of the hydrogen atoms bonded to a carbon atom, typically a terminal or sp3 carbon atom, is replaced with an aryl. The arylheteroalkyl group can comprise 6 to 20 carbon atoms, e.g. the heteroalkyl moiety of the arylheteroalkyl group is 1 to 6 carbon atoms and the aryl moiety is 5 to 14 carbon atoms. In some embodiments arylheteroalkyl is selected from the group comprising aryl-O-alkyl, arylalkyl-O-alkyl, aryl-NH-alkyl, aryl-N(alkyl)2, arylalkyl-NH-alkyl, arylalkyl-N-(alkyl)2, aryl-S-alkyl, and arylalkyl-S-alkyl.
The term “arylheteroalkenyl” or “arylheteroalkenyl-” as used herein refers to a heteroalkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an aryl. The arylheteroalkenyl group can comprise 6 to 20 carbon atoms, e.g. the heteroalkenyl moiety of the arylheteroalkenyl group is 1 to 6 carbon atoms and the aryl moiety is 5 to 14 carbon atoms. In some embodiments arylheteroalkenyl is selected from the group comprising aryl-O-alkenyl, arylalkenyl-O-alkenyl, aryl-NH-alkenyl, arylalkenyl-NH-alkenyl, aryl-S-alkenyl, and arylalkenyl-S-alkenyl.
The term “heterocyclyl” as used herein refer to non-aromatic, fully saturated or partially unsaturated ring system of 3 to 18 atoms including at least one N, O, S, or P (for example, 3 to 7 member monocyclic, 7 to 11 member bicyclic, or comprising a total of 3 to 10 ring atoms) wherein at least one ring is a heterocyclyl and wherein said ring may be fused to an aryl, cycloalkyl, heteroaryl and/or heterocyclyl ring. Each ring of the heterocyclyl may have 1, 2, 3 or 4 heteroatoms selected from N, O and/or S, where the N and S heteroatoms may optionally be oxidized and the N heteroatoms may optionally be quaternized; and wherein at least one carbon atom of heterocyclyl can be oxidized to form at least one C═O. The heterocyclic may be attached at any heteroatom or carbon atom of the ring or ring system, where valence allows. The rings of multi-ring heterocyclyls may be fused, bridged and/or joined through one or more spiro atoms.
Non limiting exemplary heterocyclic groups include piperidinyl, piperazinyl, homopiperazinyl, morpholinyl, tetrahydropyranyl, tetrahydrofuranyl, pyrrolidinyl, aziridinyl, oxiranyl, thiiranyl, azetidinyl, oxetanyl, thietanyl, 2-imidazolinyl, pyrazolidinyl imidazolidinyl, isoxazolinyl, oxazolidinyl, isoxazolidinyl, thiazolidinyl, isothiazolidinyl, succinimidyl, 3H-indolyl, indolinyl, isoindolinyl, chromanyl (also known as 3,4-dihydrobenzo[b]pyranyl), 2H-pyrrolyl, 1-pyrrolinyl, 2-pyrrolinyl, 3-pyrrolinyl, 4H-quinolizinyl, 2-oxopiperazinyl, 2-pyrazolinyl, 3-pyrazolinyl, tetrahydro-2H-pyranyl, 2H-pyranyl, 4H-pyranyl, 3,4-dihydro-2H-pyranyl, 3-dioxolanyl, 1,4-dioxanyl, 2,5-dioximidazolidinyl, 2-oxopiperidinyl, 2-oxopyrrolodinyl, indolinyl, tetrahydrothiophenyl, tetrahydroquinolinyl, tetrahydroisoquinolin-1-yl, tetrahydroisoquinolin-2-yl, tetrahydroisoquinolin-3-yl, tetrahydroisoquinolin-4-yl, thiomorpholin-4-yl, thiomorpholin-4-ylsulfoxide, thiomorpholin-4-ylsulfone, 1,3-dioxolanyl, 1,4-oxathianyl, 1,4-dithianyl, 1,3,5-trioxanyl, 1H-pyrrolizinyl, tetrahydro-1,1-dioxothiophenyl, N-formylpiperazinyl, and morpholin-4-yl. The term “aziridinyl” as used herein includes aziridin-1-yl and aziridin-2-yl. The term “oxyranyl” as used herein includes oxyranyl-2-yl. The term “thiiranyl” as used herein includes thiiran-2-yl. The term “azetidinyl” as used herein includes azetidin-1-yl, azetidin-2-yl and azetidin-3-yl. The term “oxetanyl” as used herein includes oxetan-2-yl and oxetan-3-yl. The term “thietanyl” as used herein includes thietan-2-yl and thietan-3-yl. The term “pyrrolidinyl” as used herein includes pyrrolidin-1-yl, pyrrolidin-2-yl and pyrrolidin-3-yl. The term “tetrahydrofuranyl” as used herein includes tetrahydrofuran-2-yl and tetrahydrofuran-3-yl. The term “tetrahydrothiophenyl” as used herein includes tetrahydrothiophen-2-yl and tetrahydrothiophen-3-yl. The term “succinimidyl” as used herein includes succinimid-1-yl and succininmid-3-yl. The term “dihydropyrrolyl” as used herein includes 2,3-dihydropyrrol-1-yl, 2,3-dihydro-1H-pyrrol-2-yl, 2,3-dihydro-1H-pyrrol-3-yl, 2,5-dihydropyrrol-1-yl, 2,5-dihydro-1H-pyrrol-3-yl and 2,5-dihydropyrrol-5-yl. The term “2H-pyrrolyl” as used herein includes 2H-pyrrol-2-yl, 2H-pyrrol-3-yl, 2H-pyrrol-4-yl and 2H-pyrrol-5-yl. The term “3H-pyrrolyl” as used herein includes 3H-pyrrol-2-yl, 3H-pyrrol-3-yl, 3H-pyrrol-4-yl and 3H-pyrrol-5-yl. The term “dihydrofuranyl” as used herein includes 2,3-dihydrofuran-2-yl, 2,3-dihydrofuran-3-yl, 2,3-dihydrofuran-4-yl, 2,3-dihydrofuran-5-yl, 2,5-dihydrofuran-2-yl, 2,5-dihydrofuran-3-yl, 2,5-dihydrofuran-4-yl and 2,5-dihydrofuran-5-yl. The term “dihydrothiophenyl” as used herein includes 2,3-dihydrothiophen-2-yl, 2,3-dihydrothiophen-3-yl, 2,3-dihydrothiophen-4-yl, 2,3-dihydrothiophen-5-yl, 2,5-dihydrothiophen-2-yl, 2,5-dihydrothiophen-3-yl, 2,5-dihydrothiophen-4-yl and 2,5-dihydrothiophen-5-yl. The term “imidazolidinyl” as used herein includes imidazolidin-1-yl, imidazolidin-2-yl and imidazolidin-4-yl. The term “pyrazolidinyl” as used herein includes pyrazolidin-1-yl, pyrazolidin-3-yl and pyrazolidin-4-yl. The term “imidazolinyl” as used herein includes imidazolin-1-yl, imidazolin-2-yl, imidazolin-4-yl and imidazolin-5-yl. The term “pyrazolinyl” as used herein includes 1-pyrazolin-3-yl, 1-pyrazolin-4-yl, 2-pyrazolin-1-yl, 2-pyrazolin-3-yl, 2-pyrazolin-4-yl, 2-pyrazolin-5-yl, 3-pyrazolin-1-yl, 3-pyrazolin-2-yl, 3-pyrazolin-3-yl, 3-pyrazolin-4-yl and 3-pyrazolin-5-yl. The term “dioxolanyl” also known as “1,3-dioxolanyl” as used herein includes dioxolan-2-yl, dioxolan-4-yl and dioxolan-5-yl. The term “dioxolyl” also known as “1,3-dioxolyl” as used herein includes dioxol-2-yl, dioxol-4-yl and dioxol-5-yl. The term “oxazolidinyl” as used herein includes oxazolidin-2-yl, oxazolidin-3-yl, oxazolidin-4-yl and oxazolidin-5-yl. The term “isoxazolidinyl” as used herein includes isoxazolidin-2-yl, isoxazolidin-3-yl, isoxazolidin-4-yl and isoxazolidin-5-yl. The term “oxazolinyl” as used herein includes 2-oxazolinyl-2-yl, 2-oxazolinyl-4-yl, 2-oxazolinyl-5-yl, 3-oxazolinyl-2-yl, 3-oxazolinyl-4-yl, 3-oxazolinyl-5-yl, 4-oxazolinyl-2-yl, 4-oxazolinyl-3-yl, 4-oxazolinyl-4-yl and 4-oxazolinyl-5-yl. The term “isoxazolinyl” as used herein includes 2-isoxazolinyl-3-yl, 2-isoxazolinyl-4-yl, 2-isoxazolinyl-5-yl, 3-isoxazolinyl-3-yl, 3-isoxazolinyl-4-yl, 3-isoxazolinyl-5-yl, 4-isoxazolinyl-2-yl, 4-isoxazolinyl-3-yl, 4-isoxazolinyl-4-yl and 4-isoxazolinyl-5-yl. The term “thiazolidinyl” as used herein includes thiazolidin-2-yl, thiazolidin-3-yl, thiazolidin-4-yl and thiazolidin-5-yl. The term “isothiazolidinyl” as used herein includes isothiazolidin-2-yl, isothiazolidin-3-yl, isothiazolidin-4-yl and isothiazolidin-5-yl. The term “thiazolinyl” as used herein includes 2-thiazolinyl-2-yl, 2-thiazolinyl-4-yl, 2-thiazolinyl-5-yl, 3-thiazolinyl-2-yl, 3-thiazolinyl-4-yl, 3-thiazolinyl-5-yl, 4-thiazolinyl-2-yl, 4-thiazolinyl-3-yl, 4-thiazolinyl-4-yl and 4-thiazolinyl-5-yl. The term “isothiazolinyl” as used herein includes 2-isothiazolinyl-3-yl, 2-isothiazolinyl-4-yl, 2-isothiazolinyl-5-yl, 3-isothiazolinyl-3-yl, 3-isothiazolinyl-4-yl, 3-isothiazolinyl-5-yl, 4-isothiazolinyl-2-yl, 4-isothiazolinyl-3-yl, 4-isothiazolinyl-4-yl and 4-isothiazolinyl-5-yl. The term “piperidyl” also known as “piperidinyl” as used herein includes piperid-1-yl, piperid-2-yl, piperid-3-yl and piperid-4-yl. The term “dihydropyridinyl” as used herein includes 1,2-dihydropyridin-1-yl, 1,2-dihydropyridin-2-yl, 1,2-dihydropyridin-3-yl, 1,2-dihydropyridin-4-yl, 1,2-dihydropyridin-5-yl, 1,2-dihydropyridin-6-yl, 1,4-dihydropyridin-1-yl, 1,4-dihydropyridin-2-yl, 1,4-dihydropyridin-3-yl, 1,4-dihydropyridin-4-yl, 2,3-dihydropyridin-2-yl, 2,3-dihydropyridin-3-yl, 2,3-dihydropyridin-4-yl, 2,3-dihydropyridin-5-yl, 2,3-dihydropyridin-6-yl, 2,5-dihydropyridin-2-yl, 2,5-dihydropyridin-3-yl, 2,5-dihydropyridin-4-yl, 2,5-dihydropyridin-5-yl, 2,5-dihydropyridin-6-yl, 3,4-dihydropyridin-2-yl, 3,4-dihydropyridin-3-yl, 3,4-dihydropyridin-4-yl, 3,4-dihydropyridin-5-yl and 3,4-dihydropyridin-6-yl. The term “tetrahydropyridinyl” as used herein includes 1,2,3,4-tetrahydropyridin-1-yl, 1,2,3,4-tetrahydropyridin-2-yl, 1,2,3,4-tetrahydropyridin-3-yl, 1,2,3,4-tetrahydropyridin-4-yl, 1,2,3,4-tetrahydropyridin-5-yl, 1,2,3,4-tetrahydropyridin-6-yl, 1,2,3,6-tetrahydropyridin-1-yl, 1,2,3,6-tetrahydropyridin-2-yl, 1,2,3,6-tetrahydropyridin-3-yl, 1,2,3,6-tetrahydropyridin-4-yl, 1,2,3,6-tetrahydropyridin-5-yl, 1,2,3,6-tetrahydropyridin-6-yl, 2,3,4,5-tetrahydropyridin-2-yl, 2,3,4,5-tetrahydropyridin-3-yl, 2,3,4,5-tetrahydropyridin-3-yl, 2,3,4,5-tetrahydropyridin-4-yl, 2,3,4,5-tetrahydropyridin-5-yl and 2,3,4,5-tetrahydropyridin-6-yl. The term “tetrahydropyranyl” also known as “oxanyl” or “tetrahydro-2H-pyranyl”, as used herein includes tetrahydropyran-2-yl, tetrahydropyran-3-yl and tetrahydropyran-4-yl. The term “2H-pyranyl” as used herein includes 2H-pyran-2-yl, 2H-pyran-3-yl, 2H-pyran-4-yl, 2H-pyran-5-yl and 2H-pyran-6-yl. The term “4H-pyranyl” as used herein includes 4H-pyran-2-yl, 4H-pyran-3-yl and 4H-pyran-4-yl. The term “3,4-dihydro-2H-pyranyl” as used herein includes 3,4-dihydro-2H-pyran-2-yl, 3,4-dihydro-2H-pyran-3-yl, 3,4-dihydro-2H-pyran-4-yl, 3,4-dihydro-2H-pyran-5-yl and 3,4-dihydro-2H-pyran-6-yl. The term “3,6-dihydro-2H-pyranyl” as used herein includes 3,6-dihydro-2H-pyran-2-yl, 3,6-dihydro-2H-pyran-3-yl, 3,6-dihydro-2H-pyran-4-yl, 3,6-dihydro-2H-pyran-5-yl and 3,6-dihydro-2H-pyran-6-yl. The term “tetrahydrothiophenyl”, as used herein includes tetrahydrothiophen-2-yl, tetrahydrothiophenyl-3-yl and tetrahydrothiophenyl-4-yl. The term “2H-thiopyranyl” as used herein includes 2H-thiopyran-2-yl, 2H-thiopyran-3-yl, 2H-thiopyran-4-yl, 2H-thiopyran-5-yl and 2H-thiopyran-6-yl. The term “4H-thiopyranyl” as used herein includes 4H-thiopyran-2-yl, 4H-thiopyran-3-yl and 4H-thiopyran-4-yl. The term “3,4-dihydro-2H-thiopyranyl” as used herein includes 3,4-dihydro-2H-thiopyran-2-yl, 3,4-dihydro-2H-thiopyran-3-yl, 3,4-dihydro-2H-thiopyran-4-yl, 3,4-dihydro-2H-thiopyran-5-yl and 3,4-dihydro-2H-thiopyran-6-yl. The term “3,6-dihydro-2H-thiopyranyl” as used herein includes 3,6-dihydro-2H-thiopyran-2-yl, 3,6-dihydro-2H-thiopyran-3-yl, 3,6-dihydro-2H-thiopyran-4-yl, 3,6-dihydro-2H-thiopyran-5-yl and 3,6-dihydro-2H-thiopyran-6-yl. The term “piperazinyl” also known as “piperazidinyl” as used herein includes piperazin-1-yl and piperazin-2-yl. The term “morpholinyl” as used herein includes morpholin-2-yl, morpholin-3-yl and morpholin-4-yl. The term “thiomorpholinyl” as used herein includes thiomorpholin-2-yl, thiomorpholin-3-yl and thiomorpholin-4-yl. The term “dioxanyl” as used herein includes 1,2-dioxan-3-yl, 1,2-dioxan-4-yl, 1,3-dioxan-2-yl, 1,3-dioxan-4-yl, 1,3-dioxan-5-yl and 1,4-dioxan-2-yl. The term “dithianyl” as used herein includes 1,2-dithian-3-yl, 1,2-dithian-4-yl, 1,3-dithian-2-yl, 1,3-dithian-4-yl, 1,3-dithian-5-yl and 1,4-dithian-2-yl. The term “oxathianyl” as used herein includes oxathian-2-yl and oxathian-3-yl. The term “trioxanyl” as used herein includes 1,2,3-trioxan-4-yl, 1,2,3-trioxay-5-yl, 1,2,4-trioxay-3-yl, 1,2,4-trioxay-5-yl, 1,2,4-trioxay-6-yl and 1,3,4-trioxay-2-yl. The term “azepanyl” as used herein includes azepan-1-yl, azepan-2-yl, azepan-1-yl, azepan-3-yl and azepan-4-yl. The term “homopiperazinyl” as used herein includes homopiperazin-1-yl, homopiperazin-2-yl, homopiperazin-3-yl and homopiperazin-4-yl. The term “indolinyl” as used herein includes indolin-1-yl, indolin-2-yl, indolin-3-yl, indolin-4-yl, indolin-5-yl, indolin-6-yl, and indolin-7-yl. The term “quinolizinyl” as used herein includes quinolizidin yl, quinolizidin-2-yl, quinolizidin-3-yl and quinolizidin-4-yl. The term “isoindolinyl” as used herein includes isoindolin-1-yl, isoindolin-2-yl, isoindolin-3-yl, isoindolin-4-yl, isoindolin-5-yl, isoindolin-6-yl, and isoindolin-7-yl. The term “3H-indolyl” as used herein includes 3H-indol-2-yl, 3H-indol-3-yl, 3H-indol-4-yl, 3H-indol-5-yl, 3H-indol-6-yl, and 3H-indol-7-yl. The term “quinolizinyl” as used herein includes quinolizidin-1-yl, quinolizidin-2-yl, quinolizidin-3-yl and quinolizidin-4-yl. The term “quinolizinyl” as used herein includes quinolizidin-1-yl, quinolizidin-2-yl, quinolizidin-3-yl and quinolizidin-4-yl. The term “tetrahydroquinolinyl” as used herein includes tetrahydroquinolin-1-yl, tetrahydroquinolin-2-yl, tetrahydroquinolin-3-yl, tetrahydroquinolin-4-yl, tetrahydro quinolin-5-yl, tetrahydroquinolin-6-yl, tetrahydroquinolin-7-yl and tetrahydroquinolin-8-yl. The term “tetrahydroisoquinolinyl” as used herein includes tetrahydroisoquinolin-1-yl, tetrahydroisoquinolin-2-yl, tetrahydroisoquinolin-3-yl, tetrahydroisoquinolin-4-yl, tetrahydroisoquinolin-5-yl, tetrahydroisoquinolin-6-yl, tetrahydroisoquinolin-7-yl and tetrahydroisoquinolin-8-yl. The term “chromanyl” as used herein includes chroman-2-yl, chroman-3-yl, chroman-4-yl, chroman-5-yl, chroman-6-yl, chroman-7-yl and chroman-8-yl. The term “1H-pyrrolizine” as used herein includes 1H-pyrrolizin-1-yl, 1H-pyrrolizin-2-yl, 1H-pyrrolizin-3-yl, 1H-pyrrolizin-5-yl, 1H-pyrrolizin-6-yl and 1H-pyrrolizin-7-yl. The term “3H-pyrrolizine” as used herein includes 3H-pyrrolizin-1-yl, 3H-pyrrolizin-2-yl, 3H-pyrrolizin-3-yl, 3H-pyrrolizin-5-yl, 3H-pyrrolizin-6-yl and 3H-pyrrolizin-7-yl.
The term “heteroaryl” refers but is not limited to an aromatic ring system of 5 to 18 atoms including at least one N, O, S, or P, containing 1 or more rings, such as 1 or 2 or 3 or 4 rings, which can be fused together or linked covalently, each ring typically containing 5 to 6 atoms; at least one of said ring is aromatic, where the N and S heteroatoms may optionally be oxidized and the N heteroatoms may optionally be quaternized, and wherein at least one carbon atom of said heteroaryl can be oxidized to form at least one C═O. Such rings may be fused to an aryl, cycloalkyl, heteroaryl and/or heterocyclyl ring.
Non-limiting examples of such heteroaryl, include: triazol-2-yl, pyridinyl, 1H-pyrazol-5-yl, pyrrolyl, furanyl, thiophenyl, pyrazolyl, imidazolyl, oxazolyl, isoxazolyl, thiazolyl, isothiazolyl, triazolyl, oxadiazolyl, thiadiazolyl, tetrazolyl, oxatriazolyl, thiatriazolyl, pyrimidyl, pyrazinyl, pyridazinyl, oxazinyl, dioxinyl, thiazinyl, triazinyl, imidazo[2,1-b][1,3]thiazolyl, thieno[3,2-b]furanyl, thieno[3,2-b]thiophenyl, thieno[2,3-d][1,3]thiazolyl, thieno[2,3-d]imidazolyl, tetrazolo[1,5-a]pyridinyl, indolyl, indolizinyl, isoindolyl, benzofuranyl, isobenzofuranyl, benzothiophenyl, isobenzothiophenyl, indazolyl, benzimidazolyl, 1,3-benzoxazolyl, 1,2-benzisoxazolyl, 2,1-benzisoxazolyl, 1,3-benzothiazolyl, 1,2-benzoisothiazolyl, 2,1-benzoisothiazolyl, benzotriazolyl, 1,2,3-benzoxadiazolyl, 2,1,3-benzoxadiazolyl, 1,2,3-benzothiadiazolyl, 2,1,3-benzothiadiazolyl, benzo[d]oxazol-2(3H)-one, 2,3-dihydro-benzofuranyl, thienopyridinyl, purinyl, imidazo[1,2-a]pyridinyl, 6-oxo-pyridazin-1(6H)-yl, 2-oxopyridin-1(2H)-yl, 6-oxo-pyridazin-1(6H)-yl, 2-oxopyridin-1(2H)-yl, 1,3-benzodioxolyl, quinolinyl, isoquinolinyl, cinnolinyl, quinazolinyl, quinoxalinyl; preferably said heteroaryl group is selected from the group comprising pyridyl, 1,3-benzodioxolyl, benzo[d]oxazol-2(3H)-one, 2,3-dihydro-benzofuranyl, pyrazinyl, pyrazolyl, pyrrolyl, isoxazolyl, thiophenyl, imidazolyl, benzimidazolyl, pyrimidinyl, s-triazinyl, oxazolyl, isothiazolyl, furyl, thienyl, triazolyl thiazolyl, 5H-[1,2,4]triazino[5,6-b]indole, and 3,5,6,8,10,11-hexaazatricyclo[7.3.0.02,6]dodeca-1(9), 2,4,7,11-pentaenyl.
The term “pyrrolyl” (also called azolyl) as used herein includes pyrrol-1-yl, pyrrol-2-yl and pyrrol-3-yl. The term “furanyl” (also called “furyl”) as used herein includes furan-2-yl and furan-3-yl (also called furan-2-yl and furan-3-yl). The term “thiophenyl” (also called “thienyl”) as used herein includes thiophen-2-yl and thiophen-3-yl (also called thien-2-yl and thien-3-yl). The term “pyrazolyl” (also called 1H-pyrazolyl and 1,2-diazolyl) as used herein includes pyrazol-1-yl, pyrazol-3-yl or 1H-pyrazol-5-yl, pyrazol-4-yl and pyrazol-5-yl. The term “imidazolyl” as used herein includes imidazol-1-yl, imidazol-2-yl, imidazol-4-yl and imidazol-5-yl. The term “oxazolyl” (also called 1,3-oxazolyl) as used herein includes oxazol-2-yl, oxazol-4-yl and oxazol-5-yl. The term “isoxazolyl” (also called 1,2-oxazolyl), as used herein includes isoxazol-3-yl, isoxazol-4-yl, and isoxazol-5-yl. The term “thiazolyl” (also called 1,3-thiazolyl),as used herein includes thiazol-2-yl, thiazol-4-yl and thiazol-5-yl (also called 2-thiazolyl, 4-thiazolyl and 5-thiazolyl). The term “isothiazolyl” (also called 1,2-thiazolyl) as used herein includes isothiazol-3-yl, isothiazol-4-yl, and isothiazol-5-yl. The term “triazolyl” as used herein includes triazol-2-yl, 1H-triazolyl and 4H-1,2,4-triazolyl, “1H-triazolyl” includes 1H-1,2,3-triazol-1-yl, 1H-1,2,3-triazol-4-yl, 1H-1,2,3-triazol-5-yl, 1H-1,2,4-triazol-1-yl, 1H-1,2,4-triazol-3-yl and 1H-1,2,4-triazol-5-yl. “4H-1,2,4-triazolyl” includes 4H-1,2,4-triazol-4-yl, and 4H-1,2,4-triazol-3-yl. The term “oxadiazolyl” as used herein includes 1,2,3-oxadiazol-4-yl, 1,2,3-oxadiazol-5-yl, 1,2,4-oxadiazol-3-yl, 1,2,4-oxadiazol-5-yl, 1,2,5-oxadiazol-3-yl and 1,3,4-oxadiazol-2-yl. The term “thiadiazolyl” as used herein includes 1,2,3-thiadiazol-4-yl, 1,2,3-thiadiazol-5-yl, 1,2,4-thiadiazol-3-yl, 1,2,4-thiadiazol-5-yl, 1,2,5-thiadiazol-3-yl (also called furazan-3-yl) and 1,3,4-thiadiazol-2-yl. The term “tetrazolyl” as used herein includes 1H-tetrazol-1-yl, 1H-tetrazol-5-yl, 2H-tetrazol-2-yl, and 2H-tetrazol-5-yl. The term “oxatriazolyl” as used herein includes 1,2,3,4-oxatriazol-5-yl and 1,2,3,5-oxatriazol-4-yl. The term “thiatriazolyl” as used herein includes 1,2,3,4-thiatriazol-5-yl and 1,2,3,5-thiatriazol-4-yl. The term “pyridinyl” (also called “pyridyl”) as used herein includes pyridin-2-yl, pyridin-3-yl and pyridin-4-yl (also called 2-pyridyl, 3-pyridyl and 4-pyridyl). The term “pyrimidyl” as used herein includes pyrimid-2-yl, pyrimid-4-yl, pyrimid-5-yl and pyrimid-6-yl. The term “pyrazinyl” as used herein includes pyrazin yl and pyrazin-3-yl. The term “pyridazinyl as used herein includes pyridazin-3-yl and pyridazin-4-yl. The term “oxazinyl” (also called “1,4-oxazinyl”) as used herein includes 1,4-oxazin-4-yl and 1,4-oxazin-5-yl. The term “dioxinyl” (also called “1,4-dioxinyl”) as used herein includes 1,4-dioxin-2-yl and 1,4-dioxin-3-yl. The term “thiazinyl” (also called “1,4-thiazinyl”) as used herein includes 1,4-thiazin-2-yl, 1,4-thiazin-3-yl, 1,4-thiazin-4-yl, 1,4-thiazin-5-yl and 1,4-thiazin-6-yl. The term “triazinyl” as used herein includes 1,3,5-triazin-2-yl, 1,2,4-triazin-3-yl, 1,2,4-triazin-5-yl, 1,2,4-triazin-6-yl, 1,2,3-triazin-4-yl and 1,2,3-triazin-5-yl. The term “imidazo[2,1-b][1,3]thiazolyl” as used herein includes imidazo[2,1-b][1,3]thiazol-2-yl, imidazo[2,1-b][1,3]thiazol-3-yl, imidazo[2,1-b][1,3]thiazol-5-yl and imidazo[2,1-b][1,3]thiazol-6-yl. The term “thieno[3,2-b]furanyl” as used herein includes thieno[3,2-b]furan-2-yl, thieno[3,2-b]furan-3-yl, thieno[3,2-b]furan-4-yl, and thieno[3,2-b]furan-5-yl. The term “thieno[3,2-b]thiophenyl” as used herein includes thieno[3,2-b]thien-2-yl, thieno[3,2-b]thien-3-yl, thieno[3,2-b]thien-5-yl and thieno[3,2-b]thien-6-yl. The term “thieno[2,3-d][1,3]thiazolyl” as used herein includes thieno[2,3-d][1,3]thiazol-2-yl, thieno[2,3-d][1,3]thiazol-5-yl and thieno[2,3-d][1,3]thiazol-6-yl. The term “thieno[2,3-d]imidazolyl” as used herein includes thieno[2,3-d]imidazol-2-yl, thieno[2,3-d]imidazol-4-yl and thieno[2,3-d]imidazol-5-yl. The term “tetrazolo[1,5-a]pyridinyl” as used herein includes tetrazolo[1,5-a]pyridine-5-yl, tetrazolo[1,5-a]pyridine-6-yl, tetrazolo[1,5-a]pyridine-7-yl, and tetrazolo[1,5-a]pyridine-8-yl. The term “indolyl” as used herein includes indol-1-yl, indol-2-yl, indol-3-yl,-indol-4-yl, indol-5-yl, indol-6-yl and indol-7-yl. The term “indolizinyl” as used herein includes indolizin-1-yl, indolizin-2-yl, indolizin-3-yl, indolizin-5-yl, indolizin-6-yl, indolizin-7-yl, and indolizin-8-yl. The term “isoindolyl” as used herein includes isoindol-1-yl, isoindol-2-yl, isoindol-3-yl, isoindol-4-yl, isoindol-5-yl, isoindol-6-yl and isoindol-7-yl. The term “benzofuranyl” (also called benzo[b]furanyl) as used herein includes benzofuran-2-yl, benzofuran-3-yl, benzofuran-4-yl, benzofuran-5-yl, benzofuran-6-yl and benzofuran-7-yl. The term “isobenzofuranyl” (also called benzo[c]furanyl) as used herein includes isobenzofuran-1-yl, isobenzofuran-3-yl, isobenzofuran-4-yl, isobenzofuran-5-yl, isobenzofuran-6-yl and isobenzofuran-7-yl. The term “benzothiophenyl” (also called benzo[b]thienyl) as used herein includes 2-benzo[b]thiophenyl, 3-benzo[b]thiophenyl, 4-benzo[b]thiophenyl, 5-benzo[b]thiophenyl, 6-benzo[b]thiophenyl and -7-benzo[b]thiophenyl (also called benzothien-2-yl, benzothien-3-yl, benzothien-4-yl, benzothien-5-yl, benzothien-6-yl and benzothien-7-yl). The term “isobenzothiophenyl” (also called benzo[c]thienyl) as used herein includes isobenzothien-1-yl, isobenzothien-3-yl, isobenzothien-4-yl, isobenzothien-5-yl, isobenzothien-6-yl and isobenzothien-7-yl. The term “indazolyl” (also called 1H-indazolyl or 2-azaindolyl) as used herein includes 1H-indazol-1-yl, 1H-indazol-3-yl, 1H-indazol-4-yl, 1H-indazol-5-yl, 1H-indazol-6-yl, 1H-indazol-7-yl, 2H-indazol-2-yl, 2H-indazol-3-yl, 2H-indazol-4-yl, 2H-indazol-5-yl, 2H-indazol-6-yl, and 2H-indazol-7-yl. The term “benzimidazolyl” as used herein includes benzimidazol-1-yl, benzimidazol-2-yl, benzimidazol-4-yl, benzimidazol-5-yl, benzimidazol-6-yl and benzimidazol-7-yl. The term “1,3-benzoxazolyl” as used herein includes 1,3-benzoxazol-2-yl, 1,3-benzoxazol-4-yl, 1,3-benzoxazol-5-yl, 1,3-benzoxazol-6-yl and 1,3-benzoxazol-7-yl. The term “1,2-benzisoxazolyl” as used herein includes 1,2-benzisoxazol-3-yl, 1,2-benzisoxazol-4-yl, 1,2-benzisoxazol-5-yl, 1,2-benzisoxazol-6-yl and 1,2-benzisoxazol-7-yl. The term “2,1-benzisoxazolyl” as used herein includes 2,1-benzisoxazol-3-yl, 2,1-benzisoxazol-4-yl, 2,1-benzisoxazol-5-yl, 2,1-benzisoxazol-6-yl and 2,1-benzisoxazol-7-yl. The term “1,3-benzothiazolyl” as used herein includes 1,3-benzothiazol-2-yl, 1,3-benzothiazol-4-yl, 1,3-benzothiazol-5-yl, 1,3-benzothiazol-6-yl and 1,3-benzothiazol-7-yl. The term “1,2-benzoisothiazolyl” as used herein includes 1,2-benzisothiazol-3-yl, 1,2-benzisothiazol-4-yl, 1,2-benzisothiazol-5-yl, 1,2-benzisothiazol-6-yl and 1,2-benzisothiazol-7-yl. The term “2,1-benzoisothiazolyl” as used herein includes 2,1-benzisothiazol-3-yl, 2,1-benzisothiazol-4-yl, 2,1-benzisothiazol-5-yl, 2,1-benzisothiazol-6-yl and 2,1-benzisothiazol-7-yl. The term “benzotriazolyl” as used herein includes benzotriazol-1-yl, benzotriazol-4-yl, benzotriazol-5-yl, benzotriazol-6-yl and benzotriazol-7-yl. The term “1,2,3-benzoxadiazolyl” as used herein includes 1,2,3-benzoxadiazol-4-yl, 1,2,3-benzoxadiazol-5-yl, 1,2,3-benzoxadiazol-6-yl and 1,2,3-benzoxadiazol-7-yl. The term “2,1,3-benzoxadiazolyl” as used herein includes 2,1,3-benzoxadiazol-4-yl, 2,1,3-benzoxadiazol-5-yl, 2,1,3-benzoxadiazol-6-yl and 2,1,3-benzoxadiazol-7-yl. The term “1,2,3-benzothiadiazolyl” as used herein includes 1,2,3-benzothiadiazol-4-yl, 1,2,3-benzothiadiazol-5-yl, 1,2,3-benzothiadiazol-6-yl and 1,2,3-benzothiadiazol-7-yl. The term “2,1,3-benzothiadiazolyl” as used herein includes 2,1,3-benzothiadiazol-4-yl, 2,1,3-benzothiadiazol-5-yl, 2,1,3-benzothiadiazol-6-yl and 2,1,3-benzothiadiazol-7-yl. The term “thienopyridinyl” as used herein includes thieno[2,3-b]pyridinyl, thieno[2,3-c]pyridinyl, thieno[3,2-c]pyridinyl and thieno[3,2-b]pyridinyl. The term “purinyl” as used herein includes purin-2-yl, purin-6-yl, purin-7-yl and purin-8-yl. The term “imidazo[1,2-a]pyridinyl”, as used herein includes imidazo[1,2-a]pyridin-2-yl, imidazo[1,2-a]pyridin-3-yl, imidazo[1,2-a]pyridin-4-yl, imidazo[1,2-a]pyridin-5-yl, imidazo[1,2-a]pyridin-6-yl and imidazo[1,2-a]pyridin-7-yl. The term “1,3-benzodioxolyl”, as used herein includes 1,3-benzodioxol-4-yl, 1,3-benzodioxol-5-yl, 1,3-benzodioxol-6-yl, and 1,3-benzodioxol-7-yl. The term “quinolinyl” as used herein includes quinolin-2-yl, quinolin-3-yl, quinolin-4-yl, quinolin-5-yl, quinolin-6-yl, quinolin-7-yl and quinolin-8-yl. The term “isoquinolinyl” as used herein includes isoquinolin-1-yl, isoquinolin-3-yl, isoquinolin-4-yl, isoquinolin-5-yl, isoquinolin-6-yl, isoquinolin-7-yl and isoquinolin-8-yl. The term “cinnolinyl” as used herein includes cinnolin-3-yl, cinnolin-4-yl, cinnolin-5-yl, cinnolin-6-yl, cinnolin-7-yl and cinnolin-8-yl. The term “quinazolinyl” as used herein includes quinazolin-2-yl, quinazolin-4-yl, quinazolin-5-yl, quinazolin-6-yl, quinazolin-7-yl and quinazolin-8-yl. The term “quinoxalinyl” as used herein includes quinoxalin-2-yl, quinoxalin-5-yl, and quinoxalin-6-yl.
The term “heterocyclyloxy”, as a group or part of a group, refers to a group having the formula —O—Ri wherein Ri is heterocyclyl as defined herein above.
The term “heterocyclylalkyloxy”, as a group or part of a group, refers to a group having the formula —O—Ra—Ri wherein Ri is heterocyclyl, and Ra is alkyl as defined herein above.
The term “heterocyclylalkyl”, as a group or part of a group, means an alkyl as defined herein, wherein at least one hydrogen atom is replaced by at least one heterocyclyl as defined herein. A non-limiting example of a heterocyclyl-alkyl group is 2-tetrahydrofuranyl-methyl.
The term “heteroaryloxy”, as a group or part of a group, refers to a group having the formula —O—Rk wherein Rk is heteroaryl as defined herein above.
The term “heteroarylalkyloxy”, as a group or part of a group, refers to a group having the formula —O—Ra—Ri wherein Ri is heteroaryl, and Ra is alkyl as defined herein above.
The term “heterocyclyl-alkyl” as a group or part of a group, refers to an alkyl in which one of the hydrogen atoms bonded to a carbon atom, typically a terminal or sp3 carbon atom, is replaced with a heterocyclyl. A non-limiting example of a heterocyclyl-alkyl group is 2-piperidinyl-methylene. The heterocyclyl-alkyl group can comprise 6 to 20 atoms, e.g. the alkyl moiety of the heterocycle-alkyl group is 1 to 6 carbon atoms and the heterocyclyl moiety is 5 to 14 atoms.
The term “heterocyclyl-alkenyl” as a group or part of a group refers to an alkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an heterocyclyl. The heterocyclyl-alkenyl group can comprise 6 to 20 atoms, e.g. the alkenyl moiety of the heterocyclyl-alkenyl group is 1 to 6 carbon atoms and the heterocyclyl moiety is 5 to 14 atoms, such as ═CH—Ri, wherein Ri is heterocyclyl as defined herein above.
The term “heterocyclyl-heteroalkyl” as a group or part of a group refers to a heteroalkyl in which one of the hydrogen atoms bonded to a carbon atom, typically a terminal or sp3 carbon atom, is replaced with a heterocyclyl. The heterocyclyl-heteroalkyl group can comprise 6 to 20 atoms, e.g. the heteroalkyl moiety of the heterocyclyl-heteroalkyl group can comprise 1 to 6 carbon atoms and the heterocyclyl moiety can comprise 5 to 14 atoms. In some embodiments heterocyclyl-heteroalkyl is selected from the group comprising heterocyclyl-O-alkyl, heterocyclylalkyl-O-alkyl, heterocyclyl-NH-alkyl, heterocyclyl-N(alkyl)2, heterocyclylalkyl-NH-alkyl, heterocyclylalkyl-N-(alkyl)2, heterocyclyl-S-alkyl, and heterocyclylalkyl-S-alkyl.
The term “heterocyclyl-heteroalkenyl” as a group or part of a group refers to a heteroalkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an heterocyclyl. The heterocyclyl-heteroalkenyl group can comprise 6 to 20 atoms, e.g. the heteroalkenyl moiety of the heterocyclyl-heteroalkenyl group can comprise 1 to 6 carbon atoms and the heterocyclyl moiety can comprise 5 to 14 atoms. In some embodiments heterocyclyl-heteroalkenyl is selected from the group comprising heterocyclyl-O-alkenyl, heterocyclylalkyl-O-alkenyl, heterocyclyl-NH-alkenyl, heterocyclyl-N(alkenyl)2, heterocyclylalkyl-NH-alkenyl, heterocyclylalkyl-N-(alkenyl)2, heterocyclyl-S-alkenyl, and heterocyclylalkenyl-S-alkenyl.
The term “heteroaryl-alkyl” as a group or part of a group refers to an alkyl in which one of the hydrogen atoms bonded to a carbon atom, typically a terminal or sp3 carbon atom, is replaced with a heteroaryl. An example of a heteroaryl-alkyl group is 2-pyridyl-methylene. The heteroaryl-alkyl group can comprise 6 to 20 atoms, e.g. the alkyl moiety of the heteroaryl-alkyl group can comprise 1 to 6 carbon atoms and the heteroaryl moiety can comprise 5 to 14 atoms.
The term “heteroaryl-alkenyl” as a group or part of a group refers to an alkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an heteroaryl. The heteroaryl-alkenyl group can comprise 6 to 20 atoms, e.g. the alkenyl moiety of the heteroaryl-alkenyl group can comprise 1 to 6 carbon atoms and the heteroaryl moiety can comprise 5 to 14 atoms, such as ═CH—Rk, wherein Rk is heteroaryl as defined herein above.
The term “heteroaryl-heteroalkyl” as a group or part of a group as used herein refers to a heteroalkyl in which one of the hydrogen atoms bonded to a carbon atom, typically a terminal or sp3 carbon atom, is replaced with a heteroaryl. The heteroaryl-heteroalkyl group comprises 6 to 20 atoms, e.g. the heteroalkyl moiety of the heteroaryl-heteroalkyl group is 1 to 6 carbon atoms and the heteroaryl moiety is 5 to 14 atoms. In some embodiments heteroaryl-heteroalkyl is selected from the group comprising heteroaryl-O-alkyl, heteroarylalkyl-O-alkyl, heteroaryl-NH-alkyl, heteroaryl-N(alkyl)2, heteroarylalkyl-NH-alkyl, heteroarylalkyl-N-(alkyl)2, heteroaryl-S-alkyl, and heteroarylalkyl-S-alkyl.
The term “heteroaryl-heteroalkenyl” as a group or part of a group as used herein refers to a heteroalkenyl in which one of the hydrogen atoms bonded to a carbon atom, is replaced with an heteroaryl. The heteroaryl-heteroalkenyl group comprises 6 to 20 atoms, e.g. the heteroalkenyl moiety of the heteroaryl-heteroalkenyl group is 1 to 6 carbon atoms and the heteroaryl moiety is 5 to 14 atoms. In some embodiments heteroaryl-heteroalkenyl is selected from the group comprising heteroaryl-O-alkenyl, heteroarylalkenyl-O-alkenyl, heteroaryl-NH-alkenyl, heteroaryl-N(alkenyl)2, heteroarylalkenyl-NH-alkenyl, heteroarylalkenyl-N-(alkenyl)2, heteroaryl-S-alkenyl, and heteroarylalkenyl-S-alkenyl.
By way of example, carbon bonded heteroaryl or heterocyclic rings can be bonded at position 2, 3, 4, 5, or 6 of a pyridine, position 3, 4, 5, or 6 of a pyridazine, position 2, 4, 5, or 6 of a pyrimidine, position 2, 3, 5, or 6 of a pyrazine, position 2, 3, 4, or 5 of a furan, tetrahydrofuran, thiophene, pyrrole or tetrahydropyrrole, position 2, 4, or 5 of an oxazole, imidazole or thiazole, position 3, 4, or 5 of an isoxazole, pyrazole, or isothiazole, position 2 or 3 of an aziridine, position 2, 3, or 4 of an azetidine, position 2, 3, 4, 5, 6, 7, or 8 of a quinoline or position 1, 3, 4, 5, 6, 7, or 8 of an isoquinoline. Still more typically, carbon bonded heteroaryls and heterocyclyls include 2-pyridyl, 3-pyridyl, 4-pyridyl, 5-pyridyl, 6-pyridyl, 3-pyridazinyl, 4-pyridazinyl, 5-pyridazinyl, 6-pyridazinyl, 2-pyrimidinyl, 4-pyrimidinyl, 5-pyrimidinyl, 6-pyrimidinyl, 2-pyrazinyl, 3-pyrazinyl, 5-pyrazinyl, 6-pyrazinyl, 2-thiazolyl, 4-thiazolyl, or 5-thiazolyl. By way of example, nitrogen bonded heterocyclic rings are bonded at position 1 of an aziridine, azetidine, pyrrole, pyrrolidine, 2-pyrroline, 3-pyrroline, imidazole, imidazolidine, 2-imidazoline, 3-imidazoline, pyrazole, pyrazoline, 2-pyrazoline, 3-pyrazoline, piperidine, piperazine, indole, indoline, 1H-indazole, position 2 of a isoindole, or isoindoline, position 4 of a morpholine, and position 9 of a carbazole, or B-carboline. Still more typically, nitrogen bonded heteroaryls or heterocyclyls include 1-aziridyl, 1-azetedyl, 1-pyrrolyl, 1-imidazolyl, 1-pyrazolyl, and 1-piperidinyl.
As used herein and unless otherwise stated, the terms “alkoxy”, “cyclo-alkoxy”, “aryloxy”, “arylalkyloxy”, “heteroaryloxy” “heterocyclyloxy”, “alkylthio”, “cycloalkylthio”, “arylthio”, “arylalkylthio”, “heteroarylthio” and “heterocyclylthio” refer to substituents wherein an alkyl group, respectively a cycloalkyl, aryl, arylalkyl heteroaryl, or heterocyclyl (each of them such as defined herein), are attached to an oxygen atom or a sulfur atom through a single bond, such as but not limited to methoxy, ethoxy, propoxy, butoxy, thioethyl, thiomethyl, phenyloxy, benzyloxy, mercaptobenzyl and the like. The same definitions will apply for alkenyl and alkynyl instead of alkyl.
The term “alkylthio”, as a group or part of a group, refers to a group having the formula —S—Rb wherein Rb is alkyl as defined herein above. Non-limiting examples of alkylthio groups include methylthio (—SCH3), ethylthio (—SCH2CH3), n-propylthio, isopropylthio, n-butylthio, isobutylthio, sec-butylthio, tert-butylthio and the like.
The term “alkenylthio”, as a group or part of a group, refers to a group having the formula —S—Rd wherein Rd is alkenyl as defined herein above.
The term “arylthio”, as a group or part of a group, refers to a group having the formula —S—Rg wherein Rg is aryl as defined herein above.
The term “arylalkylthio”, as a group or part of a group, refers to a group having the formula —S—Ra—Rg wherein Ra is alkylene and Rg is aryl as defined herein above.
The term “heterocyclylthio”, as a group or part of a group, refers to a group having the formula —S—Ri wherein Ri is heterocyclyl as defined herein above.
The term “heteroarylthio”, as a group or part of a group, refers to a group having the formula —S—Rk wherein Rk is heteroaryl as defined herein above.
The term “heterocyclylalkylthio”, as a group or part of a group, refers to a group having the formula —S—Ra—Ri wherein Ra is alkylene and Ri is heterocyclyl as defined herein above.
The term “heteroarylalkylthio”, as a group or part of a group, refers to a group having the formula —S—Ra—Rk wherein Ra is alkylene and Rk is heteroaryl as defined herein above.
The term “alkyl-SO2”, as a group or part of a group, refers to a group having the formula —SO2-Rb wherein Rb is alkyl as defined herein above. Non-limiting examples of alkyl-SO2 groups include methyl-SO2, ethyl-SO2 and the like.
The term “heteroalkyl-SO2”, as a group or part of a group, refers to a group having the formula —SO2-Re wherein Re is heteroalkyl as defined herein above.
The term “aryl-SO2”, as a group or part of a group, refers to a group having the formula —SO2-Rg wherein Rg is aryl as defined herein above.
The term “heteroaryl-SO2”, as a group or part of a group, refers to a group having the formula —SO2-Rk wherein Rk is heteroaryl as defined herein above.
The term “heterocyclyl-SO2”, as a group or part of a group, refers to a group having the formula —SO2-Ri wherein Ri is heterocyclyl as defined herein above.
The term “mono- or di-alkylamino”, as a group or part of a group, refers to a group of formula —N(Ro)(Rb) wherein Ro is hydrogen, or alkyl, Rb is alkyl. Thus, alkylamino include mono-alkyl amino group (e.g. mono-alkylamino group such as methylamino and ethylamino), and di-alkylamino group (e.g. di-alkylamino group such as dimethylamino and diethylamino). Non-limiting examples of suitable mono- or di-alkylamino groups include n-propylamino, isopropylamino, n-butylamino, i-butylamino, sec-butylamino, t-butylamino, pentylamino, n-hexylamino, di-n-propylamino, di-i-propylamino, ethylmethylamino, methyl-n-propylamino, methyl-i-propylamino, n-butylmethylamino, i-butylmethylamino, t-butylmethylamino, ethyl-n-propylamino, ethyl-i-propylamino, n-butylethylamino, i-butylethylamino, t-butylethylamino, di-n-butylamino, di-i-butylamino, methylpentylamino, methylhexylamino, ethylpentylamino, ethylhexylamino, propylpentylamino, propylhexylamino, and the like.
The term “mono- or di-arylamino”, as a group or part of a group, refers to a group of formula —N(Rq)(Rr) wherein Rq and Rr are each independently selected from hydrogen, aryl, or alkyl, wherein at least one of Rq or Rr is aryl.
The term “mono- or di-heteroarylamino”, as a group or part of a group, refers to a group of formula —N(Ru)(Rv) wherein Ru and Rv are each independently selected from hydrogen, heteroaryl, or alkyl, wherein at least one of Ru or Rv is heteroaryl as defined herein.
The term “mono- or di-alkylamino-SO2”, as a group or part of a group, refers to a group of formula —SO2-N(Ro)(Rb) wherein Ro is hydrogen, or alkyl, Rb is alkyl. “alkylamino” includes mono-alkyl amino group (e.g. mono-alkylamino group such as methylamino and ethylamino), and di-alkylamino group (e.g. di-alkylamino group such as dimethylamino and diethylamino).
The term “mono- or di-arylamino-SO2”, as a group or part of a group, refers to a group of formula —SO2-N(Rq)(Rr) wherein Rq and Rr are each independently selected from hydrogen, aryl, or alkyl, wherein at least one of Rq or Rr is aryl.
The term “mono- or di-heteroarylamino-SO2”, as a group or part of a group, refers to a group of formula —SO2-N(Ru)(Rv) wherein Ru and Rv are each independently selected from hydrogen, heteroaryl, or alkyl, wherein at least one of Ru or Rv is heteroaryl as defined herein.
The term “mono- or di-heterocyclylamino”, as a group or part of a group, refers to a group of formula —N(Rw)(Rx) wherein Rw and Rx are each independently selected from hydrogen, heterocyclyl, or alkyl, wherein at least one of Rw or Rx is heterocyclyl as defined herein.
As used herein and unless otherwise stated, the term halogen means any atom selected from the group consisting of fluorine (F), chlorine (Cl), bromine (Br) and iodine (I).
The terminology regarding a chemical group “which optionally includes one or more heteroatoms, said heteroatoms being selected from the atoms consisting of O, S, and N” as used herein, refers to a group where one or more carbon atoms are replaced by an oxygen, nitrogen or sulphur atom and thus includes, depending on the group to which is referred, heteroalkyl, heteroalkenyl, heteroalkynyl, heteroalkyl, heteroalkenyl, heteroalkynyl, cycloheteroalkyl, cycloheteroalkenyl, cycloheteroalkynyl, heteroaryl, arylheteroalkyl, heteroarylalkyl, heteroarylheteroalkyl, arylheteroalkenyl, heteroarylalkenyl, heteroarylheteroalkenyl, heteroarylheteroalkenyl, arylheteroalkynyl, heteroarylalkynyl, heteroarylheteroalkynyl, among others. This term therefore comprises, depending on the group to which is referred, as an example alkoxy, alkenyloxy, alkynyloxy, alkyl-O-alkylene, alkenyl-O-alkylene, arylalkoxy, benzyloxy, heteroaryl-heteroalkyl, heterocyclyl-heteroalkyl, heteroaryl-alkoxy, heterocyclyl-alkoxy, among others. As an example, the terminology “alkyl which optionally includes one or more heteroatoms, said heteroatoms being selected from the atoms consisting of O, S, and N” therefore refers to heteroalkyl, meaning an alkyl which comprises one or more heteroatoms in the hydrocarbon chain, whereas the heteroatoms may be positioned at the beginning of the hydrocarbon chain, in the hydrocarbon chain or at the end of the hydrocarbon chain. Examples of heteroalkyl include methoxy, methylthio, ethoxy, propoxy, CH3—O—CH2—, CH3—S—CH2—, CH3—CH2—O—CH2—, CH3—NH—, (CH3)2—N—, (CH3)2—CH2—NH—CH2—CH2—, among many other examples. As an example, the terminology “arylalkylene which optionally includes one or more heteroatoms in the alkylene chain, said heteroatoms being selected from the atoms consisting of O, S, and N” therefore refers to arylheteroalkylene, meaning an arylalkylene which comprises one or more heteroatoms in the hydrocarbon chain, whereas the heteroatoms may be positioned at the beginning of the hydrocarbon chain, in the hydrocarbon chain or at the end of the hydrocarbon chain. “Arylheteroalkylene” thus includes aryloxy, arylalkoxy, aryl-alkyl-NH— and the like and examples are phenyloxy, benzyloxy, aryl-CH2—S—CH2—, aryl-CH2—O—CH2—, aryl-NH—CH2— among many other examples. The same counts for “heteroalkenylene”, and other terms used herein when referred to “which optionally includes one or more heteroatoms, said heteroatoms being selected from the atoms consisting of O, S, and N”.
The term “single bond” as used herein for a linking group i.e. in a way that a certain linking group is selected from a single bond, etc. in the formulas herein, refers to a molecule wherein the linking group is not present and therefore refers to compounds with a direct linkage via a single bond between the two moieties being linked by the linking group.
As used herein with respect to a substituting group, and unless otherwise stated, the terms “substituted” such as in “substituted alkyl”, “substituted alkenyl”, substituted alkynyl”, “substituted aryl”, “substituted heteroaryl”, “substituted heterocyclyl”, “substituted arylalkyl”, “substituted heteroaryl-alkyl”, “substituted heterocyclyl-alkyl” and the like refer to the chemical structures defined herein, and wherein the said alkyl, alkenyl, alkynyl, group and/or the said aryl, heteroaryl, or heterocyclyl may be optionally substituted with one or more substituents (preferable 1, 2, 3, 4, 5 or 6), meaning that one or more hydrogen atoms are each independently replaced with at least one substituent. Typical substituents include, but are not limited to and in a particular embodiment said substituents are being independently selected from the group consisting of halogen, amino, hydroxyl, sulfhydryl, alkyl, alkoxy, alkenyl, alkenyloxy, alkynyl, alkynyloxy, cycloalkyl, cycloalkenyl, cycloalkynyl, heteroalkyl, heteroalkenyl, heteroalkynyl, aryl, heteroaryl, heterocyclyl, arylalkyl, arylalkenyl, arylalkynyl, cycloalkyl-alkyl, cycloalkylalkenyl, cycloalkylalkynyl, heteroaryl-alkyl, heterocyclyl-alkyl, heteroaryl-alkenyl, heterocyclyl-alkenyl and heteroaryl-alkynyl, heterocyclyl-alkynyl, —X, -Z, —O−, —OZ, ═O, —SZ, —S−, ═S, —NZ2, —N+Z3, ═NZ, ═N—OZ, —CX3 (e.g. trifluoromethyl), —CN, —OCN, —SCN, —N═C═O, —N═C═S, —NO, —NO2, ═N2, —N3, —NZC(O)Z, —NZC(S)Z, —NZC(O)O−, —NZC(O)OZ, —NZC(S)OZ, —NZC(O)NZZ, NZC(NZ)Z, NZC(NZ)NZZ, —C(O)NZZ, —C(NZ)Z, —S(O)2O−, —S(O)2OZ, —S(O)2Z, —OS(O)2OZ, —OS(O)2Z, —OS(O)2O−, —S(O)2NZ, —S(O)Z, —OP(O)(OZ)2, —P(O)(OZ)2, —P(O)(O−)2, —P(O)(OZ)(O−), —P(O)(OH)2, —C(O)Z, —C(O)X, —C(S)Z, —C(O)OZ, —C(O)O−, —C(S)OZ, —C(O)SZ, —C(S)SZ, —C(O)NZZ, —C(S)NZZ, —C(NZ)NZZ, —OC(O)Z, —OC(S)Z, —OC(O)O−, —OC(O)OZ, —OC(S)OZ, wherein each X is independently a halogen selected from F, Cl, Br, or I; and each Z is independently —H, alkyl, alkenyl, alkynyl, heteroalkyl, heteroalkenyl, heteroalkynyl, aryl, heteroaryl, heterocyclyl, cycloalkyl, protecting group or prodrug moiety, while two Z bonded to a nitrogen atom can be taken together with the nitrogen atom to which they are bonded to form a heteroaryl, or heterocyclyl. Alkyl(ene), alkenyl(ene), and alkynyl(ene) groups may also be similarly substituted.
Any substituent designation that is found in more than one site in a compound of this invention shall be independently selected.
Substituents optionally are designated with or without bonds. Regardless of bond indications, if a substituent is polyvalent (based on its position in the structure referred to), then any and all possible orientations of the substituent are intended.
The term “heteroatom(s)” as used herein means an atom selected from nitrogen, which can be quaternized; oxygen; and sulfur, including sulfoxide and sulfone.
The term “hydroxyl” as used herein means —OH.
The term “carbonyl” as used herein means carbon atom bonded to oxygen with a double bond, i.e., C═O.
The term “amino” as used herein means the —NH2 group.
The term “imino” as used herein means the ═NH group.
The term “alkyl-imino”, as a group or part of a group, refers to a group having the formula ═N—Rb wherein Rb is alkyl as defined herein above.
The term “aryl-imino”, as a group or part of a group, refers to a group having the formula ═N—Rg wherein Rg is aryl as defined herein above.
The term “carboxyl” as used herein means —C(O)OH.
The term “hydroxycarbonylalkyl” as a group or part of a group, refers to a group having the formula —Rb—C(O)OH wherein Rb is alkyl as defined herein above. Non-limiting example of a hydroxycarbonylalkyl group includes e.g. hydroxycarbonylmethyl.
The term “hydroxycarbonylalkenyl” as a group or part of a group, refers to a group having the formula —Rd—C(O)OH wherein Rd is alkenyl as defined herein above. Non-limiting example of a hydroxycarbonylalkenyl group includes e.g hydroxycarbonylmethylene, hydroxycarbonylpropylene.
The compounds as defined herein can be prepared while using a series of chemical reactions well known to those skilled in the art. The compounds of interest having a structure according to the general formula (I), or general formula (II), or general formula (III), or general formula (IV), and all other formulas described herein and embodiments thereof can be prepared using a series of chemical reactions well known to those skilled in the art.
As used herein and unless otherwise stated, the term “enantiomer” means each individual optically active form of a compound as defined herein, having an optical purity or enantiomeric excess (as determined by methods standard in the art) of at least 80% (e.g. at least 90% of one enantiomer and at most 10% of the other enantiomer), preferably at least 90% and more preferably at least 98%.
The term “isomers” as used herein means all possible isomeric forms, including tautomeric and stereochemical forms, which the compounds of formulae herein may possess, but not including position isomers. Typically, the structures shown herein exemplify only one tautomeric or resonance form of the compounds, but the corresponding alternative configurations are contemplated as well. Unless otherwise stated, the chemical designation of compounds denotes the mixture of all possible stereochemically isomeric forms, said mixtures containing all diastereomers and enantiomers (since the compounds of formulae herein may have at least one chiral center) of the basic molecular structure, as well as the stereochemically pure or enriched compounds. More particularly, stereogenic centers may have either the R- or S-configuration, and multiple bonds may have either cis- or trans-configuration.
Pure isomeric forms of the said compounds are defined as isomers substantially free of other enantiomeric or diastereomeric forms of the same basic molecular structure. In particular, the term “stereoisomerically pure” or “chirally pure” relates to compounds having a stereoisomeric excess of at least about 80% (e.g. at least 90% of one isomer and at most 10% of the other possible isomers), preferably at least 90%, more preferably at least 94% and most preferably at least 97%. The terms “enantiomerically pure” and “diastereomerically pure” should be understood in a similar way, having regard to the enantiomeric excess, respectively the diastereomeric excess, of the mixture in question.
Separation of stereoisomers is accomplished by standard methods known to those skilled in the art. One enantiomer of a compound as defined herein can be separated substantially free of its opposing enantiomer by a method such as formation of diastereomers using optically active resolving agents. Separation of isomers in a mixture can be accomplished by any suitable method well known to those skilled in the art, including: (1) formation of ionic, diastereomeric salts with chiral compounds and separation by fractional crystallization or other methods, (2) formation of diastereomeric compounds with chiral derivatizing reagents, separation of the diastereomers, and conversion to the pure enantiomers, or (3) enantiomers can be separated directly under chiral conditions. Under method (1), diastereomeric salts can be formed by reaction of enantiomerically pure chiral bases such as brucine, quinine, ephedrine, strychnine, a-methyl-b-phenylethylamine (amphetamine), and the like with asymmetric compounds bearing acidic functionality, such as carboxylic acid and sulfonic acid. The diastereomeric salts may be induced to separate by fractional crystallization or ionic chromatography. For separation of the optical isomers of amino compounds, addition of chiral carboxylic or sulfonic acids, such as camphorsulfonic acid, tartaric acid, mandelic acid, or lactic acid can result in formation of the diastereomeric salts. Alternatively, by method (2), the substrate to be resolved may be reacted with one enantiomer of a chiral compound to form a diastereomeric pair. Diastereomeric compounds can be formed by reacting asymmetric compounds with enantiomerically pure chiral derivatizing reagents, such as menthyl derivatives, followed by separation of the diastereomers and hydrolysis to yield the free, enantiomerically enriched compound. A method of determining optical purity involves making chiral esters, such as a menthyl ester or Mosher ester, a-methoxy-a-(trifluoromethyl)phenyl acetate (Jacob III. (1982) J. Org. Chem. 47:4165), of the racemic mixture, and analyzing the NMR spectrum for the presence of the two atropisomeric diastereomers. Stable diastereomers can be separated and isolated by normal- and reverse-phase chromatography following methods for separation of atropisomeric naphthyl-isoquinolines (see e.g. WO 96/15111). Under method (3), a racemic mixture of two asymmetric enantiomers is separated by chromatography using a chiral stationary phase. Suitable chiral stationary phases are, for example, polysaccharides, in particular cellulose or amylose derivatives. Appropriate eluents or mobile phases for use in combination with said polysaccharide chiral stationary phases are hexane and the like, modified with an alcohol such as ethanol, isopropanol and the like.
The terms cis and trans are used herein in accordance with Chemical Abstracts nomenclature and include reference to the position of the substituents on a ring moiety. The absolute stereochemical configuration of the compounds of the formulae described herein may easily be determined by those skilled in the art while using well-known methods such as, for example, X-ray diffraction.
Also included within the scope of this invention are salts of compounds as defined herein with one or more amino acids, especially the naturally-occurring amino acids found as protein components. The amino acid typically is one bearing a side chain with a basic or acidic group, e.g., lysine, arginine or glutamic acid, or a neutral group such as glycine, serine, threonine, alanine, isoleucine, or leucine.
The term “pharmaceutically acceptable salts” as used herein means the therapeutically active non-toxic salt forms which the compounds as defined herein are able to form. Therefore, the compounds of this invention optionally comprise salts of the compounds herein, especially pharmaceutically acceptable non-toxic salts containing, for example, Na+, Li+, K+, Ca2+ and Mg2+. Such salts may include those derived by combination of appropriate cations such as alkali and alkaline earth metal ions or ammonium and quaternary amino ions with an acid anion moiety, typically a carboxylic acid. The compounds as defined herein may bear multiple positive or negative charges. The net charge of the compounds as defined herein may be either positive or negative. Any associated counter ions are typically dictated by the synthesis and/or isolation methods by which the compounds are obtained. Typical counter ions include, but are not limited to ammonium, sodium, potassium, lithium, halides, acetate, trifluoroacetate, etc., and mixtures thereof. It will be understood that the identity of any associated counter ion is not a critical feature as defined herein, and that the invention encompasses the compounds in association with any type of counter ion. Moreover, as the compounds can exist in a variety of different forms, the invention is intended to encompass not only forms of the compounds that are in association with counter ions (e.g., dry salts), but also forms that are not in association with counter ions (e.g., aqueous or organic solutions). Metal salts typically are prepared by reacting the metal hydroxide with a compound of this invention. Examples of metal salts which are prepared in this way are salts containing Li+, Na+, and K+. A less soluble metal salt can be precipitated from the solution of a more soluble salt by addition of the suitable metal compound. In addition, salts may be formed from acid addition of certain organic and inorganic acids to basic centers, typically amines, or to acidic groups. Examples of such appropriate acids include, for instance, inorganic acids such as hydrohalogen acids, e.g. hydrochloric or hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid and the like; or organic acids such as, for example, acetic, propanoic, hydroxyacetic, 2-hydroxypropanoic, 2-oxopropanoic, lactic, pyruvic, oxalic (i.e. ethanedioic), malonic, succinic (i.e. butanedioic acid), maleic, fumaric, malic, tartaric, citric, methanesulfonic, ethanesulfonic, benzenesulfonic, p-toluenesulfonic, cyclohexanesulfamic, salicylic (i.e. 2-hydroxybenzoic), p-aminosalicylic and the like. Furthermore, this term also includes the solvates which the compounds of formulae herein as well as their salts are able to form, such as for example hydrates, alcoholates and the like. Finally, it is to be understood that the compositions herein comprise compounds as defined herein in their unionized, as well as zwitterionic form, and combinations with stoichiometric amounts of water as in hydrates.
The compounds as defined herein also include physiologically acceptable salts thereof. Examples of physiologically acceptable salts of the compounds as defined herein include salts derived from an appropriate base, such as an alkali metal (for example, sodium), an alkaline earth (for example, magnesium), ammonium and NX4+ (wherein X is C1-C4 alkyl). Physiologically acceptable salts of an hydrogen atom or an amino group include salts of organic carboxylic acids such as acetic, benzoic, lactic, fumaric, tartaric, maleic, malonic, malic, isethionic, lactobionic and succinic acids; organic sulfonic acids, such as methanesulfonic, ethanesulfonic, benzenesulfonic and p-toluenesulfonic acids; and inorganic acids, such as hydrochloric, sulfuric, phosphoric and sulfamic acids. Physiologically acceptable salts of a compound containing a hydroxy group include the anion of said compound in combination with a suitable cation such as Na+ and NX4+ (wherein X typically is independently selected from H or a C1-C4 alkyl group). However, salts of acids or bases which are not physiologically acceptable may also find use, for example, in the preparation or purification of a physiologically acceptable compound. All salts, whether or not derived from a physiologically acceptable acid or base, are within the scope of the present invention.
Another embodiment of this invention relates to “pro-drug” forms of the compounds as defined herein. It may be desirable to formulate the compounds of the present invention in the form of a chemical species which itself is not significantly biologically-active, but which when delivered to the animal, mammal or human will undergo a chemical reaction catalyzed by the normal function of the body, said chemical reaction having the effect of releasing a compound as defined herein. The term “pro-drug” thus relates to these species which are converted in vivo into the active pharmaceutical ingredient. For the purpose of the present invention the term “prodrug”, as used herein, relates to an inactive or significantly less active derivative of a compound such as represented by the structural formulae herein described, which undergoes spontaneous or enzymatic transformation within the body in order to release the pharmacologically active form of the compound.
In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III) or of formula (IV) as defined herein, or as represented in Tables A to F, for use as a medicament. In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III) or of formula (IV) as defined herein, or as represented in Tables A to F, for use as a modulator of Rel hydrolase and/or synthetase activity. In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III) or of formula (IV) as defined herein, for use as an inhibitor of the Rel hydrolase and/or synthetase activity or as represented in Tables A to F,. In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III) or of formula (IV) as defined herein, or as represented in any of Tables A to F, for use as an activator of Rel hydrolase and/or synthetase activity. In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III) or of formula (IV) as defined herein, or as represented in any of Tables A to F, for use as an effector of the Rel hydrolase and/or synthetase activity.
Compounds identified by the methods as described herein such as the herein mentioned compounds of general formula (I), or (II), or (III) or (IV) may be included in a pharmaceutical formulation. Techniques regarding the formulation and administration of pharmaceutical compositions are known to a skilled person and have been described in the art (e.g. the reference book: Remington: The Science and Practice of Pharmacy, periodically revised).
In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III), or of formula (IV) as defined herein, or as represented in any of Tables A to F, for use in treating infections with antibiotic (multi)resistant bacteria. In certain embodiments, the present invention relates to a compound of formula (I), or of formula (II) or of formula (III) or of formula (IV) as defined herein, or as represented in any of Tables A to F, for use in treating infections with dormant, latent or persistent bacteria.
Therefore, the invention further relates to a method of treating or preventing infections with antibiotic (multi)resistant bacteria in a subject comprising a Rel modulator as described in any embodiment herein, or a pharmaceutical composition comprising a Rel modulator as described herein. A skilled person appreciates that in order to be effective, the Rel modulator has to be administered in a therapeutically effective amount to achieve a biological or medical response in a subject. Method and practices to determine therapeutically effective doses of a pharmaceutical active ingredient, in the context of the current specification the Rel modulator are known to a person skilled in the art. Evidently, the required dosage or amount that is needed to arrive at a therapeutically effective dose needs to be determined on a case-by-case and subject-to-subject basis. It is standard practice to adapt a dosage to a certain individual to obtain an optimal, i.e. ideal effect or response. Optimally, a considerable number of distinct parameters need to be assessed when determining an optimal dosage, or dosage schedule and include but are by no means limited to the nature and degree of the disease to be treated, gender of the subject, subject age, body weight, other medical indications, nutrition, mode of administration, metabolic state, interference or influence of efficacy by other pharmaceutically active ingredients, etc. Furthermore each antibiotic (multi)resistant bacteria or bacterial infection may have a certain intrinsic degree of responsiveness to the used Rel modulator. The pharmaceutical compositions as referred to herein may comprise at least one additional pharmaceutical active ingredient. In certain embodiments, the pharmaceutical formulation further comprises one or more non-active pharmaceutical ingredients or inactive ingredients, also known in the art as excipients. Furthermore, the formulation may comprise pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions, such as pH adjusting and buffering agents, preservatives, complexing agents, tonicity adjusting agents, wetting agents and the like.
Terms such as “subject”, “patient”, or “individual” may be used interchangeably herein and refer to animals, preferably warm-blooded animals, more preferably vertebrates, and even more preferably mammals. Preferred subjects are human subjects (Homo sapiens) including all genders and all age categories thereof. Adult subjects, elder subjects, newborn subjects, and foetuses are intended to be covered by the term “subject”.
The terms “treatment” or “treat” indicate the therapeutic treatment of an already developed disease or condition, such as the therapy of an (multi)resistant bacterial infection. Beneficial or desired clinical results may include, without limitation, alleviation of one or more symptoms or one or more biological markers, diminishment of extent of disease, stabilized (i.e., not worsening) state of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, and the like. While the methods, uses, and modulators described herein act as a new class of therapeutics for (multi)resistant bacteria, it is evident that they may also be applied on bacteria or bacterial infections that do not show any antibiotic resistance, or only a limited degree of antibiotic resistance.
In certain embodiments, the Rel modulator used in a method of treatment is a Rel hydrolase and/or Rel synthetase inhibitor. In alternative embodiments, the Rel modulator used in the method of treatment is a Rel hydrolase and/or Rel synthetase activator. In certain embodiments, the method is directed to treatment of bacterial infections characterized by the presence of dormant, latent, or persistent bacteria. Also intended is treatment of a subject for treating or preventing infections with antibiotic (multi)resistant bacteria comprising at least two distinct Rel modulators as described herein. In certain embodiments, the Rel modulators as disclosed herein are used in conjunction with other distinct antibacterial molecules or compositions known in the art. In certain embodiments, the methods of treatment disclosed herein comprise use of at least one Rel modulator as disclosed herein and a distinct traditional antibacterial molecule or composition known in the art. In further embodiments, the traditional antibacterial molecule or composition acts on the ribosomal machinery of the bacteria. In certain further embodiments, the Rel modulator and the traditional antibacterial molecule or composition are used at distinct time points in therapy. In further embodiments, the traditional antibacterial molecule or composition and Rel modulator are used in an alternating manner. In certain embodiments, the subject is a subject diagnosed with a (multi)resistant bacterial infection. In certain embodiments, the bacterial infection is characterized by biofilm formation and/or deposition in said subject.
In certain aspects, the invention relates to the use of a Rel modulator as described herein, or a pharmaceutical composition comprising a Rel modulator as described in herein, for the manufacture of a medicament for the prevention or treatment of an antibiotic (multi)resistant bacterial infection. In certain embodiments, the use of a Rel modulator as described herein, or a pharmaceutical composition comprising a Rel modulator as described herein for the manufacture of a medicament for modulating the function of Rel is intended. Preferably, the activity of Rel is modulated by said Rel modulator or pharmaceutical composition in such a way that Rel hydrolase and/or Rel synthetase activity is inhibited or reduced. In alternative preferred embodiments, the activity of Rel is modulated by said Rel modulator or pharmaceutical composition in such a way that Rel hydrolase and/or Rel synthetase activity is upregulated. Furthermore, the use of a Rel modulator as disclosed herein for the manufacture of a medicament for the prevention or treatment of infections with dormant, latent, or persistent bacteria is also envisaged. Also intended is the use of a Rel modulator as described herein for the manufacture of a medicament for the prevention or treatment of infections characterized by biofilm formation.
In certain embodiments, the modulator of Rel as described herein is an inhibitor or Rel hydrolase and/or synthetase activity. In certain embodiments, the modulator of Rel is an inhibitor of Rel synthetase activity. In certain embodiments, the modulator of Rel is an effector of Rel hydrolase and/or synthetase activity. In certain embodiments, the modulator of Rel is an activator of Rel hydrolase activity. In certain embodiments, the modulator of Rel is an activator of Rel synthetase activity. In certain embodiments, the Rel modulator increases or decreases the Rel hydrolase activity with at least 30%, preferably at least 50%, at least 75%, most preferably at least 100% compared to Rel hydrolase activity in absence of said Rel modulator. In certain embodiments, the Rel modulator increases or decreases the Rel synthetase activity with at least 30%, preferably at least 50%, at least 75%, at least 100% compared to Rel synthetase activity in absence of said Rel modulator. In embodiments where the Rel modulator increases or upregulates Rel hydrolase or synthetase activity, said activity is upregulated by at least 1.5-fold, at least 2-fold, at least 3-fold, at least 5-fold, at least 10-fold, or more, when compared to the hydrolase or synthetase activity of Rel in identical conditions in absence of the Rel modulator.
“Effector” as used herein indicates a molecule which increases or decreases the activity of an enzyme by binding to the enzyme at a regulatory site which is optionally distinct from the catalytic site that binds the substrate. Hence, a skilled person appreciates that an effector molecule is a molecule that regulates the biological activity of a target protein by binding to said protein. Preferably, an effector is a “small” molecule as defined herein.
Therefore, a further aspect of the invention is directed to Rel modulators as described herein for use in treating infections with antibiotic (multi)resistant bacteria. In certain embodiments, the (multi)resistant bacteria of the bacterial infection are Gram-negative bacteria, Gram-positive bacteria, or a combination thereof. Non-limiting examples of (multi)resistant bacteria are found in Staphylococci, Enterococci, Gonococci, Streptococci, Salmonella, Mycobacteria, and numerous Gram-negative bacteria. In certain embodiments, the antibiotic (multi)resistant bacteria further are resistant to bacteriophages. By means of illustration and not limitation, specific examples of (multi)resistant bacteria are multiresistant tuberculosis bacterial strains, Common multidrug-resistant organisms are usually bacteria part of the following group of bacteria: Vancomycin-Resistant Enterococci, Methicillin-Resistant Staphylococcus aureus, Extended-spectrum β-lactamase producing Gram-negative bacteria, Klebsiella pneumoniae carbapenemase (KPC) producing Gram-negatives, Multidrug-Resistant Gram Negative (MDR GN) bacteria such as Enterobacter species, E. coli, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa. In certain embodiments, the bacterial infection is an infection caused by a bacteria or a combination of bacteria commonly annotated in the art as the ESKAPE group of bacteria (Boucher et al., Bad buds, no drugs: no ESKAPE! An update from the Infectious Diseases Society of America, Clinical infectious diseases, 2009). It is understood that the term “ESKAPE group” refers to a group of bacteria comprising Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa and Enterobacter species. A skilled person appreciates that the collection of bacterial strains and species that are annotated in the art as antibiotic (multi)resistant bacteria will evolve and likely expand over time, and that these organisms are also envisaged in this specification. It is evident that the Rel modulators as described herein are suitable for use in any bacterial infection that expresses a Rel protein as described herein.
Means and method to detect drug resistance and classify drug resistant bacteria have been described in the art and are therefore known to a skilled person (Fluit et al., Molecular detection of antimicrobial resistance, Clinical microbiology reviews, 2001).
In certain embodiments, (a portion of) antibiotic (multi)resistant bacteria in bacterial infections disclosed herein are dormant, latent, or persistent bacteria. In certain embodiments, a portion of the antibiotic (multi)resistant bacteria in bacterial are a combination of dormant, latent, persistent bacteria. In certain embodiments, a portion of the antibiotic (multi)resistant bacteria are metabolically active, i.e. displaying a normal metabolic state represented by a standard growth rate, and a distinct portion of said bacteria are characterized by a metabolically attenuated, or metabolically reduced, or metabolically inactive state. The terms dormant, latent and persistent are well known and characterized in detail in the art (e.g. in Cohen et al., Microbial persistence and the road to drug resistance, Cell host and microbe, 2013). It is evident that a bacterium may change one or more of its properties changing its state and is therefore to be classified as a dormant, latent or persistent bacterium at a later point in time, or vice versa no longer be classified as dormant, latent, or persistent bacterium.
Another aspect of the invention is directed to the use of the crystal structure of the Rel polypeptide as defined by the atomic coordinates presented in any one of Tables 1 to 4, or a subset thereof, or atomic coordinates which deviate from those in any one of Tables 1 to 4, or a subset thereof, by RMSD over protein backbone atoms by no more than 3 Å for designing and/or identifying a compound which modulates Rel hydrolase and/or synthetase activity. In certain embodiments, the use is directed to the design and/or identification of compounds which inhibit Rel hydrolase and/or Rel synthetase activity. In alternative embodiments, the use is directed to the design and/or identification of compounds which upregulate or enhance Rel hydrolase and/or Rel synthetase activity. In certain embodiments, the use is directed to designing and/or identifying allosteric Rel modulators. In certain embodiments, the use is directed to designing and/or identifying Rel modulators that bind to two distinct domain or regions of the Rel protein.
A different aspect of the invention regards a computer system comprising a database containing the atomic coordinates, or a subset thereof as defined in any one of Tables 1 to 4, stored on a computer readable storage medium, and a user interface to view the information. Also intended are data processing apparatuses, devices, and systems comprising a database containing the atomic coordinates, or a subset thereof as defined in any one of Tables 1 to 4, stored on a computer readable storage medium, and a user interface to view the information. Models and atomic coordinates as disclosed herein are typically stored on a machine-readable, or computer-readable medium which are known in the art and include as non-limiting examples magnetic or optical media and random-access or read-only memory, including tapes, diskettes, hard disks, CD-ROMs and DVDs, flash drives or chips, servers and the internet. In certain embodiments, the computer system comprises means for carrying out the methods as described herein. In certain embodiments, the computer system further comprises an input device to receive instructions from an operator. In certain embodiments, the computer system comprises and/or is connected to a remote data storage system, wherein the remote data storage system is located at a geographic location different from the location of the user interface to view the information. Said data storage system may be located in a network storage medium such as the internet, providing remote accessibility. In certain embodiments, the database comprised in the computer system is encrypted. In certain embodiments, the computer system has access to at least one database of compound structures, and a user can by appropriately instructing said computer system access said at least one database of compound structures. In certain embodiments, the compound, list of compounds, or compound database (also known as compound library) is loaded into the computer system by the operator. In alternative embodiments, the compound, list of compounds, or compound database is accessible by the computer system from a medium different than said computer system. In certain embodiments, the computer system comprises a processing unit to assess the degree of fit between any compound molecule loaded into the computer system and Rel. Also intended is a computer-readable storage medium comprising instructions which, when executed by a computer, causes the computer to carry out any one of the methods disclosed herein.
A further aspect relates to the use of a computer system as described herein for designing and/or identifying a compound (ligand) which modulates Rel activity. In certain embodiments, the use of said computer system is achieved by user input commands. In certain embodiments, the computer system comprises means to select candidate Rel modulators from a list of compounds, or a compound library. In certain embodiments, the computer system comprises means to select (a) candidate compound(s) and proposing structural changes to the at least one candidate compound to further increase the number of energetically favorable interactions between said compound and Rel and/or means to select (a) candidate compound(s) and proposing structural changes to the at least one candidate compound to reduce or eliminate structural interference between said candidate modulator and one or more Residues of Rel defined by the atomic coordinates in any one of Tables 1 to 4. When using a computer system as described herein, the user searching for Rel modulators, which may or may not be the operator of the computer is provided by an optionally printed list of candidate Rel modulator. The computer system provides the user with one or more candidate Rel modulators. In certain embodiments, the computer system can be used to only provide the uses with candidate compounds that inhibit Rel hydrolase and/or synthetase activity. In alternative embodiments, the computer system can be used to only provide the user with candidate compounds that upregulate Rel hydrolase and/or synthetase activity. In alternative embodiments, the computer system is used for designing and/or identifying an allosteric Rel modulator. In certain embodiments, the computer system is used to provide a visual representation, i.e. an image of the three-dimensional structure of Rel, optionally during interaction with the candidate Rel compound. In certain embodiments, a list of candidate Rel modulators is generated and stored, optionally sorted according to a scoring system as described herein, in an electronic file.
Further intended herein are the crystals comprising a Rel protein in one or more of its three-dimensional conformations. In a certain embodiment, a crystal of Rel in its unbound resting state is intended, comprising a structure characterized by the atomic coordinates or a subset thereof as defined in Table 1. It is understood that “unbound resting state” indicates the conformation the Rel protein adopts in absence of binding any Rel modulator or Rel substrate. In an alternative embodiment, a crystal of Rel in its synthetase active form is intended, comprising a structure characterized by the atomic coordinates or a subset thereof as defined in Table 2. In an alternative embodiment, a crystal structure of Rel in its hydrolase active form is intended, comprising a structure characterized by the atomic coordinates or a subset thereof as defined in Table 3. In yet an alternative embodiment, a crystal structure of Rel in its allosteric state is intended, comprising a structure characterized by the atomic coordinates or a subset thereof as defined in Table 4. The precise composition of the Rel crystals envisaged herein will depend on the method used to generate said crystal, and the prevalence of these different compositions are expected by a skilled person.
Any crystal structure disclosed herein is said to be characterized by, or conform to, or substantially conform to, a set or subset of atomic coordinates when a structure, or a substantial fragment of a structure has or falls within the limit RMSD value as disclosed herein. In a certain embodiment, at least 75%, preferably at least 80%, more preferably at least 90% of the crystal structure has the recited RMSD value. In certain embodiments, “substantially conform to” further refers to atoms of amino acid side chains. In this context, common amino acid side chains are side chains that are common between the structure substantially conform to a structure with particular atomic coordinates and structures being defined by said atomic coordinates of Tables 1, 2, 3, or 4.
Another aspect of the invention relates to a method for producing a medicament, pharmaceutical composition or drug, the process comprising providing a compound as described herein and preparing a medicament, pharmaceutical composition or drug containing said compound. In certain embodiments, the pharmaceutical composition is formulated into a unit dosage form, including but not limited to hard capsules, soft capsules, tablets, coated tablets such as lacquered tablets or sugar-coated tablets, granules, aqueous or oily solutions, syrups, emulsions, suspensions, ointments, pastes, lotions, gels, inhalants or suppositories. In certain embodiments, the pharmaceutical composition is administered systemically, however in alternative embodiments the pharmaceutical composition is administered locally. In further embodiments, the pharmaceutical composition is suitable for oral, rectal, bronchial, nasal, topical, buccal, sublingual, transdermal, vaginal or parenteral administration, or in a form suitable for administration by inhalation. In certain embodiments, depending on the specific administration route said pharmaceutical composition is suited for, the process further comprises addition of ingredients not considered an active pharmaceutical ingredient to improve administration of the pharmaceutical composition. The unit dosage form comprising the pharmaceutical composition may be characterized by an immediate release pattern, a delayed release pattern, or a sustained release pattern, which are each terms standardly used in the technical field of pharmacy and have been defined in Pharmacopeias published by government authorities or medical or pharmaceutical societies. A skilled person appreciates that these bodies of work are reference works in the field of pharmacy.
A further aspect of the invention is directed to a computer system, intended to generate three dimensional structural representations of a Rel enzyme, Rel enzyme homologues or analogues, complexes of Rel enzyme with binding compounds or modulators, or complexes of Rel enzyme homologues or analogues with compounds or modulators, or, to analyze or optimize binding of compounds or modulators to said Rel enzyme or homologues or analogues, or complexes thereof, the system containing computer-readable data comprising one or more of:
(a) the coordinates of the Rel enzyme structure, listed in any one of Tables 1 to 4, optionally varied by a root mean square deviation of residue backbone atoms of not more than 3 Å, or selected coordinates thereof;
(b) the coordinates of a Rel enzyme homologue or analogue generated by homology modeling of the target based on the data in (a);
(c) the coordinates of a candidate binding compound or modulator generated by interpreting X-ray crystallographic data or NMR data by reference to the coordinates of the Rel enzyme structure, listed in any one of Tables 1 to 4, optionally varied by a root mean square deviation of residue backbone atoms of not more than 3 Å, or selected coordinates thereof, and
(d) structure factor data derivable from the coordinates of (a), (b) or (c).
In certain embodiments, the computer system comprises data comprising any combination of (a), (b), (c), or (d). In further embodiments, the user is able to adjust, remove, or add further data to the computer system. In certain embodiments, the computer system is able to receive additional data, adjust data, or remove data pertaining to (a), (b), (c), or (d). In certain embodiments, the user is able to access synthesis protocols of compounds or modulators through the computer system. In certain embodiments, the computer system directs the user to a synthesis protocol.
The term ‘homolog’ used herein indicates a pair of genes, in the context of the invention a gene encoding Rel that are said or evidenced to have a shared ancestry. Homology as used in the art is typically indicative for a similar nucleotide sequence, or a nucleotide sequence encoding at least a similar amino acid sequence for both genes. A further sub classification of homology can be established which is then commonly based on orthology and paralogy. Genes are said to be orthologous if they share a common ancestral sequence and have been diverging from each other by at least one speciation event. Thus, orthologs arise when a species diverges into two separate species. In contrast, paralogous genes are genes that arise through duplication events in the last common ancestor of the species under investigation. “Analog” as used herein refers to at least two genes each present in distinct taxa that do not share a common ancestor but nevertheless have the same function, or share at least one common function. Sequence similarity of the gene or the gene product is not a prerequisite for two genes to be analogs.
A different aspect of the invention relates to a computer-readable storage medium, comprising a data storage material encoded with computer readable data, wherein the data comprises one or more of
(a) the coordinates of the Rel enzyme structure, listed in any one of Tables 1 to 4, optionally varied by a root mean square deviation of residue backbone atoms of not more than 3 Å, or selected coordinates thereof;
(b) the coordinates of a Rel enzyme homologue or analogue generated by homology modeling of the target based on the data in (a);
(c) the coordinates of a candidate binding compound or modulator generated by interpreting X-ray crystallographic data or NMR data by reference to the coordinates of the Rel enzyme structure, listed in any one of Tables 1 to 4, optionally varied by a root mean square deviation of residue backbone atoms of not more than 3 Å, or selected coordinates thereof, and
(d) structure factor data derivable from the coordinates of (a), (b) or (c).
In certain embodiments, the computer readable data is encrypted and requires authentication or authorization credentials from a user or second computer-readable storage system for a computer system to be able to access said data. In certain embodiments, the computer-readable storage medium is a physical storage medium. In alternative embodiments, the computer-readable storage medium is a non-physical storage medium or a storage medium perceived to be a non-physical storage medium (i.e. a cloud based storage medium).
In a different aspect the invention relates to a computer-readable storage medium comprising a data storage material encoded with a first set of computer-readable data comprising a Fourier transform of at least a portion of the structural coordinates of the Rel enzyme listed in any one of Tables 1 to 4, optionally varied by a root mean square deviation of residue backbone atoms of not more than 3 Å, or selected coordinates thereof; which data, when combined with a second set of machine readable data comprising an X-ray diffraction pattern of a molecule or molecular complex of unknown structure, using a machine programmed with the instructions for using said first set of data and said second set of data, can determine at least a portion of the structure coordinates corresponding to the second set of machine readable data. Fourier transformation in the context of the invention is to be interpreted as the application of a molecular-replacement approach.
As envisaged herein by the term “Fourier transform”, the three-dimensional transformation of a molecular model is calculated in a first step. Subsequently, the weighed reciprocal lattice is rotated according to the calculated transformation. Fourier transformation in molecular biology, and more specifically structure biology, has been described in the art (Rabinovich et al., Molecular replacement: the revival of the molecular Fourier transform method, Acta crystallographica section D biological crystallography, 1998). In certain embodiments, the X-ray diffraction pattern of a molecule or molecular complex of unknown structure is obtained by an apparatus operably coupled to said computer storage medium. In alternative embodiments, the X-ray diffraction pattern of a molecule or molecular complex of unknown structure is inputted to said computer-readable storage medium by user instructions. In yet alternative embodiments, the X-ray diffraction pattern of a molecule or molecular complex of unknown structure is retrieved by a computer system comprising the computer-readable storage medium from a public (accessible) database.
In certain embodiments, the computer system or computer-readable storage medium as described herein further comprises a database containing information on the three dimensional structure of candidate compounds or modulators which are small molecules. In certain embodiments, the computer system or computer-readable storage medium further comprises a means to retrieve information from public information databases on the three dimensional structure of candidate compounds or modulators, which are “small” molecules as defined herein, including the non-limiting examples of PubChem (https://pubchem.ncbi.nlm.nih.gov), the Zinc database (https://www.zinc.docking.org), and/or MolPort (https://www.molport.com). In certain embodiments, the computer system further generates information indicating which list or subset of atomic coordinates of any one of Tables 1, 2, 3, or 4 shows or is predicted to show the highest number of energetically favorable interactions with any candidate modulator assessed by said computer system. In certain embodiments the user receives an automatically generated list of candidate compounds ranked according to the number of energetically favorable interactions with the Rel protein as defined by each list or subset of atomic coordinates of any one of Tables 1 to 4. In further embodiments the computer system provides the user with a number of common structural groups any combination of candidate modulator may be differentiated by.
While the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications, and variations will be apparent to those skilled in the art in light of the foregoing description. Accordingly, it is intended to embrace all such alternatives, modifications, and variations as follows in the spirit and broad scope of the appended claims. The herein disclosed aspects and embodiments of the invention are further supported by the following non-limiting examples.
1. Structure Analysis of Rel Enzyme with Catalytically Engaged Synthetase or Nuclease Domains.
The paradigm of RSH catalysis is based on the structure of N-terminal region of S. dysgalactiae Rel (RelSeqNTD) solved more than a decade ago. RelSeqNTD is formed by two catalytic domains with opposing activities—ppGpp hydrolase (HD) and ppGpp synthetase (SYN) (Hogg et al., Conformational antagonism between opposing active sites in a bifunctional RelA/SpoT homolog modulates (p)ppGpp metabolism during the stringent response Cell, 2004). Serendipitously, the two RelSeqNTD molecules observed in the same crystal lattice were locked in contrasting conformations leading to the hypothesis of reciprocal regulation of SYN and HD domains in archetypical RSH enzymes. However, the enzyme contained a nucleotide bound in each active site in one of the conformations and only one active site occupied in the other one. This contradicts earlier observations that suggested catalysis was incompatible with the simultaneous activation of synthetase and hydrolase function (Avarbock et al., Cloning and characterization of a bifunctional RelA/SpoT homologue from Mycobacterium tuberculosis, Gene, 1999, and Mechold et al., Differential regulation by ppGpp versus pppGpp in Escheria coli, Nucleic acids research, 2016). Thus, to directly test this hypothesis it is essential to solve the structures of Rel enzymes with catalytically engaged SYN or HD domains.
To understand how nucleotide binding stimulates the enzymatic capacity of RSH enzymes, we took advantage of T. thermophilus Rel NTD (RelTtNTD, amino acid positions 1-355) as an experimental system. RelTtNTD hydrolysis activity is virtually undetectable at 4° C. (
From the structure of the RelTtNTD-ppGpp complex (
2. Characterization of Conformational Rearrangements During Substrate Binding.
The hydrolase active site has a remarkable distribution of surface electrostatics. The site consists of a deep and wide cavity with one half of the site positively charged and involved in the stabilization of the 5′ poly-phosphate groups of the substrate and the other predominately acidic and more directly involved in the 3′-pyrophosphate hydrolysis (
The 5′-pyrophosphate group of ppGpp is stabilized by the damping effect of K112, K143, R147 and K161 and projects towards the α-helix α6 (
To understand how substrates control the ppGpp synthesis by the SYN domain, we solved the structure of RelTtNTD in a post-catalytic (PC) state (
The PC active site of RelTtNTD resembles the pre-catalytic state observed in the structure of the RelP SAS enzyme from S. aureus, a single-domain (p)ppGpp synthetase-only RSH enzyme that lacks additional catalytic or regulatory domains. The overall interactions of ppGpNp with the synthetase active site are similar to those observed in the RelP-GTP-APCPP (PDBID 6EWZ) and RelP-pppGpp (PDBID 6EX0) complexes (
3. Investigation of Structure-Function Relationship by smFRET.
Our structural data suggest that the presence of nucleotides in either the hydrolase or the synthetase domain would prime the enzyme for that particular function, switching off the other catalytic site. Such an allosteric effect would manifest itself in the form of changes in the width of the conformational landscape that would tilt the dynamic equilibrium of the population ensemble towards the favoured state as a function of the concentration of the nucleotides in solution. We directly challenge this hypothesis using single-molecule fluorescence resonance energy transfer (smFRET). For this, we constructed RelTtNTD6/287, a variant of RelTtNTD that allows fluorescent labels to be attached at cysteine residues introduced at positions 6 and 287 (RDA
E predicted for RelTtNTD6/287 based on our crystal structures is 75 Å for the open form (RelTtNTD-AMP-ppGpNp complex) and 57 Å for the closed form (RelTtNTD-ppGpNp complex).
While in the presence of ppGpp, RelTtNTD6/287 shows a homogenous population with RDA
E of 64 Å (
RDA
E of 72 Å consistent with the opening of the enzyme (
RDA
E of 70 Å) (
To directly address the sequential binding hypothesis, we monitored the binding of these nucleotides using Isothermal Titration Calorimetry (ITC). GDP binds RelTtNTD with an affinity of 23 μM (
We hypothesized that the motions of the α6-α7 loop were coupled to the allosteric switching of the enzyme, constituting a crucial element of the intramolecular crosstalk between domains. In the hydrolase-ON (RelTtNTD-ppGpp complex, close state), this loop was projected away from the hydrolase active site allowing the binding of ppGpp whereas in the synthetase-ON state (RelTtNTD post-catalytic open state) α6-α7 moved towards the hydrolase active site precluding the binding of ppGpp, effectively switching off the hydrolase function. We used RelTtNTD6/124, a RelTtNTD variant fluorescently labeled via cysteine residues introduced at residue positions 6 and 124 (RDA
E 62 Å, indicating displacement of the loop away from the active site (
RDA
E of 55 Å) indicative of the loop movement towards the active site (
RDA
E estimates based on the structural data that predicts a distance between dyes of 51 Å for the open state and 60 Å for the closed. The removal of the Mn2+ ion from the hydrolase site by incubation with EDTA precluded the close conformation even in the presence of ppGpp (
4. Biological Significance of the Stretch/Recoil Mechanism.
The dynamic modulation of the cellular alarmone levels is paramount to maintenance of cellular homeostasis. Here we showed that Rel enzymes possessing active hydrolase and synthetase domains rely on additional levels of allosteric regulation besides the control that the C-terminal regulatory domains exert, which prevent the occurrence of futile catalytic cycles. The activation of one of the catalytic domains entails the physical blockade and active site misalignment of the other. Our results support the view that the allosteric motion of α-helices α6 and α7 is coupled to the catalytic cycle precluding or allowing the access of substrate to the hydrolase active site whereas the relative conformational state between domains regulates the synthetase function, hindering access to the synthetase site in the close state. This allosteric control provides a bona fide on/off switch that renders one domain completely blocked while the other is active (
The regulation of catalysis of bifunctional enzymes is usually dominated by allosteric transitions preventing the occurrence of futile cycles (Okar et al., PFK-2/FBPase-2: maker and breaker of the essential biofactor fructose-2,6-bisphosphate, Trends in biochemical sciences, 2001). The stretch/recoil mechanism we propose provides the basis for the macromolecular control of the enzyme by its opposing substrates. The synchronous action of this additional regulatory layer, together with the spatial control of the ribosome as the docking platform of the enzyme for (p)ppGpp synthesis (Arenz et al., Nucleic acids research, 2016, Brown et al., Nature 2016, and Loveland et al., Elife, 2016) or the PTS for ppGpp hydrolysis (Ronneau et al., Regulation of (p)ppGpp hydrolysis by a conserved archetypal regulatory domain, Nucleic acids research, 2019), denote a tightly regulated mechanism that prevents the occurrence of futile catalytic cycles and facilitates the action of the alarmone as a bacterial phenotypic switch.
5. Discovery of Novel ppGpp Allosteric Sites.
The alarmone nucleotides guanosine pentaphosphate (pppGpp) and tetraphosphate (ppGpp)—collectively referred to as (p)ppGpp—are central regulators of bacterial metabolism and stress responses, with effects on antibiotic tolerance and virulence (Gaca et al., Many means to a common end: the intricacies of (p)ppGpp metabolism and its control of bacterial homeostasis, J Bacteriology, 2015; Hauryliuk et al., Recent functional insights into the role of (p)ppGpp in bacterial physiology, Nat Rev Microbiol, 2015; Liu et al., Diversity in (p)ppGpp metabolism and effectors, Curr Opin Microbiol, 2015). The concentration of (p)ppGpp in the cell is controlled by enzymes belonging to the RelA/SpoT Homologue (RSH) protein family (Atkinson et al., The RelA/SpoT homolog (RSH) superfamily: distribution and functional evolution of ppGpp synthetases and hydrolases across the tree of life, PLoS One, 2011). RSH family members synthesize (p)ppGpp by transferring the pyrophosphate group of ATP onto either GTP or GDP, and/or degrade it by removing the diphosphate and converting the alarmone back to GTP or GDP. The pentaphosphate alarmone pppGpp is a much more potent activator than the tetraphosphate ppGpp (Kudrin et al., The ribosomal A-site finger is crucial for binding and activation of the stringent factor RelA, Nucleic Acids Res, 2018). However, the molecular details of this allosteric regulatory mechanism and the location of RelA's (p)ppGpp binding site have remained elusive due to the challenging nature of RelA (low solubility and stability) combined with its considerable structural complexity. Long RSHs are comprised of an N-terminal enzymatic half (N terminal domain region, NTD) and C-terminal regulatory half (C terminal domain region, CTD). The NTD contains two domains: the (p)ppGpp hydrolysis (HD) and (p)ppGpp synthesis (SYNTH) domains, while the CTD is comprised of the TGS (ThrRS, GTPase and SpoT), Helical, ZFD (Zinc Finger Domain; equivalent to CC, conserved cysteine as per (Atkinson et al., The RelA/SpoT homolog (RSH) superfamily: distribution and functional evolution of ppGpp synthetases and hydrolases across the tree of life, PLoS One, 2011)) and RRM (RNA recognition motif; equivalent to ACT, aspartokinase, chorismate mutase and TyrA, as per (Atkinson et al., The RelA/SpoT homolog (RSH) superfamily: distribution and functional evolution of ppGpp synthetases and hydrolases across the tree of life, PLoS One, 2011)).
Investigations of E. coli RelA (Agirrezabala et al., The ribosome triggers the stringent response by RelA via a highly distorted tRNA. EMBO Rep, 2013; Arenz et al., The stringent factor RelA adopts an open conformation on the ribosome to stimulate ppGpp synthesis. Nucleic Acids Res, 2016; Brown et al., Ribosome-dependent activation of stringent control. Nature, 2016; Gropp et al., Regulation of Escherichia coli RelA requires oligomerization of the C-terminal domain. 2001; Loveland et al., Ribosome*RelA structures reveal the mechanism of stringent response activation, Elife, 2016; Turnbull et al., Intramolecular Interactions Dominate the Autoregulation of Escherichia coli Stringent Factor RelA, Frontiers in Microbiology, 2019) as well as Rel enzymes from Bacillus subtilis by us, Mycobacterium tuberculosis (Avarbock et al., Functional regulation of the opposing (p)ppGpp synthetase/hydrolase activities of RelMtb from Mycobacterium tuberculosis, Biochemistry, 2005; Jain et al., Molecular dissection of the mycobacterial stringent response protein Rel, Protein Sci, 2006), Thermus thermophilus (Tamman et al., Nucleotide-mediated allosteric regulation of bifunctional Rel enzymes, BioRxiv, 2019) and Streptococcus equisimilis (Hogg et al., Conformational antagonism between opposing active sites in a bifunctional RelA/SpoT homolog modulates (p)ppGpp metabolism during the stringent response [corrected], Cell, 2004; Mechold et al., Intramolecular regulation of the opposing (p)ppGpp catalytic activities of Rel(Seq), the Rel/Spo enzyme from Streptococcus equisimilis, J Bacteriol, 2002) have been instrumental for our understanding of the molecular regulation of long ribosome-associated RSHs. Collectively these studies have established that i) the CTD mediates contact with the rRNA and a highly distorted A-site tRNA—so-called A/R tRNA—of the ‘starved’ ribosomal complex, ii) the CTD transmits the allosteric signal to regulate the synthetic activity of the NTD and, iii) the synthetic and hydrolytic activities of SYNTH and HD domains of the NTD allosterically oppose each other's enzymatic activities. This network of intramolecular allosteric regulation is likely to be exploited by (p)ppGpp in its positive regulation of RelA. We have identified a “hot spot” using hydrogen-deuterium exchange mass spectrometry (HDX-MS). This site a located in the region that connects the hydrolase and synthetase domains of RelANTD and consists of α-helices α8, α9, α10, α11′ and α12, involving residues K164, D200, Y201, R204, Y211, K212, H219, R221, R222 and R225. Y211 is particularly important for the interaction and substitutions different from F or H are not permissive decreasing significantly the turnover of the enzyme and its affinity for pppGpp.
6. Putative Novel Site that Stabilizes the Enzyme in a Resting-Like Inactive State (
The resting apo state of the catalytic NTD region of Rel/RelA enzymes is structurally conserved (
The structure of RelSaNTD in a catalytically resting apo-state resembles that of other nucleotidefree Rel and RelA enzymes, both in terms of the overall fold of its catalytic domains, and their relative conformational state. While in the apo-state the hydrolase (HD) domain coordinates the Mn2+ ion that is essential for catalysis, the active site is not properly organized. R51 is misaligned and hydrogen-bonds D85 instead of the conserved T158(9.5 Å in the apo-state). The H-bond to T158 is crucial to orient the guanidine group for coordinating the guanine of (p)ppGpp. In addition this free state includes the increased flexibility of the region involving residues 117-130 of α6-α7 region for which we could not observe any density in the crystal structure. Recent structural studies of T. thermophilus Rel NTD (RelTtNTD) have identified the α6-α7 structural element as an allosteric switch linking the activation of one of the catalytic domains with the inactivation of the other. The aforementioned disorder in in the active site of apo-RelSaNTD, suggests that the apo-state is not compatible with active hydrolysis as observed in the case of apo-RelTtNTD. Therefore substrate binding to the HD site is likely required to trigger a conformational change in Rel that aligns the active site and allows (p)ppGpp hydrolysis. In the other catalytic domain, the dimensions of the SYNTH active site of RelSaNTD (with a volume of around 700 Å3) also resemble more that of RelTtNTD in the SYNTH-OFF conformation (around 780 Å3) with a completely buried ATP binding site and only the GDP binding site being exposed, than that of the active site of RelTtNTD in the SYNTH-ON which is approximately 1800 Å3 (
Substrate binding is required but not sufficient to lock a particular active catalytic state to gain insight into the intra-molecular regulation of the Rel catalytic domains by nucleotide substrates, we solved structures of RelSaNTD in a bound to either GDP or pppGpp. The structure of the RelSaNTD:GDP complex is similar to that of GDP-bound S. equisimilis Rel NTD (RelSeqNTD). GDP binding triggers only minor conformational changes compared to the unbound resting enzyme. These changes observed in the RelSaNTD:GDP complex are not sufficient to stabilize a conformation that is fully compatible with the active synthetase state. Such conformation was observed in for T. thermophilus Rel bound to both ppGpp and AMP. In this post-catalytic state, the enzyme was in an open conformation that involved the rotation of the SYNTH domain with respect to the central linker region that connects both domains, which exposed the ATP binding site. The inefficacy of GDP to trigger this open conformation could have regulatory implications in the outcome of catalysis. Thus the preference for GDP or GTP would be decided by the molecule that is more effective in stabilizing the active SYNTH conformation, with the extra phosphate of GTP likely the crucial group to trigger the open state. The structure of T. thermophilus Rel bound with ppGpp bound in the HD active site shows the enzyme undergoes a conformational change that results in a more compact NTD. We solved the structure of RelSaNTD in complex with pppGpp (
The overall structure of the RelSaNTD:pppGpp complex is very similar to that of the resting state and the RelSaNTD:GDP complex. The presence of pppGpp in the HD active site triggers some notable conformational changes that are however not sufficient to induce a fully active hydrolase state as observed in the RelTtNTD:ppGpp complex. In general, Rel HD active sites are defined by a hydrophobic region that engages the base of the nucleotide and two opposing acid and basic sites that accommodate the 3′ and 5′ groups. In the RelSaNTD:pppGpp complex, the guanosine base of the alarmone is stacked between R51 (that now is within 3.3 Å of T158—compared to 9.5 Å in the apo-state and 9.0 Å in the GDP complex—) and M162 and makes additional Van der Waals interactions with K52, Y57, K59 and N155. As observed in RelSeq, the majority of hydrogen bond interactions of the base with the enzyme are via the backbone amide and carbonyl groups of α-helices α3 and α8. At the 5′ end the tri-phosphate group is exposed to the bulk solvent with only the α-phosphate interacting with R169 (
The lattice constrains and SYNTH-domain occupancy, combined with the observed HD active site disorder, likely resulted in a slow rate of hydrolysis that allowed us to capture a leaving product of the hydrolase reaction at the interface the between the two molecules of the asymmetric unit.
Interestingly the reaction product is the unusual GTP derivative, guanosine 5′-triphosphate-2′:3′-cyclic mono-phosphate (pppG2′:3′p) (
Taken together, our structural results underscore the complexity of the activation of the catalytic domains of Rel enzymes. While substrate binding has been shown to be crucial for priming the enzyme for a particular catalytic function (Tamman et al., Nucleotide-mediated allosteric regulation of bifunctional Rel enzymes, BioRxiv, 2019; Hogg et al., Conformational antagonism between opposing active sites in a bifunctional RelA/SpoT homolog modulates (p)ppGpp metabolism during the stringent response [corrected]. Cell, 2004), this event on its own is not sufficient to guarantee the complete stabilization of an active catalytic state. This leaves the enzyme vulnerable to inhibition due to the serendipitous binding of nucleotides in both catalytic domains. In RelSeq, the simultaneous occupancy of both active sites precludes the enzyme from adopting a fully active catalytic conformation. This seems to be the case for the RelSaNTD:pppGpp as well. Considering the high efficiency of the enzyme in vivo, it is likely that additional layers of regulation are in place to fully stabilize of one conformation depending on a particular cellular state and prevent the occupancy of both sites at the same time.
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
351A
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/EP2020/059005 | 3/30/2020 | WO |