Broad spectrum inhibitors

Abstract
The invention features a method of designing broad spectrum inhibitors using structural data, compositions having broad spectrum activity, and methods for treating disease using those compositions.
Description


BACKGROUND OF THE INVENTION

[0002] The present invention relates to the field of inhibitors and methods for identifying or designing broad spectrum therapeutics for use in the treatment of infectious diseases and cancers, particularly where drug resistance is, or could reasonably predicted to be, an obstacle to successful long term therapy.


[0003] The development of drug resistance is one of the most common causes of drug failure in the treatment of diseases involving replicating biological entities (i.e., cancer and infectious diseases). Drug resistance often results from a reduction in drug-binding affinity and can be quantified by the ratio of drug binding affinity (Kd) for variant and wild type target proteins. Administration of a drug introduces a selective pressure upon the replicating biological entity. The result is the emergence of drug resistant strains.


[0004] Drug resistance is a major obstacle to the successful treatment of many cancers and infections, both bacterial and viral. For example, increased resistance of bacterial infections to antibiotic treatment has been extensively documented and has now become a generally recognized medical problem, particularly with nosocomial infections. See, for example, Jones et al., Diagn. Microbiol. Infect. Dis. 31:379 (1998); Murray, Adv. Intern. Med. 42:339 (1997); and Nakae, Microbiologia 13:273 (1997).


[0005] Drug resistance has complicated the treatment for HIV as new mutant strains of HIV have emerged that are resistant to multiple, structurally diverse, experimental and chemotherapeutic antiretrovirals, including HIV protease inhibitors (PIs), nucleoside and non-nucleoside HIV reverse transcriptase inhibitors (NRTIs and NNRTIs), and HIV fusion inhibitors (FIs).


[0006] More than 60 million people have been infected by HIV in the last two decades, and 20 million people have died from HIV/AIDS. While the development of highly active antiretrovirals to treat HIV/AIDS has led to significant reductions in the mortality and morbidity of AIDS, the rapid emergence and spread of drug-resistant mutant strains of HIV is rendering current drugs ineffective, and is the major cause of treatment failure. Recent estimates are that nearly 50% of drug-experienced patients in North America harbor HIV that is resistant to one or more of the 16 FDA-approved antiretroviral agents used in multi-drug ‘cocktails’ (Ref. dont have this ref). Moreover, it has been estimated that drug-resistant HIV accounts for up to 12% of new infections (Little et al., N. Engl. J. Med., 347:385 (2002)).


[0007] Accordingly, drug resistant HIV strains represent distinct infectious entities from a therapeutic viewpoint, and pose new challenges for drug design as well as drug treatment of existing infections. Substitutions have been documented in over 45 of the 99 amino acids of the HIV protease monomer in response to protease inhibitor treatment (Mellors, et al., International Antiviral News, 3:8 (1995); Eastman, et al., J. Virol., 72:5154 (1998); Kozal, et al., Nat. Med., 2:753 (1996)). The particular sequence and pattern of mutations selected by PIs is believed to be somewhat drug-specific and often patient-specific, but high level resistance is typified by multiple mutations in the protease gene which give rise to cross-resistance to all of the PIs.


[0008] In view of the foregoing problems, there exists a need for inhibitors against drug resistant and mdrHIV strains. Further, there exists a need for inhibitors against drug resistant and multi-drug resistant HIV proteases (mdrPR). Further still, there exists a need for inhibitors of HIV that can prevent or slow the emergence of drug resistant and mdrHIV strains in infected individuals.


[0009] Inhibitors with the ability to inhibit mdrHIV strains, and to slow the emergence of drug resistant strains in wild type HIV infections, are defined as “resistance-repellent” inhibitors.


[0010] There also exists a need for robust methods that can be used to design “resistance-repellent” inhibitors.


[0011] More generally, there is a need for therapeutic regimens that minimize the likelihood that resistance will occur in a disease involving a replicating biological entity. In one approach, drugs may be designed which have similar activity against both the wild type and mutant forms of their target. Such drugs minimize the probability of a mutant population emerging by reducing the selective pressure introduced by the drug when used to treat wild type infections. Such drugs also can be used to treat mutant infections and can be used for salvage therapy.


[0012] There is also an urgent need to develop potent, broad-spectrum, and mechanistically-novel antimicrobials suitable for tackling the growing problem of antibiotic-resistant bacteria strains, and for treating and/or preventing outbreaks of infectious diseases, including diseases caused by bioterrorism agents like anthrax, plague, cholera, gastroenteritis, multidrug-resistant tuberculosis (MDR TB). The recent anthrax attack of 2001 underscored the reality of large-scale aerosol bioweapons attack by terrorist groups. It also revealed that there is an urgent and pressing need to discover and develop novel and potent antimicrobials that can be used therapeutically and prophylactically for biodefense against new bioattacks. The NIH and CDC have identified a number of High Priority pathogens based on their likelihood of causing widespread contagious disease and/or death to the general population. Research on methods of protection against potential agents of bioterrorism has been a priority for several years at the NIH. A recent analysis suggested the existence of ongoing offensive biological weapons programs in at least 13 countries (Inglesby, T. V., et al., JAMA, 287:2236, (2002)).


[0013] The widespread use of antibiotics in human medical as well as in agricultural applications has promoted the emergence and spread of drug resistant bacteria that are no longer sensitive to existing drugs. The ease with which drug resistant microorganisms can be selected in a simple laboratory setting is a further concern when contemplating pharmaceutical-based strategies for biodefense. There is an urgent need to discover and develop novel therapeutic agents to combat pathogens that are likely to be used in a bioterrorist scenario.


[0014] A list of selected agents rated by likelihood to cause the greatest harm in a bioterrorist attack has been compiled by the CDC and NIAID (Lane, H. C., et al., Nat Med., 7:1271 (2001)). B. anthracis; the bacterium that causes anthrax, is one of the most serious of the group A pathogens. Dissemination of B. anthracis spores via the US Postal Service in 2001 established the feasibility of large-scale aerosol bioweapons attack. It has been estimated that between 130,000 and 3 million deaths would follow the release of 100 kg of B. anthracis, a lethality matching that of a hydrogen bomb (Inglesby, T. V., et al., JAMA, 287:2236, (2002)). Penicillin, doxycycline and ciprofloxacin are currently approved by the FDA for the treatment of inhalation anthrax infections. However, it was advised that antibiotic resistance to penicillin- and tetracycline-class antibiotics should be assumed following a terrorist attack (Inglesby, T. V., et al., JAMA, 281:1735-45 (1999)). Moreover, in vitro selection of a B. anthracis strain that is resistant to ofloxacin (a fluoroquinilone closely related to ciprofloxacin) has been reported (Choe, C. H., et al., Antimicrob. Agents. Chemother., 44:1766 (2000)). Following the anthrax attacks of 2001, the CDC advocated the use of a combination of 2-3 antibiotics. As a post-exposure prophylaxis, 60 days of treatment with ciprofloxacin is currently recommended. Strict compliance to this drug regimen is complicated by moderate to severe gastrointestinal tract intolerance.


[0015] Another group A pathogen, Y. pestis, is the causative agent of plague. If 50 kg of Y. pestis were released as an aerosol over a city of 5 million, pneumonic plague would afflict an estimated 150,000 individuals and result in 36,000 deaths (Inglesby, T. V., et al., JAMA, 283: 2281, (2000)). Streptomycin, tetracycline and doxycycline are the FDA-approved treatment for plague. Wide spread use of these antibiotics in the US raises concerns about possible resistance. A US-licensed, formaldehyde-killed whole bacilli vaccine was discontinued by its manufacturers in 1999 and is no longer available.


[0016]

C. jejuni
and V. cholerae are category B pathogens which can present a significant threat to the safety of food and water supplies. C. jejuni infections are one of the most commonly identified causes of acute bacterial gastroenteritis worldwide and area frequent cause of Traveler's diarrhea (Allos, B. M., Clin Infect Dis, 32:1201 (2001)). Currently, the CDC estimates that 2.4 million cases of Campylobacter infection occur in the United States each year, affecting almost 1% of the entire population. In the past few years, a rapidly increasing proportion of Campylobacter strains all over the world have been found to be fluoroquinolone-resistant. High rates of resistance make tetracycline, amoxicillin, ampicillin, metronidazole, and cephalasporins poor choices for the treatment of C. jejuni infections. All Campylobacter species are inherently resistant to vancomycin, rifampin, and trimethoprim. V. cholerae, a causative agent of cholera, is responsible for 120,000 deaths annually (Faruque, S. M., et al., Microbiol Mol Biol Rev, 62:1301 (1998)) and is characterized by a rapidly changing pattern of antibiotic resistance.


[0017] TB is one of the most common and devastating infectious diseases known to man. An estimated one third of the global population is infected with Mycobacteria tuberculosis. Eight million people develop an active infection and 2 million victims die yearly (Dye, C., et al., JAMA, 282:677 (1999.)). Currently, a combination of four drugs is recommended for TB treatment: isoniazid, rifampicin, pyrazinamide and ethambutole. The treatment course lasts 6 months. Such a multidrug combination together with the lengthy duration of treatment is prone to side-effects and adherence problems, which in turn can often lead to the development of drug resistance. The current drugs used to treat TB infections were introduced into clinical practice more than 30 years ago, in the absence of any knowledge of molecular mechanism. There is an urgent need to identify novel, effective, non-toxic and specific drugs that can shorten the duration of treatment, reduce side-effects, combat latent infection and reduce the spread of MDR TB strains. In addition, it is important to recognize the need for mechanistically novel drugs, i.e., antimicrobial agents that target biochemical pathways distinct from those of existing TB drugs, in order to be effective against MDR TB strains.


[0018] In summary, there is a clear need for the discovery of novel, non-toxic, broad spectrum antibiotics that can be used to (1) treat drug-resistant bacterial infections, and (2) protect civilians and military personnel in case of bioterrorist attacks. In one approach, drugs may be designed which have similar activity against both the wild type and variant forms of their target. Such drugs should minimize the probability of the emergence of mutant populations by reducing the selective pressure introduced by the drug when used to treat wild type infections. Such drugs also can be used to treat mutant infections and can be used for salvage therapy. In another approach, drugs may be designed which have similar activity against various isotypes of a homologous target. Such drugs can be used to treat multiple species of pathogenic microorganisms since they will be active against the target of each species. In a third approach, drugs can be designed that combine the properties and the uses of both of the above approaches.


[0019] There also exists a need for robust methods that can be used to design such broad spectrum antibiotics.



SUMMARY OF THE INVENTION

[0020] In a first aspect the invention features a method for the structure-based design of a drug that can act as an inhibitor of at least two different biological entities, the method comprising the steps of: (a) providing at least one structure of a wild type target protein or an inhibitor-wild type target protein complex; (b) providing at least one structure of a variant target protein or an inhibitor-variant target protein complex; (c) comparing at least one structure from step (a) with at least one structure from step (b) to determine whether there exists a common three-dimensionally conserved substructure comprising the atomic coordinates of the structurally conserved atoms of the inhibitors and structurally conserved atoms of the target proteins; and (d) if a conserved substructure exists, using the atomic coordinates of the conserved substructure to select a compound having atoms matching those of the structurally conserved atoms of the inhibitors, wherein the selection of the compound is performed using computer modeling.


[0021] The invention also features a method for the structure-based drug design of a broad spectrum compound, the method comprising the steps of: (a) providing at least one structure of a wild type target protein or an inhibitor-wild type target protein complex; (b) providing at least one structure of a variant target protein or an inhibitor-variant target protein complex; (c) comparing at least one structure from step (a) with at least one structure from step (b) to determine whether there exists a common three-dimensionally conserved substructure comprising the atomic coordinates of the structurally conserved atoms of the target proteins or a common three-dimensionally conserved substructure comprising the atomic coordinates of the structurally conserved atoms of the inhibitors and structurally conserved atoms of the target proteins; and (d) if a conserved substructure exists, using the atomic coordinates of the conserved substructure to select a compound having atoms matching those of the structurally conserved atoms of the inhibitors or to design a compound that binds to the target protein, wherein the selection of the compound is performed using computer modeling.


[0022] Desirably, the above method further comprises the steps of: (e) comparing at least one structure from step (a) with at least one structure from step (b) to determine whether there exists a three-dimensionally non-conserved substructure comprising the atomic coordinates of the structurally non-conserved atoms of the inhibitors and structurally non-conserved atoms of the target proteins; and (f) if a non-conserved substructure exists, using the atomic coordinates of the non-conserved substructure to reject a compound having atoms matching those of the structurally non-conserved atoms of the inhibitors, wherein the rejection of the compound is performed in conjunction with computer modeling.


[0023] In any of the above methods, at least two, four, six, or eight structures from step b can be used in step c. The methods can be applied using several structures, including at least two, four, six, or eight variant forms of the target protein.


[0024] The inhibitors used in the inhibitor-wild type target protein complex and the inhibitor-variant target protein complex can be the same or different. The inhibitors can be selected from competitive or noncompetitive inhibitors. Furthermore, the inhibitors can be selected from reversible, or irreversible inhibitors.


[0025] In any of the above methods, the variant target protein can be a homologous protein or a mutant protein.


[0026] In any of the above methods, the structures can be selected from crystal structures, NMR structures, computer models, any acceptable experimental, theoretical or computational method of deriving a three-dimensional representation of a structure, or a combination thereof.


[0027] Target proteins for use in the present invention include any therapeutically relevant protein. The target protein can be a viral, bacterial, protozoan, or fungal protein. In some instances, the target protein is one that is expressed in a neoplasm.


[0028] Preferably, the target protein can be an enzyme, a receptor, a structural protein, a component of a macromolecular complex, a component of a metabolic pathway, or an assembly of biological molecules. Desirably, the target protein is necessary for the survival of the replicating biological entity. For example, the target protein can be an enzyme selected from the group consisting of reverse transcriptases, proteases, DNA and RNA polymerases, methylases, oxidases, esterases, acyl transferases, helicases, topoisomerases, and kinases. The target protein can be a component of a metabolic pathway, such as the shikimate pathway. Desirable target proteins include HIV protease or 3-dehydroquinate dehydratase, among others.


[0029] Where the target protein is HIV protease, suitable inhibitors for use in the methods of the invention include those selected from the group consisting of indinavir, nelfinavir, ritonavir, saquinavir, amprenavir, lopinavir, and UIC-94003.


[0030] A broad spectrum protease inhibitor can be designed using the susbstructure of structurally conserved atoms described by the atomic coordinates in Table 8, which includes the structurally conserved atoms of the inhibitor and structurally conserved atoms of the protease. A broad spectrum protease inhibitor can also be designed using the structurally conserved atoms of the inhibitor alone. These are described by the atomic coordinates in Table 8.


[0031] A broad spectrum 3-dehydroquinate dehydratase inhibitor can be designed using the susbstructure of structurally conserved atoms described by the atomic coordinates in Table 12, which includes the structurally conserved atoms of the 3-dehydroquinate dehydratase. A broad spectrum 3-dehydroquinate dehydratase inhibitor can also be designed using the structurally conserved atoms of the inhibitor alone. These are described by the atomic coordinates in Table 12.


[0032] The invention also features compounds having a chemical structure selected using any of the methods above. Such compounds are broad spectrum inhibitors and have broad spectrum activity against replicating biological entities expressing a particular target protein. Thus, if the target protein is expressed by a microbe or a neoplasm, the compound will have broad spectrum activity against the microbe or neoplasm, respectively.


[0033] The invention features a compound having broad spectrum activity against HIV protease wherein the compound has a chemical structure selected using the methods above, including those methods utilizing the atomic coordinates of Table 8.


[0034] The invention features a compound having broad spectrum activity against 3-dehydroquinate dehydratase wherein the compound has a chemical structure selected using the methods above, including those methods utilizing the atomic coordinates of Table 12.


[0035] The compounds of the invention exclude bis-THF compounds (e.g., analogs of compounds 1 and 3) as described in J. Med. Chem. 39:3278-3290 (1996) (compounds 49-52 and 58-60), Bioorg. Med. Chem. Lett. 8:979-982 (1998), WO99/65870, U.S. Pat. No. 6,319,946, WO02/08657, WO02/092595, WO99/67417, EP00/9917, and WO00/76961; and also exclude fused ring THF structures as described in Bioorg. Med. Chem. Lett. 8:687-690 (1998) and U.S. Pat. No. 5,990,155.


[0036] For any of the broad spectrum inhibitors of the invention, broad spectrum activity can be measured by the ratio of the inhibitory concentrations of the broad spectrum inhibitor for the variant and wild type biological entities (IC50, variant/IC50, wild type). Desirably, the IC50, variant/IC50, wild type ratio for a broad spectrum inhibitor is less than 100, 80, 60, 40, 30, 20, 10, 8, 6, or, most desirably, less than 3.


[0037] A broad spectrum inhibitor can be active against several different mutant biological entities. Desirably, the inhibitor will have broad spectrum activity against at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 mutant biological entities.


[0038] A broad spectrum inhibitor can also be active against different organisms or neoplastic cell types expressing homologous target proteins that possess sufficient structural similarity. Desirably, the inhibitor will have broad spectrum activity against at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, or 20 different organisms or neoplastic cell types expressing homologous target proteins.


[0039] The invention also features a pharmaceutical composition that includes a broad spectrum inhibitor described herein in any pharmaceutically acceptable form, including isomers such as diastereomers and enantiomers, salts, solvates, and polymorphs thereof. The composition can include an inhibitor of the invention along with a pharmaceutically acceptable carrier or diluent.


[0040] The invention also features methods of treating disease in a patient in need thereof, which includes the administration of a pharmaceutical composition of the invention to the patient in an amount sufficient to treat the disease. The pharmaceutical composition includes any broad spectrum inhibitor described herein. Such broad spectrum inhibitors have broad spectrum activity against replicating biological entities expressing a particular target protein. Thus, if the target protein is expressed by a microbe or a neoplasm, the disease to be treated will be a microbial infection or neoplasm, respectively.


[0041] The invention features a method of treating an HIV infection in a patient in need thereof, the method including the step of administering to the patient a pharmaceutical composition including a broad spectrum protease inhibitor described herein in amounts effective to treat the HIV infection.


[0042] The invention features a method of treating a bacterial infection in a patient in need thereof, the method including the step of administering to the patient a pharmaceutical composition including a broad spectrum 3-dehydroquinate dehydratase inhibitor described herein in amounts effective to treat the bacterial infection. The bacterial infection to be treated using the above method can be caused by a bacterium selected from the group consisting of C. jejuni, V. cholerae, Y. pestis, B. anthracis, P. putidas, and M. tuberculosis. Furthermore, this method can be used to treat infections by any microbe the utilizes 3-dehydroquinate dehydratase.


[0043] The invention also features the use of a pharmaceutical composition described herein in the manufacture of a medicament for the treatment of a disease. The pharmaceutical composition includes any broad spectrum inhibitor described herein. Such broad spectrum inhibitors and have broad spectrum activity against replicating biological entities expressing a particular target protein. Thus, if the target protein is HIV protease or 3-dehydroquinate dehydratase, the disease to be treated will be an HIV infection or bacterial infection, respectively.


[0044] The term “replicating biological entity” includes, for example, bacteria, fungi, yeasts, viruses, protozoa, prions and neoplasms


[0045] Neoplasms include, for example, carcinomas of the bladder, breast, colon, kidney, liver, lung, head and neck, gall-bladder, ovary, pancreas, stomach, cervix, thyroid, prostate, or skin; a hematopoietic tumor of lymphoid lineage; a hematopoietic tumor of myeloid lineage; a tumor of mesenchymal origin; a tumor of the central or peripheral nervous system; melanoma; seminoma; teratocarcinoma; osteosarcoma; thyroid follicular cancer; and Kaposi's sarcoma. Hematopoietic tumors of lymphoid lineage can be leukemia, acute lymphocytic leukemia, acute lymphoblastic leukemia, B-cell lymphoma, T-cell-lymphoma, Hodgkin's lymphoma, non-Hodgkin's lymphoma, hairy cell lymphoma and Burkett's lymphoma.


[0046] By “wild type target protein” is meant a protein obtained from a replicating biological entity that has not been subjected to drug selection pressure, and could include polymorphisms or isoforms thereof. A replicating biological entity that expresses wild type target protein is referred to herein as a wild type biological entity.


[0047] By “variant target protein” is meant a mutant target protein or a homologous target protein. A replicating biological entity that expresses variant target protein is referred to herein as a variant biological entity.


[0048] By “mutant target protein” is meant a target protein that contains one or more amino acid substitutions with respect to the wild type target protein, including proteins from the same organism that have evolved under drug selection pressure. In general, mutant target proteins will have one or more amino acid substitutions and should be readily identified as related to the cognate wild type protein using standard sequence comparison methods. A replicating biological entity that expresses mutant target protein is referred to herein as a mutant biological entity.


[0049] By “homologous target protein” is meant a variant target protein that is expressed in a different species or neoplastic cell type than the wild type target protein, but has the same, or similar, function.


[0050] By “structurally conserved target substructure”, and by “structurally” or “three-dimensionally conserved substructure” as applied to target proteins, is meant the regions of the target protein structure which are not significantly affected by amino acid mutations or substitutions. Such regions can be defined using standard methods of comparative analysis of three-dimensional structures of proteins, such as superposition analysis, for example. In the case of HIV protease, these regions were identified using a pair wise superposition analysis of wild type and mutant protease structures complexed with inhibitors. The superposition of structures can be performed using the iterative procedure described herein. In the case of DHQase, these regions were identified using a pair wise superposition analysis of wild type and homologous DHQase structures from different bacterial species with and without inhibitors. It is apparent that the overall compositions of structurally conserved target substructures will likely differ for different, non-homologous target proteins, especially when the frequency of amino acid substitutions in high. However, a quantitative definition can be derived from the superposition analysis, which provides both the identities and the positions of the atoms that comprise these substructures. The regions that comprise structurally conserved target substructures contain atoms whose superimposed pairs have three-dimensional atomic coordinates that match to within a distance of 1 Å, 0.6 Å, 0.4 Å, or 0.2 Å.


[0051] By “broad spectrum inhibitor” is meant a compound having broad spectrum activity, i.e., an inhibitor that is active against two different-biological entities, e.g., both a wild type biological entity and one or more variants of that biological entity. Thus, broad spectrum activity can be described by the inhibitor's action against a particular target protein (e.g., broad spectrum activity against protease) or a particular target organism (e.g., broad spectrum activity against HIV). Broad spectrum inhibitors will have medically insignificant interactions with non-conserved regions. Broad spectrum inhibitors can be useful for the treatment and/or prevention of infectious diseases caused by multiple infectious agents, as well as for decreasing the development of drug-resistance by these organisms.


[0052] As used herein, the term “treating” refers to administering a pharmaceutical composition for prophylactic and/or therapeutic purposes. To “prevent disease” refers to prophylactic treatment of a patient who is not yet ill, but who is susceptible to, or otherwise at risk of, a particular disease. To “treat disease” or use for “therapeutic treatment” refers to administering treatment to a patient already suffering from a disease to ameliorate the disease and improve the patient's condition. Thus, in the claims and embodiments, treating is the administration to a patient either for therapeutic or prophylactic purposes.


[0053] The term “microbial infection” refers to the invasion of the host patient by pathogenic microbes (e.g., bacteria, fungi, yeasts, viruses, protozoa). This includes the excessive growth of microbes that are normally present in or on the body of a patient. More generally, a microbial infection can be any situation in which the presence of a microbial population(s) is damaging to a host patient. Thus, a patient is “suffering” from a microbial infection when excessive numbers of a microbial population are present in or on a patient's body, or when-the presence of a microbial population(s) is damaging the cells or other tissue of a patient.


[0054] The term “microbes” includes, for example, bacteria, fungi, yeasts, viruses and protozoa. The term “administration” or “administering” refers to a method of giving a dosage of a pharmaceutical composition to a patient, where the method is, e.g., topical, oral, intravenous, intraperitoneal, or intramuscular. The preferred method of administration can vary depending on various factors, e.g., the components of the pharmaceutical composition, site of the potential or actual disease and severity of disease.


[0055] The term “patient” includes humans, cattle, pigs, sheep, horses, dogs, and cats, and also includes other vertebrate, most preferably, mammalian species.


[0056] Where “atomic coordinates” are provided, or otherwise referred to, these coordinates define a three dimensional structure. That such a structure may be defined by more than one different coordinate system, e.g., by translation or rotation of the coordinates, does not change the relative positions of the atoms in the structure. Accordingly, any reference to atomic coordinates herein is intended to include any equivalent three dimensional structure defined by the coordinates.


[0057] By “computer modeling” is meant the use of a computer to visualize or compute a compound, a portion of a compound, a target protein, a portion of a target protein, a complex between a compound and a target protein, or a portion of a complex between a compound and a target protein.


[0058] Other features and advantages of the invention will be apparent from the following detailed description and from the claims.







BRIEF DESCRIPTION OF THE DRAWINGS

[0059]
FIG. 1 is a table depicting the structures of compounds 1-7, gt33, and qxa.


[0060]
FIG. 2 illustrates the amino acid alignment of type II DHQases. Fully conserved residues are framed. Catalytically important amino acids are marked by stars. Arrows denote amino acids that make hydrogen bonds and ionic interactions in the structure of M. tuberculosis DHQase complexed with the inhibitor, 3-dehydroquinic acid oxime.


[0061]
FIG. 3 illustrates the key interactions of the substrate-based inhibitor, DHQO, with the active site residues for the Type II DHQase from M. tuberculosis.







DETAILED DESCRIPTION

[0062] We have discovered that the comparative analysis of the structures of complexes of inhibitors bound to wild type and variant forms of a target protein can be used to design compounds that are broad spectrum inhibitors.


[0063] The methods of the invention entail the design of compounds having a particular structure. The methods rely upon the use of structural information to arrive at these compounds. The structural data define a three dimensional array-of the important contact atoms in an inhibitor that bind to the target protein in a fashion that results in broad spectrum activity against biological entities expressing variants of the target protein.


[0064] Inhibitor-Target Protein Structures


[0065] Atomic structural coordinates can be selected from crystal structures, NMR structures, computer models, any acceptable experimental, theoretical or computational method of deriving a three-dimensional representation of a structure, or a combination thereof. Atomic coordinates for use in the methods of the invention can be obtained from publicly available sources, e.g. from the Protein Data Bank, or obtained using known experimental or computational methods.


[0066] Atomic structural coordinates for use in the methods of the invention include crystal structures of HIV protease/inhibitor complexes derived from wild type and drug-resistant mutant proteases, and of DHQase and DHQase inhibitor complexes derived from two or more bacterial species, among others. In examples 1-3, the methods of the invention are applied using the coordinates of wild type HIV protease complexed with amprenavir, wild type HIV protease complexed with UIC-94003, and V82F/184V mutant HIV protease complexed with UIC-94003. In example 4, the methods of the invention are applied using the coordinates of wild type DHQase from M. tuberculosis and from Pseudomonas putidas. a complex between a compound and a target protein. The coordinates of other representative structures of HIV protease and DHQase should be useful for performing the methods of the present invention.


[0067] Conserved Substructures


[0068] Conserved substructures can be identified for target proteins, for target protein-inhibitor complexes, and/or for inhibitors, depending on the nature of the structures that are used in the comparative superposition analysis. In one approach, at least one structure of a wild type target protein is compared to at least one structure of a mutant or homologous target protein to determine whether a common three-dimensionally conserved substructure is present among the wild type protein and the mutant or homologous proteins, respectively. In another approach, at least one structure of an inhibitor-wild type target protein complex and at least one structure of an inhibitor-mutant target protein complex are compared to determine whether a common three-dimensionally conserved substructure is present among the mutant and wild type complexes. In a third approach, at least one structure of an inhibitor-wild type target protein complex and at least one structure of a mutant or homologous target protein without inhibitor are compared to determine whether a common three-dimensionally conserved substructure is present among the respective mutant or homologous protein and the wild type complexes. Variations of the approached described above can also be used. In each case, such a comparison can be made by means of (a) an overall superposition of the atoms of the protein structures; and, where feasible, (b) a study of the distances from atoms of the inhibitors to atoms of the protein. This analysis requires three-dimensional atomic coordinates of the protein structures and of the bound inhibitor.


[0069] The superposition of the protein structures can be performed in a two step process: 1) the distance between all pairs of corresponding Cá atoms (Cá atom of residue number 1 in one protein to Cá atom of residue number 1 in the second protein; Cá atom of residue number 2 in one protein to Cá atom of residue number 2 in the second protein; and so on) of the polypeptide chains is minimized by means of a least-square algorithm; 2) the superposition is refined by minimizing, in an iterative process, the distances between corresponding Cá atoms that are closer than a given distance (0.25 A for example), thus eliminating regions of the structures having large conformational differences to compute the superposition parameters. Furthermore, where a partial structure is provided (e.g., from NMR data) the available coordinates are superimposed.


[0070] The conserved substructure identifies the relevant portion of the target protein that is the active site, or binding region, defined by that part of the target protein interacting with inhibitor. Important interactions between the target protein and inhibitor are identified by mapping the contacts between the two. Structurally conserved regions of the target protein not near the binding site are generally not relevant to the design of the broad spectrum inhibitor. Accordingly, the selection of the meaningful substructure is identified using the above mentioned contacts.


[0071] Design of a Broad Spectrum Inhibitor


[0072] The coordinates of the conserved inhibitor substructure are used to design an inhibitor having atoms matching those of the three-dimensionally structurally conserved atoms of the inhibitors. The result is an inhibitor for which IC50, variant and IC50, wild type are similar, minimizing the selective pressure introduced by the drug.


[0073] The methods of the invention can employ computer-based methods for designing broad spectrum inhibitors. These computer-based methods fall into two broad classes: database methods and de novo design methods. In database methods the compound of interest is compared to all compounds present in a database of chemical structures and compounds whose structure is in some way similar to the compound of interest are identified. The structures in the database are based on either experimental data, generated by NMR or x-ray crystallography, or modeled three-dimensional structures based on two-dimensional (i.e., sequence) data. In de novo design methods, models of compounds whose structure is in some way similar to the compound of interest are generated by a computer program using information derived from known structures, e.g., data generated by x-ray crystallography and/or theoretical rules. Such design methods can build a compound having a desired structure in either an atom-by-atom manner or by assembling stored small molecular fragments.


[0074] The success of both database and de novo methods in identifying compounds having the desired activity depends on the identification of the functionally relevant portion of the compound of interest. The functionally relevant portion of the compound, the pharmacophore, is defined by the structurally conserved substructure. A pharmacophore then is an arrangement of structural features and functional groups important for obtaining an inhibitor having broad spectrum activity.


[0075] Not all identified compounds having the desired pharmacophore will act as broad spectrum inhibitors. The actual activity can be finally determined only by measuring the activity of the compound in relevant biological assays. However, the methods of the invention are extremely valuable because they can be used to greatly reduce the number of compounds which must be tested to identify those likely to exhibit broad spectrum activity.


[0076] Programs suitable for generating predicted three-dimensional structures from two-dimensional data include: Concord (Tripos Associated, St. Louis, Mo.), 3-D Builder (Chemical Design Ltd., Oxford, U.K.), Catalyst (Bio-CAD Corp., Mountain View, Calif.), and Daylight (Abbott Laboratories, Abbott Park, Ill.).


[0077] Programs suitable for searching three-dimensional databases to identify molecules bearing a desired pharmacophore include: MACCS-3D and ISIS/3D (Molecular Design Ltd., San Leandro, Calif.), ChemDBS-3D (Chemical Design Ltd., Oxford, U.K.), and Sybyl/3DB Unity (Tripos Associates, St. Louis, Mo.).


[0078] Programs suitable for pharmacophore selection and design include: DISCO (Abbott Laboratories, Abbott Park, Ill.), Catalyst (Bio-CAD Corp., Mountain View, Calif.), and ChemDBS-3D (Chemical Design Ltd., Oxford, U.K.).


[0079] Databases of chemical structures are available from Cambridge Crystallographic Data Centre (Cambridge, U.K.) and Chemical Abstracts Service (Columbus, Ohio).


[0080] De novo design programs include Ludi (Biosym Technologies Inc., San Diego, Calif.) and Aladdin (Daylight Chemical Information Systems, Irvine Calif.).


[0081] One skilled in the art may use one of several methods to screen chemical entities for their ability to match the conserved substructure. This process may begin by visual inspection of, for example, the active site on the computer screen based on the atomic coordinates for the target protein. Docking may be accomplished using software such as Quanta and Sybyl, followed by energy minimization and molecular dynamics with standard molecular mechanics forcefields, such as CHARMM and AMBER.


[0082] Specialized computer programs may also assist in the process of selecting chemical entities. These include:


[0083] 1. GRID (Goodford, P. J., “A Computational Procedure for Determining Energetically Favorable Binding Sites on Biologically Important Macromolecules,” J. Med. Chem., 28:849 (1985)). GRID is available from Oxford University, Oxford, UK.


[0084] 2. MCSS (Miranker, A. and M. Karplus, “Functionality Maps of Binding Sites: A Multiple Copy Simultaneous Search Method.” Proteins: Structure, Function, and Genetics, 11:29 (1991)). MCSS is available from Molecular Simulations, Burlington, Mass.


[0085] 3. AUTODOCK (Goodsell, D. S. and A. J. Olsen, “Automated Docking of Substrates to Proteins by Simulated Annealing,” Proteins: Structure, Function, and Genetics, 8:195 (1990)). AUTODOCK is available from Scripps Research Institute, La Jolla, Calif.


[0086] 4. DOCK (Kuntz, L. D. et al., “A Geometric Approach to Macromolecule-Ligand Interactions,” J. Mol. Biol., 161:269 (1982)). DOCK is available from University of California, San Francisco, Calif.


[0087] Once the conserved substructure for the inhibitor has been identified, the conserved atoms of the inhibitor can be selected for assembly into a single inhibitor. Assembly may be proceed by visual inspection of the relationship of the fragments to each other on the three-dimensional image displayed on a computer screen in relation to the structure coordinates of the target protein. This may be followed by manual model building using software such as Quanta or Sybyl.


[0088] Useful programs to aid one of skill in the art in assembly of the individual chemical entities or fragments include:


[0089] 1. CAVEAT (Bartlett, P. A. et al, “CAVEAT: A Program to Facilitate the Structure-Derived Design of Biologically Active Molecules”. In “Molecular Recognition in Chemical and Biological Problems,” Special Pub., Royal Chem. Soc., 78:182 (1989)). CAVEAT is available from the University of Calif., Berkeley, Calif.


[0090] 2. 3D Database systems such as MACCS-3D (MDL Information Systems, San Leandro, Calif.). This area is reviewed in Martin, Y. C., “3D Database Searching in Drug Design,” J. Med. Chem., 35:2145 (1992)).


[0091] 3. HOOK (available from Molecular Simulations, Burlington, Mass.).


[0092] Other molecular modeling techniques may also be employed in accordance with this invention. See, e.g., Cohen, N. C. et al., “Molecular Modeling Software and Methods for Medicinal Chemistry,” J. Med. Chem., 33:883 (1990). See also, Navia, M. A. and M. A. Murcko, “The Use of Structural Information in Drug Design,” Current Opinions in Structural Biology, 2:202 (1992).


[0093] Once a broad spectrum inhibitor has been optimally designed, as described above, substitutions may then be made in some of its atoms or side groups in order to improve or modify its binding properties. Generally, initial substitutions are conservative, i.e., the replacement group will have approximately the same size, shape, hydrophobicity and charge as the original group. It should, of course, be understood that components known in the art to alter conformation should be avoided.


[0094] In general, inhibitors designed using the methods of the invention can be tested for broad spectrum activity using any of the to in vitro and/or in vivo methods described below, among others.


[0095] Broad Spectrum Inhibitors


[0096] Broad spectrum inhibitors match the pharmacophore defined by the structurally conserved substructure. The pharmacophore is the arrangement of structural features and functional groups important for obtaining an inhibitor having broad spectrum activity. This pharmacophore is derived using structural data for known inhibitors complexed to a target protein. Accordingly, broad spectrum inhibitors will often be structurally related to known compounds lacking broad spectrum activity, but useful in the design of broad spectrum inhibitors using the methods disclosed herein. These known inhibitors serve as lead compounds for both the design and synthesis of a broad spectrum inhibitor. Using the synthetic methods for making the lead compounds and standard synthetic methods as described by, for example, J. March, Advanced Organic Chemistry: Reactions, Mechanisms and Structure,” John Wiley & Sons, Inc., 1992; T. W. Green and P. G. M. Wuts, “Protective Groups in Organic Synthesis” (2nd Ed.), John Wiley & Sons, 1991; and P. J. Kocienski, “Protecting Groups,” Georg Thieme Verlag, 1994, one can synthesize the broad spectrum inhibitors described herein.


[0097] Typically the lead compounds bear varied functional groups which are present in the pharmacophore, including hydrogen-bond donors, hydrogen-bond acceptors, ionic moieties, polar moieties, hydrophobic moieties, aromatic centers, and electron-donors and acceptors. These are linked by a structural scaffold which imparts the appropriate a three dimension arrangement of the functional groups.


[0098] Numerous modifications of the lead compound can be made using techniques known in the art. These include changing a functional group by replacing it with another moiety of the same group. For example, one hydrogen-bond donor may be substituted by another. A good hydrogen bond donor has an H atom bonded to a very electronegative atom (e.g., O—H or N—H). Examples of hydrogen-bond donors include alcohols, carboxylic acids, oximes, and amides, among others. Similarly, one hydrogen-bond acceptor may be substituted by another. A good hydrogen bond acceptor has an electronegative element with lone pairs (e.g., O, N, or F). Examples of hydrogen bond acceptors include water, halogen atoms, alcohols, amines, carbonyls, ethers, and amides, among others. It may also be desirable to alter the distance between functional groups in a lead compound. This is achieved by employing synthetic methods analogous to those used to prepare the lead compound, but replacing the scaffold with a structurally related scaffold that provides the desired distance (e.g., a scaffold that incorporates more or fewer atoms linking the relevant functional groups). In some instances it may also be desirable to alter the stereochemistry in a lead compound. This can be accomplished by employing racemic starting materials, or by employing reaction conditions that result in racemization of the relevant chiral center, followed by separation of the enantiomeric or diastereomeric mixture.


[0099] Assays


[0100] Inhibitors designed using the methods disclosed herein may be further assayed, using standard in vitro models or animal models, to evaluate therapeutic activity and toxicity. These assays are described in the literature and are familiar to those skilled in the art. These include but are not limited to assays for monitoring or measuring efficacy against HIV, bacteria, and neoplasms.


[0101] One skilled in the art will be familiar with methods of measuring the IC50's of a broad spectrum inhibitor described herein. The IC50 value is determined by plotting percent activity versus inhibitor concentration in the assay and identifying the concentration at which 50% of the activity (e.g., growth, enzymatic activity, protein production, etc.) remains. Inhibitors can be tested for antimicrobial activity against a panel of organisms according to standard procedures described by the National Committee for Clinical Laboratory Standards (NCCLS document 7-A3, Vol. 13, No. 25, 1993/NCCLS document M27-P, Vol. 12, No. 25, 1992). Inhibitors can be dissolved (0.1 μg/ml-500 μg/ml) in microbial growth media, diluted, and added to wells of a microtiter plate containing bacteria or fungal cells in a final volume of an appropriate media (Mueller-Hinton Broth; Haemophilus Test Media; Mueller-Hinton Broth+5% Sheep Blood; or RPMI 1690). Typically, the plates are incubated overnight at an appropriate temperature (30° C. to 37° C.) and optical densities (measure of cell growth) are measured using a commercial plate reader.


[0102] IC50 's for broad spectrum protease inhibitors can be measured against wild type HIV and clinically isolated mutant HIV isolates, utilizing the PHA-PBMC exposed to HIV-1 (50 TCID50 dose/X06 PBMC) as target cells and using the inhibition of p24 Gag protein production as an endpoint. The amounts of p24 antigen produced by the cells can be determined on day 7 in culture using a commercially available radioimmunoassay kit. Drug concentrations resulting in 50% inhibition (IC50's) of p24 antigen production can be determined by comparison with the p24 production level in drug-free control cell cultures.


[0103] Therapy


[0104] The invention features a method of identifying a compound having broad spectrum activity. Broad spectrum inhibitors of the present invention may be administered by any appropriate route for treatment or prevention of a disease or condition associated with a bacterial infection, viral infection, or neoplastic disorder, among others. These may be administered to humans, domestic pets, livestock, or other animals with a pharmaceutically acceptable diluent, carrier, or excipient, in unit dosage form. Administration may be topical, parenteral, intravenous, intra-arterial, subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, intraventricular, intracapsular, intraspinal, intracistemal, intraperitoneal, intranasal, aerosol, by suppositories, or oral administration.


[0105] Therapeutic formulations may be in the form of liquid solutions or suspensions; for oral administration, formulations may be in the form of tablets or capsules; and for intranasal formulations, in the form of powders, nasal drops, or aerosols.


[0106] Methods well known in the art for making formulations are found, for example, in “Remington: The Science and Practice of Pharmacy” (20th ed., ed. A. R. Gennaro AR., 2000, Lippincott Williams & Wilkins). Formulations for parenteral administration may, for example, contain excipients, sterile water, or saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, or hydrogenated napthalenes. Biocompatible, biodegradable lactide polymer, lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene copolymers may be used to control the release of the compounds. Nanoparticulate formulations (e.g., biodegradable nanoparticles, solid lipid nanoparticles, liposomes) may be used to control the biodistribution of the compounds. Other potentially useful parenteral delivery systems include ethylene-vinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, and liposomes. Formulations for inhalation may contain excipients, for example, lactose, or may be aqueous solutions containing, for example, polyoxyethylene-9-lauryl ether, glycholate and deoxycholate, or may be oily solutions for administration in the form of nasal drops, or as a gel. The concentration of the broad spectrum inhibitor in the formulation will vary depending upon a number of factors, including the dosage of the drug to be administered, and the route of administration.


[0107] The broad spectrum inhibitor may be optionally administered as a pharmaceutically acceptable salt, such as a non-toxic acid addition salts or metal complexes that are commonly used in the pharmaceutical industry. Examples of acid addition salts include organic acids such as acetic, lactic, pamoic, maleic, citric, malic, ascorbic, succinic, benzoic, palmitic, suberic, salicylic, tartaric, methanesulfonic, toluenesulfonic, or trifluoroacetic acids or the like; polymeric acids such as tannic acid, carboxymethyl cellulose, or the like; and inorganic acid such as hydrochloric acid, hydrobromic acid, sulfuric acid phosphoric acid, or the like. Metal complexes include zinc, iron, and calcium, among others.


[0108] Administration of compounds in controlled release formulations is useful where the broad spectrum inhibitor has (i) a narrow therapeutic index (e.g., the difference between the plasma concentration leading to harmful side effects or toxic reactions and the plasma concentration leading to a therapeutic effect is small; generally, the therapeutic index, TI, is defined as the ratio of median lethal dose (LD50) to median effective dose (ED50)); (ii) a narrow absorption window in the gastro-intestinal tract; or (iii) a short biological half-life, so that frequent dosing during a day is required in order to sustain the plasma level at a therapeutic level.


[0109] Many strategies can be pursued to obtain controlled release in which the rate of release outweighs the rate of metabolism of the broad spectrum inhibitor. For example, controlled release can be obtained by the appropriate selection of formulation parameters and ingredients, including, e.g., appropriate controlled release compositions and coatings. Examples include single or multiple unit tablet or capsule compositions, oil solutions, suspensions, emulsions, microcapsules, microspheres, nanoparticles, patches, and liposomes.


[0110] Formulations for oral use include tablets containing the active ingredient(s) in a mixture with non-toxic pharmaceutically acceptable excipients. These excipients may be, for example, inert diluents or fillers (e.g., sucrose and sorbitol), lubricating agents, glidants, and antiadhesives (e.g., magnesium stearate, zinc stearate, stearic acid, silicas, hydrogenated vegetable oils, or talc).


[0111] Formulations for oral use may also be provided as chewable tablets, or as hard gelatin capsules wherein the active ingredient is mixed with an inert solid diluent, or as soft gelatin capsules wherein the active ingredient is mixed with water or an oil medium.


[0112] Pharmaceutical formulations of broad spectrum inhibitor described herein include isomers such as diastereomers and enantiomers, mixtures of isomers, including racemic mixtures, salts, solvates, and polymorphs thereof.


[0113] The formulations can be administered to human patients in therapeutically effective amounts. For example, when the broad spectrum inhibitor is an antimicrobial drug, an amount is administered which prevents, stabilizes, eliminates, or reduces a microbial infection. Typical dose ranges are from about 0.01 μg/kg to about 2 mg/kg of body weight per day. The exemplary dosage of drug to be administered is likely to depend on such variables as the type and extent of the disorder, the overall health status of the particular patient, the formulation of the compound excipients, and its route of administration. Standard clinical trials maybe used to optimize the dose and dosing frequency for any particular broad spectrum inhibitor.


[0114] The following examples are meant to illustrate, but in no way limit, the claimed invention.



EXAMPLE I

[0115] This example illustrates the method by which experimentally-determined crystal structures of the same inhibitor in complex with wild type and mutant species of HIV protease can be compared and analyzed for the existence of a three-dimensionally conserved substructure.


[0116] The structures of wild type HIV-1 protease and a mutant, V82F/184V, HIV-1 protease, both in complexes with the inhibitor shown in FIG. 1 were determined using conventional x-ray crystallography techniques. The structures were analyzed by means of (a) an overall superposition of the atoms of the protein structures; and, (b) a study of the distances from atoms of the inhibitors to atoms of the protein. This analysis requires three dimensional atomic coordinates of the protein structures and of the bound inhibitor.


[0117] The superposition of the protein structures was performed in a two step process: 1) the distance between all pairs of corresponding Cá atoms (Cá atom of residue number 1 in one protein to Cá atom of residue number 1 in the second protein; Cá atom of residue number 2 in one protein to Cá atom of residue number 2 in the second protein; and so on) of the polypeptide chains is minimized by means of a least-square algorithm; 2) the superposition is refined by minimizing, in an iterative process, the distances between corresponding Cá atoms that are closer than a given distance (0.25 Å in this example), thus eliminating regions of the structures having large conformational differences to compute the superposition parameters. The distances between equivalenced Cá atoms after the minimization procedure are shown in Table 4.
1TABLE 4Distances between equivalent Cá atomsMolecule 1: HIV-1 PR wt: 1Molecule 2: HIV-1 PR V82F/I84V mutant: 1Molecule 1Molecule 2distance [Å]CA PRO 1CA PRO 10.455CA GLN 2CA GLN 20.434CA ILE 3CA ILE 30.418CA THR 4CA THR 40.317CA LEU 5CA LEU 50.172CA TRP 6CA TRP 60.228CA GLN 7CA GLN 70.364CA ARG 8CA ARG 80.166CA PRO 9CA PRO 90.057CA LEU 10CA LEU 100.183CA VAL 11CA VAL 110.194CA THR 12CA THR 120.168CA ILE 13CA ILE 130.146CA LYS 14CA LYS 140.229CA ILE 15CA ILE 150.266CA GLY 16CA GLY 160.662CA GLY 17CA GLY 170.491CA GLN 18CA GLN 180.267CA LEU 19CA LEU 190.112CA LYS 20CA LYS 200.128CA GLU 21CA GLU 210.190CA ALA 22CA ALA 220.169CA LEU 23CA LEU 230.218CA LEU 24CA LEU 240.233CA ASP 25CA ASP 250.160CA THR 26CA THR 260.200CA GLY 27CA GLY 270.303CA ALA 28CA ALA 280.169CA ASP 29CA ASP 290.150CA ASP 30CA ASP 300.038CA THR 31CA THR 310.047CA VAL 32CA VAL 320.173CA LEU 33CA LEU 330.194CA GLU 34CA GLU 340.310CA GLU 35CA GLU 350.260CA MET 36CA MET 360.136CA SER 37CA SER 370.494CA LEU 38CA LEU 380.607CA PRO 39CA PRO 390.094CA GLY 40CA GLY 400.774CA ARG 41CA ARG 410.448CA TRP 42CA TRP 420.204CA LYS 43CA LYS 430.596CA PRO 44CA PRO 440.625CA LYS 45CA LYS 450.541CA MET 46CA MET 460.643CA ILE 47CA ILE 470.361CA GLY 48CA GLY 480.240CA GLY 49CA GLY 490.182CA ILE 50CA ILE 500.110CA GLY 51CA GLY 510.243CA GLY 52CA GLY 520.200CA PHE 53CA PHE 530.119CA ILE 54CA ILE 540.255CA LYS 55CA LYS 550.295CA VAL 56CA VAL 560.108CA ARG 57CA ARG 570.129CA GLN 58CA GLN 580.074CA TYR 59CA TYR 590.372CA ASP 60CA ASP 600.496CA GLN 61CA GLN 610.780CA ILE 62CA ILE 620.406CA LEU 63CA LEU 630.211CA ILE 64CA ILE 640.260CA GLU 65CA GLU 650.193CA ILE 66CA ILE 660.181CA CYS 67CA CYS 670.518CA GLY 68CA GLY 680.641CA HIS 69CA HIS 690.319CA LYS 70CA LYS 700.179CA ALA 71CA ALA 710.265CA ILE 72CA ILE 720.350CA GLY 73CA GLY 730.253CA THR 74CA THR 740.301CA VAL 75CA VAL 750.187CA LEU 76CA LEU 760.186CA VAL 77CA VAL 770.070CA GLY 78CA GLY 780.306CA PRO 79CA PRO 790.047CA THR 80CA THR 800.470CA PRO 81CA PRO 810.404CA VAL 82CA PHE 820.556CA ASN 83CA ASN 830.146CA ILE 84CA VAL 840.196CA ILE 85CA ILE 850.163CA GLY 86CA GLY 860.224CA ARG 87CA ARG 870.127CA ASN 88CA ASN 880.048CA LEU 89CA LEU 890.081CA LEU 90CA LEU 900.197CA THR 91CA THR 910.226CA GLN 92CA GLN 920.176CA ILE 93CA ILE 930.151CA GLY 94CA GLY 940.338CA CYS 95CA CYS 950.233CA THR 96CA THR 960.305CA LEU 97CA LEU 970.089CA ASN 98CA ASN 980.260CA PHE 99CA PHE 990.250CA PRO 101CA PRO 1010.227CA GLN 102CA GLN 1020.108CA ILE 103CA ILE 1030.206CA THR 104CA THR 1040.169CA LEU 105CA LEU 1050.125CA TRP 106CA TRP 1060.363CA GLN 107CA GLN 1070.296CA ARG 108CA ARG 1080.400CA PRO 109CA PRO 1090.173CA LEU 110CA LEU 1100.182CA VAL 111CA VAL 1110.085CA THR 112CA THR 1120.123CA ILE 113CA ILE 1130.107CA LYS 114CA LYS 1140.368CA ILE 115CA ILE 1150.226CA GLY 116CA GLY 1160.638CA GLY 117CA GLY 1170.516CA GLN 118CA GLN 1180.414CA LEU 119CA LEU 1190.102CA LYS 120CA LYS 1200.191CA GLU 121CA GLU 1210.206CA ALA 122CA ALA 1220.197CA LEU 123CA LEU 1230.231CA LEU 124CA LEU 1240.145CA ASP 125CA ASP 1250.235CA THR 126CA THR 1260.311CA GLY 127CA GLY 1270.200CA ALA 128CA ALA 1280.102CA ASP 129CA ASP 1290.143CA ASP 130CA ASP 1300.261CA THR 131CA THR 1310.172CA VAL 132CA VAL 1320.232CA LEU 133CA LEU 1330.103CA GLU 134CA GLU 1340.175CA GLU 135CA GLU 1350.190CA MET 136CA MET 1360.220CA SER 137CA SER 1370.739CA LEU 138CA LEU 1380.277CA PRO 139CA PRO 1390.325CA GLY 140CA GLY 1400.390CA ARG 141CA ARG 1410.174CA TRP 142CA TRP 1420.168CA LYS 143CA LYS 1430.304CA PRO 144CA PRO 1440.194CA LYS 145CA LYS 1450.456CA MET 146CA MET 1460.362CA ILE 147CA ILE 1470.178CA GLY 148CA GLY 1480.390CA GLY 149CA GLY 1490.434CA ILE 150CA ILE 1500.050CA GLY 151CA GLY 1510.199CA GLY 152CA GLY 1520.152CA PHE 153CA PHE 1530.455CA ILE 154CA ILE 1540.198CA LYS 155CA LYS 1550.470CA VAL 156CA VAL 1560.590CA ARG 157CA ARG 1570.607CA GLN 158CA GLN 1580.465CA TYR 159CA TYR 1590.301CA ASP 160CA ASP 1600.294CA GLN 161CA GLN 1610.308CA ILE 162CA ILE 1620.274CA LEU 163CA LEU 1630.235CA ILE 164CA ILE 1640.367CA GLU 165CA GLU 1650.410CA ILE 166CA ILE 1660.201CA CYS 167CA CYS 1670.409CA GLY 168CA GLY 1680.406CA HIS 169CA HIS 1690.410CA LYS 170CA LYS 1700.282CA ALA 171CA ALA 1710.273CA ILE 172CA ILE 1720.317CA GLY 173CA GLY 1730.563CA THR 174CA THR 1740.129CA VAL 175CA VAL 1750.237CA LEU 176CA LEU 1760.155CA VAL 177CA VAL 1770.240CA GLY 178CA GLY 1780.386CA PRO 179CA PRO 1790.340CA THR 180CA THR 1800.335CA PRO 181CA PRO 1810.446CA VAL 182CA PHE 1820.343CA ASN 183CA ASN 1830.205CA ILE 184CA VAL 1840.262CA ILE 185CA ILE 1850.096CA GLY 186CA GLY 1860.118CA ARG 187CA ARG 1870.202CA ASN 188CA ASN 1880.073CA LEU 189CA LEU 1890.108CA LEU 190CA LEU 1900.127CA THR 191CA THR 1910.177CA GLN 192CA GLN 1920.175CA ILE 193CA ILE 1930.241CA GLY 194CA GLY 1940.118CA CYS 195CA CYS 1950.375CA THR 196CA THR 1960.437CA LEU 197CA LEU 1970.167CA ASN 198CA ASN 1980.178


[0118] Table 4 shows that the 184V, V82F mutations induce structural changes relative to the wild type structure in some parts of the enzyme, but that other regions are less affected. The regions of the protein structure which are not significantly affected by the amino acid mutations are defined as structurally conserved regions. In the present example, the mutations result in localized structural changes in the backbone of HIV protease over a wide range, from 0.038-0.774 Å.


[0119] The distances between the strongly interacting atoms of the inhibitor to atoms of the wild type and mutant protein, that is hydrogen-bond donors and acceptors, were computed and they are displayed in Table 5.
2TABLE 5Distances between atoms of the inhibitor and atoms ofthe proteinHIV PR wt: 1V82F/I84V: 1O2-Wat3012.922.89N1-0273.363.46O6-N303.303.6106-N293.193.55O7-N292.842.87O7-OD1 293.423.54O7-O13.313.19O3-OD 25 (out)2.502.94O3-OD 25 (in)2.652.67O3-OD125 (out)3.273.21O3-OD125 (in)2.802.67O5-Wat3012.702.79O8-N1303.162.96


[0120] Table 5 shows that the atoms of the inhibitor interact with the same atoms of the two different proteins, in this case the wild type and V82F/184V mutant HIV proteases. From Table 5, it can be seen that the atoms of the enzymes with which the inhibitor interacts belong to the structurally conserved regions. The effects of mutations on the protein-inhibitor interactions can be quantified in terms of the distances between interacting pairs of atoms from the inhibitor and from atoms of the three-dimensionally conserved substructure of the protein. These distances are similar in the wild type and in the mutant complexes; the average of their differences is only 0.07 Å. The range of the differences is 0.02-0.36 Å.



EXAMPLE 2

[0121] This example illustrates the method by which experimentally-determined crystal structures of two different inhibitors in complexes with wild type HIV protease can be compared and analyzed for the existence of a three-dimensionally conserved substructure. The structures of wild type HIV-1 protease in complexes with inhibitor 1 and with Amprenavir (inhibitor 2) were analyzed by means of (a) an overall superposition of the protein structures; and (b) a study of the distances from atoms of the inhibitors to atoms of the protein.


[0122] The superposition of the protein structures is performed in a two step process: 1) the distance between all pairs of corresponding Cá atoms (Cá atom of residue number 1 in one protein to Cá atom of residue number 1 in the second protein; Cá atom of residue number 2 in one protein to Cá atom of residue number 2 in the second protein; and so on) of the polypeptide chains is minimized by means of a least-square algorithm; 2) the superposition is refined by minimizing, in an iterative process, the distances between corresponding Cá atoms that are closer than a given distance (0.25 Å in this example), thus eliminating regions of the structures having large conformational differences to compute the superposition parameters. The distances between equivalenced Cá atoms after the minimization procedure are shown in Table 6.
3TABLE 6Distances between equivalent Cá atomsMolecule 1: HIV-1 PR wt: 1Molecule 2: HIV-1 PR wt: 2Molecule 1Molecule 2distance [Å]CA PRO 1CA PRO 10.200CA GLN 2CA GLN 20.320CA ILE 3CA ILE 30.147CA THR 4CA THR 40.405CA LEU 5CA LEU 50.225CA TRP 6CA TRP 60.296CA GLN 7CA GLN 70.317CA ARG 8CA ARG 80.154CA PRO 9CA PRO 90.143CA LEU 10CA LEU 100.259CA VAL 11CA VAL 110.275CA THR 12CA THR 120.307CA ILE 13CA ILE 130.207CA LYS 14CA LYS 140.273CA ILE 15CA ILE 150.434CA GLY 16CA GLY 160.469CA GLY 17CA GLY 170.414CA GLN 18CA GLN 180.319CA LEU 19CA LEU 190.161CA LYS 20CA LYS 200.155CA GLU 21CA GLU 210.196CA ALA 22CA ALA 220.338CA LEU 23CA LEU 230.246CA LEU 24CA LEU 240.292CA ASP 25CA ASP 250.142CA THR 26CA THR 260.109CA GLY 27CA GLY 270.176CA ALA 28CA ALA 280.193CA ASP 29CA ASP 290.087CA ASP 30CA ASP 300.118CA THR 31CA THR 310.111CA VAL 32CA VAL 320.087CA LEU 33CA LEU 330.306CA GLU 34CA GLU 340.333CA GLU 35CA GLU 350.399CA MET 36CA MET 360.296CA SER 37CA SER 370.454CA LEU 38CA LEU 380.451CA PRO 39CA PRO 390.397CA GLY 40CA GLY 400.444CA ARG 41CA ARG 410.535CA TRP 42CA TRP 420.346CA LYS 43CA LYS 430.442CA PRO 44CA PRO 440.548CA LYS 45CA LYS 450.307CA MET 46CA MET 460.320CA ILE 47CA ILE 470.403CA GLY 48CA GLY 480.237CA GLY 49CA GLY 490.280CA ILE 50CA ILE 500.206CA GLY 51CA GLY 510.368CA GLY 52CA GLY 520.315CA PHE 53CA PHE 530.378CA ILE 54CA ILE 540.180CA LYS 55CA LYS 550.149CA VAL 56CA VAL 560.302CA ARG 57CA ARG 570.098CA GLN 58CA GLN 580.219CA TYR 59CA TYR 590.279CA ASP 60CA ASP 600.385CA GLN 61CA GLN 610.431CA ILE 62CA ILE 620.343CA LEU 63CA LEU 630.473CA ILE 64CA ILE 640.344CA GLU 65CA GLU 650.456CA ILE 66CA ILE 660.481CA CYS 67CA CYS 670.920CA GLY 68CA CLY 680.999CA HIS 69CA HIS 690.295CA LYS 70CA LYS 700.406CA ALA 71CA ALA 710.446CA ILE 72CA ILE 720.374CA GLY 73CA GLY 730.259CA THR 74CA THR 740.276CA VAL 75CA VAL 750.165CA LEU 76CA LEU 760.220CA VAL 77CA VAL 770.202CA GLY 78CA GLY 780.231CA PRO 79CA PRO 790.131CA THR 80CA THR 800.374CA PRO 81CA PRO 810.472CA VAL 82CA VAL 820.554CA ASN 83CA ASN 830.149CA ILE 84CA ILE 840.261CA ILE 85CA ILE 850.223CA GLY 86CA GLY 860.130CA ARG 87CA ARG 870.165CA ASN 88CA ASN 880.103CA LEU 89CA LEU 890.072CA LEU 90CA LEU 900.076CA THR 91CA THR 910.114CA GLN 92CA GLN 920.115CA ILE 93CA ILE 930.204CA GLY 94CA GLY 940.220CA CYS 95CA CYS 950.068CA THR 96CA THR 960.185CA LEU 97CA LEU 970.095CA ASN 98CA ASN 980.311CA PHE 99CA PHE 990.216CA PRO 101CA PRO 1010.455CA GLN 102CA GLN 1020.121CA ILE 103CA ILE 1030.120CA THR 104CA THR 1040.109CA LEU 105CA LEU 1050.128CA TRP 106CA TRP 1060.205CA GLN 107CA GLN 1070.229CA ARG 108CA ARG 1080.211CA PRO 109CA PRO 1090.195CA LEU 110CA LEU 1100.135CA VAL 111CA VAL 1110.086CA THR 112CA THR 1120.166CA ILE 113CA ILE 1130.199CA LYS 114CA LYS 1140.333CA ILE 115CA ILE 1150.356CA GLY 116CA GLY 1160.671CA GLY 117CA GLY 1170.709CA GLN 118CA GLN 1180.370CA LEU 119CA LEU 1190.258CA LYS 120CA LYS 1200.156CA GLU 121CA GLU 1210.250CA ALA 122CA ALA 1220.276CA LEU 123CA LEU 1230.103CA LEU 124CA LEU 1240.112CA ASP 125CA ASP 1250.078CA THR 126CA THR 1260.057CA GLY 127CA GLY 1270.121CA ALA 128CA ALA 1280.098CA ASP 129CA ASP 1290.190CA ASP 130CA ASP 1300.302CA THR 131CA THR 1310.073CA VAL 132CA VAL 1320.178CA LEU 133CA LEU 1330.147CA GLU 134CA GLU 1340.239CA GLU 135CA GLU 1350.101CA MET 136CA MET 1360.235CA SER 137CA SER 1370.391CA LEU 138CA LEU 1380.364CA PRO 139CA PRO 1390.532CA GLY 140CA GLY 1400.213CA ARG 141CA ARG 1410.448CA TRP 142CA TRP 1420.133CA LYS 143CA LYS 1430.195CA PRO 144CA PRO 1440.082CA LYS 145CA LYS 1450.359CA MET 146CA MET 1460.306CA ILE 147CA ILE 1470.076CA GLY 148CA GLY 1480.214CA GLY 149CA GLY 1490.205CA ILE 150CA ILE 1500.163CA GLY 151CA GLY 1510.287CA GLY 152CA GLY 1520.318CA PHE 153CA PHE 1530.125CA ILE 154CA ILE 1540.189CA LYS 155CA LYS 1550.384CA VAL 156CA VAL 1560.510CA ARG 157CA ARG 1570.405CA GLN 158CA GLN 1580.139CA TYR 159CA TYR 1590.361CA ASP 160CA ASP 1600.252CA GLN 161CA GLN 1610.414CA ILE 162CA ILE 1620.337CA LEU 163CA LEU 1630.202CA ILE 164CA ILE 1640.359CA GLU 165CA GLU 1650.463CA ILE 166CA ILE 1660.347CA CYS 167CA CYS 1670.256CA GLY 168CA GLY 1680.471CA HIS 169CA HIS 1690.658CA LYS 170CA LYS 1700.489CA ALA 171CA ALA 1710.445CA ILE 172CA ILE 1720.396CA GLY 173CA GLY 1730.523CA THR 174CA THR 1740.130CA VAL 175CA VAL 1750.156CA LEU 176CA LEU 1760.077CA VAL 177CA VAL 1770.129CA GLY 178CA GLY 1780.276CA PRO 179CA PRO 1790.272CA THR 180CA THR 1800.580CA PRO 181CA PRO 1810.436CA VAL 182CA VAL 1820.328CA ASN 183CA ASN 1830.180CA ILE 184CA ILE 1840.151CA ILE 185CA ILE 1850.104CA GLY 186CA GLY 1860.059CA ARG 187CA ARG 1870.058CA ASN 188CA ASN 1880.183CA LEU 189CA LEU 1890.164CA LEU 190CA LEU 1900.051CA THR 191CA THR 1910.216CA GLW 192CA GLN 1920.162CA ILE 193CA ILE 1930.158CA GLY 194CA GLY 1940.047CA CYS 195CA CYS 1950.050CA THR 196CA THR 1960.200CA LEU 197CA LEU 1970.165CA ASN 198CA ASN 1980.074


[0123] The distances between the atoms of the inhibitors 1 and 2 to atoms of the protein, that is, hydrogen-bond donors and acceptors, were computed and are shown in Table 7.
4TABLE 7Distances between atoms of inhibitors and atoms ofthe proteinsWt: 1 complexWt: 2 complexO2-Wat3012.923.02N1-0273.363.58O6-N303.303.5006-N293.193.51O7-N292.84O7-OD1 293.42O7-O13.31O3-OD 25 (out) A2.502.80O3-OD 25 (in) A2.652.66O3-OD 25 (out) B3.273.07O3-OD 25 (in) B2.802.68O5-Wat3012.702.77O8-N 303.16N3-N 303.17N3-OD2 303.15


[0124] Inhibitors 1 (FIG. 1) and 2 (Amprenavir) have similar structural elements, in particular their core, i.e. groups at the P1-P1′60 positions. However, 2 has a THF group while 1 has a bis-THF group at the P2′ position. The P2 groups are identical except for the substitution of an ether oxygen atom in 1 as compared to an amine nitrogen atom at the same position in 2. Table 7 shows that 1 forms more interactions with the atoms of the protein that were previously identified as belonging to the structurally conserved substructure than does compound 2. For example, the O7 oxygen atom in compound 1, that forms an interaction with N29 nitrogen of the protease, has no counterpart in compound 2. Instead, the O6 oxygen atom of 2 forms longer (and presumably weaker) hydrogen bonds with both N30 (3.50 Å) and N29 (3.51 Å). In contrast, the O6 oxygen of compound 1 forms a shorter (and presumably stronger) hydrogen bond with N29 (3.19 Å). Additionally, as can be seen in Table 7, where both compounds 1 and 2 form interactions with atoms in the structurally conserved substructure of HIV protease, the distances between interacting atoms are consistently shorter for compound 1, indicative of presumably stronger binding interactions.


[0125] Examples 1 and 2 were used to identify a three dimensionally-conserved substructure of HIV protease that is involved in the binding of HIV protease inhibitors and, in particular, to identify atoms of these substructural elements that are involved in forming interactions with atoms of HIV protease inhibitors. This substructure is defined by the set of atomic coordinates (in orthogonal coordinates) provided in Table 8 and any equivalent set derived by applying arbitrary rotations and translations to the set of atomic coordinates in Table 8. The values of the coordinates (X,Y,Z) of the atoms defining the substructure are affected by a standard error σ. Therefore (X,Y,Z) values for each atom are those defined in the intervals (X−σ, X+σ) for coordinate X, (Y−σ, Y+σ) for coordinate Y, and (Z−σ, Z+σ) for coordinate Z.
5TABLE 8Three dimensionally-conserved substructure of HIV proteaseAtomX [Å]Y [Å]Z [Å]σ [Å]DescriptionSubstructure of the protein atomsOxygen−7.913.627.40.5Oxygen atom of water moleculecoordinated to main chain amidenitrogen atoms of amino acidGly 49 and Gly 149O27−13.817.730.40.5Main Chain carbonyl oxygenatom of amino acid Gly 27N29−13.418.234.50.5Main chain amide nitrogenatom of amino acid Asp 29N30−11.918.636.70.5Main chain amide nitrogenatom of amino acid Asp 30OD1 25−11.321.228.70.5Carboxylate oxygen atom ofaminoacid Asp 25OD2 25−9.420.429.30.5Carboxylate oxygen atom ofaminoacid Asp 25OD1 125−12.720.326.40.5Carboxylate oxygen atom ofaminoacid Asp 125OD2 125−12.720.326.40.5Carboxylate oxygen atom ofaminoacid Asp 125N129−8.920.520.70.5Main chain amide nitrogen atomof amino acid Asp 129N130−10.119.518.60.5Main chain amide nitrogen atomof amino acid Asp 130Substructure of the inhibitor atomsHydrogen−8.817.525.70.5Interacting with main chainBondcarbonyl oxygen atom of aminodonoracid Gly 27AtomHydrogen−8.515.325.10.5Interacting with Oxygen atom ofBondwater molecule coordinated toacceptormain chain amide nitrogenAtomatoms of amino acid Gly 49 andGly 149Hydrogen−10.419.127.40.5Interacting with carboxylateBondoxygen atoms of aminoacidsdonor-Asp 25 and Asp 125acceptorAtomHydrogen−8.914.029.80.5Interacting with Oxygen atom ofBondwater molecule coordinated toacceptormain chain amide nitrogenAtomatoms of amino acid Gly 49 andGly 149Hydrogen−8.617.320.70.5Main chain amide nitrogen atomBondof amino acid Asp 30acceptorAtomHydrogen−6.918.721.40.5Interacting with main chainBondamide nitrogen atom of aminoacceptoracid Asp 29AtomO8−10.715.835.80.5Interacting with main chainamide nitrogen atom of aminoacid Asp 130



EXAMPLE 3

[0126] The following example demonstrates that a protease inhibitor that contains atoms that can make favorable interactions with the atoms of the substructure may exhibit broad spectrum activity.


[0127] Compounds 1 and 3 contain a Bis-THF group at the P2 position that contains two atoms, in particular, hydrogen bond acceptor oxygen atoms, that can form hydrogen bonds with the two hydrogen atoms attached to the backbone amide nitrogen atoms on the protein at residues 29 and 30. Compound 2 is similar to 1 except that 2 contains a THF group at P2 with only a single hydrogen bond acceptor oxygen atom. All three compounds differ in the P2′ substituent. Compounds 1 and 3 both are unaffected by the two active site mutations, V82F and 184V, and Ki values for wild type and mutant enzymes are similar for both compounds. In contrast, compound 2, which contains only a single hydrogen bond acceptor atom in the P2 substitutent, is dramatically affected by the active site mutations, which demonstrate high level resistance to 2.


[0128] The antiviral activity of compounds 1 and 3 against HIV derived from patient isolates that contain multiple mutations are equivalent to their activity against wild type HIV strains. In contrast, compound 2 is much less effective against the same mutant viruses. None of the patients from whom virus was isolated had ever been exposed to any of the compounds tested herein. Nonetheless, compound 2 exhibited cross resistance to these virus strains that is typically seen with all clinically useful HIV protease inhibitors −4 (Saquinavir), 5 (Ritonavir), 6 (Indinavir) and 7 (Nelfinavir). Compounds 2, 4, 5, 6, and 7 have very different chemical structures, but nonetheless behave as a single class with respect to their antiviral behavior against wild type and multidrug resistant HIV strains. All compounds are dramatically less potent against the multidrug resistant strains of HIV.


[0129] In sharp contrast, compounds 1 and 3, which closely resemble each other as well as compound 2, exhibit broad spectrum activity in that they are equally effective against wild type and mutant HIV strains that exhibit high level multidrug resistance towards compounds 2, 4, 5, 6, and 7. The broad spectrum activity of compound 1 was completely unexpected and contrasts with the common and typical loss of antiviral potency experienced with compounds like 2, 4, 5, 6, 7, and indeed most other HIV protease inhibitors represented as similar or different structures that have been reported.


[0130] The development and application of the 3D motif method described above successfully revealed the presence of a unique, three dimensionally-conserved substructure of HIV protease that is useful in the design of broad spectrum inhibitors. Based on this method, compound 3 was predicted, on the basis of comparative molecular modeling using the coordinates of the complexes of compound 1 with wild type and V82F/184V mutant HIV proteases, to be able to make the same key interaction as compound 1 and thereby to exhibit broad spectrum activity. Based on these data, it is feasible to design protease inhibitors that are predicted to have broad spectrum activity, and are predicted to be useful for the treatment of both wild type (first line therapy) and drug resistant (salvage therapy) HIV infections.



EXAMPLE 4

[0131] This example illustrates the method by which experimentally-determined crystal structures of two different target proteins, DHQases, from two different bacterial species can be compared and analyzed for the existence of a three-dimensionally conserved substructure even in the absence of readily discernible or statistically significant sequence similarity. DHQases from different bacterial species typically exhibit less than 30% sequence identity (FIG. 2). A schematic map showing the key interactions of the substrate-based inhibitor, DHQO, with the active site residues for the Type II DHQase from M. tuberculosis is provided in FIG. 3.


[0132] The structures of wild type DHQase from M. tuberculosis and a homologous DHQase from Pseudomonas putidas were determined using conventional x-ray crystallography techniques. The structures were analyzed by means of (a) an overall superposition of the atoms of the protein structures. This analysis requires three dimensional atomic coordinates of the protein structures.


[0133] The superposition of the protein structures was performed in a two step process: 1) the distance between all pairs of corresponding Cá atoms (Cá atom of residue number 1 in one protein to Cá atom of residue number 1 in the second protein; Cá atom of residue number 2 in one protein to Cá atom of residue number 2 in the second protein; and so on) of the polypeptide chains is minimized by means of a least-square algorithm; 2) the superposition is refined by minimizing, in an iterative process, the distances between corresponding Cá atoms that are closer than a given distance (0.4 Å in this example), thus eliminating regions of the structures having large conformational differences to compute the superposition parameters. The distances between equivalenced Cá atoms after the minimization procedure are shown in Table 9.
6TABLE 9Distances between equivalent Cá atomsMolecule 1: DHQase P. putida wt: qxaMolecule 2: DHQase M. tuberculosis wt: gt33Molecule 1Molecule 2distance [Å]CA MET 2CA GLU 21.078CA ALA 3CA LEU 31.504CA THR 4CA ILE 41.800CA LEU 5CA VAL 51.283CA LEU 6CA ASN 60.911CA VAL 7CA VAL 70.715CA LEU 8CA ILE 80.298CA HIS 9CA ASN 90.211CA GLY 10CA GLY 100.591CA PRO 11CA PRO 110.599CA ASN 12CA ASN 120.487CA LEU 13CA LEU 130.428CA ASN 14CA GLY 140.229CA LEU 15CA ARG 150.685CA LEU 16CA LEU 160.541CA GLY 17CA GLY 171.693CA THR 18CA ARG 182.287CA ARG 19CA ARG 192.956CA GLN 20CA GLN 203.475CA PRO 21CA PRO 213.390CA GLY 22CA ALA 224.037CA THR 23CA VAL 233.770CA TYR 24CA TYR 242.521CA GLY 25CA GLY 251.170CA SER 26CA GLY 261.642CA THR 27CA THR 271.454CA THR 28CA THR 281.532CA LEU 29CA HIS 291.471CA GLY 30CA ASP 301.632CA GLN 31CA GLU 311.966CA ILE 32CA LEU 321.586CA ASN 33CA VAL 331.875CA GLN 34CA ALA 342.230CA ASP 35CA LEU 352.343CA LEU 36CA ILE 361.927CA GLU 37CA GLU 372.284CA ARG 38CA ARG 382.980CA ARG 39CA GLU 392.917CA ALA 40CA ALA 402.719CA ARG 41CA ALA 413.367CA GLU 42CA GLU 423.534CA ALA 43CA LEU 433.281CA GLY 44CA GLY 443.161CA HIS 45CA LEU 452.899CA HIS 46CA LYS 461.844CA LEU 47CA ALA 471.599CA LEU 48CA VAL 481.201CA HIS 49CA VAL 492.053CA LEU 50CA ARG 501.045CA GLN 51CA GLN 510.266CA SER 52CA SER 520.300CA ASN 53CA ASP 530.282CA ALA 54CA SER 540.348CA GLU 55CA GLU 550.326CA TYR 56CA ALA 560.238CA GLU 57CA GLN 570.380CA LEU 58CA LEU 580.455CA ILE 59CA LEU 590.413CA ASP 60CA ASP 600.984CA ARG 61CA TRP 611.452CA ILE 62CA ILE 621.338CA HIS 63CA HIS 631.310CA ALA 64CA GLN 642.327CA ALA 65CA ALA 652.526CA ARG 66CA ALA 663.063CA ASP 67CA ASP 673.449CA GLU 68CACA GLY 69CA ALA 682.318CA VAL 70CA ALA 691.691CA ASP 71CA GLU 700.812CA PHE 72CA PRO 710.515CA ILE 73CA VAL 720.561CA ILE 74CA ILE 730.547CA LEU 75CA LEU 740.380CA ASN 76CA ASN 750.277CA PRO 77CA ALA 760.369CA ALA 78CA GLY 770.952CA ALA 79CA GLY 780.421CA PHE 80CA LEU 790.714CA THR 81CA THR 800.575CA HIS 82CA HIS 810.142CA THR 83CA THR 820.222CA SER 84CA SER 830.741CA VAL 85CA VAL 840.719CA ALA 86CA ALA 850.415CA LEU 87CA LEU 860.667CA ARG 88CA ARG 870.660CA ASP 89CA ASP 880.426CA ALA 90CA ALA 890.697CA LEU 91CA CYS 901.233CA LEU 92CA ALA 911.319CA ALA 93CA GLU 922.852CA VAL 94CA LEU 934.165CA SER 95CA SER 943.605CA ILE 96CA ALA 953.840CA PRO 97CA PRO 962.414CA PHE 98CA LEU 970.314CA ILE 99CA ILE 980.251CA GLU 100CA GLU 990.095CA VAL 101CA VAL 1000.131CA HIS 102CA HIS 1010.318CA ILE 103CA ILE 1020.117CA SER 104CA SER 1030.229CA ASN 105CA ASN 1040.203CA VAL 106CA VAL 1050.193CA HIS 107CA HIS 1060.499CA LYS 108CA ALA 1070.498CA ARG 109CA ARG 1080.292CA GLU 110CA GLU 1090.333CA PRO 111CA GLU 1100.377CA PHE 112CA PHE 1110.651CA ARG 113CA ARG 1120.611CA ARG 114CA ARG 1130.469CA HIS 115CA HIS 1140.467CA SER 116CA SER 1150.293CA TYR 117CA TYR 1160.483CA PHE 118CA LEU 1170.468CA SER 119CA SER 1180.367CA ASP 120CA PRO 1190.676CA VAL 121CA ILE 1200.445CA ALA 122CA ALA 1210.334CA VAL 123CA THR 1220.405CA GLY 124CA GLY 1230.372CA VAL 125CA VAL 1240.375CA ILE 126CA ILE 1250.250CA CYS 127CA VAL 1260.328CA GLY 128CA GLY 1270.332CA LEU 129CA LEU 1280.473CA GLY 130CA GLY 1290.272CA ALA 131CA ILE 1300.551CA THR 132CA GLN 1310.564CA GLY 133CA GLY 1320.289CA TYR 134CA TYR 1330.276CA ARG 135CA LEU 1340.476CA LEU 136CA LEU 1350.556CA ALA 137CA ALA 1360.677CA LEU 138CA LEU 1370.703CA GLU 139CA ARG 1380.861CA SER 140CA TYR 1390.876CA ALA 141CA LEU 1401.330CA LEU 142CA ALA 1411.529CA GLU 143CA GLU 1421.492CA GLN 144CA HIS 1431.738CA LEU 145CA VAL 1443.487


[0134] Table 9 shows that the two structures are remarkably similar overall despite their low level sequence identity. However, the structures exhibit very large deviations in some regions, and are highly conserved in others. In particular, this analysis reveals that regions of the enzyme are minimally affected by the large number of amino acid sequence substitutions. The regions of the protein structure which are not significantly affected by the amino acid substitutions are defined as structurally conserved regions. In the present example, the substitutions result in localized structural changes in the backbone of DHQase over a wide range, from 0.095-4.165 Å.


[0135] The distances between the strongly interacting atoms of the inhibitor to atoms of the homologous DHQase proteins, that is P. putida wt: qxa and M. tuberculosis wt: gt33 complexes, were computed and they are displayed in Tables 10 and 11, respectively.
7TABLE 10Distances between atoms of theinhibitor and atoms of the proteinDHQase P. putida wt: gxadistance [Å]C6-PRO11 (O)3.31C6-ASN12 (CB)3.10N7-TYR24 (OH)2.45O12-ASN76 (HD2)2.96O13-HIS102 (CB)3.38O12-SER104 (N)3.35C6-ASN12 (CB)3.10


[0136]

8





TABLE 11










Distances between atoms of the


inhibitor and atoms of the protein


DHQase M. tuberculosis wt: gt33









distance [Å]














N14-PRO11 (O)
3.35



O15-PRO11 (O)
3.01



O15-LEU 13 (CG)
3.36



O15-ARG 19 (NH1)
3.26



O15-ARG 19 (NH2)
3.32



O7-ASN 75 (OD1)
2.50



O13-ASN 75 (ND2)
2.98



N14-GLY 77 (CA)
3.01



O9-HIS 81 (NE2)
2.82



O7-HIS 101 (ND1)
3.25



O11-ILE 102 (N)
3.33



O13-ILE 102 (N)
2.77



O11-SER 103 (N)
2.96



O11-SER 103 (OG)
2.68



O9-ARG 112 (NH2)
3.09



O10-ARG 112 (NH2)
3.02











[0137] The methods of Examples 1-3 were applied to the DHQase data to identify a three dimensionally-conserved substructure of DHQase that is involved in the binding of DHQase inhibitors, in particular, to identify the relevant target substructure for developing broad spectrum inhibitors. This substructure is defined by the set of atomic coordinates (in orthogonal coordinates) provided in Table 12 and any equivalent set derived by applying arbitrary rotations and translations to the set of atomic coordinates in Table 12. The values of the coordinates (X,Y,Z) of the atoms defining the substructure are affected by a standard error σ. Therefore (X,Y,Z) values for each atom are those defined in the intervals (X−σ, X+σ) for coordinate X, (Y−σ, Y+σ) for coordinate Y, and (Z−σ, Z+σ) for coordinate Z.
9TABLE 12Three dimensionally-conserved substructure of DHQase, M. tuberculosisAtomX [Å]Y [Å]Z [Å]σ [Å]DescriptionSubstructure of the protein atomsOD126.26568.91221.2190.5Side chain carbonylASN75oxygen atom of aminoacid ASN 75ND227.33666.96020.9510.5Side chain nitrogenASN 75atom of amino acidASN 75NE228.34376.42522.1110.5Side chain nitrogenHIS 81atom of amino acidHIS 81ND128.07970.60423.6620.5Side chain nitrogenHIS 101atom of amino acidHIS 101N ILE31.22767.16722.1680.5Main chain amide102nitrogen atom ofatom acid ILE 102N SER33.75468.31521.5580.5Main chain amide103nitrogen atom ofamino acid Ser 103OG SER33.94671.05920.7350.5Side chain hydroxyl103oxygen atom of aminoacid SER 103Substructure of the inhibitorHydrogen29.60068.55420.2980.5Interacting with mainbondchain nitrogen atom ofacceptorILE 102 and side chainatomnitrogen atom of ASN75Hydrogen28.03170.73920.4220.5Interacting with sidebondchain oxygen atom ofdonor-ASN75 and side chainacceptornitrogen atom ofatomHIS 101Hydrogen29.66474.65820.4930.5Interacting side chainbondnitrogen atom of HISdonor-81acceptoratomHydrogen31.45169.83520.5310.5Interacting with mainbondchain nitrogen atomacceptorand side chain oxygenatomatom of SER 103



Other Embodiments

[0138] All publications and patent applications, and patents mentioned in this specification are herein incorporated by reference.


[0139] While the invention has been described in connection with specific embodiments, it will be understood that it is capable of further modifications. Therefore, this application is intended to cover any variations, uses, or adaptations of the invention that follow, in general, the principles of the invention, including departures from the present disclosure that come within known or customary practice within the art.


Claims
  • 1. A method for the structure-based design of a drug that can act as an inhibitor of at least two different biological entities, said method comprising the steps of: (a) providing at least one structure of a wild type target protein or an inhibitor-wild type target protein complex; (b) providing at least one structure of a variant target protein or an inhibitor-variant target protein complex; (c) comparing at least one structure from step (a) with at least one structure from step (b) to determine whether there exists a common three-dimensionally conserved substructure comprising the atomic coordinates of the structurally conserved atoms of the inhibitors and structurally conserved atoms of the target proteins; and (d) if a conserved substructure exists, using said atomic coordinates of said conserved substructure to select a compound having atoms matching those of said structurally conserved atoms of the inhibitors, wherein the selection of said compound is performed using computer modeling.
  • 2. A method for the structure-based drug design of a broad spectrum inhibitor, said method comprising the steps of: (a) providing at least one structure of a wild type target protein or an inhibitor-wild type target protein complex; (b) providing at least one structure of a variant target protein or an inhibitor-variant target protein complex; (c) comparing at least one structure from step (a) with at least one structure from step (b) to determine whether there exists a common three-dimensionally conserved substructure comprising the atomic coordinates of the structurally conserved atoms the target proteins or a common three-dimensionally conserved substructure comprising the atomic coordinates of the structurally conserved atoms of the inhibitors and structurally conserved atoms of the target proteins; and (d) if a conserved substructure exists, using said atomic coordinates of said conserved substructure to select a compound having atoms matching those of said structurally conserved atoms of the inhibitors or to design a compound that binds to said target protein, wherein the selection of said compound is performed using computer modeling.
  • 3. The method of claim 1, further comprising the steps of: (e) comparing at least one structure from step (a) with at least one structure from step (b) to determine whether there exists a three-dimensionally non-conserved substructure comprising the atomic coordinates of the structurally non-conserved atoms of the inhibitors and structurally non-conserved atoms of the target proteins; and (f) if a non-conserved substructure exists, using said atomic coordinates of said non-conserved substructure to reject a compound having atoms matching those of said structurally non-conserved atoms of the inhibitors, wherein the rejection of said compound is performed in conjunction with computer modeling.
  • 4. The method of claim 1, wherein at least two structures from step b are used in step c.
  • 5. The method of claim 4, wherein at least four structures from step b are used in step c.
  • 6. The method of claim 4, wherein said target proteins comprise at least two variant forms.
  • 7. The method of claim 6, wherein said target proteins comprise at least four variant forms
  • 8. The method of claim 1, wherein the inhibitors in said inhibitor-wild type target protein complex and said inhibitor-variant target protein complex are the same.
  • 9. The method of claim 1, wherein the inhibitors in said inhibitor-wild type target protein complex and said inhibitor-variant target protein complex are different.
  • 10. The method of claim 1, wherein said inhibitors are competitive inhibitors.
  • 11. The method of claim 1, wherein said inhibitors are noncompetitive inhibitors.
  • 12. The method of claim 1, wherein said inhibitors are reversible inhibitors.
  • 13. The method of claim 1, wherein said inhibitors are irreversible inhibitors.
  • 14. The method of claim 1, wherein said variant target protein is a homologous target protein.
  • 15. The method of claim 1, wherein said variant target protein is a mutant target protein.
  • 16. The method of claim 1, wherein at least one of said structures is a crystal structure.
  • 17. The method of claim 1, wherein at least one of said structures is an nmr structure.
  • 18. The method of claim 1, wherein at least one of said structures is derived using computational methods.
  • 19. The method of claim 1, wherein said target protein is expressed in a microbe and said microbe is selected from the group consisting of viruses, bacteria, protozoa, or fungi.
  • 20. The method of claim 1, wherein said target protein is expressed in a neoplasm.
  • 21. The method of claims 19 or 20, wherein said target protein is selected from the group consisting of an enzyme, a receptor, a structural protein, a component of a macromolecular complex, a component of a metabolic pathway, or an assembly of biological molecules.
  • 22. The method of claim 21, wherein said enzyme is selected from the group consisting of reverse transcriptases, proteases, DNA and RNA polymerases, methylases, oxidases, hydratases, esterases, acyl transferases, helicases, topoisomerases, and kinases.
  • 23. The method of claim 22, wherein said enzyme is HIV protease.
  • 24. The method of claim 23, wherein said inhibitors are selected from the group consisting of indinavir, nelfinavir, ritonavir, saquinavir, amprenavir, lopinavir, and UIC-94003.
  • 25. The method of claim 23, wherein said structurally conserved atoms of the inhibitor and structurally conserved atoms of the protease have the atomic structural coordinates as provided in Table 8.
  • 26. The method of claim 22, wherein said enzyme is 3-dehydroquinate dehydratase.
  • 27. The method of claim 26, wherein said structurally conserved atoms of the 3-dehydroquinate dehydratase have the atomic structural coordinates as provided in Table 12.
  • 28. A compound having a chemical structure selected using the method of claim 19, wherein said compound has broad spectrum activity against wild type and variant microbes.
  • 29. A compound having a chemical structure selected using the method of claim 20, wherein said compound has broad spectrum activity against wild type and variant neoplasms.
  • 30. The compound of claims 28 or 29, wherein said compound has an IC50, variant/IC50, wild type ratio of less than 20.
  • 31. The compound of claim 30, wherein said IC50, variant/IC50, wildtype ratio is less than 6.
  • 32. The compound of claims 28 or 29, wherein said compound has broad spectrum activity against at least 3 mutant biological entities.
  • 33. The compound of claim 28, wherein said compound has broad spectrum activity against at least 3 different organisms expressing homologous target proteins.
  • 34. A pharmaceutical composition comprising a compound of claim 28 and a pharmaceutically acceptable carrier or diluent.
  • 35. A pharmaceutical composition comprising a compound of claim 29 and a pharmaceutically acceptable carrier or diluent.
  • 36. A compound having a chemical structure selected using the method of any one of claims 23-25, wherein said compound has broad spectrum activity against HIV protease.
  • 37. The compound of claim 36, wherein said compound has an IC50, variant/IC50, wild type ratio of less than 10.
  • 38. The compound of claim 37, wherein said IC50, variant/IC50, wild type ratio is less than 6.
  • 39. The compound of claim 36, wherein said compound has broad spectrum activity against at least 3 mutant biological entities.
  • 40. A pharmaceutical composition comprising a compound of claim 36 and a pharmaceutically acceptable carrier or diluent.
  • 41. A compound having a chemical structure selected using the method of claims 26 or 27, wherein said compound has broad spectrum activity against 3-dehydroquinate dehydratase.
  • 42. The compound of claim 41, wherein said compound has an IC50, variant/IC50, wild type ratio of less than 20.
  • 43. The compound of claim 42, wherein said IC50, variant/IC50, wild type ratio is less than 10.
  • 44. The compound of claim 43, wherein said compound has broad spectrum activity against at least 3 mutant biological entities.
  • 45. The compound of claim 41, wherein said compound has broad spectrum activity against at least 3 different organisms expressing homologous target proteins.
  • 46. A pharmaceutical composition comprising a compound of claim 41 and a pharmaceutically acceptable carrier or diluent.
  • 47. A method of treating a microbial infection in a patient, said method comprising the step of administering to said patient a pharmaceutical composition of claim 34 in an amount effective to prevent or treat said infection.
  • 48. A method of treating a neoplasm in a patient in need thereof, said method comprising the step of administering to said patient a pharmaceutical composition of claim 35 in amounts effective to treat said neoplasm.
  • 49. A method of treating an HIV infection in a patient in need thereof, said method comprising the step of administering to said patient a pharmaceutical composition of claim 40 in amounts effective to treat said infection.
  • 50. A method of treating a bacterial infection in a patient in need thereof, said method comprising the step of administering to said patient a pharmaceutical composition of claim 46 in amounts effective to treat said infection.
  • 51. The method of claim 50, wherein said bacterial infection is caused by a bacterium selected from the group consisting of C jejuni, V. cholerae, Y pestis, B. anthracis, P. putidas, and M. tuberculosis.
CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The application claims benefit of U.S. Provisional Application No. 0/344,788, filed Jan. 7, 2002, and No. 60/383,575, filed May 29, 2002, each of which is hereby incorporated by reference.

Provisional Applications (2)
Number Date Country
60344788 Jan 2002 US
60383575 May 2002 US