The instant invention relates generally to the field of isolated multiprotein assemblies. In various aspects, these multiprotein assemblies are implicated in one or more disease state and, as such, represent a druggable target for therapeutic agents. In various aspects, the isolated multiprotein assemblies are components of assays for determining whether a candidate chemotherapeutic is useful in preventing, curing or retarding the advancement of a disease. More particularly, this invention relates to multiprotein assemblies, compositions incorporating such assemblies and methods of using the assemblies in identifying novel therapeutics effective against various diseases. Still further, the present invention relates to screening assays using a multiprotein assembly-ribosome complex for the screening and identification of novel therapeutics.
Protein synthesis is carried out by an elaborate translation complex, which is composed of a ribosome, accessory protein factors as well as mRNA and charged tRNA molecules. Like DNA and RNA synthesis, protein synthesis can be divided into initiation, chain elongation and termination stages. Initiation involves the assembly of the translation complex at the start codon in the mRNA. During polypeptide-chain elongation, the ribosome and associated components move in the 5′ to 3′ direction along the template mRNA. The polypeptide is synthesized from the N-terminus to the C-terminus. Finally, when synthesis of the protein is complete, the translation complex disassembles in a separate termination step. An important part of this disassembly is the release of the ribosome from the mRNA, which is signaled by a stop codon.
Catalysis of peptide bond formation requires the precise juxtaposition by the ribosome of the acceptor ends of the amino acid-charged tRNA's bound in the peptidyl site (i.e., P site) and aminoacyl site (i.e., A site) of its “active site”. This activity represents the essential enzymatic activity of the ribosome and is referred to as the “peptidyl transferase activity,” an integral component of the large subunit of all ribosomes characterized to date. Studies of bacterial ribosomes have identified the essential active site constituents of the peptidyl transferase activity as a few ribosomal protein subunits and the 23S rRNA. As the integrity of the latter is essential for enzymatic activity, it is assumed that it plays a direct role in the catalysis of peptide bond formation acting as a so-called ribozyme.
Many diseases involve foreign or aberrant host proteins, for example, viral infections involve the synthesis of viral proteins, e.g., capsid proteins. A variety of agents are presently used to combat viral infection. These agents include interferon, which is a naturally-occurring protein having some efficacy in combat of certain selected viral diseases. In addition, agents such as AZT are used in the combat of an immunodeficiency disease, referred to commonly as AIDS, caused by the virus HIV-1.
Given the large number of drugs available for treating infections caused by more complex organisms such as bacteria, it is remarkable how few drugs are available for treating the relatively simple organisms known as viruses. Indeed, most viral diseases remain essentially untreatable. The development of new antiviral chemotherapeutics is resource- and time-intensive. The difficulties encountered in drug treatment of most infections pale when compared to viral infections. For example, it is at least theoretically (and often in practice) possible to attack a bacterium without harming the host. Unlike bacteria however, viruses replicate inside cells and utilize cellular machinery of the host for replication. As a result, development of antiviral therapeutics often represents a compromise between preferable killing, or at least arresting replication of, the virus, and not harming the host, or at worst, doing only minimal damage which can be justified by the potential gain (Drug and Market Development, Vol 3. No. 9, pp. 174-180 (Feb. 15, 1993)).
It is now generally recognized that an important challenge for small molecule drug discovery is the identification of novel druggable targets (Hopkins A-L, Groom C-R (2002) The druggable genome. Nat Rev Drug Discov. 1:727-730). Conventional targets appear to have largely been exhausted, and it can be argued that various highly anticipated methods in recent years have disappointed, in that many of the targets they are identifying are of questionable druggability (Goff S-P (2008) Knockdown screens to knockout HIV-1. Cell 135:417-420). How then does one find the likely highly unconventional novel druggable targets of the future?
The present invention provides an approach orthogonal to conventional drug discovery. The method provides compositions and methods for investigating protein-protein interactions in disease states as diverse as viral and bacterial infections, cancer, degenerative neurological disease and psychiatric disorders. In various embodiments, the invention provides multiprotein assemblies implicated in one or more disease state, which assemblies occur during protein biogenesis and maturation, as a novel starting point for identification of therapeutic small molecules. These protein-protein interactions are critical for the later function of proteins and, counter-intuitively, subtle disruption of a subset of these early interactions is sufficient to functionally impact later events in assembly of proteins, in a substrate selective manner.
Thus, in an exemplary embodiment, the invention provides an isolated multiprotein assembly, comprising two or more host proteins interacting to form said assembly, wherein said assembly is implicated in a mammalian disease state. In various embodiments, the invention provides an isolated multiprotein complex comprising two or more host proteins interacting with each other or with an organic ligand. The organic ligand binds to a recognition site for the organic ligand on at least one of the proteins of the multiprotein assembly. In an exemplary embodiment, the organic ligand interacts with a recognition site for the ligand on two or more of the proteins of the multiprotein complex. In various embodiments, the interaction of the two or more proteins leads to the formation of a recognition site for the ligand located within or proximate to the surfaces of the proteins which are interacting to form the multiprotein assembly. In various embodiments, the organic ligand is immobilized on a solid support and the interaction between the organic ligand and at least one of the two or more proteins of the multiprotein assembly results in the immobilization of the multiprotein assembly on the solid support through the attractive interaction between the at least one protein and the immobilized ligand. In various embodiments the surfaces of the two or more proteins interacting with the ligand or with another protein are allosteric surfaces and are not surfaces recognized in the art as corresponding to an active site of the particular protein.
The compositions and methods of the invention allow the identification of therapeutics that function in a novel manner, targeting host multiprotein assemblies that are highly unconventional drug targets. Remarkably, these host-targeted compounds have robust efficacy against proteins implicated in disease states (e.g., viral capsid proteins) at concentrations avoiding significant toxicity to cultured mammalian cells. Moreover, these compounds display improved selectivity indices (toxicity/efficacy) with structure-activity relationship (SAR) optimization. The present invention provides a new approach to therapeutic agents that function by disrupting protein-protein interactions generally, and those implicated in viral diseases in particular.
In an exemplary embodiment, the invention provides a method of discovering a drug that is therapeutically active to modulate a disease state. In this embodiment, many highly druggable targets can be identified. These targets have not been identified earlier because conventional drug discovery typically starts with target identification. The drug can function to mitigate a disease or to prevent the progression of the disease. The isolated multiprotein complexes of the invention are versatile tools of use in the drug discovery methods of the invention. Quite surprisingly, the isolated multiprotein complexes can be used to probe the therapeutic efficacy of a library of compounds of diverse structure (e.g., obeying Lipinski's Rule of Five), without foreknowledge of the structure of the target for the drug. In various embodiments, a lead organic ligand including within its structure a putative pharmacophore with therapeutic activity against a disease of interest is identified with a cell free protein synthesis system (“CFPS”) (e.g., wheat germ extract). In an exemplary embodiment, the CFPS includes a pathway in which at least a portion of a disease-related pathway In various embodiments a library of organic ligands is contacted with the CFPS under conditions appropriate for the system to synthesize proteins implicated in the disease (e.g., progression of the disease). An exemplary screen according to this embodiment is performed in a manner amenable to deconvolution and identification of organic ligands having a desired effect on protein synthesis (e.g., a multi-well format).
In various embodiments, the multiprotein assembly includes at least one, at least two, at least three, or at least four or more host proteins within the assembly. In an exemplary embodiment, the multiprotein assembly has a catalytic role in the manufacture of a protein complex related to disease progression (e.g., viral capsid assembly).
An exemplary CFPS includes a detectable element (e.g., green-, red-fluorescent protein co-expressed with disease-related proteins) that indicates the status of protein synthesis in the CFPS in contact with an organic ligand. For example, in one embodiment, the detectable element is a detectable protein. Decrease of synthesis of this protein in a CFPS in a well including an organic ligand is indicative that the organic ligand is decreasing the synthesis of the detectable protein and the synthesis of the disease-related protein.
In an exemplary embodiment, at least one disease-related protein product synthesized in the CFPS is detected. An exemplary mode of detection of the protein product includes capture of the at least one product by an anti-protein product antibody, which can be immobilized on a solid support (e.g., a plate). The immobilized at least one product is then detected. In various embodiments detection utilizes a detectable label. In various embodiments, the detectable label is an antibody against the at least one product that includes a fluorescent or profluorescent label.
In an exemplary embodiment, the at least one disease related protein is a protein that undergoes an assembly (e.g., mulitmerization) event that is implicated in disease progression, e.g., assembly of a viral capsid, aggregation of proteins in a tauopathy, etc. In an exemplary embodiment, the therapeutic efficacy of the organic ligand is in its ability to prevent the assembly of the disease-related protein into the structure implicated in the disease. In an exemplary embodiment, this prevention or diminishment of assembly is detectable because the number of epitopes available on a single protein molecule or an incompletely assembled multimeric structure, which are available for binding by a detectably labeled anti-disease-related protein antibody is fewer than the number of such epitopes available on the fully assembled multimeric assembly. In this embodiment, the amount of detectable label bound to the single disease-related protein or the partially assembled multimer is less, which results in a diminution of detectable signal when compared to the signal from the fully assembled multimer.
In an exemplary embodiment, organic ligands exhibiting therapeutic activity in the CFPS are advanced and are tested against a model of the disease state of interest. Thus, for example, in some embodiments, lead organic ligands identified in the CFPS are tested in an infectious virus screen, a bacteria screen, or another assay for probing the effects of the lead organic ligand on one or more events implicated in disease progression.
In various embodiments, the lead organic ligand is immobilized on a solid support and is used as an affinity ligand to isolate the multiprotein assembly from either the CFPS or the screen for the integrity of disease related events (e.g., capsid assembly in an infectious virus screen). Thus, in various embodiments, the invention provides a method of isolating the multiprotein assembly that comprises contacting the multiprotein assembly with the immobilized organic ligand under conditions appropriate to at least transiently immobilize the multiprotein assembly on the immobilized organic ligand. In an exemplary embodiment, the assembly is immobilized for a time sufficient to allow essentially all proteins not immobilized on the solid support bound ligand to be washed away from the immobilized multiprotein assembly.
In various embodiments, the organic ligand is not a protein, e.g., a protein that binds to one or more of the proteins of the multiprotein assembly. In a preferred embodiment, the organic ligand is not a protein that selectively or specifically binds to a protein component of a multiprotein assembly. In an exemplary embodiment, the organic ligand is not an antibody or a fragment of an antibody, which binds to a protein or more than one protein of a multiprotein assembly. In various embodiments, the organic ligand does not include two or more naturally occurring amino acids linked through a peptide bond. In an exemplary embodiment, the organic ligand does not include a naturally ocurring amino acid structure. In various embodiments, the organic ligand is a small organic molecule with a molecular weight of less than about 2,000 Daltons, e.g., less than about 1500 Dalton, e.g., less than about 1,000 Daltons. In various embodiments, the organic ligand has no appreciable binding to an active site of a protein of a multiprotein assembly. In an exemplary emobdiment, the principal binding site for the organic ligand is located at an allosteric site of a protein or at an interface formed between two or more allosteric regions or two or more proteins of the multiprotein assembly. Exemplary organic ligands of use in the present invention include those disclosed in copending, commonly-owned U.S. patent application Ser. No. 13/099,006, filed on May 2, 2011.
In an exemplary embodiment, the multiprotein assembly is washed from the solid support after removing the non-immobilized proteins from the CFPS or other screen. Any method that leads to the elution of the assembly from the solid support is useful. An exemplary method involves elution of the assembly from the solid support by washing the solid support with several equivalents of the unbound analog of the immobilized organic ligand. The organic ligand can then be dialyzed away from the proteins to provide a mixture comprising proteins that form the multiprotein assembly implicated in disease progression.
The relevance of the isolated proteins to disease progression can be readily confirmed. For example, in one embodiment, the ability of the mixture eluted from the solid support to which the multiprotein assembly is bound can be submitted to an assay to determine whether the mixture depleted in the constituents of the multiprotein assembly can execute the step implicated in disease progression without the proteins that were removed by the organic ligand bound to the solid support. In an exemplary embodiment, the depleted mixture does not execute the disease-related event. In another exemplary embodiment, when the proteins of the multiprotein assembly are reintroduced to the depleted mixture, activity in the disease related event is reconstituted, thereby verfiying the essential nature of the multiprotein assembly in disease progression.
In an exemplary embodiment, the invention provides a complex between a messenger RNA (m-RNA) sequence and a ribosome. In various embodiments, the m-RNA is a truncated m-RNA, truncated in the sense that it does not include a stop codon at its 3′-terminus. In an exemplary embodiment, the truncated m-RNA is complexed to the P-site of the ribosome. In various embodiments, the A-site of the ribosome is essentially free of complexed truncated m-RNA. In various embodiments, one or both of the ribosome, and the m-RNA is associated with the isolated multiprotein assembly. As used herein, an exemplary “isolated multiprotein assembly” includes an assembly associated with a ribosome and/or a m-RNA complexed to a ribosome. In this embodiment, the isolated multiprotein assembly (complexed with the ribosome and/or m-RNA) is otherwise “isolated” as that term is defined herein. In various embodiments, the “isolated multiprotein complex is associated with an organic ligand, which, in some embodiments, is immobilized on a solid support.
In an exemplary embodiment, the invention provides a method of assaying a candidate compound for its ability to disrupt the assembling proteins (e.g., non-viral proteins) in a multiprotein assembly by querying an isolated multiprotein assembly. In various embodiments, the invention provides a method of assaying a candidate compound for its ability to affect the function of proteins (e.g., non-viral proteins) in a multiprotein assembly by querying an isolated multiprotein assembly. In various embodiments, the multiprotein assembly is queried using a candidate compound with the ability to disrupt protein assembly or function. In selected embodiments, the multiprotein assembly is implicated in the origin or advancement of one or more disease, e.g., viral infections, bacterial infections, central nervous system disorders, psychiatric disorders, metabolic disorders, oncologic disorders, or immunologic disorders. In an exemplary embodiment, the methods assemble a multiprotein assembly of bacterial or parasitic proteins which is a novel target for drug development and determine the ability of a candidated compound to disrupt the assembly or function of the assembly.
In various embodiments, there is provided a method of testing whether a candidate compound modulates the assembly of or function of a multiprotein assembly. An exemplary method includes introducing the compound to an isolated multiprotein assembly of the invention, and determining whether the assembly of or function of the multiprotein assembly is modulated. In various embodiments, disruption of the assembly process is determined by detection of one or more smaller assemblies or individual proteins emerging from the isolated multiprotein assembly. In a further exemplary embodiment, the multiprotein assembly is isolated through its association with an organic ligand binding to the assembly. The organic ligand is immobilized on a solid support. The isolated multiprotein assembly is eluted off of the organic ligand and off of the solid support and, in some embodiments, purified from the organic ligand by chromatography, dialysis and/or another art-recognized method. The multiprotein assembly is introduced into a translation system depleted in the proteins of the multiprotein assembly, which does not execute the disease-related event. If, upon reconstitution of the translation system by addition of the proteins of the multiprotein assembly oringally isolated by their binding to the organic ligand, the translation system is able to execute the disease-related event, this is an indication that the candidate compound (e.g., the organic ligand) modulated the formation or activity of a disease-related multiprotein assembly.
The invention also provides a method of testing whether a candidate compound modulates viral capsid assembly due to an effect on host proteins or on the viral capsid proteins themselves. The method includes introducing the compound to an isolated multiprotein assembly of the invention and determining whether the isolated multiprotein assembly, viral capsid (or other protein) assembly is modulated. An exemplary end point for determining the modulation is whether complete viral capsids are formed and/or whether complete viral capsids are formed in the same amounts as are formed in the system in the absence of the candidate compound.
These and additional objects, advantages and embodiments of the invention are described in the detailed description that follows.
ATP binding cassette family E1 (ABCE1); Cell-free protein synthesis (cfps); Dimethylsulfoxide (DMSO); Direct fluorescent antibody assay (DFA); ER (endoplasmic reticulum); Horseradish peroxidase (HRP); Human immunodeficiency virus (HIV); Hepatitis B virus (HBV); Hepatitis C virus (HCV); Immobilized apyrase (iapy); Influenza virus (FLUV); Polyacrylamide gel electrophoresis in sodium dodecyl sulfate (SDS-PAGE); RABV (rabies virus); Brain post mitochondrial supernatant (brpmis); Signal Recognition Particle (SRP); Structure activity relationship (SAR); Sucrose step gradient (ssg); and Western blot (WB).
Unless otherwise noted, the technical terms used herein are according to conventional usage as understood by persons skilled in the art. Definitions of common terms in molecular biology may be found in standard texts (e.g. Benjamin Lewin, Genes V, published by Oxford University Press, 1994 (ISBN 0-19854287-9); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd, 1994 (ISBN 0-632-02182-9); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8)).
A “multiprotein assembly”, as used herein refers to a functional assembly of proteins, which participates in the assembly of a “multiprotein structure” implicated in a disease (e.g., a viral capsid”) or in the folding of a protein which is misfolded (e.g., amyloid protein) or in the mislocation of a protein. The “multiprotein assembly” is a functional component of a disease state. Exemplary protein components of a multiprotein assembly include, without limitation, ABCE1, Macrophage migration inhibitory factor (MIF), Glutathione transferase, SAM Synthetase, Peroxiredoxin-2 (PRX-2) and Cyclophilin A. In various embodiments two or more of these proteins are independently, and in any combination, incorporated into a multiprotein assembly.
A “multiprotein structure” is a structure that is present in a disease state, for example a viral capsid or a protein aggregate (e.g., beta-amyloid plaque) but is not otherwise present in a host or is present at a level that does not cause clinical disease.
Translation system: The term “translation system” refers to the components necessary to incorporate a naturally occurring amino acid into a growing polypeptide chain (protein). Components of a translation system can include, e.g., ribosomes, tRNAs, synthetases, mRNA and the like. The components of the present invention can be added to a translation system, in vivo or in vitro. A translation system can be a cell, either prokaryotic, e.g., an E. coli cell, or eukaryotic, e.g., a yeast, mammalian, plant, or insect cell. The cells of the translation system can be non-diseased (“normal”) or diseased. Similarly, the cells can be stressed in some manner, e.g., oxidatively or environmentally.
The term “messenger RNA (mRNA)” refers to the RNA that is without introns and that can be translated into protein by the cell. “cDNA” refers to a double-stranded DNA that is complementary to and derived from mRNA. “Sense” RNA refers to RNA transcript that includes the mRNA. “Antisense RNA” refers to a RNA transcript that is complementary to all or part of a target primary transcript or mRNA and that blocks the expression of a target gene by interfering with the processing, transport and/or translation of its primary transcript or mRNA. The complementarity of an antisense RNA may be with any part of the specific gene transcript, i.e., at the 5′ non-coding sequence, 3′ non-coding sequence, introns, or the coding sequence. In addition, as used herein, antisense RNA may contain regions of ribozyme sequences that increase the efficacy of antisense RNA to block gene expression.
The term “stop (or “termination”) codon” refers to a unit of three adjacent nucleotides in a polynucleotide coding sequence that specifies translational termination of protein synthesis (i.e., mRNA translation) by the ribosomal complex.
The phrase “cell-free translation system,” as used herein, refers to any type of system capable of synthesizing proteins in vitro in the absence of viable cells. An exemplary system is a cell-free protein synthesis system derived from wheat germ extract.
The term “expressing” and “expression,” as used herein, refer to the production of a protein, peptide, or nucleotide sequence, and include transcription into an RNA product, post-transcriptional modification and/or translation into a protein product or polypeptide from a DNA encoding that product, as well as possible post-translational modifications.
The terms “polypeptide” or “peptide” or “protein” are used interchangeably herein, to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non-naturally occurring amino acid polymer. Macromolecular structures such as polypeptide structures can be described in terms of various levels of organization. For a general discussion of this organization (see, e.g., Alberts et al., Molecular Biology of the Cell (3rd ed., 1994) and Cantor and Schimmel, Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980)). “Primary structure” refers to the amino acid sequence of a particular peptide. “Secondary structure” refers to locally ordered, three dimensional structures within a polypeptide. These structures are commonly known as domains, e.g., enzymatic domains, extracellular domains, transmembrane domains, pore domains, and cytoplasmic tail domains. Domains are portions of a polypeptide that form a compact unit of the polypeptide and are typically 15 to 350 amino acids long. Exemplary domains include domains with enzymatic or other functional activity. Typical domains are made up of sections of lesser organization such as stretches of 3-sheet and a-helices. “Tertiary structure” refers to the complete three dimensional structure of a polypeptide monomer. “Quaternary structure” refers to the three dimensional structure formed by the noncovalent association of independent tertiary units.
A “protein implicated in disease,” or “target protein,” as used herein, are interchangeable and refer to a whole protein molecule, including but not limited to, viral capsid proteins, or a portion thereof, i.e., cytoplasmic domain or other domain of a protein. Also included are aberrant host proteins, e.g., misfolded (e.g., “conformational disease”) or mislocated proteins. Proteins implicated in disease include those implicated in neurological disorders, cancer and pathological infections. In an exemplary embodiment, the protein is a protein that is not implicated in a viral disease.
A “disease-related event” refers to a dysfunction in a host cell implicating one or more multiprotein assemblies in which two or more host proteins are assembled into a multiprotein assembly. Exemplary disease related events include, without limitation, assembly of viral capsid proteins into an intact capsid, and misfolding and/or aggregation of proteins implicated in neurological or neurodegenerateive disease or disorder (e.g.,a tauopathy), e.g., Alzheimer's disease, Amyotrophic lateral sclerosis, schizophrenia (DISC 1), etc. In an exemplary embodiment, the disease related event is not an event related to viral infection or replication.
The term “virus” or “viral,” as used herein, refers to minute infectious agents, which, with certain exceptions, are not observable by light microscopy, lack independent metabolism, and are able to replicate only within a living host cell. Some exceptions include, but are not limited to, the cell-free translation system described herein. These individual particles (i.e., for example, virions) typically comprise nucleic acid and a protein shell or coat; some virions also have a lipid containing membrane. The term virus encompasses all types of viruses, including animal, plant, phage, and other viruses. In an exemplary embodiment, the virus is a pathogen detrimental to aquaculture, e.g., Taura Syndrome Virus (TSV), Yellow Head Virus (YHV) and White Spot Syndrome Baculovirus (WSBV). Shrimp are also prone to diseases caused by bacteria such as Vibrio parahaemolyticus.
The term “viral capsid” or “capsid,” as used herein, refers to the protein coat that surrounds the viral nucleic acid. Viral capsids have interior surfaces and exterior surfaces. The interior surface of a viral capsid is the surface that is normally exposed to the viral nucleic acid. The exterior surface of a viral capsid is the surface that is generally exposed to the environment. The phrase “viral capsid assembly” refers to the process of arranging viral capsid proteins in a manner sufficient to generate a viral capsid.
The term “capsid interacting protein,” as used herein, refers to protein that interacts with a viral capsid either during or after its assembly. The capsid interacting protein may be endogenous to a virus, may be exogenously added, or present in the cell-free extract. Capsid interacting proteins can include, but are not limited to, capsid chaperones and proteins that have catalytic actions favoring capsid formation.
Neurological diseases and disorders, as used herein, are used in the broadest sense and includes neurodegenerative diseases and disorders. As defined herein, a neurodegenerative disease or disorder may be characterized by the manifestation of gross physical dysfunction, not otherwise determinable as having emotional or psychiatric origins, typically resulting from progressive and irreversible loss of neurons. Such neurodegenerative diseases and disorders are defined in The Diagnostic and Statistical Manual of Mental Disorders-IV (DSM-IV) (American Psychiatric Association (1995)) and include, but are not limited to, Primary Lateral Sclerosis (PLS), Progressive Muscular Atrophy (PMA), Amyotrophic Lateral Sclerosis (ALS), Alzheimer's disease, Pick's disease, Huntington's disease, and Parkinson's disease. Of particular interest in the present invention are those diseases or disorders resulting from an alteration of normal SMN-associated processes including, but not limited to, SMA1 (Spinal Muscular Atrophy I, Werdnig-Hoffmann Disease, Infantile Muscular Atrophy), SMA2 (Spinal Muscular Atrophy II, Spinal Muscular Atrophy, Mild Child and Adolescent Form), SMA3 (Spinal Muscular Atrophy III, Juvenile Spinal Muscular Atrophy, Kugelberg-Welander Disease), and SMA4 (Spinal Muscular Atrophy IV).
A psychiatric disease or disorder, as used herein, may be characterized as one which is of emotional or psychiatric origin and is typically not associated with a loss of neurons. Exemplary psychiatric diseases and disorders include, but are not limited to, eating disorders, such as anorexia nervosa, bulimia nervosa, and atypical eating disorder; mood disorders, such as recurrent depressive disorder, bipolar affective disorder, persistent affective disorder, and secondary mood disorder; drug dependency such as alcoholism; neuroses, including anxiety, obsessional disorder, somatoform disorder, and dissociative disorder; grief; post-partum depression; psychosis such as hallucinations and delusions; dementia; paranoia; Tourette's syndrome; attention deficit disorder; psychosexual disorders, schizophrenia; and sleeping disorders. An exemplary protein associated with psychiatric disease is DISC1, which is implicated in schizophrenia.
An exemplary disease which can be investigated using the methods and compositions of the invention and for which new therapeutic agents can be identified is an affective disorder. These are disorders of mood, causing recurrent depression and/or recurrent episodes of mood elevation, resulting in mania or hypomania. Current treatment regimens include the use of lithium carbonate, carbamazepine, or anti-psychotic medication. Proteins such as inflammatory cytokines are involved in the regulation of sleep and mood.
An exemplary disease which can be investigated using the methods and compositions of the invention and for which new therapeutic agents can be identified is depression. Clinical depression is characterized by depressed mood, often accompanied by additional clinical manifestations, such as sleep disturbance, weight loss, loss of appetite, apathy, anhedonia, and when severe, can be associated with suicidal ideation. It is currently treated, when indicated, with antidepressant medication, most commonly selective serotonin reuptake inhibitors (SSRIs) or tricyclic antidepressants. Post-partum depression can be especially serious, occurring after childbirth. Depression, even when treated, is associated with an increased suicide risk. These patients have a disturbance in cytokine patterns.
A “neurodegenerative disease”, as used in the current context, is readily understood by one of ordinary skill in the art to include any abnormal physical or mental behavior or experience where the death or dysfunction of neuronal cells is involved in the etiology of the disorder, or is affected by the disorder. As used herein, neurodegenerative diseases encompass disorders affecting the central and peripheral nervous systems, and include such afflictions as memory loss, stroke, dementia, personality disorders, gradual, permanent or episodic loss of muscle control. Examples of neurodegenerative diseases for which the current invention can be used preferably include, but are not limited to, Alzheimer's Disease, Parkinson's Disease, Huntington's Disease, Dementia with Lewy Body, amyotrophic lateral sclerosis, epilepsy, myasthenia gravis, neuropathy, ataxia, dementia, chronic axonal neuropathy and stroke.
As used herein “Parkinson's disease” or “PD” refer to a condition of disturbance of voluntary movement in which muscles become stiff and sluggish, movement becomes clumsy and difficult and uncontrollable rhythmic twitching of groups of muscles produces characteristic shaking or tremor. The condition is believed to be caused by a degeneration of pre-synaptic dopaminergic neurons in the brain. The absence of adequate release of the chemical transmitter dopamine during neuronal activity thereby leads to the Parkinsonian symptomotology.
As used herein “Alzheimer's disease” or “AD” refers to a condition characterized by the abnormal deposition of amyloid in the brain of a patient in the form of extra-cellular plaques and intra-cellular neurofibrillary tangles. The rate of amyloid accumulation is a combination of the rates of formation, aggregation and egress from the brain. It is generally accepted that the main constituent of amyloid plaques is the 4 kD amyloid protein (beta-A4, also referred to as A-beta, beta-protein and beta-AP) which is a proteolytic product of a precursor protein of much larger size. The symptoms of Alzheimer's disease are similar to those of other dementias. They include memory loss, changes in personality, problems using language, disorientation, difficulty doing daily activities, and disruptive behavior.
As used herein “dementia with Lewy body” or “DLB” refers to a condition characterized by widespread neurodegeneration with formation of Lewy bodies not only in the dopaminergic system but also in other brain regions. The major symptoms of DLB are fluctuating cognition, visual hallucinations and parkinsonian signs. This is a disease considered by some as a collision between AD and PD; its clinical diagnosis is extremely challenging.
The term “parasite” refers to any parasite including, without limitation, Plasmodium falciparum, Plasmodium vivax, Plasmodium malariae, Plasmodium ovale, Trypanosoma cruzi or a parasite of the Leishmania genus such as, for example, Leishmania donovani.
The term “components,” as used herein, refers to constituents of the cell-free translation system necessary to incorporate a naturally occurring or non-natural amino acid into a growing polypeptide chain. For example, components can include, but are not limited to, buffers, amino acids, nucleic acid transcripts, ATP, GTP, creatine phosphate, labeled amino acids, myristoyl CoA lithium salts, RNase inhibitors, creatine kinases, and tRNAs. Components are described in greater detail herein.
The phrase “detectable moiety” or “conjugate” refers to any atom, molecule or a portion thereof, the presence, absence or level of which is directly or indirectly monitorable. A variety of detectable moieties are well known to those skilled in the art, and can be any material detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Such detectable labels can include, but are not limited to, magnetic beads, fluorescent dyes, radiolabels, enzymes, and colorimetric labels such as colloidal gold or colored glass or plastic beads. In various embodiments a detectable moiety is conjugated to a protein implicated in a disease process. An exemplary detectable moiety is a fluorescent protein, e.g., green fluorescent protein. Other detectable moieties include those conjugated to an antibody, e.g., an antibody that reacts selectively with a protein expressed in a disease state. In an exemplary embodiment, a dectectable moiety conjugated to an antibody, or a fragment thereof, binding selectively to one or more protein of the multiprotein assembly is used to detect or identify the one or more protein, however, the antibody is not used to “capture” the multiprotein assembly or to immobilize it on a solid support in the manner and at the stage that the organic ligand is used to immobilize it on a solid support.
The term “amino acid” refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code. Amino acids can be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Nucleotides, likewise, can be referred to by their commonly accepted single-letter codes. Amino acid substitutions, deletions or additions to individual or a small percentage of amino acids in the encoded sequence is a conservatively modified variant, where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention. The following eight groups each contain amino acids that are conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins (1984)). Amino acids can also include one or more radioactive isotopes, e.g., 35S methionine.
The term “nucleic acid” refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form, and complements thereof. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions can be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). The term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, and polynucleotide. A particular nucleotide sequence also implicitly encompasses “splice variants,” which as the name suggests, are products of alternative splicing of a gene. After transcription, an initial nucleic acid transcript can be spliced such that different (alternate) nucleic acid splice products encode different polypeptides. Mechanisms for the production of splice variants vary, but include alternate splicing of exons. Alternate polypeptides derived from the same nucleic acid by read-through transcription are also encompassed by this definition. Any products of a splicing reaction, including recombinant forms of the splice products, are included in this definition.
The phrase “in vitro transcription reaction,” as used herein, refers to a transcription reaction that takes place in a cell-free environment using largely purified components, for example, purified DNA template and purified DNA-dependent RNA polymerase.
The term “modulates” or “modulated,” as used herein, means to interact with a target protein or multiprotein assembly either directly or indirectly so as to alter the activity of the target protein (or assembly), including, for example, to inhibit the activity of the target protein (or assembly), or to limit or reduce the activity of the target protein (or assembly). Accordingly, the phrase “modulates a cellular function” means to alter the function of a way, which can include, but is not limited to, inhibition of protein synthesis or inhibition of protein assembly into molecular structures such as viral capsids. Exemplary candidate compounds of the invention modulate the formation or activity of a multiprotein assembly. In various embodiments, the assembly is an assembly of host proteins and is implicated in a disease-related event. In various embodiments, the candidate compound modulates the formation or activity of the multiprotein assembly by interacting with one or more protein of the assembly at a site other than an art-recognized active site of the one or more proteins.
The term “candidate compound” or “compound” or “drug candidate” or “modulator” or “organic ligand” or grammatical equivalents, as used herein, describes any molecule, either naturally occurring or synthetic, e.g., protein, oligopeptide (e.g., from about 5 to about 25 amino acids in length, preferably from about 10 to 20 or 12 to 18 amino acids in length, preferably 12, 15, or 18 amino acids in length), small organic molecule, polysaccharide, lipid, fatty acid, polynucleotide, oligonucleotide, etc., to be tested for the capacity to directly or indirectly modulate assembly or function of a multiprotein assembly. The candidate compound can be in the form of a library of candidate compounds, such as a combinatorial or randomized library that provides a sufficient range of diversity. Test compounds are optionally linked to a fusion partner, e.g., targeting compounds, rescue compounds, dimerization compounds, stabilizing compounds, addressable compounds, and other functional moieties. Conventionally, new chemical entities with useful properties are generated by identifying a test compound (called a “lead compound” or “candidate”) with some desirable property or activity, e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property and activity of those variant compounds. Often, high throughput screening (HTS) methods are employed for such an analysis. Compounds can be inhibitors, activators, or modulators of, for example, of the assembly or function of a multiprotein assembly. Inhibitors are compounds that, e.g., bind to, partially or totally block assembly or action by activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate activity or expression of, for example, nucleic acids or polypeptides derived from the cell-free system described herein, e.g., antagonists. Activators are compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or up regulate, for example, nucleic acids, polypeptides or multiprotein assemblies derived from the cell-free system described herein, e.g., agonists. Inhibitors, activators, or modulators also include genetically modified versions of the proteins derived from the cell-free system, e.g., versions with altered activity, as well as naturally occurring and synthetic ligands, substrates, antagonists, agonists, antibodies, peptides, cyclic peptides, nucleic acids, antisense molecules, ribozymes, or small chemical molecules, for example.
The phrases “small organic molecule” and “organic ligand”, are used interchangeably and refer to a candidate compound which is an organic molecule, either naturally occurring or synthetic, that has a molecular weight of from about 50 to about 2500 daltons, e.g., less than about 2000 daltons, e.g., between about 100 and about 1000 daltons, e.g., between about 200 and about 500 daltons. In various embodiments, the organic ligand is not a protein, e.g., a protein that binds to one or more of the proteins of the multiprotein assembly. In a preferred embodiment, the organic ligand is not a protein that selectively or specifically binds to a protein component of a multiprotein assembly. In an exemplary embodiment, the organic ligand is not an antibody or a fragment of an antibody, which binds to a protein or more than one protein of a multiprotein assembly. In various embodiments, the organic ligand does not include two or more naturally occurring amino acids linked through a peptide bond. In an exemplary embodiment, the organic ligand does not include a naturally occurring amino acid structure. In various embodiments, the organic ligand is a small organic molecule with a molecular weight of less than about 2,000 Daltons, e.g., less than about 1500 Dalton, e.g., less than about 1,000 Daltons. In various embodiments, the organic ligand has no appreciable binding to an active site of a protein of a multiprotein assembly. In an exemplary embodiment, the principal binding site for the organic ligand is located at an allosteric site of a protein or at an interface formed between two or more allosteric regions or two or more proteins of the multiprotein assembly. Exemplary organic ligands of use in the present invention include those disclosed in copending, commonly-owned U.S. patent application Ser. No. 13/099,006, filed on May 2, 2011.
The term “biopharmaceutical” or “biopharmaceutical compound,” as used herein, refers to any candidate compound such as a protein, peptide, polypeptide, antibody, or the like, which can be expressed endogenously in a biological system under genetic control and which confers biological activity toward pharmaceutical or therapeutic use. The biopharmaceutical or biopharmaceutical compound can be constitutively or inducibly expressed. The biological system can be an in vivo biological system and/or an in vitro biological system.
As used herein, the terms “isolated” and “substantially purified” refers to a multiprotein assembly that is removed from its natural environment and is usually at least 60% free, preferably 75% free, and most preferably 90% free from other components with which it is naturally associated. Thus, for example, a composition containing A is “substantially free of” B when at least 85% by weight of the total A+B in the composition is A. Preferably, A comprises at least about 90% by weight of the total of A+B in the composition, more preferably at least about 95% or even 99% by weight. In an exemplary embodiment, the multiprotein assembly is “isolated” when it is bound to a stationary support, e.g., through interaction with a candidate compound (e.g., organic ligand) immobilized thereon.
In an exemplary embodiment, the present invention provides an isolated multiprotein assembly, which is implicated in one or more disease states, e.g., mammalian disease states (e.g., viral, bacterial, cancer, neurodegenerative and psychological). In an exemplary embodiment, the multiprotein assembly includes two or more proteins interacting with each other. The isolated multiprotein assembly can be isolated from any source. In an exemplary embodiment, the assembly is isolated from a host cell, e.g., a host cell extract. In various embodiments, the assembly includes two or more host cell proteins interacting with each other. In various embodiments, the multiprotein assembly is associated with an organic ligand. In an exemplary embodiment, the organic ligand is bound to a solid support and the interaction between the organic ligand and the multiprotein assembly immobilizes the multiprotein assembly on the solid support. In an exemplary embodiment, the protein component of a multiprotein assembly is ABCE1. In an exemplary embodiment, the protein component of a multiprotein assembly is SAM Synthetase. In an exemplary embodiment, the protein component of a multiprotein assembly is Peroxiredoxin-2 (PRX-2). In various embodiments two or more of these proteins are independently, and in any combination, incorporated into a multiprotein assembly.
In an exemplary embodiment, the assembly is isolated from a cell free translation system, e.g., a wheat germ extract preparation. The translation system can include any amount wheat germ extract, e.g., about 10% wheat germ extract, e.g., about 5% wheat germ extract. In an exemplary embodiment, the cell free translation system is expressing one or more proteins implicated in a disease state (e.g., viral capsid proteins).
The multiprotein assembly can also be isolated from an animal host cell, e.g., a mammalian cell, e.g, from a tissue or organ affected by the disease state of interest. Exemplary mammalian cells and organs include stem cells, and cells of the eye, brain, lung, liver, skin, spleen, immune system and blood. In an exemplary embodiment, the multiprotein assembly is isolated from a mammalian host cell that is in a diseased or disordered state (e.g., viral or bacterial infection, having degenerative changes, or cancer). In various embodiments, the host cell is not affected by a viral disease, or the virus is not actively replicating in the host cell in a manner that gives rise to the disease state.
The isolated multiprotein assembly can be isolated from cells that are diseased and/or are stressed by one or more stressor, e.g., oxidative stress and environmental stress. Exemplary environmental stressors include, extreme heat, extreme cold, dehydration, hyper- and hypo-salinity, In an exemplary embodiment, the system in which the effects of the stressor is modeled for candidate compound selection is a plant system and the stressor is an environmental stressor.
In various embodiments, the multiprotein assembly is isolated at a selected point in the developmental cycle of a component of a cell-free system or a cell.
The multiprotein assembly can exist in one or more form. In an exemplary embodiment the isolated multiprotein assembly is isolated in a first form, which is implicated in a disease state. The first form is distinct from a second form of the assembly existing in a non-disease state. The first form can differ from the second form in the number of proteins in the assembly, the identity of the proteins in the assembly, the conformation of one or more protein in the assembly and a combination thereof. In various embodiments, the first form is an active form associated with a disease state and said second form is a quiescent form associated with a non-disease state. In an exemplary embodiment, the first form of the isolated multiprotein assembly is an active form associated with an energized state and the second form is a quiescent form associated with an energy-depleted state. In various embodiments, the second form converts to the first form upon addition of an energy source (e.g., ATP or other biologically relevant source). In various embodiments, the second form is convertible to the first form in the presence of an energy source and m-RNA encoding a protein implicated in a disease state.
In various embodiments, the isolated multiprotein assembly is bound to a candidate compound (e.g., therapeutic agent) for treating said mammalian disease state. The interaction between the assembly and the candidate may be in the solution phase, but in an exemplary embodiment occurs with the candidate compound immobilized on a substrate (e.g., a stationary phase, e.g., chromatographic support). In an exemplary embodiment, the complex between the multiprotein assembly and the candidate compound is isolated.
The isolated multiprotein assembly can be isolated before, after or during translation of an exogenous m-RNA in the host cell. In various embodiments, the isolated assembly has bound thereto one or more m-RNA molecule. In various embodiments, the isolated assembly has bound thereto one or more m-RNA molecule complexed with one or more ribosome. In an exemplary embodiment, the m-RNA encodes at least a portion of a protein implicated in a disease state (e.g., viral capsid, toxin, beta-4A, etc.). In an exemplary embodiment, the m-RNA does not include a stop codon.
The isolated multiprotein assembly can be one that is implicated in any disease of interest. In various embodiments, the disease is a member selected from viral infection, bacterial infection, parasitic infection, neoplasia, Alzheimer's, depression, affective disorders, Amyotrophic Lateral Sclerosis (ALS), Parkinson's Disease, Huntington's Disease and Schizophrenia.
In an exemplary embodiment, the multiprotein assembly is functional with respect to disease progression. In other words, when the multiprotein assembly is introduced into a translation system (cell free or cell-based) depleted in the proteins of the multiprotein assembly, the status of the disease state alters from quiescent to progression.
In various embodiments, isolated multiprotein assemblies are derived from across various subpopulations of cells and cell components (e.g., mitochondria, organelles, endoplasmic reticulum, nucleus, and cytosol) without separation of the machines particular to one or more of these subpopulations. In an exemplary embodiment, one or more subpopulation is first isolated and the multiprotein assemblies particular to this one or more subpopulation are isolated.
In various embodiments, the invention provides an assay to determine whether a candidate compound is effective at treating a disease that involves the assembly of multiprotein structures such as viral capsids, or misfolded proteins, e.g., amyloid fibrils. In various embodiments, the assay is performed using an isolated multiprotein assembly of the invention. In an exemplary embodiment, the screen is based upon confirming the ability of a candidate compound to interrupt the formation of a multiprotein assembly or the interaction between a multiprotein assembly and a second protein translated in a translation system. The interruption of the interaction between the multiprotein assembly and the second protein is an indication that the candidate compound is an effective agent (or starting point for development of an effective agent) for treating the disease state of interest.
Thus, in one embodiment, the invention provides a method for assaying a candidate compound for its ability to interfere with the function or assembly of a multiprotein assembly implicated in a disease. The multiprotein assembly can play one or more role in the initiation, advancement or treatment of the disease state. For example, the multiprotein assembly can participate in folding of a second protein, or the formation of multiprotein structures incorporating the second protein or both. The second protein is preferably a protein that is implicated in the disease of interest. An exemplary method includes the use of a cell-free translation system including a ribosome, a truncated m-RNA which is missing a stop codon at its 3′-terminus and the components of the translation system necessary for it to perform m-RNA translation. The truncated m-RNA encodes a host protein that is known or is thought to play a role in a multiprotein assembly (“protein substrate”) that interacts with one or more second protein implicated in the disease of interest. Because of the lack of a stop codon, the newly synthesized protein, the truncated m-RNA, or both are not released from the ribosome upon completing the synthesis of the host protein. In various embodiments, the protein is complexed to the ribosome at a site that is accessible to an A site. In an exemplary embodiment, the protein substrate is not complexed to a t-RNA. In various embodiments, one or more of the truncated m-RNA and the second protein are released from the ribosome upon treatment of the ribosome complex with puromycin.
The second protein is also expressed in the translation system and the translation system is contacted with the candidate compound prior to, during and/or after synthesis of the host protein, the second protein or both. In various embodiments, the ability of the candidate compound to disrupt the interaction of a multiprotein assembly and a second protein is determined and the presence of this ability is interpreted as an indication that the candidate compound is a useful therapeutic or lead compound for treating or preventing the disease of interest. The methods of determining whether the second protein has been incorporated into a multiprotein structure (e.g., a viral capsid) or is misfolded (e.g., an amyloid fibril) are well-known in the art. Methods of determining whether a second protein is incorporated into a viral capsid are set forth in the Examples appended hereto.
In operation, the candidate compound interferes with the assembly or function of the multiprotein assembly by interacting with any one or more proteins in the multiprotein assembly or at any location on the second protein. For example, in some embodiments, the candidate compound binds to site on a single protein, one or more site at an interface between two proteins interacting with each other, or at one or more site at an interface between three, or more than three, proteins that are interacting with each other to form the multiprotein assembly.
In various embodiments, the candidate compound interferes with the assembly or function of the multiprotein assembly by binding to one or more active site (e.g., enzymatic activity) of a single protein, one or more active site of each of two proteins or one or more active site of each of three, or each of more than three, proteins that are interacting with each other to form the multiprotein assembly.
In an exemplary embodiment, the candidate compound interferes with the assembly or function of the multiprotein assembly by binding to one or more allosteric site of one protein, one or more allosteric site of each of two proteins or one or more allosteric site of each of three, or each of more than three, proteins that are interacting with each other to form the multiprotein assembly.
Exemplary diseases for which candidate therapeutic compounds can be investigated include, without limitation, viral infections, bacterial infections, cancer, and diseases in which proteins are misfolded, e.g., Alzheimer's disease.
When the disease of interest is a viral infection, the infection can be by any one or more member of the group Flaviviridae, Togaviridae, Bunyaviridae, Arenaviridae, Filoviridae, Poxviridae, Orthomyxoviridae, Rhabdoviridae, Herpesviridae, Coronaviridae, Paramyxoviridae, Hepadnaviridae, Bornaviridae, Picornaviridae, Retroviridae, Reoviridae, Papillomaviridae, Adenoviridae, Astroviridae, and Polyomaviridae.
In another exemplary embodiment, there is provided an assay for determining whether a candidate compound interferes with a target that is a host target or a target encoded by a pathogenic organism. The method is based on the removal of host proteins that bind to the candidate compound from the translation medium by contacting an initial preparation of the medium, which is uncharged by coding genetic material, with an affinity chromatography device that includes the candidate compound immobilized thereon. The material that is not captured by the affinity chromatography device is collected in a flow through fraction. The device is then eluted with an eluent that includes the candidate compound to displace the proteins bound to the immobilized candidate compound from the affinity chromatography device. The material that is eluted off is collected and is exhaustively dialyzed to remove both free and bound candidate compound to form an eluent fraction. Translation of a host protein and a second protein is performed in a medium including the flow through fraction in the presence of the candidate compound. If there is no evidence of the assembly of a multiprotein structure incorporating the second protein or of a misfolded protein, this is an indication that protein substrate for the multiprotein assembly is a host protein and that it was removed from the medium by affinity chromatography on the candidate compound. In various embodiments, the flow through fraction and the eluent fraction are combined and translation of a host protein and a second protein is performed in the resulting medium. If there is evidence of the assembly of a multiprotein structure incorporating the second protein or of a misfolded protein, this is an indication that the substrate protein for formation of the multiprotein assembly is a host protein that interacts specifically with the candidate compound.
Thus, in various embodiments, the invention provides a method of verifying that a target for a candidate compound which interferes with the assembly or function of a multiprotein assembly implicated in a disease is a host target. An exemplary multiprotein assembly participates in folding of a second protein, formation of a multiprotein structure comprising the second protein or a combination thereof. An exemplary method includes: (a) contacting an initial medium for a cell-free translation system including one or more host protein with an affinity chromatography device having said candidate compound immobilized thereon. The candidate compound is immobilized on the chromatography device either directly or through a linker. In various embodiments, the contacting is performed under conditions appropriate to bind at least one member of the multiprotein assembly to the candidate compound. In step (b), the chromatography device is washed with a first eluent, removing species not bound to the immobilized candidate compound. The eluent from the device is collected and is termed a flow through fraction. The device is then washed with a second eluent, which preferably includes the candidate compound and which displaces the bound proteins from the device. This fraction is termed an eluent fraction.
In various embodiments the candidate compound is combined with the flow through fraction and this resulting first mixture is used for cell-free translation of an m-RNA sequence. The m-RNA sequence encodes a protein substrate of the multiprotein assembly. In various embodiments, the m-RNA sequence is a truncated m-RNA sequence lacking a stop codon at its 3′-terminus. In a preferred embodiment, upon completion of translation of the truncated m-RNA sequence, the protein substrate remains complexed to the ribosome. An exemplary site for protein complexation is a site that is accessible to an A site. It is generally preferred that the protein substrate is not complexed to a t-RNA. The cell-free translation system is also used to synthesize the second protein, generally essentially simultaneously with synthesis of the first protein. The cell-free system is assayed to determine whether the second protein was misfolded, the multiprotein structure comprising the second protein was formed or both. If there is no evidence of the assembly of a multiprotein structure incorporating the second protein or of a misfolded protein, this is an indication that protein substrate for the multiprotein assembly is a host protein and that it was removed from the medium by affinity chromatography on the candidate compound.
In some embodiments, it is desired to confirm or augment the results from the first portion of the assay. In exemplary embodiments, the candidate compound is combined with the flow through fraction and the eluent fraction. The second mixture is used in the cell-free translation of the truncated m-RNA sequence lacking a stop codon at its 3′-terminus, and the synthesis of the second protein. The cell-free system is assayed to determine whether the second protein was misfolded, the multiprotein structure comprising the second protein was formed or both. If there is no evidence of the assembly of a multiprotein structure incorporating the second protein or of a misfolded second protein, this confirms that said initial medium does not include a host target for the candidate compound. Alternatively, it confirms that the host target for the candidate compound is removed by contacting the medium with the immobilized candidate compound, and confirmation of folding of the second protein or formation of the multiprotein structure including said second protein confirms that said host target for said candidate compound is removed by contacting with said immobilized candidate compound and, therefore, interacts with the candidate compound.
The invention also provides methods of preparing the isolated multiprotein assembly. An exemplary method includes: (a) contacting a host extract with a candidate therapeutic agent immobilized on a solid support, thereby immobilizing the multiprotein assembly on the solid support; and (b) separating components of the host extract other than the immobilized multiprotein assembly from the immobilized multiprotein assembly, thereby isolating the multiprotein assembly. In an exemplary embodiment, the isolated multiprotein assembly is produced by a method in which, prior to step (a), an endogenous m-RNA is translated in the host system.
In an exemplary embodiment, the invention provides a method for identifying and/or isolating a multiprotein assembly that does not require the multiprotein assembly to be bound to a candidate compound at any point. This method results in a significant savings in time and labor by eliminating exhaustive dialysis procedures. In an exemplary method, a first mixture of proteins thought to include a multiprotein assembly is submitted to a fractionation process, e.g., size exclusion chromatography or glycerol gradient fractionation. Any useful number of fractions are collected. The fraction containing the multiprotein assembly of iterest is then selected. In an exemplary embodiment, the fractionation process is calibrated to confirm the presence of the multiprotein assembly of interest in the selected fraction before the fractionation is performed on the first protein mixture. In an exemplary embodiment, the fractionation process is calibrated by the method of the invention set forth above in which a mixture of proteins is contacted with a candidate compound immobilized on a solid support to capture the assembly, washing the column to remove proteins not associated with the assembly, eluting the assembly from the column (e.g., with candidate compound), dialyzing away from the eluted assembly all compound used to elute the assembly, and reconstituting the translation system with the dialyzed assembly to confirm re-initiation of translation. Thus, in this embodiment, a second mixture of proteins prepared by a procedure essentially identical to the procedure used for preparing the first mixture is submitted to the capture, elution, dialysis, confirmation process and the fraction containing the assembly of interest is identified. When the first mixture is fractionated using essentially the same procedure used to calibrate the method with the second mixture, the fraction derived from the first mixture containing the assembly of interest will be pre-identified.
In an exemplary embodiment, fractions from the second mixture are identified by a technique specific for one or more protein of the assembly. In an exemplary embodiment, the technique is Western Blot.
In various embodiments, the protein identified is a member selected from ABCE1, Macrophage Migration Inhibitory Factor (MIF), Glutathione Transferase, SAM Synthetase, Peroxiredoxin-2 (PRX-2) and Cyclophilin A. In various embodiments two or more of these proteins are independently, and in any combination, identified.
As will be apparent to those of skill in the art, the activity in a translation system of the assembly isolated by the method set forth immediately above is readily verifiable by reconstituting a translation system depleted in the multiprotein assembly with the isolated multiprotein assembly.
In various embodiments, a kit for cell-free assay for determining whether a candidate compound is effective against a target protein or multiprotein assembly is provided.
In another embodiment, a kit for assaying the effect of a candidate compound on the assembly of a multiprotein assembly is provided. The kit comprises a cell-free mixture comprising not more than about 5% wheat germ extract, components necessary for expression of proteins required for viral capsid assembly and instructions sufficient for use of the kit in a cell-free expression experiment.
In another embodiment, a kit for a cell-free assay to determine whether a compound modulates assembly or function of a multiprotein assembly is provided. An exemplary kit comprises one or more of a cell-free mixture comprising one or more components of a cell-free assay system, components necessary for expression of the protein, and instructions sufficient for use of the kit in a cell-free expression experiment. In an exemplary embodiment the cell-free assay system is a wheat germ assay system. In an exemplary embodiment, the wheat germ assay system includes not more than about 5% wheat germ.
In another embodiment, a kit for determining whether a compound modulates viral capsid assembly is provided. An exemplary kit comprises one or more of a cell-free mixture comprising one or more components of a cell-free assay system, components necessary for expression of the protein, and instructions sufficient for use of the kit in a cell-free expression experiment. In an exemplary embodiment the cell-free assay system is a wheat germ assay system. In an exemplary embodiment, the wheat germ assay system includes not more than about 5% wheat germ.
In another embodiment, a kit for determining whether a compound modulates multiprotein assembly is provided. An exemplary kit comprises one or more of a cell-free mixture comprising one or more components of a cell-free assay system, components necessary for expression of the protein, and instructions sufficient for use of the kit in a cell-free expression experiment. In an exemplary embodiment the cell-free assay system is a wheat germ assay system. In an exemplary embodiment, the wheat germ assay system includes not more than about 5% wheat germ.
In each of the embodiments set forth above, the kit may provide an isolated multiprotein assembly of the invention in addition to the other recited elements or instead of one or more elements.
The invention provides compositions, kits and integrated systems for practicing the assays described herein. For example, an assay composition having a cell-free translation system including not more than about 10%, e.g., not more than about 8%, e.g., not more than about 6%, e.g., not more than about 5% wheat germ extract, and nucleic acid apparatus for expressing one or more host protein (e.g., a truncated m-RNA) a second protein and instructions for using these components in a drug screening assay are provided. Additional assay components as described above are also provided. For instance and affinity chromatography device, e.g., a solid support or substrate to which a candidate compound can be bound can also be included. Such solid supports include membranes (e.g., nitrocellulose or nylon), a microtiter dish (e.g., PVC, polypropylene, or polystyrene), a test tube (glass or plastic), a dipstick (e.g., glass, PVC, polypropylene, polystyrene, latex, and the like), a microcentrifuge tube, or a glass, silica, plastic, metallic or polymer bead or other substrate such as paper is provided. In various embodiments, the assay will use microtiter, e.g., 96, 384 or 1536 well microtiter plates. In various embodiments the affinity chromatography device is a resin column having a candidate compound immobilized thereon or a reactive group to which a candidate compound can be immobilized.
The invention also provides kits for practicing the candidate screening assays described above. The kits can include any of the materials noted above, and optionally further include additional components such as instructions to practice a high-throughput method of screening for a candidate compound, one or more containers or compartments (e.g., to hold candidate compounds, affinity chromatography resins, cell-free translation systems), a control activity modulator, a robotic armature for mixing kit components, and the like.
The invention also provides integrated systems for high throughput screening of potential candidate compounds. Such systems typically include a robotic armature which transfers fluid from a source to a destination, a controller which controls the robotic armature, a label detector, a data storage unit which records label detection, and an assay component such as a microtiter dish comprising a well having a capture moiety for a protein affixed to the well.
A number of well-known robotic systems have also been developed for solution phase chemistries. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, Mass; Orca, Hewlett-Packard, Palo Alto, Calif.) which mimic the manual synthetic operations performed by a chemist. Any of the above devices are suitable for use with the present invention. The nature and implementation of modifications to these devices (if any) so that they can operate as discussed herein will be apparent to persons skilled in the relevant art.
The structure of the ribosome and the mechanism of translation, as have been revealed by recent work, are reviewed herein (Alberts, B., Johnson, A., Lewis, J., Raff. M., Roberts, K., and Walter, P., Molecular Biology of the Cell, 4th ed, 2002, Garland Science, N.Y.; Ramakrishnan, V., Ribosome Structure and the Mechanism of Translation, 2002, Cell 108 557-572; Schlunzen, F. et al., Structural basis for the interaction of antibiotics with the petidyl transferase center in eubacteria, 2001, Nature 413 814-821; Sytnik, A. et al., Peptidyl Transferase Center Activity Observed in Single Ribosomes, 1999, J Mol. Biol. 285, 49-54; Nyborg, J., and Liljas, A., Protein biosynthesis: structural studies of the elongation cycle, 1998, FEBS letters 430, 95-99).
The ribosome itself is composed of two subunits, termed 30S and 505 (there are differences between bacterial and eukaryotic ribosomes--henceforth in this discussion the ribosome is presumed to come from E. coli, although this assumption is made for the purposes of description only and without any intention of being limiting in any way). The large unit is composed of a pair of large RNA molecules (5S and 23S), the small subunit of a single RNA molecule (30S). Each unit has several dozen small proteins attached to it (Alberts, B., Johnson, A., Lewis, J., Raff. M., Roberts, K., and Walter, P., Molecular Biology of the Cell, 4.sup.th ed, 2002, Garland Science, N.Y.). The ribosome reads the code on mRNA molecules and synthesizes the encoded protein through the mediation of tRNA molecules. The process is performed in three stages: initiation, elongation and termination.
The ribosome uses an adaptor molecule—transfer RNA, or tRNA. These molecules are a special type of RNA. At one end, they have the anticodon part that binds to the RNA codon. At the other end, they carry the amino acid corresponding to that codon.
When tRNA is tagged (as for example with a fluorescent label), the tRNA should continue to function normally during the processes of becoming charged with an amino acid, attaching to the elongation factors, and traveling through the ribosome. Several tagging schemes have made use of the shoulder 10 of the molecule in order to create fluorescent labeling schemes that are efficient on the one hand and result in a fully functional tRNA molecule on the other hand. Several studies have shown that E. coli tRNAs (tRNA molecules) can be efficiently labeled at position 8, which has in many cases a 4-thiouridine base, and at position 47, which has in several cases an amine-reactive X-base (see table below; it should be noted that these position numbers are given according to a standard numbering system for tRNA molecules). tRNA functionality requires that the molecule interact properly with the aminoacyl synthetases on the one hand, and with the ribosomal machinery (including the elongation factors) on the other. tRNA recognition by aminoacyl synthetases is known to be particularly dependent on the anticodon part and the amino acid arm locus.
There are three important stages in translation: initiation, elongation and termination. For monitoring protein synthesis, where protein identification is a preferred motivation, the important stage is elongation.
Three docked tRNAs are seen in
The main stages of elongation are as follows. Stage 1: Codon recognition. A tRNA molecule carrying an amino acid binds to a vacant A-site, while the nascent polypeptide is attached to the P-site. Stage 2: Peptide bond creation. A new peptide bond is created and the polypeptide chain is moved to the A-site. Stage 3: Translocation. The ribosome translocates a distance of 3 nucleotides with respect to the mRNA, the two tRNA units and the polypeptide chain. Stage 4: the cycle repeats itself until a stop codon is reached.
This cycle is shown as schematic diagrams in
The majority of existing anti-viral drugs are nucleoside analogs or other agents that exert their effects through an enzyme involved in producing new copies of the viral genetic material, such as a nucleoside kinase or a polymerase or reverse transcriptase or replicase. These analogs are typically metabolized into nucleotide analogs that inhibit production of viral nucleic acid, for example by inhibiting a polymerase or by causing premature chain termination of growing viral nucleic acids. The efficacy of such drugs depends on two key factors. The first is that the target virus utilized at least one virus-specific enzyme, encoded by the virus and used only by the virus, in the pathways which result in the copying of its genetic material. The second is that this enzyme is more sensitive to the drug or more efficient in utilizing it than any corresponding enzyme in the host. However, because viral and cellular nucleic acid metabolism are so similar, it is difficult to find anti-viral agents that are not used to some extent by host cell enzymes. This limits the dose of anti-viral drug that can be tolerated, which in turn may limit the utility of the drug.
Even in the case where a drug is tolerated at an effective dose, its effectiveness can be reduced markedly by the ability of a virus to mutate relatively rapidly, evolving new versions of the viral enzyme which do not utilize the drug as efficiently or which are less inhibited by the drug. Furthermore, antiviral drugs that function at the level of multiprotein assemblies of host proteins, rather than viral proteins, are anticipated to show an increased resistance to drug resistance.
The present invention provides novel methods for discovering such drugs and for treating illnesses with the drugs discovered. The methods of this invention are based in the observation that assembly of viral multiprotein complexes require one or more host-derived protein substrate. This phenomenon is illustrated herein by reference to synthesis of the rabies viral capsid, however, this is an illustration of a broader general principle and the use of rabies capsid assembly as an example of this principle is not limiting.
Such drugs have significant advantages over current anti-viral agents. As noted above, the targets for the majority of the latter are enzymes involved in the synthesis of viral nucleic acids, and because host cells also contain enzymes active in the synthesis of nucleic acids it is difficult to hit the viral enzymes without also hitting the host ones. Similar problems are likely to occur for any drug target which is an active catalyst in the synthesis of a material required by both the virus and the host cell. In the methods of the present invention, these problems are avoided because the drug targets are not active catalysts in a synthetic pathway: they are devices used by a virus to secure preferential access to a synthetic pathway (protein synthesis), rather than catalysts in such a pathway. As weapons used by the virus in its attack on the host, these devices do not have any parallels within the host. Drugs which interfere with these devices therefore have minimal side effects on the host.
Such drugs are more effective than current drugs, for two reasons. First, their minimal side effects allow them to be used at higher doses. Second, it is possible for these drugs to be intrinsically more injurious to their targets than is tolerable for drugs whose targets have host homologues, because if the latter drugs are intrinsically too injurious they may harm the host homologues to some extent.
Assays for screening modulators of assembly or activity of a multiprotein assembly, (e.g., antiviral compounds) that are based on biochemical approaches typically involve testing compounds for activities that limit or inhibit proteins that are essential for disease progression. For example key components of viral replication complex are ideal targets for antiviral screening. Further, three-dimensional structures of viral proteins, if available, can afford the possibility for rational design of drugs that will inhibit their activity. Although biochemical approaches are capable of identifying potential viral inhibitors, they are limited in their overall efficiency since only a single enzyme or protein can be tested for any potential assay. Thus, individual assays would be required to screen for inhibitors of each given viral target protein. The present invention provides a significant advance over this state of the art.
In some embodiments, modulation of proteins, e.g., viral proteins, can be assessed by determining the effect of a compound on expression, folding, and assembly of the protein in a cell-free system (e.g., with about 5% wheat germ extract). In various embodiments, modulation of viral capsid assembly can be assessed by determining the effect of a compound on capsid assembly using a cell-free system. In some embodiments, modulation of capsid interacting proteins, including but not limited to capsid assembly chaperone proteins, can be assessed by determining the effect of a compound on expression of the protein in the cell-free system of the invention (e.g., using 5% wheat germ extract). Modulation can further include, but is not limited to, modulation of infection, replication, receptor binding, cell entry, particle formation, and the like.
An advantage of using a cell-free system in the present invention is that the process of capsid formation is slowed down, thus allowing for the targeting of capsid assembly processes for modulators of capsid assembly. An advantage of using cell-free systems in the invention, such as those having about 5% wheat germ extract, is an increased sensitivity for detecting compounds that otherwise would not be detected at higher wheat germ concentrations.
Measurement of modulation of a viral protein and/or viral capsid assembly can be performed using a variety of assays, in vitro, in vivo, and ex vivo. The assays described herein can use a full length viral protein, a variant, a mutant or a fragment thereof. A suitable physical, chemical (e.g., detectable moiety) or phenotypic change that affects activity, e.g., enzymatic activity, cell surface marker expression, viral replication and proliferation can be used to assess the influence of a test compound on the proteins expressed. The assay can also make use of one or more drug designed to block or alter protein activity, capsid assembly, or the associations of chaperones with viral proteins. The assay can also identify modulators of viral capsid assembly intermediates. Moreover, genomic nucleic acid can also be encapsidated into capsids, which can be used to design drugs that interfere with encapsidation and with the design of assay systems that examine the mechanism of action of drugs that inhibit encapsidation.
A high throughput binding assay can be performed in which the translation system is contacted with a candidate compound and incubated for a suitable amount of time. A wide variety of modulators can be used, including, but not limited to, small organic molecule, or a biopharmaceutical or biological entity, such as a protein, e.g., an antibody or peptide, a sugar, a nucleic acid, e.g., an antisense oligonucleotide or a ribozyme or siRNA, or a lipid.
In high throughput assays, either soluble or solid state, it is possible to screen up to several thousand different modulators or ligands in a single day. In particular, each well of a microtiter plate can be used to run a separate assay against a selected potential modulator, or, if concentration or incubation time effects are to be observed, every 5-10 wells can test a single modulator. Thus, a single standard microtiter plate can assay about 350 (e.g., 384) modulators. If 1536 well plates are used, then a single plate can easily assay from about 100- about 1500 different compounds. It is possible to assay many plates per day; assay screens for up to about 6,000, 20,000, 50,000, or more than 100,000 different compounds are possible using integrated systems.
High throughput screening methods involve providing a combinatorial small organic molecule or peptide library containing a large number of potential therapeutic compounds (potential modulator or ligand compounds) can be used. Such “combinatorial chemical libraries” or “ligand libraries” are then screened in one or more assays to identify those library members (particular chemical species or subclasses) that display a desired characteristic activity. The compounds thus identified can serve as conventional “lead compounds” or can themselves be used as potential or actual therapeutics.
A combinatorial chemical library is a collection of diverse chemical compounds generated by either chemical synthesis or biological synthesis, by combining a number of chemical “building blocks” such as reagents. For example, a linear combinatorial chemical library such as a polypeptide library is formed by combining a set of chemical building blocks (amino acids) in every possible way for a given compound length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical compounds can be synthesized through such combinatorial mixing of chemical building blocks.
Preparation and screening of combinatorial chemical libraries is well known to those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. Pat. No. 5,010,175, Furka, Int. J. Pept. Prot. Res. 37:487-493 (1991) and Houghton et al., Nature 354:84-88 (1991)). Other chemistries for generating chemical diversity libraries can also be used. Such chemistries include, but are not limited to: peptoids (e.g., PCT Publication No. WO 91/19735), encoded peptides (e.g., PCT Publication No. WO 93/20242), random bio-oligomers (e.g., PCT Publication No. WO 92/00091), benzodiazepines (e.g., U.S. Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs et al., Proc. Nat. Acad. Sci. USA 90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al., Amer. Chem. Soc. 114:6568 (1992)), nonpeptidal peptidomimetics with glucose scaffolding (Hirschmann et al., J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses of small compound libraries (Chen et al., I Amer. Chem. Soc. 116:2661 (1994)), oligocarbamates (Cho et al., Science 261:1303 (1993)), and/or peptidyl phosphonates (Campbell et al., J Org. Chem. 59:658 (1994)), nucleic acid libraries (see Ausubel, Berger and Sambrook, supra), peptide nucleic acid libraries (see, e.g., U.S. Pat. No. 5,539,083), antibody libraries (see, e.g., Vaughn et al., Nature Biotechnology, 14(3):309-314 (1996) and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang et al., Science, 274:1520-1522 (1996) and U.S. Pat. No. 5,593,853), small organic molecule libraries (see, e.g., benzodiazepines, Baum C&EN, January 18, page 33 (1993); isoprenoids, U.S. Pat. No. 5,569,588; thiazolidinones and metathiazanones, U.S. Pat. No. 5,549,974; pyrrolidines, U.S. Pat. No. 5,525,735 and 5,519,134; morpholino compounds, U.S. Pat. No. 5,506,337; benzodiazepines, U.S. Pat. No. 5,288,514, and the like).
In an exemplary embodiment, a cell-free system for expressing a protein of interest is utilized as a component of the assay. The system comprises components necessary for expression of the protein of interest. In an exemplary embodiment, the protein of interest is a viral protein or a misfolded protein implicated in a disease. In various embodiments, the viral protein is a viral capsid protein. In a selected embodiment, the protein of interest is a capsid interacting protein. In a selected embodiment, the protein of interest is a host protein which undergoes assembly in a manner analogous to capsid proteins, i.e. most likely due to catalysis of formation of specific multi-protein complexes by other proteins in the cytoplasm.
In an exemplary embodiment, the protein is a non-viral protein that is a compound of a multiprotein assembly. In selected embodiments, the multiprotein assembly is implicated in diseases comprising central nervous system disorders, metabolic disorders, oncologic disorders, parasitic diseases or immunologic disorders. In an exemplary embodiment, the protein of interest is a bacterial or parasitic protein.
The viral protein can be from any virus or family of viruses. In an exemplary embodiment, the viral protein is from a virus which is a member of a viral family selected from the group consisting of Flaviviridae, Togaviridae, Bunyaviridae, Arenaviridae, Filoviridae, Poxviridae, Orthomyxoviridae, Rhabdoviridae, Herpesviridae, Coronaviridae, Paramyxoviridae, Hepadnaviridae, Bornaviridae, Picornaviridae, Retroviridae, Reoviridae, Papillomaviridae, Adenoviridae, Astroviridae, Polyomaviridae.
Cell-free translation systems provide cytosolic factors critical for translation, and support the in vitro translation of a wide variety of mRNAs into protein. The translation mechanism is sufficiently conserved so that a cell-free system can translate both prokaryotic and eukaryotic mRNAs with high efficiency. The optimal concentration of wheat germ extract can be determined for each RNA sequence to be translated.
Cell free translation systems are well known. Recently a synthetic system, built entirely from purified recombinant factors, and that has a high protein synthesis yield, was described (Shimizu et al., Cell-free translation reconstituted with purified components. Nat Biotechnol. 2001 August; 19(8):751-5). Kits and detailed instructions can be obtained from vendors such as Promega (Madison, Wis.). These systems are used for several applications, such as ORF validation and functional analysis of gene products. The systems contain ribosome-rich media with the required tRNAs and amino acids, and little or no mRNA. When mRNA is introduced, the ribosomes begin translation and proteins are produced. Often the proteins are produced radiolabeled. This enables the researcher to verify that the required proteins were in fact produced. The optional, exemplary system disclosed here is easier to assemble in vitro than in vivo, since labeling techniques are more readily available and easier to implement.
In an exemplary embodiment, a wheat germ extract cell-free system is utilized. Wheat germ extract is commonly used for cell-free translation reactions, and was initially described by various investigators (e.g., Roberts, B. E. and Paterson, B. M. (1973, PNAS 70, p. 2330)). Wheat germ extract is desirable because is supports translation of prokaryotic, eukaryotic, and viral RNAs. Wheat germ extract has been further shown to be useful in cell-free systems designed to assemble viral capsids (Lingappa et al. J Cell Bio 125: 99-111 (1994); Lingappa et al. J Cell Bio 136:567-581 (1997); Singh et al. Virology 279:257-270 (2001); Zimmerman et al. Nature 415:88-92 (2002); Lingappa and Thielen, Methods Mol Biol. 485:185-95 (2009)). Viral capsids are the protein shell of a virus that protects the viral genome, and its assembly is a process, catalyzed by host factors, that can be targeted to develop anti-viral drugs.
Proper wheat germ extract concentration is required for efficient protein expression. Optimal wheat germ extract concentration for robust protein expression has generally been shown to be around 40 to 50% of the total translation mix (Erikson and Blobel, Methods Enzymology 96:38-50 (1983)). Reports have specified that wheat germ extract is optimally used at 20% of the final reaction volume for proper capsid assembly (Lingappa and Thielen, Methods Mol Biol. 485:185-95 (2009)). In an exemplary embodiment, the cell free system used in the assay of the present invention includes not more than about 5% wheat germ extract. An exemplary system is set forth in commonly owned copending U.S. Provisional Patent Application No. 61/514,825 filed Aug. 3, 2011.
A cell-free expression system of use in the invention includes one or more components necessary or useful to ensure that the expression system expresses the desired protein in the desired amount and/or form. In addition to the translation apparatus, an exemplary embodiment, the further components of the system include one or more of a buffer, an amino acid, and a nucleic acid transcript. In various embodiments the composition further comprises one or more of a detectable moiety, ATP, GTP, creatine phosphate, a labeled amino acid, myristoyl CoA lithium salt, an RNase inhibitor, creatine kinase, and a tRNA. In an exemplary embodiment, the labeled amino acid comprises [35S] methionine. In an exemplary embodiment, the nucleic acid transcript is derived from an in vitro transcription reaction. In an exemplary embodiment, the nucleic acid transcript encodes a viral protein, e.g., a viral capsid protein. In an exemplary embodiment, the nucleic acid transcript encodes a viral capsid interacting protein. In an embodiment, the buffer comprises a member selected from potassium acetate, spermine, and dithiothreitol or other reducing agents and a combination thereof.
Cell-free translation systems can utilize a wide variety of components in addition to the translation apparatus (e.g., wheat germ extract). Described herein are exemplary basic components useful for efficient translation of proteins (e.g., viral proteins) using a cell-free translation system (e.g., wheat germ extract). However, one skilled in the art recognizes that additional components might be useful for translation and/or capsid assembly depending on the desired properties of the proteins and/or capsids produced the desired production conditions and other variables. The choice of the proper components for a cell-free system of the invention is well within the capabilities of those of skill in the art. For example, when the proteins of interest are implicated in viral infections, the use of wheat germ extract for capsid assembly is recognized known in the art for this application, and is described in more detail, for example, in Lingappa and Thielen, Methods Mol Biol. 485:185-95 (2009), and U.S. Pat. No. 7,638,269.
In an exemplary embodiment, the composition of the invention further includes a buffer. Cell-free translation systems generally require an appropriate compensating buffer. Potassium and magnesium concentrations of the wheat germ translation system can have dramatic effects on the efficiency of translation, and the compensating buffer is used to adjust the ion concentration of the total translation reaction to an optimum that can be determined for each mRNA being translated. Buffers can include further components for efficient protein expression, which include, but are not limited to, potassium acetate, amines (e.g., spermine), and sulfur compounds (e.g., dithiothreitol).
The composition of the invention also optionally includes a nucleic acid encoding a protein or a portion thereof. In an exemplary embodiment, the translation mixture contains transcript nucleic acid or a fragment thereof that encodes one or more protein implicated in a disease state, e.g., a viral protein. In this embodiment, the cell-free translation system involves two linked reactions: in vitro transcription and cell-free translation. RNA can be obtained by any method known in the art including, but not limited to, isolating mRNA or by making in vitro mRNA transcripts from DNA cloned into a vector containing an mRNA polymerase promoter. RNA molecules can also be generated in the same reaction vessel used for the translation reaction. In an exemplary embodiment, the mRNA is generated in situ by addition of, for example, SP6 polymerase to the reaction mixture along with the viral protein coding region or cDNA.
In an exemplary embodiment, a sample containing a virus of interest or a bodily fluid of an individual infected with a virus of interest, or infected cells from an individual, is used a source of viral nucleic acid encoding the protein for the virus. This can then be engineered behind an appropriate promoter (e.g. for SP6 polymerase), amplified by PCR and purified for transcription-linked translation. The fluid may be any bodily fluid including, without limitation, blood, serum, plasma, lymphatic fluid, urine, sputum, cerebrospinal fluid, and the like.
The endogenous mRNA present in the wheat germ extract can compete with the exogenous RNA for ribosomes and factors required for translation. It is therefore optionally advantageous to reduce the concentration of endogenous RNA by treating the prepared extract with nuclease. Such nucleases are well known in the art, and can include, but are not limited to, micrococcal nuclease from Staphylococcus aureus.
Methods known in the art are used to maintain energy levels sufficient to maintain protein synthesis. In an exemplary embodiment, additional nucleotide energy sources are added during the reaction. In various embodiments, energy is maintained by addition of an energy source such as creatine phosphate/creatine phosphokinase. In an exemplary embodiment, ATP and GTP concentrations present in a standard translation mixture known in the art are sufficient to support both protein synthesis and capsid formation.
In an exemplary embodiment, the virus has a myristolated intermediary. For such viruses, one can add sufficient myristoyl coenzyme A (MCoA) with or without acceptable salts to the system to enable capsid assembly. The concentration required may vary according to the particular experimental conditions, and can therefore be determined empirically.
In various embodiments, the cell-free translation systems of the invention comprise further components including, but not limited to, RNase inhibitors, ribonuclease inhibitors, protease inhibitors, microsomal membranes, and tRNAs, either alone or in combination.
With the present invention, cell-free translation systems can optionally produce one protein or many proteins, and their identification and production rates could be measured, controlled, and optimized in real time.
Proteins expressed in the cell-free system optionally include a detectable moiety. This may be a primary label or a secondary label. In an exemplary embodiment, the detectable moiety is a primary label. A primary label is one that can be directly detected, such as a fluorophore. In general, labels fall into three classes: a) isotopic labels, which may be radioactive or heavy isotopes; b) magnetic, electrical, thermal labels; and c) colored or luminescent dyes. Labels can also include enzymes (horseradish peroxidase, etc.) and magnetic particles. Preferred labels include chromophores or phosphors but are preferably fluorescent dyes. Suitable dyes for use in the invention include, but are not limited to, fluorescent lanthanide complexes, including those of Europium and Terbium, fluorescein, rhodamine, tetramethylrhodamine, eosin, erythrosin, coumarin, methyl-coumarins, pyrene, Malacite green, stilbene, Lucifer Yellow, Cascade Blue™, Texas Red, alexa dyes, phycoerythin, bodipy, and others described in the 6th Edition of the Molecular Probes Handbook by Richard P. Haugland, hereby expressly incorporated by reference.
In a preferred embodiment, a secondary detectable label is used. Accordingly, detectable moieties may be primary labels (i.e. directly detectable) or secondary labels (indirectly detectable). A secondary label is one that is indirectly detected; for example, a secondary label can bind or react with a primary label for detection, or may allow the separation of the compound comprising the secondary label from unlabeled materials, etc. Secondary labels find particular use in systems requiring separation of labeled and unlabeled proteins. Secondary labels include, but are not limited to, one of a binding partner pair; chemically modifiable moieties; nuclease inhibitors, etc.
In a preferred embodiment, the secondary label is a binding partner pair. For example, the label may be a hapten or antigen, which will bind its binding partner. In a preferred embodiment, the binding partner can be attached to a solid support to allow separation of extended and non-proteins. For example, suitable binding partner pairs include, but are not limited to: antigens (such as proteins (including peptides)) and antibodies (including fragments thereof (FAbs, etc.)); proteins and small molecules, including biotin/streptavidin; enzymes and substrates or inhibitors; other protein-protein interacting pairs; receptor-ligands; and carbohydrates and their binding partners.
In a preferred embodiment, the binding partner pair comprises biotin or imino-biotin and streptavidin. Imino-biotin is particularly preferred as imino-biotin disassociates from streptavidin in pH 4.0 buffer while biotin requires harsh denaturants (e.g. 6 M guanidinium HC1, pH 1.5 or 90% formamide at 95° C.).
In a preferred embodiment, the binding partner pair comprises a primary detectable moiety and an antibody that will specifically bind to the primary detectable moiety. By “specifically bind” herein is meant that the partners bind with specificity sufficient to differentiate between the pair and other components or contaminants of the system. The binding should be sufficient to remain bound under the conditions of the assay, including wash steps to remove non-specific binding moieties.
For immobilization of proteins, it is preferred that the other half of the binding pair is attached to a solid support. In this embodiment, the solid support may be any as described herein for substrates and microspheres, and the form is preferably microspheres as well; for example, a preferred embodiment utilizes magnetic beads that can be easily introduced to the sample and easily removed, although any affinity chromatography formats may be used as well. Standard methods are used to attach the binding partner to the solid support, and can include direct or indirect attachment methods. For example, biotin labeled antibodies to fluorophores can be attached to streptavidin coated magnetic beads.
Thus, in this embodiment, the expressed proteins comprise a binding partner that is contacted with its binding partner under conditions wherein the proteins are separated from the unproteins. These proteins can then be added to the array comprising capture probes as described herein.
In a preferred embodiment, the secondary label is a chemically modifiable moiety. In this embodiment, labels comprising reactive functional groups are incorporated into the nucleic acid. The functional group can then be subsequently labeled with a primary label. Suitable functional groups include, but are not limited to, amino groups, carboxy groups, maleimide groups, oxo groups and thiol groups, with amino groups and thiol groups being particularly preferred.
In various embodiments, the detectable moiety is a component of a fusion protein expressed by the cell-free translation system. In one emobdiment, a fusion nucleic acid is introduced to the system. The fusion nucleic acid comprises nucleic acid encoding a protein of interest and a nucleic acid encoding a detectable moiety. Thus, by linking a protein of interest to a detectable molecule, for example green fluorescent protein, the presence or absence of the detectable molecule can serve to identify the protein of interest.
The nucleic acid encoding the protein of interest is operably linked to nucleic acid encoding a detectable molecule. The fusion proteins are constructed by methods known in the art. For example, the nucleic acids encoding the protein of interest is ligated to a nucleic acid encoding a detectable moiety. An exemplary detectable moiety is a fluorescent moiety. Preferred fluorescent molecules include but are not limited to green fluorescent protein (GFP), blue fluorescent protein (BFP), yellow fluorescent protein (YFP), red fluorescent protein (RFP), and enzymes including luciferase and .beta.-galactosidase. In a preferred embodiment, green fluorescent protein (GFP) or any of its derivatives including, but not limited to, EGFP (Haas et al., Curr. Biol. 6:315-324 (1996)), d2EGFP (Clontech), EBFP (Clontech), GFPuv (Crameri et al., Nature Biotechnol. 14:315-319 (1996)), BFP (blue fluorescent protein), YFP (yellow fluorescent protein) and RFP (red fluorescent protein) is used as a detectable moiety. GFP expression and loss of GFP expression can be monitored noninvasively in vivo in individual cells.
When single molecule detection is required in in vitro translation systems, the molecules are preferably immobilized. There are several approaches to immobilization of biomolecules. Biomolecules can be attached specifically or non-specifically, and in either case, either ribosomes or mRNA templates can be immobilized.
For non-specific immobilization, the protein can be attached to a charged surface such as an aminopropylsilane-coated surface via electrostatic interaction, as described in 8. Ha, T. et al., (1996) Proc. Natl. Acad. Sci. USA 93, 6264-6268. Another nonspecific immobilization method successfully used for single-molecule fluorescence study is trapping molecules inside polyacrylamide pores (Dickson, R. M., et al., (1996) Science 274, 966-969) or agarose gel (Lu, H. P., et al., (1998) Science 282, 1877-1882., Dickson, R. M., et al., (1997) Nature 388, 355-358.). While gel immobilization has the merit of not requiring any special modification of the biomolecule, it has some disadvantages. First, the concentration of other small molecules such as enzyme substrates and ions is difficult to change in a short time. Sudden changes in the buffer conditions are necessary for a certain type of single-molecule studies. Second, because of limited molecular diffusion, it is not easy to study interactions between macromolecules in gel.
Specific immobilization requires a well-defined modification of the biological molecule. For instance, a biotin or a digoxigenin can be attached to an mRNA, rRNA or ribosomal protein, to immobilize them to streptavidin- or antidigoxigenin-coated surfaces respectively. Alternatively, histidine tags that are typically introduced to help the purification of recombination proteins can be used to immobilize a ribosomal protein on a Ni-NTA-coated surface. A detailed procedure for preparing a mini-flow cell to immobilize biotinylated nucleic acids is described in Ha, T., Methods 25, 78-86 (2001).
A surface can be densely coated by polyethylene glycol (PEG). PEG is known to reject protein adsorption to a surface if it forms a dense coating. Bifunctional PEG can be used immobilize nucleic acids specifically to a surface while rejecting protein adsorption. mRNA can be optionally immobilized on a polyethylene glycol (PEG) coated surface with biotin-streptavidin linker, and the ribosomes allowed to process the immobilized mRNA. The mRNA preferably features 3′-end biotin labeling. Since protein synthesis may not end normally because of the linked 3′ end, it is advisable to ensure that the template mRNA continues for at least 20 codons beyond the stop codon. In another approach, a ribosomal protein can be labeled with biotin and immobilized on a fused glass slide. The other ribosomal components can then be reconstituted around the immobilized protein. Ribosomal complexes can also be bound to a mica surface, which is transparent and flat on a molecular size scale. Ribosomes, either labeled or unlabeled, undergo binding to mica in a few seconds, allowing the detection of single fluorescence images in aqueous buffer. A large excess of ribosomes and a short incubation period are employed for single molecule detection. The mica-bound ribosomes retain their activities, as shown in Sytnik et al., J. Mol. Biol. (1999), 285, 49-54, where detailed protocols are provided. Preparation of the mica cells and adsorption of ribosomes to these cells is also described in Vanzi et al., Protein synthesis by single ribosomes, RNA (2003), 9:1174-1179.
In one embodiment, the invention translates in a cell-free translation system a m-RNA encoding a protein or a portion of a protein (e.g., a host protein or a second protein, e.g., a protein implicated in a disease). In various embodiments, the m-RNA has been engineered such that it is missing a stop codon (“truncated m-RNA”) relative to its full length sequence (
In an exemplary embodiment, the truncated m-RNA (
The present invention also provides a method of expressing a host protein and a second protein, which may be a protein of interest (e.g., a protein implicated in a disease). The host protein and/or the second protein can be labeled with a detectable moiety when expressed using a composition of the invention. The second protein can be a protein from an infective agent, e.g., virus, bacterium a misfolded protein, etc. In an exemplary embodiment, the cell-free system is used to mimic capsid biogenesis and assembly and the second protein is one or more capsid protein. The focus on this embodiment is for purposes of illustration only and is not limiting.
In the cell-free system, viral capsid transcripts are translated in the presence of wheat germ extract that contains soluble factors necessary for capsid protein translation and subsequent capsid assembly (Lingappa et al. J Cell Biol 136:567-581 (1997)). Both cytosolic and membrane proteins present in the wheat germ extract may be involved in capsid assembly and/or viral replication. Integral membrane proteins can include transmembrane proteins. In those embodiments utilizing viruses requiring membrane proteins for capsid assembly, appropriate membranes can be added to the cell-free translation mixture. It is further possible to supplement the cell-free translation mixture with other exogenous proteins, such as chaperone proteins that can for example, facilitate the assembly of capsid intermediates. Assembly of capsids in the cell-free system minimally requires expression of only the particular viral protein(s) that are involved in capsid assembly. Once expressed, polypeptides proceed to assemble into capsids that are catalyzed by host factors.
After incubation for a time sufficient to produce capsids, products of the cell-free reaction can be analyzed to determine sedimentation value, buoyant density, and electron microscopy appearance. Together these form a sensitive set of measurements for integrity of capsid formation.
Synthesized viral proteins can be detected in any manner known in the art. In an exemplary embodiment, the second protein or the host protein is labeled with a detectable moiety. In an exemplary embodiment, radiolabeled capsid polypeptides using labeled amino acids are used. In one embodiment using this approach, 35S methionine is added to the translation mixture. Following in vitro expression, velocity sedimentation gradients can generate fractions that are aliquoted into loading buffers and run on a standard SDS-PAGE gel. The gel can be exposed to film that generates autoradiographs showing the amount of 35S labeled viral protein in different fractions of the velocity sedimentation gradients. Alternatively, as is known in the art, a phosphoimager can be used to visualize radiolabeled viral proteins.
In various embodiments, antibodies, e.g., commercially available antibodies, are used to detect successful protein expression or capsid assembly. Alternatively, antibodies can be specifically raised against proteins of interest, as is well known in the art. Other detectable moieties, such as those set forth herein are of use as well.
Viral protein expression and capsid assembly in the cell-free system of the invention is useful for viruses from any family including, without limitation, Flaviviridae, Togaviridae, Bunyaviridae, Arenaviridae, Filoviridae, Poxviridae, Orthomyxoviridae, Rhabdoviridae, Herpesviridae, Coronaviridae, Paramyxoviridae, Hepadnaviridae, Bornaviridae, Picornaviridae, Retroviridae, Reoviridae, Papillomaviridae, Adenoviridae, Astroviridae, Polyomaviridae.
The present invention also provides a method of expressing non-viral proteins using a composition of the invention. In an exemplary embodiment, the cell-free system is used to mimic a pathway of assembling a multi-protein assembly for which the viral capsid serves as an analogous model. Because host proteins involved in viral capsid assembly also exist in the host for other purposes, the non-viral proteins that are the substrates for those endogenous assembly pathways are equally effectively usable for drug screening by the present invention. Thus, the present invention is applicable, not only to viral disease, but also to proteins implicated in other disorders including, but not limited to, metabolic, nervous system, oncologic, and immunologic diseases. In one embodiment, the invention is used to screen drugs effective in treating diseases in which amyloid fibrils are implicated (e.g., Alzheimer's and Creutzfeldt-Jakob disease). The present invention can be further used to mimic a bacterial or parasitic protein that forms a multiprotein complex that when disrupted, ameliorates bacterial or parasitic disease. What is required for applicability of the present invention is that the newly synthesized proteins in question share the ability to use other proteins in the extract to assist or facilitate their assembly into distinct multiprotein complexes.
The following examples serve to further illustrate the present invention in a non-limiting manner.
Compound resin conjugates are synthesized as follows:
11 (64 mgs,0.15 mmol) was obtained by the boc-deprotection of 10 employing 4M hydrochloric acid in dioxane and 1,2-dichloroethane. Upon completion, the reaction mixture was concentrated to dryness. To the residue obtained was added, DMF (1.5mL), Et3N (103 μL 0.75 mmol), AFFI-GEL® 10 (affinity media, AFFI-GEL is a trademark of Bio-Rad Laboratories, Inc.) (CHEM CAP: 0.015 mmol/mL, 20 mL, 0.30 mmol) and agitated for 18 hrs. Ethanolamine (10 mL) was added to the bluish mixture, agitated for an additional 10 hrs. The reaction mixture was filtered with the aid of a disposable filter funnel with polyethylene fritted disc, the resin was washed sequentially with DMSO(5X) solution of urea, isopropanol and stored in PBS, producing affinity chromotography device 12.
Cell-free translation was carried out essentially as described previously (Lingappa J-R, et al. (1994) A eukaryotic cytosolic chaperonin is associated with a high molecular weight intermediate in the assembly of hepatitis B virus capsid, a multimeric particle. J Cell Biol 125:99-111; Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581), but at lower percentage cell extracts.
Moderate throughput small molecule screening was carried out in 384 well format by translation of RABV N mRNA supplemented with mRNA for RABV P and M, and eGFP in 20 μl reactions per well in the presence of small molecules from the Prosetta compound collection, for 1 hr at 26° C. for synthesis, followed by assembly at 34° C./1 h. Products were captured on a second 384 well plate precoated with affinity purified antibody followed by washing with phosphate buffered saline containing 1% TRITON™ x-100 (detergent, TRITON is a trademark of Dow Chemical Company), and then decorated with biotinylated affinity purified antibody, neutravidin HRP, further washing and incubation with a fluorogenic substrate, QUANTABLU™, (fluorescence kit, QUANTABLU is a trademark of Life Technologies) that generates a fluorescent readout proportional to the degree of biotinylated antibody binding which in turn is a function of degree of assembly upon measurement of fluorescence at 330/425 nm (excitation/emission) after 1 hr.
Sucrose step gradients were performed essentially as described previously (Lingappa J-R, et al. (1994) A eukaryotic cytosolic chaperonin is associated with a high molecular weight intermediate in the assembly of hepatitis B virus capsid, a multimeric particle. J Cell Biol 125:99-111; Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581).
Glycerol gradients were poured using a linear gradient former from 5 to 35% glycerol in Tea 10 mM pH 7.6, 10 mM NaCl, 1 mM MgAc and 0.2 mM EDTA. Gradients were chilled, samples up to 200 ul loaded and centrifuged in the TLS-55 rotor at 50K rpm/55 minutes with slow acceleration and deceleration. Gradients were then fractionated into 200u1 aliquots 1-11 and aliquots analyzed by SDS-PAGE.
SDS-PAGE was carried out as previously described (Lingappa J-R, et al. (1994) A eukaryotic cytosolic chaperonin is associated with a high molecular weight intermediate in the assembly of hepatitis B virus capsid, a multimeric particle. J Cell Biol 125:99-111; Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581). Gels were transferred in Towbin buffer overnight, blocked in 1% BSA, and incubated at room temperature in primary antibody at 1:1000 dilution of approximately 100μg/m1 affinity purified IgG for 1 hr, washed 3× in PBS with 0.1% TWEEN™-20 (detergent, TWEEN is a trademark of Croda International) and incubated with secondary anti-rabbit antibody coupled to alkaline phosphatase at 1:5000 dilution for 1 hr, followed by washing to various degrees of stringency by elevation of salt, followed by Tris-buffered saline wash and incubation in developer solution prepared as follows. BCIP [5-Bromo-4-chloro-3-indolyl phosphate] 225 mg was dissolved in 18 ml dimethyl formamide [DMF] with 12 ml water to give 7.5 mg/ml in 60% DMF. NBT [Nitro blue tetrazolium] 450 mg dissolved in 21 ml dimethyl formamide with 9 ml water was prepared (15 mg/ml 70% DMF). (100 μl of each solution (stored at −20° C.) was adjusted to 50 ml with 0.1M Tris pH 9.5/0.1 mM MgCl2 to prepare working stock of developer that was applied to washed blots.
Street Rabies virus (RABV; TxFX A11-1198) was derived from the salivary glands of a rabid gray fox (Urocyon cinereoargenteus), associated with enzootics in carnivores throughout the southwest United States. Mouse neuroblastoma (MNA) cells were propagated in Eagle minimal essential medium (MEM) supplemented with 10% fetal bovine serum.
In multiple wells of a 96-well plate, MNA cells were infected with RABV at a multiplicity of infection (m.o.i) of 0.1 per cell (except cell control), and incubated at 37° C. for 48 hrs in the presence of MEM supplemented with 10% fetal calf serum. After incubation, 100 μl of supernatant was removed from each well, and replaced with an anti-viral compound at described concentrations, and incubated for an additional 48 hrs.
After incubation, 100 μl of supernatant was removed and titrated on MNA. Micro titer plates were washed twice in PBS (phosphate buffered saline, pH 7.2-7.4) and fixed with 80% acetone at −20° C. RABV antigens were detected by direct fluorescent antibody staining using fluorescein isothiocyanate (FITC)-labeled monoclonal antibody conjugate (Fujirebio Diagnostics, Inc., Malvern, Pa.), titers for infectious virus released into the supernatant were calculated by the Reed and Muench method. The BSR cells (a clone of BHK) were grown in Dulbecco's minimal essential medium (DMEM) supplemented with 10% fetal bovine serum (Atlanta Biologicals, Lawrenceville, GA) at 37° C., in a 5% CO2 incubator. The rabies virus ERA strain was obtained from ATCC and maintained at CDC, Atlanta. For virus titration and antiviral compound treatment, the confluent BSR cells in T75 flasks were split and seeded to the 24-well-plate (Fisher Scientific, Becton-Dickinson, Suwanee, GA). Twenty four hours post-incubation, the confluent BSR cells in plate were infected with 1 m.o.i. of rabies virus ERA either pre- or post-antiviral compound treatment at the indicated time course. Virus titer in the treated cell supernatants was calculated in focus forming unit (ffu) per ml. In brief, 20 μl of cell supernatants, mixed with freshly prepared 180 μl of BSR cell suspension, was seeded into a LAB-TEK® Chamber Slide (laboratory equipment, LAB-TEK is a trademark of Nalge Nunc International Corp.) (Fisher Scientific, Nunc, Rochester, N.Y.). A serial 10-fold dilution of the virus- cell supernatants was made similarly with BSR cell suspensions in the same slide. The cells were incubated at 37° C., in a 5% CO2 incubator for 24 hrs before titration using the DFA assay. A standard DFA protocol (www.cdc.gov/rabies/pdf/rabiesdfaspv2.pdf) was followed for virus titration or the effect of antiviral compound treatment against the original cells grown in the 24-well-plate.
Materials were general purchased from Sigma Chemical Co, St Louis, Mo. or Thermo Fisher. Affinity purified antibody to RABV N and ABCE1 can be purchased from www.pbpl.com.
As a first step to application of this novel approach, we analyzed authentic RABV, that had been irradiated to render the virus non-infectious, on sucrose step gradients (ssg) that have previously been the used to dissect the capsid assembly pathways of other viral families in cell-free systems (Lingappa J-R, et al. (1994) A eukaryotic cytosolic chaperonin is associated with a high molecular weight intermediate in the assembly of hepatitis B virus capsid, a multimeric particle. J Cell Biol 125:99-111; Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581; Klein K-C, Dellos S-R, Lingappa J-R (2005) Identification of residues in the hepatitis C virus core protein that are critical for capsid assembly in a cell-free system. J Virol. 79:6814-6826; Klein K-C, Polyak S-J, Lingappa J-R (2004) Unique features of hepatitis C virus capsid formation revealed by de novo cell-free assembly. J Virol. 78:9257-9269). As shown in
Our approach to expression through cfps is to first engineer the coding region of interest, in this case the RABV N gene initially, and later the matrix (M) and phosphoprotein (P) genes, behind the SP6 bacteriophage promoter and the Xenopus globin 5′ untranslated region, as described previously (Krieg P-A, Melton D-A (1984) Functional messenger RNAs are produced by SP6 in vitro transcription of cloned cDNAs. Nucleic Acids Res 12:7057-7070). PCR is then used to generate messenger RNA (mRNA) encoding each full-length protein of interest.
When mRNA encoding N was translated at 26° C./1 hr and applied to ssg as described, the newly synthesized (as evidenced by radiolabelling) protein is released from the ribosome and found at the top of the ssg with peak in fraction 1 (
The rabies literature suggests that the M and P proteins are likely to play roles in assembly of progeny virus (Fu Z-F, Zheng Y, Wunner W-H, Koprowski H, Dietzschold B (1994) Both the N- and the C-terminal domains of the nominal phosphoprotein of rabies virus are involved in binding to the nucleoprotein. Virology 200:590-597; Yang J, et al. (1998) The specificity of rabies virus RNA encapsidation by nucleoprotein. Virology 242:107-117; Waibler Z, Detje C-N, Bell J-C, Kalinke U (2007) Matrix protein mediated shutdown of host cell metabolism limits vesicular stomatitis virus-induced interferon-alpha responses to plasmacytoid dendritic cells. Immunobiology 212:887-894; Gerard F-C, et al. (2009) Modular organization of rabies virus phosphoprotein. J Mol Biol 388:978-996). In view of this possibility, we co-translated mRNA encoding N with those encoding P and M as in
Compared to parallel experiments with N translation alone, the results for N co-translated with M and P were notable for a striking slowing of N progression to fractions 5-7 (see
To better catalogue the high molecular weight complexes observed for RABV N, gradient fractions were analyzed under conditions that generated a distribution of N between fractions 1-6 (e.g. as shown in
In this context, we addressed in more detail for the specific case of RABV, the critical issue of energy-dependence, a key discriminator between spontaneous, thermodynamically driven self-assembly and (host) catalyzed models of capsid formation, as has previously been done for HIV (Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581). Apyrase was immobilized on sepharose beads and the immobilized apyrase (iapy) demonstrated to be active and to fully deplete the ATP in 25 bed volumes of translation reaction, and that no apyrase leached from the beads since it could be quantitatively removable by centrifugation with restoration of translation upon addition of fresh ATP (data not shown). Translation products of mRNAs encoding each of RABV proteins N, M, and P, were synthesized separately or together and immediately post-translationally, iapy was added and incubated at 4° C. for 1 hr, and then the iapy removed by centrifugation, with careful transfer of the supernatant. An aliquot of the supernatant was analyzed without further manipulation as for
To assess the composition and significance of these putative assembly intermediates, and to provide another set of properties by which they could be distinguished, individual fractions from ssgs containing RABV N, as shown in
In
We wished to establish a screen to identify small molecules capable of blocking or altering progression through this newly identified pathway, through their effect on the catalytic protein-protein interactions implied by the data in
The screen was validated by demonstrating that authentic RABV shown in
We sought hits by screening a portion of a library of drug-like small molecules largely conforming to Lipinski's rule of five (Albertini A-A, et al. (2006) Crystal structure of the rabies virus nucleoprotein-RNA complex. Science 313:360-3). Hits were defined as compounds demonstrating a dose-dependent diminution of fluorescence in the whole pathway screen where the diminution of fluorescence could not be accounted for by diminution of protein synthesis (as measured by co-translation of green fluorescent protein [eGFP] or by WB for RABV N). A number of compounds were observed to have modest dose-dependent inhibition of fluorescence in the RABV screen. A potent compound emerging from a comparable screen for influenza nucleoprotein assembly had no effect in the RABV screen (data not shown). Two hits and one negative compound from the RABV screen are shown in
A set of analogs of one of the two active compounds were synthesized and assessed in both the cfps screen (
It was hypothesized that the most robust hits from this screen are likely to be targeting those host factors that act catalytically, as the consequence of inhibition of such a target would be expected to have the greatest effect on RABV N assembly in cells. The potent compounds identified made it possible to test this hypothesis. First compared was the effect of time of compound addition on the titer of infectious RABV generated in infected cells. In the experiments shown in
To better understand the time of action and target of these drugs, studies were performed in the cfps system. By generating mRNA for RABV N from which a stop codon is lacking (Perara E, Rothman R-E, Lingappa V-R(1986) Uncoupling translocation from translation: implications for transport of proteins across membranes. Science 232:348-352), termed truncated N and abbreviated NR, it is possible to retain a significant fraction of chains in polyribosomes, such that they migrate in the middle of the ssg (
This data strongly suggests that the target protein action blocked by the compound occurs very early in the pathway, possibly before release of newly synthesized RABV N from the ribosome. And yet, the compound effect requires the later events of assembly (incubation at 34° C.), to be manifest. Indeed, a time course of A addition at 1, 5, and 15 minutes after puromycin release, strongly suggests that the target is engaged and has completed its action prior to release of the nascent chain, because even 1 minute after treatment with puromycin, the drug effect is largely lost (data not shown).
As an approach to better understanding this unconventional target, A was coupled to a resin, and the resulting resin conjugate used for affinity chromatography (Blobel G, Sabatini D (1971) Dissociation of mammalian polyribosomes into subunits by puromycin. Proc Natl Acad Sci USA 68:390-394). The resin conjugate corresponding to A was termed resin 1 and blocked resin alone (lacking a compound and therefore serving as a negative control for binding specificity) was termed resin 2. In
The most direct test of a host target would be to demonstrate binding of specific protein(s) from the cfps extract prior to its programming with RABV N encoding mRNA—and to show the bound proteins are essential for the activity of A. All the more because A, the active anti-RABV compound, was identified through the cfps screen and used as the affinity ligand for identification of the putative compound target. Towards this end, the starting WG extract (prior to programming with RABV mRNA or use in the cfps screen), was applied to columns of resin 1 or 2, with similar wash and elution steps as described above. The free compound eluate was subjected to exhaustive dialysis to remove both free and bound compound. A flow through extract was prepared (termed depleted extract because it is missing resin conjugate bound proteins), along with an exhaustively dialyzed free compound (A) eluate. The cfps programmed with RABV N was carried out separately in: i) the starting WG extract, ii) the resin column 1 flow through extract (depleted WG) and iii) the dialyzed eluate added to the depleted WG. These three translation reactions were carried out in the absence of compound (with DMSO vehicle control) and in the presence of a titration of two active anti-RABV pharmacophores, including A. As shown previously (see
Another approach to corroboration of the mechanism of action of A is to assess the effect of the compound on assembly of newly synthesized RABV N and P by cfps as judged by ssg. As shown in
Analysis of the 1 eluate revealed a striking pattern of approximately a dozen major protein bands by silver stain that were not found in the column 2 eluate (
We were initially surprised to see such a large number of distinct proteins specifically bound to the active compound conjugate (resin 1) and eluted with free A from the column 1 but not from the control column (resin 2). To better understand the meaning of this result, the WG and brpmis starting material and A eluate from column 1 were analyzed by glycerol gradient ultracentrifugation, and the gradient profiles probed by silver stain to assess the entire set of proteins observed in
These results suggest that the eluted proteins from the compound resin conjugate 1 likely comprise a multiprotein complex that is both functional for RABV N assembly, is the basis for compound sensitivity of assembly (
The goal of the present study was to explore the possibility that cfps could identify novel, unconventional but druggable host protein targets that catalyze the progression of RABV N through assembly intermediates, based on the successful reconstitution of capsid assembly pathways with these characteristics for other viral families (Lingappa J-R, et al. (1994) A eukaryotic cytosolic chaperonin is associated with a high molecular weight intermediate in the assembly of hepatitis B virus capsid, a multimeric particle. J Cell Biol 125:99-111; Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581; Klein K-C, Dellos S-R, Lingappa J-R (2005) Identification of residues in the hepatitis C virus core protein that are critical for capsid assembly in a cell-free system. J Virol. 79:6814-6826; Klein K-C, Polyak S-J, Lingappa J-R (2004) Unique features of hepatitis C virus capsid formation revealed by de novo cell-free assembly. J Virol. 78:9257-9269). Corroboration of authenticity of RABV capsid formation (e.g. through electron microscopy, as done previously [Lingappa J-R, et al. (1994) A eukaryotic cytosolic chaperonin is associated with a high molecular weight intermediate in the assembly of hepatitis B virus capsid, a multimeric particle. J Cell Biol 125:99-111; Lingappa J-R, Hill R-L, Wong M-L, Hegde R-S (1997) A multistep, ATP-dependent pathway for assembly of human immunodeficiency virus capsids in a cell-free system. J Cell Biol 136:567-581; Klein K-C, Dellos S-R, Lingappa J-R (2005) Identification of residues in the hepatitis C virus core protein that are critical for capsid assembly in a cell-free system. J Virol. 79:6814-6826]), was not pursued in the present studies. Instead, we chose validation against infectious RABV of the compounds emerging from a RABV-specific screen of this pathway. That success provides emphatic support for the proposition that the protein-protein interactions disrupted by A are relevant for RABV, presumably because of the power of i) the new paradigm of host-catalyzed capsid formation, ii) heterologous cfps as an analytical tool, and iii) these highly unconventional and labile multiprotein complexes as druggable antiviral targets.
Two observations are particularly worthy of mention. First, the active anti-RABV compound chosen for focus, A, works extremely early in the capsid assembly process, specifically while the chain is still on the ribosome. A working hypothesis outlined at the outset was that interfering with early events in protein biogenesis might have a substantial yet substrate-selective impact on later events. This notion, while conventionally counterintuitive, is strongly supported by these studies: the consequence of this very early alteration in protein-protein interactions manifests much later in the viral lifecycle as substantial reduction in infectivity of RABV.
A second fundamental observation from this study is that the protein set bound to the compound resin conjugate and specifically eluted by free compound, comprises a labile and functional multiprotein complex (based on the reconstitution demonstrated in
It is provocative that ABCE1, a protein implicated in HIV capsid assembly, should also be identified as a component of the host protein complex implicated in RABV capsid assembly. We note that ABCE1 itself is extremely heterogeneous, and only a small portion of the total ABCE1 was observed to be depleted from the extract despite the use of excess drug resin. This suggests that only a small subset of the ABCE1 present is in a conformation or assembly configuration relevant for RABV capsid formation. Presumably the other forms do other things for the host and, by corollary, may be taken advantage of by other families of viruses. This suggests the intriguing possibility that the putative assembly machines are both numerous and highly heterogeneous or highly adaptable to the multiprotein complexes they are called upon to assemble, or both.
It is also notable that the compounds shown here have activity against other viral families. However, the SAR profile for different viruses on which any given compound is active is quite different. Thus, while A is also active against FLUV, a member of family Orthomyxoviridae, analog F, which is even more potent against infectious RABV (see
The evidence for transient and energy-dependent action of host proteins suggests a role for an enzymatic host-derived assembly machine. If its action were blocked or altered, one can envision two classes of phenotypes: the viral particle might either be sufficiently malformed or blocked in its formation so as to not be released at all, or might be released but with a sufficiently aberrant capsid structure as to be rendered non-infectious. Elsewhere we will demonstrate both of these phenotypes for compounds with similar targets in the case of members of another viral family, the Togaviridae (Kelley-Clarke et al submitted).
Finally, data from co-immunoprecipitation (
Loss of neurons by a degenerative process is a pathological feature of many human neurological disorders. Neuronal cell death is a normal part of tissue development and maintenance. Abnormal neuronal cell death, however, occurs as a result of a variety of conditions including, but not limited to, traumatic injury or trauma, ischemia, and neurodegenerative diseases such as Parkinson's disease, Huntington's disease, Alzheimer's disease, amyotrophic lateral sclerosis (ALS), and stroke.
The invention provides a method of identifying neuroprotective compounds, also referred to herein as agents, by screening a library of agents with an assay of the invention. Such libraries may comprise either collections of pure agents or collections of agent mixtures. Examples of pure agents include, but are not limited to, proteins, polypeptides, peptides, nucleic acids, oligonucleotides, carbohydrates, lipids, synthetic or semi-synthetic chemicals, and purified natural products. Examples of agent mixtures include, but are not limited to, extracts of prokaryotic or eukaryotic cells and tissues, as well as fermentation broths and cell or tissue culture supernates.
Psychiatric disorders are pathological conditions of the brain characterized by identifiable symptoms that result in abnormalities in cognition, emotion or mood, or the highest integrative aspects of behavior. These disorders may vary in severity of symptoms, duration, and functional impairment. Psychiatric disorders afflict millions of people worldwide resulting in tremendous human suffering and economic burden due to lost productivity.
The invention provides a method of identifying psychopharmacological compounds, also referred to herein as agents, by screening a library of agents with an assay of the invention. Such libraries may comprise either collections of pure agents or collections of agent mixtures. Examples of pure agents include, but are not limited to, proteins, polypeptides, peptides, nucleic acids, oligonucleotides, carbohydrates, lipids, synthetic or semi-synthetic chemicals, and purified natural products. Examples of agent mixtures include, but are not limited to, extracts of prokaryotic or eukaryotic cells and tissues, as well as fermentation broths and cell or tissue culture supernates.
Peroxiredoxin 2 western blot of Glycerol Gradient fractions were obtained as follows. Pig Brain post mitochondrial supernatant was loaded onto glycerol gradient (5-35%) and centrifuged in the TLS-55 rotor for 50 k RPM for 5 hrs, fractionated equally to eleven total fractions. Each fraction was individually loaded on either control resin (74) or PAV-866 resin (124). The fractions were incubated for 1 hr at 4° C. and then washed extensively with 100 X buffer and eluted with free PAV-866. Eluted fractions were loaded onto 15% SDS PAGE (including the total input fractions from Glycerol gradient, top panel) and probed with goat polyclonal Anti-Peroxiredoxin-2 antibody (R& D Systems, Minneapolis, Minn. 55413). The primary antibody bound to PRX-2 was detected by alkaline phosphatase conjugated secondary antibody with some chromogenic substrate.
Results of this experimental are provided in
Mouse Neuroblastoma (MNA) cells (a gift from the Centers for Disease Control [CDC]) were propagated in DMEM media with 10% serum. They were treated with either control media or MNA cells were treated with 25 μM tBHP (tert-Butyl Hydroperoxide) for 4 hrs just before harvest. They were washed with ice cold PBS, cell extract (post mitochondrial supernatant) was prepared and loaded onto control and PAV-866 drug resin, incubated at 4° C. for 1 hr, washed and eluted with free drug. Western blot with polyclonal rabbit ant-SAM Synthetase (from Sigma-Aldrich, St Louis, Mo. 63103). The primary antibody bound to SAM Synthetase was detected by alkaline phosphatase conjugated secondary antibody with chromogenic substrate.
Similarly PRX-2 antibody was used to detect PRX-2 binding. Goat polyclonal Anti-Peroxiredoxin-2 antibody (R& D Systems, Minneapolis, Minn. 55413) was used for this detection.
Results of this experimental are provided in
The foregoing descriptions of specific embodiments of the present invention have been presented for purposes of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise forms disclosed, and obviously many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the Claims appended hereto and their equivalents.
This application is a continuation of U.S. patent application Ser. No. 13/950,232, filed Jul. 24, 2013, which claims the benefit of U.S. Provisional Pat. App. No. 61/822,273, filed May 10, 2013, U.S. Provisional Pat. App. No. 61/800,690, filed Mar. 15, 2013, U.S. Provisional Pat. App. No. 61/800,596, filed Mar. 15, 2013, U.S. Provisional Pat. App. No. 61/758,234, filed Jan. 29, 2013 and U.S. Provisional Pat. App. No. 61/675,300, filed Jul. 24, 2012, each of which is incorporated by reference in its entirety for all purposes.
Number | Date | Country | |
---|---|---|---|
61822273 | May 2013 | US | |
61800690 | Mar 2013 | US | |
61800596 | Mar 2013 | US | |
61758234 | Jan 2013 | US | |
61675300 | Jul 2012 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13950232 | Jul 2013 | US |
Child | 15429027 | US |