To initiate infection, viruses bind to one or more receptors on a target cell. The second step is entry. Many viruses are enveloped with a lipid membrane derived from the cell in which they were produced. Following attachment, these enveloped viruses fuse their membrane with a target cell membrane to allow the contents of the virion, including the viral genome, to enter the cell.
Paramyxoviruses are viruses of the Paramyxoviridae family of the Mononegavirales order. They are negative-sense single-stranded RNA viruses responsible for a number of human and animal diseases.
The Paramyxovirus family includes 2 subfamilies: (i) Paramyxovirus: including parainfluenza virus (PIV) 1-4, Newcastle disease virus (NDV), Nipah virus, measles virus and mumps virus; (ii) Pneumovirus: including human respiratory syncytial virus (RSV), bovine RSV and human metapneumovirus (hMPV). Parainfluenza viruses and RSV produce acute respiratory diseases of the upper and lower respiratory tracts, whereas measles and mumps viruses cause systemic disease.
RSV causes respiratory tract infections in patients of all ages. It is the major cause of lower respiratory tract infection during infancy and childhood. In temperate climates there is an annual epidemic during the winter months. In tropical climates, infection is most common during the rainy season. In the United States, 60% of infants are infected during their first RSV season, and nearly all children will have been infected with the virus by 2-3 years of age. Natural infection with RSV does not induce protective immunity, and thus people can be infected multiple times. Sometimes an infant can become symptomatically infected more than once even within a single RSV season. More recently, RSV infections have been found to be frequent among elderly patients as well. As the virus is ubiquitous, avoidance of infection is not possible. There is currently no vaccine or specific treatments against RSV. The failure in developing a vaccine has led to renewed interest in the pathogenesis of the disease.
There is a need, generally, for methods to identify antiviral agents that inhibit the activity of fusion proteins, or reduce the infectivity of paramyxoviruses, such as RSV (human and bovine), hMPV, PIV1 and 3 and NDV.
Provided herein is a pre-triggered soluble fusion (F) protein of a virus in the paramyxovirus family, wherein the soluble fusion protein lacks a transmembrane domain and a cytoplasmic tail domain and includes a CRAC1 domain. The soluble fusion protein is in a pre-triggered conformation and can be triggered when exposed to a triggering event.
Also provided is a functional fragment of an RSV soluble fusion protein, comprising a first and a second peptide linked to form a dimer peptide. The first and second peptides include, respectively, a sequence that is at least 90% identical to amino acids 37-69 and 156-440 of SEQ ID NO: 1, and the second peptide includes a CRAC1 domain.
Also contemplated are methods of screening for a candidate paramyxovirus antiviral agent, including the steps of: (i) contacting a test agent with a soluble F protein of a paramyxovirus described above and (ii) detecting a structural indicator of the soluble pre-triggered F protein. A change in the structural indicator of the soluble pre-triggered F protein in the presence of the test agent as compared to the absence of the test agent indicates that the agent is a candidate antiviral agent against the paramyxovirus.
Another method contemplated herein is a method of screening for a candidate paramyxovirus antiviral agent that includes the steps of: (i) contacting a test agent with a soluble F protein of the paramyxovirus, described above, to form a test sF protein; (ii) exposing the test sF protein to a triggering event; and (iii) assessing a structural indicator of the test sF protein before and after exposure to the triggering event. In this method, an absence of a change in the structural indicator of the test sF protein after exposure to the triggering event indicates that the agent is a candidate antiviral agent against the paramyxovirus.
Also provided is a method of screening for a candidate antiviral agent against RSV, including the steps of: (i) contacting a test agent with a functional fragment of a soluble pre-triggered F protein of RSV, described above; and (ii) detecting a structural indicator of the functional fragment. A change in the structural indicator of the functional fragment in the presence of the test agent as compared to the absence of the test agent indicates that the agent is a candidate antiviral agent against RSV.
Also included is a method of screening for a candidate antiviral agent against RSV, comprising the steps of: (i) contacting a test agent with a functional fragment of a soluble pre-triggered F protein of RSV, as described above, to form a test sF protein; (ii) exposing the test sF protein to a triggering event; and (iii) assessing a structural indicator of the test sF protein before and after exposure to the triggering event. In this method, the absence of a change in the structural indicator of the test sF protein after exposure to the triggering event indicates that the agent is a candidate antiviral agent against RSV.
a shows effects of single amino acid changes within the RSV F protein CRAC1 domain on cell-cell fusion. Human embryonic kidney 293T cells were co-transfected with pcDNA3.1 plasmids (Invitrogen) expressing the RSV F protein and the green fluorescent protein (A) Optimized wild-type strain A, D46 RSV F protein was express from plasmid MP340. (B-L) Single point mutations in MP340 that changed the individual amino acids as indicated were also expressed. Cells were photographed at 48 hours post-transfection.
b shows the effects of both central tryosines were changed of the CRAC3 domain to alanine Cells infected with wild-type D46 F protein (A) was compared to a CRAC3 mutant (B). In this experiment, pictures were taken 23 hours after transfection.
Unless otherwise defined herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. As used in the description, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.
Unless otherwise indicated, all numbers expressing quantities of ingredients, reaction conditions, and so forth used in the specification are to be understood as being modified in all instances by the term “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification are approximations that may vary depending upon the desired properties sought to be obtained by the present disclosure. At the very least, and not as an attempt to limit the application of the doctrine of equivalents, each numerical parameter should be construed in light of the number of significant digits and ordinary rounding approaches.
Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the disclosure are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. Any numerical value, however, inherently contains certain errors necessarily resulting from the standard deviation found in their respective testing measurements. Every numerical range given throughout this disclosure will include every narrower numerical range that falls within such broader numerical range, as if such narrower numerical ranges were all expressly written herein.
Provided herein are compositions and screening methods for identifying candidate antiviral agents. In particular, disclosed herein is a pre-triggered, fusion (F) protein of a paramyxovirus, or functional fragments thereof, which contain one or more cholesterol binding motifs in a location that is away from the transmembrane domain, referred to herein as CRAC (Cholesterol Recognition/interaction Amino acid Consensus) domains. Also provided is a computer model of the structure of the pre-triggered F protein. Compositions that directly or indirectly bind and interfere with the normal activity or binding of the pre-triggered F proteins, or the CRAC domains, are useful as antiviral agents in the treatment of paramyxovirus infections. Thus, disclosed herein are methods of screening for antiviral agents, using the pre-triggered F protein, or fragments thereof.
Paramyxovirus Fusion Mechanism:
To accomplish attachment and fusion, members of the Paramyxoviridae family express two glycoproteins, one to attach to the target cell (the attachment protein) and one to fuse the virion membrane with the target cell membrane (the fusion protein).
In all of these paramyxoviruses, the fusion (F) protein is a trimer composed of three copies of the F protein monomer. As the F trimer passes through the Golgi on its way to the cell surface it is cleaved by a protease to generate F2, the small N-terminal fragment, and F1, the large transmembrane fragment (
In most paramyxoviruses, the conventional wisdom is that the viral attachment protein binds to its receptor, then nudges the F protein in some way that results in F protein triggering. However, RSV is unique in that its F protein is able to fuse membranes without the aid of an attachment protein, suggesting that the RSV F protein expresses both attachment and fusion activities. The RSV attachment glycoprotein (G) does enhance this process by binding the virion to target cells more efficiently, but otherwise seems to play no role in fusion.
The RSV fusion protein precursor, F0, is cleaved twice, releasing a 27 amino acid peptide “pep27” and the F1 and F2 proteins, which are covalently linked by two disulfide bonds (
An appreciation of the movements involved in assembling the HR1 α-helix are recent and have come from the crystal structures of other paramyxoviruses. The steps in fusion initiation are shown in cartoon form in
We have computer modeled the pre- and post-triggered structures of the RSV F protein, and used these models to suggest a candidate triggering domain. (EXAMPLE 1) (
The model for the pre and post-triggered form of the RSV F protein is presented in
The CRAC Domains
We have discovered that a cholesterol-binding protein motif (CRAC; Cholesterol Recognition/interaction Amino acid Consensus) is present near the tip of the pre-triggered F protein in a potential triggering domain (
As used herein, a CRAC “motif” refers to the sequences V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID No. 40) or V/L/I-X1-5-Y/F/W-X1-5-D/E (SEQ ID NO: 41). A CRAC “domain” refers to a CRAC motif that is present in a position away from the virion membrane. “CRAC1 domain” refers to a CRAC motif present in the HR1 region of the F protein in a location N-terminal to the first cysteine that links the F1 to the F2 region. “CRAC3 domain” refers to a CRAC motif present in the F1 fragment, N-terminal to HR2.
Without wishing to be bound by theory, we believe that the CRAC domain(s) on the F protein interact with cholesterol in the target cell membrane and that this interaction causes triggering of the F protein, resulting in fusion.
The CRAC1 domain: In the three-dimensional structure of the pre-triggered F protein, the CRAC1 domain is a short α-helix, designated α-helix 2 in
The CRAC1 helix is highlighted in ball-and-stick form in both pre- and post-triggered form in
Without wishing to be bound by theory, one of our hypotheses is that when this CRAC1 domain approaches a membrane, it is attracted by the cholesterol in the membrane, and is pulled into the membrane, initiating F protein triggering. The action of pulling the CRAC1 domain upward and onto the target cell membrane would: 1) straighten the region between the CRAC helix (2 in
The result would be assembly of the complete long, HR1 α-helix in the post-triggered form (
Our alternative hypothesis is that cholesterol is pre-loaded in the F protein trimer crown. A trimer could pick up cholesterol molecule(s) during monomer or trimer formation, or as the trimer is transported from the endoplasmic reticulum through the Golgi to the cell surface. If cholesterol is stored in the F protein crown, as the crown approaches a target cell membrane the hydrophobic forces in the target cell membrane would pull the cholesterol molecules out of the crown and into the membrane, dragging the CRAC domains with them. This movement would initiate formation of the long α-helix (6) and insertion of the attached fusion peptide into the target cell membrane, as described above.
If cholesterol is pre-loaded in the F trimer, it would only be energetically favorable if it were held in the crown with its only hydrophilic portion, its hydroxyl group, facing the solvent (upward). The orientation of cholesterol when it is associated with a CRAC domain is known. The position of the CRAC1 domain in the crown would indeed hold cholesterol with its hydroxyl group facing upward. To the best of our knowledge, the presence of a non-membrane associated lipid within a viral protein, as suggested here, and the function of the lipid in activating a fusion protein, as discussed herein, has not been previously reported.
The CRAC1 domain is conserved among several paramyxoviruses. It is found in all pneumovirus subfamily members, including human RSV, bovine RSV, and human metapneumovirus (
The CRAC3 domain: The post-triggered form of the F protein contains the signature 6-helix bundle (
The forces that bring the 6-helix bundle together are completely unknown. Formation of the long HR1 helix more than doubles the length of the pre-triggered F protein, making it much too long to fit between the virion and the cell (
There is no obvious “motor” in the head of the trimer (top of the post-triggered form,
We have identified several other CRAC domains, including CRAC3 in the head region of the F protein (light gray balls (10) in
We believe that the CRAC1 domain is a membrane contact point for the HR1 helix, enabling it to bind to the target cell membrane at a second point, the first being the fusion peptide anchored in the target cell membrane. Since the HR1 helix is rigid and long, two contact points, one at the end and one near the middle would keep this half of the protein parallel with the target cell membrane, preventing the virion from moving further from the cell. If both halves of the F protein are forced to lie parallel to the membranes into which they are inserted, the two membranes would be forced together, allowing contact between the helices and formation of the 6-helix bundle. While this hypothesis may require Brownian motion, it adds direction from the F protein in the form of additional contacts with each of the membranes that should enable the 6-helix bundle to line up and lock in much more rapidly. If both sides of the molecule are attached to the membranes at two (or more) points, the membrane curvature would be very sharp where the transmembrane and fusion peptides are brought together (at the ends of the red and blue helices, respectively) enhancing the likelihood of initiating fusion pore formation and subsequent membrane fusion.
Consistent with the hypothesis that these CRAC domains interact with the cell or virion membrane, we showed that both the CRAC1 and CRAC3 domains face outward, making such interactions possible.
We have also found that the CRAC3 domain of one F protein monomer cradles the fusion peptide of the next monomer in the pre-triggered trimer form. Cholesterol might be included in this complex. Whether or not it is, a compound that is capable of binding to the CRAC3 domain will displace the fusion peptide and cause the F protein to trigger pre-maturely. A compound that is capable of binding to the CRAC3 domain would also prevent the CRAC3 domain from forming the second contact to guide the HR2 a-helix to the HR1 trimer of helices and would prevent fusion in that way.
Other CRAC domains: As can be seen from
For all the reasons stated above, a compound that blocks the activity or binding of CRAC1 or CRAC3 domains to the virion membrane would reduce the efficiency of fusion, thereby reducing infection. Similarly, a compound that blocks the interaction of other F protein CRAC domains would reduce the efficiency of bringing HR1 and HR2 together, the final step in fusion initiation, thereby reducing infection.
Since cholesterol is a natural ligand for the CRAC domain, the antiviral compound can be a cholesterol mimic and/or a cholesterol precursor or derivative.
Contemplated herein is an isolated soluble fusion (sF) protein of a member of the paramyxovirus family in its pre-triggered form. The isolated sF protein includes a portion of a fusion protein that contains at least one CRAC1 domain having the sequence V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40) or V/L/I-X1-5-Y/F/W-X1-5-D/E (SEQ ID NO: 41).
Members of the paramyxovirus family whose F protein's include a CRAC1 domain include: RSV (human and bovine), human metapneumovirus (hMPV), para-influenza virus 1 (PIV1), PIV3, and Newcastle disease virus (NDV).
A “soluble” F protein, as used herein, refers to a truncated fusion protein that is not membrane-bound, i.e. the F protein is released form the cell into media. Thus, the soluble F protein lacks the transmembrane (TM) and cytoplasmic tail (CT) domains. In some embodiments, the pre-triggered sF protein also lacks the pep27 region.
A “soluble F protein of a member of the paramyxovirus family that includes a CRAC1 domain” refers to any soluble fusion protein that includes a CRAC1 domain, and whose sequence is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the sequence of a truncated F protein of: human RSV, bovine RSV, hMPV, PIV1, PIV3 and NDV.
In one embodiment, the CRAC domain has the sequence VLDLKNYIDK, SEQ ID NO: 20. In another embodiment, the CRAC domain has the sequence VLDLKNYIDR, SEQ ID NO: 42. In another embodiment, the CRAC domain has the sequence VLDIKNYIDK, SEQ ID NO: 43. In another embodiment, the CRAC domain has the sequence ILDLKNYIDK, SEQ ID NO: 44. In another embodiment, the CRAC domain has the sequence VLDLKNYINNR, SEQ ID NO: 45. In another embodiment, the CRAC domain has the sequence VRELKDFVSK, SEQ ID NO: 46. In another embodiment, the CRAC domain has the sequence LKTLQDFVNDEIR, SEQ ID NO: 47. In another embodiment, the CRAC domain has the sequence VQDYVNK, SEQ ID NO: 48. In another embodiment, the CRAC domain has the sequence VNDQFNK, SEQ ID NO: 49.
SEQ ID NO: 1 represents the full length amino acid sequence of the A2 strain RSV F protein (
In one embodiment, each monomer of the sF protein trimer includes an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to: amino acid 27-109 and 137-522 of SEQ ID NO: 1. In another embodiment, each monomer of the sF protein trimer includes an amino acid sequence that is at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to amino acid 27-522 of SEQ ID NO: 1. Amino acids 523 and 524 of SEQ ID NO: 1 may be deleted or changed to other amino acids. Therefore, in another embodiment, the sF protein comprises amino acid 27-524 of SEQ ID NO: 1.
The signal peptide (amino acids 1-25 in SEQ ID NO: 1) is used to start the translocation of the protein across the ER membrane during synthesis. In some embodiments, the constructs that are used to prepare a pre-triggered sF protein also include a sequence encoding a signal peptide. In one example, the signal peptide encoded by the construct comprises amino acids 1-25 in SEQ ID NO: 1. In other examples, the signal peptide encoding sequence may be exchanged for other signal peptide encoding sequences that are capable of starting the in vivo translocation of the protein across the ER membrane during synthesis. Examples of other suitable signal peptides include, but are not limited to, the signal peptide of another polypeptide naturally expressed by the expression host cell, the Campath leader sequence (Page, M. J. et al., BioTechnology 9:64-68 (1991)), the signal peptide and the pre-pro region of the alkaline extracellular protease (AEP) (Nicaud et al. 1989. J Biotechnol. 12: 285-298), secretion signal of the extracellular lipase encoded by the LIP2 gene (Pignede et al., 2000 Appi Environ. Microbiol. 66: 3283-3289.), the 22 amino acid signal peptide of the endoglucanase I coding sequence from T. reesei (Park, C. S., (1997). J. Biol. Chem. 272: 6876-6881), the rice ct-amylase signal peptide (Chen et al., 2004 Plant Physiol. 135: 1363-1377), the signal peptide for pre-proinsulin, immunoglobulin kappa chain, or any type I glycoprotein or protein that is normally secreted from mammalian cells. A type I glycoprotein is a protein that has its N terminus outside the cell plasma membrane and its C terminus inside.
In some embodiments, the sF protein is also fused to a detection tag that is useful for identification or purification. Examples of commonly used detection tags include, but are not limited to, a maltose-binding protein (MBP), glutathione S-transferase (GST), tandem affinity purification (TAP) tag, calcium modulating protein (calmodulin) tag, covalent yet dissociable (CYD) NorpD peptide, Strep II, FLAG tag, heavy chain of protein C(HPC) peptide tag, green fluorescent protein (GFP), metal affinity tag (MAT), HA (hemagglutinin) tag, 6HIS tag (SEQ ID NO: 21), myc tag, and/or herpes simplex virus (HSV) tag. In some embodiments, the tag is a FLAG tag or a 6HIS tag (SEQ ID NO: 21). In one embodiment, the protein comprised both a FLAG tag and a 6HIS tag (SEQ ID NO: 21). In some embodiments, the polypeptide further comprises a cleavage domain to facilitate the removal of the tag from the polypeptide, for example, after isolation of the protein. In some embodiments, the tag is fused to the C terminus of the sF protein. The tag or tags can also be placed at the N terminus of the F2 protein, C terminal to the signal peptide. For example, we have placed a 6HIS tag (SEQ ID NO: 21) in this position and rescued fully functional RSV from cDNA that contains this tag on the F protein, indicating that the tag did not negatively impact production or function of the F protein. The tag or tags can also be placed in other positions in the protein as additional or replacement amino acids, generally in external loops of the protein where the amino acids comprising the tag would not affect protein folding or function.
In some embodiments, the sF protein contains a C terminal “clamp” to hold the C terminus of the protein in position. The clamp holds the C termini of the three monomers in the molecule together, preventing them from separating or moving upward and triggering the molecule. In one example, the C terminal clamp is a trimerization domain, such as GCNt. The sF protein with the GCNt clamp that we produced, sMP340-A, is secreted efficiently from transfected cells but it is not recognized efficiently by MAbs against the F protein, may be partially aggregated, and is not triggered by treatment at 50 C for one hour. Minor modifications to this construct, however, will likely result in a pre-triggered sF protein. Those modifications include removal of the glycine that we had inserted between the sF protein C terminus and the GCNt clamp to add flexibility, removal of residues or insertion of residues such as alanine, that will not disturb the helical nature of this region but which can bring the HR2 helix and the GCNt helix into phase with each other. In another example, the clamp contains a trimerization domain comprising two cysteines that will covalently link the three monomers. In this example, two amino acids at or near the C terminus of the HR2 helix in each soluble F protein monomer are replaced with two cysteines. The cysteines are either consecutive or have one or more amino acids separating them. The 6 cysteines in the trimer will form 3 disulfide bonds, linking the C termini of the three monomers.
For example, the sF protein stabilized at its C terminus by either the addition of a GCNt clamp or cysteines are useful tools for assessing the first step of triggering, i.e., unfolding of the HR1 domain, without the second step of forming the 6-helix bundle. Because the HR2 helices are linked in this protein, they will not be able to fit into the grooves provided by the HR1 trimer to produce the 6-helix bundle. On the other hand, the sF protein without the cysteines will be able to perform both unfolding of the HR1 domain and formation of the 6-helix bundle because its C terminus is not cross-linked to the other monomers in the trimer. So, the clamp or the Cys linkage would probably stabilize the sF protein making it easier to store and to use since more of it would remain in the pre-triggered form. For example, SC-2 begins to decay as soon as it is made, with a t½ of about 3 weeks.
In addition, several strategies are available to produce and maintain and/or stabilize the isolated sF protein or its fragments in the pre-triggered state, i.e. to prevent the triggering of the protein during synthesis and storage. These strategies include: using freshly prepared sF protein in the assays described below; storing the sF protein at 4° C. under which conditions the pre-triggered sF protein slowly triggers, with a half-life of approximately 3 weeks; snap freezing the isolated sF protein on dry ice or liquid nitrogen; and thawing at 37° C. To maintain the sF protein in its pre-triggered form, it is desirable to avoid harsh treatments or treatments which allow triggering to occur. For example, freezing the protein slowly by placing it in a −20° C. freezer or maintaining it at 37° C. or higher for any appreciable amount of time may allow the protein to trigger. In another example, extremes of pH, such as the low pH needed to remove an isolated sF protein from an antibody affinity column should likely be avoided. As described above, the sF protein may also be physically stabilized by adding a GCNt segment to clamp the C terminus, or by adding cysteines that will cross-link the trimer C termini.
Any isolated sF protein that has less than 100% identity with the reference amino acid sequence of the F protein (e.g. SEQ ID NO: 1) is a variant protein. A variant protein has an altered sequence in which one or more of the amino acids in the reference sequence, other than the amino acids that constitute the CRAC domains, is deleted or substituted, or one or more amino acids are inserted into the sequence of the reference amino acid sequence (as described above). A variant can have any combination of deletions, substitutions, or insertions.
With regard to amino acid substitutions, a variety of amino acid substitutions can be made. As used herein, amino acids generally can be grouped as follows: (1) amino acids with non-polar or hydrophobic side groups (A, V, L, I, P, F, W, and M); (2) amino acids with uncharged polar side groups (G, S, T, C, Y, N, and Q); (3) polar acidic amino acids, negatively charged at pH 6.0-7.0 (D and E); and (4) polar basic amino acids, positively charged at pH 6.0-7.0 (K, R, and H). Generally, “conservative” substitutions, i.e., those in which an amino acid from one group is replaced with an amino acid from the same group, can be made without an expectation of impact on activity. Further, some non-conservative substitutions may also be made without affecting activity. Those of ordinary skill in the art will understand what substitutions can be made without impacting activity.
It should be noted that proteins disclosed herein may also comprise amino acids linked to either end, or both. These additional sequences may facilitate expression, purification, identification, solubility, membrane transport, stability, activity, localization, toxicity, and/or specificity of the resulting polypeptide, or may be added for some other reason. The proteins disclosed herein may be linked directly or via a spacer sequence. The spacer sequence may or may not comprise a protease recognition site to allow for the removal of amino acids.
It should be further noted that proteins disclosed herein may also comprise non-amino acid tags linked anywhere along the protein. These additional non-amino acid tags may facilitate expression, purification, identification, solubility, membrane transport, stability, activity, localization, toxicity, and/or specificity of the resulting polypeptide, or it may be added for some other reason. The proteins disclosed herein may be linked directly or via a spacer to the non-amino acid tag. Examples of non-amino acid tags include, but are not limited to, biotin, carbohydrate moieties, lipid moieties, fluorescence groups, and/or quenching groups. The proteins disclosed herein may or may not require chemical, biological, or some other type of modification in order to facilitate linkage to additional groups.
Also provided herein are functional fragments of the isolated sF protein. The terms “fragment” and “functional fragment” are used interchangeably and refer to an isolated peptide that is a truncated from of the pre-triggered soluble F protein and that can successfully function in any of the screening tests described below. The functional fragments comprise some or most of the amino acid sequence of the pre-triggered sF protein, and include a CRAC1 domain. Several regions of the sF protein may be deleted or modified to form a functional fragment.
In one embodiment, the CRAC domain has the sequence VLDLKNYIDK, SEQ ID NO: 20. In another embodiment, the CRAC domain has the sequence VLDLKNYIDR, SEQ ID NO: 42. In another embodiment, the CRAC domain has the sequence VLDIKNYIDK, SEQ ID NO: 43. In another embodiment, the CRAC domain has the sequence ILDLKNYIDK, SEQ ID NO: 44. In another embodiment, the CRAC domain has the sequence VLDLKNYINNR, SEQ ID NO: 45. In another embodiment, the CRAC domain has the sequence VRELKDFVSK, SEQ ID NO: 46. In another embodiment, the CRAC domain has the sequence LKTLQDFVNDEIR, SEQ ID NO: 47. In another embodiment, the CRAC domain has the sequence VQDYVNK, SEQ ID NO: 48. In another embodiment, the CRAC domain has the sequence VNDQFNK, SEQ ID NO: 49.
In one embodiment, the functional fragment is a fragment of RSV F protein. In some embodiments of the RSV functional fragment, all or some of the amino acids N terminal to Cys37 are deleted or replaced.
In other embodiments, all or a portion of the amino acid sequence between and including Asn70 and S155 is removed or replaced. In some other embodiments, all or a portion of the fusion peptide (a.a. 137-155) is removed. In yet other embodiments, all or a portion of the amino acid sequence from Asn70 and R136 is removed or replaced. In some embodiments, pep27 (a.a. 110-136) is removed or replaced with alanines and glycines without destroying the function of the F protein.
In some embodiments, part, or all, of the HR2 region is removed. In some embodiments, the C terminus is truncated, up to and including D440. In some embodiments, a tryptophan or phenylalanine replaces the tyrosine Y198, an arginine replaces R201, an isoleucine, leucine or valine replaces V192, L193, or L195.
In some embodiments, cysteines C37, C69, C212 and C439 link the F1 and F2 fragments together. In other embodiments, these cysteines are replaced by amino acids that interact in a non-covalent manner to hold the F1 and F2 fragments together. Although no residues substitute for cysteines in terms of creating covalent cross-linked bonds, there are many hydrogen-bonding/salt-bridge networks, and hydrophobic-packing networks that can functionally substitute for the stability provided by cysteine residue disulfide bonds. For instance, cysteine residues can coordinate Zinc, rather than link covalently, as in the lid domain of adenylate kinase. The structure of the adenylate kinase lid domain is stabilized by either 4 cysteine residues which coordinate a zinc ion rather than covalently link through disulfide bonds, or by a variable set of 6 residues that engage in salt-bridges, polar interactions, and hydrogen bonding. These 4 cysteine residues can be replaced by several combinations of charged/polar residues at these 6 partially overlapping positions on the structure. Another example would be a leucine zipper that is used in many proteins as a mechanism to dimerize. Another example is found where there is a valine-alanine interaction that substitutes for a disulfide bonded cysteine pair, e.g. in the PIV5 structure (387-410 in the 2B9B PDB structure).
In one embodiment, the fragment is a “dimer peptide” comprising two peptides, each of which comprise, respectively, an amino acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to amino acids 37-69 (F2 fragment) and 156-440 (F1 fragment, including the CRAC1 domain) of SEQ ID NO: 1, linked together.
In other embodiments, any number of amino acids can be added to either end of the dimer peptide. In some embodiments, the additional one or more amino acids that are added to the “dimer peptide” are identical to, or are conservative substitutions for, the amino acids found between amino acids 26-36, 70-155 and/or 441-522 of SEQ ID NO: 1.
Different fragments may be used in different screening methods, as described below.
Method of producing the pre-triggered, soluble F protein and fragments thereof.
Also provided herein are methods of producing the isolated pre-triggered, soluble (s) F protein of paramyxoviruses. In general, any suitable method known in the art for the production of glycoproteins can be used for the purpose of producing the pre-triggered sF protein and fragments thereof.
In some embodiments, the method comprises using a nucleic acid molecule (e.g. RNA) encoding the truncated F protein in a cell-free translation system to prepare the soluble F protein, or functional fragments thereof. Alternatively, a nucleic acid molecule (e.g. DNA) encoding the truncated F protein, or functional fragments thereof, is introduced into an expression vector and used to transform cells. In the expression vector, the sequence which encodes the truncated F protein is operatively linked to an expression control sequence, i.e., a promoter, which directs mRNA synthesis.
Suitable expression vectors include for example chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40, bacterial plasmids, phage DNAs; yeast plasmids, vectors derived from combinations of plasmids and phage DNAs, viral DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. The DNA sequence is introduced into the expression vector by conventional procedures.
Sequences of the paramyxovirus F proteins are publicly available. For example, in some embodiments, the F protein has the sequence SEQ ID NO: 1. Other examples of RSV F protein sequences are presented in Table 1.
Promoters vary in their “strength” (i.e. their ability to promote transcription). For the purposes of expressing a cloned gene, it is desirable to use strong promoters in order to obtain a high level of transcription and, hence, expression of the gene. Depending upon the host cell system utilized, any one of a number of suitable promoters may be used. For instance, when cloning in E. coli, its bacteriophages, or plasmids, promoters such as the T7 phage promoter, lac promoter, trp promoter, recA promoter, ribosomal RNA promoter, the PR and PL promoters of coliphage lambda and others, including but not limited, to lacUV5, ompF, bla, lpp, and the like, may be used to direct high levels of transcription of adjacent DNA segments. Additionally, a hybrid trp-lacUV5 (tac) promoter or other E. coli promoters produced by recombinant DNA or other synthetic DNA techniques may be used to provide for transcription of the inserted gene. Examples of constitutive promoters for use in mammalian cells include the RSV promoter derived from Rous sarcoma virus, the CMV promoter derived from cytomegalovirus, β-actin and other actin promoters, and the EF1α promoter derived from the cellular elongation factor 1α gene. Other examples of some constitutive promoters that are widely used for inducing expression of transgenes include the nopoline synthase (NOS) gene promoter, from those derived from any of the several actin genes, which are known to be expressed in most cells types, and the ubiquitin promoter, which is a gene product known to accumulate in many cell types. Other promoters include the SV40 promoter, or the or murine leukemia virus long terminal repeat (LTR) promoters.
Examples of host cells include a variety of eukaryotic cells. Suitable mammalian cells for use in the present invention include, but are not limited to Chinese hamster ovary (CHO) cells, Vero (African kidney), baby hamster kidney (BHK) cells, human HeLa cells, A549 (human type II pneumocyte), HEp-2 (human neck epithelial) cells, monkey COS-1 cell, human embryonic kidney 293T cells, mouse myeloma NSO and human HKB cells. Other suitable host cells include insect cell lines, including for example, Spodoptera frugiperda cells (Sf9, Sf21), Trichoplusia ni cells, and Drosophila Schneider Line 1 (SL1) cells.
In another embodiment, the method of production includes the same steps but in a cell line capable of high density growth without serum. Examples include, but are not limited to mammalian cells including HKB11 (a hybrid cell line from human embryonic kidney 293 and a human B cell line), CHO (Chinese hamster ovary cells, NS0 (mouse myeloma), and SP2/0 Ag14 (mouse myeloma).
Alternative methods include using insect or yeast cells infected by a viral vector to deliver and express the sF gene. Examples of viral vectors include, but are not limited to: Sindbis virus, adenovirus or vaccinia virus in mammalian cells, or baculovirus in insect, or mammalian, cells.
In some embodiment, the RSV sF protein gene sequence is derived by reverse transcription as cDNA and inserted into a plasmid behind a promoter such as the bacteriophage T7, SP6 or other similar promoter. The plasmid is transfected into cells along with a plasmid expressing the corresponding T7, SP6 or other polymerase, or a viral vector producing this polymerase. In these systems, the sF protein will be expressed in the cytoplasm of a cell, resulting in sF protein production and secretion.
The cDNA sequence derived from the RSV genome or mRNA cannot be inserted into a plasmid and expressed from the nucleus. Since RSV replicates in the cytoplasm, its mRNA is not exposed to the nuclear splicing and polyadenylation machinery. The RSV F protein contains 4 nuclear polyadenylation sites (Ternette, et al. 2007. Vaccine. 2007 25(41):7271-9).
In other embodiments, the sF gene sequence (e.g. in a plasmid) can be designed with optimized mammalian codons to remove cryptic splice sites and cryptic polyadenylation sites. Optimization also enhances translation by choosing codons that are used most frequently in the host cell being used. This type of “optimized” gene sequence can be expressed in the nucleus of the host cell. We have also optimized the F gene from the RSV Long strain, enabling us to produce the Long strain F protein from a plasmid in the nucleus. Many other examples of optimized genes can be found in the literature, including the first description of the human immunodeficiency virus gp160 gene (Haas et al. 1996 Curr. Bio 6:315-24). Such optimized genes can also be obtained commercially, where a company can synthesize genes for a fee, optimizing them as described to avoid cryptic splice sites and cryptic polyadenylation sites.
In one embodiment, the optimized F gene sequence is at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to the sequence in
Using the computational structure of the F protein to design and/or screen potential anti-viral agents
Contemplated herein are methods of identifying a potential paramyxovirus antiviral agent that can bind a CRAC domain of a viral fusion (F) protein, including the step of using a three-dimensional structural representation as defined by the coordinates in Table 4 of a any one of the soluble or full-length pre- or post-triggered RSV F-protein, or a fragment thereof, which contains a cholesterol-binding CRAC pocket to computationally screen candidate compounds for an ability to bind the CRAC pocket.
This disclosure also contemplates a method of selecting a potential paramyxovirus antiviral agent, comprising the steps of providing a computer-generated model of the three-dimensional structure of any one of the soluble or full-length pre- or post-triggered RSV F-protein as defined by the atomic coordinates of RSV F-protein according to Table 4 and selecting chemical structures capable of associating with a CRAC domain having the sequence V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40) in any one of the soluble or full-length pre- or post-triggered RSV F-protein computer-generated models.
Also contemplated herein is a method for selecting a paramyxovirus antiviral agent comprising generating a three-dimensional model of any one of the soluble or full-length pre- or post-triggered RSV F-protein as defined by the atomic coordinates of RSV F-protein according to Table 4 based at least in part on a predetermined sequence, selecting a CRAC domain defined by the atomic coordinates of RSV F-protein according to Table 4 for receiving the agent, and selecting at least one chemical structure compatible with the CRAC domain to define the agent. In some embodiments, the predetermined sequence is V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40).
Also contemplated herein is a method comprising selecting a CRAC domain in a three-dimensional model of any one of the soluble or full-length pre- or post-triggered RSV F-protein as defined by the atomic coordinates of RSV F-protein according to Table 4 for receiving a paramyxovirus antiviral agent, and selecting at least one chemical structure compatible with the CRAC domain to define the agent. In some embodiments, the three-dimensional model of the protein is based at least in part on a predetermined sequence. In some embodiments, the predetermined sequence is V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40).
Another embodiment contemplated herein is a method for assembling a potential paramyxovirus antiviral agent, comprising the steps of providing a computer-generated model of the three-dimensional structure of any one of the soluble or full-length pre- or post-triggered RSV F-protein as defined by the atomic coordinates of RSV F-protein according to Table 4, identifying a portion of at least one chemical structure, wherein the portion is capable of associating with a CRAC domain of any one of the soluble or full-length pre- or post-triggered RSV F-protein having the sequence V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40), and assembling the identified portions into a single molecule to provide the chemical structure of the potential paramyxovirus antiviral agent.
Another embodiment contemplated herein is a method for assembling a paramyxovirus antiviral agent comprising generating a three-dimensional model of any one of the soluble or full-length pre- or post-triggered RSV F-protein as defined by the atomic coordinates of RSV F-protein according to Table 4 based at least in part on a predetermined sequence, selecting a CRAC domain defined by the atomic coordinates in Table 4 for receiving the agent and identifying at least a portion of at least one chemical structure compatible with the CRAC domain and assembling portions of chemical structures identified above into a molecule defining a chemical structure for the agent.
Also contemplated herein is a method for selecting a paramyxovirus antiviral agent comprising processing three-dimensional coordinates of a CRAC domain of a three-dimensional model of any one of the soluble or full-length pre- or post-triggered RSV F-protein to generate a criteria data set, comparing the criteria data set to one or more chemical structures of potential agents, and selecting the chemical structure from the comparing above that binds to the criteria data set to define the agent.
Another embodiment contemplated herein is a method for selecting a paramyxovirus antiviral agent comprising processing three-dimensional coordinates of a CRAC domain of a three-dimensional model of any one of the soluble or full-length pre- or post-triggered RSV F-protein to generate a criteria data set, comparing the criteria data set to at least one portion of one or more chemical structures of potential agents; and selecting at least one or more portions of chemical structures from the comparing above that bind to the criteria data set to define the agent.
Also contemplated herein are methods of identifying a compound that can bind a CRAC domain of a viral fusion (F) protein, comprising the step of using a three-dimensional structural representation of a pre-triggered soluble F protein, or a fragment thereof, which contains a cholesterol binding CRAC pocket to computationally design a synthesizable candidate compound that binds the CRAC pocket.
The computational design can include the steps of: identifying chemical entities or fragments capable of associating with the CRAC binding site; and assembling the chemical entities or fragments into a single molecule to provide the structure of the candidate compound. Also contemplated are methods of synthesizing any such candidate compound, and screening the candidate compound for F protein binding activity. Examples of such compounds include cholesterol derivatives or mimics. Cholesterol mimics include molecules that have similar contact points as cholesterol, but may be very different structurally.
Another example of such compounds includes compounds that are capable of displacing a preloaded cholesterol molecule in a CRAC pocket, causing the F protein to trigger prematurely.
In one example, the CRAC domain may comprise three CRAC1 motifs located in a pit at the top of the F protein trimer crown. Each CRAC1 motif has the sequence V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40), or V/L/I-X1-5-Y/F/W-X1-5-D/E (SEQ ID NO: 41).
In one example, the CRAC containing virus is a paramyxovirus. In another example, the virus belongs to the pneumovirus subfamily virus. In yet another example, the virus is human RSV.
The three-dimensional structure model of a CRAC containing protein and a potential ligand may be examined through the use of computer modeling using a docking program such as FLEX X, DOCK, or AUTODOCK (see, Dunbrack et al., Folding & Design, 2:R27-42 (1997); incorporated by reference herein), to identify potential ligands and/or inhibitors. This procedure can include computer fitting of potential ligands to the ligand binding site to ascertain how well the shape and the chemical structure of the potential ligand will complement the binding site. [Bugg et al., Scientific American, December:92-98 (1993); West et al., TIBS, 16:67-74 (1995); incorporated by reference herein]. Computer programs can also be employed to estimate the attraction, repulsion, and steric hindrance of the two binding partners (i.e., the ligand-binding site and the potential ligand). Generally the tighter the fit, the lower the steric hindrances, and the greater the attractive forces, the more potent the potential drug since these properties are consistent with a tighter binding constant. Furthermore, the more specificity in the design of a potential drug, the more likely that the drug will not interact as well with other proteins. This will minimize potential side-effects due to unwanted interactions with other proteins.
A variety of methods are available to one skilled in the art for evaluating and virtually screening molecules or chemical fragments appropriate for associating with a protein. Such association may be in a variety of forms including, for example, steric interactions, van der Waals interactions, electrostatic interactions, solvation interactions, charge interactions, covalent bonding interactions, non-covalent bonding interactions (e.g., hydrogen-bonding interactions), entropically or enthalpically favorable interactions, and the like.
Numerous computer programs are available and suitable for rational drug design and the processes of computer modeling, model building, and computationally identifying, selecting and evaluating potential inhibitors in the methods described herein. These include, for example, GRID (available form Oxford University, UK), MCSS (available from Molecular Simulations Inc., Burlington, Mass.), AUTODOCK (available from Oxford Molecular Group), FLEX X (available from Tripos, St. Louis. Mo.), DOCK (available from University of California, San Francisco), CAVEAT (available from University of California, Berkeley), HOOK (available from Molecular Simulations Inc., Burlington, Mass.), and 3D database systems such as MACCS-3D (available from MDL Information Systems, San Leandro, Calif.), UNITY (available from Tripos, St. Louis. Mo.), and CATALYST (available from Molecular Simulations Inc., Burlington, Mass.). Potential inhibitors may also be computationally designed “de novo” using such software packages as LUDI (available from Biosym Technologies, San Diego, Calif.), LEGEND (available from Molecular Simulations Inc., Burlington, Mass.), and LEAPFROG (Tripos Associates, St. Louis, Mo.). Compound deformation energy and electrostatic repulsion, may be evaluated using programs such as GAUSSIAN 92, AMBER, QUANTA/CHARMM, AND INSIGHT II/DISCOVER. These computer evaluation and modeling techniques may be performed on any suitable hardware including for example, workstations available from Silicon Graphics, Sun Microsystems, and the like. These techniques, methods, hardware and software packages are representative and are not intended to be comprehensive listing. Other modeling techniques known in the art may also be employed in accordance with embodiments disclosed herein. See for example, N.C. Cohen, Molecular Modeling in Drug Design, Academic Press (1996) (and references therein), and software identified at internet sites including the CAOS/CAMM Center Cheminformatics Suite at www.caos.kun.nl, and the NIH Molecular Modeling Home Page at www.fi.muni.cz/usr/mejzlik/mirrors/molbio.info.nih.gov/modeling/software list/.
A potential ligand may be obtained from commercial sources or synthesized from readily available starting materials using standard synthetic techniques and methodologies known to those of ordinary skill in the art. The potential ligand may then be assayed to determine its ability to inhibit the target protein as described above. Synthetic chemistry transformations and protecting group methodologies (protection and deprotection) useful in synthesizing ligand compounds are known in the art and include, for example, those such as described in R. Larock, Comprehensive Organic Transformations, VCH Publishers (1989); T. W. Greene and P. G. M. Wuts, Protective Groups in Organic Synthesis, 2d. Ed., John Wiley and Sons (1991); L. Fieser and M. Fieser, Fieser and Fieser's Reagents for Organic Synthesis, John Wiley and Sons (1994); and L. Paquette, ed., Encyclopedia of Reagents for Organic Synthesis, John Wiley and Sons (1995); incorporated by reference herein.
The ligands described herein may contain one or more asymmetric centers and thus occur as racemates and racemic mixtures, single enantiomers, individual diastereomers and diastereomeric mixtures. All such isomeric forms of these compounds are expressly included in the present disclosure. The ligands described herein may also be represented in multiple tautomeric forms, all of which are included herein. The ligands may also occur in cis- or trans- or E- or Z-double bond isomeric forms. All such isomeric forms of such ligands are expressly included in the present disclosure. All crystal forms of the ligands described herein are expressly included in the present disclosure.
Whether a CRAC domain is empty or loaded with cholesterol, a compound that would “cap” or stabilize the trimer, blocking access to the cholesterol binding site, can prevent triggering. This stabilization can be temporary, or even permanent if the affinity is high enough. For example, if the F protein is pre-loaded with cholesterol, such a compound could bind to the three cholesterol hydroxyl groups that would be exposed at the top of the F protein trimer.
Therefore, contemplated herein are methods of identifying a compound that can stabilize the crown of a fusion protein. Such methods include the step of using a three-dimensional structural representation of a pre-triggered soluble F protein, or a fragment thereof, which contains a CRAC domain to computationally screen a candidate compound that is capable of stabilizing the crown of a fusion protein.
Also contemplated herein are methods of identifying a compound that can stabilize the crown of a fusion protein, comprising the step of using a three-dimensional structural representation of a pre-triggered soluble F protein, or a fragment thereof, which contains a CRAC domain to computationally design a synthesizable candidate compound that binds and is capable of stabilizing the crown of a fusion protein.
The computational design can include the steps of: identifying chemical entities or fragments capable of associating with the CRAC1 binding site; and assembling the chemical entities or fragments into a single molecule to provide the structure of the candidate compound.
In one example, the CRAC domain may comprise three CRAC1 motifs located in a pit at the top of the F protein trimer crown. Each CRAC1 motif has the sequence V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO: 40), or V/L/I-X1-5-Y/F/W-X1-5-D/E (SEQ ID NO: 41).
Compounds that stabilize the F protein, preventing triggering can be detected by their ability to inhibit changes in the structural indicator of sF proteins, or functional fragments thereof (e.g. circular dichroism or spectrofluorimetric spectrum) as discussed below.
Screening for candidate antiviral agents using soluble, pre-triggered F protein and fragments thereof.
Without wishing to be bound by theory, it is believed that the triggering mechanism works in one of two ways. If CRAC1 is empty, a compound that binds to CRAC1 will cause the F protein to either (i) trigger prematurely, leaving it spent and inactive and destroying the infectivity of the virion in whose membrane the F protein sits, or (ii) not trigger at all when it contacts a target cell membrane. If, on the other hand, the CRAC1 is pre-loaded with cholesterol, a compound that binds to CRAC1 more strongly than cholesterol, and so is capable of displacing cholesterol, would also reduce the infectivity of the virion by causing either (i) or (ii) above. In either case, such a compound can inhibit the biological activity of the fusion protein and reduce the infectivity of the virus.
Accordingly, contemplated herein are methods of screening for a candidate paramyxovirus antiviral agent using a soluble, pre-triggered F protein of a paramyxovirus, or fragments thereof, that comprise a CRAC1 domain having the sequence V/L/I-X1-5-Y/F/W-X1-5-R/K (SEQ ID NO:40) or V/L/I-X1-5-Y/F/W-X1-5-D/E (SEQ ID NO: 41). The method includes the steps of: (i) contacting a test agent with the soluble pre-triggered F protein or a functional fragment thereof; (ii) detecting a structural indicator of the soluble pre-triggered protein, or the fragment thereof, wherein a change in the structural indicator in the presence of the test agent as compared to the absence of the test agent indicates that the agent is a candidate antiviral agent for the paramyxovirus. In this method, the test agent would prematurely trigger the F protein, thereby reducing infectivity of the virus.
Alternative methods include screening for compounds that prevent RSV F protein triggering. The sF protein will likely be triggered by the addition of stimuli such as by incubation with lipid membranes, including liposomes, or by the addition of heat. Compounds that stabilize the sF protein can be detected by their ability to inhibit sF triggering when the sF protein is exposed to a triggering event. A structural indicator, as described above, can be used to detect conformational change in the F protein. The positive control for these screening assays can be sF protein heated or exposed to liposomes in the absence of any test compound. These assays could easily be adapted for high throughput to identify compounds that stabilize the sF protein, as described above for compounds that trigger the sF protein.
Thus, in another embodiment, the method of screening includes: (i) contacting a test agent with the soluble pre-triggered F protein of a paramyxovirus, or a fragment thereof, to form a test sF protein; (ii) exposing the test sF protein to a triggering event; and (iii) assessing a structural indicator of the test sF protein before and after exposure to the triggering event, wherein an absence of change in the structural indicator of the test sF protein after exposure to the triggering event indicates that the agent is a candidate antiviral agent for RSV. The absence of change in any individual sF protein indicates that the sF protein did not trigger, i.e. is incapable of triggering after contact with the test compound. Therefore, this screening method would identify compounds that can block the activity of the F protein, thereby reducing or blocking the infectivity of the virus.
The control in this method would be an sF protein that has not been contacted with the test agent but has been contacted with a control substance similar to but lacking the test agent. In this case, the sF protein would exhibit a change in the structural indicator after the triggering event.
A candidate antiviral agent is a compound that is capable of reducing the infectivity of the paramyxovirus when administered to a subject infected, or at risk of being infected, with the paramyxovirus. In some embodiments, the antiviral agent is an anti-RSV agent.
As used herein, “triggering” refers to the conformational change when an isolated soluble F protein, or functional fragment thereof, goes from a pre-triggered conformation to a post-triggered conformation, as shown in
In some embodiments, the steps of either of the methods described above are performed in the absence of an attachment protein.
The “structural indicator” as used herein refers to a parameter that is capable of detection and that indicates whether the F protein, or functional fragment thereof, has or has not undergone a conformational change as a result of being triggered. Detecting a difference between the structural indicator of an F protein, or functional fragment thereof, before as compared to after exposure to a test agent is indicative of a conformational change in the F protein (i.e. indicates that the test agent has triggered the F protein). Alternatively, the absence of change in the structural indicator after the F protein, or functional fragment thereof, has been exposed to both the test agent and a triggering event indicates that the F protein is not capable of changing its conformation, (i.e., the test agent has locked the F protein in its pre-triggered form.
The methods use a pre-triggered, soluble F (sF) protein or a functional fragment thereof, as described above. Described below, and in Table 2, is a non-limiting list of examples of screening methods, as well as examples of fragments that can be used in such screening methods.
Any of the following assays could easily be adapted to a 96-well or 384-well or similar format for high throughput screening. In this way, many compounds can be simultaneously and quickly assayed for their abilities to trigger or block the sF protein. A library of compounds related to cholesterol or cholesterol mimics, or any other library of chemical compounds can be rapidly tested in this way to identify lead compounds.
The screening methods described above can use one or more structural indicators as follows:
Circular dichroism (CD). In one example, the structural indicator is circular dichroism (CD) spectrum of the protein.
In general, triggering converts the three short helices with their intervening non-helical regions into the long HR1 α-helix. The CD spectrum of a protein is highly sensitive to the secondary structure of the protein backbone. α-helical structure, β-sheet structure, and random coil have distinct, signature spectra. The conformational change upon triggering of the F protein converts several unstructured regions, and 2 β-sheets into a continuous α-helix. This increase in α-helicity and corresponding decrease in other structural components can be detected by change in the CD spectrum.
Fluorescence Emission. In another example, the structural indicator is the fluorescence emission of the sF protein as determined by, for example, spectro fluonimetory. Tryptophan (Trp) residues are responsible for the majority of a protein's fluorescent emission spectrum. When a solvent (polar environment) exposed Trp is excited in the range of 280 nm, the wavelength of maximum Trp emission is approximately 350 nm. When the same Trp is exposed to a hydrophobic environment, instead, the maximum emission is blue shifted.
The F protein contains 3 Trp residues (for a total of 9 in the trimer). Trp1 (W52) and Trp2 (W 314) are situated on the inside of the head, in the vicinity predicted for the fusion peptide, post-cleavage. Trp 3 (W 341) is situated on the exterior face of the F protein, pointing into the inter-domain interface that is also occupied by the N-terminus of the HR1 domain in the pre-triggered form.
If the structural changes of the sF protein alters the hydrophobicity of the environment of any of the three tryptophan residues, one or more spectral peaks will change their fluorescence value. Therefore, in another example, the structural indicator includes environmental monitoring of one or more tryptophan residues, Trp1, Trp2, or Trp 3, within the F protein. The environmental monitoring can include detecting a fluorescence emission shift effect and/or intensity change shown by one or more of the tryptophan residues.
In the case of sF protein, upon triggering, the hydrophobic fusion peptide is removed thereby changing the local environment of Trp1 and Trp2, exposing them to the solvent in the interior cavity of the F protein head. This polar environment will cause a shift in the emission spectrum that can be detected by a spectrofluorimeter as a measure of triggering. The local environment of Trp3 does not change significantly, as it remains on the solvent-exposed face of the protein in the post-triggered form. Therefore, Trp3 fluorescence could be used as a control.
In addition, we have found that tryptophan can replace the central tyrosine in the CRAC motif without loss of fusion activity. If the sF protein releases the bound cholesterol molecule when it is triggered, an sF molecule with tryptophan in this position will dramatically change its fluorescence.
In addition to the fluorescence emission shift effect shown by Trp residues in hydrophobic environments, Trp fluorescence is significantly quenched by contact with Asp and Glu residues. Trp 1 and Trp2 are near several Glu and Asp residues in the pre-triggered form, but are shielded from others by the interposing fusion peptide. When the fusion peptide is removed during triggering, Trp1 and Trp2 are exposed to these additional Asp and Glu contacts, resulting in significant quenching of the Trp1/Trp2 emission spectra.
Trp3 does not have any nearby Asp or Glu residues in either the pre or post-triggered F protein. In some embodiments, as an additional triggering monitor, an Asp or Glu residue could be engineered into HR1 at the point of contact with Trp3 in the pre-triggered form. Upon triggering, HR1 is dramatically removed from the neighborhood of Trp3, thereby removing the quenching effect of such an engineered quenching partner, and greatly increasing the intensity of the Trp3 emission spectrum.
Resonance Raman (RR) spectroscopy. Another example of structural indicator that involves environmental monitoring includes resonance Raman (RR) spectroscopy of the tryptophan residues.
Resonance Raman (RR) spectroscopy may be used for monitoring the microenvironment of specific amino acids. RR spectroscopy is based on scattering rather than emission. Generally, a monochromatic laser is used to excite the sample. Light from the laser interacts with vibrational, electronic or other transitions of the system, resulting in the energy of some photons being changed. The particular changes observed are indicative of the available excitation states in the sample. The excitation states of some amino acids (including Trp and Tyr) are sufficiently distinct that they may be excited, and therefore monitored, separate from each other and from the bulk of the protein. Because each residue's microenvironment affects its available excitation states, RR spectroscopy is another method that can used to selectively monitor the environment of Trp1/2/3 thereby detecting sF triggering.
Monitoring the environment of Trp 1/2/3 provides several assay mechanisms for observing the conformational change involved in triggering (extension of the HR1 helix). For example, the triggering initiation event, removal of the cholesterol from the CRAC domain, would effect a dramatic change in the local environment of the CRAC Tyr. Monitoring this Tyr therefore provides an assay mechanism for the triggering initiation event, rather than the triggering conformational change monitored by Trp1/2/3. Therefore, in yet another example, the structural indicator includes environmental monitoring of the CRAC region's central tyrosine residue. The environmental monitoring can be resonance Raman (RR) spectroscopy of the tyrosine residue. Alternatively, the tyrosine residue can be replaced by a tryptophan (Trp 4) and the environmental monitoring can be detecting a fluorescence emission shift effect shown by such Trp4 residue upon removal of cholesterol from the neighborhood of the CRAC domain.
Hydrophobic dye binding. Yet another example involves exposing the test F protein to a hydrophobic dye wherein the structural indicator is fluorescence of the hydrophobic dye. Examples of hydrophobic dyes include 8-anilinonaphthalene sulfonate (ANS), Sypro Orange, or a similar dye. Hydrophobic dyes are transparent in an aqueous environment, but display increasing fluorescence as the character of their environment becomes more hydrophobic. These dyes are commonly used to monitor the denaturation temperature of soluble proteins, as the loss of tertiary structure exposes hydrophobic regions of the proteins that would usually be buried and inaccessible to the dye. Upon binding to the hydrophobic regions, such a dye will fluoresce, signaling the change in structure. Hydrophobic dyes such as ANS or Sypro Orange can be used to monitor the onset of availability of these hydrophobic regions, thereby monitoring the conformational change caused by triggering. During the F protein triggering event the highly hydrophobic fusion peptide will become exposed, a hydrophobic dye will bind and fluoresce.
Liposome association. In another example, the structural indicator involves binding of the test F protein with a liposome membrane. Triggering of the sF protein exposes its fusion peptide. The highly hydrophobic fusion peptide will insert itself into the hydrophobic core of any available membrane. If liposomes are available, the fusion peptides will insert into these artificial membranes causing the sF protein to associate with the liposomes. The liposomes can be separated from the unbound sF protein by flotation centrifugation, by column chromatography, or other methods. The sF protein may also be triggered at some unknown rate by contact with lipid membranes, such as liposomes. For this reason, a test compound would most likely need to be added to the sF protein before exposing sF to the liposomes. Exposure of the pre-triggered F protein to liposomes can also cause some of the F molecules to trigger and could be used as an assay to identify compounds that block triggering.
Hydrophobic association. In another example, the structural indicator involves hydrophobic association. The surface of the pre-triggered sF protein is hydrophilic, like the surface of most proteins. However, when the sF protein is triggered, its fusion peptide is exposed. The fusion peptide is highly hydrophobic and hydrophobic surfaces have a strong attraction for other hydrophobic surfaces. Therefore, a structural indicator assay can use plates or beads with a hydrophobic surface, to which the post-triggered sF protein, but not the pre-triggered sF protein will bind. In one example of this assay an aliquot of pre-triggered sF protein in solution will be added to each well or bead. A test compound will be added and mixed. If the sF protein is triggered, it will expose its fusion peptide and bind to the hydrophobic surface of the well or bead. Unbound protein will be washed off and the bound protein can be detected. Various methods of detection are possible, including, but not limited to, detection by 6HIS (SEQ ID NO: 21) or FLAG M2 antibodies, or by antibodies that react specifically with the post-triggered sF protein. These antibodies can either be directly labeled with a detection molecule or detected by a secondary antibody labeled with a detection molecule. The detection molecule could be, for example but not limited to, a fluorescent molecule, such as fluorescene or rhodamine, or an enzyme. Binding of the fluorescent molecule can be detected by a fluorimeter. An enzyme, such as horseradish peroxidase or alkaline phosphatase can be detected by incubation with a corresponding substrate that is altered by the enzyme in a predictable manner, for example by turning color or by fluorescing, which can be detected in a spectrophotometer or fluorimeter, respectively. The sF protein could also be directly fused to a fluorescent moiety, such as a green fluorescent protein (GFP), or it can be chemically linked to a fluorescent molecule like fluoroscene or rhodamine, or fused to or chemically linked to an enzyme such as horseradish peroxidase or alkaline phosphatase.
Split GFP. In another embodiment, the structural indicator is the split GFP (Cabantous, S., et al. 2005. Nat Biotechnol 23:102-7) detection of the post-triggered sF protein. In the pre-triggered sF protein, the N and C termini of sF1 are far apart but they are brought together in the post-triggered form. In this method, one portion of GFP is fused to the sF protein fusion peptide sequence via a flexible linker, replacing both the furin cleavage site N terminal to the fusion peptide and pep27. Pep27 is the peptide between the two natural F protein furin cleavage sites that is normally removed during processing in the Golgi (
FRET In another assay, the structural indicator comprises Forster Resonance Energy Transfer (FRET) (Piston, D. W., and G. J. Kremers. 2007. Trends Biochem Sci 32:407-14) detection of the post-triggered sF protein in which the N and C termini of the sF protein are brought together. The sF construct is similar to the construct described above for the split GFP approach, except that two complete or nearly complete fluorescent proteins are fused to the sF1 protein: one N terminal to the fusion peptide and replacing the pep27 sequence, the furin cleavage site and possibly the fusion peptide sequence; and the other at the C terminus of the sF protein (
Enzyme immunoassay (EIA). In one example, the structural indicator is loss of antibody binding. Triggering the sF protein (for example by heat treatment) causes the sF protein to dramatically alter its conformation, as indicated by the loss of sF binding to neutralizing MAbs (
The ability of a compound to prevent sF protein triggering is tested in the same manner, i.e., following the addition of test compound, the plate is exposed to a triggering event (e.g. heat). Detection is performed in the same manner. If the neutralizing MAb binds to the sF protein, the test compound prevented triggering.
Alternatively, a 96-well assay plate is coated with a MAb to the post-triggered form of the sF protein. A solution of pre-triggered sF protein is added to the well along with a test compound. If the test compound causes the sF protein to trigger, the resulting post-triggered sF protein will bind to the MAb on the well. Unbound sF protein is washed off and the EIA is developed with a second MAb that also recognizes the post-triggered F protein but at a different antigenic site. The second MAb is directly labeled with a fluorescent molecule or an enzyme followed by its substrate.
The ability of a compound to prevent sF protein triggering is tested in the same manner, but following the addition of the sF protein solution and the test compound, the plate is exposed to a triggering event (e.g. heat). Detection is performed in the same manner. If the second, post-triggering specific, MAb detects the sF protein, the test compound did not prevent triggering. If this MAb does not detect the sF protein, the test compound prevented triggering.
Functional Assays
The primary assays, as described above, can be followed by functional assays that use the membrane-bound F protein to assess cell-cell fusion or viral infection of cells.
Cell-cell fusion. Expression of the complete F protein (with or without pep27) in cultured cells that are sensitive to viral infection causes the cells to fuse. The F protein can be expressed either by infecting with a virus or by transfecting transiently or stably with the F gene alone. Stable transfection with the F gene would likely require control with an inducible promoter to prevent fusion during cell growth and before addition of the test compounds. This assay can also be developed as a high throughput assay. The read-out can be by microscopic counting of syncytia.
This assay could include a gene for a protein whose presence is relatively simple to detect, such as luciferase, driven by a promoter which is normally switched off, in one cell line. A second set of cells containing the molecule needed to activate transcription of the detection gene can be added to the wells and incubated, usually for 4 to 12 hours to allow the F protein to cause fusion. At that point, the cells are lysed and the amount of enzyme generated is determined by the addition of substrate (see, for example, Nussbaum, O., et al. (1994). J Virol 68(9), 5411-22.). Such a cell-cell fusion assay could be used to screen for compounds that inhibit fusion.
Virus infection. Additional proof that a compound has antiviral activity is the demonstration that it inhibits infection of cultured cells. In another embodiment, compounds that are able to trigger the sF protein, or to prevent sF protein triggering, identified by the primary screening methods above, can be tested for their ability to prevent viral infection in a secondary screen. For example, in a high throughput assay, multi-well tissue culture plates such as 96-well or 386-well plates are seeded with cultured cells sensitive to paramyro virus infection and inoculated with a fixed number of infectious viruses, usually 30 to 100 plaque-forming units (pfu). Compounds are added before, with, or after virus addition. After a period of time, usually 1 to 3 days, the cells are fixed with a reagent such as methanol, stained with a dye such as methylene blue, and examined by microscope for small syncytia, the fused cells that result from infection. Alternatively, the cells can be stained with an antibody to one or more of the viral proteins. The antibody can be either directly labeled with a fluorochrome or with an enzyme whose substrate precipitates at the site, or can be detected by a secondary antibody that is linked to a fluorochrome or an enzyme.
Alternatively, a recombinant virus expressing a marker protein such as an enzyme, luciferase, β-galactosidase, or other, or a fluorescent protein, such as a green fluorescent protein, red fluorescent protein, or other can be used. In that case the number of infected cells can be counted with a microscope after an appropriate passage of time, e.g., the following day.
In an alternative embodiment, the inoculum can be a much higher amount of virus, usually averaging one or more pfu/cell. In this case, the plate can be analyzed the following day or later by a plate reader. Compounds that have no effect on virus infection result in bright fluorescence or large amounts of enzyme production detected by the addition of substrate, but compounds that inhibit viral replication will prevent the virus from expressing its fluorescent protein and the wells will be less bright or turn over less substrate. Detergent may be added to each well to enhance the accuracy of the reading by homogenizing the signal across each well.
When used as a secondary assay, these infectivity assays will be able to assess the antiviral activity of compounds identified in the sF protein triggering, and triggering inhibition, assays described above.
All of the screening methods described herein can be used for members of the paramyxovirus family whose F proteins contain CRAC domains, including, pneumoviruses or human RSV.
Also contemplated herein are compounds identified using the screening methods described above. Focused libraries of compounds representing the precursors to cholesterol or derivatives of cholesterol in their natural state or derivatized at any possible site or sites with formyl, acetyl, hydroxyl, or any other R group can be used to screen for active compounds. Likewise, focused libraries of compounds that make contacts with the CRAC domain that are similar to cholesterol and those that are derivatized at any possible site or sites with formyl, acetyl, hydroxyl, or any other R group can be used to screen for active compounds.
Compounds that inhibit the synthesis of cholesterol
Since cholesterol in a liposome membrane as a model of the target cell membrane enhances the ability of the F protein to trigger, blocking its synthesis in an infected cell would reduce or prevent infection of that cell. If cholesterol is incorporated into the F protein, blocking its synthesis in an infected cell would prevent incorporation into the F protein. For example, if incorporation into the F protein is necessary for the F protein to trigger when it contacts a target cell, the F protein that is produced in that cell and incorporated into virions would be unable to trigger and the virion would not be infectious. Alternatively, if incorporation of cholesterol stabilizes the F protein in its pre-triggered form, without cholesterol the F protein would be unstable and may trigger prematurely, preventing virion formation or allowing the formation of virions that are non-infectious.
Therefore any compound that can reduce or inhibit cholesterol synthesis can be a candidate antiviral compound capable of reducing or inhibiting the biological activity of the fusion protein. Accordingly, contemplated herein are compounds that can reduce, inhibit, or block cholesterol synthesis in infected cells, thereby reducing the biological activity or infectivity of the virus. Such a compound will have antiviral activity against a paramyxovirus that contains a CRAC domain. In one example, the paramyxovirus belongs to the pneumovirus subfamily. In another example, the paramyxovirus is human RSV. In another example, the paramyxovirus is PIV3, PIV1, or NDV.
In order that the embodiments disclosed herein may be more readily understood, the following examples are set forth. It should be understood that these examples are for illustrative purposes only and are not to be construed as limiting the embodiments disclosed in any manner.
We used the following strategy to model the pre-triggered and post-triggered forms of the RSV sF protein based on the X-ray crystallographic structures of the PIV5 and PIV3 sF structures (Yin, et al. 2005. Proc Natl Acad Sci USA 102(26), 9288-93; Yin, et al. 2006 Nature 439(7072), 38-44 respectively (
We generated three versions of the RSV sF protein from the full-length F protein gene of an RSV subgroup A strain virus. The virus gene was cloned as part of the complete RSV genomic cDNA clone, D46. The D46 F protein sequence is identical to the A2 strain of RSV (GenBank: X02221) with the exception of three amino acids. The F protein of A2 differs from D46 at E66K, P101Q, and F342Y, where the first letter represents the A2 amino acid, at the numbered position, and the final letter represents the D46 amino acid at that position.
We used a codon-optimized, synthetic version of the D46 RSV A2 F protein gene (from Peter Collins
We removed the optimized F gene from the optiF plasmid by digestion with SacII and XhoI and inserted it into the same restriction sites in the expression plasmid, MP319, a modified version of pcDNA3.1+ (Invitrogen, Inc.) that we prepared by inserting these restriction sites into its multiple cloning site. A cytomegalovirus promoter preceded the F gene in this plasmid to drive its expression. The final cDNA clone, MP340, expressed the RSV F protein from the nucleus when transfected into mammalian HeLa or human embryonic kidney 293T cells.
The three versions of the RSV sF protein (cartoon in
SC-1, the original sF gene that contains a FLAG tag followed by a 6HIS tag (SEQ ID NO: 21) at the C terminus of the F sequence, was constructed from the optimized F protein gene in pUC19 vector using inverse PCR mutagenesis (Byrappa, Gavin, and Gupta, 1995). The entire plasmid was amplified using two oligonucleotide primers (SEQ ID NO. 7 and 8) that include the sequence to be added (FLAG and 6HIS tags (SEQ ID NO: 21)), and set apart on the target plasmid to exclude the sequence to be deleted (transmembrane and cytoplasmic domains of the protein). The PCR product was purified, ligated, transformed into E. coli, and plated on bacterial medium-containing agar with ampicillin. Surviving clones were analyzed for plasmids containing the mutant sequence.
The first plasmid expressing sF protein, SC-2, was generated by digesting SC-1 with SacII and XhoI then inserting into the similarly digested MP340, pcDNA3.1 based expression vector (Invitrogen).
HC-1 was generated directly from SC-2 by inverse PCR mutagenesis with two oligonucleotide primers (SEQ ID NO. 9 and 10) to introduce two cysteine residues in place of the two C terminal amino acids of the F sequence in SC-2, in order to covalently link the three monomers within the trimer.
The sMP340-A construct was more complex to generate because it included a large stretch of novel sequence and there was no convenient restriction sites near the site of insertion of this new sequence. We assembled the new sequence as a series of 4 overlapping oligonucleotide sequences (SEQ ID NO. 11, 12, 13, and 14), amplified them through 7 cycles in a thermocycler by PCR, then took a small portion of this reaction and added two primers (SEQ ID NO. 15 and 16) from the extreme ends of the new segment and amplified the complete novel sequence. The primer (SEQ ID NO. 16) at the 3′ (right) end contains an XhoI site for insertion into the plasmid. Because there were no convenient restriction sites in the C terminus of the F gene, we PCR amplified a segment of the F gene that contained a ClaI site at its 5′ end and overlapped with the novel synthetic sequence at its 3′ end. We mixed this PCR product with the novel sequence and PCR amplified with primers (SEQ ID NO. 16 and 17) at the extreme ends of this final product. This final product was digested with ClaI and XhoI and inserted into the similarly digested MP340 to generate sMP340-A.
The sF proteins were produced by transfecting human embryonic kidney 293T cells that had been passaged twice over the previous two days and grown in medium lacking antibiotics. Cells were transfected with each DNA construct mixed with the transfection reagent TransIT (Mims, Corp.), as described in the manufacturer's instructions. After 48 hours of incubation at 37° C. in 5% CO2, the medium was harvested, centrifuged at low speed (2,000×g) to remove cell debris, and the supernatant and cell lysate were analyzed by western blot (
To generate a larger amount of purified sF protein we repeated the protocol described above in 3 150 mm tissue culture dishes. At 48 hr post-transfection we collected the medium and purified the protein on a Nickel column (Qiagen), according to the manufacturer's instructions. The sF protein binds to the nickel column because of the 6HIS tag (SEQ ID NO: 21) and can be specifically eluted with imidizol. The purified sF protein was easily detected by SDS-PAGE and Coomassie blue staining (
To determine whether the partially purified sF proteins that we had produced were in the pre-triggered or post-triggered form, we analyzed them by ultracentrifugation through linear sucrose gradients. The pre-triggered form should not migrate very far into the gradient, but the post-triggered form will aggregate via the hydrophobic fusion peptide that is exposed upon triggering and migrate further into the gradient, as found for the PIV5 sF protein (Connolly et al., 2006. Proc Natl Acad Sci USA 103(47): 17903-8). The 15-55% linear sucrose gradients were ultracentrifuged for 18 hours at 41,000 rpm in an SW41 rotor (Beckman). In our initial experiments, both SC-2 and sMP340-A migrated further into the sucrose gradients than expected (
In the next attempt, we performed the complete experiment without freezing the sF proteins. The sMP340-A sF protein again did not remain at the top of the sucrose gradient nor did it move further into the gradient after treatment at 50° C., suggesting that this protein is not in the pre-triggered form to begin with and could not be triggered (
However, when we performed the sucrose gradient analysis without freezing the SC-2 sF protein, it remained near the top of the gradient (4° C. in
In an attempt to trigger the SC-2 sF protein, we heated it at 50° C. for 1 hour and then analyzed it by velocity linear sucrose gradient centrifugation as described above. The heated sF migrated much further into the gradient (50° C. in
The HC-1 sF protein behaved just like the SC-2 sF protein (
We produced two sF protein constructs, SC-2 and HC-1, that are in a pre-triggered form and can subsequently be triggered by the simplest known method, the addition of mild heat or cold. In some embodiments, the heat applied to induce triggering is from 37° C. to 55° C., including, for example, 42° C., 46° C. and 50° C., for a period of between 15 minutes to 4 hours, including, for example, 30 minutes, 45 minutes, 1 hour, 2 hours, and 3 hours. In one embodiment heating is performed at 50° C. for 1 hour. In another embodiment, triggering is caused by slow cooling, for example by placing the protein in an ice bath until it reaches 4° C. In other embodiments, triggering is obtained by placing the protein in a refrigerated environment, for example of 0° C., −4° C., −10° C., −15° C. or −20° C. until frozen.
Confirming the pre-triggered state by reactivity with neutralizing antibodies. We confirmed the pre-triggered status of the constructs by performing antibody reactivity studies. According to our hypothesis, any mouse monoclonal antibody (MAb) against the F protein that neutralizes RSV infectivity in cell culture would bind to the virion form of the F protein and probably to the pre-triggered form of the F protein. If the SC-2 or sMP340-A protein represents the pre-triggered form of the sF protein, neutralizing MAbs should recognize it. To test this possibility, cells were transfected with plasmids expressing the SC-2 and sMP340-A sF proteins and metabolically labeled with 35S-Metionine/Cysteine. Medium from these radiolabeled cells was immunoprecipitated with all 11 neutralizing MAbs (Crowe et al. 1998. Virology 252:373-5; Walsh, 1998 J Gen Virol. 1998 March; 79 (Pt 3):479-87; Walsh, E. E., and J. Hruska. 1983. J Virol 47:171-7) available to us (
MATERIALS & METHODS: Mouse MAbs A6, A8 and L4 against the RSV F protein were provided by Edward Walsh. Each of these MAbs binds to a different site on the RSV F protein. MAb L4 has been shown to neutralized RSV in the absence of complement, but it is 4-fold more effective in the presence of complement (Walsh, et al. 1986. J Gen Virol 67:505-13; Walsh, E. E., and J. Hruska. 1983. J Virol 47:171-7). The ability of a MAb to neutralize RSV indicates that it binds to the RSV virion, most likely the pre-triggered form, and blocks its ability to function in membrane fusion.
The remaining MAbs listed in
To determine the temperature at which the sF protein is triggered, the SC-2 sF protein was incubated at 4° C., 37° C., 42° C., 46° C., and 50° C. for an hour (
The shape of the pre-triggered PIV5 sF protein changes upon triggering with mild heat, as determined by others, using electron microscopy. If mild heat also causes the SC-2 sF protein to trigger, as suggested above by its more rapid migration in velocity sucrose gradients (
Because the SC-2 sF protein, lacking the transmembrane and cytoplasmic domains of the RSV F protein, is secreted in the pre-triggered form, detectable with 11 neutralizing MAbs, and can be triggered with mild heating to aggregate and at the same time to lose its MAb reactivity, the SC-2 sF protein is in the pre-triggered form. Therefore, the membrane anchor is not necessary to maintain the RSV sF protein in its pre-triggered form.
Triggering and stable association of the RSV sF protein with pure lipids in the form of liposomes. The classic method for detecting viral fusion protein triggering is to mix the protein with liposomes, artificial vesicles composed of pure lipids, and add a triggering agent. Triggering the viral protein exposes the fusion peptide which inserts into the nearest hydrophobic environment, the liposome membrane. Association is demonstrated by co-floatation in a sucrose gradient: the sF protein/liposome mixture is placed at the bottom of the tube in dense sucrose with progressively less dense sucrose layered above it, and centrifuged. Because of the low density of lipids, the liposomes float up through the gradient carrying associated proteins with them, while proteins that did not associate with the liposomes remain at the bottom of the gradient.
We used liposomes containing three types of lipids: POPC:POPE:Cholesterol=8:2:5 molar ratio (POPC is 1-Palmitoyl-2-Oleoyl-sn-Glycero-3-Phosphocholine; POPE is 1-Palmitoyl-2-Oleoyl-sn-Glycero-3-Phosphoethanol amine) to roughly model the content of the plasma membrane. Surprisingly, we found that the mere addition of the RSV sF protein to these liposomes and incubation at 37° C. caused liposome association and co-flotation (
RSV virion fusion with pure lipids in the form of liposomes. We have examined the ability of sucrose density gradient-purified, recombinant green fluorescent protein-expressing virions with the F protein as their only glycoprotein (rgRSV-F), labeled with self-quenching amounts of R18 lipid dye, to fuse with liposomes prepared from pure lipids, leading to R18 dilution and fluorescence. 100% is defined as the fluorescence of the same amount of virions treated with the detergent Triton-X100 to dissociate and therefore dequench all of the R18. Over the course of 18 min, 1.2% of the rgRSV-F virions fused with POPC (1-Palmitoyl-2-Oleoyl-sn-Glycero-3-Phosphocholine) liposomes (
In this experiment, fusion of the virions with liposomes was not as rapid as sF protein triggering by liposomes (
The rgRSV-F virions fused more efficiently with liposomes containing 30% cholesterol in addition to POPC lipids: 1.7% over 18 min as compared to 1.2% of the liposomes lacking cholesterol (
Experimental test of the role of the CRAC1 domain in fusion. Our discovery of the cholesterol-binding CRAC domain (CRAC1) lining the central pore in the crown of the RSV F protein in combination with our recognition of stabilizing interactions that would be disrupted if the CRAC1 domain were pulled away led us to discover that cholesterol is involved in the triggering mechanism. Both the CRAC1 domain and the interacting peptide are located within the region of the F protein that must reform into a long α-helix as the F protein is triggered. According to this discovery, mutation of any one of the three signature CRAC domain amino acids (V/L, X1-5, Y, X1-5, R/K) would have a major negative effect on fusion. The CRAC1 sequence in the RSV F protein is: 192VLDLKNYIDK. (SEQ ID NO. 20) The residue numbering corresponds to the amino acid sequence of SEQ ID NO: 1. The amino acids that compose the minimal CRAC domain are L195, Y198 and K201 (underlined). We hypothesized that mutating these signature amino acids to alanine, the simplest amino acid, should reduce the fusion activity of the F protein without changing the secondary structure, the α-helix. On the other hand, mutation of these amino acids to the alternate acceptable amino acid (i.e. L to V, or K to R) should have minimal effect on fusion.
We made these mutations in the RSV F protein and tested their effects on cell-cell fusion (
Alternatively, it is possible that either V192 or L193 may be the critical CRAC amino acid in the first position, instead of L195. Mutations that change V192 or L193 to alanine destroyed the fusion activity of the F protein (
Conservation of the trigger domain in the F1 protein: the role of CRAC1 in other viruses: CRAC1 domain is conserved among several paramyxoviruses, including human RSV, bovine RSV, and human metapneumovirus (
To directly test whether phenylalanine can substitute for tyrosine in the central CRAC1 position, we mutated Y198 to phenylalanine. This mutation did not greatly affect cell-cell fusion (
CRAC1 is also present at the same position in the F protein of parainfluenzavirus type 1 and parainfluenzavirus type 3, and shifted 5 amino acids toward the C terminus in the F protein of Newcastle disease virus. Nipah virus has a CRAC domain immediately at the base of the fusion peptide, a more N terminal position than the others, that might perform a similar function. Both parainfluenza virus type 1 and Newcastle disease virus have phenylalanine as the central amino acid in their CRAC1 domains. We have shown above that phenylalanine can substitute for tyrosine in the central position, so these two CRAC1 domains are likely functional.
Measles virus has sequences similar to the CRAC domain in the position of the RSV CRAC1, but this domain ends with an acidic amino acid instead of a basic amino acid. We propose that such a domain also binds cholesterol since a charge may be the important aspect of this amino acid rather than the type of charge, positive or negative. Furthermore, in
There are other paramyxoviruses, such as mumps virus, parainfluenzavirus types 2 and 4, and SV5, that do not have a CRAC domain in the CRAC1 position (
Experimental Test of the Role of CRAC3 in Fusion
We have shown that a single mutation in the central tyrosine of the triggering CRAC1 domain inhibits cell-to-cell fusion (
The entire contents of the following references are incorporated herein by reference:
Yin, H. S., Wen, X., Paterson, R. G., Lamb, R. A., and Jardetzky, T. S. (2006). Structure of the parainfluenza virus 5 F protein in its metastable, prefusion conformation. Nature 439(7072), 38-44.
This application is the national stage of International Application No. PCT/US 08/66223, filed Jun. 6, 2008, which claims the benefit of U.S. Provisional Application No. 60/942,456, filed Jun. 6, 2007, the entire contents of which are incorporated herein by reference.
This invention was made, at least in part, with government support under National Institutes of Health Grant No: AI047213. The U.S. government may have certain rights in the invention.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US08/66223 | 6/6/2008 | WO | 00 | 6/4/2010 |
Number | Date | Country | |
---|---|---|---|
60942456 | Jun 2007 | US |