Influenza has a long history of pandemics, epidemics, resurgences and outbreaks. Avian influenza, including the H5N1 strain, is a highly contagious and potentially fatal pathogen, but it currently has only a limited ability to infect humans. However, avian flu viruses have historically observed to accumulate mutations that alter its host specificity and allow it to readily infect humans. In fact, two of the major flu pandemics of the last century originated from avian flu viruses that changed their genetic makeup to allow for human infection.
There is a significant concern that the current H5N1, H7N7, H9N2 and H2N2 avian influenza strains might accumulate mutations that alter their host specificity and allow them to readily infect humans. Therefore, there is a need to assess whether the HA protein in these strains can, in fact, convert to a form that can readily infect humans, and a further need to identify HA variants with such ability. There is a further need to understand the characteristics of HA proteins generally that allow or prohibit infection of different subjects, particularly humans. There is also a need for vaccines and therapeutic strategies for effective treatment or delay of onset of disease caused by influenza virus.
The present invention provides hemagglutinin polypeptides with particular glycan binding characteristics. In particular, the present invention provides hemagglutinin polypeptides that bind to sialylated glycans having an umbrella-like topology. In certain embodiments, inventive HA polypeptides bind to umbrella glycans with high affinity and/or specificity. In some embodiments, inventive HA polypeptides show a binding preference for umbrella glycans as compared with cone-topology glycans.
The present invention also provides diagnostic and therapeutic reagents and methods associated with provided hemagglutinin polypeptides, including vaccines.
HA Sequence Element 1 is a sequence element corresponding approximately to residues 97-185 (where residue positions are assigned using H3 HA as reference) of many HA proteins found in natural influenza isolates. This sequence element has the basic structure:
C(Y/F)PX1CX2WX3WX4HHP, wherein:
In some embodiments, X1 is about 35-45, or about 35-43, or about 35, 36, 37, 38, 38, 40, 41, 42, or 43 amino acids long. In some embodiments, X2 is about 9-15, or about 9-14, or about 9, 10, 11, 12, 13, or 14 amino acids long. In some embodiments, X3 is about 26-28, or about 26, 27, or 28 amino acids long. In some embodiments, X4 has the sequence (G/A) (I/V). In some embodiments, X4 has the sequence GI; in some embodiments, X4 has the sequence GV; in some embodiments, X4 has the sequence AI; in some embodiments, X4 has the sequence AV. In some embodiments, HA Sequence Element 1 comprises a disulfide bond. In some embodiments, this disulfide bond bridges residues corresponding to positions 97 and 139 (based on the canonical H3 numbering system utilized herein).
In some embodiments, and particularly in H1 polypeptides, X1 is about 43 amino acids long, and/or X2 is about 13 amino acids long, and/or X3 is about 26 amino acids long. In some embodiments, and particularly in H1 polypeptides, HA Sequence Element 1 has the structure:
CYPX1AT(A/T)(A/S)CX2WX3WX4HHP, wherein:
X1A is approximately 27-42, or approximately 32-42, or approximately 32-40, or approximately 26-41, or approximately 31-41, or approximately 31-39, or approximately 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids long, and X2-X4 are as above.
In some embodiments, and particularly in H1 polypeptides, HA Sequence Element 1 has the structure:
CYPX1AT(A/T)(A/S)CX2W(I/L)(T/V)X3AWX4HHP, wherein:
X1A is approximately 27-42, or approximately 32-42, or approximately 32-40, or approximately 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids long,
X3A is approximately 23-28, or approximately 24-26, or approximately 24, 25, or 26 amino acids long, and X2 and X4 are as above.
In some embodiments, and particularly in H1 polypeptides, HA Sequence Element 1 includes the sequence:
QLSSISSFK,
typically within X1, (including within X1A) and especially beginning about residue 12 of X1 (as illustrated, for example, in
In some embodiments, and particularly in H3 polypeptides, X1 is about 39 amino acids long, and/or X2 is about 13 amino acids long, and/or X3 is about 26 amino acids long.
In some embodiments, and particularly in H3 polypeptides, HA Sequence Element 1 has the structure:
CYPX1AS(S/N)(A/S)CX2WX3WX4HHP, wherein:
X1A is approximately 27-42, or approximately 32-42, or approximately 32-40, or approximately 23-38, or approximately 28-38, or approximately 28-36, or approximately 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids long, and X2-X4 are as above.
In some embodiments, and particularly in H3 polypeptides, HA Sequence Element 1 has the structure:
CYPX1AS(S/N)(A/S)CX2WL(T/H)X3AWX4HHP, wherein:
X1A is approximately 27-42, or approximately 32-42, or approximately 32-40, or approximately 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids long,
X3A is approximately 23-28, or approximately 24-26, or approximately 24, 25, or 26 amino acids long, and X2 and X4 are as above.
In some embodiments, and particularly in H3 polypeptides, HA Sequence Element 1 includes the sequence:
(L/I)(V/I)ASSGTLEF,
typically within X1 (including within X1A), and especially beginning about residue 12 of X1 (as illustrated, for example, in
In some embodiments, and particularly in H5 polypeptides, X1 is about 42 amino acids long, and/or X2 is about 13 amino acids long, and/or X3 is about 26 amino acids long.
In some embodiments, and particularly in H5 polypeptides, HA Sequence Element 1 has the structure:
CYPX1ASSACX2WX3WX4HHP, wherein:
X1A is approximately 27-42, or approximately 32-42, or approximately 32-40, or approximately 23-38, or approximately 28-38, or approximately 28-36, or approximately 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids long, and X2-X4 are as.
In some embodiments, and particularly in H5 polypeptides, HA Sequence Element 1 has the structure:
CYPX1ASSACX2WLIX3AWX4HHP, wherein:
X1A is approximately 27-42, or approximately 32-42, or approximately 32-40, or approximately 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids long, and
X3A is approximately 23-28, or approximately 24-26, or approximately 24, 25, or 26 amino acids long, and X2 and X4 are as above.
In some embodiments, and particularly in H5 polypeptides, HA Sequence Element 1 is extended (i.e., at a position corresponding to residues 186-193) by the sequence:
NDAAEXX(K/R)
In some embodiments, and particularly in H5 polypeptides, HA Sequence Element 1 includes the sequence:
YEELKHLXSXXNHFEK,
typically within X1, and especially beginning about residue 6 of X1 (as illustrated, for example, in
HA Sequence Element 2 is a sequence element corresponding approximately to residues 324-340 (again using a numbering system based on H3 HA) of many HA proteins found in natural influenza isolates. This sequence element has the basic structure:
GAIAGFIE
In some embodiments, HA Sequence Element 2 has the sequence:
PX1GAIAGFIE, wherein:
X1 is approximately 4-14 amino acids long, or about 8-12 amino acids long, or about 12, 11, 10, 9 or 8 amino acids long. In some embodiments, this sequence element provides the HA0 cleavage site, allowing production of HA1 and HA2.
In some embodiments, and particularly in H1 polypeptides, HA Sequence Element 2 has the structure:
PS(I/V)QSRX1AGAIAGFIE, wherein:
X1A is approximately 3 amino acids long; in some embodiments, X1A is G (L/I) F.
In some embodiments, and particularly in H3 polypeptides, HA Sequence Element 2 has the structure:
PXKXTRX1AGAIAGFIE, wherein:
X1A is approximately 3 amino acids long; in some embodiments, X1A is G (L/I) F.
In some embodiments, and particularly in H5 polypeptides, HA Sequence Element 2 has the structure:
PQRXXXRXXRX1AGAIAGFIE, wherein:
X1A is approximately 3 amino acids long; in some embodiments, X1A is G (L/I) F.
Affinity: As is known in the art, “affinity” is a measure of the tightness with a particular ligand (e.g., an HA polypeptide) binds to its partner (e.g., and HA receptor). Affinities can be measured in different ways.
Biologically active: As used herein, the phrase “biologically active” refers to a characteristic of any agent that has activity in a biological system, and particularly in an organism. For instance, an agent that, when administered to an organism, has a biological effect on that organism, is considered to be biologically active. In particular embodiments, where a protein or polypeptide is biologically active, a portion of that protein or polypeptide that shares at least one biological activity of the protein or polypeptide is typically referred to as a “biologically active” portion.
Broad spectrum human-binding (BSHB) H5 HA polypeptides: As used herein, the phrase “broad spectrum human-binding H5 HA” refers to a version of an H5 HA polypeptide that binds to HA receptors found in human epithelial tissues, and particularly to human HA receptors having α2-6 sialylated glycans. Moreover, inventive BSHB H5 HAs bind to a plurality of different α2-6 sialylated glycans. In some embodiments, BSHB H5 HAs bind to a sufficient number of different α2-6 sialylated glycans found in human samples that viruses containing them have a broad ability to infect human populations, and particularly to bind to upper respiratory tract receptors in those populations. In some embodiments, BSHB H5 HA bind to umbrella glycans (e.g., long α2-6 sialylated glycans) as described herein.
Characteristic portion: As used herein, the phrase a “characteristic portion” of a protein or polypeptide is one that contains a continuous stretch of amino acids, or a collection of continuous stretches of amino acids, that together are characteristic of a protein or polypeptide. Each such continuous stretch generally will contain at least two amino acids. Furthermore, those of ordinary skill in the art will appreciate that typically at least 5, 10, 15, 20 or more amino acids are required to be characteristic of a protein. In general, a characteristic portion is one that, in addition to the sequence identity specified above, shares at least one functional characteristic with the relevant intact protein.
Characteristic sequence: A “characteristic sequence” is a sequence that is found in all members of a family of polypeptides or nucleic acids, and therefore can be used by those of ordinary skill in the art to define members of the family.
Cone topology: The phrase “cone topology” is used herein to refer to a 3-dimensional arrangement adopted by certain glycans and in particular by glycans on HA receptors. As illustrated in
Corresponding to: As used herein, the term “corresponding to” is often used to designate the position/identity of an amino acid residue in an HA polypeptide. Those of ordinary skill will appreciate that, for purposes of simplicity, a canonical numbering system (based on wild type H3 HA) is utilized herein (as illustrated, for example, in
Degree of separation removed: As used herein, amino acids that are a “degree of separation removed” are HA amino acids that have indirect effects on glycan binding. For example, one-degree-of-separation-removed amino acids may either: (1) interact with the direct-binding amino acids; and/or (2) otherwise affect the ability of direct-binding amino acids to interact with glycan that is associated with host cell HA receptors; such one-degree-of-separation-removed amino acids may or may not directly bind to glycan themselves. Two-degree-of-separation-removed amino acids either (1) interact with one-degree-of-separation-removed amino acids; and/or (2) otherwise affect the ability of the one-degree-of-separation-removed amino acids to interact with direct-binding amino acids, etc.
Direct-binding amino acids: As used herein, the phrase “direct-binding amino acids” refers to HA polypeptide amino acids which interact directly with one or more glycans that is associated with host cell HA receptors.
Engineered: The term “engineered”, as used herein, describes a polypeptide whose amino acid sequence has been selected by man. For example, an engineered HA polypeptide has an amino acid sequence that differs from the amino acid sequences of HA polypeptides found in natural influenza isolates. In some embodiments, an engineered HA polypeptide has an amino acid sequence that differs from the amino acid sequence of HA polypeptides included in the NCBI database.
H1 polypeptide: An “H1 polypeptide”, as that term is used herein, is an HA polypeptide whose amino acid sequence includes at least one sequence element that is characteristic of H1 and distinguishes H1 from other HA subtypes. Representative such sequence elements can be determined by alignments such as, for example, those illustrated in
H3 polypeptide: An “H3 polypeptide”, as that term is used herein, is an HA polypeptide whose amino acid sequence includes at least one sequence element that is characteristic of H3 and distinguishes H3 from other HA subtypes. Representative such sequence elements can be determined by alignments such as, for example, those illustrated in
H5 polypeptide: An “H5 polypeptide”, as that term is used herein, is an HA polypeptide whose amino acid sequence includes at least one sequence element that is characteristic of H5 and distinguishes H5 from other HA subtypes. Representative such sequence elements can be determined by alignments such as, for example, those illustrated in
Hemagglutinin (HA) polypeptide: As used herein, the term “hemagglutinin polypeptide” (or “HA polypeptide”) refers to a polypeptide whose amino acid sequence includes at least one characteristic sequence of HA. A wide variety of HA sequences from influenza isolates are known in the art; indeed, the National Center for Biotechnology Information (NCBI) maintains a database (www.ncbi.nlm.nih.gov/genomes/FLU/flu.html) that, as of the filing of the present application included 9796 HA sequences. Those of ordinary skill in the art, referring to this database, can readily identify sequences that are characteristic of HA polypeptides generally, and/or of particular HA polypeptides (e.g., H1, H2, H3, H4, H5, H6, H7, H8, H9, H10, H11, H12, H13, H14, H15, or H16 polypeptides; or of HAs that mediate infection of particular hosts, e.g., avian, camel, canine, cat, civet, environment, equine, human, leopard, mink, mouse, seal, stone martin, swine, tiger, whale, etc. For example, in some embodiments, an HA polypeptide includes one or more characteristic sequence elements found between about residues 97 and 185, 324 and 340, 96 and 100, and/or 130-230 of an HA protein found in a natural isolate of an influenza virus. In some embodiments, an HA polypeptide has an amino acid sequence comprising at least one of HA Sequence Elements 1 and 2, as defined herein. In some embodiments, an HA polypeptide has an amino acid sequence comprising HA Sequence Elements 1 and 2, in some embodiments separated from one another by about 100-200, or by about 125-175, or about 125-160, or about 125-150, or about 129-139, or about 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, or 139 amino acids. In some embodiments, an HA polypeptide has an amino acid sequence that includes residues at positions within the regions 96-100 and/or 130-230 that participate in glycan binding. For example, many HA polypeptides include one or more of the following residues: Tyr98, Ser/Thr136, Trp153, His183, and Leu/Ile194. In some embodiments, an HA polypeptide includes at least 2, 3, 4, or all 5 of these residues.
Isolated: The term “isolated”, as used herein, refers to an agent or entity that has either (i) been separated from at least some of the components with which it was associated when initially produced (whether in nature or in an experimental setting); or (ii) produced by the hand of man. Isolated agents or entities may be separated from at least about 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or more of the other components with which they were initially associated. In some embodiments, isolated agents are more than 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% pure.
Long oligosaccharide: For purposes of the present disclosure, an oligosaccharide is typically considered to be “long” if it includes at least one linear chain that has at least four saccharide residues.
Non-natural amino acid: The phrase “non-natural amino acid” refers to an entity having the chemical structure of an amino acid (i.e.,:
and therefore being capable of participating in at least two peptide bonds, but having an R group that differs from those found in nature. In some embodiments, non-natural amino acids may also have a second R group rather than a hydrogen, and/or may have one or more other substitutions on the amino or carboxylic acid moieties.
Polypeptide: A “polypeptide”, generally speaking, is a string of at least two amino acids attached to one another by a peptide bond. In some embodiments, a polypeptide may include at least 3-5 amino acids, each of which is attached to others by way of at least one peptide bond. Those of ordinary skill in the art will appreciate that polypeptides sometimes include “non-natural” amino acids or other entities that nonetheless are capable of integrating into a polypeptide chain, optionally.
Pure: As used herein, an agent or entity is “pure” if it is substantially free of other components. For example, a preparation that contains more than about 90% of a particular agent or entity is typically considered to be a pure preparation. In some embodiments, an agent or entity is at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%<Or 99% pure.
Short oligosaccharide: For purposes of the present disclosure, an oligosaccharide is typically considered to be “short” if it has fewer than 4, or certainly fewer than 3, residues in any linear chain.
Specificity: As is known in the art, “specificity” is a measure of the ability of a particular ligand (e.g., an HA polypeptide) to distinguish its binding partner (e.g., a human HA receptor, and particularly a human upper respiratory tract HA receptor) from other potential binding partners (e.g., an avian HA receptor).
Therapeutic agent: As used herein, the phrase “therapeutic agent” refers to any agent that elicits a desired biological or pharmacological effect.
Treatment: As used herein, the term “treatment” refers to any method used to alleviate, delay onset, reduce severity or incidence, or yield prophylaxis of one or more symptoms or aspects of a disease, disorder, or condition. For the purposes of the present invention, treatment can be administered before, during, and/or after the onset of symptoms.
Umbrella topology: The phrase “umbrella topology” is used herein to refer to a 3-dimensional arrangement adopted by certain glycans and in particular by glycans on HA receptors. The present invention encompasses the recognition that binding to umbrella topology glycans is characteristic of HA proteins that mediate infection of human hosts. As illustrated in
Vaccination: As used herein, the term “vaccination” refers to the administration of a composition intended to generate an immune response, for example to a disease-causing agent. For the purposes of the present invention, vaccination can be administered before, during, and/or after exposure to a disease-causing agent, and in certain embodiments, before, during, and/or shortly after exposure to the agent. In some embodiments, vaccination includes multiple administrations, appropriately spaced in time, of a vaccinating composition.
Variant: As used herein, the term “variant” is a relative term that describes the relationship between a particular HA polypeptide of interest and a “parent” HA polypeptide to which its sequence is being compared. An HA polypeptide of interest is considered to be a “variant” of a parent HA polypeptide if the HA polypeptide of interest has an amino acid sequence that is identical to that of the parent but for a small number of sequence alterations at particular positions. Typically, fewer than 20%, 15%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2% of the residues in the variant are substituted as compared with the parent. In some embodiments, a variant has 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 substituted residue as compared with a parent. Often, a variant has a very small number (e.g., fewer than 5, 4, 3, 2, or 1) number of substituted functional residues (i.e., residues that participate in a particular biological activity). Furthermore, a variant typically has not more than 5, 4, 3, 2, or 1 additions or deletions, and often has no additions or deletions, as compared with the parent. Moreover, any additions or deletions are typically fewer than about 25, 20, 19, 181, 17, 16, 15, 14, 13, 10, 9, 8, 7, 6, and commonly are fewer than about 5, 4, 3, or 2 residues. In some embodiments, the parent HA polypeptide is one found in a natural isolate of an influenza virus (e.g., a wild type HA).
Vector: As used herein, “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. In some embodiment, vectors are capable of extra-chromosomal replication and/or expression of nucleic acids to which they are linked in a host cell such as a eukaryotic or prokaryotic cell. Vectors capable of directing the expression of operatively linked genes are referred to herein as “expression vectors.”
Wild type: As is understood in the art, the phrase “wild type” generally refers to a normal form of a protein or nucleic acid, as is found in nature. For example, wild type HA polypeptides are found in natural isolates of influenza virus. A variety of different wild type HA sequences can be found in the NCBI influenza virus sequence database, http://www.ncbi.nlm.nih.gov/genomes/FLU/FLU.html.
The present invention provides HA polypeptides that bind to umbrella topology glycans. In some embodiments, the present invention provides HA polypeptides that bind to umbrella topology glycans found on HA receptors of a particular target species. For example, in some embodiments, the present invention provides HA polypeptides that bind to umbrella topology glycans found on human HA receptors, e.g., HA receptors found on human epithelial cells, and particularly HA polypeptides that bind to umbrella topology glycans found on human HA receptors in the upper respiratory tract.
The present invention provides HA polypeptides that bind to HA receptors found on cells in the human upper respiratory tract, and in particular provides HA polypeptides that binds to such receptors (and/or to their glycans, particularly to their umbrella glycans) with a designated affinity and/or specificity.
The present invention encompasses the recognition that gaining an ability to bind umbrella topology glycans (e.g., long a2-6 sialylated glycans), and particularly an ability to bind with high affinity, may confer upon an HA polypeptide variant the ability to infect humans (where its parent HA polypeptide cannot). Without wishing to be bound by any particular theory, the present inventors propose that binding to umbrella topology glycans may be paramount, and in particular that loss of binding to other glycan types may not be required.
The present invention further provides various reagents and methods associated with inventive HA polypeptides including, for example, systems for identifying them, strategies for preparing them, antibodies that bind to them, and various diagnostic and therapeutic methods relating to them. Further description of certain embodiments of these aspects, and others, of the present invention, is presented below.
Influenza viruses are RNA viruses which are characterized by a lipid membrane envelope containing two glycoproteins, hemagglutinin (HA) and neuraminidase (NA), embedded in the membrane of the virus particular. There are 16 known HA subtypes and 9 NA subtypes, and different influenza strains are named based on the number of the strain's HA and NA subtypes. Based on comparisons of amino acid sequence identity and of crystal structures, the HA subtypes have been divided into two main groups and four smaller clades. The different HA subtypes do not necessarily share strong amino acid sequence identity, but the overall 3D structures of the different HA subtypes are similar to one another, with several subtle differences that can be used for classification purposes. For example, the particular orientation of the membrane-distal subdomains in relation to a central α-helix is one structural characteristic commonly used to determine HA subtype (Russell et al., Virology, 325:287, 2004).
HA exists in the membrane as a homotrimer of one of 16 subtypes, termed H1-H16. Only three of these subtypes (H1, H2, and H3) have thus far become adapted for human infection. One reported characteristic of HAs that have adapted to infect humans (e.g., of HAs from the pandemic H1N1 (1918) and H3N2 (1967-68) influenza subtypes) is their ability to preferentially bind to α2-6 sialylated glycans in comparison with their avian progenitors that preferentially bind to α2-3 sialylated glycans (Skehel & Wiley, Annu Rev Biochem, 69:531, 2000; Rogers, & Paulson, Virology, 127:361, 1983; Rogers et al., Nature, 304:76, 1983; Sauter et al., Biochemistry, 31:9609, 1992; Connor et al., Virology, 205:17, 1994; Tumpey et al., Science, 310:77, 2005). The present invention, however, encompasses the recognition that ability to infect human hosts correlates less with binding to glycans of a particular linkage, and more with binding to glycans of a particular topology. Thus, the present invention demonstrates that HAs that mediate infection of humans bind to umbrella topology glycans, often showing preference for umbrella topology glycans over cone topology glycans (even though cone-topology glycans may be α2-6 sialylated glycans).
Several crystal structures of HAs from H1 (human and swine), H3 (avian) and H5 (avian) subtypes bound to sialylated oligosaccharides (of both α2-3 and α2-6 linkages) are available and provide molecular insights into the specific amino acids that are involved in distinct interactions of the HAs with these glycans (Eisen et al., Virology, 232:19, 1997; Ha et al., Proc Natl Acad Sci USA, 98:11181, 2001; Ha et al., Virology, 309:209, 2003; Gamblin et al., Science, 303:1838, 2004; Stevens et al., Science, 303:1866, 2004; Russell et al., Glycoconj J 23:85, 2006; Stevens et al., Science, 312:404, 2006).
For example, the crystal structures of H5 (A/duck/Singapore/3/97) alone or bound to an α2-3 or an α2-6 sialylated oligosaccharide identifies certain amino acids that interact directly with bound glycans, and also amino acids that are one or more degree of separation removed (Stevens et al., Proc Natl Acad Sci USA 98:11181, 2001). In some cases, conformation of these residues is different in bound versus unbound states. For instance, Glu190, Lys193 and Gln226 all participate in direct-binding interactions and have different conformations in the bound versus the unbound state. The conformation of Asn186, which is proximal to Glu190, is also significantly different in the bound versus the unbound state.
As noted above, the present invention encompasses the finding that binding to umbrella topology glycans correlates with ability to mediate infection of particular hosts, including for example, humans. Accordingly, the present invention provides HA polypeptides that bind to umbrella glycans. In certain embodiments, inventive HA polypeptides bind to umbrella glycans with high affinity. In certain embodiments, inventive HA polypeptides bind to a plurality of different umbrella topology glycans, often with high affinity and/or specificity.
In some embodiments, inventive HA polypeptides bind to umbrella topology glycans (e.g., long α2-6 silaylated glycans such as, for example, Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4GlcNAc-) with high affinity. For example, in some embodiments, inventive HA polypeptides bind to umbrella topology glycans with an affinity comparable to that observed for a wild type HA that mediates infection of a humans (e.g., H1N1 HA or H3N2 HA). In some embodiments, inventive HA polypeptides bind to umbrella glycans with an affinity that is at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% of that observed under comparable conditions for a wild type HA that mediates infection of humans. In some embodiments, inventive HA polypeptides bind to umbrella glycans with an affinity that is greater than that observed under comparable conditions for a wild type HA that mediates infection of humans.
In certain embodiments, binding affinity of inventive HA polypeptides is assessed over a range of concentrations. Such a strategy provides significantly more information, particularly in multivalent binding assays, than do single-concentration analyses. In some embodiments, for example, binding affinities of inventive HA polypeptides are assessed over concentrations ranging over at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more fold.
In certain embodiments, inventive HA polypeptides show high affinity if they show a saturating signal in a multivalent glycan array binding assay such as those described herein. In some embodiments, inventive HA polypeptides show high affinity if they show a signal above about 400000 or more (e.g., above about 500000, 600000, 700000, 800000, etc) in such studies. In some embodiments, HA polypeptides show saturating binding to umbrella glycans over a concentration range of at least 2 fold, 3 fold, 4 fold, 5 fold or more, and in some embodiments over a concentration range as large as 10 fold or more.
Furthermore, in some embodiments, inventive HA polypeptides bind to umbrella topology glycans more strongly than they bind to cone topology glycans. In some embodiments, inventive HA polypeptides show a relative affinity for umbrella glycans vs cone glycans that is about 10, 9, 8, 7, 6, 5, 4, 3, or 2.
In some embodiments, inventive HA polypeptides bind to α2-6 sialylated glycans; in some embodiments, inventive HA polypeptides bind preferentially to α2-6 sialylated glycans. In certain embodiments, inventive HA polypeptides bind to a plurality of different α2-6 sialylated glycans. In some embodiments, inventive HA polypeptides are not able to bind to α2-3 sialylated glycans, and in other embodiments inventive HA polypeptides are able to bind to α2-3 sialylated glycans.
In some embodiments, inventive HA polypeptides bind to receptors found on human upper respiratory epithelial cells. In certain embodiments, inventive HA polypeptides bind to HA receptors in the bronchus and/or trachea. In some embodiments, inventive HA polypeptides are not able to bind receptors in the deep lung, and in other embodiments, inventive HA polypeptides are able to bind receptors in the deep lung.
In some embodiments, inventive HA polypeptides bind to at least about 10%, 15%, 20%, 25%, 30% 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% 95% or more of the glycans found on HA receptors in human upper respiratory tract tissues (e.g., epithelial cells).
In some embodiments, inventive HA polypeptides bind to one or more of the glycans illustrated in
The present invention provides isolated HA polypeptides with designated binding specificity, and also provides engineered HA polypeptides with designated binding characteristics with respect to umbrella glycans.
In some embodiments, inventive HA polypeptides with designated binding characteristics are H1 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H2 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H3 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H4 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H5 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H6 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H7 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H8 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H9 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H10 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H11 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H12 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H13 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H14 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H15 polypeptides. In some embodiments, inventive HA polypeptides with designated binding characteristics are H16 polypeptides.
In some embodiments, inventive HA polypeptides with designated binding characteristics are not H1 polypeptides, are not H2 polypeptides, and/or are not H3 polypeptides.
In some embodiments, inventive HA polypeptides do not include the H1 protein from any of the strains: A/South Carolina/1/1918; A/Puerto Rico/8/1934; A/Taiwan/1/1986; A/Texas/36/1991; A/Beijing/262/1995; A/Johannesburg/92/1996; A/New Caledonia/20/1999; A/Solomon Islands/3/2006.
In some embodiments, inventive HA polypeptides are not the H2 protein from any of the strains of the Asian flu epidemic of 1957-58). In some embodiments, inventive HA polypeptides do not include the H2 protein from any of the strains: A/Japan/305+/1957; A/Singapore/1/1957; A/Taiwan/1/1964; A/Taiwan/1/1967.
In some embodiments, inventive HA polypeptides do not include the H3 protein from any of the strains: A/Aichi/2/1968; A/Phillipines/2/1982; A/Mississippi/1/1985; A/Leningrad/360/1986; A/Sichuan/2/1987; A/Shanghai/11/1987; A/Beijing/353/1989; A/Shandong/9/1993; A/Johannesburg/33/1994; A/Nanchang/813/1995; A/Sydney/5/1997; A/Moscow/10/1999; A/Panama/2007/1999; A/Wyoming/3/2003; A/Oklahoma/323/2003; A/California/7/2004; A/Wisconsin/65/2005.
In certain embodiments, an HA polypeptide is a variant of a parent HA polypeptide in that its amino acid sequence is identical to that of the parent HA but for a small number of particular sequence alterations. In some embodiments, the parent HA is an HA polypeptide found in a natural isolate of an influenza virus (e.g., a wild type HA polypeptide).
In some embodiments, inventive HA polypeptide variants have different glycan binding characteristics than their corresponding parent HA polypeptides. In some embodiments, inventive HA variant polypeptides have greater affinity and/or specificity for umbrella glycans (e.g., as compared with for cone glycans) than do their cognate parent HA polypeptides. In certain embodiments, such HA polypeptide variants are engineered variants.
In some embodiments, HA polypeptide variants with altered glycan binding characteristics have sequence alternations in residues within or affecting the glycan binding site. In some embodiments, such substitutions are of amino acids that interact directly with bound glycan; in other embodiments, such substitutions are of amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves. Inventive HA polypeptide variants contain substitutions of one or more direct-binding amino acids, one or more first degree of separation-amino acids, one or more second degree of separation-amino acids, or any combination of these. In some embodiments, inventive HA polypeptide variants may contain substitutions of one or more amino acids with even higher degrees of separation.
In some embodiments, HA polypeptide variants with altered glycan binding characteristics have sequence alterations in residues that make contact with sugars beyond Neu5Ac and Gal (see, for example,
In some embodiments, HA polypeptide variants have at least one amino acid substitution, as compared with a wild type parent HA. In certain embodiments, inventive HA polypeptide variants have at least two, three, four, five or more amino acid substitutions as compared with a cognate wild type parent HA; in some embodiments inventive HA polypeptide variants have two, three, or four amino acid substitutions. In some embodiments, all such amino acid substitutions are located within the glycan binding site.
In some embodiments, HA polypeptide variants have sequence substitutions at positions corresponding to one or more of residues 137, 145, 156, 159, 186, 187, 189, 190, 192, 193, 196, 222, 225, 226, and 228. In some embodiments, HA polypeptide variants have sequence substitutions at positions corresponding to one or more of residues 156, 159, 189, 192, 193, and 196; and/or at positions corresponding to one or more of residues 186, 187, 189, and 190; and/or at positions corresponding to one or more of residues 190, 222, 225, and 226; and/or at positions corresponding to one or more of residues 137, 145, 190, 226 and 228. In some embodiments, HA polypeptide variants have sequence substitutions at positions corresponding to one or more of residues 190, 225, 226, and 228.
In certain embodiments, HA polypeptide variants, and particularly H5 polypeptide variants, have one or more amino acid substitutions relative to a wild type parent HA (e.g., H5) at residues selected from the group consisting of residues 98, 136, 138, 153, 155, 159, 183, 186, 187, 190, 193, 194, 195, 222, 225, 226, 227, and 228. In other embodiments, HA polypeptide variants, and particularly H5 polypeptide variants, have one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids located in the region of the receptor that directly binds to the glycan, including but not limited to residues 98, 136, 153, 155, 183, 190, 193, 194, 222, 225, 226, 227, and 228. In further embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids located adjacent to the region of the receptor that directly binds the glycan, including but not limited to residues 98, 138, 186, 187, 195, and 228.
In some embodiments, an inventive HA polypeptide variant, and particularly an H5 polypeptide variant has one or more amino acid substitutions relative to a wild type parent HA at residues selected from the group consisting of residues 138, 186, 187, 190, 193, 222, 225, 226, 227 and 228. In other embodiments, an inventive HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids located in the region of the receptor that directly binds to the glycan, including but not limited to residues 190, 193, 222, 225, 226, 227, and 228. In further embodiments, an inventive HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids located adjacent to the region of the receptor that directly binds the glycan, including but not limited to residues 138, 186, 187, and 228.
In further embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from the group consisting of residues 98, 136, 153, 155, 183, 194, and 195. In other embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids located in the region of the receptor that directly binds to the glycan, including but not limited to residues 98, 136, 153, 155, 183, and 194. In further embodiments, an inventive HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids located adjacent to the region of the receptor that directly binds the glycan, including but not limited to residues 98 and 195.
In certain embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves, including but not limited to residues 98, 138, 186, 187, 195, and 228.
In further embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves, including but not limited to residues 138, 186, 187, and 228.
In further embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves, including but not limited to residues 98 and 195.
In certain embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has an amino acid substitution relative to a wild type parent HA at residue 159.
In other embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from 190, 193, 225, and 226. In some embodiments, an HA polypeptide variant, and particularly an H5 polypeptide variant, has one or more amino acid substitutions relative to a wild type parent HA at residues selected from 190, 193, 226, and 228.
In some embodiments, an inventive HA polypeptide variant, and particularly an H5 variant has one or more of the following amino acid substitutions: Ser137Ala, Lys156Glu, Asn186Pro, Asp187Ser, Asp187Thr, Ala189Gln, Ala189Lys, Ala189Thr, Glu190Asp, Glu190Thr, Lys193Arg, Lys193Asn, Lys193His, Lys193Ser, Gly225Asp, Gln226Ile, Gln226Leu, Gln226Val, Ser227Ala, Gly228Ser.
In some embodiments, an inventive HA polypeptide variant, and particularly an H5 variant has one or more of the following sets of amino acid substitutions:
Glu190Asp, Lys193Ser, Gly225Asp and Gln226Leu;
Glu190Asp, Lys193Ser, Gln226Leu and Gly228Ser;
Ala189Gln, Lys193Ser, Gln226Leu, Gly228Ser;
Ala189Gln, Lys193Ser, Gln226Leu, Gly228Ser;
Asp187Ser/Thr, Ala189Gln, Lys193Ser, Gln226Leu, Gly228Ser;
Ala189Lys, Lys193Asn, Gln226Leu, Gly228Ser;
Asp187Ser/Thr, Ala189Lys, Lys193Asn, Gln226Leu, Gly228Ser;
Lys156Glu, Ala189Lys, Lys193Asn, Gln226Leu, Gly228Ser;
Lys193His, Gln226Leu/Ile/Val, Gly228Ser;
Lys193Arg, Gln226Leu/Ile/Val, Gly228Ser;
Ala189Lys, Lys193Asn, Gly225Asp;
Lys156Glu, Ala189Lys, Lys193Asn, Gly225Asp;
Ser137Ala, Lys156Glu, Ala189Lys, Lys193Asn, Gly225Asp;
Glu190Thr, Lys193Ser, Gly225Asp;
Asp187Thr, Ala189Thr, Glu190Asp, Lys193Ser, Gly225Asp;
Asn186Pro, Asp187Thr, Ala189Thr, Glu190Asp, Lys193Ser, Gly225Asp;
Asn186Pro, Asp187Thr, Ala189Thr, Glu190Asp, Lys193Ser, Gly225Asp, Ser227Ala. In some such embodiments, the HA polypeptide has at least one further substitution as compared with a wild type HA, such that affinity and/or specificity of the variant for umbrella glycans is increased.
In some embodiments, inventive HA polypeptides (including HA polypeptide variants) have sequences that include D190, D225, L226, and/or S228. In some embodiments, inventive HA polypeptides have sequences that include D190 and D225; in some embodiments, inventive HA polypeptides have sequences that include L226 and S228.
In some embodiments, inventive HA polypeptide variants have an open binding site as compared with a parent HA, and particularly with a parent wild type HAs.
The present invention further provides characteristic portions of inventive HA polypeptides and nucleic acids that encode them. In general, a characteristic portion is one that contains a continuous stretch of amino acids, or a collection of continuous stretches of amino acids, that together are characteristic of the HA polypeptide. Each such continuous stretch generally will contain at least two amino acids. Furthermore, those of ordinary skill in the art will appreciate that typically at least 5, 10, 15, 20 or more amino acids are required to be characteristic of a H5 HA polypeptide. In general, a characteristic portion is one that, in addition to the sequence identity specified above, shares at least one functional characteristic with the relevant intact HA polypeptide. In some embodiments, inventive characteristic portions of HA polypeptides share glycan binding characteristics with the relevant full-length HA polypeptides.
Inventive HA polypeptides, and/or characteristic portions thereof, or nucleic acids encoding them, may be produced by any available means.
Inventive HA polypeptides (or characteristic portions) may be produced, for example, by utilizing a host cell system engineered to express an inventive HA-polypeptide-encoding nucleic acid.
Any system can be used to produce HA polypeptides (or characteristic portions), such as egg, baculovirus, plant, yeast, Madin-Darby Canine Kidney cells (MDCK), or Vero (African green monkey kidney) cells. Alternatively or additionally, HA polypeptides (or characteristic portions) can be expressed in cells using recombinant techniques, such as through the use of an expression vector (Sambrook et al., Molecular Cloning: A Laboratory Manual, CSHL Press, 1989).
Alternatively or additionally, inventive HA polypeptides (or characteristic portions thereof) can be produced by synthetic means.
Alternatively or additionally, inventive HA polypeptides (or characteristic portions thereof) may be produced in the context of intact virus, whether otherwise wild type, attenuated, killed, etc. Inventive HA polypeptides, or characteristic portions thereof, may also be produced in the context of virus like particles.
In some embodiments, HA polypeptides (or characteristic portions thereof) can be isolated and/or purified from influenza virus. For example, virus may be grown in eggs, such as embryonated hen eggs, in which case the harvested material is typically allantoic fluid. Alternatively or additionally, influenza virus may be derived from any method using tissue culture to grow the virus. Suitable cell substrates for growing the virus include, for example, dog kidney cells such as MDCK or cells from a clone of MDCK, MDCK-like cells, monkey kidney cells such as AGMK cells including Vero cells, cultured epithelial cells as continuous cell lines, 293T cells, BK-21 cells, CV-1 cells, or any other mammalian cell type suitable for the production of influenza virus for vaccine purposes, readily available from commercial sources (e.g., ATCC, Rockville, Md.). Suitable cell substrates also include human cells such as MRC-5 cells. Suitable cell substrates are not limited to cell lines; for example primary cells such as chicken embryo fibroblasts are also included.
Also, it will be appreciated by those of ordinary skill in the art that HA polypeptides, and particularly variant HA polypeptides as described herein, may be generated, identified, isolated, and/or produced by culturing cells or organisms that produce the HA (whether alone or as part of a complex, including as part of a virus particle or virus), under conditions that allow ready screening and/or selection of HA polypeptides capable of binding to umbrella-topology glycans. To give but one example, in some embodiments, it may be useful to produce and/or study a collection of HA variants under conditions that reveal and/or favor those variants that bind to umbrella topology glycans (e.g., with particular specificity and/or affinity). In some embodiments, such a collection of HA variants results from evolution in nature. In some embodiments, such a collection of HA variants results from engineering. In some embodiments, such a collection of HA variants results from a combination of engineering and natural evolution.
HA interacts with the surface of cells by binding to a glycoprotein receptor. Binding of HA to HA receptors is predominantly mediated by N-linked glycans on the HA receptors. Specifically, HA on the surface of flu virus particles recognizes sialylated glycans that are associated with HA receptors on the surface of the cellular host. After recognition and binding, the host cell engulfs the viral cell and the virus is able to replicate and produce many more virus particles to be distributed to neighboring cells.
HA receptors are modified by either α2-3 or α2-6 sialylated glycans near the receptor's HA-binding site, and the type of linkage of the receptor-bound glycan affects the conformation of the receptor's HA-binding site, thus affecting the receptor's specificity for different HAs.
For example, the glycan binding pocket of avian HA is narrow. According to the present invention, this pocket binds to the trans conformation of α2-3 sialylated glycans, and/or to cone-topology glycans, whether α2-3 or α2-6 linked.
HA receptors in avian tissues, and also in human deep lung and gastrointestinal (GI) tract tissues are characterized by α2-3 sialylated glycan linkages, and furthermore (according to the present invention), are characterized by glycans, including α2-3 sialylated and/or α2-6 sialylated glycans, which predominantly adopt cone topologies.
By contrast, human HA receptors in the bronchus and trachea of the upper respiratory tract are modified by α2-6 sialylated glycans. Unlike the α2-3 motif, the α2-6 motif has an additional degree of conformational freedom due to the C6-C5 bond (Russell et al., Glycoconj J 23:85, 2006). HAs that bind to such α2-6 sialylated glycans have a more open binding pocket to accommodate the diversity of structures arising from this conformational freedom. Moreover, according to the present invention, HAs may need to bind to glycans (e.g., α2-6 sialylated glycans) in an umbrella topology, and particularly may need to bind to such umbrella topology glycans with strong affinity and/or specificity, in order to effectively mediate infection of human upper respiratory tract tissues.
As a result of these spatially restricted glycosylation profiles, humans are not usually infected by viruses containing many wild type avian HAs (e.g., avian H5). Specifically, because the portions of the human respiratory tract that are most likely to encounter virus (i.e., the trachea and bronchi) lack receptors with cone glycans (e.g., α2-3 sialylated glycans, and/or short glycans) and wild type avian HAs typically bind primarily or exclusively to receptors associated with cone glycans (e.g., α2-3 sialylated glycans, and/or short glycans), humans rarely become infected with avian viruses. Only when in sufficiently close contact with virus that it can access the deep lung and/or gastrointestinal tract receptors having umbrella glycans (e.g., long α2-6 sialylated glycans) do humans become infected.
To rapidly expand the current knowledge of known specific glycan-glycan binding protein (GBP) interactions, the Consortium for Functional Glycomics (CFG; www.functionalglycomics.org), an international collaborative research initiative, has developed glycan arrays comprising several glycan structures that have enabled high throughput screening of GBPs for novel glycan ligand specificities. The glycan arrays comprise both monovalent and polyvalent glycan motifs (i.e. attached to polyacrylamide backbone), and each array comprises 264 glycans with low (10 uM) and high (100 uM) concentrations, and six spots for each concentration (see http://www.functionalglycomics.org/static/consortium/resources/resourcecoreh5.shtml).
The arrays predominantly comprise synthetic glycans that capture the physiological diversity of N- and O-linked glycans. In addition to the synthetic glycans, N-linked glycan mixtures derived from different mammalian glycoproteins are also represented on the array.
As used herein, a glycan “array” refers to a set of one or more glycans, optionally immobilized on a solid support. In some embodiments, an “array” is a collection of glycans present as an organized arrangement or pattern at two or more locations that are physically separated in space. Typically, a glycan array will have at least 4, 8, 16, 24, 48, 96 or several hundred or thousand discrete locations. In general, inventive glycan arrays may have any of a variety of formats. Various different array formats applicable to biomolecules are known in the art. For example, a huge number of protein and/or nucleic acid arrays are well known. Those of ordinary skill in the art will immediately appreciate standard array formats appropriate for glycan arrays of the present invention.
In some embodiments, inventive glycan arrays are present in “microarray” formats. A microarray may typically have sample locations separated by a distance of 50-200 microns or less and immobilized sample in the nano to micromolar range or nano to picogram range. Array formats known in the art include, for example, those in which each discrete sample location has a scale of, for example, ten microns.
In some embodiments, inventive glycan arrays comprise a plurality of glycans spatially immobilized on a support. The present invention provides glycan molecules arrayed on a support. As used herein, “support” refers to any material which is suitable to be used to array glycan molecules. As will be appreciated by those of ordinary skill in the art, any of a wide variety of materials may be employed. To give but a few examples, support materials which may be of use in the invention include hydrophobic membranes, for example, nitrocellulose, PVDF or nylon membranes. Such membranes are well known in the art and can be obtained from, for example, Bio-Rad, Hemel Hempstead, UK.
In further embodiments, the support on which glycans are arrayed may comprise a metal oxide. Suitable metal oxides include, but are not limited to, titanium oxide, tantalum oxide, and aluminium oxide. Examples of such materials may be obtained from Sigma-Aldrich Company Ltd, Fancy Road, Poole, Dorset. BH12 4QH UK.
In yet further embodiments, such a support is or comprises a metal oxide gel. A metal oxide gel is considered to provide a large surface area within a given macroscopic area to aid immobilization of the carbohydrate-containing molecules.
Additional or alternative support materials which may be used in accordance with the present invention include gels, for example silica gels or aluminum oxide gels. Examples of such materials may be obtained from, for example, Merck KGaA, Darmstadt, Germany.
In some embodiments of the invention, glycan arrays are immobilized on a support that can resist change in size or shape during normal use. For example a support may be a glass slide coated with a component material suitable to be used to array glycans. Also, some composite materials can desirable provide solidity to a support.
As demonstrated herein, inventive arrays are useful for the identification and/or characterization of different HA polypeptides and their binding characteristics. In certain embodiments, inventive HA polypeptides are tested on such arrays to assess their ability to bind to umbrella topology glycans (e.g., to α2-6 sialylated glycans, and particularly to long α2-6 sialylated glycans arranged in an umbrella topology).
Indeed, the present invention provides arrays of α2-6 sialylated glycans, and optionally α2-3 sialylated glycans, that can be used to characterize HA polypeptide binding capabilities and/or as a diagnostic to detect, for example, human-binding HA polypeptides. In some embodiments, inventive arrays contain glycans (e.g., α2-6 sialylated glycans, and particularly long α2-6 sialylated glycans) in an umbrella topology. As will be clear to those of ordinary skill in the art, such arrays are useful for characterizing or detecting any HA polypeptides, including for example, those found in natural influenza isolates in addition to those designed and/or prepared by researchers.
In some embodiments, such arrays include glycans representative of about 10%, 15%, 20%, 25%, 30% 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% 95%, or more of the glycans (e.g., the umbrella glycans, which will often be α2-6 sialylated glycans, particularly long α2-6 sialylated glycans) found on human HA receptors, and particularly on human upper respiratory tract HA receptors. In some embodiments, inventive arrays include some or all of the glycan structures depicted in
The present invention provides methods for identifying or characterizing HA proteins using glycan arrays. In some embodiments, for example, such methods comprise steps of (1) providing a sample containing HA polypeptide, (2) contacting the sample with a glycan array comprising, and (3) detecting binding of HA polypeptide to one or more glycans on the array.
Suitable sources for samples containing HA polypeptides to be contacted with glycan arrays according to the present invention include, but are not limited to, pathological samples, such as blood, serum/plasma, peripheral blood mononuclear cells/peripheral blood lymphocytes (PBMC/PBL), sputum, urine, feces, throat swabs, dermal lesion swabs, cerebrospinal fluids, cervical smears, pus samples, food matrices, and tissues from various parts of the body such as brain, spleen, and liver. Alternatively or additionally, other suitable sources for samples containing HA polypeptides include, but are not limited to, environmental samples such as soil, water, and flora. Yet other samples include laboratory samples, for example of engineered HA polypeptides designed and/or prepared by researchers. Other samples that have not been listed may also be applicable.
A wide variety of detection systems suitable for assaying HA polypeptide binding to inventive glycan arrays are known in the art. For example, HA polypeptides can be detectably labeled (directly or indirectly) prior to or after being contacted with the array; binding can then be detected by detection of localized label. In some embodiments, scanning devices can be utilized to examine particular locations on an array.
Alternatively or additionally, binding to arrayed glycans can be measured using, for example, calorimetric, fluorescence, or radioactive detection systems, or other labeling methods, or other methods that do not require labeling. In general, fluorescent detection typically involves directly probing the array with a fluorescent molecule and monitoring fluorescent signals. Alternatively or additionally, arrays can be probed with a molecule that is tagged (for example, with biotin) for indirect fluorescence detection (in this case, by testing for binding of fluorescently-labeled streptavidin). Alternatively or additionally, fluorescence quenching methods can be utilized in which the arrayed glycans are fluorescently labeled and probed with a test molecule (which may or may not be labeled with a different fluorophore). In such embodiments, binding to the array acts to squelch the fluorescence emitted from the arrayed glycan, therefore binding is detected by loss of fluorescent emission. Alternatively or additionally, arrayed glycans can be probed with a live tissue sample that has been grown in the presence of a radioactive substance, yielding a radioactively labeled probe. Binding in such embodiments can be detected by measuring radioactive emission.
Such methods are useful to determine the fact of binding and/or the extent of binding by HA polypeptides to inventive glycan arrays. In some embodiments of the invention, such methods can further be used to identify and/or characterize agents that interfere with or otherwise alter glycan-HA polypeptide interactions.
Methods described below may be of particular use in, for example, identifying whether a molecule thought to be capable of interacting with a carbohydrate can actually do so, or to identify whether a molecule unexpectedly has the capability of interacting with a carbohydrate.
The present invention also provides methods of using inventive arrays, for example, to detect a particular agent in a test sample. For instance, such methods may comprise steps of (1) contacting a glycan array with a test sample (e.g., with a sample thought to contain an HA polypeptide); and, (2) detecting the binding of any agent in the test sample to the array.
Yet further, binding to inventive arrays may be utilized, for example, to determine kinetics of interaction between binding agent and glycan. For example, inventive methods for determining interaction kinetics may include steps of (1) contacting a glycan array with the molecule being tested; and, (2) measuring kinetics of interaction between the binding agent and arrayed glycan(s).
The kinetics of interaction of a binding agent with any of the glycans in an inventive array can be measured by real time changes in, for example, colorimetric or fluorescent signals, as detailed above. Such methods may be of particular use in, for example, determining whether a particular binding agent is able to interact with a specific carbohydrate with a higher degree of binding than does a different binding agent interacting with the same carbohydrate.
It will be appreciated, of course, that glycan binding by inventive HA polypeptides can be evaluated on glycan samples or sources not present in an array format per se. For example, inventive HA polypeptides can be bound to tissue samples and/or cell lines to assess their glycan binding characteristics. Appropriate cell lines include, for example, any of a variety of mammalian cell lines, particularly those expressing HA receptors containing umbrella topology glycans (e.g., at least some of which may be α2-6 sialylated glycans, and particularly long α2-6 sialylated glycans). In some embodiments, utilized cell lines express individual glycans with umbrella topology. In some embodiments, utilized cell lines express a diversity of glycans. In some embodiments, cell lines are obtained from clinical isolates; in some they are maintained or manipulated to have a desired glycan distribution and/or prevalence. In some embodiments, tissue samples and/or cell lines express glycans characteristic of mammalian upper respiratory epithelial cells.
As discussed here, according to the present invention, HA polypeptides can be identified and/or characterized by mining data from glycan binding studies, structural information (e.g., HA crystal structures), and/or protein structure prediction programs.
The main steps involved in the particular data mining process utilized by the present inventors (and exemplified herein) are illustrated in
The data mining platform utilized herein comprised software modules that interact with each other (
Representative features extracted from the glycans on the glycan array are listed in Table 1.
The rationale behind choosing these particular features shown was that glycan binding sites on GBPs typically accommodate di-tetra-saccharides. A tree based representation was used to capture the information on monosaccharides and linkages in the glycan structures (root of the tree at the reducing end). This representation facilitated the abstraction of various features including higher order features such as connected set of monosaccharide triplets, etc (
Different types of classifiers have been developed and used in many applications. They fall primarily into three main categories: Mathematical Methods, Distance Methods and Logic Methods. These different methods and their advantages and disadvantages are discussed in detail in Weiss & Indrukhya (Predictive data mining—A practical guide. Morgan Kaufmann, Sann Francisco, 1998). For this specific application we chose a method called Rule Induction, which falls under Logic Methods. The Rule Induction classifier generates patterns in form of IF-THEN rules.
One of the main advantages of the Logic Methods, and specifically classifiers such as the Rule Induction method that generate IF-THEN rules, is that the results of the classifiers can be explained more easily when compared to the other statistical or mathematical methods. This allows one to explore the structural and biological significance of the rule or pattern discovered. An example rule generated using the features described earlier (Table 1) is: IF A Glycan contains “Galb4GlcNAcb3Gal[B]” and DOES NOT contain “Fuca3GlcNAc[B]”, THEN the Glycan will bind with higher affinity to Galectin 3. The specific Rule Induction algorithm that was used in this case is the one developed by Weiss & Indurkya (Predictive data mining—A practical guide. Morgan Kaufmann, Sann Francisco, 1998.
A threshold that distinguished low affinity and high affinity binding was defined for each of the glycan array screening data sets.
In certain embodiments, the present invention provides nucleic acids which encode an HA polypeptide or a characteristic or biologically active portion of an HA polypeptide. In other embodiments, the invention provides nucleic acids which are complementary to nucleic acids which encode an HA polypeptide or a characteristic or biologically active portion of an HA polypeptide.
In other embodiments, the invention provides nucleic acid molecules which hybridize to nucleic acids encoding an HA polypeptide or a characteristic or biologically active portion of an HA polypeptide. Such nucleic acids can be used, for example, as primers or as probes. To give but a few examples, such nucleic acids can be used as primersin polymerase chain reaction (PCR), as probes for hybridization (including in situ hybridization), and/or as primers for reverse transcription-PCR (RT-PCR).
In certain embodiments, nucleic acids can be DNA or RNA, and can be single stranded or double-stranded. In some embodiments, inventive nucleic acids may include one or more non-natural nucleotides; in other embodiments, inventive nucleic acids include only natural nucleotides.
The present invention provides antibodies to inventive HA polypeptides. These may be monoclonal or polyclonal and may be prepared by any of a variety of techniques known to those of ordinary skill in the art (e.g., see Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988). For example, antibodies can be produced by cell culture techniques, including the generation of monoclonal antibodies, or via transfection of antibody genes into suitable bacterial or mammalian cell hosts, in order to allow for the production of recombinant antibodies.
In some embodiments, the present invention provides for pharmaceutical compositions including HA polypeptide(s), nucleic acids encoding such polypeptides, characteristic or biologically active fragments of such polypeptideds or nucleic acids, antibodies that bind to such polypeptides or fragments, small molecules that interact with such polypeptides or with glycans that bind to them, etc.
The invention encompasses treatment of influenza infections by administration of such inventive pharmaceutical compositions. In some embodiments, treatment is accomplished by administration of a vaccine. To date, although significant accomplishments have been made in the development of influenza vaccines, there is room for further improvement. The present invention provides vaccines comprising inventive HA polypeptides, and particularly comprising HA polypeptides that bind to umbrella glycans (e.g., α2-6 linked umbrella glycans such as, for example, long α2-6 sialylated glycans).
To give but one example, attempts to generate vaccines specific for the H5N1 strain in humans have generally not been successful due, at least in part, to low immunogenicity of H5 HAs. In one study, a vaccine directed at the H5N1 strain was shown to yield antibody titers of 1:40, which is not a titer high enough to guarantee protection from infection. Furthermore, the dosage required to generate even a modest 1:40 antibody titer (two doses of 90 μg of purified killed virus or antigen) was 12-times that normally used in the case of the common seasonal influenza virus vaccine (Treanor et al., N Eng J Med, 354:1343, 2006). Other studies have similarly shown that current H5 vaccines are not highly immunogenic (Bresson et al., Lancet, 367:1657, 2006). In some embodiments, inventive vaccines are formulated utilizing one or more strategies (see, for example, Enserink, Science, 309:996, 2005) intended to allow use of lower dose of H5 HA protein, and/or to achieve higher immunogenicity. For example, in some embodiments, multivalency is improved (e.g., via use of dendrimers); in some embodiments, one or more adjuvants is utilized, etc.
In some embodiments, the present invention provides for vaccines and the administration of these vaccines to a human subject. In certain embodiments, vaccines are compositions comprising one or more of the following: (1) inactivated virus, (2) live attenuated influenza virus, for example, replication-defective virus, (3) inventive HA polypeptide or characteristic or biologically active portion thereof, (4) nucleic acid encoding HA polypeptide or characteristic or biologically active portion thereof, (5) DNA vector that encodes HA polypeptide or characteristic or biologically active portion thereof, and/or (6) expression system, for example, cells expressing one or more influenza proteins to be used as antigens.
Thus, in some embodiments, the present invention provides inactivated flu vaccines. In certain embodiments, inactivated flu vaccines comprise one of three types of antigen preparation: inactivated whole virus, sub-virions where purified virus particles are disrupted with detergents or other reagents to solubilize the lipid envelope (“split” vaccine) or purified HA polypeptide (“subunit” vaccine). In certain embodiments, virus can be inactivated by treatment with formaldehyde, beta-propiolactone, ether, ether with detergent (such as Tween-80), cetyl trimethyl ammonium bromide (CTAB) and Triton N101, sodium deoxycholate and tri(n-butyl) phosphate. Inactivation can occur after or prior to clarification of allantoic fluid (from virus produced in eggs); the virions are isolated and purified by centrifugation (Nicholson et al., eds., Textbook of Influenza, Blackwell Science, Malden, Mass., 1998). To assess the potency of the vaccine, the single radial immunodiffusion (SRD) test can be used (Schild et al., Bull. World Health Organ., 52:43-50 & 223-31, 1975; Mostow et al., J. Clin. Microbiol., 2:531, 1975).
The present invention also provides live, attenuated flu vaccines, and methods for attenuation are well known in the art. In certain embodiments, attenuation is achieved through the use of reverse genetics, such as site-directed mutagenesis.
In some embodiments, influenza virus for use in vaccines is grown in eggs, for example, in embryonated hen eggs, in which case the harvested material is allantoic fluid. Alternatively or additionally, influenza virus may be derived from any method using tissue culture to grow the virus. Suitable cell substrates for growing the virus include, for example, dog kidney cells such as MDCK or cells from a clone of MDCK, MDCK-like cells, monkey kidney cells such as AGMK cells including Vero cells, cultured epithelial cells as continuous cell lines, 293T cells, BK-21 cells, CV-1 cells, or any other mammalian cell type suitable for the production of influenza virus (including upper airway epithelial cells) for vaccine purposes, readily available from commercial sources (e.g., ATCC, Rockville, Md.). Suitable cell substrates also include human cells such as MRC-5 cells. Suitable cell substrates are not limited to cell lines; for example primary cells such as chicken embryo fibroblasts are also included.
In some embodiments, inventive vaccines further comprise one or more adjuvants. For example, aluminum salts (Baylor et al., Vaccine, 20:S18, 2002) and monophosphoryl lipid A (MPL; Ribi et al., (1986, Immunology and Immunopharmacology of bacterial endotoxins, Plenum Publ. Corp., NY, p 407, 1986) can be used as adjuvants in human vaccines. Alternatively or additionally, new compounds are currently being tested as adjuvants in human vaccines, such as MF59 (Chiron Corp., http://www.chiron.com/investors/pressreleases/2005/051028.html), CPG 7909 (Cooper et al., Vaccine, 22:3136, 2004), and saponins, such as QS21 (Ghochikyan et al., Vaccine, 24:2275, 2006).
Additionally, some adjuvants are known in the art to enhance the immunogenicity of influenza vaccines, such as poly[di(carboxylatophenoxy)phosphazene] (PCCP; Payne et al., Vaccine, 16:92, 1998), interferon-γ (Cao et al., Vaccine, 10:238, 1992), block copolymer P1205 (CRL1005; Katz et al., Vaccine, 18:2177, 2000), interleukin-2 (IL-2; Mbwuike et al., Vaccine, 8:347, 1990), and polymethyl methacrylate (PMMA; Kreuter et al., J. Pharm. Sci., 70:367, 1981).
In addition to vaccines, the present invention provides other therapeutic compositions useful in the treatment of viral infections. For example, in some embodiments, treatment is accomplished by administration of an agent that interferes with expression or activity of an inventive HA polypeptide. For example, treatment can be accomplished with a composition comprising antibodies (such as antibodies that recognize virus particles containing a particular HA polypeptide (e.g., an HA polypeptide that binds to umbrella glycans), nucleic acids (such as nucleic acid sequences complementary to HA sequences, which can be used for RNAi), glycans that compete for binding to HA receptors, small molecules or glycomometics that compete the glycan-HA polypeptide interaction, or any combination thereof. In some embodiments, collections of different agents, having diverse structures are utilized. In some embodiments, therapeutic compositions comprise one or more multivalent agents. In some embodiments, treatment comprises urgent administration shortly after exposure or suspicion of exposure.
In general, a pharmaceutical composition will include a therapeutic agent in addition to one or more inactive agents such as a sterile, biocompatible carrier including, but not limited to, sterile water, saline, buffered saline, or dextrose solution. Alternatively or additionally, the composition can contain any of a variety of additives, such as stabilizers, buffers, excipients, or preservatives. In certain embodiments, a pharmaceutical composition will include a therapeutic agent that is encapsulated, trapped, or bound within a lipid vesicle, a bioavailable and/or biocompatible and/or biodegradable matrix, or other microparticle.
The pharmaceutical compositions of the present invention may be administered either alone or in combination with one or more other therapeutic agents including, but not limited to, vaccines and/or antibodies. By “in combination with,” it is not intended to imply that the agents must be administered at the same time or formulated for delivery together, although these methods of delivery are within the scope of the invention. In general, each agent will be administered at a dose and on a time schedule determined for that agent. Additionally, the invention encompasses the delivery of the inventive pharmaceutical compositions in combination with agents that may improve their bioavailability, reduce or modify their metabolism, inhibit their excretion, or modify their distribution within the body. Although the pharmaceutical compositions of the present invention can be used for treatment of any subject (e.g., any animal) in need thereof, they are most preferably used in the treatment of humans.
The pharmaceutical compositions of the present invention can be administered by a variety of routes, including oral, intravenous, intramuscular, intra-arterial, subcutaneous, intraventricular, transdermal, interdermal, rectal, intravaginal, intraperitoneal, topical (as by powders, ointments, creams, or drops), mucosal, bucal, or as an oral or nasal spray or aerosol. In general the most appropriate route of administration will depend upon a variety of factors including the nature of the agent (e.g., its stability in the environment of the gastrointestinal tract), the condition of the patient (e.g., whether the patient is able to tolerate oral administration), etc. At present the oral or nasal spray or aerosol route is most commonly used to deliver therapeutic agents directly to the lungs and respiratory system. However, the invention encompasses the delivery of the inventive pharmaceutical composition by any appropriate route taking into consideration likely advances in the sciences of drug delivery.
Suitable devices for use in delivering intradermal pharmaceutical compositions described herein include short needle devices such as those described in U.S. Pat. No. 4,886,499, U.S. Pat. No. 5,190,521, U.S. Pat. No. 5,328,483, U.S. Pat. No. 5,527,288, U.S. Pat. No. 4,270,537, U.S. Pat. No. 5,015,235, U.S. Pat. No. 5,141,496, U.S. Pat. No. 5,417,662. Intradermal compositions may also be administered by devices which limit the effective penetration length of a needle into the skin, such as those described in WO99/34850, incorporated herein by reference, and functional equivalents thereof. Also suitable are jet injection devices which deliver liquid vaccines to the dermis via a liquid jet injector or via a needle which pierces the stratum corneum and produces a jet which reaches the dermis. Jet injection devices are described for example in U.S. Pat. No. 5,480,381, U.S. Pat. No. 5,599,302, U.S. Pat. No. 5,334,144, U.S. Pat. No. 5,993,412, U.S. Pat. No. 5,649,912, U.S. Pat. No. 5,569,189, U.S. Pat. No. 5,704,911, U.S. Pat. No. 5,383,851, U.S. Pat. No. 5,893,397, U.S. Pat. No. 5,466,220, U.S. Pat. No. 5,339,163, U.S. Pat. No. 5,312,335, U.S. Pat. No. 5,503,627, U.S. Pat. No. 5,064,413, U.S. Pat. No. 5,520,639, U.S. Pat. No. 4,596,556, U.S. Pat. No. 4,790,824, U.S. Pat. No. 4,941,880, U.S. Pat. No. 4,940,460, WO 97/37705 and WO 97/13537. Also suitable are ballistic powder/particle delivery devices which use compressed gas to accelerate vaccine in powder form through the outer layers of the skin to the dermis. Additionally, conventional syringes may be used in the classical mantoux method of intradermal administration.
General considerations in the formulation and manufacture of pharmaceutical agents may be found, for example, in Remington's Pharmaceutical Sciences, 19th ed., Mack Publishing Co., Easton, Pa., 1995.
The present invention provides kits for detecting HA polypeptides, and particular for detecting HA polypeptides with particular glycan binding characteristics (e.g., binding to umbrella glycans, to α2-6 sialylated glycans, to long α2-6 sialylated glycans, etc.) in pathological samples, including, but not limited to, blood, serum/plasma, peripheral blood mononuclear cells/peripheral blood lymphocytes (PBMC/PBL), sputum, urine, feces, throat swabs, dermal lesion swabs, cerebrospinal fluids, cervical smears, pus samples, food matrices, and tissues from various parts of the body such as brain, spleen, and liver. The present invention also provides kits for detecting HA polypeptides of interest in environmental samples, including, but not limited to, soil, water, and flora. Other samples that have not been listed may also be applicable.
In certain embodiments, inventive kits may include one or more agents that specifically detect HA polypeptides with particular glycan binding characteristics. Such agents may include, for example, antibodies that specifically recognize certain HA polypeptides (e.g., HA polypeptides that bind to umbrella glycans and/or to α2-6 sialylated glycans and/or to long α2-6 sialylated glycans), which can be used to specifically detect such HA polypeptides by ELISA, immunofluorescence, and/or immunoblotting. These antibodies can also be used in virus neutralization tests, in which a sample is treated with antibody specific to HA polypeptides of interest, and tested for its ability to infect cultured cells relative to untreated sample. If the virus in that sample contains such HA polypeptides, the antibody will neutralize the virus and prevent it from infecting the cultured cells. Alternatively or additionally, such antibodies can also be used in HA-inhibition tests, in which the HA protein is isolated from a given sample, treated with antibody specific to a particular HA polypeptide or set of HA polypeptides, and tested for its ability to agglutinate erythrocytes relative to untreated sample. If the virus in the sample contains such an HA polypeptide, the antibody will neutralize the activity of the HA polypeptide and prevent it from agglutinating erythrocytes (Harlow & Lane, Antibodies: A Laboratory Manual, CSHL Press, 1988; http://www.who.int/csr/resources/publications/influenza/WHO_CDS_CSR_NCS—2002—5/en/index.html; http://www.who.int/csr/disease/avian_influenza/guidelines/labtests/en/index.html). In other embodiments, such agents may include nucleic acids that specifically bind to nucleotides that encode particular HA polypeptides and that can be used to specifically detect such HA polypeptides by RT-PCR or in situ hybridization (http://www.who.int/csr/resources/publications/influenza/WHO_CDS_CSR_NCS—2002—5/en/index.html; http://www.who.int/csr/disease/avian_influenza/guidelines/labtests/en/index.html). In certain embodiments, nucleic acids which have been isolated from a sample are amplified prior to detection. In certain embodiments, diagnostic reagents can be detectably labeled.
The present invention also provides kits containing reagents according to the invention for the generation of influenza viruses and vaccines. Contents of the kits include, but are not limited to, expression plasmids containing the HA nucleotides (or characteristic or biologically active portions) encoding HA polypeptides of interest (or characteristic or biologically active portions). Alternatively or additionally, kits may contain expression plasmids that express HA polypeptides of interest (or characteristic or biologically active portions). Expression plasmids containing no virus genes may also be included so that users are capable of incorporating HA nucleotides from any influenza virus of interest. Mammalian cell lines may also be included with the kits, including but not limited to, Vero and MDCK cell lines. In certain embodiments, diagnostic reagents can be detectably labeled.
In certain embodiments, kits for use in accordance with the present invention may include, a reference sample, instructions for processing samples, performing the test, instructions for interpreting the results, buffers and/or other reagents necessary for performing the test. In certain embodiments the kit can comprise a panel of antibodies.
In some embodiments of the present invention, glycan arrays, as discussed above, may be utilized as diagnostics and/or kits.
In certain embodiments, inventive glycan arrays and/or kits are used to perform dose response studies to assess binding of HA polypeptides to umbrella glycans at multiple doses (e.g., as described herein). Such studies give particularly valuable insight into the binding characteristics of tested HA polypeptides, and are particularly useful to assess specific binding. Dose response binding studies of this type find many useful applications. To give but one example, they can be helpful in tracking the evolution of binding characteristics in a related series of HA polypeptide variants, whether the series is generated through natural evolution, intentional engineering, or a combination of the two.
In certain embodiments, inventive glycan arrays and/or kits are used to induce, identify, and/or select HA polypeptides, and/or HA polypeptide variants having desired binding characteristics. For instance, in some embodiments, inventive glycan arrays and/or kits are used to exert evolutionary (e.g., screening and/or selection) pressure on a population of HA polypeptides.
Crystal structures of HAs from H1 (PDB IDS: 1RD8, 1RU7, 1RUY, 1RV0, 1RVT, 1RVX, 1RVZ), H3 (PDB IDs: 1MQL, 1MQM, 1MQN) and H5 (1JSN, 1JSO, 2FK0) and their complexes with α2-3 and/or α2-6 sialylated oligosaccharides have provided molecular insights into residues involved in specific HA-glycan interactions. More recently, the glycan receptor specificity of avian and human H1 and H3 subtypes has been elaborated by screening the wild type and mutants on glycan arrays comprising of a variety of α2-3 and α2-6 sialylated glycans.
The Asp190Glu mutation in the HA of the 1918 human pandemic virus reversed its specificity from α2-6 to α2-3 sialylated glycans (Stevens et al., J. Mol. Biol., 355:1143, 2006; Glaser et al., J. Virol., 79:11533, 2005). On the other hand, the double mutation Glu190Asp and Gly225Asp on an avian H1 (A/Duck/Alberta/35/1976) reversed its specificity from α2-3 to α2-6 sialylated glycans. In the case of the H3 subtype, the amino acid changes from Gln226 to Leu and Gly228 to Ser between the 1963 avian H3N8 strain and the 1967-68 pandemic human H3N2 strain correlate with the change in their preference from α2-3 to α2-6 sialylated glycans (Rogers et al., Nature, 304:76, 1983). The relationship between the HA glycan binding specificity and transmission efficiency was demonstrated in a ferret model using the highly pathogenic and virulent 1918 H1N1 viruses (Tumpey, T. M. et al. Science 315: 655, 2007).
Switching the receptor binding specificity from the parental human α2-6 sialylated glycan (SC18) receptor preference to an avian α2-3 sialylated receptor preference (AV18) resulted in a virus that was unable to transmit. On the other hand, one of the mixed α2-3/α2-6 sialylated glycan specificity virus (A/New York/1/8 (NY18)) showed no transmission, surprisingly A/Texas/36/91 (Tx91) virus, also mixed α2-3/α2-6 sialylated glycan specificity, was able to efficiently transmit. Furthermore, as stated above, various strains of the highly pathogenic H5N1 viruses also show mixed α2-3/α2-6 sialylated glycan specificity (Yamada, S. et al. Nature 444:378, 2006), and have yet been able to transmit from human-to-human. The confounding results with respect to HA's sialylated glycan specificity and transmission posed the following questions. First, is there diversity in the sialylated glycans found in the upper airways in humans, and could that account for the specificity and tissue tropism of the virus? Second, are there nuances of glycan conformation that might play a role in how both α2-3 and/or α2-6 sialylated glycans bind to HA glycan binding pocket? Taken together, what are the glycan binding requirements of the Influenza A virus HA for human adaptation?
Analysis of all the HA-glycan co-crystal structures indicates that the orientation of the Neu5Ac sugar (SA) is fixed relative to the HA glycan binding site. A highly conserved set of amino acids Phe95, Ser/Thr136, Trp153, His183, Leu/Ile194 across different HA subtypes are involved in anchoring the SA. Therefore, the specificity of HA to α2-3 or α2-6 is governed by interactions of the HA glycan binding site with the glycosidic oxygen atom and sugars beyond SA.
The conformation of the Neu5Acα2-3Gal linkage is such that the positioning of Gal and sugars beyond Gal in α2-3 fall in a cone-like region governed by the glycosidic torsion angles at this linkage (
In addition to the conserved anchor points for sialic acid binding, two critical residues, Gln226 and Glu190, are involved in binding to the Neu5Acα2-3Gal motif. Gln226, located at the base of the binding site, interacts with the glycosidic oxygen atom of the Neu5Acα2-3Gal linkage (
Superimposition of the glycan binding site in the crystal structures of AAI68_H3—23, ADU67_H3—23 and APR34_H1—23 gaves additional insights into the positioning of the Glu190 side chain and its effect on HA binding to α2-3 sialylated glycans. The side chain of Glu190 in H1 HA is further (around 1 Å) into the binding site in comparison with that of Glu190 in H3 HA. This could be due to the amino acid differences Pro186 in H1 HA as against Ser186 in H3 HA which are proximal to the Glu190 residue. This change in side chain conformation of Glu190 could correlate with the binding of avian H1 (and not avian H3) with moderate affinity to some of the α2-6 sialylated glycans as shown by the data mining analysis of the glycan microarray data (Table 3). Further, substitution of Gly228 to Ser—a hallmark change between avian and human H3 subtypes—alters the conformation of Glu190 and interferes with the interaction of human H3 HA to Neu5Acα2-3Gal in the trans conformation. This is further elaborated by the distinct conformation (that is not trans) of Neu5Acα2-3Gal motif observed in the human AAI68_H3—23 co-crystal structure. The Neu5Acα2-3 Gal motif in this conformation provides less optimal contacts with human H3 HA binding site compared to those provided by this motif in the trans conformation with the avian H3 HA (
How do the structural variations around the Neu5Acα2-3Gal influence HA-glycan interactions? Lys193, which is highly conserved in the avian H5 (
Thus, for binding to α2-3 sialylated glycans, apart from the residues that anchor Neu5Ac, Glu190 and Gln226, highly conserved in all avian H1, H3 and H5 subtypes are critical for binding to Neu5Acα2-3Gal motif. The contacts with GlcNAc or GalNAc and substitutions such as sulfation and fucosylation in the α2-3 motif involve amino acids at positions 137, 186, 187, 193 and 222. HA from H1, H3 and H5 exhibit differential binding specificity to the diverse α2-3 sialylated glycans present in the glycan microarray. The amino acid residues in these positions are not conserved across the different HAs and this accounts for the different binding specificities
In the case of Neu5Aca2-6Gal linkage, the presence of the additional C6-C5 bond provides added conformational flexibility. The position of Gal and subsequent sugars in α2-6 would span a much larger umbrella-like region as compared to the cone-like region in the case of α2-3 (
In H1 HA, superimposition of the glycan binding domain of HA from a human H1N1 (A/South Carolina/1/1918) subtype with that of ASI30_H1—26 and APR34_H1—26 provided insights into the amino acids involved in providing specificity to the α2-6 sialylated glycan. Lys222 and Asp225 are positioned to interact with the oxygen atoms of the Gal in the Neu5Acα2-6Gal motif. Asp190 and Ser/Asn193 are positioned to interact with additional monosaccharides GlcNAcα1-3Gal of the Neu5Acα2-6Galα1-4GlcNAcα1-3Gal motif (
Asp190, Lys222 and Asp225 are highly conserved among the H1 HAs from the 1918 human pandemic strains. Although the amino acid Gln226 is highly conserved in all the avian and human H1 subtypes, it does not appear to be as involved in binding to α2-6 sialylated glycans (in human H1 subtypes) compared to its role in binding to α2-3 sialylated glycans (in the avian H1 subtypes). The data mining analysis of the glycan array results for wild type and mutant form of the avian and human H1 HAs further substantiates the role of the above amino acids in binding to α2-6 sialylated glycans (Table 3). The Glu190Asp/Gly225Asp double mutant of the avian H1 HA reverses its binding to α2-6 sialylated glycans (Table 3). Further, the Lys222Leu mutant of human ANY18_H1 removes its binding to all the sialylated glycans on the array consistent with the essential role of Lys222 in glycan binding.
In order to identify amino acids that provide specificity for H3N2 HA binding to α2-6 sialylated glycans, the glycan binding domain of HA from human H3N2 (AAI68_H3), ADU63_H3—26 and ASI30_H1—26 were superimposed. Analysis of these superimposed structures showed that Leu226 is positioned to provide optimal van der Waals contact with the C6 atom of the Neu5α2-6Gal motif and Ser228 is positioned to interact with O9 of the sialic acid. Ser228 in the human H3 also interacts with Glu190 (unlike Gly228 in avian ADU63_H3 which does not) thereby affecting its side chain conformation. The side chain of Glu190 in human H3 HA is displaced slightly into the binding site by about 0.7 Å in comparison with that of Glu190 in avian H3 HA. These differences limit the ability of human H3 HA to bind to α2-3 sialylated glycans and correlate with its preferential binding to α2-6 sialylated glycans. Thus, the Gln226Leu and Gly228Ser mutations cause a reversal of the glycan receptor specificity of avian H3 to human H3 subtype during the 1967 pandemic.
Comparison of HAs from 1967-68 pandemic H3N2 and those from more recent H3 subtypes (after 1990) show that the Glu190 is mutated to Asp in the recent subtypes. This mutation further enhances the binding of human H3 to α2-6 sialylated glycans since Asp190 in human H3 is positioned to interact favorably with these glycans. This structural implication is further corroborated by the data mining analysis of the glycan array data on a human H3 subtype (A/Moscow/10/1999). This HA comprises Asp190, Leu226 and Ser228 (
The above observations highlight both the similarities as well as differences between H1 and H3 HA binding to α2-6 sialylated glycans. In both H1 and H3 HA, Asp190 and Ser/Asn193 are positioned to make favorable contacts with monosaccharides beyond Neu5Acα2-6Gal motif (
The interactions with α2-6 sialylated glycans provided by the different amino acids in H1 and H3 HA suggested that the current avian H5N1 HA could mutate into a H1-like or H3-like glycan binding site in order to reverse its glycan receptor specificity. Based on the above framework, the hypothesized H1-like and H3-like mutations for H5 HA are further elaborated and tested as discussed below.
Analysis of the superimposed ASI30_H1—26, APR34_H1—26, ADS97_H5—26 and Viet04_H5 structures provided insights into the H1-like binding of H5 HA to α2-6 sialylated glycans. Since the H1 and H5 HAs belong to the same structural clade, their glycan binding sites share a similar topology and distribution of amino acids (Russell et al., Virology, 325:287, 2004). Lys222, which is highly conserved in avian H5 HAs is positioned to provide optimal contacts with Gal of Neu5Acα2-6Gal motif similar to the analogous Lys in H1 HA. Glu190 and Gly225 in Viet04_H5 (in the place of Asp190 and Asp225 in H1) do not provide the necessary contacts with the Neu5Acα2-6Galβ1-4GlcNAc motif similar to H1. Therefore Glu190Asp and Gly225Asp mutations in H5 HA could potentially improve the contacts with α2-6 sialylated glycans.
Analysis of the interactions beyond GlcNAc in the Neu5Acα2-6Galβ1-4GlcNAcβ1-3Galβ1-4Glc oligosaccharide and the glycan binding pocket of H1 and H5 HAs showed that while Ser/Asn193 in H1 HA provides favorable contacts with the penultimate Gal, the analogous Lys193 in H5 has unfavorable steric overlaps with the GlcNAcβ1-3Gal motif. Thus, the Lys193Ser mutation can provide additional favorable contacts (along with Glu190Asp and Gly225Asp mutations) with α2-6 sialylated glycans.
The highly conserved Gln226 in H1 HA is also conserved in the avian H5 HA. Given that Gln226 plays a less active role in H1 HA binding to α2-6 sialylated glycans (as discussed above), mutation of this amino acid to a hydrophobic amino acid such as Leu could potentially enhance its van der Waals contact with C6 atom of Gal in Neu5Acα2-6Gal motif.
The superimposition of ADU63_H3—26, AAI68_H3, ADS97_H5—26 and Viet04_H5 provides insights into the H3-like binding of H5 HA to α2-6 sialylated glycans. While this superimposition structurally aligned the glycan binding site of H5 and H3 HA, it was not as good as the structural alignment between H5 and H1. The favorable van der Waals contact and ionic contact with Neu5α2-6Gal motif respectively provided by Leu226 and Ser228 in H3 HA were absent in H5 HA (with Gln226 and Gly228). Given that Leu226 and Ser228 are critical for binding to α2-6 sialylated glycans in human H3 HA, the Gln226Leu and Gly228Ser mutations in H5 HA could potentially provide optimal contacts with α2-6 sialylated glycans. Further, even in the comparison between H3 and H5, Lys 193 is positioned such that it would have unfavorable steric contacts with the monosaccharides beyond Neu5Acα2-6Gal motif as against Ser193 in human H3 HA which is positioned to provide favorable contacts. Although the HA from the 1967-68 pandemic H3N2 comprises of Glu190, Asp190 in H5 HA would be positioned to provide better ionic contacts with Neu5Acα2-6Gal motif in longer oligosaccharides.
The roles of the above mentioned residues were further corroborated by data mining analysis of glycan array data for wild type and mutant forms of Viet04_H5 (Table 3). The double mutant, Glu190Asp/Gly225Asp, does not bind to any glycan structure since it loses the amino acid Glu190 for binding α2-3 sialylated glycans and has the steric interference from Lys193 for binding to α2-6 sialylated glycans. Similarly the double mutant, Gln226Leu/Gly228Ser binds to some of the α2-3 sialylated glycans (α2-3 Type B classifier) but only to a single biantennary α2-6 sialylated glycan (α2-6 Type A classifier).
Analysis of this binding to the biantennary α2-6 sialylated glycan showed that the Neu5Acα2-6Gal linkage in this glycan can potentially bind in an extended conformation to the double mutant albeit with lesser contacts (
Without wishing to be bound by any particular theory, the present inventors propose that a necessary condition for human adaptation of influenza A virus HAs is to gain the ability to bind to long α2-6 (predominantly expressed in human upper airway) with high affinity. For example, an aspect of glycan diversity is the length of the lactosamine branch that is capped with the sialic acid. This is captured by the two distinct features of α2-6 sialylated glycans derived from the data mining analysis (Table 3). One feature is characterized by the Neu5Acα2-6Galβ1-4GlcNAc linked to the Man of the N-linked core and the other is characterized by this motif linked to another lactose amine unit forming a longer branch (which typically adopts umbrella topology). Thus, the extensive binding of the mutant H5 HAs to the upper airways may only be possible if these mutants bind with high affinity to the glycans with long α2-6 adopting the umbrella topology. For example, according to the present invention, desirable binding patterns include binding to umbrella glycans depicted in
By contrast, we note a recent report of modified H5 HA proteins (containing Gly228Ser and Gln226Leu/Gly228Ser substitution) showed binding to only a single biantennary a2-6 sualyl-lactosamine glycan structure on the glycan array (Stevens et al., Science 312:404, 2006). Such modified H5 HA proteins are therefore not BSHB H5 HAs, as described herein.
Hemagglutinin in viruses is present as a trimer and is anchored to the membrane. The full length construct of HA has a N-terminal signal peptide and a C-terminal transmembrane sequence. For recombinant expression of HA, often a shortened construct of HA is used which allows the protein to be secreted. This shortened soluble construct is created by replacing the HA's N-terminal signal peptide with a Gp67 signal peptide sequence and the C-terminal transmembrane region is replaced by a ‘foldon’ sequence followed by a tryptic cleavage site and a 6×-His tag (Stevens et al., J. Mol. Biol., 355:1143, 2006). Both full length and the soluble form of HA were expressed in insect cells. Suspension cultures of Sf-9 cells in Sf900 II SFM medium (Invitrogen) were infected with baculoviruses containing either full length or soluble form of HA. The cells were harvested 72-96 hours post infection.
Hemagglutinin (HA) from A/Vietnam/1203/2004 was a kind gift from Adolfo García-Sastre. This “wild type” (WT) HA was used as template to create two different mutant constructs, DSLS and DSDL, using QuikChange II XL Site-Directed Mutagenesis Kit (Stratagene) and QuikChange Multi Site-Directed Mutagenesis Kit (Stratagene). The primers used for mutagenesis were designed using the web based program, PrimerX (http://bioinformatics.org/primerx/), and synthesized by Invitrogen. The WT and mutant HA genes were sub-cloned into the entry vector pENTR-D-TOPO (Invitrogen) using TOPO ligation. The entry vectors containing the WT and mutant genes were recombined with BaculoDirect linear DNA (Invitrogen) using Gateway cloning technology. DNA sequencing was performed at each sub-cloning step to confirm the accuracy of the sequences. The recombinant baculovirus DNA produced was used to transfect Spodoptera frugiperda Sf-9 cells (Invitrogen) to yield primary stock of virus.
The full length HA was purified from the membrane fraction of the infected cells by a method modified from Wang et al. (2006) Vaccine, 24:2176. Briefly, the cells from the 150 ml culture were harvested by centrifugation and the cell pellet was extracted with 30 ml of 1% Tergitol NP-9 in buffer A (20 mM sodium phosphate, 1.0 mM EDTA, 0.01% Tergitol-NP9, 5% glycerol, pH 5.93) at 4° C. for 30 min. The extract was then subjected to centrifugation at 6,000 g for 15 min. The supernatant was filtered using a 0.45 micron filter and loaded on Q/SP columns (GE healthcare, Piscataway, N.J.) that were previously equilibrated with Buffer A. After loading, the columns were washed with 20 ml of Buffer A. Then, the anion exchange column Q was disconnected and the SP column was used for elution of protein using five 5 ml fractions of buffer B (20 mM sodium phosphate, 0.03% Tergitol, 5% glycerol, pH 8.2) and two 5 ml fractions of buffer C (20 mM sodium phosphate, 150 mM NaCl, 0.03% Tergitol, 5% glycerol, pH 8.2). The fractions containing the protein of interest were pooled together and subjected to ultrafiltration using Amicon Ultra 100 K NMWL membrane filters (Millipore). The protein was concentrated and reconstituted in PBS.
The soluble form of HA was purified from the supernatant of the infected cells using the protocol described in Stevens et al. (2004). Briefly, the supernatant was concentrated and the soluble HA was recovered from the concentrated cell supernatant by performing affinity chromatography using Ni-NTA beads (Qiagen). Eluting fractions containing HA were pooled and dialyzed against 10 mM Tris-HCl, 50 mM NaCl; pH 8.0. Ion exchange chromatography was performed on the dialyzed samples using a Mono-Q HR10/10 column (Pharmacia). The fractions containing HA were pooled together and subjected to ultrafiltration using Amicon Ultra 100 K NMWL membrane filters (Millipore). The protein was concentrated and reconstituted in PBS.
The presence of the protein in the samples was verified by performing western blot analysis with anti avian H5N1 HA antibody. Through dot-blot immunoassay (using WT H5 HA obtained from Protein Sciences Inc as the reference) the protein concentration of WT and the mutants were determined. In the various experiments that were performed the protein concentration of the H5 HA (WT and mutants) were typically found to be in 20-50 microgram/ml range. Based on the protein concentration for a given lot appropriate serial dilutions in the ranges of 1:10-1:100 were used (see
A framework for the binding of H5N1 subtype to α2-3/6 sialylated glycans was developed (
This analysis provides important insights into the interactions of an HA glycan binding site with a variety of α2-3/6 sialylated glycans, including glycans of either umbrella or cone topology. The second involves a data mining approach to analyze the glycan array data on the different H1, H3 and H5 HAs. This data mining analysis correlates the strong, weak and non-binders of the different wild type and mutant HAs to the structural features of the glycans in the microarray (Table 3).
Importantly, these correlations (classifiers) capture the effect of subtle structural variations of the α2-3/6 sialylated linkages and/or of different topologies on binding to the different HAs. The correlations of glycan features obtained from the data mining analysis are mapped onto the HA glycan binding site, providing a framework to systematically investigate the binding of H1, H3 and H5 HAs to α2-3 and α2-6 sialylated glycans, including glycans of different topologies, as discussed below.
To give but one example, application of this framework to H5 HA according to the present invention illustrates how length of an α2-6 oligosaccharide chain becomes more important, especially in the context of degree of branching, than the nuances of structural variations around the glycan. For example, a triantennary structure with a single α2-6 motif versus a biantennary structure with a longer α2-6 motif will influence HA-glycan binding as against structural variations around the individual α2-6 motif. This is confirmed by the distinct length dependent classifiers for the α2-6 motif obtained herein from data mining (Table 3).
In some particular embodiments of the present invention, HA polypeptides are H5 polypeptides. In some such embodiments, inventive H5 polypeptides show binding (e.g., high affinity and/or specificity binding) to umbrella glycans. In some embodiments, inventive H5 polypeptides are termed “broad spectrum human binding” (BSHB) H5 polypeptides.
The phrase “broad spectrum human binding” (BSHB) was originally coined to refer to H5 polypeptides bind to HA receptors found in human epithelial tissues, and particularly to human HA receptors characterized by α2-6 sialylated glycans. As discussed above, with regard to HA polypeptides generally, in some embodiments, inventive BSHB H5 HA polypeptides bind to receptors found on human upper respiratory epithelial cells. Furthermore, inventive BSHB H5 HA polypeptides bind to a plurality of different α2-6 sialylated glycans. In certain embodiments, BSHB H5 HA polypeptides bind to umbrella glycans.
In certain embodiments, inventive BSHB H5 HA polypeptides bind to HA receptors in the bronchus and/or trachea. In some embodiments, BSHB H5 HA polypeptides are not able to bind receptors in the deep lung, and in other embodiments, BSHB H5 HA polypeptides are able to bind receptors in the deep lung. In further embodiments, BSHB H5 HA polypeptides are not able to bind to α2-3 sialylated glycans, and in other embodiments BSHB H5 HA polypeptides are able to bind to α2-3 sialylated glycans.
In certain embodiments, inventive BSHB H5 HA polypeptides are variants of a parent H5 HA (e.g., an H5 HA found in a natural influenza isolate). For example, in some embodiments, inventive BSHB H5 HA polypeptides have at least one amino acid substitution, as compared with wild type H5 HA, within or affecting the glycan binding site. In some embodiments, such substitutions are of amino acids that interact directly with bound glycan; in other embodiments, such substitutions are of amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves. Inventive BSHB H5 HA polypeptides contain substitutions of one or more direct-binding amino acids, one or more first degree of separation-amino acids, one or more second degree of separation-amino acids, or any combination of these. In some embodiments, inventive BSHB H5 HA polypeptides may contain substitutions of one or more amino acids with even higher degrees of separation.
In certain embodiments, inventive BSHB H5 HA polypeptides have at least two, three, four, five or more amino acid substitutions as compared with wild type H5 HA; in some embodiments inventive BSHB H5 HA polypeptides have two, three, or four amino acid substitutions. In some embodiments, all such amino acid substitutions are located within the glycan binding site.
In certain embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from the group consisting of residues 98, 136, 138, 153, 155, 159, 183, 186, 187, 190, 193, 194, 195, 222, 225, 226, 227, and 228. In other embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids located in the region of the receptor that directly binds to the glycan, including but not limited to residues 98, 136, 153, 155, 183, 190, 193, 194, 222, 225, 226, 227, and 228. In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids located adjacent to the region of the receptor that directly binds the glycan, including but not limited to residues 98, 138, 186, 187, 195, and 228.
In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from the group consisting of residues 138, 186, 187, 190, 193, 222, 225, 226, 227 and 228. In other embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids located in the region of the receptor that directly binds to the glycan, including but not limited to residues 190, 193, 222, 225, 226, 227, and 228. In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids located adjacent to the region of the receptor that directly binds the glycan, including but not limited to residues 138, 186, 187, and 228.
In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from the group consisting of residues 98, 136, 153, 155, 183, 194, and 195. In other embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids located in the region of the receptor that directly binds to the glycan, including but not limited to residues 98, 136, 153, 155, 183, and 194. In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids located adjacent to the region of the receptor that directly binds the glycan, including but not limited to residues 98 and 195.
In certain embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves, including but not limited to residues 98, 138, 186, 187, 195, and 228.
In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves, including but not limited to residues 138, 186, 187, and 228.
In further embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from amino acids that are one degree of separation removed from those that interact with bound glycan, in that the one degree of separation removed-amino acids either (1) interact with the direct-binding amino acids; (2) otherwise affect the ability of the direct-binding amino acids to interact with glycan, but do not interact directly with glycan themselves; or (3) otherwise affect the ability of the direct-binding amino acids to interact with glycan, and also interact directly with glycan themselves, including but not limited to residues 98 and 195.
In certain embodiments, a BSHB H5 HA polypeptide has an amino acid substitution relative to wild type H5 HA at residue 159.
In other embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from 190, 193, 225, and 226. In some embodiments, a BSHB H5 HA polypeptide has one or more amino acid substitutions relative to wild type H5 HA at residues selected from 190, 193, 226, and 228. In some embodiments, an inventive HA polypeptide variant, and particularly an H5 variant has one or more of the following amino acid substitutions: Ser137Ala, Lys156Glu, Asn186Pro, Asp187Ser, Asp187Thr, Ala189Gln, Ala189Lys, Ala189Thr, Glu190Asp, Glu190Thr, Lys193Arg, Lys193Asn, Lys193His, Lys193Ser, Gly225Asp, Gln226Ile, Gln226Leu, Gln226Val, Ser227Ala, Gly228Ser.
In some embodiments, an inventive HA polypeptide variant, and particularly an H5 variant has one or more of the following sets of amino acid substitutions:
Glu190Asp, Lys193Ser, Gly225Asp and Gln226Leu;
Glu190Asp, Lys193Ser, Gln226Leu and Gly228Ser;
Ala189Gln, Lys 193 Ser, Gln226Leu, Gly228Ser;
Ala189Gln, Lys193Ser, Gln226Leu, Gly228Ser;
Asp187Ser/Thr, Ala189Gln, Lys193Ser, Gln226Leu, Gly228Ser;
Ala189Lys, Lys193Asn, Gln226Leu, Gly228Ser;
Asp187Ser/Thr, Ala189Lys, Lys193Asn, Gln226Leu, Gly228Ser;
Lys156Glu, Ala189Lys, Lys193Asn, Gln226Leu, Gly228Ser;
Lys193His, Gln226Leu/Ile/Val, Gly228Ser;
Lys193Arg, Gln226Leu/Ile/Val, Gly228Ser;
Ala189Lys, Lys193Asn, Gly225Asp;
Lys156Glu, Ala189Lys, Lys193Asn, Gly225Asp;
Ser137Ala, Lys156Glu, Ala189Lys, Lys193Asn, Gly225Asp;
Glu190Thr, Lys193Ser, Gly225Asp;
Asp187Thr, Ala189Thr, Glu190Asp, Lys193Ser, Gly225Asp;
Asn186Pro, Asp187Thr, Ala189Thr, Glu190Asp, Lys193Ser, Gly225Asp;
Asn186Pro, Asp187Thr, Ala189Thr, Glu190Asp, Lys193Ser, Gly225Asp, Ser227Ala.
In some such embodiments, the HA polypeptide has at least one further substitution as compared with a wild type HA, such that affinity and/or specificity of the variant for umbrella glycans is increased.
In certain embodiments, inventive BSHB H5 HA polypeptides have amino acid sequences characteristic of H1 HAs. For example, in some embodiments, such H1-like BSHB H5 HA polypeptides have substitutions Glu190Asp, Lys193Ser, Gly225Asp and Gln226Leu.
In certain embodiments, inventive BSHB H5 HA polypeptides have amino acid sequences characteristic of H1 HAs. For example, in some embodiments, such H3-like BSHB H5 HAs contain substitutions Glu190Asp, Lys193Ser, Gln226Leu and Gly228Ser.
In some embodiments, inventive BSHB H5 HA polypeptides have an open binding site as compared with wild type H5 HAs. In some embodiments, inventive BSHB H5 HA polypeptides bind to the following α2-6 sialylated glycans:
combinations thereof. In some embodiments, inventive BSHB H5 HA polypeptides bind to glycans of the structure:
and combinations thereof; and/or
and combinations thereof. In some embodiments, inventive BSHB H5 HA polypeptides bind to
in some embodiments to
in some embodiments to
and in some embodiments to
In some embodiments, inventive BSHB H5 HA polypeptides bind to umbrella topology glycans. In some embodiments, inventive BSHB H5 HA polypeptides bind to at least some of the glycans (e.g., α2-6 silaylated glycans) depicted in
In some embodiments, inventive BSHB H5 HA polypeptides bind to at least about 10%, 15%, 20%, 25%, 30% 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90% 95% or more of the glycans found on HA receptors in human upper respiratory tract tissues (e.g., epithelial cells).
Lectin binding studies showed diversity in the distribution of α2-3 and α2-6 in the upper respiratory tissues. Staining studies indicate predominant distribution of α2-6 sialylated glycans as a part of both N-linked (ciliated cells) and O-linked glycans (in the goblet cells) on the apical side of the tracheal epithelium (
MALDI-MS glycan profiling analyses showed a substantial diversity (
Data in
Data in
The apical side of tracheal tissue predominantly expresses α2-6 glycans with long branch topology. The alveolar tissue on the other hand predominantly expresses α2-3 glycans. H1 HA binds significantly to the apical surface of the trachea and its binding reduces gradually with dilution from 40 to 10 μg/ml (
The data in
As described herein, the present invention encompasses the recognition that binding by HA polypeptides to glycans having a particular topology, herein termed “umbrella” topology, correlates with ability of the HA polypeptides to mediate infection of human hosts. The present Example describes results of direct binding studies with different HA polypeptides that mediate infection of different hosts, and illustrates the correlation between human infection and umbrella glycan binding.
Direct binding assays typically utilize glycan arrays in which defined glycan structures (e.g., monovalent or multivalent) are presented on a support (e.g., glass slides or well plates), often using a polymer backbone. In so-called “sequential” assays, trimeric HA polypeptide is bound to the array and then is detected, for example using labeled (e.g., with FITC or horse radish peroxidase) primary and secondary antibodies. In “multivalent” assays, trimeric HA is first complexed with primary and secondary antibodies (typically in a 4:2:1 HA:primary:secondary ratio), such that there are 12 glycan binding sites per pre-complexed HA, and is then contacted with the array. Binding assays are typically carried out over a range of HA concentrations, so that information is obtained regarding relative affinities for different glycans in the array.
For example, direct binding studies were performed with arrays having different glycans such as 3′SLN, 6′SLN, 3′SLN-LN, 6′SLN-LN, and 3′SLN-LN-LN, where LN represents Galβ1-4GlcNAc, 3′ represents Neu5Acα2-3, and 6′ represents Neu5Acα2-6). Specifically, biotinylated glycans (50 ul of 120 pmol/ml) were incubated overnight (in PBS at 4° C.) with a streptavidin-coated High Binding Capacity 384-well plate that was previously rinsed three times with PBS. The plate was then washed three times with PBS to remove excess glycan, and was used without further processing.
Appropriate amounts of His-tagged HA protein, primary antibody (mouse anti 6×His tag) and secondary antibody (HRP conjugated goat anti-mouse IgG) were incubated in a ratio of 4:2:1 HA:primary:secondary for 15 minutes on ice. The mixture (i.e., precomplexed HA) was then made up to a final volume of 250 ul with 1% BSA in PBS. 50 ul of the precomplexed HA was then added to the glycan-coated wells in the 384-well plate, and was incubated at room temperature for 2 hours. The wells were subsequently washed three times with PBS containing 0.05% TWEEN-20, and then three times with PBS. HRP activity was estimated using Amplex Red Peroxidase Kit (Invitrogen, CA) according to the manufacturer's instructions. Serial dilutions of HA precomplxes were studied. Appropriate negative (non-sialylated glycans) and background (no glycans or no HA) controls were included, and all assays were done in triplicate. Results are presented in
One characteristic of the binding pattern of known human adapted H1 and H3 HAs is their binding at saturating levels to the long α2-6 (6′SLN-LN) over a range of dilution from 40 down to 5 μg/ml (
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The scope of the present invention is not intended to be limited to the above Description, but rather is as set forth in the following claims:
1Border line high binder;
2Sulfated GlcNAc[6/S]/Gal[6S] high binders;
3Border line high) binders to a2-6 Type B. Only sulfated GlcNAc[6S]/Gal[6S] are high binders;
4Binds to several non-sialylated glycans;
5Border line high to α2-3 sialylated glycans;
6Few border line high binders to sulfated GlcNAc on Neu5Acα3Galβ3/4GlcNAc;
7High binders are Neu5Acα6Galβ4GlcNAcβ3Gal & !GlcNAcα6Man;
The present application claims priority under 35 USC 119(e) to co-pending U.S. Provisional patent application Ser. No. 60/837,868, filed on Aug. 14, 2006, and to co-pending U.S. provisional patent application Ser. No. 60/837,869, filed on Aug. 14, 2006. The entire contents of each of these prior applications is incorporated herein by reference.
This invention was made with United States government support awarded by the National Institute of General Medical Sciences under contract number U54 GM62116 and by the National Institutes of Health under contract number GM57073. The United States Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
60837868 | Aug 2006 | US | |
60837869 | Aug 2006 | US |