The present invention relates to a method of synthesizing a rhamnose polysaccharide. The invention also relates to a synthetic streptococcal polysaccharide, a streptococcal glycoconjugate, an immunogenic composition or vaccine comprising the streptococcal polysaccharide or glycoconjugate and the polysaccharide, glycoconjugate, immunogenic composition or vaccine for use in raising an immune response in an animal or for use in treating or preventing a disease, condition or infection with a streptococcal aetiology.
The Streptococci genera of bacteria is a group of versatile gram-positive bacteria that infect a wide range of hosts and are responsible for a remarkable number of illnesses.
Streptococcus pyogenes (Group A Streptococcus, GAS) is a human-exclusive pathogenic Gram-positive bacterium that causes a variety of illnesses. A probably underestimated appraisal of the epidemical power of this organism suggests that over 700 million individuals are afflicted per year worldwide, causing diseases as varied as impetigo, pharyngitis, scarlet fever, necrotising fasciitis, meningitis and toxic shock syndrome, amongst other illnesses. Moreover, autoimmune post-infection sequelae, such as acute rheumatic fever, acute glomerulonephritis or rheumatic heart disease can affect individuals that had previously suffered from GAS infections, extending the list of clinical manifestations caused by this pathogen. The Group A Carbohydrate (GAC) is a peptidoglycan-anchored rhamnose-polysaccharide (RhaPS) from Streptococcus pyogenes that is essential to bacterial survival and contributes to Streptococcus pyogenes' ability to infect the human host.
Streptococcus agalactiae (Group B Streptococcus, GBS), is a (pathogenic) commensal bacterium which is carried by 20-40% of all adult humans. 25% of women carry GBS in the vagina, where it normally resides without symptoms. However, in pregnant women, GBS is a recognised cause for preterm delivery, maternal infections, stillbirths and late miscarriages. Despite current prevention strategies, 1 in every 1000 babies born in the UK develop GBS infections. Preterm babies are known to be at particular risk of GBS infection as their immune systems are not as well developed. This results in one baby per week dying in the UK from GBS infection and one baby surviving with long-term disabilities.
Group C Streptococcus (GCS) can cause epidemic pharyngitis and cellulitis clinically indistinguishable from GAS disease in humans. It is also known to cause septicaemia, endocarditis, septic arthritis and necrotizing infections in patients with predisposing conditions such as diabetes, cancer or in elderly patients. In equine animals, GCS is the cause of the highly contagious and serious upper respiratory tract infection known as strangles, which is enzootic in a worldwide distribution.
Group G Streptococcus (GGS) are significant human pathogens that cause cutaneous infections, for example of the human skin. GGS also infect the oropharynx, gastrointestinal regions and female genital tracts. Other infections associated with GGS include several potentially life-threatening infections such as septicaemia, endocarditis, meningitis, peritonitis, pneumonitis, empyema, and septic arthritis.
Antimicrobial options for effectively controlling, treating and preventing GAS infections are becoming more limited. This is due to emerging antibiotic resistance, pandemic development and the spread of hyper virulent strains. There is thus a clear need for the development of a safe and effective vaccine candidate. For a vaccine to be capable of targeting most of the over 120 different GAS serotypes, it will need to be based on a ubiquitous, conserved and essential GAS target. One such target is the GAC, which is not only an essential structural component to the pathogen but is also a virulence determinant.
Current forms of vaccine development are limited to chemical and enzymatic extraction methods from native bacteria as well as chemical conjugation to any acceptor compound, for example a protein or peptide. This is labour-intensive and results in a limited yield and quality of product. There is a clear need for a method of producing a GAS polysaccharide which is less labour-intensive and results in a homogenous, pure and high yield of polysaccharide. The present invention is devised with these issues in mind.
In its broadest sense, the present disclosure relates to a method of synthesizing a polysaccharide, specifically a rhamnose polysaccharide.
According to a first aspect there is provided a method of synthesizing a rhamnose polysaccharide, the method comprising:
(i) transferring a rhamnose moiety to a hexose monosaccharide, disaccharide, or trisaccharide using a hexose-β-1,4-rhamnosyltransferase, a hexose-α-1,2-rhamnosyltransferase and/or a hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant thereof to form a disaccharide, trisaccharide or tetrasaccharide comprising a rhamnose moiety at a non-reducing end of the disaccharide, trisaccharide or tetrasaccharide; and
(ii) generating the rhamnose polysaccharide by extending from the rhamnose moiety at the non-reducing end of the disaccharide, trisaccharide or tetrasaccharide using a heterologous bacterial enzyme Streptococcus pyogenes Group A carbohydrate enzyme C (GacC) and/or Streptococcus pyogenes Group A carbohydrate enzyme G (GacG) or an enzymatically active homologue, variant or fragment thereof.
The bacterial species from which the enzyme GacC and/or the enzyme GacG or an enzymatically active homologue, variant or fragment thereof is derived is heterologous to the bacterial species from which the hexose-β-1,4-rhamnosyltransferase, the hexose-α-1,2-rhamnosyltransferase, the hexose-α-1,3-rhamnosyltransferase or enzymatically active fragment or variant thereof used in step (i) is derived.
The present inventor has discovered for the first time that the Streptococcus pyogenes enzyme GacB, which initiates the synthesis of the GAC rhamnose polysaccharide, is a α-D-GlcNAc-β-1,4-L rhamnosyl-transferase. Entirely surprisingly, the inventor has found that these rhamnose polysaccharides can be synthesized using rhamnosyltransferases from bacterial species different to those from which the GacB is derived. In other words, the inventors have found that rhamnose polysaccharides can be synthesized using rhamnosyltransferases from bacterial species other than S. pyogenes. This is entirely unexpected given that the function of GacB was previously unknown. It is also surprising that enzymes from different species can work together to synthesize a rhamnose polysaccharide.
In some embodiments, step (ii) comprises generating the rhamnose polysaccharide by extending from the rhamnose moiety at the non-reducing end of the disaccharide, trisaccharide or tetrasaccharide using the heterologous bacterial enzyme GacC or an enzymatically active homologue, variant or fragment thereof.
Polysaccharide is a known term of the art used to denote a molecule comprising a plurality of identical or different monosaccharides, typically more than four monosaccharides. The term rhamnose polysaccharide, as used herein, will thus be understood to refer to a molecule comprising a plurality, typically more than four, rhamnose moieties, optionally attached to one or more other monosaccharide moieties. Conveniently, the rhamnose polysaccharide may be a single straight chain of repeating units comprising rhamnose, bound to each other by alpha 1,3, or alpha 1,2 bonds. Each repeating unit may consist only of rhamnose, or each repeating unit may comprise rhamnose and one or more different monosaccharides. An exemplary repeating unit which comprises rhamnose is a rhamnose-galactose disaccharide repeating unit. Each/any repeating unit and/or rhamnose moiety may or may not include any side-group. In one embodiment no side groups are present and in another embodiment one or more side groups, such as a sugar, with or without additional modifications, such as glycerol-phosphate; or phosphate, may be present.
In embodiments, the method is performed in a bacterium.
In such embodiments, the method will be understood to be a microbiological method. Embodiments other than those carried out in a bacterium will be understood to be in vitro methods. By “bacterium”, this will be understood to refer to a bacterial cell. It will be appreciated that the invention also encompasses the method being performed in bacteria. Such microbiological methods are ideal for the production of large and homogenous quantities of a particular product, in this instance a rhamnose polysaccharide.
The rhamnose polysaccharide produced by the method will be understood to be a synthetic rhamnose polysaccharide. A synthetic rhamnose polysaccharide, as the skilled person will appreciate, will be understood to refer to a rhamnose polysaccharide, which is not the result of a naturally occurring process. This is because the method of the first aspect uses enzymes, the combination of which is not naturally occurring. In one embodiment, the bacterium is a Streptococcus species other than Streptococcus pyogenes, Escherichia species, such as E. coli, or a Shigella species, such as Shigella dysenteriae or Shigella flexneri.
Typically, the rhamnose polysaccharide produced by the method is a streptococcal polysaccharide. For example, the polysaccharide may comprise a polysaccharide or a fragment or variant thereof selected from the group consisting of a Group A, Group B, Group C and Group G carbohydrate.
By rhamnose moiety, this will be understood to refer to a rhamnose monosaccharide or a derivative thereof. It will be appreciated that derivatives of rhamnose refer to a rhamnose monosaccharide(s) which has been modified by the addition or replacement of one or more groups or elements in the rhamnose monosaccharide, provided that at least one carbon of the rhamnose monosaccharide is still capable of forming a glycosidic bond with at least one other rhamnose monosaccharide or rhamnose moiety. Derivatives of rhamnose may encompass acetyl or methyl forms of rhamnose, amino-rhamnose, carboxylethyl-rhamnose, halogenated rhamnose and rhamnose phosphate. Unless context otherwise dictates, herein after reference will generally be made to a rhamnose moiety, but this should not be construed as limiting. Halogenated rhamnose will be understood to refer to a rhamnose monosaccharide wherein one or more groups of the rhamnose, for example one or more OH groups is replaced with a halogen, for example fluoride or chloride to form a fluorinated or chlorinated rhamnose, respectively.
Amino-rhamnose will be understood to refer to a rhamnose monosaccharide where one or more groups of the rhamnose is replaced by an amine group.
An example acetyl-rhamnose may comprise 2-O-acetyl-α-L-rhamnose, while an example methyl-rhamnose may comprise 3-O-methyl-L-rhamnose. Another exemplary derivative of rhamnose may comprise carboxylethyl-rhamnose, for example 4-O-(1-carboxyethyl)-L-rhamnose.
By enzymatically active fragment or variant, we include that the sequence of the relevant enzyme can vary from the naturally occurring sequence with the proviso that the fragment or variant substantially retains the enzymatic activity of the enzyme. By retain the enzymatic activity of the enzyme it is meant that the fragment and/or variant retains at least a portion of the enzymatic activity as compared to the native enzyme. Typically, the fragment and/or variant retains at least 50%, such as 60%, 70%, 80%, 90%, 95%, 97%, 98% or 99% activity.
In some instances, the fragment and/or variant may have a greater enzymatic activity than the native enzyme. In some embodiments, the fragment and/or variant may display an increase in another physiological feature as compared to the native enzyme. For example, the fragment and/or variant may possess a greater half-life in vitro and/or in vivo, as compared to the native enzyme. The test for determining the half-life of an enzyme, or a fragment or variant thereof, will be known to the skilled person. Briefly, an in vitro test may involve incubating the enzyme at a particular temperature and pH for different time periods. At the end of each time period, the activity of the enzyme, or fragment or variant thereof, can be measured using an enzymatic assay, which is well known to the skilled person.
The enzyme GacC, as used herein, will be understood to refer to the Streptococcus pyogenes Group A carbohydrate enzyme C (UniProtKB—Q9A0G4 (Q9A0G4_STRP1)). An exemplary amino acid sequence encoding GacC is provided by SEQ ID NO:1.
The enzyme GacG, as used herein, will be understood to refer to the Streptococcus pyogenes Group A carbohydrate enzyme G (UniProtKB—Q9A0G0 (Q9A0G0_STRP1)). In some embodiments, the enzyme GacG comprises or consists of SEQ ID NO:2, or an enzymatically active fragment or variant thereof.
GacG (or an enzymatically active homologue, variant or fragment thereof) is used instead of or in addition to GacC in the method of the invention. GacC is a rhamnose-1,3 α rhamnosyltransferase, while GacG is a predicted dual function glycosyltransferase, that synthesizes the repeating unit for the GAC (alpha 1,3-alpha1,2).
“Homologue” may encompass enzymes which exhibit at least about 20%, 30%, 40%, 50%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to a GacC or GacG amino acid sequence.
In some embodiments, the enzymatically active homologue is a homologue of GacC.
The degree of (or percentage) “homology” between two or more amino acid sequences may be calculated by aligning the sequences and determining the number of aligned residues which are identical and adding this to the number of conservative amino acid substitutions. The combined total is then divided by the total number of residues compared and the resulting figure is multiplied by 100—this yields the percentage homology between aligned sequences.
Typically, a homologue of GacC or GacG encompasses an enzyme which substantially retains the enzymatic activity of GacC or GacG.
In some embodiments, the homologue of GacC comprises or consists of rfbG. RfbG is an alpha-1-3 rhamnosyltransferase derived from Shigella flexneri which has 30% identity to GacC. Thus, in the context of the present invention, rfbG is an enzymatically active homologue of GacC. In some embodiments, rfbG comprises or consists of SEQ ID NO: 3. RfbG may be identified using the UniProtKB—A0A2D0WWB9 (A0A2D0WWB9_9ENTR).
The homologue of GacC or GacG may comprise or consist of rfbG, an enzyme derived from a Lancefield group species other than S. pyogenes and/or from a non-Lancefield group Streptococcus species other than S. pneumoniae.
In some embodiments, the homologue of GacC or GacG is an enzyme derived from a Lancefield group species other than S. pyogenes and/or from a non-Lancefield group Streptococcus species other than S. pneumoniae.
As the skilled person will be aware, the Lancefield group of bacteria refers to a group of different bacterial species, primarily Streptococcus species, which are catalase-negative and coagulase-negative. The grouping is based on the carbohydrate composition of the cell wall antigens.
Lancefield group bacteria include:
The non-Lancefield group Streptococcus species may comprise Streptococcus mutans or S. uberis. In some embodiments, the non-Lancefield group Streptococcus species may comprise or consist of S. mutans.
The enzymatically active homologue of GacC or GacG may be selected from a homologue from the Streptococcus Group B, Group C, Group G, S. mutans, S. uberis or an enzymatically active fragment or variant thereof.
In some embodiments, the enzymatically active homologue of GacC or GacG may be selected from a homologue from the Streptococcus Group B, Group C, Group G, S. mutans, or an enzymatically active fragment or variant thereof.
In some embodiments, the enzymatically active homologue of GacC is selected from a homologue of GacC from the Streptococcus Group B, Group C, Group G, S. mutans, S. uberis or an enzymatically active fragment or variant thereof. The skilled person will be aware of Streptococcal homologues to GacC. For example, the Group B homologue of GacC may be GbcC (UniProtKB—Q8DYQ2 (Q8DYQ2_STRA5)). The Group C homologue of GacC may be GccC (UniProtKB—M4YWQ3 (M4YWQ3_STREQ)). The Group G homologue of GacC may be GgcC (UniProtKB—C5WFT8 (C5WFT8_STRDG)), while the S. mutans homologue of GacC may be SccC (UniProtKB—A0A0E2EN43 (A0A0E2EN43_STRMG). The S. uberis homologue of GacC may be SucC (UniProtKB—B9DU25 (B9DU25_STRU0)).
The amino acid sequence of GbcC may comprise or consist of SEQ ID NO:4. The amino acid sequence of GccC may comprise of consist of SEQ ID NO:5, while the amino acid sequence of GgcC may comprise of consist of SEQ ID NO:6. In some embodiments, SccC comprises or consists of SEQ ID NO:7. The amino acid sequence of SucC may comprise or consist of SEQ ID NO:8.
In some embodiments, the enzymatically active homologue of GacG is selected from a homologue of GacG from the Streptococcus Group C, Group G, S. mutans, S. uberis or an enzymatically active fragment or variant thereof. Suitable enzymatically active homologues of GacG include, but are not limited to, the Group C homologue of GacG, GccG, the Group G homologue of GacG, GgcG, the S. uberis homologue of GacG, SucG, and the S. mutans homologue of GacG, SccG.
In some embodiments, GccG comprises and consists of SEQ ID NO:9. In some embodiments, GccG comprises or consists of two proteins. The two proteins may comprise or consist SEQ ID Nos 10 and 11.
GgcG may comprise or consist of two proteins. The two proteins may have the UniProtKBs C5WFU2 (C5WFU2_STRDG) and C5WFU3 (C5WFU3_STRDG), respectively. In some embodiments, GgcG may comprise or consist of SEQ ID Nos 12 and 13.
SucG may comprise or consist of the amino acid sequence identified by the UniProtKB—B9DU29 (B9DU29_STRU0). For example, SucG may comprise or consist of the amino acid sequence SEQ ID NO:14.
SccG may comprise or consist of the amino acid sequence identified by the UniProtKB—082878 (082878_STRMG). In some embodiments, SccG comprises or consists of the amino acid sequence SEQ ID NO:15.
The enzymatically active homologue of GacC or GacG may be selected from a homologue from, S. mutans, S. uberis or a fragment or variant thereof.
In some embodiments, step (ii) comprises generating the rhamnose polysaccharide by extending from the rhamnose moiety at the non-reducing end of the disaccharide, trisaccharide or tetrasaccharide using an enzymatically active homologue of GacC and/or GacG from S. mutans, or an enzymatically active variant or fragment thereof.
The invention also encompasses nucleic acid sequences encoding the enzymes (and/or enzymatically active fragments, variants or homologues) of the present invention.
As used herein, when an enzyme is “derived from” a particular bacterial species, this means that the enzyme is naturally occurring in the particular bacterial species. In the context of the present invention, an enzyme “derived from” a particular bacterial species may include an enzyme endogenous to the bacterium in which the method may be performed, an enzyme or a nucleic acid encoding the enzyme isolated from the particular bacterial species, or variants or fragments thereof. In embodiments where the method is performed in a bacterium, the enzyme or nucleic acid encoding the enzyme isolated from the particular bacterial species may be transferred into the bacterium in which the method is performed.
In embodiments where the method is performed in a bacterium, the enzyme(s) of step (i) and/or the enzymes(s) of step (ii) may be overexpressed in the bacterium. By “overexpressed”, this will be understood to refer to a level of expression of the enzyme higher than that which would be observed for the naturally occurring enzyme when endogenously expressed in its native bacterium. Various techniques for overexpression are known to those skilled in the art. Further information regarding overexpression techniques may be found in Current Protocols in Molecular Biology (2019) which is incorporated herein by reference.
In the context of the present invention, heterologous is used to refer to different. A heterologous bacterial species will be understood to mean a bacterial species different to another, or bacterial genera different to another bacterial genera.
It will be appreciated that in the context of the present invention, heterologous does not encompass a bacterial strain being different to another bacterial strain (i.e., two strains, for example, of S. mutans).
By “variants” of an enzyme we include insertions, deletions and substitutions of the amino acid sequence, either conservative or non-conservative wherein the physio-chemical properties of the respective amino acid(s) are not substantially changed (for example, conservative substitutions such as Gly, Ala; Val, lie, Leu; Asp, Glu; Asn, Gln; Ser, Thr; Lys, Arg; and Phe, Tyr). The skilled person will appreciate that such conservative substitutions should not affect the functionality of the respective enzyme. Moreover, small deletions within non-functional regions of the enzyme can also be tolerated and hence are considered “variants” for the purpose of the present invention. “Variants” also include recombinant enzyme proteins in which the amino acids have been post-translationally modified, by for example, glycosylation, or disulphide bond formation. The experimental procedures described herein can be readily adopted by the skilled person to determine whether a “variant” can still function as an enzyme.
It is preferred if the variant has an amino acid sequence which has at least 75%, yet still more preferably at least 80%, in further preference at least 85%, in still further preference at least 90% and most preferably at least 95%, 97%, 98% or 99% identity with the “naturally occurring” amino acid sequence of the enzyme.
It will be appreciated that variants also encompass variants of the nucleic acid sequence encoding the enzyme. In particular, we include variants of the nucleotide sequence where such changes do not substantially alter the enzymatic activity of the enzyme which it encodes.
A skilled person would know that such sequences can be altered without the loss of enzymatic activity. In particular, single changes in the nucleotide sequence may not result in an altered amino acid sequence following expression of the sequence.
In some embodiments, the method is performed in a bacterium species heterologous to the bacterium species or genera from which the enzyme GacC and/or GacG or an enzymatically active homologue, variant or fragment thereof is derived. In some embodiments, the method is performed in a gram-positive bacterium. The method may be performed in a gram-negative bacterium. For example, the method may be performed in a gram-negative bacterium such as E. coli or Campylobacter species. Other suitable gram-negative bacteria will be known to the skilled person. In embodiments, the bacterium species may be heterologous to the bacterium species or genera from which the hexose-β-1,4-rhamnosyltransferase, hexose-α-1,2-rhamnosyltransferase or hexose-α-1,3-rhamnosyltransferase is derived.
In some embodiments, the method is performed in E. coli.
Step ii) of the method may comprise using one or more additional enzymes from the Gac cluster of bacterial enzymes, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
As the skilled person will appreciate, GacB is one of a number of enzymes encoded by one gene cluster in S. pyogenes. This gene cluster, which may otherwise be referred to as the Gac gene cluster, (gacA-gacL, MGAS5005_Spy_0602-0613) is understood to encode 12 different enzymes, as defined by van Sorge et al., 2014. The 12 enzymes are GacA, GacB, GacC, GacD, GacE, GacF, GacG, GacH, Gacl, GacJ, GacK and GacL. Thus, step ii) of the method may further comprise using one or more additional enzymes from the Gac cluster of bacterial enzymes, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof. Thus, In some embodiments, step ii) of the method comprises using one or more additional enzymes selected from GacA, GacC, GacD, GacE, GacF, GacG, GacH, Gacl, GacJ, GacK, GacL or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
In some embodiments, step ii) of the method further comprises using one or more enzymatically active homologue(s), or enzymatically active variant(s) or fragment(s) thereof, of one or more of GacA, GacC, GacD, GacE, GacF, GacG, GacH, Gacl, GacJ, GacK, GacL.
The one or more enzymatically active homologue(s) may be derived from S. mutans and/or S. uberis.
In some embodiments, the one or more enzymatically active homologue(s) is derived from S. mutans.
Step ii) may further comprise using the enzyme GacA or an enzymatically active homologue, fragment or variant thereof. In some embodiments, step ii) may comprise using the enzymes GacC and GacG, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
In some embodiments, step ii) comprises using the enzymes GacC, GacA and GacG, or one or more enzymatically active homologues, variants or fragments thereof. Step ii) may further comprise using the enzymes GacD, GacE, and GacF or one or more enzymatically active homologue(s), fragment(s) or variant(s) thereof.
Step ii) may comprise using the enzymes GacC, GacA, GacG, GacD, GacE, and Gac F or one or more enzymatically active homologue(s), fragment(s) or variant(s) thereof.
In some embodiments, step ii) comprises using the enzymes GacA, GacC, GacD, GacE, GacF, GacG, GacH, Gacl, GacJ, GacK and GacL, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
Step ii) may comprise using the enzymatically active homologues from S. mutans and/or S. uberis of GacA, GacC, GacD, GacE, GacF, GacG and GacH.
In some embodiments, step ii) comprises using the enzymatically active homologues from S. mutans of GacA, GacC, GacD, GacE, GacF, GacG and GacH.
GacA may comprise or consist of SEQ ID NO:16. Without wishing to be bound by theory, GacA is believed to function to synthesize the rhamnose moieties required for the generation of the rhamnose polysaccharide. GacG is believed to be involved in the generation of the rhamnose polysaccharide by extending from the rhamnose moiety at the reducing end.
GacD and GacE may function to form an ATP-dependent ABC transporter. As the skilled person will appreciate, an ATP-dependent ABC transporter translocates substrates across membranes. Thus, without wishing to be bound by theory, GacD and GacE may assist in transporting the rhamnose polysaccharide across the bacterial membrane such that it can then be presented on the bacterial cell wall.
GacH may comprise or consist of SEQ ID NO:17. GacH can also be identified using UniProtKB—J7M7C2 (J7M7C2_STRP1).
In some embodiments, step ii) further comprises using the enzymes GacH, Gacl, GacJ, GacK and GacL, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
It is thought that Gacl and/or GacJ may enhance the catalytic efficiency of the method of synthesizing the rhamnose polysaccharide.
Enzymatically active homologues of GacA may be selected from a homologue of GacA from the Streptococcus Group B, Group C, Group G, S. mutans, S. uberis or an enzymatically active fragment or variant thereof. For example, the Streptococcus Group B homologue of GacA is RmlD. The Streptococcus Group C homologue of GacA is RmlD, as is the Streptococcus Group G homologue of GacA.
The Streptococcus Group B homologue of GacA, RmlD may have the UniProtKB—A0A0E1EP43 (A0A0E1EP43_STRAG). In some embodiments, the Streptococcus Group B homologue of GacA, RmlD comprises or consists of SEQ ID NO:18.
The Streptococcus Group C homologue of GacA, RmlD may have the UniProtKB—K4Q921 (K4Q921_STREQ). In some embodiments, the Streptococcus Group C homologue of GacA, RmlD comprises or consists of SEQ ID NO:19.
The Streptococcus Group G homologue of GacA, RmlD may have the UniProt—KB AOA2X3AIL5 (AOA2X3AIL5_STRDY). The Streptococcus Group G homologue of GacA may comprise or consist of SEQ ID NO:20.
The S. mutans homologue of GacA may be identified using the UniProtKB—033664 (033664_STRMG). In some embodiments, the S. mutans homologue of GacA may comprise or consist of SEQ ID NO:21.
The S. uberis homologue of GacA may be identified using the UniProtKB—B9DU23 (B9DU23_STRU0). In some embodiments, the S. uberis homologue of GacA may comprise or consist of SEQ ID NO:22.
Enzymatically active homologues of GacD, GacE and/or GacF, may be selected from homologues from the Streptococcus Group C, Group G, S. mutans, S. uberis or an enzymatically active fragment or variant thereof. Suitable homologues of GacD include, but are not limited to, the Streptococcus Group C enzyme GccD, the Streptococcus Group G enzyme GgcD and the S. mutans enzyme SccD. Suitable homologues of GacE include, but are not limited to, the Streptococcus Group C enzyme GccE, the Streptococcus Group G enzyme GgcE and the S. mutans enzyme SccE. Suitable homologues of GacF include, but are not limited to, the Streptococcus Group C enzyme GccF, the Streptococcus Group G enzyme GgcF, the S. mutans enzyme SccF and the S. uberis enzyme SucF.
In some embodiments, GccD comprises or consists of the amino acid sequence SEQ ID NO:23. GccE may be identified using the UniProtKB—AOA380KIL0 (AOA380KIL0_STREQ).
In some embodiments, GccE comprises or consists of the amino acid sequence SEQ ID NO:24. GccF may be identified using the UniProtKB—A0A3S4QIR3 (A0A3S4QIR3_STREQ). Optionally, GccF comprises or consists of SEQ ID NO:25.
In some embodiments, GgcD comprises or consists of the amino acid sequence SEQ ID NO:26. GgcD may be identified using the UniProtKB—C5WFT9 (C5WFT9_STRDG).
In some embodiments, GgcE is identified by the UniProtKB—M4YXS7 (M4YXS7_STREQ). Optionally, GgcE comprises or consists of SEQ ID NO:27. GgcF may be identified by the UniProtKB—C5WFU1 (C5WFU1_STRDG). In some embodiments, GgcF comprises or consists of SEQ ID NO:28.
SccD may comprise or consist of SEQ ID NO:29. Optionally, SccD is identified using the UniProtKB—I6L8Z4 (I6L8Z4_STRMU).
SccE may comprise or consist of SEQ ID NO:30. Optionally, SccE is identified using the UniProtKB—I6L8X8 (I6L8X8_STRMU).
SccF may be identified using the UniProtKB—082877 (082877_STRMG). Optionally, SccF comprises or consists of SEQ ID NO:31.
SucD may be identified using the UniProtKB—B9DU26 (B9DU26_STRU0). In some embodiments, SucD comprises or consists of SEQ ID NO:32.
SucE may be identified using the UniProtKB—B9DU27 (B9DU27_STRU0). In some embodiments, SucE comprises or consists of SEQ ID NO:33.
SucF may be identified using the UniProtKB—B9DU28 (B9DU28_STRU0). In some embodiments, SucF comprises or consists of the amino acid sequence SEQ ID NO:34.
An enzymatically active homologue of GacH may comprise or consist of the S. mutans enzyme SccH, or an enzymatically active fragment or variant thereof. The enzyme SccH may be identified using the UniProtKB—Q8DUS0 (Q8DUS0_STRMU).
In some embodiments, SccH comprises or consists of SEQ ID NO:35.
In some embodiments, the hexose-β-1,4-rhamnosyltransferase is not a N-acetylglucosamine (GlcNAc)-β-1,4-rhamnosyltransferase. In some embodiments, the hexose-β-1,4-rhamnosyltransferase is not GacB.
By “hexose-β-1,4-rhamnosyltransferase”, this will be understood to be an enzyme capable of transferring a rhamnose moiety to a hexose such that a β-1,4 linkage is formed between the hexose and the rhamnose moiety. Once the rhamnose moiety is transferred, it will be understood that the hexose is at the reducing end and the rhamnose moiety is at the non-reducing end, i.e., the end from which is extended from to generate the rhamnose polysaccharide.
The hexose-β-1,4-rhamnosyltransferase may comprise or consist of an allose-β-1,4-rhamnosyltransferase, an altrose-β-1,4-rhamnosyltransferase, a glucose-β-1,4-rhamnosyltransferase, a mannose-β-1,4-rhamnosyltransferase, a xylose-β-1,4-rhamnosyltransferase, a idose-β-1,4-rhamnosyltransferase, a galactose-β-1,4-rhamnosyltransferase a talose-β-1,4-rhamnosyltransferase, a diacetylbacillosamine-β-1,4-rhamnosyltransferase or an enzymatically active fragment or variant thereof.
In some embodiments, the hexose-β-1,4-rhamnosyltransferase comprises a glucose (Glc)-β-1,4-rhamnosyltransferase or an enzymatically active fragment or variant thereof. As the skilled person will appreciate, a glucose (Glc)-β-1,4-rhamnosyltransferase is an enzyme capable of transferring a rhamnose moiety to a glucose, thereby forming a β-1,4 linkage between the glucose and the rhamnose moiety. The hexose-β-1,4-rhamnosyltransferase may comprise a WchF enzyme, or an enzymatically active fragment or variant thereof. The WchF enzyme will be understood to be derived from S. pneumoniae and is a glucose (Glc)-β-1,4-rhamnosyltransferase.
In some embodiments, the WchF enzyme comprises SEQ ID NO:36, or an enzymatically active fragment or variant thereof.
The enzymatically active fragment or variant of WchF may have at least 30% amino acid sequence identity to the WchF enzyme.
In some embodiments, the enzymatically active fragment or variant of WchF has at least 80%, at least 85%, at least 90%, at least 95%, at least 97% or at least 99% amino acid identity to the WchF enzyme. For example, homologues of WchF from S. mitis, S. oralis, S. pseudopneumoniae and S. perosis share 87%, 93%, 87% and 81% amino acid identity to WchF, respectively. In the context of the present invention, these particular homologues will thus be understood to be enzymatically active variants of WchF.
The hexose-α-1,2-rhamnosyltransferase may comprise or consist of an allose-α-1,2-rhamnosyltransferase, an altrose-α-1,2-rhamnosyltransferase, a glucose-α-1,2-rhamnosyltransferase, a mannose-α-1,2-rhamnosyltransferase, a xylose-α-1,2-rhamnosyltransferase, a idose-α-1,2-rhamnosyltransferase, a-galactose α-1,2-rhamnosyltransferase a talose-α-1,2-rhamnosyltransferase, a diacetylbacillosamine-α-1,2-rhamnosyltransferase, a GlcNAc-α-1,2-rhamnosyltransferase or an enzymatically active fragment or variant thereof.
In some embodiments, the hexose-α-1,2-rhamnosyltransferase comprises or consists of a galactose-α-1,2-rhamnosyltransferase or an enzymatically active fragment or variant thereof. The hexose-α-1,2-rhamnosyltransferase may comprise a WbbR enzyme, or an enzymatically active fragment or variant thereof. As the skilled person will appreciate, the WbbR enzyme (WP_001045977.1—UniProtKB—Q32EG0 (Q32EG0_SHIDS) is derived from Shigella dysenterica and is a galactose-α-1,2-rhamnosyltransferase.
The WbbR enzyme may comprise or consist of SEQ ID NO:37.
The hexose-α-1,3-rhamnosyltransferase may comprise or consist of an allose-α-1,3-rhamnosyltransferase, an altrose-α-1,3-rhamnosyltransferase, a glucose-α-1,3-rhamnosyltransferase, a mannose-α-1,3-rhamnosyltransferase, a xylose-α-1,3-rhamnosyltransferase, a idose-α-1,3-rhamnosyltransferase, a galactose-α-1,3-rhamnosyltransferase a talose-α-1,3-rhamnosyltransferase, a diacetylbacillosamine-α-1,3-rhamnosyltransferase, a GlcNAc-α-1,3-rhamnosyltransferase or an enzymatically active fragment or variant thereof
In some embodiments, the hexose-α-1,3-rhamnosyltransferase comprises or consists of a GlcNAc-α-1,3-rhamnosyltransferase, a diNAcBac-α-1,3-rhamnosyltransferase, a Glc-α-1,3-rhamnosyltransferase, a galactose-α-1,3-rhamnosyltransferase or a fragment or variant thereof. The hexose-α-1,3-rhamnosyltransferase may comprise or consist of a GlcNAc-α-1,3-rhamnosyltransferase or a galactose-α-1,3-rhamnosyltransferase or an enzymatically active fragment or variant thereof.
The GlcNAc-α-1,3-rhamnosyltransferase may comprise a WbbL enzyme, or an enzymatically active fragment or variant thereof. The WbbL enzyme is derived from E. coli. The WbbL enzyme may comprise or consist of SEQ ID NO:38, or an enzymatically active fragment or variant thereof.
The enzymatically active fragment or variant of WbbL may have at least 20% or at least 25% amino acid sequence identity to the WchF enzyme. For example, a homologous enzyme of WbbL having 27% amino acid identity to WbbL has been identified in Mycobacterium tuberculosis, also known as WbbL. Thus, in the context of the present invention, this homologue will be understood to be an enzymatically active variant of WbbL. This homologous enzyme to WbbL, derived from Mycobacterium tuberculosis may comprise or consist of SEQ ID NO: 39. Another suitable homologue of WbbL comprises or consists of the enzyme rfbF, derived from Shigella flexneri. RfbF may comprise or consist of SEQ ID NO:40. RfbF can be identified using the UniProtKB—A0A2Y2Z310 (A0A2Y2Z310_SHIFL).
The galactose-α-1,3-rhamnosyltransferase may comprise a WsaD enzyme, or an enzymatically active fragment or variant thereof. The WsaD enzyme is derived from Geobacillus stearothermophilus. In some embodiments, the WsaD enzyme comprises or consists of SEQ ID NO:41.
Enzymatically active fragments or variants of WsaD may be derived from other Bacilli strains, for example Brevibacillus species and Paenibacillus species. The enzymatically active fragments or variants of WsaD may have at least 20%, 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98% or at least 99% amino acid identity to WsaD.
The inventors have surprisingly found that a chimera of the hexose-β-1,4-rhamnosyltransferase, the hexose-α-1,2-rhamnosyltransferase the hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant with GacB or an enzymatically active variant, fragment or homologue thereof is capable of transferring the rhamnose moiety to a hexose monosaccharide, disaccharide or trisaccharide. Thus, in some embodiments, transferring a rhamnose moiety to a hexose monosaccharide, disaccharide or trisaccharide uses a GacB/hexose-β-1,4-rhamnosyltransferase, hexose-α-1,2-rhamnosyltransferase, hexose-α-1,3-rhamnosyltransferase or enzymatically active fragments or variants thereof chimera. It will be appreciated that in such embodiments the hexose-β-1,4-rhamnosyltransferase is not GacB.
The chimera may comprise at least the C terminus region of GacB linked to the N terminus region of the hexose-β-1,4-rhamnosyltransferase, the hexose-α-1,2-rhamnosyltransferase the hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant thereof. In some embodiments, the chimera comprises the C terminus region of GacB linked to the N terminus region of WchF.
In some embodiments, the chimera comprises the full amino acid sequence of GacB except for the initial 50, 100, 150, 160, 170, 180, 190 or 200 amino acids, which are replaced with the corresponding hexose-β-1,4-rhamnosyltransferase, hexose-α-1,2-rhamnosyltransferase hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant thereof amino acids. An example chimera may comprise the amino acid sequence of GacB except that the first 178 amino acids of GacB are replaced with the corresponding WchF amino acids (1-186 amino acids).
The hexose monosaccharide, disaccharide or trisaccharide to which the rhamnose moiety is transferred can be any hexose. In embodiments, the hexose monosaccharide is not a rhamnose moiety.
In embodiments wherein the rhamnose moiety is transferred to a hexose disaccharide or trisaccharide, the monosaccharides of the di or trisaccharide may be the same or different to each other. For example, the disaccharide may comprise two galactose monosaccharides. Alternatively, the disaccharide may comprise a GlcNAc and a galactose. The GlcNAc may be at the reducing end of the disaccharide, and the galactose at the non-reducing end.
The disaccharide may comprise one rhamnose moiety. The trisaccharide may comprise one or two rhamnose moieties.
In some embodiments, the monosaccharide at the reducing end of the hexose monosaccharide, disaccharide or trisaccharide to which the rhamnose moiety is transferred (so the hexose monosaccharide or first monosaccharide of the disaccharide or trisaccharide) is a glucose or a glucose derivative.
In the context of the present invention, glucose derivative will be understood to refer to GlcNAc or diNAcBac. In some embodiments, the hexose monosaccharide, disaccharide or trisaccharide does not comprise GlcNAc.
It will be appreciated that the monosaccharide at the non-reducing end of the hexose monosaccharide, disaccharide or trisaccharide determines the specificity of the rhamnosyltransferase. This is because the rhamnosyltransferase transfers the rhamnose moiety to the monosaccharide at the non-reducing end of the hexose monosaccharide, disaccharide or trisaccharide. Thus, when the monosaccharide at the non-reducing end is galactose, the hexose rhamnosyltransferase will be a galactose rhamnosyltransferase.
The disaccharide or trisaccharide may comprise a rhamnose moiety at its non-reducing end.
An exemplary disaccharide may comprise a glucose at the reducing end linked to a rhamnose moiety at the non-reducing end. Other exemplary disaccharides include, but are not limited to, a diNAcBac at the reducing end linked to a rhamnose moiety at the non-reducing end, or a galactose at the reducing end linked to a rhamnose moiety at the non-reducing end.
Exemplary trisaccharides include, but are not limited to, a glucose at the reducing end linked to a hexose which is linked to a rhamnose moiety at the non-reducing end, a diNAcBac at the reducing end linked to a hexose which is linked to a rhamnose moiety at the non-reducing end, or a GlcNAc at the reducing end linked to a hexose which is linked to a rhamnose moiety at the non-reducing end. Optionally, the hexose of the trisaccharide may be a rhamnose moiety or a galactose.
When reference is made to a “link” between hexoses, this will be understood to refer to a glycosidic bond. In the di or trisaccharide, the glycosidic bond between two hexoses in the di- or trisaccharide may be an alpha (α) or a beta (β) glycosidic bond. The alpha bond may be an alpha 1,3 or an alpha 1,2 bond. The beta bond may be a beta 1,4 bond.
The features of the hexose monosaccharide, disaccharide and trisaccharide as described herein are also applicable to the hexose monosaccharide, disaccharide and trisaccharide, as appropriate of the streptococcal polysaccharide of the invention.
Further examples of monosaccharides, disaccharides and trisaccharides to which the rhamnose moiety can be transferred in step i) of the method and/or which comprise or consist of the hexose monosaccharide, disaccharide or trisaccharide of the streptococcal polysaccharide of the invention are provided in Example 2.
In embodiments wherein step (i) comprises transferring a rhamnose moiety to a hexose disaccharide or trisaccharide, the method may further comprise forming the hexose disaccharide or trisaccharide. The hexose disaccharide or trisaccharide may be formed using a hexosyltransferase, i.e., an enzyme capable of transferring a hexose to another hexose. For the hexose trisaccharide, if each monosaccharide of the trisaccharide is the same (for example the trisaccharide is formed of three glucoses), then one hexosyltransferase can be used to transfer each hexose to the other to form the trisaccharide. However, in embodiments where the hexose trisaccharide is formed of at least two different hexoses, then two different hexosyltransferases will be required to form the hexose trisaccharide.
When the method further comprises forming the hexose disaccharide, the hexose disaccharide may be formed using a hexose-α-1,3-hexosyltransferase or an enzymatically active fragment or variant thereof. A hexose-α-1,3-hexosyltransferase will be understood to refer to an enzyme which is capable of transferring a hexose to another hexose to form a α-1,3 bond. In the context of the present invention, bond may otherwise be used to refer to linkage. In some embodiments, the hexose disaccharide is formed using a hexose-α-1,3-galactosyltransferase. The hexose-α-1,3-galactosyltransferase may comprise or consist of a GlcNAc-α-1,3-galactosyltransferase, optionally the enzyme WbbP, or an enzymatically active fragment or variant thereof. The enzyme WbbP may be identified using the UniProt KB—Q53982 (Q53982_SHIDY). In some embodiments, WbbP may comprise or consist of the amino acid sequence SEQ ID NO:42. Thus, in some embodiments, the disaccharide consists of a GlcNAc at its reducing end and a galactose at its non-reducing end, the two hexoses linked via a α-1,3 bond.
In some embodiments, the method comprises forming the hexose disaccharide using the enzyme WbbP, or an enzymatically active fragment or variant thereof, followed by transferring a rhamnose moiety to the hexose disaccharide using the enzyme WbbR, or an enzymatically active fragment or variant thereof.
The hexose disaccharide may be formed using a hexose-α-1,3-rhamnosyltransferase or an enzymatically active fragment or variant thereof. For example, the hexose disaccharide may be formed using a galactose-α-1,3-rhamnosyltransferase, for example WsaD or an enzymatically active fragment or variant thereof. It will be appreciated in such embodiments that the hexose disaccharide is formed of a galactose at the reducing end and a rhamnose moiety at the non-reducing end. When the hexose disaccharide is formed using a galactose-α-1,3-rhamnosyltransferase, the enzyme WsaP optionally may also be used in the formation of the disaccharide, for example to attach a lipid to the galactose. The enzyme WsaP is derived from Geobacillus stearothermophilus. WsaP may be identified using the UniprotKB—Q7BG44 (Q7BG44_GEOSE). In some embodiments, the WsaP enzyme comprises or consists of SEQ ID NO:43.
Enzymatically active fragments or variants of WsaP may be derived from other Bacilli strains, for example Brevibacillus species and Paenibacillus species. The enzymatically active fragments or variants of WsaP may have at least 20%, 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98% or at least 99% amino acid identity to WsaP.
The hexose disaccharide may be extended using a hexose-α-1,2-hexosyltransferase or an enzymatically active fragment or variant thereof to form a trisaccharide or tetrasaccharide prior to further extension from the rhamnose moiety at the non-reducing end of the trisaccharide or tetrasaccharide using a heterologous bacterial enzyme GacC and/or GacG or an enzymatically active homologue, variant or fragment thereof. Exemplary hexose-α-1,2-hexosyltransferases may include, but not be limited to WsaC and WsaE. WsaC may be identified by the UniProtKB—Q7BG54 (Q7BG54_GEOSE). Optionally, WsaC comprises or consists of SEQ ID NO: 44. WsaE may be identified by the UniProtKB—Q7BG51 (Q7BG51_GEOSE). Optionally, WsaE may comprise or consist of SEQ ID NO:45.
When the method further comprises forming the hexose trisaccharide, two monosaccharides may be linked together as described for the disaccharide, followed by the transfer of a further hexose to the non-reducing end of the disaccharide using an additional hexosyltransferase. The additional hexosyltransferase may comprise hexose-rhamnosyltransferases, such that a rhamnose moiety is transferred to the non-reducing end. Suitable hexose-rhamnosyltransferases may include any of the hexose-rhamnosyltransferases described herein. Suitable hexose-rhamnosyltransferases may include a rhamnose-α-1,3-rhamnosyltransferase, for example the enzyme WbbQ or WsaC, or an enzymatically active variant or fragment thereof. WbbQ may be identified using the UniProtKB—AOA090NIC3 (AOA090NIC3_SHIDY). In some embodiments, WbbQ comprises or consists of SEQ ID NO:46.
In some embodiments, the hexose trisaccharide is formed using a rhamnose-α-1,3-rhamnosyltransferase which is not GacC.
Further information regarding exemplary hexosyltransferases for use in the present invention are provided in the Examples.
The hexose monosaccharide, disaccharide or trisaccharide to which the rhamnose moiety is transferred may be linked to a lipid. Thus, step i) may comprise transferring a rhamnose moiety to a lipid-linked hexose monosaccharide, disaccharide or trisaccharide. The link between the hexose monosaccharide, disaccharide or trisaccharide may comprise an undecaprenyl-diphosphate.
The method may further comprise a step (step (iii)) of conjugating the rhamnose polysaccharide to an acceptor molecule using an O-oligosaccharyltransferase capable of recognising the hexose monosaccharide at the reducing end of the rhamnose polysaccharide to form a rhamnose glycoconjugate.
O-oligosaccharyltransferases are enzymes used to catalyse the transfer of a carbohydrate moiety to a target protein, in a process known as protein glycosylation. Protein glycosylation is the process of covalently attaching carbohydrate moieties, i.e., a polysaccharide, to a protein substrate. O-oligosaccharyltransferases function by cleaving a phosphate-monosaccharide bond at a reducing end of a polysaccharide. To be capable of interacting with the substrate, the O-oligosaccharyltransferase must be capable of recognising the first two monosaccharides after the phosphate bond. The substrate may otherwise be referred to as an acceptor. Thus, the acceptor molecule may comprise a peptide or a protein. This results in the formation of a glyconjugate comprising the rhamnose polysaccharide of the invention. Such glyconjugates are particularly useful as antigens, which can be used in immunogenic compositions or vaccines. In addition, when the method is performed in a bacterium, the process of glycosylation leads to the presentation of the glycoconjugate on the surface of the bacterium. This enables the glycoconjugate to be isolated from the bacterium for further use, or alternatively enables the whole bacterium to be used as an antigen, which can be used in an immunogenic composition or vaccine.
In some embodiments, the O-oligosaccharyltransferase is capable of recognising a glucose or glucose derivative. In such embodiments, the hexose monosaccharide at the reducing end of the rhamnose polysaccharide will be a glucose or a glucose derivative, such as N-acetyl glucosamine (GlcNAc).
The O-oligosaccharyltransferase may comprise PglB, PglL, PglS or WsaB or a enzymatically active homologue, fragment or variant thereof.
The PglB enzyme may be derived from a Campylobacter species, for example Campylobacter jejuni or Campylobacter lari. Without wishing to be bound by theory, it is believed that the PglB enzyme is capable of recognising any hexose except for glucose.
The PglL enzyme may derived from Neisseria meningitides. It is believed that the PglL enzyme is capable of recognising any hexose except for glucose.
The PglS enzyme may be derived from Acinetobacter species. It is believed that the PglS enzyme is capable of recognising glucose.
The WsaB enzyme is derived from Geobacillus stearothermophilus. Enzymatically active variants of the WsaB enzyme can be derived from other Geobacillus species.
In some embodiments, the O-oligosaccharyltransferase is derived from a bacterial species heterologous to the bacteria in which the method is performed.
The method may further comprise an additional step of purifying the rhamnose glycoconjugate. Purifying may comprise high performance liquid chromatography (HPLC), for example recycling-HPLC, affinity or size exclusion chromatography. Other suitable methods of purification will be known to the skilled person.
It will be appreciated that the method can be carried out at an industrial scale. As the skilled person will be aware, the bacteria in which the method can be performed are grown in liquid media. Such liquid media comprising the bacteria can be used to fill an industrial scale bioreactor, for example at a volume of at least 50, 100 or 1000 litres. This advantageously results in the synthesis of a substantial amount of the polysaccharide product of the invention.
A commonly used liquid media is Luria Broth, which may otherwise be referred to as Lysogeny Broth. Other liquid media will be known to the skilled person.
When the method is performed in bacteria, the method may be a fed-batch method. “Fed batch” is a term familiar to a person skilled in the art. Nevertheless, for the purposes of clarity, “fed batch” will be understood to refer to a method of synthesis in which nutrients are supplied to the bacteria via the liquid media during cultivation.
Suitable nutrients will be known to the skilled person. Some exemplary, but non-limiting nutrients may include a rhamnose moiety, a hexose other than a rhamnose moiety and/or divalent cations including, but not limited to, magnesium and/or manganese.
In some embodiments, the rhamnose moiety comprises rhamnose. Rhamnose may be supplied to the liquid media in the D or the L isoform, preferably the L isoform.
Which hexose other than a rhamnose moiety is supplied to the liquid media depends on the composition of the rhamnose polysaccharide produced by the method. If the hexose monosaccharide, disaccharide or trisaccharide to which the rhamnose moiety is transferred comprises glucose, then the skilled person will appreciate that a suitable nutrient to be supplied to the liquid media would be glucose. If the hexose monosaccharide, disaccharide or trisaccharide comprises galactose, then the skilled person will appreciate that a suitable nutrient to be supplied to the liquid media would be galactose. Thus, the hexose for supply to the liquid media may be selected from one or more of allose, altrose, glucose, mannose, xylose, idose, galactose, talose, diacetylbacillosamine, GalNAc or GlcNAc, as appropriate.
The rhamnose moiety and/or other hexose may (each) be supplied to the liquid media at a final concentration in the liquid media of 0.1, 0.25, 0.5, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or 15 g/L. In some embodiments, the rhamnose moiety and/or other hexose is (each) supplied to the liquid media at a final concentration in the liquid media of about 4 g/L.
The rhamnose moiety and/or other hexose may (each) be supplied to the liquid media at a final concentration in the liquid media of 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 or 1.0 mg/ml.
In embodiments, the rhamnose moiety is supplied to the liquid media as L-rhamnose. L-rhamnose may be supplied to the liquid media at a final concentration in the liquid media of 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 or 1.0 mg/mL When magnesium is fed to the liquid media, this may be supplied in the form of MgSO4 or MgCl2. The MgSO4 or MgCl2 may be supplied to the liquid media to form a final concentration in the media of between 0 and 10 mM.
Prior to step i), when the method is performed in a bacterium the method may further comprise the introduction of one or more nucleic acids encoding one or more of the enzymes described herein into the bacterium. For example, the method may further comprise the introduction of a nucleic acid encoding the O-oligosaccharyltransferase and/or a nucleic acid encoding the hexose-β-1,4-rhamnosyltransferase, the hexose-α 1,2-rhamnosyltransferase, the hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant thereof into the bacterium. In some embodiments, the method further comprises the introduction of a nucleic acid encoding the bacterial enzyme GacC and/or the bacterial enzyme GacG or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof into the bacterium. The enzyme can then be expressed from its respective nucleic acid. The nucleic acid(s) encoding the one or more enzymes may further comprise a nucleic acid sequence encoding an endogenous or constitutive promoter and/or an artificial ribosome binding site.
Methods for the introduction of one or more nucleic acids into a bacterium are well known to those skilled in the art. One commonly used method is that of transformation. As used herein, transforming or transformation (which may otherwise be referred to as transfecting or transfection) refers to the process of introducing free nucleic acid into a cell by allowing the nucleic acid to cross the plasma membrane of the cell. By free nucleic acid, this will be understood to refer to nucleic acid which is not contained within a virus, virus-like particle or other organism; i.e., the nucleic acid is independent of an organism (although it will be appreciated that the nucleic acid may be derived or isolated from the nucleic acid sequence of an organism).
Methods of transfection typically involve altering the plasma membrane such that free nucleic acid can cross the plasma membrane (for example, electroporation methods) or complexing the free nucleic acid with a reagent that enables the free nucleic acid to cross the plasma membrane.
It will be appreciated that the nucleic acid for transfection may be in the form of a plasmid, this being a circular strand of nucleic acid. Hence, a plasmid may comprise one or more nucleic acid(s) encoding the one or more enzymes.
The nucleic acid is typically DNA, although RNA may also or alternatively be envisaged.
Transfecting may comprise polyethylenimine, poly-L-lysine, calcium phosphate, electroporation or liposomal-based methods. In embodiments, transfecting may comprise polyethylenimine, calcium phosphate or liposomal-based methods.
It will be appreciated that a variety of liposomal-based reagents are available commercially for liposomal-based methods of transfection. Liposomal methods may include, but may not be limited to, lipofectamine-based transfection or FuGENE®HD (Promega Corporation, Wisconsin, USA)-based transfection.
Further information regarding transformation/transfection techniques may be found in Current Protocols in Molecular Biology (2019) which is incorporated herein by reference.
The plasmid may further comprise appropriate regulatory sequences, including promoter sequences, terminator fragments, enhancer sequences, marker genes and/or other sequences. For further details see, for example, Sambrook & Russell, Molecular Cloning: A Laboratory Manual: 3rd edition.
The plasmid may be further engineered to contain regulatory sequences that act as enhancer and promoter regions and lead to efficient transcription of the fusion protein sequence carried on the construct. Many parts of the regulatory unit are located upstream of the coding sequence of the heterologous gene and are operably linked thereto. The regulatory sequences can direct constitutive or inducible expression of the heterologous coding sequence. Such regulatory sequences are especially suitable if expression is wanted to occur in a time specific manner. Expression may be induced by supplying the liquid media with an inducer. The inducer may comprise or consist of arabinose, IPTG or rhamnose. Regulatory sequences which can direct inducible expression when exposed to arabinose, IPTG or rhamnose will be known to the skilled person.
Arabinose may be supplied to the liquid media at a final concentration in the liquid media of 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 g/L. Optionally, arabinose is supplied to the liquid media at a concentration of about 2 g/L.
IPTG may be supplied to the liquid media at a final concentration in the liquid media of 0.1 to 5 mM. In some embodiments, IPTG is supplied to the liquid media at a final concentration in the liquid media of 0.1 to 2 mM, preferably at a concentration of about 1 mM.
L-rhamnose may be supplied to the liquid media at a final concentration of 0.05, 0.1, 0.15, 0.2, 0.25, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0 mg/mL as an inducer.
Also provided is a product obtainable using the method according to the first aspect. A product obtainable by the method according to the first aspect is especially pure and homogenous due to its synthetic method of production. The product of this invention is therefore ideally suited to commercial use, for example for the production on a large scale for use as an antigen or for use in research applications.
According to a third aspect there is provided a synthetic streptoccocal polysaccharide, the polysaccharide having a non-reducing end comprising a linear chain of rhamnose moieties and a reducing end comprising a hexose monosaccharide, disaccharide or trisaccharide, the hexose monosaccharide, disaccharide or trisaccharide being as described in relation to the method aspect. The polysaccharide comprises an α-1,3 bond or a an α-1,2 bond between the hexose monosaccharide, disaccharide or trisaccharide and the linear chain of rhamnose moieties, or the polysaccharide comprises an β-1,4 bond between the hexose monosaccharide, disaccharide or trisaccharide and the linear chain of rhamnose moieties and the hexose monosaccharide, disaccharide or trisaccharide does not comprise N-acetylglucosamine.
As the inventors have found, the naturally occurring GAC from S. pyogenes comprises a GlcNAc (N-acetylglucosamine) monosaccharide linked by a β-1,4 glycosidic bond to a linear chain of rhamnose monosaccharides. By altering this natural composition of the reducing end sugars, the inventors have generated a synthetic polysaccharide which retains the chemical composition and antigenic capacity of the alpha-1,2-alpha-1,3 rhamnose disaccharide repeat units of GAC, while enabling production of the polysaccharide at an industrial scale and at high levels of purity and tightly regulated size distribution to increase product length homogeneity.
Thus, typically, the polysaccharide comprises a polysaccharide or a fragment or variant thereof selected from the group consisting of a Group A, Group B, Group C and Group G carbohydrate.
In some embodiments, the polysaccharide comprises an α-1,3 bond between the hexose monosaccharide, disaccharide or trisaccharide and the linear chain of rhamnose moieties. The hexose monosaccharide disaccharide or trisaccharide may comprise N-acetylglucosamine, N,N′-diacetylbacillosamine, glucose or galactose.
In some embodiments, the polysaccharide comprises an α-1,2 bond between the hexose monosaccharide, disaccharide or trisaccharide and the linear chain of rhamnose moieties. The hexose may comprise galactose.
In some embodiments, the polysaccharide comprises a β-1,4 bond between the hexose monosaccharide, disaccharide or trisaccharide and the linear chain of rhamnose moieties and the hexose comprises glucose.
According to a fourth aspect, there is provided a streptococcal rhamnose glycoconjugate comprising the streptococcal polysaccharide according to the third aspect conjugated to an acceptor. Glyconjugates have strong antigenic potential and so rhamnose glyconjugates of the invention have particular utility in raising an immune response for example as part of or as an immunogenic composition or vaccine.
In embodiments, the polysaccharide is conjugated to the acceptor at the reducing end of the polysaccharide. The acceptor may comprise a peptide or a protein.
In some embodiments, the streptococcal rhamnose glycoconjugate is expressed on the surface of a bacterial host cell, optionally a gram negative bacterium such as E. coli. Thus, the invention also encompasses a bacterial host cell comprising the streptococcal rhamnose glycoconjugate of the fourth aspect on its cell surface. Conveniently, expression on the cell surface of the bacterial host cell enables ease of isolation of the glycoconjugate. Even more conveniently, this means that the bacterial host cell which comprises the streptococcal rhamnose glycoconjugate on its cell surface can be used as a component of or an immunogenic composition or vaccine without requiring isolation of the glyconjugate from the bacterial host cell. This reduces the time and cost necessary to produce the glyconjugate for downstream use as an immunogenic composition or vaccine.
Thus, according to a fifth aspect there is provided a bacterial host cell comprising a hexose-β-1,4-rhamnosyltransferase, a hexose-α-1,2-rhamnosyltransferase or a hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant thereof and the heterologous bacterial enzyme GacC and/or GacG or an enzymatically active homologue, variant or fragment thereof as described herein.
The bacterial host cell may be heterologous to the species from which the hexose-β-1,4-rhamnosyltransferase, a hexose-α-1,2-rhamnosyltransferase or a hexose-α-1,3-rhamnosyltransferase, or an enzymatically active fragment or variant thereof is derived.
Optionally, the bacterial host cell is a gram-negative bacterium such as E. coli. The bacterial host cell may comprise the enzymes described herein and/or the nucleic acid sequences encoding the enzymes.
According to a sixth aspect, there is provided an immunogenic composition or vaccine comprising the rhamnose polysaccharide of the second or third aspect or the streptococcal glycoconjugate according to the fourth aspect. The immunogenic composition or vaccine may further comprise a pharmaceutically acceptable and/or sterile excipient, carrier and/or diluent.
In some embodiments, the immunogenic composition or vaccine further comprises an antigen, polypeptide and/or adjuvant.
The composition may further comprise a pharmaceutically acceptable carrier, diluent or excipient. A “pharmaceutically acceptable carrier” as referred to herein is any physiological vehicle known to those of ordinary skill in the art useful in formulating pharmaceutical compositions. A “diluent” as referred to herein is any substance known to those of ordinary skill in the art useful in diluting agents for use in pharmaceutical compositions. The agent may be mixed with, or dissolved, suspended or dispersed in the carrier, diluent or excipient.
The composition may be in the form of a capsule, tablet, liquid, ointment, cream, gel, hydrogel, aerosol, spray, micelle, transdermal patch, liposome or any other suitable form that may be administered to an animal suffering from, or at risk of developing a disease, condition or infection with a streptococcal aetiology.
The compositions and/or vaccines of this invention may be formulated for oral, topical (including dermal and sublingual), intramammary, parenteral (including subcutaneous, intradermal, intramuscular and intravenous), transdermal and/or mucosal administration. In embodiments the compositions and vaccines of this invention may be formulated for parenteral administration, optionally subcutaneous, intradermal, intramuscular and/or intravenous administration.
There is also provided the rhamnose polysaccharide of the second or third aspect, the streptococcal glycoconjugate according to the fourth aspect, or the immunogenic composition or vaccine according to the sixth aspect for use in raising an immune response in an animal or for use in treating or preventing a disease, condition or infection with a streptococcal aetiology.
The animal may be any mammalian subject, for example a dog, cat, rat, mouse, human, sheep, goat, donkey, horse, cow, pig and/or chicken.
In embodiments, the animal is an ovine animal, a caprine animal, an equine animal, a porcine animal, a bovine animal or a human. In embodiments, the animal is an ovine animal. By “ovine animal”, this will be understood to include sheep.
The skilled person will appreciate that the term “caprine” includes goats, while “bovine” includes cattle. Equine is a term that will be understood to include horses. As used herein, the term “porcine” includes pigs.
An immune response which contributes to an animal's ability to resolve an infection/infestation and/or which helps reduce the symptoms associated with an infection/infestation may be a referred to as a “protective response”. In the context of this invention, the immune responses raised through exploitation of the rhamnose polysaccharides described herein may be referred to as “protective” immune responses. The term “protective” immune response may embrace any immune response which: (i) facilitates or effects a reduction in host pathogen burden; (ii) reduces one or more of the effects or symptoms of an infection/infestation; and/or (iii) prevents, reduces or limits the occurrence of further (subsequent/secondary) infections.
Thus, a protective immune response may prevent an animal from becoming infected/infested with a particular pathogen and/or from developing a particular disease or condition.
An “immune response” may be regarded as any response which elicits antibody (for example IgA, IgM and/or IgG or any other relevant isotype) responses and/or cytokine or cell mediated immune responses. The immune response may be targeted to the rhamnose polysaccharide of the invention. For example, the immune response may comprise antibodies which have affinity for epitopes of or the entire rhamnose polysaccharide.
Also provided is a method of treating an animal having a disease, condition or infection with a streptococcal aetiology, the method comprising administering the animal a therapeutically effective amount of the rhamnose polysaccharide of the second or third aspect, the streptococcal glycoconjugate according to the fourth aspect, or the immunogenic composition or vaccine according to the sixth aspect.
A therapeutically effective amount will be understood to refer to an amount sufficient to eliminate, reduce or prevent a disease, condition or infection with a streptococcal aetiology.
The rhamnose polysaccharide, glyconjugate or the immunogenic composition or vaccine may be administered as a single dose or as multiple doses. Multiple doses may be administered in a single day (e.g., 2, 3 or 4 doses at intervals of e.g., 3, 6 or 8 hours). The agent may be administered on a regular basis (e.g., daily, every other day, or weekly) over a period of days, weeks or months, as appropriate.
It will be appreciated that optimal doses to be administered can be determined by those skilled in the art and will vary depending on the particular agent in use, the strength of the preparation, the mode of administration and the advancement or severity of the disease, condition or infection with a streptococcal aetiology. Additional factors depending on the particular subject being treated will result in a need to adjust dosages, including subject age, weight, gender, diet, and time of administration. Known procedures, such as those conventionally employed by the pharmaceutical industry (e.g., in vivo experimentation, clinical trials, etc.), may be used to establish specific formulations for use according to the invention and precise therapeutic dosage regimes.
Also provided is a kit of parts, the kit comprising:
Suitable nucleic acid sequences for the kit of parts are as described herein in relation to the method of the invention.
In some embodiments, the kit further comprises one or more nucleic acid sequences encoding an O-oligosaccharyltransferase as described herein.
Further nucleic acid sequences which the kit may comprise may include one or more nucleic acid sequences encoding one or more of the following 12 enzymes GacA, GacD, GacE, GacF, GacH, Gacl, GacJ, GacK and GacL, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
In some embodiments, the kit further comprises a nucleic acid sequence encoding GacA, or an enzymatically active homologue, variant or fragment thereof. In some embodiments, the kit comprises a nucleic acid sequence encoding GacG, or an enzymatically active homologue, variant or fragment thereof.
In some embodiments, the kit comprises nucleic acid sequences encoding GacG and GacC, or one or more enzymatically active homologue(s), variant(s) or fragment(s) thereof.
In some embodiments, the kit further comprises nucleic acid sequences encoding the enzymes GacA, GacD, GacE, and GacF or one or more enzymatically active homologues, fragments or variants thereof.
The kit may further comprise one or more nucleic acid sequences encoding a reporter gene.
The reporter sequence may encode a gene or peptide/protein, the expression of which can be detected by some means. Suitable reporter sequences may encode genes and/or proteins, the expression of which can be detected by, for example, optical, immunological or molecular means. Exemplary reporter sequences may encode, for example, fluorescent and/or luminescent proteins. Examples may include sequences encoding firefly luciferase (Luc: including codon-optimised forms), green fluorescent protein (GFP), red fluorescent protein (dsRed). One or both of the nucleic acid sequences described in (i) and (ii) of the kit may comprise the reporter sequence.
The kit may optionally further comprise bacteria, for example gram-negative bacteria such as E. coli. The bacteria may be heterologous to the bacterial species from which the hexose-β-1,4-rhamnosyltransferase, the hexose-α-1,2-rhamnosyltransferase, the hexose-α-1,3-rhamnosyltransferase or enzymatically active fragment or variant thereof is derived.
It will be appreciated that the plurality of nucleic acid sequences may be provided in one or a plurality of plasmids.
All of the features described herein (including any accompanying claims, abstract and drawings) may be combined with any of the above aspects in any combination, unless otherwise indicated.
The invention will now be described by way of example with reference to the following figures, which show:
S. pyogenes relies on different mechanisms to withstand the host's defences (1-5). These mechanisms are supported by the synthesis of a wide array of virulence factors, amongst which is the Group A Carbohydrate (GAC), a surface polysaccharide that constitutes between 40% and 60% of the bacterial cell wall (6-9). GAC is composed of a [→3)α-Rha(1→2)α-Rha(1→] rhamnose polysaccharide (RhaPS) backbone with a β-D-GlcNAc (1→3) side chain modifications on every α-1,2-linked rhamnose (9-11). Recent structural examinations and composition analysis of the GAC also suggest the presence of glycerol phosphate (GroP) (12), an observation that remained unnoticed for over fifty years (13,14). Further, Edgar et al. demonstrated that approximately 25% of GAC side chain GlcNAcs are decorated with GroP, imparting a negative charge to this polymer that has implications on S. pyogenes biology and defence mechanisms (12, 13, 15). This feature, previously identified in other surface glycans (16,17), provided new insight into the structural composition, biosynthesis and function of GAC.
GAC is proposed to be synthesised by twelve proteins, GacABCDEFGHIJKL, encoded in one gene cluster (i.e.: MGAS5005_spy0602-0613) that has been found in all S. pyogenes species identified so far (1, 18). Through sequencing of transposon mutant libraries, Le Breton et al. discovered that eight of these genes, gacABCDEFG and gacL are essential for S. pyogenes survival (4, 19). This information supports the observation by van Sorge et al., who identified via insertional mutagenesis that the first three genes of the cluster (gacABC) are essential (1).
It is currently hypothesized that the GAC is formed in five consecutive steps: (i) lipid-linked acceptor initiation, (ii) [-→3)α-Rha(1→2)α-Rha(1→] RhaPS backbone synthesis, (iii) membrane translocation, (iv) post-translocational chain modifications in the extracellular environment and (v) linkage to the peptidoglycan (9). The cytoplasmic pool of dTDP-rhamnose is supplied by the enzymes encoded in two separate gene clusters rm/ABC and gacA/rm/D (16).
Despite the recent findings, some pressing questions remain unanswered regarding the biosynthesis of the GAC. For example, the products of six of the twelve genes that constitute the GAC cluster (gacBCDEFG) have not yet been characterised, leaving the GAC initiation, RhaPS backbone biosynthesis and translocation steps unknown.
As a means of attaining more information on the GAC initiation step, we conducted an in-depth examination of the second enzyme encoded in the GAC gene cluster. Here we demonstrate that GacB, in disagreement with its preliminary genetic annotation and currently proposed action (8), is the first retaining rhamnosyltransferase that catalyses the transfer of L-rhamnose from dTDP-β6-L-rhamnose. GacB forms a β-1,4 glycosidic bond with the lipid-linked GlcNAc-diphosphate through a metal-independent mechanism. More importantly, our research on phylogenetically-related homologs from other important human pathogenic streptococci, in particular from the Lancefield groups B, C and G streptococci, reveal that the role of GacB is well conserved within the Streptococcus genus, suggesting a common first committed step for the production of RhaPS from all Lancefield groups.
Experimental Procedures
Bioinformatics Analysis
Alignment of protein sequences was performed using NCBI Blast Global align (https://goo.gl/vB9zmD) and ClustalOmega (https://goo.gl/8FbvYP) (49). Molecular weight predictions were obtained using the ProtParam tool at the Expasy server (http://www.expasy.org/). Topological predictions were generated using both SpOctopus (http://octopus.cbr.su.se/) and the TMHMM algorithms (www.cbs.dtu.dk/services/TMHMM/).
Secondary structure predictions were generated using either Phyre2 (https://goo.gl/zrGKJ7) or RaptorX (raptorx.uchicago.edu) homology recognition engines, and these structures were viewed and analysed using the PyMOL Molecular Graphics System (educational version 1.8 Schrödinger, LLC). The Carbohydrate Active Enzymes database (CAZy) (http://www.cazy.org/) (50) was examined to obtain information about the classification and characterization of carbohydrate active enzymes. Phylogeny relationships were established using Clustal Omega, Clustal X and the interactive tree of life iTOL (22).
Bacterial Strains and Growth Conditions
E. coli strains DH5a and MC1061 were used indistinctively as host strains for the propagation of recombinant plasmids and plasmid integration. E. coli CS2775, a strain lacking the Rha modification on the lipopolysaccharide, was used as the host strain to evaluate the production of RhaPS. E. coli 21548 is an Und-PP-GlcNAc deficient strain that contains a wecA deletion, serving as a negative control for the production of RhaPS. E. coli strain C43 (DE3) was used for the production of recombinant protein. All E. coli strains were grown in LB media. Unless otherwise indicated, all bacterial cultures were incubated at 37° C. in a shaking incubator at 200 rpm. Where necessary, media were supplemented with one or more antibiotics to the following final concentration: carbenicillin (Amp) at 100 μg/μL, erythromycin (Erm) at 300 μg/μL or kanamycin (Kan) at 50 μg/mL.
Molecular Genetic Techniques
Table 1 shows the DNA sequence of the forward and reverse oligonucleotide primer pairs used to amplify, delete, or mutagenise the genes of interest. All primers were obtained from Integrated DNA Technologies (IDT). All PCR reactions were performed using a SimpliAmp™ Thermal Cycler from ThermoFisher Scientific with standard procedures. Constructs were cloned using standard molecular biology procedures, including restriction enzyme digest and ligation. All constructs were validated with DNA sequencing.
S. mutans
S. mutans
S. mutans
S. mutans
S. mutans
S. mutans
S. mutans
S. mutans
S. pyogenes
S. pyogenes
S. pyogenes
S. pyogenes
S. dysgalactiae
equisimilis
Streptococcus -
S. agalactiae
Streptococcus -
S. dysgalactiae
equisimilis
Streoptococcus -
S. pneumonia
S. pneumoniae
Determination of RhaPS Production
50 μL of OD600-normalised overnight cultures grown at 37° C. were mixed with 50 μL of 6×SDS-loading buffer and resolved in 20% Tricine-SDS gels (29). Assessment of the RhaPS production was performed via immunoblotting on PVDF membranes following the traditional immunoblotting technique. Primary antibody: rabbit-raised anti-Streptococcus pyogenes Group A carbohydrate polyclonal antibody (Abcam, ab21034). Secondary antibody: goat-raised anti-rabbit IgG HRP conjugate (Biorad, 170-6515). Immunoreactive signals were captured using GENESYS™ 10S UV-Vis Spectrophotometer (Thermo Scientific) after exposure to the Clarity Western ECL (Biorad).
Extraction and Radiolabelling of Lipid-Linked Oligosaccharides
Radiolabelled lipid-linked saccharides (LLS) of induced E. coli CS2775 cells bearing the selected plasmids were extracted using 1:1 CHCl3/CH3OH and water-saturated butan-1-ol (1:1 v/v) solution to determine the addition of sugar residues in vivo after glucose D[6s3H] (N) (Perkin Elmer) supplementation (1 mCi/mL). The incorporated radioactivity was measured in a Beckman Coulter® LS6000SE scintillation counter. The organic phase containing the LLSs were normalised to 0.05 μCi/μL. The samples were separated via thin layer chromatography (TLC) on a HPTLC Silica Gel 60 plate (Merck) using a C:M:AC:A:W mobile phase (180 mL chloroform+140 mL methanol+9 mL 1M ammonium acetate+9 mL 13 M ammonia solution, 23 mL distilled water), then dried and sprayed with En 3 Hance™ liquid (Perkin Elmer). Radioautography images were obtained from Carestream® Kodak® BioMax® XAR Film and MS Intensifying Screens after 5 to 10 days.
Purification of Recombinantly Expressed Membrane Associated Proteins
The purification was conducted following the established protocol from Waldo, et. al. (30) with the following modifications. Overnight cultures of E. coli C43 (DE3) cells expressing C-terminal GFP-fusion proteins were diluted 1:100, incubated for 3 hours until OD600=0.6, induced with 0.5 mM IPTG and shifted to room temperature overnight, all at 200 rpm shaking. GPF expression was detected through in-gel fluorescence using a Fujifilm FLA-5000 laser scanner. Cloning, expression and purification of GacB-WT, GacB-D160N-GFP and GacB-Y182-GFP: plasmids containing GFP-Hiss-tagged recombinant proteins were constructed as described in Table 1 into the vector pWaldo-E (30). For protein production and purification purposes, the vectors were transformed into E. coli C43 (DE3) cells and expressed as described above. The cells were fractionated using an Avestin C3 High-Pressure Homogenisator (Biopharma, UK) and spun down at 4000×g. Further centrifugation of the supernatant at 200 000×g for 2 h rendered 2-3 g of membrane containing the GacB-GFP proteins. Membranes were solubilised in Buffer 1 (500 mM NaCl, 10 mM Na2HPO4, 1.8 mM KH2PO4 2.7 mM KCl, pH of 7.4, 20 mM imidazole, 0.44 mM TCEP) with the addition of 1% DDM (Anatrace) for 2 hr at 4° C. and bound to a 1 mL Ni-Sepharose 6 Fast Flow (GE healthcare) column, prewashed with buffer 1 plus 0.03% DDM. Elution was conducted using Buffer 1 supplemented with 250 mM imidazole and 0.03% DDM. Imidazole was removed using a HiPrep 26/10 desalting column (GE Healthcare) equilibrated with Buffer (PBS, 0.03% DDM, 0.4 mM TCEP). The GFP-His tag was removed with PreScission Protease cleavage in a 1:100 ratio overnight at 4° C. Cleaved GacB proteins were collected after negative IMAC. Protein identity and purity was determined by tryptic peptide mass fingerprinting, matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF), respectively (University of Dundee ‘Fingerprints’ Proteomics Facility).
Synthesis of Acceptor Acceptor 1 and 2
Acceptor 2 (P1-(11-phenoxyundecyl)-P2-(2-acetamido-2-deoxy-α-D-glucopyranosyl) diphosphate) was synthesised as sodium salt from phenoxyundecyl dihydrogen phosphate and 2-acetamido-2-deoxy-3,4,6-tri-O-acetyl-α-D-glucopyranosyl dihydrogen phosphate according to the procedure by T. N. Druzhinina et al. 2010 (94). Acceptor 1 (P1-tridecyl-P2-(2-acetamido-2-deoxy-α-D-glucopyranosyl) diphosphate) was synthesised from tridecyl dihydrogen phosphate (obtained similarly to phenoxyundecyl dihydrogen phosphate) by the same procedure as described for acceptor 2.
GacB In Vitro Enzymatic Reaction
Purified GacB-WT-GFP, GacB-D160N-GFP, GacB-Y182F-GFP and the GacB (tag-less) protein (0.15 mg/ml final concentration) were mixed in a 100 μl TBS buffer supplemented with 1 mM TDP-Rha as sugar donor and 1 mM acceptor-1 (C13—PP-GlcNAc) or 1 mM acceptor-2 (Phenol-O—C11H22—PP-GlcNAc) as acceptor substrate. The reaction was incubated for 3 h to 24 h at 30° C. The assay mixture was adjusted with the exchange of the nucleotide sugar donor to UDP-Rha or UDP-GlcNAc and with the addition of either 1 mM MgCl2, 1 mM MnCl2, or 1 mM EDTA to define the essentiality of metal dependency.
Mass Spectrometry Analysis
Matrix-assisted Laser Desorption Ionization Time-of-Flight (MALDI-TOF) was used to analyse the acceptors and products of the GacB in vitro assay. 100 μl reaction samples were purified over a 100 μL Sep-Pak C18 cartridges (Waters, UK), pre-equilibrated with 5% EtOH. The bound samples were washed with 800 μl H2O and 800 μl 15% EtOH, eluted in two fractions with a) 800 μl 30% and b) 800 μl 60% EtOH. The two elution fractions were dried in a speed vac and resuspended in 20 μl 50% MeOH. 1 μl of sample was mixed with 1 μl 2,5-dihydroxybenzoic (DHB) acid matrix (15 mg/mL in 30:70 acetonitrile: 0.1% TFA) and 1 μl was added to the MALDI grid. Samples were analyzed by MALDI in an Autoflex speed mass spectrometer set up in reflection positive ion mode (Bruker, Germany).
NMR Analysis
The purified GacB in vitro assay products (0.5-2 mg) were dissolved in D20 (550 μL) and measured at 300 K. The spectra were acquired on a 4-channel Avance III 800 MHz Bruker NMR spectrometer equipped with a 5 mm TCl CryoProbe™ with automated matching and tuning. 1D spectra were acquired using the relaxation and acquisition times of 5 and 1.8 s, respectively. Between 32 and 512 scans were acquired using the spectral width of 11 ppm. J connectivities were established in a series of 1D and 2D TOCSY experiments with mixing times between 20 and 120 ms. Selective 1D TOCSY spectra (32) were acquired using a 40 ms Gaussian pulses and DIPSI-2 sequence (33) (γB1/2π=10 kHz) for spin lock of between 20 and 120 ms. The following parameters were used to acquire 2D TOCSY and ROESY experiments: 2048 and 768 complex points in t2 and t1, respectively, spectral widths of 11 and 8 ppm in F2 and F1, yielding t2 and t1 acquisition times of 116 and 60 ms, respectively. Sixteen scans were acquired for each t1 increments using a relaxation time of 1.5 s. The overall acquisition time was 6-7 hours per experiment. A forward linear prediction to 4096 points was applied in F1. A zero filling to 4096 was applied in F2. A cosine square window function was used for apodization prior to Fourier transformation in both dimensions. The ROESY mixing time was applied in the form of a 250 ms rectangular pulse at γB1/2π=4167 Hz. DIPSI-2 sequence (γB1/2π=10 kHz) was applied for a 20, 80 and 120 ms spin lock. 2D magnitude mode HMBC experiments: 2048 and 128 complex points in t2 and t1, respectively, spectral widths of 6 and 500 ppm in F2 and F1, yielding t2 and t1 acquisition times of 0.35 s and 0.6 ms, respectively. Two scans were acquired for each of 128 t1 increments using a relaxation time of 1.2 s. The overall acquisition time was 8 minutes. A forward linear prediction to 512 points was applied in F1; zero filling to 4096 was applied in F2. A sine square window function was used for apodization prior to Fourier transformation in both dimensions.
GacC/Homologous Enzymes Protein Purification
For production of recombinant proteins, target genes (GacC, GbcC, Cps2F, SccC) were synthesized using IDT's gBlock gene fragment synthesis service. Wild-type sequences for GacC and its' homologs were PCR amplified with overhangs designed for cloning into pOPINF1, which contains an N-terminal 6× Histidine tag for affinity purification. Cloning into pOPINF was carried out using In-Fusion™ cloning technology (Clontech). The resulting plasmids were then transformed into DH5α: competent cells for propagation and extraction (miniprep kit; Qiagen). Positively transformed plasmids were identified by size comparison to a non-transformed control pOPINF plasmid using gel electrophoresis, which were subsequently confirmed by DNA sequencing. For insertion of point mutants, wild-type plasmids were used as templates to PCR amplify 2 overlapping fragments containing the desired point mutant. Fragments were designed to contain a minimum of a 15 bp overlap and were cloned into pOPINF and sequence verified as for wild type plasmids. A full list of primers used for both wild-type and mutant cloning can be found in Table A.
Sequence verified plasmids were then transformed into C43 cells for protein expression. For activity assays, 1 L of E. coli culture typically yielded enough protein for >50 assays (1 mg L−1). Cultures were grown at 37° C. and shaking at 200 RPM to an OD of 0.6-1, at which point they were transferred to 18° C. for 1 hour before induction with 0.5 mM isopropyl β-D-thiogalactopyranoside (IPTG). Cultures were left shaking at 18° C. overnight. Following centrifugation of the culture at 3000×g, proteins were extracted in Buffer A0 (50 mM HEPES pH 7.5, 300 mM NaCl, 10% glycerol, 2 mM TCEP) supplemented with protease inhibitors, using an Avestin C3 cell disruptor according to the manufacturer's instructions. Lysed cultures were then subject to ultracentrifugation at 200,000×g and the supernatant was collected. The supernatant containing the soluble proteins of interest was then purified over a Nickel-affinity (Thermo Fisher) column using wash Buffer A (50 mM HEPES pH 7.5, 300 mM NaCl, 10% glycerol, 2 mM TCEP, 20 mM imidazole) and elution Buffer B (50 mM HEPES pH 7.5, 300 mM NaCl, 10% glycerol, 2 mM TCEP, 400 mM imidazole) according to manufacturer's instructions. Elution fractions containing the target proteins were then passed over a desalting column, preequilibrated with Buffer A0, to remove imidazole. Protein samples were concentrated to 0.5-1 mg/ml and snap frozen in liquid nitrogen until use.
1Berrow NS, Alderton D, Sainsbury S, Nettleship J, Assenberg R, Rahman N, Stuart DI, Owens RJ. A versatile ligation-independent cloning method suitable for high-throughput expression screening applications. Nucleic acids research. 2007 Mar 1; 35(6): e45.
HPLC Assay
For in vitro enzyme analyses, 50 μl reactions were set up to include 2.5 mM synthetic lipid acceptor PH—O—C11H22—PP-alpha-NAG, 12.5 mM TDP-L-rhamnose, 0.5-1.5 μM GacB-GFP, and 1.25-2.5 μM GacC or homolog/mutant of interest, topped up to 50 μl with TBS Buffer supplemented with 2 mM MnCl2. Reactions were incubated at 30° C. and when desired timepoints were met, quenched with 50 μl acetonitrile and left on ice for 15 minutes. Reactions were spin filtered at 14,000 RPM in a benchtop centrifuge to remove precipitated protein before being injected onto a Xbridge BEH Amide OBS Prep column (130 Å, 5 μM, 10×250 mm) connected to an HPLC system fitted with a UV detector set to 270 nm (Ultimate 3000, Thermo). Samples were applied to the column at 4 ml/min using Running Buffer A (95% acetonitrile, 10 mM ammonium acetate, pH 8) and Running Buffer B (50% acetonitrile, 10 mM ammonium acetate, pH 8) over a gradient of increasing concentration of B. Increasingly polar products with additional sugar residues eluted later into the gradient, with the triple rhamnosylated GacC product typically eluting ˜14 min into a 36 min run. Products purified from the HPLC were dried in a speed vacuum to remove excess acetonitrile, before being freeze-dried to remove residual water and ammonium acetate. Samples could be stored at −20° C. for structural analysis.
NMR Analysis GacC Product
For NMR analysis at the University of Dundee, HPLC purified products (0.5-2 mg) were resuspended in 600 μl of D20 and NMR spectra were recorded at 293 K. The spectra were acquired on a Bruker AVANCE III HD 500 MHz NMR Spectrometer equipped with a 5-mm QCPI cryoprobe. NMR spectra were recorded as described for the GacB reaction product. Spectra were analysed using Bruker Topsin (4.0.7).
Results
GacB is Required for the Biosynthesis of the GAC RhaPS Chain
To investigate the GacB function and to identify potential catalytic residues, we used E. coli as a heterologous expression system to study the GAC RhaPS backbone biosynthesis. We constructed two vectors carrying the homologous genes from S. pyogenes, gacACDEFG (gacA-G; ΔgacB) and gacB (
The RhaPS chain is presumed to be translocated to the outer membrane in E. coli, which naturally contains rhamnose attached to the lipopolysaccharides. Thus, to avoid unspecific binding of the anti-GAC antibody, all transformations were made using a rfaS-deficient strain (20). The interruption of the rfaS gene impedes the attachment of rhamnose to the LPS on the bacterial outer membrane, rendering a strain that lacks endogenous rhamnose on its surface (20). The role of GacB was investigated using the traditional complementation strategy depicted in
We investigated the production of RhaPS by gacA-G from our complementation approach using immunoblots of total cells lysates (
In order to investigate if GacB and SccB are catalysing the same reaction, we tested the ability of GacB to functionally substitute SccB and vice versa by co-transforming ΔsccB+gacB and ΔgacB+sccB. In all cases, SccB and GacB were interchangeable (
GacB Extends a Lipid-Linked Precursor
We investigated whether GacB is a GT that uses GlcNAc-PP-Und as an acceptor. We performed an in vivo experiment generating radiolabelled lipid-linked oligosaccharides (LLO), which were isolated from the bacterial membrane and separated via thin-layer chromatography (TLC). Based on the annotation as a rhamnosyltransferase, radiolabelled dTDP-β-L-rhamnose would be the preferred sugar donor for GacB. However, this compound is not commercially available, therefore tritiated glucose was chosen as an alternative. Inside the bacterial cell, glucose is used as a substrate to synthesise a wide array of organic components, including dTDP-L-rhamnose (25).
We hypothesised that GacB transfers an activated sugar from a (radiolabelled) nucleotide sugar donor to a membrane-bound acceptor monosaccharide-PP-Und, e.g. GlcNAc-PP-Und. Therefore, we expected a change in size of the membrane bound acceptor, compared to the signal of the monosaccharide lipid-linked acceptor after running the samples in a TLC plate. As negative control, we used E. coli CS2775 (ArfaS) transformed with the empty vector. This transformant showed a signal consistent with the generation of monosaccharide-PP-Und (
GacB is a Rhamnosyltransferase that Transfers Rhamnose from TDP-β-I-Rha onto GlcNAc-PP-Lipid Acceptors
The observed band shift suggested that GacB adds a monosaccharide to a lipid-linked precursor, most likely GlcNAc-PP-Und. We investigated this hypothesis using recombinantly produced and purified GacB WT and amino acid mutants (mutants D160N and Y182F). We established an in vitro assay using the predicted nucleotide sugar donor, TDP-β-L-rhamnose and a synthetic acceptor substrate. We tested two of these synthetic substrates designed to mimic the native lipid-linked acceptor: C13H27—PP-GlcNAc (acceptor 1) or phenyl-O—C11H22—PP-GlcNAc (acceptor 2) (
The MALDI-MS spectra of the enzymatic reaction (
We further investigated GacB's specificity towards the sugar-nucleotide donor. In particular, we tested if GacB is selective for thymidine-based nucleotides and tolerates uridine-based nucleotides such as UDP-Glc, UDP-GlcNAc and UDP-Rha. As shown before, in the presence of TDP-β-L-Rha, two products consistent with the incorporation of rhamnose plus either two or three sodium cations were observed in the spectrum (
Finally, we assessed metal ion dependency in vitro. Compared to the control reaction (
Together, these data confirmed our previous conclusions drawn from the LLSs radiolabelled assay (
Investigation of GacB's Catalytic Residues
We were unable to obtain diffraction-quality crystals from the detergent-extracted protein, which would ultimately have revealed detailed insights into the catalytic region. We constructed a GacB structural model based on two enzymes that belong to the GT-4 family of GTs: Bacillus anthracis' BaBshA (PDB entry 3mbo) (72) and Corynebacterium glutamicum's MshA (PDB ID: 3c4v) (24). BaBshA shares 15% identity in 64 out of 424 amino acids. MshA is a ‘homologous’ GT that shares 16% identical residues in a sequence stretch of 71 residues out of 446. Based on the scarce information provided by the structural models and the multiple sequence alignment described in detail below, we mutated several residues that are highly conserved in over forty pathogenic streptococci species.
Our in vitro E. coli system is the first one that enables the study of GacB mutant proteins, allowing the identification of those mutants that abrogate or reduce the production of RhaPS backbone. Conducting this in S. pyogenes is not possible since deletion of the gacB gene renders inviable cells (1, 20). We used the information available from the GT models mentioned above and the sequence alignment of multiple streptococci to select residues that might be involved in substrate binding, which tends to be conserved among GT. Through in-situ mutagenesis, we constructed nine recombinant versions of GacB containing the following amino acid substitutions: D126A, D126N, E222A, E222Q, D160A, D160N, Y182A, Y182F and K131R. The latter mutation was included as a negative control since it is a conserved predicted surface residue that presumably is not engaged in the catalytic activity or could inactivate the enzyme otherwise.
We found that substitution of D160 with an asparagine led to a drastic reduction in the production of the RhaPS chain, while an alanine residue did not cause such significant effect. This suggests that the D160 carboxyl group might be required for catalysis, which potentially can be replaced in the alanine mutant by a water molecule. A more severe effect was observed with mutations of Y182. The alanine substitution of Y182 (Y182A) impeded the RhaPS backbone biosynthesis significantly, while Y182F completely inactivated GacB, suggesting an essential role for the Y182 hydroxyl group in GacB's enzymatic activity.
We further investigated the mutants D160N and Y182F in an in vitro assay using recombinantly expressed and purified GacB-GFP-fusions. The MALDI-MS analysis of the reaction products from GacB-D160N-GFP and GacB-Y182F-GFP revealed that both mutants lacked an enzymatic activity in vitro (
Finally, we created three truncated versions of GacB at the N-terminal end as an attempt to determine whether the enzyme remains active in the absence of the residues predicted to be associated with the membrane. Our results showed that truncations of the first 22 (GacB23-385), 75 (GacB76-385) and 118 residues (GacB119-385) led to inactivation of the enzyme when assessed through the complementation assay. Their inability to complement ΔgacB suggest that the N-terminal domain is required for activity and supports the hypothesis that GacB is a membrane-associated rhamnosyltransferase.
GacB is a Retaining β-1,4-Rhamnosyl-Transferase
The current gene annotation suggests that GacB is an inverting α-1,2 rhamnosyltransferase (1, 8). This annotation is incompatible with the acceptor sugar GlcNAc since its carbon at position C2 is already decorated with the N-acetyl group. Therefore, GacB can only transfer the rhamnose onto the available hydroxyl groups on C3, C4 or C6. In addition, the GAC backbone is composed of repeating units of rhamnose connected via an α-1,3-1,2 linkage (9, 12) suggesting that GacB would be the only rhamnosyltransferase of this pathway using a retaining mechanism of action. According to the CAZy database, the GacB sequence is classified as a GT-4 family member, which are classified as retaining GTs (27). If that classification is correct for GacB, the stereochemical configuration at the anomeric centre of the sugar donor, TDP-β-L-rhamnose, should be retained in the final product.
In order to elucidate whether GacB is an inverting or a retaining rhamnosyltransferase, we conducted nuclear magnetic resonance (NMR) spectroscopy on the purified reaction products 1 and 2. 1H NMR spectra were collected at 800 MHz to both establish the structural integrity of acceptors 1 and 2 (
The addition of rhamnose to both acceptor substrates was accompanied by the appearance of a characteristic signal in the anomeric region of the spectrum (4.88 ppm, H1) next to the water signal. The anomeric configuration of this monosaccharide was established in several ways. The measured 3J(H1,H2) coupling constant of 1.0 Hz indicated a β-L configuration (1.1 and 1.8 Hz reported) for β-L and α-L-Rha, respectively). A rotating-frame nuclear Overhauser effect (ROESY) spectrum (
In conclusion, 1H NMR spectroscopy revealed the formation of a R-L-Rha (1-4) D-GlcNAc moiety and the integrity of the product.
Group a, B, C and G Streptococcus Share a Common RhaPS Initiation Step
In addition to S. mutans SccB, GacB homologs with a high degree of sequence identity are found in other streptococcal species of clinical importance, such as the Streptococcus species from Group B (GBS), Group C (GCS) and Group G (GGS). All homologous enzymes are situated in the corresponding gene clusters encoding the biosynthesis of their Lancefield antigens, i.e., the Group B, C and G carbohydrate (15). The homologous gene products share 67%, 89% and 89% amino acid identity to GacB, respectively (Table 2,
Based on the high-sequence identity to GacB, we hypothesised that the carbohydrate biosynthesis of the Group A, Group B, Group C and Group G Streptococcus possess a conserved initiation step, in which the first rhamnose residue is transferred onto the lipid-linked acceptor forming Rha-β-1,4-GlcNAc-PP-Und. We tested the ability of the homologs from GBS, GCS and GGS (GbsB, GcsB and GgsB, respectively) to functionally substitute GacB in the production of the RhaPS chain (
We showed that GacB requires GlcNAc-PP-Und as acceptor, but it is possible that the enzymes from GBS, GCS and GGS use a different lipid-linked acceptor substrate, such as Glc-PP-Und. Thus, to determine whether the GacB homologs require GlcNAc-PP-Und as lipid acceptor, we conducted the complementation assay using E. coli ΔwecA cells, which lack GlcNAc-PP-Und (23). As a positive control we identified S. pneumoniae WchF, a Glc-1,4-β-rhamnosyltransferase that uses exclusively Glc-PP-Und as substrate (28). As expected, GacB was unable to restore the RhaPS chain when co-transformed with the ΔgacB vector in the absence of the GlcNAc-PP-Und (
Most Streptococcal Pathogens are Predicted to have a GlcNAc-1,4-β-Rhamnosyl-Transferase
S. pneumoniae wchF encodes a Glc-β-1,4-rhamnosyltransferase that requires Glc-PP-Und as acceptor (28). It shares 51% amino acid identity to GacB, compared to 67-89% for the homologous enzymes from GBS, GCS, GGS and S. mutans. Towards a better understanding of the conservation of GacB in the Streptococcus genus, we extended our bioinformatics analysis to search for other strains that harbour GacB homologous genes. We found 48 human/veterinary pathogenic Streptococcus species with a single GacB homolog, sharing 50 to 94% sequence identity (Table 2,
The sequence analysis paired with the complementation assay led us to hypothesise that all GacB homologs encompassed in the ‘high identity’ subgroup possess GlcNAc-β-1,4-rhamnosyltransferase activity. In contrast, the ‘low identity’ subgroup contains S. pneumoniae WchF, a known Glc-1,4-β-rhamnosyltransferase (28). All five members of the ‘low identity subgroup’ exhibit very high sequence identity (>90%) when compared to WchF.
GacO from S. pyogenes, the WecA homolog, was shown to be responsible for the biosynthesis of the GlcNAc-PP-Und (8,9), the substrate for GacB. We therefore hypothesised that the ‘low’ and ‘high identity’ subgroups utilise different substrates, and therefore investigated whether an equivalent discrepancy should be observed when comparing the sequence identity of the GacO homologs. Within the 48 pathogenic streptococci genomes (Table 2,
The degree of phylogenetic conservation of GacB in the Streptococcus genus highlights the importance of this gene, for survival and pathogenesis of streptococcal pathogens. Overall, these results lead us to propose that those streptococcal species that have GacB homologs with a high degree of identity (>65%) are GlcNAc-β-1,4-rhamnosyltransferases that catalyse the first committed step in the biosynthesis of their surface RhaPS by transferring rhamnose from TDP-β-L-rhamnose to the membrane-bound GlcNAc-PP-Und. In contrast, we postulate that the species within the ‘low identity’ subgroup, in accordance with the function of S. pneumoniae serotype 2 WchF, contains a rhamnosyltransferase that acts on lipid-linked Glc-PP-Und.
S. pyogenes
S. canis
S. dysgalactiae subsp.
equisimilis
S. phocae
S. equi subsp. zooepidermicus
S. equi subsp. equi
S. ictaluri
S. bovimastitidis
S. iniae
S. hongkongensis
S. panaeicida
S. uberis
S. porcinus
S. henryi
S. orisasini
S. orisratti
S. parasanguinis
S. ratti
S. vestibularis
S. australis
S. equinus
S. porci
S. sanguinis
S. sinensis
S. sobrinus
S. thoraltensis
S. anginosus
S. caballi
S. downei
S. gordonii
S. intermedius
S. constellatus
S. gallolyticus
S. hyovaginalis
S. mutans
S. salivarius
S. urinalis
S. agalactiae
S. entericus
S. infantarius
S. plurextorum
S. suis
S. lutetiensis
S. oralis subsp. tigurimus
S. mitis
S. peroris
S. pneumoniae
S. pseudopneumoniae
GacB's N-Terminal Domain Encodes Specificity for the GlcNAc Acceptor
We performed a multiple sequence alignment of the GacB homologs from all 48 streptococcal pathogens to identify the most variable and conserved regions in the protein sequence. We observed a higher discrepancy between the ‘high identity’ and the ‘low identity’ subgroups in their N-terminal domains (Table 2). More precisely, a low sequence conservation region is identifiable between the GacB amino acid residues 40 and 80, suggesting that this section of the domain is either involved in the GlcNAc acceptor sugar recognition or in essential protein-protein interactions.
We knew from our previous experiment that GacB cannot initiate the RhaPS biosynthesis on a wecA deletion background (
Therefore, we investigated a structural model based on the GacB homolog from Bacillus anthracis, BaBshA (PDB entry 3mbo), which suggested that residues L128, R131, GNT100 may potentially be involved in sugar acceptor recognition. We mutated these residues to mimic those found in WchF. Complementation assays using GacB L128H_R131L, failed to complement ΔgacB in a ΔwecA background (
Discussion
This work sheds light on the first committed step of the GAC biosynthesis and provides insight into the function of GacB, the first metal-independent, retaining and non-processive α-D-GlcNAc β-1,4-L-rhamnosyltransferase reported. This insight is depicted schematically in
We have also shown that streptococcal RhaPS can be synthesized in a recombinant expression system, namely E. coli, onto a different acceptor, Und-PP-Glu using the enzyme WchF. This is depicted schematically in
This results in the formation of a synthetic streptococcal polysaccharide having a non-reducing end comprising a linear chain of rhamnose moieties and a reducing end comprising a glucose monosaccharide, the polysaccharide comprising a β-1,4 bond between the glucose and the linear chain of rhamnose moieties. As the skilled person will appreciate, this differs from the naturally occurring GAC (which is shown in
To further illustrate the invention, this Example is directed to further exemplary methods of synthesis and the rhamnose polysaccharide of the invention.
In this particular example, the method is performed in E. coli, although other bacteria can be envisaged for this purpose. Thus, in this particular embodiment, WbbL can be endogenous to the E. coli or it can be overexpressed in the E. coli.
This method, as
Other methods of synthesis are within the scope of the present invention.
The method of
An alternative embodiment, as also depicted by the top and penultimate schematics of
To the left of
WbbP can alternatively be used to form a disaccharide (i.e., a galactose monosaccharide at its non-reducing end linked by an α-1,3 glycosidic bond to a GlcNAc at its reducing end, following which the rhamnose polysaccharide is generated by extended from the rhamnose moiety at the non-reducing end of the disaccharide (see bottom schematic of
Optionally, one or two additional rhamnose moieties can be transferred to the rhamnose moiety linked to the galactose to form a tetra or pentasaccharide, prior to the step of extension as detailed above. The one or two additional rhamnose moieties can be transferred using the enzyme WbbQ, followed by further extension using GacC using GbcC, as shown in the third schematic of
For the tri, tetra or pentasaccharide methods, these methods result in the generation of a synthetic Streptococcal polysaccharide having a reducing end comprising a linear chain of rhamnose moieties and a non-reducing end comprising a GlcNac and a galactose, the polysaccharide comprising a α-1,2 bond between the linear chain of rhamnose moieties and the galactose and a α-1,3 bond between the galactose and the GlcNAc.
In embodiments wherein a rhamnose moiety is transferred to a disaccharide or trisaccharide, it is envisaged that any combination of hexoses may be used to form the di or trisaccharide using alpha or beta bonds as described herein. This is depicted in
Homologues to GacC can function in a similar manner.
Similarly, the inventor has shown that the GacC enzyme function is conserved amongst Streptococci and is able to complement SccC enzyme of E. coli.
Complementation study confirms that GacC enzyme function is conserved amongst Streptococci from Group B, C, G and S. mutans.
Phylogenetic analysis of GacO, GacB and GacC enzymes show the high degree of similarity and hence function is conserved in Streptococci-Pathogenic strains are all expected to produce RhaPS with identical adapter/stem and as such, all are suitable for use in accordance with the present invention.
Further studies, not all data shown, by the inventors using NMR and mass spectrometry techniques confirm that GacC can add up to 4 rhamnose sugars and that GacC is an inverting alpha-1,3 rhamnosyltransferase.
The inventor has further shown that the rhamnose polysaccharide in accordance with the present invention may be generated using different enzyme combinations.
Based on the above evidence, it is expected that Shigella spp. can be further used in order to provide the adaptor/stem and GAC repeat units, as shown schematically in
Replacement of the GacB/C enzymes (replacement of the GlcNAc-beta1,4-rhamnose-alpha1,3-rhamnose adaptor/stem) to generate NewRhaPS, provides an alternative to maintain the immunogenic repeat unit (proposed to be introduced by GacG enzyme activity). Replacing the adaptor region (green box) with a O-Otase compatible polysaccharide/oligosaccharide is sufficient to build the immunogenic polysaccharide (alpha1,2-alpha1,3 rhamnose).
As described herein, the rhamnose polysaccharides of the present invention may be conjugated with a suitable protein and presented on the surface of a bacterium.
Lanes 5 and 7 show that two different expression conditions for NewRhaPS system are positive for NanA-NewRhaPS glycosylation.
Lane 9: GAC chemically extracted from S. pyogenes (positive control for GAC antibody).
This description should not be construed as limiting and it will be appreciated that other variants and embodiments thereof fall within the scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
1908528.1 | Jun 2019 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/066314 | 6/12/2020 | WO |