Escherichia coli Nissle 1917 (EcN) is a probiotic bacterium originally isolated from a particularly healthy soldier from World War I by the physician Alfred Nissle 1. Since then, this bacterium has found significant use as a probiotic therapy, outcompeting pathogens in the gut 2 and thus protecting the host from infection. EcN has been at the forefront of probiotic genetic engineering 3, benefitting from the well-understood nature of E. coli biology, and from the many tools available to manipulate the organism. There are many projects working with engineered EcN 3, developing living therapeutics for diseases like hyperammonemia 4, and diagnostic tools for cancer detection 5.
In recent years, the gut microbiome has emerged as a critical factor for human health 6, however, the gut ecosystem remains a poorly understood system. One important approach to probe the gut microbiome is the development of engineered microbes that can sense and report on the conditions in the gut7, and deliver therapeutic molecules into the gut environment 8. Additionally, synthetic systems can provide insight into the behaviour of engineered bacteria in the gut environment 9, aiding further engineering efforts. As such, it is important to develop genetic tools to simplify bacterial engineering to study gut health and accelerate the development of sophisticated probiotic bacteria, capable of sensing and treating gut disorders.
Synthetic biology projects typically utilize plasmid vectors, circular DNA elements that can replicate within cells independently of the genome. Plasmids have many benefits: they are simple to manipulate, can be reliably transformed into E. coli cells, and can achieve high levels of gene expression due, in part, to a higher copy number than genomic DNA. Furthermore, several plasmids can be used in concert, allowing for modular assembly of complex synthetic genetic systems, as well as the simple independent testing of each plasmid in the system. An integral part of developing synthetic genetic systems is the iteration of prototypes in a design-test-build cycle 10, where during each cycle variants are tested to inform successive design iterations. Rapid and reliable genetic circuit construction and implementation is key for developing synthetic genetic systems, and plasmids offer an essential tool for this process. However, plasmid vectors also present a serious experimental limitation by requiring an antibiotic for selection and plasmid maintenance. In the context of in vivo therapeutic use in the gut, administration of an antibiotic is often incompatible with treatment and severely limiting to experiments as it induces drastic changes in the host microbiome11.
Synthetic plasmids have been employed to engineer bacteria for in vivo use, however, without antibiotic selection, plasmid loss has been observed 12, although this issue has not been extensively investigated. Strategies have been proposed to overcome this plasmid loss 12, but these are still in early stages of development and come with their own drawbacks. Given the limitations of plasmids, EcN engineering projects that require stable transformants often insert DNA directly into the chromosome. However, genomic manipulations are typically limited by poor transformation efficiencies in EcN, and involve time-consuming and cumbersome protocols, impeding the iteration of genetic circuit designs. Furthermore, common genomic incorporation protocols such as Lambda Red based methods can be inefficient and have limitations on insert length 13,14 further slowing or outright preventing the development of large multi-component synthetic genetic systems. Additionally, genomic incorporation limits recombinant DNA copy number to genomic copy number, making the achievement of high gene expression rates more difficult. Given the importance of rapid prototyping for the development of synthetic genetic systems, new paradigms are required to host synthetic DNA to facilitate the engineering of probiotic organisms.
The invention comprises an engineered strain of non-pathogenic E. coli harboring two pieces of modified plasmid DNA that enable it to secrete proteins inside the mammalian gastrointestinal (GI) tract. Diseases of the GI tract (e.g. Crohn's, ulcerative colitis) are hard to treat because it is difficult to maintain a steady amount of drug at the site of disease, due to the constant flow of material through the gut. One solution to this problem is to use a living bacterium as the drug delivery vehicle. The bacterium can multiply itself inside the gut, maintaining a relatively steady concentration over time, all while secreting a therapeutic directly at the site of disease. In order to accomplish this, one needs to genetically modify the bacterium so that it behaves appropriately. Genetic engineering of this type in E. coli Nissle, the most common starting point because of its safety profile and genetic tractability, can either occur through insertions to the chromosome or with smaller pieces of circular DNA called plasmids. Chromosomal modification is time consuming and has other limitations. Plasmids are much easier to work with and modify, but they typically require constant antibiotic selection in order to stay associated with the bacterial host—something that is undesirable for use in a human patient. The plasmids were engineered to remain stably associated with E. coli Nissle in the absence of antibiotic selection, facilitating their use in vivo. Overall, the technology is a tool that one could use to engineer bacteria to perform better as a living therapeutic, such as a gut therapeutic.
In some aspects of the invention, disclosed herein are methods for producing a genetically modified bacterium, comprising introducing into a bacterium at least one engineered cryptic plasmid comprising a heterologous nucleic acid, wherein the heterologous nucleic acid comprises a nucleic acid sequence encoding a recombinant protein and a polypeptide secretion system for directing the recombinant protein to the outer membrane for secretion, wherein the bacterium does not comprise any native cryptic plasmids.
In certain aspects of the invention, provided herein are engineered bacterium, comprising at least one engineered cryptic plasmid comprising a heterologous nucleic acid, wherein the heterologous nucleic acid comprises a nucleic acid sequence encoding a recombinant protein and a polypeptide secretion system for directing the recombinant protein to the outer membrane for secretion, wherein the bacterium does not comprise any native cryptic plasmids.
Bacteria isolated from clinical samples often contain plasmids, including small cryptic plasmids that are maintained at high copy number despite containing little genetic information and conferring no apparent phenotype 15. Many of these plasmids have no known function, although one study linked the presence of such small cryptic plasmids to phage resistance 16. EcN contains two such cryptic plasmids, pMUT1 and pMUT2, which are stable within the bacteria and survive passage through the gut, and are used as targets to detect the EcN in clinical PCR assays 17. The pMUT plasmids do not confer any detectable phenotype, are not essential to EcN and do little to affect growth 18. Furthermore, the pMUT plasmids do not present a metabolic burden to EcN, at least under laboratory conditions 19. Whilst several projects have used pMUT plasmids to carry synthetic circuits 3, no systematic engineering attempt has been made to domesticate and characterize the efficacy of engineered pMUT plasmids in vivo.
Disclosed herein is the systematic engineering of the E. coli Nissle 1917 cryptic plasmids pMUT1 and pMUT2 to create a series of plasmid vectors for use in the gut. Several sites were tested on each plasmid to insert recombinant DNA cassettes containing selection and fluorescent markers, and characterized the gene expression in each case. It was found that the native plasmids were not lost through transformation of an engineered variant, thus a technique was developed to remove the native plasmids through a CRISPR-Cas9 mechanism. Further functionality was added to these plasmid vectors: adapting and expanding a temperature sensitive expression system, as well as curli-based protein secretion to export proteins into the extracellular space. The plasmids were then tested in vivo and demonstrated that EcN retained the engineered pMUT plasmids during passage through the mouse GI tract, and that the plasmids were capable of secreting recombinant protein into the extracellular space of the gut.
The invention disclosed herein, for the first time:
This Invention Also Provides:
The invention as contemplated herein may be useful for:
In some aspects, the invention provided herein may be used as a tool for rapid design/testing of living therapeutics.
The disclosed invention is much more cost efficient than biologics. For example, and without limitation, manufacturing involves much less downstream processing, and the manufactured product may be taken orally. Accordingly, the invention disclosed herein is sufficiently cost-effective to be of particular use in treating diseases in developing countries (e.g., enteric pathogens).
There is no living therapeutic currently on the market capable of protein secretion in the gut. The invention provided herein enables the production and therapeutic use against a range of indications.
As used herein, the term “engineered bacterium” or “engineered bacterial cell” refers to a bacterial cell that has been genetically modified from its native state. For instance, an engineered bacterial cell may have nucleotide insertions, nucleotide deletions, nucleotide rearrangements, and nucleotide modifications introduced into their DNA. These genetic modifications may be present in the chromosome of the bacteria or bacterial cell, or on a plasmid in the bacteria or bacterial cell. Engineered bacterial cells of the disclosure may comprise exogenous nucleotide sequences on plasmids. Alternatively, recombinant bacterial cells may comprise exogenous nucleotide sequences stably incorporated into their chromosome. In some embodiments, the engineered bacterium is non-pathogenic. In some embodiments, the engineered bacterium is pathogenic.
“Probiotic”, as used herein, refers to a live, non-pathogenic microorganism, e.g., a bacterium, which can confer health benefits to a host organism. In some embodiments, the host organism is a mammal. In some embodiments, the host organism is a human. Some species, strains, and/or subtypes of non-pathogenic bacteria are currently recognized as probiotic bacteria. Examples of probiotic bacteria include, but are not limited to, Bacteroides (e.g., Bacteroides fragilis, Bacteroides subtilis, and Bacteroides thetaiotaomicron) and Escherichia coli. In some embodiments, the probiotic is Gram-negative bacterium. The probiotic may be a variant or a mutant strain of bacterium. Non-pathogenic bacteria may be genetically engineered to enhance or improve desired biological properties, e.g., survivability. Non-pathogenic bacteria may be genetically engineered to provide probiotic properties. Probiotic bacteria may be genetically engineered to enhance or improve probiotic properties.
The term “antibody”, as used herein, refers to any immunoglobulin (Ig) molecule comprised of four polypeptide chains, two heavy (H) chains and two light (L) chains, or any functional fragment, mutant, variant, or derivation thereof. Such mutant, variant, or derivative antibody formats are known in the art. In a full-length antibody, each heavy chain is comprised of a heavy chain variable region (abbreviated herein as HCVR or VH) and a heavy chain constant region. The heavy chain constant region is comprised of three domains, CH1, CH2 and CH3. Each light chain is comprised of a light chain variable region (abbreviated herein as LCVR or VL) and a light chain constant region. The light chain constant region is comprised of one domain, CL. The VH and VL regions can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR). Each VH and VL is composed of three CDRs and four FRs, arranged from amino-terminus to carboxy-terminus in the following order: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4. Immunoglobulin molecules can be of any type (e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.g., IgG1, IgG2, IgG3, IgG4, IgA1 and IgA2) or subclass. In some embodiments, the antibody is a full-length antibody. In some embodiments, the antibody is a murine antibody. In some embodiments, the antibody is a human antibody. In some embodiments, the antibody is a humanized antibody. In other embodiments, the antibody is a chimeric antibody. Chimeric and humanized antibodies may be prepared by methods well known to those of skill in the art including CDR grafting approaches (see, e.g., U.S. Pat. Nos. 5,843,708; 6,180,370; 5,693,762; 5,585,089; and 5,530,101), chain shuffling strategies (see, e.g., U.S. Pat. No. 5,565,332; Rader et al. (1998) PROC. NAT'L. ACAD. SCI. USA 95: 8910-8915), molecular modeling strategies (U.S. Pat. No. 5,639,641), and the like.
In some embodiments, the antibody is a donkey antibody. In some embodiments, the antibody is a rat antibody. In some embodiments, the antibody is a horse antibody. In some embodiments, the antibody is a camel antibody. In some embodiments, the antibody is a shark antibody.
The term “antigen-binding portion” of an antibody (or simply “antibody fragment”), as used herein, refers to one or more fragments of an antibody that retain the ability to specifically bind to an antigen. It has been shown that the antigen-binding function of an antibody can be performed by fragments of a full-length antibody. Such antibody embodiments may also be bispecific, dual specific, or multi-specific formats; specifically binding to two or more different antigens. Examples of antibody fragments encompassed within the term “antigen-binding portion” of an antibody include (i) a Fab fragment, a monovalent fragment consisting of the VL, VH, CL and CH1 domains; (ii) a F(ab′)2 fragment, a bivalent fragment comprising two Fab fragments linked by a disulfide bridge at the hinge region; (iii) a Fd fragment consisting of the VH and CH1 domains; (iv) a Fv fragment consisting of the VL and VH domains of a single arm of an antibody, (v) a dAb fragment (Ward et al. (1989) Nature 341: 544-546; and PCT Publication No. WO 90/05144 A1, the contents of which are herein incorporated by reference), which comprises a single variable domain; and (vi) an isolated complementarity determining region (CDR). Furthermore, although the two domains of the Fv fragment, VL and VH, are coded for by separate genes, they can be joined, using recombinant methods, by a synthetic linker that enables them to be made as a single protein chain in which the VL and VH regions pair to form monovalent molecules (known as single chain Fv (scFv); see, e.g., Bird et al. (1988) Science 242:423-426; and Huston et al. (1988) Proc. Nat'l. Acad. Sci. USA 85:5879-5883). Such single chain antibodies are also intended to be encompassed within the term “antigen-binding portion” of an antibody. Other forms of single chain antibodies, such as diabodies are also encompassed. Antibody fragments also include single domain antibodies, maxibodies, minibodies, nanobodies, intrabodies, diabodies, triabodies, tetrabodies, v-NAR and bis-scFv (see, e.g., Hollinger and Hudson (2005) Nature Biotechnology 23:1126-1136).
A “single domain antibody”, as used herein, refers to the heavy chain variable domain (“VH”) of an antibody, i.e., a heavy chain variable domain without a light chain variable domain. Single domain antibodies are described, for example, in Hamers-Casterman et al. (1993) Nature 363:446-48, and Dumoulin et al. (2002) Protein Science 11:500-15. Single domain antibodies can be derived from a multiple animals, including, for example, llama, alpaca, camel (i.e., camelid single domain antibodies), and shark.
As used herein, the term “gene” refers to a nucleic acid fragment that encodes a protein or fragment thereof, optionally including regulatory sequences preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence. In one embodiment, a “gene” does not include regulatory sequences preceding and following the coding sequence.
As used herein, a “heterologous” gene, “heterologous sequence”, or “heterologous nucleic acid” refers to a nucleic acid sequence that is not normally found in a given cell in nature. As used herein, a heterologous sequence encompasses a nucleic acid sequence that is exogenously introduced into a given cell. “Heterologous gene” includes a native gene, or fragment thereof, that has been introduced into the host cell in a form that is different from the corresponding native gene. A heterologous gene may include a native gene, or fragment thereof, introduced into a non-native host cell. Thus, a heterologous gene may be foreign or native to the recipient cell; a nucleic acid sequence that is naturally found in a given cell but expresses an unnatural amount of the nucleic acid and/or the polypeptide which it encodes; and/or two or more nucleic acid sequences that are not found in the same relationship to each other in nature.
As used herein, the term “endogenous gene” refers to a native gene in its natural location in the genome of an organism.
As used herein, the term “transgene” refers to a gene that has been introduced into the host organism, e.g., host bacterial cell's genome.
As used herein, a “SecA-dependent secretion signal”, refers to a polypeptide sequence which, when present on a polypeptide, e.g., at the N-terminus of a polypeptide, can cause the polypeptide to be exported from the cytoplasm of a bacterium across the inner membrane as mediated by a bacterial SEC system. In some embodiments, the SecA-dependent secretion signal is the polypeptide having the sequence of the E. coli CsgA SecA-dependent secretion signal and homologs and/or variants, including conservative substitution variants, thereof.
As used herein, a “signal recognition particle (SRP) pathway signal sequence” refers to a polypeptide sequence which, when present on a polypeptide (e.g., the N-terminus of a polypeptide), can cause the polypeptide to be exported from the cytoplasm of a bacterial cell across the inner membrane as mediated by the single recognition particle (SRP) pathway proteins. In some embodiments, the polypeptide is translated and transported across the inner membrane concurrently, thus guiding the nascent polypeptide into the periplasm. In some embodiments, the SRP pathway signal sequence is the SRP signal sequence from CcmH, DsbA, FocC, NikA, SfmC, TolB, TorT, YraI, or homologs and/or variants, including conservative substitution variants, thereof.
As used herein, a “CsgGE export signal sequence” refers to a polypeptide sequence which, when present at the N-terminus of a polypeptide can cause the polypeptide to be targeted by CsgE and exported across the outer membrane of the cell via the CsgG oligomeric transport complex of a curli export system, or by an orthologous export system. In some embodiments, the CsgG targeting sequence is the last 22 amino acids of the bipartite curli signal sequence of an endogenous polypeptide exported by the curli export system. In some embodiments, the CsgG targeting sequence can be a polypeptide having the sequence of an E. coli CsgA CsgGE export signal sequence and homologs and/or variants, including conservative substitution variants, thereof.
A “promoter” as used herein, refers to a nucleotide sequence that is capable of controlling the expression of a coding sequence or gene. Promoters are generally located 5′ of the sequence that they regulate. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from promoters found in nature, and/or comprise synthetic nucleotide segments. Those skilled in the art will readily ascertain that different promoters may regulate expression of a coding sequence or gene in response to a particular stimulus, e.g., in a cell-specific or tissue-specific manner, in response to different environmental or physiological conditions, or in response to specific compounds. Prokaryotic promoters are typically classified into two classes: inducible and constitutive.
“Constitutive promoter” refers to a promoter that is capable of facilitating continuous transcription of a coding sequence or gene under its control and/or to which it is operably linked. Constitutive promoters and variants are well known in the art and include, but are not limited to, a constitutive Escherichia coli σs promoter, a constitutive Escherichia coli σ32 promoter, a constitutive Escherichia coli σ70 promoter, a constitutive Bacillus subtilis σA promoter, a constitutive Bacillus subtilis σB promoter, and a bacteriophage T7 promoter.
An “inducible promoter” refers to a promoter that initiates increased levels of transcription of the coding sequence or gene under its control in response to a stimulus or an exogenous environmental condition. A “directly inducible promoter” refers to a regulatory region, wherein the regulatory region is operably linked to a gene encoding a protein or polypeptide, where, in the presence of an inducer of said regulatory region, the protein or polypeptide is expressed. An “indirectly inducible promoter” refers to a regulatory system comprising two or more regulatory regions, for example, a first regulatory region that is operably linked to a first gene encoding a first protein, polypeptide, or factor, e.g., a transcriptional regulator, which is capable of regulating a second regulatory region that is operably linked to a second gene, the second regulatory region may be activated or repressed, thereby activating or repressing expression of the second gene. Both a directly inducible promoter and an indirectly inducible promoter are encompassed by “inducible promoter.” For example, and without limitation, chemical agents, temperature, and light may be used for induction of the promoters contemplated herein. Preferably, the promoter is a temperature sensitive promoter.
As used herein, the term “expression” refers to the transcription and stable accumulation of sense (mRNA) or anti-sense RNA derived from a nucleic acid, and/or to translation of an mRNA into a polypeptide
The term “genetic modification,” as used herein, refers to any genetic change. Exemplary genetic modifications include those that increase, decrease, or abolish the expression of a gene, including, for example, modifications of native chromosomal or extrachromosomal genetic material. Exemplary genetic modifications also include the introduction of at least one plasmid, modification, mutation, base deletion, base addition, and/or codon modification of chromosomal or extrachromosomal genetic sequence(s), gene over-expression, gene amplification, gene suppression, promoter modification or substitution, gene addition (either single or multi-copy), antisense expression or suppression, or any other change to the genetic elements of a host cell, whether the change produces a change in phenotype or not. Genetic modification can include the introduction of a plasmid, e.g., a plasmid comprising at least one amino acid catabolism enzyme operably linked to a promoter, into a bacterial cell. Genetic modification can also involve a targeted replacement in the chromosome, e.g., to replace a native gene promoter with an inducible promoter, regulated promoter, strong promoter, or constitutive promoter. Genetic modification can also involve gene amplification, e.g., introduction of at least one additional copy of a native gene into the chromosome of the cell. Alternatively, chromosomal genetic modification can involve a genetic mutation.
The term “isolated” or “partially purified” as used herein refers, in the case of a nucleic acid or polypeptide, to a nucleic acid or polypeptide separated from at least one other component (e.g., nucleic acid or polypeptide) that is present with the nucleic acid or polypeptide as found in its natural source and/or that would be present with the nucleic acid or polypeptide when expressed by a cell, or secreted in the case of secreted polypeptides. A chemically synthesized nucleic acid or polypeptide or one synthesized using in vitro transcription/translation is considered “isolated.”
As used herein, the term “exogenous” refers to a substance (e.g., a nucleic acid or polypeptide) present in a cell other than its native source. The term exogenous can refer to a nucleic acid or a protein that has been introduced by a process involving the hand of man into a biological system such as a cell or organism in which it is not normally found or in which it is found in undetectable amounts. A substance can be considered exogenous if it is introduced into a cell or an ancestor of the cell that inherits the substance. In contrast, the term “endogenous” refers to a substance that is native to the biological system or cell.
A “non-amyloid polypeptide”, as used herein, refers to a polypeptide that does not form amyloid aggregates in a cell (e.g., a bacterial cell). An “amyloidogenic polypeptide” refers to a peptide that either forms or increases the formation of amyloid aggregates in a cell. In some embodiments, the therapeutic polypeptide is a non-amyloid polypeptide. In some embodiments, the polypeptide is a non-amyloiodogenic polypeptide. An “amyloid polypeptide” refers to a polypeptide that forms amyloid aggregates in a bacterial cell. An “amyloidogenic polypeptide” refers to a polypeptide that either forms or increases the formation of amyloid aggregates in a cell. In some embodiments, the therapeutic polypeptide is an amyloid polypeptide. In some embodiments, the therapeutic polypeptide is an amyloidogenic polypeptide.
A “pharmaceutical composition,” as used herein, refers to a composition comprising an active ingredient (e.g., a bacterial cell, an inducer, a drug, or a detectable compound) with other components such as a physiologically suitable carrier and/or excipient.
As used herein, the term “pharmaceutically acceptable” or “pharmacologically acceptable” refers to those compounds, materials, compositions, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio. Moreover, for animal (e.g., human) administration, it will be understood that compositions should meet sterility, pyrogenicity, general safety and purity standards as required by the FDA Office of Biological Standards.
As used herein, the term “pharmaceutically acceptable excipient” means a pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, manufacturing aid (e.g., lubricant, talc magnesium, calcium or zinc stearate, or steric acid), or solvent encapsulating material, involved in carrying or transporting the subject compound from one organ, or portion of the body, to another organ, or portion of the body. Each carrier must be “acceptable” in the sense of being compatible with the other ingredients of the formulation and not injurious to the patient. Some examples of materials which can serve as pharmaceutically-acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, methylcellulose, ethyl cellulose, microcrystalline cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) lubricating agents, such as magnesium stearate, sodium lauryl sulfate and talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol (PEG); (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) pH buffered solutions; (21) polyesters, polycarbonates and/or polyanhydrides; (22) bulking agents, such as polypeptides and amino acids (23) serum component, such as serum albumin, HDL and LDL; (22) C2-C12 alcohols, such as ethanol; and (23) other non-toxic compatible substances employed in pharmaceutical formulations. Wetting agents, coloring agents, release agents, coating agents, disintegrating agents, binders, sweetening agents, flavoring agents, perfuming agents, protease inhibitors, plasticizers, emulsifiers, stabilizing agents, viscosity increasing agents, film forming agents, solubilizing agents, surfactants, preservative and antioxidants can also be present in the formulation. The terms such as “excipient”, “carrier”, “pharmaceutically acceptable excipient” or the like are used interchangeably herein.
A “plasmid” or “vector” includes a nucleic acid construct designed for delivery to a host cell or transfer between different host cell. An “expression plasmid” or “expression vector” can be a plasmid that has the ability to incorporate and express heterologous nucleic acid fragments in a cell. An expression plasmid may comprise additional elements, for example, the expression vector may have two replication systems, thus allowing it to be maintained in two organisms. The nucleic acid incorporated into the plasmid can be operatively linked to an expression control sequence when the expression control sequence controls and regulates the transcription and translation of that polynucleotide sequence. In some embodiments of the invention disclosed herein, the plasmid or vector is derived from a cryptic plasmid, such as, but not limited to, pMUT1 and/or pMUT2.
As used herein, the terms “protein” and “polypeptide” are used interchangeably herein to designate a series of amino acid residues, connected to each other by peptide bonds between the alpha-amino and carboxy groups of adjacent residues. The terms “protein”, and “polypeptide” refer to a polymer of amino acids, including modified amino acids (e.g., phosphorylated, glycated, glycosylated, etc.) and amino acid analogs, regardless of its size or function. The terms “protein” and “polypeptide” as used herein refer to both large polypeptides and small peptides. The terms “protein” and “polypeptide” are used interchangeably herein when referring to a gene product and fragments thereof. Thus, exemplary polypeptides or proteins include gene products, naturally occurring proteins, homologs, orthologs, paralogs, fragments and other equivalents, variants, fragments, and analogs of the foregoing.
As used herein, the term “therapeutic polypeptide” refers to any polypeptide that has a therapeutic effect or may be used for diagnostic purposes when introduced into a eukaryotic organism (e.g., a mammalian subject such as human). In some embodiments, the therapeutic polypeptide is an antibody. In some embodiments, the therapeutic polypeptide is a single domain antibody. In some embodiments, the therapeutic polypeptide is a fusion protein, a hormone, an antigen, a thrombolytic agent, a cytokine or a growth factor. In some embodiments, the therapeutic polypeptide is an immunotoxin (e.g., an antibody fused to a cellular toxin).
The term “operatively linked” includes having an appropriate transcription start signal (e.g., promoter) in front of the polynucleotide sequence to be expressed, and having an appropriate translation start signal (e.g., a Shine Delgarno sequence and a start codon (ATG)) in front of the polypeptide coding sequence and maintaining the correct reading frame to permit expression of the polynucleotide sequence under the control of the expression control sequence, and, optionally, production of the desired polypeptide encoded by the polynucleotide sequence. In some examples, transcription of a gene encoding a recombinant polypeptide as described herein is under the control of a promoter sequence (or other transcriptional regulatory sequence) which controls the expression of the nucleic acid in a cell-type in which expression is intended. It will also be understood that the gene encoding a recombinant polypeptide as described herein can be under the control of transcriptional regulatory sequences which are the same or which are different from those sequences which control transcription of the naturally-occurring form of a protein.
The terms “overexpression” or “overexpress”, as used herein refers to the expression of a functional nucleic acid, polypeptide or protein encoded by DNA in a host cell, wherein the nucleic acid, polypeptide or protein is either not normally present in the host cell, or wherein the nucleic acid, polypeptide or protein is present in the host cell at a higher level than that normally expressed from the endogenous gene encoding the nucleic acid, polypeptide or protein.
A “nucleic acid” or “nucleic acid sequence” may be any molecule, preferably a polymeric molecule, incorporating units of ribonucleic acid, deoxyribonucleic acid or an analog thereof. The nucleic acid can be either single-stranded or double-stranded. A single-stranded nucleic acid can be one nucleic acid strand of a denatured double-stranded DNA. Alternatively, it can be a single-stranded nucleic acid not derived from any double-stranded DNA. In one aspect, the nucleic acid can be DNA. In another aspect, the nucleic acid can be RNA. Suitable nucleic acid molecules are DNA, including genomic DNA or cDNA. Other suitable nucleic acid molecules are RNA, including mRNA.
The terms “decrease”, “reduced”, “reduction”, or “inhibit” are all used herein to mean a decrease by a statistically significant amount. In some embodiments, the terms “reduced”, “reduction”, “decrease”, or “inhibit” can mean a decrease by at least 10% as compared to a reference level, for example a decrease by at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or more or any decrease of at least 10% as compared to a reference level. In some embodiments, the terms can represent a 100% decrease, i.e. a non-detectable level as compared to a reference level. In the context of a marker or symptom, a “decrease” is a statistically significant decrease in such level. The decrease can be, for example, at least 10%, at least 20%, at least 30%, at least 40% or more, and is preferably down to a level accepted as within the range of normal for an individual without such disorder.
The terms “increased”, “increase”, “enhance”, or “activate” are all used herein to mean an increase by a statically significant amount. In some embodiments, the terms “increased”, “increase”, “enhance”, or “activate” can mean an increase of at least 10% as compared to a reference level, for example an increase of at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% increase or any increase between 10-100% as compared to a reference level, or at least about a 2-fold, or at least about a 3-fold, or at least about a 4-fold, or at least about a 5-fold or at least about a 10-fold increase, or any increase between 2-fold and 10-fold or greater as compared to a reference level. In the context of a marker or symptom, an “increase” is a statistically significant increase in such level.
The term “non-pathogenic” as used herein to refer to bacteria refers to bacteria that are not capable of causing disease or harmful responses in a host. In some embodiments, non-pathogenic bacteria are commensal bacteria. Examples of non-pathogenic bacteria include, but are not limited to Bacteroides and Escherichia coli, e.g., Escherichia coli Nissle 1917, Bacteroides fragilis, Bacteroides subtilis, and Bacteroides thetaiotaomicron, Naturally pathogenic bacteria may be genetically engineered to provide reduce or eliminate pathogenicity.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid and retains the desired activity of the polypeptide. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles consistent with the disclosure. A given amino acid can be replaced by a residue having similar physiochemical characteristics, e.g., substituting one aliphatic residue for another (such as Ile, Val, Leu, or Ala for one another), or substitution of one polar residue for another (such as between Lys and Arg; Glu and Asp; or Gln and Asn). Other such conservative substitutions, e.g., substitutions of entire regions having similar hydrophobicity characteristics, are well known. Polypeptides comprising conservative amino acid substitutions can be tested in any one of the assays described herein to confirm that a desired activity. Amino acids can be grouped according to similarities in the properties of their side chains (in A. L. Lehninger, in Biochemistry, second ed., pp. 73-75, Worth Publishers, New York (1975)): (1) non-polar: Ala (A), Val (V), Leu (L), Ile (I), Pro (P), Phe (F), Trp (W), Met (M); (2) uncharged polar: Gly (G), Ser (S), Thr (T), Cys (C), Tyr (Y), Asn (N), Gln (Q); (3) acidic: Asp (D), Glu (E); (4) basic: Lys (K), Arg (R), His (H). Alternatively, naturally occurring residues can be divided into groups based on common side-chain properties: (1) hydrophobic: Norleucine, Met, Ala, Val, Leu, Ile; (2) neutral hydrophilic: Cys, Ser, Thr, Asn, Gln; (3) acidic: Asp, Glu; (4) basic: His, Lys, Arg; (5) residues that influence chain orientation: Gly, Pro; (6) aromatic: Trp, Tyr, Phe. Non-conservative substitutions will entail exchanging a member of one of these classes for another class. Particular conservative substitutions include, for example; Ala into Gly or into Ser; Arg into Lys; Asn into Gln or into His; Asp into Glu; Cys into Ser; Gln into Asn; Glu into Asp; Gly into Ala or into Pro; His into Asn or into Gln; Ile into Leu or into Val; Leu into Ile or into Val; Lys into Arg, into Gln or into Glu; Met into Leu, into Tyr or into Ile; Phe into Met, into Leu or into Tyr; Ser into Thr; Thr into Ser; Trp into Tyr; Tyr into Trp; and/or Phe into Val, into Ile or into Leu.
In some embodiments, polypeptides described herein can be a variant of a sequence described herein. In some embodiments, the variant is a conservatively modified variant. Conservative substitution variants can be obtained by mutations of native nucleotide sequences, for example. A “variant,” as referred to herein, is a polypeptide substantially homologous to a native or reference polypeptide, but which has an amino acid sequence different from that of the native or reference polypeptide because of one or a plurality of deletions, insertions or substitutions. Variant polypeptide-encoding DNA sequences encompass sequences that comprise one or more additions, deletions, or substitutions of nucleotides when compared to a native or reference DNA sequence, but that encode a variant protein or fragment thereof that retains activity, e.g. ability to target a polypeptide for export via the curli export system. A wide variety of PCR-based site-specific mutagenesis approaches are also known in the art and can be applied by the ordinarily skilled artisan.
A variant amino acid or DNA sequence can be at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, identical to a native or reference sequence. The degree of homology (percent identity) between a native and a mutant sequence can be determined, for example, by comparing the two sequences using freely available computer programs commonly employed for this purpose on the world wide web (e.g., BLASTp or BLASTn with default settings).
Alterations of the native amino acid sequence can be accomplished by any of a number of techniques known to one of skill in the art. Mutations can be introduced, for example, at particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites enabling ligation to fragments of the native sequence. Following ligation, the resulting reconstructed sequence encodes an analog having the desired amino acid insertion, substitution, or deletion. Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can be employed to provide an altered nucleotide sequence having particular codons altered according to the substitution, deletion, or insertion required. Techniques for making such alterations are very well established and include, for example, those disclosed by Walder et al. (Gene 42:133, 1986); Bauer et al. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 12-19); Smith et al. (Genetic Engineering: Principles and Methods, Plenum Press, 1981); and U.S. Pat. Nos. 4,518,584 and 4,737,462, which are herein incorporated by reference in their entireties. Any cysteine residue not involved in maintaining the proper conformation of the polypeptide also can be substituted, generally with serine, to improve the oxidative stability of the molecule and prevent aberrant crosslinking. Conversely, cysteine bond(s) can be added to the polypeptide to improve its stability or facilitate oligomerization.
The term “statistically significant” or “significantly” refers to statistical significance and generally means a two standard deviation (2SD) or greater difference.
Other than in the operating examples, or where otherwise indicated, all numbers expressing quantities of ingredients or reaction conditions used herein should be understood as modified in all instances by the term “about.” The term “about” when used in connection with percentages can mean±1%.
The articles “a” and “an,” as used herein, should be understood to mean “at least one,” unless clearly indicated to the contrary.
The phrase “and/or,” when used between elements in a list, is intended to mean either (1) that only a single listed element is present, or (2) that more than one element of the list is present. For example, “A, B, and/or C” indicates that the selection may be A alone; B alone; C alone; A and B; A and C; B and C; or A, B, and C. The phrase “and/or” may be used interchangeably with “at least one of” or “one or more of” the elements in a list.
Ranges provided herein are understood to be shorthand for all of the values within the range. For example, a range of 1 to 50 is understood to include any number, combination of numbers, or sub-range from the group consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, and 50.
In some aspects of the invention, disclosed herein are methods for producing a genetically modified bacterium, comprising introducing into a bacterium at least one engineered cryptic plasmid comprising a heterologous nucleic acid, wherein the heterologous nucleic acid comprises a nucleic acid sequence encoding a recombinant protein and a polypeptide secretion system for directing the recombinant protein to the outer membrane for secretion, wherein the bacterium does not comprise any native cryptic plasmids.
In some embodiments, the at least one engineered cryptic plasmid is an engineered pMUT1 or pMUT2. In some such embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system (e.g., the expression cassette) is inserted into the pMUT1 backbone or pMUT2 backbone at insertion sites as indicated in Table 3. In some embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system (e.g., the expression cassette) is inserted within a site amplified by a primer pair comprising the sequences set forth in SEQ ID NOs: 21 and 22; SEQ ID NOs: 23 and 24; SEQ ID NOs: 25 and 26; or SEQ ID NOs: 27 and 28. In some embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system comprises a curli fiber secretion system. For example, and without limitation, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system comprises a synthetic csgBACEFG operon.
In some preferred embodiments, the heterologous nucleic acid sequence encodes a recombinant protein fused to a CsgA monomer, e.g., an engineered CsgA as contemplated herein. In some embodiments, the recombinant protein comprises a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an enzyme, a fusion protein, a hormone, an antigen, a thrombolytic agent, a cytokine, an immunotoxin, and a growth factor. In some such embodiments, the therapeutic polypeptide is an antibody fragment; and the antibody fragment is a single chain antibody, such as a nanobody. In some embodiments, the single chain antibody is specific for an antigen selected from the group consisting of: carcinogenic embryonic antigen (CEA), glucose transporter 1 (GLUT1), green fluorescent protein (GFP), beta-lactamase, Clostridium difficile Toxin A, Clostridium difficile Toxin B, botulinum toxin (BoTox), cholera toxin (CTX), norovirus capsid protein, rotavirus capsid protein, and Plasmodium membrane protein. Contemplated embodiments also include the therapeutic polypeptide fused to an amyloid polypeptide. As a non-limiting example, the amyloid polypeptide may comprise at least one curli subunit.
In some embodiments, the engineered cryptic plasmid lacks a selectable marker gene. In some embodiments, the heterologous nucleic acid is operably linked to an inducible promoter. Such inducible promoters may be responsive to an inducer selected from the group consisting of IPTG, arabinose, tetracycline, and permissive temperature change. In preferred embodiments, the inducible promoter is a temperature sensitive promoter.
In some embodiments, the bacterium of the invention retain the engineered cryptic plasmid in the absence of a selectable marker. In some embodiments, the methods provided herein further comprise plasmid-curing the bacterium prior to introduction of the engineered cryptic plasmid.
In some aspects, described herein are engineered microbial cells comprising an engineered CsgA polypeptide and/or comprising a vector (e.g., an engineered cryptic vector) or nucleic acid encoding such a polypeptide. In certain aspects of the invention, provided herein are engineered bacterium, comprising at least one engineered cryptic plasmid comprising a heterologous nucleic acid, wherein the heterologous nucleic acid comprises a nucleic acid sequence encoding a recombinant protein and a polypeptide secretion system for directing the recombinant protein to the outer membrane for secretion, wherein the bacterium does not comprise any native cryptic plasmids. In some embodiments, the at least one engineered cryptic plasmid is an engineered pMUT1 or pMUT2.
In some embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system (e.g., an expression cassette) is inserted into the pMUT1 backbone or pMUT2 backbone at insertion sites as indicated in Table 3. In some embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system (e.g., the expression cassette) is inserted within a site amplified by a primer pair comprising the sequences set forth in SEQ ID NOs: 21 and 22; SEQ ID NOs: 23 and 24; SEQ ID NOs: 25 and 26; or SEQ ID NOs: 27 and 28. In some such embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system comprises a curli fiber secretion system. In some preferred embodiments, the nucleic acid sequence encoding the recombinant protein and polypeptide secretion system comprises a synthetic csgBACEFG operon. In some embodiments, the heterologous nucleic acid sequence encodes a recombinant protein fused to a CsgA monomer, e.g., an engineered CsgA.
In some embodiments of the engineered bacterium provided herein, the recombinant protein comprises a therapeutic polypeptide selected from the group consisting of an antibody, an antibody fragment, an enzyme, a fusion protein, a hormone, an antigen, a thrombolytic agent, a cytokine, an immunotoxin, and a growth factor. In some such embodiments, the therapeutic polypeptide is an antibody fragment; and the antibody fragment is a single chain antibody, such as a nanobody. The single chain antibodies provided herein may be specific for an antigen selected from the group consisting of: carcinogenic embryonic antigen (CEA), glucose transporter 1 (GLUT1), green fluorescent protein (GFP), beta-lactamase, Clostridium difficile Toxin A, Clostridium difficile Toxin B, botulinum toxin (BoTox), cholera toxin (CTX), norovirus capsid protein, rotavirus capsid protein, and Plasmodium membrane protein.
In some embodiments, the therapeutic polypeptide is fused to an amyloid polypeptide. In some such embodiments, the amyloid polypeptide comprises at least one curli subunit. Accordingly, in some preferred embodiments the therapeutic polypeptide may be fused to a CsgA subunit, e.g., an engineered CsgA.
In some embodiments, the engineered bacterium and/or the engineered cryptic plasmid lacks a selectable marker gene.
In some embodiments, the heterologous nucleic acid, e.g., expression cassette, is operably linked to an inducible promoter. In some such embodiments, the inducible promoter is responsive to an inducer selected from the group consisting of IPTG, arabinose, tetracycline, and permissive temperature change. Preferably, the inducible promoter is a temperature sensitive promoter.
In some embodiments of the invention, the bacterium retains the engineered cryptic plasmid in the absence of a selectable marker.
In certain embodiments, the heterologous nucleic acid further comprises a nucleic acid sequence encoding a polypeptide tag. Such polypeptide tags may be selected from the group consisting of a poly-histidine tag, a myc tag a FLAG tag, a hemagglutinin (HA) tag, and a V5 tag.
In some embodiments, the engineered bacterium is a non-pathogenic bacterium. In some embodiments, the engineered bacterium is a bacterium of the genus Bacteroides or Escherichia. The engineered bacterium may be a probiotic bacterium. In some preferred embodiments, the engineered bacterium is Escherichia coli, such as Escherichia cob strain Nissle 1917. In some such embodiments, the engineered bacterium does not comprise a native csgBACEFG operon.
In some embodiments, the engineered CsgA polypeptide can comprise a functional polypeptide. In some embodiments, the engineered. CsgA polypeptide can comprise a functional polypeptide comprising a conjugation domain. In some embodiments, a cell encoding and/or comprising an engineered CsgA polypeptide can comprise an activity polypeptide. In some embodiments, a cell encoding and/or comprising an engineered CsgA polypeptide can comprise an activity polypeptide comprising a conjugation domain can further encode and/or comprise a second engineered polypeptide comprising a partner conjugation domain and a functionalizing polypeptide. In some embodiments, described herein is a population of cells comprising two cell types, the first cell type encoding and/or comprising an engineered CsgA polypeptide comprising an activity polypeptide comprising a conjugation domain and the second cell type encoding and/or comprising a second engineered polypeptide comprising a partner conjugation domain and a functionalizing polypeptide. That is, a single cell can comprise a CsgA polypeptide with a conjugation domain and also comprise the polypeptide which will bind to and/or be bound by that CsgA polypeptide or that a first cell can comprise a CsgA polypeptide with a conjugation domain and a second cell can comprise the polypeptide which will bind to and/or be bound by that CsgA polypeptide. It is further contemplated that an engineered CsgA polypeptide with a conjugation domain can be contacted with a second polypeptide comprising a partner conjugation domain and a functionalizing polypeptide, e.g. the second polypeptide can be produced (e.g. by a bacteria or eukaryotic cell) and/or synthesized (and optionally isolated or purified) and then brought in contact with the engineered CsgA polypeptide, e.g. when the CsgA polypeptide is present on a cell surface and/or present in a biofilm.
Functional polypeptides within the scope of the present disclosure include peptides or proteins having a desired function. Such functions include catalytic function, recognition function or structural function. Exemplary functional polypeptides include targeting domains. Exemplary functional polypeptides include therapeutic polypeptides. Exemplary functional polypeptides include diagnostic polypeptides. Exemplary functional polypeptides include anticancer polypeptides. Exemplary functional polypeptides include antimicrobial polypeptides. Exemplary functional polypeptides include anti-inflammatory polypeptides. Exemplary functional polypeptides include polymer binding polypeptides, Exemplary functional polypeptides include metabolite binding polypeptides. Exemplary functional polypeptides include targeting polypeptides. Exemplary functional polypeptides include functional polypeptides that bind to tissues or cells or substrates. Exemplary functional polypeptides include a first member of a known binding pair. When expressed, the first member of the binding pair is available for binding to a second member of the binding pair which may have attached to it a functional polypeptide, such as for therapeutic or diagnostic purposes. In this manner, the functional polypeptide with the second member of the binding pair may be contacted to the biofilm to add the functional polypeptide to the biofilm, such as to provide the biofilm with the characteristic of the functional polypeptide. Exemplary functional polypeptides may be those to which a functional group may be covalently attached either directly or through a linker. For example, by appending to CsgA a peptide capable of undergoing spontaneous covalent modification, a biofilm whose surface can be modified with any protein or compound of interest can be created by subsequent addition of the protein or compound of interest.
Exemplary therapeutic polypeptides include engineered polypeptides with therapeutic function, polypeptides with anti-inflammatory bioactivity (trefoil factors—e.g. TFF1-3, interleukins—e.g. IL-10, other anti-inflammatory cytokines, anti-TNFα factors), polypeptides with anti-microbial bioactivity (e.g. coprisin, cathelicidin, LL-37, thuricin CD, lantibiotics), polypeptides with anti-cancer bioactivity (growth inhibiting biologies).
A bacterial cell of the methods and compositions described herein can be any of any species. Preferably, the bacterial cells are of a species and/or strain which is amenable to culture and genetic manipulation. In some embodiments, the bacterial cell can be a gram-positive bacterial cell. In some embodiments, the bacterial cell can be a gram-negative bacterial cell. In some embodiments, the parental strain of the bacterial cell of the technology described herein can be a strain optimized for protein expression. Non-limiting examples of bacterial species and strains suitable for use in the present technologies include Escherichia coli, E. coli BL21, E. coli Tuner, E. coli Rosetta, E. coli JM101, and derivatives of any of the foregoing. Bacterial strains for protein expression are commercially available, e.g. EXPRESS™ Competent E. coli (Cat. No. 02523; New England Biosciences; Ipswich, Mass.). In some embodiments, the cell is an E. coli cell.
In some embodiments, the nucleic acid encoding an engineered CsgA polypeptide is comprised by a cell expressing wild-type CsgA. In some embodiments, the nucleic acid encoding an engineered CsgA polypeptide is comprised by a cell with a mutation and/or deletion of the wild-type CsgA gene, e.g such that the cell does not express wild-type CsgA.
In one aspect, described herein is a biofilm comprising an engineered microbial cell comprising one or more engineered CsgA polypeptide and/or comprising a vector or nucleic acid encoding such a polypeptides. As used herein, a “biofilm” refers to a mass of microorganisms which can adhere or is adhering to a surface. A biofilm comprises a matrix of extracellular polymeric substances, including, but not limited to extracellular DNA, proteins, glyopeptides, and polysaccharides. The nature of a biofilm, such as its structure and composition, can depend on the particular species of bacteria present in the biofilm. Bacteria present in a biofilm are commonly genetically or phenotypically different than corresponding bacteria not in a biofilm, such as isolated bacteria or bacteria in a colony.
In some embodiments, the technology described herein relates to a biofilm that is produced by culturing an engineered microbial cell comprising an engineered CsgA polypeptide (and/or comprising a vector or nucleic acid encoding such a polypeptide) under conditions suitable for the production of a biofilm. Conditions suitable for the production of a biofilm can include, but are not limited to, conditions under which the microbial cell is capable of logarithmic growth and/or polypeptide synthesis. Conditions may vary depending upon the species and strain of microbial cell selected. Conditions for the culture of microbial cells are well known in the art. Biofilm production can also be induced and/or enhanced by methods well known in the art. e.g. contacting cells with subinhibitory concentrations of beta-lactam or aminoglycoside antibiotics, exposing cells to fluid flow, contacting cells with exogenous poly-N-acetylglucosamine (PNAG), or contacting cells with quorum sensing signal molecules. In some embodiments, conditions suitable for the production of a biofilm can also include conditions which increase the expression and secretion of CsgA, e.g. by exogenously expressing CsgD.
In some embodiments, the biofilm can comprise the cell which produced the biofilm. In some embodiments, described herein is a composition comprising an engineered CsgA polypeptide as described herein, e.g., a therapeutic polypeptide fused to CsgA.
When expressed by a cell capable of forming curli, e.g. a cell expressing CsgA, CsgB, CsgC, CsgD, CsgE, CsgF, and CsgG or some subset thereof, CsgA units will be assembled to form curli filaments, e.g. polymeric chains of CsgA. In some embodiments, filaments of the polypeptide can be present in the composition. In some embodiments, the filaments can be part of a proteinaceous network, e.g. multiple filaments which can be, e.g. interwoven, overlapping, and/or in contact with each other. In some embodiments, the proteinaceous network can comprise additional biofilm components, e.g. materials typically found in an E. coli biofilm. Non-limiting examples of biofilm components can include biofilm proteins (e.g. FimA, FimH, Ag43, AidA, and/or TibA) and/or non-proteinaceous biofilm components (e.g. cellulose. PGA and/or colonic acid). In some embodiments, the composition can further comprise an engineered microbial cell comprising an engineered CsgA polypeptide and/or comprising a vector or nucleic acid encoding such a polypeptide.
In one aspect, described herein is the use of a cell, composition, or biofilm comprising an engineered CsgA polypeptide (and/or comprising a vector or nucleic acid encoding such a polypeptide) to display a polypeptide, e.g. within the biofilm, within the composition, and/or on the cell surface. As used herein, “display” refers to expressing the polypeptide (e.g. as an activity polypeptide) in such a manner that it can come in contact with the extracellular environment. A displayed polypeptide can be capable of binding with a binding partner, catalyzing an enzymatic reaction, and/or performing any other activity which it would perform as an isolated polypeptide.
It is contemplated herein that a polypeptide displayed within a biofilm (e.g. an activity polypeptide and/or functionalizing polypeptide) will retain more activity than a soluble version of that polypeptide. It is contemplated herein that a polypeptide displayed within a biofilm (e.g. an activity polypeptide and/or functionalizing polypeptide) will retain more activity than a soluble version of that polypeptide when exposed to activity degrading conditions such as, e.g., high or low pH, organic solvents, desiccation, high or low temperature, radiation, etc.
In one aspect, described herein is the use of a cell, composition, or biofilm comprising an engineered CsgA polypeptide (and/or comprising a vector or nucleic acid encoding such a polypeptide), in an application selected from the group consisting of biocatalysis; industrial biocatalysis; immobilized biocatalysis; chemical production; filtration; isolation of molecules from an aqueous solution; water filtration; bioremediation; nanoparticle synthesis; nanowire synthesis; display of optically active materials; biosensors; surface coating; therapeutic biomaterial; biological scaffold; structural reinforcement of an object; and as a delivery system for therapeutic agents. Exemplary, non-limiting embodiments of such applications and specific activity polypeptides for use therein are described in the Examples herein.
It is contemplated herein that a cell, composition and/or biofilm can comprise multiple different engineered CsgA polypeptides, each of which comprises a different activity polypeptide, e.g. an engineered CsgA polypeptide comprising an enzymatic activity polypeptide and an engineered CsgA polypeptide comprising a binding domain activity polypeptide. A cell, composition, and/or biofilm can comprise one or more engineered CsgA polypeptides, e.g., 1, 2, 3, 4, 5, 6, or more engineered CsgA polypeptides.
In some aspects of the invention, provided herein are pharmaceutical compositions comprising the engineered bacterium disclosed herein, and a pharmaceutically acceptable excipient. In some embodiments, the pharmaceutical composition is formulated for oral administration. In other embodiments, the pharmaceutical composition is formulated for rectal administration. Accordingly, the pharmaceutical compositions contemplated herein may be formulated as a pill, a capsule, a lozenge, or a suppository.
In some aspects, provided herein are methods of producing a recombinant polypeptide, comprising culturing the engineered bacterium provided herein under conditions suitable for expression and export of the recombinant polypeptide from the engineered bacterium. In some embodiments, the recombinant polypeptide comprises at least one CsgA subunit and a therapeutic polypeptide, e.g., an engineered CsgA. In some such embodiments, expression of the recombinant polypeptide is not toxic to the engineered bacterium. Preferably, the level of expression and export of the recombinant polypeptide is maintained, as compared to the level of expression and export of the recombinant polypeptide from an engineered bacterium under the same conditions expressed from a conventional plasmid comprising the heterologous nucleic acid sequence and a selectable marker gene. In some embodiments, the recombinant polypeptide is collected from the cell culture medium comprising the engineered bacterium. In some such embodiments, the engineered bacterium is not exposed to a lysing agent prior to collecting the recombinant protein from the cell culture medium. In some preferred embodiments, the recombinant polypeptide is collected from a supernatant of the cell culture medium. The methods disclosed herein may further comprise purifying the recombinant polypeptide.
In some aspects, disclosed herein are recombinant polypeptides produced using the methods disclosed herein. In other aspects, disclosed herein are biofilms comprising the recombinant polypeptide produced using the methods provided herein.
In other aspects of the invention, provided herein are methods for treating a disease or disorder. Such methods may comprise administering to a subject in need thereof an effective amount of an engineered bacterium provided herein or a pharmaceutical composition provided herein. In some embodiments, the engineered bacterium expresses and exports a recombinant polypeptide comprising at least one CsgA subunit and the therapeutic polypeptide, e.g., an engineered CsgA, thereby treating the disease or disorder.
In some embodiments, the engineered bacterium or the pharmaceutical composition is administered orally. In other embodiments, the engineered bacterium or the pharmaceutical composition is administered rectally. In some preferred embodiments, the subject is a mammal; and most preferably, the mammal is a human.
In some embodiments, the disease or disorder is a gastrointestinal disease or disorder, such as a gastrointestinal disease or disorder selected from the group consisting of inflammatory bowel disease, Crohn's disease, ulcerative colitis, colorectal cancer, ulcer, malabsorption, short-gut syndrome, cul-de-sac syndrome, celiac sprue, tropical sprue, hypogammaglobulinemic sprue, enteritis, short bowel syndrome, and gastrointestinal cancer.
In some such embodiments, the engineered bacterium colonizes the gastrointestinal tract of the subject. Preferably, the engineered bacterium retains the engineered cryptic plasmid for at least 1 to 5 days following administration.
In some aspects of the invention, provided herein are vectors, comprising a cryptic plasmid backbone and a heterologous nucleic acid, e.g., an expression cassette. In some embodiments, the heterologous nucleic acid comprises a nucleic acid sequence encoding csgBACEFG operon, and a nucleic acid sequence encoding a therapeutic polypeptide. In some such vectors, the csgBACEFG operon is derived from E. coli. In some preferred embodiments, the heterologous nucleic acid is inserted into cryptic plasmid pMUT1 backbone or pMUT2 backbone at insertion sites as indicated in Table 3. In some embodiments, the heterologous nucleic acid is inserted within a site amplified by a primer pair comprising the sequences set forth in SEQ ID NOs: 21 and 22; SEQ ID NOs: 23 and 24; SEQ ID NOs: 25 and 26; or SEQ ID NOs: 27 and 28.
The therapeutic polypeptides encoded by the vectors disclosed herein may be selected from the group consisting of an antibody, an antibody fragment, an enzyme, a fusion protein, a hormone, an antigen, a thrombolytic agent, a cytokine, an immunotoxin, and a growth factor. In some embodiments, the therapeutic polypeptide is an antibody fragment, and the antibody fragment is a single chain antibody, such as a nanobody. Such single chain antibodies may be specific for an antigen selected from the group consisting of: carcinogenic embryonic antigen (CEA), glucose transporter 1 (GLUT1), green fluorescent protein (GFP), beta-lactamase, Clostridium difficile Toxin A, Clostridium difficile Toxin B, botulinum toxin (BoTox), cholera toxin (CTX), norovirus capsid protein, rotavirus capsid protein, and Plasmodium membrane protein.
In some preferred embodiments, the therapeutic polypeptide is fused to an amyloid polypeptide. Exemplary amyloid polypeptides may comprise at least one curli subunit.
In certain embodiments, the heterologous nucleic acid further comprises a nucleic acid sequence encoding a polypeptide tag. The polypeptide tag may be selected from the group consisting of a poly-histidine tag, a myc tag a FLAG tag, a hemagglutinin (HA) tag, and a V5 tag.
In some preferred embodiments, the heterologous nucleic acid is operably linked to an inducible promoter. The inducible promoters contemplated herein may be responsive to an inducer selected from the group consisting of IPTG, arabinose, tetracycline, and permissive temperature change. Preferably, the inducible promoter is a temperature sensitive promoter.
In preferred embodiments, the vector backbone is pMUT1 or pMUT2. Such vectors may further comprise a nucleic acid encoding a detectable protein. Exemplary detectable proteins include fluorescent proteins as are known in the art, e.g., green fluorescent protein (GFP) and red fluorescent protein (RFP).
Definitions of common terms in cell biology and molecular biology can be found in The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); Benjamin Lewin, Genes X, published by Jones & Bartlett Publishing, 2009 (ISBN-10: 0763766321); Kendrew et al. (eds.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8) and Current Protocols in Protein Sciences 2009, Wiley Intersciences, Coligan et al., eds.
Unless otherwise stated, the present invention was performed using standard procedures, as described, for example in Sambrook et al., Molecular Cloning. A Laboratory Manual (3 ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., USA (2001); Davis et al., Basic Methods in Molecular Biology, Elsevier Science Publishing, Inc., New York, USA (1995); or Methods in Enzymology: Guide to Molecular Cloning Techniques Vol. 152, S. L. Berger and A. R. Kimmel Eds., Academic Press Inc., San Diego, USA (1987); and Current Protocols in Protein Science (CPPS) (John E. Coligan, et. al., ed., John Wiley and Sons, Inc.), which are all incorporated by reference herein in their entireties.
The description of embodiments of the disclosure is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. While specific embodiments of, and examples for, the disclosure are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the disclosure, as those skilled in the relevant art will recognize. For example, while method steps or functions are presented in a given order, alternative embodiments may perform functions in a different order, or functions may be performed substantially concurrently. The teachings of the disclosure provided herein can be applied to other procedures or methods as appropriate. The various embodiments described herein can be combined to provide further embodiments. Aspects of the disclosure can be modified, if necessary, to employ the compositions, functions and concepts of the above references and application to provide yet further embodiments of the disclosure. Moreover, due to biological functional equivalency considerations, some changes can be made in protein structure without affecting the biological or chemical action in kind or amount. These and other changes can be made to the disclosure in light of the detailed description. All such modifications are intended to be included within the scope of the appended claims.
Specific elements of any of the foregoing embodiments can be combined or substituted for elements in other embodiments. Furthermore, while advantages associated with certain embodiments of the disclosure have been described in the context of these embodiments, other embodiments may also exhibit such advantages, and not all embodiments need necessarily exhibit such advantages to fall within the scope of the disclosure.
The following examples are set forth as being representative of the present disclosure. These examples are not to be construed as limiting the scope of the present disclosure as these and other equivalent embodiments will be apparent in view of the present disclosure, figures and accompanying claims.
The following examples are set forth as being representative of the present disclosure. These examples are not to be construed as limiting the scope of the present disclosure as these and other equivalent embodiments will be apparent in view of the present disclosure, figures and accompanying claims.
Unless otherwise stated, the present methods were performed using standard procedures, as described, for example in Sambrook et al., Molecular Cloning: A Laboratory Manual (3 ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., USA (2001); Davis et al., Basic Methods in Molecular Biology, Elsevier Science Publishing, Inc., New York, USA (1995); or Methods in Enzymology: Guide to Molecular Cloning Techniques, Vol. 152, S. L. Berger and A. R. Kimmel Eds., Academic Press Inc., San Diego, USA (1987); and Current Protocols in Protein Science (CPPS) (John E. Coligan, et. al., ed., John Wiley and Sons, Inc.), which are all incorporated by reference herein in their entireties.
DNA cloning
All plasmid assembly was performed using Gibson Assembly, with the exception of the pCryptDel #plasmids, where the gRNA array was assembled using Golden Gate assembly due to many repeats in the DNA sequence. Custom DNA oligos were ordered from Integrated DNA Technologies (IDT) and used in PCR with Q5 polymerase (New England Biolabs, USA) to create amplicons for subsequent Gibson assembly. DNA purification from PCR was done with ZymoClean™ Gel DNA Recovery Kit (ZymoGen, USA). DNA assembly products were transformed into chemically competent E. coli Mach1 cells (Thermo Fisher Scientific, USA) and plated onto LB Agar plates with appropriate antibiotics.
DNA libraries were generated by designing degenerate bases at selected locations on DNA oligos, flanked by 25 bp of the unmodified sequence. The resulting DNA was synthesized (IDT) used as primers to make amplicons for plasmid assembly. The resulting pool of assembled plasmid variants was transformed into Mach1 cells and plated onto 10 plates. After overnight incubation at 37° C., the plates were imaged for GFP and RFP fluorescence in a FluorChem™ M Imager (Protein Simple, USA), and colonies were selected.
Colony PCR
Assessment of cryptic plasmids was done by colony PCR using 25 μL reactions with Quick-Load Taq PCR mix (New England Biolabs, USA) following the manufacturer's instructions. After the PCR, the reactions were added to a 1% agarose TAE gel with SybrSafe DNA stain and ran in a gel electrophoresis setup (constant 120V, 35 mins). Gels were then imaged in FluorChem M Imager (Protein Simple).
Bacterial Culture
E. coli bacteria were grown in LB Miller media during plasmid preparation and genetic circuit characterization. For characterization assays, starter cultures of the appropriate bacterial cultures were grown overnight in LB media in a shaking incubator. For all temperature sensitive constructs, started cultures were grown at 30° C., whereas 37° C. was used for all other constructs. Unless explicitly indicated otherwise, all characterization was done at 37° C.
Kinetic plate reader assays were performed by diluting starter culture 1:1000 into the appropriate selective media. Into the wells of black, clear-bottom, 96-well plates (655090, Greiner Bio-One, Germany), 200 μL of the inoculated culture was then added. The plates were then grown in a Synergy HT plate reader (BioTek), reading absorbance (600 nm), GFP (excitation: 485/20 nm, emission: 528/20 nm), RFP (ex: 590/20, em: 645/40). Reads were taken every 10 minutes for 16 hours, and plates were shaken continuously outside of reading (Double Orbital, 548 cpm (2 mm)).
Plasmid Curing
In order to cure Nissle and any derived strains of cryptic plasmids, they were transformed with plasmids pFREE or pCryptDe14.8, in order to cure pMUT1 or pMUT2 respectively. After transformation, cells were grown overnight in liquid LB media with 50 μg/mL kanamycin. Then, the overnight culture was diluted 1:1000 into fresh LB media supplemented with 50 μg/mL kanamycin, 0.2% rhamnose and 0.43 μM anhydrous tetracycline (ATC), and grown overnight at 37° C. After 24 hours, the culture was streaked out onto several LB agar plates without antibiotics and these were left to grow overnight. Then, the colonies were assessed by colony PCR with primers muta5, muta6, muta7 and muta8 to find colonies that had been cured of cryptic plasmids.
Growth and Gene Expression Characterization
Data from kinetic plate reader runs was initially cleaned by subtracting the background signal and smoothing the time courses for all fluorescence and absorbance data. Growth rates were found by fitting the absorbance curves to a Gompertz model, and subsequently extracting the peak growth rate. Promoter strength was quantified from kinetic fluorescence data by first finding the gradient of the fluorescence signal, normalizing this to the absorbance signal, resulting in a per cell measure of fluorescent protein production per unit time. Promoter strength was then quoted to be the average of this term for an hour around peak exponential phase.
Curti Measurement
Curli was measured by the CR method outlined in Kan et al. 39. Bacterial starter cultures were grown overnight in LB and the relevant antibiotics at 30° C. Then, selective LB media was innoculated 1:1000 with starter culture, and placed 300 μL into 1 mL deep well plates (780210, Greiner Bio-One, Germany) in a shaking incubator (900 rpm (1 mm)) at either 30° C. or 37° C. 0.025% CR was added to the media upon inoculation. After 24 hours of growth, 200 μL of each well was transferred into black, clear bottom plates and the absorbance (600 nm) and CR fluorescence (ex: 525 nm, em: 625 nm) was read in a Synergy HT plate reader. The results were then normalized to the host strain without engineered plasmids.
Curli production was also measured by whole cell filtration ELISA to measure the E-tagged CsgA proteins. 80 μL bacterial overnight cultures were added into each well of a 96-well filter plate in triplicate (0.22 μm pore size, Multiscreen-GV, Merck/Millipore Sigma). Samples were vacuum-filtered, and washed in TBST (TBS, 0.1% Tween-20), and blocked for 1.5 hours at 37° C. with 1% bovine serum albumin (BSA) and 0.01% H2O2 in TBST. After additional washing, samples were incubated with HRP-conjugated goat polyclonal E-tag epitope antibody (Novus Biologicals) for 1.5 hours at room temperature (1:5000 in TBST). Following additional washes in TBST, 100 μL Ultra-TMB ELISA reagent (Thermo Scientific) was added to each well and covered with aluminium foil to protect from light. After approximately 15-25 minutes of incubation at room temperature, the reaction was stopped using 50 μL per well of 2 M sulfuric acid. 100 μL were transferred from each well into a 96-well plate and absorbance was measured at 450 nm and 650 nm. The signal was calculated by subtracting the 650 nm absorbance value from the 450 nm absorbance value.
GFP Sequestration Assay
To test the function of the GFP nanobody, bacterial cultures were first grown overnight at stated temperatures in LB media with appropriate antibiotics. They were then pelleted at 3000 g for 10 minutes, washed once in PBS, and resuspended in a solution of PBS containing 4 μg/mL purified sfGFP. The solutions were then left in a rotating mixer for an hour at room temperature, then centrifuged again at 3000 g for 10 minutes. The supernatant GFP signal was then measured in the plate reader, and compared to the fluorescence of the initial sfGFP solution. In order to prevent non-specific GFP protein adsorption to the plasticware used in the experiment, a sterile solution of 1% BSA (bovine serum albumin) in PBS was used to block the plastic tubes prior to the experiment. To do this, the 1.5 mL tubes were filled with 1 mL of the BSA solution and left in a rotating mixer for an hour.
In Vivo Study of Engineered pMUT Plasmid Retention and Protein Expression
The protocol described below was reviewed and approved by the Harvard Medical Area Standing Committee on Animals (HMA IACUC, Ref. No. IS00000516-3). 25 female 8- to 9-week-old C57BL/6NCrl mice were randomly split into five experimental cohorts: WT pMUTs, pM1, pM1-VHH, pM2 and pM2-VHH (N=5). Bacterial suspensions were prepared in advance by growing to mid-exponential phase (OD600 of 0.5) at 30° C. (shaking at 225 RPM), pelleting the cells, resuspending to OD600 of 10 in PBS supplemented with 20% sucrose and 10% glycerol, and flash-freezing in liquid nitrogen. Aliquots of these bacterial suspensions were stored at −80° C. and allowed to thaw immediately preceding daily feeding, in order to maintain consistent bacterial density of the inoculum.
48 hours prior to initial administration of bacteria (day −2), the drinking water was supplemented with 2 g/L carbenicillin (Teknova). Antibiotic-free drinking water was restored 24 hours later (day −1). Starting on day 0, each cohort was fed 50 μL of its respective bacterial suspension by allowing the mice to lap the liquid from a pipette tip (as previously described by Mohawk et al. 48). Bacterial administration was carried out daily from day 0 to day 4. Fecal pellets were collected and weighed daily from day 0 to day 7.
Mice
Female 8- to 9-week-old C57BL/6NCrl mice were obtained from Charles River Laboratories. Mice were housed in sterile vinyl isolators within the Harvard Medical School animal facility, and kept under specific-pathogen-free (SPF) conditions. Both sterile food (JL Rat and Mouse/Auto 6F 5K67, LabDiet) and water were provided ad libitum. All mice were allowed one week to acclimate prior to any experimental procedure. To further minimize impact of living environment on experimental outcomes, mice were randomized between housing isolators at the beginning of the experiment. All experiments were conducted in compliance with the US National Institutes of Health guidelines and approved by the Harvard Medical Area Standing Committee on Animals.
Plasmid Retention Analysis
Immediately following daily collection of fecal pellets, each sample was homogenized in 1 mL of PBS, serially diluted, and plated in quadruplicate to enumerate colony forming units (CFU). Samples were plated on two types of LB agar plates −25 μg/mL chloramphenicol-only plates (Cm) and 100 μg/mL carbenicillin+25 μg/mL chloramphenicol plates (Cm+Carb). While all PBP8-derived strains carried a chromosomal Cm resistance gene, only the engineered pMUT plasmids harbored a Carb resistance marker. Plasmid retention rate was therefore estimated by calculating the Cm+Carb to Cm ratio of sample weight-normalized CFU counts. Following the plating procedure, fecal homogenates were flash-frozen and stored at −80° C. for subsequent analysis.
Fecal Filtration ELISA
To detect E-tagged curli fibers in fecal samples, a filtration ELISA protocol was adapted from Praveschotinunt et al. 35. Fecal homogenate was centrifuged at 1000 g for 1 minute to separate large insoluble material, and the supernatant was transferred onto a 96-well filter plate in triplicate (0.22 μm pore size, Multiscreen-GV, Merck/Millipore Sigma). For each sample, the homogenate volume dispensed was normalized to 1.25 mg of fecal pellet per well. After samples were added to the filter plate, the procedure to detect E-tagged material proceeded as described above in the curli measurement section. In each assay, the signal was normalized by dividing by the WT pMUTs control, such that the WT pMUTs control corresponded to 100%.
EcN's cryptic plasmids were first documented by Hacker et al. in 2002 20, who published the sequences for pMUT1 and pMUT2 on the National Center for Biotechnology Information (NCBI) database with accession numbers A84793 and A95448 respectively. Since then, 3 whole genome-sequencing projects for EcN have been uploaded to NCBI, with 2 fully assembled genomes. The first assembly, ASM71459v1 (Reister et al.21), resulted in a single sequence containing the chromosome and both plasmids, with the plasmids erroneously inserted multiple times within the chromosomal sequence. A later assembly, ASM354697v1, has a genomic sequence separate from the 2 cryptic plasmid sequences (labelled pNissle1 and pMUT2). Here, the pNissle1 sequence contains both the sequence for pMUT1 and pMUT2 and is likely also an erroneous assembly. Also, the pMUT sequences from the whole genome sequencing projects differed from those originally uploaded, A84793 and A95448, which were sequenced using Sanger sequencing. As such, a correct pMUT1 sequence on NCBI could not be found, one was thus uploaded one for reference, NCBI submission ID 2292834, and NCBI accession CP023342 is referred to for the correct pMUT2 sequence, each of which are incorporated by reference herein in their entirety. The correct pMUT sequences were confirmed by Sanger sequencing backbones of the pMUT-derived engineered vectors, finding the sequence traces aligned exactly with those derived from the whole genome sequencing efforts.
Escherichia coli Nissle 1917 plasmid pMUT1, complete sequence.
Escherichia coli Nissle 1917 plasmid pMUT2, complete sequence
pMUT engineering began by selecting 3 sites, s1-s3, (
The 2 successful insertion cassettes for gene expression were tested (
Inserts ‘AsG’ and ‘TsR’ were cloned into the sites on either pMUT1 or pMUT2 to obtain plasmids pMXsYAsG and pMXsYTsR, where X is either 1 or 2, referring to pMUT1 or pMUT2, and where sY is the insertion site number (
As reported before 18, EcN pMUT plasmid knockouts did not grow differently under lab conditions (
It was found that transforming with the engineered plasmids did not displace the native plasmids. The pMUT plasmids were tested for with DNA primers muta5, muta6, muta7, and muta8 (Table 5), developed by Blum-Oehler et al. 17 to detect pMUT1 and pMUT2 in clinical samples. In a multiplex PCR with these 4 muta primers, a 361 bp product was formed when pMUT1 was present, and a 429 bp product when pMUT2 was present (
Furthermore, primers were designed around the insertion sites on pMUT1 and pMUT2 to distinguish the native and engineered pMUT plasmids. Colonies were expected in which the native pMUTs were knocked out through plasmid incompatibility—a process whereby two plasmids cannot stably coexist in the same bacterial cell line over multiple generations, typically occurring in plasmids containing similar or identical replication mechanisms. However, when unmodified EcN was transformed with an ampicillin resistant engineered pMUT plasmid (pM1s3AsG or pM2s2AsG), and grown on selective media, colony PCR with primers around the insertion sites (primers pMXsY_chk_F and R) produced a short 200 bp product, indicating the presence of native plasmid (
Since transforming EcN with engineered pMUTs did not displace the native plasmids, a strategy was required to remove them. Whilst pMUT curing strategies exist, they rely on plasmid incompatibility to knock out native plasmids 18, which data indicates is not an immediate process and requires multiple weeks of streaking onto selective media. Therefore a CRISPR-Cas9 strategy was used to cleave both native pMUT plasmids, based on the pFREE system of Lauritsen et al. 28. The pFREE plasmid (
It was found that the pFREE plasmid cured pMUT1 (
Three gRNA pairs designed and tested to cure pMUT2, however, all designs failed until the antitoxin gene relE found on pMUT2 was included onto the plasmids (
Disclosed herein are methods of making plasmid vectors for bacteria in the gut. As such, there are several experimental challenges to both controlling and assessing synthetic genetic systems within the bacteria. Since the bacteria are in the gut of the host organism, they cannot be readily interrogated, and furthermore, due to the complex environment of the gut, it is unlikely that bacteria behave as they do under laboratory conditions. This sets a severe limitation on genetic induction systems that require exogenous chemical inducers in the gut due to the difficulty of supplying a steady inducer concentration. Inducers are normally provided in a concentrated form in the water for the animal, so the effective concentration in the gut is not clear.
However, inducible systems are desirable to simplify cloning and in vitro propagation of DNA and bacterial strains, especially for genes that encode products that are toxic or stress-inducing to the bacteria. Synthetic genetic circuits typically require the bacteria to express heterologous proteins, and these can impose significant metabolic burdens on their host 29. For constitutive high-expressing constructs, given a non-zero mutation rate, any defective mutants that relieve the metabolic burden will quickly come to dominate cultures due to faster growth. Therefore, for in vitro cloning and propagation it is desirable to use inducible systems to create an ‘off’ state where the synthetic system does not significantly reduce fitness during culture propagation. Additionally, the uninduced state provides a further internal control in experiments that can provide valuable insight into the performance of the genetic system.
A temperature-sensitive gene expression system from Piraner et al. 30 was implemented, based on the promoter pTlpA and repressor protein TlpA36. TlpA36 forms a dimer at temperatures below 36° C., and this dimer binds pTlpA and prevents gene expression (
Above a certain critical temperature, the pTlpA promoter is active and acts constitutively, and the promoter was mutated to generate a library with varied expression strengths. The promoter variant could then be selected for a transcriptional unit of interest in order to optimize gene expression. TlpA, from which TlpA36 was derived, binds to the entire pTlpA promoter 31 (
The promoter strengths of the pTlpA variants (labelled pTlpA-A1 to pTlpA-H10) were found the by calculating the amount of GFP produced per unit time and per cell (
The performance of the pM1s3AsR_TS* and pM2s2AsR_TS* constructs were then characterized in EcN, in each case measuring the engineered pMUT construct performance in the absence of the native cryptic plasmid. It was found that some of the pM2s2AsR_TS* constructs could not be transformed into EcN ΔpMUT2 cells, and thus only 4 of the pMUT2 derived constructs could be characterized. The characterization data from the E. coli Mach1 cloning strain was broadly indicative of performance in EcN (
Many proteins and peptides have therapeutic potential in the gut33, and as such the secretion of such peptides into the extracellular space from EcN inhabiting the gut is an attractive approach to therapy. Curli are well-characterized bacterial extracellular matrix proteins, secreted natively by E. coli using dedicated machinery 34 to form robust fibers. Engineered curli systems represent a versatile platform for custom protein materials, as they are capable of tolerating mutations and fusions to functional protein domains, and are consequently being developed as gut therapeutics 35.
To express curli, a synthetic csgBACEFG operon was used, which encodes the major and minor curli fiber subunits, csgA and csgB, and the secretion machinery necessary for transport from the periplasm to the extracellular space in csgEFG. CsgC prevents intracellular CsgA polymerization, which would be toxic to the bacterium 36. The CsgA monomer was fused to an E-tag epitope tag in a 37 aa flexible linker (
Overexpression of the csgBACEFG operon can be toxic to cells, and as such the expression strength requires significant tuning to obtain a high yield of curli fibers. Thus, the pTlpA promoter library was used to express a synthetic curli operon to identify an optimal promoter strength. In total, 8 of each ‘csg-Etag’ and ‘csg-Etag-NbGFP’ on pM1s3A vectors, and 3 each on pM2s2A vectors were generated. For each pTlpA*-curli construct, the curli production was characterized using the Congo Red (CR) fluorescence method 39 (
At 37° C., the final temperature sensitive curli expression constructs produced curli, which caused the bacterial cultures to aggregate with fluorescent material upon the addition of CR (
Disclosed herein are methods and compositions to address and improve retention rates of synthetic plasmids in bacteria within the gut. In a preliminary experiment, testing engineered EcN in the mouse gut, it was found that EcN harboring engineered synthetic plasmids were lost during passage through the gut without selection. In the following exemplification, mice were fed PBP8 cells (EcN ΔcsgBACEFG::cat(CamR)) transformed with either plasmid pKAG 40, a pSB4K5 based plasmid containing constitutively expressed sfGFP, or pL6FO 39, a similar synthetic plasmid with an IPTG inducible csgBACEFG operon (
Assessment of the plasmid retention of the engineered pMUTs after passing through the mouse gut was investigated; as well as determining the ability of the plasmid system to produce and secrete proteins in an in vivo context, as this feature is key to therapeutic peptide delivery in the gut. Bacterial gene expression in a mammalian gut significantly differs from expression under laboratory conditions 41, and as such in vitro characterization is unlikely to be representative of in vivo functionality.
Typically, it is difficult to assess the gene expression of engineered bacteria in the gut, because they are hard to isolate without in vitro growth that would disrupt any measurement of in vivo gene expression. Direct detection of heterologously-produced proteins in fecal samples is similarly challenging. For most proteins and affinity tags, proteolytic degradation by intestinal proteases is likely to significantly reduce any measurable signal. This is particularly problematic considering the high background signal one can expect from a complex biological medium such as feces. These experimental limitations were, in large part, the motivation to test the pMUT system using curli fibers and VHH domains. In addition to the potential utility of these proteins, both curli and nanobodies are known for their resistance to harsh conditions 42,43, thereby increasing the likelihood of their detection in fecal pellets.
An experiment was designed to test the retention of the engineered pMUTs in vivo, as well as the expression of protein through the plasmid system within the mouse gut. Four plasmids were tested, expressing either cassette ‘csg-Etag’ or ‘csg-Etag-NbGFP’ on pM1s3ATS* or pM2s2TAS* vectors. In each case, PBP8 cells (EcN ΔcsgBACEFG::cat(CamR)) were used, with the native pMUT knocked out whenever the corresponding engineered version was present. As a negative control, PBP8 harboring both wild-type pMUTs with no engineered plasmids were used, making for a total of 5 experimental cohorts. Conditions were labelled as follows:
Each of the five bacterial strains were administered to five C57BL/6 mice.
The mice were fed bacterial suspension daily for 5 days and monitored for 3 additional days after cessation of bacterial administration (
Each fecal pellet was plated on two types of selective plates: chloramphenicol (Cm), selecting for PBP8 irrespective of plasmid presence or identity; and chloramphenicol with carbenicillin (Cm+Carb), which selected specifically for PBP8 with an engineered plasmid. Plasmid retention rates were calculated as the ratio of Cm+Carb to Cm colony counts. All four engineered pMUT cohorts showed no plasmid loss during GI transit, with none of the retention rates differing significantly from 100% (
Protein expression was tested via fecal filtration ELISA, modified from a previous protocol 35. In both engineered pMUT1 and pMUT2, significant levels of E-tagged curli fibers were detected (
Provided herein are plasmid vectors based on the E. coli Nissle 1917 pMUT cryptic plasmids that have been characterized for their performance in the mouse gut. Also disclosed herein is the development of a simple method to remove the native pMUT plasmids, and generate reliable pMUT plasmid vectors capable of secreting a functional curli material within the mouse gut without plasmid loss. The pMUT-based plasmid vectors provided herein simplified in vivo experiments by forgoing the need for antibiotics for plasmid maintenance or inducers for gene expression through temperature sensitive circuits.
The pMUT plasmids have no known function, but are stable within EcN during passage through the gut. Notably, and as disclosed herein, the pMUT plasmids can be exploited as vectors for recombinant DNA. Whilst previous studies have used the pMUT plasmids 3, and shown their high plasmid retention in vitro 19, in vivo efficacy has never been systematically characterized until the present disclosure. In an attempt to cure the native pMUT plasmids, the data of the instant disclosure suggests that pMUT2 stability in EcN is improved by the RelB/RelE toxin/antitoxin system, as pMUT2 could not be cured without expressing the antitoxin gene from the pCryptDel plasmid.
Despite the common use of plasmids in the development of engineered microbes, they are not typically utilized in clinical applications, where exogenous genetic sequences are instead incorporated into the chromosome of the chassis organism. This is mainly due to concerns regarding horizontal gene transfer (HGT), as any antibiotic resistance gene or virulence factor carried on the plasmid would run the risk of being introduced into the host microbiome 45. While such concerns are valid for most synthetic plasmids, the unique features of the engineered pMUT system provided herein could address these limitations. Most prominently, the absence of antibiotic selection could eliminate the possibility of spreading resistance genes, as the resistance gene can be excised from any engineered bacterium through a recombinase. Furthermore, the presence of these plasmids in wild-type EcN suggests that the risk posed by any sequence found natively on the plasmid is negligible. Indeed, the safety profile of EcN over decades of probiotic use implies that HGT of pMUT-encoded genes is either exceedingly rare, relatively harmless, or both. Lastly, HGT may be used as a tool for in situ microbiome engineering 46. As a selection-free, probiotic-derived plasmid system, the pMUT platform could prove a valuable addition to the toolbox of this emerging microbial intervention strategy. Thus, while the pMUT plasmids could indeed be utilized for the research and development of engineered strains, they could also open the door to plasmid-based production of therapeutics in vivo, in both clinical and preclinical settings.
There are several benefits to using engineered pMUT plasmid vectors compared to genomic incorporation. The first is speed and reliability, since plasmid assembly and transformation are the only steps required for the production of an engineered EcN strain, and this can be done in several days. This can facilitate the rapid construction and development of probiotic bacteria, speeding the development and optimization of prototypes. A further benefit is the ability to incorporate relatively large recombinant genetic constructs with ease. Indeed, one of the largest constructs made was around 13 Kbp (pM2s2ATScsg-NbGFP), incorporating over 7 Kbp of recombinant DNA. Furthermore, both engineered pMUT1 and pMUT2 plasmids could be used simultaneously to house synthetic DNA, allowing for the incorporation of even larger and more complex synthetic DNA systems.
A further benefit to simplifying the process of bacterial engineering is the ability to rapidly and reliably generate multiple variant strains, and thus screen and optimize genetic circuits of interest. The pTlpA promoter library, as exemplified herein, demonstrated how even a relatively small functional change, such as the addition of a fusion protein, can require the redesign of regulatory elements within genetic circuit for optimal function. Notably, the addition of an anti-GFP nanobody required a weaker promoter for curli expression compared to unmodified CsgA-Etag, suggesting that the nanobody reduced secretion efficacy, likely through the toxicity of expression and secretion. However, the weaker expression did not reduce overall curli production in the nanobody constructs, suggesting that curli production was not limited by the expression of the other genes in the csgBACEFG operon.
In the in vivo experiments provided herein, slower clearance of the WT pMUT control strain compared to those expressing proteins through engineered pMUTs was observed (
This application claims priority to U.S. Provisional Patent Application No. 63/078,622, filed on Sep. 15, 2020, the entire contents of which are expressly incorporated herein by reference.
This invention was made with government support under Grant Number R01DK110770 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/050479 | 9/15/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63078622 | Sep 2020 | US |