The present disclosure is directed to materials and methods for reducing heterologous DNA damage in bacteria by modifying the heterologous DNA to include one or more deazapurine bases.
DNA that is recognized as foreign to a given cell may be targeted for degradation within the cell, either by its lack of a host-like methylation pattern or by the presence of unusual base modifications relative to the host DNA (Bair and Black, 2007, J Mol Biol 366: 768-778). The subsequent degradation by restriction endonucleases reportedly constitutes effective barriers to the introduction of DNA into bacteria (Briggs et al. Appl. Environ. Microbiol. 1994, 60, 2006-2010; Accetto et al. FEMS Microbiol. Lett. 2005, 247, 177-183; Bair and Black, J. Mol. Biol. 2007, 366, 768-778; Corvaglia et al. Proc. Natl. Acad. Sci. U.S.A. 2010, 107, 11954-11958; Monk et al., 2012, mBio 3(2): e00277-11.doi: 10.1 128/mBio.00277-11).
These endonuclease-based systems are grouped into four main types, type I to type IV, by a number of criteria (Roberts et al. Nucleic Acids Res. 2003, 31, 1805-1812). Systems of type I to type III encompass paired methyltransferase and endonuclease activities, degrading foreign DNA that lacks the proper methylation pattern, whereas the type IV enzymes are endonucleases that only cleave DNA substrates that have been modified (Tock and Dryden, Curr. Opin. Microbiol. 2005, 8, 466-472).
Bacterial transformants provide a key platform for a variety of industrially relevant processes, such as metabolic engineering and biochemical production. However, the introduction and expression of foreign DNA into some bacterial hosts can be an inefficient process. There is a need in the art for new strategies for maximizing the functionality of heterologous DNA in bacteria.
Bacteriophages (phages) are viruses that specifically infect and lyse bacteria. Phage therapy, a method of using whole phage viruses for the treatment of bacterial infectious diseases, was introduced in the 1920s by Felix d'Herelle. Initially, phage therapy was vigorously investigated and numerous studies were undertaken to assess the potential of phage therapy for the treatment of bacterial infection in humans and animals.
With the development of antibiotics in the 1940s, however, interest in phage-based therapeutics declined in the Western world. One of the most important factors that contributed to this decline was the lack of standardized testing protocols and methods of production. The failure to develop industry wide standards for the testing of phage therapies interfered with the documentation of study results, leading to a perceived lack of efficacy as well as problems of credibility regarding the value of phage therapy.
With the rise of antibiotic resistant strains of many bacteria, however, interest in phage-based therapeutics has returned. Even though novel classes of antibiotics may be developed, the prospect that bacteria will eventually develop resistance to the new drugs has intensified the search for non-chemotherapeutic means for controlling, preventing, and treating bacterial infections.
In one aspect, described herein is a bacterial cell comprising a heterologous nucleic acid sequence comprising one or more deazapurine bases. In some embodiments, the one or more deazapurine bases are deazaguanine bases (e.g., 7-deazaguanine bases). Exemplary 7-deazaguanine bases include, but are not limited to, 7-amido-7-deazaguanine (ADG), 7-formamidino-7-deazaguanosine (G+), 7-cyano-7-deazaguanine (PreQ0) and 7-aminomethyl-7-deazaguanine (PreQ1).
In another aspect, described herein is a method of protecting a heterologous nucleic acid sequence from cleavage by restriction enzymes in a host bacterium, the method comprising modifying the heterologous nucleic acid sequence to incorporate one or more deazaguanine bases; and introducing the modified heterologous nucleic acid sequence into the host bacterium, thereby protecting the heterologous nucleic acid sequence from cleavage by restriction enzymes in the host bacterium. In some embodiments, the modifying step occurs in vitro. In this regard, in some embodiments, the modifying step comprises mixing the heterologous nucleic acid sequence with at least one enzyme that is involved in introducing deazaguanine bases in DNA for a time sufficient to promote modification of the heterologous nucleic acid sequence.
In some embodiments, the modifying step comprises introducing the heterologous nucleic acid into a bacterial cell that has been modified to encode at least one enzyme that is involved in introducing deazaguanine bases in DNA.
Exemplary enzymes that are involved in introducing deazaguanine bases in DNA include, but are not limited to, DpdA and Gat-QueC encoded by Enterobacteria phage 9g.
The present disclosure is based, at least in part, on the discovery that a deoxyribonucleic acid (DNA) sequence comprising one or more 7-deazaguanine modifications dramatically decreases the susceptibility of the DNA to endonucleases in bacterial host restriction-modification systems (RM) compared to the same nucleic acid sequence without the 7-deazaguanine modifications. Restriction-modification systems are one of the major defense systems for bacteria to prevent the invasion by foreign nucleic acids5, such as phages, plasmids or integrons. Modifying nucleic acids (e.g., DNA) to incorporate the 7-deazaguanine modifications disclosed herein results in increased functionality or productivity of bacterial transformants because the modified DNA is less susceptible to host bacterial endonucleases.
Wild type bacteria encode for multiple defense systems against mobile genetic elements (MGEs). Many of these MGEs are used as tools for genetic engineering applications or as weapons against pathogens. Hence, the availability of a method that would protect these MGEs from bacterial defenses, particularly restriction enzymes, would greatly enhance their effectiveness. As demonstrated herein, nucleic acids (e.g., DNA) modified by dPreQ0, dPreQ1 or dG+ are protected from cleavage by a wide variety of restriction enzymes.
In one aspect, described herein is a bacterial cell (or bacterium) comprising a heterologous nucleic acid sequence comprising one or more deazaguanine bases. In some embodiments, the deazaguanine bases are 7-deazaguanine bases. Exemplary 7-deazaguanine bases include, but are not limited to, 7-amido-7-deazaguanine (ADG), 7-cyano-7-deazaguanine (PreQ0), 7-formamidino-7-deazaguanosine (G+) and 7-aminomethyl-7-deazaguanine (PreQ1).
In some embodiments, modifying the heterologous nucleic acid with one or more deazaguanine bases results in resistance to degradation by one or more restriction enzymes. In some embodiments, the one or more restriction enzymes is EcoRI (E. coli), EcoRII (E. coli), BamHI (B. amyloiquefaciens), HindIII (H. influenzae), NotI (N. otitidis), HinFI (H. influenzae), Sau3AI (S. aureus), PvuII (P. vulgaris), SmaI (S. marcescens), HaeIII (H. aegyptius), HgaI (H. gallinarum), AliI (A. luteus), EcoRV (E. coli), EcoP15I (E. coli), KpnI (K. pneumonia), PstI (P. stuartii), SacI (S. achromogenes), SalI (S. albus), Seal (S. caespitosus), SpeI (S. natans), SphI (S. phaeochromogenes), StuI (S. tubercidicus) and/or XbaI (X. badrii). Optionally, the heterologous nucleic acid comprising one or more deazaguanine bases is resistant to degradation by one or more of EcoRI, EcoRII, EcoRV and EcoP15I when transformed in E. coli.
The term “heterologous nucleic acid” is a nucleic acid that is not normally present in a particular wild type host cell. The bacterium has been “genetically modified” or “transformed” or “transfected” by heterologous nucleic acid when such nucleic acid(s) has been introduced inside the cell. Nucleic acids include DNA and RNA; can be single- or double-stranded; can be linear, branched or circular; and can be of any length. The heterologous nucleic acid described herein can be any DNA of interest. The DNA may be of genomic, cDNA, semisynthetic, synthetic origin, or any combinations thereof. The heterologous nucleic acid may encode any polypeptide having biological activity of interest or may be a DNA involved in the expression of the polypeptide having biological activity, e.g., a promoter. The heterologous nucleic acid encoding a polypeptide of interest may be obtained from any prokaryotic, eukaryotic, or other source. For purposes of the present disclosure, the term “obtained from” as used herein in connection with a given source shall mean that the polypeptide is produced by the source or by a cell in which a gene from the source has been inserted.
In some embodiments, the heterologous nucleic acid is a mobile genetic element. The term “mobile genetic element” or “MGE” as used herein refers to genetic elements that are not bound to a bacterial host and have the ability to move from one bacterial host to another. In some embodiments, the movement of DNA is within genomes (intracellular mobility). In some embodiments, the movement of DNA is between cells (intercellular mobility). Examples of MGEs include, but are not limited to, transposons, plasmids, bacteriophage nucleic acids, and pathogenicity islands. The MGE can be naturally occurring or engineered. The MGE can be cell-type specific, tissue specific, organism specific, or species specific (e.g., bacteria specific or human specific). The MGE can also be non-specific with respect to cell-type, tissue, organism and/or species.
A nucleic acid may be modified to incorporate one or more deazapurine bases in a cell-free environment or may be similarly modified in a bacterial cell. In some embodiments, the nucleic acid is modified in a bacterial cell. For example, in some embodiments, a nucleic acid (e.g., MGE) is introduced into a bacterial cell (e.g., E. coli, B. cereus, or B. subtilis) that has been modified to encode a transglycosidase (e.g., dpdA gene) and an amidotransferase (e.g., gat-queC gene) from Enterobacteria phage 9g and express their respective proteins, DpdA and Gat-QueC. The bacterial cell in its native state expresses additional enzymes (e.g., FolE, QueD, QueE and QueC) that are involved in the four first steps of PreQ0 synthesis. The expression of these native enzymes with a transglycosidase (and an amidotransferase) results in guanine(s) in the nucleic acid (e.g., MGE) being replaced with 7-cyano-7-deazaguanine (PreQ0) and 7-formamidino-7-deazaguanosine (0). The modified nucleic acid (comprising one or more deazapurine bases) can be collected by lysing the bacterial cell, and then subsequently introduced into a strain of interest.
In some embodiments, the nucleic acid is modified in a cell free environment. In this regard, isolated and purified transglycosidase (e.g., DpdA) and amidotransferases (e.g., Gat-QueC) are mixed with the nucleic acid (e.g., MGE) and the PreQ0 base (commercially available) for a time and temperature sufficient to promote modification of the nucleic acid by 7-formamidino-7-deazaguanosine (G+). The modified nucleic acid (comprising one or more deazapurine bases) can then be purified and introduced into a strain of interest. The use of DpdA alone will provide a nucleic acid modified with dPreQ0.
In some embodiments, a dGPT in a nucleic acid is modified into include a 7-substituted dazapurine dGTP, which DNA polymerases can use as a dNTP substrate to be integrated into newly created DNA (e.g., by PCR) (Cahove et al., ACS Chem. Biol. 11:3165-3171, 2016, the disclosure of which is incorporated herein by reference in its entirety).
In some embodiments, the heterologous nucleic acid is incorporated into a plasmid or other suitable expression vector (e.g., a bacteriophage-based vector). As used herein, the term “plasmid” or “vector” refers to an extrachromosomal nucleic acid, e.g., DNA, construct that is not integrated into a bacterial cell's chromosome. Plasmids are usually circular and capable of autonomous replication. Plasmids may be low-copy, medium-copy, or high-copy, as is well known in the art. Plasmids may optionally comprise a selectable marker, such as an antibiotic resistance gene, which helps select for bacterial cells containing the plasmid and which ensures that the plasmid is retained in the bacterial cell. A plasmid disclosed herein may comprise a nucleic acid sequence encoding a modified heterologous nucleic sequence e.g., a nucleotide sequence comprising one or more 7-deazaguanine bases.
The vector may contain one or more (e.g., two, several) selectable markers that permit easy selection of transformed bacterium (or bacterial cell). A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like. Examples of selectable markers include, but are not limited to, the dal genes from Bacillus subtilis or Bacillus licheniformis, or markers that confer antibiotic resistance such as ampicillin, chloramphenicol, kanamycin, or tetracycline resistance. Suitable markers for yeast host cells are ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3.
General methods, reagents and tools for transforming (e.g., bacteria) can be found, for example, in Sambrook et al (2001) Molecular Cloning: A Laboratory, Manual, 3rd ed., Cold Spring Harbor Laboratory Press, New York. Methods, reagents and tools for transforming yeast are described in “Guide to Yeast Genetics and Molecular Biology,” C. Guthrie and G. Fink, Eds., Methods in Enzymology 350 (Academic Press, San Diego, 2002).
In some embodiments, introduction of the modified heterologous nucleic acid sequence (or vector comprising the modified heterologous nucleic acid sequence) of the present disclosure into a host cell is accomplished by calcium phosphate transfection, DEAE-dextran mediated transfection, electroporation, or other common techniques (See Davis et al., 1986, Basic Methods in Molecular Biology, which is incorporated herein by reference). In one embodiment, a preferred method used to transform E. coli strains is electroporation and reference is made to Dower et al., 1988) NAR 16: 6127-6145. Indeed, any suitable method for transforming host cells can be used. It is not intended that the present disclosure be limited to any particular method for introducing the modified heterologous nucleic acids into host cells.
In some embodiments, the bacterial cell (or bacterium) is modified via CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) technology to express the modified heterologous nucleic acid. A CRISPR genomic locus can be found in the genomes of many bacteria and archaea. The CRISPR locus encodes products that function as a type of immune system to help defend the cell against foreign invaders, such as virus and phage. There are three stages of CRISPR locus function: integration of new sequences into the locus, biogenesis of CRISPR RNA (crRNA), and silencing of foreign invader nucleic acid. Five types of CRISPR systems (e.g., Type I, Type II, Type III, Type U, and Type V) have been identified.
A CRISPR locus includes a number of short repeating sequences referred to as “repeats.” The repeats can form hairpin structures and/or comprise unstructured single-stranded sequences. The repeats usually occur in clusters and frequently diverge between species. The repeats are regularly interspaced with unique intervening sequences referred to as “spacers,” resulting in a repeat-spacer-repeat locus architecture. The spacers are identical to or have high homology with known foreign invader sequences. A spacer-repeat unit encodes a crisprRNA (crRNA), which is processed into a mature form of the spacer-repeat unit. A crRNA comprises a “seed” or spacer sequence that is involved in targeting a target nucleic acid (in the naturally occurring form in prokaryotes, the spacer sequence targets the foreign invader nucleic acid). A spacer sequence is located at the 5′ or 3′ end of the crRNA.
A CRISPR locus also comprises polynucleotide sequences encoding CRISPR Associated (Cas) genes. Cas genes encode endonucleases involved in the biogenesis and the interference stages of crRNA function in prokaryotes. Some Cas genes comprise homologous secondary and/or tertiary structures.
crRNA biogenesis in a Type II CRISPR system in nature requires a trans-activating CRISPR RNA (tracrRNA). The tracrRNA is modified by endogenous RNaseIII, and then hybridizes to a crRNA repeat in the pre-crRNA array. Endogenous RNaseIII is recruited to cleave the pre-crRNA. Cleaved crRNAs are subjected to exoribonuclease trimming to produce the mature crRNA form (e.g., 5′ trimming). The tracrRNA remains hybridized to the crRNA, and the tracrRNA and the crRNA associate with a site-directed polypeptide (e.g., Cas9). The crRNA of the crRNA-tracrRNA-Cas9 complex guides the complex to a target nucleic acid to which the crRNA can hybridize. Hybridization of the crRNA to the target nucleic acid activates Cas9 for targeted nucleic acid cleavage. The target nucleic acid in a Type II CRISPR system is referred to as a protospacer adjacent motif (PAM). In nature, the PAM facilitates binding of a site-directed polypeptide (e.g., Cas9) to the target nucleic acid. Type II systems (also referred to as Nmeni or CASS4) are further subdivided into Type II-A (CASS4) and II-B (CASS4a). Jinek et al., Science, 337(6096):816-821 (2012) showed that the CRISPR/Cas9 system is useful for RNA-programmable genome editing, and International Patent Application Publication Number WO2013/176772 (incorporated herein by reference) provides numerous examples and applications of the CRISPR/Cas endonuclease system for site-specific gene editing.
Exemplary CRISPR/Cas polypeptides include the Cas9 polypeptides in FIG. 1 of Fonfara et al., Nucleic Acids Research, 42: 2577-2590 (2014) (incorporated herein by reference). The CRISPR/Cas gene naming system has undergone extensive rewriting since the Cas genes were discovered. FIG. 5 of Fonfara, supra, provides PAM sequences for the Cas9 polypeptides from various species.
Cas9 polypeptides can introduce double-strand breaks or single-strand breaks in nucleic acids, e.g., genomic DNA. The double-strand break can stimulate a cell's endogenous DNA-repair pathways (e.g., homology-dependent repair (HDR) or non-homologous end joining (NHEJ) or alternative non-homologous end joining (A-NHEJ) or microhomology-mediated end joining (MMEJ)). NHEJ can repair cleaved target nucleic acid without the need for a homologous template. This can sometimes result in small deletions or insertions (indels) in the target nucleic acid at the site of cleavage, and can lead to disruption or alteration of gene expression. HDR can occur when a homologous repair template, or exogenous nucleic acid, is available.
Thus, in some embodiments, homologous recombination is used to insert heterologous nucleic acid into the genome of the host bacterium. The modifications of the target DNA due to NHEJ and/or HDR can lead to, for example, mutations, deletions, alterations, integrations, gene correction, gene replacement, gene tagging, transgene insertion, nucleotide deletion, gene disruption, translocations and/or gene mutation. The processes of deleting genomic DNA and integrating non-native nucleic acid into genomic DNA are examples of genome editing.
In some aspects, the Cas9 nuclease is introduced to the bacterium as a protein (i.e., a protein-based system). Typically, the bacteria is treated chemically, electrically, or mechanically to allow Cas9 nuclease entry into the cell. Alternatively, the Cas9 nuclease is introduced to the bacterium as a nucleic acid (e.g., DNA or mRNA) under conditions which allow production of the nuclease. Guide RNA also is introduced into the bacterium.
A genome-targeting RNA is referred to as a “guide RNA” or “gRNA” herein. A guide RNA comprises at least a spacer sequence that hybridizes to a target nucleic acid sequence of interest, and a CRISPR repeat sequence. In Type II systems, the gRNA also comprises a tracrRNA sequence. In the Type II guide RNA, the CRISPR repeat sequence and tracrRNA sequence hybridize to each other to form a duplex. The duplex binds a site-directed polypeptide, such that the guide RNA and site-direct polypeptide form a complex. The guide RNA provides target specificity to the complex by virtue of its association with the Cas9 nuclease. The guide RNA thus directs the activity of the Cas9 nuclease. In some embodiments, the guide RNA is a single molecule guide RNA (sgRNA).
A single-molecule guide RNA in a Type II system comprises, in the 5′ to 3′ direction, an optional spacer extension sequence, a spacer sequence, a minimum CRISPR repeat sequence, a single-molecule guide linker, a minimum tracrRNA sequence, a 3′ tracrRNA sequence and an optional tracrRNA extension sequence. The optional tracrRNA extension may comprise elements that contribute additional functionality (e.g., stability) to the guide RNA. The single-molecule guide linker links the minimum CRISPR repeat and the minimum tracrRNA sequence to form a hairpin structure. The optional tracrRNA extension comprises one or more hairpins.
A nucleic acid encoding the Cas9 nuclease and/or guide RNA is typically delivered in an expression vector. The exogenous nucleic acid can be delivered in the same vector as the Cas9 nucleic acid, or in a second vector. Any of the expression vectors described herein may be used to deliver Cas9 nuclease-encoding nucleic acid into the bacterium. In many aspects, the expression vector is a plasmid. In some embodiments, an expression vector comprises one or more transcription and/or translation control elements. Depending on the host/vector system utilized, any of a number of suitable transcription and translation control elements, including constitutive and inducible promoters, transcription enhancer elements, transcription terminators, etc., may be used.
The Cas9 nuclease-encoding nucleic acid is operably linked to a promoter that drives protein expression. Exemplary prokaryotic promoters include, but are not limited to, wMel WSP Promote, wDc WSP Promoter and T7. For expressing small RNAs, including guide RNAs used in connection with Cas or Cpf1 endonuclease, promoters such as RNA polymerase III promoters, including for example U6 and H1, can be advantageous. Suitable promoters, as well as parameters for enhancing the use of such promoters, are known in art, and additional information and approaches are regularly being described; see, e.g., Ma, H. et al., Molecular Therapy—Nucleic Acids 3, e161 (2014) doi:10.1038/mtna.2014.12.
In various aspects, the heterologous nucleic acid is of bacteriophage origin. Indeed, in some embodiments, the materials and methods described herein are used to efficiently generate stocks of phage for laboratory or therapeutic use. Phages are an attractive therapeutic option for treating bacterial infections, as phages are more specific than antibiotics, are generally harmless to animals and humans, and have been shown to be effective in combatting antibiotic-resistant bacterial infections. Antibiotic-resistant bacterial infections are an increasing concern in clinical and non-clinical settings. Current first-line treatments rely upon the administration of small-molecule antibiotics to induce bacterial cell death. These broad-spectrum treatments disrupt the patient's normal microflora, allowing resistant bacteria and fungal pathogens to take advantage of vacated niches.
In this regard, described herein is method of producing a bacteriophage composition (e.g., a stock of bacteriophage) comprising (a) modifying a nucleic acid of bacteriophage origin to incorporate one or more deazaguanine bases as described herein; (b) introducing the modified nucleic acid into a host bacteria cell; (c) incubating the host bacteria cell until phage-mediated bacterial lysis occurs; and (d) isolating bacteriophage lysate. Optionally, the bacteriophage lysate is purified to produce a pharmaceutical composition of bacteriophage. The bacteriophage may be further modified to produce one or more anti-bacterial toxins.
Any suitable means for culturing bacterial cells is contemplated. Conditions for the culture and production of bacterial cells are readily available and well-known in the art. Cell culture media in general are set forth in Atlas and Parks (eds.) The Handbook of Microbiological Media (1993) CRC Press, Boca Raton, Fla. which is incorporated herein by reference. Additional information for cell culture is found in available commercial literature such as the Life Science Research Cell Culture Catalogue (1998) from Sigma-Aldrich, Inc (St Louis, Mo.) (“Sigma-LSRCCC”) and, for example, The Plant Culture Catalogue and supplement (1997) also from Sigma-Aldrich, Inc (St Louis, Mo.) (“Sigma-PCCS”), all of which are incorporated herein by reference. Also reference is made to the Manual of Industrial Microbiology and Biotechnology. A. Demain and J. Davies Eds. ASM Press. 1999.
In some embodiments, the cell culture medium is a liquid medium. In some embodiments, the cell culture medium is a semi-solid medium (e.g., cultured in semi-solid agar on a plate of solid agar).
In some embodiments, the bacteria (or bacterial cells) are grown under batch or continuous fermentations conditions. Classical batch fermentation is a closed system, wherein the compositions of the medium is set at the beginning of the fermentation and is not subject to artificial alterations during the fermentation. A variation of the batch system is a fed-batch fermentation. In this variation, the substrate is added in increments as the fermentation progresses. Fed-batch systems are useful when catabolite repression is likely to inhibit the metabolism of the cells and where it is desirable to have limited amounts of substrate in the medium. Batch and fed-batch fermentations are common and well known in the art. Continuous fermentation is a system where a defined fermentation medium is added continuously to a bioreactor and an equal amount of conditioned medium (e.g., containing the desired end-products) is removed simultaneously for processing. Continuous fermentation generally maintains the cultures at a constant high density where cells are primarily in the growth phase where production of end products is enhanced. Continuous fermentation systems strive to maintain steady state growth conditions. Methods for modulating nutrients and growth factors for continuous fermentation processes as well as techniques for maximizing the rate of product formation are well known in the art of industrial microbiology.
In some embodiments, the bacteriophage are isolated or purified from the lysate. For example, the culture medium can be filtered through a very small pore size filter to retain the bacteria and permit the smaller bacteriophage to pass through. Typically, a filter having a pore size in the range of from about 0.01 to about 1 μm can be used (or from about 0.1 to about 0.5 μm, or from about 0.2 to about 0.4 μm). Alternatively or in addition, the culture medium is purified from bacterial debris and endotoxins by dialysis using the largest pore membrane that retains bacteriophages, where the membrane preferably has a molecular cut-off of approximately 104 to about 107 daltons (or from about 105 to about 106 daltons). Many other suitable methods can be performed as disclosed for example in US 2001/0026795; US 2002/0001590; U.S. Pat. Nos. 6,121,036; 6,399,097; 6,406,692; 6,423,299; and WO 02/07742, the disclosures of which are incorporated herein by reference in their entireties.
Bacteria (or bacterial cells) for use according to the disclosure include, but are not limited to, Bacillus, Bacteroides, Bifidobacterium, Brevibacteria, Caulobacter, Clostridium, Enterococcus, Escherichia coli, Lactobacillus, Lactococcus, Listeria, Mycobacterium, Saccharomyces, Salmonella, Staphylococcus, Streptococcus, Vibrio, Bacillus coagulans, Bacillus subtilis, Bacteroides fragilis, Bacteroides subtilis, Bacteroides thetaiotaomicron, Bifidobacterium adolescentis, Bifidobacterium bifidum, Bifidobacterium breve UCC2003, Bifidobacterium infantis, Bifidobacterium lactis, Bifidobacterium longum, Clostridium acetobutylicum, Clostridium butyricum, Clostridium butyricum M-55, Clostridium cochlearum, Clostridium felsineum, Clostridium histolyticum, Clostridium multifermentans, Clostridium novyi-NT, Clostridium paraputrificum, Clostridium pasteureanum, Clostridium pectinovorum, Clostridium perfringens, Clostridium roseum, Clostridium sporogenes, Clostridium tertium, Clostridium tetani, Clostridium tyrobutyricum, Corynebacterium parvum, Escherichia coli MG1655, Escherichia coli Nissle 1917, Listeria monocytogenes, Mycobacterium bovis, Salmonella choleraesuis, Salmonella typhimurium, and Vibrio cholera. In certain embodiments, the bacteria are selected from the group consisting of Enterococcus faecium, Lactobacillus acidophilus, Lactobacillus bulgaricus, Lactobacillus casei, Lactobacillus johnsonii, Lactobacillus paracasei, Lactobacillus plantarum, Lactobacillus reuteri, Lactobacillus rhamnosus, Lactococcus lactis, Oxalobacter formigenes and Saccharomyces boulardii. In some embodiments, the bacterium is E. coli, B. cereus or L. acidophilus.
In some embodiments, the bacterium is a species of the genus Escherichia (e.g., E. coli). In various embodiments, the E. coli bacterial strain used in the processes described herein are derived from strain W3110, strain MG1655, strain B766 (E. coli W) or strain BW25113.
Other examples of useful E. coli strains include, but are not limited to, E. coli strains found in the E. coli Stock Center from Yale University (at website cgsc.biology.yale.edu/index.php); the Keio Collection, available from the National BioResource Project at NBRP E. coli, Microbial Genetics Laboratory, National Institute of Genetics 1111 Yata, Mishima, Shizuoka, 411-8540 Japan (www at shigen.nig.ac.jp/ecoli/strain/top/top.jsp); or strains deposited at the American Type Culture Collection (ATCC).
The bacteriophage described herein are optionally used to treat a bacterial infection in a subject in need thereof. In this regard, a suitable method comprises administering a bacteriophage comprising a heterologous nucleic acid comprising one or more deazapurine bases to the subject. In some embodiments, the bacterial infection is an Actinobacteria, Aquificae, Armatimonadetes, Bacteroidetes, Caldiserica, Chlamydiae, Chloroflexi, Chrysiogenetes, Cyanobacteria, Deferribacteres, Deinococcus-Thermus, Dictyoglomi, Elusimicrobia, Fibrobacteres, Firmicutes (e.g., Bacillus, Listeria, Staphylococcus), Fusobacteria, Gemmatimonadetes, Nitrospirae, Planctomycetes, Proteobacteria (e.g., Acidobacillus, Aeromonas, Burkholderia, Neisseria, Shewanella, Citrobacter, Enterobacter, Envinia, Escherichia, Klebsiella, Kluyvera, Morganella, Salmonella, Shigella, Yersinia, Coxiella, Rickettsia, Legionella, Avibacterium, Haemophilus, Pasteurella, Acinetobacter, Moraxella, Pseudomonas, Vibrio, Xanthomonas), Spirochaetes, Synergistets, Tenericutes (e.g., Mycoplasma, Spiroplasma, Ureaplasma), Thermodesulfobacteria or a Thermotoga infection. Optionally, the bacteriophage targets Salmonella spp., Listeria monocytogenes, MRSA, E. coli, Mycobacterium tuberculosis, Campylobacter spp., and/or Pseudomonas syringae. Alternatively, the bacteriophage is employed to destroy bacteria ex vivo (e.g., for surface sterilization).
In some embodiments, the heterologous nucleic acid (e.g., heterologous nucleic acid present in bacteriophage) is provided in a pharmaceutical composition, wherein the delivery vehicle is a pharmaceutically acceptable carrier. Pharmaceutically acceptable carriers are well known, and one skilled in the pharmaceutical art can easily select carriers suitable for particular routes of administration (Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 1985). Merely to illustrate, in the context of bacteriophage, the delivery vehicle optionally further stabilizes and/or enhances the efficacy of bacteriophage in inhibiting bacterial infection. In some embodiments, the delivery vehicle is a liquid vehicle suitable for administration by infusion or injection. In some embodiments, the delivery vehicle comprises a buffer. Exemplary buffers include, but are not limited to, phosphate buffered saline (PBS), lysogeny broth (LB), phage buffer (100 mM NaCl, 100 mM Tris-HCl, 0.01% (w/v) Gelatin), and Tryptic Soy broth (TSB). In some embodiments, the delivery vehicle is a solid vehicle suitable for administration, e.g., by inhalation or for application by spraying. In some embodiments, the delivery vehicle is a semi-solid or semi-liquid vehicle, such as a gel, cream, paraffin wax, or ointment, suitable for topical application.
All of the U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification, are incorporated herein by reference, in their entireties.
From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention.
Materials/Methods
Media composition: Lysogeny broth1 (LB): 10 g/L tryptone, 5 g/L yeast extract, 10 g/L NaCl, powder order from fisher (BP1426).
Brain heart infusion2 (BHI): Merck cat. 110493
BHI+3: BHI supplemented with 8 μM MnCl2, 0.25 mM, CaCl2, 0.2 mM MgSO4, 50 Mm Tris-HCl pH 7.5, 50 ng/μl choline chloride, 0.4% glycine and 100 μl/ml catalase.
Middlebrook 7H9 broth: 4.7 g Middlebrook 7H9 (Difco), 5 mL 40% glycerol, 900 mL ddH2O.
Middlebrook 7H10 agar: 19.0 g Middlebrook 7H10 (Difco), 12.5 mL 40% glycerol, 4.95 mL 40% dextrose, 5 drops anti-bubble, 990 mL ddH2O.
Middlebrook Top Agar: 4.7 g Middlebrook 7H9 (Difco), 7.0 g BactoAgar, ddH2O up to 1000 mL, 4 drops of anti-bubble.
Salt water (SW) stock (30%): 240 g/L NaCl, 30 g/L MgCl2, 35 g/L MgSO4, 7 g/L KCl, 5 mM Tris-HCl pH 7.5.
Modified growth medium (Rodrigez-Valera 1983) (MGM): for liquid broth 23% SW is used, 20% for agar medium and 18% for soft-agar medium. 5 g/L peptone and 1 g/L yeast extract are also added.
Difco nutrient broth: 3 g/L beef extract, 5 g/L peptone.
To these media, 15 g/L of agar are uses for solid medium and 7 g/L for top-agar medium.
Construction of the E. coli Q− mutants: The E. coli BW25113 folE::kan, queD::kan, queE::kan, queC::kan and tgt::kan mutants were collected from the Keio collection4. Each mutation was transduced using phage P15 in E. coli MG1655. The transductions were verified by PCR (couple of primers used: GO119/GO120 and GO121/GO122 for folE mutation, GO123/GO124 and GO125/GO126 for queD mutation, GO127/GO128 and GO129/GO130 for queE mutation, GO111/GO112 and GO113/GO114 for queC mutation, GO107/GO108 and GO109/GO110 for tgt mutation). The kanamycin cassette was removed from all these strains but Δtgt using pCP20 as described by Datsenko and Wanner6. The resulting strains are listed in Table 1.
Salmonella phage SE1
Streptococcus phage Dp-1
Escherichia phage CAjan
Escherichia phage Greed
Escherichia phage Seurat
Escherichia phage slur01
Escherichia phage
Mycobacterium phage Ares
Mycobacterium phage Boyle
Mycobacterium phage Glass
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Opia
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Tres
Mycobacterium virus Ta17a
Streptococcus virus MS1
Vibrio phage
Vibrio phage vB_VhaS-tm
Vibrio phage VpKK5
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Daffy
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Larva
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Milly
Mycobacterium phage
Mycobacterium phage
Mycobacteriumphage
Mycobacterium phage Oline
Mycobacterium phage
Mycobacterium phage Orion
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Pops
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Soto
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Vista
Mycobacterium phage ZoeJ
Mycobacterium virus
Salmonella phage 7-11
Salmonella phage SE131
Salmonella phage SEN1
Salmonella phage SPN3UB
Salmonella virus BTP1
Vibrio phage
Vibrio phage
Vibrio phage
Vibrio phage Athenal
Vibrio phage
Vibrio phage KVP40
Vibrio phage ValKK3
Vibrio phage VH7D
Vibrio phage phi-ST2
Vibrio phage nt-1
Vibrio phage
Escherichia phage ECML-4
Escherichia phage FEC14
Escherichia phage PhaxI
Escherichia virus CBA120
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Salmonella phage 38
Salmonella phage GG32
Salmonella phage PhiSH19
Salmonella phage SFP10
Salmonella phage STML-
Salmonella phage
Salmonella phage
Salmonella phage
Salmonella phage Vi01
Vibrio phage
Vibrio phage
Vibrio phage
Vibrio phage pVa-21
Salmonella phage SE1
Streptococcus phage Dp-1
Escherichia phage CAjan
Escherichia phage Greed
Escherichia phage Seurat
Escherichia phage slur01
Escherichia phage
Mycobacterium phage Ares
Mycobacterium phage Boyle
Mycobacterium phage Glass
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Opia
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Tres
Mycobacterium virus Ta17a
Streptococcus virus MS1
Vibrio phage
Vibrio phage vB_VhaS-tm
Vibrio phage VpKK5
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Daffy
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Larva
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Milly
Mycobacterium phage
Mycobacterium phage
Mycobacteriumphage
Mycobacterium phage Oline
Mycobacterium phage
Mycobacterium phage Orion
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Pops
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Soto
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage Vista
Mycobacterium phage ZoeJ
Mycobacterium virus
Salmonella phage 7-11
Salmonella phage SE131
Salmonella phage SEN1
Salmonella phage SPN3UB
Salmonella virus BTP1
Vibrio phage
Vibrio phage
Vibrio phage
Vibrio phage Athenal
Vibrio phage
Vibrio phage KVP40
Vibrio phage ValKK3
Vibrio phage VH7D
Vibrio phage phi-ST2
Vibrio phage nt-1
Vibrio phage
Escherichia phage ECML-4
Escherichia phage FEC14
Escherichia phage PhaxI
Escherichia virus CBA120
Mycobacterium phage
Mycobacterium phage
Mycobacterium phage
Salmonella phage 38
Salmonella phage GG32
Salmonella phage PhiSH19
Salmonella phage SFP10
Salmonella phage STML-
Salmonella phage
Salmonella phage
Salmonella phage
Salmonella phage Vi01
Vibrio phage
Vibrio phage
Vibrio phage
Vibrio phage pVa-21
Cloning E. coli tgt: The tgt gene was amplified by PCR from E. coli MG1655 using tgt_pBAD24 KpnI_F and tgt_pBAD24_SphI_R primers. The resulting PCR product and pBAD24 were digested by KpnI and SphI. (NEB) following the recommendation of the manufacturer. The genes were then inserted by ligation using the T4 DNA ligase from NEB, following the manufacturer recommendations. The resulting plasmid was verified by sequencing (data not shown).
Cloning of 9g genes: dpdA, folE, queD, queE and gat-queC genes from Enterobacteria phage 9g (accession number: NC_024146) were amplified by PCR using the couple of primers GO80/GO81, GO92/GO93, GO94/GO95, GO100/GO101 and GO96/GO97, respectively. pBAD24 plasmid and the PCR products were digested by SalI-HF and SbfI-HF (NEB), following the recommendation of the manufacturer. The genes were then inserted by ligation using the T4 DNA ligase from NEB, following the manufacturer recommendations. dpdA and gat-queC were also cloned in pBAD33 using the same methods. The resulting plasmids were verified by sequencing (data not shown). Each resulting plasmid was transformed in different mutants of E. coli MG1655 as listed in Table 1 for the experiment showed in
Plasmid DNA preparation for Mass spectrometry: Overnight cultures were diluted 1/100-fold into 500 mL of LB supplemented with 0.4% arabinose, 100 μg/mL ampicillin and 20 μg/mL of chloramphenicol. Cells were grown overnight and pelleted. The Qiagen maxi-prep kit was used to extract the plasmid following the recommendations of the manufacturer.
Rosebush and Orion DNA purification: Mycobacteriophages and Rosebush and Orion were grown as described previously13. In brief, 30 mL of a dense M. smegmatis culture was mixed with approximately 106 phage particle, 270 mL of top-agar were added and the mixture was plated on 30 large (150×10 mm) solid media plates. After incubation for 36-48 h at 37° C., 10 mls of phage buffer added, incubated for 4 hrs at room temperature, and the phage lysate collected. Following clarification by centrifugation, phage particles were precipitated with the addition of NaCl to a final concentration of 1M and polyethylene glycol 8000 to a final concentration of 10%. The precipitated particles were collected by centrifugation for 10 minutes at 5,500×g at 4° C., and resuspended in 10 mls of phage buffer. The lysate was clarified by centrifuged at 5,500×g for 10 minutes at 4° C., 8.5 g of CsCl was added, and placed in a heat-sealed tube. Samples were centrifuged at 38,000 RPM (98,000×g) for 16 hours, and the visible phage band removed with a syringe through the side of the tube.
Prior to DNA extraction, CsCl was removed by dialysis against phage buffer overnight at 4° C. For DNA extraction, 0.5 mls of phage lysate (˜1012 particles) were incubated with 12.5 mM MgCl2, 0.8 μU/mL DNAse I and 100 μg/mL RNAse at room temperature for 30 minutes. To this, 20 mM EDTA, 50 μg/mL of Proteinase K and 0.5% of SDS were added, vortexed vigorously and incubated at 55° C. for 60 minutes. An equal volume of phenol:chlorophorm:isoamyl-alcohol (25:24:1) was added and the mixture was inverted several time before being centrifuged for 5 minutes at room temperature at 13,000 rpm (16,000×g). This step was repeated several times on the aqueous phase obtained until the white interphase was gone. The DNA was ethanol precipitated from the sample, pelleted, washed with 500 μL of 70% ethanol, dried, and the DNA pellet resuspended in 50 μL ddH2O. DNA concentrations were measured using NanoDrop (ThermoScientific).
HVTV-1 DNA purification: To 30 mL of a stationary phase Haloarcula valismoris grown in MGM 23%, enough phages were added to obtain confluent lysis on plates. 270 mL of MGM 18% top-agar were added and the mixture was completely plated on MGM 20% agar. The phages were grown for 4-5 days at 37° C. then a top layer of HVTV-1 virus buffer14 (1.2 M NaCl, 44 mM MgCl2, 47 mM MgSO4, 1.5 mM CaCl2, 28 mM KCl, 24 mM Tris-HCl pH 7.2) was poured on top of each plate. Phages were allowed to diffuse to the liquid phase for 4 h at 4° C. before being harvested. Debris were pelleted, and phages were precipitated over night at 4° C. by adding 10% polyethylene glycol (PEG 8000) to the supernatant. The phage suspension was centrifuged for 10 minutes at 4,500×g at 4° C. The phage pellet was resuspended in 10 mL of HVTV-1 virus buffer and dialyzed in the same buffer over night at 4° C. to eliminate the last traces of PEG. 12.5 mM MgCl2, 0.8 μU/mL DNAse I and 100 μg/mL RNAse were added and the mixture were incubated at room temperature for ˜30 minutes. 20 mM EDTA, 50 μg/mL of Proteinase K and 0.5% of SDS were added to the mixture, which was then vortexed vigorously and incubate at 55° C. for 60 minutes. A equal volume of phenol:chlorophorm:isoamyl-alcohol (25:24:1) was then added and the mixture was inverted several time before being centrifuged for 5 minutes at room temperature at 4,500×g. This step was repeated several times on the aqueous phase obtained until the white interphase was gone. An equal volume of chloroform was added to the aqueous phase, vortexed and centrifuged again to eliminate the last traces of phenol. The DNA was then ethanol precipitated from the sample and pelleted. The pellet was washed with 500 μL of 70% ethanol. The dried DNA pellet was then resuspended in ˜50 μL dH2O. Concentrations were measured using a NanoDrop® ND-1000 Spectrophotometer (Thermo scientific, Waltham, Mass.).
9g DNA purification: To 30 mL of a stationary phase E. coli MG1655 grown in LB, enough phages were added to obtain confluent lysis on plates. 270 mL of LB top-agar were added and the mixture was completely plated on LB agar. The phages were grown overnight at 37° C. then a top layer of TM buffer (10 mM MgSO4, 10 mM Tris-HCl pH 7.5) was poured on top of each plate. Phages were allowed to diffuse to the liquid phase for 4 h at 4° C. before being harvested. Debris were pelleted, and phages were precipitated over night at 4° C. by adding 1 M of NaCl and 10% polyethylene glycol (PEG 8000) to the supernatant. The phage suspension was centrifuged for 10 minutes at 4,500×g at 4° C. The phage pellet was resuspended in 10 mL of TM buffer and dialyzed in the same buffer over night at 4° C. to eliminate the last traces of PEG. 12.5 mM MgCl2, 0.8 μU/mL DNAse I and 100 μg/mL RNAse were added and the mixture were incubated at room temperature for ˜30 minutes. 20 mM EDTA, 50 μg/mL of Proteinase K and 0.5% of SDS were added to the mixture, which was then vortexed vigorously and incubate at 55° C. for 60 minutes. A equal volume of phenol:chlorophorm:isoamyl-alcohol (25:24:1) was then added and the mixture was inverted several time before being centrifuged for 5 minutes at room temperature at 4,500×g. This step was repeated several times on the aqueous phase obtained until the white interphase was gone. An equal volume of chloroform was added to the aqueous phase, vortexed and centrifuged again to eliminate the last traces of phenol. The DNA was then ethanol precipitated from the sample and pelleted. The pellet was washed with 500 μL of 70% ethanol. The dried DNA pellet was then resuspended in ˜50 μL dH2O. Concentrations were measured using a NanoDrop® ND-1000 Spectrophotometer (Thermo scientific, Waltham, Mass.).
To a solution of compound i16 (130 mg, 0.33 mmol,
To a suspension of i16 (600 mg, 1.53 mmol) in pyridine (10 mL) was added CuCN (1.37 g, 15.3 mmol) with stirring under reflux for 20 h. The reaction mixture was cooled to ambient temperature and solvent evaporated. The resulting solid was washed thoroughly with 20% MeOH in dichloromethane, with the washings combined, evaporated and purified by column chromatography (100-200 mesh silica gel) eluting with 10% to 20% MeOH in dichloromethane to afford dPreq0 (220 mg, 49%) as off-white solid. HRMS (ESI): m/z calculated for C12H14N5O4 [M+H]+ 292.1046, observed 292.1043.
Dry HCl gas was bubbled through a suspension of dPreQ0 (100 mg, 0.34 mmol) in anhydrous MeOH (20 mL) at 0° C. for 2 h. Following stirring at ambient temperature for 16 h, the reaction mixture was evaporated and treated with 7N NH3 in MeOH at 0° C., with stirring for 16 h. The crude reaction mixture was evaporated under vacuum and purified by MPLC using C18 column eluting with acetonitrile and H2O. The fractions containing product was lyophilized to afford dG+ (20 mg, 18%) as an off-white solid18. HRMS (ESI): m/z calculated for C12H17N6O4 [M+H]+ 309.1311, observed 309.1306.
Q detection in tRNA: Overnight cultures were diluted 1/100-fold into 5 mL of LB supplemented with 0.4% arabinose and 100 μg/mL ampicillin and grown for 2 h at 37° C. Cells were harvested by centrifugation at 16,000×g for 2 min at 4° C. Cell pellets were immediately resuspended in 1 mL of Trizol (Life technologies, Carlsbad, Calif.). Small RNAs were extracted using PureLink™ miRNA Isolation kit from Invitrogen (Carlsbad, Calif.) according to manufacturer protocol. The purified RNAs were eluted in 50 μL of RNase free water and tRNA concentrations were measured by NanoDrop® ND-1000 Spectrophotometer (Thermo scientific, Waltham, Mass.). Then, 200 μs were used in 3-(Acrylamido)-phenylboronic acid (APB) assay described in detail previously32 using the (5′-biotin-CCCTCGGTGACAGGCAGG-3′) probe that detects tRNAAsp(GUC) at final concentration of 0.3 μM.
Restriction assay for deazapurine presence in plasmid DNA: E. coli strains containing different variation of pBAD24 and pBAD33 (with or without dpdA or gat-queC from Enterobacteria phage 9g) were grown overnight in LB supplemented with 0.2% of glucose at 37° C. Each strain was diluted 100-fold in LB supplemented with 0.4% of arabinose and grown 6 h at 37° C. Plasmids were extracted using the Qiagen QIAprep Spin Miniprep Kit and 500 ng of plasmid were digested by EcoRI-HF (New England Biolabs, Ipswich Mass.) for 1 h at 37° C. in 20 mL CutSmart buffer. The enzyme was inactivated by 20 min incubation at 80° C. The samples were run on a 0.5% agarose gel, Tris-EDTA acetate (TAE) 1×. The gel was then stained 30 min in 0.5 μg/mL ethidium bromide, then washed 3 times for 15 min in water, and visualized with the Azur Biosystem c200 gel doc (Thermofisher, Waltham, Mass., USA).
Search for phage encoding queuosine and archaeosine biosynthesis proteins: The Viruses nr database from NCBI was queried by three iterations of PSI-BLAST37, default set up as previously suggested50, using the proteins referenced in Table 2, known to be involved in Queuosine (Q) or Archaeosine (G+) biosynthesis, as well as DpdA from Enterobacteria phage 9g, predicted to be involved in the modification of phage DNA, and another DpdA2 from Vibrio phage nt-1, part of a new family identified in this study.
Escherichia coli
Escherichia coli
Escherichia coli
Escherichia coli
Escherichia coli
Escherichia coli
Escherichia coli
Thermoproteus uzoniensis
Aeropyrum pernix K1
Halalkalicoccus paucihalophilus
Bifidobacterium longum
Aeromonas salmonicida
Enterobacteria phage 9g
Vibrio phage nt-1
The PreQ0 specific transporter YhhQ27 was also added. For each virus identified with at least one of these genes, a reverse analysis was done (phage genome again the protein list) to ensure that no protein was missed during the first analysis. Each identified ortholog was verified by HHpred38 for its annotation.
Identification of the host and their gene content: The Virus-Host DB44 was used to gather the host of each phage identified in this study. For phages not referenced in this database, a manual investigation coupling RefSeq42 and the literature was performed (data now shown) Each host identified was queried in the Globi database43 (data not shown) The same analysis was done for the double strand DNA (dsDNA) phages, as only these phages were return in our analysis (data not shown). A list of genomes was created on PubSeed45 from the hosts identified to create a new spreadsheet.
Mass spectrometry analysis: DNA analysis was performed as previously but with several modifications16. Purified DNA (20 μg) was hydrolyzed in 10 mM Tris-HCl (pH 7.9) with 1 mM MgCl2 with Benzonase (20 U), DNase I (4 U), calf intestine phosphatase (17 U) and phosphodiesterase (0.2 U) for 16 h at ambient temperature. Following passage through a 10 kDa filter to remove proteins, the filtrate was lyophilized and resuspended to a final concentration of 0.2 μg/μL (based on initial DNA quantity).
Quantification of the modified 2′-deoxynucleosides (dADG, dQ, dPreQ0, dPreQ1 and dG+) and the four canonical 2′-deoxyribonucleosides (dA, dT, dG, and dC) was achieved by liquid chromatography-coupled triple quadrupole mass spectrometry (LC-MS/MS) and in-line diode array detector (LC-DAD), respectively. Aliquots of hydrolyzed DNA were injected onto a Phenomenex Luna Omega Polar C18 column (2.1×100 mm, 1.6 μm particle size) equilibrated with 98% solvent A (0.1% v/v formic acid in water) and 2% solvent B (0.1% v/v formic acid in acetonitrile) at a flow rate of 0.25 mL/min and eluted with the following solvent gradient: 12% B for 10 min, 1 min ramp to 100% B for 10 min, 1 min ramp to 2% B for 10 min. The HPLC column was coupled to an Agilent 1290 Infinity DAD and an Agilent 6490 triple quadruple mass spectrometer (Agilent, Santa Clara, Calif.). The column was kept at 40° C. and the auto-sampler was cooled at 4° C. The UV wavelength of the DAD was set at 260 nm and the electrospray ionization of the mass spectrometer was performed in positive ion mode with the following source parameters: drying gas temperature 200° C. with a flow of 14 L/min, nebulizer gas pressure 30 psi, sheath gas temperature 400° C. with a flow of 11 L/min, capillary voltage 3,000 V and nozzle voltage 800 V. Compounds were quantified in multiple reaction monitoring (MRM) mode with the following m/z transitions: 310.1→194.1, 310.1→177.1, 310.1→293.1 for dADG, 394.1→163.1, 394.1→146.1, 394.1→121.1 for dQ, 292.1→176.1, 176.1→159.1, 176.1→52.1 for dPreQ0, 296.1→163.1, 296.1→121.1, 296.1→279.1 for dPreQ1, and 309.1→193.1, 309.1→176.1, 309.1→159.1 for dG+. External calibration curves were used for the quantification of the modified canonical 2′-deoxynucleosides. The calibration curves were constructed from replicate measurements of eight concentrations of each standard. A linear regression with r2>0.995 was obtained in all relevant ranges. The limit of detection (LOD), defined by a signal-to-noise ratio (S/N)≥3, ranged from 0.1 to 1 fmol for the modified 2′-deoxynucleosides. Data acquisition and processing were performed using MassHunter software (Agilent, Santa Clara, Calif.).
Restriction assay of phage DNA: 250 ng of phage DNA were digested by different enzymes (New England Biolabs) described in
First, it was determined whether the phage 9g genes predicted to encode PreQ0 synthesis enzymes could complement the Q deficiency phenotype of E. coli derivatives lacking the corresponding orthologs. As shown in
It was predicted that dual expression of the viral gat-queC and dpdA genes in trans would lead to the insertion of 7-deazaguanine derivatives, as dG+, in E. coli DNA. Because the presence of dG+ confers resistance to EcoRI digestion34, restriction profiles were used as a first indication for the presence of modifications in plasmid DNA. The two phage genes were both cloned in pBAD24 and pBAD33. EcoRI cuts pBAD24 once and pBAD33 twice, as shown in the digestion profiles of plasmids extracted from an E. coli derivative co-transformed with the two empty plasmids (
Analysis of dG+, dADG, dPreQ0 and dPreQ1 profiles by liquid chromatography-coupled triple quadrupole mass spectrometry (LC-MS/MS) (
Interestingly, whereas we had failed to complement the Q− phenotype of the E. coli queC strain when expressing the phage 9g Gat-QueC gene, the EcoRI resistance phenotype caused by 7-deazapurine insertion in strains expressing both 9g dpdA and gat-queC was still observed in a ΔqueC background (
Finally, whether the E. coli TGT was required for DpdA activity in E. coli was tested as the active forms of TGT enzymes are known to be dimers36. This does not seem to be the case as the restriction resistance phenotype was still observed in the Δtgt background (
A new sub-family of DpdA encoded by the Vibrio phage nt-1 was identified by investigating genes flanking PreQ0 biosynthesis genes cluster. Indeed, phage nt-1 DpdA (YP_008125322) is not detected with PSI-BLAST when using the E. coli phage 9g DpdA as input sequence and it does not possess the conserved histidine found at position 196 but similarities with members of the TGT family could be detected using HHpred. This protein was renamed DpdA2.
An in silico search for phages that could harbor 7-deazaguanine derivatives in their genomic DNA revealed that a total of 182 viruses deposited in GenBank were found to encode a DpdA homolog and/or at least a G+ synthesis gene (Table 1). Most of these viruses (163/182) were bacteriophages, while 16 archaeal viruses as well as the 3 eukaryotic viruses were found. The latter only encode for FolE, which is most likely to be linked to the folate pathway39. Analyses of the presence/absence patterns of the predicted Q/G+ biosynthesis genes led to classification of these viruses in various groups and in some cases, predict the nature of the 7-deazaguanine base modification. It is important to note that no homologs to the proteins specifically involved in Q biosynthesis such as QueA, QueG, or QueH (see
The first group contains 25 phages and is represented by Enterobacteria phage 9g (KJ419279), Streptococcus phage Dp-1 (NC_015274) and Vibrio phage nt-1 (NC_021529) in
The second group includes 40 phages and is represented by E. coli phage CAjan (NC_028776) and Mycobacterium phage Rosebush (AY129334) in
The third group contains 76 phages including Salmonella phage 7-11 (NC_015938) and Mycobacterium phage Orion (DQ398046) shown in
The last group is composed of 48 phages encoding proteins of the PreQ0/G+ pathway but no DpdA. These phages could boost the production of the Q precursor to increase the level of Q in the host tRNA and increase translation efficiency40. However, it is possible that 7-deazaguanines are inserted in their DNA in a DpdA independent pathway as there is a recent report that the genomes of Capylobacter phages from this group are highly modified by dADG (data not shown).
Phages containing FolE and QueC singletons were discarded from further analysis because FolE is shared between folate and PreQ0 synthesis16 while QueC is also part of a superfamily of ATPase (COG) making their precise role to identify.
All the phages identified above are members of the Caudovirales order and are distributed into various families: Siphoviridae (95), Myoviridae (23), Ackermannviridae (20) and Podoviridae (3). For the Archaeal virus, 12 Ligamenvirales and 2 Bicaudaviridae were identified (data not shown).
To study the interaction between phages containing 7-deazaguanine related genes and their bacterial hosts, metadata on the hosts and their habitat was gathered using RefSeq42 and the Globi database43, and the distribution of Q, G+ and dADG synthesis genes in these organisms was analyzed (data not shown). Interestingly, 106 of the collected phages (˜60%) infect a strain that is the model for a known bacterial pathogen, where only ˜9% of the dsDNA viruses from the Virus-Host database44 infect a strain related to pathogen (data not shown). No clear environment was found for the archaeal hosts.
All phage hosts predicted to modify their DNA with G+ possess the pathway to produce Q in tRNA. Curiously the hosts of the phages coding for a QueF-L and a 9g DpdA homolog do not encode for the PreQ0 biosynthetic pathway (QueDEC, see
There is no clear pattern for the bacterial hosts of phages encoding both DpdA and the whole PreQ0 pathway. Most of them encode the full Q pathway enzymes except for Streptococcus pneumoniae, which lacks PreQ0 pathway genes, Rhodococcus erythropolis, which encodes only TGT, and the Mycobacteria, that possess none of these genes.
The hosts of the phages encoding only DpdA also encode for the full set of Q synthesis enzymes except the Clostridium species, which lack the PreQ0 pathway genes, and the Mycobacterium genus, that possess none of these genes. Sulfolobi were not referenced in PubSeed45, but using BLASTp with default parameters with the genes listed in Table 2 above as queries, all G+ pathway genes were identified. Hence, the 7-deazaguanine intermediates produced by these hosts, Clostridium and Mycobacterium excluded, might be used by phages that lack the biosynthesis proteins to produce a 7-deazaguanine precursor.
Finally, the hosts of the phages that do not encode a DpdA but encode the PreQ0 pathway proteins all encode the full Q synthesis pathway.
A few bacterial hosts, such as 46 different strains of E. coli, Haloarcula valismortis and Vibrio harveyi 1DA3, also harbor homologs of the bacterial DpdA. In these cases, infecting phages could be modified by the host modification machinery.
To test predictions on the nature of phage DNA modification, a set of phages from each group was selected, and their genomic DNA were extracted for mass spectrometry analysis (Table 3).
Escherichia
Escherichia
Mycobacterium
Mycobacterium
Salmonella
Streptococcus
Vibrio phage
Interestingly, no 2′-deoxyqueuosine (dQ) was found in any of the tested samples, correlating with the fact that no phage or virus encodes the specific protein for Q synthesis (QueAGH).
First, phages encoding both a DpdA and one of the amidotransferase homologs were analyzed. Streptococcus phage Dp-1 DNA, encoding for a QueF-L, contained a large amount of dPreQ1 (3,389 modifications per 106 nucleotides, ˜1.7% of the Gs) but no dG+, which would mean that the QueF-L of this phage would actually be functionally closer to the bacterial QueF than the archaeal QueF-L, as predicted by the SSN clustering. Vibrio phage nt-1, encoding an ArcS, was shown to harbor not only dG+ (44 modifications per 106 nucleotides, ˜0.02% of the Gs) but also dPreQ0 and dADG (232 modifications per 106 nucleotides, ˜0.11% of the Gs, and 72 modifications per 106 nucleotides, ˜0.03% of the Gs, respectively). This result might indicate that nt-1 DpdA is more promiscuous and could insert all intermediates of the pathway.
Next, phages of the second group that encode both a DpdA and the four proteins of the PreQ0 biosynthesis pathway but no amidotransferase homolog were investigated. Mycobacterium phage Rosebush was found to harbor dPreQ0 in its DNA (96,530 modifications per 106 nucleotides, ˜28% of the Gs) as does Escherichia phage CAjan (70,628 modifications per 106 nucleotides, ˜32% of the Gs). However, Mycobacterium phage Rosebush was found to also harbor a very small amount of dADG (9 modifications per 106 nucleotides, ˜0.003% of the Gs). These proportions are negligible for Rosebush and could be the result of the natural oxidation of the PreQ0 base.
The genomic DNA of Salmonella phage 7-11 and Mycobacterium phage Orion from the third group of phage, which only encode a DpdA were also analyzed by LC-MS/MS. Mycobacterium phage Orion lacked any 7-deazaguanine modifications in its DNA. This result was expected as none of the phage nor the host encode for the PreQ0 biosynthesis pathway (Mycobacterium smegmatis, Table 3). However, Salmonella phage 7-11 was unexpectedly modified by dADG (50 modifications per 106 nucleotides, ˜0.02% of the Gs), suggesting the presence of a protein responsible for the oxidation of PreQ0 encoded by the phage.
Finally, Halovirus HVTV-1, which encodes the four proteins of the PreQ0 biosynthesis pathway and an ArcS homolog but no DpdA, contained mainly dPreQ1 (88,607 modifications per 106 nucleotides, ˜30% of the Gs) but also relatively small amounts of dADG and dG+ (152 modifications per 106 nucleotides, ˜0.05% of the Gs, and 22 modifications per 106 nucleotides, ˜0.008% of the Gs, respectively). As its host, Haloarcula valismortis, harbors a DpdA homolog, it is possible that the host DpdA inserts PreQ0 in Halovirus HVTV-1 DNA before it is further modified to dPreQ1 or dG+ by the viral ArcS, that would have evolved to perform a nitrile reduction as well, or to dADG by another unidentified protein.
The different modifications present in the phages analyzed above may lead to distinct resistance patterns to host defense mechanism such as RM systems. To test this hypothesis, phage DNA preparations were digested with a set of restriction enzymes that had been shown to be totally or partially inactivated in the presence of the dG+ modification34. As a control, and as shown in
Mycobacteria phage Rosebush DNA that carries PreQ0 showed a slightly different pattern of resistance. The restriction profiles for BamHI, BstXI and EcoRV were identical to those of Enterobacteria phage 9g. However, Rosebush DNA was fully sensitive to HaeIII, MluI and PciI and resisted to NdeI degradation (
Discussion:
As described herein, the presence of 7-deazaguanine modifications was directly linked with a restriction resistance phenotype.
In addition, all 7-deazaguanine modified DNA preparation tested were protected to various degrees from digestion by restriction enzymes. Transplanting the dG+ modification in E. coli reproduced the resistance to cleavage by EcoRI (
Four 7-deazaguanine modifications in DNA were detected: dADG in bacteria, and dG+, dPreQ1 and dPreQ0, all represented in phages. dADG was observed in phage genomes for the first time. The genes involved in the synthesis of these different modifications also were identified. FolE, QueD and QueE from Enterobacteria phage 9g were proven to functionally replace their E. coli orthologs (
Most 7-deazaguanine containing phage genomes also harbor a gene coding for a DpdA homolog. As with its bacterial homolog32, the phage DpdA introduces PreQ0 in DNA (
The combination of comparative genomic analyses and experimental validations described herein has allowed to elucidate pathways for the insertion of dPreQ0, dPreQ1 and dG+ in phage genomes (
HHpred analysis predicted that a homolog of the archaeal QueF-L, that synthesizes G+-tRNA from the PreQ0-tRNA49, was encoded by Streptococcus phage Dp-1. However, we found that this phage was modified by dPreQ1. It is unclear if the reduction occurs on free PreQ0, similarly to the bacterial QueF proteins22, and then the free base PreQ1 is inserted by DpdA, or if the phage QueF is able to modify the DNA-bounded dPreQ0, as does the archaeal QueF-L with tRNA49. However, Halovirus HVTV-1 contains mainly dPreQ1, but also small amounts of dADG and dG+. It is possible that the QueF-L is on the verge of evolving from an amidohydrolase to an amidotransferase reaction, but one cannot rule out that the host ArcS could catalyze the reaction, although the specific PUA domain specific for tRNA bidding makes it highly unlikely.
Interestingly, 7-deazaguanine modifications seem to dramatically decrease the susceptibility of the phage genomes to the host restriction-modification systems (RM). These systems are one of the major defense systems for bacteria to prevent the invasion by foreign DNA5. Phages evolved to escape these RM systems by different methods including modification of their genomic DNA11-14. As demonstrated by the data provided herein, the presence of the dG+ modification was directly linked with the restriction resistance phenotype. In addition, all 7-deazaguanine modified DNA preparations tested were protected to various degrees from digestion by restriction enzymes. It was also observed that introducing the dG+ modification in E. coli reproduced the resistance to cleavage by EcoRI (
The following Example describes an in vivo method for introducing 7-deazaguanine modifications into a heterologous nucleic acid.
Specific laboratory strains of the gram-negative bacteria Escherichia coli and the gram positive Bacillus subtilis will be engineered to encode the dpdA and gat-queC from Enterobacteria phage 9g and produce the respective proteins, DpdA and Gat-QueC, when voluntarily induced by the experimenter (
The advantage of this system is that it requires only a few materials but the strain of interest has to have a compatible MGE with the modifying strain. The number of modifying strains used to produce the modification will be expanded as this technology grows to be more available to diverse species of bacteria.
The following Example describes an in vitro method for introducing 7-deazaguanine modifications into a heterologous nucleic acid.
The Enterobacteria phage 9g dpdA and gat-queC genes will be cloned in an expression plasmid, such as pET28. DpdA and Gat-QueC protein will be expressed in a specific strain of E. coli, such as BL21, and further purified to be used in vitro (
The advantage of this method is that all that is needed is the proteins and PreQ0 to modify a nucleic acid of interest, and thus it can be easily set up in form of a kit. However, this technique is not applicable to phage, unless the phage packaging system is available in vitro.
The present application claims the benefit of priority to U.S. Provisional Application No. 62/816,815, filed Mar. 11, 2019, the disclosure of which is incorporated by reference in its entirety.
This invention was made with government support under GM070641 awarded by The National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US20/21886 | 3/10/2020 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62816615 | Mar 2019 | US |