The present invention, in some embodiments thereof, relates to methods of downregulating prokaryotic genes and, more particularly, but not exclusively, to methods of downregulating bacterial genes.
One of the most successful ways for understanding the function of a gene within an organism is to silence its expression. This is usually done by modifying, interrupting or deleting the DNA of the gene, and is referred to as gene ‘knockout’ [Austin et al., 2004, Nature Genetics 36:921-4]. Gene knockouts are mostly carried out through homologous recombination, and this method was used in multiple studies and in multiple organisms from bacteria to mammals.
Despite the fact that gene targeting by knockout is a powerful tool, it is a complex, labor intensive and time consuming procedure. It is therefore hard to scale up cost-effectively, and most studies are generally limited to a knockout of a single gene rather than a complete pathway. Moreover, knockout is mostly limited to organisms in which homologous recombination is relatively efficient, such as mouse [Austin et al., 2004, Nature Genetics 36:921-4] and yeast [Deutscher et al., 2006, Nature Genetics 38:993-8].
A breakthrough in the search for efficient alternative to gene knockouts in eukaryotes was achieved when it was realized that RNA-interference (RNAi) could be used to silence the expression of specific genes without interruption of their DNA. RNAi is a conserved biological mechanism first discovered in the nematode Caenorhabditis elegans, where it was demonstrated that injection of long dsRNA into this nematode led to sequence-specific degradation of the corresponding mRNAs. This silencing response has been subsequently found in other eukaryotes including fungi, plants and mammals [Fire et al., 1998, Nature 391(6669):806-11; Hannon, 2002, Nature 418(6894):244-51].
While the role of RNAi is, at least in part, to protect against viral infections and mobile element infestations, it was also shown that artificial transfection of short dsRNA duplexes, which target specific endogenous mRNAs, into mammalian cells can trigger gene specific silencing [Elbashir et al., 2001, Nature. 2001 May 24; 411(6836):494-8]. These short dsRNAs (called siRNAs) are converted into single strands by the RNA-induced silencing protein complex (RISC), and the RISC-siRNA complex identifies target mRNAs by base pairing, leading to their degradation by an RNA nuclease. Researchers are now using this RNAi gene silencing technology and its derivatives to understand the biological function of endogenous genes in many eukaryotic organisms, and usage of this technology has led to important scientific breakthroughs.
The enormous advantages of RNAi in genetic studies have so far not been reproduced in prokaryotes (bacteria and archaea), because the RNAi system seems to be limited to the eukaryotic lineage.
U.S. Patent Application 20040053289 teaches the use of si hybrids to down-regulate prokaryotic genes.
CRISPR is a genetic system comprised of a cluster of short repeats (24-47 bp long), interspersed by similarly sized non repetitive sequences (called spacers). Additional components of the system include CRISPR-associated (CAS) genes and a leader sequence (
CRISPR arrays and CAS genes vary greatly among microbial species. The direct repeat sequences frequently diverge between species, and extreme sequence divergence is also observed in the CAS genes. The size of the repeat can vary between 24 and 47 bp, with spacer sizes of 26-72 bp. The number of repeats per array can vary from 2 to the current record holder, Haliangium ochraceum, which has 382 repeats in one array and, although many genomes contain a single CRISPR locus, M. jannaschii has 18 loci. Finally, although in some CRISPR systems only 6, or fewer, CAS genes have been identified, others involve more than 20. Despite this diversity, most CRISPR systems have some conserved characteristics (
It was recently demonstrated experimentally that in response to phage infection, bacteria integrate new spacers that are derived from phage genomic sequences, resulting in CRISPR-mediated phage resistance. The new repeat-spacer units were added at the leader-proximal end of the array, and had to match the phage sequence exactly (100% identity), to provide complete resistance. When such phage-derived spacers were artificially introduced into the CRISPR array of a phage-sensitive S. thermophilus strain, it became phage-resistant [Barrangou et al, 2007, Science 315(5819): 1709-12]. Indeed, spacers found in naturally occurring CRISPR arrays are frequently derived from phages and other extrachromosomal elements [Bolotin et al., 2005, Microbiology 151(Pt 8): 2551-61].
Additional background art includes Horvath et al [International Publication No. WO2008/108989, Bronus et al [Science, 321, 960 (2008)], U.S. Application No: 20100076057 and Sorek et al., [Nature Reviews Microbiology, 6, 181, 2008].
According to an aspect of some embodiments of the present invention there is provided an isolated polynucleotide, comprising
(i) a clustered, regularly interspaced short palindromic repeat (CRISPR) array nucleic acid sequence wherein at least one spacer of the CRISPR is sufficiently complementary to a portion of at least one prokaryotic gene so as to down-regulate expression of the prokaryotic gene; and
(ii) a nucleic acid sequence encoding at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family.
According to an aspect of some embodiments of the present invention there is provided a nucleic acid construct comprising the isolated polynucleotide of the present invention.
According to an aspect of some embodiments of the present invention there is provided a nucleic acid construct system comprising:
(i) a first nucleic acid construct comprising an isolated polynucleotide having a clustered, regularly interspaced short palindromic repeat (CRISPR) array nucleic acid sequence wherein at least one spacer of the CRISPR is sufficiently complementary to a portion of at least one prokaryotic gene so as to down-regulate expression of the prokaryotic gene; and
(ii) a second nucleic acid construct comprising an isolated polynucleotide having a nucleic acid sequence encoding at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family.
According to an aspect of some embodiments of the present invention there is provided a method of down-regulating expression of a gene of a prokaryotic cell, the method comprising introducing into the cell a CRISPR system polynucleotide encoding a CRISPR array and at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family, wherein a spacer of the CRISPR array is sufficiently complementary with a portion of the gene to down-regulate expression of the gene, thereby down-regulating expression of gene of a prokaryotic cell.
According to an aspect of some embodiments of the present invention there is provided a nucleic acid construct comprising two repeat sequences of a CRISPR array flanking a cloning site or making a cloning site when concatenated.
According to an aspect of some embodiments of the present invention there is provided a method of treating a bacterial infection in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of an isolated polynucleotide, comprising a clustered, regularly interspaced short palindromic repeat (CRISPR) system nucleic acid sequence, the CRISPR system encoding a CRISPR array and at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family wherein at least one spacer of the CRISPR array is sufficiently complementary to a portion of at least one bacterial gene so as to down-regulate expression of the bacterial gene, the bacterial gene being a vital bacterial gene or a bacterial virulence gene, thereby treating the bacterial infection.
According to an aspect of some embodiments of the present invention there is provided a method of treating an antibiotic-resistant bacterial infection in a subject in need thereof, the method comprising administering to the subject the isolated polynucleotide of the present invention, thereby treating the antibiotic-resistant bacterial infection.
According to an aspect of some embodiments of the present invention there is provided a method of annotating a prokaryotic gene, the method comprising:
(a) introducing the isolated polynucleotide of the present invention into a prokaryote under conditions that allow downregulation of the prokaryotic gene; and
(b) assaying a phenotype of the prokaryote, wherein a change in phenotype following the introducing is indicative of the prokaryotic gene being associated with the phenotype.
According to some embodiments of the invention, the at least one spacer comprises at least two spacers, each being sufficiently complementary to a portion of different prokaryotic genes so as to down-regulate expression of the different prokaryotic genes.
According to some embodiments of the invention, the at least one spacer comprises 26-72 base pairs.
According to some embodiments of the invention, the prokaryotic gene is a bacterial gene.
According to some embodiments of the invention, the bacterial gene is associated with down-regulation of biofuel production.
According to some embodiments of the invention, the bacterial gene is selected from the group consisting of acetate kinase, phosphate acetyltransferase and L-lactate dehydrogenase.
According to some embodiments of the invention, the bacterial gene is a genetic repressor CcpN.
According to some embodiments of the invention, the bacterial gene is an antibiotic resistance gene.
According to some embodiments of the invention, the antibiotic resistance gene is a methicillin resistance gene or a vancomycin resistance gene.
According to some embodiments of the invention, the bacterial gene is a bacterial virulence gene.
According to some embodiments of the invention, the bacterial gene is a ribosomal RNA gene, a ribosomal protein gene or a tRNA synthestase gene.
According to some embodiments of the invention, the bacterial gene is selected from the group consisting of dnaB, fabI, folA, gyrB, murA, pytH, metG, and tufA(B).
According to some embodiments of the invention, the prokaryotic gene is an archaeal gene.
According to some embodiments of the invention, the isolated polynucleotide further comprises a nucleic acid sequence encoding a CRISPR leader sequence.
According to some embodiments of the invention, the mesophilic organism is Neisseria sicca ATCC29256.
According to some embodiments of the invention, the nucleic acid construct comprises a nucleic acid sequence encoding at least one polypeptide having at sequence selected from the group consisting of SEQ ID NOs: 1354-1360.
According to some embodiments of the invention, the nucleic acid construct encodes each of the polypeptides as set forth in SEQ ID NOs: 1354-1360.
According to some embodiments of the invention, the at least one CAS polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 1354-1360.
According to some embodiments of the invention, the first nucleic acid construct comprises a leader sequence upstream to the at least one spacer as set forth in SEQ ID NO: 1374.
According to some embodiments of the invention, a repeat sequence of the CRISPR array is as set forth in SEQ ID NO: 1369, 1373, 1374 or 1375.
According to some embodiments of the invention, the nucleic acid construct further comprises a leader sequence of the CRISPR array.
According to some embodiments of the invention, CRISPR array is of a RAMP module.
According to some embodiments of the invention, the nucleic acid construct further comprises at least one spacer sequence of the CRISPR array.
According to some embodiments of the invention, the at least one CRISPR associated (CAS) polypeptide of a RAMP family is of a mesophilic organism.
According to some embodiments of the invention, the at least one CRISPR associated (CAS) polypeptide of a RAMP family is of a thermophilic organism.
According to some embodiments of the invention, the nucleic acid sequence encoding at least one CRISPR associated (CAS) polypeptide of a RAMP family comprises a sequence encoding a RAMP module.
According to some embodiments of the invention, the isolated polynucleotide is non-naturally occurring.
According to some embodiments of the invention, the nucleic acid construct comprises a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1, 6 and 11.
According to some embodiments of the invention, the nucleic acid construct comprises a cis regulatory element.
According to some embodiments of the invention, the cis regulatory element is a promoter.
According to some embodiments of the invention, the promoter is an inducible promoter.
According to some embodiments of the invention, the gene is not introduced into the cell by a bacteriophage.
According to some embodiments of the invention, the gene is integrated into a chromosome of the cell.
According to some embodiments of the invention, the gene is endogenous to the prokaryotic cell.
According to some embodiments of the invention, the gene is epichromosomal.
According to some embodiments of the invention, the method further comprises introducing into the cell a naïve CRISPR array system.
According to some embodiments of the invention, the bacterial infection is induced by methicillin resistant Staphylococcus aureus or vancomycin resistant Staphylococcus aureus.
Unless otherwise defined, all technical and/or scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, exemplary methods and/or materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be necessarily limiting.
Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description taken with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.
In the drawings:
The present invention, in some embodiments thereof, relates to methods of downregulating prokaryotic genes and, more particularly, but not exclusively, to methods of downregulating bacterial genes.
Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not necessarily limited in its application to the details set forth in the following description or exemplified by the Examples. The invention is capable of other embodiments or of being practiced or carried out in various ways.
The clusters of regularly interspaced short palindromic repeat (CRISPR) system is associated with defense of bacteria and archaea providing protection thereto against invading phages. Resistance is acquired by incorporating short stretches of invading DNA sequences in genomic CRISPR loci. These integrated sequences are thought to function as a genetic memory that prevents the host from being infected by viruses containing this recognition sequence.
It has previously been proposed that CRISPR's ability to down-regulate extrachromosomal DNA may be manipulated such that it can also down-regulate genes that are integrated into the chromosome of a cell. Thus, it was proposed that in order to selectively down-regulate a gene of interest, a CRISPR-bearing plasmid may be transformed into a prokaryotic cell of choice (e.g. bacterial cell) with one of the spacers being changed to match the gene of interest (Sorek et al, 2008, Nature Reviews Microbiology, 6, 181).
However, previous attempts to engineer the CRISPR system to down regulate endogenous genes have failed, due to the discovery that many CRISPR systems target DNA rather than RNA [Marraffini & Sontheimer, Science 322, 1843 (2008)]. In order to circumvent this problem, the present inventors suggest using a specific CRISPR subtype, called the RAMP module (or cmr module). Recent evidence suggests that the RAMP module (or cmr module) is unique, and performs its action by silencing RNA rather than DNA [Hale et al, Cell 139, 863, 2009]. However, most natural RAMP modules, including the one that was experimentally examined [Hale et al, Cell 139, 863, 2009], occur within thermophilic (high-temperature-adapter) prokaryotes, and are thus not expected to work well in human pathogens and in model organisms such as E. coli.
The present inventors propose engineering polynucleotides of the CRISPR system which encode a CRISPR array and at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family so that its spacers will target endogenous genes. This targeting is expected to result in degradation of the targeted mRNA, thus allowing selective silencing of specific genes of choice within bacteria, i.e., selective gene knock-down without manipulation of the original microbial genome (as explained above).
Thus, according to one aspect of the present invention there is provided a method of down-regulating expression of a gene of a prokaryotic cell. The method comprises introducing into the cell a CRISPR system encoding a CRISPR array and at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family, wherein a spacer of the CRISPR array is sufficiently complementary with a portion of the gene to down-regulate expression of the gene.
In an exemplary embodiment, the gene is not introduced into the cell by a bacteriophage.
As used herein, the phrase “down-regulating” refers to reducing or inhibiting the expression level of the gene, on the RNA and optionally on the protein level. According to one embodiment, the gene is down-regulated by at least 10%. According to another embodiment, the gene is down-regulated by at least 20%. According to another embodiment, the gene is down-regulated by at least 30%. According to another embodiment, the gene is down-regulated by at least 40%. According to another embodiment, the gene is down-regulated by at least 50%. According to another embodiment, the gene is down-regulated by at least 60%. According to another embodiment, the gene is down-regulated by at least 70%. According to another embodiment, the gene is down-regulated by at least 80%. According to another embodiment, the gene is down-regulated by at least 90%. According to another embodiment, the gene is down-regulated by 100% (i.e. inhibiting gene expression).
As used herein, the term “gene” refers to a DNA sequence which encodes a polypeptide or a non-coding, functional RNA.
The phrase “gene of a prokaryotic cell” refers to a gene that is present in the prokaryotic cell but not necessarily integrated into the chromosome of the prokaryotic cell.
According to one embodiment, the genes which are down-regulated are those that are integrated into the chromosome of the prokaryote.
According to another embodiment, the genes which are down-regulated are those that remain outside the chromosome i.e. remain epichromosomal.
According to another embodiment, the gene is endogenous to the cell. The term “endogenous gene” refers to a native gene in its natural location in the genome of an organism.
Examples of contemplated genes that may be downregulated according to the method of this aspect of the present invention, include, but are not limited to genes associated with down-regulation of organic material production in bacteria.
Thus, for example, the present invention contemplates the down-regulation of genes whose knockout enhanced the production of ethanol as a biofuel. Shaw et al., (PNAS 2008, Sep. 16; 105(37):1769-74) teaches the knock-out of a number of genes (namely acetate kinase, phosphate acetyltransferase and L-lactate dehydrogenase, examples of sequences of each can be found in refseq accession no: NC—009012, Clostridium thermocellum ATCC 27405, complete genome) that resulted in the production of ethanol at high yields.
Tannler S et al [Metab Eng. 2008 September; 10(5):216-26], teaches enhanced ethanol production in bacteria by down-regulating expression of the gluconeogenic genes gapB and pckA (examples of sequences of each can be found in refseq accession no: NC—000964, Bacillus subtilis subsp. subtilis str. 168, complete genome) through knockout of their genetic repressor CcpN.
The present invention also contemplates the down-regulation of genes whose knockout enhanced the production of hydrogen as a biofuel.
Vardar-Schara et al [Microbial Biotechnology Vol 1, Issue 2, Pages 107-125], incorporated herein by reference, teaches a number of strains of genetically engineered bacteria which generate hydrogen. Vardar-Schara et al states therein, that the hydrogen yield is suboptimal in a number of those strains due to the presence of uptake hydrogenases. Accordingly, the present invention contemplates downregulation of these uptake hydrogenases, Hyd-1 and -2 (hyaB and hybC respectively) for the enhancement of hydrogen production. Examples of sequences of each can be found in refseq accession no: NC—011742, Escherichia coli S88, complete genome.
In addition, the present invention contemplates down-regulation of genes of metabolic pathways that compete for hydrogen production.
Exemplary genes that may be down-regulated to increase hydrogen production in bacteria include, but are not limited to lactate dehydrogenase (ldhA), the FHL repressor (hycA), fumarate reductase (frdBC), the Tat system (tatA-E), the alpha subunit of the formate dehydrogenase-N and -O (fdnG and fdoG respectively), the alpha subunit of nitrate reductase A (narG), pyruvate dehydrogenase (aceE), pyruvate oxidase (poxB) and proteins that transport formate (focA and focB)—see Vardar-Schara et al [Microbial Biotechnology Vol 1, Issue 2, Pages 107-125]. Examples of sequences of each can be found in refseq accession no: NC—011742, Escherichia coli S88, complete genome.
Lactic Acid Bacteria (LAB) play an essential role in the preservation, taste and texture of cheese, yogurt, sausage, sauerkraut and a large variety of traditional indigenous fermented foods. Down-regulation of such genes would ensure for example that lactic acid bacteria used in the food industry would have a better taste or smell. According to another embodiment, the genes that are down-regulated in bacteria are those which are involved in taste or odor.
For example, the buttermilk aroma diacetyl is formed from the carbon metabolism of dairy Lactococcus bacteria during buttermilk fermentation. Lactococcal strains that have low levels of a diacetyl reductase, acetoin reductase and butanediol dehydrogenase have been found to produce more diacetyl. Down-regulation of such enzymes would therefore be beneficial. Examples of sequences of each can be found in refseq accession no: NC—009004, Lactococcus lactis subsp. cremoris MG1363.
For example, down-regulation of those Lactobacillus bulgaricus genes associated with lactic acid production would be beneficial for the generation of mild forms of yoghurt.
As used herein, the expression “lactic acid bacterium” refers to a group of gram-positive, microaerophilic or anaerobic bacteria having in common the ability to ferment sugars and citrate with the production of acids including lactic acid as the predominantly produced acid, acetic acid, formic acid and propionic acid. The industrially most useful lactic acid bacteria are found among Lactococcus species, Streptococcus species, Lactobacillus species, Leuconostoc species, Oenococcus species and Pediococcus species. In the dairy industry, the strict anaerobes belonging to the genus Bifidobacterium is generally included in the group of lactic acid bacteria as these organisms also produce lactic acid and are used as starter cultures in the production of dairy products.
It will be appreciated that the present invention may be used to enhance production of any industrial, agricultural, pharmaceutical (e.g. recombinant protein production) product in bacteria by suppressing genes associated with lower levels of expression of that industrial product.
Other examples of bacterial genes that may be downregulated according to the method of this aspect of the present invention are genes that if down-regulated would aid in the treatment of a bacterial infection. Such genes include for example, antibiotic resistance genes, bacterial virulence genes and genes that are essential for the growth of bacteria.
The phrase “antibiotic resistance genes” as used herein refers to genes that confer resistance to antibiotics, for example by coding for enzymes which destroy it, by coding for surface proteins which prevent it from entering the microorganism, or by being a mutant form of the antibiotic's target so that it can ignore it.
Example of antibiotic resistance genes may be found on the ARDB—Antibiotic Resistance Genes Database—www.ardbdotcbcbdotumddotedu/. Particular examples of antibiotic resistance genes include, but are not limited to methicillin resistance gene or a vancomycin resistance gene.
The phrase “virulence gene” as used herein refers to a nucleic acid sequence of a microorganism, the presence and/or expression of which correlates with the pathogenicity of the microorganism. In the case of bacteria, such virulence genes may in an embodiment comprise chromosomal genes (i.e. derived from a bacterial chromosome), or in a further embodiment comprise a non-chromosomal gene (i.e. derived from a bacterial non-chromosomal nucleic acid source, such as a plasmid). In the case of E. coli, examples of virulence genes and classes of polypeptides encoded by such genes are described below. Virulence genes for a variety of pathogenic microorganisms are known in the art.
Examples of virulence genes include, but are not limited to genes encoding toxins, hemolysins, fimbrial and afimbrial adhesins, cytotoxic factors, microcins and colicins and also those identified in Sun et al., Nature medicine, 2000; 6(11): 1269-1273.
According to one embodiment of the invention, the bacterial virulence gene may be selected from the group consisting of actA (example is given in genebank accession no: NC—003210.1), Tem (example is given in genebank accession no: NC—009980), Shv (example is given in genebank accession no: NC—009648), oxa-1 (example is given in genebank accession no: NW—139440), oxa-7 (example is given in genebank accession no: X75562), pse-4 (example is given in genebank accession no: J05162), ctx-m (example is given in genebank accession no: NC—010870), ant(3″)-Ia (aadA1) (example is given in genebank accession no: DQ489717), ant(2″)-Ia (aadB)b (example is given in genebank accession no: DQ176450), aac(3)-IIa (aacC2) (example is given in genebank accession no: NC—010886), aac(3)-IV (example is given in genebank accession no: DQ241380), aph(3′)-Ia (aphA1) (example is given in genebank accession no: NC—007682), aph(3′)-IIa (aphA2) (example is given in genebank accession no: NC—010170), tet(A) (example is given in genebank accession no: NC—005327), tet(B) (example is given in genebank accession no: FJ411076), tet(C) (example is given in genebank accession no: NC—010558), tet(D) (example is given in genebank accession no: NC—010558), tet(E) (example is given in genebank accession no: M34933), tet(Y) (example is given in genebank accession no: AB089608), catI (example is given in genebank accession no: NC—005773), catII NC—010119, catIII (example is given in genebank accession no: X07848), floR (example is given in genebank accession no: NC—009140), dhfrI (example is given in genebank accession no: NC—002525), dhfrV (example is given in genebank accession no: NC—010488), dhfrVII (example is given in genebank accession no: DQ388126), dhfrIX (example is given in genebank accession no: NC—010410), dhfrXIII (example is given in genebank accession no: NC—000962), dhfrXV (example is given in genebank accession no: Z83311), suII (example is given in genebank accession no: NC—000913), suIII (example is given in genebank accession no: NC—000913), integron class 1 3′-CS (example is given in genebank accession no: AJ867812), vat (example is given in genebank accession no: NC—011742), vatC (example is given in genebank accession no: AF015628), vatD (example is given in genebank accession no: AF368302), vatE (example is given in genebank accession no: NC—004566), vga (example is given in genebank accession no: AF117259), vgb (example is given in genebank accession no: AF117258), and vgbB (example is given in genebank accession no: AF015628).
As mentioned, in order to kill various pathogenic bacteria, CRISPR could be used in order to silence essential genes (i.e., compatible with life) in the bacteria. Essential genes could be identified by their conservation among several pathogens (Payne et al, Nature Reviews Drug Discovery 6, 29-40 (January 2007)). Such genes include ribosomal RNA genes (16S and 23S), ribosomal protein genes, tRNA-synthetases, as well as additional genes shown to be essential such as dnaB, fabI, folA, gyrB, murA, pytH, metG, and tufA(B) NC—009641 for Staphylococcus aureus subsp. aureus str. Newman and NC—003485 for Streptococcus pyogenes MGAS8232 (DeVito et al, Nature Biotechnology 20, 478-483 (2002)).
As mentioned, the present invention teaches a method of down-regulating a gene (or genes) in a prokaryotic cell.
Examples of prokaryotic cells include, but are not limited to bacterial cells and archaeal cells (e.g. those belonging to the two main phyla, the Euryarchaeota and Crenarchaeota).
The bacteria whose genes may be down-regulated may be gram positive or gram negative bacteria. The bacteria may also be photosynthetic bacteria (e.g. cyanobacteria).
The term “Gram-positive bacteria” as used herein refers to bacteria characterized by having as part of their cell wall structure peptidoglycan as well as polysaccharides and/or teichoic acids and are characterized by their blue-violet color reaction in the Gram-staining procedure. Representative Gram-positive bacteria include: Actinomyces spp., Bacillus anthracis, Bifidobacterium spp., Clostridium botulinum, Clostridium perfringens, Clostridium spp., Clostridium tetani, Corynebacterium diphtheriae, Corynebacterium jeikeium, Enterococcus faecalis, Enterococcus faecium, Erysipelothrix rhusiopathiae, Eubacterium spp., Gardnerella vaginalis, Gemella morbillorum, Leuconostoc spp., Mycobacterium abcessus, Mycobacterium avium complex, Mycobacterium chelonae, Mycobacterium fortuitum, Mycobacterium haemophilium, Mycobacterium kansasii, Mycobacterium leprae, Mycobacterium marinum, Mycobacterium scrofulaceum, Mycobacterium smegmatis, Mycobacterium terrae, Mycobacterium tuberculosis, Mycobacterium ulcerans, Nocardia spp., Peptococcus niger, Peptostreptococcus spp., Proprionibacterium spp., Staphylococcus aureus, Staphylococcus auricularis, Staphylococcus capitis, Staphylococcus cohnii, Staphylococcus epidermidis, Staphylococcus haemolyticus, Staphylococcus hominis, Staphylococcus lugdanensis, Staphylococcus saccharolyticus, Staphylococcus saprophyticus, Staphylococcus schleiferi, Staphylococcus similans, Staphylococcus warneri, Staphylococcus xylosus, Streptococcus agalactiae (group B streptococcus), Streptococcus anginosus, Streptococcus bovis, Streptococcus canis, Streptococcus equi, Streptococcus milleri, Streptococcus mitior, Streptococcus mutans, Streptococcus pneumoniae, Streptococcus pyogenes (group A streptococcus), Streptococcus salivarius, Streptococcus sanguis.
The term “Gram-negative bacteria” as used herein refer to bacteria characterized by the presence of a double membrane surrounding each bacterial cell. Representative Gram-negative bacteria include Acinetobacter calcoaceticus, Actinobacillus actinomycetemcomitans, Aeromonas hydrophila, Alcaligenes xylosoxidans, Bacteroides, Bacteroides fragilis, Bartonella bacilliformis, Bordetella spp., Borrelia burgdorferi, Branhamella catarrhalis, Brucella spp., Campylobacter spp., Chalmydia pneumoniae, Chlamydia psittaci, Chlamydia trachomatis, Chromobacterium violaceum, Citrobacter spp., Eikenella corrodens, Enterobacter aerogenes, Escherichia coli, Flavobacterium meningosepticum, Fusobacterium spp., Haemophilus influenzae, Haemophilus spp., Helicobacter pylori, Klebsiella spp., Legionella spp., Leptospira spp., Moraxella catarrhalis, Morganella morganii, Mycoplasma pneumoniae, Neisseria gonorrhoeae, Neisseria meningitidis, Pasteurella multocida, Plesiomonas shigelloides, Prevotella spp., Proteus spp., Providencia rettgeri, Pseudomonas aeruginosa, Pseudomonas spp., Rickettsia prowazekii, Rickettsia rickettsii, Rochalimaea spp., Salmonella spp., Salmonella typhi, Serratia marcescens, Shigella spp., Treponema carateum, Treponema pallidum, Treponema pallidum endemicum, Treponema pertenue, Veillonella spp., Vibrio cholerae, Vibrio vulnificus, Yersinia enterocolitica and Yersinia pestis.
As mentioned, the method of the present invention is effected by introducing into the cell a CRISPR system polynucleotide encoding a CRISPR array and at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family, wherein a spacer of the CRISPR array is sufficiently complementary with a portion of the gene to down-regulate expression of the gene.
According to another embodiment the method is effected by introducing into the cell a first polynucleotide which encodes the CRISPR array and a second polynucleotide which encodes at least one Cas polypeptide of the RAMP family.
Below is a short summary of CRISPR arrays.
CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) arrays together with the CAS genes form the CRISPR system.
CRISPR arrays also known as SPIDRs (SPacer Interspersed Direct Repeats) constitute a family of recently described DNA loci that are usually specific to a particular bacterial species. The CRISPR array is a distinct class of interspersed short sequence repeats (SSRs) that were first recognized in E. coli (Ishino et al, J. Bacteriol., 169:5429-5433). In subsequent years, similar CRISPR arrays were found in Mycobacterium tuberculosis, Haloferax mediterranei, Methanocaldococcus jannaschii, Thermotoga maritima and other bacteria and archaea. The repeats of CRISPR arrays are short elements that occur in clusters that are always regularly spaced by unique intervening sequences with a constant length. Although the repeat sequences are highly conserved between strains, the number of interspersed repeats and the sequences of the spacer regions differ from strain to strain. The repeat sequences are partially palindromic DNA repeats typically of 24 to 47 bp, containing inner and terminal inverted repeats of up to 11 bp. These repeats have been reported to occur from 1 to 382 times. Although isolated elements have been detected, they are generally arranged in clusters (up to about 20 or more per genome) of repeated units spaced by unique intervening 26-72 bp sequences.
As used herein, the phrase “CRISPR array polynucleotide” refers to a DNA or RNA segment which comprises sufficient CRISPR repeats such that it is capable of downregulating a complementary gene.
According to one embodiment, the CRISPR array polynucleotide comprises at least 2 repeats with 1 spacer between them.
According to another embodiment, the CRISPR array polynucleotide comprises at least 4 repeats with spacers inbetween each.
According to still another embodiment, at least one, at least two, at least three, at least four of the spacers of the CRISPR array polynucleotide are the native sequences of the array.
According to one embodiment, the CRISPR array polynucleotide comprises at least 1 spacer flanked on the 5′ end by 4-10 bases from the 3′ end of the repeat.
In an exemplary embodiment, the CRISPR array polynucleotide comprises all of the CRISPR repeats, starting with the first nucleotide of the first CRISPR repeat and ending with the last nucleotide of the last (terminal) repeat.
Various computer software and web resources are available for the analysis of and identification of CRISPR systems and therefore CRISPR arrays. These tools include software for CRISPR detection, such as PILERCR, CRISPR Recognition Tool and CRISPRFinder; online repositories of pre-analyzed CRISPRs, such as CRISPRdb; and tools for browsing CRISPRs in microbial genomes, such as Pygram. Databases for CRISPR systems include: www.crisprdotu-psuddotfr/crispr/CRISPRHomePagedotphp.
It has been revealed that CRISPR systems are found in approximately 40% and 90% of sequenced bacterial and archaeal genomes, respectively, and the present inventor contemplates the use of CRISPR arrays from all such CRISPR systems that target RNA.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate the gene of interest, is 100% homologous to the naturally occurring (wild-type) sequence.
According to another embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 99% homologous to the naturally occurring (wild-type) sequence.
According to another embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 98% homologous to the naturally occurring (wild-type) sequence.
According to another embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 97% homologous to the naturally occurring (wild-type) sequence.
According to another embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 96% homologous to the naturally occurring (wild-type) sequence.
According to another embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 95% homologous to the naturally occurring (wild-type) sequence.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 90% homologous to the naturally occurring (wild-type) sequence.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 80% homologous to the naturally occurring (wild-type) sequence.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 75% homologous to the naturally occurring (wild-type) sequence.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 70% homologous to the naturally occurring (wild-type) sequence.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 65% homologous to the naturally occurring (wild-type) sequence.
According to one embodiment, the CRISPR array polynucleotide comprises a nucleic acid sequence which, apart from the spacer, (or spacers) which is replaced so as to down-regulate a gene of interest, is 60% homologous to the naturally occurring (wild-type) sequence.
The present invention contemplates modification of the CRISPR system polynucleotide sequence such that the codon usage is optimized for the organism in which it is being introduced (e.g. E. coli).
Thus for example Cas6 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1361. A Cmr 1 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1362. A Cmr2 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1363. A Cmr3 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1364. A Cmr4 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1365. A Cmr5 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1366. A Cmr6 polynucleotide sequence derived from Neisseria sicca codon optimized for use in E. coli is set forth in SEQ ID NO: 1367.
Contemplated CRISPR systems that may be used according to this aspect of the present invention include, but are not limited to the CRISPR systems which encode at least one CRISPR associated (CAS) polypeptide of a repeat associated mysterious protein (RAMP) family.
As used herein, the term “cas gene” refers to the genes that are generally coupled, associated or close to or in the vicinity of flanking CRISPR arrays that encode CAS proteins.
According to one embodiment the cas gene is defined as such in one of the TIGRFAM profiles that were defined in [Haft et al, PLoS Comput Biol. 1, e60, 2005].
Preferably, a cas gene comprises at least 50%, more preferably at least 65%, more preferably at least 80%, more preferably at least 85%, more preferably at least 90%, more preferably at least 95%, more preferably at least 96%, more preferably at least 97%, more preferably at least 98%, most preferably at least 99% of the wild-type sequence. Preferably, a cas gene retains 50%, more preferably 60%, more preferably 70%, more preferably 80%, more preferably 85%, more preferably 90%, more preferably 95%, more preferably 96%, more preferably 97%, more preferably 98%, or most preferably 99% activity of the wild-type polypeptide or nucleotide sequence.
CRISPR arrays are typically found in the vicinity of four genes named cas1 to cas4. The most common arrangement of these genes is cas3-cas4-cas 1-cas2. The Cas3 protein appears to be a helicase, whereas Cas4 resembles the RecB family of exonucleases and contains a cysteine-rich motif, suggestive of DNA binding. The cas1 gene (NCBI COGs database code: COG1518) is especially noteworthy, as it serves as a universal marker of the CRISPR system (linked to all CRISPR systems except for that of Pyrococcus abyssii). Cas2 remains to be characterized cas1-4 are typically characterized by their close proximity to the CRISPR loci and their broad distribution across bacterial and archaeal species. Although not all cas1-4 genes associate with all CRISPR loci, they are all found in multiple subtypes.
In addition, there is another cluster of three genes associated with CRISPR structures in many bacterial species, referred to herein as cas1 B, cas5 and cas6; (See, [Barrangou et al, 2007, Science 315(5819): 1709-12]). In some embodiments, the cas gene is selected from cas1, cas2, cas3, cas4, cas IB, cas5 and/or cas6, fragments, variants, homologues and/or derivatives thereof. In some additional embodiments, a combination of two or more cas genes find use, including any suitable combinations, including those provided in WO 07/025097, incorporated herein by reference.
In some embodiments, the cas genes comprise DNA, while in other embodiments, the cas comprise RNA. In some embodiments, the nucleic acid is of genomic origin, while in other embodiments, it is of synthetic or recombinant origin. In some embodiments, the cas genes are double-stranded or single-stranded whether representing the sense or antisense strand or combinations thereof.
In some embodiments it is preferred that the cas gene is the cas gene that is closest to the leader sequence or the first CRISPR repeat at the 5′ end of the CRISPR locus—such as cas4 or cas6.
Exemplary CAS polypeptides of the RAMP subtype include Csm3-5, Cmr1, Cmr2, Cmr3, Cmr4, Cmr6 and Csx7.
Exemplary cas gene sequences of the RAMP subtype are set forth in SEQ ID NOs: 1-1339.
According to a specific embodiment, the CRISPR system encodes at least a Cas6 and the six gene operon Cmr1-Cmr6 of the Neisseria sicca bacteria. The CRISPR system may further encode Cas1 and Cas2 of the Neisseria sicca bacteria.
According to one embodiment, the CRISPR system polynucleotide sequence encodes at least 2, at least 3, at least 4, at least 5, at least 6, at least 7 or all the CAS polypeptides of the RAMP family.
According to another embodiment, the CRISPR system polynucleotide sequence encodes a combination of cas polypeptides that is found to be naturally occurring, wherein at least one cas polypeptide belongs to the RAMP subtype. Such a combination is referred to herein as a RAMP module.
Examples of CRISPR systems of the RAMP module are detailed in Table 1 in the Examples section herein below.
It will be appreciated that a given set of cas genes or proteins is typically associated with a given repeated sequence within a particular CRISPR array. Thus, cas genes appear to be specific for a given DNA repeat (i.e., cas genes and the repeated sequence form a functional pair).
Thus, for example if the CRISPR array which is used to downregulate a gene comprises the same repeat sequences as the CRISPR array from the Myxococcus xanthus DK 1622 RAMP module CRISPR system, then the cas genes that may also be comprised in the polynucleotide may be those from the Myxococcus xanthus DK 1622 CRISPR system. According to one embodiment, the same number of cas genes are added to the polynucleotide as the number of cas genes that appear in the original system. According to another embodiment, at least one of the cas genes that appears in the original system is added to the polynucleotide.
Thus for example, cas genes that appear in the Myxococcus xanthus DK 1622 RAMP module CRISPR system include Cas6 (SEQ ID NO: 2), cmr6 SEQ ID NO: 4, cmr5 SEQ ID NO: 6, cmr4 SEQ ID NO: 8, cmr3 SEQ ID NO: 10, cmr2 SEQ ID NO: 12, cmr1 SEQ ID NO: 14, and a putative CRISPR-associated protein SEQ ID NO: 1338. Polynucleotides comprising repeat sequences from the Myxococcus xanthus DK 1622 CRISPR system may therefore preferably comprise at least one of these sequences.
As another example, Neisseria sicca Cas proteins include the Cas6 protein (SEQ ID NO: 1354), and the six Cmr1-Cmr6 proteins (SEQ ID NOs: 1355, 1356, 1357, 1358, 1359 and 1360). Polynucleotides comprising repeat sequences from Neisseria sicca (e.g. as set forth in SEQ ID NOs:1369, 1373, 1374 or 1375) and the leader sequence from Neisseria sicca (as set forth in SEQ ID NO: 1372) may therefore preferably comprise polynucleotide sequences encoding the above mentioned Cas proteins.
Typically, once a CRISPR system which encodes at least one cas protein of the RAMP module is identified it may be amplified and isolated.
Amplification of the CRISPR system may be achieved by any method known in the art, including polymerase chain reaction (PCR). In the present invention, oligonucleotide primers may be designed for use in PCR reactions to amplify all or part of a CRISPR array.
The term “primer” refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, which is capable of acting as a point of initiation of synthesis when placed under conditions in which synthesis of a primer extension product which is complementary to a nucleic acid strand is induced (i.e., in the presence of nucleotides and an inducing agent—such as DNA polymerase and at a suitable temperature and pH). In some embodiments, the primer is single stranded for maximum efficiency in amplification, although in other embodiments, the primer is double stranded. In some embodiments, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of extension products in the presence of the inducing agent. The exact length of the primers depends on many factors, including temperature, source of primer, and the use of the method. PCR primers are typically at least about 10 nucleotides in length, and most typically at least about 20 nucleotides in length. Methods for designing and conducting PCR are well known in the art, and include, but are not limited to methods using paired primers, nested primers, single specific primers, degenerate primers, gene-specific primers, vector-specific primers, partially mismatched primers, etc.
Exemplary primers that can amplify the M. xanthus CRISPR array are set forth in SEQ ID NO: 1350 and 1351.
As mentioned, in order to down-regulate the prokaryotic gene of interest, a spacer of the CRISPR array is replaced with a nucleic acid sequence, the nucleic acid sequence being sufficiently complementary to a portion of the prokaryotic gene.
As used herein, the term “spacer” refers to a non-repetitive spacer sequence that is found between multiple short direct repeats (i.e., CRISPR repeats) of CRISPR arrays. In some preferred embodiments, CRISPR spacers are located in between two identical CRISPR repeats. In some embodiments, CRISPR spacers are located in between two partial repeats. In some embodiments, CRISPR spacers are identified by sequence analysis at the DNA stretches located in between two CRISPR repeats.
In some preferred embodiments, CRISPR spacer is naturally present in between two identical, short direct repeats that are palindromic.
The phrase “portion of a gene” relates to a portion from the coding or non-coding region of the gene.
The phrase “sufficiently complementary” as used herein, refers to the sequence of the spacer being adequately complementary such that it is capable of downregulating expression of the gene.
According to one embodiment of this aspect of the present invention, a sequence which is sufficiently complementary to a portion of the prokaryotic gene is one which is at least about 70, about 75, about 80, about 85, or about 90% identical, or at least about 91, about 92, about 93, about 94, about 95, about 96, about 97, about 98, or about 99% identical to the prokaryotic gene. In some preferred embodiments, the sequence is 100% complementary to the prokaryotic gene.
Assays to test down-regulation of expression are known in the art and may be effected on the RNA or protein level.
Methods of Detecting the Expression Level of RNA
The expression level of the RNA in prokarytoic cells can be determined using methods known in the arts.
Northern Blot Analysis:
This method involves the detection of a particular RNA in a mixture of RNAs. An RNA sample is denatured by treatment with an agent (e.g., formaldehyde) that prevents hydrogen bonding between base pairs, ensuring that all the RNA molecules have an unfolded, linear conformation. The individual RNA molecules are then separated according to size by gel electrophoresis and transferred to a nitrocellulose or a nylon-based membrane to which the denatured RNAs adhere. The membrane is then exposed to labeled DNA probes. Probes may be labeled using radio-isotopes or enzyme linked nucleotides. Detection may be using autoradiography, colorimetric reaction or chemiluminescence. This method allows both quantitation of an amount of particular RNA molecules and determination of its identity by a relative position on the membrane which is indicative of a migration distance in the gel during electrophoresis.
RT-PCR Analysis:
This method uses PCR amplification of relatively rare RNAs molecules. First, RNA molecules are purified from the cells and converted into complementary DNA (cDNA) using a reverse transcriptase enzyme (such as an MMLV-RT) and primers such as, oligo dT, random hexamers or gene specific primers. Then by applying gene specific primers and Taq DNA polymerase, a PCR amplification reaction is carried out in a PCR machine. Those of skills in the art are capable of selecting the length and sequence of the gene specific primers and the PCR conditions (i.e., annealing temperatures, number of cycles and the like) which are suitable for detecting specific RNA molecules. It will be appreciated that a semi-quantitative RT-PCR reaction can be employed by adjusting the number of PCR cycles and comparing the amplification product to known controls.
RNA In Situ Hybridization Stain:
In this method DNA or RNA probes are attached to the RNA molecules present in the cells. Generally, the cells are first fixed to microscopic slides to preserve the cellular structure and to prevent the RNA molecules from being degraded and then are subjected to hybridization buffer containing the labeled probe. The hybridization buffer includes reagents such as formamide and salts (e.g., sodium chloride and sodium citrate) which enable specific hybridization of the DNA or RNA probes with their target mRNA molecules in situ while avoiding non-specific binding of probe. Those of skills in the art are capable of adjusting the hybridization conditions (i.e., temperature, concentration of salts and formamide and the like) to specific probes and types of cells. Following hybridization, any unbound probe is washed off and the slide is subjected to either a photographic emulsion which reveals signals generated using radio-labeled probes or to a colorimetric reaction which reveals signals generated using enzyme-linked labeled probes.
In Situ RT-PCR Stain:
This method is described in Nuovo G J, et al. [Intracellular localization of polymerase chain reaction (PCR)-amplified hepatitis C cDNA. Am J Surg Pathol. 1993, 17: 683-90] and Komminoth P, et al. [Evaluation of methods for hepatitis C virus detection in archival liver biopsies. Comparison of histology, immunohistochemistry, in situ hybridization, reverse transcriptase polymerase chain reaction (RT-PCR) and in situ RT-PCR. Pathol Res Pract. 1994, 190: 1017-25]. Briefly, the RT-PCR reaction is performed on fixed cells by incorporating labeled nucleotides to the PCR reaction. The reaction is carried on using a specific in situ RT-PCR apparatus such as the laser-capture microdissection PixCell I LCM system available from Arcturus Engineering (Mountainview, Calif.).
DNA Microarrays/DNA Chips:
The expression of thousands of genes may be analyzed simultaneously using DNA microarrays, allowing analysis of the complete transcriptional program of an organism during specific developmental processes or physiological responses. DNA microarrays consist of thousands of individual gene sequences attached to closely packed areas on the surface of a support such as a glass microscope slide. Various methods have been developed for preparing DNA microarrays. In one method, an approximately 1 kilobase segment of the coding region of each gene for analysis is individually PCR amplified. A robotic apparatus is employed to apply each amplified DNA sample to closely spaced zones on the surface of a glass microscope slide, which is subsequently processed by thermal and chemical treatment to bind the DNA sequences to the surface of the support and denature them. Typically, such arrays are about 2×2 cm and contain about individual nucleic acids 6000 spots. In a variant of the technique, multiple DNA oligonucleotides, usually 20 nucleotides in length, are synthesized from an initial nucleotide that is covalently bound to the surface of a support, such that tens of thousands of identical oligonucleotides are synthesized in a small square zone on the surface of the support. Multiple oligonucleotide sequences from a single gene are synthesized in neighboring regions of the slide for analysis of expression of that gene. Hence, thousands of genes can be represented on one glass slide. Such arrays of synthetic oligonucleotides may be referred to in the art as “DNA chips”, as opposed to “DNA microarrays”, as described above [Lodish et al. (eds.). Chapter 7.8: DNA Microarrays: Analyzing Genome-Wide Expression. In: Molecular Cell Biology, 4th ed., W. H. Freeman, New York. (2000)].
Oligonucleotide Microarray—
In this method oligonucleotide probes capable of specifically hybridizing with the polynucleotides of the present invention are attached to a solid surface (e.g., a glass wafer). Each oligonucleotide probe is of approximately 20-25 nucleic acids in length. To detect the expression pattern of the polynucleotides of the present invention in a specific cell sample (e.g., blood cells), RNA is extracted from the cell sample using methods known in the art (using e.g., a TRIZOL solution, Gibco BRL, USA). Hybridization can take place using either labeled oligonucleotide probes (e.g., 5′-biotinylated probes) or labeled fragments of complementary DNA (cDNA) or RNA (cRNA). Briefly, double stranded cDNA is prepared from the RNA using reverse transcriptase (RT) (e.g., Superscript II RT), DNA ligase and DNA polymerase I, all according to manufacturer's instructions (Invitrogen Life Technologies, Frederick, Md., USA). To prepare labeled cRNA, the double stranded cDNA is subjected to an in vitro transcription reaction in the presence of biotinylated nucleotides using e.g., the BioArray High Yield RNA Transcript Labeling Kit (Enzo, Diagnostics, Affymetix Santa Clara Calif.). For efficient hybridization the labeled cRNA can be fragmented by incubating the RNA in 40 mM Tris Acetate (pH 8.1), 100 mM potassium acetate and 30 mM magnesium acetate for 35 minutes at 94° C. Following hybridization, the microarray is washed and the hybridization signal is scanned using a confocal laser fluorescence scanner which measures fluorescence intensity emitted by the labeled cRNA bound to the probe arrays.
For example, in the Affymetrix microarray (Affymetrix®, Santa Clara, Calif.) each gene on the array is represented by a series of different oligonucleotide probes, of which, each probe pair consists of a perfect match oligonucleotide and a mismatch oligonucleotide. While the perfect match probe has a sequence exactly complimentary to the particular gene, thus enabling the measurement of the level of expression of the particular gene, the mismatch probe differs from the perfect match probe by a single base substitution at the center base position. The hybridization signal is scanned using the Agilent scanner, and the Microarray Suite software subtracts the non-specific signal resulting from the mismatch probe from the signal resulting from the perfect match probe.
Methods of Detecting Expression and/or Activity of Proteins
Expression and/or activity level of proteins expressed in prokaryotic cells can be determined using methods known in the arts.
Enzyme Linked Immunosorbent Assay (ELISA):
This method involves fixation of a sample (e.g., fixed cells or a proteinaceous solution) containing a protein substrate to a surface such as a well of a microtiter plate. A substrate specific antibody coupled to an enzyme is applied and allowed to bind to the substrate. Presence of the antibody is then detected and quantitated by a colorimetric reaction employing the enzyme coupled to the antibody. Enzymes commonly employed in this method include horseradish peroxidase and alkaline phosphatase. If well calibrated and within the linear range of response, the amount of substrate present in the sample is proportional to the amount of color produced. A substrate standard is generally employed to improve quantitative accuracy.
Western Blot:
This method involves separation of a substrate from other protein by means of an acrylamide gel followed by transfer of the substrate to a membrane (e.g., nylon or PVDF). Presence of the substrate is then detected by antibodies specific to the substrate, which are in turn detected by antibody binding reagents. Antibody binding reagents may be, for example, protein A, or other antibodies. Antibody binding reagents may be radiolabeled or enzyme linked as described hereinabove. Detection may be by autoradiography, colorimetric reaction or chemiluminescence. This method allows both quantitation of an amount of substrate and determination of its identity by a relative position on the membrane which is indicative of a migration distance in the acrylamide gel during electrophoresis.
Radio-Immunoassay (RIA):
In one version, this method involves precipitation of the desired protein (i.e., the substrate) with a specific antibody and radiolabeled antibody binding protein (e.g., protein A labeled with I125) immobilized on a precipitable carrier such as agarose beads. The number of counts in the precipitated pellet is proportional to the amount of substrate.
In an alternate version of the RIA, a labeled substrate and an unlabelled antibody binding protein are employed. A sample containing an unknown amount of substrate is added in varying amounts. The decrease in precipitated counts from the labeled substrate is proportional to the amount of substrate in the added sample.
Fluorescence Activated Cell Sorting (FACS):
This method involves detection of a substrate in situ in cells by substrate specific antibodies. The substrate specific antibodies are linked to fluorophores. Detection is by means of a cell sorting machine which reads the wavelength of light emitted from each cell as it passes through a light beam. This method may employ two or more antibodies simultaneously.
Immunohistochemical Analysis:
This method involves detection of a substrate in situ in fixed cells by substrate specific antibodies. The substrate specific antibodies may be enzyme linked or linked to fluorophores. Detection is by microscopy and subjective or automatic evaluation. If enzyme linked antibodies are employed, a colorimetric reaction may be required. It will be appreciated that immunohistochemistry is often followed by counterstaining of the cell nuclei using for example Hematoxyline or Giemsa stain.
In Situ Activity Assay:
According to this method, a chromogenic substrate is applied on the cells containing an active enzyme and the enzyme catalyzes a reaction in which the substrate is decomposed to produce a chromogenic product visible by a light or a fluorescent microscope.
In Vitro Activity Assays:
In these methods the activity of a particular enzyme is measured in a protein mixture extracted from the cells. The activity can be measured in a spectrophotometer well using colorimetric methods or can be measured in a non-denaturing acrylamide gel (i.e., activity gel). Following electrophoresis the gel is soaked in a solution containing a substrate and colorimetric reagents. The resulting stained band corresponds to the enzymatic activity of the protein of interest. If well calibrated and within the linear range of response, the amount of enzyme present in the sample is proportional to the amount of color produced. An enzyme standard is generally employed to improve quantitative accuracy.
According to one embodiment, the spacer which is replaced has the same number of base pairs as the “replacing spacer” i.e. the one that is complementary to the prokaryotic gene.
According to another embodiment, at least two spacers of the CRISPR are replaced with a nucleic acid sequence, each nucleic acid sequence being sufficiently complementary to opposite strands of the gene.
The present inventor envisages that it is possible to replace any number of the spacers of the wild-type CRISPR.
The replacement spacers may target the same gene or a number of different genes. According to one embodiment, at least about 10% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 20% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 30% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 40% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 50% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 60% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 70% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 80% of the spacers are exchanged for a replacing spacer. According to another embodiment, at least about 90% of the spacers are exchanged for a replacing spacer. According to still another embodiment, about 100% of the spacers are exchanged for a replacing spacer.
According to one embodiment, at least one of the replaced spacers in the CRISPR array of the present invention is the one which is at the most 5′ end of the array.
It will be appreciated that the polynucleotide which comprises the modified CRISPR array of the present invention may also comprise other sequences.
Thus, for example, the modified CRISPR array of the present invention may also comprise a leader sequence 5′ to the array.
The CRISPR leader is a conserved DNA segment of defined size which is located immediately upstream of the first repeat.
The leader sequence can be of a different length in different bacteria. In some embodiments, the leader sequence is at least about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, about 200, about 300, about 400, or about 500 or more nucleotides in length.
According to one embodiment, the leader sequence is directly 5′ to the array with no intervening base pairs.
According to another embodiment the leader sequence is the same leader sequence found in the wild type CRISPR system.
Thus, for example if the CRISPR array that is introduced into the prokaryotic cell is derived from the Myxococcus xanthus DK 1622 CRISPR system of the RAMP module, then the leader sequence is that leader sequence that is found in the wild-type Myxococcus xanthus DK 1622 CRISPR system of the RAMP module.
Alternatively, if the CRISPR array that is introduced into the prokaryotic cell is derived from the Neisseria sicca CRISPR system of the RAMP module, then the leader sequence is that leader sequence that is found in the wild-type Neisseria sicca CRISPR system of the RAMP module (e.g. as set forth in SEQ ID NO: 1372).
According to an exemplary embodiment, where a prokaryotic cell already comprises a CRISPR system, the CRISPR array polynucleotide which is introduced into the cell comprises the identical CRISPR array repeat sequence which is endogenous to that bacteria (it does not necessarily have to have an array of the same size). Accordingly, the choice of CRISPR array that is introduced into a cell is mainly dependent on the prokaryote whose gene or genes are being down-regulated.
In the case where the prokaryotic cell does not comprise a CRISPR system it will be appreciated that any CRISPR array may be introduced into the cell. According to this embodiment, the other components which make up the CRISPR system are also introduced into the cell. Such components typically match the CRISPR array (i.e. originate from the same CRISPR system). The other components may be introduced into the cell (together with a non-modified, native spacer, or on their own) prior to administration of the CRISPR array with the modified spacer. Alternatively, the other components may be introduced into the cell concommitant with (on the same or on a separate vector) the CRISPR array with the modified spacer.
Typically, the polynucleotides of the present invention are inserted into nucleic acid constructs so that they are capable of being expressed and propagated in bacterial cells.
Such nucleic acid constructs typically comprise a prokaryotic origin of replication and other elements which drive the expression of the CRISPR array and associated cas genes.
Preferably, the promoter utilized by the nucleic acid construct of the present invention is active in the specific cell population transformed.
Constitutive promoters suitable for use with the present invention are promoter sequences which are active under most environmental conditions and most types of cells such as the cytomegalovirus (CMV) and Rous sarcoma virus (RSV).
According to one embodiment, the promoter is an inducible promoter, i.e., a promoter that induces the CRISPR expression only in a certain condition (e.g. heat-induced promoter) or in the presence of a certain substance (e.g., promoters induced by Arabinose, Lactose, IPTG etc).
Examples of bacterial constructs include the pET series of E. coli expression vectors [Studier et al. (1990) Methods in Enzymol. 185:60-89).
Additional nucleic acid constructs contemplated by the present inventors are those that are engineered such any spacer of choice can be inserted (using a simple blunt-end or sticky end ligation) between two repeat sequences (as exemplified in
The nucleic acid construct may optionally comprise a leader sequence upstream of the first repeat originating from the CRISPR system of the RAMP module.
In addition, the construct may comprise spacer sequences on one or both sides of the repeat sequences, as illustrated in
Methods of introducing the polynucleotides of the present invention into prokaryotic cells are well known in the art—these include, but are not limited to, transforming with a recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vector containing the CRISPR array sequence.
It will be appreciated that downregulating bacterial genes that are essential or vital to bacterial functioning may be used as a method for treating a bacterial infection. Similarly, downregulating bacterial genes that are associated with bacterial virulence may also be used as a method for treating a bacterial infection.
Thus, according to another aspect of the present invention, there is provided a method of treating a bacterial infection in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of an isolated polynucleotide, comprising a clustered, regularly interspaced short palindromic repeat (CRISPR) array nucleic acid sequence wherein at least one spacer of the CRISPR is sufficiently complementary to a portion of at least one bacterial gene so as to down-regulate expression of the bacterial gene, the bacterial gene being a vital bacterial gene or a bacterial virulence gene, thereby treating the bacterial infection.
As used herein, the term “treating” includes abrogating, substantially inhibiting, slowing or reversing the progression of a condition, substantially ameliorating clinical or aesthetical symptoms of a condition or substantially preventing the appearance of clinical or aesthetical symptoms of a condition.
The phrase “bacterial infection” as used herein, refers the invasion and colonization of bacteria in a bodily tissue producing subsequent tissue injury and disease.
The bacterial infection may be on the body surface, localized (e.g., contained within an organ, at a site of a surgical wound or other wound, within an abscess), or may be systemic (e.g., the subject is bacteremic, e.g., suffers from sepsis). Of particular interest is the treatment of bacterial infections that are amenable to therapy by topical application of the phage of the invention. Also of particular interest is the treatment of bacterial infections that are present in an abscess or are otherwise contained at a site to which the phage of the invention can be administered directly.
The present invention also contemplates coating of surfaces other than body surfaces with the constructs of the present invention. U.S. Pat. No. 6,627,215 teaches coating of wound dressings with nucleic acids that comprise anti-bacterial activity. U.S. Pat. No. 6,617,142 teaches methods for attaching DNA or RNA to medical device surfaces.
Contacting a surface with the constructs can be effected using any method known in the art including spraying, spreading, wetting, immersing, dipping, painting, ultrasonic welding, welding, bonding or adhering. The peptides of the present invention may be attached as monolayers or multiple layers.
The present invention coating a wide variety of surfaces with the constructs of the present invention including fabrics, fibers, foams, films, concretes, masonries, glass, metals, plastics, polymers, and like.
An exemplary solid surface that may be coated with the peptides of the present invention is an intracorporial or extra-corporial medical device or implant.
An “implant” as used herein refers to any object intended for placement in a human body that is not a living tissue. The implant may be temporary or permanent. Implants include naturally derived objects that have been processed so that their living tissues have been devitalized. As an example, bone grafts can be processed so that their living cells are removed (acellularized), but so that their shape is retained to serve as a template for ingrowth of bone from a host. As another example, naturally occurring coral can be processed to yield hydroxyapatite preparations that can be applied to the body for certain orthopedic and dental therapies. An implant can also be an article comprising artificial components.
Thus, for example, the present invention therefore envisions coating vascular stents with the peptides of the present invention. Another possible application of the peptides of the present invention is the coating of surfaces found in the medical and dental environment.
Surfaces found in medical environments include the inner and outer aspects of various instruments and devices, whether disposable or intended for repeated uses. Examples include the entire spectrum of articles adapted for medical use, including scalpels, needles, scissors and other devices used in invasive surgical, therapeutic or diagnostic procedures; blood filters, implantable medical devices, including artificial blood vessels, catheters and other devices for the removal or delivery of fluids to patients, artificial hearts, artificial kidneys, orthopedic pins, plates and implants; catheters and other tubes (including urological and biliary tubes, endotracheal tubes, peripherably insertable central venous catheters, dialysis catheters, long term tunneled central venous catheters peripheral venous catheters, short term central venous catheters, arterial catheters, pulmonary catheters, Swan-Ganz catheters, urinary catheters, peritoneal catheters), urinary devices (including long term urinary devices, tissue bonding urinary devices, artificial urinary sphincters, urinary dilators), shunts (including ventricular or arterio-venous shunts); prostheses (including breast implants, penile prostheses, vascular grafting prostheses, aneurysm repair devices, heart valves, artificial joints, artificial larynxes, otological implants), anastomotic devices, vascular catheter ports, clamps, embolic devices, wound drain tubes, hydrocephalus shunts, pacemakers and implantable defibrillators, and the like. Other examples will be readily apparent to practitioners in these arts.
Surfaces found in the medical environment include also the inner and outer aspects of pieces of medical equipment, medical gear worn or carried by personnel in the health care setting. Such surfaces can include counter tops and fixtures in areas used for medical procedures or for preparing medical apparatus, tubes and canisters used in respiratory treatments, including the administration of oxygen, of solubilized drugs in nebulizers and of anesthetic agents. Also included are those surfaces intended as biological barriers to infectious organisms in medical settings, such as gloves, aprons and faceshields. Commonly used materials for biological barriers may be latex-based or non-latex based. Vinyl is commonly used as a material for non-latex surgical gloves. Other such surfaces can include handles and cables for medical or dental equipment not intended to be sterile. Additionally, such surfaces can include those non-sterile external surfaces of tubes and other apparatus found in areas where blood or body fluids or other hazardous biomaterials are commonly encountered. Other surfaces include medical gauzes and plasters such as band-aids.
Other surfaces related to health include the inner and outer aspects of those articles involved in water purification, water storage and water delivery, and those articles involved in food processing. Thus the present invention envisions coating a solid surface of a food or beverage container to extend the shelf life of its contents.
Surfaces related to health can also include the inner and outer aspects of those household articles involved in providing for nutrition, sanitation or disease prevention. Examples can include food processing equipment for home use, materials for infant care, tampons and toilet bowls.
Typically, the subject being treated is a mammalian subject—e.g. human, fowl, rodent or primate.
According to one embodiment, the above-mentioned nucleic acid construct is administered as naked DNA or in a carrier—such as a liposome. According to another embodiment of this aspect of the present invention the polynucleotides are delivered to the bacteria using a targeting moiety (see for example Yacoby and Benhar, Infect Disord Drug Targets. 2007 September; 7(3):221-9).
Thus, according to another embodiment of this aspect of the present invention, the subject is administered with the polynucleotides of the present invention using bacteriophages. According to one embodiment, the bacteriophages are lytic phages.
Treatment of bacterial infections using bacteriophages is well known in the art.
Bacteriophage(s) suitable for use in treatment of a subject can be selected based upon the suspected bacterial pathogen infecting the subject. Methods for diagnosis of bacterial infections and determination of their sensitivities are well known in the art. Where such diagnosis involves culturing a biological sample from the subject, the clinician can at the same time test the susceptibility of the infecting pathogen to growth inhibition by one or more therapeutic phages that are candidates for subsequent therapy.
In order to address the problem of rapid clearance by the spleen, liver and the reticulo-endothelial system of bacteriophages, the present inventors contemplate the use of long-circulating variants of wild type phages (see for example Merrill et al (Proc. Natl. Acad. Sci. USA 93, 3188 (1996) and U.S. Pat. No. 5,688,501) or holing modified bacteriophages—see for example U.S. Pat. Appl. 20040156831.
Selection of the gene to be downregulated will depend on the bacterial infection being treated. According to one embodiment, the spacers of the modified CRISPRs of the present invention are designed such that they target a gene that is highly conserved among bacteria; such spacers will lead to broad-spectrum killing. According to one embodiment, the spacers of the modified CRISPRs of the present invention are designed such that they target a gene that is unique to a specific bacteria; such spacers will lead to narrow-spectrum killing.
According to another embodiment, the bacterial infection being treated is an antibiotic resistant bacterial infection—e.g. infections induced by methicillin resistant Staphylococcus aureus or vancomycin resistant Staphylococcus aureus. In this case, the spacers of the modified CRISPRs of the present invention are designed such that they target an antibiotic resistance gene.
The bacteriophages comprising the modified CRISPRs of the present invention may be administered per se, or as part of a pharmaceutical composition.
As used herein a “pharmaceutical composition” refers to a preparation of one or more of the active ingredients described herein with other chemical components such as physiologically suitable carriers and excipients. The purpose of a pharmaceutical composition is to facilitate administration of the active agent to an organism.
Herein the term “active ingredient” refers to the modified CRISPR accountable for the biological effect.
Hereinafter, the phrases “physiologically acceptable carrier” and “pharmaceutically acceptable carrier” which may be interchangeably used refer to a carrier or a diluent that does not cause significant irritation to an organism and does not abrogate the biological activity and properties of the administered compound. An adjuvant is included under these phrases.
Herein the term “excipient” refers to an inert substance added to a pharmaceutical composition to further facilitate administration of an active ingredient. Examples, without limitation, of excipients include calcium carbonate, calcium phosphate, various sugars and types of starch, cellulose derivatives, gelatin, vegetable oils and polyethylene glycols.
Techniques for formulation and administration of drugs may be found in “Remington's Pharmaceutical Sciences,” Mack Publishing Co., Easton, Pa., latest edition, which is incorporated herein by reference.
Suitable routes of administration may, for example, include oral, rectal, transmucosal, especially transnasal, intestinal or parenteral delivery, including intramuscular, subcutaneous and intramedullary injections as well as intrathecal, direct intraventricular, intracardiac, e.g., into the right or left ventricular cavity, into the common coronary artery, intravenous, intraperitoneal, intranasal, or intraocular injections.
Alternately, one may administer the pharmaceutical composition in a local rather than systemic manner, for example, via injection of the pharmaceutical composition directly into a tissue region of a patient.
Pharmaceutical compositions of the present invention may be manufactured by processes well known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes.
Pharmaceutical compositions for use in accordance with the present invention thus may be formulated in conventional manner using one or more physiologically acceptable carriers comprising excipients and auxiliaries, which facilitate processing of the active ingredients into preparations which, can be used pharmaceutically. Proper formulation is dependent upon the route of administration chosen.
For injection, the active ingredients of the pharmaceutical composition may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hank's solution, Ringer's solution, or physiological salt buffer. For transmucosal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.
For oral administration, the pharmaceutical composition can be formulated readily by combining the active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers enable the pharmaceutical composition to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for oral ingestion by a patient. Pharmacological preparations for oral use can be made using a solid excipient, optionally grinding the resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carbomethylcellulose; and/or physiologically acceptable polymers such as polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate.
Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar solutions may be used which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium dioxide, lacquer solutions and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to characterize different combinations of active compound doses.
Pharmaceutical compositions which can be used orally, include push-fit capsules made of gelatin as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules may contain the active ingredients in admixture with filler such as lactose, binders such as starches, lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active ingredients may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for oral administration should be in dosages suitable for the chosen route of administration.
For buccal administration, the compositions may take the form of tablets or lozenges formulated in conventional manner.
For administration by nasal inhalation, the active ingredients for use according to the present invention are conveniently delivered in the form of an aerosol spray presentation from a pressurized pack or a nebulizer with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichloro-tetrafluoroethane or carbon dioxide. In the case of a pressurized aerosol, the dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use in a dispenser may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch.
The pharmaceutical composition described herein may be formulated for parenteral administration, e.g., by bolus injection or continuos infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampoules or in multidose containers with optionally, an added preservative. The compositions may be suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.
Pharmaceutical compositions for parenteral administration include aqueous solutions of the active preparation in water-soluble form. Additionally, suspensions of the active ingredients may be prepared as appropriate oily or water based injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acids esters such as ethyl oleate, triglycerides or liposomes. Aqueous injection suspensions may contain substances, which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol or dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase the solubility of the active ingredients to allow for the preparation of highly concentrated solutions.
Alternatively, the active ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile, pyrogen-free water based solution, before use.
The pharmaceutical composition of the present invention may also be formulated in rectal compositions such as suppositories or retention enemas, using, e.g., conventional suppository bases such as cocoa butter or other glycerides.
Pharmaceutical compositions suitable for use in context of the present invention include compositions wherein the active ingredients are contained in an amount effective to achieve the intended purpose. More specifically, a therapeutically effective amount means an amount of active ingredients effective to prevent, alleviate or ameliorate symptoms of bacterial infection or prolong the survival of the subject being treated.
Determination of a therapeutically effective amount is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein.
For any preparation used in the methods of the invention, the therapeutically effective amount or dose can be estimated initially from in vitro and cell culture assays. For example, a dose can be formulated in animal models to achieve a desired concentration or titer. Such information can be used to more accurately determine useful doses in humans.
Toxicity and therapeutic efficacy of the active ingredients described herein can be determined by standard pharmaceutical procedures in vitro, in cell cultures or experimental animals. The data obtained from these in vitro and cell culture assays and animal studies can be used in formulating a range of dosage for use in human. The dosage may vary depending upon the dosage form employed and the route of administration utilized. The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. (See e.g., Fingl, et al., 1975, in “The Pharmacological Basis of Therapeutics”, Ch. 1 p. 1).
Dosage amount and interval may be adjusted individually to provide tissue or blood levels of the active ingredient are sufficient to induce or suppress the biological effect (minimal effective concentration, MEC). The MEC will vary for each preparation, but can be estimated from in vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics and route of administration. Detection assays can be used to determine plasma concentrations.
Depending on the severity and responsiveness of the condition to be treated, dosing can be of a single or a plurality of administrations, with course of treatment lasting from several days to several weeks or until cure is effected or diminution of the disease state is achieved.
The amount of a composition to be administered will, of course, be dependent on the subject being treated, the severity of the affliction, the manner of administration, the judgment of the prescribing physician, etc.
Compositions of the present invention may, if desired, be presented in a pack or dispenser device, such as an FDA approved kit, which may contain one or more unit dosage forms containing the active ingredient. The pack may, for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may be accompanied by instructions for administration. The pack or dispenser may also be accommodated by a notice associated with the container in a form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals, which notice is reflective of approval by the agency of the form of the compositions or human or veterinary administration. Such notice, for example, may be of labeling approved by the U.S. Food and Drug Administration for prescription drugs or of an approved product insert. Compositions comprising a preparation of the invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an appropriate container, and labeled for treatment of an indicated condition, as is further detailed above.
As mentioned, the modified bacteria of the present invention may be present in food products as well as in food additives.
The phrase “food additive” [defined by the FDA in 21 C.F.R. 170.3(e)(1)] includes any liquid or solid material intended to be added to a food product. This material can, for example, include an agent having a distinct taste and/or flavor or a physiological effect (e.g., vitamins).
The food additive composition of the present invention can be added to a variety of food products.
As used herein, the phrase “food product” describes a material consisting essentially of protein, carbohydrate and/or fat, which is used in the body of an organism to sustain growth, repair and vital processes and to furnish energy. Food products may also contain supplementary substances such as minerals, vitamins and condiments. See Merriani-Webster's Collegiate Dictionary, 10th Edition, 1993. The phrase “food product” as used herein further includes a beverage adapted for human or animal consumption. A food product containing the food additive of the present invention can also include additional additives such as, for example, antioxidants, sweeteners, flavorings, colors, preservatives, nutritive additives such as vitamins and minerals, amino acids (i.e. essential amino acids), emulsifiers, pH control agents such as acidulants, hydrocolloids, antifoams and release agents, flour improving or strengthening agents, raising or leavening agents, gases and chelating agents, the utility and effects of which are well-known in the art.
It will be appreciated that the CRISPR polynucleotides of the present invention may also be used to identify a function of a particular prokaryotic gene.
According to this aspect of the present invention, the modified CRISPR polynucleotides of the present invention are introduced into prokaryotic cells, wherein a spacer of the CRISPR is directed against a prokaryotic gene of unknown function. A phenotype of the prokaryote is then assayed. Depending on the outcome of the assay, the function of the gene can be determined (i.e. annotated).
According to one embodiment, the phenotype is examined using “phenotype microarray analysis”—see for example Zhou et al Journal of Bacteriology, August 2003, p. 4956-4972, Vol. 185, No. 16.
The modified CRISPR arrays of the present invention may also be used to generate a library of clones, each of which containing a different down-regulated gene. Such libraries have been prepared for Escherichia coli K-12, [Baba et al., Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection” Molecular Systems Biology 2 Article number: 2006.0008]. However, construction of this library was extremely labor intensive and expensive. Organisms of interest for production of such libraries include bio-energy relevant organisms such as Synechocystis sp. PCC 6803 or in which it is needed to determine which genes are needed to be silenced in order to enhance the production of the desired output biofuel. This could also be done for organisms generating a biotechnologically relevant product (other than a bio-fuel). Organisms of interest for production of such libraries include human pathogens, animal pathogens and plant pathogens.
As mentioned, the constructs of the present invention may be useful in the generation of biofuels in bacteria.
Thus, according to another aspect of the present invention, there is provided a method of generating an organic material in bacteria, the method comprising downregulating a gene which compromises the generation of the organic material in the bacteria.
Using the CRISPR polynucleotides described herein, new virulence genes can be discovered in pathogens. According to this embodiment, a library of isolated polynucleotides, each comprising a clustered, regularly interspaced short palindromic repeat (CRISPR) array nucleic acid sequence wherein at least one spacer of the CRISPR is sufficiently complementary to a portion of at least one bacterial gene, will be constructed such that each isolated polynucleotide down regulates one gene in a specific pathogen. This library may then by used to serially down regulate each gene in the pathogen. Concomitant with down regulation of each of the genes separately, a virulence assay may be conducted, such that the virulence is measured when the specific gene is down regulated. Genes whose down regulation interferes with virulence will be identified as virulence-associated genes. Vaccines or antibiotics could be made to target these specific genes.
It is expected that during the life of a patent maturing from this application many relevant CRISPR arrays and systems will be identified and the scope of the term CRISPR array is intended to include all such new technologies a priori
As used herein the term “about” refers to ±10%
The terms “comprises”, “comprising”, “includes”, “including”, “having” and their conjugates mean “including but not limited to”.
The term “consisting of means “including and limited to”.
The term “consisting essentially of” means that the composition, method or structure may include additional ingredients, steps and/or parts, but only if the additional ingredients, steps and/or parts do not materially alter the basic and novel characteristics of the claimed composition, method or structure.
As used herein the term “method” refers to manners, means, techniques and procedures for accomplishing a given task including, but not limited to, those manners, means, techniques and procedures either known to, or readily developed from known manners, means, techniques and procedures by practitioners of the chemical, pharmacological, biological, biochemical and medical arts.
It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable subcombination or as suitable in any other described embodiment of the invention. Certain features described in the context of various embodiments are not to be considered essential features of those embodiments, unless the embodiment is inoperative without those elements.
Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples.
Reference is now made to the following examples, which together with the above descriptions illustrate some embodiments of the invention in a non limiting fashion.
Reference is now made to the following examples, which together with the above descriptions, illustrate the invention in a non limiting fashion.
Generally, the nomenclature used herein and the laboratory procedures utilized in the present invention include molecular, biochemical, microbiological and recombinant DNA techniques. Such techniques are thoroughly explained in the literature. See, for example, “Molecular Cloning: A laboratory Manual” Sambrook et al., (1989); “Current Protocols in Molecular Biology” Volumes I-III Ausubel, R. M., ed. (1994); Ausubel et al., “Current Protocols in Molecular Biology”, John Wiley and Sons, Baltimore, Md. (1989); Perbal, “A Practical Guide to Molecular Cloning”, John Wiley & Sons, New York (1988); Watson et al., “Recombinant DNA”, Scientific American Books, New York; Birren et al. (eds) “Genome Analysis: A Laboratory Manual Series”, Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 5,272,057; “Cell Biology: A Laboratory Handbook”, Volumes I-III Cellis, J. E., ed. (1994); “Culture of Animal Cells—A Manual of Basic Technique” by Freshney, Wiley-Liss, N. Y. (1994), Third Edition; “Current Protocols in Immunology” Volumes I-III Coligan J. E., ed. (1994); Stites et al. (eds), “Basic and Clinical Immunology” (8th Edition), Appleton & Lange, Norwalk, Conn. (1994); Mishell and Shiigi (eds), “Selected Methods in Cellular Immunology”, W. H. Freeman and Co., New York (1980); available immunoassays are extensively described in the patent and scientific literature, see, for example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 4,098,876; 4,879,219; 5,011,771 and 5,281,521; “Oligonucleotide Synthesis” Gait, M. J., ed. (1984); “Nucleic Acid Hybridization” Hames, B. D., and Higgins S. J., eds. (1985); “Transcription and Translation” Hames, B. D., and Higgins S. J., eds. (1984); “Animal Cell Culture” Freshney, R. I., ed. (1986); “Immobilized Cells and Enzymes” IRL Press, (1986); “A Practical Guide to Molecular Cloning” Perbal, B., (1984) and “Methods in Enzymology” Vol. 1-317, Academic Press; “PCR Protocols: A Guide To Methods And Applications”, Academic Press, San Diego, Calif. (1990); Marshak et al., “Strategies for Protein Purification and Characterization—A Laboratory Course Manual” CSHL Press (1996); all of which are incorporated by reference as if fully set forth herein. Other general references are provided throughout this document. The procedures therein are believed to be well known in the art and are provided for the convenience of the reader. All the information contained therein is incorporated herein by reference.
Currently, there is no database or public resource that connects between CRISPR array and its associated cas genes. To identify RAMP modules in prokaryotes that can function in downregulating prokaryotic genes, the present inventors computationally scanned the genomes of 334 CRISPR-bearing organisms. Non-questionable CRISPR arrays, spacers, and repeats were obtained from CRISPRdb [Grissa et al, BMC Bioinformatics 8, 172, 2007]. For each CRISPR-bearing organism, information regarding cas genes was obtained from both the Genbank file and from IMG (Integrated Microbial Genomes) at JGI (www.imgdotjgidotdoedotgov/cgi-bin/pub/maindotcgi). For this example, cas genes were defined as genes that were assigned one of the TIGRFAM profiles that were defined in [Haft et al, PLoS Comput Biol. 1, e60, 2005]. For this example, an array of cas genes (cas array) was defined as one or more consecutive cas genes with up to 2 intervening non-cas genes. A “CRISPR system” was defined for this example as a cas array associated to its closest CRISPR array in the analyzed genome. RAMP modules were defined as a cas array+CRISPR array, where the cas array contains at least 4 genes, and where at least one gene belongs to the RAMP subtype, as defined by [Haft et al, PLoS Comput Biol. 1, e60, 2005].
Characterization of RAMP Modules in Bacterial Genomes:
The search above identified 73 RAMP modules (Table 1, herein below).
Myxococcus xanthus
Bacillus halodurans
Chlorobium limicola
Chloroherpeton thalassium
Clostridium botulinum
Clostridium botulinum
Clostridium botulinum
Clostridium botulinum
Cyanothece sp.
Halorhodospira halophila
Herpetosiphon aurantiacus
Herpetosiphon aurantiacus
Marinomonas sp.
Methanococcus vannielii
Methanosarcina barkeri
fusaro
Methanosarcina mazei
Nostoc sp. PCC 7120
Porphyromonas gingivalis
Porphyromonas gingivalis
Prosthecochloris aestuarii
Sorangium cellulosum
Streptococcus thermophilus
Streptococcus thermophilus
Synechococcus sp.
Synechocystis sp.
Syntrophus aciditrophicus
Aquifex aeolicus VF5
Archaeoglobus fulgidus
Caldivirga maquilingensis
Candidatus Desulforudis
audaxviator MP104C
Candidatus Desulforudis
audaxviator MP104C
Chloroflexus aurantiacus
Deinococcus geothermalis
Hyperthermus butylicus
Ignicoccus hospitalis
Methanopyrus kandleri
Methanosaeta thermophila
Methanothermobacter
thermautotrophicus Delta H
Pelotomaculum
thermopropionicum SI
Pelotomaculum
thermopropionicum SI
Pyrobaculum arsenaticum
Pyrobaculum calidifontis
Pyrobaculum calidifontis
Pyrococcus furiosus
Roseiflexus castenholzii
Roseiflexus sp. RS-1
Roseiflexus sp. RS-1
Roseiflexus sp. RS-1
Rubrobacter xylanophilus
Sulfolobus solfataricus P2
Sulfolobus solfataricus P2
Sulfolobus solfataricus P2
Sulfolobus solfataricus P2
Sulfolobus tokodaii 7
Sulfolobus tokodaii 7
Sulfurihydrogenibium sp.
Synechococcus sp.
Synechococcus sp.
Synechococcus sp. JA-3-3Ab
Synechococcus sp. JA-3-3Ab
Thermoanaerobacter
tengcongensis MH4
Thermobifida
fusca YX
Thermofilum pendens Hrk 5
Thermofilum pendens Hrk5
Thermoproteus neutrophilus
Thermosipho
melanesiensis
Thermotoga lettingae
Thermotoga maritima
Thermotoga sp. RQ2
Thermotoga sp. RQ2
Thermus thermophilus
Thermus thermophilus
Of these, 46 were in thermophilic organisms (organisms that optimally grow in high temperatures), and 27 in mesophiles (organisms optimally growing at a near body temperature). It is expected that proteins expressed in thermophiles are adapted to work in high temperatures, and might have a less optimal function when expressed in temperatures near 37° C. Therefore, in order to use RAMP modules from thermophiles, it can be hypothesized that specific amino acids should be changed in the cas genes to allow function at lower temperatures. This could be done in a process of molecular evolution.
The sequences of the 73 identified RAMP modules, as well as their associated repeat arrays and cas gene sequences, are set forth in SEQ ID NOs 1-1339).
As a proof of concept, a RAMP module of the mesophilic organism Myxococcus xanthus DK 1622 may be used to silence three genes in E. coli. The RAMP module in this organism is found between positions 8888371-8896616 in its genome (NCBI accession NC—008095), and contains 8 genes (
The cas array of the RAMP module will be amplified from the genome of Myxococcus xanthus DK 1622 using the following primers
AAAAAGATCTGATGAGACCACGAGGAGGTGATGTC (SEQ ID NO: 1350);
TTAATTAACACCGGCAAGCCTTCACGCGGCC (SEQ ID NO: 1351).
The amplified DNA will be cloned in a plasmid under the control of an inducible promoter and will be transformed to the E. coli strain BL21-AI.
In a modified version of this experiment, the RAMP module that will be cloned will contain only some of the cas genes, where one or more of the genes are omitted.
In a modified version of this experiment, another cas array will be cloned into the E. coli in conjugation with the plasmid described above. This cas array is of the Tneap subtype which resides on the genome of Myxococcus xanthus DK 1622 nearby the RAMP module (
The cas array of the Tneap subtype will be amplified from the genome of Myxococcus xanthus DK 1622 using the following primers
CGGATCCGTTGGCGCGGAGCGTCGGTTG (SEQ ID NO: 1352);
AAGCTTTCACAGCACCTTGAA (SEQ ID NO: 1353).
In a further modification, the Tneap cas array that will be cloned will contain only some of the cas genes, where one or more of the genes are omitted (for example, the Tneap cas array without cas 1 and cas2). SEQ ID No: 1349 provides the polynucleotide sequence of the cas array of the Myxococcus xanthus DK 1622 Genbank AC—008095 CRISPR Tneap system without the cas1 and cas2 genes
To allow silencing of the selected genes, DNA constructs containing CRISPR arrays will be cloned in another plasmid under the control of an inducible promoter and will also be transformed to the E. coli strain BL21-AI (
To show that more than one gene can be silenced by the same RAMP module in parallel, a CRISPR array will be designed that targets both GFP and malF at the same time (
As a control for the above described experiments a CRISPR array construct that does not target any E coli gene will also be transformed (
Degradation of the RNA of the selected genes can be further verified by Northern Blot or quantitative PCR.
The CRISPR constructs for silencing GFP, RFP and malF expression in E. coli are illustrated in
The RAMP CRISPR module of Neisseria sicca ATCC29256, a mesophilic bacteria isolated from the pharyngeal mucosa of healthy man was selected for experimentation. A draft genome of this organism is available, and in this genome (NCBI locus NZ_ACK002000045) a RAMP module was identified. This module was cloned into E. coli BL21 (DE3) on the pET-Duet compatible plasmids system, so that the CRISPR array (crRNA) was on one plasmid, and the cas genes were divided to two operons on two plasmids, all under an expression control inducible by IPTG (
To test whether the RAMP system cloned into E. coli was active, the expression of each system was induced in E. coli BL21 using 0.1 mM IPTG. Total RNA was extracted and Northern blots were performed with probes designed against one of the spacers in the array. From the Northern blots, clear processing of the pre-crRNA was observed, confirming that the protein responsible for pre-crRNA processing are active (
Next, the present inventors designed an experimental system that would allow high throughput measurements of RNA silencing. For this, they first engineered into the CRISPR array four spacers that target green fluorescence protein (GFP). Next, they prepared a two-gene construct, which includes GFP and RFP, cloned on a pRSF-Duet plasmid that is compatible with the plasmids carrying the CRISPR system (
Fluorescence levels of GFP and RFP following induction of the RAMP module that expresses crRNA with four spacers targeting different regions in the GFP gene were tested (
In order to make a RAMP-based system in which any gene of choice could be targeted, the present inventors designed a unique repeat-spacer array construct, based on the repeat sequences of the Neisseria sicca RAMP module, in which any spacer of choice could be inserted using a simple blunt-end ligation and cloning reaction (
Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.
All publications, patents and patent applications mentioned in this specification are herein incorporated in their entirety by reference into the specification, to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IL12/50194 | 5/31/2012 | WO | 00 | 12/2/2013 |
Number | Date | Country | |
---|---|---|---|
61491943 | Jun 2011 | US |