This invention relates generally to the use of recombinases.
Phage recombinases are splicing enzymes used by virions to insert or remove their genomic DNA from a host chromosome. The use of recombinases (or integrases) for genomic manipulation is well established. Site-specific recombinases are significant tools in a variety of applications in research, medicine, and biotechnology. Conditional gene targeting using site-specific recombinases has enabled the functional analysis of genes, which cannot be inactivated in the germline. Site-specific recombinases also allow the precise integration of open reading frames (ORFs) encoding proteins of interest into highly active gene loci in cell lines and transgenic animals. Recombinases are disclosed in the following references: Groth, Amy C.; Calos, Michele P. J. Mol. Biol (2004) 335, 667-678; Silver, Daniel P.; Livingston, David M. Molecular Cell (2001), 8, 233-243; Sauer, Brian; McDermott, Jeffrey. Nucleic Acids Research (2004) 32(20), 6086-6095; Yagil, Ezra; Dorgai, László; Weisberg, Robert A. J. Mol. Biol (1995) 252, 163-177; and, Dorgai, László; Yagil, Ezra; Weisberg, Robert A. J. Mol. Biol (1995) 252, 178-188.
The present invention provides for a recombinant nucleic acid comprising a nucleotide sequence comprising a plurality of constructs, wherein each construct independently comprises a nucleotide sequence of interest flanked by a pair of recombinase recognition sequences. Each pair of recombinase recognition sequences is recognized by a distinct recombinase. Optionally, each construct can, independently, further comprise one or more genes encoding a recombinase capable of recognizing the pair of recombinase recognition sequences of the construct.
The present invention provides for a recombinant nucleic acid comprising a first construct and a second construct; wherein the first construct comprises a nucleotide sequence encoding a first recognition sequence of a first recombinase, a second recognition sequence of the first recombinase, and a first nucleotide sequence of interest located between the first and second recognition sequence of the first recombinase; wherein the second construct comprises a nucleotide sequence encoding a first recognition sequence of a second recombinase, a second recognition sequence of the second recombinase, and a second nucleotide sequence of interest located between the first and second recognition sequence of the second recombinase; wherein the second construct is located downstream of the first construct; wherein the first recombinase and the second recombinase do not cross react with the recognition sequence of the other.
A recombinase that can be used in the present invention is an orthogonal (non-cross reacting), site-selective recombinase (OSSR). An OSSR is a recombinase that recognizes a specific recognition site or nucleotide sequence and does not cross-react with the recognition site or nucleotide sequence of another recombinase.
The present invention also provides for a recombinant vector comprising the recombinant nucleic acid. The present invention also provides for a vector or expression vector comprising a recombinant nucleic acid of the present invention.
The present invention further provides for a host cell comprising any of the recombinant nucleic acid or vector of the present invention. In some embodiments, the recombinant nucleic acid is integrated into a chromosome or replicon of the host cell. The host cell can be an eukaryotic or a prokaryotic cell.
The present invention further provides for a host organism comprising one or more host cells of the present invention. In some embodiments, all of the cells of the host organism comprise a recombinant nucleic acid of the present invention.
The present invention provides for a method of excising or deleting one or more nucleotide sequence of interest from a host cell, comprising: (a) providing a signal to a host cell to activate expression from a promoter in the host cell, wherein the host cell comprises a promoter upstream of a plurality of constructs, wherein each construct independently comprises a nucleotide sequence of interest flanked by a pair of recombinase recognition sequences; and (b) excising or deleting one or more nucleotide sequence of interest.
The present invention provides for a method of excising or deleting a first nucleotide sequence of interest from a host cell, comprising: (a) providing a signal to a host cell to activate expression from a promoter in the host cell, wherein the host cell comprises a promoter upstream of a first construct and a second construct; and (b) excising or deleting a first nucleotide sequence of interest; wherein the first construct comprises a nucleotide sequence encoding a first recognition sequence of a first recombinase, a second recognition sequence of the first recombinase, and the first nucleotide sequence of interest located between the first and second recognition sequence of the first recombinase; wherein the second construct comprises a nucleotide sequence encoding a first recognition sequence of a second recombinase, a second recognition sequence of the second recombinase, and a second nucleotide sequence of interest located between the first and second recognition sequence of the second recombinase; wherein the second construct is located downstream of the first construct; wherein the first recombinase and the second recombinase do not cross react with the recognition sequence of the other.
The present invention further provides for a system capable of noise canceling with non-coding interfering RNA suppression.
The present invention further provides for a system capable of noise canceling with dominant negative complexation.
The present invention further provides for a system comprising a switch that is controlled by the relative expression of two variable promoters
The foregoing aspects and others will be readily appreciated by the skilled artisan from the following description of illustrative embodiments when read in conjunction with the accompanying drawings.
Before the present invention is described, it is to be understood that this invention is not limited to particular embodiments described, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only by the appended claims.
Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limits of that range is also specifically disclosed. Each smaller range between any stated value or intervening value in a stated range and any other stated or intervening value in that stated range is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included or excluded in the range, and each range where either, neither or both limits are included in the smaller ranges is also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are now described. All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited.
It must be noted that as used herein and in the appended claims, the singular forms “a”, “and”, and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a sequence” includes a plurality of such sequences, and so forth.
These and other objects, advantages, and features of the invention will become apparent to those persons skilled in the art upon reading the details of the invention as more fully described below.
In some embodiments, the recombinant nucleic acid comprises two or more constructs, three or more constructs, four or more constructs, five or more constructs, or ten or more constructs. In some embodiments, the recombinant nucleic acid comprises up to ten constructs.
In some embodiments, the recombinant nucleic acid further comprises a third construct comprising a nucleotide sequence encoding a first recognition sequence of a third recombinase, a second recognition sequence of the third recombinase, and a third nucleotide sequence of interest located between the first and second recognition sequence of the third recombinase. In some embodiment, the recombinant nucleic acid further comprises a fourth, fifth, or/and etc. construct(s), each construct comprising a nucleotide sequence encoding a first recognition sequence of a unique recombinase, a second recognition sequence of the unique recombinase, and a nucleotide sequence of interest located between the first and second recognition sequence of the unique recombinase, wherein the unique recombinase is a recombinase is distinct of any of the other recombinase which recognizes a recognition sequence within the recombinant nucleic acid.
See
The use of recombinases for manipulation of genomic sequences is well known to those skilled in the art. Commonly, the recombinase is used as follows: a target DNA containing a selection marker is initially inserted into a chromosome at a desired location. Following selection, an OSSR is used to extract the selection marker. In this method, the toolkit of two recombinases and one selection marker can be used repeatedly to significantly modify the organism under study. The orthogonality of the recombinase is important to prevent destructive or unpredictable recombination events.
A recombinase that can be used in the present invention is an orthogonal (non-cross reacting), site-selective recombinase (OSSR). An OSSR is a recombinase that recognizes a specific recognition site or nucleotide sequence and does not cross-react with the recognition site or nucleotide sequence of another recombinase. In some embodiments, each recombinase recognizes a pair of identical DNA sequences is that is about 50 to 60 basepair in length. In some embodiments, the recombinase recognizes a pair of identical DNA sequences is that is about 52 to 58 basepair in length. In some embodiments, the recombinase recognizes a pair of identical DNA sequences is that is about 53, 54, 55, 56 or 57 basepair in length. If only two sites are present, and they are oriented in the same direction on a DNA strand, the sequence between the sites is excised into a loop, with one recognition site remaining on the annealed DNA and one site incorporated into the loop. If more than two sites are present, or the orientation between sites is different, multiple activities can occur, including flipping and mis-annealing of the genomic DNA.
To be useful for the present invention, the recombinase and its recognition sites must display the following properties: (1) Engineered recognition sites should be unique to the cell (that is, no native genomic sequence matches the recognition site). (2) Recombinases must not bind to the recognition sites associated with other recombinases. (3) Recombinases bound to a recognition site must not be able to bind to the “partner” recombinase associated with a different recognition site. (The functional unit of a recombinase is a dimer of dimers. For example, dimer AA normally binds to site a. This then interacts with another dimer AA bound to a different site a to cause a recombination event. If dimer AA recognizes site a, and dimer BB recognizes site b, if AA is capable of binding to BB in the presence of a and b, a cross reaction would occur.) (4) Recognition sites should be kept to a total of two per cell per recombinase in all cases where this is possible during the operation of the invention.
Recombinases useful for this invention include, but are not limited to, to the recombinases listed in Table 1.
Bacillus
subtilis
Bacillus
subtilis
Bacillus
subtilis
Clostridium
butyricum
Escherichia
coli
Escherichia
coli
Escherichia
coli
Mycobacterium
smegmatis
Escherichia
coli
Dichelobacter
nodosus
Escherichia
coli
Escherichia
coli
Escherichia
coli
Escherichia
coli
Escherichia
coli
Haemophilus
influenzae
Haemophilus
influenzae
Haemophilus
influenzae
Haemophilus
influenzae
Escherichia
coli
Haemophilus
influenzae
Acholeplasma sp.
Mycobacterium
tuberculosis
Staphylococcus
aureus
Escherichia
coli
Lactobacillus
leichmannii
Lactobacillus
leichmannii
Oenococcus
oeni
Methanococcus
jannaschi
Mycobacterium
leprae
Mycobacterium
paratuberculosis
Mycobacterium
tuberculosis
Mycobacterium
tuberculosis
Lactobacillus
delbrueckii
Myxococcus
xanthus
Alcaligenes
eutrophus
Chlorobium
limicola
Nostoc sp.
Amycolatopsis
methanolica
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Rhizobium sp.
Streptomyces
ambofaciens
Salmonella
dublin
Saccharopolyspora
erythraea
Saccharopolyspora
erythraea
Lactobacillus
delbrueckii
Staphylococcus
aureus
Staphylococcus
aureus
Escherichia
coli
phage
Lactobacillus
gasseri
Pseudomonas
aeruginosa
Lactobacillus sp.
Lactococcus
lactis
Escherichia
coli
Escherichia
coli
Escherichia
coli
Escherichia
coli
Salmonella
typhimurium
Escherichia
coli
Escherichia
coli
Pseudomonas
aeruginosa
Proteus
mirabilis
Escherichia
coli
Escherichia
coli
Shigella
flexneri
Streptomyces
coelicolor
Serratia
marcescens
Methanosarcina
acetivorans
Sulfolobus sp.
Streptococcus
pyogenes
Escherichia
coli
Bacillus
thuringiensis
Pseudomonas sp.
Streptococcus
pneumoniae
Lactobacillus
lactis
Staphylococcus
aureus
Staphylococcus
aureus
Escherichia
coli
Entercoccus
faecalis
Lactobacillus
lactis
Bergeyella
zoohelcum
Anabaena sp.
Anabaena sp.
Saccharomyces
cerevisiae
Kluyveromyces
lactis
Zygosaccharomyces
bailii
Zygosaccharomyces
bisporus
Zygosaccharomyces
fermentati
Zygosaccharomyces
rouxii
Helicobacter
pylori
Helicobacter
pylori
Escherichia
coli
Escherichia
coli
Escherichia
coli
Escherichia
coli
Mycobacterium
gordonae
Mycobacterium
leprae
Mycobacterium
tuberculosis
Escherichia
coli
Pseudomonas
fluorescens
Protothera
wickerhamii
Streptococcus
thermophilus
Lactobacillus
lactis
Salmonella
typhimurium
Salmonella
typhimurium
Synechocystis sp.
Dichelobacter
nodosus
Vibrio
cholerae
Methanothermobacter
marburgensis
Escherichia
coli
Salmonella
infantis
Lactococcus
lactis
Staphylococcus
aureus
Mycobacterium sp.
Helicoverpa
zea
Kluveromyces
waltii
Clostridium
butyricum
Haemophilus
influenzae
Bacteroides
uniformis
Streptococcus
pneumoniae
Streptococcus
pyogenes
Proteus
mirabilis
Proteus
mirabilis
Shigella
flexneri
Streptococcus
thermophilus
Streptomyces
fradiae
Mycobacterium sp.
Rhodococcus
erythropolis
Serratia
marcescens
Agrobacterium
tumefaciens
Aquifex
aeolicus
Staphylococcus
aureus
Enterobacter
aerogenes
Shigella
flexneri
Escherichia
coli
Streptomyces
rimosus
Streptomyces
venezuelae
Spodoptera
exigua
Salmonella
typhimurium
Pseudomonas
putida
Lactobacillus
casei
Aquifex
aeolicus
Mesorhizobium
loti
Selenomonas
ruminantium
Coxiella
burnetti
Panagrellus
redivivus
Caenorhabditis
elegans
Vibrio
cholerae
Streptococcus
mutans NG8
Rhizobium
leguminosarum
Pyrococcus
horikoshii
Rhodobacter
capsulatus
Rhodobacter
capsulatus
Enterococcus
faecium
Methanothermobacter
marburgensis
Salmonella
typhimurium
Mycobacterium
tuberculosis
Treponema
pallidum
Treponema
pallidum
Chlamydia
trachomatis
Chlamydia
trachomatis
Staphylococcus
aureus
Sphingomonas
aromaticivorans
Escherichia
coli O157:H7
Staphylococcus
aureus
Yersinia
pestis
Rickettsia
prowazekii
Rickettsia
prowazekii
Vibrio
cholerae
Actinob.
actinomycetemcomitans
Mycoplasma
arthritidis
Oenococcus
oeni
Shigella
flexneri
Ralstonia
eutropha
Helicobacter
pylori
Chlamydia
pneumoniae
Chlamydia
pneumoniae
Vibrio
cholerae
Pseudomonas
putida
Pantoea
citrea
Escherichia
coli
Corynebacterium
glutamicum
Streptomyces
coelicolor
Corynebacterium
glutamicum
Bacillus
halodurans
Xylella
fastidiosa
Bacillus
halodurans
Pseudomonas
aeruginosa
Vibrio
cholerae
Vibrio
cholerae
Neisseria
meningitidis ser. A
Neisseria
meningitidis ser. B
Xylella
fastidiosa
Chlamydia
muridarum
Staphylococcus
aureus
Neisseria
meningitidis ser. B
Neisseria
meningitidis ser. A
Chlamydia
muridarum
Pyrococcus
abysii
Deinococcus
radiodurans
Agrobacterium
tumefaciens
Helicobacter
pylori J
Thermotoga
maritima
Campylobacter
jejuni
Aeropyrum
pernix
Pseudomonas
syringae
Streptococcus
pneumoniae
Xestia
nigrum
Plutella
xylostella
Bacillus
anthracis
Bacteroides
fragilis
Deinococcus
radiodurans
Bradyrhizobium
japonicum
Bacillus
halodurans
Bacillus
anthracis
Pseudomonas
aeruginosa
Chlamydia
trachomatis
Bacteroides
fragilis
Sphingomonas
aromaticivorans
Chlamydia
trachomatis
Bacillus
halodurans
Escherichia
coli
Chlamydia
psittaci
Vibrio
cholerae
Deinococcus
radiodurans
Chlamydophila
pneumoniae
Escherichia
coli
Ureaplasma
urealyticum
Escherichia
coli
Lactococcus sp.
Acyrthosiphon
pisum
Methanococcus
maripaludis
Shigella
flexneri
Ureaplasma
urealyticum
Escherichia
coli
Bacillus
halodurans
Salmonella
enterica
Deinococcus
radiodurans
Streptomyces
coelicolor
Agrobacterium
rhizogenes
Streptomyces
coelicolor
Helicobacter
pylori
Xylella
fastidiosa
Ureaplasma
urealyticum
Bacillus
anthracis
Chlamydia
psittaci
Bacillus
subtilis
Pseudomonas
aeruginosa
Xylella
fastidiosa
Xylella
fastidiosa
Chlamydia
trachomatis
Escherichia
coli
Deinococcus
radiodurans
Escherichia
coli
Streptococcus
thermophilus
Chlamydia
trachomatis
Xylella
fastidiosa
Haemophilus
influenzae
Dichelobacter
nodosus
Bacteroides
fragilis
Shigella
sonnei
Pseudomonas
syringiae
Pseudomonas
putida
Chlamydia
trachomatis
Escherichia
coli
Bacillus
halodurans
Xylella
fastidiosa
Halobacterium sp.
Shigella
dysenteriae
Coxiella
burnetti
Pasteurella
multocida
Bacillus
subtilis
Pseudomonas
pavonaceae
Pseudomonas
aeruginosa
Xylella
fastidiosa
Streptomyces
coelicolor
Lactococcus
lactis
Mycobacterium sp.
Sphingomonas
aromaticivorans
Xylella
fastidiosa
Escherichia
coli
Anticarsia
gemmatalis
Lactobacillus
helveticus
Staphylococcus
aureus
Lymantria
dispar
Orgyia
pseudotsugata
Streptomyces
coelicolor
Borrelia
burgdorferi
Sulfolobus sp.
Yersinia
pestis
Autographica
californica
Vibrio
cholerae
Bombyx
mori
Staphylococcus
aureus
Porphyromonas
gingivalis
Archaeoglobus
fulgidus
Chlamydia
trachomatis
Salmonella
typhi
Aeropyrum
pernix
Pseudomonas
syringiae
Halobacterium sp.
Methanococcus
jannaschi
Rhizobium sp.
Borrelia
burgdorferi
Streptomyces
albus
Halobacterium sp.
Vibrio
cholerae
Euplotes
crassus
Euplotes
crassus
Pseudomonas
putida
Borrelia
hermsii
Neisseria
meningitidis
Borrelia
burgdorferi
Borrelia
burgdorferi
Borrelia
burgdorferi
Bacillus
thuringensis
Streptococcus
mitis
Streptococcus
pneumoniae
Enterococcus
faecium
Escherichia
coli O157:H7
Methanothermobacter
wolfeii
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Lactococcus
lactis
Lactococcus
lactis
Streptococcus
pyogenes
Lactococcus
lactis
Lactococcus
lactis
Streptococcus
pyogenes
Streptococcus
pyogenes
Lactococcus
lactis
Lactococcus
lactis
Lactococcus
lactis
Streptococcus
pyogenes
Streptococcus
pyogenes
Lactococcus
lactis
Escherichia
coli O157:H7
Escherichia
coli O157:H7
Mesorhizobium
loti
Mesorhizobium
loti
Rhizobium
rhizogenes
Mezorhizobium
loti
Mezorhizobium
loti
Mezorhizobium
loti
Mezorhizobium
loti
Salmonella
enterica
Mezorhizobium
loti
Escherichia
coli
Mezorhizobium
loti
Caulobacter
crescentus
Mezorhizobium
loti
Mezorhizobium
loti
Mezorhizobium
loti
Staphylococcus
aureus
Caulobacter
crescentus
Caulobacter
crescentus
Vibrio
salmonicida
Staphylococcus
aureus
Sulfolobus
solfataricus
Clostridium
perfringens
Mycoplasma
pulmonis
Thermoplasma
volcanium
Thermoplasma
acidophilum
Thermoplasma
volcanium
Salmonella
enterica
Pasteurella
multocida
Pasteurella
multocida
Mesorhizobium
loti
Deinococcus
radiodurans
Halobacterium sp.
Pasteurella
multocida
Porphyromonas
gingivalis
Porphyromonas
gingivalis
Rhodobacter
capsulatus
Mesorhizobium
loti
Mesorhizobium
loti
Mesorhizobium
loti
Mesorhizobium
loti
Mesorhizobium
loti
Mesorhizobium
loti
Mesorhizobium
loti
In some embodiments, the suitable recombinases are the recombinases listed as numbers 7, 12, 93, 95, 97, and 98 in Table 1.
A method to identify an OSSR is by determining by identifying the catalytic residues. Identifying orthogonality is done by preparing a plasmid containing a gene of interest (such as Kanamycin resistance) that is flanked by putative recombinase recognition sites. Co-transformation of a cell with this plasmid and a plasmid containing the putative recombinase will either result in excision of the gene of interest or no reaction. If a reaction occurs, the cell will then be susceptible to treatment with Kanamycin. This is identifiable by replica plating of viable colonies onto an agar plate containing Kanamycin.
In some embodiments, an integration construct is constructed that can be integrated into a genome using lambda red (a promiscuous recombinase). The integration construct would comprise a promoter blocked by a terminator, with the terminator flanked by recognition sites of one of the six recombinases. By making competent cell stocks of each cell line, each recombinase gene is added as a plasmid and then the activity of the reporter gene is measured. This would determine if each protein was capable of working on multiple sequences.
The paper “DNA recombination with a heterospecific Cre homolog identified from comparison of the pac-c1 regions of P1-related phages” (Sauer and McDermott, Nucleic Acids Research, 2004, Vol. 32, No. 20, 6086-6095; hereby incorporated by reference) performed a similar experiment using a multi-copy plasmid to check cross reactivity between the recombinase Cre and a homolog protein labeled Dre. A marker for Zeomycin resistance was placed between the recognition sites, and then cell lines were transformed with a separate plasmid containing the recombinase under study, either Cre or Dre. By growing these cells in the presence or absence of the antibiotic, a perfect record of viability was reported. When using the non-cross reacting gene, a perfect record of death was reported when using the appropriate gene to remove the resistance marker. This screen addresses point (2) listed above, that a recombinase will not bind to the recognition sites of another. However, this screen does not determine whether a mixed recognition site package (site cre, GENE, site dre, as well as the reverse) is capable of initiating recombination.
The recombinant nucleic acids of the invention can be constructed using methods well known to one skilled in the art. One such method includes the “West Coast BioBricks System” (BioBricks) for which separate constructs have been made for many recognition sites, promoters, resistance markers, and other biological device pieces. These pieces allow sequential assembly of complicated constructs. Exemplary components useful for the BioBricks system are shown in
One can use the BioBricks system to make constructs similar to those described in the referenced paper, with many different recognition sites flanking an antibiotic resistance marker. All relevant combinations will be made for each construct. For example, for recombinases A, B, and C that recognize sites a, b, and c, respectively, the following constructs can be made:
As described above, Marker 1 is removable but Marker 2 is not. Stocks of competent cells are made for each construct. These cells are then transformed with plasmids containing one or more recombinases each to cover all potential combinations. Those plasmids should also harbor resistance marker 3. These transformations are then be plated on agar plates containing antibiotics 2 and 3, and incubated to give rise to resultant colonies. Using the replica plating technique, colonies are then transferred to a plate containing antibiotic 1. Colonies are counted to assess viability.
In some embodiments, the recombinant nucleic acid further comprises a promoter that is upstream of the constructs and is capable of transcribing one or more of the constructs in a suitable host cell.
In some embodiments, the recombinant nucleic acid further comprises one or more target nucleotide sequences that are downstream of the constructs, wherein the one or more target sequences are transcribed when all of the constructs are deleted or excised. The target nucleotide sequences can encode an ORF, interference RNA, antisense RNA, or the like.
In some embodiments, the nucleotide sequence of interest comprises one or more ORF, interference RNA, antisense RNA, or the like, or one or more a terminator, or both thereof.
In some embodiments, the ORF encodes a polypeptide. The polypeptide can be a selective marker, an enzyme, a polypeptide that causes the death of the host cell in which the recombinant nucleic acid is located, or the like.
In some embodiments, each construct can further comprise a terminator located between the nucleotide sequence of interest and the second recognition sequence.
In some embodiments, the construct is located or inserted within two ORFs, that when the construct is excised from the recombinant nucleic acid, the two ORFs form a single ORF encoding a polypeptide of a certain biological function. In some embodiments, the certain biological function is one that causes the death of a host cell comprising the recombinant nucleic acid.
In some embodiment of the present invention, the recombinant nucleic acid comprises a synthetic telomere. The synthetic telomere is one application of multiple OSSRs that function as a counting mechanism for host cell, such as a bacteria, such as E. coli. The synthetic telomere makes use of cell cycle sensitive, low level expression of recombinases that, over time, cleaves inhibitory sequences from the genome, and concludes in the expression of a target gene. Such a device requires OSSRs that do not prevent miscounting, or worse, genomic scrambling. In some embodiments, the synthetic telomere is capable of a time dependence that based on the number of OSSRs present. The recombinases can be used to remove a section of the recombinant nucleic acid, such as DNA. Using the synthetic telomere, nucleotide sequence is removed as it is processed, preventing it from being read or interfering with the counting mechanism.
In some applications of the synthetic telomere, the synthetic telomere can function as a fuse that “burns down” until it reaches its target (i.e., the promoter becomes adjacent or operably linked to a gene or nucleotide sequence) and causes the expression of the gene or nucleotide sequence. The expression of the gene or nucleotide sequence can in turn directly or indirectly cause the activation or up regulation of the expression of one or more genes, and/or the repression or down regulation of the expression of one or more genes.
A synthetic teleomere functions as follows: (1) A suitable signal from the cell causes the transcription of the first recombinase. The second recombinase cannot be transcribed because of the presence of a terminator element. (2) Once the recombinase is translated and folded, the gene for the recombinase and the associated terminator element are excised by a recombination event. This DNA loop will be broken down by the host cell. (3) The next time the initiating signal fires, the second recombinase is transcribed. The third recombinase cannot be transcribed because of the presence of a terminator element. (4) The process repeats until all recombinase-terminator elements have been removed, and a reporter gene at the end of the sequence is expressed.
The synthetic teleomere acts as a time delay between the activation of transcription and the expression of a target gene. However, if the signal pulses are associated with a distinct phenomenon, such as cell cycles, night and day, or chemical washes, the invention now serves as a counting mechanism (albeit one that always counts down).
The present invention also provides for a recombinant vector comprising the recombinant nucleic acid. The recombinant nucleic acid can be a double-stranded or single-stranded DNA, or RNA. In some embodiments, the recombinant nucleic acid is integrated into a chromosome of a host cell. In some embodiments, the recombinant nucleic acid can further comprise sequences sufficient for having the recombinant nucleic acid stably replicate in a host cell. The recombinant nucleic acid can be replicon capable of stable maintenance in a host cell. In some embodiments, the replicon is a plasmid. The present invention also provides for a vector or expression vector comprising a recombinant nucleic acid of the present invention.
It will, be apparent to one of skill in the art that a variety of recombinant vectors can be utilized in the practice of aspects of the invention. As used herein, “vector” refers to polynucleotide elements that are used to introduce recombinant nucleic acid into cells for either expression or replication. Selection and use of such vehicles is routine in the art. An “expression vector” includes vectors capable of expressing DNAs that are operatively linked with regulatory sequences, such as promoter regions. Thus, an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector that, upon introduction into an appropriate host cell, results in expression of the cloned DNA. Appropriate expression vectors are well known to those of skill in the art and include those that are replicable in eukaryotic cells and/or prokaryotic cells and those that remain episomal or those that integrate into the host cell genome.
Selectable markers can also be included in the recombinant expression vectors. A variety of markers are known which are useful in selecting for transformed cell lines and generally comprise a gene whose expression confers a selectable phenotype on transformed cells when the cells are grown in an appropriate selective medium. Such markers include, for example, genes that confer antibiotic resistance or sensitivity.
Methods for introducing the recombinant vectors of the present invention into suitable hosts are known to those of skill in the art and typically include the use of CaCl2 or other agents, such as divalent cations, lipofection, DMSO, protoplast transformation, conjugation, and electroporation.
Activating expression from the promoter results in the expression of the first recombinase which in turn proceeds to recognize the two recognition sequence of the first recombinase and excise or delete the first nucleotide sequence of interest. Activating expression of each respective recombinase results in the recognition of the two recognition sequence of the respective recombinase and the excision or deletion of the corresponding nucleotide sequence of interest.
In some embodiments, the method further comprises: introducing a recombinant nucleic acid into a host cell prior to the providing step, wherein the recombinant nucleic acid comprises at least the first construct and the second construct.
The present invention further provides for a compositions and methods attenuating signal (non-coding RNA interference and Dominant-Negative complexation) by coupling to the activation of a recombinase, an enzyme which can toggle a DNA element between two states. A target DNA element can be designed to, the difference between the two states is essentially arbitrary and almost unlimited. It is not necessary that the same gene product be produced in both states. When used in traditional systems, these signal attenuation methods are limited by the fact that the intensity of the desired output is directly tied to the level of attenuation. When recombinases are used in traditional systems, they are typically limited by the noise generated in attempting to control the recombinase gene, which can cause the switch to enter the improper state. It has not been previously demonstrated that these various devices can be operated in concert to both: (1) have greater control over the properties of both devices, and (2) produce unique genetic devices that would not be possible without the marriage of these functions.
Non-coding RNA interference is a method of preventing protein expression from a given mRNA. by the introduction of a second, non-coding RNA (hereafter ncRNA) that renders the target mRNA unreadable by the protein synthesis machinery. A number of design styles have been used to demonstrate this, but the key component is that the non-coding RNA typically binds the coding RNA in a 1:1 ratio. Thus, an effect is produced in direct proportion with the number of readable copies, where readable copies=(mRNA−(non-coding RNA)), modulated by the binding constant. This system is itself subject to noise near the equivalence point, but is otherwise a very tuneable system.
Dominant-Negative complexation involves intentional production of a non-functional version of the active enzyme (usually a point mutation in the active site) that attenuates the active enzyme in two ways: (1) Competition for the binding site, and (2) Interference in the formation of functional multimers. In both cases, the level of attenuation is a function of concentration of the active enzyme relative to inactive enzyme. However, in systems where every subunit of a complex must be functional to produce an active enzyme (as with recombinases), the multimer effect allows even small amounts of inactive protein to nullify an otherwise-active complex. The intensity of this effect is increased as the number of monomers required in the active complex grows (recombinases are functional tetramers.) Of particular importance is that the recombinase reaction is fully reversible if it does not go to completion! Therefore, a “mixed” multimer of functional and non-functional monomers will not produce a side-product, but will instead continue to suppress function of active multimers by competing for the binding site.
Recombinases are unique among DNA manipulating enzymes for several reasons: (1) The output of their function is digital. When properly designed, the two states can be assigned any arbitrary function. The intensity or type of activation event has no mandatory influence on the two states in any way. It is even possible to toggle a switch between a “sensitive” and “insensitive” state, so that the output can be made dependent or independent to any other control mechanism. (2) The mechanism of action allows for large numbers of recombinases to be used without concern for exhausting available sites or causing cross reactions between multiple sites. Thus one well-designed circuit could easily be adapted for a different output, and multiple circuits could be used in the same cell. This is important in the construction of logic circuits or decision trees that require multiple events recognized over time before a final state is reached. (3) In addition to not interfering with each other, recombinases have a very limited set of required interactions to perform their function: they need to bind identical monomers and bind to DNA. By not involving any other cellular machinery in their function, the system is simplified and the risk of side reactions and cell overload is reduced.
Two different modes of use are possible, integrated/excised or “facing left”/“facing right”. To simplify, further circuits are described using the “facing left”/“facing right” convention although this system works with both designs. The main difference between the two modes is that integrated/excised is less reversible and thus less likely to be scrambled. However, it functions by removing a DNA element, so genetic information is lost permanently when the switch is activated. Alternatively, the “facing left/facing right” circuit is reusable, but can lead to scrambling of the switch if not properly controlled.
A system, for noise canceling in recombinase circuits, works as follows (
1. A constitutive promoter is followed by a non-coding, interfering RNA, producing that RNA at a set level.
2. An inducible promoter is followed by a recombinase gene. This mRNA is interfered with by the product of promoter 1.
3. Leaky expression of the inducible promoter does not lead to recombinase expression, and the switch remains “unflipped”
4. Induction of promoter 2 produces recombinase mRNA in sufficient quantity that it cannot entirely be inhibited, and recombinase is produced.
5. The produced recombinase “flips” the switch.
Another system follows a similar plan, but uses a Dominant-Negative enzyme (
1. A constitutive promoter is followed by an active-site knockout of a recombinase, producing RNA and protein at a set level.
2. An inducible promoter is followed by a functional recombinase.
3. Leaky expression of the inducible promoter does not lead to recombinase function, and the switch remains “unflipped”.
4. Induction of promoter 2 produces recombinase in sufficient quantity that (a) functional tetramers form and (b) can compete with non-functional tetramers for DNA binding sites.
5. The functional recombinase tetramers “flip” the switch.
Note that system requires lower levels of its inhibitory product to cause inhibition. This makes it stronger, but also harder to titrate exact levels. The above two systems allow for noise canceling. By replacing promoter 1 with an inducible promoter, the threshold of activation is no longer a static value, and the amount of induction required for flipping the switch changes based on the cellular environment. This is summarized for both types of systems in
With careful system construction any two promoters can have an output tied to a change in their expression level ratio rather than in absolute abundance, if they dominant-negative system is used. This is because various ribosome binding sites can be used to alter the relationship between RNA level and protein level to a range where the system responds.
The invention having been described, the following examples are offered to illustrate the subject invention by way of illustration, not by way of limitation.
An embodiment of the invention is a nucleic acid comprising a nucleotide sequence comprising: promoter—ribosome binding site—recognition site 1a—terminator—recognition site 1b—recognition site 2a—terminator—recognition site 2b—gene of interest, the gene of interest cannot be expressed until the sequence is removed of terminators by treatment with both recombinase 1 and recombinase 2. The expression cassettes for recombinase 1 and 2 do not need to be part of the named construct, nor do they need to be activated simultaneously (hence the name “fusebox”). Once each of the recombinases is activated a single time, it activates a permanent change in cellular state, bringing the cell closer to the expression of the gene of interest. Such a system would allow for multiple, non-simultaneous “checkpoints” to be identified before the gene of interest would be expressed.
This example demonstrates the efficacy of the cross testing method, initiation the development of our compatibility grid, and the design a circuit using the information thus gained.
Three of the most commonly used (in recombinant systems) recombinases are tested: cre, dre, and FlpSc (here the Sc annotation means from S. cerevisiae), each paired with one of their better characterized att sites:
As multiple sequences for multiple att sites appear across publications, an internal reference code is prepared for each att site. For example, the above sites are referred to as CreA, DreA, and ScA, respectively. This allows keeping track of the various sequences independent of their references while still retaining the reference and sequence information for later publication. It should also be noted that each recombinase has more than a single att site. Most have either two or four, although for commonly used proteins like lox, more are known. These sites need to be cross-tested against each other as well as with att sites that putatively respond to a different recombinase. Such experiments produce independent, non-cross reactive effects across an entire genome with a single enzyme.
Two separate classes of constructs are prepared:
Recombinase constructs: p15A origin, Cm resistance, pTET promoter, and one or two recombinases in an operon.
Att excision constructs: ColE1 origin, Amp resistance, pLacUV5 promoter, attx—Red Fluorescent Protein—atty, where attx and atty are various att recognition sites described above.
These vectors are then co-transformed into E. coli strain DH10B, outgrown for one hour in SOC media, and plated on LB plates containing Ampicillin, Chloroamphenicol, IPTG (to induce RFP expression) and aTc (to induce recombinase expression). Although RFP is induced, the constructs are tested using colony PCR to determine if the corresponding part of the test construct is short, indicating an excision product, or long, indicating that RFP has not been excised. These tests can also be performed using more recombinases and att sites, as well as different copy number of the target plasmid and multiple excision targets, including Kan resistance and removal of a terminator that is blocking beta-galactosidase production (i.e. an “on” switch).
Below, these pairs are drawn out in a grid. The grid is filled in with the expected behavior of each crosstest. Table 2 shows the expected cutting patterns for various combinations of recombinases and att sites.
As can be see in the right half of Table 2, the absence of cutting on “mis-matched” att sites is critical to the function of our recombinase circuits. While it would be difficult to test greater numbers of recombinases or sites simultaneously, we hope to map out possible conflicts by analyzing each pairwise combination. If future discrepancies arise in more complicated systems, using this information will allow us to more readily identify the cause of the incompatability.
In each set of experiments, the desired behavior is observed. Most importantly, when Cre and Dre are co-expressed, the mixture can correctly excise a CreA-CreA construct, a DreA-DreA construct, but shows no activity on a CreA-DreA construct. This was not necessarily true because the proteins are so similar that antibodies raised to Cre will bind to Dre in a western blot. The exact reason for this relationship is not known, but may have to do with the nature of the Holliday junction resolution (i.e., which nucleotides immediately flank the recombination site) rather than the protein quaternary structure.
A number of cross tests are also performed using FlpSc. Surprisingly, FlpSc is a “bad partner” in crosstests involving Cre. Since FlpSc and Cre are not even in the same sub-family, this behavior is not expected. This result may be related to copy number or att site choice or FlpSc is a less accessible enzyme for advanced circuit design.
The current data of the compatibility grid is in Table 3 below.
It currently appears that both Cre and Dre show some cross activity on at least one att site that is normally associated with the other protein. Although it may be possible to work around this problem by using alternate aft sites, other behaviors of FlpSc suggest that screening other recombinases will prove more immediately fruitful. When FlpSc is co-expressed with another protein, we have been unable to obtain a PCR product when we analyze the colonies. FlpSc may be reacting between sites on multiple plasmids, a behavior reported in the literature for Cre and Dre, but this not yet conclusively verified. The inclusion of copy number variation experiments and changing the reporter to Kanamycin resistance and testing for cell survivability, as described previously, may resolve this.
Using the information obtained regarding the compatibilities of Cre and Dre recombinases, an “or” gate is designed as shown in
As discussed above, the following describes a method for discovering new att sites that make an orthogonal pair but still respond to an existing recombinase. For example, there are many pairs of sites known that the Cre recombinase can act on, but that the individual sites themselves are not competent to resolve, and thus form exclusive pairs. The information gained from performing the crosstesting work, together with sequence information of the known competent pairs of att sites allows one to determine which residues of the crossover region of the att sites generate this site-match specificity that is independent of the recombinase recognition affinity. Increasing the number of att sites that respond to a single recombinase has great potential for introducing genome-wide modifications in a living organism simultaneously and without side-reactions in response to a single stimulus event. Such a tool would be useful both in optimization of fermentative production of small molecules and in the control of genetically engineered organisms in the field, for example plants that begin breaking down their own cellulose in response to a certain signal.
A primary concern in the use of recombinase based circuits is that since the input signal intensity is effectively decoupled from the output signal intensity (i.e. a different promoter can be used for the recombinase and the target gene, as shown in the “or” gate circuit), there may be problems with noise and fidelity. Unlike traditional promoter systems which respond to a variety of inducer concentrations, a recombinase circuit can be a “single fire” system: as soon as the target DNA sequence is excised (or inverted, in some designs), the new circuit becomes active and typically cannot be undone in a controlled fashion. As such controlling more complex circuits requires compensation for this sensitivity and unidirectional. Allowing the att sites and inverted DNA sections to remain in the cell is more likely to scramble the circuit than to allow for the generation of more intricate pathways. However, since the system is effectively completely irreversible, a mechanism is needed to prevent noisy signal from cutting our circuits. This can be done by expressing dominant negative recombinases along side the functional circuits, as described herein, that is using dominant-negative repression and non-coding RNA interference for tuning recombinase based genetic circuits to allow for decoupling of input and output signal strengths by setting a threshold effect (i.e. band-pass filter).
In a simple system, dominant negative recombinase monomers can be expressed with a constitutive promoter of known output. This drastically reduces the chance that 4 functional monomers can access the target sequence and fire until they are in a concentration in excess of the dominant negative monomers. One then measures the ratio of inactive to active monomers that allows activation. Over time, any circuit could fire regardless of the amount of inactive monomer present, so one can test these constructs in longitudinal experiments to assess their stability, which is especially relevant for field applications. The dominant negative recombinases are expected to be specific for their parent construct. Thus, expressing CreX (where the X signifies the inactive dominant-negative mutant) inhibits the activity of Cre but not Dre, while expressing DreX inhibits the activity of Dre and not Cre. By differentially expressing both CreX and DreX, one is able to set independent thresholds for the activity of each enzyme. This type of behavior is useful for therapeutic applications, where a “tumor-hunter” bacteria must carefully calibrate multiple signals in identifying a tumor before it is allowed to initiate production of a toxic chemical.
In another application, the dominant-negative mutant is put under control of a variable strength promoter, while the active recombinase is also under control of a variable strength promoter. As before, when thee construct is tuned so that the initial active:inactive ratio is below the activity threshold, it will not fire. In this case however, since both promoter strengths are changing, the difference in initial promoter strength can be lower (while protein copy number difference is maintained through differences in ribosome binding site strength). This allows the use of native promoter mechanisms to control a circuit in response to a change in cell physiology. This allows access to more “hands-off” circuits, where a cell responds to natural changes in its own environment rather than induced changes (i.e., adding IPTG or arabinose), and to do so in such a way where a strong response can still be generated by use of the active recombinase causing a much stronger promoter to join up to a target DNA.
This approach is also useful in that, in many circuit designs, one can think of a circuit as being made of “cheap” DNA and “expensive” proteins. In other words, it is energetically costly to maintain a protein gradient of activator and repressor pairs, but energetically cheaper to maintain the DNA that encodes those proteins and the DNA that encodes the target DNA elements they act upon. Since recombinases can induce a permanent change in the DNA of a cell without the constant presence of the protein, circuits can be designed that excise both the recombinase and the dominant negative mutant when they are no longer needed, lowering the protein burden on the cell, and only then expressing the next set of necessary proteins. Using this “sliding window” approach allows the circuit to progress towards completion without an exponential increase in protein production to maintain it.
A knockout of Dre, named DreX(Y324F) is prepared. Tyrosine Y324 is annotated as the active site tyrosine for this enzyme. However, after replacing this residue with phenylalanine, as determined in the sequencing result shown below, one still observes activity of the enzyme using our crosstesting method described herein. See
Seven recombinases (Cre, Dre, FlpSc, FlpZb, FlpZf, ΦC31, and Bxb1, where Sc Zb and Zf are different species of yeast) are analyzed for compatibility (ability to act specifically and independently upon their own attachment, att, sites when present in a single cell) in an experimental system (see Tables 4-7). Each is able to produce the desired recombination product using its cognate att sites in our system. Cre is compatible with Dre under some conditions, although some recombination is observed upon sequences flanked by mismatched CreA-DreA att sites (half-sites). This is the only half-site combination that resulted in an excision product. Analysis of the CreA and DreA sites suggest that a functional Holliday junction could be formed, although the identity of the cleavage product has not been determined. Dre and FlpSc are compatible under all conditions, while FlpSc and Cre demonstrated cross-reactivity. The remaining four recombinases show high levels of cross reactivity with non-cognate att sites.
The incompatibility between Cre and Dre is easily resolvable by changing the bases of the crossover region to make CreA and DreA an incompatible pair (one that cannot form a holiday junction that permits strand exchange.) The incompatibility between FlpSc and Cre may not be resolvable, but this pairing (and many additional enzymes and binding sites) is currently being testing using a plate based assay that significantly increases our analytical throughput. A flowchart of this process is shown in
12/12
12/12
12/12
12/12
12/12
12/12
8/8
While the present invention has been described with reference to the specific embodiments thereof, it should be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the true spirit and scope of the invention. In addition, many modifications may be made to adapt a particular situation, material, composition of matter, process, process step or steps, to the objective, spirit and scope of the present invention. All such modifications are intended to be within the scope of the claims appended hereto.
This application claims priority as a continuation-in-part application of PCT International Patent Application No. PCT/US2009/060275, filed Oct. 9, 2009, which claims priority to U.S. Provisional Patent Application Ser. No. 61/104,239, filed on Oct. 9, 2008; which are hereby incorporated by reference.
The invention described and claimed herein was made in part utilizing funds supplied by the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. The government has certain rights in this invention.
Number | Name | Date | Kind |
---|---|---|---|
20030022375 | Itoh et al. | Jan 2003 | A1 |
20060073593 | Byrd et al. | Apr 2006 | A1 |
20060078902 | Bunting | Apr 2006 | A1 |
Number | Date | Country | |
---|---|---|---|
20120135524 A1 | May 2012 | US |
Number | Date | Country | |
---|---|---|---|
61104239 | Oct 2008 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2009/060275 | Oct 2009 | US |
Child | 13088288 | US |