The present invention relates to I-CreI variants which can in particular recognise and cleave DNA targets which do not comprise the same nucleotides at positions ±6 and ±7 which are present in the wild type I-CreI target. The present invention also relates to I-CreI variants which can recognise and cleave targets which do not comprise the wild type nucleotides at positions ±4, ±5, ±6, ±7 and to I-CreI variants which can recognise and cleave targets which do not comprise the wild type nucleotides at positions ±4, ±5, ±6, ±7, ±8, ±9 and ±10.
Since the first gene targeting experiments in yeast more than 25 years ago (Hinnen et al, 1978; Rothstein, 1983), HR has been used to insert, replace or delete genomic sequences in a variety of cells (Thomas and Capecchi, 1987; Capecchi, 2001; Smithies, 2001). Targeted events occur at a very low frequency in mammalian cells, making the use of innate HR impractical. The frequency of HR can be significantly increased by a specific DNA double-strand break (DSB) at a locus (Rouet et al, 1994; Choulika et al, 1995). Such DSBs can be induced by meganucleases, sequence-specific endonucleases that recognize large DNA recognition target sites (12 to 30 bp).
Meganucleases show high specificity to their DNA target; these proteins can cleave a unique chromosomal sequence and therefore do not affect global genome integrity. Meganucleases are essentially represented by homing endonucleases, a widespread class of proteins found in eukaryotes, bacteria and archae (Chevalier and Stoddard, 2001). Early studies of the I-SceI and HO homing endonucleases have illustrated how the cleavage activity of these proteins can be used to initiate HR events in living cells and have demonstrated the recombinogenic properties of chromosomal DSBs (Dujon et al, 1986; Haber, 1995). Since then, meganuclease-induced HR has been successfully used for genome engineering purposes in bacteria (Posfai et al, 1999), mammalian cells (Sargent et al, 1997; Donoho et al, 1998; Cohen-Tannoudji et al, 1998), mice (Gouble et al, 2006) and plants (Puchta et al, 1996; Siebert and Puchta, 2002).
I-CreI is a meganuclease which has been studied extensively and for which the inventors and their collaborators have been able to change the I-CreI specificity toward the nucleotides at positions ±10, ±9, ±8 (10NNN region, WO2007/049156) or nucleotides at positions ±5, ±4, ±3 (5NNN region, WO2006/097853) of the wild type palindromic target of I-CreI (
In previous work the specificity of I-CreI towards the 7NN nucleotides (bases at positions ±7, ±6) of the C1221 target was not modified, as a natural partial degeneracy of I-CreI exists towards this portion of the target. This 7NN degeneracy can be deduced from the wild type I-CreI C1234 target (
The need to generate I-CreI variants with new specificity has led the inventors to consider the possibility of changing the specificity of I-CreI toward the 7NN nucleotides by introducing substitutions at positions 26, 28 and 42 of I-CreI (
For the first time the inventors have experimentally shown that it is possible to intentionally change I-CreI specificity towards the 7NN nucleotides. In particular the inventors have shown that all the 16 (44) 7NN_P targets are cleaved by at least one I-CreI variant. In addition, the inventors have also shown for the first time that the 7NN nucleotides define a new DNA region able to behave as the 10NNN and 5NNN regions previously identified by the inventors, as the set of mutations conferring the new 7NN specificity can be combined in a combinatorial manner with other sets of mutations.
In order to optimize the time required for the whole engineering process (an I-CreI variant with a modified specificity toward the three 10NNN, 7NN and 5NNN DNA regions), the inventors have shown how to generate directly I-CreI variants with a modified specificity toward the nucleotides at positions ±4, ±5, ±6 and ±7 (the 256 (44) palindromic 7NNNN_P targets,
Finally, the inventors have shown how to generate a variant directed against the complete 10NNNNNNN region, that is a target which can vary from the wild type target sequence of I-CreI at each of nucleotides ±4, ±5, ±6, ±7, ±8, ±9 and ±10, from the combination of two sets of mutations that confer new specificity toward the 7NNNN and 10NNN regions.
In the present Patent Application the terms meganuclease(s) and variant(s) and variant meganuclease(s) will be used interchangeably herein.
According to a first aspect of the present invention there is provided an I-CreI variant, having at least two substitutions, said variant being able to cleave a 7NNNN_P palindromic DNA target sequence (SEQ ID NO: 44) other than the wild type I-CreI DNA target sequence (SEQ ID NO: 40), and being obtainable by a method comprising at least the steps of:
(a) constructing a first series of I-CreI variants having at least one substitution in a position selected from the group: 26, 28, 42,
(b) constructing a second series of I-CreI variants having at least one substitution in a position selected from the group: 44, 68, 77,
(c) selecting and/or screening the variants from the first series of step (a) which are able to cleave a mutant I-CreI site wherein the nucleotides in positions ±7 to ±6 of the wild type I-CreI site have been replaced with the nucleotides which are present in positions ±7 to ±6 of said 7NNNN_P DNA target sequence,
(d) selecting and/or screening the variants from the first series of step (b) which are able to cleave a mutant I-CreI site wherein the nucleotides in positions ±5 to ±4 of the wild type I-CreI site have been replaced with the nucleotides which are present in positions ±5 to ±4 of said 7NNNN_P DNA target sequence,
(e) combining in a single variant, the mutation(s) in positions 26, 28, 42, and 44, 68, 77 of two variants from step (c) and step (d), to obtain a novel homodimeric I-CreI variant which cleaves a sequence wherein the nucleotide quartet in positions ±7 to ±4 is identical to the nucleotide quartet which is present in positions ±7 to ±4 of said 7NNNN_P DNA target sequence.
According to another aspect of the invention, homodimeric variants able to cleave a 7NNNN_P DNA target sequence can be directly generated without a combinatorial step. There is provided an I-CreI variant, having at least two substitutions, said variant being able to cleave a 7NNNN_P palindromic DNA target sequence (SEQ ID NO: 44) other than the wild type I-CreI DNA target sequence (SEQ ID NO: 40), and being obtainable by a method comprising at least the steps of:
(a′) constructing I-CreI variants having at least one substitution in a position selected from the group: 26, 28, 42, 44, 68, 77
(b′) selecting and/or screening the variants from step (a′) which are able to cleave a 7NNNN_P palindromic DNA target sequence site wherein the nucleotides in positions ±7 to ±4 of the wild type I-CreI site have been replaced with the nucleotides which are present in positions ±7 to ±4 of said 7NNNN_P DNA target sequence.
Preferably, the variants obtained in step (e) and in step (b′), also called 7NNNN cutters, are heterodimers, resulting from the association of a first and a second monomer having different mutations in positions 26, 28, 42, 44, 68, 77 of I-CreI, said heterodimers being able to cleave a non-palindromic DNA target sequence.
The inventors have now proven therefore that it is possible to create an I-CreI variant which can recognise and cleave a DNA target modified at the 7NNNN positions in a single round of selection.
According to a second aspect of the present invention there is provided an I-CreI variant, having at least two substitutions, said variant being able to cleave a 10NNNNNNN_P palindromic DNA target sequence other than the wild type I-CreI DNA target sequence (SEQ ID NO: 40), and being obtainable by a method comprising at least the steps of:
(A) selecting variants of step (c) having at least one substitution in a position selected from the group: 26, 28, 42, which are able to cleave a mutant I-CreI site wherein the nucleotides in positions ±7 to ±6 of the wild type I-CreI site have been replaced with the nucleotides which are present in positions ±7 to ±6 of said 10NNNNNNN_P DNA target sequence; or
(A′) selecting 7NNNN cutters of steps (e) and (b′) having at least two substitutions in a position selected from the group: 26, 28, 42, 44, 68, 77, which are able to cleave a mutant I-CreI site wherein the nucleotides in positions ±7 to ±4 of the wild type I-CreI site have been replaced with the nucleotides which are present in positions ±7 to ±4 of said 10NNNNNNN_P DNA target sequence,
(B) constructing a series of I-CreI variants having at least one substitution in a position selected from the group: 30, 32, 33, 38, 40,
(C) selecting and/or screening the variants from the first series of step (B) which are able to cleave a mutant I-CreI site wherein the nucleotides in positions ±10 to ±8 of the wild type I-CreI site have been replaced with the nucleotides which are present in positions ±10 to ±8 of said 10NNNNNNN_P DNA target sequence,
(D) combining in a single variant, the mutation(s) in positions 26, 28, 42, 44, 68, 77 and 30, 32, 33, 38, 40 of two variants from step (A) or (A′), and step (C), to obtain a novel homodimeric I-CreI variant which cleaves a sequence wherein the nucleotide septet in positions ±10 to ±4 is identical to the nucleotide septet which is present in positions ±10 to ±4 of said 10NNNNNNN_P DNA target sequence.
Preferably, the variant obtained in step (D) is a heterodimer, resulting from the association of a first and a second monomer having different mutations in positions 26 to 42 and 44 to 77 of I-CreI, said heterodimer being able to cleave a non-palindromic DNA target sequence.
The inventors have also shown that it is possible to generate I-CreI variants which cleavages a target which is variable across the entire 10NNNNNNN portion of the target in a simple two step process.
According to a further aspect of the present invention the variant may be obtained by a method comprising the additional steps of:
(i) constructing a third series of variants having at least one additional substitution in at least one of the monomers in said heterodimers,
(ii) combining said third series variants of step (i) and screening the resulting heterodimers for altered cleavage activity against said DNA target.
Preferably in step (i) said at least one additional substitution is introduced by site directed mutagenesis in a DNA molecule encoding said third series of variants, and/or by random mutagenesis in a DNA molecule encoding said third series of variants.
Preferably steps (i) and (ii) are repeated at least two times and wherein the heterodimers selected in step (i) of each further iteration are selected from heterodimers screened in step (ii) of the previous iteration which showed increased cleavage activity against said DNA target.
Preferably the residue at position 75 of I-CreI is not substituted.
Preferably the variant comprises one or more substitutions on the entire I-CreI sequence that improve the binding and/or the cleavage properties of the variant towards said DNA target sequence.
Preferably the substitutions are replacement of the initial amino acids with amino acids selected in the group consisting of A, D, E, F, G, H, I, K, M, N, P, Q, R, S, T, Y, C, W, L and V.
Preferably the variant is an obligate heterodimer, wherein the first and the second monomer, respectively, further comprises the D137R mutation and the R51D mutation.
Preferably the obligate heterodimer, wherein the first monomer further comprises the K7R, E8R, E61R, K96R and L97F or K7R, E8R, F54W, E61R, K96R and L97F mutations and the second monomer further comprises the K7E, F54G, L58M and K96E or K7E, F54G, K57M and K96E mutations.
Alternatively the variant consists of a single polypeptide chain comprising two monomers or core domains of one or two variant(s) according to the present invention or a combination of both.
Preferably the first and the second monomers are connected by a peptide linker.
It is understood that the scope of the present invention also encompasses the I-CreI variants, including heterodimers, obligate heterodimers, single chain meganuclease as non limiting examples, having at least one substitution in a position selected from the group 26, 28, 42.
According to another aspect of the present invention there is provided a polynucleotide fragment encoding the variant as defined above.
According to another aspect of the present invention there is provided an expression vector comprising at least one polynucleotide fragment as defined above.
The present invention also relates to a host cell, characterized in that it is modified by a polynucleotide or a vector according to the present invention.
The recombinant vectors comprising said polynucleotide may be obtained and introduced in a host cell by the well-known recombinant DNA and genetic engineering techniques.
The polypeptide of the invention may be obtained by culturing the host cell containing an expression vector comprising a polynucleotide sequence encoding said polypeptide, under conditions suitable for the expression of the polypeptide, and recovering the polypeptide from the host cell culture.
The present invention also relates to a non-human transgenic animal, characterized in that all or part of its constituent cells is modified by a polynucleotide or a vector according to the present invention.
The present invention also relates to a transgenic plant, characterized in that all or part of its constituent cells is modified by a polynucleotide or a vector according to the present invention.
The present invention also relates to the use of a meganuclease according to the present invention in a therapeutic method, in particular a meganuclease according to the present invention can be used for genome therapy ex vivo (gene cell therapy) and genome engineering. Most particularly the described meganucleases could be used to insert, delete or repair an endogenous or exogenous coding sequence.
To do this the meganuclease (or a polynucleotide encoding said meganuclease) and/or the targeting DNA are contained within a vector. Vectors comprising targeting DNA and/or nucleic acid encoding a meganuclease can be introduced into a cell by a variety of methods (e.g., injection, direct uptake, projectile bombardment, liposomes, electroporation). Meganucleases can be stably or transiently expressed into cells using expression vectors. Techniques of expression in eukaryotic cells are well known to those in the art. (See Current Protocols in Human Genetics: Chapter 12 “Vectors For Gene Therapy” & Chapter 13 “Delivery Systems for Gene Therapy”). Optionally, it may be preferable to incorporate a nuclear localization signal into the recombinant protein to be sure that it is expressed within the nucleus.
Once in a cell, the meganuclease and if present, the vector comprising targeting DNA and/or nucleic acid encoding the meganuclease are imported or translocated by the cell from the cytoplasm to the site of action in the nucleus. Whilst within the nucleus the meganuclease will cut any targets present in the genome and the vector resulting in double strand breaks which will be repaired by the endogenous repair mechanisms of the host cell and when a repair occurs between the genome and vector sequence this will result in a genome engineering event such as an insertion, deletion or repair.
For purposes of therapy, the meganucleases and a pharmaceutically acceptable excipient are administered in a therapeutically effective amount. Such a combination is said to be administered in a “therapeutically effective amount” if the amount administered is physiologically significant. An agent is physiologically significant if its presence results in a detectable change in the physiology of the recipient. In the present context, an agent is physiologically significant if its presence results in a decrease in the severity of one or more symptoms of the targeted disease and in a genome correction of the lesion or abnormality.
Definitions
Throughout the present Patent Application a number of terms and features are used to present and describe the present invention, to clarify the meaning of these terms a number of definitions are set out below and wherein a feature or term is not otherwise specifically defined or obvious from its context the following definitions apply.
For a better understanding of the invention and to show how the same may be carried into effect, there will now be shown by way of example only, specific embodiments, methods and processes according to the present invention with reference to the accompanying drawings in which:
There will now be described by way of example a specific mode contemplated by the Inventors. In the following description numerous specific details are set forth in order to provide a thorough understanding. It will be apparent however, to one skilled in the art, that the present invention may be practiced without limitation to these specific details. In other instances, well known methods and structures have not been described so as not to unnecessarily obscure the description.
In this example, the inventors successfully altered the 7NN specificity of the I-CreI protein. For that purpose, a variant library was built in yeast where I-CreI residues Gln26, Lys28 and Thr42 were randomized. Analysis of the structure of I-CreI in complex with its DNA target shows that residues Lys28 and Gln26 interact respectively with the bases at positions 7 and 6 of the target complementary strand. In addition, the residue Thr42 located on the β3 β-strand of I-CreI is oriented toward the 7NN region (
The 7NN _P targets (
In order to generate I-CreI derived coding sequences with the randomization of residues at positions 26, 28 and 42, two separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-37) or the 3′ end (positions 32-167) of the I-CreI coding sequence. For the 5′ end, PCR amplification is carried out using the Gal10F primer (SEQ ID NO: 2) and the Ulib7NNRev (SEQ ID NO: 3). For the 3′ end, PCR amplification is carried out using the Gal10R primer (SEQ ID NO: 4) and a primer specific to the I-CreI coding sequence for amino acids 32-46 (Ulib7NNFor: 5′ tcttataagtttaaacatcagctaagcttgnvktttcaggtgact-3′, SEQ ID NO: 5). Then, to generate the variant library called Ulib26-28-42, 25 ng of each of the two overlapping PCR fragments and 75 ng of vector DNA (pCLS0542) linearized by digestion with NcoI and EagI were used to transform the yeast Saccharomyces cerevisiae strain FYC2-6A (MATα, trp1Δ63, leu2Δ1, his3Δ200) using a high efficiency LiAc transformation protocol (Gietz and Woods 2002). An intact coding sequence containing the mutations is generated by in vivo homologous recombination in yeast. 2232 clones were picked for further experiment. They represent 66% of the theoretical protein diversity of Ulib26-28-42.
The 2232 clones from the Ulib26-28-42 variant library were screened for cleavage against the sixteen 7NN_P targets using our yeast screening assay. The primary screening yielded 836 positive clones that cleave at least one target. All the sixteen targets were cleaved at least by one variant. 465 positive clones were rearranged, sequenced and processed again for a secondary screening (
To change the specificity of I-CreI toward the 5NN nucleotides, two variant libraries were generated in yeast: Ulib44-68 by randomizing residues Gln44, Arg68 and Ulib44-68-77 by randomizing residues Gln44, Arg68 and Ile77. Residue Gln44 interacts with the base at position 4 of the target complementary strand, Arg68 interacts with the nucleotide at position 5 and Ile77 is oriented toward nucleotides at positions 6 and 5 of the DNA target. Both libraries were screened against the sixteen 5NN_P targets.
The 5NN_P targets were cloned as follows: an oligonucleotide corresponding to the target sequence flanked by gateway cloning sequence was ordered from Proligo (5′-TGGCATACAAGTTTTCAAAACNNCGTACGNNGTTTTGACAATCGTCTG TCA-3′, SEQ ID NO: 6). Double-stranded target DNA, generated by PCR amplification of the single stranded oligonucleotide, was cloned using the Gateway protocol (Invitrogen) into yeast reporter vector (pCLS1055). Yeast reporter vector was transformed into S. cerevisiae strain FYBL2-7B (MAT α, ura3 851, trp1 63, leu2 1, lys2 202).
In order to generate I-CreI derived coding sequences with the randomization of residues at positions 44 and 68, two separate overlapping PCR reactions were carried out that amplify respectively the residues 1 to 59 and the residues 54 to 167 of the I-CreI coding sequence. The first PCR fragment was amplified using the primers Gal10F (SEQ ID NO: 2) and Cre44Rev (5′-cactagtttgtccagaaaccaacggcgctgggtatttgagtcacmnriaaaggtcaagct-3′, SEQ ID NO: 7), and the second fragment with the Cre68For primer (5′-tactggacaaaetagtggatgaaattggegttggttaegtannkgatcgeggatec-3′, SEQ ID NO: 8) and Gal10R (SEQ ID NO: 4) primers. To generate the variant library called Ulib44-68, 25 ng of each PCR fragment and 75 ng of vector DNA (pCLS0542) linearized by digestion with NcoI and EagI were used to transform the yeast Saccharomyces cerevisiae strain FYC2-6A (MATα, trp1Δ63, leu2Δ1, his3Δ200) using a high efficiency LiAc transformation protocol (Gietz and Woods 2002). An intact coding sequence containing the mutations is generated by in vivo homologous recombination in yeast. 1116 clones were picked for further experiment, representing 2.8 times the theoretical protein diversity of Ulib44-68.
In order to generate I-CreI derived coding sequences with the randomization of residues at positions 44, 68 and 77, three separate overlapping PCR reactions were carried out that amplify respectively the residues 1 to 43, the residues 37 to 67 and the residues 63 to 167 of the I-CreI coding sequence. The first PCR fragment was amplified using the primers Gal10F (SEQ ID NO: 2) and Cre43Rev (5′-aaaggtcaagettagctgatgataaa-3′, SEQ ID NO: 9), the second fragment with the Cre44For (5′-catcagctaagcttgacctttnnkgtgactcaaaagacc-3′, SEQ ID NO: 10) and Cre67Rev (SEQ ID NO: 11) primers, and the third fragment with the Cre68-77For (SEQ ID NO: 12) and Gal10R (SEQ ID NO: 4) primers. Before transforming the yeast strain, an assembly PCR was performed with the two first PCR fragments using the Gal10R (SEQ ID NO: 4) and Cre67Rev (SEQ ID NO: 11) primers. Then, to generate the variant library called Ulib44-68-77, 25 ng of each of the assembly PCR fragment and the Cre68-77For-Gal10R PCR fragment and 75 ng of vector DNA (pCLS0542) linearized by digestion with NcoI and EagI were used to transform the yeast Saccharomyces cerevisiae strain FYC2-6A (MATα, trp1Δ63, leu2Δ1, his3Δ200) using a high efficiency LiAc transformation protocol (Gietz and Woods 2002). An intact coding sequence containing the mutations is generated by in vivo homologous recombination in yeast. 2232 clones were picked for further experiment. They represent 28% of the theoretical protein diversity of Ulib44-68-77.
The 1116 clones constituting the Ulib44-68 library were screened against the sixteen 5NN_P targets using our yeast screening assay (
The 2232 clones constituting the Ulib44-68-77 library were screened against the sixteen 5NN_P targets using the yeast screening assay described above (
The 7TATA_P target is a combination of the 5TA_P and 7TA_P targets (
I-CreI variants cleaving the 7TA_P or 5TA_P targets were identified previously. In order to generate I-CreI derived coding sequence containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-50) of variants from the Ulib26-28-42 library or the 3′ end (positions 43-167) of variants belonging to the Ulib44-68-77 library. For both the 5′ and 3′ end, PCR amplification is carried out using the Gal10F (SEQ ID NO: 2) and Gal10R (SEQ ID NO: 4) primers specific to the vector and primers specific to the I-CreI coding sequence for amino acids 43-50: Comb75assFor (5′-tttXXXgtgactcaaaagacccag-3′, SEQ ID NO: 13) and Comb75assRev (5′-ctgggtettttgagtcacXXXaaa-3′, SEQ ID NO: 14) where XXX codes for residue 44. The PCR fragments resulting from the amplification reaction realized with the same primers and with the same coding sequence for residue 44 were pooled. Then, each pool of PCR fragments resulting from the reaction with primers Gall OF (SEQ ID NO: 2) and Comb75assRev (SEQ ID NO: 14, for Ulib26-28-42 variants) or Comb75assFor (SEQ ID NO: 13) and Gal10R (SEQ ID NO: 4, for Ulib44-68-77 variants) was mixed in an equimolar ratio. Finally, approximately 25 ng of each final pool of the two overlapping PCR fragments and 75 ng of vector DNA (pCLS0542) linearized by digestion with NcoI and EagI were used to transform the yeast Saccharomyces cerevisiae strain FYC2-6A (MATα, trp1Δ63, leu2Δ1, his3Δ200) using a high efficiency LiAc transformation protocol (Gietz and Woods 2002). An intact coding sequence containing both groups of mutations is generated by in vivo homologous recombination in yeast.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68 and 77 of twenty 5TA_P cutters coming from Ulib44-68 or Ulib44-68-77 with mutations at positions 26, 28 and 42 of twenty 7TA_P cutters coming from Ulib26-28-42. The resulting combinatorial library has a complexity of 400 variants. This library was transformed into yeast and 1116 clones (2.8 times the diversity) were screened for cleavage against the 7TATA_P DNA target. 714 clones of the combinatorial 7TATA library turned out to be positive. Only 93 clones were rearranged and sequenced. They yielded 55 unique sequences corresponding to novel combinatorial meganucleases. An example of such meganucleases is given in Table 1. The secondary screening confirmed their strong cleavage efficacy against the 7TATA_P target (
The 7TTCT_P target is a combination of the 5CT_P and 7TT_P targets (
As described in example 3.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68 and 77 of 34 5CT_P cutters coming from Ulib44-68 or Ulib44-68-77 with mutations at positions 26, 28 and 42 of 16 7TT_P cutters coming from Ulib26-28-42. The resulting combinatorial library has a complexity of 544 variants. This library was transformed into yeast and 1116 clones (2 times the diversity) were screened for cleavage against the 7TTCT_P DNA target. The primary screening allowed for obtaining 78 positive clones, which were rearranged and sequenced. These 78 positive clones correspond to 34 unique sequences of novel combinatorial meganucleases. A panel of such meganucleases derived from I-CreI is given in Table 2. The cleavage of the 7TTCT_P target was confirmed by a secondary screening (
The 7GACT P target is a combination of the 5CT_P and 7GA_P targets (
As described in example 3.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68 and 77 of 34 variants coming from Ulib44-68 or Ulib44-68-77 that cleave the 5CT_P target (the same variants that were used for example 5) with mutations at positions 26, 28 and 42 of 15 variants coming from Ulib26-28-42 that cleave the 7GA_P target. The resulting combinatorial library has a complexity of 510 variants. This library was transformed into yeast and 1116 clones (2.2 times the diversity) were screened for cleavage against the 7GACT_P DNA target. The primary screening allowed for obtaining 850 positive clones. The 93 positive clones that gave the strongest signal for cleavage were rearranged and sequenced, yielding to 60 confirmed unique variant sequences of novel combinatorial meganucleases. A panel of such meganucleases derived from I-CreI is given in Table 3. The strong cleavage of the 7GACT_P target was confirmed by a secondary screening (
In this example, the inventors show how they were able to generate directly 7NNNN_P cutters by screening a high diversity variant library in yeast. This library was built by randomizing residues at positions 26, 28, 42, 44, 68 and 77 and screened against the 256 7NNNN_P targets.
The 7NNNN_P targets (
b) Generation of the Ulib7NNNN variant library
In order to generate I-CreI derived coding sequences with the randomization of residues at positions 26, 28, 42, 44, 68 and 77, three separate overlapping PCR reactions were carried out that amplify respectively the residues 1 to 37, the residues 32 to 67 and the residues 63 to 167 of the I-CreI coding sequence. The first PCR fragment was amplified using the primers Gal10F (5′-GCAACTTTAGTGCTGACACATACAGG-3′, SEQ ID NO: 2) and Ulib7NNRev (5′-atgataaacttataagactggtttggmbnaatmbnagegatgatgct-3′, SEQ ID NO: 3), the second fragment with the Ulib7NNForBis (5′-tatataagtttaaacatcagetaagettgnvkittnnkgtgactcaaaag-3′, SEQ ID NO: 5) and Cre67Rev (5′-tacgtaaccaacgccaatttcatccac-3′, SEQ ID NO: 11) primers, and the third fragment with the Cre68-77For (5′-ggcgttggttacgtannkgatcgcggatccgatcegattaennkttaagegaaatc-3′, SEQ ID NO: 12) and Gal10R (5′-ACAACCTTGATTGGAGACTTGACC-3′, SEQ ID NO: 4) primers.
The nvk code in the oligonucleotides allows the degeneracy at the positions 26, 28 and 42 among the 15 following amino acids: A, C, D, E, G, H, K, N P, Q, R, 5, T, W Y. The nnk code in the oligonucleotides allows the degeneracy at the positions 44, 68 and 77 among the 20 possible amino acids Before transforming the yeast strain, an assembly PCR was performed with the two first PCR fragments using the Gal10R and Cre67Rev primers. Then, to generate the Ulib7NNNN variant library, 25 ng of each of the assembly PCR fragment and the Cre68-77For PCR fragment and 75 ng of vector DNA pCLS0542 (
Screening was performed as described previously (Arnould, Chames et al. 2006). Mating was performed using a colony gridder (QpixII, Genetix). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (about 4 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of different reporter-harboring yeast strains for each target. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, with galactose (1%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using proprietary software.
The 4464 clones constituting the Ulib7NNNN library were screened against the 256 (44) 7NNNN_P targets using our yeast screening assay (
Some of the variants that were isolated during the primary screening of the Ulib7NNNN library (Example 6) had saturating activities in yeast toward the 7TATA_P or 7GACT_P targets like some of the variants that were obtained in examples 3 or 5. To compare the cleavage activity of different variants that were obtained by the two processes (either by the screening of the Ulib7NNNN library or by the 7NN×5NN combinatorial process), they were further evaluated using an extrachromosomal SSA assay in CHO-K1 cells.
a) Recloning of I-CreI Derived Variants into a Mammalian Expression Vector
The variant ORF was amplified by PCR using the primers CCM2For (5′-aagcagagetetctggetaactagagaacecactgettaetggettategaccatggccaataccaaatataacaaag agttec-3′: SEQ ID NO: 17) and CCMRevBis (5′-CTGCTCTAGATTAGTCGGCCGCCGGGGAGGATTICTTC-3′: SEQ ID NO: 18). The PCR fragment was digested by the restriction enzymes SacI and XbaI, and was then ligated into the vector pCLS1088 (
CHO-K1 cells were transfected with 200 ng of DNA containing various amounts of meganuclease expression vectors (0 to 12 ng) and 150 ng of the reporter plasmid, in the presence of Polyfect transfection reagent in accordance with the manufacturer's protocol (Qiagen). The culture medium was removed 72 hours after transfection, and 150 μl of lysis/detection buffer was added for β-galactosidase liquid assay (typically, for 1 liter of buffer, we used 100 ml of lysis buffer (10 mM Tris-HCl pH7.5, 150 mM NaCl, 0.1% Triton X100, 0.1 mg/ml BSA, protease inhibitors), 10 ml of Mg 100× buffer (MgCl2 100 mM, 2-mercaptoethanol 35%), 110 ml of an 8 mg/ml solution of ONPG and 780 ml of 0.1M sodium phosphate pH7.5). After incubation at 37° C., we measured optical density at 420 nm. The entire process was performed on 96-well plate format using an automated Velocity11 BioCel platform
Table 4 indicates the variants that were subcloned into a mammalian expression vector and further submitted to an extrachromosomal SSA aasy in CHO-K1 cells.
These results demonstrate that I-CreI derived variants able to cleave 7NNNN_P targets can be generated directly with the screening of a variant library and that some of these cutters can be compared in terms of cleavage activity to variants that have been obtained through a combinatorial process as described in examples 3 to 5.
In the present example the inventors engineer an I-CreI variant with a modified specificity toward nucleotides ±10 to ±4 as shown in
Material and methods
The SeqFullComb variant library was generated from the DNA of four 7GACT_P cutters called BrA to D (BrA and BrB are the same variants as those given in Table 4), whose sequence is given in Table 4 below. To build SeqFullComb, which contains mutations at positions 32, 33 and 38, two separate overlapping PCR reactions were carried out on each 7GACT_P variant that amplify the 5′ end (aa positions 1-25) or the 3′ end (aa positions 21-167) of the I-CreI derived variants coding sequence. For the 5′ end, PCR amplification is carried out using the Gal10F (SEQ ID NO: 2) and 107Rev (5′-agegatgatgctaccgtcaecgtc-3′, SEQ ID NO: 29). For the 3′ end, PCR amplification is carried out on each of the BrA to BrD variants using the Gal10R (SEQ ID NO: 4) primer and a primer covering residues 21 to 41 specific of the chosen variant sequence. The primers corresponding to the BrA to BrD variants are respectively: SeqBrAFor (5′-ggtagcatcatcgetactattactccaaaccagnvknvkaagtttaaacatnvkctaagettg-3′, SEQ ID NO: 30), SeqBrBFor (5′-ggtageatcatcgettgtatttetccaaaccagnvknvkaagtttaaacatnyketaagettg-3′, SEQ ID NO: 31), SeqBrCFor (5′-ggtagcatcategctgctattaatccaaaccagnyknvkaagtttaaacatnvkctaagettg-3′, SEQ ID NO: 32) and SeqBrDFor (5′-ggtagcatcatcgctgctattactccaaaccagnyknvkaagtttaaacatnvkctaagettg, SEQ ID NO: 33). The nvk codons at positions 32, 33 and 38 allows the degeneracy at these positions among all the 20 possible amino acids but F, L, M, I and V. Then, the four resulting PCR fragments were mixed in an equimolar ratio to 25 ng final and pooled with 25 ng of the Gal10F-107Rev PCR fragment. This mix was then added to 75 ng of vector DNA (pCLS0542) linearized by digestion with NcoI and EagI that were used to transform the yeast Saccharomyces cerevisiae strain FYC2-6A (MATα, trp1Δ63, leu2Δ1, his3Δ200) using a high efficiency LiAc transformation protocol (Gietz and Woods 2002). An intact coding sequence containing mutations at desired positions is generated by in vivo homologous recombination in yeast. 2232 clones were picked for further experiments. They represent 16.5% of the SeqFullComb protein diversity. Results
The SeqFullComb library was generated from the four BrA to BrD variants (Table 5)
The 2232 clones constituting the SeqFullComb library were screened for cleavage of the FullComb_P DNA target using our yeast screening assay. The primary screening yielded 27 positive clones that resulted after sequencing in 22 unique variant sequences. The secondary screening confirmed the cleavage activity toward the FullComb_P target for the vast majority of the variants (
Taking advantage of the previous screening of the Ulib7NNNN library, this result demonstrates that I-CreI variants with a modified specificity toward nucleotides ±10 to ±4 of the C1221 target can be engineered in only one combinatorial step.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2010/051904 | 4/30/2010 | WO | 00 | 2/4/2013 |