The invention relates to the use of meganuclease variants which cleave at least one target in the provirus of a retrovirus and in particular cleave the genomic insertion of an integrating Virus genome and in particular to meganuclease variants which cleave the Human Immunodeficiency Virus genome following genomic insertion, for the treatment of an infection of one or more of these viruses. The present Invention also relates to such variants and to vectors encoding such variants, as well as to a cell or multi-cellular organism modified by such a vector and to the use of said meganuclease variant and derived products for genome engineering and for in vivo and ex vivo (gene cell therapy) genome therapy.
Viral infections of various sorts are a serious and continuing health, agricultural and economic problem worldwide. In particular viruses present specific treatment and control problems as they always comprise an intracellular stage to their life cycle, in which the nucleic acid genome of the virus is inserted into a host cell and normally transported to the nucleus. During this stage of the virus life cycle, the virus genome can enter into a dormant state whilst inside a host cell, during which time the production of new virus particles/proteins/copies of the viral genome ceases. These characteristics present a significant problem as most medicaments and treatments for viral infection consist of compounds which affect aspects of virus biology involved in the active stages of the virus life cycle, such as compounds which target/inactivate a viral enzyme or structural protein. Therefore whilst in a dormant state the viral genome resident in the cytoplasm or nucleus of a host cell can not be affected by most conventional anti-virus medicaments and therefore persists.
One group of viruses presents additional problems as they integrate into the host cell genome. This group, called retroviruses, like other viruses are transmitted via the infection of new host cells by virus particles and can also cause the endemic infection of the progeny cells of a host cell in which they are genomically integrated. This second mode of transmission, particularly when the retrovirus genome is dormant can result in the clonal expansion of the retrovirus containing cells, which in turn can cause significant problems once the retrovirus genomes activate.
The present invention therefore relates to Retroviruses which are contained with the family Retroviridae which comprises in turn seven genera. Alpharetrovirus, Betaretrovirus, Gammaretrovirus, Deltaretrovirus, Epsilonretrovirus, Lentivirus and Spumavirus. These groups of viruses are responsible for several important diseases such as Human T-lymphotrophic virus (Gammaretrovirus), Rous Sarcoma (Alpharetrovirus) and Human Immunodeficiency Virus (Lentivirus).
The Human Immunodeficiency Virus (HIV) (
The retroviral genome harbors the sequences coding for the viral enzymatic, structural and regulatory proteins. In addition, the genomic RNA molecule contains a series of non-coding sequences that have important functions in different steps of the viral life cycle (
The “2007 AIDS epidemic update” report, issued by the UNAIDS (Joint United Nations Programme on HIV/AIDS), indicates that 33.2 million [30.6-36.1 million] people were estimated to be living with HIV, 2.5 million [1.8-4.1 million] people became newly infected with HIV and 2.1 million [1.9-2.4 million] people died of AIDS in 2007.
HIV is characterized by a high genetic variability, due to the rapid viral turnover (1010-1012 viral particles produced per day) in an HIV-infected individual, combined with the high mutation rate arising during reverse transcription (10−4 per nucleotide). Two types of HIV, HIV-1 and HIV-2, which are closely related to each other, have been identified to date (Sharp et al., Philos Trans R Soc Lond B Biol Sci, 2001, 356, 867-76). Most AIDS worldwide is caused by the more virulent HIV-1, while HIV-2 is endemic in West Africa. Both viruses appear to have spread to humans from other primate species and the best evidence from sequence relationships suggests that HIV-1 has passed to humans on at least three independent occasions from the chimpanzee, Pan troglodytes and HIV-2 from the sooty mangabey, Cercocebus atys.
The three zoonotic transmissions that generated the HIV-1 type viruses gave rise to three different viral groups: M, O and N. The M group (for main), represents the substantial majority of worldwide infections. The O (for outlier) and N (for non-M/non-O) groups remain essentially restricted to Central Africa (Sharp et al., Philos Trans R Soc Lond B Biol Sci, 2001, 356, 867-76).
HIV is transmitted by direct sexual contact, by blood or blood products, and from an infected mother to infant, either intrapartum, perinatally, or via breast milk. Infection of humans with HIV-1 causes a dramatic decline in the number of CD4+ T lymphocytes. When the number of CD4+ cells is very reduced, opportunistic infections and neoplasms occur (Simon et al., Lancet, 2006, 368, 489-504).
Antiretroviral treatment for HIV infection consists of drugs which work by slowing down the replication of HIV in the body. Currently, there are around 30 antiretroviral drugs approved to treat people infected with HIV in various countries around the world. There are several classes of anti-HIV drugs that attack the virus in different ways and the most common classes of antiretrovirals are nucleoside or nucleotide reverse transcriptase inhibitors, non-nucleoside reverse transcriptase inhibitors, protease inhibitors and entry inhibitors (Flexner C, Nature Reviews Drug Discovery, 2007, 6, 959-966).
People with HIV need to continuously take antiretroviral drugs. Furthermore for antiretroviral treatment to be effective for a long time, it has been found that more than one antiretroviral drug must be taken at a time as single drug treatment regimes invariably lead to HIV resistance to the single drug negating its therapeutic effects.
Combination Therapy, wherein at least two and normally three different medicaments are taken simultaneously prolongs the period of time before resistance develops for one or more of the medicaments. The term Highly Active Antiretroviral Therapy (HAART) is used to describe a combination of three or more anti-HIV drugs. HAART typically combines drugs from at least two different classes of antiretroviral drugs and has been shown to effectively suppress the virus when used properly. Highly active antiretroviral therapy has revolutionalized how people infected with HIV are treated, and reduces the rate at which resistance develops.
Normally when anti-HIV treatment is started, the viral load drops to an undetectable level. When drug resistance develops, the amount of HIV in the blood rises and the risk of the person becoming ill increases and this usually means that the drug regimen needs to be changed (Martinez-Cajas and Wainberg, Drugs, 2008, 68, 43-72).
Currently available HIV treatments have converted HIV infection into a chronic disease, increasing the lifespan of infected individuals. Anti-HIV drugs can reduce the rate of viral replication, retarding therefore the onset of AIDS. Nevertheless, the emergence of strains resistant to these existing treatments, require the continual development of new therapeutic strategies (Rossi et al., Nat. Biotechnol., 2007, 25, 1444-54). Although there are currently no vaccines to prevent or treat HIV, researchers are developing and testing several potential HIV vaccines, either for preventive and/or therapeutic purposes. However, vaccine development encounters the same problem as anti-HIV drugs concerning the rapid viral evolution and the subsequent development of resistance or in the case of a vaccine an evolved HIV strain which no longer comprises the epitope used in the vaccine and hence is not affected by the immune response elicited by the vaccine. At the present time the general consensus in the scientific and medical community is that therapeutic HIV vaccines will not be able to completely eliminate HIV infection, because the virus “hides” in certain cells of the body, where it can last silent for decades meaning that any effect of the vaccine will have been lost.
A new field for the treatment of HIV infection is the development of genetic therapies against HIV. Gene therapy could allow the prevention of progressive HIV infection by persistently blocking viral replication. Gene-targeting strategies are being developed with RNA-based agents such as ribozymes, aptamers and small interfering RNAs and protein-based agents. Among the last group, the use of zinc-finger nucleases against the CCR5 receptor, a protein present on the surface of immune cells that is required to mediate viral entry, is currently in Phase I clinical trials. In this case, the disruption of the CCR5 receptor from the immune cells by the nucleases is proposed to render the patient's cells permanently resistant to CCR5-specific strains of HIV. This approach is based on the fact that people with natural mutations on this receptor are resistant to HIV infection.
To date however the number of effective anti-HIV/retrovirus therapies is very small, due in part to the limited number of target genes/proteins/pathways present in the relatively simple retrovirus genome/life cycle as well as to the rapid creation of ‘escape’ mutants by the retrovirus during replication which allow members of the virus population to evade therapeutic compounds that more slowly evolving pathogens such as bacteria or protozoa would not be able to develop resistance to with the same speed.
In addition due to the existence of dormant intragenomic copies of the provirus which are not affected by any current therapy, the curing of HIV infection (AIDS) is currently simply not possible.
An interesting target that has not been pursued in the fight against the AIDS pandemic and more generally retroviruses is the genomically integrated provirus and/or the reverse transcribed DNA version of the retrovirus genome prior to its integration, since targeting the proviral DNA could lead to the elimination or inactivation of the structure that allows the virus to multiply and the infection to propagate. One novel way to inactivate the provirus which the inventors have decided to investigate is by the use of nucleases that could cleave the integrated form of the virus and generate mutations and/or deletions in the provirus following the action of the cellular DNA repair machinery.
An important point to be considered in this kind of approach is the choice of the target sequences. In a first instance, the target sequences should be located in the coding sequences of essential genes, since the inactivation of an accessory gene may not lead to viral eradication. The viral genome also contains essential regulatory sequences that are located in the long terminal repeats (LTRs) that flank the viral genome in the provirus. Even if mutations in these regions would be expected to have a less drastic effect than a mutation in an essential gene, the fact that they are duplicated sequences could be useful in an approach of “virus clipping”, meaning the excision of long regions of the proviral DNA by the action of a nuclease cleaving twice in the viral sequence. Another important point that should be considered is the degree of sequence variation that is observed in the target sequences among different circulating viral isolates. As discussed above HIV is characterized by a high degree of sequence variability due to the nature of the viral reverse transcriptase. It is therefore essential to check the sequence conservation of the target among the different isolates.
The inventors have developed a new molecular medicine approach based on the inactivation of the retrovirus provirus through the use of tailored meganucleases specifically targeting the proviral DNA, using the HIV-1 provirus in the genome of the infected cell as a model. The principle of this new therapeutic strategy is that the tailored meganucleases against targets in the provirus will generate a double strand break (DSB) at their target sequences, chosen to be located in genes/regulatory sequences/structural sequences that are essential for the virus to replicate or alternatively target sequences which are present in multiple copies in the provirus, for instance in the two flanking LTR regions, so allowing the provirus or a portion thereof to be excised.
The epidemiology of HIV, particularly in sub Saharan Africa, makes research into the HIV virus a major and extremely active area of research. The manipulation of the HIV provirus is one area of research in which to date reagents have not been readily available as workers have instead concentrated on attempting to manipulate the HIV virion per se. Therefore the means to easily engineer the HIV provirus in situ in the genome of an infected cell/organism would likely provide valuable insights into this aspect of HIV biology and potentially open new avenues of attack in combating HIV.
Even if the meganuclease targets have been selected following the criteria mentioned above, namely in essential genes and particularly in sequences showing the highest degree of conservation, the capacity of the virus to generate escape mutants under the selective pressure of a drug/therapy must be considered.
To minimize the effect of drug resistance(s), “Combination Therapy” has already been shown to counter act this feature of HIV biology. In the same way, the possibility of using a combination of meganucleases could help to prevent any resistance that could be generated during viral replication. In addition although HIV shows a very high level of genetic change, not all of the components of the HIV genome are as capable of supporting change as others. Generally speaking it is those portions of the virus which are immunogenic, that is present upon the exterior of the virus particle where they can interact with the components of the hosts immune system, which are most able to support high levels of variability. Whereas the essential internal structural or packaging components of HIV are less able to continue to function following changes in their coding sequences. These differences do not affect the ability of HIV to evolve so as to elude the host immune response, but have proven useful in specifically engineering drugs for which it is more difficult for HIV to develop resistance. The increased levels of conservation of some provirus sequences can also be used to further hone the meganuclease(s) according to the present invention.
In vivo meganucleases are essentially represented by homing endonucleases. Homing Endonucleases (HEs) are a widespread family of natural meganucleases including hundreds of proteins families (Chevalier, B. S. and B. L. Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774). These proteins are encoded by mobile genetic elements which propagate by a process called “homing”: the endonuclease cleaves a cognate allele from which the mobile element is absent, thereby stimulating a homologous recombination event that duplicates the mobile DNA into the recipient locus. Given their exceptional cleavage properties in terms of efficacy and specificity, they could represent ideal scaffolds to derive novel, highly specific endonucleases.
HEs belong to four major families. The LAGLIDADG family (SEQ ID NO:373), named after a conserved peptide motif involved in the catalytic center, is the most widespread and the best characterized group. Seven structures are now available. Whereas most proteins from this family are monomeric and display two LAGLIDADG motifs (SEQ ID NO:373), a few have only one motif, and thus dimerize to cleave palindromic or pseudo-palindromic target sequences.
Although the LAGLIDADG peptide (SEQ ID NO:373) is the only conserved region among members of the family, these proteins share a very similar architecture (
The making of functional chimeric meganucleases, by fusing the N-terminal I-DmoI domain with an I-CreI monomer (Chevalier et al., Mol. Cell., 2002, 10, 895-905; Epinat et al., Nucleic Acids Res, 2003, 31, 2952-62; International PCT Applications WO 03/078619 and WO 2004/031346) have demonstrated the plasticity of LAGLIDADG proteins.
Different groups have also used a semi-rational approach to locally alter the specificity of the I-CreI (Seligman et al., Genetics, 1997, 147, 1653-1664; Sussman et al., J. Mol. Biol., 2004, 342, 31-41; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156; Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Rosen et al., Nucleic Acids Res., 2006, 34, 4791-4800; Smith et al., Nucleic Acids Res., 2006, 34, e149), I-SceI (Doyon et al., J. Am. Chem. Soc., 2006, 128, 2477-2484), PI-SceI (Gimble et al., J. Mol. Biol., 2003, 334, 993-1008) and I-MsoI (Ashworth et al., Nature, 2006, 441, 656-659).
In addition, hundreds of I-CreI derivatives with locally altered specificity were engineered by combining the semi-rational approach and High Throughput Screening:
Two different variants were combined and assembled in a functional heterodimeric endonuclease able to cleave a chimeric target resulting from the fusion of two different halves of each variant DNA target sequence (Arnould et al., precited; International PCT Applications WO 2006/097854 and WO 2007/034262), as illustrates in
Furthermore, residues 28 to 40 and 44 to 77 of I-CreI were shown to form two separable functional subdomains, able to bind distinct parts of a homing endonuclease half-site target sequence (Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/049095 and WO 2007/057781).
The combination of mutations from the two subdomains of I-CreI within the same monomer allowed the design of novel chimeric molecules (homodimers) able to cleave a palindromic combined DNA target sequence comprising the nucleotides at positions ±3 to 5 and ±8 to 10 which are bound by each subdomain (Smith et al., Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/049095 and WO 2007/057781).
The combination of the two former steps allows a larger combinatorial approach, involving four different subdomains. The different subdomains can be modified separately and combined to obtain an entirely redesigned meganuclease variant (heterodimer or single-chain molecule) with chosen specificity, as illustrated in
The method for producing meganuclease variants and the assays based on cleavage-induced recombination in mammal or yeast cells, which are used for screening variants with altered specificity are described in the International PCT Application WO 2004/067736; Epinat et al., Nucleic Acids Res., 2003, 31, 2952-2962; Chames et al., Nucleic Acids Res., 2005, 33, e178, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458. These assays result in a functional LacZ reporter gene which can be monitored by standard methods.
These variants can be used to cleave genuine chromosomal sequences and have paved the way for novel perspectives in several fields, including gene therapy.
Even though the base-pairs ±1 and ±2 do not display any contact with the protein, it has been shown that these positions are not devoid of content information (Chevalier et al., J. Mol. Biol., 2003, 329, 253-269), especially for the base-pair ±1 and could be a source of additional substrate specificity (Argast et al., J. Mol. Biol., 1998, 280, 345-353; Jurica et al., Mol. Cell., 1998, 2, 469-476; Chevalier, B. S. and B. L. Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774). In vitro selection of cleavable I-CreI targets (Argast et al., precited) randomly mutagenized, revealed the importance of these four base-pairs on protein binding and cleavage activity. It has been suggested that the network of ordered water molecules found in the active site was important for positioning the DNA target (Chevalier et al., Biochemistry, 2004, 43, 14015-14026). In addition, the extensive conformational changes that appear in this region upon I-CreI binding suggest that the four central nucleotides could contribute to the substrate specificity, possibly by sequence dependent conformational preferences (Chevalier et al., 2003, precited).
Therefore the inventors seeing the problems associated with retroviruses and in particular HIV, have generated a new class of reagents which can be used to specifically target and manipulate the retroviral provirus. This new class of anti-retroviral molecules can recognize and cleave the integrated provirus either in vitro or in vivo, these reagents can be used for a variety of purposes for instance in research as well as in novel treatment regimes.
According to a first aspect of the present invention there is provided an I-CreI variant which cleaves a target in the provirus of a pathogenic virus, for use in treating an infection of said virus.
The inventors therefore provide a set of I-CreI variants which can recognise and cut targets in a genomically integrated provirus (GIP). Such I-CreI variants provide a new therapeutic route to retrovirus and in particular HIV treatment by HIV provirus inactivation or alteration. This new class of enzymes is also potentially useful in studies into the transcriptional and regulatory behaviour of the provirus.
This new class of anti-HIV medicament can act in a number of ways including by non-homologous end joining, the replacement/removal by homologous recombination with an introduced DNA targeting construct of a portion of the provirus or the removal of the provirus following recombination between chromosome arms. Each of these different mechanisms is discussed in detail below.
In the present patent application the genomically integrated provirus (GIP) refers to the DNA sequence present in one or several places in the host cell genome which was inserted following reverse transcription of the RNA virus genome and its integration into the host genome.
In the present patent application the terms meganuclease (s) and variant (s) and variant meganuclease (s) will be used interchangeably herein.
The inventors have therefore created a new class of meganuclease based reagents which are useful for the treatment of a retrovirus infection and the most important and potentially useful feature of these enzymes is that instead of acting upon the virion or any component thereof they act upon the genomic insertion of the virus.
Targeting the integrated provirus would allow a clinician to eliminate the structure which leads to the generation of further viral particles, acting at a level that no other anti-viral therapeutic approaches have yet been developed. Conversely, prior art therapies which act upon the different steps of the viral life cycle allow to a clinician to inhibit viral replication, but do not eliminate the source of the virions, which therefore allows for the amplification of the viral infection when the treatment is withdrawn or resistance develops.
These variants also allow the targeting of the DNA version of the virus genome before it has integrated into the host cell genome. By inactivating the virus genome before it can integrate into the host cell genome, the claimed variants can act during the early step of cell infection in a way which no current antiretroviral medicament can.
The Inventors have validated this new class of anti-retrovirus reagents by generating meganuclease variants to a series of DNA targets in the genome of the HIV provirus (
These target sequences are present in the U3 and U5 LTR regions, the coding sequence of the structural gene gag and more specifically in the p7 and p24 proteins therein and in the structural gene pol, specifically in the protease gene. These seven targets were selected based on their therapeutic potential.
As mentioned before, one potential therapeutic approach would be to cleave both LTRs of the integrated provirus which would in turn lead to excision of the viral genome from the infected cells. The inventors have shown that it is possible to generate I-CreI variants which can cleave targets in the U3 (target HIV1—1 (SEQ ID NO:319)) and U5 (target HIV1—3 (SEQ ID NO:321)) LTRs in the present patent application.
An alternative therapeutic approach would be to targeting one or more essential genes, the p24 protein is a structural component of the viral capsid and is essential for the virus to replicate. The inventors have shown that it is possible to generate I-CreI variants which can cleave targets in the p24 gene (target HIV1—4 (SEQ ID NO:322)) and (target HIV1—7 (SEQ ID NO:366)) in the present patent application. These two targets do not overlap and hence these two enzymes could be used simultaneously so further reducing the chances of resistance developing and/or causing an excision of the portion of p24 situated between the two cleavage sites.
The HIV protease is also an essential protein that is needed for viral particle maturation, without which viral particles remain in an immature state and are not infectious. The inventors have shown that it is possible to generate I-CreI variants which can cleave targets in the protease gene (target HIV1—5 (SEQ ID NO:323)) and (target HIV1—9 (SEQ ID NO:368)) in the present patent application. These two targets do not overlap and hence these two enzymes could be used simultaneously so further reducing the chances of resistance developing and/or causing an excision of the portion of protease situated between the two cleavage sites.
The HIV nucleocapsid protein (p7, ou NC) is bound to the single-stranded RNA genome. This protein plays a key role in the HIV life cycle since, being an RNA chaperone, its activity is required for efficient reverse transcription, making it an interesting target for antiviral therapy. The inventors have shown that it is possible to generate I-CreI variants which can cleave targets in the p7 gene (target HIV1—8 (SEQ ID NO:367)).
The inventors have therefore established that meganuclease variants can be generated in both the sequences of essential genes as well as in regulatory non-coding sequences essential for viral replication.
These targets were also selected based on a screen on the “Los Alamos National Laboratory” Sequence database (www.hiv.lanl.gov) to determine their degree of conservation among circulating isolates, which showed a high degree of sequence conservation among the different viral strains for which the complete sequence of their genome is available.
In the present patent application essential genes of the GIP provirus are those genes which must remain active in order for the GIP provirus to be converted into further virions which are able to exit the host cell and infect further cells. In addition to essential genes, other types of essential genetic elements can exist such as the regulatory elements of essential genes and/or structural sequence elements of the HIV provirus that are necessary for its packaging and/or insertion into the genome.
According to a further aspect of the present invention the pathogenic virus is from a genus selected from the group consisiting of: Alpharetrovirus, Betaretrovirus, Gammaretrovirus, Deltaretrovirus, Epsilonretrovirus, Lentivirus and Spumavirus.
Multiple examples of genomic sequences for viruses of the specified types are available from public databases such as the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/) or the virus genomics and bioinformatics resources centre at University College London (http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html).
In particular the virus is selected from the group consisting of: Human T-lymphotrophic virus, Rous Sarcoma and Human Immunodeficiency Virus.
Most particularly the virus is either Human Immunodeficiency Virus Type 1 (HIV1) or Human Immunodeficiency Virus Type 2 (HIV2).
In particular the DNA target is within a DNA sequence essential for HIV replication, viability, packaging or virulence.
In particular the DNA target is within an essential gene or regulatory element or structural element of the HIV provirus.
In particular the DNA target is within the open reading frame of the HIV provirus encoding a gene or regulatory element of a gene selected from the group: GAG, POL, ENV, TAT and REV.
In particular the target in the HIV1 provirus is selected from the group consisting of the sequences SEQ ID NO: 319 to 342 and SEQ ID NO: 366 to 368.
In particular the variant is selected from one of the sequences SEQ ID NO: 1-13; SEQ ID NO: 26-46; SEQ ID NO: 59-85; SEQ ID NO: 88-94; SEQ ID NO: 97-165; SEQ ID NO: 168-174; SEQ ID NO: 177-186; SEQ ID NO: 189-238; SEQ ID NO: 241-242; SEQ ID NO: 245-253; SEQ ID NO: 256-316; SEQ ID NO: 346-365.
In particular the variant is characterized in that at least one of the two I-CreI monomers has at least two substitutions, one in each of the two functional subdomains of the LAGLIDADG core domain (SEQ ID NO:373) situated from positions; in particular said substitution(s) in the first functional subdomain comprise a substitution in at least one of positions 26, 28, 30, 32, 33, 38 and/or 40 and said substitution(s) in the second functional subdomain comprise a substitution in at least one of positions positions 44, 68, 70, 75 and/or 77 and being obtainable by a method comprising at least the steps of:
(a) constructing a first series of I-CreI variants having at least one substitution in a first functional subdomain of the LAGLIDADG core domain (SEQ ID NO:373) comprising at least one substitution at a position selected from the group: 26, 28, 30, 32, 33, 38 and/or 40 of I-CreI,
(b) constructing a second series of I-CreI variants having at least one substitution in a second functional subdomain of the LAGLIDADG core domain (SEQ ID NO:373) comprising at least one substitution at a position selected from the group: 44, 68, 70, 75 and/or 77 of I-CreI,
(c) selecting and/or screening the variants from the first series of step (a) which are able to cleave a DNA target sequence selected from the group SEQ ID NO: 319 to 342 and SEQ ID NO: 366 to 368, wherein at least one of (i) the nucleotide triplet in positions −10 to −8 of the I-CreI site has been replaced with the nucleotide triplet which is present in positions −10 to −8 of the selected DNA target sequence from said provirus and (ii) the nucleotide triplet in positions +8 to +10 has been replaced with the reverse complementary sequence of the nucleotide triplet which is present in position −10 to −8 of said DNA target sequence from said provirus,
(d) selecting and/or screening the variants from the second series of step (b) which are able to cleave a mutant I-CreI site wherein at least one of (i) the nucleotide triplet in positions −5 to −3 of the I-CreI site has been replaced with the nucleotide triplet which is present in positions −5 to −3 of said DNA target sequence from said provirus and (ii) the nucleotide triplet in positions +3 to +5 has been replaced with the reverse complementary sequence of the nucleotide triplet which is present in position −5 to −3 of said DNA target sequence from said provirus,
(e) selecting and/or screening the variants from the first series of step (a) which are able to cleave a mutant I-CreI site wherein at least one of (i) the nucleotide triplet in positions +8 to +10 of the I-CreI site has been replaced with the nucleotide triplet which is present in positions +8 to +10 of said DNA target sequence from said provirus and (ii) the nucleotide triplet in positions −10 to −8 has been replaced with the reverse complementary sequence of the nucleotide triplet which is present in positions +8 to +10 of said DNA target sequence from said provirus,
(f) selecting and/or screening the variants from the second series of step (b) which are able to cleave a mutant I-CreI site wherein at least one of (i) the nucleotide triplet in positions +3 to +5 of the I-CreI site has been replaced with the nucleotide triplet which is present in positions +3 to +5 of said DNA target sequence from said provirus and (ii) the nucleotide triplet in positions −5 to −3 has been replaced with the reverse complementary sequence of the nucleotide triplet which is present in positions +3 to +5 of said DNA target sequence from said provirus,
(g) combining in a single variant, the mutation(s) in positions 26, 28, 30, 32, 33, 38 and/or 40 and 44, 68, 70, 75 and/or 77 of two variants from step (c) and step (d), to obtain a novel homodimeric I-CreI variant which cleaves a sequence wherein (i) the nucleotide triplet in positions −10 to −8 is identical to the nucleotide triplet which is present in positions −10 to −8 of said DNA target sequence from said provirus, (ii) the nucleotide triplet in positions +8 to +10 is identical to the reverse complementary sequence of the nucleotide triplet which is present in positions −10 to −8 of said DNA target sequence from said provirus, (iii) the nucleotide triplet in positions −5 to −3 is identical to the nucleotide triplet which is present in positions −5 to −3 of said DNA target sequence from said provirus and (iv) the nucleotide triplet in positions +3 to +5 is identical to the reverse complementary sequence of the nucleotide triplet which is present in positions −5 to −3 of said DNA target sequence from said provirus, and/or
(h) combining in a single variant, the mutation(s) in positions 26, 28, 30, 32, 33, 38 and/or 40, and 44, 68, 70, 75 and/or 77 of two variants from step (e) and step (f), to obtain a novel homodimeric I-CreI variant which cleaves a sequence wherein (i) the nucleotide triplet in positions +8 to +10 of the I-CreI site has been replaced with the nucleotide triplet which is present in positions +8 to +10 of said DNA target sequence from said provirus and (ii) the nucleotide triplet in positions −10 to −8 is identical to the reverse complementary sequence of the nucleotide triplet in positions +8 to +10 of said DNA target sequence from said provirus, (iii) the nucleotide triplet in positions +3 to +5 is identical to the nucleotide triplet which is present in positions +3 to +5 of said DNA target sequence from said provirus, (iv) the nucleotide triplet in positions −5 to −3 is identical to the reverse complementary sequence of the nucleotide triplet which is present in positions +3 to +5 of said DNA target sequence from said provirus,
(i) combining the variants obtained in steps (g) and (h) to form heterodimers, and
(j) selecting and/or screening the heterodimers from step (i) which are able to cleave said DNA target sequence from said provirus.
A combinatorial approach, as illustrated schematically in
In particular the heterodimer of step (i) may comprise monomers obtained in steps (g) and (h), with the same DNA target recognition and cleavage activity properties.
Alternatively the heterodimer of step (i) may comprise monomers obtained in steps (g) and (h), with different DNA target recognition and cleavage activity properties.
In particular the first series of I-CreI variants of step (a) are derived from a first parent meganuclease.
In particular the second series of variants of step (b) are derived from a second parent meganuclease.
In particular the first and second parent meganucleases are identical.
Alternatively the first and second parent meganucleases are different.
In particular the variant may be obtained by a method comprising the additional steps of:
(k) selecting heterodimers from step (j) and constructing a third series of variants having at least one substitution in at least one of the monomers of said selected heterodimers,
(l) combining said third series variants of step (k) and screening the resulting heterodimers for enhanced cleavage activity against said DNA target from the GIP.
The inventors have found that although specific meganucleases can be generated to a particular target in the GIP using the above method, that such meganucleases can be improved further by the additional rounds of substitution and selection against the intended target. Meganuclease generated to targets in the GIP using other methods are also comprised within the present patent application.
In particular in said step (k) the substitutions in the third series of variants are introduced by site directed mutagenesis in a DNA molecule encoding said third series of variants, and/or by random mutagenesis in a DNA molecule encoding said third series of variants.
In the additional rounds of substitution and selection, the substitution of residues in the meganucleases can be performed randomly, that is wherein the chances of a substitution event occurring are equal chance across all the residues of the meganuclease. Or on a site directed basis wherein the chances of certain residues being subject to a substitution is higher than other residues.
In particular steps (k) and (l) are repeated at least two times and wherein the heterodimers selected in step (k) of each further iteration are selected from heterodimers screened in step (l) of the previous iteration which showed increased cleavage activity against said DNA target from the GIP.
The inventors have found that the meganucleases can be further improved by using multiple iterations of the additional steps (k) and (l).
Through the inventors work they have identified the residues in the first subdomain which when altered have most effect upon altering the I-CreI enzymes specificity.
Through the inventors work they have identified the residues in the second subdomain which when altered have most effect upon altering the I-CreI enzymes specificity.
In particular the variant comprises one or more substitutions in positions 137 to 143 of I-CreI that modify the specificity of the variant towards the nucleotide in positions ±1 to 2, ±6 to 7 and/or ±11 to 12 of the target site in the GIP.
In particular the variant comprises one or more substitutions on the entire I-CreI sequence that improve the binding and/or the cleavage properties of the variant towards said DNA target sequence from the GIP.
As well as specific mutations at the residue identified above, the present invention also encompasses the substitution of any of the residues present in the I-CreI enzyme.
In particular the variant is a heterodimer, resulting from the association of a first and a second monomer having different mutations in positions 26, 28, 30, 32, 33, 38 and/or 40, and 44, 68, 70, 75 and/or 77 of I-CreI, said heterodimer being able to cleave a non-palindromic DNA target sequence from the HIV provirus.
As explained above the I-CreI enzyme acts as a dimer, by ensuring that the variant is a heterodimer this allows a specific combination of two different I-CreI monomers which increases the possible targets cleaved by the variant.
In particular the heterodimeric variant is an obligate heterodimer variant having at least one pair of mutations in corresponding residues of the first and the second monomers which mediate an intermolecular interaction between the two I-CreI monomers, wherein the first mutation of said pair(s) is in the first monomer and the second mutation of said pair(s) is in the second monomer and said pair(s) of mutations impairs the formation of functional homodimers from each monomer without preventing the formation of a functional heterodimer, able to cleave the genomic DNA target from the HIV provirus.
The inventors have previously established a number of residue changes which can ensure an I-CreI monomer is an obligate heterodimer (WO2008/093249).
In particular the monomers have at least one of the following pairs of mutations, respectively for the first and the second monomer:
a) the substitution of the glutamic acid in position 8 with a basic amino acid, preferably an arginine (first monomer) and the substitution of the lysine in position 7 with an acidic amino acid, preferably a glutamic acid (second monomer); the first monomer may further comprise the substitution of at least one of the lysine residues in positions 7 and 96, by an arginine,
b) the substitution of the glutamic acid in position 61 with a basic amino acid, preferably an arginine (first monomer) and the substitution of the lysine in position 96 with an acidic amino acid, preferably a glutamic acid (second monomer); the first monomer may further comprise the substitution of at least one of the lysine residues in positions 7 and 96, by an arginine,
c) the substitution of the leucine in position 97 with an aromatic amino acid, preferably a phenylalanine (first monomer) and the substitution of the phenylalanine in position 54 with a small amino acid, preferably a glycine (second monomer); the first monomer may further comprise the substitution of the phenylalanine in position 54 by a tryptophane and the second monomer may further comprise the substitution of the leucine in position 58 or lysine in position 57, by a methionine, and
d) the substitution of the aspartic acid in position 137 with a basic amino acid, preferably an arginine (first monomer) and the substitution of the arginine in position 51 with an acidic amino acid, preferably a glutamic acid (second monomer).
In particular the variant, which is an obligate heterodimer, wherein the first and the second monomer, respectively, further comprises the D137R mutation and the R51D mutation.
In particular the variant, which is an obligate heterodimer, wherein the first monomer further comprises the K7R, E8R, E61R, K96R and L97F or K7R, E8R, F54W, E61R, K96R and L97F mutations and the second monomer further comprises the K7E, F54G, L58M and K96E or K7E, F54G, K57M and K96E mutations.
According to a further aspect of the present invention there is provided a single-chain chimeric meganuclease which comprises two monomers or core domains of one or two variant(s) according to the first aspect of the present invention, or a combination of both.
An alternative approach to ensuring that the variant consists of a specific combination of monomers is to link the selected monomers for instance using a peptide linker.
In particular the single-chain meganuclease comprises a first and a second monomer according to the first aspect of the present invention, connected by a peptidic linker.
According to a further aspect of the present invention the I-CreI variant is combined with other antiretroviral drugs.
Most antiretroviral drugs have at least three names. Sometimes a drug is referred to by its research or chemical name, such as AZT. The second name is the generic name for all drugs with the same chemical structure; for example AZT is also known as zidovudine. The third name is the brand name given by the pharmaceutical company; one of the brand names for zidovudine is Retrovir. Lastly, an abbreviation of the common name might sometimes also be used, such as ZDV, which is the fourth name given to zidovudine.
Lists of drugs approved for use in the USA are provided below:
Due to the constant evolution of resistance to existing HIV medicaments additional antiretroviral drugs continue to be developed and approved for the treatment of HIV infections.
In accordance with this further aspect of the present invention the I-CreI variant is combined with other antiretroviral agents such as those listed above or with other meganucleases directed against different targets in the HIV provirus.
According to a preferred embodiment of the present invention I-CreI variants according to the present invention are used only once the viral load of an individual has been reduced significantly using antiretroviral drugs. The I-CreI variants are then used to eliminate as many proviruses as possible whilst the HIV virus population is in its enforced dormant state.
Using this strategy it is conceivable that an existent HIV infection could be cured. Perhaps more likely the reduction in the number of active proviruses will lead to a decrease in the number of new virus particles being produced which in turn will reduce the chances of resistant virus particles being generated against any of the medicaments being used to suppress HIV replication. Allowing the use for longer periods of time of the medicaments, so reducing the chances that an individual will ever be infected with HIV particles which are resistant to all anti-HIV medicaments.
In accordance with a further aspect of the present invention there is also provided a kit of parts comprising at least one I-CreI according to the present invention either in the form of a peptide or a nucleotide encoding the variant(s) and one or more other anti-HIV medicaments, together with instructions for the administration of the variant and other anti-HIV medicaments to a patient.
According to the present invention, the meganuclease when used as a polypeptide is associated with:
Alternatively, the meganuclease in the form of a polynucleotide encoding said meganuclease in a vector. Vectors comprising targeting DNA and/or nucleic acid encoding a meganuclease can be introduced into a cell by a variety of methods (e.g., injection, direct uptake, projectile bombardment, liposomes, electroporation). Meganucleases can be stably or transiently expressed into cells using expression vectors. Techniques of expression in eukaryotic cells are well known to those in the art. (See Current Protocols in Human Genetics: Chapter 12 “Vectors For Gene Therapy” & Chapter 13 “Delivery Systems for Gene Therapy”). Optionally, it may be preferable to incorporate a nuclear localization signal into the recombinant protein to be sure that it is expressed within the nucleus.
The meganuclease may also comprise a nuclear localization signal (NLS) which is an amino acid sequence which acts like a ‘tag’ on the exposed surface of a protein. The NLS is used to target the protein to the cell nucleus through the Nuclear Pore Complex and to direct a newly synthesized protein into the nucleus via its recognition by cytosolic nuclear transport receptors. Typically, this signal consists of one or more short sequences of positively charged lysines or arginines.
According to a second aspect of the present invention there is provided a polynucleotide fragment encoding the variant according to the first aspect of the present invention or the single-chain chimeric meganuclease according to a second aspect of the present invention.
According to a third aspect of the present invention there is provided an expression vector comprising at least one polynucleotide fragment according to the second aspect of the present invention.
In particular the expression vector, includes a targeting construct comprising a sequence to be introduced flanked by sequences sharing homologies with the regions surrounding said DNA target sequence from the provirus.
One important use of a variant according to the present invention is in increasing the incidence of homologous recombination events at or around the site where the variant cleaves its target. The present invention therefore also relates to a unified genetic construct which encodes the variant under the control of suitable regulatory sequences as well as sequences homologous to portions of the provirus surrounding the variant DNA target site. Following cleavage of the target site by the variant these homologous portions can act as complementary sequences in a homologous recombination reaction with the provirus replacing the existing provirus sequence with a new sequence engineered between the two homologous portions in the unified genetic construct.
Preferably, homologous sequences of at least 50 bp, preferably more than 100 bp and more preferably more than 200 bp are used. Shared DNA homologies are located in regions flanking upstream and downstream the site of the break and the DNA sequence to be introduced should be located between the two arms.
Therefore, the targeting construct is preferably from 200 bp to 6000 bp, more preferably from 1000 bp to 2000 bp; it comprises: a sequence which has at least 200 bp of homologous sequence flanking the target site, for repairing the cleavage and a sequence for inactivating the provirus and/or a sequence of an exogenous gene of interest which it is intended to insert at the site of the DNA repair event by homologous recombination.
For the insertion of a sequence, DNA homologies are generally located in regions directly upstream and downstream to the site of the break (sequences immediately adjacent to the break; minimal repair matrix). However, when the insertion is associated with a deletion of ORF sequences flanking the cleavage site, shared DNA homologies are located in regions upstream and downstream the region of the deletion.
A vector which can be used in the present invention includes, but is not limited to, a viral vector, a plasmid, a RNA vector or a linear or circular DNA or RNA molecule which may consists of a chromosomal, non chromosomal, semi-synthetic or synthetic nucleic acids. Preferred vectors are those capable of autonomous replication (episomal vector) and/or expression of nucleic acids to which they are linked (expression vectors). Large numbers of suitable vectors are known to those of skill in the art and commercially available.
Viral vectors include retrovirus, adenovirus, parvovirus (e.g. adeno associated viruses), coronavirus, negative strand RNA viruses such as orthomyxovirus (e.g., influenza virus), rhabdovirus (e.g., rabies and vesicular stomatitis virus), paramyxovirus (e.g. measles and Sendai), positive strand RNA viruses such as picornavirus and alphavirus, and double-stranded DNA viruses including adenovirus, herpesvirus (e.g., Herpes Simplex virus types 1 and 2, Epstein-Barr virus, cytomegalovirus), and poxvirus (e.g., vaccinia, fowlpox and canarypox). Other viruses include Norwalk virus, togavirus, flavivirus, reoviruses, papovavirus, hepadnavirus, and hepatitis virus, for example. Examples of retroviruses include: avian leukosis-sarcoma, mammalian C-type, B-type viruses, D type viruses, HTLV-BLV group, lentivirus, spumavirus (Coffin, J. M., Retroviridae: The viruses and their replication, In Fundamental Virology, Third Edition, B. N. Fields, et al., Eds., Lippincott-Raven Publishers, Philadelphia, 1996).
Vectors can comprise selectable markers, for example: neomycin phosphotransferase, histidinol dehydrogenase, dihydrofolate reductase, hygromycin phosphotransferase, herpes simplex virus thymidine kinase, adenosine deaminase, glutamine synthetase, and hypoxanthine-guanine phosphoribosyl transferase (HRPT) for eukaryotic cell culture; TRP1 for S. cerevisiae; tetracycline, rifampicin or ampicillin resistance in E. coli.
In particular for the purposes of gene therapy and in accordance with a preferred embodiment of the present invention, the viral vector is selected from the group comprising lentiviruses, Adeno-associated viruses (AAV) and Adenoviruses.
In accordance with another aspect of the present invention the variant and targeting construct may be on different nucleic acid constructs.
In accordance with another aspect of the present invention the variant in the form of a peptide and the targeting construct as a nucleic acid molecule may be used in combination.
In particular, wherein said sequence to be introduced is a sequence which inactivates the HIV provirus.
In particular, wherein the sequence which inactivates the HIV provirus comprises in the 5′ to 3′ orientation: a first transcription termination sequence and a marker cassette including a promoter, the marker open reading frame and a second transcription termination sequence, and said sequence interrupts the transcription of the coding sequence.
In particular, wherein said sequence sharing homologies with the regions surrounding DNA target sequence is from the HIV provirus or a fragment of the HIV provirus comprising sequences upstream and downstream of the cleavage site, so as to allow the deletion of coding sequences flanking the cleavage site.
According to a fourth aspect of the present invention there is provided a host cell which is modified by a polynucleotide according to a second aspect of the present invention or a vector according to a third aspect of the present invention.
A cell according to the present invention may be made according to a method, comprising at least the step of:
(a) introducing into a cell, a meganuclease, as defined above, so as to induce a double stranded cleavage at a site of interest of the GIP comprising a DNA recognition and cleavage site of said meganuclease, and thereby generate a genomically modified cell having repaired the double-strands break, by non-homologous end joining, and
(b) isolating the genomically modified cell of step (a), by any appropriate mean.
The cell which is modified may be any cell of interest. For making transgenic/knock-out animals, the cells are pluripotent precursor cells such as embryo-derived stem (ES) cells, which are well-known in the art. For making recombinant cell lines, the cells may advantageously be human cells, for example PerC6 (Fallaux et al., Hum. Gene Ther. 9, 1909-1917, 1998) or HEK293 (ATCC # CRL-1573) cells or an immortal T lymphocyte line such as Jurkat (Schneider et al (1977). Int J Cancer 19 (5): 621-6.). The meganuclease can be provided directly to the cell or through an expression vector comprising the polynucleotide sequence encoding said meganuclease linked to regulatory sequences suitable for directing its expression in the cell used.
Such a modified cell line would have a number of potential uses including the elucidation of aspects of the biology of the modified GIP as well as a model for screening compounds and other substances for therapeutic effects against cells comprising the modified GIP.
According to a fifth aspect of the present invention there is provided a non-human transgenic animal which is modified by a polynucleotide according to a second aspect of the present invention or a vector according to a third aspect of the present invention.
The subject-matter of the present invention is also a method for making an animal which comprises a modified GIP, comprising at least the step of:
(a) introducing into a pluripotent precursor cell or an embryo of an animal, a meganuclease, as defined above, so as to induce a double stranded cleavage at a site of interest of the GIP comprising a DNA recognition and cleavage site of said meganuclease, and thereby generate a genomically modified precursor cell or embryo having repaired the double-strands break by non-homologous end joining,
(b) developing the genomically modified animal precursor cell or embryo of step (a) into a chimeric animal, and
(c) deriving a transgenic animal from a chimeric animal of step (b).
Alternatively, the GIP may be inactivated by insertion of a sequence of interest by homologous recombination between the genome of the animal and a targeting DNA construct according to the present invention.
In particular the targeting DNA is introduced into the cell under conditions appropriate for introduction of the targeting DNA into the site of interest.
In particular, step (b) comprises the introduction of the genomically modified precursor cell obtained in step (a), into blastocysts, so as to generate chimeric animals.
Such a transgenic animal could be used as a multicellular animal model to elucidate aspects of the biology of the GIP, by means of engineering the provirus present in the progenitor cell line. Such transgenic animals also could be used to screen and characterise the effects of for instance novel anti-HIV medicaments.
In particular the targeting DNA construct is inserted in a vector.
For making transgenic animals/recombinant cell lines, including human cell lines expressing an heterologous protein of interest, the targeting DNA comprises the sequence of the exogenous gene encoding the protein of interest, and eventually a marker gene, flanked by sequences upstream and downstream of and essential gene in the HIV provirus, as defined above, so as to generate genomically modified cells (animal precursor cell or embryo/animal or human cell) having replaced the HIV gene by the exogenous gene of interest, by homologous recombination.
The exogenous gene and the marker gene are inserted in an appropriate expression cassette, as defined above, in order to allow expression of the heterologous protein/marker in the transgenic animal/recombinant cell line.
The meganuclease can be used either as a polypeptide or as a polynucleotide construct encoding said polypeptide. It is introduced into somatic cells of an individual, by any convenient means well-known to those in the art, which are appropriate for the particular cell type, alone or in association with either at least an appropriate vehicle or carrier and/or with the targeting DNA.
Once in a cell, the meganuclease and if present, the vector comprising targeting DNA and/or nucleic acid encoding a meganuclease are imported or translocated by the cell from the cytoplasm to the site of action in the nucleus.
According to a sixth aspect of the present invention there is provided a transgenic plant which is modified by a polynucleotide according to a second aspect of the present invention or a vector according to a third aspect of the present invention.
According to a further aspect of the present invention there is provided the use of at least one variant or at least one single-chain chimeric meganuclease as defined above, or at least one vector according to the third aspect of the present invention, for genome engineering for non-therapeutic purposes.
In particular the variant or single-chain chimeric meganuclease or vector is associated with a targeting DNA construct.
In particular the use of the variant is for inducing a double-strand break in a site of interest within the GIP, thereby inducing a DNA recombination event, a DNA loss or cell death.
According to the invention, said double-strand break is for: modifying a specific sequence in the GIP, so as to induce restoration of a GIP function such as replication in studies upon the biology of the virus, or to attenuate or activate the GIP or a gene therein, introducing a mutation into a site of interest of a GIP gene, introducing an exogenous gene or a part thereof, inactivating or deleting the GIP or a part thereof or leaving the DNA unrepaired and degraded.
In particular this present aspect of the present invention relates to the use of a meganuclease variant to treat HIV infection, by inactivating the HIV provirus by therapeutic genome engineering.
According to one aspect of the present invention the use of the meganuclease according to the present invention, comprises at least the following steps:
1) introducing a double-strand break at at least one site of interest in the HIV provirus comprising at least one recognition and cleavage site of said meganuclease, by contacting said cleavage site with said meganuclease;
2) providing a targeting DNA construct comprising the sequence to be introduced flanked by sequences sharing homologies to the targeted locus.
Wherein the meganuclease is provided directly to the cell or through an expression vector comprising the polynucleotide sequence encoding of the meganuclease and is suitable for its expression in the host cell.
This strategy is used to introduce a DNA sequence at the target site, for example to generate a HIV provirus knock-in or knock-out animal model or cell lines that can be used for drug testing or in the case of a cell line, which can be used for administration into a patient from whom it was derived.
According to a further aspect of the present invention the use of the meganuclease, comprises at least the following steps:
1) introducing a double-strand break at a site of interest of the HIV provirus comprising at least one recognition and cleavage site of said meganuclease, by contacting said cleavage site with said meganuclease;
2) maintaining said broken genomic locus under conditions appropriate for homologous recombination with chromosomal DNA sharing homologies to regions surrounding the cleavage site.
As well as inactivating the provirus using a targeting construct, a significant number of inter chromosome arm recombination events are also expected to occur following cleavage of the provirus target. The recombination of chromosome arms occurs most frequently during mitosis, but can also occur as part of the repair mechanism for DNA strand breaks. Such an inter chromosome arm recombination event would either lead to the elimination of the non homologous portions on either side of the break (e.g. the provirus) or more likely cause portions of the provirus to be recombined onto different chromosome arms. In either event this would lead to the inactivation of the provirus.
According to still further aspect of the present invention the use of the meganuclease, comprises at least the following steps:
1) introducing a double-strand break at a site of interest of the HIV provirus comprising at least one recognition and cleavage site of said meganuclease, by contacting said cleavage site with said meganuclease;
2) maintaining said broken genomic locus under conditions appropriate for repair of the double-strands break by non-homologous end joining.
According to a further aspect of the present invention the variant is used for genome therapy to knock-out in animals/cells the GIP, in particular a sequence is introduced which inactivates the HIV provirus.
All HIV proviruses present in the cell have to be targeted in order to totally inactivate the pathogenicity of the virus. In addition, the introduced sequence may also delete the HIV provirus or part thereof, and introduce an exogenous gene or part thereof (knock-in/gene replacement). For making knock-in animals/cells the DNA which repairs the site of interest may comprise the sequence of an exogenous gene of interest, and a selection marker, such as the G418 resistance gene. Alternatively, the sequence to be introduced can be any other sequence used to alter the chromosomal DNA in some specific way including a sequence used to modify a specific sequence, to attenuate or activate the endogenous gene of interest in the HIV provirus or to introduce a mutation into a site of interest in the HIV provirus. Such chromosomal DNA alterations may be used for genome engineering (animal models and recombinant cell lines including human cell lines).
Inactivation of the HIV provirus may occur by insertion of a transcription termination signal that will interrupt the transcription of an essential gene such as GAG, POL and ENV and result in a truncated protein. In this case, the sequence to be introduced comprises, in the 5′ to 3′ orientation: at least a transcription termination sequence (polyA1), preferably said sequence further comprises a marker cassette including a promoter and the marker open reading frame (ORF) and a second transcription termination sequence for the marker gene ORF (polyA2). This strategy can be used with any variant cleaving a target downstream of the relevant gene promoter and upstream of the stop codon.
Inactivation of the HIV provirus may also occur by insertion of a marker gene within an essential gene of HIV, which would disrupt the coding sequence. The insertion can in addition be associated with deletions of ORF sequences flanking the cleavage site and eventually, the insertion of an exogenous gene of interest (gene replacement).
In addition, inactivation of the HIV provirus may also occur by insertion of a sequence that would destabilize the mRNA transcript of an essential gene.
The present invention also provides a composition characterized in that it comprises at least one variant as defined above (variant or single-chain derived chimeric meganuclease) and/or at least one expression vector encoding the variant, as defined above.
The administration of the provirus targeting variant in as both a peptide and nucleotide form allows for the immediate action of the variant as as its persistence in the target cell.
In particular the composition comprises more than one variant, wherein each of the variants is directed towards a different target sequence in the provirus.
In particular the composition comprises a targeting DNA construct comprising a sequence which inactivates the HIV provirus, flanked by sequences sharing homologies with the genomic DNA cleavage site of said variant, as defined above.
Preferably, said targeting DNA construct is either included in a recombinant vector or it is included in an expression vector comprising the polynucleotide(s) encoding the variant according to the invention.
The subject-matter of the present invention is also the use of at least one meganuclease and/or one expression vector, as defined above, for the preparation of a medicament for preventing, improving or curing HIV infection in an individual in need thereof.
The subject-matter of the present invention is also the use of at least one variant and/or one expression vector, as defined above, for the preparation of a medicament for preventing, improving or curing a pathological condition associated with HIV infection in an individual in need thereof.
As discussed above the variants according to the present invention provide a possible means to prevent chromosomal integration of a target cell with the retrovirus genome. The first step of the viral infection following viral entry into the target cell is the reverse transcription (RT) of the viral genomic RNA. During this RT process, a linear double stranded DNA molecule is formed which then enters the nucleus so that it can be integrated in the cellular genome. Meganuclease variants of the present invention are also able to cleave the pre-integration complex (PIC), which is an episomal double stranded DNA molecule, conferring a protective effect during the earliest steps of viral infection, of a cell population.
The use of the meganuclease may comprise at least the step of (a) inducing in somatic tissue(s) of the donor/individual a double stranded cleavage at a site of interest of the HIV provirus comprising at least one recognition and cleavage site of said meganuclease by contacting said cleavage site with said meganuclease, and (b) introducing into said somatic tissue(s) a targeting DNA, wherein said targeting DNA comprises (1) DNA sharing homologies to the region surrounding the cleavage site and (2) DNA which inactivates the HIV provirus upon recombination between the targeting DNA and the chromosomal DNA, as defined above. The targeting DNA is introduced into the somatic tissues(s) under conditions appropriate for introduction of the targeting DNA into the site of interest. The targeting construct may comprise sequences for deleting the HIV provirus or a portion thereof and introducing the sequence of an exogenous gene of interest (gene replacement).
In this case the use of the meganuclease comprises at least the step of: inducing in somatic tissue(s) of the donor/individual a double stranded cleavage at a site of interest of the HIV provirus comprising at least one recognition and cleavage site of the meganuclease by contacting the cleavage site with the meganuclease, and thereby inducing mutagenesis of an open reading frame in the HIV provirus by repair of the double-strands break by non-homologous end joining.
According to the present invention, said double-stranded cleavage may be induced, ex vivo by introduction of said meganuclease into infected cells isolated for instance from the circulatory system of the donor/individual and then transplantation of the modified cells back into the diseased individual.
The subject-matter of the present invention is also a method for preventing, improving or curing HIV infection, in an individual in need thereof, said method comprising at least the step of administering to said individual a composition as defined above, by any means.
For purposes of therapy, the meganucleases and a pharmaceutically acceptable excipient are administered in a therapeutically effective amount. Such a combination is said to be administered in a “therapeutically effective amount” if the amount administered is physiologically significant. An agent is physiologically significant if its presence results in a detectable change in the physiology of the recipient. In the present context, an agent is physiologically significant if its presence results in a decrease in the severity of one or more symptoms of the targeted HIV infection.
In particular as far as possible the meganuclease comprising compositions should be non-immunogenic, i.e., engender little or no adverse immunological response. A variety of methods for ameliorating or eliminating deleterious immunological reactions of this sort can be used in accordance with the invention. One means of achieving this is to ensure that the meganuclease is substantially free of N-formyl methionine. Another way to avoid unwanted immunological reactions is to conjugate meganucleases to polyethylene glycol (“PEG”) or polypropylene glycol (“PPG”) (preferably of 500 to 20,000 daltons average molecular weight (MW)). Conjugation with PEG or PPG, as described by Davis et al. (U.S. Pat. No. 4,179,337) for example, can provide non-immunogenic, physiologically active, water soluble endonuclease conjugates with anti-viral activity. Similar methods also using a polyethylene-polypropylene glycol copolymer are described in Saifer et al. (U.S. Pat. No. 5,006,333).
In accordance with a further aspect of the present invention, the invention also relates to meganuclease variants, related materials and uses thereof which recognize non-virus retroelements and/or the integrated genomes of viruses which do not have mechanisms to integrate into the host cell genome.
Non-virus retroelements are endogenous genomic DNA elements that include the gene for reverse transcriptase and are also known as class I transposable elements. These retrotransposons, include the long terminal repeat (LTR) retrotransposons, non-LTR retroposons and group II mitochondrial introns. They are though to be derived from partially inactivated retroviruses which have lost the ability to form infective virus particles. These genetic elements however are increasingly becoming associated with various diseases, in particular cancers and immune disorders which result form the integration of the element into a site close to a gene (s) whose misregulation leads to the observed disease phenotype.
The present invention therefore also relates to meganuclease variants which can be used to cleave a genomic retrotransposon either in a specific tissue or cell type or more generally so as to treat the disease phenotype using one or more of the mechanisms described above.
The present invention also relates to meganuclease variants which can recognise and cleave targets in genomic insertions of viruses which do not normally insert into the host cell genome. The non-specific insertion of viral genetic material into the host cell genome is a disease causing mechanism which is currently being investigated. For example in the important virus hepatitis B, chronic infection with this virus is associated with a greatly elevated risk of hepatocellular carcinoma. In the past this association has been explained as a side effect of the episomal hepatitis B genome upon the hepatocyte host cells. Although this is doubtless true, recently the random genomic insertion of copies of the hepatitis B genome into the host cell genome has also been shown to be a causative factor in hepatocyte carcinoma (Goodarzi et al., 2008, Hep. Mon; 8 (2): 129-133).
Hepatocellular carcinoma is one of the most common cancers in the world and hence a treatment for this condition, using a meganuclease variant which can cleave the randomly integrated hepatitis B genome and have a therapeutic affect upon hepatocytes via one or more of mechanisms detailed above is therefore also within the scope of the present invention as are more generally meganuclease variants to genomically integrated copies of virus genetic material which cause a disease phenotype.
Throughout the present patent application a number of terms and features are used to present and describe the present invention, to clarify the meaning of these terms a number of definitions are set out below and wherein a feature or term is not otherwise specifically defined or obvious from its context the following definitions apply.
The amount of p24 present in cell culture supernatants was determined by ELISA. A sample transfected by a non related meganuclease (NRM, see text) is used for normalization. In this way, the amount of p24 produced by these cells, expressed in fg/cell is considered as 100% of VLP production. The amount of p24 produced by HIV meganuclease transfected cells is represented as the percentage of VLP production respect to the amount produced by the NRM transfected cells. The values represent the data from at least 3 independent transfections.
There will now be described by way of example a specific mode contemplated by the Inventors. In the following description numerous specific details are set forth in order to provide a thorough understanding. It will be apparent however, to one skilled in the art, that the present invention may be practiced without limitation to these specific details. In other instances, well known methods and structures have not been described so as not to unnecessarily obscure the description.
The HIV1—1 target (SEQ ID NO:319) is a 22 bp (non-palindromic) target located in U3 region of the proviral LTRs (
The HIV1—1 sequence (SEQ ID NO:319) is partly a patchwork of the 10AGA_P (SEQ ID NO:381), 10TGG_P (SEQ ID NO:379), 5TAC_P (SEQ ID NO:389) and 5_CTG_P (SEQ ID NO:387) targets (these designations describe the 3 bp starting at the indicated nucleotide of the I-CreI target, for instance 10AGA_P (SEQ ID NO:381) indicates that nucleotides −10, −9 and −8 are A(−10) G(−9) A(−8) (
The 10AGA_P (SEQ ID NO:381), 10TGG_P (SEQ ID NO:379), 5TAC_P (SEQ ID NO:389) and 5_CTG_P (SEQ ID NO:387) target sequences are 24 bp derivatives of C1221, a palindromic sequence cleaved by I-CreI (Arnould et al., precited). However, the structure of I-CreI bound to its DNA target suggests that the two external base pairs of these targets (positions −12 and 12) have no impact on binding and cleavage (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269), and in this study, only positions −11 to 11 were considered. Consequently, the HIV1—1 series of targets (SEQ ID NO:319 to 324) were defined as 22 bp sequences instead of 24 bp. HIV1—1 (SEQ ID NO:319) differs from C1221 (SEQ ID NO: 343) in the 4 bp central region. According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, the bases at these positions should not impact the binding efficiency. However, they could affect cleavage, which results from two nicks at the edge of this region. Thus, the ACAC sequence in −2 to 2 was first substituted with the GTAC sequence from C1221, resulting in target HIV1—1.2 (SEQ ID NO:320) (
This example shows that I-CreI variants can cut the HIV1—1.3 DNA target sequence (SEQ ID NO:321) derived from the left part of the HIV1—1.2 target (SEQ ID NO:320) in a palindromic form (
HIV1—1.3 (SEQ ID NO:321) is similar to 10AGA_P (SEQ ID NO:381) at positions ±1, ±2, +6, ±8, ±9, and ±10 and to 5TAC_P (SEQ ID NO:389) at positions ±1, +2, ±3, +4, ±5 and ±6. It was hypothesized that positions ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave the 10AGA_P (SEQ ID NO:381) target were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156. Variants able to cleave 5TAC_P (SEQ ID NO:389) were obtained by mutagenesis on I-CreI N75 at positions 24, 44, 68, 70, 75 and 77 as described in Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target. Mutations at positions 24 found in variants cleaving the 5TAC_P target (SEQ ID NO:389) will be lost during the combinatorial process. But it was hypothesized that this will have little impact on the capacity of the combined variants to cleave the HIV1—1.3 target (SEQ ID NO:321).
Therefore, to check whether combined variants could cleave the HIV1—1.3 target (SEQ ID NO:321), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TAC_P (SEQ ID NO:389) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10AGA_P (SEQ ID NO:381).
a) Construction of Target Vector
The target was cloned as follows: an oligonucleotide corresponding to the HIV1—1.3 (SEQ ID NO:321) target sequence flanked by gateway cloning sequences was ordered from PROLIGO:
5′ TGGCATACAAGTTTGCAGAACTACGTACGTAGTTCTGCCAATCGTCTGTCA 3′ (SEQ ID NO: 14). The same procedure was followed for cloning the HIV1—1.5 target (SEQ ID NO:323), using the oligonucleotide:
5′ TGGCATACAAGTTTGCAGAACTACACACGTAGTTCTGCCAATCGTCTGTCA 3′ (SEQ ID NO: 15). Double-stranded target DNA, generated by PCR amplification of the single stranded oligonucleotide, was cloned using the Gateway protocol (INVITROGEN) into the yeast reporter vector (pCLS1055,
b) Construction of Combinatorial Mutants
I-CreI variants cleaving 10AGA_P (SEQ ID NO:381) or 5TAC_P (SEQ ID NO:389) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10AGA_P (SEQ ID NO:381) and 5TAC_P (SEQ ID NO:389) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS0542,
The PCR fragments resulting from the amplification reaction using the same primers and with the same coding sequence for residue 40 were pooled. Then, each pool of PCR fragments resulting from the reaction with primers Gal10F and assR or assF and Gal10R was mixed in an equimolar ratio. Finally, approximately 25 ng of each final pool of the two overlapping PCR fragments and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
d) Sequencing of Variants
To recover the variant expression plasmids, yeast DNA was extracted using standard protocols and used to transform E. coli. Sequencing of variant ORFs was then performed on the plasmids by MILLEGEN SA. Alternatively, ORFs were amplified from yeast DNA by PCR (Akada et al., Biotechniques, 2000, 28, 668-670), and sequencing was performed directly on the PCR product by MILLEGEN SA.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TAC_P (SEQ ID NO:389) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10AGA_P (SEQ ID NO:381) on the I-CreI scaffold, resulting in a library of complexity 1600. Examples of combinatorial variants are displayed in Table I. In Table I the peptide sequence of these two subdomains are provided in the first column and second row respectively.
This library was transformed into yeast and 3348 clones (2 times the diversity) were screened for cleavage against the HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323) DNA targets. 36 positive clones were found to cleave the HIV1—1.3 target (SEQ ID NO:321), which after sequencing turned out to correspond to 31 different novel endonuclease variants (Table II). Those variants showed no cleavage activity of the HIV1—1.5 DNA target (SEQ ID NO:323). Examples of positives are shown in
This example shows that I-CreI variants can cleave the HIV1—1.4 DNA target sequence (SEQ ID NO:322) derived from the right part of the HIV1—1.2 target (SEQ ID NO:320) in a palindromic form (
HIV1—1.4 (SEQ ID NO:322) is similar to 5CTG_P (SEQ ID NO:387) at positions ±1, +2, ±3, ±4, ±5 and ±8 and to 10TGG_P (SEQ ID NO:379) at positions ±1, ±2, ±3, ±4, ±8, ±9 and ±10. It was hypothesized that positions ±6, ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave 5CTG_P (SEQ ID NO:387) were obtained by mutagenesis of I-CreI N75 at positions 44, 68, 70, 75 and 77, as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156). Variants able to cleave the 10TGG_P target (SEQ ID NO:379) were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target.
Therefore, to check whether combined variants could cleave the HIV1—1.4 target (SEQ ID NO:322), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TTC_P (SEQ ID NO:388) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10GGA_P (SEQ ID NO:380).
a) Construction of Target Vector
The experimental procedure is as described in example 2, with the exception that different oligonucleotides corresponding to the HIV1—1.4 (SEQ ID NO:322) and HIV1—1.6 (SEQ ID NO:324) targets. The oligonucleotide used for the HIV1—1.4 target (SEQ ID NO:322) was:
5′TGGCATACAAGTTTCCTGGCCCTGGTACCAGGGCCAGGCAATCGTCTGTCA 3′ (SEQ ID NO: 20),
and
5′TGGCATACAAGTTTCCTGGCCCTGACACCAGGGCCAGGCAATCGTCTGTCA 3′ (SEQ ID NO: 21) for HIV1—1.6 target (SEQ ID NO:324).
b) Construction of Combinatorial Variants
I-CreI variants cleaving 10TGG_P (SEQ ID NO:379) or 5CTG_P (SEQ ID NO:387) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10TGG_P (SEQ ID NO:379) and 5CTG_P (SEQ ID NO:387) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS1107,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software. Positives resulting clones were verified by sequencing (MILLEGEN) as described in example 2.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5CTG_P (SEQ ID NO:387) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10TGG_P (SEQ ID NO:379) on the I-CreI scaffold, resulting in a library of complexity 1600. Examples of combinatorial variants are displayed in Table III. This library was transformed into yeast and 3348 clones (2 times the diversity) were screened for cleavage against the HIV1—1.4 (SEQ ID NO:322) and HIV1—1.6 (SEQ ID NO:324) DNA targets. A total of 32 positive clones were found to cleave HIV1—1.4 (SEQ ID NO:322). Sequencing of these 32 clones allowed the identification of 25 novel endonuclease variants. One of those variants showed cleavage activity on the HIV1—1.6 DNA target (SEQ ID NO:324). Examples of positives are shown in
I-CreI variants able to cleave each of the palindromic HIV1—1.2 (SEQ ID NO:320) derived targets (HIV1—1.3 (SEQ ID NO:321) and HIV1—1.4 (SEQ ID NO:322)) were identified in example 2 and example 3. Pairs of such variants (one cutting HIV1—1.3 (SEQ ID NO:321) and one cutting HIV1—1.4 (SEQ ID NO:322)) were co-expressed in yeast. Upon co-expression, there should be three active molecular species, two homodimers, and one heterodimer. It was assayed whether the heterodimers that should be formed, cut the HIV1—1.2 (SEQ ID NO:320) and the non palindromic HIV1—1 (SEQ ID NO:319) targets.
a) Construction of Target Vector
The experimental procedure is as described in example 2, with the exception that an oligonucleotide corresponding to the HIV1—1.2 target sequence (SEQ ID NO:320):
5′TGGCATACAAGTTTGCAGAACTACGTACCAGGGCCAGGCAATCGTCTGTCA 3′ (SEQ ID NO: 22) or the HIV1—1 target sequence (SEQ ID NO:319):
5′TGGCATACAAGTTTGCAGAACTACACACCAGGGCCAGGCAATCGTCTGTCA 3′ (SEQ ID NO: 23) was used.
b) Co-Expression of Variants
Yeast DNA was extracted from variants cleaving the HIV1—1.4 target (SEQ ID NO:322) in the pCLS1107 expression vector using standard protocols and was used to transform E. coli. The resulting plasmid DNA was then used to transform yeast strains expressing a variant cutting the HIV1—1.3 target (SEQ ID NO:321) in the pCLS0542 expression vector. Transformants were selected on synthetic medium lacking leucine and containing G418.
c) Mating of Meganucleases Coexpressing Clones and Screening in Yeast
Mating was performed using a colony gridder (QpixII, Genetix). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of different reporter-harboring yeast strains for each target. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
Co-expression of variants cleaving the HIV1—1.4 target (SEQ ID NO:322) (6 variants chosen among those described in Table III and Table IV) and six variants cleaving the HIV1—1.3 target (SEQ ID NO:321) (described in Tables I and II) resulted in cleavage of the HIV1—1.2 target (SEQ ID NO:320) in most of the cases (
I-CreI variants able to cleave the HIV1—1.2 (SEQ ID NO:320) and HIV1—1 (SEQ ID NO:319) target by assembly of variants cleaving the palindromic HIV1—1.3 (SEQ ID NO:321) and HIV1—1.4 (SEQ ID NO:322) target have been previously identified in example 4. However, these variants display stronger activity with the HIV1—1.2 target (SEQ ID NO:320) compared to the HIV1—1 target (SEQ ID NO:319).
Therefore six variants cleaving HIV1—1.3 (SEQ ID NO:321) were mutagenized, and variants were screened for cleavage activity of HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—1 (SEQ ID NO:319) when co-expressed with a variant cleaving HIV1—1.4 (SEQ ID NO:322). According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, it is difficult to rationally choose a set of positions to mutagenize, and mutagenesis was performed on the whole protein. Random mutagenesis results in high complexity libraries. Therefore, to limit the complexity of the variant libraries to be tested, only one of the two components of the heterodimers cleaving HIV1_(SEQ ID NO:319) was mutagenized.
Thus, in a first step, proteins cleaving HIV1—1.3 (SEQ ID NO:321) were mutagenized and their homodimeric cleavage activity was determined, and in a second step, it was assessed whether they could cleave HIV1—1 (SEQ ID NO:319) when co-expressed with a protein cleaving HIV1—1.4 (SEQ ID NO:322).
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25), which are common to the pCLS0542 (
b) Mating of Meganuclease Expressing Clones and Screening in Yeast
Mating was performed as previously described in example 2. Positive resulting clones were verified by sequencing (MILLEGEN) as described in example 2.
c) Variant-Target Yeast Strains, Screening and Sequencing
The yeast strain FYBL2-7B (MAT a, ura3Δ851, trp1Δ63, leu2Δ1, lys2Δ202) containing the HIV1—1 target (SEQ ID NO:319) in the yeast reporter vector (pCLS1055,
Six variants cleaving HIV1—1.3 (SEQ ID NO:321), were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table VII.
2232 transformed clones were screened for cleavage against the HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323) DNA targets. A total of 297 positive clones were found to cleave HIV1—1.3 (SEQ ID NO:321), while only 6 of those cleaved the HIV1—1.5 target (SEQ ID NO:323). Sequencing of the 93 clones showing the strongest activity allowed the identification of 51 novel endonuclease variants. An example of the identified variants is presented in table VIII and in
132V 163R
111H
The 93 clones showing the highest cleavage activity on target HIV1—1.3 (SEQ ID NO:321) were then mated with a yeast strain that contains (i) the HIV1—1 target (SEQ ID NO:319) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—1.4 target (SEQ ID NO:322) (I-CreI 33T, 40K, 44R, 68Y, 70S, 77N+132V or KNSTQK/RYSDN+132V (SEQ ID NO:46), according to the nomenclature of Table I). After mating with this yeast strain, 41 clones were found to cleave the HIV1—1 target (SEQ ID NO:319) more efficiently than the original variant. Thus, 41 positives contained proteins able to form heterodimers with KNSTQK/RYSDN+132V (SEQ ID NO: 46), that showed cleavage activity on the HIV1—1 target (SEQ ID NO:319). An example of positive clones is shown in
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 5. For this purpose, four variants cleaving HIV1—1.3 (SEQ ID NO:321) were mutagenized, and variants were screened for cleavage activity of HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—1 (SEQ ID NO:319) when co-expressed with a variant cleaving HIV1—1.4 (SEQ ID NO:322).
The materials and methods have previously been described in example 5.
A) Results
Four variants cleaving HIV1—1.3 (SEQ ID NO:321), were pooled, randomly mutagenized and transformed into yeast. The four variants submitted to random mutagenesis correspond to variants described in Table VIII (SEQ ID NO: 26, 27, 28 and 29).
2232 transformed clones were screened for cleavage against the HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323) DNA targets. A total of 79 positive clones were found to cleave HIV1—1.3 (SEQ ID NO:321), while 60 of those cleaved also the HIV1—1.5 target (SEQ ID NO:323). Sequencing of the 79 clones allowed the identification of 47 novel endonuclease variants. An example of the identified variants is presented in table IX and
The 79 clones showing cleaving target HIV1—1.3 (SEQ ID NO:321) were then mated with a yeast strain that contains (i) the HIV1—1 target (SEQ ID NO:319) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—1.4 target (SEQ ID NO:322) (I-CreI 33T, 40K, 44R, 68Y, 70S, 77N, 132V or KNSTQK/RYSDN+132V (SEQ ID NO:46), according to the nomenclature of Table I). After mating with this yeast strain, 76 clones were found to cleave the HIV1—1 target (SEQ ID NO:319). Thus, 76 positives contained proteins able to form heterodimers with KNSTQK/RYSDN+132V (SEQ ID NO: 46) showing cleavage activity on the HIV1—1 target (SEQ ID NO:319). An example of positives is shown in
89A 99R 111H 132V 155Q 163R
100E 105A
The I-CreI variants cleaving HIV1—1.3 (SEQ ID NO:321) described in Table IX issued from random mutagenesis in examples 5 and 5bis were also mutagenized by introducing selected amino-acid substitutions in the proteins and screening for more efficient variants cleaving HIV1—1 (SEQ ID NO:319) in combination with a variant cleaving HIV1—1.4 (SEQ ID NO:322).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). These mutations were introduced into the coding sequence of proteins cleaving HIV1—1.3 (SEQ ID NO:321), and the resulting proteins were tested for their ability to induce cleavage of the HIV1—1 target (SEQ ID NO:319), upon co-expression with a variant cleaving HIV1—1.4 (SEQ ID NO:322), as well as for the ability to cleave targets HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323).
a) Site-Directed Mutagenesis
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)).
The same strategy is used with the following pair of oligonucleotides to introduce the mutations leading to the F54L, E80K, F87L, V105A and II132V substitutions in the coding sequences of the variants, respectively:
For each substitution to be introduced, the resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. The ten PCR fragments were pooled en equimolar amounts to generate a mix containing 50 ng of PCR DNA and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 5.
d) Sequencing of Variants
The experimental procedure is as described in example 2.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of five variants cleaving HIV1—1.3 (SEQ ID NO:321) (described in Table X). 558 transformed clones were screened for cleavage against the HIV1—1.3 (SEQ ID NO:321) and HIV1—1.5 (SEQ ID NO:323) DNA targets. A total of 395 positive clones were found to cleave HIV1—1.3 (SEQ ID NO:321), while 349 of those cleaved also the HIV1—1.5 target (SEQ ID NO:323). An example of positive variants is shown in
The 558 transformed clones were also mated with a yeast strain that contains (i) the HIV1—1 target (SEQ ID NO:319) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—1.4 target (SEQ ID NO:322) (I-CreI 33T, 40K, 44R, 68Y, 70S, 77N+132V or KNSTQK/RYSDN+132V (SEQ ID NO:46), according to the nomenclature of Table I). After mating with this yeast strain, 458 clones were found to cleave the HIV1—1 (SEQ ID NO:319). Thus, 458 positives contained proteins able to form heterodimers with KNSTQK/RYSDN+132V (SEQ ID NO: 46) showing cleavage activity on the HIV1—1 target (SEQ ID NO:319). An example of positives is shown in
Sequencing of the 186 clones with the highest cleavage activity on the HIV1—1 target (SEQ ID NO:319) allowed the identification of 138 different endonuclease variants.
The sequence of the five best I-CreI variants cleaving the HIV1—1 target (SEQ ID NO:319) when forming a heterodimer with the KNSTQK/RYSDN+132V variant (SEQ ID NO:46) are listed in Table XI.
As a complement to example 4 we also decided to perform random mutagenesis with variants that cleave HIV1—1.4 (SEQ ID NO:322). The mutagenized proteins cleaving HIV1—1.4 (SEQ ID NO:322) were then tested to determine if they could efficiently cleave HIV1—1 (SEQ ID NO:319) when co-expressed with a protein cleaving HIV1—1.3 (SEQ ID NO:321).
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25). Approximately 25 ng of the PCR product and 75 ng of vector DNA (pCLS1107,
b) Variant-Target Yeast Strains, Screening and Sequencing
The yeast strain FYBL2-7B (MAT a, ura3Δ851, trp1Δ63, leu2Δ1, lys2Δ202) containing the HIV1—1 target (SEQ ID NO:319) in the yeast reporter vector (pCLS1055,
Six variants cleaving HIV1—1.4 (SEQ ID NO:322) were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table XII.
2232 transformed clones were screened for cleavage against the HIV1—1.4 (SEQ ID NO:322) and HIV1—1.6 DNA targets (SEQ ID NO:324). A total of 388 positive clones were found to cleave HIV1—1.4 (SEQ ID NO:322), while 88 of those also cleaved the HIV1—1.6 target (SEQ ID NO:324). Sequencing of the 89 clones showing the strongest activity allowed the identification of 50 novel endonuclease variants. An example of the identified variants is presented in table XIII and in
161P
The 89 clones showing the highest cleavage activity on target HIV1—1.4 (SEQ ID NO:322) were then mated with a yeast strain that contains (i) the HIV1—1 target (SEQ ID NO:319) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—1.3 target (SEQ ID NO:321) (I-CreI 30G, 38R, 44V, 68E, 75N, 77R, 54L, 80K, 81, 132V, 163R or KGSYRS/VERNR+54L+80K+81T+132V+163R (SEQ ID NO:26), according to the nomenclature of Table I). After mating with this yeast strain, 88 clones were found to cleave the HIV1—1 target (SEQ ID NO:319). Thus, 46 positives contained proteins able to form heterodimers with KGSYRS/VERNR+54L+80K+81T+132V+163R (SEQ ID NO: 26), that showed cleavage activity on the HIV1—1 target (SEQ ID NO:319). An example of positives is shown in
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 7. For this purpose, four variants cleaving HIV1—1.4 (SEQ ID NO:322) were mutagenized, and variants were screened for cleavage activity of HIV1—1.4 (SEQ ID NO:322) and HIV1—11.6 (SEQ ID NO:324) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—1 (SEQ ID NO:319) when co-expressed with a variant cleaving HIV1—1.3 (SEQ ID NO:321).
The materials and methods have previously been described in example 7.
Four variants cleaving HIV1—1.4 (SEQ ID NO:322), were pooled, randomly mutagenized and transformed into yeast. The four variants submitted to random mutagenesis correspond to variants described in Table XIII (SEQ ID NO: 46, 68, 69 and 71).
2232 transformed clones were screened for cleavage against the HIV1—1.4 (SEQ ID NO:322) and HIV1—1.6 DNA (SEQ ID NO:324) targets. A total of 59 positive clones were found to cleave HIV1—1.4 (SEQ ID NO:322), while 16 of those cleaved also the HIV1—1.6 (SEQ ID NO:324) target. Sequencing of the 49 clones allowed the identification of 35 novel endonuclease variants. An example of the identified variants is presented in table XIV and
The 59 clones showing cleaving target HIV1—1.4 (SEQ ID NO:322) were then mated with a yeast strain that contains (i) the HIV1—1 target (SEQ ID NO:319) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—1.3 target (SEQ ID NO:321) (I-CreI 30G, 38R, 44N, 68Y, 70S, 75R, 77Y+79N or KGSYRS/NYSRY+79N (SEQ ID NO:28), according to the nomenclature of Table I). After mating with this yeast strain, 42 clones were found to cleave the HIV1—1 (SEQ ID NO:319). Thus, 42 positives contained proteins able to form heterodimers with KGSYRS/NYSRY+79N (SEQ ID NO: 28) showing cleavage activity on the HIV1—1 target (SEQ ID NO:319). An example of positives is shown in
The HIV1—3 target (SEQ ID NO:321) is a 22 bp (non-palindromic) target located in U5 region of the proviral LTRs. Since the LTRs are duplicated sequences flanking the viral ORFs in the integrated provirus, the HIV1—3 target (SEQ ID NO:321) is present twice in the HIV1 provirus. This target is precisely located at positions 599-620 and 9674-9695 of the HIV-1 pNL4-3 vector (accession number AF324493, Adachi et al., J. Virol., 1986, 59, 284-291), a subtype B infectious molecular clone.
The HIV1—3 sequence (SEQ ID NO: 325) is partly a patchwork of the 10CAG_P (SEQ ID NO:374), 10ACA_P (SEQ ID NO:375), 5CCT_P (SEQ ID NO:384) and 5_GAC_P (SEQ ID NO:385) targets (
The 10CAG_P (SEQ ID NO:374), 10ACA_P (SEQ ID NO:375), 5CCT_P (SEQ ID NO:384) and 5_GAC_P (SEQ ID NO:385) target sequences are 24 bp derivatives of C1221, a palindromic sequence cleaved by I-CreI (Arnould et al., precited). However, the structure of I-CreI bound to its DNA target suggests that the two external base pairs of these targets (positions −12 and 12) have no impact on binding and cleavage (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269), and in this study, only positions −11 to 11 were considered. Consequently, the HIV1—3 series of targets (SEQ ID NO:325 to 330) were defined as 22 bp sequences instead of 24 bp. HIV1—3 (SEQ ID NO:325) differs from C1221 (SEQ ID NO:343) in the 4 bp central region. According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, the bases at these positions should not impact the binding efficiency. However, they could affect cleavage, which results from two nicks at the edge of this region. Thus, the TTTA sequence in −2 to 2 was first substituted with the GTAC sequence from C1221 (SEQ ID NO:343), resulting in target HIV1—3.2 (SEQ ID NO: 326,
This example shows that I-CreI variants can cut the HIV1—3.3 target (SEQ ID NO:327) sequence derived from the left part of the HIV1—3.2 target (SEQ ID NO:326) in a palindromic form (
HIV1—3.3 (SEQ ID NO:327) is similar to 10CAG_P (SEQ ID NO:374) at positions ±1, ±2, ±6, ±8, ±9, and ±10 and to 5CCT_P (SEQ ID NO:384) at positions ±1, ±2, ±3, ±4, ±5 and ±6. It was hypothesized that positions ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave the 10CAG_P (SEQ ID NO:374) target were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156. Variants able to cleave 5CCT_P (SEQ ID NO:384) were obtained by mutagenesis on I-CreI N75 at positions 24, 44, 68, 70, 75 and 77 as described in Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target. Mutations at positions 24 found in variants cleaving the 5CCT_P (SEQ ID NO:384) target will be lost during the combinatorial process. But it was hypothesized that this will have little impact on the capacity of the combined variants to cleave the HIV1—3.3 target (SEQ ID NO:327).
Therefore, to check whether combined variants could cleave the HIV1—3.3 target (SEQ ID NO:327), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5CCT_P (SEQ ID NO:384) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10CAG_P (SEQ ID NO:374).
a) Construction of Target Vector
The target was cloned as follows: an oligonucleotide corresponding to the HIV1—3.3 target sequence (SEQ ID NO:327) flanked by gateway cloning sequences was ordered from PROLIGO:
5′ TGGCATACAAGTTTCTCAGACCCTGTACAGGGTCTGAGCAATCGTCTGTCA 3′ (SEQ ID NO: 86). The same procedure was followed for cloning the HIV1—3.5 target (SEQ ID NO:329), using the oligonucleotide:
5′ TGGCATACAAGTTTCTCAGACCCTTTTAAGGGTCTGAGCAATCGTCTGTCA 3′ (SEQ ID NO: 87). Double-stranded target DNA, generated by PCR amplification of the single stranded oligonucleotide, was cloned using the Gateway protocol (INVITROGEN) into the yeast reporter vector (pCLS1055,
b) Construction of Combinatorial Mutants
I-CreI variants cleaving 10CAG_P (SEQ ID NO:374) or 5CCT_P (SEQ ID NO:384) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10AGA_P (SEQ ID NO:381) and 5TAC_P (SEQ ID NO:389) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
d) Sequencing of Variants
To recover the variant expression plasmids, yeast DNA was extracted using standard protocols and used to transform E. coli. Sequencing of variant ORFs was then performed on the plasmids by MILLEGEN SA. Alternatively, ORFs were amplified from yeast DNA by PCR (Akada et al., Biotechniques, 2000, 28, 668-670), and sequencing was performed directly on the PCR product by MILLEGEN SA.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5CCT_P (SEQ ID NO:384) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10CAG_P (SEQ ID NO:374) on the I-CreI scaffold, resulting in a library of complexity 1600. Examples of combinatorial variants are displayed in Table XV. This library was transformed into yeast and 3348 clones (2 times the diversity) were screened for cleavage against the HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329) DNA targets. 10 positive clones were found to cleave the HIV1—3.3 target (SEQ ID NO:327), which after sequencing turned out to correspond to 7 different novel endonuclease variants (Table XVI). These variants showed no cleavage activity of the HIV1—3.5 DNA target (SEQ ID NO:329). Examples of positives are shown in
This example shows that I-CreI variants can cleave the HIV1—3.4 DNA target sequence (SEQ ID NO:328) derived from the right part of the HIV1—3.2 target (SEQ ID NO:326) in a palindromic form (
HIV1—3.4 (SEQ ID NO:328) is similar to 5GAC_P (SEQ ID NO:385) at positions ±1, ±2, ±3, ±4, ±5 and ±8 and to 10ACA_P (SEQ ID NO:375) at positions ±1, ±2, ±3, ±4, ±8, ±9 and ±10. It was hypothesized that positions ±6, ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave 5GAC_P (SEQ ID NO:385) were obtained by mutagenesis of I-CreI N75 at positions 44, 68, 70, 75 and 77, as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156). Variants able to cleave the 10ACA_P target (SEQ ID NO:375) were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target.
Therefore, to check whether combined variants could cleave the HIV1—3.4 target (SEQ ID NO:328), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5GAC_P (SEQ ID NO:385) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10ACA_P (SEQ ID NO:375).
a) Construction of Target Vector
The experimental procedure is as described in example 2, with the exception that different oligonucleotides corresponding to the HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 targets (SEQ ID NO:330). The oligonucleotide used for the HIV1—3.4 target (SEQ ID NO:328) was:
5′ TGGCATACAAGTTTCCACACTGACGTACGTCAGTGTGGCAATCGTCTGTCA 3′ (SEQ ID NO: 95), and
5′ TGGCATACAAGTTTCCACACTGACTTTAGTCAGTGTGGCAATCGTCTGTCA 3′ (SEQ ID NO: 96) for HIV1—3.6 target (SEQ ID NO:330).
b) Construction of Combinatorial Variants
I-CreI variants cleaving 100ACA_P (SEQ ID NO:375) or 5GAC_P (SEQ ID NO:385) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10ACA_P (SEQ ID NO:375) and 5GAC_P (SEQ ID NO:385) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS1107,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software. Positives resulting clones were verified by sequencing (MILLEGEN) as described in example 2.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5GAC_P (SEQ ID NO:385) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10ACA_P (SEQ ID NO:375) on the I-CreI scaffold, resulting in a library of complexity 2280. Examples of combinatorial variants are displayed in Table XVII. This library was transformed into yeast and 3348 clones (1.5 times the diversity) were screened for cleavage against the HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 DNA (SEQ ID NO:330) targets. A total of 305 positive clones were found to cleave HIV1—3.4 (SEQ ID NO:328), and two of those variants showed cleavage activity on the HIV1—3.6 (SEQ ID NO:330) target. DNA Sequencing of these 93 strongest clones allowed the identification of 64 novel endonuclease variants. Examples of positives are shown in
I-CreI variants able to cleave each of the palindromic HIV1—3.2 (SEQ ID NO:326) derived targets (HIV1—3.3 (SEQ ID NO:327) and HIV1—3.4 (SEQ ID NO:328)) were identified in example 9 and example 10. Pairs of such variants (one cutting HIV1—3.3 (SEQ ID NO:327) and one cutting HIV1—3.4 (SEQ ID NO:328)) were co-expressed in yeast. Upon co-expression, there should be three active molecular species, two homodimers, and one heterodimer. It was assayed whether the heterodimers that should be formed, cut the HIV1—3.2 (SEQ ID NO:326) and the non palindromic HIV1—3 (SEQ ID NO:325) targets.
a) Construction of Target Vector
The experimental procedure is as described in example 9, with the exception that an oligonucleotide corresponding to the HIV1—3.2 target sequence (SEQ ID NO:326): 5′-TGGCATACAAGTTTCTCAGACCCTGTACGTCAGTGTGGCAATCGTCTGTCA 3′ (SEQ ID NO: 317) or the HIV1—3 target sequence (SEQ ID NO:325):
5′ TGGCATACAAGTTTCTCAGACCCTTTTAGTCAGTGTGGCAATCGTCTGTCA 3′ (SEQ ID NO: 318) was used.
b) Co-Expression of Variants
Yeast DNA was extracted from variants cleaving the HIV1—3.4 (SEQ ID NO:328) target in the pCLS1107 expression vector using standard protocols and was used to transform E. coli. The resulting plasmid DNA was then used to transform yeast strains expressing a variant cutting the HIV1—3.3 (SEQ ID NO:327) target in the pCLS0542 expression vector. Transformants were selected on synthetic medium lacking leucine and containing G418.
c) Mating of Meganucleases Coexpressing Clones and Screening in Yeast
Mating was performed using a colony gridder (QpixII, Genetix). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of different reporter-harboring yeast strains for each target. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
Co-expression of variants cleaving the HIV1—3.4 target (SEQ ID NO:328) (4 variants) and five variants cleaving the HIV1—3.3 target (SEQ ID NO:327) didn't result in cleavage of the HIV1—3 target (SEQ ID NO:325), though most of the couples were able to cleave the HIV1—3.2 target (SEQ ID NO:326).
I-CreI variants able to cleave the HIV1—3.3 target (SEQ ID NO:327) have been previously identified in example 9.
These variants display, however, weak cleavage activity and where therefore mutagenized in order to improve their activity. Four mutants were selected for random mutagenesis and the variants obtained were screened for cleavage activity of HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329) targets. According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, it is difficult to rationally choose a set of positions to mutagenize, and mutagenesis was performed on the whole protein.
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25), which are common to the pCLS0542 (
b) Mating of Meganuclease Expressing Clones and Screening in Yeast
Experiments were performed as previously described in example 9. Positive resulting clones were verified by sequencing (MILLEGEN) as described in example 9.
Four variants cleaving HIV1—3.3 (SEQ ID NO:327), were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table XVI (SEQ ID 88 to 91).
2232 transformed clones were screened for cleavage against the HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329) DNA targets. A total of 51 positive clones were found to cleave HIV1—3.3 (SEQ ID NO:327), while none of those cleaved the HIV1—3.5 target (SEQ ID NO:329). Sequencing of the 51 clones allowed the identification of 35 novel endonuclease variants. An example of the identified variants is presented in table XIX and in
33S 38Y 40R 44K 68S 70N 102V
33S 38Y 40R 44K 68S 70N
24V 32K 33A 35Y 44K 68E 70S 75N 77R
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 7bis. For this purpose, ten variants cleaving HIV1—3.3 (SEQ ID NO:327) were mutagenized, and variants were screened for cleavage activity of HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329) targets. The materials and methods have previously been described in example 11.
Ten variants cleaving HIV1—3.3 (SEQ ID NO:327), were pooled, randomly mutagenized and transformed into yeast. The variants submitted to random mutagenesis correspond to variants described in Table XIX (SEQ ID NO: 105 to 114).
2232 transformed clones were screened for cleavage against the HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329) DNA targets. A total of 262 positive clones were found to cleave HIV1—3.3 (SEQ ID NO:327), while 24 of those cleaved also, though weakly, the HIV1—3.5 target (SEQ ID NO:329). Sequencing of the 93 clones showing the strongest cleavage activity in the HIV1—3.3 target (SEQ ID NO:327) allowed the identification of 69 novel endonuclease variants. An example of the identified variants is presented in table XX and
129A 154C 158Q
16L 32K 33A 43L 44K 50R 68E 70S 75N 77R 81V
7E 33S 38Y 40R 44K 68E 70S 75Y 77R 96R 105A
2S 32K 33A 44K 68E 70S 75N 77R 132N
Five I-CreI variants cleaving HIV1—3.3 (SEQ ID NO:327) after two cycles of random mutagenesis (examples 12 and 12bis) were mutagenized by introducing selected amino-acid substitutions in the proteins and screening for more efficient variants cleaving HIV1—3 (SEQ ID NO:325) in combination with a variant cleaving HIV1—3.4 (SEQ ID NO:328).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). These mutations were introduced into the coding sequence of proteins cleaving HIV1—3.3 (SEQ ID NO:327), and the resulting proteins were tested for their ability to induce cleavage of the HIV1—3 target (SEQ ID NO:325), upon co-expression with a variant cleaving HIV1—3.4 (SEQ ID NO:328), as well as for the ability to cleave targets HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329).
a) Site-Directed Mutagenesis
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)).
The same strategy is used with the following pair of oligonucleotides to introduce the mutations leading to the F54L, E80K, F87L, V105A and I132V substitutions in the coding sequences of the variants, respectively:
For each substitution to be introduced, the resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. The ten PCR fragments were pooled en equimolar amounts to generate a mix containing 50 ng of PCR DNA and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 11.
d) Sequencing of Variants
The experimental procedure is as described in example 9.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of five variants cleaving HIV1—3.3 (SEQ ID NO:327) (described in Table XX, SEQ ID NO:115 to 119). 558 transformed clones were screened for cleavage against the HIV1—3.3 (SEQ ID NO:327) and HIV1—3.5 (SEQ ID NO:329) DNA targets. A total of 376 positive clones were found to cleave HIV1—3.3 (SEQ ID NO:327), while 54 of those cleaved also the HIV1—3.5 target (SEQ ID NO:329). An example of positive variants is shown in
The 558 transformed clones were also mated with a yeast strain that contains (i) the HIV1—3 target (SEQ ID NO:325) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—3.4 target (SEQ ID NO:328) (38Y, 44Y, 68S, 70S, 75R, 77V, 43L, 81V, 105A, 107R or KNSYYS/YSSRV+43L+81V+105A+107R (SEQ ID NO:125), according to the nomenclature of Table I). After mating with this yeast strain, 386 clones were found to cleave the HIV1—3 (SEQ ID NO:325). Thus, 386 positives contained proteins able to form heterodimers with KNSYYS/YSSRV+43L+81V+105A+107R (SEQ ID NO: 125) showing cleavage activity on the HIV1—3 target (SEQ ID NO:325). An example of positives is shown in
Sequencing of 93 clones with the high cleavage activity on the HIV1—3 (SEQ ID NO:325) and/or HIV1—3.3 target (SEQ ID NO:327) allowed the identification of 62 different endonuclease variants.
As an example, ten I-CreI variants cleaving the HIV1—3 target (SEQ ID NO:325) when forming a heterodimer with the KNSYYS/YSSRV variant (SEQ ID NO:125) are listed in Table XXI.
19S 32K 33A 43L 44K 49A 68E 70S 75N 77R 81V 85R 89A 129A 154C 158Q
19S 32K 33A 44K 68E 70S 72T 73I 75N 77R 81V 85R 105A
19S 30C 33C 44K 54L 68E 70S 75N 77R
19S 32K 33A 44K 68E 70S 72T 75N 77R 80K 92R 96R 105A 154R
As a complement to example 5 we also decided to perform random mutagenesis with variants that cleave HIV1—3.4 (SEQ ID NO:328). The mutagenized proteins cleaving HIV1—3.4 (SEQ ID NO:328) were then tested to determine the efficiency of cleavage of the HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 (SEQ ID NO:330) targets.
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25). Approximately 25 ng of the PCR product and 75 ng of vector DNA (pCLS1107,
b) Mating of Meganuclease Expressing Clones and Screening in Yeast
Mating was performed as previously described in example 9. Positive resulting clones were verified by sequencing (MILLEGEN) as described in example 9.
Five variants cleaving HIV1—3.4 (SEQ ID NO:328) were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table XVIII (SEQ ID NO:97 to 101).
2232 transformed clones were screened for cleavage against the HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 (SEQ ID NO:330) DNA targets. A total of 645 positive clones were found to cleave HIV1—3.4 (SEQ ID NO:328), while 156 of those also cleaved the HIV1—3.6 target (SEQ ID NO:330). Sequencing of the 93 clones showing the strongest activity allowed the identification of 52 novel endonuclease variants. An example of the identified variants is presented in table XXII and in
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 6. For this purpose, ten variants cleaving HIV1—3.4 (SEQ ID NO:328) were mutagenized, and variants were screened for cleavage activity of HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 (SEQ ID NO:330) targets. The materials and methods have previously been described in example 11.
A) Results
Ten variants cleaving HIV1—3.4 (SEQ ID NO:328), were pooled, randomly mutagenized and transformed into yeast. The variants submitted to random mutagenesis correspond to variants described in Table XXII (SEQ ID NO: 136 to 145).
2232 transformed clones were screened for cleavage against the HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 (SEQ ID NO:330) targets. A total of 178 positive clones were found to cleave HIV1—3.4 (SEQ ID NO:328), while 63 of those cleaved also the HIV1—3.6 target (SEQ ID NO:330). Sequencing of the 93 clones showing the strongest cleavage activity in the HIV1—3.4 target (SEQ ID NO:328) allowed the identification of 62 novel endonuclease variants. An example of the identified variants is presented in table XXIII and
24V 38Y 43L 44Y 68S 70S 75R 77V 105A 153G
4I 38Y 44Y 54I 68S 70S 75R 77V 105A
33C 38S 70S 75N 77K
7R 38Y 40C 44Y 54I 68S 69V 70S 75R 77V
Four of the improved I-CreI variants cleaving HIV1—3.4 (SEQ ID NO:328) described in Table XXIII and used for a second round of random mutagenesis in example 14bis were also mutagenized by introducing selected amino-acid substitutions in the proteins and screening for variants cleaving HIV1—3 (SEQ ID NO:325) in combination with a variant cleaving HIV1—3.3 (SEQ ID NO:327).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). These mutations were introduced into the coding sequence of proteins cleaving HIV1—3.3 (SEQ ID NO:327), and the resulting proteins were tested for their ability to induce cleavage of the HIV1—3 target (SEQ ID NO:325), upon co-expression with a variant cleaving HIV1—3.4 (SEQ ID NO:328).
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)).
The same strategy is used with the following pair of oligonucleotides to introduce the mutations leading to the F54L, E80K, F87L, V105A and I132V substitutions in the coding sequences of the variants, respectively:
For each substitution to be introduced, the resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. The ten PCR fragments were pooled en equimolar amounts to generate a mix containing 50 ng of PCR DNA and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 11.
d) Sequencing of Variants
The experimental procedure is as described in example 9.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of four variants cleaving HIV1—3.4 (SEQ ID NO:328) (SEQ ID NO:136 to 139, Table XXII). 317 transformed clones were screened for cleavage against the HIV1—3.4 (SEQ ID NO:328) and HIV1—3.6 (SEQ ID NO:330) DNA targets. A total of 311 positive clones were found to cleave HIV1—3.4 (SEQ ID NO:328), while 262 of those cleaved also the HIV1—3.6 target (SEQ ID NO:330). An example of positive variants is shown in
The 317 transformed clones were also mated with a yeast strain that contains (i) the HIV1—3 target (SEQ ID NO:325) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—3.3 target (SEQ ID NO:327) (I-CreI 32K, 33A, 44K, 68E, 70S, 75N, 77R, +132N or KNKAQS/KESNR+132N (SEQ ID NO:109), according to the nomenclature of Table I). After mating with this yeast strain, 264 clones were found to cleave the HIV1—3 (SEQ ID NO:325). Thus, 264 positives contained proteins able to form heterodimers with KNKAQS/KESNR+132N (SEQ ID NO: 109, Table XIX) showing cleavage activity on the HIV1—3 target (SEQ ID NO:325). An example of positive clones is shown in
Sequencing of the 317 clones allowed the identification of 69 different endonuclease variants.
As an example, ten I-CreI variants cleaving the HIV1—3 target (SEQ ID NO:325) when forming a heterodimer with the KNKAQS/KESNR+132N variant (SEQ ID NO:109) are listed in Table XXIV.
The HIV1—4 target (SEQ ID NO:331) is a 22 bp (non-palindromic) target located in the gag gene of the HIV1 provirus. This target is precisely located at positions 1629-1650 of the HIV-1 pNL4-3 vector (accession number AF324493, Adachi et al., J. Virol., 1986, 59, 284-291), a subtype B infectious molecular clone.
The HIV1—4 sequence (SEQ ID NO: 331) is partly a patchwork of the 100AGC_P (SEQ ID NO:383), 10TGT_P (SEQ ID NO:382), 5TCT_P (SEQ ID NO:390) and 5_TAT_P (SEQ ID NO:391) targets (
The 10AGC_P (SEQ ID NO:383), 10TGT_P (SEQ ID NO:382), 5TCT_P (SEQ ID NO:390) and 5_TAT_P (SEQ ID NO:391) target sequences are 24 bp derivatives of C1221, a palindromic sequence cleaved by I-CreI (Arnould et al., precited). However, the structure of I-CreI bound to its DNA target suggests that the two external base pairs of these targets (positions −12 and 12) have no impact on binding and cleavage (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269), and in this study, only positions −11 to 11 were considered. Consequently, the HIV1—4 series of targets (SEQ ID NO:331 to 336) were defined as 22 bp sequences instead of 24 bp. HIV1—4 (SEQ ID NO:331) differs from C1221 (SEQ ID NO:343) in the 4 bp central region. According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, the bases at these positions should not impact the binding efficiency. However, they could affect cleavage, which results from two nicks at the edge of this region. Thus, the GGAC sequence in −2 to 2 was first substituted with the GTAC sequence from C1221 (SEQ ID NO:343), resulting in target HIV1—4.2 (SEQ ID NO: 332,
This example shows that I-CreI variants can cut the HIV1—4.3 DNA target sequence (SEQ ID NO:333) derived from the left part of the HIV1—4.2 target (SEQ ID NO:332) in a palindromic form (
HIV1—4.3 (SEQ ID NO:333) is similar to 10AGC_P (SEQ ID NO:383) at positions ±1, ±2, ±6, ±8, ±9, and ±10 and to 5TCT_P (SEQ ID NO:390) at positions ±1, ±2, ±3, ±4, ±5 and ±6. It was hypothesized that positions ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave the 10AGC_P (SEQ ID NO:383) target were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156. Variants able to cleave 5TCT_P (SEQ ID NO:390) were obtained by mutagenesis on I-CreI N75 at positions 24, 44, 68, 70, 75 and 77 as described in Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target. Mutations at positions 24 found in variants cleaving the 5TCT_P target (SEQ ID NO:390) will be lost during the combinatorial process. But it was hypothesized that this will have little impact on the capacity of the combined variants to cleave the HIV1—4.3 target (SEQ ID NO:333).
Therefore, to check whether combined variants could cleave the HIV1—4.3 target (SEQ ID NO:333), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TCT_P (SEQ ID NO:390) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10AGC_P (SEQ ID NO:383).
a) Construction of Target Vector
The target was cloned as follows: an oligonucleotide corresponding to the HIV1—4.3 target sequence (SEQ ID NO:333) flanked by gateway cloning sequences was ordered from PROLIGO: 5′ TGGCATACAAGTTTCCAGCATTCTGTACAGAATGCTGGCAATCGTCTGTCA 3′ (SEQ ID NO: 166). The same procedure was followed for cloning the HIV1—4.5 target (SEQ ID NO:335), using the oligonucleotide: 5′TGGCATACAAGTTTCCAGCATTCTGGACAGAATGCTGGCAATCGTCTGTCA 3′ (SEQ ID NO: 167). Double-stranded target DNA, generated by PCR amplification of the single stranded oligonucleotide, was cloned using the Gateway protocol (INVITROGEN) into the yeast reporter vector (pCLS1055,
b) Construction of Combinatorial Mutants
I-CreI variants cleaving 10AGC_P (SEQ ID NO:383) or 5TCT_P (SEQ ID NO:390) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10AGC_P (SEQ ID NO:383) and 5TCT_P (SEQ ID NO:390) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
d) Sequencing of Variants
To recover the variant expression plasmids, yeast DNA was extracted using standard protocols and used to transform E. coli. Sequencing of variant ORFs was then performed on the plasmids by MILLEGEN SA. Alternatively, ORFs were amplified from yeast DNA by PCR (Akada et al., Biotechniques, 2000, 28, 668-670), and sequencing was performed directly on the PCR product by MILLEGEN SA.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TCT_P (SEQ ID NO:390) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10AGC_P (SEQ ID NO:383) on the I-CreI scaffold, resulting in a library of complexity 3800. Examples of combinatorial variants are displayed in Table XXV. This library was transformed into yeast and 3348 clones were screened for cleavage against the HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335) DNA targets. 7 positive clones were found to cleave the HIV1—4.3 target (SEQ ID NO:333), which after sequencing turned out to correspond to 7 different novel endonuclease variants (Table XXVI). Those variants showed no cleavage activity of the HIV1—4.5 DNA target (SEQ ID NO:335). Examples of positives are shown in
This example shows that I-CreI variants can cleave the HIV1—4.4 (SEQ ID NO:334) DNA target sequence derived from the right part of the HIV1—4.2 target (SEQ ID NO:332) in a palindromic form (
HIV1—4.4 (SEQ ID NO:334) is similar to 5TAT_P (SEQ ID NO:391) at positions ±1, ±2, ±3, ±4, ±5 and ±8 and to 100TGT_P (SEQ ID NO:382) at positions ±1, ±2, ±3, ±4, ±8, ±9 and ±10. It was hypothesized that positions ±6, ±7 and +11 would have little effect on the binding and cleavage activity. Variants able to cleave 5TAT_P (SEQ ID NO:391) were obtained by mutagenesis of I-CreI N75 at positions 44, 68, 70, 75 and 77, as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156). Variants able to cleave the 10TGT_P target (SEQ ID NO:382) were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target.
Therefore, to check whether combined variants could cleave the HIV1—4.4 target (SEQ ID NO:334), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TAT_P (SEQ ID NO:391) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10TGT_P (SEQ ID NO:382).
a) Construction of Target Vector
The experimental procedure is as described in example 17, with the exception that different oligonucleotides corresponding to the HIV1—4.4 (SEQ ID NO:334) and HIV1—4.6 (SEQ ID NO:336) targets. The oligonucleotide used for the HIV1—4.4 target (SEQ ID NO:334) was:
and
5′TGGCATACAAGTTTCTTGTCTTATGGACATAAGACAAGCAATCGTCTGTCA3′ (SEQ ID NO: 176) for HIV1—4.6 target (SEQ ID NO:336).
b) Construction of Combinatorial Variants
I-CreI variants cleaving 10TGT_P (SEQ ID NO:382) or 5TAT_P (SEQ ID NO:391) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10TGT_P (SEQ ID NO:382) and 5TAT_P (SEQ ID NO:391) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS1107,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software. Positives resulting clones were verified by sequencing (MILLEGEN) as described in example 2.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TAT_P (SEQ ID NO:391) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10TGT_P (SEQ ID NO:382) on the I-CreI scaffold, resulting in a library of complexity 1406. Examples of combinatorial variants are displayed in Table XXVII. This library was transformed into yeast and 3348 clones (2.3 times the diversity) were screened for cleavage against the HIV1—4.4 (SEQ ID NO:334) and HIV1—4.6 (SEQ ID NO:336) DNA targets. A total of 210 positive clones were found to cleave HIV1—4.4 (SEQ ID NO:334). 40 of these clones were also able to cleave the HIV1—4.6 (SEQ ID NO:336) DNA target. Sequencing of these 93 clones with the strongest activity allowed the identification of 45 novel endonuclease variants. Examples of positives are shown in
I-CreI variants able to cleave each of the palindromic HIV1—4.2 (SEQ ID NO:332) derived targets (HIV1—4.3 (SEQ ID NO:333) and HIV1—4.4 (SEQ ID NO:334)) were identified in example 2 and example 3. Pairs of such variants (one cutting HIV1—4.3 (SEQ ID NO:333) and one cutting HIV1—4.4 (SEQ ID NO:334)) were co-expressed in yeast. Upon co-expression, there should be three active molecular species, two homodimers, and one heterodimer. It was assayed whether the heterodimers that should be formed, cut the HIV1—4.2 (SEQ ID NO:332) and the non palindromic HIV1—4 (SEQ ID NO:331) targets.
a) Construction of Target Vector
The experimental procedure is as described in example 2, with the exception that an oligonucleotide corresponding to the HIV1—4.2 target sequence (SEQ ID NO:332): 5′TGGCATACAAGTTTCCAGCATTCTGTACATAAGACAAGCAATCGTCTGTCA
3′ (SEQ ID NO: 187) or the HIV1—4 target sequence (SEQ ID NO:331): 5′ TGGCATACAAGTTTCCAGCATTCTGGACATAAGACAAGCAATCGTCTGTCA3′
(SEQ ID NO: 188) was used.
b) Co-Expression of Variants
Yeast DNA was extracted from variants cleaving the HIV1—4.4 (SEQ ID NO:334) target in the pCLS1107 expression vector using standard protocols and was used to transform E. coli. The resulting plasmid DNA was then used to transform yeast strains expressing a variant cutting the HIV1—4.3 target (SEQ ID NO:333) in the pCLS0542 expression vector. Transformants were selected on synthetic medium lacking leucine and containing G418.
c) Mating of Meganucleases Coexpressing Clones and Screening in Yeast
Mating was performed using a colony gridder (QpixII, Genetix). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of different reporter-harboring yeast strains for each target. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
Co-expression of variants cleaving the HIV1—4.4 target (SEQ ID NO:334) (10 variants corresponding to those described in Table XXVIII, SEQ ID 177 to 186) and six variants cleaving the HIV1—4.3 target (SEQ ID NO:333) (Table XXVI, SEQ ID 168 and 170 to 174) resulted in cleavage of the HIV1—4.2 (SEQ ID NO:332) target in most of the cases (
The assembly of I-CreI variants cleaving the palindromic HIV1—4.3 (SEQ ID NO:333) and HIV1—4.4 target (SEQ ID NO:334) to cleave the HIV1—4.2 (SEQ ID NO:332) and HIV1—4 (SEQ ID NO:331) have been previously identified in example 4. However, these variants display activity with the HIV1—4.2 target (SEQ ID NO:332) and not with the HIV1—4 target (SEQ ID NO:331).
Therefore seven variants cleaving HIV1—4.3 (SEQ ID NO:333) were mutagenized, and variants were screened for cleavage activity of HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—4 (SEQ ID NO:331) when co-expressed with a variant cleaving HIV1—4.4 (SEQ ID NO:334). According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, it is difficult to rationally choose a set of positions to mutagenize, and mutagenesis was performed on the whole protein. Random mutagenesis results in high complexity libraries. Therefore, to limit the complexity of the variant libraries to be tested, only one of the two components of the heterodimers cleaving HIV1—4 (SEQ ID NO:331) was mutagenized.
Thus, in a first step, proteins cleaving HIV1—4.3 (SEQ ID NO:333) were mutagenized and their homodimeric cleavage activity was determined, and in a second step, it was assessed whether they could cleave HIV1—4 (SEQ ID NO:331) when co-expressed with a protein cleaving HIV1—4.4 (SEQ ID NO:334).
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25), which are common to the pCLS0542 (
b) Mating of Meganuclease Expressing Clones and Screening in Yeast
Experiments were performed as previously described in example 17. Positive resulting clones were verified by sequencing (MILLEGEN) as described in example 17.
c) Variant-Target Yeast Strains, Screening and Sequencing
The yeast strain FYBL2-7B (MAT a, ura3Δ851, trp1Δ63, leu2Δ1, lys2Δ202) containing the HIV1—4 target (SEQ ID NO:331) in the yeast reporter vector (pCLS1055,
Seven variants cleaving HIV1—4.3 (SEQ ID NO:333), were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table XXVI.
2232 transformed clones were screened for cleavage against the HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335) DNA targets. A total of 249 positive clones were found to cleave HIV1—4.3 (SEQ ID NO:333), while 12 of them cleaved also the HIV1—4.5 target (SEQ ID NO:335). Sequencing of the 93 clones showing the strongest activity allowed the identification of 60 novel endonuclease variants. An example of the identified variants is presented in table XXX and in
The 93 clones showing the highest cleavage activity on target HIV1—4.3 (SEQ ID NO:333) were then mated with a yeast strain that contains (i) the HIV1—4 target (SEQ ID NO:331) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—4.4 target (SEQ ID NO:334) (I-CreI 30H, 33M, 38A, 44N, 68Y, 70S, 75Y, 77R or KHSMAS/NYSYR (SEQ ID NO:177), according to the nomenclature of Table I). After mating with this yeast strain, no clones were found to cleave the HIV1—4 (SEQ ID NO:331) when forming heterodimers with KHSMAS/NYSYR (SEQ ID NO: 177, Table XXIX).
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 20. For this purpose, four variants cleaving HIV1—4.3 (SEQ ID NO:333) were mutagenized, and variants were screened for cleavage activity of HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—4 (SEQ ID NO:331) when co-expressed with a variant cleaving HIV1—4.4 (SEQ ID NO:334).
The materials and methods have previously been described in example 20.
A) Results
Six variants cleaving HIV1—4.3 (SEQ ID NO:333), were pooled, randomly mutagenized and transformed into yeast. The six variants submitted to random mutagenesis correspond to variants described in Table XXX (SEQ ID NO: 189 to 194).
2232 transformed clones were screened for cleavage against the HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335) DNA targets. A total of 377 positive clones were found to cleave HIV1—4.3 (SEQ ID NO:333), while 208 of those cleaved also the HIV1—4.5 target (SEQ ID NO:335). Sequencing of the 93 clones with the highest activity allowed the identification of 53 novel endonuclease variants. An example of the identified variants is presented in table XXXI and
The 93 clones showing cleaving target HIV1—4.3 (SEQ ID NO:333) were then mated with a yeast strain that contains (i) the HIV1—4 target (SEQ ID NO:331) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—4.4 target (SEQ ID NO:334) (I-CreI 30H, 33M, 38A, 44A, 68Y, 70S, 75Y, 77R, 155R or KHSMAS/AYSYR+155R (SEQ ID NO:199), according to the nomenclature of Table I). After mating with this yeast strain, all the 93 clones were found to cleave the HIV1—4 (SEQ ID NO:331). Thus, 93 positives contained proteins able to form heterodimers with KHSMAS/AYSYR+155R (SEQ ID NO: 199) showing cleavage activity on the HIV1—4 target (SEQ ID NO:331). An example of positives is shown in
I-CreI variants cleaving HIV1—4.3 (SEQ ID NO:333) were also mutagenized by introducing selected amino-acid substitutions in the proteins and screening for more efficient variants cleaving HIV1—4 (SEQ ID NO:331) in combination with a variant cleaving HIV1—4.4 (SEQ ID NO:334).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). These mutations were introduced into the coding sequence of proteins cleaving HIV1—4.3 (SEQ ID NO:333), and the resulting proteins were tested for their ability to induce cleavage of the HIV1—4 target (SEQ ID NO:331), upon co-expression with a variant cleaving HIV1—4.4 (SEQ ID NO:334), as well as for the ability to cleave targets HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335).
a) Site-Directed Mutagenesis
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)).
The same strategy is used with the following pair of oligonucleotides to introduce the mutations leading to the F54L, E80K, F87L, V105A and I132V substitutions in the coding sequences of the variants, respectively:
For each substitution to be introduced, the resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. The ten PCR fragments were pooled en equimolar amounts to generate a mix containing 50 ng of PCR DNA and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 20.
d) Sequencing of Variants
The experimental procedure is as described in example 17.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of six variants cleaving HIV1—4.3 (SEQ ID NO:333) (described in Table XXXI, SEQ ID NO:200 to 205).
558 transformed clones were mated with a yeast strain that contains (i) the HIV1—4 target (SEQ ID NO:331) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—4.4 target (SEQ ID NO:334) (30H, 33M, 38A, 44N, 68Y, 70S, 75Y, 77R or KHSMAS/NYSYR (SEQ ID NO:177), according to the nomenclature of Table I). After mating with this yeast strain, 486 clones were found to cleave the HIV1—4 (SEQ ID NO:331). Thus, 486 positives contained proteins able to form heterodimers with KHSMAS/NYSYR (SEQ ID NO: 177) showing cleavage activity on the HIV1—4 target (SEQ ID NO:331). An example of positive variants is shown in
Sequencing of the 93 clones with the highest cleavage activity on the HIV1—4 target (SEQ ID NO:331) allowed the identification of 34 different endonuclease variants. These 93 clones were also tested for their ability to cleave the HIV1—4.3 (SEQ ID NO:333) and HIV1—4.5 (SEQ ID NO:335) targets. In this case, 71 clones were able to cleave the HIV1—4.3 target (SEQ ID NO:333), and 69 the HIV1—4.5 target (SEQ ID NO:335) (see
The sequence of ten I-CreI variants cleaving the HIV1—4 target (SEQ ID NO:331) when forming a heterodimer with the KHSMAS/NYSYR variant are listed in Table XXXII.
19S 28Q 38R 40K 44K 68T 70G 75N 80K 114T
19S 28Q 38R 40K 44K 54L 68T 70G 75N 114T
19S 28Q 38R 40K 44K 68T 70G 75N 80K 114T 147A
19S 28Q 38R 40K 41M 44K 54L 68T 70G 75N 80K
19S 28Q 38R 40K 42A 44K 54L 68T 70G 75N 80K
105A 114T
19S 28Q 38R 40K 42S 44K 54L 68T 70G 75N 80K
19S 28Q 38R 40K 41M 44K 54L 68T 70G 75N 80K
19S 28Q 38R 40K 44K 68T 70G 75N 123M 132V
The assembly of I-CreI variants cleaving the palindromic HIV1—4.3 (SEQ ID NO:333) and HIV1—4.4 target (SEQ ID NO:334) to cleave the HIV1—4.2 (SEQ ID NO:332) and HIV1—4 (SEQ ID NO:331) have been previously described in example 19. However, these variants display activity with the HIV1—4.2 target (SEQ ID NO:332) and not with the HIV1—4 target (SEQ ID NO:331).
As a complement to example 4 we also decided to perform random mutagenesis with variants that cleave HIV1—4.4 (SEQ ID NO:334). Therefore ten variants cleaving HIV1—4.3 (SEQ ID NO:333) were mutagenized, and variants were screened for cleavage activity of HIV1—4.4 (SEQ ID NO:334) and HIV1—4.6 (SEQ ID NO:336) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—4 (SEQ ID NO:331) when co-expressed with a variant cleaving HIV1—4.3 (SEQ ID NO:333).
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25). Approximately 25 ng of the PCR product and 75 ng of vector DNA (pCLS1107,
b) Variant-Target Yeast Strains, Screening and Sequencing
The yeast strain FYBL2-7B (MAT a, ura3Δ851, trp1Δ63, leu2Δ1, lys2Δ202) containing the HIV1—4 target (SEQ ID NO:331) in the yeast reporter vector (pCLS1055,
Ten variants cleaving HIV1—4.4 (SEQ ID NO:334) were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table XXXII.
2232 transformed clones were screened for cleavage against the HIV1—4.4 (SEQ ID NO:334) and HIV1—4.6 (SEQ ID NO:336) DNA targets. A total of 210 positive clones were found to cleave HIV1—4.4 (SEQ ID NO:334), while 32 of those also cleaved the HIV1—4.6 target (SEQ ID NO:336). Sequencing of the 93 clones showing the strongest activity allowed the identification of 65 novel endonuclease variants. An example of the identified variants is presented in table XXXIII and in
26H 30H 33M 38A 44N 68Y 70S 75Y 77R
The 93 clones showing the highest cleavage activity on target HIV1—4.4 (SEQ ID NO:334) were then mated with a yeast strain that contains (i) the HIV1—4 target (SEQ ID NO:331) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—4.3 target (SEQ ID NO:333) (I-CreI 28Q, 38R, 40K, 44K, 68T, 70G, 75N+132V or QNSYRK/KTGNI+132V (SEQ ID NO:190), according to the nomenclature of Table I). After mating with this yeast strain, 90 clones were found to cleave the HIV1—4 target (SEQ ID NO:331). Thus, 90 positives contained proteins able to form heterodimers with QNSYRK/KTGNI+132V (SEQ ID NO: 190, Table XXX), that showed cleavage activity on the HIV1—4 target (SEQ ID NO:331). An example of positives is shown in
Four of the I-CreI variants cleaving HIV1—4.4 (SEQ ID NO:334) described in Table XXXVII were mutagenized by introducing selected amino-acid substitutions in the proteins and screening for more efficient variants cleaving HIV1—4 (SEQ ID NO:331) in combination with a variant cleaving HIV1—4.3 (SEQ ID NO:333).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). These mutations were introduced into the coding sequence of proteins cleaving HIV1—4.4 (SEQ ID NO:334), and the resulting proteins were tested for their ability to induce cleavage of the HIV1—4 target (SEQ ID NO:331), upon co-expression with a variant cleaving HIV1—4.3 (SEQ ID NO:333).
a) Site-Directed Mutagenesis
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ or Gal10R 5′-acaaccttgattggagacttgacc-3′) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)). The resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. Approximately 25 ng of each of the two overlapping PCR fragments and 75 ng of vector DNA (pCLS1107,
The same strategy is used with the following pair of oligonucleotides to create other libraries containing the F54L, E80K, F87L, V105A and I132V substitutions, respectively:
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 22.
d) Sequencing of Variants
The experimental procedure is as described in example 17.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of four variants cleaving HIV1—4.4 (SEQ ID NO:334) (see Table XXXIII, SEQ ID NO:199, 177, 221 and 228).
558 transformed clones were mated with a yeast strain that contains (i) the HIV1—4 target (SEQ ID NO:331) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—4.3 target (SEQ ID NO:333) (28Q, 38R, 40K, 44K, 68, 70G, 75N or QNSYRK/KTGNI+132V (SEQ ID NO:190), according to the nomenclature of Table I). After mating with this yeast strain, 16 clones were found to cleave the HIV1—4 (SEQ ID NO:331). Thus, 16 positives contained proteins able to form heterodimers with QNSYRK/KTGNI+132V (SEQ ID NO: 190, Table XXX) showing cleavage activity on the HIV1—4 target (SEQ ID NO:331). An example of positive variants is shown in
Sequencing of these positive clones allowed the identification of 10 different endonuclease variants. The clones cleaving the HIV1—4 target (SEQ ID NO:331) were also tested for their ability to cleave the HIV1—4.4 (SEQ ID NO:334) and HIV1—4.6 (SEQ ID NO:336) targets (see
The sequence of ten I-CreI variants cleaving the HIV1—4 target (SEQ ID NO:331) when forming a heterodimer with the KHSMAS/NYSYR variant (SEQ ID NO:177) are listed in Table XXXIV.
The HIV1—5 target (SEQ ID NO:337) is a 22 bp (non-palindromic) target located in the pol gene of the HIV1 provirus. This target is precisely located at positions 2317-2338 of the HIV-1 pNL4-3 vector (accession number AF324493, Adachi et al., J. Virol., 1986, 59, 284-291), a subtype B infectious molecular clone.
The HIV1—5 sequence (SEQ ID NO: 337) is partly a patchwork of the 10TCT_P (SEQ ID NO:377), 10CTG_P (SEQ ID NO:378), 5TAG_P (SEQ ID NO:386) and 5_CCT_P (SEQ ID NO:384) targets (
The 10TCT_P (SEQ ID NO:377), 10CTG_P (SEQ ID NO:378), 5TAG_P (SEQ ID NO:386) and 5_CCT_P (SEQ ID NO:384) target sequences are 24 bp derivatives of C1221 (SEQ ID NO:343), a palindromic sequence cleaved by I-CreI (Arnould et al., precited). However, the structure of I-CreI bound to its DNA target suggests that the two external base pairs of these targets (positions −12 and 12) have no impact on binding and cleavage (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269), and in this study, only positions −11 to 11 were considered. Consequently, the HIV1—5 series of targets (SEQ ID NO:337 to 342) were defined as 22 bp sequences instead of 24 bp. HIV1—5 (SEQ ID NO:337) differs from C1221 (SEQ ID NO:343) in the 4 bp central region. According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, the bases at these positions should not impact the binding efficiency. However, they could affect cleavage, which results from two nicks at the edge of this region. Thus, the ATAC sequence in −2 to 2 was first substituted with the GTAC sequence from C1221 (SEQ NO:343), resulting in target HIV1—5.2 (SEQ ID NO: 338,
This example shows that I-CreI variants can cut the HIV1—5.3 (SEQ ID NO:339) DNA target sequence derived from the left part of the HIV1—5.2 target (SEQ ID NO:338) in a palindromic form (
HIV1—5.3 (SEQ ID NO:339) is similar to 10TCT_P (SEQ ID NO:377) at positions ±1, ±2, ±6, ±8, ±9, and ±10 and to 5TAG_P (SEQ ID NO:386) at positions ±1, ±2, ±3, ±4, ±5 and ±6. It was hypothesized that positions ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave the 10TCT_P target (SEQ ID NO:377) were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156. Variants able to cleave 5TAG_P (SEQ ID NO:386) were obtained by mutagenesis on I-CreI N75 at positions 24, 44, 68, 70, 75 and 77 as described in Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target. Mutations at positions 24 found in variants cleaving the 5TAG_P target (SEQ ID NO:386) will be lost during the combinatorial process. But it was hypothesized that this will have little impact on the capacity of the combined variants to cleave the HIV1—5.3 target (SEQ ID NO:339).
Therefore, to check whether combined variants could cleave the HIV1—5.3 target (SEQ ID NO:339), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TAG_P (SEQ ID NO:386) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10TCT_P (SEQ ID NO:377).
a) Construction of Target Vector
The target was cloned as follows: an oligonucleotide corresponding to the HIV1—5.3 target sequence (SEQ ID NO:339) flanked by gateway cloning sequences was ordered from PROLIGO: 5′ TGGCATACAAGTTTGCTCTATTAGGTACCTAATAGAGCCAATCGTCTGTCA 3′ (SEQ ID NO: 52). The same procedure was followed for cloning the HIV1—5.5 target (SEQ ID NO:341), using the oligonucleotide: 5′ TGGCATACAAGTTTGCTCTATTAGATACCTAATAGAGCCAATCGTCTGTCA
3′ (SEQ ID NO: 53). Double-stranded target DNA, generated by PCR amplification of the single stranded oligonucleotide, was cloned using the Gateway protocol (INVITROGEN) into the yeast reporter vector (pCLS1055,
b) Construction of Combinatorial Mutants
I-CreI variants cleaving 10TCT_P (SEQ ID NO:377) or 5TAG_P (SEQ ID NO:386) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10TCT_P (SEQ ID NO:377) and 5TAG_P (SEQ ID NO:386) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
d) Sequencing of Variants
To recover the variant expression plasmids, yeast DNA was extracted using standard protocols and used to transform E. coli. Sequencing of variant ORFs was then performed on the plasmids by MILLEGEN SA. Alternatively, ORFs were amplified from yeast DNA by PCR (Akada et al., Biotechniques, 2000, 28, 668-670), and sequencing was performed directly on the PCR product by MILLEGEN SA.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5TAG_P (SEQ ID NO:386) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10TCT_P (SEQ ID NO:377) on the I-CreI scaffold, resulting in a library of complexity 1920. Examples of combinatorial variants are displayed in Table XXXV, none of the variants tested from the combinatorial library produced a positive result. This library was transformed into yeast and 3348 clones (1.7 times the diversity) were screened for cleavage against the HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341) DNA targets. Two positive clones were found (though having weak cleavage activity), which after sequencing turned out to correspond to 2 different novel endonuclease variants (Table XXXVI). These two positives are shown in
This example shows that I-CreI variants can cleave the HIV1—5.4 DNA target sequence (SEQ ID NO:340) derived from the right part of the HIV1—5.2 target (SEQ ID NO:338) in a palindromic form (
HIV1—5.4 (SEQ ID NO:340) is similar to 5CCT_P (SEQ ID NO:384) at positions ±1, ±2, ±3, ±4, ±5 and ±8 and to 10CTG_P (SEQ ID NO:378) at positions ±1, ±2, ±3, ±4, ±8, ±9 and ±10. It was hypothesized that positions ±6, ±7 and ±11 would have little effect on the binding and cleavage activity. Variants able to cleave 5CCT_P (SEQ ID NO:384) were obtained by mutagenesis of I-CreI N75 at positions 44, 68, 70, 75 and 77, as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458; Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2006/097784, WO 2006/097853, WO 2007/060495 and WO 2007/049156). Variants able to cleave the 10TGG_P target (SEQ ID NO:379) were obtained by mutagenesis of I-CreI N75 or D75, at positions 28, 30, 32, 33, 38, 40 and 70, as described previously in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156.
Both sets of proteins are mutated at position 70. However, the existence of two separable functional subdomains was hypothesized. This implies that this position has little impact on the specificity at bases 10 to 8 of the target.
Therefore, to check whether combined variants could cleave the HIV1—5.4 target (SEQ ID NO:340), mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5CCT_P (SEQ ID NO:384) were combined with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 10CTG_P (SEQ ID NO:378).
a) Construction of Target Vector
The experimental procedure is as described in example 2, with the exception that different oligonucleotides corresponding to the HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342) targets. The oligonucleotide used for the HIV1—5.4 target (SEQ ID NO:340) was: 5′ TGGCATACAAGTTTATCTGCTCCTGTACAGGAGCAGATCAATCGTCTGTCA 3′ (SEQ ID NO: 243), and 5′ TGGCATACAAGTTTATCTGCTCCTATACAGGAGCAGATCAATCGTCTGTCA 3′ (SEQ ID NO: 244) for HIV1—5.6 target (SEQ ID NO:342).
b) Construction of Combinatorial Variants
I-CreI variants cleaving 10CTG_P (SEQ ID NO:378) or 5CCT_P (SEQ ID NO:384) were previously identified, as described in Smith et al. Nucleic Acids Res., 2006, 34, e149; International PCT Applications WO 2007/060495 and WO 2007/049156, and Arnould et al., J. Mol. Biol., 2006, 355, 443-458; International PCT Applications WO 2006/097784 and WO 2006/097853, respectively for the 10CTG_P (SEQ ID NO:378) and 5CCT_P (SEQ ID NO:384) targets. In order to generate I-CreI derived coding sequences containing mutations from both series, separate overlapping PCR reactions were carried out that amplify the 5′ end (aa positions 1-43) or the 3′ end (positions 39-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using primers (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) specific to the vector (pCLS1107,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
Screening was performed as described previously (Arnould et al., J. Mol. Biol., 2006, 355, 443-458). Mating was performed using a colony gridder (QpixII, GENETIX). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of the reporter-harboring yeast strain. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software. Positives resulting clones were verified by sequencing (MILLEGEN) as described in example 2.
I-CreI combinatorial variants were constructed by associating mutations at positions 44, 68, 70, 75 and 77 from proteins cleaving 5CCT_P (SEQ ID NO:384) with the 28, 30, 32, 33, 38 and 40 mutations from proteins cleaving 100CTG_P (SEQ ID NO:378) on the I-CreI scaffold, resulting in a library of complexity 1600. Examples of combinatorial variants are displayed in Table XXXXI. This library was transformed into yeast and 3348 clones (2 times the diversity) were screened for cleavage against the HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342) DNA targets. A total of 10 positive clones were found to cleave HIV1—5.4 (SEQ ID NO:340). Sequencing of these 10 clones allowed the identification of 9 novel endonuclease variants, which are represented in Table XXXVII. Examples of positives are shown in
I-CreI variants able to cleave each of the palindromic HIV1—5.2 (SEQ ID NO:338) derived targets (HIV1—5.3 (SEQ ID NO:339) and HIV1—5.4 (SEQ ID NO:340)) were identified in example 25 and example 26. Pairs of such variants (one cutting HIV1—5.3 (SEQ ID NO:339) and one cutting HIV1—5.4 (SEQ ID NO:340)) were co-expressed in yeast. Upon co-expression, there should be three active molecular species, two homodimers, and one heterodimer. It was assayed whether the heterodimers that should be formed, cut the HIV1—5.2 (SEQ ID NO:338) and the non palindromic HIV1—5 targets (SEQ ID NO:337).
a) Construction of Target Vector
The experimental procedure is as described in example 2, with the exception that an oligonucleotide corresponding to the HIV1—5.2 target sequence: 5′TGGCATACAAGTTTGCTCTATTAGGTACAGGAGCAGATCAATCGTCTGTCA3′ (SEQ ID NO: 254) or the HIV1—5 target sequence: 5′TGGCATACAAGTTTGCTCTATTAGATACAGGAGCAGATCAATCGTCTGTCA 3′ (SEQ ID NO: 255) was used.
b) Co-Expression of Variants
Yeast DNA was extracted from variants cleaving the HIV1—5.4 (SEQ ID NO:340) target in the pCLS1107 expression vector using standard protocols and was used to transform E. coli. The resulting plasmid DNA was then used to transform yeast strains expressing a variant cutting the HIV1—5.3 (SEQ ID NO:339) target in the pCLS0542 expression vector. Transformants were selected on synthetic medium lacking leucine and containing G418.
c) Mating of Meganucleases Coexpressing Clones and Screening in Yeast
Mating was performed using a colony gridder (QpixII, Genetix). Variants were gridded on nylon filters covering YPD plates, using a low gridding density (4-6 spots/cm2). A second gridding process was performed on the same filters to spot a second layer consisting of different reporter-harboring yeast strains for each target. Membranes were placed on solid agar YPD rich medium, and incubated at 30° C. for one night, to allow mating. Next, filters were transferred to synthetic medium, lacking leucine and tryptophan, adding G418, with galactose (2%) as a carbon source, and incubated for five days at 37° C., to select for diploids carrying the expression and target vectors. After 5 days, filters were placed on solid agarose medium with 0.02% X-Gal in 0.5 M sodium phosphate buffer, pH 7.0, 0.1% SDS, 6% dimethyl formamide (DMF), 7 mM β-mercaptoethanol, 1% agarose, and incubated at 37° C., to monitor β-galactosidase activity. Results were analyzed by scanning and quantification was performed using appropriate software.
Co-expression of variants cleaving the HIV1—5.4 target (SEQ ID NO:340) (9 variants chosen among those described in Table XXXVIII) and the two variants cleaving the HIV1—5.3 target (SEQ ID NO:339) (described in Table XXXVI) resulted in cleavage of the HIV1—5.2 target (SEQ ID NO:338) in one of the cases (
I-CreI variants able to cleave the HIV1—5.3 (SEQ ID NO:339) have been identified in example 25. Since these two variants show a weak activity, and only one of them is able to cleave the HIV1—5.2 target (SEQ ID NO:338) when assembled with a meganuclease cleaving the HIV1—5.4 (SEQ ID NO:340), these two variants were mutagenized, and the clones generated were screened for cleavage activity of HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341) targets. Additionally the mutants with the strongest activity were screened for cleavage activity of HIV1—5 (SEQ ID NO:337) when co-expressed with a variant cleaving HIV1—5.4 (SEQ ID NO:340). According to the structure of the I-CreI protein bound to its target, there is no contact between the 4 central base pairs (positions −2 to 2) and the I-CreI protein (Chevalier et al., Nat. Struct. Biol., 2001, 8, 312-316; Chevalier and Stoddard, Nucleic Acids Res., 2001, 29, 3757-3774; Chevalier et al., J. Mol. Biol., 2003, 329, 253-269). Thus, it is difficult to rationally choose a set of positions to mutagenize, and mutagenesis was performed on the whole protein. Random mutagenesis results in high complexity libraries. Therefore, to limit the complexity of the variant libraries to be tested, only one of the two components of the heterodimers cleaving HIV1—5 (SEQ ID NO:337) was mutagenized.
Thus, in a first step, proteins cleaving HIV1—5.3 (SEQ ID NO:339) were mutagenized and their homodimeric cleavage activity was determined, and in a second step, it was assessed whether they could cleave HIV1—5 (SEQ ID NO:337) when co-expressed with a protein cleaving HIV1—5.4 (SEQ ID NO:340).
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25), which are common to the pCLS0542 (
b) Mating of Meganuclease Expressing Clones and Screening in Yeast
Experiment were performed as previously described in example 25. Positive resulting clones were verified by sequencing (MILLEGEN) as described in example 25.
c) Variant-Target Yeast Strains, Screening and Sequencing
The yeast strain FYBL2-7B (MAT a, ura3Δ851, trp1Δ63, leu2Δ1, lys2Δ202) containing the HIV1—5 target (SEQ ID NO:337) in the yeast reporter vector (pCLS1055,
Two variants cleaving HIV1—5.3 (SEQ ID NO:339), were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in table XXXVI.
2232 transformed clones were screened for cleavage against the HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341) DNA targets. A total of 20 positive clones were found to cleave HIV1—5.3 (SEQ ID NO:339), while none of those cleaved the HIV1—5.5 target (SEQ ID NO:341). Sequencing of the 20 clones allowed the identification of 13 novel endonuclease variants. An example of these variants is presented in table XXXIX and in
6S 30S 33C 38Y 44A 68H 70S 75Y 77N 80K
The 20 clones showing cleavage activity on target HIV1—5.3 (SEQ ID NO:339) were also mated with a yeast strain that contains (i) the HIV1—5 target (SEQ ID NO:337) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—5.4 target (SEQ ID NO:340) (SEQ ID 252; I-CreI 30T, 33G, 44K, 68Y, 70S, 77R+151A or KTSGQS/KYSDR+151A, according to the nomenclature of Table I). After mating with this yeast strain, no clones were found to cleave the HIV1—5 target (SEQ ID NO:337).
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 28. For this purpose, ten variants cleaving HIV1—5.3 (SEQ ID NO:339) were mutagenized, and variants were screened for cleavage activity of HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341) targets. Additionally, the mutants with the strongest activity were screened for cleavage activity of HIV1—5 (SEQ ID NO:337) when co-expressed with a variant cleaving HIV1—5.4 (SEQ ID NO:340).
The materials and methods have previously been described in example 28.
Ten variants cleaving HIV1—5.3 (SEQ ID NO:339), were pooled, randomly mutagenized and transformed into yeast. The variants submitted to random mutagenesis correspond to variants described in Table XXXIX (SEQ ID NO: 256 to 265).
2232 transformed clones were screened for cleavage against the HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341) DNA targets. A total of 80 positive clones were found to cleave HIV1—5.3 (SEQ ID NO:339), while 25 of those cleaved also the HIV1—5.5 target (SEQ ID NO:341). Sequencing of the 80 clones allowed the identification of 39 novel endonuclease variants. An example of the identified variants is presented in table XXXX and
The 80 clones showing cleavage activity on target HIV1—5.3 (SEQ ID NO:339) were then mated with a yeast strain that contains (i) the HIV1—5 target (SEQ ID NO:337) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—5.4 target (SEQ ID NO:340) (I-CreI 30S, 33N, 44K, 68Y, 70S, 77R+103T or KSSNQS/KYSDR+103T (SEQ ID NO:276), according to the nomenclature of Table I). After mating with this yeast strain, 4 clones were found to cleave the HIV1—5 (SEQ ID NO:337). Thus, 4 positives contained proteins able to form heterodimers with KSSNQS/KYSDR+103T (SEQ ID NO: 276) showing cleavage activity on the HIV1—5 target (SEQ ID NO:337). An example of positives is shown in
24F 33C 38Y 44A 68Y 70Q 72Y 75N 107R 153Y
163G 164G
Three of the I-CreI variants cleaving HIV1—5.3 (SEQ ID NO:339) described in Table XL were mutagenized by introducing selected amino-acid substitutions in the proteins and screening for more efficient variants cleaving HIV1—5 (SEQ ID NO:337) in combination with a variant cleaving HIV1—5.4 (SEQ ID NO:340).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). These mutations were introduced into the coding sequence of proteins cleaving HIV1—5.3 (SEQ ID NO:339), and the resulting proteins were tested for their ability to induce cleavage of the HIV1—5 target (SEQ ID NO:337), upon co-expression with a variant cleaving HIV1—5.4 (SEQ ID NO:340), as well as for the ability to cleave targets HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341).
a) Site-Directed Mutagenesis
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)).
The same strategy is used with the following pair of oligonucleotides to introduce the mutations leading to the F54L, E80K, F87L, V105A and I132V substitutions in the coding sequences of the variants, respectively:
For each substitution to be introduced, the resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. The ten PCR fragments were pooled en equimolar amounts to generate a mix containing 50 ng of PCR DNA and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 28.
d) Sequencing of Variants
The experimental procedure is as described in example 25.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of three variants cleaving HIV1—5.3 (SEQ ID NO:339) (SEQ ID NO: 266, 269 and 270; described in Table XL). 558 transformed clones were screened for cleavage against the HIV1—5.3 (SEQ ID NO:339) and HIV1—5.5 (SEQ ID NO:341) DNA targets. A total of 450 positive clones were found to cleave HIV1—5.3 (SEQ ID NO:339), while 435 of those cleaved also the HIV1—5.5 target (SEQ ID NO:341). An example of positive variants is shown in
The 558 transformed clones were also mated with a yeast strain that contains (i) the HIV1—5 target (SEQ ID NO:337) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—5.4 target (SEQ ID NO:340) (I-CreI 30S, 33N, 44K, 68Y, 70S, 77R+103T or KSSNQS/KYSDR+103T (SEQ ID NO:276), according to the nomenclature of Table I). After mating with this yeast strain, 444 clones were found to cleave the HIV1—5 (SEQ ID NO:337). Thus, 444 positives contained proteins able to form heterodimers with KSSNQS/KYSDR+103T (SEQ ID NO: 276) showing cleavage activity on the HIV1—5 target (SEQ ID NO:337). An example of positive clones is shown in
Sequencing of the 93 clones with the highest cleavage activity on the HIV1—5 target (SEQ ID NO:337) allowed the identification of 50 different endonuclease variants.
The sequence of ten I-CreI variants cleaving the HIV1—5 target (SEQ ID NO:337) when forming a heterodimer with the KSSNQS/KYSDR+103T variant (SEQ ID NO:276) are listed in Table XLI.
19S 33C 38Y 44A 50R 60E 68Y 70Q 75N 79T 85R
19S 33C 38Y 44A 68Y 70Q 75N 79T 85R 105A
105A 132V
19S 33C 38Y 44A 50R 60E 68Y 70Q 75N 79T 85R
As a complement to example 29 we also decided to perform random mutagenesis with variants that cleave HIV1—5.4 (SEQ ID NO:340). The variants generated were screened for their cleavage activity on targets HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342); and the mutagenized proteins cleaving HIV1—5.4 (SEQ ID NO:340) were then tested to determine if they could efficiently cleave HIV1—5 (SEQ ID NO:337) when co-expressed with a protein cleaving HIV1—5.3 (SEQ ID NO:339).
a) Construction of Libraries by Random Mutagenesis
Random mutagenesis was performed on a pool of chosen variants, by PCR using Mn2+. PCR reactions were carried out that amplify the I-CreI coding sequence using the primers preATGCreFor (5′-gcataaattactatacttctatagacacgcaaacacaaatacacagcggccttgccacc-3′; SEQ ID NO: 24) and ICreIpostRev (5′-ggctcgaggagctcgtctagaggatcgctcgagttatcagtcggccgc-3′; SEQ ID NO: 25). Approximately 25 ng of the PCR product and 75 ng of vector DNA (pCLS1107,
b) Variant-Target Yeast Strains, Screening and Sequencing
The yeast strain FYBL2-7B (MAT a, ura3Δ851, trp1Δ63, leu2Δ1, lys2Δ202) containing the HIV1—5 target (SEQ ID NO:337) in the yeast reporter vector (pCLS1055,
Nine variants cleaving HIV1—5.4 (SEQ ID NO:340) were pooled, randomly mutagenized and transformed into yeast. The sequences of the variants subjected to random mutagenesis are described in Table XXXVIII.
2232 transformed clones were screened for cleavage against the HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342) DNA targets. A total of 53 positive clones were found to cleave HIV1—5.4 (SEQ ID NO:340), while 6 of those also cleaved the HIV1—5.6 target (SEQ ID NO:342). Sequencing of the 53 clones showing the strongest activity allowed the identification of 42 novel endonuclease variants. An example of the identified variants is presented in Table XLII and in
The 53 positive clones showing the highest cleavage activity on target HIV1—5.4 (SEQ ID NO:340) were then mated with a yeast strain that contains (i) the HIV1—5 target (SEQ ID NO:337) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—5.3 target (SEQ ID NO:339) (I-CreI 33C, 38Y, 44A, 68Y, 70Q, 75N+89A or KNSCYS/AYQNI+89A, according to the nomenclature of Table I; SEQ ID NO:256). After mating with this yeast strain, no clones were found to cleave the HIV1—5 target (SEQ ID NO:337).
In order to further improve the activity of the obtained meganucleases, a second round of random mutagenesis was carried out following the same rationale of example 30. For this purpose, six variants cleaving HIV1—5.4 (SEQ ID NO:340) were mutagenized, and variants were screened for cleavage activity of HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342) targets. Additionally the mutants were screened for cleavage activity of HIV1—5 (SEQ ID NO:337) when co-expressed with a variant cleaving HIV1—5.3 (SEQ ID NO:339).
The materials and methods have previously been described in example 30.
Six variants cleaving HIV1—5.4 (SEQ ID NO:340), were pooled, randomly mutagenized and transformed into yeast. The six variants submitted to random mutagenesis correspond to variants described in Table XLII (SEQ ID NO: 276 and 288 to 292).
2232 transformed clones were screened for cleavage against the HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342) DNA targets. A total of 21 positive clones were found to cleave HIV1—5.4 (SEQ ID NO:340), while 9 of those cleaved also the HIV1—5.6 target (SEQ ID NO:342). Sequencing of the 21 clones allowed the identification of 16 novel endonuclease variants. An example of the identified variants is presented in Table XLIII and
The 21 positive clones showing cleavage activity on target HIV1—5.4 (SEQ ID NO:340) were then mated with a yeast strain that contains (i) the HIV1—5 target (SEQ ID NO:337) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—5.3 target (SEQ ID NO:339) (I-CreI 33C, 38Y, 44A, 68Y, 70Q, 75N+89A or KNSCYS/AYQNI+89A, according to the nomenclature of Table I; SEQ ID NO:256). After mating with this yeast strain, no clones were found to cleave the HIV1—5 target (SEQ ID NO:337).
6S 33S 44R 54I 68Y 70S 75N 77Q 124V 158R 163T
2Y 16L 30S 33S 44R 66H 68Y 70S 75N 77N 82E 89S
103S 147A
128R 146V 151A
Two of the I-CreI variants cleaving HIV1—5.4 (SEQ ID NO:340) described in Table XLIII were mutagenized by introducing selected amino-acid substitutions in the proteins and screening for more efficient variants cleaving HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342), as well as for cleavage of the HIV1—5 (SEQ ID NO:337) target when in combination with a variant cleaving HIV1—5.3 (SEQ ID NO:339).
Six amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Glycine 19 with Serine (G19S), Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Phenylalanine 87 with Leucine (F87L), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V).
a) Site-Directed Mutagenesis
Site-directed mutagenesis libraries were created by PCR on a pool of chosen variants. For example, to introduce the G19S substitution into the coding sequence of the variants, two separate overlapping PCR reactions were carried out that amplify the 5′ end (residues 1-24) or the 3′ end (residues 14-167) of the I-CreI coding sequence. For both the 5′ and 3′ end, PCR amplification is carried out using a primer with homology to the vector (Gal10F 5′-gcaactttagtgctgacacatacagg-3′ (SEQ ID NO: 16) or Gal10R 5′-acaaccttgattggagacttgacc-3′ (SEQ ID NO: 17)) and a primer specific to the I-CreI coding sequence for amino acids 14-24 that contains the substitution mutation G19S (G19SF 5′-gccggctttgtggactctgacggtagcatcatc-3′ (SEQ ID NO: 47) or G19SR 5′-gatgatgctaccgtcagagtccacaaagccggc-3′ (SEQ ID NO: 48)).
The same strategy is used with the following pair of oligonucleotides to introduce the mutations leading to the F54L, E80K, F87L, V105A and I132V substitutions in the coding sequences of the variants, respectively:
For each substitution to be introduced, the resulting PCR products contain 33 bp of homology with each other. The PCR fragments were purified. The ten PCR fragments were pooled en equimolar amounts to generate a mix containing 50 ng of PCR DNA and 75 ng of vector DNA (pCLS0542,
c) Mating of Meganuclease Expressing Clones and Screening in Yeast
The experimental procedure is as described in example 28.
d) Sequencing of Variants
The experimental procedure is as described in example 25.
A library containing a population harboring the six amino-acid substitutions (Glycine 19 with Serine, Phenylalanine 54 with Leucine, Glutamic acid 80 with Lysine, Phenylalanine 87 with Leucine, Valine 105 with Alanine and Isoleucine 132 with Valine) was constructed on a pool of two variants cleaving HIV1—5.4 (SEQ ID NO:340) (SEQ ID NO: 297 and 299; described in Table XLIII). 558 transformed clones were screened for cleavage against the HIV1—5.4 (SEQ ID NO:340) and HIV1—5.6 (SEQ ID NO:342) DNA targets. A total of 378 positive clones were found to cleave HIV1—5.4 (SEQ ID NO:340), while 321 of those cleaved also the HIV1—5.6 target (SEQ ID NO:342). An example of positive variants is shown in
The 558 transformed clones were also mated with a yeast strain that contains (i) the HIV1—5 target (SEQ ID NO:337) in a reporter plasmid (ii) an expression plasmid containing a variant that cleaves the HIV1—5.3 target (SEQ ID NO:339) (I-CreI 33C, 38Y, 44A, 68Y, 70Q, 75N+89A or KNSCYS/AYQNI+89A (SEQ ID NO:256), according to the nomenclature of Table I). After mating with this yeast strain, 137 clones were found to cleave the HIV1—5 (SEQ ID NO:337). Thus, 137 positives contained proteins able to form heterodimers with KNSCYS/AYQNI+89A (SEQ ID NO: 256) showing cleavage activity on the HIV1—5 target (SEQ ID NO:337). An example of positives is shown in
Sequencing of the 93 clones with the highest cleavage activity on the HIV1—5 target (SEQ ID NO:337) allowed the identification of 48 different endonuclease variants.
The sequence of ten I-CreI variants cleaving the HIV1—5 target (SEQ ID NO:337) when forming a heterodimer with the KNSCYS/AYQNI+89A (SEQ ID NO:256) variant are listed in Table XXXXIV.
19S 33S 44R 541 68Y 70S 75N 77Q 124V 158R 163T
19S 30S 33N 44K 68Y 70S 77R 103T 132V 142R
19S 30S 33N 44K 68Y 70S 77R 103T 142R 160E
Coexpression of the variants cleaving the non-palindromic targets used during the custom meganuclease development process described in previous examples leads to cleavage of the corresponding DNA target in yeast. Different mutants were selected, either showing a high cleavage activity as heterodimers in the corresponding non-palindromic targets, or a high cleavage activity as homodimers in the HIV1_N.5 and in the HIV1_N.6 pseudo-palindromic targets (N standing for any of the targets described in the present patent application: 1, 3, 4, 5, 7, 8 and 9). In all cases the mutant cleaving the HIV1_N.5 target and the mutant cleaving the HIV1_N.6 target will be called Ma and Mb. This nomenclature is not related to the identity of the HIV1_N.5 or HIV1_N.6 cutter, but to the position in the single chain molecule (Ma being the N-terminal mutant and Mb being the C-terminal mutant).
Single chain constructs were engineered using the linker RM2 (AAGGSDKYNQALSKYNQALSKYNQALSGGGGS) (SEQ ID NO: 345) resulting in the production of the canonical single chain molecule: Ma-RM2-Mb. During this design step, the G19S mutation was introduced in the C-terminal (Mb) mutant. In addition, mutations K7E and K96E were introduced into the Ma mutant, while mutations E8K and E61R were introduced into the Mb mutant. This leads to the generation of the single chain molecule: Ma(K7E K96E)-RM2-Mb(E8K E61R) that is called SCOH-HIV1-MaMb.
Four additional amino-acid substitutions have been found in previous studies to enhance the activity of I-CreI derivatives: these mutations correspond to the replacement of Phenylalanine 54 with Leucine (F54L), Glutamic acid 80 with Lysine (E80K), Valine 105 with Alanine (V105A) and Isoleucine 132 with Valine (I132V). Certain combinations of these mutations were introduced into the coding sequence of N-terminal and C-terminal protein fragment (if these mutations were not present in the original mutants). The coding sequences of the single chain proteins were cloned into a mammalian expression vector, and their activity on the corresponding target in the HIV1 genome was tested in a cellular model developed for this purpose. Table XLV shows an example of the single chain molecules that have been generated for the different HIV1 targets.
a) Cloning of the SC_OH Single Chain Molecules
A series of synthetic gene assemblies were ordered to MWG-EUROFINS. Synthetic genes coding for the different single chain variants targeting the HIV1 provirus were cloned in pCLS1853 (
The efficacy of HIV meganucleases to cleave the corresponding proviral DNA target was assessed in a cellular system containing a defective integrated provirus. This cellular model produces viral-like particles (VLPs) containing all the essential HIV1 proteins with the exception of the viral envelope glycoproteins. Nevertheless, the produced VLPs are not able to infect the cells due to the absence of entry-mediating proteins in the viral envelope. Production of VLPs can be measured in the supernatants of cultured cells using an HIV1-p24 ELISA kit. The VLP-producing cells were transfected with the plasmids coding for the different versions of the SCOH-HIV1 meganucleases (SEQ ID NO:346 to 365) and the antiviral effect was measured by the reduction in the titres of p24 present in the supernatants of transfected cells respect to a “control” sample in which the cells were transfected by a non-related meganuclease (NRM), which has no cleavage activity on the HIV1 proviral DNA.
1) Material and Methods
a) Generation of a Cellular System Allowing to Test the Antiviral Activity of HIV1 Meganucleases (SEQ ID NO:346 to 365)
A cell line capable of producing non-replicative VLPs was generated in order to dispose of a model allowing to determine the efficacy of antiviral meganucleases. With the aim of introducing an HIV provirus in the cells, a lentiviral vector pseudotyped by the VSV envelope protein was used to transduce the HEK-293 human cell line. In order to avoid viral replication on the cellular model, the integrated provirus harbours deletion of the HIV1 accessory proteins (Vif, Vpr, Vpu and Nef) as well as of the viral envelope glycoprotein (env). A cassette conferring puromycin resistance to the cell line was introduced, as well as the EGFP coding sequence (EF1alfa.p-PuroR-IRES-EGFP) to replace the env coding sequence.
For safety reasons, two other HIV1 essential proteins have been deleted from the proviral sequence, those of the Tat and the Rev proteins, which are essential for the production of viral progeny.
To produce the cellular system, two retroviral vectors were generated harbouring either the tat or the rev coding sequences. These two vectors were used to sequentially transduce HEK-293 cells, leading to the generation of a cell line able to produce the tat and rev proteins after integration of the retroviral vectors in the cellular genome. The generated cell line was then transduced by a lentiviral expression vector that, after integration of the dsDNA resulting from reverse transcription, would generate the pseudo-HIV1 provirus containing the meganuclease target hits. The structure of the integrated provirus correspond to the sequence elements U3RU5(HIV)-PsiGAGPOL(HIV)-EF1a:Puro:IRES:GFP-U3RU5(HIV) and is schematically represented in
The cells were tested for their ability to produce VLPs by determining the presence of the HIV1 p24 protein in the culture supernatants using the Alliance® HIV1-p24 ELISA Kit (Perkin Elmer Inc, Waltham, Mass., USA). In a next step, the VLP producing cells were subjected to clonal dilutions in order to characterize the number of pseudo HIV1 integrated provirus in different clones. A cellular clone (HEK293-VLP-CL40) containing between 1 and 2 copies of the pseudo HIV1 provirus (as determined by qPCR) was used for assessing the antiviral activity of meganucleases.
HEK293-VLP-CL40 cells were cultured in DMEM media supplemented with 2 mM L-glutamine, penicillin (100 IU/ml), streptomycin (100 mg/ml), amphotericin B (Fongizone: 0.25 mg/ml, Invitrogen-Life Science) and 10% of foetal bovine serum (FBS).
b) Transfection of HEK293-VLP-CL40 Cells
The day before transfection, HEK293-VLP-CL40 cells were seeded in 12-well culture plates (Falcon, Becton Dickinson, Le Pont De Claix, France) at 105 cells per well and incubated overnight at 37° C. in 1 ml of complete growth medium. The cultures were about 70% confluent on the day of transfection. Transfection with 1 g of plasmid expressing I-CreI variants cleaving different HIV1 target sequences was done using FuGENE® HD Transfection Reagent (Roche Diagnostics, Indianapolis, Ind., USA) according to manufacturer's instruction. Transfection media was replaced 24 h after transfection and cells were kept at 37° C. in complete growth medium for other 24 hours.
c) Cell Harvesting and p24 Determination
Cell supernatants were harvested 48 h post-transfection and p24 titres were either measured immediately or the supernatants were kept at −20° C. for ulterior quantification of the p24. HEK-293-CL40 transfected cells were then recovered and counted, prior to centrifugation at 1500 rpm for 5 minutes and storage of the dry cellular pellet at −20° C. for ulterior extraction of the genomic DNA.
The amount of p24 present in cellular supernatants was determined using the Alliance® HIV1-p24 ELISA Kit (Perkin Elmer Inc, Waltham, Mass., USA) according to the manufacturer's instructions. Results were expressed as p24 in pg/ml (or as pg/well, according to the cell culture conditions). The production of p24 was normalized by the number of cells present in the well at the moment of media harvesting, and expressed as p24 levels in fg/cell.
2) Results
The single chain molecules described in Table XLV (SEQ ID NO: 346 to 365) were tested for their ability to target the HIV1 provirus and reduce the amount of VLPs produced in the HEK293-VLP-CL40 cellular model. Cells were transfected with 1 μg of plasmid expressing the meganuclease variants and the level of p24 present in the culture supernatants was determined 48 h after transfection, as previously described. As a control, a non related meganuclease (NRM) was transfected. This NRM is not active against the HIV1 provirus and should have no effect on the level of p24 produced by NRM transfected cells. The p24 levels of NRM transfected cells, expressed in fg/cell, was considered as 100% of VLP production, and the p24 levels present in samples transfected with HIV meganucleases were compared to the NRM value, in order to determine the percentage of VLP production in these samples.
a) Sequences Targeted in the HIV1 Provirus by the HIV1 Meganucleases
The meganuclease target sites have already been described except for the HIV1—7 (SEQ ID NO:366), HIV1—8 (SEQ ID NO:367) and HIV1—9 (SEQ ID NO:368) targets.
The HIV1—1 target (SEQ ID NO:319), described in example 1, is located in the U3 region of the proviral LTRs; while the HIV1—3 target (SEQ ID NO:325), described in example 8, is located in the U5 region of the proviral LTRs. Since the LTRs are duplicated sequences flanking the viral ORFs in the integrated provirus, each of these two targets are present twice in the HIV1 provirus.
The HIV1—4 target (SEQ ID NO:331) has been described in example 16, and is located in the gag gene of the HIV1 provirus, more precisely in the coding sequence of the p24 (CApsid) protein. The HIV1—7 target (G GAG CC ACC CCAC AAG AT TTA A, SEQ ID NO: 366) also cleaves the coding sequence of the p24 protein, though at a different position. The HIV1—7 target (SEQ ID NO:366) is also a 22 bp (non-palindromic) target precisely located at positions 1321-1342 of the HIV-1 pNL4-3 vector (accession number AF324493, Adachi et al., J. Virol., 1986, 59, 284-291), a subtype B infectious molecular clone.
The HIV1—5 target (SEQ ID NO:337) has been described in example 24, and is located in the pol gene of the HIV1 provirus, more precisely in the sequence coding for the PRotease protein. The HIV1—9 target (SEQ ID NO:368) also cleaves the coding sequence of the protease, though at a different position. The HIV1—9 target (A GAA AT CTG TTGA CTC AG ATT G, SEQ ID NO: 368) is also a 22 bp (non-palindromic) target located at positions 2511-2532 of the HIV-1 pNL4-3 vector.
The HIV1—8 target (G GGC CC CTA GGAA AAA GG GCT G, SEQ ID NO: 367) is a 22 bp (non-palindromic) target located in the gag gene of the HIV1 provirus. This target is precisely located at positions 2006-2027 of the HIV-1 pNL4-3 vector, on the coding sequence of the p7 (NC, NucleoCapsid) protein.
Over again, it should be noted that two cleavage sites are present in the HIV1 proviral DNA for targets HIV1—1 (SEQ ID NO:319) and HIV1—3 (SEQ ID NO:325), while the remaining targets present only one cleavage site in the integrated provirus.
The presence of the HIV1 meganuclease cleavage sites in the HEK293-VLP-CL40 cells was confirmed by sequencing and their position is represented in
b) I-CreI Variants Targeting the HIV1 Genome Induce a Decrease in p24 Titres in a Cellular Model Harbouring an HIV1 Provirus
p24 titres were determined 48 hours after transfection with the HIV1 meganucleases as previously described. The values, expressed as p24 in fg/cell, were normalized respect to the amount of p24 released in a well transfected by a NRM, which was considered to be 100% for VLP production.
A significant reduction of p24 titers, ranging from 35-40%, is observed also for other I-CreI variants cleaving different targets in the HIV1 provirus (SCOH-HIV1—1-B, SEQ ID NO: 347; SCOH-HIV1—7-A, SEQ ID NO: 357; SCOH-HIV1—8-D, SEQ ID NO: 362; and SCOH-HIV1—9-B, SEQ ID NO: 364).
I-CreI variants targeting the HIV1—8 target (SEQ ID NO:367), as well as their activity have been described in Examples 32 and 33. The efficiency of two of the HIV1—8 meganucleases (SEQ ID NO:359 to 362) to cleave their endogenous DNA target sequence was next tested. This example will demonstrate that meganucleases engineered to cleave the HIV1—8 target sequence (SEQ ID NO:367) cleave their cognate endogenous site in human cells harboring an integrated HIV1 provirus (HEK293-VLP-CL40 cells).
Repair of double-strand break by non homologous end-joining (NHEJ) can generate small deletions and insertions (InDel) (
Two Single Chain I-CreI variants targeting the HIV1—8 target (SEQ ID NO:367) cloned in the pCLS1853 plasmid were used for this experiment. The day previous to the experiment, cells derived from the human embryonic kidney cell line, 293-H (HEK293-VLP-CL40) were seeded in a 10 cm dish at density of 106 cells/dish.
The following day, cells were transfected with 3 μg of an empty plasmid or a meganuclease-expressing plasmid using FuGene® HD Transfection Reagent (Roche Diagnostics, Indianapolis, Ind., USA) according to manufacturer's instruction. 72 hours after transfection, cells were collected and diluted (dilution 1/20) in fresh culture medium. After 7 days of culture, cells were collected and genomic DNA extracted. 200 ng of genomic DNA were used to amplify the endogenous locus surrounding the meganuclease cleavage site by PCR amplification. A 325 bp fragment corresponding to the HIV1—8 locus was amplified using specific PCR primers HI8f (SEQ ID NO 369; 5′-GACCCGGCCATAAAGCAAGAGTTTTGGCTG-3′) and HI8r (SEQ ID NO 370; 5′-AAGCTCTCTTCTGGTGGGGCTGTTGGCTCT-3′). PCR amplification was performed to obtain a fragment flanked by specific adaptator sequences (SEQ ID NO 371; 5′-CCATCTCATCCCTGCGTGTCTCCGACTCAG-3′ and 25 SEQ ID NO: 372 5′-CCTATCCCCTGTGTGCCTTGGCAGTCTCAG-3′) provided by the company offering sequencing service (GATC Biotech AG, Germany) on the 454 sequencing system (454 Life Sciences).
An average of 3,000 sequences was obtained from pools of the amplicons (500 ng). After sequencing, different samples were identified based on barcode sequences introduced in the first of the above adaptators. 15 sequences showed the presence of insertions or deletions in the cleavage site of HIV1—8 meganucleases (SEQ ID NO:359 to 362).
Table XLVI summarizes the results that were obtained.
The analysis of the genomic DNA extracted from cells transfected with the meganucleases targeting the HIV1—8 locus showed that around 1% of the analyzed sequences contained InDel events within the recognition site of HIV1—8 meganucleases (SEQ ID NO:359 to 362) (Table XLVI). Since small deletions or insertions could be related to PCR or sequencing artefacts, the same locus was analyzed after transfection with a plasmid that does not express the meganuclease. The analysis of the HIV1—8 locus revealed that no InDel events could be detected. These data demonstrate that meganucleases engineered to target the HIV1—8 locus are active in human cells and can cleave their cognate endogenous sequence. Moreover, it shows that meganucleases have the ability to generate small InDel events within a sequence which would disrupt a gene ORF and thus inactivate the corresponding gene expression product.
Number | Date | Country | Kind |
---|---|---|---|
PCT/IB2009/005582 | Apr 2009 | IB | international |
The present application is a division of U.S. Ser. No. 13/265,575, filed Feb. 10, 2012, which is a National Stage (371) of PCT/IB2010/051746, filed Apr. 21, 2010, and claims priority to PCT/IB2009/005582, filed Apr. 21, 2009.
Number | Date | Country | |
---|---|---|---|
Parent | 13265575 | Feb 2012 | US |
Child | 14064775 | US |