The invention relates to methods for introducing mutation within genes (gene editing), which use the reparation machinery of the cell and in particular gene editing by the CRISPR system.
Gene editing (GE) by the CRISPR/Cas system is a tool widely used worldwide to edit prokaryotic and eukaryotic genomes. (Jinek, M. et al. Science 337, 816-821 2012, Cong, L. et al. Science 339, 819-823 2013) Three types of editing can be obtained after a Double Stranded-Break (DSB) made by a CRISPR/Cas system. SDN1 editing means that the repair of the DNA strands is made by one pathway of the cellular DNA repair machinery: the Non-Homologous End Joining pathway (NHEJ). The repair can result in small insertions or deletions (indels) at the cleavage site.
SDN2 is an editing where the repair is made by the insertion of a sequence by homologous recombination (HR) using a template. The insertion is usually identical to the target sequence but comprises a desired correction.
SDN3 is also an editing where the repair is made by HR but the insertion is a new genetic material.
Base editing (BE) has become an important tool in genome engineering. Currently, tools exist to target Cytidine residues and more recently Adenine residues (Cytidine Base Editors and Adenine Base Editors) (Kim (2018), Rees and Liu (2018)). These Base Editors consist of a cytidine deaminase or an adenine deaminase linked to a DNA targeting system allowing the deaminase activity to be directed to a specific DNA region. Deaminase activity for the cytidine deaminase converts in majority a C residue to a T. However, depending on the enzyme and BE construction used, other bases can be introduced. The Adenine BE is more specific, converting an A residue only to a G residue.
The applicant has developed a method that is both an alternative and an improvement to CRISPR/Cas9 Base Editing and to small SDN2 by using iterative Cas9 gene editing to modify and direct the change of several bases in one or two codons without a Base-editor.
The technique is able to direct the editing of one or two codons in a nucleic acid sequence and can generate a modification of one or two specific amino acids in the resulting protein sequence. The Directed Codon Re-Write is achieved at the position cleaved by the CRISPR Cas endonuclease which makes it very versatile when used for example with a broad PAM variant such as xCas9. As it doesn't require a template sequence for SDN2 and homologous recombination, it is expected to be more efficient than oligo-based SDN2.
The method herein disclosed is alternatively called Codon replacement or codon re-write or directed codon re-write.
A codon is a series of three successive nucleotides in a nucleic acid sequence encoding an amino acid or a stop signal during protein synthesis.
Nucleotide and base are used alternatively to designate the basic components of a nucleic acid sequence: Adenine (A), Cytosine (C), Guanine (G) and Thymine (T).
A PAM is a Protospacer Adjacent Motif, which is a DNA sequence (usually 2-6-base pair) in the DNA sequence targeted by the endonuclease in the CRISPR system. The motif is essential for the recognition of the targeted region by the CRISPR endonuclease. A PAM sequence is specific of a CRISPR endonuclease. The PAM sequence of a CRISPR endonuclease determines the possible area that can be targeted in the targeted genome.
The wild type sequence, or original sequence, or initial sequence, is the target sequence before performance of the method, and thus before introduction of mutations.
The method of the invention is based on the gene editing CRISPR technology and the error-prone repairs of the NHEJ cellular DNA repair machinery. The NHEJ pathway is the predominant mechanism for DSB repairs. This pathway joins two broken DNA ends (Lieber 2010). The NHEJ pathway will either repair the DNA and restore the integrity of the original DNA sequence or introduce errors in the repaired DNA, for example small deletions or insertions at the junction.
The codon re-write machinery requires a CRISPR endonuclease generating blunt ends and at least 2 guide RNA (gRNA) used successively or concomitantly.
The method of the invention allows to introduce specific modifications in a targeted organism.
The method can be applied to eukaryotic cells (animal, fungal or plant cells; preferably plants cells; more preferably corn, wheat, rice, rapeseed, sunflower, barley, rye, soybean, cotton, sorghum, beetroot) or prokaryotic cells.
In more details, the invention thus relates to a method for introducing a specific mutation at a specified position of a target gene, comprising
Consequently, the method allows obtaining a cell of an organism in which the specific mutation has been introduced at the specified position (or location) in the target gene. The target gene codes for a given protein. The method is performed when one wishes to modify one or two codons of the protein, so that one or two amino acids of the protein sequence are modified.
The first guide RNA will trigger cleavage of the double-strand DNA at a given position in the target gene. Said position depends on both the location of the PAM close to the given position (it is reminded that the protospacer adjacent motif (PAM) is a short DNA sequence usually 2-6 base pairs in length that follows the DNA region targeted for cleavage by the CRISPR system), and of the endonuclease used.
After cleavage of the DNA strand, the cell DNA repair machinery starts working to repair the cleaved DNA. During this process, some errors can happen, so that usually one, two or three nucleotides are deleted in the repaired DNA sequence at the location of the cleavage. Larger deletions can also occur but are not desired for the codon-rewrite method. Alternatively, some nucleotides insertions are also possible during the process of DNA repair by the cell machinery.
The invention thus uses the fact that the first guide RNA directs the nuclease to cleave the target gene at the targeted position, and that small deletions from 1 to 3 nucleotides are introduced by reparation at the cleavage site in some cells. Consequently, some cells contain a modified targeted position, with one, two or three nucleotides missing at this location.
The second guide RNA directs the nuclease to cleave the target gene at the modified targeted position. Upon this second cleavage, it is intended to have insertion of a desired nucleotide during reparation at the cleavage site, the desired nucleotide being different from the original nucleotide at this position, thereby introducing by reparation of the target gene the specific mutation at the specified position of the target gene.
This embodiment as explained when only two guides RNA are used (substitution of one nucleotide by successive deletion and insertion) can be broadened, as disclosed below, for substituting more than one nucleotide, using more guides RNA, to take advantage of the fact that the repair can introduce deletion of more than one nucleotide after cleavage directed by the first guide RNA.
In one embodiment, the plurality of cells is a tissue of the organism (a part of the organism which is typically self-contained and has a specific function). In another embodiment, the plurality of cells is a tissue. In another embodiment, the plurality of cells is a suspension of cells of the organism. In yet another embodiment, the plurality of cells are cells that have been made adherent to the culture recipient.
In a preferred embodiment, the organism is a plant. In particular, the plant is a monocotyledon, preferably a cereal. As monocotyledonous plants, one can cite cereals like rice, wheat, barley, sorghum, oat, maize but also sugarcane. The method can advantageously be performed with wheat or maize cells. In another embodiment, the plant is a dicotyledon. Among dicotyledonous plants, one can cite soybean, cotton, tomato, beet, sunflower, or rapeseed.
In a preferred embodiment, the plurality of plant cells are microspores, ovules, protoplasts, zygotes. In a specific embodiment, the cells are plated for biolistics transformation.
In a preferred embodiment the plant tissue is an embryo, meristems like apical meristem, leaves, roots, stems, hypocotyls.
When the method is performed on plant cells, one can use the totipotency property of such plant cells, which makes it possible to regenerate a whole plant from a given cell (for instance after growing the cell and forming a callus from the cultured cells), and hence a plant in which the mutations have been introduced at the specified target locus.
When the method is performed on plant germinal cells (pollen, ovule), the plant germinal cell can be used to fertilize another germinal cell, and to generate a plant so as to introduce the mutation in the resulting plant. In another embodiment, a double haploid is generated, and a plant is regenerated.
In one embodiment, the mutation is the replacement of one nucleotide in the target gene. In this embodiment, one can make use of only two guides RNA. The first guide is intended to direct the first cleavage. it is reminded that the targeting specificity of CRISPR-Cas9 is determined by a 20-nt sequence at the 5′ end of the gRNA. After base pairing of the gRNA to the target, Cas9 mediates the double strand break about 3-4 nucleotides upstream of PAM at the desired target sequence
The second guide RNA is designed to direct the cleavage to the initial sequence at the locus, minus one nucleotide at the site where the cleavage has occurred with the first gRNA. Hence, the second gRNA will only recognize and direct a cleavage for the cells in which repair after the first cleavage was not proper and one deletion was introduced during this repair step. The endonuclease thus cleaves again the double stranded site at the target site with the deletion.
Repair will then occur again, and a random nucleotide may be inserted at the cleavage site. It is thus possible to screen for identifying the cells in which the desired nucleotide has been inserted, in substitution to the original nucleotide, thereby obtaining a cell in which the mutation has been introduced.
In another embodiment, a third guide RNA (ternary guide) is introduced in the cells with the endonuclease protein and the first and second guide RNAs, and
In this embodiment, three guides RNA are used and the method takes advantage of the fact that a deletion of two desired nucleotides may be introduced when the cleavage directed by the first gRNA is repaired, thereby leading to a modified sequence.
In this embodiment, the second guide RNA recognizes the modified sequence (original sequence minus the deletion of two desired nucleotides at the first cleavage site) and is able to direct the endonuclease to perform a cleavage in the cells having this modified sequence. After the cleavage, the repair mechanism will close the double strand, and one random nucleotide may be inserted, leading to secondary modified sequences.
The third RNA guide is specific to the secondary modified sequence in which a first desired nucleotide has been inserted. This first desired nucleotide is different from the original nucleotide that was at this location. The third gRNA directs the endonuclease to cleave the secondary modified sequence.
A second desired nucleotide may be inserted at the repair site. It is thus possible to screen for identifying the cells in which the second desired nucleotide has been inserted.
It is to be noted that the second desired nucleotide, which is inserted after cleavage directed by the third gRNA, may be identical to the original nucleotide that was present at the same location in the original (non-mutated sequence). This may occur when one wishes to modify only the first desired nucleotide in the two nucleotide-deletion.
In this embodiment, one or two codons of the original target gene can be modified if the specific mutation overlaps two codons.
In another embodiment, four guides RNA are used: the first gRNA directs a cleavage in the original sequence, the second gRNA is specific for the original sequence with a deletion of three desired nucleotides at the cleavage site, the third gRNA is specific for the original sequence with a deletion of three desired nucleotides and one insertion of a first desired nucleotide different from the original nucleotide, and the fourth gRNA is specific for the original sequence in which a deletion of three desired nucleotides was made after the first cleavage and after which two desired nucleotides have been inserted successively during repairs (after the cleavages with the second and third gRNA).
In short, the first gRNA directs a cleavage at the predetermined site. Repair may introduce a deletion of three desired nucleotides, thereby leading to a first modified sequence.
The second gRNA is specific to the first modified sequence and directs the cleavage by the endonuclease. Upon repair, insertion of one random nucleotide can arise, thereby leading to a second modified sequence.
The third gRNA is specific for the second modified sequence in which a first desired nucleotide (different from the original one) has been inserted. It directs an endonuclease cleavage and repair, upon which one random nucleotide may be inserted (downstream the first desired nucleotide), leading to a third modified sequence.
The fourth gRNA (quaternary guide) is specific for the third modified sequence, in which a second desired nucleotide has been inserted. It directs an endonuclease cleavage and repair, upon which one random nucleotide may be inserted (downstream the second desired nucleotide), leading to a fourth modified sequence.
Upon screening of the cells having fourth modified sequences, it is possible to identify the cells for which a third desired nucleotide has been inserted downstream the second desired nucleotide.
It is to be noted that, although the first desired nucleotide is different from the original nucleotide at the same location, one or the other two desired nucleotides may be identical to the ones that were in the original sequence. In one embodiment, the first desired nucleotide is different, but the two last reintroduced nucleotides are identical to the ones in the original sequence. In another embodiment, the two first desired nucleotides are different but the last one reintroduced is identical to the one in the original sequence. In another embodiment, the three desired nucleotides are different than the ones in the original sequence.
In this embodiment, a fourth guide RNA is introduced in the cells with the endonuclease protein and the first, second and third guide RNAs, and
The endonuclease, and the guides RNA can be delivered into plants as ribonucleoprotein complexes (RNP), or using vectors comprising genetic constructs encoding the different elements.
The delivery methods are known by the skilled person. For example, mention may be made of electroporation, biolistics, virus-mediated transformation, Agrobacterium mediated plant transformation (Ishida et al., 1996)
Vectors comprising genetic constructs encoding the different elements may be plasmids. The nucleic acids encoding the endonuclease and the gRNA are placed under the control of promoters. Promoters of the invention comprise constitutive promoters like the ZmUbi promoter, the TaU6 promoter, the maize U6 promoter, the maize U3 promoter, the rice U3 promoter and the rice U6 promoter.
Optionally, the vector can comprise a selectable marker like bar, nptll, hygromycin and or a visual marker (GFP, luciferase, GUS).
The RNP complexes comprising the endonuclease and the gRNAs are produced and assembled in vitro and delivered in the target cells (bombardment, electroporation, PEG transformation).
As indicated above, it is preferred when the mutation induces a change of an amino acid in the protein coded by the gene. In another embodiment, the mutation results in a change in two consecutive amino acids of the protein (when the ability of repair to introduce two or three deletions is used, when the specified position comprises two or three nucleotides of two codons of a gene encoding a protein, and wherein the mutation includes the mutation of at least two of these nucleotides, thereby triggering a change of two amino acids in the protein). In another embodiment, the mutation creates a stop codon in the coding sequence, thereby leading to a truncated protein. In another embodiment, the mutation is introduced in a regulatory region of the gene, for instance in the gene's promoter so as to reduce or increase expression of the gene.
In the preferred embodiment, the cells are cultured in vitro to increase their number. In another embodiment, when the organism is a plant, the cells in which the target gene has been mutated are recovered, and a whole plant/plantlet is regenerated (in particular, the cells are allowed to form a callus and the plant is regenerated from the callus). A sample of each regenerated plant/plantlet is taken for analysis. The whole DNA is extracted from each sample and sequenced to detect the presence of the specific mutation. Alternatively, primers specific of the target region in the target genes are used to amplify the target region. The target region is sequenced to detect the presence of the specific mutation.
Screening may be performed on plants regenerated from the plurality of cells. One should note that the regenerated plants are generally chimeras plants (as they contain various mutated cells (several mutation events can be present in view of the iterative method herein disclosed) as well as non-mutated cells. One way of obtaining plants with the mutation in all cells would be to screen the regenerated plants to select the ones in which it is most likely that germinal cells would contain the specific mutation.
One way to perform this screening would be to use a sample (somatic tissue) of the regenerated plant, as described above and to perform a quantitative analysis of the DNA bearing the mutation. The higher the mutation rate in somatic cells, the more likely the mutation occurred early in the cell divisions and regeneration and the higher the chance that germ cells are also mutated and thus that the mutation is transmissible. Consequently, by screening multiple regenerated plants using similar samples from each plant (for instance a circle taken from a leaf by a punch), it is possible, by quantifying the amount of mutated DNA to evaluate the quantity of cells that contain the specific mutation and to select the plants from which these cells were harvested. For instance, it is possible to rank the plants according to the amount of mutation in the sample (thus the importance to take similar samples for mutation DNA analysis), and select the first decile (the 10% of plants that have the highest amount of mutations in the analyzed DNA) for further processing. In another embodiment, the 5% of plants, or the 2% of plants, or the 1% of plants having the highest amount of mutations in the analyzed DNA are selected.
Such selected plants can be used for further crosses with other plants. A percentage of the next generation plants will be heterozygous for the specific mutation.
Another method could be not to analyze the regenerated plants, but to cross these plants and to screen for the presence of the specific mutation in the progeny.
If the specific mutation introduces a herbicide tolerance in plants, the cells are selected on a selective medium comprising said herbicide.
In another embodiment, a plant comprising at least one cell containing a mutation at the specified position of the target gene is regenerated from the cell population.
The endonuclease that is used can be selected from endonucleases known in the art. It is preferably a CRISPR Cas endonuclease, most preferably a Class 2 type II (Cas9) endonuclease. One can use a Cas9 originating from any known host, such as Streptococcus pyogenes (SpCas9) Staphylococcus aureus (SaCas9), Campylobacter jejuni (CjCas9), Streptococcus thermophilus (St1Cas9), Neisseria meningitidis (MnCas9), Francisella novicida (FnCas9), E1369R/E1449H/R1556A triple mutant of Francisella novicida (RHA FnCas9), improved Cas9 (xCas9 described by Hu et al, Nature. 2018 Apr. 5; 556(7699):57-63), or Cas9-NG (Ren et al, Molecular Plants, 12 (7), 2019, 1015-1026). The endonuclease to be used in the methods herein disclosed must generate blunt ends when cleaving the target sequence.
The invention also relates to a method for producing a plant comprising at least one cell containing a mutation at a specified position of a target gene, comprising performing the method for introducing a mutation in the target gene, as disclosed above, and regenerating the plant from the cells obtained after performance of the method. This method may also include a step of selecting a plant from the plurality of plants regenerated from the plurality of cells, according to the method indicated above.
The invention also relates to a method for designing guides necessary for the introduction of a specific mutation at a specified position of a target gene of a cell of an organism, comprising:
The method is thus a directed iterative enrichment in the random mutations obtained from the non-homologous end joining repair following CRISPR application to obtain the desired mutation using specifically designed guides depending on the intended modification, to direct codon re-writing.
The first guide (primary guide) and the endonuclease of the invention are able to target the native (wild-type) sequence inducing a break at the desired nucleotide(s) to re-write. A reasonably high frequency of small deletions from 1 to 3 nucleotides or insertions of 1 nucleotide will appear in some cells.
Introduced simultaneously or subsequently, a second guide (secondary guide) that recognizes the native sequence having the desired deletion (1 to 3 nucleotides) is delivered to the cells. In the case of the occurrence of the desired deletion, the mutated sequence will be cut again by the endonuclease thanks to the secondary guide. This time taking advantage of the high frequency of insertions, a fraction of cells will insert a new nucleotide.
If a Re-write of 1 nucleotide in a codon is targeted, the codon re-write machinery comprises at least 2 guide RNAs. The codon re-write will be a partial codon-rewrite.
If a Re-write of 2 nucleotides in a codon is targeted, the codon re-write machinery comprises at least three guide RNAs. The codon re-write will be a partial codon-rewrite. If a Re-write of 3 nucleotides in a codon is targeted, the codon re-write machinery comprises at least four guide RNAs. The codon re-write will be a complete codon-rewrite.
Herein are disclosed several steps:
The modification of the targeted codon will induce a modification at the protein level with an amino acid modification within the sequence of the protein encoded by the targeted nucleic acid of interest compared to the wild-type protein. In some embodiments the mutation introduces a STOP codon in the gene sequence.
This amino acid modification in the protein brings a specific trait to the targeted organism.
Traits of interest in plants are for example: yield improvement, improvement of the uptake of nutrients, improvement of the plant architecture, improvement of tolerance to abiotic stress (heat, cold, drought, salinity, osmotic stress . . . ), improvement of organoleptic properties, tolerance to resistance to biotic stress (insects, nematodes, fungi, bacteria, viruses . . . ) tolerance to herbicides/antibiotics, improvement of seed oil content, improvement of seed protein content.
The skilled person knows the PAM specificity of different endonucleases.
Endonucleases of the invention are CRISPR endonucleases generating blunt ends, typically class 2 Type II CRISPR nucleases. Class 2 Type II nucleases of the invention can be SpCas9 as described in WO2014093661 or WO2013176772, SaCas9, CjCas9, St1Cas9, NmCas9, FnCas9, RHA FnCas9 in Wang et al (2020), xCas9 (Hu et al, 2018), Cas9-NG (Nishimasu et al, 2018).
The most common NHEJ repair errors observed at a cutting site are deletions of 1 to 3 nucleotides and insertions of 1 nucleotide.
With the Cas9 endonuclease, the two nuclease domains in Cas9 will each cleave one of the DNA strands 3 bases upstream of the PAM, leaving a blunt end DNA double stranded break (DSB). Deletion of the 4th, of the 4th and 5th, or of the 4th, 5th and 6th nucleotide or insertions of one nucleotide at the 4th position upstream (5′) of the PAM sequence are frequent modifications introduced in the wild type sequence.
The delivery of the codon-rewrite machinery depends on the targeted cells. The person skilled in the art knows how to adapt the delivery method.
The codon-rewrite machinery can be delivered by way of example as expression cassettes in nucleic acid vectors or as ribonucleoprotein complexes.
The total DNA of the cells is extracted, and the targeted region is amplified with adequate primers and sequenced. The targeted region is aligned with the wild type sequence and the introduction of the editing is verified.
The identification of the edited cells can be made by extracting total RNA or total protein.
It is also possible to identify edited cells visually for example if the codon modification introduces a morphological difference or a resistance.
If the edited cells are plant cells, the edited cells can be regenerated into edited plants. The skilled-person knows the regeneration method suitable for each plant.
Before starting the codon re-write method, it is possible to test the targeted region within the targeted nucleic acid sequence of interest with a guide RNA in order to determine the frequencies of deletions and insertions of each potential nucleotide obtained with the cellular DNA repair machinery.
In order to perform this test, the guide RNA is provided with the chosen endonuclease into cells. The cells are maintained in adequate conditions for the editing. The total DNA of the cells is extracted, the targeted region is amplified with adequate primers and the amplicons are sequenced.
The frequencies of deletions and insertions of the four nucleotides at the targeted region can thus be measured.
This step is helpful to optimize the design of the guides towards the most probable insertions or deletions events at the location of the cut.
In some cases, the modification of a codon can be performed through several different scenarios of deletions and insertions. This step can increase the final efficiency by allowing to design of the guides based on the more probable scenario.
A) Identification of the Target Codon.
The chosen target is the Fusarium Head Blight susceptibility gene HRC (Su et al.), a gene that encodes a putative histidine-rich calcium-binding protein from Triticum aestivum (GenBank: MK450306.1). A maize HRC gene (Zea maize cvA188) was identified (SEQ ID NO: 1). The target codon GAT encoding an Aspartic Acid amino acid (Asp) at position 94 in ZmHRC protein (SEQ ID NO: 2) is intended to be replaced by TTT or TTC to encode a Phenylalanine amino acid (Phe) at the same position. The desired replacement is a Asp-94-Phe mutation in ZmHRC.
A primary gRNA targeting Asp94 codon in SEQ ID NO: 1 was designed on the coding strand to introduce a cleavage point after Asp94 codon of the wild type sequence. (
The expected cleavage site by the Cas9 nuclease is situated three bases upstream (5′) of the PAM site (underlined). The cleavage is situated after the T in the GAT codon for Asp94.
B) Genome Editing in Maize Plant Cells by Delivering CRISPR Cas9 and Guide RNA and Regenerating Maize Plants
A Cas9 gRNA guide is designed (SEQ ID NO: 4) to target the cleavage site described in Example 1 A) and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a maize U6 promoter (SEQ ID NO: 5) to form SEQ ID NO: 6.
The Cas9 gene from Streptococcus pyogenes was codon-optimized for Poaceae using standard techniques known in the art (SEQ ID NO: 7) and nuclear localisation signals (NLS) were added at both ends: Simian virus 40 (SV40) monopartite amino terminal NLS encoded by SEQ ID NO: 8 and Xenopus laevis Nucleoplasmin NLS encoded by SEQ ID NO: 9 at the amino- and carboxyl-termini respectively. The optimized sequence of Cas9 was cloned by standard methods under control of the maize ubiquitin promoter and first intron (Christensen et al.) and Agrobacterium tumefaciens NOS terminator (Depicker et al.)
The Cas9 expression cassette and the gRNA cassette(s) were introduced into a binary vector pMRT with reporter and selectable marker cassettes to form pBIOS12094 vector and transformed into Agrobacterium strain Super virulent LBA4404/pSB1 (Komari et al.)
Maize A188 immature embryos were transformed with pBIOS12094 in Agrobacterium strain super virulent LBA4404/pSB1 and plants regenerated according to the procedure described by Ishida et al, 2007.
C) Analysis of the Mutation Profile Obtained with the Primary Guide
To detect the editing, the DNA from the regenerated maize plants is extracted from leave samples and the Asp94 target site is amplified using primers PP_03359_F (SEQ ID NO: 10) and PP_03359_R (SEQ ID NO: 11). Amplicons are sequenced using Next Generation Sequencing (NGS) technology. The number of sequences with indels of at least 1 nucleotide in a 10-nucleotide region centred on the Asp94 target site is assessed. (
The repartition of the indel types in 105 T0 transgenic plants is shown in
The deletions of 3 nucleotides and insertions of 1 nucleotide at the locus of Asp94 needed for the Asp-94-Phe mutation are possible and frequent. Additionally, insertions of T nucleotides are frequent among the insertions of 1 nucleotide.
The replacement of GAT by TTT for the codon of Asp94 can occur at a reasonable frequency. The expected modifications are a deletion of the three nucleotides of the GAT codon and three successive additions of T.
D) Design of the Secondary Guide RNA
Among the various mutations which can be obtained using the primary guide, the deletion of three nucleotides (GAT) is desired (del3). The secondary guide is designed to be specific of the ZmHRC sequence with the deletion of the three nucleotides (GAT) of Asp94 codon. (
A Cas9 gRNA guide (SEQ ID NO: 13) is designed to target ZmHRC_Asp94-del3, the nucleic acid expressing said guide is cloned into high copy number plasmids behind a maize U6 promoter (SEQ ID NO: 5) to form SEQ ID NO: 14.
E) Design of the Tertiary Guide RNA.
Among the various mutations which can be obtained using the secondary guide, the insertion of 1 nucleotide (T) is desired (del3insT). The tertiary guide is designed to be specific of the ZmHRC_Asp94-del3 sequence with insertion of 1 nucleotide (T) in ZmHRC_Asp94-del3. (
A Cas9 gRNA guide (SEQ ID NO: 16) is designed to target ZmHRC_Asp94-del3insT and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a maize U6 promoter (SEQ ID NO: 5) to form SEQ ID NO: 17.
F) Design of the Quaternary Guide RNA.
Among the various mutations which can be obtained using the tertiary guide, the insertion of 1 nucleotide (T) is desired (del3insTT). The quaternary guide is designed to be specific of the ZmHRC_Asp94-del3insTT sequence with insertion of 1 nucleotide (T) in ZmHRC_Asp94-del3insT. (
A Cas9 gRNA guide (SEQ ID NO: 19) is designed to target ZmHRC_Asp94-del3insTT and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a maize U6 promoter (SEQ ID NO: 5) to form SEQ ID NO: 20.
Among the various mutations which can be obtained using the quaternary guide, the insertion of 1 nucleotide (T) will occur, thus generating the desired ZmHRC_Asp-94-Phe codon rewrite. (
G) Codon Rewrite in Maize Plant Cells by Delivering CRISPR Cas9 and Guide RNA and Regenerating Maize Plants
The Cas9 expression cassette and the gRNA cassettes for the primary, secondary, tertiary and quaternary guides were introduced into a binary vector pMRT with reporter and selectable marker cassettes to form pZmHRC_Asp-94-Phe_Cd-RW vector and transformed into Super virulent Agrobacterium strain LBA4404/pSB1 (as in example 1A).
Maize A188 immature embryos are transformed with pZmHRC_Asp-94-Phe_Cd-RW in Agrobacterium strain super virulent LBA4404/pSB1 and plants are regenerated according to the procedure described in example 1B.
H) Analysis of the Mutation Profile Obtained in Maize Plants
To detect the editing, the DNA from the regenerated maize plants, DNA is extracted from leave samples and the ZmHRC_Asp94 target site is amplified using primers PP_03359_F (SEQ ID NO: 10) and PP_03359_R (SEQ ID NO: 11). Amplicons are sequenced using Next Generation Sequencing (NGS) technology. Plants containing the desired Asp-94-Phe codon rewrite are identified and their phenotype evaluated in presence of the Fusarium pathogen.
A) Identification of the Target Codon.
The chosen target is the lipoxygenase 1 gene (Lox1) from Triticum aestivum (GenBank: GQ166692.1) coding for a 9-lipoxigenase. The silencing of this gene renders wheat resistant to Fusarium headblight.
In this experiment to modify the Lox1 gene, the target codon GTC encoding a Valine amino acid (Val) at position 286 in TaLox1 is replaced by GAA or GAG to encode a Glutamic acid amino acid (Glu) at the same position. The desired replacement is a Val-286-Glu mutation in TaLox1.
A functional gRNA targeting TaLox1 has been identified by Wang et al, 2018 on the complementary strand. This guide was chosen to design the primary guide. (
The expected cleavage site by the Cas9 nuclease is situated three bases upstream (5′) of the PAM site after the G in the codon for Val286 of the Lox1 protein as shown by Wang et al, 2018.
According to the codon table, Asp is encoded either by GAA or GAG. The minimum path for the Val-286-Glu mutation involves the replacement of the last 2 nucleotides (TC) of the GTC codon by AA or AG.
B) Genome Editing in Wheat Plant Cells by Delivering CRISPR Cas9 and Guide RNA and Regenerating Wheat Plants
A Cas9 gRNA guide (SEQ ID NO: 22) is designed to target the site described in Example 2 A) and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a wheat U6 promoter (SEQ ID NO: 23) to form SEQ ID NO: 24.
The Cas9 expression cassette and the gRNA cassette(s) were introduced into a binary vector pMRT with reporter and selectable marker cassettes to form pBIOS12093 vector and transformed into Agrobacterium strain EHA105.
Wheat immature embryos are transformed, selected and regenerated as described by Ishida et al, 2015.
C) Analysis of the Mutation Profile Obtained with the Primary Guide
To detect the editing, the DNA from the wheat plants is extracted from leave samples and the Val286 target site is amplified using primers PP_03344_F (SEQ ID NO: 25) and PP_03344_R (SEQ ID NO: 26). Amplicons are sequenced using Next Generation Sequencing (NGS) technology. The number of sequences with indels of at least 1 nucleotide in a 10-nucleotide region centred on Val286 target site is assessed (
The repartition of the indel types in 51 T0 transgenic plants is shown in
The deletions of 2 nucleotides and insertions of 1 nucleotide at the locus of Val286 needed for the Val-286-Glu mutation are therefore possible. Additionally, all 4 nucleotides types are found among the insertions of 1 nucleotide.
The replacement of TC by AA or AG in the GTC codon of Val286 can occur at a reasonable frequency. The replacement AA should occur at a higher frequency than AG.
D) Design of the Secondary Guide RNA.
Among the various mutations which can be obtained using the primary guide, the deletion of two nucleotide (TC) is desired. The secondary guide is designed to be specific of the TaLox1 sequence with the deletion of two nucleotides (TC) of the Val286 codon. (
A Cas9 gRNA guide (SEQ ID NO: 28) is designed to target TaLox1_Val286-del2 and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a wheat U6 promoter (SEQ ID NO: 23) to form SEQ ID NO: 29.
E) Design of the Tertiary Guide RNA.
Among the various mutations which can be obtained using the secondary guide, the insertion of A or G nucleotide is desired. The frequency of A insertions being higher at the target locus (Example 2C) the insertion of A is preferred. The tertiary guide is designed to be specific of the TaLox1_Val286-del2 sequence with the insertion of 1 nucleotide (A) in TaLox1_Val286-del2. (
A Cas9 gRNA guide (SEQ ID NO: 31) is designed to target TaLox1_Val286-del2insA and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a wheat U6 promoter (SEQ ID NO: 23) to form SEQ ID NO: 32.
Among the various mutations which can be obtained using the tertiary guide, the insertion of A will occur generating the desired Val-286-Glu codon rewrite. (
F) Codon Rewrite in Wheat Plant Cells by Delivering CRISPR Cas9 and Guide RNAs and Regenerating Wheat Plants
The Cas9 expression cassette and the gRNA cassettes for the primary, secondary and tertiary guides were introduced into a binary vector with reporter and selectable marker cassettes to form pTaLox1_Val-286-Glu_Cd-RW vector and transformed into Agrobacterium strain EHA105.
Wheat immature embryos are transformed, selected and regenerated as described by Ishida et al, 2015.
G) Analysis of the Mutation Profile Obtained with the Primary Guide
To detect the editing, the DNA from the wheat plants is extracted from leave samples and the Val286 target site is amplified using primers PP_03344_F (SEQ ID NO: 25) and PP_03344_F (SEQ ID NO: 26). Amplicons are sequenced using Next Generation Sequencing (NGS) technology. Plants containing the desired Val-286-Glu codon rewrite are identified and their phenotype evaluated.
A) Identification of the Target Codon.
The chosen target is the herbicide and selectable marker gene acetohydroxyacid synthase (AHAS or ALS). The substitution of a Serine (Ser) by an Asparagine (Asn) at position 621 in the maize ALS protein lead to resistance of maize callus to sulfonylurea herbicides such as chlorosulfuron or imazethapyr (Zhu et al.). Maize has two ALS genes: ALS1 (SEQ ID NO: 33) and ALS2 (SEQ ID NO: 34). Guide RNAs (gRNAs) are designed to introduce the Ser-621-Asn mutation into ZmALS2.
The target codon AGT encoding a Serine amino acid (Ser) at position 621 in ZmALS2 has to be replaced by AAT or AAC to encode an Asparagine amino acid (Asn) at the same position to introduce an herbicide resistance in maize. The desired replacement is a Ser-621-Asn mutation in ZmALS2.
The base replacement to introduce the Ser-621-Asn mutation requires the following minimal change: the deletion of G in AGT and a replacement with A to recreate AAT.
B) Design of a Primary Guide RNA
An optimal Cas9 PAM sequence is identified in the vicinity of the targeted nucleotide ZmALS2 as to introduce a cut directly after the targeted G in the Ser621 codon. (
A Cas9 gRNA guide is designed (SEQ ID NO: 36) and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a maize U6 promoter (SEQ ID NO: 5) to form SEQ ID NO: 37.
C) Design of a Secondary Guide RNA
Among the various mutations which can be obtained using the primary guide, the deletion of one nucleotide (G) is desired. The secondary guide is designed to be specific of the ZmALS2 sequence with the deletion of one nucleotide (G) in ZmALS2_Ser 621. (
A Cas9 gRNA guide (SEQ ID NO: 39) is designed to target ZmALS2_Ser621-del1 and the nucleic acid expressing said guide is cloned into high copy number plasmids behind a maize U6 promoter (SEQ ID NO: 5) to form SEQ ID NO: 40.
The secondary guide will introduce a cleavage at the location of the deleted G to induce new mutations at this site. Some of these mutations will be insertions of nucleotide (A) to form Ser-621-Asn mutation into ZmALS2 (
D) Codon Rewrite in Maize Protoplasts
In one experiment, both the primary gRNA (ID SEQ NO:37) and secondary gRNA (ID SEQ NO:40) plasmids are co-transformed with Cas9 cassette and a reporter cassette into maize A188 protoplasts using a standard PEG-based protocol (Cao et al.)
To detect the editing, the DNA from the maize protoplasts is extracted and the Ser-621-Asn target site is amplified using primers ZmALS_621_for (SEQ ID NO: 41) and ZmALS_621_rev (SEQ ID NO: 42). Amplicons are sequenced using Next Generation Sequencing (NGS) technology. The number of sequences with the desired G to A (Ser-621-Asn) edit is assessed to determine the relative efficiency of codon rewrite.
E) Codon Rewrite in Maize Callus
In a second experiment, the Cas9 expression cassette and both the primary gRNA (ID SEQ NO: 37) and secondary gRNA (ID SEQ NO: 40) were introduced into a binary vector pMRT with reporter and selectable marker cassettes to form pZmALS2_Ser-621-Asn_Cd-RW vector.
pZmALS2_Ser-621-Asn_Cd-RW is introduced into maize Black Mexican Sweet (BMS) cell suspensions by biolistic as described by Kirihara, 1994. Individual chlorosulfuron-resistant calli are isolated and DNA extracted by standard protocol.
The Ser-621-Asn target site is amplified using primers ZmALS_621_for (SEQ ID NO: 41) and ZmALS_621_rev (SEQ ID NO: 42). Amplicons are sequenced to confirm the specific codon rewrite.
F) Codon Rewrite to Obtain Herbicide Tolerance Maize Plants by Targeting the AHAS Gene
In a third experiment, pZmALS2_Ser-621-Asn_Cd-RW is transformed into Agrobacterium strain Super virulent LBA4404/pSB1 (Komari et al.) and used to transform maize A188 immature embryos and plants regenerated according the procedure described by Ishida et al, 2007.
On T0 plants, the Ser-621-Asn target site is amplified using primers ZmALS_621_for (SEQ ID NO: 41) and ZmALS_621_rev (SEQ ID NO: 42). Amplicons are sequenced to confirm the specific codon rewrite and evaluate the frequency. Chlorosulfuron resistance is confirmed on T1 plants as described by Svitashev et al, 2015.
Number | Date | Country | Kind |
---|---|---|---|
20306150.2 | Oct 2020 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2021/077206 | 10/1/2021 | WO |