The disclosure relates to the field of fluorescence activated cell sorting to generate plants. In a preferred embodiment, the disclosure describes FACS enrichment of edited, regenerable protoplasts to generate fertile edited plants.
The Fluorescence Activated Cell Sorter (FACS) was invented in the late 1960s by Bonner, Sweet, Hulett, Herzenberg, and others to do flow cytometry and cell sorting of viable cells. Becton Dickinson Immunocytometry Systems introduced the commercial machines in the early 1970s. Fluorescence-activated cell sorting (FACS) is a specialized type of flow cytometry. It provides a method for sorting a heterogeneous mixture of biological cells into two or more containers, one cell at a time, based upon the specific light scattering and fluorescent characteristics of each cell. It is a useful scientific instrument, as it provides fast, objective and quantitative recording of fluorescent signals from individual cells as well as physical separation of cells of particular interest.
The cell suspension is entrained in the center of a narrow, rapidly flowing stream of liquid. The flow is arranged so that there is a large separation between cells relative to their diameter. A vibrating mechanism causes the stream of cells to break into individual droplets. The system is adjusted so that there is a low probability of more than one cell per droplet. Just before the stream breaks into droplets, the flow passes through a fluorescence measuring station where the fluorescent character of interest of each cell is measured. An electrical charging ring is placed just at the point where the stream breaks into droplets. A charge is placed on the ring based on the immediately prior fluorescence intensity measurement, and the opposite charge is trapped on the droplet as it breaks from the stream. The charged droplets then fall through an electrostatic deflection system that diverts droplets into containers based upon their charge. In some systems, the charge is applied directly to the stream, and the droplet breaking off retains charge of the same sign as the stream. The stream is then returned to neutral after the droplet breaks off.
A wide range of fluorophores can be used as labels in flow cytometry. Fluorophores, or simply “fluors,” are typically attached to an antibody that recognizes a target feature on or in the cell; they may also be attached to a chemical entity with affinity for the cell membrane or another cellular structure. Each fluorophore has a characteristic peak excitation and emission wavelength, and the emission spectra often overlap. Consequently, the combination of labels which can be used depends on the wavelength of the lamp(s) or laser(s) used to excite the fluorochromes and on the detectors available.
Fluorescence-activated cell sorting (FACS) provides a rapid means of isolating large numbers of fluorescently tagged cells from a heterogeneous mixture of cells. Collections of transgenic plants with cell type-specific expression of fluorescent marker genes such as green fluorescent protein (GFP) are ideally suited for FACS-assisted studies of individual cell types.
It has been demonstrated that flow cytometric analysis and fluorescence activated cell sorting (FACS) of plant protoplasts is practicable, moreover, this technique has yielded valuable results in a number of different fields of research (Harkins and Galbraith, 1984; Galbraith et al., 1995; Sheen et al., 1995). For instance, FACS of protoplasts from Arabidopsis plants expressing tissue-specific fluorescent protein markers has been used to examine both basal and environmentally stimulated transcriptional profiles in particular cell types (Birnbaum et al., 2003; Brady et al., 2007; Gifford et al., 2008; Dinneny et al., 2008) and flow cytometry has been employed to analyze reactive oxygen species production and programmed cell death tobacco protoplasts (Nicotiana tabacum; Lin et al., 2006). A broad selection of fluorescent tools is available to study a plethora of physiological parameters in plants, e.g., cis-regulatory elements fused to fluorescent proteins (Haseloff and Siemering, 2006), genetically-encoded molecular sensors (Looger et al., 2005) or dye-based sensors (Haugland, 2002) can be used in combination with cytometry to measure diverse biological processes. However, there are certain inefficiencies with this process due to the sensitivities of the assays and thus there is room for improvement.
A particular embodiment of the disclosure relates to a method for generating a plant from a population of plant cells by isolating a plant protoplast utilizing a polynucleotide of interest by providing a population of plant protoplasts having at least one protoplast comprising a polynucleotide of interest and a fluorescent marker, wherein the population is substantially free of plant protoplasts comprising the fluorescent marker and not comprising the polynucleotide of interest; wherein the plant protoplast is encapsulated by sodium alginate; separating the at least one protoplast comprising the polynucleotide of interest and the fluorescent marker from the remaining plant protoplasts in the population, thereby isolating a plant protoplast comprising the polynucleotide of interest; regenerating a plant from said isolated plant protoplast; and culturing said plant.
In another embodiment of the invention, there can be a plant regenerated by isolating a plant protoplast comprising a polynucleotide of interest integrated into the genome of the plant protoplast by providing a population of plant protoplasts having at least one protoplast comprising a polynucleotide of interest and a fluorescent marker; wherein the plant protoplast is encapsulated by sodium alginate; recovering microcalli from the population of protoplasts comprising the polynucleotide of interest and the fluorescent marker wherein the at least one protoplast comprises the polynucleotide of interest and the fluorescent marker has been transformed with the polynucleotide of interest and a polynucleotide encoding the fluorescent marker; regenerating a plant from said microcalli; and culturing said plant.
Alternative embodiments include methods for producing a transgenic plant, the method can include providing a population of plant protoplasts having at least one protoplast comprising a polynucleotide of interest and a fluorescent marker, wherein the at least one protoplast comprises a site-specific nuclease, such that the polynucleotide of interest is capable of being integrated in the genome of the at least one plant protoplast by homologous recombination at a recognition site of the site-specific nuclease and wherein the plant protoplast is encapsulated by sodium alginate; separating the at least one protoplast comprising the polynucleotide of interest and the fluorescent marker from the remaining plant protoplasts in the population; regenerating the transgenic plant from the at least one protoplast; and culturing said transgenic plant.
The foregoing and other features will become more apparent from the following detailed description of several embodiments, which proceeds with reference to the accompanying figures.
FIGS. 3A-3M′: Shows a sequence alignment of FAD3 gene sequences, generated using A
Transient transformation of protoplasts is a widely utilized tool in plant research that is swift and unproblematic. The technique can be used, for example, to monitor the regulation of promoter elements, to analyze gene expression or enzymatic activity in response to a variety of stimuli, to examine the roles of transcription factors or signal-transduction cascade components or to study the subcellular localization of proteins (Sheen, 2001; Yoo et al., 2007). As opposed to stable transformation of plants (Arabidopsis thaliana being the most commonly used platform), which generally takes months and requires the use of a transfecting agent (usually Agrobacterium tumefaciens), transfection of protoplasts can be achieved in just one day and entails only raw DNA and either a chemical- or electroporation-based transfection method. Additionally, transient transformation analyses can overcome problems encountered with stable over-expression such as pleiotropic developmental effects or nonviability, when a cell-based assay is appropriate. However, due to the fact that protoplast transformation efficiency is never 100%, results can be convoluted by the non-transformed cells.
Transformation efficiencies are often low and variable (e.g. Cummins et al., 2007; <10%) and depend on the employed method as well as properties of the protoplasts and DNA used. The present invention relates to the field of plant biotechnology, but can be used for all biological purposes. In particular, embodiments of the present invention relate to the generation of native or transgenic plant cell lines from a heterogeneous population of plant cells through flow cytometric sorting. Such plant cell line may either be a monocot or a dicot. As will be apparent for a skilled person, the invention also uses the plant cell line for the regeneration of whole fertile plants.
One of the embodiments of the present inventions relates to FACS based sensitivity selection wherein there is better selection to select for successfully transformed cells. It has previously been reported that transformation of a population of plant cells such as a plant suspension culture frequently results in transgenic cultures that heterogeneous and inconsistent expression levels. The present invention is primarily concern with the provision of a plant based system. The ability to isolate and grow single cells has numerous possible applications. For example, methods outlined herein have utility in the improvement of processes related to the productivity of plant cell cultures. However, this application has broad applicability for all cells.
Embodiments of the present invention include the use of flow cytometric sorting such as FACS technology to separate or isolate single, i.e. individualized protoplast that are prepared from a population of plant cells using materials and methods known in the art. These protoplasts can be transformed and are capable of 1) producing a fluorescent marker protein or polypeptide; 2) producing a desired product; and/or 3) surviving in the presence of a selection agent. Sorting criteria for FACS can be selected from the group comprising the genetic background (e.g., ploidy, aneuploidy), mutants, transgenics, gene exchange products, and fluorescence (autofluorescence (chloroplasts, metabolites), fluorescent proteins or enzyme mediation fluorescence). Any fluorescent protein may be used. A selection agent may or may not be used.
After separation or isolation of the single protoplasts by flow cytometric sorting, each single transformed protoplast is regenerated until the formation of a microcolony (microcallus) by co-cultivation. The plant source origin is not limited but is restricted to those lines, varieties and species whose protoplasts have the potential to regenerate until the formation of a microcolony or microcallus. The present invention will thus be applicable to all plant varieties and species for which a regeneration protocol has been established or will be provided. Thus, the present invention can be carried out with all plant varieties and species for which a regeneration portion has been established or will be provided for in the future.
The microcolony itself may be separated or removed from the feeder cell material and cultivated until the formation of a plant cell line.
Embodiments of the present invention can also include the generation of a callus tissue by 1) transferring the microcolony or microcallus to a solid cultivation medium and 2) cultivating the microcolony or microcallus in the presence of at least one selection agent until the formation of a transgenic callus tissue from which a transgenic plant cell line can be established by transferring the callus tissue to a liquid cultivation medium. The microcolony can also be removed or separated from the feeder cell material by mechanical means, i.e., by clone picking. In this case, no selection agent is needed and the cells comprised by the microcolony do not need to display resistance against any selection agent.
In some embodiments the cells can comprise a heterogeneous population of plant cells that are native or non-transgenic cells that, before being subjected to flow cytometric sorting and are stably or transiently transformed with at least one expression vector comprising at least one heterologous nucleic acid sequence which can be operably linked to a functional promoter, wherein said at least one heterologous nucleic acid sequence codes for a desired product. In additional embodiments the at least one heterologous nucleic acid sequence is operably linked to at least one functional promoter wherein the at least one heterologous nucleic acid sequence codes for a fluorescent marker protein or polypeptide and at least one heterologous nucleic acid sequence for resistance to a selection agent or for a desired product. Additional embodiments can include wherein the cells may additionally comprise a heterologous nucleic acid sequence that codes for a desired product to be accumulated in the transgenic plant cell line as provided.
In other embodiments the genome of the host cell can be expressed so that the recombinant protein or peptide can be modified by recombination, for example homologous recombination or heterologous recombination.
Any (transgenic) monoclonal or diclonal plant cell lines established can be treated or cultivated in the presence of precursors, inducers, hormones, stabilizers, inhibitors, RNAi/siRNA molecules, signaling compounds, enzymes and/or elicitors in addition to or instead of the vector suspension, for the production of recombinant proteins or metabolites.
Heterologous nucleic acids may encode genes of bacterial, fungal, plant or non-plant origin such as fusion proteins, and proteins of animal origin. Polypeptides produced may be utilized for producing polypeptides which can be purified therefrom for use elsewhere. Proteins that can be produced in a process of the invention include heterodimers, immunoglobulins, fusions antibodies and single chain antibodies. Furthermore, the above genes may be altered to produce proteins with altered characteristics.
Embodiments of the present invention include the ability to produce a large variety of proteins and polypeptides. These embodiments can also include methods for the production of at least one desired product selected from the group consisting of heterologous proteins or polypeptides, secondary metabolites, and markers. The method comprises to use the plant cell line as established according to the invention in order to produce and accumulate the at least one desired product which is subsequently obtained or isolated from the producing cells or from the cultivation medium.
Additional methods include methods of generating at least an extracellular heterologous protein comprising the steps of 1) stably introducing into a target cell comprised by the starting population of plant cells a first nucleic acid comprising the nucleotide sequence coding for the heterologous protein or desired product; 2) preparing protoplasts form plant suspension cells provided from said plant suspension culture, wherein the protoplasts are additionally transformed and capable of i) producing a fluorescent marker protein or polypeptide and ii) surviving in presence of a selection agent; 3) separating single transformed protoplasts by subjecting the preparation of protoplasts to FACS; 4) regenerating a separated single transformed protoplast until the formation of a microcolony or microcallus by co-cultivation in the presence of feeder cell material; 5) generating callus tissue by i) transferring the microcolony or microcallus to solid cultivation medium and ii) cultivating the microcolony or microcallus in the presence of at least one selection agent until the formation of a transgenic callus tissue; 6) establishing a transgenic plant cell line by transferring the callus tissue to liquid cultivation medium; 7) causing or permitting expression from the nucleic acid of the heterologous protein or desired product by providing appropriate cultivation conditions; and 8) harvesting the accumulated heterologous protein or desired product from the producing cells. Such isolation can be by entirely conventional means and may or may not entail partial or complete purification.
More than one gene may be used in each construct. Multiple vectors, each including one or more nucleotide sequences encoding heterologous protein of choice, may be introduced into the target cells as described herein or elsewhere. This can also be useful for producing multiple subunits of an enzyme.
The fluorescent marker protein or polypeptide can be a protein detectable by fluorescence such as GUS, fluorescent proteins such as GFP or DsRed, luciferase, etc. Preferably the reported is a non-invasive marker such as DsRed or GFP.
The techniques of this invention may be used to select for certain plants to be grown. Selection of a gene of interest may be handled in a number of ways. A large number of techniques are available for inserting DNA into a plant host cell. These techniques include transformation with T-DNA using Agrobacterium tumefaciens or Agrobacterium rhizogenes as transformation agent, fusion, injection, biolistics (microparticle bombardment), silicon carbide whiskers, aerosol beaming, PEG, or electroporation as well as other possible methods. If Agrobacteria are used for the transformation, the DNA to be inserted has to be cloned into special plasmids, namely either into an intermediate vector or into a binary vector. The intermediate vectors can be integrated into the Ti or Ri plasmid by homologous recombination owing to sequences that are homologous to sequences in the T-DNA. The Ti or Ri plasmid also comprises the vir region necessary for the transfer of the T-DNA. Intermediate vectors cannot replicate themselves in Agrobacteria. The intermediate vector can be transferred into Agrobacterium tumefaciens by means of a helper plasmid (conjugation). Binary vectors can replicate themselves both in E. coli and in Agrobacteria. They comprise a selection marker gene and a linker or polylinker which are framed by the right and left T-DNA border regions. They can be transformed directly into Agrobacteria (Holsters, 1978). The Agrobacterium used as host cell is to comprise a plasmid carrying a vir region. The vir region is necessary for the transfer of the T-DNA into the plant cell. Additional T-DNA may be contained. The bacterium so transformed is used for the transformation of plant cells. Plant explants can be cultivated advantageously with Agrobacterium tumefaciens or Agrobacterium rhizogenes for the transfer of the DNA into the plant cell. Whole plants can then be regenerated from the infected plant material (for example, pieces of leaf, segments of stalk, roots, but also protoplasts or suspension-cultivated cells) in a suitable medium, which may contain antibiotics or biocides for selection. The plants so obtained can then be tested for the presence of the inserted DNA. No special demands are made of the plasmids in the case of injection and electroporation. It is possible to use ordinary plasmids, such as, for example, pUC derivatives.
The transformed cells grow inside the plants in the usual manner. They can form germ cells and transmit the transformed trait(s) to progeny plants. Such plants can be grown in the normal manner and crossed with plants that have the same transformed hereditary factors or other hereditary factors. The resulting hybrid individuals have the corresponding phenotypic properties.
In some preferred embodiments of the invention, genes encoding proteins of interest are expressed from transcriptional units inserted into the plant genome. Preferably, said transcriptional units are recombinant vectors capable of stable integration into the plant genome and enable selection of transformed plant lines expressing mRNA encoding the proteins.
Once the inserted DNA has been integrated in the genome, it is relatively stable there (and does not come out again). It normally contains a selection marker that confers on the transformed plant cells resistance to a biocide or an antibiotic, such as kanamycin, G418, bleomycin, hygromycin, or chloramphenicol, inter alia. Plant selectable markers also typically can provide resistance to various herbicides such as glufosinate (e.g., PAT/bar), glyphosate (EPSPS), ALS-inhibitors (e.g., imidazolinone, sulfonylurea, triazolopyrimidine sulfonanilide, et al.), bromoxynil, HPPD-inhibitor resistance, PPO-inhibitors, ACC-ase inhibitors, and many others. The individually employed marker should accordingly permit the selection of transformed cells rather than cells that do not contain the inserted DNA. The gene(s) of interest are preferably expressed either by constitutive or inducible promoters in the plant cell. Once expressed, the mRNA is translated into proteins, thereby incorporating amino acids of interest into protein. The genes encoding a protein expressed in the plant cells can be under the control of a constitutive promoter, a tissue-specific promoter, or an inducible promoter.
Several techniques exist for introducing foreign recombinant vectors into plant cells, and for obtaining plants that stably maintain and express the introduced gene. Such techniques include the introduction of genetic material coated onto microparticles directly into cells (U.S. Pat. No. 4,945,050 to Cornell and U.S. Pat. No. 5,141,131 to DowElanco, now Dow AgroSciences, LLC). In addition, plants may be transformed using Agrobacterium technology, see U.S. Pat. No. 5,177,010 to University of Toledo; U.S. Pat. No. 5,104,310 to Texas A&M; European Patent Application 0131624B1; European Patent Applications 120516, 159418B1 and 176,112 to Schilperoot; U.S. Pat. Nos. 5,149,645, 5,469,976, 5,464,763 and 4,940,838 and 4,693,976 to Schilperoot; European Patent Applications 116718, 290799, 320500, all to Max Planck; European Patent Applications 604662 and 627752, and U.S. Pat. No. 5,591,616, to Japan Tobacco; European Patent Applications 0267159 and 0292435, and U.S. Pat. No. 5,231,019, all to Ciba Geigy, now Syngenta; U.S. Pat. Nos. 5,463,174 and 4,762,785, both to Calgene; and U.S. Pat. Nos. 5,004,863 and 5,159,135, both to Agracetus. Other transformation technology includes whiskers technology. See U.S. Pat. Nos. 5,302,523 and 5,464,765, both to Zeneca, now Syngenta. Other direct DNA delivery transformation technology includes aerosol beam technology. See U.S. Pat. No. 6,809,232. Electroporation technology has also been used to transform plants. See WO 87/06614 to Boyce Thompson Institute; U.S. Pat. Nos. 5,472,869 and 5,384,253, both to Dekalb; and WO 92/09696 and WO 93/21335, both to Plant Genetic Systems. Furthermore, viral vectors can also be used to produce transgenic plants expressing the protein of interest. For example, monocotyledonous plants can be transformed with a viral vector using the methods described in U.S. Pat. No. 5,569,597 to Mycogen Plant Science and Ciba-Geigy (now Syngenta), as well as U.S. Pat. Nos. 5,589,367 and 5,316,931, both to Biosource, now Large Scale Biology.
As mentioned previously, the manner in which the DNA construct is introduced into the plant host is not critical to this invention. Any method that provides for efficient transformation may be employed. For example, various methods for plant cell transformation are described herein and include the use of Ti or R1-plasmids and the like to perform Agrobacterium mediated transformation. In many instances, it will be desirable to have the construct used for transformation bordered on one or both sides by T-DNA borders, more specifically the right border. This is particularly useful when the construct uses Agrobacterium tumefaciens or Agrobacterium rhizogenes as a mode for transformation, although T-DNA borders may find use with other modes of transformation. Where Agrobacterium is used for plant cell transformation, a vector may be used which may be introduced into the host for homologous recombination with T-DNA or the Ti or Ri plasmid present in the host. Introduction of the vector may be performed via electroporation, tri-parental mating and other techniques for transforming gram-negative bacteria which are known to those skilled in the art. The manner of vector transformation into the Agrobacterium host is not critical to this invention. The Ti or Ri plasmid containing the T-DNA for recombination may be capable or incapable of causing gall formation, and is not critical to said invention so long as the vir genes are present in said host.
In some cases where Agrobacterium is used for transformation, the expression construct being within the T-DNA borders will be inserted into a broad spectrum vector such as pRK2 or derivatives thereof as described in Ditta et al. (1980) and EPO 0 120 515. Included within the expression construct and the T-DNA will be one or more markers as described herein which allow for selection of transformed Agrobacterium and transformed plant cells. The particular marker employed is not essential to this invention, with the preferred marker depending on the host and construction used.
For transformation of plant cells using Agrobacterium, explants may be combined and incubated with the transformed Agrobacterium for sufficient time to allow transformation thereof. After transformation, the Agrobacteria are killed by selection with the appropriate antibiotic and plant cells are cultured with the appropriate selective medium. Once calli are formed, shoot formation can be encouraged by employing the appropriate plant hormones according to methods well known in the art of plant tissue culturing and plant regeneration. However, a callus intermediate stage is not always necessary. After shoot formation, said plant cells can be transferred to medium which encourages root formation thereby completing plant regeneration. The plants may then be grown to seed and said seed can be used to establish future generations. Regardless of transformation technique, the gene encoding a bacterial protein is preferably incorporated into a gene transfer vector adapted to express said gene in a plant cell by including in the vector a plant promoter regulatory element, as well as 3′ non-translated transcriptional termination regions such as Nos and the like.
In addition to numerous technologies for transforming plants, the type of tissue that is contacted with the foreign genes may vary as well. Such tissue would include but would not be limited to embryogenic tissue, callus tissue types I, II, and III, hypocotyl, meristem, root tissue, tissues for expression in phloem, and the like. Almost all plant tissues may be transformed during dedifferentiation using appropriate techniques described herein.
As mentioned above, a variety of selectable markers can be used, if desired. Preference for a particular marker is at the discretion of the artisan, but any of the following selectable markers may be used along with any other gene not listed herein which could function as a selectable marker. Such selectable markers include but are not limited to aminoglycoside phosphotransferase gene of transposon Tn5 (Aph II) which encodes resistance to the antibiotics kanamycin, neomycin and G41; hygromycin resistance; methotrexate resistance, as well as those genes which encode for resistance or tolerance to glyphosate; phosphinothricin (bialaphos or glufosinate); ALS-inhibiting herbicides (imidazolinones, sulfonylureas and triazolopyrimidine herbicides), ACC-ase inhibitors (e.g., ayryloxypropionates or cyclohexanediones), and others such as bromoxynil, and HPPD-inhibitors (e.g., mesotrione) and the like.
In addition to a selectable marker, it may be desirous to use a reporter gene. In some instances a reporter gene may be used with or without a selectable marker. Reporter genes are genes that are typically not present in the recipient organism or tissue and typically encode for proteins resulting in some phenotypic change or enzymatic property. Examples of such genes are provided in Weising et al., 1988. Preferred reporter genes include the beta-glucuronidase (GUS) of the uidA locus of E. coli, the chloramphenicol acetyl transferase gene from Tn9 of E. coli, the green fluorescent protein from the bioluminescent jellyfish Aequorea victoria, and the luciferase genes from firefly Photinus pyralis. An assay for detecting reporter gene expression may then be performed at a suitable time after said gene has been introduced into recipient cells. A preferred such assay entails the use of the gene encoding beta-glucuronidase (GUS) of the uidA locus of E. coli as described by Jefferson et al. (1987), to identify transformed cells.
In addition to plant promoter regulatory elements, promoter regulatory elements from a variety of sources can be used efficiently in plant cells to express foreign genes. For example, promoter regulatory elements of bacterial origin, such as the octopine synthase promoter, the nopaline synthase promoter, the mannopine synthase promoter; promoters of viral origin, such as the cauliflower mosaic virus (35S and 19S), 35T (which is a re-engineered 35S promoter, see U.S. Pat. No. 6,166,302, especially Example 7E) and the like may be used. Plant promoter regulatory elements include but are not limited to ribulose-1,6-bisphosphate (RUBP) carboxylase small subunit (ssu), beta-conglycinin promoter, beta-phaseolin promoter, ADH promoter, heat-shock promoters, and tissue specific promoters. Other elements such as matrix attachment regions, scaffold attachment regions, introns, enhancers, polyadenylation sequences and the like may be present and thus may improve the transcription efficiency or DNA integration. Such elements may or may not be necessary for DNA function, although they can provide better expression or functioning of the DNA by affecting transcription, mRNA stability, and the like. Such elements may be included in the DNA as desired to obtain optimal performance of the transformed DNA in the plant. Typical elements include but are not limited to Adh-intron 1, Adh-intron 6, the alfalfa mosaic virus coat protein leader sequence, osmotin UTR sequences, the maize streak virus coat protein leader sequence, as well as others available to a skilled artisan. Constitutive promoter regulatory elements may also be used thereby directing continuous gene expression in all cells types and at all times (e.g., actin, ubiquitin, CaMV 35S, and the like). Tissue specific promoter regulatory elements are responsible for gene expression in specific cell or tissue types, such as the leaves or seeds (e.g., zein, oleosin, napin, ACP, globulin and the like) and these may also be used.
Promoter regulatory elements may also be active (or inactive) during a certain stage of the plant's development as well as active in plant tissues and organs. Examples of such include but are not limited to pollen-specific, embryo-specific, corn-silk-specific, cotton-fiber-specific, root-specific, seed-endosperm-specific, or vegetative phase-specific promoter regulatory elements and the like. Under certain circumstances it may be desirable to use an inducible promoter regulatory element, which is responsible for expression of genes in response to a specific signal, such as: physical stimulus (heat shock genes), light (RUBP carboxylase), hormone (Em), metabolites, chemical (tetracycline responsive), and stress. Other desirable transcription and translation elements that function in plants may be used. Numerous plant-specific gene transfer vectors are known in the art.
Plant RNA viral based systems can also be used to express bacterial protein. In so doing, the gene encoding a protein can be inserted into the coat promoter region of a suitable plant virus which will infect the host plant of interest. The protein can then be expressed thus providing protection of the plant from herbicide damage. Plant RNA viral based systems are described in U.S. Pat. No. 5,500,360 to Mycogen Plant Sciences, Inc. and U.S. Pat. Nos. 5,316,931 and 5,589,367 to Biosource.
Means of further increasing tolerance or resistance levels. It is shown herein that plants of the subject invention can be imparted with novel herbicide resistance traits without observable adverse effects on phenotype including yield. Such plants are within the scope of the subject invention. Plants exemplified and suggested herein can withstand 2×, 3×4× and 5× typical application levels, for example, of at least one subject herbicide. Improvements in these tolerance levels are within the scope of this invention. For example, various techniques are known in the art, and can foreseeably be optimized and further developed, for increasing expression of a given gene.
One such method includes increasing the copy number of the subject genes (in expression cassettes and the like). Transformation events can also be selected for those having multiple copies of the genes.
Strong promoters and enhancers can be used to “supercharge” expression. Examples of such promoters include the preferred 35T promoter which uses 35S enhancers. 35S, maize ubiquitin, Arabidopsis ubiquitin, A.t. actin, and CSMV promoters are included for such uses. Other strong viral promoters are also preferred. Enhancers include 4 OCS and the 35S double enhancer. Matrix attachment regions (MARs) can also be used to increase transformation efficiencies and transgene expression, for example.
Shuffling (directed evolution) and transcription factors can also be used for embodiments according to the subject invention.
Variant proteins can also be designed that differ at the sequence level but that retain the same or similar overall essential three-dimensional structure, surface charge distribution, and the like. See e.g. U.S. Pat. No. 7,058,515; Larson et al., Protein Sci. 2002 11: 2804-2813, “Thoroughly sampling sequence space: Large-scale protein design of structural ensembles”; Crameri et al.,Nature Biotechnology 15, 436-438 (1997), “Molecular evolution of an arsenate detoxification pathway by DNA shuffling”; Stemmer, W. P. C. 1994, DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution, Proc. Natl. Acad. Sci. USA 91: 10747-10751; Stemmer, W. P. C. 1994, Rapid evolution of a protein in vitro by DNA shuffling, Nature 370: 389-391; Stemmer, W. P. C. 1995, Searching sequence space. Bio/Technology 13: 549-553; Crameri, A., Cwirla, S, and Stemmer, W. P. C. 1996, Construction and evolution of antibody-phage libraries by DNA shuffling, Nature Medicine 2: 100-103; and Crameri, A., Whitehorn, E. A., Tate, E. and Stemmer, W. P. C., 1996, Improved green fluorescent protein by molecular evolution using DNA shuffling, Nature Biotechnology 14: 315-319.
The activity of recombinant polynucleotides inserted into plant cells can be dependent upon the influence of endogenous plant DNA adjacent the insert. Thus, another option is taking advantage of events that are known to be excellent locations in a plant genome for insertions. See e.g. WO 2005/103266 A1, relating to crylF and crylAc cotton events; FAD2, FAD3, wherein genes such as AAD1 or AAD12 or others can be substituted in those genomic loci in place of such inserts. Thus, targeted homologous recombination, for example, can be used according to the subject invention. This type of technology is the subject of for example, WO 03/080809 A2 and the corresponding published U.S. application (USPA 20030232410), relating to the use of zinc fingers for targeted recombination. The use of recombinases (cre-10x and flp-frt for example) is also known in the art.
Computational design of 5′ or 3′ UTR most suitable for synthetic hairpins can also be conducted within the scope of the subject invention. Computer modeling in general, as well as gene shuffling and directed evolution, are discussed elsewhere herein. More specifically regarding computer modeling and UTRs, computer modeling techniques for use in predicting/evaluating 5′ and 3′ UTR derivatives of the present invention include, but are not limited to: MFold version 3.1 available from Genetics Corporation Group, Madison, Wis. (see Zucker et al., Algorithms and Thermodynamics for RNA Secondary Structure Prediction: A Practical Guide. In RNA Biochemistry and Biotechnology, 11-43, J. Barciszewski & B. F. C. Clark, eds., NATO ASI Series, Kluwer Academic Publishers, Dordrecht, N L, (1999); Zucker et al., Expanded Sequence Dependence of Thermodynamic Parameters Improves Prediction of RNA Secondary Structure. J. Mol. Biol. 288, 911-940 (1999); Zucker et al., RNA Secondary Structure Prediction. In Current Protocols in Nucleic Acid Chemistry, S. Beaucage, D. E. Bergstrom, G. D. Glick, and R. A. Jones eds., John Wiley & Sons, New York, 11.2.1-11.2.10, (2000)), COVE (RNA structure analysis using covariance models (stochastic context free grammar methods)) v. 2.4.2 (Eddy & Durbin, Nucl. Acids Res. 1994, 22: 2079-2088) which is freely distributed as source code and which can be downloaded by accessing the website genetics.wust1.edu/eddy/software/, and FOLDALIGN, also freely distributed and available for downloading at the website bioinf.au.dk. FOLDALIGN/ (see Finding the most significant common sequence and structure motifs in a set of RNA sequences. J. Gorodkin, L. J. Heyer and G. D. Stormo. Nucleic Acids Research, Vol. 25, no. 18 pp 3724-3732, 1997; Finding Common Sequence and Structure Motifs in a set of RNA Sequences. J. Gorodkin, L. J. Heyer, and G. D. Stormo. ISMB 5; 120-123, 1997).
Embodiments of the subject invention can be used in conjunction with naturally evolved or chemically induced mutants (mutants can be selected by screening techniques, then transformed with other genes). Plants of the subject invention can be combined with various resistance genes and/or evolved resistance genes. Traditional breeding techniques can also be combined with the subject invention to powerfully combine, introgress, and improve selection of traits.
All references, including publications, patents, and patent applications, cited herein are hereby incorporated by reference to the extent they are not inconsistent with the explicit details of this disclosure, and are so incorporated to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein. The references discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the inventors are not entitled to antedate such disclosure by virtue of prior invention.
The following Examples are provided to illustrate certain particular features and/or aspects. These Examples should not be construed to limit the disclosure to the particular features or aspects described.
BAC Construction
A Bacterial Artificial Chromosome (BAC) library was sourced from a commercial vendor (Amplicon Express, Pullman, Wash.). The BAC library consisted of 110,592 BAC clones containing high molecular weight genomic DNA (gDNA) fragments isolated from Brassica napus L. var. DH10275. The gDNA was digested with either the BamHI or HinDIII restriction enzyme. Isolated gDNA fragments of about 135 Kbp were ligated into the pCC1BAC vector (Epicentre, Madison, Wis.) and transformed into Escherichia coli str. DH10B (Invitrogen). The BAC library was made up of an even number of BAC clones that were constructed using the two different restriction enzymes. As such, the Hind III constructed BAC library consisted of 144 individual 384-well plates. Likewise, the BamHI constructed BAC library consisted of 144 individual 384-well plates. A total of 110,592 BAC clones were isolated and arrayed into 288 individual 384-well plates. Each of the 288 individual 384 well plates were provided by the vendor as a single DNA extraction for rapid PCR based screening. The resulting BAC library covers approximately 15 Gbp of gDNA, which corresponds to a 12-fold genome coverage of Brassica napus L. var. DH10275genome (estimate of the Brassica napus L. genome is ca. 1.132 Gbp as described in Johnston et al. (2005) Annals of Botany 95:229-235).
Sequence Analysis of Fad2 Coding Sequences Isolated From the BAC Library
The constructed BAC library was used to isolate FAD2 gene coding sequences. Sequencing experiments were conducted to identify the specific gene sequences of four FAD2 gene paralogs from Brassica napus L. var. DH10275.
The FAD2 gene sequence was initially identified within the model species Arabidopsis thaliana. The gene sequence is listed in Genbank as Locus Tag: At3g12120. Comparative genomic relationships between the model plant species Arabidopsis thaliana and the diploid Brassica rapa, one of the progenitors of the tetraploid Brassica napus, have been previously described. (Schranz et al. (2006) Trends in Plant Science 11(11):535-542). With specific relation to the FAD2 gene the comparative analysis predicted that 3-4 copies of the gene may occur within the diploid Brassica genome. Additional genetic mapping studies were completed by Scheffler et al. (1997) Theoretical and Applied Genetics 94; 583-591. The results of these genetic mapping studies indicated that four copies of the FAD2 gene were present in Brassica napus.
Sequencing analysis of the BAC library which was constructed from B. napus L. var. DH12075 resulted in the isolation of four BAC sequences (SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, and SEQ ID NO:4) from which the coding sequences for the FAD2A (SEQ ID NO:5), FAD2-1 (SEQ ID NO:6), FAD2-2 (SEQ ID NO:7), and FAD2-3(SEQ ID NO:8) genes were determined. The FAD2A, FAD2-1, FAD2-2, and FAD2-3 gene sequences were identified and genetically mapped. Sequence analysis of the four FAD2 genes was conducted using a sequence alignment program and a neighbor-joining tree using percentage of identity. The sequence alignment was made via the A
Next, the newly isolated FAD2 sequences from Brassica napus were used to BLAST genomic libraries isolated from a Brassica rapa genomic BAC library and Brassica oleracea shotgun genomic sequence reads. Both, Brassica rapa and Brassica oleracea are diploid progenitors of Brassica napus which is an amphidiploid species (AC genome, n=19). Brassica napus derived from a recent hybridization event between Brassica rapa (A sub-genome, n=10) and Brassica oleracea (C sub-genome, n=9). The diploid progenitor sequences were compared to the four different FAD2 coding sequences isolated from Brassica napus using a BLASTn analysis. This sequence analysis identified specific, annotated gene sequences from Brassica rapa and Brassica oleracea which shared the highest sequence similarity to the newly discovered Brassica napus FAD2 sequences. Table 1 lists the newly identified FAD2 coding sequence and the corresponding progenitor reference sequence accession number and source organism.
B. rapa
B. oleracea
B. rapa
B. oleracea
The FAD2 genes exist in the Brassica napus genome as two copies of each gene per sub-genome. One copy of each gene is located on the A sub-genome, and likewise one copy of each gene is located on the C sub-genome. New naming conventions are described to indicate which sub-genome that each gene is located on. The high levels of sequence similarity between the four different FAD2 coding sequences isolated from the Brassica napus BAC genomic DNA library and the progenitor sequence data suggest that FAD2-3 is a duplicate of the FAD2 sequence from the C sub-genome and could be relabeled as FAD2C; FAD2-1 is a duplicate of the FAD2 sequence from the A sub-genome and could therefore be labeled as FAD2A′; and finally, FAD2-2 is a second copy that was duplicated from the FAD2 sequence of the C sub-genome and could be labeled as FAD2C′.
Sequence Analysis of Fad3 Coding Sequences Isolated From the BAC library
The constructed BAC library was used to isolate FAD3 gene coding sequences. Sequencing experiments were conducted to identify the specific gene sequences of five FAD3 gene paralogs from Brassica napus L. var. DH10275.
The FAD3 gene sequence was initially identified within the model species Arabidopsis thaliana. The gene sequence is listed in Genbank as Locus Tag: At2g29980. Comparative genomic relationships between the model plant species Arabidopsis thaliana and the diploid Brassica rapa, one of the progenitors of the tetraploid Brassica napus, have been previously described. (Schram et al. (2006) Trends in Plant Science 11(11):535-542). With specific relation to the FAD gene the comparative analysis predicted that 3-4 copies of the gene may occur within the diploid Brassica genome. Additional genetic mapping studies were completed by Scheffler et al. (1997) Theoretical and Applied Genetics 94; 583-591. The results of these genetic mapping studies indicated that six copies of the FAD3 gene were present in Brassica napus.
Previous sequencing efforts focused on the FAD3 genes from Brassica napus had identified and genetically mapped both A and C genome specific copies (Hu et al., (2006) Theoretical and Applied Genetics, 113(3): 497-507). A collection of EST sequences from seed specific cDNA libraries had previously been constructed and sequenced from the plant line DH12075 by Andrew Sharpe of Agriculture and Agri-food Canada, 107 Science Place, Saskatoon, Saskatchewan. As a collection of ESTs from the doubled haploid canola plant DH12075 full length gene sequences were not available, moreover the indications of sequence quality and confidence of correctly called nucleotides was also not available. Consequently, sequence variation between different FAD gene sequence reads could not be unequivocally attributed to different gene copies of the various paralogs of the FAD3 gene family, nor was the genomic sequence available. However, when a combined sequence analysis was performed with the ESTs as well as the two FAD3A and FAD3C full length gene sequences described in Hu et al., (2006), ESTs that matched both of the genes were identified along with an additional 3 haplotypes. As a result, a total of six unique haplotypes of FAD3 were identified. Following the assembly of all available data for the various FAD3 haplotypes, high levels of exon sequence divergence in exon 1 was identified. The divergence of the FAD3 sequence in exon 1 was identified as an opportunity which could be utilized for the design of gene/allele specific PCR primers. In addition, exons were identified that were either minimally differentiated between haplotypes (e.g., exons 5, 6, 7 and 8 had 1-3 bp that varied between FAD3A and FAD3C) or that were devoid of sequence variation (e.g., exons 2 and 3).
Sequencing analysis of the BAC library which was constructed from B. napus L. var. DH12075 resulted in the isolation of six BAC sequences (SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:11, SEQ ID NO:12, SEQ ID NO:13, and SEQ ID NO:14) from which the coding sequences for the FAD3A (SEQ ID NO:15), FAD3A′ (SEQ ID NO:16), FAD3A″ (SEQ ID NO:17), FAD3C (SEQ ID NO:18), FAD3C″ (SEQ ID NO:19), and FAD3C′ (SEQ ID NO:20) genes were determined. The FAD3A, FAD3A′, FAD3A″, FAD3C, FAD3C″, and FAD3C′ gene sequences were identified and genetically mapped.
Sequence analysis of the six FAD3 genes was conducted using a sequence alignment program and a neighbor-joining tree using percentage of identity. The sequence alignment was made via the A
PCR Based Screening
A cohort of PCR primers were design to screen the aforementioned BAC library. The primers were designed as either universal primers, which would amplify all members of the gene family, or as gene specific primers for targeted allele amplification. The PCR primers were designed to be 20 bp long (+/−1 bp) and contain a G/C content of 50% (+/−8%). Table 2 and Table 3 lists the primers which were designed and synthesized. The clones of the BAC library were pooled and screened via the Polymerase Chain Reaction (PCR).
A Two different sets of conditions were used for the polymerase chain reactions (PCR). The first series of PCR reactions contained: 1×PCR buffer (containing dNTPs); 1.5 mM MgCl2; 200 μM of 0.25 U I
Following PCR based screening and conformational Sanger sequencing, a collection of plates were identified that contained the various different FAD2 and FAD3 gene family members. A total of four unique FAD2 and FAD3 paralogous gene sequences were identified (Table 4 and Table 5). A total of two plates per each FAD2 and FAD3 paralogous gene sequence were chosen to undergo plate screening to identify the specific well and clone within the plate that contained the FAD2 and FAD3 gene (Table 4 and Table 5). The specific wells were identified for both of the plates and an individual clone was selected for each of the FAD2 and FAD3 gene family members.
The single BAC clone, for each identified FAD gene family member, was further analysed via sequencing. The DNA was isolated for the BAC clone and was prepared for sequencing using a L
After the full genomic sequence of all four FAD2 and six FAD3 genes had been identified and fully characterized, zinc finger nucleases were designed to bind to the sequences for each specific gene family member.
Zinc finger proteins directed against DNA sequences encoding various functional sequences of the FAD2 gene locus were designed as previously described. See, e.g., Urnov et al. (2005) Nature 435:646-651. Exemplary target sequence and recognition helices are shown in Table 6 and Table 8 (recognition helix regions designs) and Table 7 and Table 9 (target sites). In Table 8 and Table 9, nucleotides in the target site that are contacted by the ZFP recognition helices are indicated in uppercase letters; non-contacted nucleotides indicated in lowercase. Zinc Finger Nuclease (ZFN) target sites were designed to bind five target sites of FAD2A, and seven target sites of FAD3. The FAD2 and FAD3 zinc finger designs were incorporated into zinc finger expression vectors encoding a protein having at least one finger with a CCHC structure. See, U.S. Patent Publication No. 2008/0182332. In particular, the last finger in each protein had a CCHC backbone for the recognition helix. The non-canonical zinc finger-encoding sequences were fused to the nuclease domain of the type IIS restriction enzyme FokI (amino acids 384-579 of the sequence of Wah et al., (1998) Proc. Natl. Acad. Sci. USA 95:10564-10569) via a four amino acid ZC linker and an opaque-2 nuclear localization signal derived from Zea mays to form FAD2A zinc-finger nucleases (ZFNs). Expression of the fusion proteins was driven by a relatively strong constitutive promoter such as a promoter derived from the Cassaya Vein Mosaic Virus (CsVMV) promoter and flanked by the Agrobacterium tumefaciens ORF23 3′ UnTranslated Region (AtuORF23 3′UTR v1). The self-hydrolyzing 2A encoding nucleotide sequence from Thosea asigna virus (Szymczak et al., 2004) was added between the two Zinc Finger Nuclease fusion proteins that were cloned into the construct. Exemplary vectors are described below.
The optimal zinc fingers were verified for cleavage activity using a budding yeast based system previously shown to identify active nucleases. See, e.g., U.S. Patent Publication No. 20090111119; Doyon et al. (2008) Nat. Biotechnol. 26:702-708; Geurts et al. (2009) Science 325:433. Zinc fingers for the various functional domains were selected for in-vivo use. Of the numerous ZFNs that were designed, produced and tested to bind to the putative FAD genomic polynucleotide target sites, a ZFNs were identified as having in vivo activity at high levels, and selected for further experimentation. These ZFNs were characterized as being capable of efficiently binding and cleaving the unique FAD2 genomic polynucleotide target sites in planta.
Construct Assembly
Plasmid vectors containing ZFN expression constructs of the exemplary zinc finger nucleases, which were identified using the yeast assay, as described in Example 2, were designed and completed using skills and techniques commonly known in the art. Each zinc finger-encoding sequence was fused to a sequence encoding an opaque-2 nuclear localization signal (Maddaloni et al. (1989) Nuc. Acids Res. 17(18):7532), that was positioned upstream of the zinc finger nuclease.
Next, the opaque-2 nuclear localization signal::zinc finger nuclease fusion sequence was paired with the complementary opaque-2 nuclear localization signal::zinc finger nuclease fusion sequence. As such, each construct consisted of a single open reading frame comprised of two opaque-2 nuclear localization signal::zinc finger nuclease fusion sequences separated by the 2A sequence from Thosea asigna virus (Mattion et al. (1996) J. Virol. 70:8124-8127). Expression of the fusion proteins was driven by a relatively strong constitutive promoter such as a promoter derived from the Cassaya Vein Mosaic Virus (CsVMV) promoter and flanked by the Agrobacterium tumefaciens ORF23 3′ UnTranslated Region (AtuORF23 3′UTR).
The vectors were assembled using the IN-FUSION™ Advantage Technology (Clontech, Mountain View, Calif.). Restriction endonucleases were obtained from New England BioLabs (NEB; Ipswich, Mass.) and T4 DNA Ligase (Invitrogen) was used for DNA ligation. Plasmid preparations were performed using NUCLEOSPIN® Plasmid Kit (Macherey-Nagel Inc., Bethlehem, Pa.) or the Plasmid Midi Kit (Qiagen) following the instructions of the suppliers. DNA fragments were isolated using QIAquick Gel Extraction Kit™ (Qiagen) after agarose Tris-acetate gel electrophoresis. Colonies of all assembled plasmids were initially screened by restriction digestion of miniprep DNA. Plasmid DNA of selected clones was sequenced by a commercial sequencing vendor (Eurofins MWG Operon, Huntsville, Ala.). Sequence data were assembled and analyzed using the SEQUENCHER™ software (Gene Codes Corp., Ann Arbor, Mich.). Before delivery to B. napus protoplasts, Plasmid DNA was prepared from cultures of E. coli using the Pure Yield P
The resulting eleven plasmid constructs; pDAB104008 (containing the ZFN24845 and ZFN24844 construct), pDAB104009 (containing the ZFN24820 and ZFN24821 construct), pDAB104010 (containing the ZFN24828 and ZFN24829 construct) (
The resulting plasmid constructs; pDAB107824 (ZFNs 28025-2A-28026), pDAB107815 (ZFNs 27961-2A-27962), pDAB107816 (ZFNs 27969-2A-27970), pDAB107817 (ZFNs 27973-2A-27974), pDAB107825 (ZFNs 28035-2A-28036), pDAB107826 (ZFNs 28039-2A-28040), pDAB107818 (ZFNs 27987-2A-27988), pDAB107827 (ZFNs 28051-2A-28052), pDAB 107821 (ZFNs 28004-2A-28005), pDAB 107819 (ZFNs 27989-2A-27990), pDAB 107828 (ZFNs 28053-2A-28054), pDAB107829 (ZFNs 28055-2A-28056), pDAB107820 (ZFNs 27991-2A-27992), pDAB107822 (ZFNs 28021-2A-28022) and pDAB107823 (ZFNs 28023-2A-28024) were confirmed via restriction enzyme digestion and via DNA sequencing.
Preparation of DNA for Transfection
Plasmid DNA of the above described vectors was sterilized by precipitation and washing in 100% (v/v) ethanol and dried in a laminar flow hood. The DNA pellet was suspended in 30 μL of sterile double-distilled water at a final concentration of 0.7 μg/μl for transfection into protoplast cells as described below. The preparation of the plasmid DNA was undertaken to result in supercoiled plasmid DNA for transient transfection and linearized plasmid DNA for stable transfection. The addition of carrier DNA (e.g. fish-sperm DNA) to the transforming plasmid was not required for the transient transfection of protoplast cells. For transient studies about 30 μg of plasmid DNA per 106 protoplasts was used per transformation.
Transfection
Transfection of Brassica napus L. var. DH10275 was completed as described in Spangenberg et al., (1986) Plant Physiology 66: 1-8, the media formulations are described in Spangenberg G. and Protrykus I. (1995) Polyethylene Glycol-Mediated Direct Gene Transfer in Tobacco Protoplasts. In: Gene Transfer to Plants. (Protrykus I. and Spangenberg G. Eds.) Springer-Verlag, Berlin. Brassica napus seeds were surface sterilized in 70% ethanol. The seeds were immersed in 12 mL of the 70% ethanol solution and mixed by gently rocking the cocktail for 10 minutes. The 70% ethanol solution was removed by decanting the solution and exchanged with a seed sterilization solution consisting of 1% w/v calcium hypochlorite and 0.1% v/v Tween-20. The seeds were immersed in the seed sterilization solution and mixed by gently rocking the cocktail for 25 minutes. The seed sterilization solution was decanted and the sterilized seeds were rinsed three times in 50 mL of sterile water. Finally, the seeds were transferred to a sterile 80 mm W
A volume of liquid PS medium (about 10 mL) was decanted into a sterile Petri dish. Using sterile forceps and a scalpel, an aerial portion of the four to five day old seedling in the 4-leaf stage of growth and development, was removed and discarded. Hypocotyl segments in lengths of 20-40 mm were determined to produce the highest population of small, cytoplasmic-rich protoplasts. The hypocotyl segments were aseptically excised and transferred to liquid PS medium. The excised hypocotyl segments were grouped together and cut transversely into 5-10 mm segments. Next, the hypocotyl segments were transferred to fresh PS medium and incubated at room temperature for 1 hour. The plasmolyzed hypocotyls were transferred to a Petri dish containing enzyme solution. Care was taken to immerse all of the hypocotyl segments into the solution. The Petri dishes were sealed with P
Protoplast cells were released from the hypocotyl segments. The overnight hypocotyl digests were gently agitated to release protoplasts into the enzyme solution. The Petri dish was angled slightly to aid the transfer of the digesting suspension which consisted of enzyme solution and plant debris. Using a 10 mL pipette the digesting suspension was transferred to a sterilized protoplast filtration (a filter of 100 micron mesh) unit to further separate the protoplasts from the plant debris. The filtration unit was tapped gently to release the excess liquid that had been caught in the sieve. The protoplast suspension, about 8 to 9 mL, was gently mixed and distributed into 14 mL sterile plastic round-bottomed centrifuge tubes. Each suspension was overlaid with 1.5 mL of W5 solution. The W5 solution was carefully dispensed over the protoplast suspension at an angle and dispensed drop-by-drop with minimal agitation. The addition of the W5 solution to the protoplast suspension resulted in the production of a protoplast rich interface. This interface was collected using a pipette. Next, the collected protoplasts were transferred into a new 14 mL centrifuge tube, and gently mixed. The yield or obtained protoplasts were determined using a hemocytometer to determine the number of protoplasts per milliliter. The method was repeated, wherein leaf tissue was digested to produce mesophyll protoplasts.
Next, W5 solution was added to a volume of 10 mL and the protoplasts were pelleted at 70 g, before removing the W5 solution. The remaining protoplast suspension was resuspended by gentle shaking. Each tube containing the protoplast suspension was filled with 5 mL of W5 solution and incubated at room temperature from one to four hours. The protoplast suspensions were pelleted at 70 g, and all of the W5 solution was removed. Next, 300 μL of transformation buffer was added to each of the pelleted protoplast suspensions which contained the isolated protoplasts. To each of the tubes, 10 μg of plasmid DNA was added to the protoplast suspensions. The plasmid DNA consisted of the Zinc Finger Nuclease constructs described above (e.g., pDAB104010). Next, 300 μL of pre-warmed PEG 4000 solution was added to the protoplast suspension and the tubes were gently tapped. The protoplast suspensions and transformation mixture was allowed to incubate at room temperature for fifteen minutes without any agitation. An additional 10 mL of W5 solution was added to each tube in sequential aliquots of 1 mL, 1 mL, 1 mL, 2 mL, 2 mL, and 3 mL with gentle inversion of the tubes between each addition of W5 solution. The protoplasts were pelleted by spinning in a centrifuge at 70 g. All of the W5 solution was removed leaving a pure protoplast suspension.
Next, 0.5 mL of K3 medium was added to the pelleted protoplast cells and the cells were resuspended. The resuspended protoplast cells were placed in the center of a Petri dish and 5 mL of K3 and 0.6 mL Sea Plaque™ agarose (Cambrex, East Rutherford, N.J.) in a 1:1 concentration. The Petri dishes were shaken in a single gentle swirling motion and left to incubate for 20-30 minutes at room temperature. The Petri dishes were sealed with P
Genomic DNA Isolation from Canola Protoplasts
Transfected protoplasts were supplied in individual 1.5 or 2.0 mL microfuge tubes. The cells were pelleted at the base of the tube in a buffer solution. DNA extraction was carried out by snap freezing the cells in liquid nitrogen followed by freeze drying the cells, for about 48 hours in A L
Testing of Fad2a and Fad3 ZFNs for Genomic DNA Sequence Cleavage in Canola Protoplasts
The design of the ZFN target sites for the FAD2A and FAD3 gene loci were clustered, so that multiple pairs of ZFN were design to overlap the target sites. The clustering of ZFN target sites enabled PCR primers to be designed that would amplify the surrounding genomic sequence from all FAD2A and FAD3 gene family members within a 100 bp window as to encapsulate all of the overlapping ZFN target sites. As such, the Illumina short read sequence technology could be used to assess the integrity of the target ZFN site of the transfected protoplasts. In addition, the PCR primers designed needed to include specific nucleotide bases that would attribute sequence reads to the specific gene member of the FAD2A and FAD3 family. Therefore, all of the PCR primers would be required to bind 5-10 nucleotides away from any ZFN target cut site as non-homologous end joining (NHEJ) activity is known to cause small deletions that could remove a priming site, inhibit amplification and therefore distort the assessment of NHEJ activity.
Primers were designed to bind to all of the ZFN target loci for the FAD2A and FAD3 gene families (Table 11) and were empirically tested for amplification of all gene family members through Sanger based sequencing of PCR amplification products. In several instances primers could not be developed that would distinguish all gene family members (Table 12 and Table 13), however in all instances the target gene sequences of FAD2A and FAD3, could be distinguished. Following PCR primer design custom DNA barcode sequences were incorporated into the PCR primers that were used to distinguish the different ZFN target loci and identify specific sequence reads to a transfection and ZFN (Tables 11, 12 and 13).
Following DNA extraction of canola protoplasts transfected with the ZFN, PCR amplification of the target ZFN loci was performed to generate the requisite loci specific DNA molecules in the correct format for Illumina based sequencing by synthesis technology. Each assay was optimised to work on 25 ng starting DNA (about 12,500 cell equivalents of the Brassica napus genome). Multiple reactions were performed, per sample to provide the coverage required to assess NHEJ efficiency and specificity at the appropriate level, about sixteen PCR reactions equivalent to 200,000 copies of the Brassica napus genome taken from individual protoplasts. PCR amplification master-mixes were made for all samples to be tested with the same assay and one reaction, performed in triplicate, was assayed using a quantitative PCR method that was used to determine the optimal number of cycles to perform on the target tissue, to ensure that PCR amplification had not become reagent limited and was still in an exponential amplification stage. The experimentation with the necessary negative control reactions, was performed in 96 well format using a MX3000P
Method of Data Analysis for Detection of Non-Homologous End Joining at Target Zinc Finger Sites
Following completion of the sequencing reaction and primary data calling performed using the Illumina bioinformatic pipeline for base calling, full analysis was performed to identify deleted bases at the target ZFN site in each instance. A custom PERL script was designed to extract and sort barcodes from DNA sequences computationally following a list of input sequences. The barcode had to match the reference sequence at a Phred score of greater than 30 to be accepted, to reduce misattributing sequence reads. After the sequence reads had been binned into the different barcode groups that had been used, a quality filter was passed across all sequences. The quality filter was a second custom developed PERL script. Sequence reads were excluded if there were more than three bases called as “N,” or if the median Phred score was less than 20, or if there were 3 consecutive bases with a Phred score of less than 20, or if the sequence read was shorter than 40 bp in length. The remaining sequences were merged where both of the paired sequence reads were available using the N
Using the gapped FASTA file a conversion of the gapped base position number to the input reference was performed using a fourth custom PERL script. This enabled bases that discriminate the different gene family members (either homoeologous or paralogous sequence variation between the different gene family members) to be identified in the assembled data. Once the conversion of base numbering had been performed it was possible to generate haplotype reports for each unique sequence reads and assign the reads to specific gene family members. Once the reads had been grouped by gene a 10 bp window was identified and assessed that surrounded the ZFN target site. The number of sequences with deletions was recorded per gene along with the number of missing bases.
The data was then graphically displayed as a multiple line graph, with the number of sequences with 1 through 10 bases deleted at the target ZFN site per 10,000 sequence reads (
From these results highest level of ZFN activity at a FAD2 target site, as determined by the greater activity of NHEJ, was identified at locus E. The ZFNs which were encoded on plasmid pDAB104010 (i.e., ZFN24828 and 24829) were selected for in planta targeting of an Engineered Transgene Integration Platform (ETIP) given its characteristics of significant genomic DNA cleavage activity and minimal non-target activity.
The plasmid vector constructs described below were built using methods and techniques commonly known by one with skill in the art. The application of specific reagents and techniques described within this paragraph are readily known by those with skill in the art, and could be readily interchanged with other reagents and techniques to achieve the desired purpose of building plasmid vector constructs. The restriction endonucleases were obtained from New England BioLabs (NEB; Ipswich, Mass.). Ligations were completed with T4 DNA Ligase (Invitrogen, Carlsbad, Calif.). Gateway reactions were performed using GATEWAY® LR CLONASE® enzyme mix (Invitrogen) for assembling one entry vector into a single destination vector. IN-FUSION™ reactions were performed using IN-FUSION™ Advantage Technology (Clontech, Mountain View, Calif.) for assembling one entry vector into a single destination vector Plasmid preparations were performed using NUCLEOSPIN® Plasmid Kit (Macherey-Nagel Inc., Bethlehem, Pa.) or the Plasmid Midi Kit® (Qiagen) following the instructions of the suppliers. DNA fragments were isolated using QIAquick Gel Extraction Kit™ (Qiagen) after agarose Tris-acetate gel electrophoresis. Colonies of all assembled plasmids were initially screened by restriction digestion of miniprep DNA. Plasmid DNA of selected clones was sequenced by a commercial sequencing vendor (Eurofins MWG Operon, Huntsville, Ala.). Sequence data were assembled and analyzed using the SEQUENCHER™ software (Gene Codes Corp., Ann Arbor, Mich.).
Direct-Delivery Vectors for Precision Integration of ETIP in the Fad2A Locus of Canola
Standard cloning methods were used in the construction of the ETIP-containing vectors pDAS000130 (
The ETIP sequence was synthesized by a commercial gene synthesis vendor (GeneArt, Life Technologies). The 1 kb segments of FAD2A genome sequence were amplified from genomic DNA purified from leaf tissue of B. napus DH 12075 using a Qiagen DN
Direct-Delivery Vectors for Precision Integration of ETIP in the Fad 3 Locus of Canola
Standard cloning methods were used in the construction of the ETIP-containing vectors pDAS000271 (
The ETIP sequence was synthesized by a commercial gene synthesis vendor (GeneArt, Life Technologies). The 1 kb segments of FAD3A and FAD3C genome sequence were amplified from genomic DNA purified from leaf tissue of B. napus DH12075 using a Qiagen DN
Control Vectors
A control vector was used to develop a Fluorescence Activated Cell Sorting (FACS) cell based sorting method. Standard cloning methods were used in the construction of a control vector, pDAS000031 (
Construction of Binary Vectors for Random Integration of ETIP in Canola
Two binary vectors were constructed for random integration of an ETIP T-Strand sequence within the genome of Brassica napus. Standard cloning methods were used in the construction of the ETIP-containing vectors pDAS000036 (
The expression cassettes and ELP were synthesized with Multi-Gateway sites by a commercial gene synthesis vendor (GeneArt, Life Technologies). Entry clones were constructed of each expression cassette and ELP using BP clonase II enzyme mix™ (Invitrogen, Life Technologies) and the pDONR221 vector suite™ (Invitrogen, Life Technologies). The Entry clones were then used in a Multi-Gateway reaction with a Gateway-enabled binary vector using LR Clonase II Plus Enzyme mix™ (Invitrogen, Life Technologies). Colonies of all assembled plasmids were initially screened by restriction digestion of miniprep DNA. Restriction endonucleases were obtained from New England BioLabs (NEB; Ipswich, Mass.) and Promega (Promega Corporation, WI). Plasmid preparations were performed using the QIAprep Spin Miniprep Kit™ (Qiagen, Hilden) or the Pure Yield Plasmid Maxiprep System™ (Promega Corporation, WI) following the instructions of the suppliers. Plasmid DNA of selected clones was sequenced using ABI Sanger Sequencing and Big Dye Terminator v3.1 cycle sequencing protocol™ (Applied Biosystems, Life Technologies). Sequence data were assembled and analyzed using the SEQUENCHER™ software (Gene Codes Corporation, Ann Arbor, Mich.).
Transformation of Brassica napus
The ETIP constructs (pDAS000036, pDAS000037), the DS-Red control construct (pDAS000031), and the FAD2A, FAD3A, and FAD3C site specific constructs (pDAS000130, and pDAS000271-pDAS000275) and accompanying Zinc Finger Nuclease (pDAB104010, pDAB10728, and pDAB10729) described in Example 4. The binary vectors were transformed into Agrobacterium tumefaciens strain GV3101. PM90. Transformation of Brassica napus protoplast cells was completed using the transfection protocol described in Example 3 with some modification.
The modifications to the protocol included the use of sodium alginate instead of Sea Plaque™ agarose. The transfection experiments in which both the Zinc Finger Nuclease construct and the ETIP construct were co-delivered into Brassica napus protoplast cells were completed at DNA concentrations comprising a 5:1 molar ratio of plasmid DNA. The other ETIP and control plasmid constructs were transformed at concentrations of 30 μg of plasmid DNA. As such, pDAS000130 consisted of a concentration of 27.8 μg of plasmid DNA and pDAB 104010 consisted of a concentration of 2.2 μg of plasmid DNA. The other ETIP and control plasmid constructs were transformed at concentrations of 30 μg of plasmid DNA.
Additional modifications to the protocol included the propagation of whole plants from the transformed protoplast cells in medium containing 1.5 mg/mL of hygromycin. The propagation of whole plants required that the A medium was replaced every two weeks and the growth of the protoplast-derived colonies was monitored. After the protoplast-derived colonies had grown to approximately 2-3 mm in diameter, the colonies were transferred into individual wells of a 12-well C
Genomic DNA was extracted from leaf tissue of all putative transgenic plants using a DN
A second screen was completed, where gDNA from each transgenic plant was analyzed by PCR using five sets of primers designed to amplify the binary vector outside of the T-DNA region [(1F SEQ ID NO:156 ATGTCCACTGGGTTCGTGCC; 1R SEQ ID NO:157 GAAGGGAACTTATCCGGTCC) (2F SEQ ID NO:158 TGCGCTGCCATTCTCCAAAT; 2R SE ID NO:159 ACCGAGCTCGAATTCAATTC) (3F SEQ ID NO:160 CCTGCATTCGGTTAAACACC; 3R SEQ ID NO:161 CCATCTGGCTTCTGCCTTGC) (4F SEQ ID NO:162 ATTCCGATCCCCAGGGCAGT; 4R SEQ ID NO:163 GCCAACGTTGCAGCCTTGCT) (5F SEQ ID NO:164 GCCCTGGGATGTTGTTAAGT; 5R SEQ ID NO:165 GTAACTTAGGACTTGTGCGA)]. Plants from which PCR products of the correct and expected size were amplified with primer sets 3 and 4 were considered to have backbone integration.
DNA from plants with no backbone integration was purified from 20 g of leaf tissue using a modified CTAB method (Maguire et al., (1994) Plant Molecular Biology Reporter, 12(2): 106-109). The isolated gDNA was digested with several restriction enzymes and 10 μg of gDNA was separated by electrophoresis on an agarose gel and transferred to membrane using a standard Southern blotting protocol. Membranes were probed using the DIG Easy Hyb System™ (Roche, South San Francisco, Calif.) following the manufacturer's instructions. Probes to each expression cassette to the ELP and to an endogenous control gene, actin, were amplified from the ETIP construct using the following primers: (IPT-F SEQ ID NO:166 TCTCTACCTTGATGATCGG; IPT-R SEQ ID NO:167 AACATCTGCTTAACTCTGGC; dsRED-F SEQ ID NO:168 ATGGCTTCATCTGAGAACG; dsRED-R SEQ ID NO:169 TTCCGTATTGGAATTGAGG; PAT-F SEQ ID NO:170 TTGCTTAAGTCTATGGAGGCG; PAT-R SEQ ID NO:171 TGGGTAACTGGCCTAACTGG; ELP-F SEQ ID NO:172 ATGATATGTAGACATAGTGGG; ELP-R SEQ ID NO:173 AGGGTGTAAGGTACTAGCC; Hph-F SEQ ID NO:174 TGTTGGTGGAAGAGGATACG; Hph-R SEQ ID NO:175 ATCAGCAGCAGCGATAGC; actin-F SEQ ID NO:176 GTGGAGAAGAACTACGAGCTACCC; actin-R SEQ ID NO:177 GACTCATCGTACTCTCCCTTCG).
The ETIP sequence was amplified and sequenced from all plants containing only a single copy of the ETIP. The sequence of each T-DNA insert was analyzed by direct sequencing of PCR products using the AB13730xI™ (Applied Biosystems, Life Technologies). The T-DNA insert was amplified from genomic DNA, using Phusion Hot Start II Polymerase™ (Finnzymes, Thermo Fisher Scientific). The amplification reactions of the T-DNA were completed with multiple primer pairs to amplify overlapping sequences of approximately 2 Kbp in length. Each PCR product was sequenced with multiple primers to ensure complete coverage. The PCR reactions were treated with shrimp alkaline phosphatase and exonuclease I (Applied Biosystems, Life Technologies) to inactivate excess primer prior to the sequencing PCR reaction. The sequences flanking the T-DNA insert of each single copy ETIP line were identified by digestion of purified genomic DNA with eight restriction endonucleases followed by ligation of double-stranded adapters specific for the overhangs created by the restriction endonucleases. Following this ligation step a PCR was performed with a biotinylated primer to either the 3′ or 5′ end of the ETIP and a primer to each adapter. The PCR products were captures and cleaned on Ampure Solid Phase Reversible Immobilization (SPRI) beads™ (Agencourt Bioscience Corporation, Beckman Coulter Company). A nested PCR was performed and all products were sequenced using ABI Sanger Sequencing and Big Dye Terminator v3.1 cycle™ sequencing protocol (Applied Biosystems, Life Technologies). Sequence data were assembled and analyzed using the SEQUENCHER™ software (Gene Codes Corp., Ann Arbor, Mich.).
Southern Blot Analysis
Specific restriction enzymes were selected to digest gDNA samples prior to Southern probing. The putative transgenic plants were analyzed by digesting the genomic DNA with EcoRI and SwaI. Next, the digested gDNA and uncut gDNA samples were probed with either polynucleotide fragments comprising PATv6, IPT or ELP gene elements as these polynucleotide probe fragments enabled differentiation of multiple inserts in EcoRI digests as well as in the SwaI digests. Identified single copy transgenic plant lines were then further analyzed with all six probes to identify the presence of all essential elements of the inserted vector.
Accordingly, 67 independent events transformed with ETIP-pDAS000036 were sampled and tested for the presence of the transgene (hph), and the presence of vector backbone. Of the 67 plants tested, 47 were found to have the transgene integrated within the genome. From the 47 transgenic plants, 17 of the plants were found to contain vector backbone (Table 14). The remaining 30 plants that contained no significant portion of vector backbone (absence of Ori or SpecR) were sampled for Southern analysis. As a general rule, the plants were screened initially with the IPT probe, and plant lines identified as putative single copy lines were further tested with probes comprising the dsRED, PAT, ELP and hph gene elements in order to confirm the presence of the whole cassette.
Likewise, 52 independent events transformed with ETIP-pDAS000037 and surviving in soil were sampled and tested for the presence of the transgene (hph), and the presence of vector backbone. Of the 52 plants tested, 48 were found to have the transgene integrated within the genome. From the 48 transgenic plants, 23 of the plants were found to contain vector backbone as well and 3 plants were not tested (Table 14). The remaining 22 plants that contained no significant portion of vector backbone (absence of Ori or SpecR) were sampled for Southern analysis. These transgenic plants were initially screened with the IPT probe, and the plant lines were identified as putative single copy lines, and were further tested with the dsRED, PAT, ELP, hph and actin probes in order to confirm results. Once the identification of 5 independent single copy lines were obtained, Southern analysis was terminated on the remaining plants. In total, 11 ETIP-pDAS000037 lines underwent Southern analysis.
Results of ETIP Transgenic Canola Transformed with PDAS000036 AND PDAS000037
The transgenic Brassica napus events which were produced via transformation of pDAS000036 and pDAS000037 resulted in the production of single copy, full length T-strand insertions. Three to four events for each plant were fully characterized, and were putatively mapped to specific chromosomes within the Brassica napus genome. Although a few single base-pair rearrangements occurred during the T-strand integration, the selected events contained full length expression cassettes which are capable of driving robust expression of the transgene. The selected T0 events were grown to the T1 stage of development. The T1 were res-screened using the above described PCR assays to determine the zygosity of the integrated T-strand. Screened events were categorized as homozygous, hemizygous, or null.
The ETIP sequence was amplified and sequenced from all transgenic events containing only a single copy of the integrated ETIP sequence. The sequence of each T-DNA insert was analyzed by direct sequencing of PCR products. The T-DNA insert was amplified from genomic DNA, using Phusion Hot Start II Polymerase™ (Finnzymes, Thermo Fisher Scientific). Next, the T-DNA was amplified with multiple primer pairs to amplify overlapping sequences of approximately 2 Kb in length. Each PCR product was sequenced with multiple primers to ensure complete coverage. The PCR reactions were treated with Shrimp Alkaline Phosphotase and Exonuclease I (Applied Biosystems, Life Technologies) to inactivate excess primer prior to the sequencing PCR reaction.
The sequences flanking the T-DNA insert of each single copy ETIP line was identified by digestion of purified genomic DNA with eight restriction endonucleases followed by ligation of double-stranded adapters specific for the overhangs created by the restriction endonucleases. Following this step a PCR reaction was performed with a biotinylated primer to either the 3′ or 5′ end of the ETIP and a primer to each adapter. The PCR products were captured and cleaned on Ampure Solid Phase Reversible Immobilization™ (SPRI) beads (Agencourt Bioscience Corporation, Beckman Coulter Company). A nested PCR was performed and all products were sequenced using ABI Sanger Sequencing and Big Dye Terminator v3.1 cycle sequencing protocol (Applied Biosystems, Life Technologies). Sequence data were assembled and analyzed using the SEQUENCHER™ software (Gene Codes Corp., Ann Arbor, Mich.). Eight ETIP lines were identified and selected for flanking sequence analysis (Table 15). The left and right flanking sequences (also described as border or junction sequences) are provided as SEQ ID NO:431-SEQ ID NO:446, the underlined sequences indicated plasmid vector, the non-underlined sequences indicate genomic flanking sequence.
CGTTTTTAATGTACTGAATTAACGCCGAATTGAATTCGAGCTCGGTACCC
GGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTAGCTTGAGCTTG
GATC
CAATGACTTCAAATCTACTTGAAGGCATGGAGTATAAGCCATGTTCCTTT
CAGAGGGGACTGTACTTCTGTAGATTACTTTCCCTCATTAACCAGATCTG
GCCGGCCTACCCAGCTTTCTTGTACACATAGCGACCGAGCTCGAGCCGAA
TGAATTAACGCCGAATTGAATTCGAGCTCGGTACCCGGGGATCCTCTAGA
GTC
TGAAGCAATGACTTCAAATCTACTTGAAGGCATGGAGTATAAGCCATGTT
CCTTTCAGAGGGGACTGTACTTCTGTAGATTACTTTCCCTCATTAACCAG
ATCTGGCCGGCCTACCCAGCTTTCTTGTACAAAGTGGTGATAAACTATCG
TTTTTAATGTACTGAATTAACGCCGAATTGAATTCGAGCTCGGTACCCGG
GGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTAGCTTGAGCTTGGA
TCAGATTGTC
TCAGTGTTTGATTAAAGATAAAATTTGATTTTTCATTACATAATAATCCA
GTGTGAACAAATTGACGCTTAGACAACTTAATAACACATTGCGGACGTTT
TTAATGTACTGAATTAACGCCGAATTGAATTCGAGCTCGGTACCCGGGGA
TCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTA
TTCAGAGGGGACTGTACTTCTGTAGATTACTTTCCCTCATTAACCAGATC
TGGCCGGCCTACCCAGCTTTCTTGTACAAAGTGGTGATAAACTATCAGTG
TTTGAACATATATATACGCATAATATTCTCAGAACCCGACCCATTGGTTG
TAGACAACTTAATAACACATTGCGGACGTTTTTAATGTACTGAATTAACG
CCGAATTGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGC
AGGCATGCAAGCTTAGCTTGAGCTTGGAT
ACTTCAAATCTACTTGAAGGCATGGAGTATAAGCCATGTTCCTTTCAGAG
GGGACTGTACTTCTGTAGATTACTTTCCCTCATTAACCAGATCTGGCCGG
CCTACCCAGCTTTCTTGTACAAAGTGACGATAAACTATCAGTCTTTCAAA
AATTGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGG
CATGCAAGCTTAGCTTGAGCTTGGATCAGATTGTCGTTTCCCGC
CATGTTCCTTTCAGAGGGGACTGTACTTCTGTAGATTACTTTCCCTCATT
AACCAGATCTGGCCGGCCTACCCAGCTTTCTTGTACAAAGTGGTGATAAA
CTATCAGTGTTTGACTGAATTTTAATTTCTAATTTTTGTAAAAAATTTGT
TTTTTAATGTACTGAATTAACGCCGAATTGAATTCGAGCTCGGTACCCGG
GGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTAGCTTGAGCTTGGA
TCAGATTGTC
CAATGACTTCAAATCTACTTGAAGGCATGGAGTATAAGCCATGTTCCTTT
CAGAGGGGACTGTACTTCTGTAGATTACTTTCCCTCATTAACCAGATCTG
GCCGGCCTACCCAGCTTTCTTGTACAAAGTGGTGATAAACTATCAGTGTT
TGAAATAATCGGATATTTAATTTTCTTAGACAGTTCATTAGTAGTTGATC
ACGTTTTTAATGTACTGAATTAACGCCGAATTGAATTCGAGCTCGGTACC
CGGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTAGCTTGAGCTT
GGATCAGATTGTCGTTTC
AGATTACTTTCCCTCATTAACCAGATCTGGCCGGCCTACCCAGCTTTCTT
GTACAAAGTGGTGATAAACTATCAGTGTTAGATCCCCGACCGACCGCCCA
Mapping of ETIPS
For each transgenic event containing a single copy insertion of the ETIP, the flanking sequence was taken following manual assembly and used as the query in a local BLAST analysis. There were a total of eight plants that had single copy integrations identified by this process (Table 16 and Table 17). A collection of 595,478 genomic derived shotgun sequences from Brassica oleracea were downloaded from the NCBI GSS database and formatted as a nucleotide BLAST database. The flanking ETIP sequences were then BLASTn compared to the database and all matches were manually examined. The most significant sequence match to the flanking ETIP sequence from the B. oleracea database was then taken and aligned against the online Brassica rapa genome sequence (http://brassicadb.org/brad/blastPage.php) where the position in the genome that had the most significant sequence match was also retrieved. In instances where a only the 5′ or 3′ flanking sequences provided significant matches with the B. oleracea genome sequences, it was assumed that the unaligned or unmatched sequence had either; identified missing sequence from the database, or that there had been significant genome rearrangements generated during the integration of the ETIP. For the samples that generated significant BLASTn matches from the analysis the flanking ETIP sequence, the most significant B. oleracea GSS matching sequence along with the most significant matching sequence from the B. rapa genome, were then manually aligned in Sequencher™ v5.0 software (Gene Codes Corp., Ann Arbor, Mich.) for each of the eight single copy ETIP plants. The three sequences were then compared and the most similar sequence from either of the diploid Brassica species compared to the flanking ETIP was designated the genome that the ETIP was located in. For the majority of the samples significant variation did exist between the two diploid Brassica genome sequences and the B. napus derived flanking ETIP sequence showed a predominant association with one or other of the diploid sequences. There were instances however, where there was insufficient sequence variation between the diploids and a linkage group assignment may have been possible but a sub-genome assignment was not possible. The specific genome location was then predicted from the location from the Brassica rapa genome sequence. In instances where the ETIP was identified as being integrated into the B. oleracea C genome, the comparative synteny between the diploid Brassica genomes described in Parkin et al. (Genetics 2005, 171: 765-781) was used to extrapolate the genomic location into the Brassica napus C sub-genome. In addition the sequences identified were BLASTn compared to the Arabidopsis thaliana genomes coding sequences (TAIR 9 CDS downloaded from http://arabidopsis.org/index.jsp) and the identity of any gene sequences disrupted were identified, as well as a confirmation of genomic location following the Arabidopsis Brassica synteny described in Schranz et al. (Trend in Plant Science 2006, 11, 11: 535-542).
Arabidopsis gene
Arabidopsis genes
The homozygous events are used to produce protoplasts via the previously described method. The protoplasts are subsequently co-transformed with a Zinc Finger Nuclease that is designed to target a Zinc Finger binding site which is incorporated within the ETIP sequence and a donor plasmid which shares homology with specific regions of the ETIP. The Zinc Finger Nuclease cleaves the ETIP locus and the donor plasmid is integrated within the genome of Brassica napus cells via homology directed repair. As a result of the integration of the donor plasmid, the partial DS-red transgene is repaired to a full length DS-red transgene. The expression of the now fully operational DS-red transgene is used to sort protoplast cells with a FACS method. Putative transgenic plants are sorted using the FACS method described in Example 7 and the isolated protoplasts are regenerated into mature plants. The integration of the donor plasmid is confirmed within the ETIP-targeted plants using molecular confirmation methods. As such, the ETIP locus serves as a site-specific locus for gene targeted integration of a donor polynucleotide sequence.
The genomic targeting locations provide genomic locations that do not alter the plants normal phenotype. The resulting events, wherein a transgene is targeted within an ETIP present no agronomically meaningful unintended differences when the ETIP events are compared to the control plants. In addition, the protein expression levels of transgenes integrated within the ETIP locus are robustly expressed and consistent and stable across multiple genomic locations. The disclosed genomic sequences of SEQ ID NO:431 to SEQ ID NO:446 provide genomic locations within the brassica genome that are targetable for the integration of gene expression cassettes comprising a transgene.
Molecular Confirmation of Fad2a Integration of ETIPS in Canola
Genomic DNA was extracted from leaf tissue of all putative transgenic plants using a DNeasy Plant Mini Kit™ (Qiagen) following the manufacturer's instructions, with the exception that tissue was eluted in 80 μl of AE buffer. Thirty milligrams of young leaf tissue from regenerated plants was snap frozen in liquid nitrogen before being ground to a powder.
Molecular characterization of the FAD2A locus was performed using three independent assays. Assays were designed and optimized using the following controls; characterized transgenic events comprising a single randomly integrated transgene, characterized transgenic event with five randomly integrated transgenes, wild-type canola c.v. DH12075 plants and non-template control reactions. The results from the three following molecular analyses are considered together in order to provide evidence for integration of the ETIP at FAD2A via HDR.
Identifying Transgene Integration by Real-Time Polymerase Chain Reaction
Four replicates of each plant were analyzed using primers specific to the hph (also described as hpt) target gene (SEQ ID NO:447, hpt F791 5′ CTTACATGCTTAGGATCGGACTTG 3′; SEQ ID NO:448, hpt R909 5′ AGTTCCAGCACCAGATCTAACG 3′; SEQ ID NO:449, hpt Taqman 872 5′ CCCTGAGCCCAAGCAGCATCATCG 3′ FAM) (
Detection of Disrupted FAD2A ZFN Site
Each plant was analyzed for presence or absence of amplification of endogenous target in the disrupted locus test, which is a dominant assay. The assay is a SYBR® Green I qPCR assay and in singleplex, but with each reaction run simultaneously on the same PCR plate, targets an endogenous locus (FAD2A/2C.RB.UnE.F1, SEQ ID NO:453, 5′ CTTCCACTCCTTCCTCCTCGT*C 3′ and FAD2A/2C.RB.UnE.R1, 5′ SEQ ID NO:454, GCGTCCCAAAGGGTTGTTGA*G 3′) and the ZFN locus (locus at which the ZFN pDAB104010 binds and cuts the genome) (FAD2A.UnE.F1, SEQ ID NO:455, 5′ TCTCTACTGGGCCTGCCAGGG*C 3′ and FAD2A.UnE.R1, SEQ ID NO:456, 5′ CCCCGAGACGTTGAAGGCTAAGTACAA*A 3′) (
Plants that had amplification of the endogenous target but no amplification of the ZFN target, were scored as positive for the disrupted locus test and were considered to have a disrupted ZFN locus. This assay was considered to be positive when the ZFN binding site on both alleles at the FAD2A locus have been disrupted.
PCR Detection of Transgene Integration at FAD2A Via Homology Directed Repair
Each putative plant transformant was analysed using endpoint with PCR primers designed to amplify the transgene target hph (hph_ExoDigPC_F1, SEQ ID NO:457, 5′ TTGCGCTGACGGATTCTACAAGGA 3′ and hph_ExoDigPC_R1, SEQ ID NO:458, 5′ TCCATCAGTCCAAACAGCAGCAGA 3′), the FAD2A endogenous locus (FAD2A.Out.F1, SEQ ID NO:459, 5′ CATAGCAGTCTCACGTCCTGGT*C 3′ and FAD2A.Out.Rvs3, SEQ ID NO:460, 5′ GGAAGCTAAGCCATTACACTGTTCA*G 3′), the region spanning the 5′ end of any transgene inserted into the FAD2A locus via HDR, upstream of the transgene into the FAD2 A locus (FAD2A.Out.F1, SEQ ID NO:461, 5′ CATAGCAGTCTCACGTCCTGGT*C 3′ and QA520, SEQ ID NO:462, 5′ CCTGATCCGTTGACCTGCAG 3′) and the region spanning the 3′ end of any transgene inserted into the FAD2A locus via HDR, downstream of the transgene into the FAD2 A locus (QA558, SEQ ID NO:463, 5′ GTGTGAGGTGGCTAGGCATC 3′ and FAD2A.Out.Rvs3, SEQ ID NO:464, 5′ GGAAGCTAAGCCATTACACTGTTCA*G 3′) (
Amplification of the 5′ transgene-genome flanking target and/or amplification of the 3′ transgene-genome flanking target indicated a putative insertion event. It must be noted that due to the approximately 1,000 bp FAD2A homology arms in the pDAS000130 cassette (comprising polynucleotide sequences with 100% sequence identity to the FAD2A regions immediately upstream and downstream of the ZFN cut site), the PCR reactions were subject to false positive PCR product amplification due to PCR chimerism arising from amplification of off-target ETIP integration events. Amplification of the hph target confirmed transgene integration had occurred Amplification of the FAD2A target suggests that the FAD2A locus is intact or contains only a partial insertion. Due to the size of the ETIP (11,462 bp for the ETIP cassettes or 13,472 bp including the FAD2A homologous arms and the ETIP cassettes) it is expected that the FAD2A primers would not amplify a product when an intact ETIP is integrated into the FAD2A locus.
Southern Detection of FAD2A Editing
Plants that had amplification of either a 5′ genome-transgene flanking target product and/or amplification of a 3′ transgene-genome flanking target, or no amplification of the ZFN locus target, or both, were subject to Southern analysis for detection of transgene integration at the FAD2A locus. Genomic DNA was purified from 5 g of leaf tissue using a modified CTAB method (Maguire, T. L., G. G. Collins, and M. Sedgley A modified CTAB DNA extraction procedure for plants belonging to the family proteaceae. Plant Molecular Biology Reporter, 1994. 12(2): p. 106-109). Next, 12 μg of genomic DNA was digested with Kpn1-HF (New England BioLabs) and digestion fragments were separated by electrophoresis on a 0.8% agarose gel before transfer to membrane using a standard Southern blotting protocol. Primers to FAD2A 5′ target region (F, SEQ ID NO:465, 5′ AGAGAGGAGACAGAGAGAGAGT 3′ and R, SEQ ID NO:466, 5′ AGACAGCATCAAGATTTCACACA 3′), FAD2A 3′ target region (F, SEQ ID NO:467, 5′ CAACGGCGAGCGTAATCTTAG 3′ and R, SEQ ID NO:468, 5′ GTTCCCTGGAATTGCTGATAGG 3′) and hph (F, SEQ ID NO:469, 5′ TGTTGGTGGAAGAGGATACG 3′ and R, SEQ ID NO:470, 5′ ATCAGCAGCAGCGATAGC 3′) were used to generate probes to detect the presence of the ETIP within the FAD2A locus using the DIG E
Membrane-bound genomic DNA was probed in a specific order; firstly FAD2A 5′ sequences were probed, then the FAD2A 3′ sequences were probe, and finally the hph sequences were probed (
Results of Molecular and Southern Analysis
Following transfection, culturing, and selection the transgenic plants were transferred to soil. From this process, 139 plants survived and had tissue sampled for gDNA extraction and analysis. All 139 plants were analyzed for copy number estimation. Of these 139 plants, 56 were positive for the ETIP and 11 of the 56 positive plants had a putative single copy integration (
Results of Etip Trans Genic Canola Transformed with pDAS000130 and pDAB104010.
The transgenic Brassica napus events which are produced via transformation of pDAS000130 and pDAB104010 result in the integration of a single copy, full length T-strand insertion of the ETIP polynucleotide sequence from pDAS000130 within the FAD2A locus. Three to four events are fully characterized and confirmed to contain the integrated ETIP. The confirmation is completed using an in-out PCR amplification method, and further validated via Southern blot. The selected T0 events are grown to the T1 stage of development. The T1 plants are re-screened to determine the zygosity of the integrated T-strand. Screened events are categorized as homozygous, hemizygous, or null.
The homozygous events are used to produce protoplasts via the previously described method. The protoplasts are subsequently co-transformed with a Zinc Finger Nuclease that is designed to target a Zinc Finger binding site which is incorporated within the ETIP sequence and a donor plasmid which shares homology with specific regions of the ETIP wherein the donor is integrated within the ETIP via an HDR mechanism. Likewise, the protoplasts are subsequently co-transformed with a Zinc Finger Nuclease that is designed to target a Zinc Finger binding site which is incorporated within the ETIP sequence and a donor plasmid which does not share homology with specific regions of the ETIP, wherein the donor is integrated within the ETIP via an non-homologous end joining mechanism. The Zinc Finger Nuclease cleaves the ETIP locus and the donor plasmid is integrated within the genome of Brassica napus cells via homology directed repair or non-homologous end joining. As a result of the integration of the donor plasmid, the partial DS-red transgene is repaired to a full length DS-red transgene. The expression of the now fully operational DS-red transgene is used to sort protoplast cells with a FACS method. Putative transgenic plants are sorted using the FACS method described in Example 7 and the isolated protoplasts are regenerated into mature plants. The integration of the donor plasmid is confirmed within the ETIP-targeted plants using molecular confirmation methods. As such, the ETIP locus serves as a site-specific locus for gene targeted integration of a donor polynucleotide sequence.
Results of Etip Transgenic Canola Transformed with Zinc Finger Nuclease and pDAS000271-pDAS000275 Etip Constructs
The transgenic Brassica napus events which are produced via transformation of ETIP and Zinc Finger Nuclease constructs result in the integration of a single copy, full length T-strand insertion of the ETIP polynucleotide sequence from pDAS000273 or pDAS275 within the FAD3A locus, and from pDAS000271, pDAS000272 or pDAS000274 into the FAD3C locus. Three to four events are fully characterized and confirmed to contain the integrated ETIP. The confirmation is completed using an in-out PCR amplification method, and further validated via Southern blot. The selected T0 events are grown to the T1 stage of development. The T1 plants are res-screened to determine the zygosity of the integrated T-strand. Screened events are categorized as homozygous, hemizygous, or null.
The homozygous events are used to produce protoplasts via the previously described method. The protoplasts are subsequently co-transformed with a Zinc Finger Nuclease that is designed to target a Zinc Finger binding site which is incorporated within the ETIP sequence and a donor plasmid which shares homology with specific regions of the ETIP. The Zinc Finger Nuclease cleaves the ETIP locus and the donor plasmid is integrated within the genome of Brassica napus cells via homology directed repair. As a result of the integration of the donor plasmid, the partial DS-red transgene is repaired to a full length DS-red transgene. The expression of the now fully operational DS-red transgene is used to sort protoplast cells with a FACS method. Putative transgenic plants are sorted using the FACS method described in Example 7 and the isolated protoplasts are regenerated into mature plants. The integration of the donor plasmid is confirmed within the ETIP-targeted plants using molecular confirmation methods. As such, the ETIP locus serves as a site-specific locus for gene targeted integration of a donor polynucleotide sequence.
Brassica napus protoplasts that were transfected with the DS-Red control construct, pDAS000031, were sorted via FACS-mediated cell sorting using a BD Biosciences Influx-Cell sorter™ (San Jose, Calif.). The protoplast cells were isolated and transfected as described in Example 3. After the cells had been transfected with pDAS000031, the cells were sorted using the FACS sorter with the conditions described in Table 21.
The protoplasts which expressed the DS-red transgene were sorted and isolated. The FACS isolated protoplasts were counted using the sorter. About 1×105 to 1.8×105 of cells were placed in a well of a 24-well micro titer plate on the first day after the FACS isolation. The cells were transferred to a bead culture for 5 to 20 days. Similar conditions were tested, wherein about 1×104 of cells were placed in a well of a 2 or 4-well micro titer plate on the second day after the FACS isolation. The various conditions that were tested resulted in the recovery of cells at a viability or 95-98% of the total isolated protoplast cells. The FACS sorted protoplast cells were transferred to a bead culture for 3-20 days. The FACS sorted protoplast cells were regenerated into plants on media which contained 1.5 mg/mL of hygromycin using the above described protocol. The putative transgenic plants were confirmed to contain an intact T-strand insert from pDAS000031 via molecular conformation protocols.
Targeting of Etip Lines with ZFN Mediated Homologous Recombination Of Ds-Red
A canola line containing the T-strand insert from pDAS000036 was obtained and confirmed via molecular characterization to contain a full length, single copy of the T-strand. This canola event was labeled as pDAS000036-88 and was used to produce protoplasts via the previously described method. The protoplasts were isolated and ˜50,000 canola protoplast cells were subsequently co-transformed with a Zinc Finger Nuclease, either pDAS000074 (
The donor plasmid DNA and ZFN plasmid DNA were mixed at various concentrations and used to transfect the canola protoplast cells containing Event pDAS000036-88, and the transgenic protoplast cells were sorted using the FACS transfection that was previously described. Table 22 describes the various transfection experiments and the DNA concentrations which were used for the transfection of the canola protoplasts containing Event pDAS000036-88. The ZFN and donor plasmid DNA was isolated and prepared for the transfections via the previously described methods.
After the transfection experiments were completed the protoplasts were incubated at room temperature for 48 hours and sorted using the above described FACS protocol. Each experiment was sorted independently and Zinc Finger-mediated introgression of a transgene was confirmed via identification of individual events which expressed the DS-red transgene.
Selected explants were transferred and cultured upon regeneration media containing phosphothrinocin. After the culturing period the surviving explants were transferred to elongation medium and root induction medium for culturing and plant development. Whole plants that consisted of developed root and shoot structures were transferred into soil and further propagated in the greenhouse. The tissue culture process utilized media and culture conditions as previously described above. The results of plants produced from the tissue culturing process are shown in Table 23 below.
The FACS sorting method is directly applicable to screen any fluorescent transgene sequence and is used to isolate a proportion of any protoplast, herein Brassica napus protoplast cells that are targeted with a fluorescent transgene via homology mediated repair within a specific site in the ETIP region within a genomic locus.
While certain exemplary embodiments have been described herein, those of ordinary skill in the art will recognize and appreciate that many additions, deletions, and modifications to the exemplary embodiments may be made without departing from the scope of the following claims. In addition, features from one embodiment may be combined with features of another embodiment.
The present application claims priority to the benefit of U.S. Provisional Patent Application Ser. No. 61/697,890, filed Sep. 7, 2012, the disclosure of which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61697890 | Sep 2012 | US |