The invention relates to the field of molecular biology, cell biology and biotechnology. More, in particular, the invention relates to an efficient method for obtaining site-specific, marker-less integration (or deletion) of a sequence of interest in (or from) the genome of a cell.
Autonomously replicating plasmids are widely used for the introduction of desired DNA sequences into a host cell. However, stability is relatively low and plasmids are often lost rapidly when selection pressure is relieved. Furthermore, plasmid copy number varies per cell, resulting in low reproducibility. For these reasons, in biotechnology, the desired DNA is often inserted into the host genome by homologous recombination or by use of a host-specific integration system. This increases stability, while reproducibility is also higher.
Integration methods invariably require a marker on the delivery plasmid, such as an antibiotic-resistance gene or an antibiotic-free selectable marker, such as BADH (betaine aldehyde dehydrogenase, see, for example, PCT International Patent Publication WO 01/64023), which is unattractive from a biotechnological point of view, certainly if (part of) the production organism ends up in the food chain. Examples of the latter are genetically modified plants and vegetables or biomass of microbial origin (e.g., fungi) used in animal feeds. The use of chromosomal insertion by homologous recombination with concomitant loss of the vector sequences and its marker is, therefore, preferable.
After a certain recombination event, two possible recombinants can arise. The desired recombinant can either be selected or screened for. Selection is defined as having a way to force the presence of the desired situation, such as adding an antibiotic requires the presence of antibiotic resistance, while screening is defined as searching for a certain nonselectable characteristic, such as DNA sequencing, to establish the presence or absence of a change (insertion, deletion, mutation) in the genomic DNA.
One of the main problems with existing gene replacement protocols is that if there is no selectable marker left on the genome, this typically results in seemingly endless screening for the desired recombinants, without the certainty of success, making the process highly cost-ineffective and time consuming.
An overview of the various possibilities for gene inactivation strategies, with focus on actinomycetes, is given in Kieser et al. 2000. Gene replacement is a two-step procedure, with an insertion step and an excision event. While the insertion step can be selected due to the presence of a selectable marker on the plasmid, typically an antibiotic resistance marker, the second step often involves an indirect screening step (such as loss of the resistance cassette) unless a marker is left on the genome. However, the latter possibility is highly undesirable in biotechnological production strains.
Therefore, methods have been sought to achieve a more effective final recombination event. The main goal is, therefore, to find a way to avoid the need of screening in the last step, a process often involving replicating thousands of colonies to find a recombinant with the desired phenotype. One possible solution is to build in a counter-selectable marker into the delivery plasmid. For this purpose, the glkA gene has been used in streptomycetes by several laboratories (e.g., Buttner et al. 1990, van Wezel et al. 1995) and forms the basis for the disruption vector pIJ2581 (van Wezel and Bibb 1996). In such experiments, loss of the delivery plasmid is forced by growth on the glucose analogue 2-deoxyglucose, which is lethal for glkA+ strains. A disadvantage is that the host remains a glkA mutant, and this typically requires replacement by the wild-type glkA gene, for example, by crossing. Furthermore, unless a selectable marker, such as an antibiotic-resistance marker (or an antibiotic free selectable marker), is inserted into the target sequence on the genome, secondary recombinants still have to be screened for the correct recombination event, such as deletion or insertion of a DNA sequence. This is typically done by isolating genomic DNA from a large number of recombinants, followed by PCR and DNA sequencing, which is time consuming, and often the desired mutation or insertion is not found in the screen. A recent method designed to allow screening for deletions involves an initial gene replacement experiment whereby the DNA sequence to be deleted is replaced by a resistance marker, followed by recombinatory removal of the resistance marker, resulting in the desired deletion. This involves two cross-overs instead of one double, still without positive selection in the final step.
While such intensive screening programs to find the desired second recombination event are problematic in prokaryotes, growing hundreds of recombinant species to find a desired genotype without the aid of a selectable marker is often impossible in plants and other higher eukaryotes. For this reason, in plant biotechnology, scientists resort to the unfavorable solution of leaving a selectable marker (such as bialaphos or kanamycin-resistance) on the genome of the recombinant.
In recent years, the use of selectable marker genes for the identification of recombinant microorganisms has become subject to intensive debate. The spread of such marker genes into the environment is highly undesirable and even more so if these marker genes encode resistance against antimicrobial agents. Therefore, other markers (such as BADH) have been developed to reduce this environmental risk, although their use is also subject to restraints. Therefore, ideally, one would like to produce recombinant organisms that do not contain any additional DNA other than the DNA of interest. An example of a method for marker-free integration of DNA into the genome of microorganisms (fungi and bacteria) is presented in European patent application 0 635 574 A1. In this case, the DNA of interest is cloned on a vector between two identical DNA sequences and, after integration of the plasmid into the host genome, the inventors select for a second recombination event, removing all vector sequences other than the DNA of interest and a single element of the repeat sequence. However, while such a method results in a marker-free recombinant strain, still a significant section of additional (and in principle undesired) DNA is left in the host genome, typically with a minimum length of several hundreds of base pairs. Furthermore, vector systems with repeat sequences typically have reduced stability due to possible recombination events between the repeats during plasmid replication.
There is, therefore, a clear need for an integration (or deletion) method in which all steps are selectable and particularly, the final recombination event, where the only change to the host genome is (preferably) the desired insertion (or deletion), without leaving a selection marker (e.g., an antibiotic resistance gene) or a mutant host (e.g., a glucokinase mutant in bacteria or an ethylene receptor mutant in plants).
Disclosed is a reliable and highly effective method for inserting (or deleting) a sequence of interest (preferably DNA) into (or from) the host genome, which is, for example, used for the insertion of an expression cassette for enzyme production. The method is easily translated to any organism, provided that the suitable criteria for the organism are met. For example, the choice of selection marker (which is used in the method according to the invention but which is later removed) is adapted to the organism. Upon reading the detailed description of the present invention, it will be clear to a person skilled in the art that the invention is applicable to a broad range of hosts (cells), as the only principal requirement is the availability of a transformation procedure and a selectable (or screenable) host gene.
Provided are multiple methods resulting in a site-specific, markerless integration (or deletion) of a sequence of interest. The methods are based on the following principles:
The genomic presence in a cell (host cell) of a selectable or screenable gene X. As a result of a mutation, this gene X can be essentially sensitive or insensitive to a certain component or condition Z or, as a result of a mutation in gene X, the host cell is made dependent on the presence of a certain component or condition Z.
A plasmid on which the desired insertion (or deletion) is present, further harboring a truncated version of gene X and a selectable marker (such as an antibiotic-resistance or an antibiotic-free selectable marker gene) to select or screen for the presence or absence of vector sequences in the host cell.
Positive selection of the final recombination step, avoiding complicated and time-consuming screening for the desired recombinants. In a preferred situation, both recombination steps can be positively selected.
When the starting host cell contains a mutant gene X on the genome, the final recombinant has a gene X without a mutation and, preferably, the genome further only comprises the desired insertion or deletion. When the starting host cell contains an original gene X, the final recombinant bears a mutation in gene X and (preferably) the genome further only comprises the desired insertion or deletion.
Some of the possibilities are exemplified in
Table 1 discloses an overview of selection criteria that can be applied in the recombination scheme as depicted in
Preferably, gene X is an endogenous gene. As a result of a mutation, gene X can be essentially sensitive or insensitive to a certain component or condition Z or, as a result of a mutation in gene X, the host cell is made dependent on the presence of a certain component or condition Z. The term “gene X” is not restricted to the sequence encoding the open reading frame of the corresponding protein, but typically also comprises the necessary sequences for proper transcription and translation, in particular, promoter and/or termination sequences as well as the ribosome-binding site.
The invention provides in one embodiment, a method for obtaining site-specific, marker-less integration of a sequence of interest in the genome of a cell, wherein the genome comprises a selectable or screenable gene X and a sequence Y, the method comprising:
The positive selection for the second recombination event for obtaining a cell with a recombinant genome in which an internal recombination event has occurred through gene X and the truncated version of gene X is based on the presence of the final desired genomic version of gene X. This is explained in more detailed hereunder.
In a preferred embodiment, the invention provides a method wherein the plasmid essentially cannot replicate during the first recombination event. Hence, when a selection step is performed for the presence of the selection marker of the plasmid, only cells in which the genetic information from the plasmid has been integrated in the genome will survive. Preferably, the plasmid can replicate in the cell to multiple copies after the transfer of the plasmid to the cell and replication is blocked during the first recombination event. Such a feature is, for example, obtained by providing the plasmid with a conditionally dependent ori. This increases the efficiency of the first recombination event.
In yet another preferred embodiment the invention provides a method that further comprises a check after the second recombination event for loss of the selection marker of the plasmid. This is easily performed by comparing the growth of, for example, bacterial colonies on plates with and without the corresponding antibiotic. In yet another preferred embodiment, the invention provides a method wherein the obtained cell in which an internal recombination event has occurred through gene X and the truncated version of gene X is checked for the presence of the sequence of interest, for example, PCR analysis followed by sequence analysis.
In a preferred embodiment, the invention comprises a method, wherein the selectable or screenable gene X is selectable or screenable via a component or a chemical and/or physical condition or wherein the cell is dependent on the presence of the component or condition due to the presence of the selectable or screenable gene X. Examples of suitable combinations of gene X and component or condition will be outlined below. Some examples of a component or a chemical and/or physical condition are temperature, light, H2O2, vitamins and amino acids.
In another preferred embodiment, the invention provides a method wherein the truncated version of gene X is inactive through truncation but otherwise original (for example, as illustrated in
In an even more preferred embodiment, the invention provides a method, wherein the final recombinant has, except for the desired insertion, an original genome (more specifically, with an original gene X) and, even more preferably, a method wherein both recombination steps are selectable. However, it is clear from
The term “original” is herein used to refer to the starting situation before a method according to the invention, possibly preceded by a method for preparing the cell in which the site-specific, marker-less integration or deletion must take place, is applied. For example, when gene X is glkA (encoding glucose kinase), the original genome comprises a glk gene, which results in a sensitive phenotype of the cell to 2-deoxyglucose (2-DOG). First, a mutant of the glk gene is produced (by methods known to a person skilled in the art), which mutant renders the cell insensitive to 2-DOG (hence, a selectable gene X is obtained). Then, a method according to the invention is applied and after the final recombination event, the resulting cell will comprise a glk gene that renders the cell (again) sensitive for 2-DOG. It is clear that this logic can be applied mutatis mutandis to the other examples of gene X herein disclosed.
The method according to the invention can be carried out with different types of gene X and non-limiting examples are disclosed herein.
In one of the embodiments, gene X is mutated such that it is essentially insensitive to a certain component or condition Z and, hence, the invention provides a method for obtaining site-specific, marker-less integration of a sequence of interest in the genome of a cell, wherein the genome comprises a gene X, which, as a result of a mutation, is essentially insensitive to a certain component or condition Z, the genome further comprising a sequence Y, the method comprising:
The choice for condition V is based on the final (desired) outcome of the genomic version of gene X. Hence, this choice is based on the characteristics/properties of the genomic version of gene X (in the host cell) before the first recombination event through sequence Y of the genome and sequence Y of the plasmid and after a second recombination event through the sequences of gene X and the truncated inactive but otherwise original version of gene X. The choice for condition V will be exemplified in more detail at the different discussions on the figures.
In a preferred embodiment, sequence Y of the genome is located downstream of gene X, which, as a result of a mutation, is essentially insensitive to a certain component or condition Z and wherein the plasmid comprises:
The opposite situation, with selectable gene X downstream instead of upstream of sequence Y, is also possible (
Any sequence can be used as the sequence of interest. Preferably, the sequence enables the production of a product/protein of interest not present as such or present in low concentrations in the cell. Hence, the sequence of interest preferably also comprises the necessary elements for proper transcription and/or translation (such as functionally linked promoter and terminator sequences), for example, a sequence specifying an enzyme, an enzyme inhibitor, an antitumor agent, a bioinsecticide, a part of an antibody (for example, a heavy chain or a light chain), or an anti-migraine agent. However, it should be kept in mind that not only sequences can be inserted according to a method of the invention, but also deletions can be introduced. Examples of the creation of deletions upstream or downstream of a selectable or screenable gene X are shown in
As used herein, the term “vector” is used to indicate a so-called empty (without any extra sequences) cloning vector (for example, pUC18 or pBR322). The term “plasmid” is used to refer to a vector in which a sequence has been cloned (for example, a sequence of interest). Hence, in the final step of a method according to the invention, it is checked whether all vector-related sequences have been removed from the genome, leaving only the sequence of interest behind. The removal of the vector sequences is, for example, determined by screening for loss of the selection marker.
The strategy for constructing the plasmid is briefly described. First, the desired site-specific genomic location for insertion (or deletion) upstream or downstream of a selectable or screenable gene X is determined. This position is preferably chosen such that the promoter and/or terminator can provide (proper) transcription, while translational signals such as the ribosome-binding site are also left intact. In the situation described in
Any common selection marker can be used to identify the presence of vector sequences. The person skilled in the art will know how to select the proper selection marker for each cell type. For example, ampicillin, apramycin or kanamycin for an E. Coli cell, apramycin, hygromycin or kanamycin for a streptomycete, or kanamycin, cyanamid or hygromycin for a plant cell.
The plasmid used in the method according to the invention further comprises all necessary elements for cloning and propagation in a host other than the host that is the target for the chromosomal insertion or deletion, for example; an origin of replication (ori) enabling the production or maintaining of the plasmid in E. coli. The person skilled in the art is very well capable of selecting all the necessary elements and a detailed discussion on this item is, therefore, not provided.
In a preferred embodiment, gene X, which, as a result of a mutation, is essentially insensitive to a certain component or condition Z, is mutated close to the 3′ end of gene X in the situation outlined in
When the starting host cell contains a mutant gene X on the genome, the final recombinant has a gene X without a mutation and the genome further only comprises the desired insertion or deletion. When the starting host cell contains an original gene X, the final recombinant bears a mutation in gene X and the genome further only comprises the desired insertion or deletion.
In a preferred embodiment, the method according to the invention comprises a screening step after the first recombination event to rule out that recombination has occurred upstream of the cloned DNA, but downstream of the site of mutation (see
The length of gene X is not critical. In principle, such a gene will be at least several 100 bp because it encodes a protein that is essentially insensitive to a certain component or condition Z. However, it is clear to a person skilled in the art that, in general, the frequency of the recombination increases with increasing lengths of gene X and sequence Y. With regard to applications in actinomycetes and especially in streptomycetes, the lengths of gene X and sequence Y are preferably at least 400 to 500 bp to ensure an acceptable frequency. A person skilled in the art is very well capable of selecting, based on the cell in which the integration must take place, a suitable length of gene X. For example, for use of a method according to the invention in E. coli, the length of gene X can be reduced well below 100 bp.
In principle, it is possible to use non-homologous DNA sequences for recombination, although recombination frequencies are strongly reduced. Several examples are given in Kieser et al. (2000). However, to get recombination between heterologous genes at a reasonable frequency, more than 95% identity between the genes or regions in which recombination is desired, is highly desirable, and certainly when there is no positive selection, such as in the final step of most double cross-over events. Hence, in a preferred embodiment, the sequence Y on the genome and sequence Y on the plasmid are at least 95% identical. The same is true for gene X on the genome and gene X on the plasmid.
In an even more preferred embodiment, the mutation in gene X comprises a point mutation and, more preferably, a point mutation at the 3′ or 5′ end which ensures a final recombinant with an original gene X.
In yet an even more preferred embodiment, the invention provides a method according to the invention wherein the substantial part of sequence Y located downstream of the sequence of interest is approximately of the same length as the 5′ truncated inactive but otherwise original version of gene X, to improve the probability of the desired second recombination event. A substantial part is herein defined as a part that is capable of providing recombination. The length and overall homology depends on the cell used. For example, recombination in a hyperrecombinant E. coli strain can take place with sequences as small as 40 bp. Recombination in streptomycetes typically involves sequences of at least 400 bp. A person skilled in the art knows how to select the proper length and, hence, no further details are provided.
It is clear from the description in
In principle, every gene whose wild-type product confers sensitivity to a certain component or condition Z can be applied in a method according to the invention. All that is preferably needed is a genomic mutant of the gene, preferably with the mutation close to the 3′ end of the gene (for the situation as depicted in
In a preferred embodiment, gene X is followed by a transcriptional terminator on the genome and insertion of the sequence of interest has no effect on the proper transcription and/or translation of downstream-located genes. This avoids polar effects on downstream-located genes. A detailed analysis of these problems and ways to avoid them are outlined in the experimental part.
Another example of such a combination of a gene X that is essentially insensitive to a certain component or condition Z and sensitive to a component or condition V, is mutated rpsL (encoding r-protein S12) and streptomycin. In this special case, both V and Z are streptomycin. Several streptomycin-dependent mutants are known in prokaryotes (Timms et al. 1992), which require streptomycin for growth due to strongly enhanced accuracy of translation in these mutants, which is counteracted by streptomycin. Replacing glkA by rpsL in
Yet another example of such a combination of a gene X that is essentially insensitive to a certain component or condition Z and sensitive to a component or condition V, which is applicable in plants, is a mutated gene for the ethylene receptor protein 1 (ERP1). An alignment of EPR1 homologues from various plants is shown in
In yet another embodiment, gene X is mutated such that the (host) cell requires the presence of a component or condition Z and, hence, the invention provides a method for obtaining site-specific, marker-less integration of a sequence of interest in the genome of a cell, wherein the genome comprises a mutated gene X and wherein the cell is, due to the mutated gene X, dependent on the presence of a certain component or condition Z, the genome further comprising a sequence Y, the method comprising:
In a preferred embodiment, sequence Y of the genome is located downstream of the mutated gene X, wherein the plasmid comprises:
Cells or host cells for use in such a method are readily obtainable by, for example, classical mutagenesis methods known by the person skilled in the art.
This part of the invention is illustrated in
Preferably, the mutated gene X is a mutated amino acid biosynthesis gene and component or condition Z is the corresponding amino acid. In another preferred embodiment, the mutated gene X is a mutated vitamin biosynthesis gene and component or condition Z is the corresponding vitamin.
Such cells or host cells are readily obtainable by, for example, classical mutagenesis methods known by the person skilled in the art.
This part of the invention is exemplified by the use of auxotrophic markers. Auxotrophy is the inability of, in general, microorganisms to synthesize certain compounds, such as amino acids, from precursors. In contrast to corresponding wild-type strains, auxotrophic variants do not grow on so-called minimal media. Auxotrophic strains only grow on minimal media supplemented with the required growth factors, such as vitamins and/or amino acids.
This part of the invention is exemplified in
Recombinants obtained via the second possible recombination are screened, for example, by comparing the growth of recombinants in the presence or absence of the corresponding amino acid. In a further recombination event, selection is made for recombination between the sequence upstream of the mutation in the chromosomally located (mutant) copy of the amino acid biosynthesis gene and the 5′ truncated inactive but otherwise original version of the amino acid biosynthesis gene. These final recombinants are then selected by their ability to grow on media which do not contain the corresponding amino acid and, optionally, screened for absence of the selection marker of the plasmid. Also optionally, the presence of the sequence of interest is confirmed by, for example, a PCR and/or sequence analysis.
This method provides a positive selection step for identifying the desired final recombinants and, hence, the success rate of identifying a final desired recombinant is optimized, avoiding failed experiments, and experimental time and effort reduced significantly.
In principle, every gene whose original product confers the ability to grow without the need for amino acids, vitamins and other essential building blocks can be applied in a method as described above. All that is required is an endogenous gene located on the genome, with a mutation at either end of the gene, making the cell dependent on a certain component or condition Z.
In yet another embodiment, the genome comprises a gene X, which, as a result of a mutation, is essentially sensitive to a component or condition Z and, hence, the invention provides a method for obtaining site-specific, marker-less integration of a sequence of interest in the genome of a cell, wherein the genome comprises a gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z, the genome further comprising a sequence Y, the method comprising:
In a preferred embodiment, the invention provides a method, wherein sequence Y of the genome is located downstream of gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z and wherein the plasmid comprises:
Again, it is clear to a person in the art that, similarly to the situations illustrated in
Preferably, gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z, is a mutated peroxidase or catalase gene and component or condition Z is H2O2. Another example of a gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z, is a gene which is sensitive to a certain antibiotic and that becomes resistant after the final recombination event. Hence, in another preferred embodiment, gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z, is a mutated β-lactamase and component or condition Z is a β-lactam-antibiotic. Yet another example of a gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z, is a gene that is sensitive to elevated or reduced temperatures, known as heat-shock or cold-shock conditions, respectively.
Cells or host cells for use in such a method are readily obtainable by, for example, classical mutagenesis methods known by the person skilled in the art.
This method is exemplified by the use of catalase as a gene X that is as a result of a mutation essentially sensitive to H2O2 and proceeds through the following steps (see
Examples of genes which can be used in this part of the invention are katG (E. coli, Synechocystis PCC6803), cpeB (S. coelicolor).
Another example of a gene X, which, as a result of a mutation, is essentially sensitive to a certain component or condition Z, is a thermosensitive (Ts) mutation that renders the (host) cell sensitive to higher temperatures. While this does not allow positive selection for one of the two alternative recombination events (1 and 2 in
In the case a gene X is used that is sensitive to component or condition Z, another way of providing the corresponding truncation is by providing a transciptionally silent gene X, for example, by creating a mutation in a crucial part of the promoter consensus sequence. In such a case, a complete gene X is present, but due to the lack of an active promoter, no transcript and, thus, no protein is produced. Since this is always a mutation at the front (5′) end of gene X, a scheme such as depicted in
Furthermore, it is noted that in the case a final recombinant with a mutant gene X is not a problem, for example, in a laboratory strain, a method according to the invention can also be performed as illustrated in
In a preferred embodiment, the invention provides a method for obtaining site-specific, marker-less integration of a sequence of interest in the genome of a cell as outlined herein, wherein the cell is a eukaryotic cell, for example, a plant cell. The plant cell can, for example, be obtained by using the characteristics of a mutated and wild-type erp1 gene.
It is clear that a method according to the invention can be applied both to a prokaryotic cell and to a eukaryotic cell. Typical examples of a cell in which integration according to a method of the invention can be obtained are actinomycetes, more preferably streptomycetes. Production of a protein encoded by the sequence of interest in a prokaryotic cell typically involves secretion of the protein into the extracellular media and, hence, the presence of a marker gene does not interfere significantly with the isolation of marker-free protein. However, in the case a protein is produced in, for example, the leaves of a plant, isolation of the protein can be contaminated with a protein encoded by a marker gene. Furthermore, there is the risk of spread of the marker into the environment when recombinant plants are grown in fields. This is currently one of the biggest problems in plant biotechnology. Acceptance by governments, as well as by the public, would greatly benefit from a method that produces recombinant plants not polluted with additional marker genes. In an even more preferred situation, if only plant sequences are used and, preferably, homologous plant sequences, the recombinant plant will contain only endogenous sequences, lacking DNA from, for example, bacterial or fungal origin. Hence, the present invention is particularly advantageous for providing a eukaryotic cell with a sequence encoding a protein or RNA molecule of interest.
Besides a method for obtaining site-specific, marker-less integration of a sequence of interest in the genome of a cell, the invention also provides a cell obtainable according to any one of the invention's methods. Preferably, the cell is a eukaryotic cell and, even more preferably, the eukaryotic cell is a plant cell. Non-limiting examples of dicot plants are Brassica, potato, tomato, soy bean, sugar beet, and Arabidopsis, and examples of monocot plants are rice, maize, wheat, and barley.
The invention also provides an organism which comprises a cell according to the invention. Preferably, the organism is a non-human organism/animal and, even more preferably, the organism is a plant.
In yet another embodiment, the invention provides a method for producing an antibiotic or a useful protein comprising culturing a cell according to the invention or an organism (preferably a non-human organism/animal) according to the invention and harvesting the antibiotic or protein from the cell, organism or culture.
The invention will be explained in more detail in the following description, which is not limiting the invention.
Experimental Part & Results
Bacterial Strains and Culturing Conditions
E. coli K-12 strains JM109 was used for propagating plasmids and was grown and transformed by standard procedures (Sambrook et al. 1989). E. coli ET12567 (MacNeil et al. 1992) was used to isolate DNA for transformation of plasmid DNA to Streptomyces coelicolor. Transformants were selected in L broth containing 1% (w/v) glucose, and ampicillin at a final concentration of 200 μg ml−1. L broth with 1% glucose and 30 μg ml−1 chloramphenicol was used to grow ET12567.
Streptomyces coelicolor A3(2) M145 was obtained from the John Innes Centre strain collection. Protoplast preparation and transformation were performed as described by Kieser et al. (2000). SFM medium (mannitol, 20 g 1−1; soya flour, 20 g 1−1; agar, 20 g 1−1, dissolved in tap water) was used to make spore suspensions. Minimal Medium (MM) and R2YE agar plates (Kieser et al. 2000) were used for selection experiments; R2YE was also used for regenerating protoplasts and, after addition of the appropriate antibiotic, for selecting recombinants. For standard cultivation of Streptomyces, YEME (Kieser et al. 2000) or tryptone soy broth (Difco) containing 10% (w/v) sucrose (designated TSBS) were used. Liquid cultures to select for glucose utilization were performed in NMMP (minimal medium), with 1% (w/v) mannitol or 1% (w/v) glucose as the carbon source.
Construction of pMBS011
As the basis for a recombination plasmid, we used pIJ2581 (5192 bp; Genbank Accession X98363; van Wezel and Bibb, 1996), a construct based on pBluescript SK+ (Strategene), with bla as selectable marker in E. coli, and tsr as selectable marker in Streptomyces. The plasmid has both ColE1 and f1 (+) origins of replication, the latter allowing the production of single-stranded DNA in the presence of helper phage (Sambrook et al. 1989). Single-stranded DNA increases the transformation efficiency in Streptomyces (Hilleman et al. 1991). The plasmid lacks a Streptomyces origin of replication and can, therefore, only be maintained by integration into the host genome through cloned homologous sequences.
A 2080 bp sequence harboring all but the first 36 bp of glkA, as well as 1162 bp of the downstream sequence, was amplified from the S. coelicolor M145 genome using the 30-mer oligonucleotides glkX and glkY. These oligonucleotides were designed so as to add SmaI and KpnI sites to the beginning and the end of the DNA fragment, respectively. The exact sequence inserted is shown in
Mutations that Inactivate Glucose Kinase and Confer 2-Deoxy-Glucose Resistance
The non-utilizable glucose analogue 2-deoxy-glucose (2-DOG) is lethal when introduced in bacterial strains that have an active glucokinase (designated glucose kinase in streptomycetes). Strains harboring mutant glkA genes fail to grow on glucose, but are resistant to 2-DOG. Introduction of an active glucokinase or restoration of the wild-type gene by recombination restores full glycolysis and glucose utilization and renders the cells sensitive to 2-DOG.
An alignment of several bacterial glucokinases is shown in
To create Streptomyces coelicolor strains mutant for glucose kinase, these were grown on solid MMD plates, consisting of MM (Kieser et al. 2000) with 1% (w/v) mannitol and 100 mM 2-deoxyglucose, the latter compound being lethal for Glk+ strains. Therefore, colonies that develop on this medium have to be Glk−. Colonies that were able to grow on MMD were selected and tested for glucose kinase activity. Glucose kinase-deficient (ΔglkA) strains were checked by PCR, which showed that the nature of the mutations varied from large deletions to point mutations. For the experiments described herein, a generated mutant glkA harboring a small deletion corresponding to aa 257-262 (see
Marker-Less Insertion of the EGFP Gene into the S. coelicolor Genome with Positive Selection Construction of the Insertion Vector
For demonstration of the integration method, using pMBS011 as integration vector, the gene for EGFP (enhanced green fluorescent protein) was chosen to be inserted into the S. coelicolor genome. For this purpose, we amplified an approximately 1 kb DNA fragment harboring the EGFP gene and its RBS with oligonucleotides GfpX and GfpY, designed so as to provide BglII sites at either end. The PCR-amplified EGFP gene-containing DNA fragment was digested with BglII and inserted into BclI-digested pMBS011. DNA from the latter was isolated from E. coli strain ET12567 (mutant for several modification genes, including dam dcm; McNeill et al. 1992) to allow digestion of the normally dam-methylated BclI site. After ligation, the DNA was re-digested with BclI so as to guarantee the absence of vector without insert. All colonies tested contained the expected plasmid, which was designated pMBS012.
Construct pMBS012 has the N-truncated but otherwise wild-type glkA gene followed by the EGFP gene, which, in turn, is followed by ORFs 6E10.19 and the end of ORF6E10.18, which is oppositely oriented (not indicated in
Insertion of the EGFP Gene into the S. coelicolor Genome
An S. coelicolor glkA mutant lacking the codons for amino acid residues 257-262 (IVGGGL,
In this way, positive selection was achieved of recombinants in which recombination event 2 (
The viable primary recombinants were streaked on MM plates with 2-DOG and subsequently replicated onto MM with glucose as the sole carbon source to allow recombination events to occur and spores harvested. These were used to inoculate a liquid NMMP culture (Kieser et al. 2000) with mannitol as the sole carbon source, grown until OD600 of 0.5, washed twice in NMMP without carbon source, resuspended in NMMP with glucose as the sole carbon source, and grown until stationary phase was reached (typically overnight). Only mycelium with a wild-type glucose kinase gene can utilize glucose and such recombinants must have arisen from a further recombination event through the homologous glkA sequences, resulting in a wild-type glucose kinase gene followed by the EGFP gene on the genome.
The mycelium was plated on MM plates with glucose as the sole carbon source to select the population with a wild-type glkA gene. Colonies that appeared were tested for sensitivity to 2-DOG and thiostrepton. The majority of the colonies tested harbored a wild-type glkA gene and were sensitive to thiostrepton.
Southern hybridization on genomic DNA isolated from two independent colonies confirmed that these had the expected EGFP insertion.
Thus, we succeeded in inserting a DNA fragment into the genome of S. coelicolor M145, without leaving any selectable marker or other undesired sequences behind, and with both recombination steps positively selectable, namely resistance to thiostrepton and 2-DOG (step 1), and ability to grow on glucose (step 2). DNA sequencing confirmed the presence of the expected insert.
Thus, we believe that this is an important step forward in creating recombinant microorganisms, especially those which are notoriously hard to screen, such as actinomycetes.
Solving Possible Polar Effects of Insertions on the Transcription of Downstream Genes
It is possible that insertion of a DNA sequence into the genome affects the transcription of downstream-located genes, resulting in so-called polar effects. For example, this occurs if, in the final situation in
In a more specific case, gene X and sequence Y are also part of the operon, where on the genome they are immediately preceded by the operon promoter and followed by one or more genes that also depend on this promoter. In such a case, insertion of a plasmid by recombination through gene X or sequence Y results in block of transcription of all downstream-located genes. This is lethal if one or more of the downstream-located genes is essential for growth. Negative effects of the insertion can only be counteracted by making sure that two promoters are present on the plasmid, one promoter either upstream of the truncated gene X (
In the case of a glucose kinase gene, it is likely that insertion of DNA into the BclI site (
Recombination Events Between Mutation in Gene X and the Cloned DNA
There is a possibility of a recombination event upstream of the cloned DNA but downstream of the site of mutation. This event is exemplified in
The chance PB for this undesirable recombination to occur can, in our experience, be estimated by the formula: PB=½(A/B)2×100%. For example, with a ratio A:B=2:1, it follows that the chance of recombination through sequence B is approximately 13%. In most cases, the experimenter will be able to choose the situation such that the ratio A:B is much larger, as in the cases listed in Table 2, so that PB becomes negligible. While in principle area B should be minimized, in practice, it follows that as long as the ratio A:B exceeds 2:1, checking a few recombinants is sufficient to identify the correct recombinant to enter recombination step 2.
In the glkA experiment described in the experimental section where the ratio A:B is around 5:2, all three colonies checked had undergone recombination through area A.
In the example of the use of the EPR1 gene for recombination in plants, mutations all lie between nucleotide positions 100-300, while the whole gene is more than 2000 bp long. In such a case, the chance of finding the desired recombination through area A is close to 100%.
“Selectable” means positive selection for desired recombination event possible. Non-selectable means desired situation needs to be screened for, e.g., by replicating colonies to agar plates with and without the selectable compound or condition Z, looking for Z-sensitive colonies. Situation 3 differs from 1 and 2 in that the mutation needed for screening/selection lies on the plasmid rather than on the genome.
Streptomyces coelicolor
Escherichia coli
Streptomyces coelicolor
Escherichia coli
Synechocystis PCC 6803
Streptomyces coelicolor
Escherichia coli
Streptomyces coelicolor
Arabidopsis, Nicotiana
Halobacterium
Escherichia coli
Escherichia coli
Escherichia coli
Number | Date | Country | Kind |
---|---|---|---|
02080000.9 | Nov 2002 | EP | regional |
This application is a continuation of PCT International Patent Application No. PCT/NL2003/000835, filed on Nov. 27, 2003, designating the United States of America, and published, in English, as PCT International Publication No. WO 2004/048580 A1 on Jun. 10, 2004, which application claims priority to European Patent Application Serial No. 02080000.9 filed on Nov. 28, 2002, the contents of each of which are hereby incorporated herein by this reference.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/NL03/00835 | Nov 2003 | US |
Child | 11140625 | May 2005 | US |