METHODS OF TRANSIENT PROTEIN AND GENE EXPRESSION IN CELLS

INCORPORATION OF THE SEQUENCE LISTING

The contents of the text file submitted electronically herewith are incorporated herein by reference in their entirety: A computer readable format copy of the Sequence Listing (filename: ZYMR_055_01WO_SeqList_ST25.txt, date recorded May 25, 2021, file size ˜93 KB).

FIELD OF THE DISCLOSURE

The present disclosure provides methods for producing gene-edited cells free of gene-editing system molecules through the manipulation of prototrophy. Exemplary system molecules include those required for CRISPR editing techniques, such as plasmids and genes encoding such molecules. The methods may employ constructs that temporarily disrupt prototrophy, the removal of which restores prototrophy. Also disclosed are gene-edited cells and populations of gene-edited cells comprising these constructs. The present methods and compositions may be used to achieve desired gene editing of a host cell in the absence of extraneous genetic material remaining from the genetic engineering technique itself.

BACKGROUND

CRISPR gene editing is a commonly used genetic engineering technique by which the genomes of living organisms may be modified. It is based on a simplified version of the bacterial CRISPR-Cas9 antiviral defense system. In many organisms, genome editing using CRISPR nucleases such as Cas9 or Cas12a may involve the introduction of DNA encoding two components: DNA expressing the Cas nuclease, and DNA expressing the guide RNA (gRNA). However, use of CRISPR gene editing suffers from three notable difficulties.

First, in applications requiring a strain without exogenous DNA remaining in the cell (for example, during a fermentation), DNA expressing different guide RNAs must be introduced and sequentially removed from the organism. This often requires multiple rounds of genetic engineering to introduce and then remove the guide RNAs.

Second, plasmids containing selectable/counterselectable metabolic genes are an attractive method to introduce and then remove plasmids expressing gRNAs. However, this requires the use of auxotrophic strains which depend on the presence of the plasmid to provide the required metabolic gene or require specially supplemented growth media. Auxotrophic strains are undesirable for use in fermentation as their metabolism may differ substantially from prototrophic strains. Thus it is desirable to restore the prototrophy of a strain before use in a fermentation, which traditionally requires an additional transformation to re-introduce a construct expressing the wild-type metabolic gene.

Third, expressing the Cas nuclease from DNA integrated into the genome of an organism can have advantages over expression from plasmids due to lower toxicity and less cell-to-cell variability. However, in many cases, the DNA encoding the Cas nuclease must then be removed from the organism before it can be used in downstream processes (e.g. in fermentations), which necessitates further manipulation of the cell genome to achieve the desired result.

Each of these challenges add time, expense, and difficulty to the process of genetic engineering through CRISPR.

Within yeast, alternative genome editing methods make use of mating to combine desired gene edits of interest from different strains. However, these methods are complicated by the desire to obtain haploid yeast cells from a process that requires mating competent yeast that produce diploid cells.

There is an ongoing and unmet need for improved methods to streamline genetic engineering and the removal of extraneous genetic material left over from the engineering process.

BRIEF SUMMARY

In one aspect, the present disclosure provides a method for producing a population of gene-edited cells free of gene-editing system molecules, comprising: (a) introducing an integrating nucleic acid construct into a population of cells that comprise a target gene of interest and that are prototrophic for a nutrient, wherein the integrating nucleic acid construct integrates into a gene that is required for prototrophy for the nutrient; and wherein the integrating nucleic acid construct comprises: a first nucleotide sequence encoding a gene-editing protein; a second nucleotide sequence encoding a dominant selectable marker; and a pair of repeat nucleotide sequences flanking the first nucleotide sequence and the second nucleotide sequence; (b) selecting for expression of the dominant selectable marker to produce a population of cells that are auxotrophic for the nutrient; (c) introducing a non-integrating nucleic acid construct into the population of cells produced in step (b); wherein the non-integrating nucleic acid construct comprises: a third nucleotide sequence encoding a gene-editing nucleic acid that introduces an edit into the gene of interest; and a fourth nucleotide sequence encoding a protein that complements the auxotrophy for the nutrient, wherein the fourth nucleotide sequence cannot recombine with the cellular genome; (d) simultaneously selecting for expression of the dominant selectable marker and for prototrophy for the nutrient to produce a population of cells that comprise the edited gene of interest; (e) removing the non-integrating nucleic acid nucleic acid construct from the population of cells produced in step (d) by growing the cells on media that selects against expression of the protein that complements the auxotrophy for the nutrient to produce a population of cells that comprise the edited gene of interest and are free of the non-integrating nucleic acid construct; and (f) removing the integrating nucleic acid construct from the population of cells produced in step (e) by growing the cells on media that selects for prototrophy for the nutrient to produce a population of cells that comprise the edited gene of interest and that are free of the integrating nucleic acid construct.

In some embodiments, the cells are fungal cells or bacterial cells.

In some embodiments, the fungal cells are Fusarium spp., Kluyveromyces spp., Penicillium spp., Pichia spp., Saccharomyces spp., Schizosaccharomyces spp. or Yarrowia spp.

In some embodiments, the fungal cells are Kluyveromyces lactis, Kluyveromyces marxianus, Pichia pastoris, Saccharomyces cerevisiae, Schizosaccharomyces pombe or Yarrowia lipolytica.

In some embodiments, the bacterial cells are Agrobacterium spp., Arthrobacterspecies spp., Bacillus spp., Clostridium spp., Corynebacterium spp., Cupriavidus spp., Escherichia spp., Erwinia spp., Geobacillus spp., Lactobacillus spp., Pantoea spp., Propionibacterium spp., Pseudomonas spp., Sphingomonas spp., Streptococcus spp., Streptomyces spp., Xanthomonas spp., or Zymomonas spp.

In some embodiments, the bacterial cells are Bacillus clausii, Bacillus lichenifonnis, Bacillus subtilis, Clostridium acetobutylicum, Corynebacterium glutamicum, Cupriavidus necator, Escherichia coli, Geobacillus thermoglucosidasius, Propionibacterium freudenreichii, Sphingomonas elodea, or Xanthomonas campestris.

In some embodiments, the gene-editing protein is an endonuclease.

In some embodiments, the endonuclease is an RNA-guided endonuclease.

In some embodiments, the RNA-guided endonuclease is a CRISPR Class 2 endonuclease.

In some embodiments, the CRISPR Class 2 endonuclease is selected from the list consisting of: cas9, cas12a, cas12b1, cas12b2, cas12c, cas12d, cas12e, cas12f1, cas12f2, cas12f3, cas12g, cas12h, cas12i, cas12k, cas13a, cas13b1, cas13b2, cas13c, cas13d, c2c4, c2c8, c2c9, c2c10, and Cms1 endonucleases.

In some embodiments, the CRISPR Class 2 endonuclease is cas9 or cas12a.

In some embodiments, the gene-editing nucleic acid is a guide RNA (gRNA).

In some embodiments, the guide RNA is a single guide RNA (sgRNA).

In some embodiments, the RNA-guided endonuclease is a CRISPR Class 1 endonuclease.

In some embodiments, the CRISPR Class 1 endonuclease is Cas3 or Cas10.

In some embodiments, the dominant selectable marker is hygromycin B phosphotransferase (hygR), nourseothricin N-acetyl transferase (Nat), KanMX, patMX, zeocin antibiotic resistance (Zeo), AmdS, or thymidine kinase (Tk).

In some embodiments, the gene that is required for prototrophy for the nutrient is URA3, LYS2, LYS5, CAN1, amdS, FCY1, FCA1, GAP1, HSV_TK or TRP1.

In some embodiments, the protein that complements the auxotrophy for the nutrient is Kluyveromyces lactis URA3 (K1URA3).

In some embodiments, the media that selects against expression of the protein that complements the auxotrophy for the nutrient comprises 5-FOA, alpha-aminoadipate, canavanine, fluoroacetamide, 5-fluorocytosine, D-histidine, antifolate media, or 5-fluoroanthranilic acid.

In some embodiments, the nutrient is uracil, lysine, arginine, acetamide, cytosine, L-citrulline, FUdR or tryptophan.

In some embodiments, the non-integrating nucleic acid construct is a plasmid.

In one aspect, the present disclosure provides a method for producing a population of gene-edited Saccharomyces cerevisiae cells free of Cas9 and sgRNA, comprising: (a) introducing an integrating nucleic acid construct into a population of S. cerevisiae cells that comprise a target gene of interest and that are prototrophic for uracil, wherein the integrating nucleic acid construct integrates into the URA3 gene; and wherein the integrating nucleic acid construct comprises: a first nucleotide sequence encoding Cas9; a second nucleotide sequence encoding HygR; and a pair of repeat nucleotide sequences flanking the first nucleotide sequence and the second nucleotide sequence; (b) selecting for expression of HygR to produce a population of cells that are auxotrophic for uracil; (c) introducing a non-integrating nucleic acid construct into the population of cells produced in step (b); wherein the non-integrating nucleic acid construct comprises: a third nucleotide sequence encoding an sgRNA that introduces an edit into the gene of interest; and a fourth nucleotide sequence encoding Kluyveromyces lactis URA3 (K1URA3) protein; (d) simultaneously selecting for expression of HygR and for prototrophy for uracil to produce a population of cells that comprise the edited gene of interest; (e) removing the non-integrating nucleic acid nucleic acid construct from the population of cells produced in step (d) by growing the cells on media that selects against expression of K1URA3 protein to produce a population of cells that comprise the edited gene of interest and are free of the non-integrating nucleic acid construct; and (f) removing the integrating nucleic acid construct from the population of cells produced in step (e) by growing the cells on media that selects for prototrophy for uracil to produce a population of cells that comprise the edited gene of interest and that are free of the integrating nucleic acid construct.

In one aspect, the present disclosure provides a population of cells comprising a nucleic acid construct integrated into a gene that is required for prototrophy for a nutrient, wherein the integrated nucleic acid construct comprises: a first nucleotide sequence encoding a gene-editing protein; a second nucleotide sequence encoding a dominant selectable marker; and a pair of repeat nucleotide sequences flanking the first nucleotide sequence and the second nucleotide sequence.

In some embodiments, the non-integrating nucleic acid construct comprises: a third nucleotide sequence encoding a gene-editing nucleic acid that introduces an edit into a gene of interest; and a fourth nucleotide sequence encoding a protein that complements the auxotrophy for the nutrient, wherein the fourth nucleotide sequence cannot recombine with the cellular genome.

In one aspect, the present disclosure provides a population of cells comprising an edited gene of interest and a nucleic acid construct integrated into a gene that is required for prototrophy for a nutrient, wherein the integrated nucleic acid construct comprises: a first nucleotide sequence encoding a gene-editing protein; a second nucleotide sequence encoding a dominant selectable marker; and a pair of repeat nucleotide sequences flanking the first nucleotide sequence and the second nucleotide sequence.