The present invention relates to the field of biochemistry, specifically to the bioproduction of glycolate.
Glycolate, the conjugate base of glycolic acid, is the simplest α-hydroxy acid with the gross brute formula C2H4O3. It has multiple applications, primarily as a skin/personal care agent, but also in the textile industry as a dyeing and tanning agent and in food processing as a flavoring agent and as a preservative. It is also used in adhesives and plastics and is often included into emulsion polymers, solvents and additives for ink and paint, in order to improve flow properties and impart gloss. Glycolate can be produced either via chemical synthesis or via microbial fermentation. Currently, most of the glycolate is chemically manufactured by high-pressure, high-temperature carbonylation of formaldehyde (Loder, 1939). Glycolate can also be produced through bioconversion of glycolonitrile using microbial nitrilases (He et al., 2010) or bioconversion of ethylene glycol to glycolate by bacteria such as Gluconobacter oxydans (Wei et al., 2009). WO2013050659 relates to the production of glycolic acid in eukaryotic cells, including yeast cells and filamentous fungi, genetically modified to express a glyoxylate reductase gene to produce glycolic acid. WO2016193540 relates to the production of glycolic acid in eukaryotic cells wherein the entire glycolic acid production pathway is introduced into the cytosol. EP2233562 relates to the production of glycolic acid in E. coli. WO2011036213 relates to the production of glycolic acid in bacteria and yeast wherein the pH is first lower than 7 and subsequently is higher than 7. WO2007140816 relates to the production of glycolate in E. coli transformed i) to attenuate the glyoxylate consuming pathways to other compounds than glycolate ii) to use an NADPH glyoxylate reductase to convert glyoxylate to glycolate iii) to attenuate the level of all the glycolate metabolizing enzymes and iv) increase the flux in the glyoxylate/glycolate pathway. WO2017059236 relates to the production of glycolate by fermentation of pentose sugars like xylulose and ribulose.
However, for production of glycolate using chemotrophic microorganisms, substrates such as glucose are needed, which makes the production process economically, and with respect to sustainability, rather inefficient. Taubert et al., 2019 have reported production of glycolate in the unicellular algae Chlamydomonas using CO2 as carbon source under modulated culture conditions. Cyanobacteria have been reported for the production of metabolites and organic acids.
WO2009078712 relates to the production of various compounds in cyanobacteria, such as butanol, ethanol, ethylene, succinate, propanol, acetone and D-lactate. WO2011136639 relates to the production of L-lactate in cyanobacteria. WO2014092562 relates to the production of acetoin, 2,3-butanediol and 2-butanol in cyanobacteria. WO2015147644 relates to the production of erythritol in cyanobacteria. WO2016008883 relates to the production of various monoterpenes in cyanobacteria. WO2016008885 relates to the production of various sesquiterpenes in cyanobacteria. Eisenhut et al. (2008) relate to the CO2 concentrating mechanism of cyanobacteria. A Synechocystis mutant overexpressing the putative phosphoglycolate phosphatases slr0458 was constructed. Compared with the wild type, the mutant grew slower under limiting CO2 concentration and the intracellular 2-phosphoglycolate level was considerably smaller than in the wild type Synechocystis. Haimovich-Dayan et al, 2014 investigates the photorespiratory 2-phosphoglycolate (2PG) metabolism in Synechocystis PCC6803; it is demonstrated that a mutant defective in its two glycolate dehydrogenases (ΔglcD1/ΔglcD2) was unable to grow under low CO2 conditions. Pierce et al, 1989 demonstrates that the native ribulose bisphosphate carboxylase (Rubisco) is essential for both photoautotrophic growth and photoheterotrophic growth of the cyanobacterium Synechocystis PCC6803. By exchanging the native Rubisco for a heterologous one (from Rhodospirillum rubrum) with a lower affinity for CO2, a mutant was obtained that was extremely sensitive to the CO2/O2 ratio supplied during growth and was unable to grow at all in air. As depicted here above, one has succeeded in producing various compounds in cyanobacteria; however, the yields vary between products and for some products the yield appears still too low to be commercially relevant.
While cyanobacteria natively produce some glycolate, there is no disclosure nor suggestion of producing glycolate on a commercially relevant scale. At present, there is thus no efficient bioprocess for the production of glycolate available, neither on laboratory scale, nor on industrial scale, while using CO2 as the substrate. Thus, in view of the state of the art, there is still a need for an alternative, more sustainable and improved glycolate production process, without the need for expensive and complicated starting materials, and with a commercially relevant yield.
Synechocystis PCC6803
Synechocystis PCC6803
Synechocystis PCC6803
Synechocystis PCC6803
Synechocystis PCC6803
Synechocystis PCC6803
Escherichia coli
Escherichia coli
Chlamydomonas
reinhardtii
Chlamydomonas
reinhardtii
Synechococcus
elongatus PCC7942
Synechococcus
elongatus PCC7942
Escherichia coli
Escherichia coli
Rhodospirillum rubrum
Rhodospirillum rubrum
Rhodospirillum rubrum
Rhodospirillum rubrum
Archaeolobus fulgidus
Archaeolobus fulgidus
Synechocystis PCC6803
Synechocystis PCC6803
Synechocystis PCC6803
Synechocystis PCC6803
Escherichia coli
Escherichia coli
Mycobacterium
tuberculosis
Mycobacterium
tuberculosis
Arabidopsis thaliana
Arabidopsis thaliana
Synechococcus
elongatus PCC 7942
Synechococcus
elongatus PCC 7942
Synechocystis PCC6803
Synechococcus
elongatus PCC 7942
Synechococcus
elongatus PCC 7942
Synechococcus
elongatus PCC 7942
Synechococcus
elongatus PCC 7943
Synechococcus
elongatus PCC 7944
Synechococcus
elongatus PCC 7942
Rhodopseudomonas
capsulatus
Rhodopseudomonas
capsulatus
Rhodobacter
sphaeroides
Rhodobacter
sphaeroides
Synechocystis PCC6803
Synechococcus PCC7002
Synechococcus PCC7002
Synechococcus PCC7002
Synechococcus PCC7002
Synechococcus PCC7002
Synechococcus PCC7002
Synechococcus
elongatus PCC 7942
Synechococcus
elongatus PCC 7942
The inventors have arrived at an improved process for the production of extracellular glycolate with a commercially relevant yield.
Accordingly, the invention provides for a recombinant host cell for the production of extracellular glycolate, wherein the host cell:
The production of extracellular glycolate is herein to be construed in such a way that the glycolate produced in the host cell is secreted, whether actively or passively, by the host cell, e.g. mediated by a transporter and/or a permease, and/or via non-facilitated diffusion across the cyanobacterial cell envelope. Leakage of the glycolate by lysis of host cells is preferably not within the scope of the invention.
Substantially unable to anabolize glycolate is herein to be construed that less than about 10% of the glycolate produced is anabolized by the host cell. In an embodiment, less than about 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2% or less than 1% of glycolate produced is anabolized by the host cell. In the recombinant host cell for the production of extracellular glycolate, the ribulose 1,5-bisphosphate carboxylase/oxygenase (Rubisco) activity may be with increased selectivity for 02 compared to the Rubisco of the parent cell. The selectivity may be increased by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or at least 100%. The selectivity may be increased by at least 1-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, 1 log, 2 log, 3 log, or at least 4 log.
In the recombinant host cell for the production of extracellular glycolate, the host cell may be substantially unable to metabolize glycolate due to reduced or eliminated glycolate dehydrogenase, glycolate oxidase activity and/or lactate dehydrogenase activity relative to the parent cell. The glycolate dehydrogenase activity may be reduced by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or may be reduced completely (elimination). The lactate dehydrogenase activity may be reduced by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or may be reduced completely (elimination). The person skilled in the art knows how to reduce activity of an enzyme, e.g. by reduction of expression of the sequence encoding the enzyme by gene disruption (knock-out) or down regulation.
Accordingly, in the recombinant host cell for the production of extracellular glycolate, the reduced or eliminated glycolate dehydrogenase and/or glycolate oxidase activity relative to the parent cell may be due to targeted gene disruption of deletion of a glycolate dehydrogenase and/or glycolate oxidase and/or lactate dehydrogenase. Preferred glycolate dehydrogenase, glycolate oxidase and lactate dehydrogenase are the ones described elsewhere herein.
In the recombinant host cell for the production of extracellular glycolate, the host cell may overexpress glyoxylate reductase and/or isocitrate lyase in view of the parent cell. Overexpression of an enzyme herein preferably means that activity of the enzyme is increased by at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or at least 100%. The activity may be increased by at least 1-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, 10-fold, 1 log, 2 log, 3 log, or at least 4 log. Preferred glyoxylate reductase and isocitrate lyase are the ones described elsewhere herein.
In the recombinant host cell for the production of extracellular glycolate, the host cell may overexpress phosphoglycolate phosphatase in view of the parent cell. A preferred phosphoglycolate phosphatase is the one described elsewhere herein.
In the recombinant host cell for the production of extracellular glycolate, the host cell may comprises a ribulose bisphosphate carboxylase (Rubisco) that has decreased selectivity for CO2 over 02 (given by the specificity constant Sc/o=(kccat/Kc)/(kocat/Ko)), with similar or higher intrinsic turnover rate (ko/ccat) compared to the native Rubisco of the parent host cell. This specificity constant Sc/o is a measure of the relative capacities of the enzyme to catalyse carboxylation and oxygenation of ribulose 1,5-bisphosphate. It is calculated, based on the turnover numbers (maximum per active site catalytic rates in units of s−1) for carboxylation (kccat) and oxygenation (kocat), as well as KC and KO, which indicate the Michaelis constants (half-saturation concentrations in μM) for carboxylation and oxygenation, respectively. Preferably, the Rubisco is one as listed in Table 1. More preferably, the Rubisco has a polypeptide sequence that has at least 70% sequence identity with SEQ ID NO: 16, 18, 20, 86 or 87, 91 or 93.
In one embodiment, the recombinant host cell for the production of extracellular glycolate comprises both a ribulose bisphosphate carboxylase (Rubisco) that has decreased selectivity for CO2 over O2 (as described above) and a Rubisco that does not have a decreased selectivity for CO2 over O2. Preferably, the Rubisco that does not have a decreased selectivity for CO2 over O2 is a Rubisco that is endogenous to the host cell. In one embodiment, the endogenous Rubisco is the endogenous Rubisco from Synechocystis and/or the Rubisco with decreased selectivity for CO2 over 02 has a polypeptide sequence that has at least 70% sequence identity with SEQ ID NO: 16, 18, 20, 86 or 87, 91 or 93.
Synechococcus PCC6103
Synechococcus PCC6103
Synechococcus PCC6103
Synechococcus PCC6103
Rhodospirillum rubrum
Rhodospirillum rubrum
Archaeolobus fulgidus
Rhodopseudomonas
capsulatus
Rhodobacter
sphaeroides
In the recombinant host cell for the production of extracellular, the host cell may express a Rubisco with a specificity constant Sc/o<55. Preferably, Sc/o<54, 53, 52, 51, 50, 49, 48, 47, 46, 45, 44, 43, 42, 41, 40, 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, or Sc/o<5.
In the recombinant host cell for the production of extracellular glycolate, the host cell may express a type II or type III Rubisco. There are four forms of Rubisco found in nature. Only forms I, II and III catalyse the carboxylation or oxygenation of ribulose bisphosphate. Form I is the most abundant form, found in eukaryotes and bacteria. It forms a hexadecamer consisting of eight large (L) and eight small (S) subunits. This form of Rubisco tends to have a high specificity for CO2 (SC/O˜40-170), but relatively poor catalytic rate (kcat). Form II of Rubisco contains only dimers of L subunits, and in contrast to form I of Rubisco, form II tends to have a higher kcat but a lower specificity for CO2 (SC/O˜10-20) (Mueller-Cajar et al., 2007). Form III is found primarily in archae and is also comprised of dimers of L subunits (Tabita et al., 2008).
In an embodiment, the recombinant host cell for the production of extracellular glycolate expresses a Rubisco of Rhodospirillum rubrum, optionally comprising a H44N mutation. Preferably, the Rubisco has at least 70% sequence identity with SEQ ID NO: 16, 18, 20, 86 or 87. More preferably, the Rubisco has at least 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99% sequence identity with SEQ ID NO: 16, 18, 20, 86, 87, 91 or 93. Most preferably, the Rubisco has a polypeptide sequence as set forward in SEQ ID NO: 16, 18, 20, 86, 87, 91 or 93. The recombinant host cell for the production of extracellular glycolate, preferably is a photosynthetic cell, including algae and cyanobacteria. Preferred photosynthetic host cells include but are not limited to the following genera: Acanthoceras, Acanthococcus, Acaryochloris, Achnanthes, Achnanthidium, Actinastrum, Actinochloris, Actinocyclus, Actinotaenium, Amphichrysis, Amphidinium, Amphikrikos, Amphipleura, Amphiprora, Amphithrix, Amphora, Anabaena, Anabaenopsis, Aneumastus, Ankistrodesmus, Ankyra, Anomoeoneis, Apatococcus, Aphanizomenon, Aphanocapsa, Aphanochaete, Aphanothece, Apiocystis, Apistonema, Arthrodesmus, Artherospira, Ascochloris, Asterionella, Asterococcus, Audouinella, Aulacoseira, Bacillaria, Balbiania, Bambusina, Bangia, Basichlamys, Batrachospermum, Binuclearia, Bitrichia, Blidingia, Botrdiopsis, Botrydium, Botryococcus, Botryosphaerella, Brachiomonas, Brachysira, Brachytrichia, Brebissonia, Bulbochaete, Bumilleria, Bumilleriopsis, Caloneis, Calothrix, Campylodiscus, Capsosiphon, Carteria, Catena, Cavinula, Centritractus, Centronella, Ceratium, Chaetoceros, Chaetochloris, Chaetomorpha, Chaetonella, Chaetonema, Chaetopeltis, Chaetophora, Chaetosphaeridium, Chamaesiphon, Chara, Characiochloris, Characiopsis, Characium, Charales, Chilomonas, Chlainomonas, Chlamydoblepharis, Chlamydocapsa, Chlamydomonas, Chlamydomonopsis, Chlamydomyxa, Chlamydonephris, Chlorangiella, Chlorangiopsis, Chlorella, Chlorobotrys, Chlorobrachis, Chlorochytrium, Chlorococcum, Chlorogloea, Chlorogloeopsis, Chlorogonium, Chlorolobion, Chloromonas, Chlorophysema, Chlorophyta, Chlorosaccus, Chlorosarcina, Choricystis, Chromophyton, Chromulina, Chroococcidiopsis, Chroococcus, Chroodactylon, Chroomonas, Chroothece, Chrysamoeba, Chrysapsis, Chrysidiastrum, Chrysocapsa, Chrysocapsella, Chrysochaete, Chrysochromulina, Chrysococcus, Chrysocrinus, Chrysolepidomonas, Chrysolykos, Chrysonebula, Chrysophyta, Chrysopyxis, Chrysosaccus, Chrysophaerella, Chrysostephanosphaera, Clodophora, Clastidium, Closteriopsis, Closterium, Coccomyxa, Cocconeis, Coelastrella, Coelastrum, Coelosphaerium, Coenochloris, Coenococcus, Coenocystis, Colacium, Coleochaete, Collodictyon, Compsogonopsis, Compsopogon, Conjugatophyta, Conochaete, Coronastrum, Cosmarium, Cosmioneis, Cosmocladium, Crateriportula, Craticula, Crinalium, Crucigenia, Crucigeniella, Cryptoaulax, Cryptomonas, Cryptophyta, Ctenophora, Cyanodictyon, Cyanonephron, Cyanophora, Cyanophyta, Cyanothece, Cyanothomonas, Cyclonexis, Cyclostephanos, Cyclotella, Cylindrocapsa, Cylindrocystis, Cylindrospermum, Cylindrotheca, Cymatopleura, Cymbella, Cymbellonitzschia, Cystodinium Dactylococcopsis, Debarya, Denticula, Dermatochrysis, Dermocarpa, Dermocarpella, Desmatractum, Desmidium, Desmococcus, Desmonema, Desmosiphon, Diacanthos, Diacronema, Diadesmis, Diatoma, Diatomella, Dicellula, Dichothrix, Dichotomococcus, Dicranochaete, Dictyochloris, Dictyococcus, Dictyosphaerium, Didymocystis, Didymogenes, Didymosphenia, Dilabifilum, Dimorphococcus, Dinobryon, Dinococcus, Diplochloris, Diploneis, Diplostauron, Distrionella, Docidium, Drapamaldia, Dunaliella, Dysmorphococcus, Ecballocystis, Elakatothrix, Ellerbeckia, Encyonema, Enteromorpha, Entocladia, Entomoneis, Entophysalis, Epichrysis, Epipyxis, Epithemia, Eremosphaera, Euastropsis, Euastrum, Eucapsis, Eucocconeis, Eudorina, Euglena, Euglenophyta, Eunotia, Eustigmatophyta, Eutreptia, Fallacia, Fischerella, Fragilaria, Fragilariforma, Franceia, Frustulia, Curcilla, Geminella, Genicularia, Glaucocystis, Glaucophyta, Glenodiniopsis, Glenodinium, Gloeocapsa, Gloeochaete, Gloeochrysis, Gloeococcus, Gloeocystis, Gloeodendron, Gloeomonas, Gloeoplax, Gloeothece, Gloeotila, Gloeotrichia, Gloiodictyon, Golenkinia, Golenkiniopsis, Gomontia, Gomphocymbella, Gomphonema, Gomphosphaeria, Gonatozygon, Gongrosia, Gongrosira, Goniochloris, Gonium, Gonyostomum, Granulochloris, Granulocystopsis, Groenbladia, Gymnodinium, Gymnozyga, Gyrosigma, Haematococcus, Hafniomonas, Hallassia, Hammatoidea, Hannaea, Hantzschia, Hapalosiphon, Haplotaenium, Haptophyta, Haslea, Hemidinium, Hemitoma, Heribaudiella, Heteromastix, Heterothrix, Hibberdia, Hildenbrandia, HiIlea, Holopedium, Homoeothrix, Hormanthonema, Hormotila, Hyalobrachion, Hyalocardium, Hyalodiscus, Hyalogonium, Hyalotheca, Hydrianum, Hydrococcus, Hydrocoleum, Hydrocoryne, Hydrodictyon, Hydrosera, Hydrurus, Hyella, Hymenomonas, lsthmochloron, Johannesbaptistia, Juranyiella, Karayevia, Kathablepharis, Katodinium, Kephyrion, Keratococcus, Kirchneriella, Klebsormidium, Kolbesia, Koliella, Komarekia, Korshikoviella, Kraskella, Lagerheimia, Lagynion, Lamprothamnium, Lemanea, Lepocinclis, Leptosira, Lobococcus, Lobocystis, Lobomonas, Luticola, Lyngbya, Malleochloris, Mallomonas, Mantoniella, Marssoniella, Martyana, Mastigocoleus, Gastogloia, Melosira, Merismopedia, Mesostigma, Mesotaenium, Micractinium, Micrasterias, Microchaete, Microcoleus, Microcystis, Microglena, Micromonas, Microspora, Microthamnion, Mischococcus, Monochrysis, Monodus, Monomastix, Monoraphidium, Monostroma, Mougeotia, Mougeotiopsis, Myochloris, Myromecia, Myxosarcina, Naegeliella, Nannochloris, Nautococcus, Navicula, Neglectella, Neidium, Nephroclamys, Nephrocytium, Nephrodiella, Nephroselmis, Netrium, Nitella, Nitellopsis, Nitzschia, Nodularia, Nostoc, Ochromonas, Oedogonium, Oligochaetophora, Onychonema, Oocardium, Oocystis, Opephora, Ophiocytium, Orthoseira, Oscillatoria, Oxyneis, Pachycladella, Palmella, Palmodictyon, Pnadorina, Pannus, Paralia, Pascherina, Paulschulzia, Pediastrum, Pedinella, Pedinomonas, Pedinopera, Pelagodictyon, Penium, Peranema, Peridiniopsis, Peridinium, Peronia, Petroneis, Phacotus, Phacus, Phaeaster, Phaeodermatium, Phaeophyta, Phaeosphaera, Phaeothamnion, Phormidium, Phycopeltis, Phyllariochloris, Phyllocardium, Phyllomitas, Pinnularia, Pitophora, Placoneis, Planctonema, Planktosphaeria, Planothidium, Plectonema, Pleodorina, Pleurastrum, Pleurocapsa, Pleurocladia, Pleurodiscus, Pleurosigma, Pleurosira, Pleurotaenium, Pocillomonas, Podohedra, Polyblepharides, Polychaetophora, Polyedriella, Polyedriopsis, Polygoniochloris, Polyepidomonas, Polytaenia, Polytoma, Polytomella, Porphyridium, Posteriochromonas, Prasinochloris, Prasinocladus, Prasinophyta, Prasiola, Prochlorphyta, Prochlorothrix, Protoderma, Protosiphon, Provasoliella, Prymnesium, Psammodictyon, Psammothidium, Pseudanabaena, Pseudenoclonium, Psuedocarteria, Pseudochate, Pseudocharacium, Pseudococcomyxa, Pseudodictyosphaerium, Pseudokephyrion, Pseudoncobyrsa, Pseudoquadrigula, Pseudosphaerocystis, Pseudostaurastrum, Pseudostaurosira, Pseudotetrastrum, Pteromonas, Punctastruata, Pyramichlamys, Pyramimonas, Pyrrophyta, Quadrichloris, Quadricoccus, Quadrigula, Radiococcus, Radiofilum, Raphidiopsis, Raphidocelis, Raphidonema, Raphidophyta, Peimeria, Rhabdoderma, Rhabdomonas, Rhizoclonium, Rhodomonas, Rhodophyta, Rhoicosphenia, Rhopalodia, Rivularia, Rosenvingiella, Rossithidium, Roya, Scenedesmus, Scherffelia, Schizochlamydella, Schizochlamys, Schizomeris, Schizothrix, Schroederia, Scolioneis, Scotiella, Scotiellopsis, Scourfieldia, Scytonema, Selenastrum, Selenochloris, Sellaphora, Semiorbis, Siderocelis, Diderocystopsis, Dimonsenia, Siphononema, Sirocladium, Sirogonium, Skeletonema, Sorastrum, Spermatozopsis, Sphaerellocystis, Sphaerellopsis, Sphaerodinium, Sphaeroplea, Sphaerozosma, Spiniferomonas, Spirogyra, Spirotaenia, Spirulina, Spondylomorum, Spondylosium, Sporotetras, SpumeIla, Staurastrum, Stauerodesmus, Stauroneis, Staurosira, Staurosirella, Stenopterobia, Stephanocostis, Stephanodiscus, Stephanoporos, Stephanosphaera, Stichococcus, Stichogloea, Stigeoclonium, Stigonema, Stipitococcus, Stokesiella, Strombomonas, Stylochrysalis, Stylodinium, Styloyxis, Stylosphaeridium, Surirella, Sykidion, Symploca, Synechococcus, Synechocystis, Synedra, Synochromonas, Synura, Tabellaria, Tabularia, Teilingia, Temnogametum, Tetmemorus, Tetrachlorella, Tetracyclus, Tetradesmus, Tetraedriella, Tetraedron, Tetraselmis, Tetraspora, Tetrastrum, Thalassiosira, Thamniochaete, Thorakochloris, Thorea, Tolypella, Tolypothrix, Trachelomonas, Trachydiscus, Trebouxia, Trentepholia, Treubaria, Tribonema, Trichodesmium, Trichodiscus, Trochiscia, Tryblionella, Ulothrix, Uroglena, Uronema, Urosolenia, Urospora, Uva, Vacuolaria, Vaucheria, Volvox, Volvulina, Westella, Woloszynskia, Xanthidium, Xanthophyta, Xenococcus, Zygnema, Zygnemopsis, and Zygonium.
More preferred host cells are a Synechocystis or a Synechococcus, or an Anabaena species. The recombinant host cell for the production of extracellular glycolate is preferably a host cell expressing a heterologous Phosphoribulokinase (PRK).
The recombinant host cell for the production of extracellular glycolate is preferably a host cell selected from the group consisting of a bacterial cell, and a fungal cell, preferably a yeast cell. When the host cell is a bacterial host cell, the host cell is preferably an Escherichia spp., Streptomyces spp., Zymonas spp., Acetobacter spp., Citrobacter spp., Rhizobium spp., Clostridium spp., Corynebacterium spp., Streptococcus spp., Xanthomonas spp., Lactobacillus spp., Lactococcus spp., Bacillus spp., Alcaligenes spp., Pseudomonas spp., Aeromonas spp., Azotobacter spp., Comamonas spp., Mycobacterium spp., Rhodococcus spp., Gluconobacter spp., Ralstonia spp., Acidithiobacillus spp., Microlunatus spp., Geobacter spp., Geobacillus spp., Arthrobacter spp., Flavobacterium spp., Serratia spp., Saccharopolyspora spp., Thermus spp., Stenotrophomonas spp., Chromobacterium spp., Sinorhizobium spp., Saccharopolyspora spp., Agrobacterium spp. or Pantoea spp. A preferred Escherichia spp. is Escherichia coli When the host cell is a fungal host cell, the host cell is preferably a Saccharomyces spp., Schizosaccharomyces spp., Pichia spp., Kluyveromyces spp., Candida spp., Talaromyces spp., Brettanomyces spp., Pachysolen spp., Debaryomyces spp., Yarrowia spp. Aspergillus spp., Penicillium spp., Fusarium spp., Rhizopus spp., Acremonium spp., Neurospora spp., Sordaria spp., Magnaporthe spp., Allomyces spp., Ustilago spp., Botrytis spp., or a Trichoderma spp. A preferred fungal cell is a Saccharomyces spp. cell.
The host cells defined herein can conveniently be used for the production of extracellular glycolate. Accordingly, the invention further provides for, a process for the production of extracellular glycolate comprising;
The person skilled in the art knows how to culture the host cells defined herein and knows how to purify glycolate from a culture broth. The culture broth can e.g. be separated from the host cells by centrifugation or membrane filtration and can subsequently purified by e.g. removal of excess water. Preferably, the yield of the process is at least 0.1 gram glycolate per litre culture broth, more preferably at least 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, or at least 10 gram glycolate per litre culture broth.
The terms “homology”, “sequence identity” and the like are used interchangeably herein. Sequence identity is herein defined as a relationship between two or more amino acid (polypeptide or protein) sequences or two or more nucleic acid (polynucleotide) sequences, as determined by comparing the sequences. In the art, “identity” also means the degree of sequence relatedness between amino acid or nucleic acid sequences, as the case may be, as determined by the match between strings of such sequences. “Similarity” between two amino acid sequences is determined by comparing the amino acid sequence and its conserved amino acid substitutes of one polypeptide to the sequence of a second polypeptide. “Identity” and “similarity” can be readily calculated by known methods.
“Sequence identity” and “sequence similarity” can be determined by alignment of two peptide or two nucleotide sequences using global or local alignment algorithms, depending on the length of the two sequences. Sequences of similar lengths are preferably aligned using a global alignment algorithms (e.g. Needleman Wunsch) which aligns the sequences optimally over the entire length, while sequences of substantially different lengths are preferably aligned using a local alignment algorithm (e.g. Smith Waterman). Sequences may then be referred to as “substantially identical” or “essentially similar” when they (when optimally aligned by for example the programs GAP or BESTFIT using default parameters) share at least a certain minimal percentage of sequence identity (as defined below). GAP uses the Needleman and Wunsch global alignment algorithm to align two sequences over their entire length (full length), maximizing the number of matches and minimizing the number of gaps. A global alignment is suitably used to determine sequence identity when the two sequences have similar lengths. Generally, the GAP default parameters are used, with a gap creation penalty=50 (nucleotides)/8 (proteins) and gap extension penalty=3 (nucleotides)/2 (proteins). For nucleotides the default scoring matrix used is nwsgapdna and for proteins the default scoring matrix is Blosum62 (Henikoff & Henikoff, 1992, PNAS 89, 915-919). Sequence alignments and scores for percentage sequence identity may be determined using computer programs, such as the GCG Wisconsin Package, Version 10.3, available from Accelrys Inc., 9685 Scranton Road, San Diego, Calif. 92121-3752 USA, or using open source software, such as the program “needle” (using the global Needleman Wunsch algorithm) or “water” (using the local Smith Waterman algorithm) in EmbossWlN version 2.10.0, using the same parameters as for GAP above, or using the default settings (both for ‘needle’ and for ‘water’ and both for protein and for DNA alignments, the default Gap opening penalty is 10.0 and the default gap extension penalty is 0.5; default scoring matrices are Blossum62 for proteins and DNAFull for DNA). When sequences have a substantially different overall lengths, local alignments, such as those using the Smith Waterman algorithm, are preferred.
Alternatively, percentage similarity or identity may be determined by comparing against public databases, using algorithms such as FASTA, BLAST, etc. Thus, the nucleic acid and protein sequences of the invention can further be used as a “query sequence” to perform a comparison against public databases to, for example, identify other family members or related sequences. Such searches can be performed using the BLASTn and BLASTx programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences homologous to oxidoreductase nucleic acid molecules of the invention. BLAST protein searches can be performed with the BLASTx program, score=50, wordlength=3 to obtain amino acid sequences homologous to protein molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17): 3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., BLASTx and BLASTn) can be used. See the homepage of the National Center for Biotechnology Information at www.ncbi.nlm.nih.gov/.
Optionally, in determining the degree of amino acid similarity, the skilled person may also take into account so-called “conservative” amino acid substitutions, as will be clear to the skilled person. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. Examples of classes of amino acid residues for conservative substitutions are known to the person skilled in the art.
The term “homologous” when used to indicate the relation between a given (recombinant) nucleic acid or polypeptide molecule and a given host organism or host cell, is understood to mean that in nature the nucleic acid or polypeptide molecule is produced by a host cell or organisms of the same species, preferably of the same variety or strain. If homologous to a host cell, a nucleic acid sequence encoding a polypeptide will typically (but not necessarily) be operably linked to another (heterologous) promoter sequence and, if applicable, another (heterologous) secretory signal sequence and/or terminator sequence than in its natural environment. It is understood that the regulatory sequences, signal sequences, terminator sequences, etc. may also be homologous to the host cell. When used to indicate the relatedness of two nucleic acid sequences the term “homologous” means that one single-stranded nucleic acid sequence may hybridize to a complementary single-stranded nucleic acid sequence. The degree of hybridization may depend on a number of factors, including the amount of identity between the sequences and the hybridization conditions such as temperature and salt concentration as discussed later.
The term “heterologous”, when used with respect to a nucleic acid (DNA or RNA) or protein, refers to a nucleic acid or protein that does not occur naturally as part of the organism, cell, genome or DNA or RNA sequence in which it is present, or that is found in a cell or location or locations in the genome or DNA or RNA sequence that differ from that in which it is found in nature. A heterologous nucleic acid or protein is not endogenous to the cell into which it is introduced, but has been obtained from another cell or synthetically or recombinantly produced. Generally, though not necessarily, such nucleic acids encode proteins that are not normally produced by the cell in which the DNA is transcribed or expressed. Similarly, exogenous RNA encodes for proteins not normally expressed in the cell in which the exogenous RNA is present. Heterologous nucleic acids and proteins may also be referred to as foreign nucleic acids or proteins. Any nucleic acid or protein that one of skill in the art would recognize as heterologous or foreign to the cell in which it is expressed is herein encompassed by the term heterologous nucleic acid or protein. The term heterologous also applies to non-natural combinations of nucleic acid or amino acid sequences, i.e. combinations where at least two of the combined sequences are foreign with respect to each other.
Any reference to nucleotide or amino acid sequences accessible in public sequence databases herein refers to the version of the sequence entry as available on the filing date of this document. In this document and in its claims, the verb “to comprise” and its conjugations is used in its non-limiting sense to mean that items following the word are included, but items not specifically mentioned are not excluded. In addition, the verb “to consist” may be replaced by “to consist essentially of” meaning that a product or a composition may comprise additional component(s) than the ones specifically identified; said additional component(s) not altering the unique characteristic of the invention. In addition, reference to an element by the indefinite article “a” or “an” does not exclude the possibility that more than one of the elements is present, unless the context clearly requires that there be one and only one of the elements. The indefinite article “a” or “an” thus usually means “at least one”.
All patent and literature references cited in the specification are hereby incorporated by reference in their entirety.
The following examples are offered for illustrative purposes only, and are not intended to limit the scope of the invention in any way.
Escherichia coli strains XL-1 blue (Stratagene), Turbo (NEB) or CopyCutter EP1400 (Epicentre biotechnologies) were used for plasmid amplification and manipulation, grown at 37° C. in Lysogeny Broth (LB) or on LB agar. Strains of Synechocystis sp. PCC 6803 and Synechococcus elongatus PCC 7942 were cultivated either on BG-11 plates or in BG-11 medium (Sigma Aldrich) optionally supplemented with 10 mM TES-KOH (pH=8), and/or 10 mM bicarbonate. BG-11 agar plates were supplemented with 10 mM TES-KOH (pH=8), 0.3% (w/v) sodium thiosulfate and 5 mM glucose. Strains of Synechococcus PCC 7002 were cultivated in the same medium as Synechocystis, but supplemented with 4 μg/l cyanocobalamin. When appropriate, the following antibiotics were used: ampicillin (100 μg/ml), kanamycin (20 or 50 μg/ml, for Synechocystis and E. coli, respectively), spectinomycin (25 μg/ml), streptomycin (10 μg/ml), and chloramphenicol (20 μg/ml). Strains were grown in Erlenmeyer flasks at 30° C., shaking 120 rpm. Alternatively, the strains were grown in the MC-1000 cultivator (Photon System) or in a 10 ml culture vial (CelIDEG), according to manufacturers' protocols. At several time points samples were taken from the culture vessel to analyze cell density and product formation by HPLC analysis, using a UV and RI detector.
Restriction endonucleases were purchased from Thermo Scientific. Amplification for high fidelity reactions used for cloning or sequencing was performed using Herculase II Fusion polymerase (Agilent), using a Biometra TRIO thermocycler. Primers used are mentioned in Table 2. Cloning was performed in E. coli using CaCl2)-competent XL1-blue, Turbo or CopyCutter EPI400 cells, according to manufacturer protocol.
Natural transformation for genomic integration of exogenous genes or deletion of endogenous genes in Synechocystis was performed using plates with increasing concentrations of antibiotic for growing the transformants to drive segregation. In the case of making markerless mutants, we used the mazF gene, encoding an endoribonuclease, driven by a Ni2+ inducible promoter system that allows for counter-selection (Cheah et al., 2013). This PBRS-mazF cassette [SEQ ID NO: 33] was synthesized at a gene synthesis company (GenScript) and then combined with different antibiotic markers. This cassette was used in combination with homologous regions targeting a specific part of the genome, first introducing and fully segregating it, before removing the marker based on Ni2+ selection.
Conjugation of RSF1010-based plasmids from E. coli XL-1 to Synechocystis was performed by tri-parental mating using E. coli J53 (pRP4) as the helper strain. Correct insertion of the genes and full segregation, as well as insertion of conjugation plasmids, were verified by colony PCR with specific primers (Table 2) and MyTaq DNA polymerase (Bioline).
The genome of Synechocystis contains two glycolate dehydrogenase genes: s110404 (glcD1) [SEQ ID NO: 1, 2] and slr0806 (glcD2) [SEQ ID NO: 3, 4]. While we have deleted the whole glcD1 gene, we left some of the glcD2 intact as there is an antisense RNA present in the sequence, for which we wanted to preserve the function. To enable deletion, we have amplified the homologous regions (˜1000 bp) surrounding the genes with specific primers (#1-8; Table 2), fused them by fusion PCR while introducing restriction sites, and inserted this sequence into a pUC-18 backbone. Next, the Omega marker gene (conferring resistance to spectinomycin) and the PBRS-mazF cassette, that allows counter-selection to create markerless deletions, were inserted into these vectors.
The resulting vectors were transformed into Synechocystis, first introducing and fully segregating the Δsll0404::spR. After making the resulting strain fully markerless (ΔglcD1), we introduced the next construct Δslr0806::spR and fully segregated the resulting strain. This strain was then again made fully markerless and was named SGP009m (ΔglcD1/2). After culturing the strain, we established that it was accumulating extracellular glycolate (
Genes encoding phosphoglycolate phosphatase (PGP) [SEQ ID NO: 7, 8, 9, 10, 11, 12] were inserted into a vector targeting the slr0168 gene in the Synechocystis genome, pHKH-RFP [SEQ ID NO: 35]. These genes were either synthesized with codon-optimization (Genscript) or amplified from their host genome with specific primers (#13-14; Table 2). The genes were expressed with one of the following promoters: Ptrc, PcpcBA, PrbcL or PpsbA2 [SEQ ID NO: 37,38,39,40]. The resulting constructs were introduced into SGP009m, and tested for production of glycolate. An example of one of these strains, SGP026 is shown in
The nucleotide sequence encoding glycolate permease [SEQ ID NO: 13, 14] was synthesized with codon-optimization (Genscript) and inserted with a PpsbA2 promoter [SEQ ID NO: 39] into a vector targeting the slr0168 gene in the Synechocystis genome, pHKH-RFP [SEQ ID NO: 35]. The resulting construct was introduced into SGP009m, and tested for productivity of glycolate. A result of one of those strains is shown in
The nucleotide sequence encoding glyoxylate reductase [SEQ ID NO: 29,30] was synthesized with codon-optimization (Baseclear) and inserted with a Ptrc1 promoter [SEQ ID NO: 37] into the broad host range RSF1010-derivative plasmid pAVO+(van der Woude et al., 2016) [SEQ ID NO: 36]. The resulting construct, as well as an empty pAVO+ was introduced into Synechocystis wildtype and the ΔglcD1/2 strain SGP009m through conjugation. The resulting strains were tested for productivity of glycolate, as shown in
To remove the capacity of Synechocystis for carboxysome formation, we removed one of the genes encoding a central carboxysome component, ccmM. To delete the ccmM gene [SEQ ID NO: 34], we amplified the homologous regions (˜1000 bp) surrounding the gene with specific primers (#36-39; Table 2), fused them by fusion PCR while introducing restriction sites, and inserted this sequence into a pBSKII+ vector. Next, the marker gene (conferring resistance to spectinomycin) and the PBRS-mazF cassette (allowing counter-selection to create markerless deletions) were inserted in this vector. The resulting vector was introduced in the mutant Synechocystis strain SGP026 using the spectinomycin marker, and, after full segregation was achieved, the marker was removed through recombination based on Ni2+ selection. The resulting strain SGP105 was tested for glycolate productivity (Table 3).
Additional to the deletion of the glycolate dehydrogenase, also the gene encoding lactate dehydrogenase (slr1556) [SEQ ID NO: 5, 6] was deleted. To this end, we have amplified the homologous regions (˜1000 bp) surrounding the genes with specific primers (#15-18; Table 2), fused them by fusion PCR while introducing restriction sites, and inserted this sequence into a pUC-18 backbone. Next, the Omega marker gene (conferring resistance to spectinomycin) and the PBRS-mazF cassette, that allows counter-selection to create markerless deletions, were inserted into the vector. The resulting vector was introduced into SGP026. After full segregation of the construct, the deletion was made markerless, resulting in strain SGP201 (Table 3).
To replace the endogenous genes encoding the Rubisco operon [SEQ ID NO: 21] of Synechocystis, we amplified the homologous regions (˜1000 bp) surrounding the rbcLXS genes with specific primers (#25-28; Table 2), fused them by fusion PCR while introducing a number of restriction sites, and inserted this sequence in a pBSKII+ vector. Next, a nucleotide sequence encoding heterologous Rubisco, rbcM [SEQ ID NO: 15, 16], was amplified from Rhodospirillum rubrum with specific primers (#29-30; Table 2), and placed behind a PcpcBA promoter [SEQ ID NO: 38] inside the rbcLXS-targeting vector. Lastly, the marker gene (conferring resistance to chloramphenicol) and the PBRS-mazF cassette that allows counter-selection to create markerless deletions, were inserted in this vector. The resulting vector was used first to replace rbcLXS operon in the mutant Synechocystis strain SGP201 (Table 3) using the chloramphenicol marker, and, after full segregation was achieved, the marker was removed through recombination based on Ni2+ selection. The resulting strain was tested for glycolate productivity (
The nucleotide sequence encoding glyoxylate reductase [SEQ ID NO: 29,30] was synthesized with codon-optimization (Baseclear) and inserted in operon with a nucleotide sequence encoding isocitrate lyase[SEQ ID NO: 25,26], amplified with specific primers (#21-22; Table 2), driven by a Ptrc1 promoter [SEQ ID NO: 37] into the broad host range RSF1010-derivative plasmid pAVO+(van der Woude et al., 2016). The resulting construct was introduced into the ΔglcD1+ΔglcD2+Δldh strain SAW082m through conjugation, and tested for production of glycolate. A result of one of those strains is shown in
The PGPase overexpression cassette of SGP237 was replaced with only a kanamycin resistance marker, resulting in strain SGP338 (table 3). The strain was tested for productivity (
To test multiple different Rubisco enzymes, nucleotide sequence encoding heterologous Rubisco, rbcM [SEQ ID NO: 90, 91, 92, 93], was amplified from Rhodopseudomonas capsulatus or Rhodobacter sphaeroides with specific primers (#29-30; Table 2), and placed behind a Pcpt promoter [SEQ ID NO: 94] inside the rbcLXS-targeting vector. These sequences were introduced at the same site as rbcMfrom R. rubrum (strain SGP237, table 3), the marker was removed through recombination based on Ni2+ selection. The resulting strains (SGP340 or SGP343) were tested for glycolate productivity (
To make a strain with both form I and form II rubisco, we placed back the endogenous Rubisco in SGP237. To this end, the Rubisco operon [SEQ ID NO: 21] of Synechocystis was amplified with specific primers and cloned behind a behind a Ptrc promoter [SEQ ID NO: 37] in a vector targeting neutral site NSC2. The vector was introduced in SGP237 and the resulting strains (SGP340 or SGP343) were tested for glycolate productivity (
To delete the gene encoding glycolate dehydrogenase in Synechococcus PCC7002 [SEQ ID NO: 95,96], we have amplified the homologous regions (˜1000 bp) surrounding the genes with specific primers (#15-18; Table 2), fused them by fusion PCR while introducing restriction sites, and inserted this sequence into a pBSKII+ vector. Next, the kanR marker gene (conferring resistance to kanamycin) and the PBRS-mazF cassette, that allows counter-selection to create markerless deletions, were inserted into the vector. The resulting vector was introduced into Synechococcus PCC7002 and full segregation resulted in strain ScGP001 (Table 4).
To replace the endogenous genes encoding the Rubisco operon [SEQ ID NO: 97] of Synechococcus PCC7002, we amplified the homologous regions (˜1000 bp) surrounding the rbcLXS genes with specific primers (#25-28; Table 2), fused them by fusion PCR while introducing a number of restriction sites, and inserted this sequence in a pBSKII+ vector. Next, a nucleotide sequence encoding heterologous Rubisco, rbcM [SEQ ID NO: 15, 16], was amplified from Rhodospirillum rubrum with specific primers (#29-30; Table 2) and placed behind a PcpcBA promoter [SEQ ID NO: 38] inside the rbcLXS-targeting vector. Lastly, the marker gene (conferring resistance to chloramphenicol) and the PBRS-mazFcassette that allows counter-selection to create markerless deletions, were inserted in this vector. The resulting vector was used first to replace rbcLXS operon in the mutant Synechococcus PCC7002 strain ScGP001 (Table 4) using the chloramphenicol marker and full segregation was achieved. The resulting strain ScGP006 was tested for glycolate productivity (
To replace the endogenous genes encoding the Rubisco operon [SEQ ID NO: 84] of Synechococcus elongatus PCC7942, we amplified the homologous regions (˜1000 bp) surrounding the rbcLXS genes with specific primers (#25-28; Table 2), fused them by fusion PCR while introducing a number of restriction sites, and inserted this sequence in a pBSKII+ vector. Next, a nucleotide sequence encoding heterologous Rubisco, rbcM [SEQ ID NO: 15, 16], was amplified from Rhodospirillum rubrum with specific primers (#29-30; Table 2) and placed behind a PcpcBA promoter [SEQ ID NO: 38] inside the rbcLXS-targeting vector, together with the marker gene camR (conferring resistance to chloramphenicol). The resulting vector was used to replace rbcLXSoperon in Synechococcus elongatus PCC7942 using the chloramphenicol marker and full segregation resulted in strain SeGP002 (Table 4).
To delete the gene encoding glycolate dehydrogenase in Synechococcus elongatus PCC7942 [SEQ ID NO: 101,102], we have amplified the homologous regions (˜1000 bp) surrounding the genes with specific primers (#15-18; Table 2), fused them by fusion PCR while introducing restriction sites, and inserted this sequence into a pBSKII+ vector. Next, we introduced a gene encoding phosphoglycolate phosphatase (PGP) [SEQ ID NO: 9, 10] behind a Ptrc promoter [SEQ ID NO: 37] and the kanR marker gene (conferring resistance to kanamycin) into the vector. The resulting vector was introduced into Synechococcus elongatus PCC7942 strain SeGP002 and full segregation was achieved. The resulting strain SeGP004 (Table 4) was tested for glycolate productivity (
Synechocystis PCC6803 strain list with glycolate
Synechocystis
Synechococcus strain list with glycolate
Synechococcus
Synechococcus PCC7002
Synechococcus PCC7002
Synechococcus
elongatus PCC7942
Synechococcus
elongatus PCC7942
Number | Date | Country | Kind |
---|---|---|---|
19153544.2 | Jan 2019 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/051785 | 1/24/2020 | WO | 00 |