The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jun. 19, 2020, is named G0919.70033WO00-SEQ-OMJ.txt, and is 1.76 megabytes (MB) in size.
The present disclosure relates to enzymes, nucleic acids, and cells useful for the conversion of leucine to isopentanol.
Maple syrup urine disease (MSUD) is a metabolic disorder caused by a deficiency of the branched-chain alpha-keto acid dehydrogenase complex (BCKDC), leading to a buildup of the branched-chain amino acids (leucine, isoleucine, and valine) and their toxic by-products (ketoacids) in the blood and urine. MSUD gets its name from the distinctive sweet odor of affected individual's urine, particularly prior to diagnosis, and during times of acute illness. There remains a need for improved treatments for MSUD and other conditions characterized by excessive branched-chain amino acids.
The present disclosure is based, at least in part, on generation of engineered cells containing enzymes for consuming leucine, for example, by converting leucine to isopentanol. Such engineered cells are useful, e.g., to treat diseases associated with accumulation of leucine such as MSUD.
Aspects of the disclosure relate to host cells that comprise a heterologous polynucleotide encoding a leucine dehydrogenase (LeuDH) enzyme, wherein the LeuDH enzyme comprises an amino acid sequence that is at least 90% identical to a sequence selected from SEQ ID NOs: 2, 4, 6, 8, 10, and 12. In some embodiments, the LeuDH enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO: 2. In some embodiments, the LeuDH enzyme comprises SEQ ID NO: 2. In some embodiments, the LeuDH enzyme comprises: V at a residue corresponding to residue 13 in SEQ ID NO: 27; W at a residue corresponding to residue 16 in SEQ ID NO: 27; Q at a residue corresponding to residue 42 in SEQ ID NO: 27; T, Y, F, E, or W at a residue corresponding to residue 43 in SEQ ID NO: 27; I, H, K, or Y at a residue corresponding to residue 44 in SEQ ID NO: 27; T, E, A, S, or K at a residue corresponding to residue 67 in SEQ ID NO: 27; K at a residue corresponding to residue 71 in SEQ ID NO: 27; S at a residue corresponding to residue 73 in SEQ ID NO: 27; R, H, Y, S, K, or W at a residue corresponding to residue 76 in SEQ ID NO: 27; Y at a residue corresponding to residue 92 in SEQ ID NO: 27; H at a residue corresponding to residue 93 in SEQ ID NO: 27; G at a residue corresponding to residue 95 in SEQ ID NO: 27; G at a residue corresponding to residue 100 in SEQ ID NO: 27; C at a residue corresponding to residue 105 in SEQ ID NO: 27; G at a residue corresponding to residue 111 in SEQ ID NO: 27; M at a residue corresponding to residue 113 in SEQ ID NO: 27; N, or V at a residue corresponding to residue 115 in SEQ ID NO: 27; R, N, or W at a residue corresponding to residue 116 in SEQ ID NO: 27; A at a residue corresponding to residue 120 in SEQ ID NO: 27; D at a residue corresponding to residue 122 in SEQ ID NO: 27; E at a residue corresponding to residue 136 in SEQ ID NO: 27; D at a residue corresponding to residue 140 in SEQ ID NO: 27; M at a residue corresponding to residue 141 in SEQ ID NO: 27; S at a residue corresponding to residue 160 in SEQ ID NO: 27; F at a residue corresponding to residue 185 in SEQ ID NO: 27; N at a residue corresponding to residue 196 in SEQ ID NO: 27; Y at a residue corresponding to residue 228 in SEQ ID NO: 27; M at a residue corresponding to residue 248 in SEQ ID NO: 27; C at a residue corresponding to residue 256 in SEQ ID NO: 27; Q or C at a residue corresponding to residue 293 in SEQ ID NO: 27; K or N at a residue corresponding to residue 296 in SEQ ID NO: 27; R, Q, or K at a residue corresponding to residue 297 in SEQ ID NO: 27; C or D at a residue corresponding to residue 300 in SEQ ID NO: 27; T or S at a residue corresponding to residue 302 in SEQ ID NO: 27; C at a residue corresponding to residue 305 in SEQ ID NO: 27; F at a residue corresponding to residue 319 in SEQ ID NO: 27; and/or M at a residue corresponding to residue 330 in SEQ ID NO: 27.
Further aspects of the disclosure relate to host cells that comprise a heterologous polynucleotide encoding a leucine dehydrogenase (LeuDH) enzyme, wherein the LeuDH enzyme comprises: V at a residue corresponding to residue 13 in SEQ ID NO: 27; W at a residue corresponding to residue 16 in SEQ ID NO: 27; Q at a residue corresponding to residue 42 in SEQ ID NO: 27; T, Y, F, E, or W at a residue corresponding to residue 43 in SEQ ID NO: 27; I, H, K, or Y at a residue corresponding to residue 44 in SEQ ID NO: 27; T, E, A, S, or K at a residue corresponding to residue 67 in SEQ ID NO: 27; K at a residue corresponding to residue 71 in SEQ ID NO: 27; S at a residue corresponding to residue 73 in SEQ ID NO: 27; R, H, Y, S, K, or W at a residue corresponding to residue 76 in SEQ ID NO: 27; Y at a residue corresponding to residue 92 in SEQ ID NO: 27; H at a residue corresponding to residue 93 in SEQ ID NO: 27; G at a residue corresponding to residue 95 in SEQ ID NO: 27; G at a residue corresponding to residue 100 in SEQ ID NO: 27; C at a residue corresponding to residue 105 in SEQ ID NO: 27; G at a residue corresponding to residue 111 in SEQ ID NO: 27; M at a residue corresponding to residue 113 in SEQ ID NO: 27; N, or V at a residue corresponding to residue 115 in SEQ ID NO: 27; R, N, or W at a residue corresponding to residue 116 in SEQ ID NO: 27; A at a residue corresponding to residue 120 in SEQ ID NO: 27; D at a residue corresponding to residue 122 in SEQ ID NO: 27; E at a residue corresponding to residue 136 in SEQ ID NO: 27; D at a residue corresponding to residue 140 in SEQ ID NO: 27; M at a residue corresponding to residue 141 in SEQ ID NO: 27; S at a residue corresponding to residue 160 in SEQ ID NO: 27; F at a residue corresponding to residue 185 in SEQ ID NO: 27; N at a residue corresponding to residue 196 in SEQ ID NO: 27; Y at a residue corresponding to residue 228 in SEQ ID NO: 27; M at a residue corresponding to residue 248 in SEQ ID NO: 27; C at a residue corresponding to residue 256 in SEQ ID NO: 27; Q or C at a residue corresponding to residue 293 in SEQ ID NO: 27; K or N at a residue corresponding to residue 296 in SEQ ID NO: 27; R, Q, or K at a residue corresponding to residue 297 in SEQ ID NO: 27; C or D at a residue corresponding to residue 300 in SEQ ID NO: 27; T or S at a residue corresponding to residue 302 in SEQ ID NO: 27; C at a residue corresponding to residue 305 in SEQ ID NO: 27; F at a residue corresponding to residue 319 in SEQ ID NO: 27; and M at a residue corresponding to residue 330 in SEQ ID NO: 27.
Further aspects of the disclosure relate to host cells that comprise a heterologous polynucleotide encoding a leucine dehydrogenase (LeuDH) enzyme, wherein relative to SEQ ID NO: 27, the LeuDH enzyme comprises an amino acid substitution at amino acid residue: 42, 43, 44, 67, 71, 76, 78, 113, 115, 116, 136, 293, 296, 297 and/or 300. In some embodiments, the LeuDH enzyme comprises: A, Q, or T at residue 42; E, F, T, W, or Y at residue 43; H, I, K, or Y at residue 44; A, E, K, Q, S, or T at residue 67; C, D, H, K, M, or T at residue 71; E, F, H, I, K, M, R, S, T, W, or Y at residue 76; C, F, H, K, Q, V, or Y at residue 78; F, M, Q, V, W, or Y at residue 113; N, Q, S, T, or V at residue 115; A, L, M, N, R, S, V, or W at residue 116; E, F, L, R, S, or Y at residue 136; A, C, Q, S, or T at residue 293; A, C, E, I, K, L, N, S, or T at residue 296; C, D, E, F, H, K, L, M, N, Q, R, T, W, or Y at residue 297; and/or A, C, D, F, H, K, M, N, Q, R, S, T, W, or Y at residue 300.
Further aspects of the present disclosure relate to non-naturally occurring LeuDH enzymes, wherein relative to SEQ ID NO: 27, the LeuDH enzyme comprises an amino acid substitution at amino acid residue: 42, 43, 44, 67, 71, 76, 78, 113, 115, 116, 136, 293, 296, 297 and/or 300. In some embodiments, the LeuDH enzyme comprises: A, Q, or T at residue 42; E, F, T, W, or Y at residue 43; H, I, K, or Y at residue 44; A, E, K, Q, S, or T at residue 67; C, D, H, K, M, or Tat residue 71; E, F, H, I, K, M, R, S, T, W, or Y at residue 76; C, F, H, K, Q, V, or Y at residue 78; F, M, Q, V, W, or Y at residue 113; N, Q, S, T, or V at residue 115; A, L, M, N, R, S, V, or W at residue 116; E, F, L, R, S, or Y at residue 136; A, C, Q, S, or T at residue 293; A, C, E, I, K, L, N, S, or T at residue 296; C, D, E, F, H, K, L, M, N, Q, R, T, W, or Y at residue 297; and/or A, C, D, F, H, K, M, N, Q, R, S, T, W, or Y at residue 300.
Further aspects of the disclosure relate to host cells that comprise a heterologous polynucleotide encoding a branched chain α-ketoacid decarboxylase (KivD) enzyme, wherein the KivD enzyme comprises an amino acid sequence that is at least 90% identical to a sequence selected from SEQ ID NOs: 14, 16, and 18. In some embodiments, the KivD enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO: 18. In some embodiments, the KivD enzyme comprises SEQ ID NO: 18. In some embodiments, the KivD enzyme comprises: Y at a residue corresponding to residue 33 in SEQ ID NO: 29; Q at a residue corresponding to residue 44 in SEQ ID NO: 29; M at a residue corresponding to residue 117 in SEQ ID NO: 29; I at a residue corresponding to residue 129 in SEQ ID NO: 29; W at a residue corresponding to residue 185 in SEQ ID NO: 29; I at a residue corresponding to residue 190 in SEQ ID NO: 29; I at a residue corresponding to residue 225 in SEQ ID NO: 29; Y at a residue corresponding to residue 227 in SEQ ID NO: 29; L at a residue corresponding to residue 311 in SEQ ID NO: 29; G at a residue corresponding to residue 312 in SEQ ID NO: 29; T at a residue corresponding to residue 313 in SEQ ID NO: 29; P at a residue corresponding to residue 328 in SEQ ID NO: 29; W at a residue corresponding to residue 341 in SEQ ID NO: 29; H at a residue corresponding to residue 345 in SEQ ID NO: 29; C at a residue corresponding to residue 347 in SEQ ID NO: 29; R at a residue corresponding to residue 420 in SEQ ID NO: 29; D at a residue corresponding to residue 494 in SEQ ID NO: 29; C at a residue corresponding to residue 508 in SEQ ID NO: 29; and/or F at a residue corresponding to residue 550 in SEQ ID NO: 29.
Further aspects of the disclosure relate to host cells that comprise a heterologous polynucleotide encoding a branched chain α-ketoacid decarboxylase (KivD) enzyme, wherein the KivD enzyme comprises: Y at a residue corresponding to residue 33 in SEQ ID NO: 29; Q at a residue corresponding to residue 44 in SEQ ID NO: 29; M at a residue corresponding to residue 117 in SEQ ID NO: 29; I at a residue corresponding to residue 129 in SEQ ID NO: 29; W at a residue corresponding to residue 185 in SEQ ID NO: 29; I at a residue corresponding to residue 190 in SEQ ID NO: 29; I at a residue corresponding to residue 225 in SEQ ID NO: 29; Y at a residue corresponding to residue 227 in SEQ ID NO: 29; L at a residue corresponding to residue 311 in SEQ ID NO: 29; G at a residue corresponding to residue 312 in SEQ ID NO: 29; T at a residue corresponding to residue 313 in SEQ ID NO: 29; P at a residue corresponding to residue 328 in SEQ ID NO: 29; W at a residue corresponding to residue 341 in SEQ ID NO: 29; H at a residue corresponding to residue 345 in SEQ ID NO: 29; C at a residue corresponding to residue 347 in SEQ ID NO: 29; R at a residue corresponding to residue 420 in SEQ ID NO: 29; D at a residue corresponding to residue 494 in SEQ ID NO: 29; C at a residue corresponding to residue 508 in SEQ ID NO: 29; and F at a residue corresponding to residue 550 in SEQ ID NO: 29.
Further aspects of the disclosure relate to host cells that comprise a heterologous polynucleotide encoding an alcohol dehydrogenase (Adh) enzyme wherein the Adh enzyme comprises an amino acid sequence that is at least 90% identical to a sequence selected from SEQ ID NOs: 20, 22, and 24. In some embodiments, the Adh enzyme comprises an amino acid sequence that is at least 90% identical to SEQ ID NO: 24. In some embodiments, the Adh enzyme comprises SEQ ID NO: 24. In some embodiments, the Adh enzyme comprises: P at a residue corresponding to residue 9 in SEQ ID NO: 31; G at a residue corresponding to residue 16 in SEQ ID NO: 31; Q at a residue corresponding to residue 23 in SEQ ID NO: 31; R at a residue corresponding to residue 28 in SEQ ID NO: 31; A at a residue corresponding to residue 30 in SEQ ID NO: 31; K at a residue corresponding to residue 93 in SEQ ID NO: 31; L at a residue corresponding to residue 98 in SEQ ID NO: 31; R at a residue corresponding to residue 99 in SEQ ID NO: 31; P at a residue corresponding to residue 114 in SEQ ID NO: 31; K at a residue corresponding to residue 115 in SEQ ID NO: 31; Y at a residue corresponding to residue 119 in SEQ ID NO: 31; Y at a residue corresponding to residue 194 in SEQ ID NO: 31; P at a residue corresponding to residue 242 in SEQ ID NO: 31; K at a residue corresponding to residue 249 in SEQ ID NO: 31; E at a residue corresponding to residue 255 in SEQ ID NO: 31; D at a residue corresponding to residue 260 in SEQ ID NO: 31; H at a residue corresponding to residue 269 in SEQ ID NO: 31; Q at a residue corresponding to residue 281 in SEQ ID NO: 31; L at a residue corresponding to residue 325 in SEQ ID NO: 31; M at a residue corresponding to residue 333 in SEQ ID NO: 31; P at a residue corresponding to residue 334 in SEQ ID NO: 31; and/or Q at a residue corresponding to residue 348 in SEQ ID NO: 31.
Further aspects of the disclosure relate to host cells that comprises a heterologous polynucleotide encoding a an alcohol dehydrogenase (Adh) enzyme, wherein the Adh enzyme comprises: P at a residue corresponding to residue 9 in SEQ ID NO: 31; G at a residue corresponding to residue 16 in SEQ ID NO: 31; Q at a residue corresponding to residue 23 in SEQ ID NO: 31; R at a residue corresponding to residue 28 in SEQ ID NO: 31; A at a residue corresponding to residue 30 in SEQ ID NO: 31; K at a residue corresponding to residue 93 in SEQ ID NO: 31; L at a residue corresponding to residue 98 in SEQ ID NO: 31; R at a residue corresponding to residue 99 in SEQ ID NO: 31; P at a residue corresponding to residue 114 in SEQ ID NO: 31; K at a residue corresponding to residue 115 in SEQ ID NO: 31; Y at a residue corresponding to residue 119 in SEQ ID NO: 31; Y at a residue corresponding to residue 194 in SEQ ID NO: 31; P at a residue corresponding to residue 242 in SEQ ID NO: 31; K at a residue corresponding to residue 249 in SEQ ID NO: 31; E at a residue corresponding to residue 255 in SEQ ID NO: 31; D at a residue corresponding to residue 260 in SEQ ID NO: 31; H at a residue corresponding to residue 269 in SEQ ID NO: 31; Q at a residue corresponding to residue 281 in SEQ ID NO: 31; L at a residue corresponding to residue 325 in SEQ ID NO: 31; M at a residue corresponding to residue 333 in SEQ ID NO: 31; P at a residue corresponding to residue 334 in SEQ ID NO: 31; and Q at a residue corresponding to residue 348 in SEQ ID NO: 31.
In some embodiments, the host cell is a plant cell, an algal cell, a yeast cell, a bacterial cell, or an animal cell. In some embodiments, the host cell is a yeast cell. In some embodiments, the yeast cell is a Saccharomyces cell, a Yarrowia cell or a Pichia cell. In some embodiments, the host cell is a bacterial cell. In some embodiments, the bacterial cell is an E. coli cell or a Bacillus cell.
In some embodiments, the host cell further comprises a heterologous polynucleotide encoding a branched-chain amino acid transport system 2 carrier protein (BrnQ). In some embodiments, the BrnQ protein is at least 90% identical to the amino acid sequence of SEQ ID NO: 35. In some embodiments, the BrnQ protein comprises the amino acid sequence of SEQ ID NO: 35.
In some embodiments, the heterologous polynucleotide is operably linked to an inducible promoter. In some embodiments, the heterologous polynucleotide is expressed in an operon. In some embodiments, the operon expresses more than one heterologous polynucleotide, and a ribosome binding site may be present between each heterologous polynucleotide.
In some embodiments, the host cell further comprises a heterologous polynucleotide encoding a KivD enzyme and/or a heterologous polynucleotide encoding an Adh enzyme.
In some embodiments, the host cell further comprises a heterologous polynucleotide encoding a LeuDH enzyme and/or a heterologous polynucleotide encoding an Adh enzyme.
In some embodiments, the host cell further comprises a heterologous polynucleotide encoding a LeuDH enzyme and/or a heterologous polynucleotide encoding a KivD enzyme.
In some embodiments, the host cell is capable of producing isopentanol from leucine. In some embodiments, the host cell consumes at least two-fold more leucine relative to a control host cell that comprises a heterologous polynucleotide encoding a control LeuDH enzyme comprising the sequence of SEQ ID NO: 27, a heterologous polynucleotide encoding a control KivD enzyme comprising the sequence of SEQ ID NO: 29, a heterologous polynucleotide encoding a control Adh enzyme comprising the sequence of SEQ ID NO: 31, and a heterologous polynucleotide encoding a control BrnQ protein comprising the sequence of SEQ ID NO: 35.
Further aspects of the disclosure relate to methods comprising culturing any of the host cells disclosed in this application.
Further aspects of the disclosure relate to methods for producing isopentanol from leucine comprising culturing any of the host cells disclosed in this application.
Further aspects of the disclosure relate to non-naturally occurring nucleic acids comprising a sequence that is at least 90% identical to a nucleic acid sequence selected from SEQ ID NOs: 1, 3, 5, 7, 9, and 11.
Further aspects of the disclosure relate to non-naturally occurring nucleic acids comprising a sequence that is at least 90% identical to a nucleic acid sequence selected from SEQ ID NOs: 13, 15, and 17.
Further aspects of the disclosure relate to non-naturally occurring nucleic acids comprising a sequence that is at least 90% identical to a nucleic acid sequence selected from SEQ ID NOs: 19, 21, and 23.
Further aspects of the disclosure relate to non-naturally occurring nucleic acids encoding a sequence that is at least 90% identical to a sequence selected from SEQ ID NOs: 2, 4, 6, 8, 10, and 12.
Further aspects of the disclosure relate to non-naturally occurring nucleic acids encoding a sequence that is at least 90% identical to a sequence selected from SEQ ID NOs: 14, 16, and 18.
Further aspects of the disclosure relate to non-naturally occurring nucleic acids encoding a sequence that is at least 90% identical to a sequence selected from SEQ ID NOs: 20, 22, and 24.
Further aspects of the disclosure relate to vectors comprising any of the non-naturally occurring nucleic acids disclosed in this application.
Further aspects of the disclosure relate to expression cassettes comprising any of the non-naturally occurring nucleic acids disclosed in this application.
Each of the limitations of the invention can encompass various embodiments of the invention. It is, therefore, anticipated that each of the limitations of the invention involving any one element or combinations of elements can be included in each aspect of the invention. This invention is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the drawings. The invention is capable of other embodiments and of being practiced or of being carried out in various ways.
The accompanying drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures is represented by a like numeral. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:
The present disclosure provides, in some aspects, cells and combination of enzymes of the branched-chain amino acid (BCAA) pathway that are engineered for leucine consumption. These BCAA pathway enzymes include leucine dehydrogenase (LeuDH), ketoisovalerate decarboxylase (KivD), and alcohol dehydrogenase (Adh). The disclosed enzymes and host cells comprising such enzymes may be used to promote leucine consumption, e.g., in a subject suffering from a disorder associated with a buildup of BCAA (e.g., leucine) such as maple syrup urine disease (MSUD) and in other medical and industrial settings.
As used in this disclosure “leucine dehydrogenase (LeuDH)” refers to an enzyme that catalyzes the reversible deamination of branched-chain L-amino acids (e.g., L-leucine, L-valine, L-isoleucine) to their 2-oxo analogs. A LeuDH enzyme may use L-leucine as a substrate. In some embodiments, LeuDH exhibits specificity for L-leucine compared to L-valine and/or L-isoleucine. In some embodiments, LeuDH produces ketoisocaproate (also known as 2-oxoisocaproate) from L-leucine.
In some embodiments, a host cell comprises a LeuDH enzyme and/or a heterologous polynucleotide encoding such an enzyme. In some embodiments, a host cell comprises a heterologous polynucleotide encoding a LeuDH enzyme comprising an amino acid sequence that is at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to any one of SEQ ID NO: 2, 4, 6, 8, 10, 12, or 257-475, a LeuDH enzyme in Table 3 or Table 4, or a LeuDH enzyme otherwise described in this disclosure. In some embodiments, a host cell comprises a heterologous polynucleotide that is at least 90% (e.g., at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to any one of SEQ ID NO: 1, 3, 5, 7, 9, 11, or 37-255, a polynucleotide encoding a LeuDH enzyme in Table 3 or Table 4, or a LeuDH enzyme otherwise described in this disclosure.
In some embodiments, a host cell comprises LeuDH from Bacillus cereus. In other embodiments, a host cell does not comprise LeuDH from Bacillus cereus.
LeuDH from Bacillus cereus can comprise the amino acid sequence of UniProtKB—P0A392 (SEQ ID NO: 27):
In some embodiments, the amino acid sequence of SEQ ID NO: 27 is encoded by the nucleic acid sequence:
In some embodiments, a host cell that expresses a heterologous polynucleotide encoding a LeuDH enzyme may increase conversion of leucine to ketoisocaproate by 0.5-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, 5-fold, 5.5-fold, or 6-fold more (e.g., 2-fold to 6-fold more) relative to a control. In some embodiments, the control is a host cell that expresses a heterologous polynucleotide encoding SEQ ID NO: 27. In some embodiments, the control is an E. coli Nissle strain SYN1980 ΔleuE, ΔilvC, lacZ:tetR-Ptet-livKHMGF, tetR-Ptet-leuDH(Bc)-kivD-adh2-brnQ-rrnB ter (pSC101), such as is described in U.S. Patent Application Publication No. US20170232043.
In some embodiments, a host cell that expresses a heterologous polynucleotide encoding a LeuDH enzyme may exhibit at least 0.5-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, 5-fold, 5.5-fold, or 6-fold more (e.g., 2-fold to 6-fold more) more activity on leucine relative to valine. In some embodiments, a host cell that expresses a heterologous polynucleotide encoding a LeuDH enzyme may exhibit at least 0.5-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, 5-fold, 5.5-fold, or 6-fold more (e.g., 2-fold to 6-fold more) more activity on leucine relative to isoleucine.
In some embodiments, a LeuDH comprises a sequence that is at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or is 100% identical to SEQ ID NO: 27, any one of SEQ ID NO: 2, 4, 6, 8, 10, 12, or 257-475, any one of SEQ ID NO: 1, 3, 5, 7, 9, 11, or 37-255, an amino acid or polynucleotide sequence of a LeuDH enzyme in Table 3 or Table 4, or a LeuDH enzyme otherwise described in this disclosure.
In some embodiments, such a LeuDH enzyme comprises: V at a residue corresponding to residue 13 in SEQ ID NO: 27; W at a residue corresponding to residue 16 in SEQ ID NO: 27; Q at a residue corresponding to residue 42 in SEQ ID NO: 27; T, Y, F, E, or W at a residue corresponding to residue 43 in SEQ ID NO: 27; I, H, K, or Y at a residue corresponding to residue 44 in SEQ ID NO: 27; T, E, A, S, or K at a residue corresponding to residue 67 in SEQ ID NO: 27; K at a residue corresponding to residue 71 in SEQ ID NO: 27; S at a residue corresponding to residue 73 in SEQ ID NO: 27; R, H, Y, S, K, or W at a residue corresponding to residue 76 in SEQ ID NO: 27; Y at a residue corresponding to residue 92 in SEQ ID NO: 27; H at a residue corresponding to residue 93 in SEQ ID NO: 27; G at a residue corresponding to residue 95 in SEQ ID NO: 27; G at a residue corresponding to residue 100 in SEQ ID NO: 27; C at a residue corresponding to residue 105 in SEQ ID NO: 27; G at a residue corresponding to residue 111 in SEQ ID NO: 27; M at a residue corresponding to residue 113 in SEQ ID NO: 27; N, or V at a residue corresponding to residue 115 in SEQ ID NO: 27; R, N, or W at a residue corresponding to residue 116 in SEQ ID NO: 27; A at a residue corresponding to residue 120 in SEQ ID NO: 27; D at a residue corresponding to residue 122 in SEQ ID NO: 27; E at a residue corresponding to residue 136 in SEQ ID NO: 27; D at a residue corresponding to residue 140 in SEQ ID NO: 27; M at a residue corresponding to residue 141 in SEQ ID NO: 27; S at a residue corresponding to residue 160 in SEQ ID NO: 27; F at a residue corresponding to residue 185 in SEQ ID NO: 27; N at a residue corresponding to residue 196 in SEQ ID NO: 27; Y at a residue corresponding to residue 228 in SEQ ID NO: 27; M at a residue corresponding to residue 248 in SEQ ID NO: 27; C at a residue corresponding to residue 256 in SEQ ID NO: 27; Q or C at a residue corresponding to residue 293 in SEQ ID NO: 27; K or N at a residue corresponding to residue 296 in SEQ ID NO: 27; R, Q, or K at a residue corresponding to residue 297 in SEQ ID NO: 27; C or D at a residue corresponding to residue 300 in SEQ ID NO: 27; T or S at a residue corresponding to residue 302 in SEQ ID NO: 27; C at a residue corresponding to residue 305 in SEQ ID NO: 27; F at a residue corresponding to residue 319 in SEQ ID NO: 27; and/or M at a residue corresponding to residue 330 in SEQ ID NO: 27.
In some embodiments, a LeuDH enzyme comprises: V at a residue corresponding to residue 13 in SEQ ID NO: 27; W at a residue corresponding to residue 16 in SEQ ID NO: 27; Q at a residue corresponding to residue 42 in SEQ ID NO: 27; T, Y, F, E, or W at a residue corresponding to residue 43 in SEQ ID NO: 27; I, H, K, or Y at a residue corresponding to residue 44 in SEQ ID NO: 27; T, E, A, S, or K at a residue corresponding to residue 67 in SEQ ID NO: 27; K at a residue corresponding to residue 71 in SEQ ID NO: 27; S at a residue corresponding to residue 73 in SEQ ID NO: 27; R, H, Y, S, K, or W at a residue corresponding to residue 76 in SEQ ID NO: 27; Y at a residue corresponding to residue 92 in SEQ ID NO: 27; H at a residue corresponding to residue 93 in SEQ ID NO: 27; G at a residue corresponding to residue 95 in SEQ ID NO: 27; G at a residue corresponding to residue 100 in SEQ ID NO: 27; C at a residue corresponding to residue 105 in SEQ ID NO: 27; G at a residue corresponding to residue 111 in SEQ ID NO: 27; M at a residue corresponding to residue 113 in SEQ ID NO: 27; N, or V at a residue corresponding to residue 115 in SEQ ID NO: 27; R, N, or W at a residue corresponding to residue 116 in SEQ ID NO: 27; A at a residue corresponding to residue 120 in SEQ ID NO: 27; D at a residue corresponding to residue 122 in SEQ ID NO: 27; E at a residue corresponding to residue 136 in SEQ ID NO: 27; D at a residue corresponding to residue 140 in SEQ ID NO: 27; M at a residue corresponding to residue 141 in SEQ ID NO: 27; S at a residue corresponding to residue 160 in SEQ ID NO: 27; F at a residue corresponding to residue 185 in SEQ ID NO: 27; N at a residue corresponding to residue 196 in SEQ ID NO: 27; Y at a residue corresponding to residue 228 in SEQ ID NO: 27; M at a residue corresponding to residue 248 in SEQ ID NO: 27; C at a residue corresponding to residue 256 in SEQ ID NO: 27; Q or C at a residue corresponding to residue 293 in SEQ ID NO: 27; K or N at a residue corresponding to residue 296 in SEQ ID NO: 27; R, Q, or K at a residue corresponding to residue 297 in SEQ ID NO: 27; C or D at a residue corresponding to residue 300 in SEQ ID NO: 27; T or S at a residue corresponding to residue 302 in SEQ ID NO: 27; C at a residue corresponding to residue 305 in SEQ ID NO: 27; F at a residue corresponding to residue 319 in SEQ ID NO: 27; and M at a residue corresponding to residue 330 in SEQ ID NO: 27.
In some embodiments, a LeuDH enzyme comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39, at least 40, at least 41, at least 42, at least 43, at least 44, at least 45, at least 46, at least 47, at least 48, at least 49, at least 50, at least 60, at least 70, at least 80, at least 90, or at least 100 amino acid substitutions, deletions, insertions, or additions relative to SEQ ID NO: 27, any one of SEQ ID NO: 2, 4, 6, 8, 10, 12, or 257-475, a LeuDH enzyme in Table 3 or Table 4, or a LeuDH enzyme otherwise described in this disclosure.
In some embodiments, a LeuDH enzyme comprises an amino acid substitution at one or more residues relative to SEQ ID NO: 27. In some embodiments, a LeuDH enzyme comprises an amino acid substitution at a residue corresponding to position 42 in SEQ ID NO: 27, at a residue corresponding to position 43 in SEQ ID NO: 27, at a residue corresponding to position 44 in SEQ ID NO: 27, at a residue corresponding to position 67 in SEQ ID NO: 27, at a residue corresponding to position 71 in SEQ ID NO: 27, at a residue corresponding to position 76 in SEQ ID NO: 27, at a residue corresponding to position 78 in SEQ ID NO: 27, at a residue corresponding to position 113 in SEQ ID NO: 27, at a residue corresponding to position 115 in SEQ ID NO: 27, at a residue corresponding to position 116 in SEQ ID NO: 27, at a residue corresponding to position 136 in SEQ ID NO: 27, at a residue corresponding to position 293 in SEQ ID NO: 27, at a residue corresponding to position 296 in SEQ ID NO: 27, at a residue corresponding to position 297 in SEQ ID NO: 27, and/or at a residue corresponding to position 300 in SEQ ID NO: 27. In some embodiments, a LeuDH enzyme comprises: A, Q, or T at a residue corresponding to position 42 in SEQ ID NO: 27; E, F, T, W, or Y at a residue corresponding to position 43 in SEQ ID NO: 27; H, I, K, or Y at a residue corresponding to position 44 in SEQ ID NO: 27; A, E, K, Q, S, or T at a residue corresponding to position 67 in SEQ ID NO: 27; C, D, H, K, M, or T at a residue corresponding to position 71 in SEQ ID NO: 27; E, F, H, I, K, M, R, S, T, W, or Y at a residue corresponding to position 76 in SEQ ID NO: 27; C, F, H, K, Q, V, or Y at a residue corresponding to position 78 in SEQ ID NO: 27; F, M, Q, V, W, or Y at a residue corresponding to position 113 in SEQ ID NO: 27; N, Q, S, T, or V at a residue corresponding to position 115 in SEQ ID NO: 27; A, L, M, N, R, S, V, or W at a residue corresponding to position 116 in SEQ ID NO: 27; E, F, L, R, S, or Y at a residue corresponding to position 136 in SEQ ID NO: 27; A, C, Q, S, or T at a residue corresponding to position 293 in SEQ ID NO: 27; A, C, E, I, K, L, N, S, or T at a residue corresponding to position 296 in SEQ ID NO: 27; C, D, E, F, H, K, L, M, N, Q, R, T, W, or Y at a residue corresponding to position 297 in SEQ ID NO: 27; and/or A, C, D, F, H, K, M, N, Q, R, S, T, W, or Y at a residue corresponding to position 300 in SEQ ID NO: 27.
In some embodiments, relative to SEQ ID NO: 27, a LeuDH enzyme comprises an amino acid substitution at amino acid residue: 42, 43, 44, 67, 71, 76, 78, 113, 115, 116, 136, 293, 296, 297 and/or 300. In some embodiments, a LeuDH enzyme comprises A, Q, or T at residue 42; E, F, T, W, or Y at residue 43; H, I, K, or Y at residue 44; A, E, K, Q, S, or T at residue 67; C, D, H, K, M, or T at residue 71; E, F, H, I, K, M, R, S, T, W, or Y at residue 76; C, F, H, K, Q, V, or Y at residue 78; F, M, Q, V, W, or Y at residue 113; N, Q, S, T, or V at residue 115; A, L, M, N, R, S, V, or W at residue 116; E, F, L, R, S, or Y at residue 136; A, C, Q, S, or T at residue 293; A, C, E, I, K, L, N, S, or T at residue 296; C, D, E, F, H, K, L, M, N, Q, R, T, W, or Y at residue 297; and/or A, C, D, F, H, K, M, N, Q, R, S, T, W, or Y at residue 300.
As used in this disclosure “ketoisovalerate decarboxylase (KivD)” refers to an enzyme that catalyzes the decarboxylation of alpha-keto acids derived from amino acid transamination into aldehydes. A KivD may use ketoisocaproate as a substrate. In some embodiments, KivD produces isovaleraldehyde from ketoisocaproate.
In some embodiments, a host cell comprises a KivD enzyme and/or a heterologous polynucleotide encoding such an enzyme. In some embodiments, a host cell comprises a heterologous polynucleotide encoding a KivD enzyme comprising an amino acid sequence that is at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to any one of SEQ ID NO: 14, 16, 18, or 533-588, a KivD enzyme in Table 3 or Table 5, or a KivD enzyme otherwise described in this disclosure. In some embodiments, a host cell comprises a heterologous polynucleotide that is at least 90% (e.g., at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to any one of SEQ ID NO: 13, 15, 17 or 477-532, a polynucleotide encoding a KivD enzyme in Table 3 or Table 5, or a polynucleotide encoding a KivD enzyme otherwise described in this disclosure.
In some embodiments, a host cell comprises KivD from Lactococcus lactis. In other embodiments, a host cell does not comprise KivD from Lactococcus lactis.
KivD from Lactococcus lactis can comprise the amino acid sequence of UniProtKB—Q684J7 (SEQ ID NO: 29):
In some embodiments, the amino acid sequence of SEQ ID NO: 29 is encoded by the nucleic acid sequence:
In some embodiments, a host cell that expresses a heterologous polynucleotide encoding a KivD enzyme may increase conversion of ketoisocaproate to isovaleraldehyde by 0.5-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, 5-fold, 5.5-fold, or 6-fold more (e.g., 2-fold to 6-fold more) relative to a control. In some embodiments, a control is a host cell that expresses a heterologous polynucleotide encoding SEQ ID NO: 29. In some embodiments, the control is an E. coli Nissle strain SYN1980 ΔleuE, ΔilvC, lacZ:tetR-Ptet-livKHMGF, tetR-Ptet-leuDH(Bc)-kivD-adh2-brnQ-rrnB ter (pSC101), such as is described in U.S. Patent Application Publication No. US20170232043.
In some embodiments, a KivD enzyme comprises a sequence that is at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or is 100% identical to SEQ ID NO: 29, any one of SEQ ID NO: 14, 16, 18, or 533-588, any one of SEQ ID NO: 13, 15, 17 or 477-532, an amino acid or polynucleotide sequence encoding a KivD enzyme in Table 3 or Table 5, or a KivD enzyme otherwise described in this disclosure.
In some embodiments, a KivD enzyme comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39, at least 40, at least 41, at least 42, at least 43, at least 44, at least 45, at least 46, at least 47, at least 48, at least 49, at least 50, at least 60, at least 70, at least 80, at least 90, or at least 100 amino acid substitutions, deletions, insertions, or additions relative to SEQ ID NO: 29, any one of SEQ ID NO: 14, 16, 18, or 533-588, a KivD enzyme in Table 3 or Table 5, or a KivD enzyme otherwise described in this disclosure.
In some embodiments, a KivD enzyme comprises: Y at a residue corresponding to residue 33 in SEQ ID NO: 29; Q at a residue corresponding to residue 44 in SEQ ID NO: 29; M at a residue corresponding to residue 117 in SEQ ID NO: 29; I at a residue corresponding to residue 129 in SEQ ID NO: 29; W at a residue corresponding to residue 185 in SEQ ID NO: 29; I at a residue corresponding to residue 190 in SEQ ID NO: 29; I at a residue corresponding to residue 225 in SEQ ID NO: 29; Y at a residue corresponding to residue 227 in SEQ ID NO: 29; L at a residue corresponding to residue 311 in SEQ ID NO: 29; G at a residue corresponding to residue 312 in SEQ ID NO: 29; T at a residue corresponding to residue 313 in SEQ ID NO: 29; P at a residue corresponding to residue 328 in SEQ ID NO: 29; W at a residue corresponding to residue 341 in SEQ ID NO: 29; H at a residue corresponding to residue 345 in SEQ ID NO: 29; C at a residue corresponding to residue 347 in SEQ ID NO: 29; R at a residue corresponding to residue 420 in SEQ ID NO: 29; D at a residue corresponding to residue 494 in SEQ ID NO: 29; C at a residue corresponding to residue 508 in SEQ ID NO: 29; and/or F at a residue corresponding to residue 550 in SEQ ID NO: 29.
In some embodiments, a KivD enzyme comprises: Y at a residue corresponding to residue 33 in SEQ ID NO: 29; Q at a residue corresponding to residue 44 in SEQ ID NO: 29; M at a residue corresponding to residue 117 in SEQ ID NO: 29; I at a residue corresponding to residue 129 in SEQ ID NO: 29; W at a residue corresponding to residue 185 in SEQ ID NO: 29; I at a residue corresponding to residue 190 in SEQ ID NO: 29; I at a residue corresponding to residue 225 in SEQ ID NO: 29; Y at a residue corresponding to residue 227 in SEQ ID NO: 29; L at a residue corresponding to residue 311 in SEQ ID NO: 29; G at a residue corresponding to residue 312 in SEQ ID NO: 29; T at a residue corresponding to residue 313 in SEQ ID NO: 29; P at a residue corresponding to residue 328 in SEQ ID NO: 29; W at a residue corresponding to residue 341 in SEQ ID NO: 29; H at a residue corresponding to residue 345 in SEQ ID NO: 29; C at a residue corresponding to residue 347 in SEQ ID NO: 29; R at a residue corresponding to residue 420 in SEQ ID NO: 29; D at a residue corresponding to residue 494 in SEQ ID NO: 29; C at a residue corresponding to residue 508 in SEQ ID NO: 29; and F at a residue corresponding to residue 550 in SEQ ID NO: 29.
As used in this disclosure “alcohol dehydrogenase (Adh)” refers to an enzyme that catalyzes the conversion of ethanol to acetaldehyde. An Adh may use isovaleraldehyde as a substrate. In some embodiments, Adh produces isopentanol from isovaleraldehyde.
In some embodiments, a host cell comprises an Adh enzyme and/or a heterologous polynucleotide encoding such an enzyme. In some embodiments, a host cell comprises a heterologous polynucleotide encoding an Adh enzyme comprising an amino acid sequence that is at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to any one of SEQ ID NO: 20, 22, 24, or 645-700, an Adh enzyme in Table 3 or Table 6, or an Adh enzyme otherwise described in this disclosure. In some embodiments, a host cell comprises a heterologous polynucleotide that is at least 90% (e.g., at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to any one of SEQ ID NO: 19, 21, 23 or 589-644, a polynucleotide encoding a Adh enzyme in Table 3 or Table 6, or an Adh enzyme otherwise described in this disclosure.
In some embodiments, a host cell comprises Adh from Saccharomyces cerevisiae. In other embodiments, a host cell does not comprise Adh from Saccharomyces cerevisiae.
Adh from Saccharomyces cerevisiae can comprises the amino acid sequence of UniProtKB—P00331 (SEQ ID NO: 31):
In some embodiments, the amino acid sequence of SEQ ID NO: 3 μs encoded by the nucleic acid sequence:
In some embodiments, a host cell that expresses a heterologous polynucleotide encoding an Adh enzyme may increase conversion of isovaleraldehyde to isopentanol by 0.5-fold, 1-fold, 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, 5-fold, 5.5-fold, or 6-fold more (e.g., 2-fold to 6-fold more) relative to a control. In some embodiments, a control is a host cell that expresses a heterologous polynucleotide encoding SEQ ID NO: 31. In some embodiments, a control is a host cell that expresses a heterologous polynucleotide encoding SEQ ID NO: 31. In some embodiments, the control is an E. coli Nissle strain SYN1980 ΔleuE, ΔilvC, lacZ:tetR-Ptet-livKHMGF, tetR-Ptet-leuDH(Bc)-kivD-adh2-brnQ-rrnB ter (pSC101), such as is described in U.S. Patent Application Publication No. US20170232043.
In some embodiments, an Adh comprises a sequence that is at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or is 100% identical to SEQ ID NO: 31, any one of SEQ ID NO: 20, 22, 24, or 645-700, any one of SEQ ID NO: 19, 21, 23 or 589-644, an amino acid or polynucleotide sequence encoding a Adh enzyme in Table 3 or Table 6, or an Adh enzyme otherwise disclosed in this disclosure.
In some embodiments, an Adh comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 31, at least 32, at least 33, at least 34, at least 35, at least 36, at least 37, at least 38, at least 39, at least 40, at least 41, at least 42, at least 43, at least 44, at least 45, at least 46, at least 47, at least 48, at least 49, at least 50, at least 60, at least 70, at least 80, at least 90, or at least 100 amino acid substitutions, deletions, insertions, or additions relative to SEQ ID NO: 31, any one of SEQ ID NO: 20, 22, 24, or 645-700, an Adh enzyme in Table 3 or Table 6, or an Adh enzyme otherwise disclosed in this disclosure.
In some embodiments, an Adh comprises P at a residue corresponding to residue 9 in SEQ ID NO: 31; G at a residue corresponding to residue 16 in SEQ ID NO: 31; Q at a residue corresponding to residue 23 in SEQ ID NO: 31; R at a residue corresponding to residue 28 in SEQ ID NO: 31; A at a residue corresponding to residue 30 in SEQ ID NO: 31; K at a residue corresponding to residue 93 in SEQ ID NO: 31; L at a residue corresponding to residue 98 in SEQ ID NO: 31; R at a residue corresponding to residue 99 in SEQ ID NO: 31; P at a residue corresponding to residue 114 in SEQ ID NO: 31; K at a residue corresponding to residue 115 in SEQ ID NO: 31; Y at a residue corresponding to residue 119 in SEQ ID NO: 31; Y at a residue corresponding to residue 194 in SEQ ID NO: 31; P at a residue corresponding to residue 242 in SEQ ID NO: 31; K at a residue corresponding to residue 249 in SEQ ID NO: 31; E at a residue corresponding to residue 255 in SEQ ID NO: 31; D at a residue corresponding to residue 260 in SEQ ID NO: 31; H at a residue corresponding to residue 269 in SEQ ID NO: 31; Q at a residue corresponding to residue 281 in SEQ ID NO: 31; L at a residue corresponding to residue 325 in SEQ ID NO: 31; M at a residue corresponding to residue 333 in SEQ ID NO: 31; P at a residue corresponding to residue 334 in SEQ ID NO: 31; and/or Q at a residue corresponding to residue 348 in SEQ ID NO: 31.
In some embodiments, an Adh comprises P at a residue corresponding to residue 9 in SEQ ID NO: 31; G at a residue corresponding to residue 16 in SEQ ID NO: 31; Q at a residue corresponding to residue 23 in SEQ ID NO: 31; R at a residue corresponding to residue 28 in SEQ ID NO: 31; A at a residue corresponding to residue 30 in SEQ ID NO: 31; K at a residue corresponding to residue 93 in SEQ ID NO: 31; L at a residue corresponding to residue 98 in SEQ ID NO: 31; R at a residue corresponding to residue 99 in SEQ ID NO: 31; P at a residue corresponding to residue 114 in SEQ ID NO: 31; K at a residue corresponding to residue 115 in SEQ ID NO: 31; Y at a residue corresponding to residue 119 in SEQ ID NO: 31; Y at a residue corresponding to residue 194 in SEQ ID NO: 31; P at a residue corresponding to residue 242 in SEQ ID NO: 31; K at a residue corresponding to residue 249 in SEQ ID NO: 31; E at a residue corresponding to residue 255 in SEQ ID NO: 31; D at a residue corresponding to residue 260 in SEQ ID NO: 31; H at a residue corresponding to residue 269 in SEQ ID NO: 31; Q at a residue corresponding to residue 281 in SEQ ID NO: 31; L at a residue corresponding to residue 325 in SEQ ID NO: 31; M at a residue corresponding to residue 333 in SEQ ID NO: 31; P at a residue corresponding to residue 334 in SEQ ID NO: 31; and Q at a residue corresponding to residue 348 in SEQ ID NO: 31.
As used in this disclosure “Branched-chain amino acid transport system 2 carrier protein (BrnQ)” refers to a component of the LIV-II transport system for branched-chain amino acids. BrnQ may be used to transport a branched-chain amino acid, e.g., leucine, into a cell such as a host cell.
In some embodiments, a host cell comprises a BrnQ protein and/or a heterologous polynucleotide encoding such a protein. In some embodiments, a host cell comprises a heterologous polynucleotide encoding a BrnQ protein comprising an amino acid sequence that is at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) identical to a BrnQ protein as described in this application, e.g., SEQ ID NO: 35. In some embodiments, the BrnQ protein comprises the amino acid sequence set forth in UniProtKB—B7MD59.
UniProtKB—B7MD59 has the amino acid sequence:
In some embodiments, SEQ ID NO: 35 is encoded by the nucleic acid sequence:
Variants of enzymes and proteins described in this disclosure (e.g., LeuDH, KivD, or Adh and including variants to nucleic acid and amino acid sequences) are also encompassed by the present disclosure. A variant may share at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity with a reference sequence, including all values in between.
Unless otherwise noted, the term “sequence identity,” as known in the art, refers to a relationship between the sequences of two polypeptides or polynucleotides, as determined by sequence comparison (alignment). In some embodiments, sequence identity is determined across the entire length of a sequence (e.g., LeuDH, KivD, or Adh sequence). In some embodiments, sequence identity is determined over a region (e.g., a stretch of amino acids or nucleic acids, e.g., the sequence spanning an active site) of a sequence (e.g., LeuDH, KivD, or Adh sequence).
Identity can also refer to the degree of sequence relatedness between two sequences as determined by the number of matches between strings of two or more residues (e.g., nucleic acid or amino acid residues). Identity measures the percent of identical matches between two or more sequences with gap alignments (if any) addressed by a particular mathematical model or computer program (e.g., algorithms).
Identity of related polypeptides or nucleic acid sequences can be readily calculated by any of the methods known to one of ordinary skill in the art. The “percent identity” of two sequences (e.g., nucleic acid or amino acid sequences) may, for example, be determined using the algorithm of Karlin and Altschul Proc. Natl. Acad. Sci. USA 87:2264-68, 1990, modified as in Karlin and Altschul Proc. Natl. Acad. Sci. USA 90:5873-77, 1993. Such an algorithm is incorporated into the NBLAST® and XBLAST® programs (version 2.0) of Altschul et al., J. Mol. Biol. 215:403-10, 1990. BLAST® protein searches can be performed, for example, with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to the proteins described in this application. Where gaps exist between two sequences, Gapped BLAST® can be utilized, for example, as described in Altschul et al., Nucleic Acids Res. 25(17):3389-3402, 1997. When utilizing BLAST® and Gapped BLAST® programs, the default parameters of the respective programs (e.g., XBLAST® and NBLAST®) can be used, or the parameters can be adjusted appropriately as would be understood by one of ordinary skill in the art.
Another local alignment technique which may be used, for example, is based on the Smith-Waterman algorithm (Smith, T. F. & Waterman, M. S. (1981) “Identification of common molecular subsequences.” J. Mol. Biol. 147:195-197). A general global alignment technique which may be used, for example, is the Needleman—Wunsch algorithm (Needleman, S. B. & Wunsch, C. D. (1970) “A general method applicable to the search for similarities in the amino acid sequences of two proteins.” J. Mol. Biol. 48:443-453), which is based on dynamic programming.
More recently, a Fast Optimal Global Sequence Alignment Algorithm (FOGSAA) was developed that purportedly produces global alignment of nucleic acid and amino acid sequences faster than other optimal global alignment methods, including the Needleman-Wunsch algorithm. In some embodiments, the identity of two polypeptides is determined by aligning the two amino acid sequences, calculating the number of identical amino acids, and dividing by the length of one of the amino acid sequences. In some embodiments, the identity of two nucleic acids is determined by aligning the two nucleotide sequences and calculating the number of identical nucleotide and dividing by the length of one of the nucleic acids.
For multiple sequence alignments, computer programs including Clustal Omega (Sievers et al., Mol Syst Biol. 2011 Oct. 11; 7:539) may be used.
In preferred embodiments, a sequence, including a nucleic acid or amino acid sequence, is found to have a specified percent identity to a reference sequence, such as a sequence disclosed in this application and/or recited in the claims when sequence identity is determined using the algorithm of Karlin and Altschul Proc. Natl. Acad. Sci. USA 87:2264-68, 1990, modified as in Karlin and Altschul Proc. Natl. Acad. Sci. USA 90:5873-77, 1993 (e.g., BLAST®, NBLAST®, XBLAST® or Gapped BLAST® programs, using default parameters of the respective programs).
In some embodiments, a sequence, including a nucleic acid or amino acid sequence, is found to have a specified percent identity to a reference sequence, such as a sequence disclosed in this application and/or recited in the claims when sequence identity is determined using the Smith-Waterman algorithm (Smith, T. F. & Waterman, M. S. (1981) “Identification of common molecular subsequences.” J. Mol. Biol. 147:195-197) or the Needleman-Wunsch algorithm (Needleman, S. B. & Wunsch, C. D. (1970) “A general method applicable to the search for similarities in the amino acid sequences of two proteins.” J. Mol. Biol. 48:443-453) using default parameters.
In some embodiments, a sequence, including a nucleic acid or amino acid sequence, is found to have a specified percent identity to a reference sequence, such as a sequence disclosed in this application and/or recited in the claims when sequence identity is determined using a Fast Optimal Global Sequence Alignment Algorithm (FOGSAA) using default parameters.
In some embodiments, a sequence, including a nucleic acid or amino acid sequence, is found to have a specified percent identity to a reference sequence, such as a sequence disclosed in this application and/or recited in the claims when sequence identity is determined using Clustal Omega (Sievers et al., Mol Syst Biol. 2011 Oct. 11; 7:539) using default parameters.
As used in this disclosure, a residue (such as a nucleic acid residue or an amino acid residue) in sequence “X” is referred to as corresponding to a position or residue (such as a nucleic acid residue or an amino acid residue) “Z” in a different sequence “Y” when the residue in sequence “X” is at the counterpart position of “Z” in sequence “Y” when sequences X and Y are aligned using amino acid sequence alignment tools known in the art, such as, for example, Clustal Omega or BLAST®.
As used in this disclosure, variant sequences may be homologous sequences. As used in this disclosure, homologous sequences are sequences (e.g., nucleic acid or amino acid sequences) that share a certain percent identity (e.g., at least 5%, at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% percent identity, including all values in between). Homologous sequences include but are not limited to paralogous or orthologous sequences. Paralogous sequences arise from duplication of a gene within a genome of a species, while orthologous sequences diverge after a speciation event.
In some embodiments, a polypeptide variant (e.g., LeuDH, KivD, or Adh enzyme variant) comprises a domain that shares a secondary structure (e.g., alpha helix, beta sheet) with a reference polypeptide (e.g., a reference LeuDH, KivD, or Adh enzyme). In some embodiments, a polypeptide variant (e.g., LeuDH, KivD, or Adh enzyme variant) shares a tertiary structure with a reference polypeptide (e.g., a reference LeuDH, KivD, or Adh enzyme). As a non-limiting example, a variant polypeptide (e.g., LeuDH, KivD, or Adh enzyme variant) may have low primary sequence identity (e.g., less than 80%, less than 75%, less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, less than 40%, less than 35%, less than 30%, less than 25%, less than 20%, less than 15%, less than 10%, or less than 5% sequence identity) compared to a reference polypeptide, but share one or more secondary structures (e.g., including but not limited to loops, alpha helices, or beta sheets), or have the same tertiary structure as a reference polypeptide. For example, a loop may be located between a beta sheet and an alpha helix, between two alpha helices, or between two beta sheets. Homology modeling may be used to compare two or more tertiary structures.
Any suitable method, including circular permutation (Yu and Lutz, Trends Biotechnol. 2011 January; 29(1):18-25), may be used to produce such variants. In circular permutation, the linear primary sequence of a polypeptide can be circularized (e.g., by joining the N-terminal and C-terminal ends of the sequence) and the polypeptide can be severed (“broken”) at a different location. Thus, the linear primary sequence of the new polypeptide may have low sequence identity (e.g., less than 80%, less than 75%, less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, less than 40%, less than 35%, less than 30%, less than 25%, less than 20%, less than 15%, less than 10%, less or less than 5%, including all values in between) as determined by linear sequence alignment methods (e.g., Clustal Omega or BLAST). Topological analysis of the two polypeptides, however, may reveal that their tertiary structure is similar. Without being bound by a particular theory, a variant polypeptide created through circular permutation of a reference polypeptide and with a tertiary structure similar to the reference polypeptide can share similar functional characteristics (e.g., enzymatic activity, enzyme kinetics, substrate specificity or product specificity). In some instances, circular permutation may alter the secondary structure, tertiary structure or quaternary structure and produce an enzyme with different functional characteristics (e.g., increased or decreased enzymatic activity, different substrate specificity, or different product specificity). See, e.g., Yu and Lutz, Trends Biotechnol. 2011 January; 29(1):18-25.
It should be appreciated that in a protein that has undergone circular permutation, the linear amino acid sequence of the protein would differ from a reference protein that has not undergone circular permutation. However, one of ordinary skill in the art would be able to readily determine which residues in the protein that has undergone circular permutation correspond to residues in the reference protein that has not undergone circular permutation by, for example, aligning the sequences and detecting conserved motifs, and/or by comparing the structures or predicted structures of the proteins, e.g., by homology modeling. Variants described in this application include circularly permutated variants of sequences described in this application.
In some embodiments, an algorithm that determines the percent identity between a sequence of interest and a reference sequence described in this application accounts for the presence of circular permutation between the sequences. The presence of circular permutation may be detected using any method known in the art, including, for example, RASPODOM (Weiner et al., Bioinformatics. 2005 Apr. 1; 21(7):932-7). In some embodiments, the presence of circulation permutation is corrected for (e.g., the domains in at least one sequence are rearranged) prior to calculation of the percent identity between a sequence of interest and a sequence described in this application. The claims of this application should be understood to encompass sequences for which percent identity to a reference sequence is calculated after taking into account potential circular permutation of the sequence.
Functional variants of the recombinant LeuDH, KivD, or Adh enzyme disclosed in this application are also encompassed by the present disclosure. For example, functional variants may bind one or more of the same substrates or produce one or more of the same products. Functional variants may be identified using any method known in the art. For example, the algorithm of Karlin and Altschul Proc. Natl. Acad. Sci. USA 87:2264-68, 1990 described above may be used to identify homologous proteins with known functions.
Putative functional variants may also be identified by searching for polypeptides with functionally annotated domains. Databases including Pfam (Sonnhammer et al., Proteins. 1997 July; 28(3):405-20) may be used to identify polypeptides with a particular domain.
Homology modeling may also be used to identify amino acid residues that are amenable to mutation without affecting function. A non-limiting example of such a method may include use of position-specific scoring matrix (PSSM) and an energy minimization protocol.
Position-specific scoring matrix (PSSM) uses a position weight matrix to identify consensus sequences (e.g., motifs). PSSM can be conducted on nucleic acid or amino acid sequences. Sequences are aligned and the method takes into account the observed frequency of a particular residue (e.g., an amino acid or a nucleotide) at a particular position and the number of sequences analyzed. See, e.g., Stormo et al., Nucleic Acids Res. 1982 May 11; 10(9):2997-3011. The likelihood of observing a particular residue at a given position can be calculated. Without being bound by a particular theory, positions in sequences with high variability may be amenable to mutation (e.g., PSSM score ≥0) to produce functional homologs.
PSSM may be paired with calculation of a Rosetta energy function, which determines the difference between the wild-type and the single-point mutant. The Rosetta energy function calculates this difference as (ΔΔGcalc). With the Rosetta function, the bonding interactions between a mutated residue and the surrounding atoms are used to determine whether a mutation increases or decreases protein stability. For example, a mutation that is designated as favorable by the PSSM score (e.g. PSSM score ≥0), can then be analyzed using the Rosetta energy function to determine the potential impact of the mutation on protein stability. Without being bound by a particular theory, potentially stabilizing mutations are desirable for protein engineering (e.g., production of functional homologs). In some embodiments, a potentially stabilizing mutation has a ΔΔGcalc value of less than −0.1 (e.g., less than −0.2, less than −0.3, less than −0.35, less than −0.4, less than −0.45, less than −0.5, less than −0.55, less than −0.6, less than −0.65, less than −0.7, less than −0.75, less than −0.8, less than −0.85, less than −0.9, less than −0.95, or less than −1.0) Rosetta energy units (R.e.u.). See, e.g., Goldenzweig et al., Mol Cell. 2016 Jul. 21; 63(2):337-346. Doi: 10.1016/j.molcel.2016.06.012.
In some embodiments, a LeuDH, KivD, or Adh enzyme coding sequence comprises a mutation at 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 or more than 100 positions corresponding to a reference (e.g., LeuDH, KivD, or Adh enzyme) coding sequence. In some embodiments, the LeuDH, KivD, or Adh enzyme coding sequence comprises a mutation in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 or more codons of the coding sequence relative to a reference (e.g., LeuDH, KivD, or Adh enzyme) coding sequence. As will be understood by one of ordinary skill in the art, a mutation within a codon may or may not change the amino acid that is encoded by the codon due to degeneracy of the genetic code. In some embodiments, the one or more mutations in the coding sequence do not alter the amino acid sequence of the coding sequence (e.g., LeuDH, KivD, or Adh enzyme) relative to the amino acid sequence of a reference polypeptide (e.g., LeuDH, KivD, or Adh enzyme).
In some embodiments, the one or more mutations in a recombinant LeuDH, KivD, or Adh enzyme sequence alters the amino acid sequence of the polypeptide (e.g., LeuDH, KivD, or Adh enzyme) relative to the amino acid sequence of a reference polypeptide (e.g., LeuDH, KivD, or Adh enzyme). In some embodiments, the one or more mutations alters the amino acid sequence of the recombinant polypeptide (e.g., LeuDH, KivD, or Adh enzyme) relative to the amino acid sequence of a reference polypeptide (e.g., LeuDH, KivD, or Adh enzyme) and alters (enhances or reduces) an activity of the polypeptide relative to the reference polypeptide.
The activity (e.g., specific activity) of any of the recombinant polypeptides described in this disclosure (e.g., LeuDH, KivD, or Adh enzyme) may be measured using routine methods. As a non-limiting example, a recombinant polypeptide's activity may be determined by measuring its substrate specificity, product(s) produced, the concentration of product(s) produced, or any combination thereof. As used in this disclosure, “specific activity” of a recombinant polypeptide refers to the amount (e.g., concentration) of a particular product produced for a given amount (e.g., concentration) of the recombinant polypeptide per unit time.
The skilled artisan will also realize that mutations in a recombinant polypeptide (e.g., LeuDH, KivD, or Adh enzyme) coding sequence may result in conservative amino acid substitutions to provide functionally equivalent variants of the foregoing polypeptides, e.g., variants that retain the activities of the polypeptides. As used in this disclosure, a “conservative amino acid substitution” refers to an amino acid substitution that does not alter the relative charge or size characteristics or functional activity of the protein in which the amino acid substitution is made.
In some instances, an amino acid is characterized by its R group (see, e.g., Table 1). For example, an amino acid may comprise a nonpolar aliphatic R group, a positively charged R group, a negatively charged R group, a nonpolar aromatic R group, or a polar uncharged R group. Non-limiting examples of an amino acid comprising a nonpolar aliphatic R group include alanine, glycine, valine, leucine, methionine, and isoleucine. Non-limiting examples of an amino acid comprising a positively charged R group include lysine, arginine, and histidine. Non-limiting examples of an amino acid comprising a negatively charged R group include aspartate and glutamate. Non-limiting examples of an amino acid comprising a nonpolar, aromatic R group include phenylalanine, tyrosine, and tryptophan. Non-limiting examples of an amino acid comprising a polar uncharged R group include serine, threonine, cysteine, proline, asparagine, and glutamine.
Variants can be prepared according to methods for altering polypeptide sequence known to one of ordinary skill in the art such as are found in references which compile such methods, e.g., Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Fourth Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 2012, or Current Protocols in Molecular Biology, F. M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York, 2010.
Non-limiting examples of functionally equivalent variants of polypeptides may include conservative amino acid substitutions in the amino acid sequences of proteins disclosed in this application. As used in this disclosure “conservative substitution” is used interchangeably with “conservative amino acid substitution” and refers to any one of the amino acid substitutions provided in Table 1.
In some embodiments, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more than 20 residues can be changed when preparing variant polypeptides. In some embodiments, amino acids are replaced by conservative amino acid substitutions.
Amino acid substitutions in the amino acid sequence of a polypeptide to produce a recombinant polypeptide (e.g., LeuDH, KivD, or Adh enzyme) variant having a desired property and/or activity can be made by alteration of the coding sequence of the polypeptide (e.g., LeuDH, KivD, or Adh enzyme). Similarly, conservative amino acid substitutions in the amino acid sequence of a polypeptide to produce functionally equivalent variants of the polypeptide typically are made by alteration of the coding sequence of the recombinant polypeptide (e.g., LeuDH, KivD, or Adh enzyme).
Mutations (e.g., substitutions) can be made in a nucleotide sequence by a variety of methods known to one of ordinary skill in the art. For example, mutations can be made by PCR-directed mutation, site-directed mutagenesis according to the method of Kunkel (Kunkel, Proc. Nat. Acad. Sci. U.S.A. 82: 488-492, 1985), by chemical synthesis of a gene encoding a polypeptide, by gene editing techniques, or by insertions, such as insertion of a tag (e.g., a HIS tag or a GFP tag).
Aspects of the present disclosure relate to recombinant enzymes, functional modifications and variants thereof, as well as uses relating thereto. For example, the enzymes and cells described in this application may be used to promote leucine consumption, e.g., by converting leucine to isopentanol. The methods may comprise using a host cell comprising one or more enzymes disclosed in this application, a cell lysate, isolated enzymes, or any combination thereof. Methods comprising recombinant expression of polynucleotides encoding an enzyme disclosed in this application in a host cell are encompassed by the present disclosure. Methods comprising administering a host cell comprising at least one BCAA pathway enzyme (e.g., LeuDH, KivD, or Adh enzyme) to a subject in need thereof are encompassed by the present disclosure. In vitro methods comprising reacting one or more branched-chain amino acids (BCAAs) in a reaction mixture with a BCAA pathway enzyme disclosed in this application are also encompassed by the present disclosure. In some embodiments, the BCAA pathway enzyme is an LeuDH, KivD, or Adh enzyme, or a combination thereof.
A nucleic acid encoding any one or more of the recombinant polypeptides (e.g., LeuDH, KivD, Adh, and/or BrnQ) is encompassed by the disclosure and may be comprised within a host cell. In some embodiments, the nucleic acid is in the form of an operon. In some embodiments, at least one ribosome binding site is present between one or more of the coding sequences present in the nucleic acid.
In some embodiments, LeuDH, KivD, Adh, and/or BrnQ nucleic acid sequences encompassed by the disclosure are nucleic acid sequences that hybridize to a LeuDH, KivD, Adh, and/or BrnQ nucleic acid sequence provided in this disclosure under high or medium stringency conditions and that are biologically active. For example, nucleic acids that hybridize under high stringency conditions of 0.2 to 1×SSC at 65° C. followed by a wash at 0.2×SSC at 65° C. to a nucleic acid encoding LeuDH, KivD, Adh, and/or BrnQ can be used. Nucleic acids that hybridize under low stringency conditions of 6×SSC at room temperature followed by a wash at 2×SSC at room temperature to a nucleic acid encoding LeuDH, KivD, Adh, and/or BrnQ can be used. Other hybridization conditions include 3×SSC at 40° C. or 50° C., followed by a wash in 1 or 2×SSC at 20° C., 30° C., 40° C., 50° C., 60° C., or 65° C.
Hybridizations can be conducted in the presence of formaldehyde, e.g., 10%, 20%, 30% 40% or 50%, which further increases the stringency of hybridization. Theory and practice of nucleic acid hybridization is described, e.g., in S. Agrawal (ed.) Methods in Molecular Biology, volume 20; and Tijssen (1993) Laboratory Techniques in biochemistry and molecular biology-hybridization with nucleic acid probes, e.g., part I chapter 2 “Overview of principles of hybridization and the strategy of nucleic acid probe assays,” Elsevier, New York provide a basic guide to nucleic acid hybridization. Exemplary proteins may have at least about 50%, 70%, 80%, 90%, preferably at least about 95%, even more preferably at least about 98% and most preferably at least 99% homology or identity with a LeuDH, KivD, or Adh protein or a domain thereof, e.g., the catalytic domain. Other exemplary proteins may be encoded by a nucleic acid that is at least about 90%, preferably at least about 95%, even more preferably at least about 98% and most preferably at least 99% homology or identity with a LeuDH, KivD, or Adh nucleic acid, e.g., those described in this application.
A nucleic acid encoding any one or more of the recombinant polypeptides (e.g., LeuDH, KivD, Adh and/or BrnQ) described in this application may be incorporated into any appropriate vector through any method known in the art. For example, the vector may be an expression vector, including but not limited to a viral vector (e.g., a lentiviral, retroviral, adenoviral, or adeno-associated viral vector), any vector suitable for transient expression, any vector suitable for constitutive expression, or any vector suitable for inducible expression (e.g., a galactose-inducible or doxycycline-inducible vector).
In some embodiments, a vector replicates autonomously in the cell. In some embodiments, a vector integrates into a chromosome within a cell. A vector can contain one or more endonuclease restriction sites that are cut by a restriction endonuclease to insert and ligate a nucleic acid containing a gene described in this application to produce a recombinant vector that is able to replicate in a cell. Vectors are typically composed of DNA, although RNA vectors are also available. Cloning vectors include, but are not limited to: plasmids, fosmids, phagemids, virus genomes and artificial chromosomes. As used in this application, the terms “expression vector” or “expression construct” refer to a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a host cell (e.g., microbe), such as a yeast cell. In some embodiments, the nucleic acid sequence of a gene described in this application is inserted into a cloning vector such that it is operably joined to regulatory sequences and, in some embodiments, expressed as an RNA transcript. In some embodiments, the vector contains one or more markers, such as a selectable marker as described in this application, to identify cells transformed or transfected with the recombinant vector. In some embodiments, the nucleic acid sequence of a gene described in this application is codon-optimized. Codon optimization may increase production of the gene product by at least 10%, at least 15%, at least 20%, at least 25%, at least 30%, at least 35%, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or 100%, including all values in between) relative to a reference sequence that is not codon-optimized.
In some embodiments, nucleic acid sequences described in this application are expressed in plasmids. For example, nucleic acid sequences described in this application may be expressed in cloning plasmids. Nucleic acid sequences described in this application may be expressed in plasmids for transient expression. Nucleic acid sequences described in this application may also be expressed in plasmids for incorporation of the nucleic acid sequences into genomic DNA.
A coding sequence and a regulatory sequence are said to be “operably joined” or “operably linked” when the coding sequence and the regulatory sequence are covalently linked and the expression or transcription of the coding sequence is under the influence or control of the regulatory sequence. If the coding sequence is to be translated into a functional protein, the coding sequence and the regulatory sequence are said to be operably joined if induction of a promoter in the 5′ regulatory sequence permits the coding sequence to be transcribed and if the nature of the linkage between the coding sequence and the regulatory sequence does not (1) result in the introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region to direct the transcription of the coding sequence, or (3) interfere with the ability of the corresponding RNA transcript to be translated into a protein.
In some embodiments, the nucleic acid encoding any one or more of the proteins described in this application is under the control of regulatory sequences (e.g., enhancer sequences). In some embodiments, a nucleic acid is expressed under the control of a promoter. The promoter can be a native promoter, e.g., the promoter of the gene in its endogenous context, which provides normal regulation of expression of the gene.
Alternatively, a promoter can be a promoter that is different from the native promoter of the gene, e.g., the promoter is different from the promoter of the gene in its endogenous context. In some embodiments, the promoter is a eukaryotic promoter. Non-limiting examples of eukaryotic promoters include TDH3, PGK1, PKC1, PDC1, TEF1, TEF2, RPL18B, SSA1, TDH2, PYK1,TPI1 GAL1, GAL10, GALT, GAL3, GAL2, MET3, MET25, HXT3, HXT7, ACT1, ADH1, ADH2, CUP1-1, ENO2, and SOD1, as would be known to one of ordinary skill in the art (see, e.g., Addgene website: blog.addgene.org/plasmids-101-the-promoter-region). In some embodiments, the promoter is a prokaryotic promoter (e.g., bacteriophage or bacterial promoter). Non-limiting examples of bacteriophage promoters include Pls1con, T3, T7, SP6, and PL. Non-limiting examples of bacterial promoters include Pbad, PmgrB, Ptrc2, PCI857, Plac/ara, Plac/fnr, Ptac, Ptet, Pcmt, and Pm.
In some embodiments, the promoter is an inducible promoter. As used in this application, an “inducible promoter” is a promoter controlled by the presence or absence of a molecule. This may be used, for example, to controllably induce the expression of an enzyme. In some embodiments, where an inducible promoter is linked to a LeuDH, a KivD and/or a Adh, the expression of LeuDH, KivD and/or Adh may be induced or not induced at certain times. For example, in some embodiments, expression may not be induced at certain times so that leucine consumption would be limited (e.g., during cell growth). Non-limiting examples of inducible promoters include chemically regulated promoters and physically regulated promoters. For chemically regulated promoters, the transcriptional activity can be regulated by one or more compounds, such as alcohol, tetracycline, galactose, a steroid, a metal, or other compounds. For physically regulated promoters, transcriptional activity can be regulated by a phenomenon such as light or temperature. Non-limiting examples of tetracycline-regulated promoters include anhydrotetracycline (aTc)-responsive promoters and other tetracycline-responsive promoter systems (e.g., a tetracycline repressor protein (tetR), a tetracycline operator sequence (tetO) and a tetracycline transactivator fusion protein (tTA)). Non-limiting examples of steroid-regulated promoters include promoters based on the rat glucocorticoid receptor, human estrogen receptor, moth ecdysone receptors, and promoters from the steroid/retinoid/thyroid receptor superfamily. Non-limiting examples of metal-regulated promoters include promoters derived from metallothionein (proteins that bind and sequester metal ions) genes. Non-limiting examples of pathogenesis-regulated promoters include promoters induced by salicylic acid, ethylene or benzothiadiazole (BTH). Non-limiting examples of temperature/heat-inducible promoters include heat shock promoters. Non-limiting examples of light-regulated promoters include light responsive promoters from plant cells. In certain embodiments, the inducible promoter is a galactose-inducible promoter. In some embodiments, the inducible promoter is induced by one or more physiological conditions (e.g., pH, temperature, radiation, osmotic pressure, saline gradients, cell surface binding, or concentration of one or more extrinsic or intrinsic inducing agents). Non-limiting examples of an extrinsic inducer or inducing agent include amino acids and amino acid analogs, saccharides and polysaccharides, nucleic acids, protein transcriptional activators and repressors, cytokines, toxins, petroleum-based compounds, metal containing compounds, salts, ions, enzyme substrate analogs, hormones or any combination thereof.
In some embodiments, the promoter is a constitutive promoter. As used in this application, a “constitutive promoter” refers to an unregulated promoter that allows continuous transcription of a gene. Non-limiting examples of a constitutive promoter include TDH3, PGK1, PKC1, PDC1, TEF1, TEF2, RPL18B, SSA1, TDH2, PYK1,TPI1, HXT3, HXT7, ACT1, ADH1, ADH2, ENO2, and SOD1.
Other inducible promoters or constitutive promoters known to one of ordinary skill in the art are also contemplated in this application.
The precise nature of the regulatory sequences needed for gene expression may vary between species or cell types, but generally include, as necessary, 5′ non-transcribed and 5′ non-translated sequences involved with the initiation of transcription and translation respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. In particular, such 5′ non-transcribed regulatory sequences will include a promoter region which includes a promoter sequence for transcriptional control of the operably joined gene. Regulatory sequences may also include enhancer sequences or upstream activator sequences. The vectors disclosed in this application may include 5′ leader or signal sequences. Regulatory sequences may also include a terminator sequence. In some embodiments, a terminator sequence marks the end of a gene in DNA during transcription. The choice and design of one or more appropriate vectors suitable for inducing expression of one or more genes described in this application in a heterologous organism is within the ability and discretion of one of ordinary skill in the art.
Expression vectors containing the necessary elements for expression are commercially available and known to one of ordinary skill in the art (see, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, Fourth Edition, Cold Spring Harbor Laboratory Press, 2012).
The disclosed methods and compositions and host cells are exemplified with E. coli cells (e.g., E. coli Nissle 1917), but are, in some embodiments, applicable to other host cells.
Suitable host cells include, but are not limited to: yeast cells, bacterial cells, algal cells, plant cells, fungal cells, insect cells, and animal cells, including mammalian cells. In one illustrative embodiment, suitable host cells include E. coli (e.g., Shuffle™ competent E. coli available from New England BioLabs in Ipswich, Mass. or E. coli Nissle 1917 available from German Collection of Microorganisms and Cell Cultures (DSMZ Braunschweig, E. coli DSM 6601)).
Suitable yeast host cells include, but are not limited to: Candida, Hansenula, Saccharomyces, Schizosaccharomyces, Pichia, Kluyveromyces, and Yarrowia. In some embodiments, the yeast cell is Hansenula polymorpha, Saccharomyces cerevisiae, Saccaromyces carlsbergensis, Saccharomyces diastaticus, Saccharomyces norbensis, Saccharomyces kluyveri, Schizosaccharomyces pombe, Pichia pastoris, Pichia finlandica, Pichia trehalophila, Pichia kodamae, Pichia membranaefaciens, Pichia opuntiae, Pichia thermotolerans, Pichia salictaria, Pichia quercuum, Pichia pijperi, Pichia stipitis, Pichia methanolica, Pichia angusta, Kluyveromyces lactis, Candida albicans, or Yarrowia lipolytica.
In some embodiments, the yeast strain is an industrial polyploid yeast strain. Other non-limiting examples of fungal cells include cells obtained from Aspergillus spp., Penicillium spp., Fusarium spp., Rhizopus spp., Acremonium spp., Neurospora spp., Sordaria spp., Magnaporthe spp., Allomyces spp., Ustilago spp., Botrytis spp., and Trichoderma spp.
In certain embodiments, the host cell is an algal cell such as Chlamydomonas (e.g., C. Reinhardtii) and Phormidium (P. sp. ATCC29409).
In other embodiments, the host cell is a prokaryotic cell. Suitable prokaryotic cells include gram positive, gram negative, and gram-variable bacterial cells. The host cell may be a species of, but not limited to: Agrobacterium, Alicyclobacillus, Anabaena, Anacystis, Acinetobacter, Acidothermus, Arthrobacter, Azobacter, Bacillus, Bifidobacterium, Brevibacterium, Butyrivibrio, Buchnera, Campestris, Campylobacter, Clostridium, Corynebacterium, Chromatium, Coprococcus, Escherichia, Enterococcus, Enterobacter, Erwinia, Fusobacterium, Faecalibacterium, Francisella, Flavobacterium, Geobacillus, Haemophilus, Helicobacter, Klebsiella, Lactobacillus, Lactococcus, Ilyobacter, Micrococcus, Microbacterium, Mesorhizobium, Methylobacterium, Methylobacterium, Mycobacterium, Neisseria, Pantoea, Pseudomonas, Prochlorococcus, Rhodobacter, Rhodopseudomonas, Rhodopseudomonas, Roseburia, Rhodospirillum, Rhodococcus, Scenedesmus, Streptomyces, Streptococcus, Synecoccus, Saccharomonospora, Saccharopolyspora, Staphylococcus, Serratia, Salmonella, Shigella, Thermoanaerobacterium, Tropheryma, Tularensis, Temecula, Thermosynechococcus, Thermococcus, Ureaplasma, Xanthomonas, Xylella, Yersinia, and Zymomonas.
In some embodiments, the bacterial host strain is an industrial strain. Numerous bacterial industrial strains are known and suitable for the methods and compositions described in this application.
In some embodiments, the bacterial host cell is of the Agrobacterium species (e.g., A. radiobacter, A. rhizogenes, A. rubi), the Arthrobacterspecies (e.g., A. aurescens, A. citreus, A. globformis, A. hydrocarboglutamicus, A. mysorens, A. nicotianae, A. paraffineus, A. protophonniae, A. roseoparaffinus, A. sulfureus, A. ureafaciens), the Bacillus species (e.g., B. thuringiensis, B. anthracis, B. megaterium, B. subtilis, B. lentus, B. circulars, B. pumilus, B. lautus, B. coagulans, B. brevis, B. firmus, B. alkaophius, B. licheniformis, B. clausii, B. stearothermophilus, B. halodurans and B. amyloliquefaciens. In particular embodiments, the host cell will be an industrial Bacillus strain including but not limited to B. subtilis, B. pumilus, B. licheniformis, B. megaterium, B. clausii, B. stearothermophilus and B. amyloliquefaciens. In some embodiments, the host cell will be an industrial Clostridium species (e.g., C. acetobutylicum, C. tetani E88, C. lituseburense, C. saccharobutylicum, C. perfringens, C. beijerinckii). In some embodiments, the host cell will be an industrial Corynebacterium species (e.g., C. glutamicum, C. acetoacidophilum). In some embodiments, the host cell will be an industrial Escherichia species (e.g., E. coli). In some embodiments, the host cell will be an industrial Erwinia species (e.g., E. uredovora, E. carotovora, E. ananas, E. herbicola, E. punctata, E. terreus). In some embodiments, the host cell will be an industrial Pantoea species (e.g., P. citrea, P. agglomerans). In some embodiments, the host cell will be an industrial Pseudomonas species, (e.g., P. putida, P. aeruginosa, P. mevalonii). In some embodiments, the host cell will be an industrial Streptococcus species (e.g., S. equisimiles, S. pyogenes, S. uberis). In some embodiments, the host cell will be an industrial Streptomyces species (e.g., S. ambofaciens, S. achromogenes, S. avermitilis, S. coelicolor, S. aureofaciens, S. aureus, S. fungicidicus, S. griseus, S. lividans). In some embodiments, the host cell will be an industrial Zymomonas species (e.g., Z. mobilis, Z. lipolytica), and the like.
The present disclosure is also suitable for use with a variety of animal cell types, including mammalian cells, for example, human (including 293, HeLa, WI38, PER.C6 and Bowes melanoma cells), mouse (including 3T3, NS0, NS1, Sp2/0), hamster (CHO, BHK), monkey (COS, FRhL, Vero), and hybridoma cell lines.
In various embodiments, strains that may be used in the practice of the disclosure including both prokaryotic and eukaryotic strains, and are readily accessible to the public from a number of culture collections such as American Type Culture Collection (ATCC), Deutsche Sammlung von Mikroorganismen and Zellkulturen GmbH (DSM), Centraalbureau Voor Schimmelcultures (CBS), and Agricultural Research Service Patent Culture Collection, Northern Regional Research Center (NRRL). The present disclosure is also suitable for use with a variety of plant cell types.
The term “cell,” as used in this application, may refer to a single cell or a population of cells, such as a population of cells belonging to the same cell line or strain. Use of the singular term “cell” should not be construed to refer explicitly to a single cell rather than a population of cells. The host cell may comprise genetic modifications relative to a wild-type counterpart.
A vector encoding any one or more of the recombinant polypeptides (e.g., LeuDH, KivD, Adh enzyme and/or BrnQ) described in this application may be introduced into a suitable host cell using any method known in the art. Host cells may be cultured under any conditions suitable as would be understood by one of ordinary skill in the art. For example, any media, temperature, and incubation conditions known in the art may be used. For host cells carrying an inducible vector, cells may be cultured with an appropriate inducible agent to promote expression.
Any of the cells disclosed in this application can be cultured in media of any type (rich or minimal) and any composition prior to, during, and/or after contact and/or integration of a nucleic acid. The conditions of the culture or culturing process can be optimized through routine experimentation as would be understood by one of ordinary skill in the art. In some embodiments, the selected media is supplemented with various components. In some embodiments, the concentration and amount of a supplemental component is optimized. In some embodiments, other aspects of the media and growth conditions (e.g., pH, temperature, etc.) are optimized through routine experimentation. In some embodiments, the frequency that the media is supplemented with one or more supplemental components, and the amount of time that the cell is cultured, is optimized.
Culturing of the cells described in this application can be performed in culture vessels known and used in the art. In some embodiments, an aerated reaction vessel (e.g., a stirred tank reactor) is used to culture the cells. In some embodiments, a bioreactor or fermentor is used to culture the cell. Thus, in some embodiments, the cells are used in fermentation. As used in this application, the terms “bioreactor” and “fermentor” are interchangeably used and refer to an enclosure, or partial enclosure, in which a biological, biochemical and/or chemical reaction takes place, involving a living organism or part of a living organism. A “large-scale bioreactor” or “industrial-scale bioreactor” is a bioreactor that is used to generate a product on a commercial or quasi-commercial scale. Large scale bioreactors typically have volumes in the range of liters, hundreds of liters, thousands of liters, or more.
In some embodiments, a bioreactor comprises a cell (e.g., a bacterial cell) or a cell culture (e.g., a bacterial cell culture), such as a cell or cell culture described in this application. In some embodiments, a bioreactor comprises a spore and/or a dormant cell type of an isolated microbe (e.g., a dormant cell in a dry state).
Non-limiting examples of bioreactors include: stirred tank fermentors, bioreactors agitated by rotating mixing devices, chemostats, bioreactors agitated by shaking devices, airlift fermentors, packed-bed reactors, fixed-bed reactors, fluidized bed bioreactors, bioreactors employing wave induced agitation, centrifugal bioreactors, roller bottles, and hollow fiber bioreactors, roller apparatuses (for example benchtop, cart-mounted, and/or automated varieties), vertically-stacked plates, spinner flasks, stirring or rocking flasks, shaken multi-well plates, MD bottles, T-flasks, Roux bottles, multiple-surface tissue culture propagators, modified fermentors, and coated beads (e.g., beads coated with serum proteins, nitrocellulose, or carboxymethyl cellulose to prevent cell attachment).
In some embodiments, the bioreactor includes a cell culture system where the cell (e.g., bacterial cell) is in contact with moving liquids and/or gas bubbles. In some embodiments, the cell or cell culture is grown in suspension. In other embodiments, the cell or cell culture is attached to a solid phase carrier. Non-limiting examples of a carrier system includes microcarriers (e.g., polymer spheres, microbeads, and microdisks that can be porous or non-porous), cross-linked beads (e.g., dextran) charged with specific chemical groups (e.g., tertiary amine groups), 2D microcarriers including cells trapped in nonporous polymer fibers, 3D carriers (e.g., carrier fibers, hollow fibers, multicartridge reactors, and semi-permeable membranes that can comprising porous fibers), microcarriers having reduced ion exchange capacity, encapsulation cells, capillaries, and aggregates. In some embodiments, carriers are fabricated from materials such as dextran, gelatin, glass, or cellulose.
In some embodiments, industrial-scale processes are operated in continuous, semi-continuous or non-continuous modes. Non-limiting examples of operation modes are batch, fed batch, extended batch, repetitive batch, draw/fill, rotating-wall, spinning flask, and/or perfusion mode of operation. In some embodiments, a bioreactor allows continuous or semi-continuous replenishment of the substrate stock, for example a carbohydrate source and/or continuous or semi-continuous separation of the product, from the bioreactor.
In some embodiments, the bioreactor or fermentor includes a sensor and/or a control system to measure and/or adjust reaction parameters. Non-limiting examples of reaction parameters include biological parameters (e.g., growth rate, cell size, cell number, cell density, cell type, or cell state, etc.), chemical parameters (e.g., pH, redox-potential, concentration of reaction substrate and/or product, concentration of dissolved gases, such as oxygen concentration and CO2 concentration, nutrient concentrations, metabolite concentrations, concentration of an oligopeptide, concentration of an amino acid, concentration of a vitamin, concentration of a hormone, concentration of an additive, serum concentration, ionic strength, concentration of an ion, relative humidity, molarity, osmolarity, concentration of other chemicals, for example buffering agents, adjuvants, or reaction by-products), physical/mechanical parameters (e.g., density, conductivity, degree of agitation, pressure, and flow rate, shear stress, shear rate, viscosity, color, turbidity, light absorption, mixing rate, conversion rate, as well as thermodynamic parameters, such as temperature, light intensity/quality, etc.). Sensors to measure the parameters described in this application are well known to one of ordinary skill in the relevant mechanical and electronic arts. Control systems to adjust the parameters in a bioreactor based on the inputs from a sensor described in this application are well known to one of ordinary skill in the art in bioreactor engineering.
In some embodiments, the method involves batch fermentation (e.g., shake flask fermentation). General considerations for batch fermentation (e.g., shake flask fermentation) include the level of oxygen and glucose. For example, batch fermentation (e.g., shake flask fermentation) may be oxygen and glucose limited, so in some embodiments, the capability of a strain to perform in a well-designed fed-batch fermentation is underestimated. Also, the final product may display some differences from the substrate in terms of solubility, toxicity, cellular accumulation and secretion and in some embodiments can have different fermentation kinetics.
In some embodiments, the cells of the present disclosure are adapted to consume leucine in vivo. In some embodiments, the cells are adapted to produce one or more enzymes for leucine consumption via conversion to isopentanol (e.g., LeuDH, KivD, and/or Adh). In such embodiments, the enzyme can catalyze reactions for the consumption of leucine by bioconversion in an in vitro or ex vivo process.
Any of the proteins or enzymes of the present disclosure may be expressed in a host cell. As used in this application, a host cell is a cell that can be used to express at least one heterologous polynucleotide (e.g., encoding a protein or enzyme as described in this application). The term “heterologous” with respect to a polynucleotide, such as a polynucleotide comprising a gene, is used interchangeably with the term “exogenous” and the term “recombinant” and refers to: a polynucleotide that has been artificially supplied to a biological system; a polynucleotide that has been modified within a biological system, or a polynucleotide whose expression or regulation has been manipulated within a biological system. A heterologous polynucleotide that is introduced into or expressed in a host cell may be a polynucleotide that comes from a different organism or species than the host cell, or may be a synthetic polynucleotide, or may be a polynucleotide that is also endogenously expressed in the same organism or species as the host cell. For example, a polynucleotide that is endogenously expressed in a host cell may be considered heterologous when it is situated non-naturally in the host cell; expressed recombinantly in the host cell, either stably or transiently; modified within the host cell; selectively edited within the host cell; expressed in a copy number that differs from the naturally occurring copy number within the host cell; or expressed in a non-natural way within the host cell, such as by manipulating regulatory regions that control expression of the polynucleotide. In some embodiments, a heterologous polynucleotide is a polynucleotide that is endogenously expressed in a host cell but whose expression is driven by a promoter that does not naturally regulate expression of the polynucleotide. In other embodiments, a heterologous polynucleotide is a polynucleotide that is endogenously expressed in a host cell and whose expression is driven by a promoter that does naturally regulate expression of the polynucleotide, but the promoter or another regulatory region is modified. In some embodiments, the promoter is recombinantly activated or repressed. For example, gene-editing based techniques may be used to regulate expression of a polynucleotide, including an endogenous polynucleotide, from a promoter, including an endogenous promoter. See, e.g., Chavez et al., Nat Methods. 2016 July; 13(7): 563-567. A heterologous polynucleotide may comprise a wild-type sequence or a mutant sequence as compared with a reference polynucleotide sequence.
Any suitable host cell may be used to produce any of the recombinant polypeptides (e.g., LeuDH, KivD, and/or Adh) disclosed in this application, including eukaryotic cells or prokaryotic cells.
The present disclosure provides compositions, including pharmaceutical compositions, comprising a host cell described in this application (e.g., a host cell comprising a heterologous polynucleotide encoding at least one enzyme selected from the group consisting of LeuDH, KivD, and Adh) or one or more enzymes described in this application (e.g., LeuDH, KivD, and/or Adh), and optionally a pharmaceutically acceptable excipient.
In certain embodiments, a host cell described in this application is provided in an effective amount in a composition, such as a pharmaceutical composition. In certain embodiments, one or more enzymes described in this application are provided in an effective amount in a composition, such as a pharmaceutical composition. In certain embodiments, the effective amount is a therapeutically effective amount. In certain embodiments, the effective amount is a prophylactically effective amount. In some embodiments, the effective amount is an amount that is sufficient to treat or ameliorate one or more symptoms of MSUD.
In certain embodiments, the subject is an animal. In certain embodiments, the subject is a human. In other embodiments, the subject is a non-human animal. In certain embodiments, the subject is a mammal. In certain embodiments, the subject is a non-human mammal. In some embodiments, the subject is a non-mammal. In certain embodiments, the subject is a domesticated animal, such as a dog, cat, cow, pig, horse, sheep, chicken or goat. In certain embodiments, the subject is a companion animal, such as a dog or cat. In certain embodiments, the subject is a livestock animal, such as a cow, pig, horse, sheep, chicken, or goat. In certain embodiments, the subject is a zoo animal. In another embodiment, the subject is a research animal, such as a rodent (e.g., mouse, rat), dog, pig, or non-human primate.
Compositions, such as pharmaceutical compositions, described in this application can be prepared by any method known in the art. In general, such preparatory methods include bringing a compound described in this application (e.g., the “active ingredient”) into association with a carrier or excipient, and/or one or more other accessory ingredients, and then, if necessary and/or desirable, shaping, and/or packaging the product into a desired single- or multi-dose unit.
In some aspects, the disclosure provides methods of using host cells. In some embodiments, the disclosure provides a method comprising culturing a host cell described in this application (e.g., a host cell comprising a heterologous polynucleotide encoding at least one enzyme selected from the group consisting of LeuDH, KivD, and Adh). Methods for culturing cells are described elsewhere in this application. In some embodiments, the disclosure provides a method of producing isopentanol from leucine comprising culturing a host cell described in this application (e.g., a host cell comprising a heterologous polynucleotide encoding LeuDH, KivD, and Adh). In some embodiments, the production and culturing occurs in vivo, e.g., in a human subject that has been administered the host cell. In some embodiments, the production occurs ex vivo, e.g., in an in vitro cell culture environment. Compositions, cells, enzymes, and methods described in this application are also applicable to industrial settings, including any application wherein there may be a buildup of branched-chain amino acids (e.g., leucine, isoleucine, and valine).
The present invention is further illustrated by the following Examples, which in no way should be construed as limiting. The entire contents of all of the references (including literature references, issued patents, published patent applications, and co pending patent applications) cited throughout this application are hereby expressly incorporated by reference. If a reference incorporated in this application contains a term whose definition is incongruous or incompatible with the definition of same term as defined in the present disclosure, the meaning ascribed to the term in this disclosure shall govern. However, mention of any reference, article, publication, patent, patent publication, and patent application cited in this application is not, and should not be taken as an acknowledgment or any form of suggestion that they constitute valid prior art or form part of the common general knowledge in any country in the world.
In order that the invention described in this application may be more fully understood, the following examples are set forth. The examples described in this application are offered to illustrate the systems and methods provided in this application and are not to be construed in any way as limiting their scope.
Machine-learning-based bioinformatics tools were used to identify enzyme candidates for each of the three desired activities (leucine dehydrogenase, 1.4.1.9; ketoisovalerate decarboxylase, 4.1.1.1; and alcohol dehydrogenase 1.1.1.1) in public sequence databases (SwissProt and TrEMBL, together known as UniProt). For LeuDH and Adh, sequence diversity was maximized using previously developed algorithms. For KivD, a stratified sampling approach was used. The total number of enzyme candidates were 1175 LeuDH sequences, 1296 KivD sequences and 1177 Adh sequences.
For LeuDH and Adh, molecular models of the enzyme—transition state complex were built using Rosetta software, and systematic mutations of the active site residues to each of the 20 amino acids were designed.
DNA sequences for all LeuDH, KivD, and Adh enzymes were codon optimized for expression in E. coli. Coding sequences were synthesized in an inducible E. coli expression vector under the control of the T7 promoter.
To improve the leucine-consuming branched-chain amino acid (BCAA) pathway, experiments were performed to identify LeuDH, KivD, and Adh enzymes with superior activity relative to parent enzymes in a prototype strain (1980, also known as SYN1980), which parent strain included Bacillus cereus LeuDH, Lactococcus lactis KivD, and Saccharomyces cerevisiae ADH2. The prototype strain also included BrnQ from E. coli, which is a transporter for branched-chain amino acids that can transport branched-chain amino acids, such as leucine, into the cell. The parent LeuDH enzyme exhibited substrate promiscuity, deaminating valine and isoleucine in addition to leucine. To improve specific consumption of leucine by the BCAA pathway, an additional goal for the pathway design was to identify LeuDH enzymes with increased specificity for leucine (Leu) relative to valine (Val) and isoleucine (Ile).
Two complementary approaches were used to design a library for each enzyme family (LeuDH, KivD, and Adh): metagenomic sourcing and rational design (Table 2). For each enzyme, a metagenomic library of >1000 enzymes was designed to sample the full metagenomic sequence space available in sequence databases (
For each of the enzyme libraries screened, strains harboring library plasmids were transformed into E. coli T7 expression host cells. 5 μL/well of thawed glycerol stocks were stamped into 500 μL/well of LB+100 ug/mL Carbenicillin (LB-Carb100) in half-height deepwell plates, which were sealed with AeraSeals. Samples were incubated at 37° C. and shaken at 1000 RPM in 80% humidity overnight. 50 μL/well of the resulting precultures were stamped into 450 μL/well of LB-Carb100+1 mM IPTG in half-height deepwell plates, which were sealed with AeraSeals. Samples were incubated at 30° C. and shaken at 1000 RPM in 80% humidity overnight. 250 μL/well of the resulting production cultures were stamped into deepwell plates containing 500 uL of phosphate buffered saline (PBS) and centrifuged for 10 minutes at 4000*G. Supernatant was removed and the resulting cell pellet was resuspended in 200 μL of BugBuster Protein Extraction Reagent+1 μL/mL purified Benzonase+1 μL/6 mL purified Lysozyme. Samples were incubated for 10 minutes at room temperature to generate the cell lysates used in in vitro enzyme assays.
10 μL of lysate for the LeuDH library strains was transferred to a half-area flat-bottom plate containing 90 μL/well assay buffer (20 mM amino acid [L-Leucine, L-Valine, or L-Isoleucind 200 mM Glycine, 200 mM KCl, 0.4 mM NAD, pH 10.5). Optical measurements were taken on a plate reader, with absorbance readings taken at 340 nm for 10 minutes. The resulting kinetic data was used to resolve the maximum rate of NAD+ reduction, a proxy for LeuDH activity.
10 μL of lysate for the KivD library strains was transferred to a half-area flat-bottom plate containing 90 μL/well assay buffer (100 mM PIPES-KOH, 100 mM Potassium glutamate, 1 mM Dithiothreitol, 0.4 mM NAD, 1.5 mM Thiamine pyrophosphate, 10 mM Magnesium glutamate, 20 mM ketoisocaproate (KIC), pH 7.5). A coupling enzyme was used to indirectly measure KivD activity on KIC. Optical absorbance measurements were taken over 10 minutes. The resulting kinetic data was used to determine KivD activity.
10 μL of lysate for the Adh library strains was transferred to a half-area flat-bottom plate containing 90 μL/well assay buffer (50 mM MOPS buffer, 0.4 mM NADH, and 30 mM isovaleraldehyde, pH 7.0). Optical absorbance measurements were taken on a plate reader at 340 nm for 10 minutes. The resulting kinetic data was used to resolve the maximum rate of NADH oxidation, a proxy for ADH activity.
To measure LeuDH selectivity (specific deamination of L-Leu in the presence L-Ile and L-Val), lysate was diluted four-fold in lysis buffer, and 10 μL/well of the newly diluted lysate was stamped into 90 μL/well of a modified assay buffer from above, featuring 0.5 mM of each amino acid (L-leucine, L-isoleucine, L-valine), 200 mM Glycine, 200 mM Potassium chloride, and 4 mM NAD. The reaction was quenched at different timepoints and submitted for LC-MS quantification of leucine, isoleucine, and valine.
To screen the 3ט1300-member enzyme libraries, high-throughput (HTP) methods were developed to screen for LeuDH, KivD, and Adh enzyme activities in E. coli cell lysates. In brief, strains were cultivated in 96-deepwell plates to induce protein production, with positive and negative control strains included in each plate. Cells were lysed, and enzyme activity was measured in cell lysates using the enzyme-specific spectrophotometric assays described herein. Enzyme assays were executed on a fully automated robotic workcell. For each enzyme family, the full library (˜1300 members each) was measured in biological duplicate, and 50-200 enzymes with the highest activity in each enzyme family were selected as primary “hits” for that family. The primary hits were re-screened in a secondary screen with additional replication (4 biological replicates) to validate the enzyme rankings.
A total of 1378 LeuDH enzymes were first screened for the ability to deaminate Leu. An initial round of screening identified 220 enzymes (Table 4) with activity similar to or better than the parent LeuDH enzyme from B. subtilis. These primary hits were further analyzed in a secondary screen (
Activity was calculated as: Enzyme Activity divided by Background Enzyme Activity minus 1. Controls were set to 0, and strains with values >0 were considered as potential hits. The value represents a fractional improvement over the control. As a non-limiting example, strains with a 50% improvement would be indicated in Table 4 with a value of 0.5.
To determine if any of the primary LeuDH hits exhibited increased specificity for Leu over Ile and Val, all 220 primary hits were also screened for activity on Val and Be. Specificity was measured as the ratio of activity on Leu to the activity on Be or Val. As shown in
A trade-off of Leu specificity for Leu activity was observed in this library, where the most specific LeuDH enzymes were not the most active LeuDH enzymes. By comparing specificity for Leu/Ile to Leu/Val, hits with increased specificity for Leu relative to both Leu and Val were identified (
A total of 1248 KivD enzymes were screened for the decarboxylase activity on ketoisocaproate. An initial round of screening identified 55 enzymes (Table 5) with higher activity than the parent KivD enzyme from S. aureus, which did not exhibit activity greater than the background lysate decarboxylase activity in this assay and was equated to the non-zero measurable background activity. These primary KivD hits were further analyzed in a secondary screen (
A total of 1215 Adh enzymes were screened for the ability to reduce isovaleraldehyde to isopentanol. An initial round of screening identified 55 enzymes (Table 6) with higher activity than the parent ADH2 enzyme from S. cerevisiae, which did not exhibit activity greater than the background lysate alcohol dehydrogenase activity in this assay and was equated to the non-zero measurable background activity. Because activity of the ADH2 enzyme for S. cerevisiae was indistinguishable from the background activity of the lysate, an Equus caballus Adh with activity higher than the background activity was used as a positive control for the screen. These primary hits were further analyzed in a secondary screen (
To measure LeuDH selectivity (specific deamination of L-Leu in the presence L-Ile and L-Val), lysate was diluted four-fold in lysis buffer, and 10 μL/well of the newly diluted lysate was stamped into 90 μL/well of a modified assay buffer from above, featuring 0.5 mM of each amino acid (L-leucine, L-isoleucine, L-valine), 200 mM Glycine, 200 mM Potassium chloride, and 4 mM NAD. The reaction was quenched at different time points and submitted for LC-MS quantification of leucine, isoleucine, and valine.
LeuDH catalyzes the deamination of Leu, Val and Be, and as a consequence all substrates have the potential to act as competitors in an in vivo context where substrate pools are mixed. In order to better predict the performance of the top LeuDH hits with regard to mixed-substrate pools, the selectivity of LeuDH enzymes for Leu (i.e., the preference of LeuDH for Leu when Leu, Val, and Ile are all present in the reaction mixture) was measured. A total of 21 LeuDH enzymes were screened in cell lysate assays similar to the HTP screen, except that the reaction mixture contained Leu, Val, and Ile at 1:1:1 molar ratio. Rate of Leu, Val, and Ile disappearance was monitored in the reaction mixture.
To improve the overall Leu consumption of the BCAA pathway, multiple enzymes for each step that demonstrated superior performance relative to the parent enzyme were selected. For LeuDH, 6 hits were selected based on two criteria: enzyme activity on Leu and specificity for Leu relative to Val and Ile. Because LeuDH selectivity analysis was run in parallel to operon assembly, the selectivity data set did not factor into LeuDH selection. For KivD and ADH, 3 hits were selected for each enzyme family based on in vitro enzyme activity. In total, 12 enzymes were advanced to the final operon design (Table 3). The operon was composed of four coding sequences for enzymes in the following order: LeuDH-KivD-Adh-BrnQ. A preferred operon for Leu consumption was selected and further tested as described below.
Cetobacterium ceti
Hymenobacter daecheongensis
Hymenobacter sp. CRA2
Arenimonas sp SCN 70-307
Candidatus kapabacteria sp. 59-99
Peptococcaceae bacterium CEB 3
Candida auris
Bacillus sp. FJ AT-1801
Erwinia iniecta
Tortispora caseinolytica NRRL Y-
Rhizobiales bacterium NRL2
Alcanivorax dieselolei
Branched-chain amino acid (BCAA) pathway operon plasmids were transformed into E. coli Nissle strain 1917, which was purchased from the German Collection of Microorganisms and Cell Cultures (DSMZ Braunschweig, E. coli DSM 6601). Transformed cells were thawed on ice and cell density was measured by light absorption at 600 nm (OD600). OD600 of 1.0 was assumed to be equal to 109 cells/mL in this method. A volume was calculated to target 1 mL of 2×109 cells/mL cell resuspension, and the cells were transferred into a 96-deep well plate and washed once with cold PBS. After centrifugation (4000 rpm, 4° C., 10 min), the PBS was discarded, and the cell pellets were then resuspended in 1 mL of 1×M9+50 mM MOPS+0.5% glucose (MMG) buffer. Eight hundred (800) μL of each sample was transferred into a new 96-deep well plate and 800 μL of MMG containing 16 mM leucine was added, mixed well by pipetting. A sample (200 μL) assigned as time zero was collected at this moment. The plate was then covered by a breathable membrane and moved to an anaerobic chamber to incubate at 37° C. Samples were also collected at 2 hours and 4 hours during incubation in the anaerobic chamber. The samples were centrifuged for 10 minutes at 4000 rpm at 4° C. immediately after collection. 100 μL of the supernatant was transferred into a new 96-well plate and stored at −80° C. for future analysis.
Leucine was quantitated in bacterial supernatant by liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) using either an Ultimate 3000 UHPLC-TSQ Quantum or a Vanquish UHPLC-TSQ Altis system. Samples were extracted with 9 parts 2:1 acetonitrile:water containing 1 μg/mL leucine-d3 as an internal standard, vortexed, and centrifuged. Supernatants were diluted with 9 parts 0.1% formic acid and analyzed concurrently with standards processed as above from 0.8 to 1000 μg/mL. Samples were separated on a Phenominex Synergi 4 um Hydro-RP 80A, 75×2 mm using a 0.1% formic acid (A), 0.1% formic acid/acetonitrile (B) at 0.3 mL/min and 50 degrees C. After a 2 μL injection and an initial 5% B hold from 0 to 0.5 minutes, analytes were gradient eluted from 5 to 90% B over 0.5 to 1.5 minutes followed by high organic wash and aqueous equilibration steps. Analytes were detected using Selected Reacting Monitoring (SRM) of compound specific collision induced fragments in electrospray positive ion mode (leucine: 132>86, isoleucine: leucine-d3: 135>89). SRM chromatograms were integrated, and the unknown/internal standard peak area ratios were used to calculate concentrations against the standard curve.
The top Leu consuming operons identified through HTP screening were transformed into E. coli Nissle 1917 (and labeled as strain 5941, 5942 and 5943) and compared to the prototype strain 1980. Strain 5941 contains the LeuDH enzyme of Cetobacterium ceti, the KivD enzyme of Erwinia iniecta, and the Adh enzyme of Alcanivorax dieselolei. Strain 5942 has the LeuDH enzyme of Cetobacterium ceti, the KivD enzyme of Erwinia iniecta, and the Adh enzyme of Rhizobiales bacterium NRL2. Strain 5943 has LeuDH enzyme of Cetobacterium ceti, the KivD enzyme of Erwinia iniecta, and the Adh enzyme of Rhizobiales bacterium NRL2. The operons further contain BrnQ of E. coli. The prototype strain contains Bacillus cereus LeuDH, Lactococcus lactis KivD, Saccharomyces cerevisiae ADH2, as well as E. coli BrnQ.
Samples from the top Leu consuming operons and the prototype strain were analyzed for Leu consumption (
As shown in Table 4, mutants of UniProt P0A392 (SEQ ID NO: 27) from Bacillus cereus were generated and tested to determine whether the mutants showed improved activity or enzyme expression relative to UniProt P0A392 (SEQ ID NO: 27). The LeuDH activity assay described in Example 2 was used. Point mutations at the following unique positions were observed to improve either activity or enzyme expression: 42, 43, 44, 67, 71, 76, 78, 113, 115, 116, 136, 293, 296, 297, and 300.
The following point mutations in UniProt P0A392 (SEQ ID NO: 27) were observed to improve either activity or protein expression: A115N, A115Q, A115S, A115T, A115V, A297C, A297D, A297E, A297F, A297H, A297K, A297L, A297M, A297N, A297Q, A297R, A297T, A297W, A297Y, E116A, E116L, E116M, E116N, E116R, E116S, E116V, E116W, G43E, G43F, G43T, G43W, G43Y, G44H, G44I, G44K, G44Y, 1113F, 1113M, 1113Q, 1113V, 1113W, 1113Y, L300A, L300C, L300D, L300F, L300H, L300K, L300M, L300N, L300Q, L300R, L300S, L300T, L300W, L300Y, L42A, L42Q, L42T, L76E, L76F, L76H, L761, L76K, L76M, L76R, L76S, L76T, L76W, L76Y, L78C, L78F, L78H, L78K, L78Q, L78V, L78Y, M67A, M67E, M67K, M67Q, M67S, M67T, N71C, N71D, N71H, N71K, N71M, N71T, T136E, T136F, T136L, T136R, T136S, T136Y, V293A, V293C, V293Q, V293S, V293T, V296A, V296C, V296E, V296I, V296K, V296L, V296N, V296S, and V296T.
Bioinformatics analysis was conducted on mutants of SEQ ID NO: 27 and sequences from a metagenomic library that were hits. A list of unique residues found in hits is provided below in Table 7. The corresponding position in SEQ ID NO: 27 is shown. A hit is a LeuDH that has increased activity (greater than 0) relative to SEQ ID NO: 27. For each position in the multiple sequence alignment, individual residue identities were binned into hits and non-hits, and the set difference was calculated. These are residues that are unique to the hit set, either via the systematic point mutation library or the metagenomic sequences.
Bioinformatics analysis was conducted on hit KivD enzymes that showed increased activity relative to SEQ ID NO: 29. A list of unique residues found in hits is provided in Table 8. For each position in the multiple sequence alignment, individual residue identities were binned into hits and non-hits, and the set difference was calculated. These are residues that were unique to the hit set. The corresponding position in SEQ ID NO: 29 is indicated in Table 8.
UniProt Q684J7, from Lactococcus lactis, is a microbe widely used in the production of buttermilk and cheese. While not the named reaction for natural enzymes, KivD catalyzes the decarboxylation of 4-methyl-2-oxopentanoate to form isopentanol. It was found that hits from the KivD enzyme library have broadened substrate specificity beyond their natural substrate, which is α-ketoisovalerate.
Bioinformatics analysis was conducted on hit ADH enzymes that showed increased activity relative to SEQ ID NO: 31. A list of unique residues found in hits is provided in Table 9. For each position in the multiple sequence alignment, individual residue identities were binned into hits and non-hits, and the set difference was calculated. These are residues that were unique to the hit set. The corresponding position in SEQ ID NO: 31 is indicated in Table 9.
The performance and molar balance closure of the isopentanol pathway in strain 5941 was assessed in AMBR® 15 bioreactors. Strain 5941 comprises the LeuDH enzyme of SEQ ID NO: 2, the KivD enzyme of SEQ ID NO: 18, and the Adh enzyme of SEQ ID NO: 24. The reactors were filled to 17 mL with M9 media with 0.5% glucose, 10 mM Leu, 10 mM Val, and 5 mM Ile. Conditions were controlled with 0% dissolved oxygen and pH at 7.0. Activated biomass was inoculated to an OD600 of 1, and samples of the supernatant were taken over time to monitor metabolite concentrations.
The extracellular concentration profiles of pathway intermediates are shown in
The assay was performed in an AMBR15f, microbioreactor system from Sartorius. The vessels were filled with 17mls of 1×m9 media salts, supplemented with 2.0 mm MgSO4, 0.1 mM CaCl, 5% glucose, 10 mM L-leucine, 5 mM L-isoleucine, and 10 mM valine. The vessels were filled 18 hrs prior to inoculation, to enable both the pH and DO optodes to hydrate. The temperature in the reactors was kept at 37° C., the pH was maintained at 7 using 2N NaOH, and the dissolved oxygen was kept at 0 using a 0.14vvm N2 flow rate. The agitation was set to 500 RPM to enable good mixing throughout the experiment. The bioreactors were inoculated to an OD600 of 1, from activated biomass supplied by Synlogic. The bioreactors were sampled at 0, 30, 90, 150, and 180 minutes post inoculation. Samples were immediately centrifuged at 15000×g for 30secs in a microcentrifuge and the supernatant was removed for analysis. Supernatants were stored at −20° C. until ready for analysis.
Analytics were developed for two methods. One method involved liquid chromatography mass spectrometry (LCMS) for the quantification of leucine (Leu), ketoisocaproate acid (Leu acid), and isovaleraldehyde (Leu aldehyde). This method was also validated and used for quantification of valine and isoleucine (and their respective acid and aldehyde products). The second method involved gas chromatography mass spectrometry (GCMS) for the quantification of isopentanol (Leu alcohol). Together, these analytical methods allowed for quantitation of all pathway intermediates for strain 5941. The GCMS method was also validated and used for quantification of valine and isoleucine alcohol products.
LCMS analysis was performed on a Thermo Ultimate 3000 UPLC system with a Thermo Q-Exactive quadrupole-orbitrap mass detector and a Thermo Accucore PFP column (2.1×100 mm, 2.6 μm packing) using the following elution solvents: A=0.1% formic acid and 0.1% TFA in water; B=0.1% formic acid in acetonitrile. The gradient was at 0.5 mL/min of 1% B in A for 60 seconds, followed by a linear ramp from 1% to 40% B in A over 270 seconds. The column was then flushed with 95% B in A for 60 seconds, and re-equilibrated with 1% B in A for 180 seconds. MS acquisition was from 0.8 to 5.3 minutes.
Column effluent was introduced into the mass spectrometer via a standard Thermo ESI source with positive mode ionization at +3800V, vaporizer temperature of 400° C., and ion transfer tube temperature of 375° C. Thermo reports gas flow rates in arbitrary units probably approximating L/min at STP. Set points were: sheath gas, 60; aux gas, 30; sweep gas, 1. To increase data acquisition rate, orbitrap resolution was set to 17,500. Quadrupole resolution was 1 m/z.
This method also derivatizes both aldehydes and keto acids, improving the stability of those analytes. Numerous derivatizing agents were explored, and it was found that 2-(Dimethylamino)ethylhydrazine in methanol resulted in the best sensitivity in positive mode. A buffer of 0.5M acetic acid and 0.5M sodium acetate in methanol was used for the quantification of LEU ACID and LEU ALDEHYDE, while also measuring non-derivatized LEU.
GC-MS analysis was performed on an Agilent GCMS/MSD with a Gerstel autosampler, using a J&W DB-WAX GC Column (15m) and chloroform as the extraction solvent. Front injector was set at 250° C. and a flow rate of 1 mL/min. The oven temperature held at 40° C. for 1 minute, followed by a ramp to 130° C. (15° C./min), and then ramped up to 200° C. (65° C./min). Ms acquisition scan window was at 40-150 mz, with the MS source and MS quad at 250C and 200C respectively.
To facilitate high throughput and automation, a Gerstel autosampler was used to inject the extracted bottom chloroform layer in a 96 well plate format with the aqueous ambr15 culture matrix on top acting as an overlay to prevent product evaporation. To account for any other potential alcohol product evaporation, 2-heptanol was added to the chloroform as an internal.
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described in this disclosure. Such equivalents are intended to be encompassed by the following claims.
All references, including patent documents, disclosed in this application are incorporated by reference in their entirety, particularly for the disclosure referenced in this disclosure.
This application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application Ser. No. 62/865,129, filed Jun. 21, 2019, entitled “BIOSYNTHESIS OF ENZYMES FOR USE IN TREATMENT OF MAPLE SYRUP URINE DISEASE (MSUD),” and U.S. Provisional Application Ser. No. 62/864,875, filed Jun. 21, 2019, entitled “OPTIMIZED BACTERIA ENGINEERED TO TREAT DISORDERS INVOLVING THE CATABOLISM OF LEUCINE, ISOLEUCINE, AND/OR VALINE,” the disclosure of each which is incorporated by reference herein in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/038813 | 6/19/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62864875 | Jun 2019 | US | |
62865129 | Jun 2019 | US |