VITAMIN C PRODUCTION IN MICROORGANISMS AND PLANTS

Information

  • Patent Application
  • 20020012979
  • Publication Number
    20020012979
  • Date Filed
    May 25, 1999
    25 years ago
  • Date Published
    January 31, 2002
    23 years ago
Abstract
A biosynthetic method for producing vitamin C (ascorbic acid, L-ascorbic acid, or AA) is disclosed. Such a method includes fermentation of a genetically modified microorganism or plant to produce L-ascorbic acid. In particular, the present invention relates to the use of microorganisms and plants having at least one genetic modification to increase the action of an enzyme involved in the ascorbic acid biosynthetic pathway. Included is the use of nucleotide sequences encoding epimerases, including the endogenous GDP-D-mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway and homologues thereof for the purposes of improving the biosynthetic production of ascorbic acid. The present invention also relates to genetically modified microorganisms, such as strains of microalgae, bacteria and yeast useful for producing L-ascorbic acid, and to genetically modified plants, useful for producing consumable plant food products.
Description


FIELD OF THE INVENTION

[0002] The present invention relates to vitamin C (L-ascorbic acid) production using genetically modified microorganisms and plants. In particular, the present invention relates to the use of nucleotide sugar epimerase enzymes for the biological production of ascorbic acid in plants and microorganisms.



BACKGROUND OF THE INVENTION

[0003] Nearly all forms of life, both plant and animal, either synthesize ascorbic acid (vitamin C) or require it as a nutrient. Ascorbic acid was first identified to be useful as a dietary supplement for humans and animals for the prevention of scurvy. Ascorbic acid, however, also affects human physiological functions such as the adsorption of iron, cold tolerance, the maintenance of the adrenal cortex, wound healing, the synthesis of polysaccharides and collagen, the formation of cartilage, dentine, bone and teeth, the maintenance of capillaries, and is useful as an antioxidant.


[0004] For use as a dietary supplement, ascorbic acid can be isolated from natural sources, such as rosehips, synthesized chemically through the oxidation of L-sorbose, or produced by the oxidative fermentation of calcium D-gluconate by Acetobacter suboxidans. Considine, “Ascorbic Acid,” Van Nostrand's Scientific Encyclopedia, Vol. 1, pp. 237-238, (1989). Ascorbic acid (predominantly intracellular) has also been obtained through the fermentation of strains of the microalga, Chlorella pyrenoidosa. See U.S. Pat. No. 5,001,059 by Skatrud, which is assigned to the assignee of the present application. It is believed that ascorbic acid is produced inside the chloroplasts of photosynthetic microorganisms and functions to neutralize energetic electrons produced during photosynthesis. Accordingly, ascorbic acid production is known in photosynthetic organisms as a protective mechanism.


[0005] Therefore, products and processes which improve the ability to biosynthetically produce ascorbic acid are desirable and beneficial for the improvement of human health.



SUMMARY OF THE INVENTION

[0006] One embodiment of the present invention relates to a method for producing ascorbic acid or esters thereof in a microorganism. The method includes the steps of: (a) culturing a microorganism having a genetic modification to increase the action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase; and (b) recovering the ascorbic acid or esters produced by the microorganism. Preferably, the genetic modification is a genetic modification to increase the action of an enzyme selected from the group of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase. In one embodiment of the method of the present invention, the microorganism further includes a genetic modification to decrease the action of an enzyme having GDP-D-mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase. Such a genetic modification can include, for example, a genetic modification to decrease the action of GDP-D-mannose-dehydrogenase.


[0007] In one embodiment, the genetic modification is a genetic modification to increase the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, which can include GDP-D-mannose:GDP-L-galactose epimerase. In one embodiment, the epimerase binds NADPH. In one embodiment of this method, the genetic modification includes transformation of the microorganism with a recombinant nucleic acid molecule that expresses the epimerase. Such an epimerase can have a tertiary structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. Preferably, the epimerase has a structure having an average root mean square deviation of less than about 2.5 Å, and more preferably less than about 1 Å, over at least about 25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.


[0008] In one embodiment, the epimerase comprises a substrate binding site having a tertiary structure that substantially conforms to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. Such a substrate binding site preferably has a tertiary structure with an average root mean square deviation of less than about 2.5 Å over at least about 25% of Cα positions of the tertiary structure of a substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.


[0009] In another embodiment, the epimerase comprises a catalytic site having a tertiary structure that substantially conforms to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. Such a catalytic site preferably has a tertiary structure with an average root mean square deviation of less than about 1 Å over at least about 25% of Cα positions of the tertiary structure of a catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. The catalytic site preferably includes the amino acid residues serine, tyrosine and lysine and in one embodiment, the tertiary structure positions of the amino acid residues serine, tyrosine and lysine substantially conform to tertiary structure positions of residues Ser107, Tyr136 and Lys140, respectively, as represented by atomic coordinates in Brookhaven Protein Data Bank Accession Code 1bws.


[0010] In yet another embodiment of this method, the epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence align with 100% identity with at least about 50%, and in another embodiment with at least about 75%, and in yet another embodiment with at least about 90% of non-Xaa residues in SEQ ID NO:11. In another embodiment, the epimerase comprises an amino acid sequence having at least 4 contiguous amino acid residues that are 100% identical to at least 4 contiguous amino acid residues of an amino acid sequence selected from the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO:10. In yet another embodiment, the recombinant nucleic acid molecule comprises a nucleic acid sequence comprising at least about 12 contiguous nucleotides having 100% identity with at least about 12 contiguous nucleotides of a nucleic acid sequence selected from the group of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9.


[0011] In yet another embodiment of this method of the present invention, the epimerase comprises an amino acid sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly. In yet another embodiment, the recombinant nucleic acid molecule comprises a nucleic acid sequence that is at least about 15% identical, and in another embodiment, at least about 20% identical, and in another embodiment, at least about 25% identical, to a nucleic acid sequence selected from the group of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman-Pearson standard default parameters.


[0012] In yet another embodiment of this method of the present invention, the recombinant nucleic acid molecule comprises a nucleic acid sequence that hybridizes under stringent hybridization conditions to a nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. The nucleic acid sequence encoding the GDP-4-keto-6-deoxy-D-mannose epimerase/reductase includes nucleic acid sequences selected from the group of SEQ ID NO:1, SEQ ID NO:3 and SEQ ID NO:5, and the GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can include an amino acid sequence selected from the group of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6.


[0013] In one embodiment of the method of the present invention, the microorganism is selected from the group of bacteria, fungi and microalgae. In one embodiment, the microorganism is acid-tolerant. Preferred bacteria include, but are not limited to Azotobacter and Pseudomonas. Preferred fungi include, but are not limited to, yeast, including, but not limited to Saccharomyces yeast. Preferred microalgae include, but are not limited to, microalgae of the genera Prototheca and Chlorella, with microalgae of the genus Prototheca being particularly preferred.


[0014] In yet another embodiment of the method of the present invention, the microorganism is acid-tolerant and the step of culturing is conducted at a pH of less than about 6.0, and more preferably, at a pH of less than about 5.5, and even more preferably, at a pH of less than about 5.0. The step of culturing can be conducted in a fermentation medium that comprises a carbon source other than D-mannose in one embodiment, and in another embodiment, the step of culturing is conducted in a fermentation medium that comprises glucose as a carbon source.


[0015] In yet another embodiment of the present method, the step of culturing is conducted in a fermentation medium that is magnesium (Mg) limited. Preferably, the step of culturing is conducted in a fermentation medium that is Mg limited during a cell growth phase. In one embodiment, the fermentation medium includes less than about 0.5 g/L of Mg during a cell growth phase, and more preferably, less than about 0.2 g/L of Mg during a cell growth phase, and even more preferably, less than about 0.1 g/L of Mg during a cell growth phase.


[0016] Another embodiment of the present invention relates to a microorganism for producing ascorbic acid or esters thereof. The microorganism has a genetic modification to increase the action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase. Preferably, the genetic modification is a genetic modification to increase the action of an enzyme selected from the group of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase, and even more preferably, to increase the action of GDP-D-mannose:GDP-L-galactose epimerase.


[0017] In one embodiment, the microorganism has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the epimerase has a tertiary structure having an average root mean square deviation of less than about 2.5 A over at least about 25% of Cα positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. In another embodiment, the microorganism has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO:11. Preferred microorganisms are disclosed as for the method discussed above.


[0018] Yet another embodiment of the present invention relates to a plant for producing ascorbic acid or esters thereof. Such a plant has a genetic modification to increase the action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase. In a preferred embodiment, the genetic modification is a genetic modification to increase the action of an enzyme selected from the group of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase, and in a more preferred embodiment, the genetic modification is a genetic modification to increase the action of GDP-D-mannose:GDP-L-galactose epimerase.


[0019] In one embodiment, the plant further comprises a genetic modification to decrease the action of an enzyme having GDP-D-mannose as a substrate other than GDP-D-mannose:GDP-L-galactose epimerase. Such a genetic modification includes a genetic modification to decrease the action of GDP-D-mannose-dehydrogenase. Such a plant also includes a plant that has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the epimerase has a tertiary structure having an average root mean square deviation of less than about 2.5 Å over at least about 25% of Cα positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. In another embodiment, such a plant has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein the epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO:11.


[0020] In one embodiment, a plant for producing ascorbic acid or esters thereof according to the present invention is a microalga. Preferred microalgae include, but are not limited to microalgae of the genera Prototheca and Chlorella, with microalga of the genus Prototheca being particularly preferred. In another embodiment, the plant is a higher plant, with consumable higher plants being more preferred.







BRIEF DESCRIPTION OF THE FIGURES

[0021]
FIG. 1A is a schematic drawing of the pathway from glucose to GDP-D-mannose in plants.


[0022]
FIG. 1B is a schematic drawing of the pathway from GDP-D-mannose to L-galactose-1-phosphate in plants.


[0023]
FIG. 1C is a schematic drawing of the pathway from L-galactose to L-ascorbic acid in plants.


[0024]
FIG. 2A is a schematic drawing of selected carbon flow from glucose in Prototheca.


[0025]
FIG. 2B is a schematic drawing of selected carbon flow from glucose in Prototheca.


[0026]
FIG. 3 is a schematic drawing that shows the lineage of mutants derived from Prototheca moriformis ATCC 75669, and their ability to produce L-ascorbic acid.


[0027]
FIG. 4 is a bar graph illustrating the conversion of substrates by resting cells of strain NA45-3 following growth in media containing various magnesium concentrations and resuspension in media containing various magnesium concentrations.


[0028]
FIG. 5 is a line graph showing the relationship between specific ascorbic acid formation in cultures of Prototheca strains and the specific activity of GDP-D-mannose:GDP-L-galactose epimerase in extracts prepared from cells harvested from the same cultures.


[0029]
FIG. 6 is a line graph showing the relationship between specific epimerase activity and the degree of magnesium limitation in two strains, ATCC 75669 and EMS13-4.


[0030]
FIG. 7 depicts the overall catalytic mechanism of GDP-D-mannose:GDP-L-galactose epimerase proposed by Barber (1979, J. Biol. Chem. 254:7600-7603).


[0031]
FIG. 8A depicts the catalytic mechanism of GDP-D-mannose-4,6-dehydratase (converts GDP-D-mannose to GDP-4-keto-6-deoxy-D-mannose).


[0032]
FIG. 8B depicts the catalytic mechanism of GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (converts GDP-4-keto-6-deoxy-D-mannose to GDP-L-fucose) (Chang, et al., 1988, J. Biol. Chem. 263:1693-1697; Barber, 1980, Plant Physiol. 66: 326-329).







DETAILED DESCRIPTION OF THE INVENTION

[0033] The present invention relates to a biosynthetic method and production microorganisms and plants for producing vitamin C (ascorbic acid, L-ascorbic acid, or AA). Such a method includes fermentation of a genetically modified microorganism to produce L-ascorbic acid. In particular, the present invention relates to the use of nucleotide sequences encoding epimerases, including the endogenous GDP-D-mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway, as well as epimerases having structural homology (e.g., by nucleotide/amino acid sequence and/or tertiary structure of the encoded protein) to GDP-4-keto-6-deoxy-D-mannose epimerase/reductases, or UDP-galactose 4-epimerases, for the purposes of improving the biosynthetic production of ascorbic acid. The present invention also relates to genetically modified microorganisms, such as strains of microalgae, bacteria and yeast useful for producing L-ascorbic acid, and to genetically modified plants, useful for producing consumable plant food products.


[0034] One embodiment of the present invention relates to a method to produce L-ascorbic acid by fermentation of a genetically modified microorganism. This method includes the steps of (a) culturing in a fermentation medium a microorganism having a genetic modification to increase the action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase; and (b) recovering L-ascorbic acid or esters thereof. The various enzymes in this list represent the enzymes involved in the vitamin C biosynthetic pathway in plants. It is uncertain at this time whether the enzyme represented by GDP-L-galactose phosphorylase is actually a phosphorylase or a pyrophosphorylase (i.e., GDP-L-galactose pyrophosphorylase). Therefore, use of the term “GDP-L-galactose phosphorylase” herein refers to either GDP-L-galactose phosphorylase or GDP-L-galactose pyrophosphorylase. In one aspect of the invention, this method includes the step of culturing in a fermentation medium a microorganism having a genetic modification to increase the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. This aspect of the present invention is discussed in detail below.


[0035] Another embodiment of the present invention relates to a genetically modified microorganism for producing L-ascorbic acid or esters thereof. Another embodiment of the present invention relates to a genetically modified plant for producing L-ascorbic acid or esters thereof. Both genetically modified microorganisms (e.g., bacteria, yeast, microalgae) and plants (e.g., higher plants, microalgae) have a genetic modification to increase the action of an enzyme selected from the group of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase. In a preferred embodiment, both genetically modified microorganisms (e.g., bacteria, yeast, microalgae) and plants (e.g., higher plants, microalgae) have a genetic modification to increase the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. In one embodiment, the genetic modification includes the transformation of the microorganism or plant with the epimerase as described above.


[0036] To produce significantly high yields of L-ascorbic acid by the method of the present invention, a plant and/or microorganism is genetically modified to enhance production of L-ascorbic acid. As used herein, a genetically modified plant (such as a higher plant or microalgae) or microorganism, such as a microalga (Prototheca, Chlorella), Escherichia coli, or a yeast, is modified (i.e., mutated or changed) within its genome and/or by recombinant technology (i.e., genetic engineering) from its normal (i.e., wild-type or naturally occurring) form. In a preferred embodiment, a genetically modified plant or microorganism according to the present invention has been modified by recombinant technology. Genetic modification of a plant or microorganism can be accomplished using classical strain development and/or molecular genetic techniques, include genetic engineering techniques. Such techniques are generally disclosed herein and are additionally disclosed, for example, in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press; Roessler, 1995, Plant Lipid Metabolism, pp. 46-48; and Roessler et al., 1994, in Bioconversion for Fuels, Himmel et al. eds., American Chemical Society, Washington D.C., pp 255-70). These references are incorporated by reference herein in their entirety.


[0037] In some embodiments, a genetically modified plant or microorganism can include a natural genetic variant as well as a plant or microorganism in which nucleic acid molecules have been inserted, deleted or modified, including by mutation of endogenous genes (e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such a manner that the modifications provide the desired effect within the plant or microorganism. As discussed above, a genetically modified plant or microorganism includes a plant or microorganism that has been modified using recombinant technology.


[0038] As used herein, genetic modifications which result in a decrease in gene expression, an increase in inhibition of gene expression or inhibition of a gene product (i.e., the protein encoded by the gene), a decrease in the function of the gene, or a decrease in the function of the gene product can be referred to as inactivation (complete or partial), deletion, interruption, blockage, down-regulation, or decreased action of a gene. For example, a genetic modification in a gene which results in a decrease in the function of the protein encoded by such gene can be the result of a complete deletion of the gene encoding the protein (i.e., the gene does not exist, and therefore the protein does not exist), a mutation in the gene encoding the protein which results in incomplete or no translation of the protein (e.g., the protein is not expressed), or a mutation in the gene which decreases or abolishes the natural function of the protein (e.g., a protein is expressed which has decreased or no enzymatic activity).


[0039] Genetic modifications which result in an increase in gene expression or function can be referred to as amplification, overproduction, overexpression, activation, enhancement, addition, up-regulation or increased action of a gene. Additionally, a genetic modification to a gene which modifies the expression, function, or activity of the gene can have an impact on the action of other genes and their expression products within a given metabolic pathway (e.g., by inhibition or competition). In this embodiment, the action (e.g., activity) of a particular gene and/or its product can be affected (i.e., upregulated or downregulated) by a genetic modification to another gene within the same metabolic pathway, or to a gene within a different metabolic pathway which impacts the pathway of interest by competition, inhibition, substrate formation, etc.


[0040] In general, a plant or microorganism having a genetic modification that affects L-ascorbic acid production has at least one genetic modification, as discussed above, which results in a change in the L-ascorbic acid production pathway as compared to a wild-type plant or microorganism grown or cultured under the same conditions. Such a modification in an L-ascorbic acid production pathway changes the ability of the plant or microorganism to produce L-ascorbic acid. According to the present invention, a genetically modified plant or microorganism preferably has an enhanced ability to produce L-ascorbic acid compared to a wild-type plant or microorganism cultured under the same conditions.


[0041] The present invention is based on the present inventors' discovery of the biosynthetic pathway for L-ascorbic acid (vitamin C) in plants and microorganisms. Prior to the present invention, the metabolic pathway by which plants produce L-ascorbic acid, was not completely elucidated. The present inventors have demonstrated that L-ascorbic acid production in plants, including L-ascorbic acid-producing microorganisms (e.g., microalgae), is a pathway which uses GDP-D-mannose and involves sugar phosphates and NDP-sugars. In addition, the present inventors have made the surprising discovery that both L-galactose and L-galactono-γ-lactone can be rapidly converted into L-ascorbic acid in L-ascorbic acid-producing microalgae, including Prototheca and Chlorella pyrenoidosa. The entire pathway for L-ascorbic acid production in plants is set forth in FIGS. 1A-1C. More particularly, FIG. 1A shows that the production of L-ascorbic acid in plants proceeds through the production of mannose intermediates to GDP-D-mannose, followed by the conversion of GDP-D-mannose to GDP-L-galactose by GDP-D-mannose:GDP-L-galactose epimerase (also known as GDP-D-mannose-3,5-epimerase) (FIG. 1B), and then by the subsequent progression to L-galactose-1-P, L-galactose, L-galactonic acid (optional), L-galactono-γ-lactone, and L-ascorbic acid (FIG. 1C). FIG. 1B also illustrates alternate pathways for the use of various intermediates, such as GDP-D-mannose. Certain aspects of this pathway have been independently described in a publication (Wheeler, et al., 1998, Nature 393:365-369), incorporated herein by reference in its entirety.


[0042] Points within the L-ascorbic acid production pathway which can be targeted by genetic modification to affect the production of L-ascorbic acid can generally be categorized into at least one of the following pathways: (a) pathways affecting the production of GDP-D-mannose (e.g., pathways for converting a carbon source into GDP-D-mannose); (b) pathways for converting GDP-D-mannose into other compounds, (c) pathways associated with or downstream of the action of GDP-D-mannose:GDP-L-galactose epimerase, (d) pathways which compete for substrates involved in the production of any of the intermediates within the L-ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1-phosphate, L-galactose, L-galactono-γ-lactone, and/or L-ascorbic acid; and (e) pathways which inhibit production of any of the intermediates within the L-ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1-phosphate, L-galactose, L-galactono-γ-lactone, and/or L-ascorbic acid.


[0043] A genetically modified plant or microorganism useful in a method of the present invention typically has at least one genetic modification in the L-ascorbic acid production pathway which results in an enhanced production of L-ascorbic acid. In one embodiment, a genetically modified plant or microorganism has at least one genetic modification that results in: (a) an enhanced production of GDP-D-mannose; (b) an inhibition of pathways which convert GDP-D-mannose into compounds other than GDP-L-galactose; (c) an enhancement of action of the GDP-D-mannose:GDP-L-galactose epimerase; (d) an enhancement of the action of enzymes downstream of the GDP-D-mannose:GDP-L-galactose epimerase; (e) an inhibition of pathways which compete for substrates involved in the production of any of the intermediates within the L-ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1-phosphate, L-galactose, L-galactono-γ-lactone, and/or L-ascorbic acid; and (e) an inhibition of pathways which inhibit production of any of the intermediates within the L-ascorbic acid production pathway, and in particular, with GDP-D-mannose, GDP-L-galactose, L-galactose-1-phosphate, L-galactose, L-galactono-γ-lactone, and/or L-ascorbic acid.


[0044] An enhanced production of GDP-D-mannose by genetic modification of the plant or microorganism can be achieved by, for example, overexpression of enzymes such as hexokinase, glucose phosphate isomerase, phosphomannose isomerase (PMI), phosphomannomutase (PMM) and/or GDP-D-mannose pyrophosphorylase (GMP). Inhibition of pathways which convert GDP-D-mannose to compounds other than GDP-L-galactose can be achieved, for example, by modifications which inhibit polysaccharide synthesis, GDP-D-rhamnose synthesis, GDP-L-fucose synthesis and/or GDP-D-mannuronic acid synthesis. An increase in the action of the GDP-D-mannose:GDP-L-galactose epimerase and of enzymes downstream of the epimerase in the L-ascorbic acid production pathway can be achieved by genetic modifications which include, but are not limited to: overexpression of the epimerase gene (i.e, by overexpression of a recombinant nucleic acid molecule encoding the epimerase gene or a homologue thereof (discussed in detail below), and/or by mutation of the endogenous or recombinant gene to enhance expression of the gene) and/or overexpression of genes downstream of the epimerase which encode subsequent enzymes in the L-ascorbic acid pathway. Finally, metabolic pathways which compete with or inhibit the L-ascorbic acid production pathway can be inhibited by deleting or mutating enzymes, substrates or products which either inhibit or compete for an enzyme, substrate or product in the L-ascorbic acid pathway.


[0045] As discussed above, a genetically modified plant or microorganism useful in the method of the present invention can have at least one genetic modification (e.g., mutation in the endogenous gene or addition of a recombinant gene) in a gene encoding an enzyme involved in the L-ascorbic acid production pathway. Such genetic modifications preferably increase (i.e., enhance) the action of such enzymes such that L-ascorbic acid is preferentially produced as compared to other possible end products in related metabolic pathways. Such genetic modifications include, but are not limited to, overexpression of the gene encoding such enzyme, and deletion, mutation, or downregulation of genes encoding competitors or inhibitors of such enzyme. Preferred enzymes for which the action of the gene encoding such enzyme can be genetically modified include: hexokinase, glucose phosphate isomerase, phosphomannose isomerase (PMI), phosphomannomutase (PMM), GDP-D-mannose pyrophosphorylase (GMP), GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase. More preferably, a genetically modified plant or microorganism useful in the present invention has a genetic modification which increases the action of an enzyme selected from the group of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and/or L-galactono-γ-lactone dehydrogenase. Even more preferably, a genetically modified plant or microorganism useful in the present invention has a genetic modification which increases the action of GDP-D-mannose:GDP-L-galactose epimerase. These enzymes and the reactions catalyzed by such enzymes are illustrated in FIGS. 1A-1C.


[0046] Prior to the present invention, without knowing the L-ascorbic acid biosynthetic (i.e., production) pathway, previous mutagenesis and screening efforts were limited in that only non-lethal mutations could be detected. One embodiment of the present invention relates to elimination of a key competing enzyme that diverts carbon flow from L-ascorbic acid synthesis. If such enzyme is absolutely required for growth on glucose, then mutants lacking the enzyme (and, therefore, having increased carbon flow to L-ascorbic acid) would have been nonviable and not have been detected during prior screening efforts. One such enzyme is phosphofructokinase (PFK) (See FIG. 2A). PFK is required for growth on glucose, and is the major step drawing carbon away from L-ascorbic acid biosynthesis (FIG. 2A). Elimination of PFK would render the cells nonviable on glucose-based media. Selection of a conditional mutant where PFK was inactivated by temperature shift, for example, may allow development of a L-ascorbic acid process where cell growth is achieved under permissive fermentation conditions, and L-ascorbic acid production (from glucose) is initiated by a shift to non-permissive condition. In this example, the temperature shift would eliminate carbon flow from glucose to glycolysis via PFK, thereby shunting carbon into the L-ascorbic acid branch of metabolism. This approach has application not only in natural L-ascorbic acid producing organisms, but also in L-ascorbic acid recombinant systems (genetically engineered plant or microorganisms) as discussed herein.


[0047] Knowing the identity and mechanism of the rate-limiting pathway enzymes in the L-ascorbic acid production pathway allows for design of specific inhibitors of the enzymes that are also growth inhibitory. Selection of mutants resistant to the inhibitors allows for the isolation of strains that contain L-ascorbic acid-pathway enzymes with more favorable kinetic properties. Therefore, one embodiment of the present invention is to identify inhibitors of the enzymes that are also growth inhibitory. These inhibitors are then used to select genetic mutants that overcome this inhibition and produce L-ascorbic acid at high levels. In this embodiment, the resultant plant or microorganism is a non-recombinant strain which can then be further modified by recombinant technology, if desired. In recombinant L-ascorbic acid producing strains, random mutagenesis and screening can be used as a final step to increase L-ascorbic acid production.


[0048] In yet another embodiment genetic modifications are made to an L-ascorbic acid producing organism directly. This allows one to build upon a base of data acquired during prior classical strain improvement efforts, and perhaps more importantly, allows one to take advantage of undefined beneficial mutations that occurred during classical strain improvement. Furthermore, fewer problems are encountered when expressing native, rather than heterologous, genes. The most advanced system for development of genetic systems for microalgae has been developed for Chlamydomonas reinhardtii. Preferably, development of such a genetically modified production organism would include: isolation of mutant(s) with a specific nutritional requirement for use with a cloned selectable marker gene (similar to the ura3 mutants used in yeast and fungal systems); a cloned selectable marker such as URA3 or alternatively, identification and cloning of a gene that specifies resistance to a toxic compound (this would be analogous to the use of antibiotic resistance genes in bacterial systems, and, as is the case in yeast and other fungi, a means of inserting/removing the marker gene repeatedly would be required, unless several different selectable markers were developed); a transformation system for introducing DNA into the production organism and achieving stable transformation and expression; and, a promoter system (preferably several) for high-level expression of cloned genes in the organism.


[0049] Another embodiment of the present invention, discussed in detail below, is to place key genes or allelic variants and homologues thereof from L-ascorbic acid producing organisms (i.e., higher plants and microalgae) into a plant or microorganism that is more amenable to molecular genetic manipulation, including endogenous L-ascorbic acid producing microorganisms and suitable plants. For example, it is possible to identify a suitable non-pathogenic organism based on the requirement of growth (on glucose) at low pH (i.e., acid-tolerant organisms, discussed in detail below).


[0050] One suitable candidate for recombinant production in any suitable host organism is the gene (nucleic acid molecule) encoding GDP-D-mannose:GDP-L-galactose epimerase and homologues of the GDP-D-mannose:GDP-L-galactose epimerase, as well as any other epimerase that has structural homology at the primary (i.e., sequence) or tertiary (i.e., three dimensional) level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, or to a UDP-galactose 4-epimerase. Many microorganisms produce GDP-D-mannose as a precursor to exopolysaccharide and glycoprotein production, even though such organisms may not make L-ascorbic acid. This aspect of the present invention is discussed in detail below.


[0051] Referring to FIGS. 1A-1C, at least some of the enzymes from glucose-6-phosphate to GDP-D-mannose are present in many organisms. In fact, the entire sequence is present in bacteria such as Azotobacter vinelandii and Pseudomonas aeruginosa, and make up the early steps in the biosynthesis of the exopolysaccharide alginate. In this regard, it is possible that the only thing preventing these organisms from producing L-ascorbic acid could be the lack of GDP-D-mannose:GDP-L-galactose epimerase. The presence of PMI, PMM and GMP (see FIG. 1A) in so many organisms is important for two reasons. First, these organisms themselves could serve as alternate hosts for L-ascorbic acid production, by building on the existing early pathway enzymes and adding the required cloned genes (the epimerase and possibly others). Second, the genes encoding PMI, PMM and GMP can be cloned into a new organism where, together with the cloned epimerase, they would encode the overall pathway from glucose-6-phosphate to GDP-L-galactose.


[0052] In order to screen genomic DNA or cDNA libraries from different organisms and to isolate nucleic acid molecules encoding these enzymes such as the GDP-D-mannose:GDP-L-galactose epimerase, one can use any of a variety of standard molecular and biochemical techniques. For example, the GDP-D-mannose:GDP-L-galactose epimerase can be purified from an organism such as Prototheca, the N-terminal amino acid sequence can be determined (including, if necessary, the sequence of internal peptide fragments), and this information can be used to design degenerate primers for amplifying a gene fragment from the organism's DNA. This fragment would then be used to probe the library, and subsequently fragments that hybridize to the probe would be cloned in that organism or another suitable production organism. There is ample precedent for plant enzymes being expressed in an active form in bacteria, such as E. coli. Alternatively, yeast are also a suitable candidate for developing a heterologous system for L-ascorbic acid production.


[0053] It is to be understood that the present invention discloses a method comprising the use of a microorganism with an ability to produce commercially useful amounts of L-ascorbic acid in a fermentation process (i.e., preferably an enhanced ability to produce L-ascorbic acid compared to a wild-type microorganism cultured under the same conditions). This method is achieved by the genetic modification of one or more genes encoding a protein involved in an L-ascorbic acid pathway which results in the production (expression) of a protein having an altered (e.g., increased or decreased) function as compared to the corresponding wild-type protein. Preferably, such genetic modification is achieved by recombinant technology. It will be appreciated by those of skill in the art that production of genetically modified plants or microorganisms having a particular altered function as described elsewhere herein (e.g., an enhanced ability to produce GDP-D-mannose:GDP-L-galactose epimerase), such as by transformation of the plant or microorganism with a nucleic acid molecule which encodes a particular enzyme, can produce many organisms meeting the given functional requirement, albeit by virtue of a variety of different genetic modifications. For example, different random nucleotide deletions and/or substitutions in a given nucleic acid sequence may all give rise to the same phenotypic result (e.g., decreased enzymatic activity of the protein encoded by the sequence). The present invention contemplates any such genetic modification which results in the production of a plant or microorganism having the characteristics set forth herein.


[0054] A microorganism to be used in the fermentation method of the present invention is preferably a bacterium, a fungus, or a microalga which has been genetically modified according to the disclosure above. More preferably, a microorganism useful in the present invention is a microalga which is capable of producing L-ascorbic acid, although the present invention includes microorganisms which are genetically engineered to produce L-ascorbic acid using the knowledge of the key components of the pathway and the guidance provided herein. Even more preferably, a microorganism useful in the present invention is an acid-tolerant microorganism, such as microalgae of the genera Prototheca and Chlorella. Acid-tolerant yeast and bacteria are also known in the art. Acid-tolerant microorganisms are discussed in detail below. Particularly preferred microalgae include microalgae of the genera, Prototheca and Chlorella, with Prototheca being most preferred. All known species of Prototheca produce L-ascorbic acid. Production of ascorbic acid by microalgae of the genera Prototheca and Chlorella is described in detail in U.S. Pat. No. 5,792,631, issued Aug. 11, 1998, and in U.S. Pat. No. 5,900,370, issued May 4, 1999, both of which are incorporated herein by reference in their entirety. Preferred bacteria for use in the present invention include, but are not limited to, Azotobacter, Pseudomonas, and Escherichia, although acid-tolerant bacteria are more preferred. Preferred fungi for use in the present invention include yeast, and more preferably, yeast of the genus, Saccharomyces. A microorganism for use in the fermentation method of the present invention can also be referred to as a production organism. According to the present invention, microalgae can be referred to herein either as microorganisms or as plants.


[0055] A preferred plant to genetically modify according to the present invention is preferably a plant suitable for consumption by animals, including humans. More preferably, such a plant is a plant that naturally produces L-ascorbic acid, although other plants can be genetically modified to produce L-ascorbic acid using the guidance provided herein.


[0056] The L-ascorbic acid production pathways of the microalgae Prototheca and Chlorella pyrenoidosa will be addressed as specific embodiments of the present invention are described below. It will be appreciated that other plants and, in particular, other microorganisms, have similar L-ascorbic acid pathways and genes and proteins having similar structure and function within such pathways. It will also be appreciated that plants and microorganisms which do not naturally produce L-ascorbic acid can be modified according to the present invention to produce L-ascorbic acid. As such, the principles discussed below with regard to Prototheca and Chlorella pyrenoidosa are applicable to other plants and microorganisms, including genetically modified plants and microorganisms.


[0057] In one embodiment of the present invention, the action of an enzyme in the L-ascorbic acid production pathway is increased by amplification of the expression (i.e., overexpression) of an enzyme in the pathway, and particularly, the GDP-D-mannose:GDP-L-galactose epimerase, homologues of the epimerase, and/or enzymes downstream of the epimerase. Overexpression of an enzyme can be accomplished, for example, by introduction of a recombinant nucleic acid molecule encoding the enzyme. It is preferred that the gene encoding an enzyme in the L-ascorbic acid production pathway be cloned under control of an artificial promoter. The promoter can be any suitable promoter that will provide a level of enzyme expression required to maintain a sufficient level of L-ascorbic acid in the production organism. Preferred promoters are constitutive (rather than inducible) promoters, since the need for addition of expensive inducers is therefore obviated. The gene dosage (copy number) of a recombinant nucleic acid molecule according to the present invention can be varied according to the requirements for maximum product formation. In one embodiment, the recombinant nucleic acid molecule encoding a gene in the L-ascorbic acid production pathway is integrated into the chromosomes of the microorganism.


[0058] It is another embodiment of the present invention to provide a microorganism having one or more enzymes in the L-ascorbic acid production pathway with improved affinity for its substrates. An enzyme with improved affinity for its substrates can be produced by any suitable method of genetic modification or protein engineering. For example, computer-based protein engineering can be used to design an epimerase protein with greater stability and better affinity for its substrate. See for example, Maulik et al., 1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., which is incorporated herein by reference in its entirety.


[0059] Recombinant nucleic acid molecules encoding proteins in the L-ascorbic acid production pathway can be modified to enhance or reduce the function (i.e., activity) of the protein, as desired to increase L-ascorbic acid production, by any suitable method of genetic modification. For example, a recombinant nucleic acid molecule encoding an enzyme can be modified by any method for inserting, deleting, and/or substituting nucleotides, such as by error-prone PCR. In this method, the gene is amplified under conditions that lead to a high frequency of misincorporation errors by the DNA polymerase used for the amplification. As a result, a high frequency of mutations are obtained in the PCR products. The resulting gene mutants can then be screened for enhanced substrate affinity, enhanced enzymatic activity, or reduced/increased inhibitory ability by testing the mutant genes for the ability to confer increased L-ascorbic acid production onto a test microorganism, as compared to a microorganism carrying the non-mutated recombinant nucleic acid molecule.


[0060] Another embodiment of the present invention includes a microorganism in which competitive side reactions are blocked, including all reactions for which GDP-D-mannose is a substrate other than the production of L-ascorbic acid. In a preferred embodiment, a microorganism having complete or partial inactivation (decrease in the action of) of genes encoding enzymes which compete with the GDP-D-mannose:GDP-L-galactose epimerase for the GDP-D-mannose substrate is provided. Such enzymes include GDP-D-mannase and/or GDP-D-mannose-dehydrogenase. As used herein, inactivation of a gene can refer to any modification of a gene which results in a decrease in the activity (i.e., expression or function) of such a gene, including attenuation of activity or complete deletion of activity.


[0061] As discussed above, a particularly preferred aspect of the method to produce L-ascorbic acid by fermentation of a genetically modified microorganism of the present invention includes the step of culturing in a fermentation medium a microorganism having a genetic modification to increase the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. According to the present invention, such an epimerase can include the endogenous GDP-D-mannose:GDP-L-galactose epimerase from the L-ascorbic acid pathway, described above, as well as any other epimerase that has structural homology at the primary (i.e., sequence) or tertiary (i.e., three dimensional) level, to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, or to a UDP-galactose 4-epimerase. Such structural homology is discussed in detail below. Preferably, such an epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose. In one embodiment, the genetic modification includes transformation of the microorganism with a recombinant nucleic acid molecule that expresses such an epimerase.


[0062] Therefore, the epimerase encompassed in the method and organisms of the present invention includes the endogenous epimerase which operates in the naturally occurring ascorbic acid biosynthetic pathway (referred to herein as GDP-D-mannose:GDP-L-galactose epimerase), GDP-4-keto-6-deoxy-D-mannose epimerase/reductases, and any other epimerase which is capable of catalyzing the conversion of GDP-D mannose to GDP-L-galactose and which is structurally homologous to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase. An epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose according the present invention can be identified by biochemical and functional characteristics as well as structural characteristics. For example, an epimerase according to the present invention is capable of acting on GDP-D-mannose as a substrate, and more particularly, such an epimerase is capable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose. It is to be understood that such capabilities need not necessarily be the normal or natural function of the epimerase as it acts in its endogenous (i.e., natural) environment. For example, GDP-4-keto-6-deoxy-D-mannose epimerase/reductase in its natural environment under normal conditions, catalyzes the conversion of GDP-D-mannose to GDP-L-fucose and does not act directly on GDP-D-mannose (See FIG. 8A, B), however, such an epimerase is encompassed by the present invention for use in catalyzing the conversion of GDP-D-mannose to GDP-L-galactose for production of ascorbic acid, to the extent that it is capable of, or can be modified to be capable of, catalyzing the conversion of GDP-D-mannose to GDP-L-galactose. Therefore, the present invention includes epimerases which have the desired enzyme activity for use in production of ascorbic acid, are capable of having such desired enzyme activity, and/or are capable of being modified or induced to have such desired enzyme activity.


[0063] In one embodiment, an epimerase according to the present invention includes an epimerase that catalyzes the reaction depicted in FIG. 7. In another embodiment, an epimerase according to the present invention includes an epimerase that catalyzes the first of the reactions depicted in FIG. 8B. In one embodiment, an epimerase according to the present invention binds to NADPH. In another embodiment, an epimerase according to the present invention is NADPH-dependent for enzyme activity.


[0064] As discussed above, the present inventors have discovered that a key enzyme in L-ascorbic acid biosynthesis in plants and microorganisms is GDP-D-mannose:GDP-L-galactose epimerase (refer to FIGS. 1A-1C). One embodiment of the invention described herein is directed to the manipulation of this enzyme and structural homologues of this enzyme to increase L-ascorbic acid production in genetically engineered plants and/or microorganisms. More particularly, the GDP-D-mannose:GDP-L-galactose epimerase of the L-ascorbic acid pathway and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases are believed to be structurally homologous at both the sequence and tertiary structure level; a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is believed to be capable of functioning in the L-ascorbic acid biosynthetic pathway; and a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or homologue thereof may be superior to a GDP-D-mannose-GDP-L-galactose epimerase for increasing L-ascorbic acid production in genetically engineered plants and/or microorganisms. Furthermore, the present inventors disclose the use of a nucleotide sequence encoding all or part of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase as a probe to identify the gene encoding GDP-D-mannose:GDP-L-galactose epimerase. Similarly, the present inventors disclose the use of a nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose epimerase/reductase to design oligonucleotide primers for use in a PCR-based strategy for identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose epimerase.


[0065] Without being bound by theory, the present inventors believe that the following evidence supports the novel concept that the GDP-D-mannose:GDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases have significant structural homology at the level of sequence and/or tertiary structure, and that the GDP-4-keto-6-deoxy-D-mannose epimerase/reductases and/or homologues thereof would be useful for production of ascorbic acid and/or for isolating the endogenous GDP-D-mannose:GDP-L-galactose epimerase.


[0066] Although prior to the present invention, it was not known that the GDP-D-mannose:GDP-L-galactose epimerase enzyme (also known as GDP-D-mannose-3,5-epimerase) plays a critical role in L-ascorbic acid biosynthesis, this enzyme was previously described to catalyze the overall reversible reaction between GDP-D-mannose and GDP-L-galactose (Barber, 1971, Arch. Biochem. Biophys. 147:619-623; Barber, 1975, Arch. Biochem. Biophys. 167:718-722; Barber, 1979, J. Biol. Chem. 254:7600-7603; Hebda, et al., 1979, Arch. Biochem. Biophys. 194:496-502; Barber and Hebda, 1982, Meth. Enzymol., 83:522-525). Despite these studies, GDP-D-mannose:GDP-L-galactose epimerase has never been well characterized nor has the gene encoding this enzyme been cloned and sequenced. Since the original work by Barber, GDP-D-mannose:GDP-L-galactose epimerase activity has been detected in the colorless microalga Prototheca moriformis by the assignee of the present application, and in Arabidopsis thaliana and pea embryonic axes (Wheeler, et al., 1998, ibid.).


[0067] Barber (1979, J. Biol. Chem. 254:7600-7603) proposed a mechanism for GDP-D-mannose:GDP-L-galactose epimerase partially purified from the green microalga Chlorella pyrenoidosa. The overall conversion of GDP-D-mannose to GDP-L-galactose was proposed to proceed by oxidation of the hexosyl moiety at C-4 to a keto intermediate, ene-diol formation, and inversion of the configurations at C-3 and C-5 upon rehydration of the double bonds and stereospecific reduction of the keto group. The proposed mechanism is depicted in FIG. 7.


[0068] Based on Barber's work, Feingold and Avigad (1980, In The Biochemistry of Plants, Vol. 3: Carbohydrates; Structure and Function, P. K. Stompf and E. E. Conn, eds., Academic Press, NY) elaborated further on the proposed mechanism for GDP-D-mannose:GDP-L-galactose epimerase. This mechanism is based on the assumption that the epimerase contains tightly bound NAD+, and transfer of a hydride ion from C-4 of the substrate (GDP-D-mannose) to enzyme-associated NAD+ converts the enzyme to the reduced (NADH)form, generating enzyme-bound GDP-4-keto-D-mannose. The latter would then undergo epimerization by an ene-diol mechanism. The final product (GDP-L-galactose) would be released from the enzyme after stereospecific transfer of the hydride ion originally removed from C-4, simultaneously regenerating the oxidized form of the enzyme.


[0069] L-fucose (6-deoxy-L-galactose) is a component of bacterial lipopolysaccharides, mammalian and plant glycoproteins and polysaccharides of plant cell walls. L-fucose is synthesized de novo from GDP-D-mannose by the sequential action of GDP-D-mannose-4,6-dehydratase (an NAD(P)-dependent enzyme), and a bifunctional GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (NADPH-dependent), also referred to in scientific literature as GDP-fucose synthetase (Rizzi, et al., 1998, Structure 6:1453-1465; Somers, et al., 1998, Structure 6:1601-1612). This pathway for L-fucose biosynthesis appears to be ubiquitous (Rizzi, et al., 1998, Structure 6:1453-1465). The mechanisms for GDP-D-mannose-4,6-dehydratase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase are shown in FIG. 8A, B (Chang, et al., 1988, J. Biol. Chem. 263:1693-1697; Barber, 1980, Plant Physiol. 66:326-329).


[0070] Comparison of FIGS. 7 and 8A, B reveals that Barber's proposed mechanism for GDP-D-mannose:GDP-L-galactose epimerase is analogous to the reaction mechanism for GDP-4-keto-6-deoxy-D-mannose epimerase/reductase. The same mechanism has also been demonstrated for the epimerization reaction that occurs in the biosynthesis of two TDP-6-deoxy hexoses, TDP-L-rhamnose and TDP-6-deoxy-L-talose, from TDP-D-glucose (Liu and Thorson, 1994, Ann. Rev. Microbiol. 48:223-256). In the latter cases, however, the final reduction at C-4 is catalyzed by NADPH-dependent reductases that are separate from the epimerase enzyme. These reductases have opposite stereospecificity, providing either TDP-L-rhamnose or TDP-6-deoxy-L-talose (Liu and Thorson, 1994, Ann. Rev. Microbiol. 48:223-256).


[0071] In all of the mechanisms described above, NAD(P)H is required for the final reduction at C-4 (refer to FIG. 8B). In the work of Hebda, et al. (1979, Arch. Biochem. Biophys. 194:496-502), it was reported that GDP-D-mannose:GDP-L-galactose epimerase from C. pyrenoidosa did not require NAD, NADP or NADH for activity. Strangely, NADPH was not tested. Based on the analogous mechanisms shown in FIGS. 7 and 8A, B, the present inventors believe that it is likely that GDP-D-mannose:GDP-L-galactose epimerase from C. pyrenoidosa requires NADPH for the final reduction step. Why activity was detected in vitro without NADPH addition is not known, but tight binding of NADPH to the enzyme could explain this observation. On the other hand, if the proposed mechanism of Feingold and Avigad (1980, in The Biochemistry of Plants, Vol. 3, p. 101-170: Carbohydrates; Structure and Function, P. K. Stompf and E. E. Conn, ed., Academic Press, NY) is correct, the reduced enzyme-bound cofactor generated in the first oxidation step of the epimerase reaction would serve as the source of electrons for the final reduction of the keto group at C-4 back to the alcohol. Thus no addition of exogenous reduced cofactor would be required for activity in vitro.


[0072] Recently, a human gene encoding the bifunctional GDP-4-keto-6-deoxy-D-mannose epimerase/reductase was cloned and sequenced (Tonetti, et al., 1996, J. Biol. Chem. 271-27274-27279). This amino acid sequence of the human GDP-4-keto-6-deoxy-D-mannose epimerase/reductase shows significant homology (29% identity) to the E. coli GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (Tonetti, et al., 1998, Acta Cryst. D54:684-686; Somers, et al., 1998, Structure 6:1601-1612, both of which are incorporated herein by reference in their entireties). Tonetti et al. and Somers et al. additionally disclosed the tertiary (three dimensional) structure of the E. coli GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (also known as GDP-fucose synthetase), and noted significant structural homology with another epimerase, UDP-galactose 4-epimerase (GalE). These epimerases also share significant homology at the sequence level. Since no gene encoding a GDP-D-mannose:GDP-L-galactose epimerase has been cloned and sequenced, homology with genes encoding GDP-4-keto-6-deoxy-D-mannose epimerase/reductases or with genes encoding a UDP-galactose 4-epimerase has not been demonstrated. However, based on the similarity of the reaction products for GDP-D-mannose:GDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (i.e., GDP-L-galactose and GDP-6-deoxy-L-galactose [i.e., GDP-L-fucose], respectively) and the common catalytic mechanisms (FIGS. 7 and 8A, B) the present inventors believe that the genes encoding the enzymes will have a high degree of sequence homology, as well as tertiary structural homology.


[0073] Significant structural homology between GDP-D-mannose:GDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductases may allow a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, or a homologue thereof, to function in the L-ascorbic acid biosynthetic pathway, and a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase could potentially be even better than a GDP-D-mannose-GDP-L-galactose epimerase for increasing L-ascorbic acid production in genetically engineered plants and/or microorganisms. Furthermore, a nucleotide sequence encoding all or part of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be used as a probe to identify the gene encoding GDP-D-mannose:GDP-L-galactose epimerase. Likewise, the nucleotide sequence of the gene encoding GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be used to design oligonucleotide primers for use in a PCR-based strategy for identifying and cloning a gene encoding GDP-D-mannose:GDP-L-galactose epimerase.


[0074] The ability to substitute GDP-4-keto-6-D-mannose epimerase/reductase for GDP-D-mannose:GDP-L-galactose epimerase to enhance L-ascorbic acid biosynthesis in plants or microorganisms depends on the ability of GDP-4-keto-6-deoxy-D-mannose epimerase/reductase to act directly on GDP-D-mannose to form GDP-L-galactose. Evidence supporting this possibility already exists. Arabidopsis thaliana murl mutants are defective in GDP-D-mannose-4,6-dehydratase activity (Bonin, et al., 1997, Proc. Natl. Acad. Sci. 94:2085-2090). These mutants are thus blocked in GDP-L-fucose biosynthesis, and consequently have less than 2% of the normal amounts of L-fucose in the primary cell walls of aerial portions of the plant (Zablackis, et al., 1996, Science 272:1808-1810). The murl mutants are more brittle than wild-type plants, are slightly dwarfed and have an apparently normal life cycle (Zablackis, et al., 272:1808-1810). When murl mutants are grown in the presence of exogenous L-fucose, the L-fucose content in the plant is restored to the wild-type state (Bonin, et al., 1997, Proc. Natl. Acad. Sci. 94:2085-2090). It was discovered (Zablackis, et al., 1996, Science 272:1808-1810) that murl mutants contain, in the hemicellulose xyloglucan component of the primary cell wall, L-galactose in place of the normal L-fucose. L-galactose is not normally found in the xyloglucan component, but in murl mutants L-galactose partly replaces the terminal L-fucosyl residue. Bonin, et al. (1997, Proc. Natl. Acad. Sci. 94:2085-2090) hypothesized that in the absence of a functional GDP-D-mannose-4,6-dehydratase in the murl mutants, the GDP-4-keto-6-deoxy-D-mannose epimerase/reductase normally involved in L-fucose synthesis may be able to use GDP-D-mannose directly, forming GDP-L-galactose. Another possibility, however, is that the enzymes involved in L-ascorbic acid biosynthesis in A. thaliana are responsible for forming GDP-L-galactose in the murl mutant. If this were true, it would suggest that in the wild-type plant, some mechanism exists that prevents GDP-L-galactose formed in the L-ascorbic acid pathway from entering cell wall biosynthesis and substituting for (competing with) GDP-L-fucose for incorporation into the xyloglucan component (since L-galactose is not present in the primary cell wall of the wild-type plant).


[0075] Because of the similar reaction mechanisms of GDP-D-mannose:GDP-L-galactose epimerase and GDP-4-keto-6-deoxy-D-mannose epimerase/reductase, and because of the evidence that GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can act directly on GDP-D-mannose to form GDP-L-galactose, the present inventors believe that genes encoding all epimerases and epimerase/reductases that act on GDP-D-mannose have high homology. As such, one aspect of the present invention relates to the use of any epimerase (and nucleic acid sequences encoding such epimerase) having significant homology (at the primary, secondary and/or tertiary structure level) to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or to a UDP-galactose 4-epimerase for the purpose of improving the biosynthetic production of L-ascorbic acid.


[0076] Therefore, as described above, one embodiment of the present invention relates to a method for producing ascorbic acid or esters thereof in a microorganism, which includes culturing a microorganism having a genetic modification to increase the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Also included in the present invention are genetically modified microorganisms and plants in which the genetic modification increases the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose.


[0077] According to the present invention, an increase in the action of the GDP-D-mannose:GDP-L-galactose epimerase in the L-ascorbic acid production pathway can be achieved by genetic modifications which include, but are not limited to overexpression of the GDP-D-mannose:GDP-L-galactose epimerase gene, a homologue of such gene, or of any recombinant nucleic acid sequence encoding an epimerase that is homologous in primary (nucleic acid or amino acid sequence) or tertiary (three dimensional protein) structure to a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase, such as by overexpression of a recombinant nucleic acid molecule encoding the epimerase gene or a homologue thereof, and/or by mutation of the endogenous or recombinant gene to enhance expression of the gene.


[0078] According to the present invention, an epimerase that has a tertiary structure that is homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is an epimerase that has a tertiary structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws (Table 12). In another embodiment, an epimerase that has a tertiary structure that is homologous to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is an epimerase that has a tertiary structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1GFS. As used herein, a “tertiary structure” or “three dimensional structure” of a protein, such terms being interchangeable, refers to the components and the manner of arrangement of the components in three dimensional space to constitute the protein. The use of the term “substantially conforms” refers to at least a portion of a tertiary structure of an epimerase which is sufficiently spatially similar to at least a portion of a specified three dimensional configuration of a particular set of atomic coordinates (e.g., those represented by Brookhaven Protein Data Bank Accession Code 1bws) to allow the tertiary structure of at least said portion of the epimerase to be modeled or calculated (i.e., by molecular replacement) using the particular set of atomic coordinates as a basis for estimating the atomic coordinates defining the three dimensional configuration of the epimerase.


[0079] More particularly, a tertiary structure that substantially conforms to a given set of atomic coordinates is a structure having an average root-mean-square deviation (RMSD) of less than about 2.5 Å, and more preferably, less than about 2 Å, and, in increasing preference, less than about 1.5 Å, less than about 1 Å, less than about 0.5 Å, and most preferably, less than about 0.3 Å, over at least about 25% of the Cα positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. In other embodiments, a structure that substantially conforms to a given set of atomic coordinates is a structure wherein such structure has the recited average root-mean-square deviation (RMSD) value over at least about 50% of the Cα positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws, and in another embodiment, such structure has the recited average root-mean-square deviation (RMSD) value over at least about 75% of the Cα positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws, and in another embodiment, such structure has the recited average root-mean-square deviation (RMSD) value over about 100% of the Cα positions as compared to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. Methods to calculate RMSD values are well known in the art. Various software programs for determining the tertiary structural homology between one or more proteins are known in the art and are publicly available, such as QUANTA (Molecular Simulations Inc.).


[0080] A preferred epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose according to the method and genetically modified organisms of the present invention includes an epimerase that comprises a substrate binding site having a tertiary structure that substantially conforms to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. Preferably, the tertiary structure of the substrate binding site of the epimerase has an average root-mean-square deviation (RMSD) of less than about 2.5 Å, and more preferably, less than about 2 Å, and, in increasing preference, less than about 1.5 Å, less than about 1 Å, less than about 0.5 Å, and most preferably, less than about 0.3 Å, over at least about 25% of the Cα positions as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. In other embodiments, the tertiary structure of the substrate binding site of the epimerase has the recited average root-mean-square deviation (RMSD) value over at least about 50% of the Cα positions as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws, and in another embodiment, the tertiary structure of the substrate binding site of the epimerase has the recited average root-mean-square deviation (RMSD) value over at least about 75% of the Cα positions as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws, and in another embodiment, the tertiary structure of the substrate binding site of the epimerase has the recited average root-mean-square deviation (RMSD) value over about 100% of the Cα positions as compared to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. The tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1GFS is discussed in detail in Somers et al., 1998, ibid.


[0081] Another preferred epimerase according to the present invention includes an epimerase that comprises a catalytic site having a tertiary structure that substantially conforms to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. Preferably, the tertiary structure of the catalytic site of the epimerase has an average root-mean-square deviation (RMSD) of less than about 2.5 Å, and more preferably, less than about 2 Å, and, in increasing preference, less than about 1.5 Å, less than about 1 Å, less than about 0.5 Å, and most preferably, less than about 0.3 Å, over at least about 25% of the Cα positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws. In other embodiments, the tertiary structure of the catalytic site of the epimerase has the recited average root-mean-square deviation (RMSD) value over at least about 50% of the Cα positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws, and in another embodiment, the tertiary structure of the catalytic site of the epimerase has the recited average root-mean-square deviation (RMSD) value over at least about 75% of the Cα positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws, and in another embodiment, the tertiary structure of the catalytic site of the epimerase has the recited average root-mean-square deviation (RMSD) value over 100% of the Cα positions as compared to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.


[0082] In one embodiment, an epimerase encompassed by the present invention includes an epimerase that has a catalytic site which includes amino acid residues: serine, tyrosine and lysine. In a preferred embodiment, the tertiary structure positions of the amino acid residues serine, tyrosine and lysine substantially conform to the tertiary structure position of residues Ser107, Tyr136 and Lys140, respectively, as represented by atomic coordinates in Brookhaven Protein Data Bank Accession Code 1bws. The tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws is discussed in detail in Rizzi et al., 1998, ibid. Additionally, the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by the atomic coordinates having Brookhaven Protein Data Bank Accession Code 1GFS is discussed in detail in Somers et al., 1998, ibid.


[0083] In an even more preferred embodiment, the above definition of “substantially conforms” can be further defined to include atoms of amino acid side chains. As used herein, the phrase “common amino acid side chains” refers to amino acid side chains that are common to both the structures which substantially conforms to a given set of atomic coordinates and the structure that is actually represented by such atomic coordinates. Preferably, a tertiary structure that substantially conforms to a given set of atomic coordinates is a structure having an average root-mean-square deviation (RMSD) of less than about 2.5 Å, and more preferably, less than about 2 Å, and, in increasing preference, less than about 1.5 Å, less than about 1 Å, less than about 0.5 Å, and most preferably, less than about 0.3 Å over at least about 25% of the common amino acid side chains as compared to the tertiary structure represented by the given set of atomic coordinates. In another embodiment, a structure that substantially conforms to a given set of atomic coordinates is a structure having the recited average root-mean-square deviation (RMSD) value over at least about 50% of the common amino acid side chains as compared to the tertiary structure represented by the given set of atomic coordinates, and in another embodiment, such structure has the recited average root-mean-square deviation (RMSD) value over at least about 75% of the common amino acid side chains as compared to the tertiary structure represented by the given set of atomic coordinates, and in another embodiment, such a structure has the recited average root-mean-square deviation (RMSD) value over 100% of the common amino acid side chains as compared to the tertiary structure represented by the given set of atomic coordinates.


[0084] A tertiary structure of an epimerase which substantially conforms to a specified set of atomic coordinates can be modeled by a suitable modeling computer program such as MODELER (A. Sali and T. L. Blundell, J. Mol. Biol., vol. 234:779-815, 1993 as implemented in the Insight II Homology software package (Insight II (97.0), MSI, San Diego)), using information, for example, derived from the following data: (1) the amino acid sequence of the epimerase; (2) the amino acid sequence of the related portion(s) of the protein represented by the specified set of atomic coordinates having a three dimensional configuration; and, (3) the atomic coordinates of the specified three dimensional configuration. Alternatively, a tertiary structure of an epimerase which substantially conforms to a specified set of atomic coordinates can be modeled using data generated from analysis of a crystallized structure of the epimerase. A tertiary structure of an epimerase which substantially conforms to a specified set of atomic coordinates can also be calculated by a method such as molecular replacement. Methods of molecular replacement are generally known by those of skill in the art (generally described in Brunger, Meth. Enzym., vol. 276, pp. 558-580, 1997; Navaza and Saludjian, Meth. Enzym., vol. 276, pp. 581-594, 1997; Tong and Rossmann, Meth. Enzym., vol. 276, pp. 594-611, 1997; and Bentley, Meth. Enzym., vol. 276, pp. 611-619, 1997, each of which are incorporated by this reference herein in their entirety) and are performed in a software program including, for example, XPLOR (Brunger, et al., Science, vol. 235, p. 458, 1987). In addition, a structure can be modeled using techniques generally described by, for example, Sali, Current opinions in Biotechnology, vol. 6, pp. 437-451, 1995, and algorithms can be implemented in program packages such as Homology 95.0 (in the program Insight II, available from Biosym/MSI, San Diego, Calif.). Use of Homology 95.0 requires an alignment of an amino acid sequence of a known structure having a known three dimensional structure with an amino acid sequence of a target structure to be modeled. The alignment can be a pairwise alignment or a multiple sequence alignment including other related sequences (for example, using the method generally described by Rost, Meth. Enzymol., vol. 266, pp. 525-539, 1996) to improve accuracy. Structurally conserved regions can be identified by comparing related structural features, or by examining the degree of sequence homology between the known structure and the target structure. Certain coordinates for the target structure are assigned using known structures from the known structure. Coordinates for other regions of the target structure can be generated from fragments obtained from known structures such as those found in the Protein Data Bank maintained by Brookhaven National Laboratory, Upton, N.Y. Conformation of side chains of the target structure can be assigned with reference to what is sterically allowable and using a library of rotamers and their frequency of occurrence (as generally described in Ponder and Richards, J. Mol. Biol., vol. 193, pp. 775-791, 1987). The resulting model of the target structure, can be refined by molecular mechanics (such as embodied in the program Discover, available from Biosym/MSI) to ensure that the model is chemically and conformationally reasonable.


[0085] According to the present invention, an epimerase that has a nucleic acid sequence that is homologous at the primary structure level (i.e., is a homologue of) to a nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase includes any epimerase encoded by a nucleic acid sequence that is at least about 15%, and preferably at least about 20%, and more preferably at least about 25%, and even more preferably, at least about 30% identical to a nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase, and preferably to a nucleic acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. Similarly, an epimerase that has an amino acid sequence that is homologous to an amino acid sequence of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase includes any epimerase having an amino acid sequence that is at least about 15%, and preferably at least about 20%, and more preferably at least about 25%, and even more preferably, at least about 30% identical to an amino acid sequence of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase or a UDP-galactose 4-epimerase, and preferably to an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO:10.


[0086] According to one embodiment of the present invention, homology or percent identity between two or more nucleic acid or amino acid sequences is performed using methods known in the art for aligning and/or calculating percentage identity. To compare the homology/percent identity between two or more sequences as set forth above, for example, a module contained within DNASTAR (DNASTAR, Inc., Madison, Wis.) can be used. In particular, to calculate the percent identity between two nucleic acid or amino acid sequences, the Lipman-Pearson method, provided by the MegAlign module within the DNASTAR program, is preferably used, with the following parameters, also referred to herein as the Lipman-Pearson standard default parameters:


[0087] (1) Ktuple=2;


[0088] (2) Gap penalty=4;


[0089] (3) Gap length penalty=12.


[0090] Using the Lipman-Pearson method with these parameters, for example, the percent identity between the amino acid sequence for E. coli GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (SEQ ID NO:4) and human GDP-4-keto-6-deoxy-D-mannose epimerase/reductase (FX) (SEQ ID NO:6) is 27.7%, which is comparable to the 27% identity described for these enzymes in Tonetti et al., 1998, Acta Cryst. D54:684-686.


[0091] According to another embodiment of the present invention, to align two or more nucleic acid or amino acid sequences, for example to generate a consensus sequence or evaluate the similarity at various positions between such sequences, a CLUSTAL alignment program (e.g., CLUSTAL, CLUSTAL V, CLUSTAL W), also available as a module within the DNASTAR program, can be used using the following parameters, also referred to herein as the CLUSTAL standard default parameters:


[0092] Multiple Alignment Parameters (i.e., for more than 2 sequences):


[0093] (1) Gap penalty=10;


[0094] (2) Gap length penalty=10;


[0095] Pairwise Alignment Parameters (i.e., for two sequences):


[0096] (1) Ktuple=1;


[0097] (2) Gap penalty=3;


[0098] (3) Window=5;


[0099] (4) Diagonals saved=5.


[0100] According to the present invention, a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase can be a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from any organism, including Arabidopsis thaliana, Escherichia coli, and human. A nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from Arabidopsis thaliana is represented herein by SEQ ID NO:1. SEQ ID NO:1 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence represented herein as SEQ ID NO:2. A nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from Escherichia coli is represented herein by SEQ ID NO:3. SEQ ID NO:3 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence represented herein as SEQ ID NO:4. A nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase from homo sapiens is represented herein by SEQ ID NO:5. SEQ ID NO:5 encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase having an amino acid sequence represented herein as SEQ ID NO:6.


[0101] According to the present invention, a UDP-galactose 4-epimerase can be a UDP-galactose 4-epimerase from any organism, including Escherichia coli and human. A nucleic acid sequence encoding a UDP-galactose 4-epimerase from Escherichia coli is represented herein by SEQ ID NO:7. SEQ ID NO:7 encodes a UDP-galactose 4-epimerase having an amino acid sequence represented herein as SEQ ID NO:8. A nucleic acid sequence encoding a UDP-galactose 4-epimerase from homo sapiens is represented herein by SEQ ID NO:9. SEQ ID NO:9 encodes a UDP-galactose 4-epimerase having an amino acid sequence represented herein as SEQ ID NO:10.


[0102] In a preferred embodiment, an epimerase encompassed by the present invention has an amino acid sequence that aligns with the amino acid sequence of SEQ ID NO:11, for example using a CLUSTAL alignment program, wherein amino acid residues in the amino acid sequence of the epimerase align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO:11, and preferably at least about 75% of non-Xaa residues in SEQ ID NO:11, and more preferably, at least about 90% of non-Xaa residues in SEQ ID NO:11, and even more preferably 100% of non-Xaa residues in SEQ ID NO:11. The percent identity of residues aligning with 100% identity with non-Xaa residues can be simply calculated by dividing the number of 100% identical matches at non-Xaa residues in SEQ ID NO:11 by the total number of non-Xaa residues in SEQ ID NO:11. A preferred nucleic acid sequence encoding an epimerase encompassed by the present invention include a nucleic acid sequence encoding an epimerase having an amino acid sequence with the above described identity to SEQ ID NO:11. Such an alignment using a CLUSTAL alignment program is based on the same parameters as previously disclosed herein. SEQ ID NO:11 represents a consensus amino acid sequence of an epimerase which was derived by aligning at least portions of amino acid sequences SEQ ID NO: 4, SEQ ID NO: 6 and SEQ ID NO: 8, as described in Somers et al., 1998, Structure 6:1601-1612, and can be approximately duplicated using CLUSTAL.


[0103] In another embodiment, an epimerase encompassed by the present invention includes an epimerase that has a catalytic site which includes amino acid residues: serine, tyrosine and lysine. Preferably, such serine, tyrosine and lysine residues are located at positions in the epimerase amino acid sequence which align using a CLUSTAL alignment program with positions Ser105, Tyr134 and Lys138 of consensus sequence SEQ ID NO:11, with positions Ser109, Tyr138 and Lys142 of sequence SEQ ID NO:2, with positions Ser107, Tyr136 and Lys140 of SEQ ID NO:4, with positions Ser114, Tyr143 and Lys147 of sequence SEQ ID NO:6, with positions Ser124, Tyr149 and Lys153 of sequence SEQ ID NO:8 or with positions Ser132, Tyr157 and Lys161 of sequence SEQ ID NO:10.


[0104] In another embodiment, an epimerase that has an amino acid sequence that is homologous to an amino acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase includes any epimerase that has an amino acid motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly, which is found, for example in positions 8 through 14 of the consensus sequence SEQ ID NO:11, in positions 12 through 18 of SEQ ID NO:2, in positions 10 through 16 of SEQ ID NO:4, in positions 14 through 20 of SEQ ID NO:6, in positions 7 through 13 of SEQ ID NO:8, and in positions 9 through 15 of SEQ ID NO:10. Such a motif can be identified by its alignment with the same motif in the above-identified amino acid sequences using a CLUSTAL alignment program. Preferably, such motif is located within the first 25 N-terminal amino acids of the amino acid sequence of the epimerase.


[0105] In yet another embodiment, an epimerase encompassed by the present invention includes an epimerase that has a substrate binding site which includes amino acid residues that align using a CLUSTAL alignment program with at least 50% of amino acid positions Asn177, Ser178, Arg187, Arg209, Lys283, Asn165, Ser107, Ser108, Cys109, Asn133, Tyr136 and His179 of SEQ ID NO:4. Alignment with positions Ser107, Tyr136, Asn165, Arg209, is preferably with 100% identity (i.e., exact match of residue, under parameters for alignment).


[0106] In another embodiment of the present invention, an epimerase encompassed by the present invention comprises at least 4 contiguous amino acid residues having 100% identity with at least 4 contiguous amino acid residues of an amino acid sequence selected from the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 or SEQ ID NO:10, as determined using a Lipman-Pearson method with Lipman-Pearson standard default parameters or by comparing an alignment using a CLUSTAL program with CLUSTAL standard default parameters. According to the present invention, the term “contiguous” means to be connected in an unbroken sequence. For a first sequence to have “100% identity” with a second sequence means that the first sequence exactly matches the second sequence with no gaps between nucleotides or amino acids.


[0107] In another embodiment of the present invention, an epimerase encompassed by the present invention is encoded by a nucleic acid sequence that comprises at least 12 contiguous nucleic acid residues having 100% identity with at least 12 contiguous nucleic acid residues of a nucleic acid sequence selected from the group of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:10, as determined using a Lipman-Pearson method with Lipman-Pearson standard default parameters or by comparing an alignment using a CLUSTAL program with CLUSTAL standard default parameters.


[0108] In another embodiment of the present invention, an epimerase encompassed by the present invention is encoded by a nucleic acid sequence that hybridizes under stringent hybridization conditions to a nucleic acid sequence selected from the group of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9. As used herein, stringent hybridization conditions refer to standard hybridization conditions under which nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid., is incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In addition, formulae to calculate the appropriate hybridization and wash conditions to achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al., 1984, Anal. Biochem. 138, 267-284; Meinkoth et al., ibid., is incorporated by reference herein in its entirety.


[0109] More particularly, stringent hybridization and washing conditions, as referred to herein, refer to conditions which permit isolation of nucleic acid molecules having at least about 70% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction, more particularly at least about 75%, and most particularly at least about 80%. Such conditions will vary, depending on whether DNA:RNA or DNA:DNA hybrids are being formed. Calculated melting temperatures for DNA:DNA hybrids are 10° C. less than for DNA:RNA hybrids. In particular embodiments, stringent hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 6X SSC (0.9 M Na+) at a temperature of between about 20° C. and about 35° C., more preferably, between about 28° C. and about 40° C., and even more preferably, between about 35° C. and about 45° C. In particular embodiments, stringent hybridization conditions for DNA:RNA hybrids include hybridization at an ionic strength of 6X SSC (0.9 M Na+) at a temperature of between about 30° C. and about 45° C., more preferably, between about 38° C. and about 50° C., and even more preferably, between about 45° C. and about 55° C. These values are based on calculations of a melting temperature for molecules larger than about 100 nucleotides, 0% formamide and a G+C content of about 40%. Alternatively, Tm can be calculated empirically as set forth in Sambrook et al., supra, pages 9.31 to 9.62.


[0110] In another embodiment of the present invention, an epimerase encompassed by the present invention is encoded by a nucleic acid sequence that comprises a nucleic acid sequence selected from the group of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 or SEQ ID NO:9 or a fragment thereof, wherein the fragment encodes a protein that is capable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose, such as under physiological conditions. In another embodiment, an epimerase encompassed by the present invention comprises an amino acid sequence selected from the group of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10 or a fragment thereof, wherein the fragment is capable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose. It is to be understood that the nucleic acid sequence encoding the amino acid sequences identified herein can vary due to degeneracies. As used herein, nucleotide degeneracies refers to the phenomenon that one amino acid can be encoded by different nucleotide codons.


[0111] One embodiment of the present invention relates to a method to identify an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose. Preferably, such a method is useful for identifying the GDP-D-mannose:GDP-L-galactose epimerase which catalyzes the conversion of GDP-D-mannose to GDP-L-galactose in the endogenous (i.e., naturally occurring L-ascorbic acid biosynthetic pathway of microorganisms and/or plants). Such a method can include the steps of: (a) contacting a source of nucleic acid molecules with an oligonucleotide at least about 12 nucleotides in length under stringent hybridization conditions, wherein the oligonucleotide is identified by its ability to hybridize under stringent hybridization conditions to a nucleic acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3 and SEQ ID NO:5; and, (b) identifying nucleic acid molecules from the source of nucleic acid molecules which hybridize under stringent hybridization conditions to the oligonucleotide. Nucleic acid molecules identified by this method can then be isolated from the source using standard molecular biology techniques. Preferably, the source of nucleic acid molecules is obtained from a microorganism or plant that has an ascorbic acid production pathway. Such a source of nucleic acid molecules can be any source of nucleic acid molecules which can be isolated from an organism and/or which can be screened by hybridization with an oligonucleotide such as a probe or a PCR primer. Such sources include genomic and cDNA libraries and isolated RNA.


[0112] In order to screen cDNA libraries from different organisms and to isolate nucleic acid molecules encoding enzymes such as the GDP-D-mannose:GDP-L-galactose epimerase and related epimerases, one can use any of a variety of standard molecular and biochemical techniques. For example, oligonucleotide primers, preferably degenerate primers, can be designed using the most conserved regions of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase nucleic acid sequence, and such primers can be used in a polymerase chain reaction (PCR) protocol to amplify the same or related epimerases, including the GDP-D-mannose:GDP-L-galactose epimerase from the ascorbic acid pathway, from nucleic acids (e.g., genomic or cDNA libraries) isolated from a desired organism (e.g., a microorganism or plant having an L-ascorbic acid pathway). Similarly, oligonucleotide probes can be designed using the most conserved regions of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase nucleic acid sequence and such probe can be used to identify and isolate nucleic acid molecules, such as from a genomic or cDNA library, that hybridize under conditions of low, moderate, or high stringency with the probe.


[0113] Alternatively, the GDP-D-mannose:GDP-L-galactose epimerase can be purified from an organism such as Prototheca, the N-terminal amino acid sequence can be determined (including the sequence of internal peptide fragments), and this information can be used to design degenerate primers for amplifying a gene fragment from the organism cDNA. This fragment would then be used to probe the cDNA library, and subsequently fragments that hybridize to the probe would be cloned in that organism or another suitable production organism. There is ample precedent for plant enzymes being expressed in an active form in bacteria, such as E. coli. Alternatively, yeast are also a suitable candidate for developing a heterologous system for L-ascorbic acid production.


[0114] As discussed above in general for increasing the action of an enzyme in the L-ascorbic acid pathway according to the present invention, in one embodiment of the present invention, the action of an epimerase that catalyzes the conversion of GDP-D-mannose to GDP-L-galactose is increased by amplification of the expression (i.e., overexpression) of such an epimerase. Overexpression of an epimerase can be accomplished, for example, by introduction of a recombinant nucleic acid molecule encoding the epimerase. It is preferred that the gene encoding an epimerase according to the present invention be cloned under control of an artificial promoter. The promoter can be any suitable promoter that will provide a level of epimerase expression required to maintain a sufficient level of L-ascorbic acid in the production organism. Preferred promoters are constitutive (rather than inducible) promoters, since the need for addition of expensive inducers is therefore obviated. The gene dosage (copy number) of a recombinant nucleic acid molecule according to the present invention can be varied according to the requirements for maximum product formation. In one embodiment, the recombinant nucleic acid molecule encoding an epimerase according to the present invention is integrated into the chromosome of the microorganism.


[0115] It is another embodiment of the present invention to provide a microorganism having one or more epimerases according to the present invention with improved affinity for its substrate. An epimerase with improved affinity for its substrate can be produced by any suitable method of genetic modification or protein engineering. For example, computer-based protein engineering can be used to design an epimerase protein with greater stability and better affinity for its substrate. See for example, Maulik et al., 1997, Molecular Biotechnology: Therapeutic Applications and Strategies, Wiley-Liss, Inc., which is incorporated herein by reference in its entirety.


[0116] As noted above, in the method for production of L-ascorbic acid of the present invention, a microorganism having a genetically modified L-ascorbic acid production pathway is cultured in a fermentation medium for production of L-ascorbic acid. An appropriate, or effective, fermentation medium refers to any medium in which a genetically modified microorganism of the present invention, when cultured, is capable of producing L-ascorbic acid. Such a medium is typically an aqueous medium comprising assimilable carbon, nitrogen and phosphate sources. Such a medium can also include appropriate salts, minerals, metals and other nutrients. One advantage of genetically modifying a microorganism as described herein is that although such genetic modifications can significantly alter the production of L-ascorbic acid, they can be designed such that they do not create any nutritional requirements for the production organism. Thus, a minimal-salts medium containing glucose as the sole carbon source can be used as the fermentation medium. The use of a minimal-salts-glucose medium for the L-ascorbic acid fermentation will also facilitate recovery and purification of the L-ascorbic acid product.


[0117] In one mode of operation of the present invention, the carbon source concentration, such as the glucose concentration, of the fermentation medium is monitored during fermentation. Glucose concentration of the fermentation medium can be monitored using known techniques, such as, for example, use of the glucose oxidase enzyme test or high pressure liquid chromatography, which can be used to monitor glucose concentration in the supernatant, e.g., a cell-free component of the fermentation medium. As stated previously, the carbon source concentration should be kept below the level at which cell growth inhibition occurs. Although such concentration may vary from organism to organism, for glucose as a carbon source, cell growth inhibition occurs at glucose concentrations greater than at about 60 g/L, and can be determined readily by trial. Accordingly, when glucose is used as a carbon source the glucose concentration in the fermentation medium is maintained in the range of from about 1 g/L to about 100 g/L, more preferably in the range of from about 2 g/L to about 50 g/L, and yet more preferably in the range of from about 5 g/L to about 20 g/L. Although the carbon source concentration can be maintained within desired levels by addition of, for example, a substantially pure glucose solution, it is preferred to maintain the carbon source concentration of the fermentation medium by addition of aliquots of the original fermentation medium. The use of aliquots of the original fermentation medium are desirable because the concentrations of other nutrients in the medium (e.g. the nitrogen and phosphate sources) can be maintained simultaneously. Likewise, the trace metals concentrations can be maintained in the fermentation medium by addition of aliquots of the trace metals solution.


[0118] In an embodiment of the fermentation process of the present invention, a fermentation medium is prepared as described above. This fermentation medium is inoculated with an actively growing culture of genetically modified microorganisms of the present invention in an amount sufficient to produce, after a reasonable growth period, a high cell density. Typical inoculation cell densities are within the range of from about 0.1 g/L to about 15 g/L, preferably from about 0.5 g/L to about 10 g/L and more preferably from about 1 g/L to about 5 g/L, based on the dry weight of the cells. The cells are then grown to a cell density in the range of from about 10 g/L to about 100 g/L preferably from about 20 g/L to about 80 g/L, and more preferably from about 50 g/L to about 70 g/L. The residence times for the microorganisms to reach the desired cell densities during fermentation are typically less than about 200 hours, preferably less than about 120 hours, and more preferably less than about 96 hours.


[0119] The microorganisms useful in the method of the present invention can be cultured in conventional fermentation modes, which include, but are not limited to, batch, fed-batch, and continuous. It is preferred, however, that the fermentation be carried out in fed-batch mode. In such a case, during fermentation some of the components of the medium are depleted. It is possible to initiate fermentation with relatively high concentrations of such components so that growth is supported for a period of time before additions are required. The preferred ranges of these components are maintained throughout the fermentation by making additions as levels are depleted by fermentation. Levels of components in the fermentation medium can be monitored by, for example, sampling the fermentation medium periodically and assaying for concentrations. Alternatively, once a standard fermentation procedure is developed, additions can be made at timed intervals corresponding to known levels at particular times throughout the fermentation. As will be recognized by those in the art, the rate of consumption of nutrient increases during fermentation as the cell density of the medium increases. Moreover, to avoid introduction of foreign microorganisms into the fermentation medium, addition is performed using aseptic addition methods, as are known in the art. In addition, a small amount of anti-foaming agent may be added during the fermentation.


[0120] The present inventors have determined that high levels of magnesium in the fermentation medium inhibits the production of L-ascorbic acid due to repression of enzymes early in the production pathway, although enzymes late in the pathway (i.e., from L-galactose to L-ascorbic acid) are not negatively affected (See Examples). Therefore, in a preferred embodiment of the method of the present invention, the step of culturing is carried out in a fermentation medium that is magnesium (Mg2+) limited. Even more preferably, the fermentation is magnesium limited during the cell growth phase. Preferably, the fermentation medium comprises less than about 0.5 g/L of Mg2+ during the cell growth phase of fermentation, and even more preferably, less than about 0.2 g/L of Mg2+, and even more preferably, less than about 0.1 g/L of Mg2+.


[0121] The temperature of the fermentation medium can be any temperature suitable for growth and ascorbic acid production, and may be modified according to the growth requirements of the production microorganism used. For example, prior to inoculation of the fermentation medium with an inoculum, the fermentation medium can be brought to and maintained at a temperature in the range of from about 20° C. to about 45° C., preferably to a temperature in the range of from about 25° C. to about 40° C., and more preferably in the range of from about 30° C. to about 38° C.


[0122] It is a further embodiment of the present invention to supplement and/or control other components and parameters of the fermentation medium, as necessary to maintain and/or enhance the production of L-ascorbic acid by a production organism. For example, in one embodiment, the pH of the fermentation medium is monitored for fluctuations in pH. In the fermentation method of the present invention, the pH is preferably maintained at a pH of from about pH 6.0 to about pH 8.0, and more preferably, at about pH 7.0. In the method of the present invention, if the starting pH of the fermentation medium is pH 7.0, the pH of the fermentation medium is monitored for significant variations from pH 7.0, and is adjusted accordingly, for example, by the addition of sodium hydroxide. In a preferred embodiment of the present invention, genetically modified microorganisms useful for production of L-ascorbic acid include acid-tolerant microorganisms. Such microorganisms include, for example, microalgae of the genera Prototheca and Chlorella (See U.S. Pat. No. 5,792,631, ibid. and U.S. Pat. No. 5,900,370, ibid.).


[0123] The production of ascorbic acid by culturing acid-tolerant microorganisms provides significant advantages over known ascorbic acid production methods. One such advantage is that such organisms are acidophilic, allowing fermentation to be carried out under low pH conditions, with the fermentation medium pH typically less than about 6. Below this pH, extracellular ascorbic acid produced by the microorganism during fermentation is relatively stable because the rate of oxidation of ascorbic acid in the fermentation medium by oxygen is reduced. Accordingly, high productivity levels can be obtained for producing L-ascorbic acid with acid-tolerant microorganisms according to the methods of the present invention. In addition, control of the dissolved oxygen content to very low levels to avoid oxidation of ascorbic acid is unnecessary. Moreover, this advantage allows for the use of continuous recovery methods because extracellular medium can be treated to recover the ascorbic acid product.


[0124] Thus, the present method can be conducted at low pH when acid-tolerant microorganisms are used as production organisms. The benefit of this process is that at low pH, extracellular ascorbic acid produced by the organism is degraded at a reduced rate than if the fermentation medium was at higher pH. For example, prior to inoculation of the fermentation medium with an inoculum, the pH of the fermentation medium can be adjusted, and further monitored during fermentation. Typically, the pH of the fermentation medium is brought to and maintained below about 6, preferably below 5.5, and more preferably below about 5. The pH of the fermentation medium can be controlled by the addition of ammonia to the fermentation medium. In such cases when ammonia is used to control pH, it also conveniently serves as a nitrogen source in the fermentation medium.


[0125] The fermentation medium can also be maintained to have a dissolved oxygen content during the course of fermentation to maintain cell growth and to maintain cell metabolism for L-ascorbic acid formation. The oxygen concentration of the fermentation medium can be monitored using known methods, such as through the use of an oxygen probe electrode. Oxygen can be added to the fermentation medium using methods known in the art, for example, through agitation and aeration of the medium by stirring or shaking. Preferably, the oxygen concentration in the fermentation medium is in the range of from about 20% to about 100% of the saturation value of oxygen in the medium based upon the solubility of oxygen in the fermentation medium at atmospheric pressure and at a temperature in the range of from about 30° C. to about 40° C. Periodic drops in the oxygen concentration below this range may occur during fermentation, however, without adversely affecting the fermentation.


[0126] The genetically modified microorganisms of the present invention are engineered to produce significant quantities of extracellular L-ascorbic acid. Extracellular L-ascorbic acid can be recovered from the fermentation medium using conventional separation and purification techniques. For example, the fermentation medium can be filtered or centrifuged to remove microorganisms, cell debris and other particulate matter, and L-ascorbic acid can be recovered from the cell-free supernate by conventional methods, such as, for example, ion exchange, chromatography, extraction, solvent extraction, membrane separation, electrodialysis, reverse osmosis, distillation, chemical derivatization and crystallization.


[0127] One such example of L-ascorbic acid recovery is provided in U.S. Pat. No. 4,595,659 by Cayle, incorporated herein in its entirety be reference, which discloses the isolation of L-ascorbic acid from an aqueous fermentation medium by ion exchange resin adsorption and elution, which is followed by decoloration, evaporation and crystallization. Further, isolation of the structurally similar isoascorbic acid from fermentation medium by a continuous multi-bed extraction system of anion-exchange resins is described by K. Shimizu, Agr. Biol. Chem. 31:346-353 (1967), which is incorporated herein in its entirety by reference.


[0128] Intracellular L-ascorbic acid produced in accordance with the present invention can also be recovered and used in a variety of applications. For example, cells from the microorganisms can be lysed and the ascorbic acid which is released can be recovered by a variety of known techniques. Alternatively, intracellular ascorbic acid can be recovered by washing the cells to extract the ascorbic acid, such as through diafiltration.


[0129] Development of a microorganism with enhanced ability to produce L-ascorbic acid by genetic modification can be accomplished using both classical strain development and molecular genetic techniques, and particularly, recombinant technology (genetic engineering). In general, the strategy for creating a microorganism with enhanced L-ascorbic acid production is to (1) inactivate or delete at least one, and preferably more than one of the competing or inhibitory pathways in which production of L-ascorbic acid is negatively affected (e.g., inhibited), and more significantly to (2) amplify the L-ascorbic acid production pathway by increasing the action of a gene(s) encoding an enzyme(s) involved in the pathway.


[0130] In one embodiment, the strategy for creating a microorganism with enhanced L-ascorbic acid production is to amplify the L-ascorbic acid production pathway by increasing the action of GDP-D-mannose:GDP-L-galactose epimerase, as discussed above. Such strategy includes genetically modifying the endogenous GDP-D-mannose:GDP-L-galactose epimerase such that L-ascorbic acid production is increased, and/or expressing/overexpressing a recombinant epimerase that catalyzes the conversion of GDP-D-mannose to GDP-L-galactose, which includes expression of recombinant GDP-D-mannose:GDP-L-galactose epimerase and/or homologues thereof, and of other recombinant epimerases such as GDP-4-keto-6-deoxy-D-mannose epimerase reductase and epimerases that share structural homology with such epimerase as discussed in detail above.


[0131] It is to be understood that a production organism can be genetically modified by recombinant technology in which a nucleic acid molecule encoding a protein involved in the L-ascorbic acid production pathway disclosed herein is transformed into a suitable host which is a different member of the plant kingdom from which the nucleic acid molecule was derived. For example, it is an embodiment of the present invention that a recombinant nucleic acid molecule encoding a GDP-D-mannose:GDP-L-galactose epimerase from a higher plant can be transformed into a microalgal host in order to overexpress the epimerase and enhance production of L-ascorbic acid in the microalgal production organism.


[0132] As previously discussed herein, in one embodiment, a genetically modified microorganism can be a microorganism in which nucleic acid molecules have been deleted, inserted or modified, such as by insertion, deletion, substitution, and/or inversion of nucleotides, in such a manner that such modifications provide the desired effect within the microorganism. A genetically modified microorganism is preferably modified by recombinant technology, such as by introduction of an isolated nucleic acid molecule into a microorganism. For example, a genetically modified microorganism can be transfected with a recombinant nucleic acid molecule encoding a protein of interest, such as a protein for which increased expression is desired. The transfected nucleic acid molecule can remain extrachromosomal or can integrate into one or more sites within a chromosome of the transfected (i.e., recombinant) host cell in such a manner that its ability to be expressed is retained. Preferably, once a host cell of the present invention is transfected with a nucleic acid molecule, the nucleic acid molecule is integrated into the host cell genome. A significant advantage of integration is that the nucleic acid molecule is stably maintained in the cell. In a preferred embodiment, the integrated nucleic acid molecule is operatively linked to a transcription control sequence (described below) which can be induced to control expression of the nucleic acid molecule.


[0133] A nucleic acid molecule can be integrated into the genome of the host cell either by random or targeted integration. Such methods of integration are known in the art. For example, an E. coli strain ATCC 47002 contains mutations that confer upon it an inability to maintain plasmids which contain a ColE1 origin of replication. When such plasmids are transferred to this strain, selection for genetic markers contained on the plasmid results in integration of the plasmid into the chromosome. This strain can be transformed, for example, with plasmids containing the gene of interest and a selectable marker flanked by the 5′- and 3′-termini of the E. coli lacZ gene. The lacZ sequences target the incoming DNA to the lacZ gene contained in the chromosome. Integration at the lacZ locus replaces the intact lacZ gene, which encodes the enzyme β-galactosidase, with a partial lacZ gene interrupted by the gene of interest. Successful integrants can be selected for β-galactosidase negativity.


[0134] A genetically modified microorganism can also be produced by introducing nucleic acid molecules into a recipient cell genome by a method such as by using a transducing bacteriophage. The use of recombinant technology and transducing bacteriophage technology to produce several different genetically modified microorganism of the present invention is known in the art.


[0135] According to the present invention, a gene, for example the GDP-D-mannose:GDP-L-galactose epimerase gene, includes all nucleic acid sequences related to a natural epimerase gene such as regulatory regions that control production of the epimerase protein encoded by that gene (such as, but not limited to, transcription, translation or post-translation control regions) as well as the coding region itself. In another embodiment, a gene, for example the GDP-D-mannose:GDP-L-galactose epimerase gene, can be an allelic variant that includes a similar but not identical sequence to the nucleic acid sequence encoding a given GDP-D-mannose:GDP-L-galactose epimerase gene. An allelic variant of a GDP-D-mannose:GDP-L-galactose epimerase gene which has a given nucleic acid sequence is a gene that occurs at essentially the same locus (or loci) in the genome as the gene having the given nucleic acid sequence, but which, due to natural variations caused by, for example, mutation or recombination, has a similar but not identical sequence. Allelic variants typically encode proteins having similar activity to that of the protein encoded by the gene to which they are being compared. Allelic variants can also comprise alterations in the 5′ or 3′ untranslated regions of the gene (e.g., in regulatory control regions). Allelic variants are well known to those skilled in the art and would be expected to be found within a given microorganism or plant and/or among a group of two or more microorganisms or plants.


[0136] In accordance with the present invention, an isolated nucleic acid molecule is a nucleic acid molecule that has been removed from its natural milieu (i.e., that has been subject to human manipulation). As such, “isolated” does not reflect the extent to which the nucleic acid molecule has been purified. An isolated nucleic acid molecule can include DNA, RNA, or derivatives of either DNA or RNA. There is no limit, other than a practical limit, on the maximal size of a nucleic acid molecule in that the nucleic acid molecule can include a portion of a gene, an entire gene, or multiple genes, or portions thereof.


[0137] An isolated nucleic acid molecule of the present invention can be obtained from its natural source either as an entire (i.e., complete) gene or a portion thereof capable of forming a stable hybrid with that gene. An isolated nucleic acid molecule can also be produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include natural nucleic acid molecules and homologues thereof, including, but not limited to, natural allelic variants and modified nucleic acid molecules in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications provide the desired effect within the microorganism. A structural homologue of a nucleic acid sequence has been described in detail above. Preferably, a homologue of a nucleic acid sequence encodes a protein which has an amino acid sequence that is sufficiently similar to the natural protein amino acid sequence that a nucleic acid sequence encoding the homologue is capable of hybridizing under stringent conditions to (i.e., with) a nucleic acid molecule encoding the natural protein (i.e., to the complement of the nucleic acid strand encoding the natural protein amino acid sequence). A nucleic acid molecule homologue encodes a protein homologue. As used herein, a homologue protein includes proteins in which amino acids have been deleted (e.g., a truncated version of the protein, such as a peptide), inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosylphosphatidyl inositol) in such a manner that such modifications provide the desired effect on the protein and/or within the microorganism (e.g., increased or decreased action of the protein).


[0138] A nucleic acid molecule homologue can be produced using a number of methods known to those skilled in the art (see, for example, Sambrook et al., ibid.). For example, nucleic acid molecules can be modified using a variety of techniques including, but not limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site-directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, synthesis of oligonucleotide mixtures and ligation of mixture groups to “build” a mixture of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologues can be selected from a mixture of modified nucleic acids by screening for the function of the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene.


[0139] Although the phrase “nucleic acid molecule” primarily refers to the physical nucleic acid molecule and the phrase “nucleic acid sequence” primarily refers to the sequence of nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a gene involved in an L-ascorbic acid production pathway.


[0140] Knowing the nucleic acid sequences of certain nucleic acid molecules of the present invention allows one skilled in the art to, for example, (a) make copies of those nucleic acid molecules and/or (b) obtain nucleic acid molecules including at least a portion of such nucleic acid molecules (e.g., nucleic acid molecules including full-length genes, full-length coding regions, regulatory control sequences, truncated coding regions). Such nucleic acid molecules can be obtained in a variety of ways including traditional cloning techniques using oligonucleotide probes to screen appropriate libraries or DNA and PCR amplification of appropriate libraries or DNA using oligonucleotide primers. Preferred libraries to screen or from which to amplify nucleic acid molecule include bacterial and yeast genomic DNA libraries, and in particular, microalgal genomic DNA libraries. Techniques to clone and amplify genes are disclosed, for example, in Sambrook et al., ibid.


[0141] The present invention includes a recombinant vector, which includes at least one isolated nucleic acid molecule of the present invention, inserted into any vector capable of delivering the nucleic acid molecule into a host microorganism of the present invention. Such a vector can contain nucleic acid sequences that are not naturally found adjacent to the isolated nucleic acid molecules to be inserted into the vector. The vector can be either RNA or DNA and typically is a plasmid. Recombinant vectors can be used in the cloning, sequencing, and/or otherwise manipulating of nucleic acid molecules. One type of recombinant vector, referred to herein as a recombinant molecule and described in more detail below, can be used in the expression of nucleic acid molecules. Preferred recombinant vectors are capable of replicating in a transformed bacterial cells, yeast cells, and in particular, in microalgal cells.


[0142] Transformation of a nucleic acid molecule into a cell can be accomplished by any method by which a nucleic acid molecule can be inserted into the cell. Transformation techniques include, but are not limited to, transfection, electroporation, microinjection and biolistics.


[0143] A recombinant cell is preferably produced by transforming a host cell with one or more recombinant molecules, each comprising one or more nucleic acid molecules operatively linked to an expression vector containing one or more transcription control sequences. The phrase, operatively linked, refers to insertion of a nucleic acid molecule into an expression vector in a manner such that the molecule is able to be expressed when transformed into a host cell. As used herein, an expression vector is a DNA or RNA vector that is capable of transforming a host cell and of effecting expression of a specified nucleic acid molecule. Preferably, the expression vector is also capable of replicating within the host cell. In the present invention, expression vectors are typically plasmids. Expression vectors of the present invention include any vectors that function (i.e., direct gene expression) in a yeast host cell, a bacterial host cell, and preferably a microalgal host cell.


[0144] Nucleic acid molecules of the present invention can be operatively linked to expression vectors containing regulatory sequences such as transcription control sequences, translation control sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell and that control the expression of nucleic acid molecules of the present invention. In particular, recombinant molecules of the present invention include transcription control sequences. Transcription control sequences are sequences which control the initiation, elongation, and termination of transcription. Particularly important transcription control sequences are those which control transcription initiation, such as promoter, enhancer, operator and repressor sequences. Suitable transcription control sequences include any transcription control sequence that can function in yeast or bacterial cells or preferably, in microalgal cells. A variety of such transcription control sequences are known to those skilled in the art.


[0145] It may be appreciated by one skilled in the art that use of recombinant DNA technologies can improve expression of transformed nucleic acid molecules by manipulating, for example, the number of copies of the nucleic acid molecules within a host cell, the efficiency with which those nucleic acid molecules are transcribed, the efficiency with which the resultant transcripts are translated, and the efficiency of post-translational modifications. Recombinant techniques useful for increasing the expression of nucleic acid molecules of the present invention include, but are not limited to, operatively linking nucleic acid molecules to high-copy number plasmids, integration of the nucleic acid molecules into the host cell chromosome, addition of vector stability sequences to plasmids, substitutions or modifications of transcription control signals (e.g., promoters, operators, enhancers), substitutions or modifications of translational control signals, modification of nucleic acid molecules of the present invention to correspond to the codon usage of the host cell, deletion of sequences that destabilize transcripts, and use of control signals that temporally separate recombinant cell growth from recombinant enzyme production during fermentation. The activity of an expressed recombinant protein of the present invention may be improved by fragmenting, modifying, or derivatizing nucleic acid molecules encoding such a protein.


[0146] The following experimental results are provided for the purposes of illustration and are not intended to limit the scope of the invention.



EXAMPLES


Example 1

[0147] The present example describes the elucidation of the pathway from glucose to L-ascorbic acid through GDP-D-mannose in plants.


[0148] Since the present inventors have previously shown that Prototheca makes L-ascorbic acid (AA) from glucose, it was worthwhile to examine cultures for some of the early conversion products of glucose. In the past, the present inventors had concentrated on pathways from glucose to organic acids, based on the published pathway of L-ascorbic acid synthesis in animals and proposed pathways in plants. The present inventors demonstrate herein that the pathway from glucose to L-ascorbic acid involves not organic acids, but rather sugar phosphates and nucleotide diphosphate sugars (NDP-sugars).


[0149] Prior to the present invention, it was known that all cells synthesize polysaccharides by first forming NDP-sugars. The sugar moiety is then incorporated into polymer, while the cleaved NDP is recycled. A variety of polysaccharides are known, and are usually named based on the relative proportions of the sugar residues in the polymers. For example, a “galactomannan” contains mostly galactose, and to a lesser degree, mannose residues. The “biopolymer” from Prototheca strains isolated by the present inventors was analyzed and found to be 80% D-galactose, 18% rhamnose (D- or L-configuration not determined), and 2% L-arabinose. The present inventors provide evidence herein of how the respective NDP-sugars that make up the Prototheca biopolymer are formed, and what correlations exist between L-ascorbic acid synthesis and the formation of the NDP-sugar forms of the sugar residues found in the biopolymer.


[0150] The common NDP-sugar UDP-glucose is shown in FIG. 2B. This is formed in plants from glucose-1-P by the action of UDP-D-glucose pyrophosphorylase. UDP-glucose can be epimerized in plants to form UDP-D-galactose, using UDP-D-glucose-4-epimerase. UDP-D-galactose can also be formed by phosphorylation of D-galactose by galactokinase to form D-galactose-1-P, which can be converted to UDP-D-galactose by UDP-D-galactose pyrophosphorylase. These known routes were believed to account for the D-galactose in the Prototheca biopolymer. The UDP-L-arabinose can be formed by known reactions beginning with the oxidation of UDP-D-glucose to UDP-D-glucuronic acid (by UDP-D-glucose dehydrogenase), decarboxylation to UDP-D-xylose, and epimerization to UDP-L-arabinose. This accounts for the arabinose residues in the biopolymer. UDP-L-rhamnose is known to be formed from UDP-D-glucose, thus all three of the sugar moieties in the Prototheca biopolymer can be accounted for by a pathway through glucose-1-P and UDP-glucose. Alternatively, if the rhamnose in the biopolymer is D-rhamnose, it is not formed via UDP-D-glucose, but by oxidation of GDP-D-mannose (See FIG. 1).


[0151] GDP-D-rhamnose is formed by converting glucose, in turn, to D-glucose-6-P, D-fructose-6-P, D-mannose-6-P, D-mannose-1-P, GDP-D-mannose, and GDP-D-rhamnose. It was of interest to the present inventors that this route passes through GDP-D-mannose. Exogenous mannose is known to be converted to D-mannose-6-P in plants, and can enter the path above. D-mannose is converted to L-ascorbic acid by Prototheca cells cultured by the present inventors as well or better than glucose (see Example 4). The mechanism of conversion, in Chlorella pyrenoidosa, of GDP-D-mannose to GDP-L-galactose by GDP-D-mannose:GDP-L-galactose epimerase, has been known for years (See, Barber, 1971, Arch. Biochem. Biophys. 147:619-623, incorporated herein by reference in its entirety). The present inventors have discovered herein that L-galactose and L-galactono-γ-lactone are rapidly converted to L-ascorbic acid by strains of Prototheca and Chlorella pyrenoidosa. Prior to the present invention, it was known that L-galactono-γ-lactone is converted to L-ascorbic acid in several plant systems, but the synthesis steps prior to this step were unknown. Based on the published literature and the present experimental evidence, the present inventors have determined that the L-ascorbic acid biosynthetic pathway in plants passes through GDP-D-mannose and involves sugar phosphates and NDP-sugars. The proposed pathway is shown in FIG. 1. Salient points relevant to the design and production of genetically modified microorganisms useful in the present method include:


[0152] 1. The enzymes leading from D-glucose to D-fructose-6-P are well known enzymes in the first, uncommitted steps of glycolysis.


[0153] 2. The enzymes involved in the conversion of D-fructose-6-P to GDP-D-mannose have been well characterized in plants, yeast, and bacteria, particularly Azotobacter vinelandii and Pseudomonas aeruginosa, which convert GDP-D-mannose to GDP-D-mannuronic acid, which is the precursor for alginate (See for example, Sa-Correia et al., 1987, J. Bacteriol. 169:3224-3231; Koplin et al., 1992, J. Bacteriol. 174:191-199; Oesterhelt et al., 1996, Plant Science 121:19-27; Feingold et al., 1980, The Biochemistry of Plants: Vol 3: Carbohydrates, structure and function, P. K. Stampf & E. E. Conn, eds., Academic Press, New York, pp. 101-170; Smith et al., 1992, Mol. Cell Biol. 12:2924-2930; Boles et al., 1994, Eur. J. Med. 220:83-96; Hashimoto et al., 1997, J. Biol. Chem. 272:16308-16314, all of which are incorporated herein by reference in their entirety).


[0154] 3. Barber (1971, supra, and 1975) identified in Chlorella pyrenoidosa the enzyme activities for the conversion of GDP-D-mannose to GDP-L-galactose and L-galactose-1-P.


[0155] 4. The present inventors have shown herein the rapid conversion of L-galactose and L-galactono-γ-lactone to L-ascorbic acid by Prototheca cells.


[0156] 5. L-galactono-y-lactone and L-galactonic acid can be interconverted in solution by changing the pH of the solution; addition of base shifts the equilibrium to L-galactonic acid, while addition of acid shifts the equilibrium to the lactone. Cells may have an enzymatic means for this conversion in addition to this non-enzymatic route.


[0157] 6. In plants, GDP-L-fucose is also formed from GDP-D-mannose, presumably for incorporation into polysaccharide. Roberts (1971) fed labeled D-mannose to corn root tips and found the label in polysaccharide, specifically in the residues of D-mannose, L-galactose, and L-fucose. No label was detected in D-glucose, D-galactose, L-arabinose, or D-xylose. Prototheca and C. pyrenoidosa cells have the ability to convert L-fucose (6-deoxy-L-galactose) to a dipyridyl-positive product that was shown by HPLC not to be L-ascorbic acid. The present inventors believe that it is was the 6-deoxy analog of L-ascorbic acid.



Example 2

[0158] This example shows that in Prototheca, like other plants (Loewus, F. A. 1988. In: J. Priess (ed.), The Biochemistry of Plants, 14:85-107. New York, Academic Press) and the green microalga Chlorella pyrenoidosa (Renstrom, et al., 1983. Plant Sci. Lett. 28:299-305), ascorbic acid (AA) production from glucose proceeds by a biosynthetic pathway that allows retention of the configuration of the carbon skeleton of glucose.


[0159] Cultures of the strain UV77-247 were grown to moderate cell density in shake flasks with 1-13C-labeled glucose as 10% of the total glucose (40 g/L) . Incubation was as per the standard Mg-limited screen (see Example 3). The culture supernates were clarified, deionized to remove salts, lyophilized, and subjected to nuclear magnetic resonance (nmr) analysis to determine where in the AA molecule the 13C was located. In each case, approximately 85% of the label was found at the C-1 position of AA, with most of the remaining label at the C-6 position. This strongly indicated that AA is synthesized from glucose by a pathway that retains the carbon chain configuration, i. e., C-1 of glucose becomes C-1 of AA. This has typically been observed in plants (Loewus, F. A. 1988. Ascorbic acid and its metabolic products. In: The Biochemistry of Plants, ed. J. Priess, 14:85-107. New York, Academic Press). Animals (Mapson, L. W. and F. A. Isherwood 1956. Biochem. J. 64:151-157; Loewus, F. A. 1960. J. Biol. Chem. 235(4) :937-939) and protists such as Euglena (Shigeoka, S., et al., 1979. J. Nutr. Sci. Vitaminol. 25:299-307), on the other hand, synthesize AA by a pathway that involves the inversion of configuration, i. e., C-1 of glucose becomes C-6 of AA. Demonstration of the inversion/non-inversion nature of the pathway was an important step in determining the pathway of AA biosynthesis since the two types of pathways require different types of enzymatic reactions. The label found at C-6 of AA is thought to be due to metabolism of glucose and subsequent gluconeogenesis. The metabolism of glucose in glycolysis proceeds through triose-phosphate intermediates. After this, the C-1 and C-6 carbons of glucose become biochemically equivalent. Hexose phosphates can be regenerated from the triose phosphates by gluconeogenesis, which is essentially a reversal of the degradative pathway. Consequently, metabolism of C-1-labeled glucose to triose phosphates with subsequent gluconeogenesis would result in the formation of hexose phosphate molecules labeled at either or both C-1 and C-6. If those hexose phosphates were precursors to AA, one would expect the AA to be similarly labeled. Consistent with this type of “isotopic mixing” is the observation that sucrose obtained from 1-13C-labeled glucose was labeled at positions 1, 6, 1′ and 6′.


[0160] Glucose can also be metabolized by the pentose phosphate pathway, the overall balanced equation for which is:


3Glucose-6-phosphate→2Fructose-6-phosphate+Glyceraldehyde-3-phosphate+3 CO2


[0161] Based on the known biochemistry, it would then be expected that the label at each of the carbons in glucose (Table 1 left column) would appear at the positions for the other molecules shown, and that these patterns would be reflected in the AA formed from C-2- and C-3-labeled glucose.
1TABLE 1Predicted Carbon Labeling of Metabolites of Glucose in the PentosePhosphate PathwayLabeled GlucosePosition of Labeled Carbon in:CarbonCO2F6P(1)F6P(2)G3P1+21, 31322, 3444155526663


[0162] AA recovered from cultures fed glucose labeled at C-2 or C-3 was also analyzed for its labeling patterns (Table 2).
2TABLE 2Labeling Pattern in AA after Cells were Fed 2-13C and 3-13C-glucoseCarbonIsotopic enhancement after growth on:Position in AAC-2 labeled glucoseC-3 labeled glucose11.00.4210.00.930.59.9402.852.20.2600


[0163] The data above again suggest a pathway from glucose to AA that proceeds by retention of configuration. As in the experiments with C-1 labeled glucose, approximately one-fifth of the label is present in “mirror image” position to the glucose label (C-5 for C-2 labeled glucose and C-4 for C-3 labeled glucose), indicating levels of gluconeogenesis consistent with those previously observed.


[0164] The small, but significant amount of enhancement observed in other positions is consistent with flux through the pentose phosphate pathway. As predicted above, carbon flux through this pathway would result in isotopic enhancement at positions 1 and 3 when cells were grown on 2-13C glucose and enhancement at position 2 when cells were grown on 3-13C glucose. This is indeed observed. That there is twice as much enhancement at C-1 as there is at C-3 after growth on 2-13C glucose is also predicted. These data indicate a small but measurable amount of carbon flux through the pentose phosphate pathway.



Example 3

[0165] This example shows the methods for generating, screening and isolating mutants of Prototheca with altered AA productivities compared to the starting strain ATCC 75669.


[0166] ATCC No. 75669, identified as Prototheca moriformis RSP1385 (unicellular green microalga), was deposited on Feb. 8, 1994, with the American Type Culture Collection (ATCC), Rockville, Md., 20852, U.S.A., under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. Initial screening of Prototheca species and strains was reported in U.S. Pat. No. 5,900,370, ibid. Table 3 lists the formulations of the media for growth and maintenance of the strains. Glucose for fermentors was supplied as glucose monohydrate and calculated on an anhydrous basis. The recipe for the trace metals solution is given in Table 4. The standard growth temperature was 35° C. All organisms were cultured axenically.
3TABLE 3Media for Growth and Maintenance of Prototheca StrainsAll quantities are in g/L unless otherwise specifiedAgarLiquidFerro-Stan-Stand-Mg-zinedardIngredientardlimitingSlantsPlatesPlatesPotassium phosphate1.31.32.00.272.0monobasicPotassium phosphate3.83.82.01.42.0dibasicTrisodium citrate7.77.72.61.32.6dihydrateMagnesium sulfate0.400.020.40.010.4heptahydrateAmmonium sulfate3.73.71.01.01.0Trace Metals Solution2 mL2 mL2 mL2 mL2 mLFerrous sulfate1.5 mg4.5 mg1.5 mg1.5 mgheptahydrateCalcium chloride0.25dihydrateManganous sulfate0.08monohydrateYeast extract2.5Agar151515(Noble)pH before autoclaving7.27.27.27.27.2Autoclave, then addCopper sulfate, penta-2 mLhydrate, 100 g/L40 g/L Ferrozine8.8 mLin 5 mM phosphate(pH 7.5 final)Ferric ammonium sulfate3.8 mLdodecahydrate, 40 g/L50% glucose with40 mL60 mL10 mL10 mL10 mL25 mg/L thiamine HCl


[0167]

4





TABLE 4










Trace Metals Solution













mL Indiv. Stock



Molecular
Conc. of Individ.
per liter


Compound
Weight
Solutions, g/L
of Working Stock














Distilled Water


823



Hydrochloric Acid

Conc.
20


Cobalt Chloride
237.9
24.0
6.5


hexahydrate


Boric acid
 61.8
38.1
24


Zinc sulfate
287.5
35.3
50


heptahydrate


Manganous sulfate
169.0
24.6
50


monohydrate


Sodium molybdate
242.0
23.8
2.0


dihydrate


Calcium chloride
147.0

11.4 g


dihydrate


Vanadyl sulfate
199.0
10.0
8.0


dihydrate


Nickel nitrate
290.8
 5.0
8.0


hexahydrate


Sodium selenite
173.0
 5.0
8.0










[0168] Mutant isolates were generated by treatment with one or more of the following agents: nitrous acid (NA); ethyl methane sulfonate (EMS); or ultraviolet light (UV). Typically, glucose-depleted cells grown in standard liquid medium were washed and resuspended in 25 mM phosphate buffer, pH 7.2, diluted to approximately 107 colony-forming units per mL (cfu/mL), exposed to the mutagen to achieve about 99% kill, incubated 4-8 hours in the dark, and spread onto standard agar medium, or agar media containing differential agents.


[0169] Some mutant colonies on standard agar medium were picked randomly and subcultured to master plates. Other isolation plates were inverted over chloroform to lyse cells on the surface of the colonies and allow them to release AA. Released AA was detected by spraying the treated plates with a solution of 2,6-dichrorophenol-indophenol (1.25 g/L in 70% EtOH). The ability of AA to reduce this blue redox dye to its colorless form is the basis for a standard assay of AA (Omaye, et al., 1979. Meth. Enzymol. 62:3-11.). Colonies derived from mutagenized cells were saved to master plates for further evaluation if their clear halos were significantly larger than the halos typical of the other mutants in that group. Other mutagenized cells were spread onto plates containing an AA detection system incorporated directly into the agar. This system is based on the ability of AA to reduce ferric iron to ferrous iron. The compound ferrozine (3-(2-pyridyl)-5,6-bis(4-phenylsulfonic acid)-1,2,4-triazine) was present in the agar to complex with the ferrous iron and give a violet color reaction. The ferrozine agar formulation is shown in Table 3. Colonies giving the darkest color reactions were master-plated. When screening for non-AA-producing strains (blocked mutants), white colonies were chosen against a background of relatively dark colonies.


[0170] For primary screening of tube cultures, cells were inoculated from master plates into 4 mL of Mg-limiting medium in 16×125 mm test tubes, and tubes were shaken in a slanted position on a rotary shaker at 300 rpm for four days. After both three and four days of incubation aliquots were removed for AA assay and cell density determination. Those for AA assay were centrifuged at 1500×g for 5 min and the resulting supernates were removed for either calorimetric assay or high pressure liquid chromatography (HPLC). Promising isolates were retested in tube culture. Those passing the tube screen were tested in shake flasks.


[0171] For secondary screening of flask cultures, cells were inoculated into 50 mL of standard flask medium in 250 mL baffled shake flasks, and incubated on a rotary shaker at 180 rpm until glucose depletion (24-48 hours). A second series of flasks of Mg-sufficient standard medium was inoculated from the first set to a cell density of 0.15 A620, and incubated for 24 hours. A third series of Mg-limiting flask medium was inoculated from the second set by a 1/50 dilution and incubated for 96 hours. Flasks were sampled for AA analysis and cell density measurements during this time as required. Aliquots for supernatant AA analysis were centrifuged at 5000×g for 5 min. Aliquots for total whole broth AA analysis were first extracted for 15 min with an equal volume of 5% trichloroacetic acid (TCA) before centrifugation. Aliquots of the resulting supernates were removed for either colorimetric assay or HPLC analysis.


[0172] For colorimetric assay of AA, a modification of the method of Omaye, et al. (1979. Meth. Enzymol. 62:3-11) was used. Twenty-five μL aliquots of culture supernates were added to wells of 96-well microplates, and 125 μL of color reagent was added. The color reagent consisted of four parts 0.5% aqueous 2,2′-dipyridyl and one part 8.3 mM ferric ammonium sulfate in 27% (v/v) o-phosphoric acid, the two components being mixed immediately before use. After one hour, the absorbance at 520 nm was read. AA concentration was calculated by comparison of the absorbances of AA standards.


[0173] HPLC analysis was based on that of Running, et al., (1994). Supernates were chromatographed on a Bio-Rad HPX-87H organic acid column (Bio-Rad Laboratories, Richmond, Calif.) with 13 mM nitric acid as solvent, at a flow rate of 0.7 mL/min at room temperature. Detection was at either 254 nm using a Waters 441 detector (Millipore Corp., Milford, Mass.), or at 245 nm using a Waters 481 detector. This system can distinguish between the L- and D-isomers of AA.


[0174] For dry weight determinations of cell density, 5 mL whole broth samples were centrifuged at 5000×g for 5 min, washed once with distilled water, and the pellet was washed into a tared aluminum weighing pan. Cells were dried for 8-24 h at 105° C. Cell weight was calculated by difference.


[0175] Table 5 shows the abilities of various mutants of Prototheca to synthesize AA.
5TABLE 5AA Synthesizing Ability of Various Prototheca Mutants in Flask ScreenSpecific AA Formation, mg AA per L/Culture A620,during Mg-limited IncubationStrain2 Days Incubation4 Days IncubationATCC 756692235EMS13-479166UV213-100UV218-100UV244-100UV244-155868UV77-2475683UV140-167100UV164-691131NA21-142778UV82-2100UV127-105095SP2-334


[0176] The genealogy of these isolates is presented graphically in the “family tree” of FIG. 3. ATCC No. ______, identified as Prototheca moriformis EMS13-4 (unicellular green microalga), was deposited on May 25, 1999, with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110, U.S.A., under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. ATCC No. ______, identified as Prototheca moriformis UV127-10 (unicellular green microalga), was deposited on May 25, 1999, with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110, U.S.A., under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. ATCC No. ______, identified as Prototheca moriformis SP2-3 (unicellular green microalga), was deposited on May 25, 1999, with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110, U.S.A., under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure.



Example 4

[0177] The following example shows that both growing and resting cells of Prototheca can rapidly convert L-galactose and L-galactono-γ-lactone to AA, and that conversion of D-mannose to AA by Prototheca is more rapid than conversion of D-glucose.


[0178] Shake flask cultures of the mutant strain UV77-247 were grown to glucose depletion in standard liquid medium (Table 3). Cells were washed twice and resuspended in complete medium with the glucose substituted by one of the compounds listed below. Cell suspensions were incubated for 24 hours at 35° C. with shaking, and the entire suspension was extracted with TCA as above and assayed for AA.


[0179] Tables 6-8 show that both growing and resting cells of strain UV77-247 can rapidly convert L-galactose and L-galactono-γ-lactone to AA. In these experiments, D-fructose and D-galactose were converted to AA at the same rate as D-glucose, suggesting that they are metabolized to AA through the same route as D-glucose. None of the organic acids suggested in the literature to be intermediates in the biosynthesis of AA were converted to AA, including sorbosone, which has been proposed as an intermediate by Saito et al. (1990 Plant Physiol. 94:1496-1500).
6TABLE 6Conversion of Compounds by Resting Cells of Strain UV77-247AA Relative to NoSubstrate (50 mM)Total AA, mg/LSubstrate ControlL-galactose965623L-galactono-γ-lactone818476D-fructose590248D-glucosone589247D-glucose584242D-galactose542200D-glucose (10 mM)38846D-gluconolactone38240D-gulono-γ-lactone36624D-glucuronate36422L-sorbosone3420None34202-keto-D-gluconic acid341−1D-isoascorbic acid (10 mM)330−12D-glucuronolactone329−13D-gluconic acid309−33D-galacturonic acid297−45L-idonate296−46


[0180] Since strain UV77-247 converted L-galactose and L-galactono-γ-lactone to AA much more rapidly than it did glucose, it suggests that these compounds are intermediates in the AA biosynthetic pathway and that they are “downstream” from glucose.


[0181] The data in Tables 7 and 8 also show that growing and resting cells of UV77-247 consistently convert D-mannose to AA at a rate greater than that of glucose.
7TABLE 7Conversion of Compounds to AA by Resting Cells of Strain UV77-247Ascorbic Acid, mg/LCompound25.5 h30 h47 hL-galactose667718620L-galactono-γ-lactone644681749D-glucosone465462354D-mannose448462399D-fructose402408367d-glucose395404351D-galactose352361337none287288258


[0182]

8





TABLE 8










Conversion of Compounds to AA by Growing Cells of Strain UV77-247











Ascorbic Acid,





mg/L
A620
AA/A620









Compound
25.5h
44h














L-galactose
249
506
4.5
112


D-mannose
228
488
5.6
87


L-galactono-γ-lactone
214
342
5.0
68


D-glucose
178
398
5.9
67


D-fructose
181
383
5.9
65


D-glucosone
176
362
5.7
64


D-galactose
185
380
5.9
64


none
182
249
4.4
57


D-gluconic acid (K)
178
262
5.0
52


L-idonate (Na)
182
232
4.7
49


2-keto-D-gluconic acid
182
255
5.3
48


2-deoxy-D-glucose
181
227
4.8
47


D-glucuronic acid lactone
165
218
5.0
44


D-glucuronic acid (Na)
173
241
5.6
43


L-gulono-γ-lactone
152
195
5.0
39


L-sorbosone
178
160
4.7
34


D-glucono-δ-lactone
130
190
5.7
33


D-galacturonic acid
130
180
6.0
30










[0183] These cells converted L-galactose, L-galactono-γ-lactone and D-mannose to AA more rapidly than they did glucose, suggesting that mannose exerts its effect in the biosynthetic pathway “downstream” from glucose.



Example 5

[0184] Using the methods described above, a collection of mutants was assembled. The specific AA formation for representative mutants are shown in Table 5. The genealogy of these isolates is presented graphically in the “family tree” of FIG. 3.


[0185] These isolates were tested for their ability to convert compounds which could be converted to AA by strain UV77-247. Testing was done as in Example 4. Results are shown in Table 9.
9TABLE 9Conversion of Compounds to AA by Resting Cellsof Mutant Strains of Prototheca of Varying Abilities to Synthesize AAAbsolute AA, mg/LL-L-gal-Fruc-StrainBufferGlucosegalactoseγ-lact.MannosetoseEMS13-45397191173139NDUV127-1045140213140128143 SP2-319192041462427NA21-146180147158118115 UV82-2115161831751817UV213-116151701351716UV218-116181361761921UV244-116161641621616UV244-15267730219489UV244-16286453535366ND = Not Determined


[0186] These data suggest that the mutational blocks in those strains which convert fructose and mannose to AA poorly are before (“upstream” from) L-galactose and L-galactono-γ-lactone in the pathway.



Example 6

[0187] The following example shows that magnesium inhibits early steps in the production of AA.


[0188] To address the question of whether magnesium actually inhibits AA synthesis, strain NA45-3 (ATCC 209681) was grown in magnesium (Mg)-limited and Mg-sufficient medium. ATCC No. 209681, identified as Prototheca moriformis NA45-3 (Source: repeated mutagenesis of ATCC No. 75669; Eucaryotic alga. Division Chlorophyta, Class Chlorophyceae, Order Chlorococcales), was deposited on Mar. 13, 1998, with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110, U.S.A., under the terms of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure. Cells from both cultures were harvested and resuspended in the cell-free supernate from the Mg-limited culture, and to half of each cell suspension additional magnesium was added in order to bring the level in the suspension to the Mg-sufficient level. The four conditions under which assays were run were as follows.
10TABLE 10Conditions Used to Test the Effect of Magnesium on AA ProductionMagnesium concentration, g/L, during:ConditionGrowthAssay 1 Mg > 1 Mg0.020.02 1 Mg > 10 Mg0.020.210 Mg > 1 Mg0.20.0210 Mg > 10 Mg0.20.2


[0189] Substrates previously shown to lead to the formation of AA, namely D-glucose, D-glucosone, D-fructose. D-galactose, D-mannose, and L-galactono-γ-lactone, were added at 20 g/L to the four cell suspensions. Accumulation of AA after 24 hours was measured and compared to a control in which no substrate was added. The results of this study are shown graphically in FIG. 4.


[0190] When cells growing under magnesium-limited conditions were incubated with substrates in low-magnesium broth (1 Mg>1 Mg condition), all showed significant and similar accumulation of AA over the control condition. When the same cells were incubated in high magnesium broth (1 Mg>10 Mg condition), the accumulation of AA was reduced about 40% for all substrates except D-mannose and L-galactono-γ-lactone, suggesting that 1) the rate-limiting step in the conversion of D-glucose, D-glucosone, D-fructose, and D-galactose to AA is inhibited by magnesium or 2) magnesium stimulates an enzyme which results in the conversion of these compounds to some other compound(s), reducing the amount of substrate available for AA synthesis. On the other hand, conversion of D-mannose and L-galactono-γ-lactone appeared to be unaffected by the presence of magnesium in the resuspension buffer, indicating that either 1) magnesium-inhibited enzymes are not involved in the conversion of these substrates to AA or 2) D-mannose and L-galactono-γ-lactone enter the pathway far enough downstream from the point where they can be siphoned off by side reactions involving Mg-requiring enzymes.


[0191] When cells were grown under magnesium-sufficient conditions, very little AA accumulation from any of the D-sugars was observed, regardless of the level of magnesium in the resuspension broth. Accumulation of AA from L-galactono-γ-lactone, however, was enhanced over that observed when cells are grown in Mg-limited conditions. This suggests that enzymes early in the pathway are repressed under Mg-sufficient conditions. Thus, the D-substrates all behaved similarly, with the exception of the apparent lack of magnesium inhibition of D-mannose conversion to AA. This would suggest that D-mannose enters the AA biosynthetic pathway at a point other than the other D-sugars.


[0192]
FIGS. 2A and 2B represent some of the fates of glucose in plants. The first enzymatic step in this scheme which commits carbon to glycolysis is the conversion of fructose-6-P to fructose-1,6-diP by phosphofructokinase (PFK). This reaction is essentially irreversible, and leads to the well known TCA cycle and oxidative phosphorylation, with concomitant ATP and NADH/NADPH generation. PFK has an absolute requirement for magnesium. If magnesium is limiting, this reaction could slow and eventually stop, blocking the flow of carbon through glycolysis and beyond, and would result in cessation of cell division even in the presence of excess glucose. One would expect fructose-6-P to accumulate under these conditions, fueling AA synthesis by the pathway shown in FIGS. 1 and 2.



Example 7

[0193] The following example shows the correlation in Prototheca between AA production and the activity levels of the enzymes in the AA pathway.



Phosphomannose Isomerase (PMI) Assay

[0194] PMI activity was first assayed (See FIG. 1). Ten strains representing a range of AA productivities were grown according to the standard protocol to measure AA-synthesizing ability. Cells were harvested 96 hours into magnesium-limited incubation, washed and resuspended in buffer containing 50 mM Tris/10 mM MgCl2, pH 7.5. The suspended cells were broken in a French press, spun at 30,000×g for 30 minutes, and desalted through Sephadex G-25 (Pharmacia PD-10 columns). Reactions were carried out in the reverse direction by adding various volumes of extracts to solutions of Tris/Mg buffer containing 0.15 U phosphoglucose isomerase (EC 5.3.1.9), 0.5 U glucose-6-phosphate dehydrogenase (EC 1.1.1.49), and 1.0 mm NADP. Reactions were initiated by addition of 3 mM (final) mannose-6-phosphate. Final reaction volume was 1.0 mL. All components were dissolved in Tris/Mg buffer. Activities were taken as the change in A340/min. From these activities was subtracted the activities measured in identical reaction mixtures lacking the M-6-P substrate. Specific activities were calculated by normalizing the activities for protein concentration in the reactions. Protein in the original extracts was determined by the method of Bradford, using a kit from Bio-Rad Laboratories (Hercules, Calif.). All enzymes and nucleotides were purchased from Sigma Chemical Co. (St. Louis, Mo.).



Phosphomannomutase (PMM) Assay

[0195] Phosphomannomutase was measured in a similar manner in the same strains, but these assay reaction mixtures also contained 0.25 mM glucose-1,6-diphosphate, 0.5 U commercially available PMI, and the reactions were started with the addition of 3.0 mM (final) mannose-1-phosphate rather than mannose-6-phosphate.



Phosphofructokinase (PFK) Assay

[0196] To shed light on the possibility that the enhancement of AA concentration in cultures which were limited for magnesium was due to a diversion of carbon from normal metabolism by a reduced activity of the first committed step in glycolysis (PFK) the strains were also assayed to confirm the presence of this enzyme activity. Cells were cultured, washed and broken as above. Extracts were centrifuged at 100,000×g for 90 min before desalting. Reactions were carried out in the forward direction by adding various volumes of extracts to solutions of Tris/Mg buffer containing 1.5 mM dithiothreitol, 0.86 U aldolase (EC 4.1.2.13), 1.4 U α-glycerophosphate dehydrogenase (EC 1.1.1.8), 14 U triosephosphate isomerase (EC 5.3.1.1), 0.11 mM NADH, and 1.0 mM ATP. Reactions were initiated by addition of 5 mM (final) fructose-6-phosphate. Final reaction volume was 1.0 mL. All components were dissolved in Tris/Mg buffer. Activities were taken as the change in A340/min. From these activities were subtracted the activities measured in identical reaction mixtures lacking the F-6-P substrate. Specific activities were calculated by normalizing the activities for protein concentration in the reaction. Protein in the original extracts was determined as above.



GDP-D-mannose Pyrophosphorylase (GMP) Assay

[0197] These same mutant strains were assayed for the next enzyme in the proposed pathway, GMP. Strains were grown both according to the standard Mg-limiting protocol (harvested 43-48 hours into magnesium-limited incubation) and in standard Mg-sufficient medium (harvesting all cells before glucose depletion). Washed cell pellets were resuspended in 50 mM phosphate buffer, pH 7.0, containing 20% (v/v) glycerol and 0.1 M sodium chloride (3 mL buffer/g wet cells), and broken in a French press. Crude extracts were spun at 15,000×g for 15 minutes. Reactions were carried out in the forward direction by adding various volumes of extracts to solutions of 50 mM phosphate/4 mM MgCl2 buffer, pH 7.0, containing 1 mM GTP. Reactions were initiated by addition of 1 mM (final) mannose-1-phosphate. Final reaction volume was 0.1 mL. Reaction mixtures were incubated at 30° C. for 10 min, filtered through a 0.45 μm PVDF syringe filter, and analyzed for GDP-mannose by HPLC. A Supelcosil SAX1 column (4.6×250 mm) was used with a solvent gradient (1 mL/min) of: A—6 mM potassium phosphate, pH 3.6; B—500 mM potassium phosphate, pH 4.5. The gradient was: 0-3 min, 100% A; 3-10 min, 79% A; 10-15 min, 29% A. Column temperature was 30° C. Two assays that showed enzyme activity proportional to the amount of protein were averaged. Control no-substrate and no-extract reactions were also run. Specific activity was calculated by normalizing the activity for protein concentration in the reaction. Protein in the original extracts was determined as above.



GDP-D-mannose:GDP-L-galactose Epimerase Assay

[0198] Further tests measured the activities of the next enzyme in the proposed pathway, GDP-D-mannose:GDP-L-galactose epimerase. Strains were grown according to the standard protocol, harvested 43-48 hours into magnesium-limited incubation, washed, and resuspended in buffer containing 50 mM MOPS/S mM EDTA, pH 7.2. Washed pellets were broken in a French press, and spun at 20,000×g for 20 min. Protein determinations were made as above and a dilution series of each was made, ranging from 0.4 to 2.2 mg protein/mL. 50 μL aliquots of these dilutions were added to 10 μL aliquots of 6.3 mM GDP-D-mannose in which a portion of this substrate was universally labeled with 14C in the mannose moiety. This substrate had an activity of 16 μCi/mL before dilution into the reaction mixture. Reactions were stopped after 10 min by transferring 20 μL of the mixture into microfuge tubes containing 20 μL of 250 mM trifluoroacetic acid (TFA) containing 1.0 g/L each D-mannose and L-galactose. These tubes were sealed and boiled for 10 min, cooled, spun for 60 sec in a Beckman Microfuge E, and 5 μL of each hydrolysate was spotted on 20×20 cm plastic-backed EM Science Silica gel 60 thin-layer chromatography plates (#5748/7), with 1 cm lanes created by scoring with a blunt stylus. After drying, plates were twice chromatographed for 2.5 hours in ethyl acetate:isopropanol:water, 65:22.3:12.7 (plates were dried between runs). Spots of free sugars were visualized by spraying dried plates with 0.5% p-anisaldehyde in a 62% ethanolic solution of 0.89 M sulfuric acid and 0.17 mM glacial acetic acid, and heating at 105° C. for about 15 min. Spots of L-galactose and D-mannose were cut from the plates and counted in a scintillation counter (Beckman model 2800). For time-zero control counts, 16.7 μL of each extract dilution was added to 23.3 μL of the labeled substrate above, which had been diluted 1:7 with the TFA/mannose/galactose solution.


[0199] Table 11 summarizes the results of the five enzyme assays for the strains tested, along with their specific AA formations.
11TABLE 11Specific Enzyme Activities (mU)* of Selected Mutant Prototheca StrainsGMPAA SpecificMg-StrainForm, mg/gPMIPMMPFKMg-limitedsufficientEpimeraseUVI 64-678.40.79EMS13-473.710.869.613.52.66.80.78UV140-169.90.78NA45-361.40.58UV77-24744.40.52UV127-1040.111.145.824.44.35.90.39UV244-1524.514.341.53.15.30.42NA21-1423.612.160.347.42.47.60.27ATCC 7566921.90.28UV244-165.016.585.64.35.2SP2-32.017.747.064.52.07.50.03UV218-10.415.972.12.77.00.83UV213-10.119.747.732.63.26.70.60UV82-210.014.670.630.44.17.50.15UV244-10.018.251.15.5120.15Units: PMI and PMM, nmoles NADP reduced per mm/mg protein; PFK, nmoles NADH oxidized per min/mg protein; GMP, nmoles GDP-D-mannose formed per mm/mg protein; epimerase, nmoles GDP-L-galactose formed per min/mg protein.


[0200] The only enzyme which showed a strong correlation between activity and the ability to synthesize AA was the GDP-D-mannose:GDP-L-galactose epimerase. This correlation is depicted in FIG. 5. All of the strains which produced measurable amounts of AA had measurable amounts of epimerase activity. The converse was not true: four of the strains which synthesize little or no AA had significant epimerase activities. These strains are candidates for having mutations which affect enzymatic steps downstream from the epimerase. Since all of the strains tested can synthesize AA from L-galactose and L-galactono-γ-lactone (see Examples 4 and 5), the genetic lesion(s) in these four mutants must lie between GDP-L-galactose and free L-galactose.



Example 8

[0201] The next example shows the relationship between GDP-D-mannose:GDP-L-galactose epimerase activity and the degree of magnesium limitation in two strains, the original unmutagenized parent strain ATCC 75669, and one of the best AA producers, EMS13-4 (ATCC ______).


[0202] Four flasks of each strain were grown according to the standard protocol. One culture of each was harvested 24 hours into magnesium-limited incubation, and every 24 hours thereafter for a total of four days. One flask of each strain was also harvested 24 hours into magnesium sufficient incubation. All cultures had glucose remaining when harvested. FIG. 6 shows graphically the AA productivity and epimerase activity in EMS13-4 and ATCC 75669 as the cultures became Mg-limited. Epimerase activity in EMS13-4 was significantly greater than that in ATCC 75669 at all time points. There was also a concurrent rapid rise in both AA productivity and epimerase activity in EMS13-4 as the cultures became increasingly Mg-limited. While there was a moderate increase in AA productivity in ATCC 75669 as Mg became more limiting, there was no effect on epimerase activity.



Example 9

[0203] The following example shows the results of epimerase assays performed with extracts of two E. coli strains into which were cloned the E. coli gene for GDP-4-keto-6-deoxy-D-mannose epimerase/reductase.


[0204] The E. coli K12 wca gene cluster is responsible for cholanic acid production; wcaG encodes a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase.


[0205] The E. coli wcaG sequence (nucleotides 4 through 966 of SEQ ID NO:3) was amplified by PCR from E. coli W3110 genomic DNA using primers WG EcoRI 5 (5′ TAGAATTCAGTAAACAACGAGTTTTTATTGCTGG 3′; SEQ ID NO:12) and WG Xhol 3 (5′ AACTCGAGTTACCCCCAAAGCGGTCTTGATTC 3′; SEQ ID NO:13). The 973-bp PCR product was ligated into the vector pPCR-Script SK(+) (Stratagene, LaJolla, Calif.). The 973-bp ExoRII/XhoI fragment was moved from this plasmid into the ExoRII/XhoI sites of pGEX-5X-1 (Amersham Pharmacia Biotech, Piscataway, N.J.), creating plasmid pSW67-1. Plasmid pGEX-5X-1 is a GST gene fusion vector which adds a 26-kDa GST moiety onto the N-terminal end of the protein of interest. E. coli BL21(DE3) was transformed with pSW67-1 and pGEX-5X-1, resulting in strains BL21(DE3)/pSW67-1 and BL21(DE3)/pGEX-5X-1.


[0206] The E. coli wcaG sequence (nucleotides 1 through 966 of SEQ ID NO:3) was also amplified by PCR from E. coli W3110 genomic DNA using primers WG EcoRI 5-2 (5′ CTGGAGTCGAATTCATGAGTAAACAACGAG 3′; SEQ ID NO:14) and WG PstI 3 (5′ AACTGCAGTTACCCCCGAAAGCGGTCTTGATTC 3′; SEQ ID NO:15). The 976-bp PCR product was ligated into a pPCR-Script (Stratagene). The 976-bp ExoRII/PstI fragment was moved from this plasmid into the ExoRII/PstI sites of expression vector pKK223-3 (Amersham Pharmacia Biotech), creating plasmid pSW75-2. E. coli JM105 was transformed with pKK223-3 and pSW75-2, resulting in strains JM105/pKK223-3 and JM105/pSW75-2.


[0207] All six strains were grown in duplicate at 37° C. with shaking in 2X YTA medium until an optical density of 0.8-1.0 at 600 nm was reached (about three hours). 2X YTA contains 16 g/L tryptone, 10 g/L yeast extract, 5 g/L sodium chloride and 100 mg/L ampicillin. One of each culture was induced by adding isopropyl β-D-thiogalactopyranoside (IPTG) to 1 mM final concentration. All 12 cultures were incubated for an additional four hours, washed in 0.9% NaCl, and the cells were frozen at −80° C. Prior to pelleting the cells for preparation of extracts, a portion of each culture was used for a plasmid DNA miniprep to confirm the presence of the appropriate plasmids in these strains. A protein preparation of each culture was also run on SDS gels to confirm expression of a protein of the appropriate size where expected. Frozen pellets were thawed, resuspended in 2.5 mL MOPS/EDTA buffer, pH 7.2, broken in a French Press (10,000 psi), spun for 20 min at 20,000×g, assayed for protein as above and diluted to 0.01, 0.1, 1.0 and 3 mg/mL protein.


[0208] Induction of the strain BL21(DE3)/pGEX-5X-1 resulted in high-level expression of a 26-kDa protein indicating the synthesis of the native GST protein. Induction of strain BL21(DE3)/pSW67-1 resulted in high-level expression of a 62-kDa protein, indicating the synthesis of the native GST protein (26K) fused to the wcaG gene product (36K). An aliquot of the fusion protein was treated with the protease Factor Xa (New England Biolabs, Beverly, Mass.), which cleaves near the GST/wcaG junction. Induction of the strain JM105/pSW75-2 resulted in high level expression of a 36-kDa protein, indicating the synthesis of the wcaG gene product. No such protein was detected in JM105/pKK223-3 (vector only).


[0209] Next, it was of interest to test extracts in the standard epimerase assay described in Example 7 to determine if any of the extracts containing the wcaG product could bring about the conversion of GDP-D-mannose to GDP-L-galactose. The extracts to be assayed are:


[0210] BL21(DE3) Group


[0211] 1. BL21(DE3) uninduced


[0212] 2. BL21(DE3) induced with 1 mM IPTG


[0213] 3. BL21(DE3)/pGEX-5X-1 uninduced


[0214] 4. BL21(DE3)/pGEX-5X-1 induced with 1 mM IPTG


[0215] 5. BL21(DE3)/pSW67-1 uninduced


[0216] 6. BL21(DE3)/pSW67-1 induced with 1 mM IPTG; fusion protein intact


[0217] 7. BL21(DE3)/pSW67-1 induced with 1 mM IPTG; GST moiety cleaved


[0218] JM105 Group


[0219] 1. JM105 uninduced


[0220] 2. JM105 induced with 1 mM IPTG


[0221] 3. JM105/pKK223-3 uninduced


[0222] 4. JM105/pKK223-3 induced with 1 mM IPTG


[0223] 5. JM105/pSW75-2 uninduced


[0224] 6. JM105/pSW75-2 induced with 1 mM IPTG


[0225] Extracts 1 and 7 from the BL21(DE3) group and extracts 1 and 6 from the JM105 group were tested for GDP-D-mannose:GDP-L-galactose epimerase-like activity in a pilot experiment. In this initial experiment, no epimerase activity was detected in any of the extracts. At this time, such a result can be attributed to a number of possibilities. First, it is possible that the wcaG gene product is incapable of catalyzing the conversion of GDP-D-mannose to GDP-L-galactose, although this conclusion can not be reached until several other parameters are tested. Second, it is possible that under the assay conditions which are satisfactory to measure activity for the endogenous GDP-D-mannose:GDP-L-galactose epimerase, the wcaG gene product does not have GDP-D-mannose:GDP-L-galactose epimerase-like activity. Therefore, alternate conditions should be tested. Additionally, confirmation experiments should be performed to confirm the accuracy of the pilot conditions. Third, although the BL21(DE3) and the JM105 clones produce proteins of the expected size, the constructs have not been sequenced to confirm the proper coding sequence for the wcaG gene product and thereby rule out PCR or cloning errors which may render the wcaG gene product inactive. Fourth, the protein formed from the cloned sequence is full-length, but inactive, for example, due to incorrect tertiary structure (folding). Fifth, the gene is overexpressed, resulting in accumulation of insoluble and inactive protein products (inclusion bodies). Future experiments will attempt to determine whether the constructs have or can be induced to have the ability to catalyze the conversion of GDP-D-mannose to GDP-L-galactose, and to use the sequences to isolate the endogenous GDP-D-mannose:GDP-L-galactose epimerase.


[0226] Table 12 provides the atomic coordinates for Brookhaven Protein Data Bank Accession Code 1bws:
12TABLE 12HEADEREPIMERASE/REDUCTASE27-SEP-981BWSTITLECRYSTAL STRUCTURE OF GDP-4-KETO-6-DEOXY-D-MANNOSETITLE2EPIMERASE/REDUCTASE FROM ESCHERICHIA COLI A KEY ENZYME INTITLE3THE BIOSYNTHESIS OF GDP-L-FUCOSECOMPNDMOL ID: 1;COMPND2MOLECULE: GDP-4-KETO-6-DEOXY-D-MANNOSE EPIMERASE/REDUCTASE;COMPND3CHAIN: A;COMPND4ENGINEERED: YES;COMPND5BIOLOGICAL UNIT: HOMODIMERSOURCEMOL ID: 1;SOURCE2ORGANISM SCIENTIFIC: ESCHERICHIA COLI;SOURCE3EXPRESSION SYSTEM: ESCHERICHIA COLIKEYWDSEPIMERASE/REDUCTASE, GDP-L-FUCOSE BIOSYNTHESISEXPDTAX-RAY DIFFRACTIONAUTHORDE M. RIZZITONETTIFLORAREVDAT113-JAN-99 1BWS 0JRNLAUTH DE D .RIZZITONETTIVIGEVANISTURLABISSOFLORAJRNLTITL GDP-4-KETO-6-DEOXYD-MANNOSE EPIMERASE/REDUCTASEJRNLTITL 2 FROM ESCHERICHIA COLI, A KEY ENZYME IN THEJRNLTITL 3 BIOSYNTHESIS OF GDP-L-FUCOSE, DISPLAYS THEJRNLTITL 4 STRUCTURAL CHARACTERISTICS OF THE RED PROTEINJRNLTITL 5 HOMOLOGY SUPERFAMILYJRNLREF STRUCTURE (LONDON)1998JRNLREFN  9999REMARK1REMARK2REMARK2RESOLUTION. 2.2 ANGSTROMSREMARK3REMARK3REFINEMENT.REMARK3PROGRAM: TNTREMARK3AUTHORS: TRONRUD, TEN EYCK, MATTHEWSREMARK3REMARK3DATA USED IN REFINEMENT.REMARK3RESOLUTION RANGE HIGH (ANGSTROMS): 2.2REMARK3RESOLUTION RANGE LOW (ANGSTROMS): 15.0REMARK3DATA CUTOFF (SIGMA(F)): 0.0REMARK3COMPLETENESS FOR RANGE (%): 99.7REMARK3NUMBER OF REFLECTIONS: 24481REMARK3REMARK3USING DATA ABOVE SIGMA CUTOFF.REMARK3CROSS-VALIDATION METHOD: NONEREMARK3FREE R VALUE TEST SET SELECTION: NULLREMARK3R VALUE (WORKING + TEST SET): NULLREMARK3R VALUE (WORKING SET): NONEREMARK3FREE R VALUE: NULLREMARK3FREE R VALUE TEST SET SIZE (%): NONEREMARK3FREE R VALUE TEST SET COUNT: NULLREMARK3REMARK3USING ALL DATA, NO SIGMA CUTOFF.REMARK3R VALUE (WORKING + TEST SET, NO CUTOFF): NULLREMARK3R VALUE (WORKING SET, NO CUTOFF): 0.202REMARK3FREE R VALUE (NO CUTOFF): 0.287REMARK3FREE R VALUE TEST SET SIZE (%, NO CUTOFF): NULLREMARK3FREE R VALUE TEST SET COUNT (NO CUTOFF): NULLREMARK3TOTAL NUMBER OF REFLECTIONS (NO CUTOFF): NULLREMARK3REMARK3NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT.REMARK3PROTEIN ATOMS: 2527REMARK3NUCLEIC ACID ATOMS: NULLREMARK3OTHER ATOMS: 109REMARK3REMARK3WILSON B VALUE (FROM FCALC, A**2): NULLREMARK3REMARK3EMS DEVIATIONS FROMIDEAL VALUES.EMSWEIGHTCOUNTREMARK3BOND LENGTHS (A): 0.016; NULL; NULLREMARK3BOND ANGLES (DEGREES): 1.65; NULL; NULLREMARK3TORSION ANGLES (DEGREES): NULL; NULL; NULLREMARK3PSEUDOROTATION ANGLES (DEGREES): NULL; NULL; NULLREMARK3TRIGONAL CARBON PLANES (A): NULL; NULL; NULLREMARK3GENERAL PLANES (A): NULL; NULL; NULLREMARK3ISOTROPIC THERMAL FACTORS (A**2): NULL; NULL; NULLREMARK3NON-BONDED CONTACTS (A): NULL; NULL; NULLREMARK3REMARK3INCORRECT CHIRAL-CENTERS (COUNT): NULLREMARK3REMARK3BULK SOLVENT MODELING.REMARK3METHOD USED: NULLREMARK3KSOL: NULLREMARK3ESOL: NULLREMARK3REMARK3RESTRAINT LIBRARIES.REMARK3STEREOCHEMISTRY : NULLREMARK3ISOTROPIC THERMAL FACTOR RESTRAINTS : NULLREMARK3REMARK3OTHER REFINEMENT REMARKS: NULLREMARK4REMARK41BWS COMPLIES WITH FORMAT V. 2.2, 16-DEC-1996REMARK5REMARK5WARNINGREMARK51BWS: THIS IS LAYER 1 RELFASE.REMARK5REMARK5PLEASE NOTE THAT THIS ENTRY WAS RELEASED AFTER DEPOSITORREMARK5CHECKING AND APPROVAL BUT WITHOUT PDB STAFF INTERVENTION.REMARK5AN AUXILIARY FILE, AUX1BWS.RPT, IS AVAILABLE FROM THEREMARK5PDB FTP SERVER AND IS ACCESSIBLE THROUGH THE 3DB BROWSER.REMARK5THE FILE CONTAINS THE OUTPUT OF THE PROGRAM WHAT CHECK ANDREMARK5OTHER DIAGNOSTICS.REMARK5REMARK5NOMENCLATURE IN THIS ENTRY, INCLUDING HET RESIDUE NAMESREMARK5AND HET ATOM NAMES, HAS NOT BEEN STANDARDIZED BY THE PDBREMARK5PROCESSING STAFF.A LAYER 2 ENTRY WILL BE RELEASED SHORTLYREMARK5AFTER THIS STANDARDIZATION IS COMPLETED AND APPROVED BY THEREMARK5DEPOSITOR. THE LAYER 2 ENTRY WILL BE TREATED AS AREMARK5CORRECTION TO THIS ONE, WITH THE APPROPRIATE REVDAT RECORD.REMARK5REMARK5FURTHER INFORMATION INCLUDING VALIDATION CRITERIA USED INREMARK5CHECKING THIS ENTRY AND A LIST OF MANDATORY DATA FIELDSREMARK5ARE AVAILABLE FROM THE PDB WEB SITE ATREMARK5HTTP://WWW.PDB.BNL.GOV/.REMARK200REMARK200EXPERIMENTAL DETAILSREMARK200EXPERIMENT TYPE: X-RAY DIFFRACTIONREMARK200DATE OF DATA COLLECTION: AUG-1997REMARK200TEMPERATURE (KELVIN): 120REMARK200PH: 6.5REMARK200NUMBER OF CRYSTALS USED:1REMARK200REMARK200SYNCHROTRON (Y/N)NREMARK200RADIATION SOURCE: NONEREMARK200BEAMLINE: NULLREMARK200X-RAY GENERATOR MODEL: RIGAKU RU200REMARK200MONOCHROMATIC OR LAUE (M/L): MREMARK200WAVELENGTH OR RANGE (A): 1.5418REMARK200MONOCHROMATOR: NULLREMARK200OPTICS: NULLREMARK200REMARK200DETECTOR TYPE: IMAGE PLATEREMARK200DETECTOR MANUFACTURER: RAXISREMARK200INTENSITY-INTEGRATION SOFTWARE: MOSFLMREMARK200DATA SCALING SOFTWARE: SCALAREMARK200REMARK200NUMBER OF UNIQUE REFLECTIONS: 24481REMARK200RESOLUTION RANGE HIGH (A): 2.2REMARK200RESOLUTION RANGE LOW (A): 15.0REMARK200REJECTION CRITERIA (SIGMA(I)): NONEREMARK200REMARK200OVERALL.REMARK200COMPLETENESS FOR RANGE (%): 99.7REMARK200DATA REDUNDANCY: 4.3REMARK200R MERGE (I): 0.057REMARK200R SYM (I): NONEREMARK200<I/SIGMA(I)> FOR THE DATA SET: 13.6REMARK200REMARK200IN THE HIGHEST RESOLUTION SHELL.REMARK200HIGHEST RESOLUTION SHELL, RANGE HIGH (A) : NULLREMARK200HIGHEST RESOLUTION SHELL, RANGE LOW (A) : NULLREMARK200COMPLETENESS FOR SHELL (%): NULLREMARK200DATA REDUNDANCY IN SHELL: NULLREMARK200R MERGE FOR SHELL (I): NULLREMARK200R SYM FOR SHELL (I): NULLREMARK200<I/SIGMA(I)> FOR SHELL: NULLREMARK200REMARK200DIFFRACTION PROTOCOL: NULLREMARK200METHOD USED TO DETERMINE THE STRUCTURE: MIRREMARK200SOFTWARE USED: NULLREMARK200STARTING MODEL: NULLREMARK200REMARK200REMARK: NULLREMARK280REMARK280CRYSTALREMARK280SOLVENT CONTENT, VS (%): NULLREMARK280MATTHEWS COEFFICIENT, VM (ANGSTROMS**3/DA): NULLREMARK280REMARK280CRYSTALLIZATION CONDITIONS: NULLREMARK290REMARK290CRYSTALLOGRAPHIC SYNMETRYREMARK290SYMMETRY OPERATORS FOR SPACE GROUP: P 32 2 1REMARK290REMARK290SYMOPSYMMETRYREMARK290NNWHMMOPERATORREMARK2901555X,Y,ZREMARK2902555−Y,X−Y,Z+2/3REMARK2903555Y−X,−X,Z+1/3REMARK2904555Y,X,−ZREMARK2905555X−Y,−Y,1/3−ZREMARK2906555−X,Y−X,2/3−ZREMARK290REMARK290WHERE NNN —> OPERATOR NUMBERREMARK290MMM —> TRANSLATION VECTORREMARK290REMARK290CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONSREMARK290THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATMREMARK290RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLYREMARK290RELATED MOLECULES.REMARK290SMTRY111.0000000.0000000.0000000.00000REMARK290SMTRY210.0000001.0000000.0000000.00000REMARK290SMTRY310.0000000.0000001.0000000.00000REMARK290SMTRY12−0.500045−0.8659740.0000000.00000REMARK290SMTRY220.866077−0.4999550.0000000.00000REMARK290SMTRY320.0000000.0000001.00000050.58553REMARK290SMTRY13−0.4999550.8659740.0000000.00000REMARK290SMTRY23−0.866077−0.5000450.0000000.00000REMARK290SMTRY330.0000000.0000001.00000025.29276REMARK290SMTRY14−0.5000450.8659220.0000000.00000REMARK290SMTRY240.8660770.5000450.0000000.00000REMARK290SMTRY340.0000000.0000001.0000000.00000REMARK290SMTRY151.0000000.0001040.0000000.00000REMARK290SMTRY250.0000001.0000000.0000000.00000REMARK290SMTRY350.0000000.0000001.00000025.29276REMARK290SMTRY16−0.4999550.8660260.0000000.00000REMARK290SMTRY26−0.8660770.4999550.0000000.00000REMARK290SMTRY360.0000000.0000001.00000050.58553REMARK290REMARK290REMARK: NULLREMARK465REMARK465MISSING RESIDUESREMARK465THE FOLLOWING RESIDUES WERE NOT LOCATED IN THEREMARK465EXPERIMENT. (M = MODEL NUMBER; RES = RESIDUE NAME; C = CHAINREMARK465IDENTIFIER; SSSEQ = SEQUENCE NUMBER; I = INSERTION CODE):REMARK465REMARK465M RES C SSSEQIREMARK465MET A1REMARK465SER A2REMARK465ASP A317REMARK465ARG A318REMARK465PHE A319REMARK465ARG A320REMARK465GLY A321REMARK800REMARK800SITEREMARK800SITE IDENTIFIER: CATREMARK800SITE DESCRIPTION:REMARK800CATALYTIC RESIDUEREMARK800REMARK800SITE IDENTIFIER: CATREMARK800SITE DESCRIPTION:REMARK800CATALYTIC RESIDUEREMARK800REMARK800SITE IDENTIFIER: CATREMARK800SITE DESCRIPTION:REMARK800CATALYTIC RESIDUEREMARK800DBREF1BWS A3316SWSP32055FCL ECOLISEQRES1A321METSERLYSGLNARGVALPHEILEALAGLYHISARGGLYSEQRES2A321METVALGLYSERALAILEARGARGGLNLEUGLUGLNARSSEQRES3A321GLYASPVALGLULEUVALLEUARSTHRARGASPGLULEOSEQRES4A321ASNLEULEUASPSERARGALAVALHISASPPHEPHEALASEQRES5A321SERGLUARSILEASPGLNVALTYRLEUALAALAALALYSSEQRES6A321VALGLYGLYILEVALALAASNASNTHRTYRPROALAASPSEQRES7A321PHEILETYRGLNASNMETMETILEGLUSERASNILEILESEQRES8A321HISALAALAHISGLNASNASPVALASNLYSLEULEUPHESEQRES9A321LEUSLYSERSERCYSILETYRPROLYSLEUALALYSGLNSEQRES10A321PROMETALAGLUSERGLULEULEUGLNGLYTHRLEUGLUSEQRES11A321PROTERASNGLUPROTYRALAILEALALYSILEALASLYSEQRES12A321ILELYSLEUCYSGLUSERTYRASNARGGLNTYRGLYARGSEQRES13A321ASPTYRARSSERVALMETPROTHEASNLEUTYRGLYPROSEQRES14A321HISASPASNPHEHISPROSERASNSERHISVALILEPROSEQRES15A321ALALEULEUARGARGPHEHISGLUALATHRALAGLNASNSEQRES16A321ALAPROASPVALVALVALTRPGLYSERGLYTHRPROMETSEQRES17A321ARSGLUPHELEUHISVALASPASPMETALAALAALASERSEQRES18A321ILEHISVALMETGLULEUALAHISGLUVALTRPLEUGLUSEQRES19A321ASNTHRGLNPROMETLEUSERHISILEASNVALGLYTHRSEQRES20A321SLYVALASPCYSTHRILEARGASPVALALAGLNTHRILESEQRES21A321ALALYSVALVALGLYTYRLYSGLYARGVALVALPHEASPSEQRES22A321ALASERLYSPROASPGLYTHRPROARGLYSLEULEUASPSEQRES23A321VALTHRARGLEUHISGLNLEUGLYTRPTYRHISGLUILESEQRES24A321SERLEUGLUALAGLYLEUALASERTHRTYRGLNTRPPHESEQRES25A321LEUGLUASNGLNASPARGPHEARGGLYHETNDP 10HETNAMNDP NICOTINAMIDE ADENINE DINUCLEOTIDE PHOSPHATEHETSYNNDP NADPFORMUL2NDPC21 H23 N7 O17 P3 3-FORMUL3HOH*109(E2 O1)HELIX11MET A14GLN A25112HELIX22SER A44GLU A54111HELIX33ILE A69THR A7416HELIX44PRO A76ASN A97122HELIX55SER A108ILE A11053HELIX66GLU A121GLU A12353HELIX77GLU A134TYR A154121HELIX88VAL A180ALA A193114HELIX99VAL A214GLU A226113HELIX1010HIS A229GLUA23416HELIX1111ILE A253VAL A264112HELIX1212THR A288GLN A29215HELIX1313LEU A301GLU A314114SHEET1A6 VAL A29VAL A320SHEET2A6 GLN A4ALA A91NGLN A4OGLU A30SHEET3A6 GLN A58LEU A611NGLN A58OPHE A7SHEET4A6 LYS A101LEU A1051NLYS A101OVAL A59SHEET5A6 ASP A157PRO A1631NASP A157OLEU A102SHEET6A6 ILE A243VAL A2451NILE A243OMET A162SHEET1B2 ASN A165TYR A1670SHEET2B2 PHE A211HIS A2131NLEU A212OASN A165SHEET1C2 ASP A198TRP A2020SHEET2C2 ARG A269ASP A2731NARG A269OVAL A199SITE1CAT1 TYR136SITE2CAT1 LYS140SITE3CAT1 SER107CRYST1104.200104.20075.88090.0090.00120.00P32216ORIGX11.0000000.0000000.0000000.00000ORIGX20.0000001.0000000.0000000.00000ORIGX30.0000000.0000001.0000000.00000SCALE10.0095970.0055410.0000000.00000SCALE20.0000000.0110810.0000000.00000SCALE30.0000000.0000000.0131790.00000HETATM1OHOH155.652−16.80622.5351.008.73OHETATM2OHOH358.494−10.63918.7401.0013.17OHETATM3OHOH458.230−11.71527.7701.0019.07OHETATM4OHOH557.252−3.75930.1071.0011.21OHETATM5OHOH658.298−10.01125.5271.0015.74OHETATM6OHOH749.3216.58338.8151.0019.33OHETATM7OHOH853.785−4.26222.4641.0010.94OHETATM8OHOH1074.6522.8889.1411.0017.80OHETATM9OHOH1149.7610.82632.8961.0022.02OHETATM10OHOH1255.530−11.16228.5261.0011.39OHETATM11OHOH1375.0277.03427.3531.0016.30OHETATM12OHOH1449.994−2.31411.0321.0021.33OHETATM13OHOH1561.323−8.95929.6571.0022.84OHETATM14OHOH1661.029−11.56029.1311.0021.24OHETATM15OHOH1750.6845.88110.1301.0015.88OHETATM16OHOH1864.506−6.30232.9891.0021.05OHETATM17OHOH1957.856−16.39825.0851.0022.86OHETATM18OHOH2038.97926.53619.0701.0021.08OHETATM19OHOH2138.04233.48721.9091.0019.01OHETATM20OHOH2438.17235.77520.8271.0033.46OHETATM21OHOH2570.916−11.12815.2441.0031.37OHETATM22OHOH2654.20519.36028.3961.0035.76OHETATM23OHOH2750.4362.65416.7831.0012.25OHETATM24OHOH2869.69219.10838.9791.0049.77OHETATM25OHOH2956.432−8.87719.3031.0022.52OHETATM26OHOH3060.8323.41542.3491.0017.39OHETATM27OHOH3153.889−12.70629.7641.0022.40OHETATM28OHOH3237.88726.37328.0581.0018.09OHETATM29OHOH3349.20111.17326.8671.0033.95OHETATM30OHOH3446.762−0.27831.3941.0020.63OHETATM31OHOH3541.73127.56843.3021.0027.39OHETATM32OHOH3666.82711.20228.9291.0013.23OHETATM33OHOH3746.83414.39640.8191.0046.02OHETATM34OHOH3861.3421.06443.8681.0026.68OHETATM35OHOH4270.59716.42237.8371.0019.26OHETATM36OHOH4472.275−9.08933.4071.0022.11OHETATM37OHOH4542.68534.46133.9551.0017.32OHETATM38OHOH4653.48013.39438.3641.0020.19OHETATM39OHOH4756.08521.75744.7441.0033.50OHETATM40OHOH4835.74132.69123.5171.0019.49OHETATM41OHOH4940.45836.70034.3121.0034.53OHETATM42OHOH5075.4407.26729.9481.0018.07OHETATM43OHOH5147.47618.34720.8511.0034.16OHETATM44OHOH5352.837−16.34419.5871.0025.92OHETATM45OHOH5546.4159.07320.1081.0031.91OHETATM46OHOH5745.91235.17036.1331.0035.55OHETATM47OHOH5860.247−2.88041.9191.0016.85OHETATM48OHOH6064.9746.08624.5011.0032.16OHETATM49OHOH6152.1034.6834.9781.0035.72OHETATM50OHOH6250.88840.15436.4631.0038.35OHETATM51OHOH6344.37331.23337.3361.0020.07OHETATM52OHOH6457.28027.75742.4511.0021.74OHETATM53OHOH6558.40923.76945.5171.0058.42OHETATM54OHOH6668.690−11.76435.3351.0057.07OHETATM55OHOH6742.74625.15323.4651.0027.05OHETATM56OHOH6853.638−16.45732.2921.0031.71OHETATM57OHOH6933.39041.71631.4081.0029.92OHETATM58OHOH7057.76817.89742.4341.0025.75OHETATM59OHOH7175.6479.16411.7661.0035.13OHETATM60OHOH7262.03233.29244.7491.0046.18OHETATM61OHOH7347.31014.31234.2851.0031.18OHETATM62OHOH7479.660−3.94715.9131.0034.63OHETATM63OHOH7546.9295.3434.5501.0023.14OHETATM64OHOH7673.47512.03928.4121.0027.26OHETATM65OHOH7746.297−6.98230.0321.0043.41OHETATM66OHOH7868.528−3.42240.8691.0038.47OHETATM67OHOH7962.080−1.44842.8031.0024.60OHETATM68OHOH8065.33018.15040.7261.0041.00OHETATM69OHOH8151.77516.12837.6071.0025.11OHETATM70OHOH8354.26628.68243.3131.0027.61OHETATM71OHOH8573.291−15.47920.6031.0037.54OHETATM72OHOH8634.76021.47928.5441.0043.87OHETATM73OHOH8737.32624.13129.6771.0024.47OHETATM74OHOH8865.16820.1486.7351.0026.10OHETATM75OHOH8959.19612.08913.6301.0025.24OHETATM76OHOH9166.576−6.23540.2791.0043.11OHETATM77OHOH9337.33929.39425.5151.0027.56OHETATM78OHOH9452.339-17.01442.2711.0048.96OHETATM79OHOH9540.51132.92731.7171.0022.46OHETATM80OHOH9678.58013.12134.1381.0027.98OHETATM81OHOH9765.09015.70434.8761.0018.96OHETATM82OHOH9984.5622.95127.1811.0035.92OHETATM83OHOH10050.3869.7619.6461.0023.18OHETATM84OHOH10167.649−0.85138.7641.0024.99OHETATM85OHOH10244.0014.29334.3151.0031.13OHETATM86OHOH10359.386−5.07126.2111.0029.10OHETATM87OHOH10477.3644.74541.5061.0035.32OHETATM88OHOH10559.03421.20132.4141.0023.43OHETATM89OHOH10642.46334.69814.3271.0038.86OHETATM90OHOH10770.21714.29220.8641.0042.39OHETATM91OHOH10876.9998.13025.8621.0032.91OHETATM92OHOH10949.76629.93722.1731.0042.52OHETATM93OHOH11072.47313.53638.8231.0033.32OHETATM94OHOH11164.328−12.08438.6081.0037.99OHETATM95OHOH11260.16116.38242.6821.0035.68OHETATM96OHOH11347.60213.63927.0161.0026.01OHETATM97OHOH11564.60611.64440.1071.0030.33OHETATM98OHOH11661.231−15.13727.2551.0038.76OHETATM99OHOH11765.324−11.22335.0981.0030.45OHETATM100OHOH11956.60217.21944.9321.0036.53OHETATM101OHOH12037.56419.86023.1351.0031.27OHETATM102OHOH12164.8455.05721.1321.0045.57OHETATM103OHOH12363.39116.80126.8981.0038.46OHETATM104OHOH12442.5676.13432.6351.0031.56OHETATM105OHOH12572.48513.23635.0591.0029.61OHETATM106OHOH12665.2293.65044.0321.0036.86OHETATM107OHOH12737.0897.14831.0831.0039.58OHETATM108OHOH12873.32710.54612.1231.0034.97OHETATM109OHOH12974.45010.29926.5981.0030.80OHETATM110AO5*NDPA167.52413.05526.6921.0036.42OHETATM111AC5*NDPA168.08912.29725.6141.009.30CHETATM112AC4*NDPA169.60112.12425.8581.0027.73CHETATM113AO4*NDPA170.19311.25824.8481.0022.87OHETATM114AC3*NDPA170.48413.39025.8731.0017.83CHETATM115AO3*NDPA171.19213.43627.0661.0016.11OHETATM116AC2*NDPA171.37313.22024.6261.0011.46CHETATM117AO2*NDPA172.62313.88624.6551.0031.96OHETATM118AC1*NDPA171.51011.70224.6561.0019.02CHETATM119O3NDPA165.33613.59026.1291.0020.59OHETATM120NO5*NDPA163.53611.94326.4481.0028.99OHETATM121NC5*NDPA164.32810.84325.9571.0024.89CHETATM122NC4*NDPA163.4679.64625.6861.0031.79CHETATM123NO4*NDPA162.8379.33726.9081.0028.82OHETATM124NC3*NDPA162.3409.83724.6651.0011.50CHHTATM125NO3*NDPA162.8919.40223.4611.0028.60OHETATM126NC2*NDPA161.1528.99625.1381.0028.11CHETATM127NO2*NDPA160.8817.66224.7151.0024.30OHETATM128NC1*NDPA161.5478.87526.5801.0035.35CHETATM129AP2*NDPA173.10415.06923.8231.0032.96PHETATM130AOP1NDPA174.50015.30824.3081.0037.84OHETATM131AOP2NDPA172.79714.92522.3481.0036.66OHETATM132AOP3NDPA172.16316.21723.9581.0031.97OHETATM133APNDPA166.66014.25726.3931.0026.17XXHETATM134AO1NDPA166.88614.79525.0471.0015.31XXHETATM135AO2NDPA166.43915.20727.5211.0034.39XXHETATM136AN9NDPA171.82011.22423.3531.0013.63XXHETATM137AC8NDPA171.10411.31622.2001.0012.41XXHETATM138AN7NDPA171.75810.83521.1611.0015.71XXHETATM139AC5NDPA172.93310.31321.7101.0016.17XXHETATM140AC6NDPA174.0539.65721.1401.0031.35XXHETATM141AN6NDPA174.1659.46419.8191.0012.59XXHETATM142AN1NDPA175.0789.28021.9421.0017.56XXHETATM143AC2NDPA174.9719.57823.2511.0015.44XXHETATM144AN3NDPA174.02710.30223.8891.0024.82XXHETATM145AC4NDPA173.03610.65323.0471.0017.48XXHETATM146NPNDPA164.18313.10627.1911.0025.47NHETATM147NO1NDPA163.14214.16927.2531.0028.69NHETATM148NO2NDPA164.83712.64328.4921.0024.32NHETATM149NN1NDPA160.5989.77527.1091.0023.63NHETATM150NC2NDPA160.14310.90526.442−99.0078.36NHETATM151NC3NDPA159.07011.64827.007−99.00100.00NHETATM152NC7NDPA158.49713.01726.528−99.00100.00NHETATM153NO7NDPA159.35813.70325.972−99.00100.00NHETATM154NN7NDPA157.20713.40026.912−99.0084.38NHETATM155NC4NDPA158.44211.14628.137−99.00100.00NHETATM156NC5NDPA158.9129.96328.754−99.00100.00NHETATM157NC6NDPA159.9519.26628.147−99.00100.00NATOM158NLYSA376.227−5.63244.3151.0061.49NATOM159CALYSA376.152−4.30243.6841.0058.00CATOM160CLYSA375.985−4.42142.1711.0052.79CATOM161OLYSA376.921−4.73741.4191.0044.76OATOM162CELYSA377.359−3.41744.0301.0059.74CATOM163CGLYSA377.011−1.94444.3141.0050.87CATOM164CDLYSA378.208−1.16144.8941.0061.21CATOM165CELYSA377.855−0.37746.1861.00100.00CATOM166NZLYSA378.857−0.40147.3431.0070.61NATOM167NGLNA474.746−4.24241.7471.0045.15NATOM168CAGLNA474.408−4.32640.3471.0037.18CATOM169CGLNA474.983−3.16639.5611.0034.93CATOM170OGLNA475.127−2.05040.0871.0028.48OATOM171CEGLNA472.915−4.44540.2211.0034.65CATOM172CGGLNA472.456−5.85440.5841.0031.82CATOM173CDGLNA472.570−6.78839.4051.0079.25CATOM174OE1GLNA472.165−6.45238.2861.00100.00OATOM175NE2GLNA473.206−7.92539.6231.0080.24NATOM176NARGA575.475−3.49538.3751.0027.16NATOM177CAARGA576.146−2.54637.4831.0039.16CATOM178CARGA575.191+321 2.01836.4331.0038.22CATOM179OARGA574.938−2.69835.4381.0032.44OATOM180CBARGA577.398−3.16336.8261.0041.76CATOM181CGARGA578.692−2.95437.6631.0037.34CATOM182CDARGA580.015−3.23636.8761.0032.99CATOM183NEARGA581.036−2.20337.1251.0025.71NATOM184CZARGA581.617−1.48836.1691.0032.53CATOM185NE1ARGA581.293−1.70434.9041.0040.07NATOM186NH2ARGA582.516−0.55136.4741.00100.00NATOM187NVALA674.743−0.77336.6591.0032.08NATOM188CAVALA673.715−0.08235.8811.0028.89CATOM189CVALA674.1611.02134.8971.0029.37CATOM190OVALA674.7452.04135.2741.0022.50OATOM191CBVALA672.5770.37836.8131.0023.52CATOM192CG1VALA671.3660.96036.0061.0020.29CATOM193CG2VALA672.108−0.85237.6441.0018.45CATOM194NPHEA773.9480.74933.6151.0022.92NATOM195CAPHEA774.2671.71032.5731.0027.15CATOM196CPHEA772.9752.42332.1921.0020.24CATOM197OPHEA771.9941.78831.8151.0020.71OATOM198CBPHEA774.8641.00431.3741.0018.98CATOM199CGPHEA774.9161.83630.1151.0021.83CATOM200CD1PHEA775.5213.08730.1081.0019.36CATOM201CD2PHEA774.4831.28428.8861.0023.50CATOM202CE1PHEA775.6143.82828.9021.0027.52CATOM203CE2PHEA774.5481.99627.6851.0019.33CATOM204CZPHEA775.1283.25527.6731.0018.59CATOM205NILEA872.9593.72732.4541.0018.75NATOM206CAILEA871.8444.58832.1121.0014.25CATOM207CILEA872.3375.35130.9091.0011.22CATOM208OILEA873.2596.16530.9981.0017.76OATOM209CEILEA871.5075.60533.2121.0014.15CATOM210CG1ILEA871.3564.94934.5821.008.24CATOM211CG2ILEA870.1836.34232.8741.0016.85CATOM212CD1ILEA871.0915.96135.7071.0010.32CATOM213NALAA971.8964.90629.7521.0016.42NATOM214CAALAA972.2565.55928.5131.0018.74CATOM215CALAA971.5306.91328.5111.0028.45CATOM216OALAA970.4117.03229.0451.0022.39OATOM217CBALAA971.8084.73127.3111.0014.43CATOM218NGLYA1072.1997.92227.9401.0020.06NATOM219CAGLYA1071.7069.28427.9111.0018.62CATOM220CGLYA1071.4079.81929.3051.0016.40CATOM221OGLYA1070.37910.44829.4811.0017.36OATOM222NHISA1172.2959.58130.2721.0010.32NATOM223CAHISA1172.0689.96631.6881.0013.90CATOM224CHISA1172.00811.50431.9161.0021.52CATOM225OHISA1171.70011.99432.9831.0013.22OATOM226CEHISA1173.1539.35032.5811.0014.88CATOM227CGHISA1174.5029.94832.3261.0023.73CATOM228ND1HISA1175.2399.64831.1971.0024.90NATOM229CD2HISA1175.16710.95232.9561.0016.35CATOM230CE1HISA1176.31710.40731.1701.0022.54CATOM231NE2HISA1176.27111.24032.1971.0017.56NATOM232NARGA1272.31012.28830.9081.0022.31NATOM233CAARGA1272.14713.69331.1221.0018.90CATOM234CARGA1270.85114.24430.4951.0026.34CATOM235OARGA1270.57215.42630.6041.0025.37OATOM236CEARGA1273.35214.41830.5871.0025.93CATOM237CGARGA1274.58213.94331.2791.0053.87CATOM238CDARGA1275.75714.61930.6991.0032.53CATOM239NEARGA1276.35915.57631.6051.0069.90NATOM240CZARGA1276.97116.67531.1781.00100.00CATOM241NH1ARGA1277.00116.94829.8671.00100.00NATOM242NH2ARGA1277.52617.50832.0561.00100.00NATOM243NGLYA1370.07813.42029.8001.0018.25NATOM244CAGLYA1368.80213.90429.2581.0016.50CATOM245CGLYA1367.84914.14430.4281.0018.88CATOM246OGLYA1368.20213.90231.6241.0014.04OATOM247NMETA1466.65314.63230.1031.0016.00NATOM248CAMETA1465.68814.98131.1281.0013.49CATOM249CMETA1465.29313.76031.9011.0014.02CATOM250OMETA1465.40813.71333.1451.0017.06OATOM251CEMETA1464.44215.60530.5241.0011.57CATOM252CGMETA1463.32015.62831.5591.0020.77CATOM253SDMETA1461.92616.76631.1101.0029.165ATOM254CEMETA1462.52717.10829.5741.0030.68CATOM255NVALA1564.79812.76931.1581.0025.23NATOM256CAVALA1564.43911.46831.7381.0020.90CATOM257CVALA1565.65410.71332.3781.0017.26CATOM258OVALA1565.59010.23933.5241.0018.41OATOM259CBVALA1563.75210.55030.6801.0023.25CATOM260CG1VALA1563.3309.25331.3101.0015.71CATOM261CG2VALA1562.52811.19330.1831.0013.40CATOM262NGLYA1666.78410.64231.6651.0020.39NATOM263CAGLYA1667.9419.90432.1861.0019.54CATOM264CGLYA1668.52210.43233.4921.0029.29CATOM265OGLYA1668.8969.65934.4341.0016.91OATOM266NSERA1768.64211.75533.4991.0012.53NATOM267CASERA1769.15412.46034.6501.0021.93CATOM268CSERA1768.20912.21435.8181.0013.35CATOM269OSERA1768.67711.95736.9151.0024.19OATOM270CBSERA1769.37813.94234.3331.0015.52CATOM271OGSERA1768.15314.61934.3721.0022.95OATOM272NALAA1866.89612.14335.5901.0017.52NATOM273CAALAA1865.99111.82836.7291.0013.14CATOM274CALAA1866.22010.39337.3071.0019.29CATOM275OALAA1866.14910.15038.5221.0016.94OATOM276CBALAA1864.46012.04636.3341.0014.33CATOM277NILEA1966.4849.43236.4301.0020.80NATOM278CAILEA1966.7058.07836.9001.0018.08CATOM279CILEA1967.9758.09037.7301.0016.09CATOM280OILEA1968.0187.53038.8201.0020.73OATOM281CBILEA1966.8047.07935.7101.0017.58CATOM282CG1ILEA1965.4446.81235.1621.0010.09CATOM283CG2ILEA1967.3095.66636.1331.0021.60CATOM284CD1ILEA1965.5286.36133.7411.0019.05CATOM285NARGA2068.9848.77137.1981.0018.13NATOM286CAARGA2070.2868.89737.8361.0020.25CATOM287CARGA2070.2319.49139.2421.0030.62CATOM288OARGA2070.9579.09140.1291.0033.00OATOM289CBARGA2071.2019.74336.9571.0011.71CATOM290CGARGA2072.6109.78137.4491.0023.79CATOM291CDARGA2072.88111.10738.0601.0036.76CATOM292NEARGA2074.29711.44338.0621.0048.34NATOM293CZARGA2074.99011.84136.9881.00100.00CATOM294NH1ARGA2074.39311.93135.8081.00100.00NATOM295NH2ARGA2076.28912.13937.0761.00100.00NATOM296NARGA2169.36810.46139.4391.0022.10NATOM297CAARGA2169.21611.05240.7501.0017.45CATOM298CARGA2168.72110.00741.7301.0026.71CATOM299OARGA2169.14710.00142.8851.0030.27OATOM300CBARGA2168.14212.14440.7081.0017.93CATOM301CGARGA2168.68213.52240.3211.0027.57CATOM302CDARGA2167.58614.59940.1301.0023.02CATOM303NEARGA2167.61915.00038.7431.0055.12NATOM304CZARGA2166.53815.10337.9951.0010.55CATOM305NH1ARGA2165.34314.97438.5521.0029.80NATOM306NH2ARGA2166.66515.43536.7151.0061.45NATOM307NGLNA2267.7139.22341.3451.0027.48NATOM308CAGLNA2267.1678.25742.3131.0024.79CATOM309CGLNA2268.1377.12742.5471.0031.37CATOM310OGLNA2268.3946.72443.6851.0027.47OATOM311CBGLNA2265.8187.70641.8941.0017.11CATOM312CGGLNA2264.9218.74541.2431.0066.14CATOM313CDGLNA2263.4258.45641.3971.0041.27CATOM314OE1GLNA2263.0027.32941.7621.0029.34OATOM315NE2GLNA2262.6109.46441.0461.0020.12NATOM316NLEUA2368.6976.65241.4481.0027.99NATOM317CALEUA2369.6495.57541.5001.0024.48CATOM318CLEUA2370.8285.97142.3341.0028.87CATOM319OLEUA2371.2885.21843.1651.0030.79OATOM320CBLEUA2370.0365.10740.0891.0022.72CATOM321CGLEUA2368.9664.07239.6581.0026.16CATOM322CD1LEUA2369.2713.08338.4811.0024.80CATOM323CD2LEUA2368.4273.28440.8351.0022.91CATOM324NGLUA2471.2797.19242.1531.0028.77NATOM325CAGLUA2472.4197.67542.9091.0033.79CATOM326CGLUA2472.3637.38844.4121.0035.94CATOM327OGLUA2473.3817.14045.0311.0039.07OATOM328CBGLUA2472.6479.16542.6531.0036.21CATOM329CGGLUA2474.0689.48242.2431.0042.54CATOM330CDGLUA2474.15810.68941.3331.0089.51CATOM331OE1GLUA2473.38611.66341.5491.0043.21OATOM332OE2GLUA2474.99410.64640.3981.0066.28OATOM333NGLNA2571.1827.42245.0001.0045.70NATOM334CAGLNA2571.0397.15246.4321.0047.57CATOM335CGLNA2570.8875.66946.7401.0067.34CATOM336OGLNA2570.2855.28647.7261.0074.06OATOM337CBGLNA2569.7837.84246.9051.0051.85CATOM338CGGLNA2569.5009.08446.1091.0044.91CATOM339CDGLNA2568.4199.91346.7421.00100.00CATOM340OE1GLNA2568.2719.94747.9721.00100.00OATOM341NE2GLNA2567.62410.60245.9111.00100.00NATOM342NARGA2671.3224.83145.8251.0075.37NATOM343CAARGA2671.1823.40746.0261.0074.87CATOM344CARGA2672.5682.79146.1471.0074.08CATOM345OARGA2673.4402.99745.2891.0077.00OATOM346CBARGA2670.3902.79044.8851.0052.44CATOM347CGARGA2668.9162.92745.0701.0043.51CATOM348CDARGA2668.4281.75245.8641.0040.70CATOM349NEARGA2667.2001.17645.3381.0042.33NATOM350CZARGA2667.1260.50844.1961.0032.07CATOM351NH1ARGA2668.2150.32443.4861.0044.02NATOM352NH2ARGA2665.9680.01743.7711.0077.32NATOM353NGLYA2772.7782.11447.2661.0046.30NATOM354CAGLYA2774.0601.53147.5491.0046.82CATOM355CGLYA2774.1400.16546.9231.0055.45CATOM356OGLYA2775.204−0.45346.8771.0064.43OATOM357NASPA2873.017−0.31546.4281.0040.98NATOM358CAASPA2873.016−1.64745.8611.0040.35CATOM359CASPA2873.266−1.53644.4001.0039.55CATOM360OASPA2873.109−2.51843.6541.0048.80OATOM361CBASPA2871.680−2.33546.1271.0047.80CATOM362CGASPA2870.503−1.37346.0641.0035.34CATOM363OD1ASPA2870.705−0.14046.0951.0039.23OATOM364OD2ASPA2869.383−1.87045.8721.0069.86OATOM365NVALA2973.651−0.32943.9961.0031.03NATOM366CAVALA2973.881−0.05042.5911.0028.44CATOM367CVALA2975.1660.67642.2811.0028.00CATOM368OVALA2975.5051.69942.8921.0034.83OATOM369CBVALA2972.6960.76042.0001.0030.68CATOM370CG1VALA2972.9351.08840.5491.0023.65CATOM371CG2VALA2971.416−0.02842.1561.0027.95CATOM372NGLUA3075.8240.21941.2301.0030.76NATOM373CAGLUA3076.9950.92440.7361.0028.38CATOM374CGLUA3076.6781.47139.3321.0031.03CATOM375OGLUA3076.3680.72038.3971.0026.64OATOM376CBGLUA3078.1990.00640.7221.0031.84CATOM377CGGLUA3079.3550.53941.5331.0089.26CATOM378CDGLUA3080.6670.26440.8581.00100.00CATOM379OE1GLUA3081.082−0.92240.8721.0088.94OATOM380OE2GLUA3081.2021.20640.2191.00100.00OATOM381NLEUA3176.6652.78939.2071.0022.24NATOM382CALEUA3176.2693.39137.9451.0029.37CATOM383CLEUA3177.4043.50736.9411.0025.79CATOM384OLEUA3178.4853.96937.2561.0029.41OATOM385CBLEUA3175.6324.76038.1911.0030.20CATOM386CGLEUA3174.3294.76338.9941.0029.37CATOM387CD1LEUA3173.8416.14339.2401.0023.43CATOM388CD2LEUA3173.2753.96238.2811.0023.04CATOM389NVALA3277.1463.10035.7111.0021.94NATOM390CAVALA3278.1433.26534.6851.0025.48CATOM391CVALA3277.5354.24233.6691.0038.76CATOM392OVALA3276.4293.99933.1801.0029.70OATOM393CBVALA3278.5171.90234.0551.0034.25CATOM394CG1VALA3279.5872.07932.9701.0030.56CATOM395CG2VALA3279.0030.95035.1391.0025.27CATOM396NLEUA3378.2195.37533.4571.0030.19NATOM397CALEUA3377.7326.46332.6211.0022.71CATOM398CLEUA3378.7276.97931.6451.0029.55CATOM399OLEUA3379.8967.15231.9881.0030.09OATOM400CBLEUA3377.4237.63533.5141.0019.75CATOM401CGLEUA3376.7297.20034.7791.0019.38CATOM402CD1LEUA3376.8148.34435.7621.0027.24CATOM403CD2LEUA3375.2716.91334.4441.0022.07CATOM404NARGA3478.2397.42130.4961.0015.09NATOM405CAARGA3479.1548.00829.5411.0026.04CATOM406CAARGA3478.4699.17328.9161.0036.57CATOM407OARGA3477.2889.13028.6511.0038.59OATOM408CBARGA3479.4867.04828.3981.0022.89CATOM409CGARGA3480.5796.08128.7061.0023.29CATOM410CDARGA3481.3706.57529.8601.0052.06CATOM411NEARGA3481.7835.45830.7111.0080.25NATOM412CZARGA3482.6464.53030.3231.0041.94CATOM413NH1ARGA3483.1734.59629.1041.0053.02NATOM414NH2ARGA3482.9833.54731.1481.0025.56NATOM415NTHRA3579.24810.15628.5391.0031.58NATOM416CATHRA3578.70311.28227.8331.0029.33CATOM417CTHRA3578.71910.95126.3401.0032.53CATOM418OTHRA3579.3509.94425.9621.0028.08OATOM419CBTHRA3579.52712.52728.1451.0037.49CATOM420OG1THRA3580.64412.42927.5601.0031.91OATOM421CG2THRA3579.62712.64229.6511.0019.38CATOM422NARGA3678.03211.78025.5291.0030.02NATOM423CAARGA3678.00211.63924.0561.0029.37CATOM424CARGA3679.40611.76523.5031.0031.46CATOM425OARGA3679.77211.01222.5911.0036.56OATOM426CBARGA3677.05412.65023.3541.0037.34CATOM427CGARGA3676.93712.46521.846−99.0049.47CATOM428CDARGA3676.02013.51521.232−99.0063.09CATOM429NEARGA3675.52813.12419.915−99.0075.23NATOM430CZARGA3674.38113.54919.391−99.0091.44CATOM431NH1ARGA3673.60514.37520.079−99.0079.32NATOM432NH2ARGA3674.00913.14418.185−99.0078.73NATOM433NASPA3780.21712.67724.0631.0041.30NATOM434CAASPA3781.60612.71023.6011.0044.91CATOM435CASPA3782.41011.48124.0431.0024.99CATOM436OASPA3783.21110.97823.2611.0042.22OATOM437CBASPA3782.34714.04823.718−99.0047.07CATOM438CGASPA3781.88114.88724.876−99.0062.99CATOM439OD1ASPA3780.67914.83925.204−99.0064.45OATOM440OD2ASPA3782.71115.63825.429−99.0069.84OATOM441NGLUA3882.12910.95025.2351.0019.39NATOM442CAGLUA3882.7909.71725.6821.0027.84CATOM443CGLUA3882.2038.52724.9011.0037.14CATOM444OGLUA3882.8737.51124.6991.0035.04OATOM445CBGLUA3882.6919.43527.2071.0025.18CATOM446CGGLUA3883.11610.54928.1831.0037.45CATOM447CDGLUA3882.80710.21229.6551.0021.13CATOM448OE1GLUA3881.6239.99730.0141.0055.97OATOM449OE2GLUA3883.7579.97830.4191.0098.78OATOM450NLEUA3980.9488.61024.4781.0025.52NATOM451CALEUA3980.4407.48323.7391.0018.17CATOM452CLEUA3979.2917.76422.8251.0020.34CATOM453OLEUA3978.1527.81023.2591.0026.35OATOM454CBLEUA3980.1236.31324.6571.0014.56CATOM455CGLEUA3979.4105.07524.0581.0019.52CATOM456CD1LEUA3980.2054.39222.9941.0018.84CATOM457CD2LEUA3978.8904.05125.0841.0017.41CATOM458NASNA4079.5987.86021.5431.0016.73NATOM459CAASNA4078.5487.97120.5401.0021.55CATOM460CASNA4077.7986.64920.3081.0024.53CATOM461OASNA4078.3285.72019.6881.0019.96OATOM462CBASNA4079.1308.36719.2161.0018.45CATOM463CGASNA4078.0548.72718.2251.0042.19CATOM464OD1ASNA4078.3279.09317.0801.0038.89OATOM465ND2ASNA4076.8278.73018.6971.0023.71NATOM466NLEUA4176.5436.62220.7541.0021.08NATOM467CALEUA4175.6495.46520.6501.0015.03CATOM468CLEUA4175.2255.06819.2131.0018.22CATOM469OLEUA4174.6813.97118.9801.0025.72OATOM470CELEUA4174.4265.70521.5321.0015.65CATOM471CGLEUA4174.8226.02922.9741.0021.90CATOM472CD1LEUA4173.6046.41323.7491.0020.59CATOM473CD2LEUA4175.4814.79623.6091.0017.97CATOM474NLEUA4275.5425.91618.2381.0012.45NATOM475CALEUA4275.2565.60716.8311.0015.99CATOM476CLEUA4276.2904.68016.2801.0026.18CATOM477OLEUA4276.0664.03915.2571.0022.41OATOM478CELEUA4275.2826.87315.9841.0017.85CATOM479CGLEUA4274.1807.85416.3991.0030.70CATOM480CD1LEUA4274.3189.18415.7041.0024.31CATOM481CD2LEUA4272.7647.24116.2081.0031.13CATOM482NASPA4377.4624.70516.9111.0026.87NATOM483CAASPA4378.5793.87516.4861.0019.29CATOM484CASPA4378.5832.51917.1631.0013.33CATOM485OASPA4379.0512.34818.2971.0018.75OATOM486CBASPA4379.8704.58016.7761.0031.06CATOM487CGASPA4381.0833.75816.3801.0030.68CATOM488OD1ASPA4380.9712.55116.0821.0032.36OATOM489OD2ASPA4382.1874.30816.4991.0037.83OATOM490NSERA4478.1391.54416.3771.0016.89NATOM491CASERA4477.9780.17316.7891.0017.67CATOM492CSERA4479.237−0.46317.3921.0020.40CATOM493OSERA4479.206−1.12618.4441.0026.27OATOM494CESERA4477.504−0.61715.5811.0013.85CATOM495OGSERA4476.800−1.74016.0631.0043.83OATOM496NARGA4580.335−0.30116.6821.0015.63NATOM497CAARGA4581.616−0.78817.1541.0019.94CATOM498CARGA4581.910−0.22518.5211.0029.48CATOM499OARGA4582.244−0.93719.4571.0027.65OATOM500CBARGA4582.684−0.26116.2031.0027.46CATOM501CGARGA4583.463−1.33815.4951.0092.03CATOM502CDARGA4584.854−1.41816.0771.00100.00CATOM503NEARGA4585.636−2.53315.5271.00100.00NATOM504CZARGA4586.092−3.57016.2361.00100.00CATOM505NH1ARGA4585.791−3.69517.5471.00100.00NATOM506NH2ARGA4586.773−4.54415.6421.00100.00NATOM507NALAA4681.7721.09018.6291.0031.04NATOM508CAALAA4682.0451.74319.8811.0024.72CATOM509CALAA4681.1111.17620.8991.0017.73CATOM510OALAA4681.5120.82522.0271.0022.73OATOM511CBALAA4681.8393.22119.7511.0027.16CATOM512NVALA4779.8351.11920.5311.0017.54NATOM513CAVALA4778.8780.60821.5081.0021.41CATOM514CVALA4779.262−0.81221.9141.0030.25CATOM515CVALA4779.192−1.20223.0971.0015.85OATOM516CBVALA4777.4700.66820.9891.0018.59CATOM517CG1VALA4776.5030.04222.0121.0016.88CATOM518CG2VALA4777.1152.09620.7561.0016.28CATOM519NHISA4879.692−1.58520.9201.0021.00NATOM520CAHISA4880.028−2.96921.1921.0020.17CATOM521CHISA4881.268−3.07922.1171.0032.98CATOM522OHISA4881.289−3.85023.1021.0028.20OATOM523CBHISA4880.063−3.80119.8551.0014.93CATOM524CGHISA4878.686−4.17219.3381.0026.67CATOM525ND1HISA4878.085−5.39419.6001.0028.83NATOM526CD2HISA4877.758−3.44818.6591.0025.56CATOM527CE1HISA4876.887−5.43019.0431.0020.08CATOM528NE2HISA4876.660−4.26018.4751.0025.22NATOM529NASPA4982.217−2.17021.9021.0022.62NATOM530CAASPA4983.455−2.16922.6741.0024.23CATOM531CASPA4983.171−1.89924.1221.0038.72CATOM532OASPA4983.708−2.55125.0271.0035.44OATOM533CBASPA4984.396−1.11222.1271.0030.29CATOM534CGASPA4984.991−1.50320.7751.0052.45CATOM535OD1ASPA4985.007−2.72620.4491.0042.67OATOM536OD2ASPA4985.416−0.58720.0291.0073.76OATOM537NPHEA5082.294−0.92924.3241.0032.19NATOM538CAPHEA5081.902−0.55025.6491.0029.76CATOM539CPHEA5081.299−1.76526.3591.0030.31CATOM540OPHEA5081.715−2.12427.4491.0029.22OATOM541CBPHEA5080.8920.61025.5761.0023.82CATOM542CGPHEA5080.1370.84326.8591.0019.13CATOM543CD1PHEA5080.7401.51527.9311.0020.14CATOM544CD2PHEA5078.8350.36027.0181.0013.99CATOM545CE1PHEA5080.0341.74229.1291.0025.81CATOM546CE2PHEA5078.1140.55328.2121.0022.84CATOM547CZPHEA5078.6981.27629.2591.0023.40CATOM548NPHEA5180.280−2.36725.7681.0021.75NATOM549CAPHEA5179.655−3.45126.4571.0022.61CATOM550CPHEA5180.646−4.60326.6121.0034.01CATOM551OPHEA5180.550−5.40127.5901.0025.28OATOM552CBPHEA5178.389−3.89825.7511.0022.63CATOM553CGPHEA5177.158−3.14026.1701.0027.58CATOM554CD1PHEA5176.426−3.52527.2801.0021.78CATOM555CD2PHEA5176.663−2.10025.3801.0019.55CATOM556CE1PHEA5175.267−2.79627.6621.0028.34CATOM557CE2PHEA5175.492−1.40325.7341.0014.47CATOM558CZPHEA5174.797−1.74426.8781.0014.55CATOM559NALAA5281.576−4.70625.6591.0026.43NATOM560CAALAA5282.587−5.79325.7141.0029.44CATOM561CALAA5283.687−5.56026.7681.0043.76CATOM562OALAA5284.502−6.44627.0221.0040.33OATOM563CBALAA5283.228−6.04924.3441.0024.25CATOM564NSERA5383.702−4.38227.3851.0031.96NATOM565CASERA5384.705−4.09028.3771.0021.06CATOM566CSERA5384.196−3.62529.7091.0026.41CATOM567OSERA5384.985−3.49230.6111.0036.12OATOM568CBSERA5385.709−3.08827.8431.0014.22CATOM569OGSERA5385.140−1.80727.7901.0056.90OATOM570NGLUA5482.892−3.43129.8741.0022.38NATOM571CAGLUA5482.380−2.89331.1391.0017.27CATOM572CGLUA5481.584−3.73532.1181.0026.32CATOM573OGLUA5481.229−3.28133.1911.0037.43OATOM574CBGLUA5481.677−1.56330.9061.0027.30CATOM575CGGLUA5482.573−0.54330.2621.0044.77CATOM576CDGLUA5483.669−0.14231.1941.0086.31CATOM577OE1GLUA5483.392−0.23232.4281.0050.11OATOM578OE2GLUA5484.7850.19830.6921.0050.99OATOM579NARGA5581.268−4.97131.8041.0029.63NATOM580CAARGA5580.636−5.74832.8541.0033.32CATOM581CARGA5579.347−5.14933.3781.0038.45CATOM582OARGA5579.214−4.89734.5761.0040.18OATOM583CBARGA5581.621−5.87534.0451.0057.61CATOM584CGARGA5582.666−7.02833.9601.00100.00CATOM585CDARGA5582.805−7.80535.3051.00100.00CATOM586NEARGA5582.838−9.27035.1461.00100.00NATOM587CZARGA5583.206−10.12936.1021.00100.00CATOM588NH1ARGA5583.583−9.68137.3011.00100.00NATOM589NH2ARGA5583.208−11.44035.8551.00100.00NATOM590NILEA5678.367−5.02932.4911.0042.25NATOM591CAILEA5677.064−4.43432.7941.0025.49CATOM592CILEA5675.982−5.47433.2441.0020.18CATOM593OILEA5675.897−6.57932.7041.0024.74OATOM594CBILEA5676.672−3.51231.5311.0026.89CATOM595CG1ILEA5677.643−2.30131.4421.0018.30CATOM596CG2ILEA5675.214−3.01631.5491.0019.84CATOM597CD1ILEA5677.998−1.93630.0261.0060.42CATOM598NASPA5775.166−5.13334.2371.0016.84NATOM599CAASPA5774.040−5.99934.6301.0016.33CATOM600CASPA5772.676−5.45134.1231.0028.40CATOM601OASPA5771.836−6.19833.6571.0025.50OATOM602CBASPA5774.009−6.19436.1641.0016.94CATOM603CGASPA5775.369−6.72036.7031.0034.27CATOM604OD1ASPA5775.875−7.72936.1411.0031.76OATOM605OD2ASPA5776.040−6.00737.4991.0028.36OATOM606NGLNA5872.443−4.15234.2201.0028.91NATOM607CAGLNA5871.183−3.59033.7551.0025.68CATOM608CGLNA5871.425−2.36432.8811.0023.21CATOM609OGLNA5872.403−1.62033.0671.0018.16OATOM610CBGLNA5870.342−3.15134.9461.0033.14CATOM611CGGLNA5869.798−4.24135.8071.0030.00CATOM612CDGLNA5869.226−3.71237.1051.0027.18CATOM613OE1GLNA5868.722−2.60137.1611.0031.20OATOM614NE2GLNA5869.455−4.43638.1861.0016.89NATOM615NVALA5970.496−2.13831.9611.0018.35NATOM616CAVALA5970.562−0.99831.0451.0015.59CATOM617CVALA5969.238−0.24031.0391.0026.28CATOM618OVALA5968.178−0.82030.7621.0019.51OATOM619CBVALA5970.707‘1.45629.6011.0015.32CATOM620CG1VALA5970.477−0.27428.6491.0011.93CATOM621CG2VALA5972.080−2.11129.3641.0015.83CATOM622NTYRA6069.3061.06431.2931.0021.71NATOM623CATYRA6068.1131.92731.1971.0021.40CATOM624CTYRA6068.2892.75629.9281.0018.69CATOM625OTYRA6069.2503.53229.7961.0015.51OATOM626CBTYRA6068.0212.81732.4131.0017.24CATOM627CGTYRA6067.4932.13133.6581.0019.71CATOM628CD1TYRA6068.3451.58334.5861.0021.14CATOM629CD2TYRA6066.1542.22333.9911.0020.16CATOM630CE1TYRA6067.8351.08035.7941.0019.11CATOM631CE2TYRA6065.6481.69835.1631.0010.77CATOM632CZTYRA6066.4761.09436.0541.0020.07CATOM633OHTYRA6065.9210.58537.2481.0016.04OATOM634NLEUA6167.4912.45228.9161.0017.46NATOM635CALEUA6167.6853.05327.5851.0020.17CATOM636CLEUA6167.0034.41227.4091.0023.36CATOM637OLEUA6165.9254.52626.7991.0014.86OATOM638CBLEUA6167.2672.06026.4851.0014.78CATOM639CGLEUA6168.1172.14225.2081.0015.52CATOM640CD1LEUA6167.8151.01024.1091.007.75CATOM641CD2LEUA6168.0873.54124.5801.0015.20CATOM642NALAA6267.6565.43427.9561.0020.35NATOM643CAALAA6267.1206.78427.9631.0018.55CATOM644CALAA6267.7797.73926.9491.0018.57CATOM645OALAA6267.4558.92426.9201.0024.31OATOM646CBALAA6267.0717.37729.4391.0011.69CATOM647NALAA6368.6817.23126.1011.0014.09NATOM648CAALAA6369.2498.09525.0521.0012.84CATOM649CALAA6368.3108.00523.8771.0027.00CATOM650OALAA6367.8456.91623.5111.0024.51OATOM651CBALAA6370.6657.66024.6341.004.89CATOM652NALAA6468.0769.14823.2621.0021.05NATOM653CAALAA6467.2029.28622.0861.0013.50CATOM654CALAA6467.43510.66421.4161.0028.08CATOM655OALAA6467.98711.60022.0211.0026.63OATOM656CBALAA6465.6429.17122.5181.007.63CATOM657NLYSA6566.95310.78120.1821.0023.98NATOM658CALYSA6566.96612.01219.4091.0020.47CATOM659CLYSA6565.48812.44319.5511.0024.37CATOM660OLYSA6564.59411.80718.9761.0020.29OATOM661CELYSA6567.31711.65817.9511.0025.59CATOM662CGLYSA6566.80812.63016.9231.0027.54CATOM663CDLYSA6567.51813.92617.1691.0021.08CATOM664CELYSA6567.31614.90516.0291.0055.15CATOM665NZLYSA6567.87616.26316.3921.0081.63NATOM666NVALA6665.22813.36220.4851.0022.47NATOM667CAVALA6663.87313.85020.7551.0018.99CATOM668CVALA6663.71115.34320.3941.0031.44CATOM669OVALA6664.66516.10720.4601.0034.61OATOM670CEVALA6663.44013.62322.2041.0016.66CATOM671CG1VALA6664.26912.62322.8691.0015.01CATOM672CG2VALA6663.37914.90422.9501.0019.21CATOM673NGLYA6762.51415.75519.9941.0018.03NATOM674CAGLYA6762.29817.14919.6141.0014.90CATOM675CGLYA6760.79217.51819.5851.0032.35CATOM676OGLYA6759.92216.66619.8881.0018.88OATOM677NGLYA6860.50318.78719.2561.0023.21NATOM678CAGLYA6859.13219.28819.1831.0023.83CATOM679CGLYA6858.54019.13717.7711.0019.31CATOM680OGLYA6859.16518.55016.8701.0030.64OATOM681NILEA6957.34319.68417.5881.0015.20NATOM682CAILEA6956.59519.63216.3171.0016.80CATOM683CILEA6957.38720.15315.1121.0019.33CATOM684OILEA6957.42519.51914.0611.0014.66OATOM685CEILEA6955.25720.43216.4801.0030.11CATOM686CG1ILEA6954.27119.68317.3851.0024.27CATOM687CG2ILEA6954.61020.74915.1811.0047.53CATOM688CD1ILEA6953.25920.60818.0561.0085.71CATOM689NVALA7058.01021.32715.2691.0023.03NATOM690CAVALA7058.79721.91314.1831.0019.34CATOM691CVALA7059.98321.01113.8401.0024.42CATOM692OVALA7060.33520.82912.6621.0024.14OATOM693CBVALA7059.30423.40414.4671.0021.37CATOM694CG1VALA7060.13723.90713.2811.0017.79CATOM695CG2VALA7058.13624.41014.6781.0015.74CATOM696NALAA7160.62120.45014.8611.0019.68NATOM697CAALAA7161.78219.61714.5721.0016.57CATOM698CALAA7161.42718.28913.9101.0023.36CATOM699OALAA7161.98017.92312.8491.0021.84OATOM700CBALAA7162.68519.43915.8051.009.36CATOM701NASNA7260.46317.59814.5111.0016.80NATOM702CAASNA7259.99816.35713.9231.0018.84CATOM703CASNA7259.60816.53912.4401.0023.87CATOM704OASNA7259.91915.69611.5931.0021.52OATOM705CBASNA7258.83515.80614.7381.008.60CATOM706CGASNA7259.30915.01315.9111.0023.75CATOM707OD1ASNA7259.55813.80915.8101.0023.98OATOM708ND2ASNA7259.57215.70116.9961.009.96NATOM709NASNA7358.93117.64712.1381.0023.07NATOM710CAASNA7358.52117.97110.7611.0026.05CATOM711CASNA7359.66518.4549.8171.0026.95CATOM712OASNA7359.61318.2768.5691.0022.13OATOM713CBASNA7357.38319.00110.8001.0014.86CATOM714CGASNA7356.01518.34910.9871.0019.88CATOM715OD1ASNA7355.62017.46810.2171.0027.02OATOM716ND2ASNA7355.32218.73212.0511.0020.78NATOM717NTHRA7460.71019.02910.4191.0018.69NATOM718CATHRA7461.84519.5409.6571.0010.07CATOM719CTHRA7462.96818.5489.3751.0021.00CATOM720OTHRA7463.53718.5618.2891.0011.75OATOM721CBTHRA7462.41120.74610.3061.0029.10CATOM722OG1THRA7461.37021.71410.4571.0023.24OATOM723CG2THRA7463.54121.2999.4521.0021.63CATOM724NTYRA7563.23017.63610.3101.0017.10NATOM725CATYRA7564.26716.62010.1121.009.07CATOM726CTYRA7563.73315.20310.3181.006.17CATOM727OTYRA7564.14314.54211.2671.0015.58OATOM728CBTYRA7565.30216.82511.1881.0011.89CATOM729CGTYRA7565.77918.23411.2521.0027.12CATOM730CD1TYRA7566.71218.69610.3211.0028.46CATOM731CD2TYRA7565.23419.15112.1731.0024.83CATOM732CE1TYRA7567.11720.04510.3051.0028.34CATOM733CE2TYRA7565.65220.52312.1801.0021.00CATOM734CZTYRA7566.59320.94011.2341.0045.42CATOM735OHTYRA7567.06622.23011.2151.0035.37OATOM736NPROA7662.75914.7759.5321.0013.30NATOM737CAPROA7662.18513.4389.7421.0014.64CATOM738CPROA7663.20912.2649.6181.0014.40CATOM739OPROA7663.15711.33510.4091.0020.54OATOM740CBPROA7661.05513.3668.7091.007.83CATOM741CGPROA7661.44714.3887.6171.0012.61CATOM742CDPROA7662.06815.5048.4551.0011.18CATOM743NALAA7764.16312.3398.6811.0015.25NATOM744CAALAA7765.20611.3128.5381.006.79CATOM745CALAA7766.05311.1669.8201.0017.22CATOM746OALAA7766.30610.06910.2921.0018.74OATOM747CEALAA7766.09711.6017.3301.009.04CATOM748NASPA7866.46612.26710.4241.0010.92NATOM749CAASPA7867.25612.19111.6591.0011.87CATOM750CASPA7866.57211.48612.8271.0016.09CATOM751OASPA7867.21210.74113.6011.0018.07OATOM752CBASPA7867.57813.60912.0881.0019.16CATOM753CGASPA7868.42414.32511.0681.0026.82CATOM754OD1ASPA7868.83613.69410.0441.0033.93OATOM755OD2ASPA7868.67315.51411.3161.0032.06OATOM756NPHEA7965.27911.77112.9751.0014.70NATOM757CAPHEA7964.47111.19214.0441.0020.69CATOM758CPHEA7964.2249.70713.8761.0020.22CATOM759OPHEA7964.2698.98714.8621.0022.37OATOM760CBPHEA7963.14411.93314.2191.0027.38CATOM761CGPHEA7963.26413.21814.9901.0028.59CATOM762CD1PHEA7963.13713.23016.3861.0027.49CATOM763CD2PHEA7963.50914.41514.3251.0028.20CATOM764CE1PHEA7963.28114.41317.1091.0021.76CATOM765CE2PHEA7963.62515.59315.0371.0031.48CATOM766CZPHEA7963.50915.58216.4391.0026.31CATOM767NILEA8063.9429.24912.6501.0010.79NATOM768CAILEA8063.8287.79512.4101.0018.12CATOM769CILEA8065.1977.05212.4321.0010.97CATOM770OILEA8065.4066.09013.1951.008.92OATOM771CBILEA8062.9447.40811.1481.0017.41CATOM772CG1ILEA8062.6515.88611.1051.0010.16CATOM773CG2ILEA8063.5837.8889.9011.0017.46CATOM774CD1ILEA8061.7225.4109.9801.007.30CATOM775NTYRA8166.1517.53911.6581.0011.18NATOM776CATYRA8167.4886.90211.6301.0015.06CATOM777CTYRA8168.2376.78212.9591.0016.83CATOM778OTYRA8168.7145.70213.3831.0016.74OATOM779CBTYRA8168.3847.59910.6161.009.43CATOM780CGTYRA8169.7496.96610.5411.0022.54CATOM781CD1TYRA8169.9635.8249.7471.0022.37CATOM782CD2TYRA8170.8187.46611.2991.0018.07CATOM783CE1TYRA8171.2025.1639.7461.0015.02CATOM784CE2TYRA8172.0806.89311.2011.0017.37CATOM785CZTYRA8172.2555.69810.4721.0024.27CATOM786OHTYRA8173.4915.06310.4091.0019.57OATOM787NGLNA8268.3857.91813.6121.0011.39NATOM788CAGLNA8269.1937.93014.8101.0012.23CATOM789CGGLNA8268.5447.08915.8341.0014.18CATOM790OGLNA8269.1806.41516.6311.0011.35OATOM791CBGLNA8269.2809.35415.2911.0018.73CATOM792CGGLNA8269.98610.20914.2501.0013.54CATOM793CDGLNA8270.28511.61714.7361.0026.00CATOM794OE1GLNA8270.41011.85015.9271.0022.99OATOM795NE2GLNA8270.40412.56113.8081.0016.59NATOM796NASNA8367.2357.18115.8691.0011.35NATOM797CAASNA8366.5496.40816.8601.0013.71CATOM798CASNA8366.6234.90216.5571.0021.43CATOM799OASNA8366.8314.10117.4631.0012.10OATOM800CBASNA8365.1326.94517.0741.0013.51CATOM801CGASNA8365.1318.24517.8711.0028.91CATOM802OD1ASNA8365.6288.26318.9901.0022.28OATOM803ND2ASNA8364.7569.35417.2371.0020.17NATOM804NMETA8466.5924.51715.2901.0015.63NATOM805CAMETA8466.7043.10115.0071.0015.66CATOM806CMETA8468.0542.58815.3481.0014.66CATOM807OMETA8468.1481.51415.9021.0011.45OATOM808CBMETA8466.4182.81513.5631.0017.59CATOM809CGMETA8464.9112.89413.2201.0014.40CATOM810SDMETA8464.6382.81111.3871.0015.99SATOM811CEMETA8465.1641.10510.9521.008.90CATOM812NMETA8569.0983.33815.0241.0011.20NATOM813CAMETA8570.4682.87915.3211.0011.67CATOM814CMETA8570.7792.83116.7741.0013.04CATOM815OMETA8571.3591.89317.2651.0015.26OATOM816CBMETA8571.5253.79814.6931.0015.07CATOM817CGMETA8571.5303.72613.1731.0032.01CATOM818SDMETA8571.9182.02712.4871.0037.79SATOM819CEMETA8573.3791.80113.3201.0015.94CATOM820NILEA8670.4713.89217.4811.0013.92NATOM821CAILEA8670.7603.89318.9121.0012.58CATOM822CILEA8670.1592.66219.5911.0021.61CATOM823OILEA8670.8131.98120.3621.0018.68OATOM824CBILEA8670.2255.18919.6061.0011.84CATOM825CG1ILEA8670.9786.42919.1191.0019.78CATOM826CG2ILEA8670.4355.13221.1121.006.59CATOM827CD1ILEA8670.5057.69419.7721.0020.37CATOM828NGLUA8768.8932.38319.3161.0018.78NATOM829CAGLUA8768.2631.23719.9301.0014.00CATOM830CGLUA8768.797−0.11619.4541.0015.93CATOM831OGLUA8769.017−0.99120.2681.0011.04OATOM832CBGLUA8766.7341.32419.9001.0014.89CATOM833CGGLUA8766.0851.32718.5381.0028.96CATOM834CDGLUA8764.6351.92218.5441.0011.12CATOM835OE1GLUA8764.3072.80119.3761.0025.46OATOM836OE2GLUA8763.8451.54717.6631.0029.87OATOM837NSERA8869.054−0.25918.1551.0016.18NATOM838CASERA8869.650−1.48217.5691.0019.52CATOM839CSERA8871.029−1.79218.1601.0022.54CATOM840OSERA8871.313−2.92918.5921.0013.80OATOM841CBSERA8869.815−1.32616.0231.0014.61CATOM842OGSERA8868.551−1.20115.3551.0015.41OATOM843NASNA8971.884−0.77318.1431.0022.63NATOM844CAASNA8973.227−0.86918.6931.0027.23CATOM845CASNA8973.195−1.36320.1341.0021.34CATOM846OASNA8973.795−2.38420.4761.0023.68OATOM847CBASNA8973.9800.48718.5971.0013.71CATOM848CGASNA8974.4400.82517.1681.0020.40CATOM849OD1ASNA8974.305−0.00616.2551.0014.93OATOM850ND2ASNA8974.9372.06716.9601.0013.32NATOM851NILEA9072.488−0.64620.9791.0016.55NATOM852CAILEA9072.437−1.01422.3981.0021.51CATOM853CILEA9071.876−2.42122.7291.0026.50CATOM854OILEA9072.384−3.15923.5901.0019.71OATOM855CBILEA9071.6700.07023.2331.0013.32CATOM856CG1ILEA9072.5391.29923.4011.0011.05CATOM857CG2ILEA9071.371−0.44524.6371.007.54CATOM858CD1ILEA9071.7492.59723.6681.0020.71CATOM859NILEA9170.755−2.73322.1141.0014.98NATOM860CAILEA9170.047−3.95322.4421.0021.33CATOM861CILEA9170.927−5.09821.9941.0026.27CATOM862OILEA9171.211−6.01122.7511.0026.56OATOM863CBILEA9168.556−3.93021.8141.0020.39CATOM864CG1ILEA9167.692−2.88622.5521.0013.51CATOM865CG2ILEA9167.841−5.31621.8451.0011.31CATOM866CD1ILEA9166.320−2.64821.9071.0016.23CATOM867NHISA9271.446−4.98320.7851.0024.12NATOM868CAHISA9272.293−6.01520.2431.0026.71CATOM869CHISA9273.609−6.25121.0711.0029.30CATOM870OHISA9273.983−7.36621.4431.0018.58OATOM871CBHISA9272.561−5.68218.7751.0022.23CATOM872CGHISA9273.366−6.72018.0771.0026.32CATOM873ND1HISA9272.798−7.71117.3071.0027.19NATOM874CD2HISA9274.699−6.97818.1061.0021.95CATOM875CE1HISA9273.755−8.48716.8261.0023.66CATOM876NE2HISA9274.918−8.06217.2961.0017.36NATOM877NALAA9374.328−5.18721.3331.0015.66NATOM878CAALAA9375.530−5.30122.1101.0011.88CATOM879CALAA9375.222−5.90023.5121.0028.78CATOM880OALAA9375.912−6.79024.0371.0025.23OATOM881CBALAA9376.139−3.95922.2211.006.30CATOM882NALAA9474.142−5.44224.1131.0018.82NATOM883CAALAA9473.777−5.97125.3991.0015.61CATOM884CALAA9473.593−7.50325.3011.0028.39CATOM885OALAA9474.133−8.26326.0991.0021.67OATOM886CBALAA9472.449−5.27925.9111.0018.46CATOM887NHISA9572.814−7.96624.3291.0026.35NATOM888CAHISA9572.551−9.39624.2711.0024.89CATOM889CHISA9573.845−10.17624.1401.0022.81CATOM890OHISA9574.077−11.13624.8651.0021.44OATOM891CBHISA9571.571−9.77823.1291.0022.39CATOM892CGHISA9571.554−11.25022.8311.0028.73CATOM893ND1HISA9570.979−12.18223.6821.0022.83NATOM894CD2HISA9572.159−11.96421.8451.0025.22CATOM895CE1HISA9571.171−13.39723.1961.0022.72CATOM896NE2HISA9571.911−13.29622.1011.0024.80NATOM897NGLNA9674.709−9.65823.2811.0019.97NATOM898CAGLNA9675.960−10.29922.9171.0022.27CATOM899CGLNA9676.877−10.35324.0861.0026.58CATOM900OGLNA9677.836−11.09324.0881.0024.17OATOM901CSGLNA9676.642−9.49221.8181.0023.38CATOM902CGGLNA9677.043−10.29920.5961.0061.06CATOM903CDGLNA9678.033−9.55719.6751.0075.83CATOM904OE1GLNA9678.999−8.94120.1311.0056.89OATOM905NE2GLNA9677.815−9.66818.3661.00100.00NATOM906NASNA9776.652−9.50025.0601.0022.15NATOM907CAASNA9777.537−9.53626.2081.0014.74CATOM908CASNA9776.732−10.02227.3871.0029.78CATOM909OASNA9777.049−9.76228.5641.0027.09OATOM910CBASNA9778.241−8.20126.4621.0012.93CATOM911CGASNA9779.260−7.89725.4071.0024.91CATOM912OD1ASNA9780.331−8.51825.3751.0057.17OATOM913ND2ASNA9778.839−7.13524.3921.0034.88NATOM914NASPA9875.666−10.73227.0551.0027.98NATOM915CAASPA9874.907−11.36128.0891.0029.25CATOM916CASPA9874.400−10.37929.1641.0037.53CATOM917OASPA9874.505−10.63430.3671.0036.42OATOM918CBASPA9875.791−12.45028.7001.0036.37CATOM919CGASPA9875.016−13.71229.0531.0088.62CATOM920OD1ASPA9873.775−13.74928.8771.0082.53OATOM921OD2ASPA9875.656−14.67029.5421.00100.00OATOM922NVALA9973.879−9.23528.7301.0027.13NATOM923CAVALA9973.157−8.35129.6351.0021.57CATOM924CVALA9971.706−8.86829.5301.0016.15CATOM925OVALA9971.159−9.08828.4221.0019.47OATOM926CBVALA9973.264−6.90029.2061.0024.18CATOM927CG1VALA9972.517−6.01530.1981.0014.58CATOM928CG2VALA9974.720−6.51529.2251.0030.10CATOM929NASNA10071.149−9.26230.6621.0017.39NATOM930CAASNA10069.852−9.92530.6131.0025.77CATOM931CASNA10068.648−9.03430.9101.0024.95CATOM932OASNA10067.498−9.37730.5821.0020.88OATOM933CBASNA10069.846−11.15731.5271.0014.98CATOM934CGASNA10068.724−12.11231.1801.0020.38CATOM935OD1ASNA10068.737−12.70930.1001.0029.59OATOM936ND2ASNA10067.716−12.24032.0761.0016.35NATOM937NLYSA10168.941−7.92331.5841.0017.91NATOM938CALYSA10167.970−6.91631.9941.0025.43CATOM939CLYSA10168.107−5.51031.3231.0025.29CATOM940OLYSA10169.151−4.85031.3771.0019.88OATOM941CBLYSA10167.996−6.80733.5211.0029.28CATOM942CGLYSA10167.464−8.05434.2051.009.31CATOM943CDLYSA10167.218−7.71935.6681.0038.93CATOM944CELYSA10166.206−6.56935.8851.0013.38CATOM945NZLYSA10164.750−7.00635.8251.0015.26NATOM946NLEUA10267.013−5.04330.7321.0022.22NATOM947CALEUA10267.003−3.74430.0921.0015.40CATOM948CLEUA10265.612−3.11530.1561.0018.55CATOM949OLEUA10264.590−3.81130.1021.0018.92OATOM950CBLEUA10267.465−3.89828.6361.0011.23CATOM951CGLEUA10267.553−2.71127.6511.0015.51CATOM952CD1LEUA10268.628−2.98526.5591.009.65CATOM953CD2LEUA10266.162−2.40726.9951.0013.10CATOM954NLEUA10365.595−1.79830.3181.0017.05NATOM955CALEUA10364.356−1.03630.2651.0016.23CATOM956CLEUA10364.346−0.07229.0461.0019.65CATOM957OLEUA10365.2150.78928.8751.0019.68OATOM958CBLEUA10364.099−0.28931.5621.0012.28CATOM959CGLEUA10362.6860.25931.5941.0014.13CATOM960CD1LEUA10361.645−0.82231.9021.0010.31CATOM961CD2LEUA10362.6461.36032.6011.0012.30CATOM962NPHEA10463.417−0.33328.1401.0016.41NATOM963CAPHEA10463.2150.48626.9561.0018.32CATOM964CPHEA10462.1261.54627.2491.0021.85CATOM965OPHEA10461.1681.27127.9921.0018.36OATOM966CEPHEA10462.796−0.38625.7931.009.86CATOM967CGPHEA10462.7320.34824.5081.0016.81CATOM968CD1PHEA10463.8940.71423.8401.0025.04CATOM969CD2PHEA10461.5110.79524.0051.0022.59CATOM970CE1PHEA10463.8361.44822.6191.0031.26CATOM971CE2PHEA10461.4491.53522.8141.0015.59CATOM972CZPHEA10462.6251.89522.1391.0011.67CATOM973NLEUA10562.3412.76226.7341.0020.33NATOM974CALEUA10561.4163.89726.9041.0018.10CATOM975CLEUA10560.7114.23725.6341.0017.04CATOM976OLEUA10561.3154.68024.6651.0018.83OATOM977CBLEUA10562.1785.14627.2141.0017.49CATOM978CGLEUA10562.4345.54428.6441.0027.17CATOM979CD1LEUA10562.6304.34929.5741.0019.16CATOM980CD2LEUA10563.6886.34728.5291.0023.59CATOM981NGLYA10659.4074.15325.6521.0020.66NATOM982CAGLYA10658.6794.53624.4551.0021.03CATOM983CGLYA10658.0805.93524.5971.0017.32CATOM984OGLYA10658.6906.85825.1131.0026.89OATOM985NSERA10756.8316.04724.2191.0022.05NATOM986CASERA10756.1777.31724.2881.0022.12CATOM987CSERA10754.6867.21223.9231.0019.06CATOM988OSERA10754.3146.54522.9631.0027.42OATOM989CBSERA10756.8828.23223.3001.0020.99CATOM990OGSERA10755.9479.13322.7761.0042.85OATOM991NSERA10853.8267.89024.6711.0027.42NATOM992CASERA10852.3827.94724.3391.0026.43CATOM993CSERA10852.1448.25922.8421.0030.97CATOM994OSERA10851.2427.70922.2171.0033.46OATOM995CBSERA10851.7109.07225.1441.0019.87CATOM996OGSERA10852.49510.26625.0711.0070.88OATOM997NCYSA10952.9279.18022.2781.0024.73NATOM998CACYSA10952.7289.54920.8801.0025.61CATOM999CCYSA10952.9708.48219.8151.0021.29CATOM1000OCYSA10952.9678.73718.6231.0031.31OATOM1001CBCYSA10953.36910.89920.5441.0039.55CATOM1002SGCYSA10955.15311.07720.8471.0049.24SATOM1003NILEA11053.1017.26420.2581.0018.31NATOM1004CAILEA11053.3296.15019.3791.0028.10CATOM1005CILEA11051.9775.48919.0821.0015.38CATOM1006OILEA11051.8954.59218.2681.0016.52OATOM1007CBILEA11054.1545.15320.2061.0040.45CATOM1008CG1ILEA11055.6045.51020.1361.0039.02CATOM1009CG2ILEA11053.8793.71519.8751.0061.33CATOM1010CD1ILEA11056.4294.33820.5491.0082.74CATOM1011NTYRA11150.9515.84219.8541.0014.91NATOM1012CATYRA11149.6305.22719.6781.0013.96CATOM1013CTYRA11148.9565.83118.4591.0020.40CATOM1014OTYRA11149.3026.93318.0561.0011.71OATOM1015CBTYRA11148.7635.46820.9211.009.63CATOM1016CGTYRA11149.1174.55022.0651.0014.94CATOM1017CD1TYRA11148.9853.15921.9381.009.73CATOM1018CD2TYRA11149.7555.03823.2161.0014.96CATOM1019CE1TYRA11149.3442.27323.0141.006.53CATOM1020CE2TYRA11150.1464.15524.2721.0013.66CATOM1021CZTYRA11149.8732.78724.1711.0017.86CATOM1022OHTYRA11150.2661.92725.1571.0011.37OATOM1023NPROA11247.9745.14517.8721.0022.56NATOM1024CAPROA11247.2795.74316.7211.0023.44CATOM1025CPROA11246.5897.11116.9881.0017.82CATOM1026OPROA11246.1977.45318.1151.0019.72OATOM1027CBPROA11246.2904.64416.2521.0015.69CATOM1028CGPROA11246.8953.34316.7691.0022.83CATOM1029CDPROA11247.5933.73318.0861.0016.10CATOM1030NLYSA11346.4187.86615.9151.0019.48NATOM1031CALYSA11345.7939.16715.9941.0023.50CATOM1032CLYSA11344.3969.07716.6551.0034.28CATOM1033OLYSA11344.0469.88717.5241.0046.14OATOM1034CBLYSA11345.6759.73514.5931.0030.04CATOM1035CGLYSA11346.21911.12414.4771.0043.78CATOM1036CDLYSA11345.38111.94113.5151.00100.00CATOM1037CELYSA11344.36112.83614.2501.00100.00CATOM1038NZLYSA11343.48013.62513.3041.00100.00NATOM1039NLEUA11443.5918.10316.2501.0026.33NATOM1040CALEUA11442.2677.95716.8331.0020.65CATOM1041CLEUA11442.0836.79217.7601.0018.44CATOM1042OLEUA11441.0026.27817.9181.0034.04OATOM1043CBLEUA11441.1948.00215.7801.0024.37CATOM1044CGLEUA11441.5879.12214.8301.0040.86CATOM1045CD1LEUA11440.9918.79713.5041.0049.29CATOM1046CD2LEUA11441.13910.51215.3001.0026.85CATOM1047NALAA11543.1036.47318.5271.0029.00NATOM1048CAALAA11542.9205.44619.5281.0025.66CATOM1049CALAA11541.7225.72720.4541.0028.76CATOM1050OALAA11541.3646.85520.6821.0024.12OATOM1051CEALAA11544.1775.27220.3261.0016.86CATOM1052NLYSA11641.1374.67520.9981.0030.21NATOM1053CALYSA11640.0364.79221.9281.0025.85CATOM1054CLYSA11640.6685.24823.1951.0014.18CATOM1055OLYSA11641.7504.78123.5351.0023.51OATOM1056CELYSA11639.3693.41522.1161.0022.05CATOM1057CGLYSA11639.0533.03223.5241.0055.38CATOM1058CDLYSA11637.9631.95523.5491.00100.00CATOM1059CELYSA11637.1201.95324.8351.00100.00CATOM1060NZLYSA11635.7671.31024.6301.00100.00NATOM1061NGLNA11740.0216.20823.8561.0018.23NATOM1062CAGLNA11740.4566.75725.1801.0021.01CATOM1063CGLNA11739.6956.17826.3831.0030.96CATOM1064OGLNA11738.4836.00926.3451.0027.66OATOM1065CBGLNA11740.2158.26325.1791.0011.32CATOM1066CGGLNA11740.8498.91223.9481.0012.12CATOM1067CDGLNA11742.4048.82323.9541.0024.10CATOM1068OE1GLNA11743.0418.62822.8961.0047.88OATOM1069NE2GLNA11743.0018.95325.1311.0014.24NATOM1070NPROA11840.3745.99227.4991.0030.02NATOM1071CAPROA11841.8266.19427.6551.0026.44CATOM1072CPROA11842.4505.05026.8991.0024.37CATOM1073OPROA11841.7924.02726.7261.0025.34OATOM1074CBPROA11842.0555.99429.1671.0023.89CATOM1075CGPROA11840.8475.24029.6541.0023.20CATOM1076CDPROA11839.6955.51928.7091.0015.79CATOM1077NMETA11943.6845.22826.4321.0016.00NATOM1078CAMETA11944.3724.21525.6441.0010.80CATOM1079CMETA11945.0623.08326.4441.0023.61CATOM1080OMETA11946.0133.28127.2091.0018.02OATOM1081CBMETA11945.3844.89424.7911.0013.52CATOM1082CGMETA11944.8016.01423.9891.0018.52CATOM1083SDMETA11946.1577.05423.2711.0026.27SATOM1084CEMETA11946.2646.52421.8451.0033.79CATOM1085NALAA12044.5591.87526.2711.0026.64NATOM1086CAALAA12045.1770.71226.8841.0029.17CATOM1087CALAA12046.3560.30825.9841.0023.21CATOM1088OALAA12046.4390.75924.8331.0020.19OATOM1089CBALAA12044.169−0.41926.9441.0026.02CATOM1090NGLUA12147.238−0.55326.5071.0012.30NATOM1091CAGLUA12148.427−1.00925.7881.009.45CATOM1092CGLUA12148.070−1.69724.4501.0011.68CATOM1093OGLUA12148.828−1.67023.4501.0014.84OATOM1094CBGLUA12149.321−1.88326.7151.0016.74CATOM1095CGGLUA12150.132−1.12227.7631.0018.14CATOM1096CDGLUA12149.458−1.00029.1371.0013.00CATOM1097OE1GLUA12148.252−1.29429.2761.0020.79OATOM1098OE2GLUA12150.123−0.52130.0801.0017.86OATOM1099NSERA12246.887−2.27324.4091.0011.79NATOM1100CASERA12246.427−2.97723.2181.0012.16CATOM1101CSERA12246.030−2.05822.1001.0011.70CATOM1102OSERA12245.717−2.52921.0101.0013.91OATOM1103CBSERA12245.186−3.78123.5681.0021.50CATOM1104OGSERA12244.143−2.90823.9761.0028.52OATOM1105NGLUA12346.041−0.75422.3411.0014.65NATOM1106CAGLUA12345.7830.20221.2431.0017.15CATOM1107CGLUA12346.9590.31320.2401.0011.48CATOM1108OGLUA12346.8210.84419.1411.0011.19OATOM1109CBGLUA12345.4811.60021.8051.0021.66CATOM1110CGGLUA12344.1271.69422.5231.0024.68CATOM1111CDGLUA12342.9841.37421.5851.0035.56CATOM1112OE1GLUA12343.0191.86520.4261.0041.73OATOM1113OE2GLUA12342.1580.49721.9401.00100.00OATOM1114NLEUA12448.134−0.18520.6181.0014.02NATOM1115CALEUA12449.296−0.08219.7401.0015.32CATOM1116CLEUA12449.082−0.75418.4581.0017.76CATOM1117OLEUA12448.752−1.91718.4451.0018.91OATOM1118CBLEUA12450.564−0.68020.3621.0018.07CATOM1119CGLEUA12451.922−0.22219.8031.0021.52CATOM1120CD1LEUA12452.0801.25820.1171.0020.35CATOM1121CD2LEUA12453.042−0.91920.5501.0014.07CATOM1122NLEUA12549.514−0.07117.4091.0018.44NATOM1123CALEUA12549.445−0.56416.0521.0019.92CATOM1124CLEUA12548.034−0.75415.5091.0025.56CATOM1125OLEUA12547.854−1.18814.3641.0018.26OATOM1126CELEUA12550.355−1.80015.8401.0020.79CATOM1127CGLEUA12551.890−1.51115.7781.0017.21CATOM1128CD1LEUA12552.744−2.64916.3161.0019.95CATOM1129CD2LEUA12552.334−1.21914.3381.005.81CATOM1130NGLNA12647.027−0.32716.2761.0021.97NATOM1131CAGLNA12645.652−0.50415.7901.0019.97CATOM1132CGLNA12645.2130.44714.7241.0028.31CATOM1133OGLNA12644.0760.39114.2931.0047.49OATOM1134CEGLNA12644.652−0.40416.9111.0019.87CATOM1135CGGLNA12644.949−1.31218.0481.0018.39CATOM1136CDGLNA12644.319−2.62617.8351.0066.80CATOM1137OE1GLNA12644.064−3.37618.7921.0040.75OATOM1138NE2GLNA12644.015−2.95216.5651.0071.74NATOM1139NGLYA12746.0801.33014.2701.0028.29NATOM1140CAGLYA12745.6272.26013.2521.0023.31CATOM1141CGLYA12746.6623.31512.9531.0022.90CATOM1142OGLYA12747.7553.25413.4741.0025.30OATOM1143NTHRA12846.3114.21912.0461.0019.51NATOM1144CATHRA12847.1495.31411.5881.0022.12CATOM1145CTHRA12847.7056.21912.6951.0022.60CATOM1146OTHRA12847.0616.46113.7311.0018.58OATOM1147CBTHRA12846.3926.18210.5441.0035.98CATOM1148OG1THRA12846.5335.5949.2391.0058.05OATOM1149CG2THRA12846.9427.63910.5421.0043.41CATOM1150NLEUA12948.9076.71512.4251.0018.32NATOM1151CALEUA12949.6747.53413.3561.0016.76CATOM1152CLEUA12949.5048.95912.9671.004.89CATOM1153OLEUA12949.2329.26011.8141.0016.14OATOM1154CBLEUA12951.2057.19113.2611.0017.91CATOM1155CGLEUA12951.7695.80413.7521.0018.21CATOM1156CD1LEUA12953.1325.37913.1931.0012.12CATOM1157CD2LEUA12951.6835.53215.2511.003.89CATOM1158NGLUA13049.8169.82713.9171.0010.23NATOM1159CAGLUA13049.91211.26813.6911.0013.22CATOM1160CGLUA13051.12811.54412.7751.0023.44CATOM1161OGLUA13052.24911.16213.0901.0021.23OATOM1162CBGLUA13050.15011.97915.0351.0018.48CATOM1163CGGLUA13050.75413.37614.8861.0077.44CATOM1164CDGLUA13049.83314.32814.1211.00100.00CATOM1165OR1GLUA13048.58814.20514.3401.0036.19OATOM1166OE2GLUA13050.34715.16113.2951.0021.03OATOM1167NPROA13150.92012.21911.6481.0021.35NATOM1168CAPROA13152.02312.40910.7311.0014.78CATOM1169CPROA13153.20113.13211.2651.0014.98CATOM1170OPROA13154.32512.84710.8531.0020.99OATOM1171CBPROA13151.41313.1549.5521.0014.76CATOM1172CGPROA13150.07113.4859.9491.0020.99CATOM1173CDPROA13149.64112.62611.0471.0017.25CATOM1174NTHRA13252.98614.09512.1591.0018.77NATOM1175CATHRA13254.13114.83812.6891.0016.44CATOM1176CTHRA13255.10213.95113.4081.0021.91CATOM1177OTHRA13256.31714.08813.2341.0024.17OATOM1178CBTHRA13253.71615.90713.6061.0023.45CATOM1179OG1THRA13252.97616.88312.8501.0031.15OATOM1180CG2THRA13254.96916.51914.3411.009.28CATOM1181NASNA13354.55112.97014.1221.0028.59NATOM1182CAASNA13355.35912.00714.8751.0026.38CATOM1183CASNA13355.66610.68214.2071.0014.85CATOM1184OASNA13356.4469.88414.7551.0018.67OATOM1185CEASNA13354.66111.69916.1681.0023.70CATOM1186CGASNA13354.48012.89416.9681.0050.55CATOM1187OD1ASNA13353.35413.27217.2521.0040.07OATOM1188ND2ASNA13355.56813.63817.1631.0040.36NATOM1189NGLUA13455.10010.46913.0221.009.98NATOM1190CAGLUA13455.2379.21012.3651.009.66CATOM1191CGLUA13456.6488.53012.2741.0013.86CATOM1192OGLUA13456.8147.38812.7061.0022.89OATOM1193CBGLUA13454.4489.20011.0701.0017.55CATOM1194CGGLUA13454.7507.93010.2271.0020.89CATOM1195CDGLUA13453.9267.8688.9701.0013.59CATOM1196OE1GLUA13452.6787.7389.0851.0035.28OATOM1197OE2GLUA13454.4978.0487.8691.0013.44OATOM1198NPROA13557.6809.22211.7891.0015.72NATOM1199CAPROA13559.0148.60011.6991.0018.91CATOM1200CPROA13559.5448.17413.0731.0018.68CATOM1201OPROA13560.0727.06913.2711.0015.69OATOM1202CBPROA13559.8969.75511.1691.0013.84CATOM1203CGPROA13559.03610.51410.3501.009.78CATOM1204CDPROA13557.59410.39510.9081.0014.43CATOM1205NTYRA13659.4499.11713.9941.008.64NATOM1206CATYRA13659.8738.91515.3241.0013.27CATOM1207CTYRA13659.0567.72815.9071.0016.84CATOM1208OTYRA13659.5786.90316.6581.0012.90OATOM1209CBTYRA13659.60410.23416.1001.0015.51CATOM1210CGTYRA13659.91210.16817.6141.0018.26CATOM1211CD1TYRA13661.20010.06218.0721.0020.53CATOM1212CD2TYRA13658.90410.15018.5681.0017.38CATOM1213CE1TYRA13661.4849.95919.4401.0030.44CATOM1214CE2TYRA13659.18410.08419.9531.009.85CATOM1215CZTYRA13660.4769.94920.3771.0020.65CATOM1216OHTYRA13660.7929.87321.7341.0024.41OATOM1217NALAA13757.7607.68715.6381.007.19NATOM1218CAALAA13756.9236.63316.2271.0012.68CATOM1219CALAA13757.3455.26515.7371.0015.21CATOM1220OALAA13757.4254.27216.4881.0014.58OATOM1221CBALAA13755.5176.84915.8711.0011.40CATOM1222NILEA13857.5675.21314.4471.008.93NATOM1223CAILEA13857.9543.97113.8311.0011.77CATOM1224CILEA13859.2463.49414.4921.0016.20CATOM1225OILEA13859.3072.37714.9701.0013.79OATOM1226CBILEA13858.0644.17212.3161.0017.85CATOM1227CG1ILEA13856.6804.47311.7571.0028.21CATOM1228CG2ILEA13858.6742.98611.6021.009.81CATOM1229CD1ILEA13855.6953.37611.9701.0018.17CATOM1230NALAA13960.2434.36114.6251.0011.54NATOM1231CAALAA13961.4943.93715.2881.0013.22CATOM1232CALAA13961.2563.36416.6751.0018.73CATOM1233OALAA13961.7912.31817.0311.0020.44OATOM1234CBALAA13962.4345.07315.3901.0013.62CATOM1235NLYSA14060.3974.03317.4481.0016.36NATOM1236CALYSA14060.0833.60018.8151.0015.14CATOM1237CLYSA14059.3922.26218.8241.0015.18CATOM1238OLYSA14059.8241.34619.4751.0021.42OATOM1239CBLYSA14059.1934.60619.5251.0017.86CATOM1240CGLYSA14059.9255.80620.1521.0021.11CATOM1241CDLYSA14061.2085.47820.9581.0016.75CATOM1242CELYSA14061.6646.73521.8351.0010.06CATOM1243NZLYSA14062.6886.49622.9211.0014.40NATOM1244NILEA14158.3562.11618.0271.0011.49NATOM1245CAILEA14157.7030.82817.9771.0017.92CATOM1246CILEA14158.729−0.28217.5771.0013.46CATOM1247OILEA14158.730−1.37418.1481.0013.92OATOM1248CBILEA14156.4970.92517.0191.0022.59CATOM1249CG1ILEA14155.4661.90617.5571.0017.61CATOM1250CG2ILEA14155.863−0.41116.7001.0010.49CATOM1251CD1ILEA14154.5302.32716.4491.0013.43CATOM1252NALAA14259.6370.02816.6501.0010.29NATOM1253CAALAA14260.657−0.93116.2281.007.15CATOM1254CALAA14261.456−1.30117.4561.0016.58CATOM1255OALAA14261.839−2.45417.6211.0013.04OATOM1256CBALAA14261.604−0.28815.1301.004.44CATOM1257NGLYA14361.703−0.30718.3161.009.56NATOM1258CAGLYA14362.448−0.52519.5271.005.15CATOM1259CGLYA14361.770−1.55520.4301.0016.36CATOM1260OGLYA14362.392−2.48220.9671.0014.11OATOM1261NILEA14460.476−1.41820.5641.0020.33NATOM1262CAILEA14459.725−2.31421.4071.0015.35CATOM1263CILEA14459.706−3.73220.8591.0019.84CATOM1264OILEA14459.836−4.70021.6081.0017.93OATOM1265CEILEA14458.317−1.81921.5591.0010.60CATOM1266CG1ILEA14458.311−0.61022.5161.009.80CATOM1267CG2ILEA14457.410−2.92822.1221.009.60CATOM1268CD1ILEA14457.0220.07622.5171.0018.32CATOM1269NLYSA14559.520−3.84119.5561.007.20NATOM1270CALYSA14559.459−5.13918.9261.007.64CATOM1271CLYSA14560.840−5.78818.9311.0015.32CATOM1272OLYSA14560.923−6.98918.9811.0014.76OATOM1273CBLYSA14558.891−5.00117.5161.0011.25CATOM1274CGLYSA14557.414−4.58117.4891.0012.13CATOM1275CDLYSA14556.642−5.43418.4951.0025.23CATOM1276CELYSA14555.189−4.99518.6921.0013.64CATOM1277NZLYSA14554.441−6.11119.3921.0011.94NATOM1278NLEUA14661.934−5.01118.9861.0026.98NATOM1279CALEUA14663.261−5.64219.1671.0019.72CATOM1280CLEUA14663.262−6.31620.5421.0018.20CATOM1281OLEUA14663.590−7.51120.7031.0019.86OATOM1282CELEUA14664.398−4.61819.1501.0013.56CATOM1283CGLEUA14664.895−4.25817.7591.0021.84CATOM1284CD1LEUA14665.672−2.94517.8171.0017.94CATOM1285CD2LEUA14665.745−5.39717.1021.0016.10CATOM1286NCYSA14762.931−5.52321.5481.007.91NATOM1287CACYSA14762.875−6.06422.8931.009.14CATOM1288CCYSA14762.072−7.37822.9451.0022.72CATOM1289OCYSA14762.568−8.40123.3831.0016.90OATOM1290CECYSA14762.232−5.05823.8091.0012.63CATOM1291SGCYSA14763.411−3.82324.3161.0015.02SATOM1292NGLUA14860.823−7.35222.5081.0020.03NATOM1293CAGLUA14860.016−8.55522.5671.0016.09CATOM1294CGLUA14860.685−9.71521.8021.0022.61CATOM1295OGLUA14860.651−10.88822.2261.0012.05OATOM1296CEGLUA14858.597−8.26822.0461.0014.66CATOM1297CGGLUA14857.864−7.18922.8401.0011.45CATOM1298CDGLUA14856.471−6.82122.2771.0011.75CATOM1299OE1GLUA14856.117−7.05521.0801.0011.65OATOM1300OE2GLUA14855.728−6.23123.0811.0022.56OATOM1301NSERA14961.368−9.37720.7151.0015.57NATOM1302CASERA14961.938−10.42819.8871.0010.21CATOM1303CSERA14963.040−11.24520.5021.0015.83CATOM1304OSERA14963.102−12.45820.2911.0012.72OATOM1305CBSERA14962.270−9.93618.4881.009.44CATOM1306OGSERA14961.053−9.65017.7821.0015.91OATOM1307NTYRA15063.910−10.54621.2241.0018.44NATOM1308CATYRA15065.065−11.10021.9481.0020.50CATOM1309CTYRA15064.514−11.84823.1581.0021.87CATOM1310OTYRA15064.939−12.94923.4861.0031.39OATOM1311CBTYRA15066.005−9.95022.4251.0013.71CATOM1312CGTYRA15066.994−9.50921.3651.0014.13CATOM1313CD1TYRA15066.611−8.67320.3171.0014.64CATOM1314CD2TYRA15068.288−10.00021.3601.0018.32CATOM1315CE1TYRA15067.487−8.39019.2781.0011.91CATOM1316CE2TYRA15069.198−9.68220.3451.0011.10CATOM1317CZTYRA15068.804−8.90019.3261.0020.95CATOM1318OHTYRA15069.739−8.68518.3331.0027.73OATOM1319NASNA15163.536−11.24923.8011.0014.83NATOM1320CAASNA15162.903−11.88924.9371.0023.62CATOM1321CASNA15162.417−13.24424.4101.0028.53CATOM1322OASNA15162.630−14.24825.0721.0025.89OATOM1323CBASNA15161.655−11.11325.4391.0020.95CATOM1324CGASNA15161.988−9.86726.2841.0015.07CATOM1325OD1ASNA15161.126−9.02026.4661.0026.72OATOM1326ND2ASNA15163.231−9.70926.7001.006.31NATOM1327NARGA15261.731−13.24923.2591.0019.91NATOM1328CAARGA15261.129−14.46522.6871.0017.62CATOM1329CARGA15262.090−15.52322.1881.0021.34CATOM1330OARGA15261.959−16.68722.5421.0015.44OATOM1331CBARGA15260.086−14.14821.6101.0015.30CATOM1332CGARGA15258.672−13.75422.1571.0017.22CATOM1333CDARGA15257.652−13.29721.0491.009.11CATOM1334NEARGA15257.161−14.41920.2411.0021.05NATOM1335CZARGA15257.159−14.44718.9121.0028.61CATOM1336NH1ARGA15257.590−13.38718.2211.0021.98NATOM1337NH2ARGA15256.717−15.52818.2621.0026.11NATOM1338NGLNA15363.098−15.10421.4341.0016.54NATOM1339CAGLNA15364.044−16.03620.8421.009.74CATOM1340CGLNA15365.082−16.44321.8071.0016.70CATOM1341OGLNA15365.529−17.54521.7631.0024.35OATOM1342CBGLNA15364.789−15.37219.7141.008.99CATOM1343CGGLNA15365.935−16.22519.1161.004.63CATOM1344CDGLNA15366.315−15.63717.7621.0014.17CATOM1345OE1GLNA15365.611−14.76317.2541.0012.53OATOM1346NE2GLNA15367.466−16.02417.2281.0013.38NATOM1347NTYRA15465.566−15.51822.6081.0014.35NATOM1348CATYRA15466.677−15.83923.4831.0012.16CATOM1349CTYRA15466.323−15.93024.9541.0019.06CATOM1350OTYRA15467.185−16.20725.7771.0025.59OATOM1351CBTYRA15467.829−14.81623.3261.0016.89CATOM1352CGTYRA15468.418−14.73321.9431.0017.53CATOM1353CD1TYRA15469.259−15.72621.4671.0018.91CATOM1354CD2TYRA15468.080−13.71221.0911.0013.97CATOM1355CE1TYRA15469.782−15.68620.1901.0010.98CATOM1356CE2TYRA15468.621−13.63919.8061.0023.81CATOM1357CZTYRA15469.488−14.63419.3801.0023.08CATOM1358OHTYRA15470.002−14.61918.1181.0023.87OATOM1359NGLYA15565.080−15.68625.3131.0012.08NATOM1360CAGLYA15564.747−15.70226.7311.0015.80CATOM1361CGLYA15565.323−14.49827.5801.0033.97CATOM1362OGLYA15565.491−14.64028.7891.0025.76OATOM1363NARGA15665.564−13.31826.9811.0025.91NATOM1364CAARGA15666.066−12.14627.7341.0014.13CATOM1365CARGA15664.971−11.48628.5811.0016.23CATOM1366OARGA15663.802−11.91928.5831.0022.61OATOM1367CBARGA15666.601−11.12426.7501.0013.16CATOM1368CGARGA15667.875−11.57026.0991.0015.18CATOM1369CDARGA15668.930−11.41827.1211.0026.42CATOM1370NEARGA15670.200−11.91226.6331.0021.25NATOM1371CZARGA15671.092−12.55527.3861.0042.25CATOM1372NH1ARGA15670.870−12.79528.6791.0020.02NATOM1373NH2ARGA15672.221−12.96626.8431.0020.88NATOM1374NASPA15765.343−10.44629.3211.0016.00NATOM1375CAASPA15764.370−9.74930.1661.0016.20CATOM1376CASPA15764.444−8.24529.8411.0019.20CATOM1377OASPA15764.865−7.42930.6501.0010.71OATOM1378CEASPA15764.609−10.06131.6521.0016.50CATOM1379CGASPA15763.489−9.56032.566 1.0026.45CATOM1380OD1ASPA15762.433−9.06032.1081.0026.82OATOM1381OD2ASPA15763.673−9.65333.7841.0021.88OATOM1382NTYRA15864.038−7.92128.6201.0019.41NATOM1383CATYRA15864.099−6.56428.0831.0018.96CATOM1384CTYRA15862.688−5.97728.1271.0022.62CATOM1385OTYRA15861.854−6.29627.2821.0010.12OATOM1386CBTYRA15864.562−6.66126.6311.0016.34CATOM1387CGTYRA15865.982−7.16626.4841.0012.04CATOM1388CD1TYRA15866.789−7.41527.6211.0013.76CATOM1389CD2TYRA15866.544−7.34925.2181.0016.35CATOM1390CE1TYRA15868.135−7.78627.4821.008.18CATOM1391CE2TYRA15867.886−7.73225.0601.0013.73CATOM1392CZTYRA15868.676−7.94226.1861.0024.45CATOM1393OHTYRA15869.993−8.33825.9971.0014.36OATOM1394NARGA15962.423−5.20029.1751.0023.53NATOM1395CAARGA15961.105−4.60329.4831.0021.15CATOM1396CARGA15960.930−3.17228.8781.0023.55CATOM1397OARGA15961.911−2.56628.4241.0018.12OATOM1398CBARGA15960.891−4.60831.0341.0021.68CATOM1399CGARGA15960.986−6.02931.7221.0016.41CATOM1400CDARGA15961.135−6.05233.2331.0018.10CATOM1401NEARGA15961.305−7.40233.7721.0019.25NATOM1402CZARGA15961.164−7.72035.0581.0036.67CATOM1403NH1ARGA15960.886−6.77635.9621.0015.32NATOM1404NH2ARGA15961.309−8.98635.4481.0011.79NATOM1405NSERA16059.689−2.66128.8591.0024.44NATOM1406CASERA16059.312−1.39328.2001.0021.59CATOM1407CSERA16058.242−0.57728.9501.0025.07CATOM1408OSERA16057.257−1.12729.4541.0017.02OATOM1409CBSERA16058.719−1.74726.7971.0013.05CATOM1410OGSERA16059.782−1.89725.8851.0037.57OATOM1411NVALA16158.3780.74228.9271.0021.01NATOM1412CAVALA16157.3691.64429.5091.009.70CATOM1413CVALA16157.0682.74728.5041.0016.77CATOM1414OVALA16157.9553.14927.7291.0016.33OATOM1415CBVALA16157.8062.24830.8621.0017.94CATOM1416CG1VALA16157.8731.18531.9841.0016.16CATOM1417CG2VALA16159.1372.99230.7501.0021.10CATOM1418NMETA16255.7943.14728.4431.0022.46NATOM1419CAMETA16255.2964.18527.5131.0019.23CATOM1420CMETA16254.8805.31228.3971.0025.19CATOM1421OMETA16253.7885.26928.9611.0018.35OATOM1422CBMETA16253.9793.79626.8501.0015.55CATOM1423CGMETA16254.0132.63025.9491.0037.79CATOM1424SDMETA16254.3543.10024.2351.0052.07SATOM1425CEMETA16256.1933.13424.4101.0036.30CATOM1426NPROA16355.7306.31328.5211.0018.43NATOM1427CAPROA16355.3907.47229.3371.0017.76CATOM1428CPROA16354.3008.38428.6671.0021.23CATOM1429OPROA16354.2088.44827.4331.0015.20OATOM1430CBPROA16356.7278.19629.4231.0011.43CATOM1431CGPROA16357.3527.87428.0311.0013.99CATOM1432CDPROA16357.0866.40127.9491.0012.24CATOM1433NTHRA16453.4789.06029.4781.0013.95NATOM1434CATHRA16452.58110.12128.9631.0025.82CATOM1435CTHRA16453.40611.44128.7811.0019.67CATOM1436OTHRA16454.63311.39328.8681.0013.97OATOM1437CBTHRA16451.37310.39129.9031.0025.51CATOM1438OG1THRA16450.47011.32129.2671.0014.77OATOM1439CG2THRA16451.81810.88631.2981.009.06CATOM1440NASNA16552.75112.58928.5561.0014.99NATOM1441CAASNA16553.44813.90128.4811.007.83CATOM1442CASNA16554.16714.06429.8241.0011.21CATOM1443OASNA16553.55413.92930.8941.0017.66OATOM1444CEASNA16552.43415.06128.4161.0014.48CATOM1445CGASNA16551.49214.94127.2621.0023.70CATOM1446OD1ASNA16551.93914.80026.1291.0022.37OATOM1447ND2ASNA16550.17314.92527.5391.0027.22NATOM1448NLEUA16655.41814.49029.7771.008.23NATOM1449CALEUA16656.18714.60430.9941.0014.40CATOM1450CLEUA16656.62916.01731.1201.0025.05CATOM1451OLEUA16656.62416.71830.1251.0025.09OATOM1452CELEUA16657.46013.74330.8701.0017.48CATOM1453CGLEUA16657.42312.21830.6521.0016.63CATOM1454CD1LEUA16658.83711.63931.0001.0022.52CATOM1455CD2LEUA16656.33611.53931.5141.007.46CATOM1456NTYRA16757.14616.39132.3001.0019.78NATOM1457CATYRA16757.67817.76032.5111.0018.58CATOM1458CTYRA16758.53417.76333.7671.0015.53CATOM1459OTYRA16758.47416.85234.5751.0016.71OATOM1460CBTYRA16756.50918.77832.6651.0018.33CATOM1461CGTYRA16755.67118.56133.9311.0014.23CATOM1462CD1TYRA16754.62417.61833.9771.0013.35CATOM1463CD2TYRA16755.98419.25835.1061.0016.52CATOM1464CE1TYRA16753.88917.44635.1461.0021.17CATOM1465CE2TYRA16755.30219.08436.2641.008.26CATOM1466CZTYRA16754.22818.20336.2961.0023.56CATOM1467OHTYRA16753.52618.07837.5041.0022.81OATOM1468NGLYA16859.33418.79733.9521.0016.59NATOM1469CAGLYA16860.15818.81735.1521.0018.21CATOM1470CGLYA16861.53419.42834.8801.0013.69CATOM1471OGLYA16861.74620.02833.8371.0016.52OATOM1472NPROA16962.47319.26335.8171.0020.33NATOM1473CAPROA16963.80119.82235.6561.0016.07CATOM1474CPROA16964.43019.35334.3871.0027.18CATOM1475OPROA16964.30518.18633.9811.0021.23OATOM1476CBPROA16964.59519.20636.8051.0017.28CATOM1477CGPROA16963.64918.91937.8301.0019.89CATOM1478CDPROA16962.26318.77237.1891.0022.47CATOM1479NHISA17065.22620.23533.8291.0019.48NATOM1480CAHISA17065.95219.87732.6381.0025.56CATOM1481CHISA17065.09619.70731.4281.0029.15CATOM1482OHISA17065.55319.09130.4791.0029.71OATOM1483CBHISA17066.78318.60032.8451.0028.94CATOM1484CGHISA17067.70318.67134.0341.0033.88CATOM1485ND1HISA17068.97519.20333.9691.0025.46NATOM1486CD2HISA17067.51818.29835.3261.0034.77CATOM1487CE1HISA17069.53119.15135.1661.0025.63CATOM1488NE2HISA17068.67318.60336.0081.0031.72NATOM1489NASPA17163.88120.24531.4401.0021.52NATOM1490CAASPA17163.04120.26730.2181.0028.63CATOM1491CASPA17163.63021.45929.3591.0041.94CATOM1492OASPA17164.53422.17129.8351.0029.69OATOM1493CBASPA17161.55220.55830.6021.0026.40CATOM1494CGASPA17160.55220.09729.5401.0022.32CATOM1495OD1ASPA17160.89020.06728.3251.0032.03OATOM1496OD2ASPA17159.42719.71929.9161.0042.13OATOM1497NASNA17263.14121.71228.1371.0042.08NATOM1498CAASNA17263.61622.89327.3881.0035.95CATOM1499CASNA17262.66524.05627.6741.0033.71CATOM1500OASNA17261.58624.10227.1041.0032.69OATOM1501CBASNA17263.63222.66725.8691.0041.60CATOM1502CGASNA17263.80723.98725.0861.0039.09CATOM1503OD1ASNA17262.97324.34724.2591.0083.94OATOM1504ND2ASNA17264.85524.74025.4181.0065.07NATOM1505NPHEA17363.02124.95328.5831.0031.93NATOM1506CAPHEA17362.08226.03028.9441.0048.24CATOM1507CPHEA17361.98927.26028.0451.0069.01CATOM1508OPHEA17362.27828.39528.4651.0058.79OATOM1509CBPHEA17362.22526.45930.3901.0043.43CATOM1510CGPHEA17361.86725.39931.3561.0034.19CATOM1511CD1PHEA17362.81024.48831.7511.0024.68CATOM1512CD2PHEA17360.62125.35431.9251.0024.84CATOM1513CE1PHEA17362.52423.54832.6821.0023.64CATOM1514CE2PHEA17360.30524.36632.8041.0031.32CATOM1515CZPHEA17361.26323.45733.1921.0024.30CATOM1516NHISA17461.51027.03626.8311.0068.16NATOM1517CAHISA17461.40128.10925.8711.0064.53CATOM1518CHISA17459.97328.22125.4001.0071.58CATOM1519OHISA17459.30927.18625.2491.0073.20OATOM1520CBHISA17462.41827.87024.7361.0071.71CATOM1521CGHISA17463.83527.86825.2291.0092.29CATOM1522ND1HISA17464.92127.53924.4401.00100.00NATOM1523CD2HISA17464.33828.13326.4631.00100.00CATOM1524CE1HISA17466.03227.62825.1601.00100.00CATOM1525NE2HISA17465.70527.98126.3931.00100.00NATOM1526NPROA17559.46929.46125.2621.0065.71NATOM1527CAPROA17558.10929.65824.7701.0055.72CATOM1528CPROA17558.23329.29723.2671.0075.83CATOM1529OPROA17557.22429.22622.5541.0069.59OATOM1530CEPROA17557.86631.14225.0261.0049.14CATOM1531CGPROA17559.25831.79024.9011.0042.23CATOM1532CDPROA17560.28630.69525.1091.0049.59CATOM1533NSERA17659.48028.95422.8791.0085.09NATOM1534CASERA17659.95428.47421.5481.0081.18CATOM1535CSERA17659.66026.96521.3431.0073.90CATOM1536OSERA17659.61726.45820.2131.0057.03OATOM1537CBSERA17661.49328.66621.4471.0071.32CATOM1538OGSERA17662.04829.34922.5781.0051.93OATOM1539NASNA17759.52026.27622.4801.0066.23NATOM1540CAASNA17759.27424.84722.6191.0056.41CATOM1541CASNA17757.81024.49722.3531.0060.91CATOM1542OASNA17756.91425.21522.8111.0055.58OATOM1543CBASNA17759.61924.46924.0651.0050.45CATOM1544CGASNA17759.56222.97024.3191.0066.57CATOM1545OD1ASNA17759.09522.21623.4761.00100.00OATOM1546ND2ASNA17760.09922.54625.4641.0035.61NATOM1547NSERA17857.58323.38721.6271.0057.10NATOM1548CASERA17856.23422.85321.2791.0050.50CATOM1549CSERA17855.55722.15922.4911.0076.24CATOM1550OSERA17854.57521.40022.3041.0099.63OATOM1551CBSERA17856.31621.80020.1181.0010.17CATOM1552OGSERA17857.39722.11219.2171.0071.69OATOM1553NHISA17956.13422.28423.6941.0037.39NATOM1554CAHISA17955.56921.58724.8551.0030.96CATOM1555CHISA17954.96122.61625.7671.0021.93CATOM1556OHISA17955.64123.59826.1381.0025.17OATOM1557CBHISA17956.63420.68325.5751.0036.20CATOM1558CGHISA17956.97319.41924.8351.0042.90CATOM1559ND1HISA17956.97319.33523.4571.0049.52NATOM1560CD2HISA17957.32318.19025.2781.0052.42CATOM1561CE1HISA17957.28318.10923.0841.0044.78CATOM1562NE2HISA17957.50017.39324.1681.0050.49NATOM1563NVALA16053.66122.45426.0381.0019.14NATOM1564CAVALA18052.88623.44926.7891.0029.03CATOM1565CVALA18053.37323.89028.1421.0031.29CATOM1566OVALA18053.34825.07528.4471.0019.55OATOM1567CBVALA18051.40323.11526.9141.0035.47CATOM1568CG1VALA18050.63024.39927.2171.0035.84CATOM1569CG2VALA18050.92322.55025.6631.0036.11CATOM1570NILEA18153.68422.93529.0051.0026.57NATOM1571CAILEA18154.13823.28530.3601.0024.49CATOM1572CILEA18155.37124.21330.3611.0016.51CATOM1573OILEA18155.32625.31530.9091.0024.42OATOM1574CBILEA18154.28522.01831.2641.0020.20CATOM1575CG1ILEA18152.87821.42831.5281.0018.22CATOM1576CG2ILEA18155.01422.31532.5811.0013.37CATOM1577CD1ILEA18152.86720.08632.2861.008.03CATOM1578NPROA18256.45223.77929.7181.0022.21NATOM1579CAPROA18257.66424.60529.6401.0022.07CATOM1580CPROA18257.37925.85228.8281.0024.18CATOM1581OPROA18257.81126.94929.2101.0018.35OATOM1582CBPROA18258.68223.72528.8901.0024.97CATOM1583CGPROA18257.92522.47328.4711.0025.77CATOM1584CDPROA18256.72722.35929.4011.0018.23CATOM1585NALAA18356.62825.70727.7291.0021.45NATOM1586CAALAA18356.26126.89626.9431.0021.66CATOM1587CALAA18355.46427.90027.8111.0026.10CATOM1588OALAA18355.77329.09127.8561.0019.50OATOM1589CBALAA18355.47326.51325.7031.0013.26CATOM1590NLEUA18454.47227.38928.5431.0023.34NATOM1591CALEUA18453.64228.21529.4011.0019.05CATOM1592CLEUA18454.31228.69330.6551.0021.91CATOM1593OLEUA18454.01729.77131.1581.0019.71OATOM1594CBLEUA18452.30927.55329.7151.0014.41CATOM1595CGLEUA18451.34227.59528.5251.0023.42CATOM1596CD1LEUA18449.91827.24428.9281.0031.06CATOM1597CD2LEUA18451.38028.89627.6901.0021.73CATOM1598NLEUA18555.17827.87931.2131.0018.39NATOM1599CALEUA18555.83328.33232.4171.0016.39CATOM1600CLEUA18556.66929.52831.9851.0023.67CATOM1601OLEUA18556.68130.59032.6441.0029.38OATOM1602CBLEUA18556.72327.23333.0151.0015.05CATOM1603CGLEUA18556.02126.34834.0411.0015.56CATOM1604CD1LEUA18556.81925.02234.3011.0021.06CATOM1605CD2LEUA18555.72227.11335.3211.0011.02CATOM1606NARGA18657.33729.39730.8521.0017.09NATOM1607CAARGA18658.13730.52330.4291.0018.82CATOM1608CARGA18657.30831.75230.0691.0029.00CATOM1609OARGA18657.62932.88030.4761.0023.91OATOM1610CBARGA18659.02630.14629.2811.0022.06CATOM1611CGARGA18659.65331.36528.6521.0038.46CATOM1612CDARGA18660.82531.80429.4621.0083.66CATOM1613NEARGA18662.01231.86128.6311.0070.77NATOM1614CZARGA18663.05832.62228.9041.0091.68CATOM1615NH1ARGA18663.05333.38629.9951.0056.56NATOM1616NH2ARGA18664.09832.63928.0821.00100.00NATOM1617NARGA18756.23431.54429.3101.0020.96NATOM1618CAARGA18755.36132.66228.9411.0019.32CATOM1619CARGA18754.76533.45330.1421.0028.41CATOM1620OARGA18754.82334.70030.1931.0017.23OATOM1621CBARGA18754.27032.22327.9571.0017.05CATOM1622CGARGA18754.81331.54626.7201.0061.42CATOM1623CDARGA18753.69631.24425.7571.0044.57CATOM1624NEARGA18753.03332.47225.3541.0029.47NATOM1625CZARGA18751.83132.53424.7901.0017.82CATOM1626NH1ARGA18751.13631.42724.5441.0024.95NATOM1627NH2ARGA18751.34133.71624.4471.0037.77NATOM1628NPHEA18854.19232.73431.1011.0023.48NATOM1629CAPHEA18853.60433.39932.2591.0021.24CATOM1630CPHEA18854.63834.08033.0951.0021.39CATOM1631OPHEA18854.39435.12633.6261.0023.90OATOM1632CBPHEA18852.72332.46633.0771.0019.95CATOM1633CGPHEA18851.38932.21532.4351.0022.28CATOM1634CD1PHEA18850.44033.22932.3751.0019.42CATOM1635CD2PHEA18851.14431.03831.7341.0023.82CATOM1636CE1PHEA18849.19133.02631.7421.0024.77CATOM1637CE2PHEA18849.93630.82631.0571.0020.17CATOM1638CZPHEA18848.94531.81531.0681.0023.14CATOM1639NHISA18955.83133.51333.1181.0024.15NATOM1640CAHISA18956.93334.12233.8371.0028.79CATOM1641CHISA18957.30335.50633.3151.0028.58CATOM1642OHISA18957.48036.46334.0831.0020.07OATOM1643CBHISA18958.14833.26833.6411.0031.38CATOM1644CGHISA18959.36433.84434.2901.0029.98CATOM1645ND1HISA18959.54833.83335.6581.0031.00NATOM1646CD2HISA18960.44934.46433.7661.0021.79CATOM1647CE1HISA18960.72234.37135.9451.0024.04CATOM1648NE2HISA18961.25734.81534.8211.0019.53NATOM1649NGLUA19057.53935.56132.0061.0028.43NATOM1650CAGLUA19057.87636.81631.3241.0027.72CATOM1651CGLUA19056.72537.82931.4371.0032.56CATOM1652OGLUA19056.94938.99531.7171.0027.06OATOM1653CBGLUA19058.12236.52929.8491.0028.55CATOM1654CGGLUA19059.15035.46129.6141.0035.29CATOM1655CDGLUA19060.55335.94129.8921.0099.81CATOM1656OH1GLUA19060.91336.03731.0851.0086.56OATOM1657OE2GLUA19061.29336.16728.9101.00100.00OATOM1658NALAA19155.49337.39131.1961.0032.67NATOM1659CAALAA19154.34938.28631.3111.0025.30CATOM1660CALAA19154.28738.79532.7421.0036.20CATOM1661OALAA19153.92039.92433.0141.0027.52OATOM1662CBALAA19153.05537.56331.0001.0016.48CATOM1663NTHRA19254.54937.92733.6931.0029.39NATOM1664CATHRA19254.39538.38635.0411.0019.08CATOM1665CTHRA19255.42039.49435.2981.0044.78CATOM1666OTHRA19255.09440.55035.8391.0040.58OATOM1667CBTHRA19254.51537.23535.9831.0018.99CATOM1668OG1THRA19253.41036.34835.7551.0034.36OATOM1669CG2THRA19254.46137.73837.4251.0021.15CATOM1670NALAA19356.61739.31234.7571.0048.58NATOM1671CAALAA19357.70540.28634.9051.0050.59CATOM1672CALAA19357.49641.61334.1451.0054.42CATOM1673OALAA19357.95242.69834.5531.0048.28OATOM1674CBALAA19359.04739.64034.4961.0051.78CATOM1675NGLNA19456.81041.53033.0221.0043.16NATOM1676CAGLNA19456.58642.72232.2421.0038.03CATOM1677CGLNA19455.26443.38932.5761.0040.85CATOM1678OGLNA19454.83044.28431.8451.0051.20OATOM1679CBGLNA19456.59942.35830.7501.0035.96CATOM1680CGGLNA19457.91041.69230.2901.00100.00CATOM1681CDGLNA19457.71540.66129.1581.00100.00CATOM1682OE1GLNA19456.61940.54628.5791.00100.00OATOM1683NE2GLNA19458.78239.90428.8481.00100.00NATOM1684NGLYA19554.58342.94933.6301.0032.29NATOM1685CAGLYA19553.23643.46433.8641.0036.26CATOM1686CGLYA19552.29943.33232.5931.0045.33CATOM1687OGLYA19551.51544.24232.3461.0045.16OATOM1688NGLYA19652.40542.24531.7881.0036.33NATOM1689CAGLYA19651.51541.96530.6081.0019.06CATOM1690CGLYA19650.03741.95831.1171.0022.49CATOM1691OGLYA19649.72441.47932.2231.0033.09OATOM1692NPROA19749.14442.65730.4311.0029.22NATOM1693CAPROA19747.79042.73230.9531.0025.29CATOM1694CPROA19747.09141.41330.6741.0024.64CATOM1695OPROA19746.19240.99131.4111.0024.75OATOM1696CBPROA19747.16243.91130.1761.0026.31CATOM1697CGPROA19748.18844.40729.2521.0026.56CATOM1698CDPROA19749.30743.45429.2031.0030.25CATOM1699NASPA19847.57240.72329.6581.0016.88NATOM1700CAASPA19847.06739.41829.4051.0021.65CATOM1701CASPA19848.04638.52228.6771.0031.28CATOM1702OASPA19849.06238.97828.1721.0034.57OATOM1703CBASPA19845.73939.50728.6691.0032.80CATOM1704CGASPA19845.86840.05527.2561.0046.13CATOM1705OD1ASPA19846.98240.23026.7251.0057.45OATOM1706OD2ASPA19844.81740.27126.6401.0067.61OATOM1707NVALA19947.71337.23428.6141.0038.67NATOM1708CAVALA19948.49936.22627.9011.0027.79CATOM1709CVALA19947.46235.46927.0651.0025.88CATOM1710OVALA19946.46035.02327.5981.0024.22OATOM1711CBVALA19949.16335.22928.9051.0024.37CATOM1712CG1VALA19949.87434.04728.1601.0020.28CATOM1713CG2VALA19950.12135.94229.8351.0022.25CATOM1714NVALA20047.66135.38625.7571.0023.72NATOM1715CAVALA20046.70134.69424.9031.0023.99CATOM1716CVALA20047.16733.28624.4991.0022.85CATOM1717OVALA20048.32133.10824.1881.0029.77OATOM1718CBVALA20046.35835.54823.6801.0023.11CATOM1719CG1VALA20045.56134.73722.5981.0016.25CATOM1720CG2VALA20045.65236.82324.1301.0027.86CATOM1721NVALA20146.29632.27824.6321.0027.39NATOM1722CAVALA20146.58830.89324.2651.009.63CATOM1723CVALA20145.65330.52923.1651.0019.63CATOM1724OVALA20144.45230.75523.3121.0017.61OATOM1725CBVALA20146.30629.95225.4261.0019.95CATOM1726CG1VALA20146.70328.51925.0541.0020.85CATOM1727CG2VALA20147.08630.43926.6611.0016.73CATOM1728NTRPA20246.21030.08022.0301.0014.36NATOM1729CATRPA20245.42229.69320.8651.0018.97CATOM1730CTRPA20244.49528.57221.3131.0036.22CATOM1731OTRPA20244.93427.69422.0571.0031.46OATOM1732CBTRPA20246.29229.05519.8231.0019.14CATOM1733CGTRPA20247.24329.89419.0661.0033.65CATOM1734CD1TRPA20248.39129.46318.4291.0035.28CATOM1735CD2TRPA20247.12631.28218.7721.0039.90CATOM1736NE1TRPA20248.94130.48117.6931.0037.86NATOM1737CE2TRPA20248.22831.62417.9221.0038.35CATOM1738CE3TRPA20246.20632.28119.1381.0039.39CATOM1739CZ2TRPA20248.38032.88417.3671.0036.15CATOM1740CZ3TRPA20246.35633.54218.5781.0039.60CATOM1741CH2TRPA20247.42833.82817.6841.0040.99CATOM1742NGLYA20343.24528.56420.8421.0025.59NATOM1743CAGLYA20342.33227.48321.1691.0013.09CATOM1744CGLYA20341.26027.81322.1931.0021.12CATOM1745OGLYA20341.34028.81522.8861.0022.86OATOM1746NSERA20440.27026.91922.2621.0016.88NATOM1747CASERA20439.16326.97923.1921.0018.36CATOM1748CSERA20439.56126.66424.6591.0022.07CATOM1749OSERA20438.88827.09625.6041.0034.39OATOM1750CBSERA20438.05325.99822.7401.009.99CATOM1751OGSERA20438.23724.69523.2911.0016.37OATOM1752NGLYA20540.56225.81324.8541.0012.42NATOM1753CAGLYA20540.96325.41126.2081.0011.64CATOM1754CGLYA20540.20824.17826.7111.0019.49CATOM1755OGLYA20540.42223.72327.8381.0013.59OATOM1756NTHRA20639.29223.68325.8811.0015.38NATOM1757CATHRA20638.43222.59426.2811.0010.80CATOM1758CTHRA20639.05621.22126.1541.0026.39CATOM1759OTHRA20638.56420.26726.7371.0023.28OATOM1760CBTHRA20637.12422.56225.4601.0012.86CATOM1761OG1THRA20637.43822.39524.0821.0013.12OATOM1762CG2THRA20636.34823.84025.6201.0010.62CATOM1763NPROA20740.10121.08325.3541.0021.10NATOM1764CAPROA20740.65819.74325.1751.0018.15CATOM1765CPROA20741.31619.18126.4231.0021.75CATOM1766OPROA20741.95119.92527.2151.0020.65OATOM1767CBPROA20741.63819.90924.0131.0017.51CATOM1768CGPROA20741.14621.21323.3071.0021.45CATOM1769CDPROA20740.69822.06224.4311.0023.44CATOM1770NMETA20841.11217.87626.6241.0015.60NATOM1771CAMETA20841.69417.16727.7751.0022.94CATOM1772CMETA20843.05816.42727.5791.0021.90CATOM1773OMETA20843.24815.67726.6331.0023.16OATOM1774CBMETA20840.64516.27328.3861.0032.86CATOM1775CGMETA20839.63017.05729.2231.0046.17CATOM1776SDMETA20838.30115.99029.8261.0057.85SATOM1777CEMETA20837.99915.02828.3431.0058.23CATOM1778NARGA20944.02216.68128.4561.0017.75NATOM1779CAARGA20945.31816.04228.3241.0019.88CATOM1780CARGA20945.87115.53429.6391.0016.92CATOM1781OARGA20945.43315.94630.6971.0016.58OATOM1782CBARGA20946.34016.96327.6581.0021.07CATOM1783CGARGA20945.98017.47826.2751.0022.57CATOM1784CDARGA20945.83316.35725.2821.0028.26CATOM1785NEARGA20945.58616.81923.9061.0023.15NATOM1786CZARGA20944.42016.74223.2671.0034.52CATOM1787NH1ARGA20943.33616.26723.8901.0018.03NATOM1788NE2ARGA20944.33917.17522.0121.0029.78NATOM1789NGLUA21046.87814.67529.5471.0020.87NATOM1790CAGLUA21047.53014.07930.7201.0017.37CATOM1791CGLUA21049.03114.49030.8511.0020.96CATOM1792OGLUA21049.74814.62229.8411.0022.44OATOM1793CBGLUA21047.40012.56230.5711.0016.26CATOM1794CGGLUA21047.80711.78531.8091.0019.91CATOM1795CDGLUA21048.05710.30431.5311.0027.81CATOM1796OE1GLUA21048.1119.91930.3431.0017.29OATOM1797OE2GLUA21048.2689.54032.4941.0021.63OATOM1798NPHEA21149.50414.71232.0841.0014.02NATOM1799CAPHEA21150.88715.15932.3531.0017.48CATOM1800CPHEA21151.45814.41433.5311.0033.62CATOM1801OPHEA21150.71614.03134.4431.0027.96OATOM1802CBPHEA21150.93316.67732.6441.0017.78CATOM1803CGPHEA21150.30317.49031.5411.0021.49CATOM1804CD1PHEA21151.00917.67630.3201.0017.36CATOM1805CD2PHEA21148.93317.84431.6181.0015.09CATOM1806CE1PHEA21150.39918.33429.2371.0016.37CATOM1807CE2PHEA21148.28818.49130.5331.009.61CATOM1808CZPHEA21149.05318.75629.3441.0012.71CATOM1809NLEUA21252.76114.16133.4951.0023.76NATOM1810CALEUA21253.40513.44834.6031.0021.24CATOM1811CLEUA21254.77214.05334.8981.0014.00CATOM1812OLEUA21255.51914.39833.9851.0013.99OATOM1813CBLEUA21253.54811.95434.2941.0021.52CATOM1814CGLEUA21254.03311.03935.4061.0021.09CATOM1815CD1LEUA21252.86610.63436.2801.0020.84CATOM1816CD2LEUA21254.7689.82934.8321.0013.18CATOM1817NHISA21355.02314.30236.1751.009.60NATOM1818CAHISA21356.29014.86436.5551.0013.66CATOM1819CHISA21357.38013.82836.2931.0020.37CATOM1820OHISA21357.23812.61436.5421.0016.08OATOM1821CBHISA21356.28015.25038.0021.0018.72CATOM1822CGHISA21357.49116.01738.4081.0021.22CATOM1823ND1HISA21358.70315.40638.6561.0024.29NATOM1824CD2HISA21357.71617.35338.4991.0023.67CATOM1825CE1HISA21359.61516.33138.9171.0019.13CATOM1826NE2HISA21359.04117.52338.8471.0021.99NATOM1827NVALA21458.45914.29535.6981.0021.07NATOM1828CAVALA21459.53213.38335.3611.0019.23CATOM1829CVALA21460.06712.52336.5511.0027.20CATOM1830OVALA21460.60411.44436.3591.0022.23OATOM1831CEVALA21460.62514.12534.5661.0011.84CATOM1832CG1VALA21461.39015.19935.4851.008.52CATOM1833CG2VALA21461.56013.09733.9021.0012.39CATOM1834NASPA21559.89312.98437.7901.0025.29NATOM1835CAASPA21560.40612.22838.9361.0018.19CATOM1836CASPA21559.53011.02339.2301.0013.85CATOM1837OASPA21559.9889.98139.6661.0017.44OATOM1838CBASPA21560.57513.12940.1551.0016.27CATOM1839CGASPA21561.85913.97940.0681.0030.73CATOM1840OD1ASPA21562.78213.61439.3081.0023.02OATOM1841OD2ASPA21561.95715.02940.7301.0026.00OATOM1842NASPA21658.27611.13638.8631.0020.08NATOM1843CAASPA21657.37810.01739.0161.0018.78CATOM1844CASPA21657.7619.08337.8941.0023.56CATOM1845OASPA21657.7157.88038.0261.0020.79OATOM1846CBASPA21655.91210.45738.8211.0017.18CATOM1847CGASPA21655.19310.75740.1621.0038.03CATOM1848OD1ASPA21655.50310.11941.2231.0026.02OATOM1849OD2ASPA21654.24911.58740.1241.0025.41OATOM1850NMETA21758.0929.65336.7551.0018.11NATOM1851CAMETA21758.3948.78535.6361.0022.41CATOM1852CMETA21759.5727.94235.9921.0027.54CATOM1853OMETA21759.5796.75235.7101.0020.86OATOM1854CBMETA21758.6379.59234.3451.0021.24CATOM1855CGMETA21759.4788.91833.2871.0016.37CATOM1856SDMETA21758.9627.41232.4731.0030.51SATOM1857CEMETA21757.4657.60832.3911.0019.57CATOM1858NALAA21860.5618.56236.6231.0019.09NATOM1859CAALAA21861.7747.84137.0021.0013.65CATOM1860CALAA21861.4366.77838.0281.0022.61CATOM1861OALAA21861.9345.67037.9671.0019.36OATOM1862CBALAA21862.8098.78037.5791.0012.23CATOM1863NALAA21960.6057.10939.0001.0019.34NATOM1864CAALAA21960.3106.10540.0231.0018.01CATOM1865CALAA21959.6304.90139.4131.0023.57CATOM1866OALAA21959.7813.77739.8981.0022.71OATOM1867CBALAA21959.3876.67841.0831.0010.11CATOM1868NALAA22058.7535.17438.4541.0018.99NATOM1869CAALAA22057.9054.15837.8551.0014.12CATOM1870CALAA22058.7533.21337.0341.0025.33CATOM1871OALAA22058.5842.00637.1141.0020.63OATOM1872CBALAA22056.7964.79837.0231.008.53CATOM1873NSERA22159.7703.77236.3791.0023.92NATOM1874CASERA22160.7023.01135.5561.0018.38CATOM1875CSERA22161.5371.98936.3531.0020.90CATOM1876OSERA22161.6830.79935.9831.0019.84OATOM1877CBSERA22161.6043.98534.8041.0010.67CATOM1878OGSERA22160.8474.74433.8671.0015.61OATOM1879NILEA22262.0832.47637.4631.0018.12NATOM1880CAILEA22262.8661.64438.3811.0021.56CATOM1881CILEA22262.0200.55439.0681.0029.10CATOM1882OILEA22262.504−0.56639.3071.0019.03OATOM1883CBILEA22263.4672.51639.4321.0024.56CATOM1884CG1ILEA22264.4653.47338.7651.0032.13CATOM1885CG2ILEA22264.1291.67140.5001.0028.26CATOM1886CD1ILEA22264.9734.58539.6491.0015.61CATOM1887NHISA22360.7720.90739.3841.0019.34NATOM1888CAHISA22359.829−0.03139.9961.0020.46CATOM1889CHISA22359.599−1.09738.9641.0024.82CATOM1890OHISA22359.723−2.28339.2701.0024.66OATOM1891CBHISA22358.4650.63740.3591.0019.53CATOM1892CGHISA22357.373−0.33340.7591.0028.64CATOM1893ND1HISA22357.021−0.56442.0821.0024.16NATOM1894CD2HISA22356.497−1.06240.0041.0030.39CATOM1895CE1HISA22355.983−1.39942.1121.0030.39CATOM1896NE2HISA22355.652−1.72740.8691.0028.13NATOM1897NVALA22459.354−0.68437.7251.0022.06NATOM1898CAVALA22459.111−1.65736.6521.0019.15CATOM1899CVALA22460.350−2.49036.3331.0025.89CATOM1900OVALA22460.282−3.70936.2501.0022.37OATOM1901CBVALA22458.559−1.02235.3771.0022.59CATOM1902CG1VALA22458.512−2.05034.2311.0022.61CATOM1903CG2VALA22457.161−0.49135.6501.0023.44CATOM1904NMETA22561.499−1.83836.2551.0027.83NATOM1905CAMETA22562.710−2.57736.0041.0023.69CATOM1906CMETA22562.896−3.67837.0711.0031.95CATOM1907OMETA22563.290−4.80536.7851.0024.33OATOM1908CBMETA22563.902−1.60436.0561.0021.34CATOM1909CGMETA22565.295−2.29635.9991.0017.83CATOM1910SDMETA22565.750−2.95834.3061.0023.33SATOM1911CEMETA22567.080−1.89633.7851.0016.46CATOM1912NGLUA22662.644−3.31938.3161.0019.54NATOM1913CAGLUA22662.988−4.16139.4281.0021.58CATOM1914CGLUA22661.999−5.20039.9181.0030.77CATOM1915OGLUA22662.308−6.01240.7801.0029.39OATOM1916CBGLUA22663.613−3.32340.5471.0020.47CATOM1917CGGLUA22664.937−2.67340.1221.0023.03CATOM1918CDGLUA22665.504−1.80941.2081.0032.62CATOM1919OE1GLUA22664.721−1.45542.1221.0026.12OATOM1920OE2GLUA22666.711−1.47941.1521.0017.67OATOM1921NLEUA22760.837−5.24839.2951.0034.11NATOM1922CALEUA22759.883−6.29639.6421.0035.26CATOM1923CLEUA22760.537−7.64439.3201.0027.91CATOM1924OLEUA22761.291−7.76638.3401.0019.89OATOM1925CBLEUA22758.693−6.23638.6781.0036.48CATOM1926CGLEUA22757.381−5.56938.9551.0040.30CATOM1927CD1LEUA22757.697−4.19439.3821.0042.04CATOM1928CD2LEUA22756.610−5.57737.6471.0046.21CATOM1929NALAA22860.026−8.68839.9551.0027.15NATOM1930CAALAA22860.425−10.05139.6161.0025.26CATOM1931CALAA22859.801−10.43538.2791.0027.93CATOM1932OALAA22858.624−10.09337.9341.0031.26OATOM1933CBALAA22860.003−11.05240.7031.0022.05CATOM1934NHISA22960.624−11.16037.5391.0027.05NATOM1935CAHISA22960.275−11.60536.2221.0024.42CATOM1936CHISA22958.905−12.26036.1841.0021.74CATOM1937OHISA22958.015−11.85135.3981.0022.22OATOM1938CBHISA22961.351−12.52035.6981.0017.71CATOM1939CGHISA22961.284−12.70134.2201.0027.24CATOM1940ND1HISA22961.060−11.65033.3501.0034.38NATOM1941CD2HISA22961.292−13.82133.4651.0031.45CATOM1942CB1HISA22960.992−12.11332.1151.0030.50CATOM1943NE2HISA22961.124−13.42732.1591.0035.23NATOM1944NGLUA23058.681−13.16137.1401.0020.24NATOM1945CAGLUA23057.425−13.89537.2091.0029.41CATOM1946CGLUA23056.181−13.05137.3411.0022.20CATOM1947OGLUA23055.159−13.35936.6791.0017.78OATOM1948CBGLUA23057.464−14.99738.2741.0038.51CATOM1949CGGLUA23058.085−14.58239.5671.0063.09CATOM1950CDGLUA23057.036−14.47340.6611.00100.00CATOM1951OE1GLUA23055.859−14.87240.4001.00100.00OATOM1952OE2GLUA23057.409−14.00341.7681.0081.48OATOM1953NVALA23156.272−12.00438.1821.0016.53NATOM1954CAVALA23155.202−11.02938.3561.0020.23CATOM1955CVALA23155.009−10.16437.1021.0024.45CATOM1956OVALA23153.864−9.83436.7051.002 1.00OATOM1957CBVALA23155.541−10.05739.4261.0028.61CATOM1958CG1VALA23154.362−9.09839.6101.0029.78CATOM1959CG2VALA23155.881−10.75740.6771.0028.96CATOM1960NTRPA23256.133−9.79836.4861.0017.17NATOM1961CATRPA23256.052−9.04435.2621.0021.52CATOM1962CTRPA23255.388−9.84434.1561.0020.53CATOM1963OTRPA23254.588−9.30633.3801.0024.31OATOM1964CBTRPA23257.438−8.64434.8011.0029.88CATOM1965CGTRPA23257.430−7.84333.5001.0027.65CATOM1966CD1TRPA23257.184−6.46433.3561.0025.42CATOM1967CD2TRPA23257.714−8.33632.1691.0027.75CATOM1968NE1TRPA23257.325−6.09532.0331.0022.53NATOM1969CB2TRPA23257.655−7.20331.2791.0025.11CATOM1970CB3TRPA23258.037−9.60331.6401.0022.72CATOM1971CZ2TRPA23257.917−7.31629.8791.0017.23CATOM1972CZ3TRPA23258.238−9.72030.2231.0025.97CATOM1973CH2TRPA23258.154−8.58129.3681.0022.07CATOM1974NLEUA23355.749−11.12134.0181.0023.80NATOM1975CALEUA23355.141−11.94932.9371.0024.78CATOM1976CLEUA23353.652−12.11833.1221.0024.51CATOM1977OLEUA23352.865−12.07532.1631.0028.50OATOM1978CBLEUA23355.765−13.34832.8201.0026.20CATOM1979CGLEUA23357.250−13.50532.5031.0019.39CATOM1980CD1LEUA23357.745−14.85033.0231.0019.90CATOM1981CD2LEUA23357.561−13.28731.0171.0016.01CATOM1982NGLUA23453.298−12.34334.3721.0025.45NATOM1983CAGLUA23451.929−12.52334.8221.0030.04CATOM1984CGLUA23451.128−11.31934.3671.0035.69CATOM1985OGLUA23449.926−11.39034.0521.0028.25OATOM1986CBGLUA23452.007−12.46836.3441.0037.30CATOM1987CGGLUA23450.908−13.13337.1181.0045.39CATOM1988CDGLUA23451.112−12.88138.6011.00100.00CATOM1989OE1GLUA23452.240−13.13739.1041.0099.09OATOM1990OE2GLUA23450.211−12.25739.2111.00100.00OATOM1991NASNA23551.802−10.18434.3641.0025.04NATOM1992CAASNA23551.109−8.98633.9921.0026.17CATOM1993CASNA23551.280−8.49432.5711.0030.46CATOM1994OASNA23550.824−7.39332.2591.0022.90OATOM1995CBASNA23551.427−7.89534.9811.0029.23CATOM1996CGASNA23550.878−8.19736.3421.0039.27CATOM1997OD1ASNA23549.722−7.88236.6281.0029.06OATOM1998ND2ASNA23551.653−8.93437.1401.0040.22NATOM1999NTHRA23651.935−9.26831.7081.0020.97NATOM2000CATHRA23652.108−8.79530.3441.0022.30CATOM2001CTHRA23651.867−9.94329.4191.0029.74CATOM2002OTHRA23651.551−11.03329.8951.0021.23OATOM2003CBTHRA23653.545−8.30630.1611.0022.73CATOM2004OG1THRA23654.422−9.32530.6361.0021.23OATOM2005CG2THRA23653.801−7.04831.0411.0019.69CATOM2006NGLNA23752.003−9.69928.1091.0022.23NATOM2007CAGLNA23752.097−10.78327.1221.0016.69CATOM2008CGLNA23753.335−10.50726.3311.0021.02CATOM2009OGLNA23753.729−9.36226.2041.0022.19OATOM2010CBGLNA23750.913−10.99926.1891.008.23CATOM2011CGGLNA23749.639−11.09626.9041.0021.04CATOM2012CDGLNA23748.907−9.86226.6061.0062.07CATOM2013OE1GLNA23748.437‘9.71225.4601.0059.32OATOM2014NE2GLNA23749.220−8.84727.3881.0037.82NATOM2015NPROA23854.002‘11.57925.9171.0028.76NATOM2016CAPROA23855.275−11.43825.2461.0030.28CATOM2017CPROA23855.194−10.64323.9581.0029.08CATOM2018OPROA23856.181−10.02923.6001.0015.95OATOM2019CBPROA23855.733−12.87925.0111.0022.54CATOM2020CGPROA23854.898−13.71025.8861.0018.92CATOM2021CDPROA23853.626−12.99826.0681.0011.75CATOM2022NMETA23954.041−10.63523.2861.0017.26NATOM2023CAMETA23953.924−9.80722.1041.0017.85CATOM2024CMETA23953.109−8.50922.3621.0018.63CATOM2025OMETA23952.792−7.74121.4191.0016.82OATOM2026CBMETA23953.460−10.58820.8811.0015.22CATOM2027CGMETA23954.536−11.53420.2611.0012.90CATOM2028SDMETA23953.994−12.53418.8081.0017.49SATOM2029CEMETA23954.350−11.35717.4221.0013.12CATOM2030NLEUA24052.847−8.25223.6461.0018.55NATOM2031CALEUA24052.159−7.03724.1311.0016.68CATOM2032CLEUA24052.774−6.73325.4931.0011.82CATOM2033OLEUA24052.124−6.80326.5491.0013.84OATOM2034CBLEUA24050.645−7.24924.2401.0016.91CATOM2035CGLEUA24049.646−6.12023.8521.0022.29CATOM2036CD1LEUA24048.968−5.48825.0331.0025.51CATOM2037CD2LEUA24050.070−5.05922.8151.0028.07CATOM2038NSERA24154.076−6.46725.4561.0013.09NATOM2039CASERA24154.842−6.31526.6821.0024.20CATOM2040CSERA24154.947−4.93827.3771.0030.52CATOM2041OSERA24155.363−4.85428.5471.0017.02OATOM2042CBSERA24156.247−6.90026.4951.0014.04CATOM2043OGSERA24157.062−6.14425.5981.0013.95OATOM2044NHISA24254.661−3.86126.6591.0017.87NATOM2045CAHISA24254.894−2.54827.2211.0013.55CATOM2046CHISA24253.990−2.25428.3731.0013.70CATOM2047OHISA24252.974−2.88528.5391.0013.29OATOM2048CBHISA24254.826−1.43026.1301.0016.05CATOM2049CGHISA24253.595−1.50425.2721.0018.88CATOM2050ND1HISA24252.591−0.55325.3261.0023.24NATOM2051CD2HISA24253.165−2.46124.4131.0013.19CATOM2052CE1HISA24251.629−0.88724.4831.0017.44CATOM2053NE2HISA24251.962−2.03123.9011.0019.54NATOM2054NILEA24354.310−1.20329.0951.0015.84NATOM2055CAILEA24353.492−0.80930.1921.0019.10CATOM2056CILEA24353.3360.71430.1911.0023.23CATOM2057OILEA24354.3121.40630.3851.0012.10OATOM2058CBILEA24354.166−1.27331.4821.0024.62CATOM2059CG1ILEA24354.014−2.78331.5761.0025.60CATOM2060CG2ILEA24353.497−0.66532.7351.0017.37CATOM2061CD1ILEA24354.725−3.36532.7141.0014.62CATOM2062NASNA24452.1121.21730.0131.0016.43NATOM2063CAASNA24451.8242.68930.0381.0018.99CATOM2064CASNA24452.2523.29231.3481.0018.83CATOM2065OASNA24451.9652.72732.4051.0019.58OATOM2066CBASNA24450.3042.98729.9101.0015.67CATOM2067CGASNA24449.7682.70228.5171.0014.57CATOM2068OD1ASNA24450.5462.58327.5801.0013.64OATOM2069ND2ASNA24448.4432.49128.3931.0010.16NATOM2070NVALA24552.8004.49931.3261.0013.50NATOM2071CAVALA24553.1595.13432.6021.0013.49CATOM2072CVALA24552.5286.56632.6441.0016.25CATOM2073OVALA24552.7867.40531.7701.0015.20OATOM2074CBVALA24554.7545.16332.8101.0021.07CATOM2075CG1VALA24555.1546.08533.9371.0015.08CATOM2076CG2VALA24555.2803.81733.1431.0015.82CATOM2077NGLYA24651.6966.84333.6491.0014.03NATOM2078CAGLYA24651.0278.13633.7071.0016.87CATOM2079CGLYA24650.1468.20334.9391.0026.95CATOM2080OGLYA24650.3237.40135.8501.0023.04OATOM2081NTHRA24749.2079.16134.9631.0021.44NATOM2082CATHRA24748.2329.27636.0631.0021.39CATOM2083CTHRA24746.8688.67735.6731.0024.08CATOM2084OTHRA24746.0698.30636.5081.0021.03OATOM2085CBTHRA24747.98810.73036.4041.0022.24CATOM2086OG1THRA24747.40911.38935.2651.0018.62OATOM2087CG2THRA24749.27511.37836.7241.0018.99CATOM2088NGLYA24846.5838.65134.3841.0024.95NATOM2089CAGLYA24845.3198.14333.9241.0022.61CATOM2090CGLYA24844.2239.16034.2261.0021.42CATOM2091OGLYA24843.0598.86634.1371.0025.70OATOM2092NVALA24944.61510.38634.5211.0030.72NATOM2093CAVALA24943.67311.46434.8271.0026.09CATOM2094CVALA24943.74712.59633.7861.0032.70CATOM2095OVALA24944.85313.00633.3871.0026.92OATOM2096CBVALA24944.02012.08536.2141.0038.59CATOM2097CG1VALA24943.22513.32436.4701.0036.11CATOM2098CG2VALA24943.78211.08337.3061.0041.30CATOM2099NASPA25042.58113.12533.3971.0027.95NATOM2100CAASPA25042.48814.23232.4391.0020.64CATOM2101CASPA25042.61115.58133.1551.0027.63CATOM2102OASPA25042.18815.78334.3081.0026.23OATOM2103CBASPA25041.07514.30231.8271.0023.89CATOM2104CGASPA25040.76813.18030.8501.0039.52CATOM2105OD1ASPA25041.28313.18429.6881.0039.96OATOM2106OD2ASPA25039.76712.50131.1531.0045.34OATOM2107NCYSA25143.02916.56632.3881.0020.12NATOM2108CACYSA25142.96217.90632.8511.0027.20CATOM2109CCYSA25142.91818.77931.5771.0026.47CATOM2110OCYSA25143.69918.56030.6331.0019.45OATOM2111CBCYSA25144.14818.15733.7781.0034.86CATOM2112SGCYSA25145.12919.61933.4531.0029.47SATOM2113NTHRA25241.93219.67331.4941.0014.85NATOM2114CATHRA25241.83420.58830.3351.0021.21CATOM2115CTHRA25242.99921.59230.2361.0020.53CATOM2116OTHRA25243.65721.92631.2491.0015.24OATOM2117CBTHRA25240.50621.40730.3291.0032.08CATOM2118OG1THRA25240.46022.30431.4471.0019.26OATOM2119CG2THRA25239.30920.49530.3721.0013.91CATOM2120NILEA25343.22822.09529.0241.0014.81NATOM2121CAILEA25344.26423.11828.8121.0016.90CATOM2122CILEA25343.93424.38329.6271.0023.41CATOM2123OILEA25344.83425.01230.2471.0015.27OATOM2124CBILEA25344.40423.45227.3021.0024.05CATOM2125CG1ILEA25344.86222.20026.5611.0027.33CATOM2126CG2ILEA25345.47324.47927.0771.009.22CATOM2127CD1ILEA25345.66221.27627.4521.0049.56CATOM2128NARGA25442.63724.70929.7071.0019.56NATOM2129CAARGA25442.22825.86530.5221.0019.41CATOM2130CARGA25442.71225.71331.9701.0018.10CATOM2131OARGA25443.31126.61632.5151.0013.89OATOM2132CBARGA25440.70426.10130.4801.0015.98CATOM2133CGARGA25440.28227.37831.2551.009.96CATOM2134CDARGA25438.80927.70231.2181.0024.79CATOM2135NEARGA25438.49828.41429.9971.0029.42NATOM2136CZARGA25438.69329.72329.7941.0059.85CATOM2137NH1ARGA25439.19430.52730.7321.0042.58NATOM2138NH2ARGA25438.37730.24528.6201.0018.44NATOM2139NASPA25542.40624.56432.5861.0020.22NATOM2140CAASPA25542.79524.20533.9741.0016.48CATOM2141CASPA25544.32124.37234.0691.0022.43CATOM2142OASPA25544.86824.89735.0601.0018.53OATOM2143CBASPA25542.47822.68634.1571.0019.17CATOM2144CGASPA25542.14422.24635.6101.0047.08CATOM2145OD1ASPA25541.78023.09036.4291.0049.66OATOM2146OD2ASPA25542.02021.01635.8801.0048.12OATOM2147NLEUA25645.01423.80933.0781.0015.98NATOM2148CALEUA25646.46523.84433.0691.0021.76CATOM2149CLEUA25647.02025.27533.0761.0016.79CATOM2150OLEUA25647.82525.69733.9461.0015.24OATOM2151CBLEUA25646.96723.05631.8591.0023.33CATOM2152CGLEUA25648.49123.10031.7651.0026.80CATOM2153CD1LEUA25649.17122.33432.9841.0017.13CATOM2154CD2LEUA25649.04022.72430.3461.0015.42CATOM2155NALAA25746.52026.04832.1401.0013.77NATOM2156CAALAA25746.93827.43632.0251.0012.70CATOM2157CALAA25746.65628.23733.2671.0010.73CATOM2158OALAA25747.45129.07333.6721.0020.33OATOM2159CBALAA25746.20828.07330.8341.0013.34CATOM2160NGLNA25845.47028.08033.8351.0012.40NATOM2161CAGLNA25845.10228.91134.9811.008.39CATOM2162CGLNA25845.87928.48036.1661.0013.48CATOM2163OGLNA25846.17829.28137.0291.0022.96OATOM2164CBGLNA25843.61428.76135.3051.0016.12CATOM2165CGGLNA25842.67429.09634.1301.0030.19CATOM2166CDGLNA25842.57430.58533.7811.0037.29CATOM2167OE1GLNA25842.91131.47134.6101.0021.24OATOM2168NE2GLNA25842.02130.87632.5721.0015.94NATOM2169NTHRA25946.17927.18236.2321.0016.21NATOM2170CATHRA25946.98226.67837.3361.0016.85CATOM2171CTHRA25948.41027.18637.2331.0020.56CATOM2172OTHRA25949.00227.62138.2141.0021.44OATOM2173CBTHRA25947.06625.19237.3611.0027.56CATOM2174OG1THRA25945.75224.62037.5091.0020.92OATOM2175CG2THRA25947.93624.79638.5451.0012.85CATOM2176NILEA26048.95227.17036.0281.0019.96NATOM2177CAILEA26050.29227.70435.8391.0023.01CATOM2178CILEA26050.31329.18036.2251.0031.73CATOM2179OILEA26051.21129.62736.9931.0025.90OATOM2180CBILEA26050.83527.45634.3901.0022.46CATOM2181CG1ILEA26051.15325.94034.2321.0024.12CATOM2182CG2ILEA26052.09928.36134.1061.0013.47CATOM2183CD1ILEA26051.50125.44332.8101.0012.58CATOM2184NALAA26149.28029.91035.7641.0015.35NATOM2185CAALAA26149.17731.35536.0481.0016.00CATOM2186CALAA26149.31631.60437.5501.0020.58CATOM2187OALAA26150.10432.44337.9871.0016.09OATOM2188CBALAA26147.83231.95835.4871.0013.65CATOM2189NLYSA26248.55130.84338.3231.0011.50NATOM2190CALYSA26248.57830.90539.7701.0010.13CATOM2191CLYSA26249.96830.46040.2961.0028.08CATOM2192OLYSA26250.50331.08441.2051.0029.37OATOM2193CBLYSA26247.45330.03240.3351.0012.50CATOM2194CGLYSA26247.33229.96241.8881.0016.51CATOM2195CDLYSA26246.09229.09242.3711.0046.61CATOM2196CELYSA26246.34427.55542.6611.0099.70CATOM2197NZLYSA26245.15726.70343.2001.0036.59NATOM2198NVALA26350.58929.44339.7051.0017.44NATOM2199CAVALA26351.91529.03940.1711.0016.72CATOM2200CVALA26352.99730.17039.9971.0032.12CATOM2201OVALA26353.87130.41240.8341.0021.18OATOM2202CSVALA26352.38927.70939.4761.0016.35CATOM2203CG1VALA26353.92027.51839.6471.0011.83CATOM2204CG2VALA26351.64626.52240.0931.0014.99CATOM2205NVALA26452.91330.89938.9091.0021.75NATOM2206CAVALA26453.91731.87738.6531.0019.81CATOM2207CVALA26453.71933.20839.3771.0035.79CATOM2208OVALA26454.63234.03239.4821.0028.99OATOM2209CSVALA26454.05932.01437.1751.0024.27CATOM2210CG1VALA26454.72833.26936.8221.0033.58CATOM2211CG2VALA26454.84030.80836.6741.0023.01CATOM2212NGLYA26552.55033.37839.9691.0025.30NATOM2213CAGLYA26552.24134.62040.6361.0024.14CATOM2214CGLYA26551.73035.69439.6321.0035.03CATOM2215OGLYA26551.77336.91139.9621.0033.71OATOM2216NTYRA26651.29435.25738.4281.0026.25NATOM2217CATYRA26650.69836.15137.3731.0026.55CATOM2218CTYRA26649.36436.74537.8181.0031.01CATOM2219OTYRA26648.53236.06738.4561.0027.99OATOM2220CBTYRA26650.50135.46336.0081.0024.31CATOM2221CGTYRA26649.99436.38134.8841.0028.64CATOM2222CD1TYRA26650.67037.58234.5421.0035.05CATOM2223CD2TYRA26648.86036.03834.1181.0022.60CATOM2224CE1TYRA26650.21238.43433.4721.0020.73CATOM2225CE2TYRA26648.42836.85933.0121.0020.91CATOM2226CZTYRA26649.08838.06232.7351.0023.85CATOM2227OHTYRA26648.62238.85131.7101.0033.40OATOM2228NLYSA26749.21738.04337.6041.0025.72NATOM2229CALYSA26747.98838.69738.0091.0030.77CATOM2230CLYSA26747.21739.28036.7981.0028.85CATOM2231OLYSA26746.17939.89436.9491.0031.17OATOM2232CBLYSA26748.27939.74139.0921.0027.13CATOM2233CGLYSA26748.72839.12840.4031.0023.18CATOM2234CDLYSA26748.42040.09641.5621.0030.98CATOM2235CELYSA26747.93339.35842.8201.0048.52CATOM2236NZLYSA26747.00538.20842.5051.00100.00NATOM2237NGLYA26847.71639.05435.5941.0022.67NATOM2238CAGLYA26847.01939.51834.3941.0021.38CATOM2239CGLYA26845.85638.56834.0851.0031.03CATOM2240OGLYA26845.45537.72834.9111.0019.71OATOM2241NARGA26945.38738.64532.8491.0030.40NATOM2242CAARGA26944.26337.84632.3991.0026.47CATOM2243CARGA26944.68036.70531.4891.0022.35CATOM2244OARGA26945.37836.92630.5241.0022.75OATOM2245CBARGA26943.29738.75331.6261.0022.65CATOM2246CGARGA26942.20139.39032.4631.0024.21CATOM2247CDARGA26940.93639.46531.5681.0083.45CATOM2248NEARGA26940.11340.67631.7621.00100.00NATOM2249CZARGA26938.80840.75131.4311.00100.00CATOM2250NH1ARGA26938.20139.69130.9211.0099.93NATOM2251NH2ARGA26938.09441.86531.6631.00100.00NATOM2252NVALA27044.19535.49431.7581.0019.87NATOM2253CAVALA27044.46834.38930.8561.0024.82CATOM2254CVALA27043.31934.45629.8241.0022.51CATOM2255OVALA27042.14534.50130.1811.0025.79OATOM2256CBVALA27044.43632.97931.5711.0024.03CATOM2257CG1VALA27044.57631.86130.5331.0020.72CATOM2258CG2VALA27045.50632.84932.6391.0011.27CATOM2259NVALA27143.66034.40928.5541.0025.18NATOM2260CAVALA27142.66634.49227.4871.0028.32CATOM2261CVALA27142.81933.37026.4421.0024.89CATOM2262OVALA27143.92333.11525.9801.0021.98OATOM2263CBVALA27142.90135.81326.7361.0029.25CATOM2264CG1VALA27142.25635.77325.3701.0031.91CATOM2265CG2VALA27142.42136.98927.5651.0018.72CATOM2266NPHEA27241.71632.75826.0191.0026.14NATOM2267CAPHEA27241.75231.74724.9631.0024.34CATOM2268CPHEA27241.23632.26623.6231.0028.95CATOM2269OPHEA27240.15532.82623.5821.0022.01OATOM2270CBPHEA27240.96030.50625.3911.0020.97CATOM2271CGPHEA27241.76429.57026.2431.0021.77CATOM2272CD1PHEA27241.94029.84227.6101.0014.60CATOM2273CD2PHEA27242.50428.55025.6561.0022.19CATOM2274CE1PHEA27242.76329.04128.4341.0017.89CATOM2275CE2PHEA27243.33627.72626.4541.0027.64CATOM2276CZPHEA27243.47827.97927.8511.0025.14CATOM2277NASPA27342.01232.11422.5421.0029.45NATOM2278CAASPA27341.55732.53621.2141.0022.33CATOM2279CASPA27340.89631.36520.4931.0025.67CATOM2280OASPA27341.53930.57019.7931.0017.81OATOM2281CBASPA27342.67233.11420.3431.0021.45CATOM2282CGASPA27342.13133.62618.9901.0026.89CATOM2283OD1ASPA27340.97533.24918.5981.0027.76OATOM2284OD2ASPA27342.83834.42118.3271.0030.06OATOM2285NALAA27439.58931.28420.6491.0015.59NATOM2286CAALAA27438.93230.12820.1281.0023.75CATOM2287CALAA27438.85330.16818.6531.0032.30CATOM2288OALAA27438.28429.25618.0291.0029.37OATOM2289CBALAA27437.56729.90520.7771.0018.87CATOM2290NSERA27539.37231.24318.0811.0021.10NATOM2291CASERA27539.34331.28816.6311.0026.90CATOM2292CSERA27540.39030.30016.1161.0043.37CATOM2293OSERA27540.42129.94914.9271.0046.32OATOM2294CBSERA27539.54732.68316.0741.0015.19CATOM2295OGSERA27540.90433.07016.0781.0028.71OATOM2296NLYSA27641.19229.78017.0371.0022.98NATOM2297CALYSA27642.17828.79116.6381.0023.28CATOM2298CLYSA27641.64527.40516.9761.0029.73CATOM2299OLYSA27640.99227.20618.0101.0025.10OATOM2300CSLYSA27643.54429.05117.2751.0019.19CATOM2301CGLYSA27643.95730.49617.2181.0032.11CATOM2302CDLYSA27644.06230.85215.7981.0022.43CATOM2303CELYSA27644.93032.06715.5701.0023.18CATOM2304NZLYSA27645.45432.11714.1521.0029.42NATOM2305NPROA27741.89226.47616.0551.0036.04NATOM2306CAPROA27741.44625.08716.1701.0035.93CATOM2307CPROA27742.02224.33217.3631.0029.30CATOM2308OPROA27743.10324.65017.8851.0030.54OATOM2309CBPROA27741.97524.45314.8781.0039.65CATOM2310CGPROA27743.24925.26114.5661.0042.90CATOM2311CDPROA27742.78726.67014.8921.0037.84CATOM2312NASPA27841.27323.33917.8091.0022.35NATOM2313CAASPA27841.74522.50118.9031.0022.16CATOM2314CASPA27842.18421.18918.2721.0019.66CATOM2315OASPA27841.90520.91717.1171.0023.49OATOM2316CBASPA27840.63622.24119.9711.0015.09CATOM2317CGASPA27840.21623.50320.7021.0022.86CATOM2318OD1ASPA27841.11324.25421.0961.0025.18OATOM2319OD2ASPA27838.99923.78720.8121.0039.55OATOM2320NGLYA27942.84620.35519.0441.0030.65NATOM2321CAGLYA27943.22919.03418.5461.0033.78CATOM2322CGLYA27942.11518.09918.9441.0038.10CATOM2323OGLYA27940.96318.51719.0681.0047.52OATOM2324NTHRA28042.41916.83919.1771.0029.44NATOM2325CATHRA28041.32815.99019.5871.0026.68CATOM2326CTHRA28040.88916.43920.9721.0023.52CATOM2327OTHRA28041.67017.06721.7131.0023.62OATOM2328CBTHRA28041.69514.49219.5401.0040.78CATOM2329OG1THRA28042.88914.27220.2961.0025.56OATOM2330CG2THRA28041.89314.05418.0951.0037.71CATOM2331NPROA28139.67216.06321.3461.0025.54NATOM2332CAPROA28139.12916.45422.6281.0025.72CATOM2333CPROA28139.77615.77823.8001.0026.02CATOM2334OPROA28139.75216.31424.9151.0022.68OATOM2335CBPROA28137.65015.99022.5591.0028.89CATOM2336CGPROA28137.41715.54021.2011.0029.39CATOM2337CDPROA28138.76115.13820.6461.0026.82CATOM2338NARGA28240.28114.56723.5871.0027.88NATOM2339CAARGA28240.80613.81724.7201.0034.08CATOM2340CARGA28241.97712.91824.3841.0027.62CATOM2341OARGA28241.91312.18223.4251.0023.83OATOM2342CBARGA28239.67613.01725.4051.0020.89CATOM2343CGARGA28240.03512.46726.7751.0022.81CATOM2344CDARGA28238.76211.92527.4421.0026.77CATOM2345NEARGA28238.96311.34528.7811.0036.48NATOM2346CZARGA28238.51810.13929.1641.0037.74CATOM2347NH1ARGA28237.8139.36028.3461.0028.45NATOM2348NH2ARGA28238.7549.70030.3841.0027.25NATOM2349NLYSA28343.01612.96325.2231.0028.91NATOM2350CALYSA28344.21712.17125.0511.0024.32CATOM2351CLYSA28344.79611.76626.4041.0029.57CATOM2352OLYSA28345.26212.62627.1381.0033.16OATOM2353CBLYSA28345.22613.00824.2871.0021.93CATOM2354CGLYSA28346.11112.25123.3161.0032.38CATOM2355CDLYSA28346.52613.17122.1431.0095.77CATOM2356CELYSA28345.71012.93720.8361.00100.00CATOM2357NZLYSA28346.41813.33219.5351.00100.00NATOM2358NLEUA28444.74710.46726.7341.0023.37NATOM2359CALEUA28445.3279.90527.9971.0016.08CATOM2360CLEUA28445.4638.38628.0471.0020.46CATOM2361OLEUA28444.6797.65527.4461.0025.45OATOM2362CBLEUA28444.64110.38729.2841.0016.30CATOM2363CGLEUA28443.3349.70029.7141.0025.97CATOM2364CD1LEUA28442.88110.08931.1521.0022.11CATOM2365CD2LEUA28442.2039.95328.6931.0023.92CATOM2366NLEUA28546.4537.93928.8201.0018.51NATOM2367CALEUA28546.7926.52729.0031.0016.77CATOM2368CLEUA28545.8805.86530.0061.0030.75CATOM2369OLEUA28545.5766.43931.0581.0022.02OATOM2370CBLEUA28548.2296.38929.5851.0015.85CATOM2371CGLEUA28549.3076.97028.6721.0021.51CATOM2372CD1LEUA28550.7036.70529.1221.0015.15CATOM2373CD2LEUA28549.0516.36827.3301.0016.94CATOM2374NASPA28645.5654.59929.7341.0026.62NATOM2375CAASPA28644.9453.72630.6981.0010.90CATOM2376CASPA28646.1283.05531.4981.0020.54CATOM2377OASPA28646.9912.37230.9381.0023.38OATOM2378CBASPA28644.0732.70229.9701.0014.65CATOM2379CGASPA28643.4091.69930.9431.0024.60CATOM2380OD1ASPA28643.9321.43732.0831.0024.60OATOM2381OD2ASPA28642.3161.23130.5831.0026.03OATOM2382NVALA28746.2303.31732.7911.0015.44NATOM2383CAVALA28747.3542.81633.5561.0015.58CATOM2384CVALA28746.9731.69534.5211.0016.48CATOM2385OVALA28747.6131.47335.5721.0016.63OATOM2386CBVALA28748.1014.00634.2601.0029.84CATOM2387CG1VALA28748.5345.08533.2241.0018.39CATOM2388CG2VALA28747.1734.67035.2581.0037.79CATOM2389NTHRA28845.9040.99234.1521.0022.27NATOM2390CATHRA28845.428−0.15234.9561.0019.34CATOM2391CTHRA28846.561−1.17735.2271.0027.47CATOM2392OTHRA28846.778−1.58636.3651.0024.87OATOM2393CBTHRA28844.288−0.90934.2441.0022.86CATOM2394OG1THRA28843.120−0.09634.1061.0024.84OATOM2395CG2THRA28843.916−2.11335.0241.0025.08CATOM2396NARGA28947.290−1.58534.1791.0026.08NATOM2397CAARGA28948.428−2.50634.3191.0016.92CATOM2398CARGA28949.405−2.03735.4081.0022.96CATOM2399OARGA28949.847−2.79036.2751.0023.03OATOM2400CBARGA28949.208−2.60732.9761.0012.43CATOM2401CGARGA28948.934−3.80432.1031.0029.39CATOM2402CDARGA28950.016−4.10231.0371.0025.88CATOM2403NEARGA28949.441−4.99630.0201.0017.26NATOM2404CZARGA28950.053−5.45928.9301.0038.82CATOM2405NH1ARGA28951.306−5.15328.6601.0013.51NATOM2406NH2ARGA28949.400−6.26228.0961.0037.68NATOM2407NLEUA29049.815−0.78635.3061.0026.60NATOM2408CALEUA29050.809−0.25436.2191.0025.42CATOM2409CLEUA29050.324−0.37637.6561.0024.17CATOM2410OLEUA29051.072−0.75938.5741.0019.94OATOM2411CBLEUA29051.0001.21935.8761.0024.66CATOM2412CGLEUA29052.2812.01936.0661.0024.67CATOM2413CD1LEUA29051.9923.47936.5041.0029.25CATOM2414CD2LEUA29053.4501.33536.7881.0015.82CATOM2415NHISA29149.0930.07537.8681.0030.10NATOM2416CAHISA29148.5130.07439.2121.0034.17CATOM2417CHISA29148.411−1.36739.7301.0043.41CATOM2418OHISA29148.621−1.65440.9291.0038.81OATOM2419CBHISA29147.1130.67439.1431.0028.01CATOM2420CGHISA29147.0972.15338.9841.0029.68CATOM2421ND1HISA29148.2422.92139.0151.0035.63NATOM2422CD2HISA29146.0683.02438.8551.0031.18CATOM2423CE1HISA29147.9264.19738.8451.0024.20CATOM2424NE2HISA29146.6124.28938.7471.0021.92NATOM2425NGLNA29248.048−2.26038.8211.0030.71NATOM2426CAGLNA29247.950−3.65439.1811.0034.82CATOM2427CGLNA29249.287−4.19739.6221.0036.93CATOM2428OGLNA29249.323−5.04040.5101.0027.56OATOM2429CBGLNA29247.322−4.48738.0691.0028.23CATOM2430CGGLNA29245.798−4.40538.1711.0081.15CATOM2431CDGLNA29245.023‘4.95436.9631.00100.00CATOM2432OE1GLNA29245.597−5.41035.9511.0099.65OATOM2433NE2GLNA29243.687−4.89537.0731.0040.86NATOM2434NLEUA29350.375−3.65839.0581.0031.75NATOM2435CALEUA29351.750−4.07239.3831.0022.67CATOM2436CLEUA29352.238−3.32340.6131.0028.64CATOM2437OLEUA29353.420−3.37741.0171.0022.27OATOM2438CBLEUA29352.665−3.76938.2051.0025.57CATOM2439CGLEUA29352.497−4.70337.0161.0035.11CATOM2440CD1LEUA29353.306−4.17035.8361.0028.25CATOM2441CD2LEUA29352.965−6.11037.4391.0047.81CATOM2442NGLYA29451.316−2.51041.1111.0033.08NATOM2443CAGLYA29451.488−1.79342.3471.0024.90CATOM2444CGLYA29452.272−0.51242.3261.0029.31CATOM2445OGLYA29453.070−0.24943.2231.0025.25OATOM2446NTRPA29552.0000.34741.3681.0027.83NATOM2447CATRPA29552.6871.62341.3851.0019.45CATOM2448CTRPA29551.6842.73141.0811.0025.79CATOM2449OTRPA29550.7652.52740.2971.0020.43OATOM2450CBTRPA29553.9611.61440.5241.0012.85CATOM2451CGTRPA29554.7502.91140.6181.0023.04CATOM2452CD1TRPA29555.8973.16141.3681.0023.68CATOM2453CD2TRPA29554.4154.15939.9791.0020.72CATOM2454NE1TRPA29556.2584.49341.2441.0018.67NATOM2455CE2TRPA29555.3895.11340.3731.0020.95CATOM2456CE3TRPA29553.4064.55039.1021.0021.47CATOM2457CZ2TRPA29555.3386.43939.9581.0017.58CATOM2456CZ3TRPA29553.4035.87338.6321.0021.57CATOM2459CH2TRPA29554.3686.78739.0581.0019.45CATOM2460NTYRA29651.7093.79741.8841.0025.17NATOM2461CATYRA29650.7204.88341.7311.0024.90CATOM2462CTYRA29651.5176.17841.8571.0030.85CATOM2463OTYRA29652.3636.27242.7451.0021.27OATOM2464CBTYRA29649.6544.81342.8401.0025.16CATOM2465CGTYRA29648.6853.65142.7441.0023.04CATOM2466CD1TYRA29649.0782.34343.0881.0031.62CATOM2467CD2TYRA29647.3803.85342.2891.0026.02CATOM2468CE1TYRA29648.2031.26842.9351.0024.42CATOM2469CE2TYRA29646.4932.77042.1271.0024.81CATOM2470CZTYRA29646.9021.48342.4641.0039.41CATOM2471OHTYRA29645.9840.43442.3371.0066.19OATOM2472NHISA29751.3247.12340.9241.0020.95NATOM2473CAHISA29752.1308.34340.9381.0026.86CATOM2474CHISA29751.9479.17542.2101.0035.01CATOM2475OHISA29750.8859.13242.8741.0026.92OATOM2476CBHISA29751.8199.19239.7331.0025.77CATOM2477CGHISA29750.4899.84239.8031.0031.16CATOM2478ND1HISA29749.3149.14539.6331.0034.21NATOM2479CD2HISA29750.13511.09440.1671.0025.83CATOM2480CE1HISA29748.2909.97239.7761.0024.14CATOM2481NE2HISA29748.76111.16440.0871.0023.35NATOM2482NGLUA29852.9839.92642.5541.0024.98NATOM2483CAGLUA29852.95710.68343.7981.0027.65CATOM2484CGLUA29852.83112.18743.7411.0036.86CATOM2485OGLUA29852.43312.79244.7181.0043.61OATOM2486CBGLUA29854.15310.31944.6861.0022.02CATOM2487CGGLUA29854.0048.94345.2851.0036.42CATOM2488CDGLUA29854.9998.66446.4061.00100.00CATOM2489OE1GLUA29856.2238.56146.1521.0044.79OATOM2490OE2GLUA29854.5268.47047.5471.00100.00OATOM2491NILEA29953.23212.80042.6391.0023.49NATOM2492CAILEA29953.26814.24442.5621.0013.25CATOM2493CILEA29952.01614.84841.9061.0027.05CATOM2494OILEA29951.68114.53040.7571.0026.73OATOM2495CBILEA29954.58614.71141.8621.0015.93CATOM2496CG1ILEA29955.83614.18342.6061.0023.83CATOM2497CG2ILEA29954.59616.21341.5411.0017.37CATOM2498CD1ILEA29957.23214.22141.7871.0021.32CATOM2499NSERA30051.32315.71642.6481.0018.55NATOM2500CASERA30050.17716.44942.0911.0019.58CATOM2501CSERA30050.71417.41541.0421.0017.29CATOM2502OSERA30051.82417.94141.1781.0021.06OATOM2503CBSERA30049.54217.30743.1811.0016.78CATOM2504OGSERA30050.54817.96943.9231.0075.80OATOM2505NLEUA30149.87017.75540.0751.0016.13NATOM2506CALEUA30150.24618.67539.0141.0017.70CATOM2507CLEUA30150.68919.96439.6461.0020.11CATOM2508OLEUA30151.71420.56839.3031.0020.46OATOM2509CBLEUA30148.99018.98138.1971.0017.92CATOM2510CGLEUA30149.18220.03037.1121.0025.15CATOM2511CD1LEUA30150.23319.55236.0861.0018.82CATOM2512CD2LEUA30147.85420.17736.4361.0025.88CATOM2513NGLUA30249.84520.39840.5541.0027.01NATOM2514CAGLUA30250.05321.63641.2801.0037.72CATOM2515CGLUA30251.41021.61841.9961.0029.99CATOM2516OGLUA30252.24522.51441.7981.0027.15OATOM2517CBGLUA30248.89921.84142.2751.0043.10CATOM2518CGGLUA30249.06123.06143.1741.0090.85CATOM2519CDGLUA30248.45124.32442.5801.00100.00CATOM2520OE1GLUA30247.56624.20941.7061.00100.00OATOM2521OE2GLUA30248.80825.43243.0361.0064.50OATOM2522NALAA30351.64620.59142.8011.008.72NATOM2523CAALAA30352.93720.45543.4591.0015.03CATOM2524CALAA30354.10220.35542.4501.0019.85CATOM2525OALAA30355.10421.09042.5531.0022.24OATOM2526CBALAA30352.93819.25844.4101.0018.97CATOM2527NGLYA30453.95319.47241.4671.0013.05NATOM2528CAGLYA30454.97019.32140.4481.008.94CATOM2529CGLYA30455.23920.62139.6951.0020.31CATOM2530OGLYA30456.39420.90039.3221.0014.30OATOM2531NLEUA30554.19121.38339.3611.0010.76NATOM2532CALEUA30554.48322.62238.6111.0020.29CATOM2533CLEUA30555.28123.66939.4561.0028.92CATOM2534OLEUA30556.19424.38538.9741.0017.69OATOM2535CBLEUA30553.20223.24538.0331.0024.03CATOM2536CGLEUA30552.35722.64736.8801.0027.66CATOM2537CD1LEUA30550.97523.38436.7891.0013.44CATOM2538CD2LEUA30553.07922.72435.5431.0018.39CATOM2539NALAA30654.90423.75740.7241.0019.94NATOM2540CAALAA30655.54424.66041.6551.0024.79CATOM2541CALAA30657.03524.38041.7431.0027.51CATOM2542OALAA30657.85225.28041.6621.0029.68OATOM2543CBALAA30654.93724.47143.0021.0017.87CATOM2544NSERA30757.37823.13742.0111.0018.46NATOM2545CASERA30758.79322.75642.1621.0016.31CATOM2546CSERA30759.54722.88540.8321.0022.66CATOM2547OSERA30760.74223.21240.7861.0028.47OATOM2548CBSERA30758.85121.30442.6221.0020.47CATOM2549OGSERA30758.51720.45441.5261.0029.03OATOM2550NTHRA30858.84922.63139.7351.0027.31NATOM2551CATHRA30859.45822.73838.4131.0022.89CATOM2552CTHRA30859.75724.21638.1071.0026.06CATOM2553OTHRA30860.81924.54637.5911.0029.89OATOM2554CBTHRA30858.53622.11537.3181.0018.72CATOM2555OG1THRA30858.35620.71437.5451.0020.17OATOM2556CG2THRA30859.09422.33035.9231.0012.37CATOM2557NTYRA30958.84625.11838.4531.0028.20NATOM2558CATYRA30959.11026.54938.2411.0031.09CATOM2559CTYRA30960.38327.05939.0451.0016.31CATOM2560OTYRA30961.17927.85838.5771.0016.91OATOM2561CBTYRA30957.81927.37338.5331.0031.19CATOM2562CGTYRA30957.94428.89538.3921.0014.57CATOM2563CD1TYRA30958.39729.45737.2241.0017.51CATOM2564CD2TYRA30957.57529.75739.4421.0024.99CATOM2565CE1TYRA30958.52730.80137.1001.0018.41CATOM2566CE2TYRA30957.74431.12939.3511.0019.04CATOM2567CZTYRA30958.21231.64138.1641.0029.13CATOM2568OHTYRA30958.30033.00437.9661.0028.22OATOM2569NGLNA31060.56026.57940.2601.0015.41NATOM2570CAGLNA31061.70526.96441.0871.0022.35CATOM2571CGLNA31063.00126.49240.4461.0031.46CATOM2572OGLNA31064.00927.19140.4421.0033.42OATOM2573CBGLNA31061.58726.33542.4821.0017.67CATOM2574CGGLNA31062.57926.92143.4611.0057.58CATOM2575CDGLNA31062.28728.37043.7821.0065.14CATOM2576OE1GLNA31061.13428.75444.0001.0041.94OATOM2577NE2GLNA31063.33029.19443.8011.0099.09NATOM2578NTRPA31162.95725.32139.8301.0028.76NATOM2579CATRPA31164.14624.82239.1631.0026.29CATOM2580CTRPA31164.47425.76938.0401.0017.91CATOM2581OTRPA31165.59926.19337.8801.0022.89OATOM2582CBTRPA31163.93823.38338.6431.0027.53CATOM2583CGTRPA31165.17622.78438.1191.0017.82CATOM2584CD1TRPA31166.13222.09038.8261.0020.21CATOM2585CD2TRPA31165.65222.88136.7841.0017.99CATOM2586NE1TRPA31167.19721.77637.9921.0020.39NATOM2587CE2TRPA31166.93322.28436.7461.0019.57CATOM2588CE3TRPA31165.14123.46135.6211.0020.26CATOM2589CZ2TRPA31167.68622.23635.5991.0014.25CATOM2590CZ3TRPA31165.90123.44634.5011.0018.59CATOM2591CH2TRPA31167.16922.83134.4941.0016.86CATOM2592NPHEA31263.46926.10937.2561.0017.47NATOM2593CAPHEA31263.66527.06436.1791.0020.14CATOM2594CPHEA31264.22428.37136.7331.0018.33CATOM2595OPHEA31265.08029.02436.1041.0024.76OATOM2596CBPHEA31262.32827.31835.4581.0029.51CATOM2597CGPHEA31262.32828.54434.6031.0028.52CATOM2598CD1PHEA31262.88328.50833.3381.0030.53CATOM2599CD2PHEA31261.82529.75835.1041.0029.31CATOM2600CE1PHEA31262.93629.66032.5541.0034.73CATOM2601CE2PHEA31261.90030.90434.3621.0038.40CATOM2602CZPHEA31262.43230.86033.0631.0040.73CATOM2603NLEUA31363.69728.78737.8761.0022.46NATOM2604CALEUA31364.17030.02538.5161.0028.47CATOM2605CLEUA31365.62729.82738.8981.0037.53CATOM2606OLEUA31366.45230.69338.6291.0034.20OATOM2607CBLEUA31363.37530.41039.7831.0020.44CATOM2608CGLEUA31361.95530.89739.5551.0016.29CATOM2609CD1LEUA31361.49931.39940.8711.0015.94CATOM2610CD2LEUA31361.95931.96138.5241.0014.44CATOM2611NGLUA31465.95328.68539.5081.0030.70NATOM2612CAGLUA31467.35328.43239.8751.0024.15CATOM2613CGLUA31468.29128.14938.7031.0036.34CATOM2614OGLUA31469.48528.04738.8901.0043.10OATOM2615CBGLUA31467.45927.36640.9471.0019.90CATOM2616CGGLUA31466.63427.75442.1411.0027.37CATOM2617CDGLUA31466.45026.66643.1821.0031.09CATOM2618OE1GLUA31467.15725.64843.0851.0059.60OATOM2619OE2GLUA31465.63426.87244.1251.0046.20OATOM2620NASNA31567.77828.11437.4791.0040.17NATOM2621CAASNA31568.63727.80236.3431.0037.76CATOM2622CASNA31568.38328.57835.1121.0043.75CATOM2623OASNA31568.59128.00134.0471.0039.15OATOM2624CBASNA31568.42526.36035.8841.0033.74CATOM2625CGASNA31569.02825.38336.8011.0053.18CATOM2626OD1ASNA31568.45625.08737.8351.0049.13OATOM2627ND2ASNA31570.23924.92636.4791.0097.72NATOM2628NGLNA31667.85229.80335.1971.0049.87NATOM2629CAGLNA31667.62730.55033.9571.0077.90CATOM2630CGLNA31668.79731.44833.5251.00100.00CATOM2631OGLNA31669.27231.38732.3751.0051.33OATOM2632CBGLNA31666.28031.27633.9021.0075.89CATOM2633CGGLNA31665.68331.58935.2311.0080.97CATOM2634CDGLNA31665.23333.03635.3501.0054.58CATOM2635OE1GLNA31664.88133.69934.3671.0046.46OATOM2636NE2GLNA31665.25733.53836.5661.0033.46NTER2637GLNA316CONECT110111CONECT111110112CONECT112111113114CONECT113112118CONECT114112115116CONECT115114CONECT116114117118CONECT117116129CONECT118113116CONECT120121CONECT121120122CONECT122121123124CONECT123122128CONECT124122125126CONECT125124CONECT126124127128CONECT127126CONECT128123126CONECT129117130131132CONECT130129CONECT131129CONECT132129MASTER208O11310O36263612225END


[0227] While various embodiments of the present invention have been described in detail, it is apparent that modifications and adaptations of those embodiments will occur to those skilled in the art. However, it is to be expressly understood that such modifications and adaptations are within the spirit and scope of the present invention, as set forth in the following claims.


Claims
  • 1. A method for producing ascorbic acid or esters thereof in a microorganism, comprising culturing a microorganism having a genetic modification to increase the action of an enzyme selected from the group consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase; and recovering said ascorbic acid or esters thereof.
  • 2. A method, as claimed in claim 1, wherein said genetic modification is a genetic modification to increase the action of an enzyme selected from the group consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase.
  • 3. A method, as claimed in claim 1, wherein said genetic modification is a genetic modification to increase the action of an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose.
  • 4. A method, as claimed in claim 3, wherein said genetic modification is a genetic modification to increase the action of GDP-D-mannose:GDP-L-galactose epimerase.
  • 5. The method of claim 3, wherein said genetic modification comprises transformation of said microorganism with a recombinant nucleic acid molecule that expresses said epimerase.
  • 6. The method of claim 5, wherein said epimerase has a tertiary structure that substantially conforms to the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 7. The method of claim 5, wherein said epimerase has a structure having an average root mean square deviation of less than about 2.5 Å over at least about 25% of Cα positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 8. The method of claim 5, wherein said epimerase has a tertiary structure having an average root mean square deviation of less than about 1 Å over at least about 25% of Cα positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 9. The method of claim 5, wherein said epimerase comprises a substrate binding site having a tertiary structure that substantially conforms to the tertiary structure of the substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 10. The method of claim 9, wherein said substrate binding site has a tertiary structure with an average root mean square deviation of less than about 2.5 Å over at least about 25% of Cα positions of the tertiary structure of a substrate binding site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 11. The method of claim 5, wherein said epimerase comprises a catalytic site having a tertiary structure that substantially conforms to the tertiary structure of the catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 12. The method of claim 11, wherein said catalytic site has a tertiary structure with an average root mean square deviation of less than about 2.5 Å over at least about 25% of Cα positions of the tertiary structure of a catalytic site of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 13. The method of claim 11, wherein said catalytic site comprises the amino acid residues serine, tyrosine and lysine.
  • 14. The method of claim 13, wherein tertiary structure positions of said amino acid residues serine, tyrosine and lysine substantially conform to tertiary structure positions of residues Ser107, Tyr136 and Lys140, respectively, as represented by atomic coordinates in Brookhaven Protein Data Bank Accession Code 1bws.
  • 15. The method of claim 5, wherein said epimerase binds NADPH.
  • 16. The method of claim 5, wherein said epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO:11.
  • 17. The method of claim 5, wherein said epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence align with 100% identity with at least about 75% of non-Xaa residues in SEQ ID NO:11.
  • 18. The method of claim 5, wherein said epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence align with 100% identity with at least about 90% of non-Xaa residues in SEQ ID NO:11.
  • 19. The method of claim 5, wherein said epimerase comprises an amino acid sequence having at least 4 contiguous amino acid residues that are 100% identical to at least 4 contiguous amino acid residues of an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO:10.
  • 20. The method of claim 5, wherein said recombinant nucleic acid molecule comprises a nucleic acid sequence comprising at least about 12 contiguous nucleotides having 100% identity with at least about 12 contiguous nucleotides of a nucleic acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9.
  • 21. The method of claim 5, wherein said epimerase comprises an amino acid sequence having a motif: Gly-Xaa-Xaa-Gly-Xaa-Xaa-Gly.
  • 22. The method of claim 5, wherein said recombinant nucleic acid molecule comprises a nucleic acid sequence that is at least about 15% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman-Pearson standard default parameters.
  • 23. The method of claim 5, wherein said recombinant nucleic acid molecule comprises a nucleic acid sequence that is at least about 20% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID 5 NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman-Pearson standard default parameters.
  • 24. The method of claim 5, wherein said recombinant nucleic acid molecule comprises a nucleic acid sequence that is at least about 25% identical to a nucleic acid sequence selected from the group consisting of SEQ ID NO:1, SEQ ID 5 NO:3, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, as determined using a Lipman-Pearson method with Lipman-Pearson standard default parameters.
  • 25. The method of claim 5, wherein said recombinant nucleic acid molecule comprises a nucleic acid sequence that hybridizes under stringent hybridization conditions to a nucleic acid sequence encoding a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase.
  • 26. The method of claim 25, wherein said nucleic acid sequence encoding said GDP-4-keto-6-deoxy-D-mannose epimerase/reductase is selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3 and SEQ ID NO:5.
  • 27. The method of claim 25, wherein said GDP-4-keto-6-deoxy-D-mannose epimerase/reductase comprises an amino acid sequence selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6.
  • 28. A method, as claimed in claim 1, wherein said microorganism is selected from the group consisting of bacteria, fungi and microalgae.
  • 29. A method, as claimed in claim 1, wherein said microorganism is acid-tolerant.
  • 30. A method, as claimed in claim 1, wherein said microorganism is a bacterium.
  • 31. A method, as claimed in claim 30, wherein said bacterium is selected from the group consisting of Azotobacter and Pseudomonas.
  • 32. A method, as claimed in claim 1, wherein said microorganism is a fungus.
  • 33. A method, as claimed in claim 32, wherein said microorganism is a yeast.
  • 34. A method, as claimed in claim 33, wherein said yeast is selected from the group consisting of Saccharomyces yeast.
  • 35. A method, as claimed in claim 1, wherein said microorganism is a microalga.
  • 36. A method, as claimed in claim 35, wherein said microalga is selected from the group consisting of microalgae of the genera Prototheca and Chlorella.
  • 37. A method, as claimed in claim 36, wherein said microalga is selected from the genus Prototheca.
  • 38. A method, as claimed in claim 1, wherein said microorganism further comprises a genetic modification to decrease the action of an enzyme having GDP-D-mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase.
  • 39. A method, as claimed in claim 38, wherein said genetic modification to decrease the action of an enzyme having GDP-D-mannose as a substrate, other than GDP-D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action of GDP-D-mannose-dehydrogenase.
  • 40. A method, as claimed in claim 1, wherein said microorganism is acid-tolerant and said step of culturing is conducted at a pH of less than about 6.0.
  • 41. A method, as claimed in claim 1, wherein said microorganism is acid-tolerant and said step of culturing is conducted at a pH of less than about 5.5.
  • 42. A method, as claimed in claim 1, wherein said microorganism is acid-tolerant and said step of culturing is conducted at a pH of less than about 5.0.
  • 43. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that is magnesium (Mg) limited.
  • 44. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that is Mg limited during a cell growth phase.
  • 45. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that comprises less than about 0.5 g/L of Mg during a cell growth phase.
  • 46. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that comprises less than about 0.2 g/L of Mg during a cell growth phase.
  • 47. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that comprises less than about 0.1 g/L of Mg during a cell growth phase.
  • 48. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that comprises a carbon source other than D-mannose.
  • 49. A method, as claimed in claim 1, wherein said step of culturing is conducted in a fermentation medium that comprises glucose as a carbon source.
  • 50. A microorganism for producing ascorbic acid or esters thereof, wherein said microorganism has a genetic modification to increase the action of an enzyme selected from the group consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase.
  • 51. A microorganism, as claimed in claim 50, wherein said genetic modification is a genetic modification to increase the action of an enzyme selected from the group consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase.
  • 52. A microorganism, as claimed in claim 50, wherein said genetic modification is a genetic modification to increase the action of GDP-D-mannose:GDP-L-galactose epimerase.
  • 53. A microorganism, as claimed in claim 50, wherein said microorganism has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase has a tertiary structure having an average root mean square deviation of less than about 2.5 Å over at least about 25% of Ca positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 54. A microorganism, as claimed in claim 50, wherein said microorganism is selected from the group consisting of bacteria, fungi and microalgae.
  • 55. A microorganism, as claimed in claim 50, wherein said microorganism is a bacterium.
  • 56. A microorganism, as claimed in claim 55, wherein said bacterium is selected from the group consisting of Azotobacter and Pseudomonas.
  • 57. A microorganism, as claimed in claim 50, wherein said microorganism is a fungus.
  • 58. A microorganism, as claimed in claim 57, wherein said microorganism is a yeast.
  • 59. A microorganism, as claimed in claim 58, wherein said yeast is selected from the group consisting of Saccharomyces yeast.
  • 60. A plant for producing ascorbic acid or esters thereof, wherein said plant has a genetic modification to increase the action of an enzyme selected from the group consisting of hexokinase, glucose phosphate isomerase, phosphomannose isomerase, phosphomannomutase, GDP-D-mannose pyrophosphorylase, GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase.
  • 61. A plant, as claimed in claim 60, wherein said genetic modification is a genetic modification to increase the action of an enzyme selected from the group consisting of GDP-D-mannose:GDP-L-galactose epimerase, GDP-L-galactose phosphorylase, L-galactose-1-P-phosphatase, L-galactose dehydrogenase, and L-galactono-γ-lactone dehydrogenase.
  • 62. A plant, as claimed in claim 60, wherein said genetic modification is a genetic modification to increase the action of GDP-D-mannose:GDP-L-galactose epimerase.
  • 63. A plant, as claimed in claim 60, wherein said plant has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase has a tertiary structure having an average root mean square deviation of less than about 2.5 Å over at least about 25% of Cα positions of the tertiary structure of a GDP-4-keto-6-deoxy-D-mannose epimerase/reductase represented by atomic coordinates having Brookhaven Protein Data Bank Accession Code 1bws.
  • 64. A plant, as claimed in claim 60, wherein said plant further comprises a genetic modification to decrease the action of an enzyme having GDP-D-mannose as a substrate other than GDP-D-mannose:GDP-L-galactose epimerase.
  • 65. A plant, as claimed in claim 60, wherein said genetic modification to decrease the action of an enzyme having GDP-D-mannose as a substrate other than GDP-D-mannose:GDP-L-galactose epimerase is a genetic modification to decrease the action of GDP-D-mannose-dehydrogenase.
  • 66. A plant, as claimed in claim 60, wherein said plant is a microalga.
  • 67. A plant, as claimed in claim 66, wherein said plant is selected from the group consisting of microalgae of the genera Prototheca and Chlorella.
  • 68. A plant, as claimed in claim 66, wherein said microalga is selected from the genus Prototheca.
  • 69. A plant, as claimed in claim 60, wherein said plant is a higher plant.
  • 70. A plant, as claimed in claim 60, wherein said plant is a consumable higher plant.
  • 71. A microorganism for producing ascorbic acid or esters thereof, wherein said microorganism has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO:11.
  • 72. A plant for producing ascorbic acid or esters thereof, wherein said plant has been genetically modified to express a recombinant nucleic acid molecule encoding an epimerase that catalyzes conversion of GDP-D-mannose to GDP-L-galactose, wherein said epimerase comprises an amino acid sequence that aligns with SEQ ID NO:11 using a CLUSTAL alignment program, wherein amino acid residues in said amino acid sequence align with 100% identity with at least about 50% of non-Xaa residues in SEQ ID NO:11.
CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The present application claims priority under 35 U.S.C. §119(e) from U.S. Provisional Application Ser. No. 60/088,549, filed Jun. 8, 1998; from U.S. Provisional Application Ser. No. 60/125,073, filed Mar. 17, 1999; and from U.S. Provisional Application Ser. No. 60/125,054, filed Mar. 18, 1999. Each of U.S. Provisional Application Ser. Nos. 60/088,549, 60/125,073 and 60/125,054 is incorporated herein by reference in its entirety.

Provisional Applications (3)
Number Date Country
60088549 Jun 1998 US
60125073 Mar 1999 US
60125054 Mar 1999 US