Nitrogen fixation using refactored NIF clusters

Information

  • Patent Grant
  • 11479516
  • Patent Number
    11,479,516
  • Date Filed
    Wednesday, October 5, 2016
    8 years ago
  • Date Issued
    Tuesday, October 25, 2022
    2 years ago
Abstract
The invention relates to methods for promoting fixed nitrogen from atmospheric nitrogen, and related products. Endophytic bacteria having an exogenous nif cluster promote fixed nitrogen for cereal plants.
Description
BACKGROUND OF INVENTION

Availability of nitrogen is one of the principal elements limiting growth and development of crops, particularly in agricultural soils for plant production of food, feed, fiber and fuel. Excessive use of synthetic fertilizer to meet the food demands of growing population poses an environmental threat in that it can cause harmful algal blooms and disrupt beneficial soil microbial community [1]. On the other hand, over-farming in many developing countries with a scant supply of fertilizer damages agricultural land and makes small farmers suffer from the poor yield of their crops [2].


Successful endophytic colonization of plants by human-pathogenic bacteria such as Salmonella enterica, Pseudomonas aeruginosa, Burkholderia cepacia, Escherichia coli O157:H7 has been demonstrated [3-5]. Salmonella can recognize plants as a suitable host and colonize in root tissues of alfalfa and barley [6,7].


SUMMARY OF INVENTION

The invention, in various aspects, relates to a method for providing fixed nitrogen from atmospheric nitrogen, comprising delivering a modified bacteria having an exogenous nif cluster to a cereal plant, or to soil where a cereal plant or seed is growing or is to be planted, wherein the modified bacteria provides fixed nitrogen.


In some embodiments, the nif cluster is a native nif cluster. In some embodiments, the nif cluster is a refactored nif cluster.


In other embodiments, the modified bacteria is a gamma-proteobacteria. In some embodiments, the modified bacteria is a Salmonella typhimurium.


In some embodiments, the Salmonella typhimurium strain is selected from SL1344, LT2, and DW01.


In other embodiments, the modified bacteria is a E. coli, optionally of strain H7:0157.


In other embodiments, the nif cluster is a Klebsiella wild-type nif cluster, a Pseudomonas Stutzi nif cluster, or a paenibacillus cluster. In some embodiments, the nif cluster is a refactored nif clusters.


In some embodiments, the cereal plant is selected from wheat, rye, barley, triticale, oats, millet, sorghum, teff, fonio, buckwheat, quinoa, corn and rice.


In some embodiments, the invention further comprises an exogenous gene encoding a plant growth-stimulating peptide.


In some embodiments, the exogenous gene encoding the plant growth-stimulating peptide is regulated by a type 3 secretion system (T3SS).


In some embodiments, the plant growth stimulating peptide is directly delivered into root or stem tissues.


Aspects of the invention include a method, comprising delivering a modified non-pathogenic bacteria having exogenous genes for enabling plant endosymbiosis to a cereal plant, or to soil where a cereal plant or seed is growing or is to be planted.


In some embodiments, the non-pathogenic bacteria is E. coli.


In some embodiments, the genes encode effectors or apparatus for a secretion system.


In other embodiments, the apparatus for a secretion system is type 3 secretion system (T3SS).


Aspects of the invention include compositions comprising an agriculturally suitable or compatible carrier, and a gamma-proteobacteria having an exogenous nif cluster present on or in the agriculturally suitable or compatible carrier.


In some embodiments, the proteobacteria is a Salmonella typhimurium or E. coli.


In other embodiments, the nif cluster is a native nif cluster.


In some embodiments, the nif cluster is a refactored nif cluster.


In some embodiments, the invention further comprises an exogenous gene encoding a plant growth-stimulating peptide.


In some embodiments, the agriculturally suitable or compatible carrier is selected from the group consisting of seeds, seed coats, granular carriers, soil, solid carriers, liquid slurry carriers, and liquid suspension carriers. In other embodiments the agriculturally suitable carrier includes a wetting agents, a synthetic surfactant, a water-in-oil emulsion, a wettable powder, granules, gels, agar strips or pellets, thickeners, microencapsulated particles, or liquids such as aqueous flowables or aqueous suspensions.


In other embodiments the exogenous nif cluster or gene includes a controller. The controller may be a nucleic acid encoding an IPTG inducible T7 RNA polymerase. Alternatively the controller may be a partitioning system encoded by the two par operons (parCBA and parDE). In some embodiments the partitioning system is a RK2 par system.


A seed or seedling of a cereal plant having a modified bacteria associated with an external surface of the seed or seedling is provided in other aspects of the invention. In some embodiments the modified bacteria has an exogenous nif cluster.


In other aspects the invention is a cereal plant having a modified bacteria in the plant, wherein the modified bacteria has an exogenous nif cluster.


The nif cluster may be a native nif cluster or a refactored nif cluster. In some embodiments the nif cluster is a Klebsiella wild-type nif cluster, a Pseudomonas Stutzi nif cluster, or a paenibacillus cluster. In some embodiments the modified bacteria is a gamma-proteobacteria such as a Salmonella typhimurium, optionally a Salmonella typhimurium strain selected from SL1344, LT2, and DW01 or an E. coli, optionally of strain H7:0157.


The cereal plant in some embodiments is selected from wheat, rye, barley, triticale, oats, millet, sorghum, teff, fonio, buckwheat, quinoa, corn and rice.


Optionally the seed or seedling further includes an exogenous gene encoding a plant growth-stimulating peptide. The exogenous gene encoding the plant growth-stimulating peptide, in some embodiments, is regulated by a type 3 secretion system (T3SS).


In some embodiments the exogenous gene is in root or stem tissues of the cereal plant.


In some embodiments the modified bacteria may be provided in form of solutions, dispersions, sclerotia, gel, layer, cream, coating, or dip.


In some embodiments the plant, parts of plants or the area surrounding the plants is selected from leaf, seed, branches, soil, stems, roots. In some embodiments the modified bacteria is associated with (i.e. admixed, in physical contact with or present near) the plant, parts of plants or the area surrounding the plants or is incorporated therein. In some embodiments the seeds are inoculated or coated with the modified bacteria. In certain embodiments, the modified bacteria is disposed in an amount effective to be detectable within a target tissue of the mature agricultural plant selected from a fruit, a seed, a leaf, or a root, or portion thereof.


In other embodiments, the plant, the seed or seedling comprises at least about 100 CFU, for example, at least about 200 CFU, at least about 300 CFU, at least about 500 CFU, at least about 1,000 CFU, at least about 3,000 CFU, at least about 10,000 CFU, at least about 30,000 CFU, at least about 100,000 CFU or more, of the modified bacteria on its exterior surface.


In another embodiment, the modified bacteria is disposed on an exterior surface or within a tissue of the plant, the seed or seedling in an amount effective to be detectable in an amount of at least about 100 CFU, for example, at least about 200 CFU, at least about 300 CFU, at least about 500 CFU, at least about 1,000 CFU, at least about 3,000 CFU, at least about 10,000 CFU, at least about 30,000 CFU, at least about 100,000 CFU.


Each of the limitations of the invention can encompass various embodiments of the invention. It is, therefore, anticipated that each of the limitations of the invention involving any one element or combinations of elements can be included in each aspect of the invention. This invention is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the drawings. The invention is capable of other embodiments and of being practiced or of being carried out in various ways.





BRIEF DESCRIPTION OF DRAWINGS

The accompanying drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures is represented by a like numeral. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:



FIG. 1 shows nitrogenase activity in Salmonella strains. Nitrogenase activity of native and refactored nif clusters in diverse Salmonella strains were measured by acetylene reduction assay. Non-detectable ethylene production was indicated by asterisks.



FIG. 2 shows endophytic colonization of Zea mays B73 by enteric bacteria. Internal colonization of maize roots by either S. typhi ATCC 14028 or E. coli MG1655 (a control) was investigated. While there is no CFU of E. coli MG1655 from the crushed maize roots which is surface sterilized, S. typhi ATCC 14028 cells were recovered from inside the root tissues. Error bars represent standard deviation (n=6 for MG1655 and n=10 for ATCC14028).



FIG. 3 shows ethylene production in maize plant seedlings. The box plot shows the distribution of ethylene production between the plant seedlings inoculated with the engineered S. typhi ATCC 14028 and the control wild-type S. typhi ATCC14028 (no nif cluster). Dots represent ethylene production from individual plants in a group (n=33 (control), 39 (native nif), 39 (refactored nif)). The box extends from 25% to 75% quartile. The central line represents the median of the ethylene production in a group. The whiskers represent 75% quartile plus 1.5 times the interquartile range (upper whiskers) and 25% quartile minus 1.5 times the interquartile range (lower whiskers). Asterisk indicates statistically significant difference between the refactored nif and the control group (Student's t-test, ***P<0.0001).



FIGS. 4A and 4B are graphs showing the stability of genetic systems in the Salmonella strains obtained from the surface-sterilized roots. FIG. 4A shows a controller device composed of a sensor, T7 RNA polymerase and a selective marker showed no loss from the genome-based expression system. FIG. 4B shows the RK2 par system on the nif plasmid based on the pBBR1 origin of replication leads to an increase in the plasmid stability.



FIG. 5 shows a schematic of a controller for mini-Tn7 insertion (pR6K-T7RM).





DETAILED DESCRIPTION

Endophytic bacteria that are symbiotic with host plants can be genetically engineered to deliver proteins to the host and thereby regulate properties of plants. In non-cereal plants bacteria can be used to provide fixed nitrogen, reducing the need for nitrogen rich fertilizer. In cereal plants, however, bacterial systems for providing fixed nitrogen have never been developed despite many attempts over the years to develop such systems. A method for manipulating endophytic bacteria such that they are capable of providing fixed nitrogen to cereal plants has been discovered according to the invention. Endophytes may occupy the intracellular or extracellular spaces of plant tissue, including the leaves, stems, flowers, fruits, seeds, or roots.


The methods of the invention are useful for several purposes such as reducing fertilization needs, reducing fertilization pollution, providing an eco-friendly crop production, enhanced crop production, improved oil content in plants, improved protein content in plants, the reduction of nitrogen contamination of water, and the enrichment of the carbon content relative to nitrogen and carbon in relation to a soil's organic phase.


A limiting factor for crop productivity of agricultural crops is the nitrogen content in soil and water. The supply of this element has dwindled over time as crop demands increased. Nitrogen is one of the primary nutrients essential to all forms of life, including plants. However, nitrogen must first be converted to a form that plants can utilize. Biological Nitrogen Fixation (BNF) is the conversion of atmospheric nitrogen (N2) to ammonia (NH3) using the enzyme nitrogenase. This reaction consumes a tremendous amount of energy as N2 contains a triple bond. The bond energy in a nitrogen molecule is about 225 kcal/mol. Few BNFs are performed in nature as a result of a symbiotic relationship between plants and several bacterial species that make up a “nitrogenase enzymatic complex.”


The bacterial species that produce the nitrogenase enzymatic complex include diazotrophs such as cyanobacteria, azotobacteraceae, rhizobia, and frankia. However, only a few plant species can live in a symbiotic relationship with diazotrophs. For example, the pea plant from the legume family lives in symbiosis with bacteria from the rhizobia family. In particular, rhizobia bacteria penetrate the pea plant's roots creating root nodules that contain bacteria that fix nitrogen (to ammonia) while the plant donates carbon (sugar). Improving either the symbiosis, or extending the host range would therefore be beneficial for plant survival, but achieving this goal includes many challenges including the complexity of the process and lack of basic knowledge.


Biological nitrogen fixation is carried out by a complex of three proteins (nitrogenase), encoded by nifH, rufD and nifK, which are assembled and activated by an additional 17 genes [8]. Transferring a nif cluster to a new host is challenging because of the fact that the pathway is very sensitive to small changes in gene expression and the regulatory control in many organisms is not well established [8,9]. As shown in the Examples, a refactoring method was applied to a 16 gene nif cluster from Klebsiella oxytoca M5a1 to engineer a system for regulating nif. The method modularized the gene cluster into a set of well-characterized genetic parts. Refactoring can be used as a platform for large-scale part substitutions that facilitate the swapping of regulation to that which will function in a new host. Refactoring also is valuable in eliminating the response to signals that repress the native nif cluster, including ammonia and oxygen.


Quite surprisingly, it was discovered that nif clusters, both wild type and refactored nif, transferred into endophytic bacteria enable the bacteria to provide fixed nitrogen in cereal plants. This is the first demonstration that the transfer of native and synthetic nif clusters into endophytic bacteria can be used to provide fixed nitrogen to crops. The experiments presented in the Examples below demonstrate that genetic sensors connected to refactored nif clusters successfully regulated nitrogen fixation pathway at three different Salmonella strains in response to a chemical signal. The refactored nif clusters allows the testing of large populations of enteric bacteria isolated from plants for efficient symbiosis that delivers nitrogen to crops.


Synthetic nucleic acids encoding wild type and refactored nif clusters can be used to produce genetically modified bacteria. The modified bacteria useful according to the invention are endophytes which are endosymbionts. Endosymbionts do not cause apparent disease in plants for some or all of its life cycle. Bacterial endophytes may belong to a broad range of taxa, including α-Proteobacteria, β-Proteobacteria, γ-Proteobacteria, Firmicutes, and Actinobacteria. It is particularly preferred according to methods of the invention to use γ-Proteobacteria.


In some embodiments, examples of endophytic bacteria that are γ-Proteobacteria include but are not limited to Salmonella spp., Yersinia pestis, Vibrio cholerae, Pseudomonas aeruginosa, Escherichia coli, Xanthomonas axonopodis pv. citri and Pseudomonas syringae pv. actinidiae. In preferred embodiments γ-Proteobacteria include Salmonella and Escherichia coli.


The modified bacteria of the invention, are used to promote fixed nitrogen from atmospheric nitrogen. The term “plant” as used herein refers to cereal plants. The term includes all parts of a plant such as germinating seeds, emerging seedlings and vegetation including all below ground portions (such as the roots) and above ground portions. Cereals are the cultivated forms of grasses (Poaceae) and include for example wheat (inclusive spelt, einkorn, emmer, kamut, durum and triticale), rye, barley, rice, wild rice, maize (corn), millet, sorghum, teff, fonio and oats. The term cereal plants also includes pseudocereals, such as amaranth, quinoa and buckwheat.


Additionally, the modified bacteria can be genetically engineered to deliver other factors such as plant growth-stimulating peptides directly into root or stem tissues. For instance, genes expressing proteins that affect plants can be engineered into a type 3 secretion system (T3SS). Synthetic control will be able to be regulated by expressing of T3SS in bacteria. Methods of engineering bacteria in this manner are described in Widmaier, D. M. et al. [3].


Thus, the methods according to the invention can also involve genetically modifying bacteria to further treat the cereal plants. The term “genetically modified bacteria” refers to bacteria whose genetic material has been modified by the use of recombinant DNA techniques to include an inserted sequence of DNA that is not native to that bacterial genome or to exhibit a deletion of DNA that was native to that species' genome. Often, a particular genetically modified bacteria will be one that has obtained its genetic modification(s) by a recombinant DNA technique. Typically, one or more genes have been integrated into the genetic material of a genetically modified bacteria. The gene may be inserted into the T3SS region.


A nif cluster is a collection of genes encoding enzymes involved in the fixation of atmospheric nitrogen into a form of nitrogen available to living organisms. The primary enzyme encoded by the nif genes is the nitrogenase complex which is in charge of converting atmospheric nitrogen (N2) to other nitrogen forms such as ammonia which the organism can use for various purposes. Besides the nitrogenase enzyme, the nif genes also encode a number of regulatory proteins involved in nitrogen fixation. The nif genes are found in both free-living nitrogen-fixing bacteria and in symbiotic bacteria associated with various plants. The expression of the native nif genes are induced as a response to low concentrations of fixed nitrogen and oxygen concentrations (the low oxygen concentrations are actively maintained in the root environment of host plants). Refactored nif clusters can be designed to be regulated by exogenous factors and/or constitutively regulated.


As used herein, a “genetic cluster” refers to a set of two or more genes that encode gene products. A target, naturally occurring, or wild type genetic cluster is one which serves as the original model for the refactoring. In some embodiments, the gene products are enzymes. In some embodiments, the gene cluster that is refactored is the nif nitrogen fixation pathway.


Each genetic cluster is organized into transcriptional units which are composed of a plurality of modular units. A modular unit is a discreet nucleic acid sequence that is made up of one or more genetic components. A genetic component may include anything typically found in a genetic fragment. For instance a genetic component includes but is not limited to genes, regulatory elements, spacers, non-coding nucleotides. Some or all of these are found within each modular unit. Within the modular unit one or more of the synthetic regulatory elements may be genetically linked to one or more protein coding sequences of the genetic cluster.


While multiple modular units may be composed of the same gene and regulatory elements, the units may differ from one another in terms of the orientation, position, number etc. of the gene and regulatory elements. Other modular units may have some elements in common with other modular units but include some different elements. Yet other modular units may be completely distinct and do not overlap with other modular units. The great diversity of the modular units is what leads to the diversity of the assembled genetic clusters in a library.


The modular units within the genetic cluster are arranged such that the plurality of distinct non-naturally occurring genetic clusters are distinct from a naturally occurring genetic cluster based on the number, the order, and/or the orientation of particular genetic components. The number of genetic components within a modular unit may be easily varied. For instance, one modular unit may have a single promoter or terminator, whereas another modular unit may have 5 promoters and 2 terminators. The variation that may be achieved by manipulation of this factor is significant. Additionally the order of the components within a modular unit may be varied dramatically. Multiple sets of modular units may be generated where a single order of two components may be switched. This factor would also generate significant diversity. Switching the orientation of a component in the modular unit is also a viable way of generating diversity. While it may be expected that switching the orientation of one or more genetic components might interfere with functionality it has been demonstrated herein that genetic nif clusters having different orientations are actually functional.


The refactoring process involves several levels of restructuring genetic clusters. For example, the codons of essential genes in a genetic cluster, such as the nif cluster, are changed to create a DNA sequence divergent from the wild-type (WT) gene. This may be achieved through codon optimization. Recoded genes may be computationally scanned to identify internal regulators. These regulatory components may then be removed. They are organized into operons and placed under the control of synthetic parts (promoters, ribosome binding sites, and terminators) that are functionally separated by spacer parts. Finally, a controller consisting of genetic sensors and circuits that regulate the conditions and dynamics of gene expression may be added.


The genetic components in the refactored genetic cluster typically will include at least one synthetic regulatory element. A synthetic regulatory element is any nucleic acid sequence which plays a role in regulating gene expression and which differs from the naturally occurring regulatory element. It may differ for instance by a single nucleotide from the naturally occurring element. Alternatively it may include one or more non-natural nucleotides. Alternatively it may be a totally different element. In each case, it may be considered to be an exogenous regulatory element (i.e. not identical to the naturally occurring version). Thus, a “regulatory element” refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation or rate, or stability and/or mobility of a transcription or translation product. Regulatory regions include, without limitation, promoter sequences, ribosome binding sites, ribozymes, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5′ and 3′ untranslated regions (UTRs), transcriptional start sites, transcription terminator sequences, polyadenylation sequences, introns, and combinations thereof.


In some embodiments, the regulatory sequence will increase the expression of a gene. In other embodiments, the regulatory sequence will decrease the expression of a gene. In some embodiments the regulatory sequence may be a protein-binding sequence, for example a transcription factor binding site. In some embodiments, the regulatory sequence may be a polymerase-binding site. In some embodiments, the regulatory sequence is a terminator. The terminator may require an additional factor to indicated the end of the sequence for transcription, for example a rho-dependent terminator. In some embodiments, a regulatory sequence is a sequence that binds a ribosome, such as a ribosome-binding site (RBS). In some embodiments, the regulatory sequence indicates where translation will begin. It will be evident to one of ordinary skill in the art that regulatory sequences differ in their strength of regulation. For example, there exist strong promoter sequences, gene expression from which is higher than gene expression from a weak promoter sequence. Similarly, there exist strong RBS sequences that recruit and bind ribosomes with higher affinity than a RBS sequence that is characterized as weak. In some embodiments, the regulatory sequence may be an inducible or conditional regulatory sequence. In some embodiments, the regulatory sequence will exist 5′ or upstream of a protein-coding sequence. In other some embodiments, the regulatory sequence will exist 3′ or downstream of a protein-coding sequence. In still other embodiments, the regulatory sequence may be present within a protein-coding sequence. Any given protein-coding sequence may be regulated by one or more regulatory sequences. Non-limiting examples of regulatory sequences include the bacteriophage T7 promoter, sigma 70 promoter, sigma 54 promoter, lac promoter, rho-dependent terminator, stem-loop/rho-independent terminator.


“Exogenous” with respect to a nucleic acid indicates that the nucleic acid is part of a recombinant nucleic acid construct, or is not in its natural environment. For example, an exogenous nucleic acid can be a sequence from one species introduced into another species, i.e., a heterologous nucleic acid. Typically, such an exogenous nucleic acid is introduced into the other species via a recombinant nucleic acid construct. An exogenous nucleic acid also can be a sequence that is native to an organism and that has been reintroduced into cells of that organism. An exogenous nucleic acid that includes a native sequence can often be distinguished from the naturally occurring sequence by the presence of non-natural sequences linked to the exogenous nucleic acid, e.g., non-native regulatory sequences flanking a native sequence in a recombinant nucleic acid construct. In addition, stably transformed exogenous nucleic acids typically are integrated at positions other than the position where the native sequence is found. The exogenous elements may be added to a construct, for example using genetic recombination. Genetic recombination is the breaking and rejoining of DNA strands to form new molecules of DNA encoding a novel set of genetic information.


“Expression” refers to the process of converting genetic information of a polynucleotide into RNA through transcription, which is catalyzed by an enzyme, RNA polymerase, and into protein, through translation of mRNA on ribosomes.


Promoters may be constitutive or inducible. Examples of constitutive promoters include, without limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the dihydrofolate reductase promoter, the β-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter [Invitrogen].


Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. Examples of inducible promoters regulated by exogenously supplied promoters include the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)], the tetracycline-inducible system [Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)], the RU486-inducible system [Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest., 100:2865-2872 (1997)]. Still other types of inducible promoters which may be useful in this context are those which are regulated by a specific physiological state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only.


The regulatory elements may be in some instances tissue-specific. Tissue-specific regulatory sequences (e.g., promoters, enhancers, etc.) are well known in the art. Exemplary tissue-specific regulatory sequences include, but are not limited to the following tissue specific promoters: a liver-specific thyroxin binding globulin (TB G) promoter, an insulin promoter, a glucagon promoter, a somatostatin promoter, a pancreatic polypeptide (PPY) promoter, a synapsin-1 (Syn) promoter, a creatine kinase (MCK) promoter, a mammalian desmin (DES) promoter, a α-myosin heavy chain (a-MHC) promoter, or a cardiac Troponin T (cTnT) promoter. Other exemplary promoters include Beta-actin promoter, hepatitis B virus core promoter, Sandig et al., Gene Ther., 3:1002-9 (1996); alpha-fetoprotein (AFP) promoter, Arbuthnot et al., Hum. Gene Ther., 7:1503-14 (1996)), bone osteocalcin promoter (Stein et al., Mol. Biol. Rep., 24:185-96 (1997)); bone sialoprotein promoter (Chen et al., J. Bone Miner. Res., 11:654-64 (1996)), CD2 promoter (Hansal et al., J. Immunol., 161:1063-8 (1998); immunoglobulin heavy chain promoter; T cell receptor α-chain promoter, neuronal such as neuron-specific enolase (NSE) promoter (Andersen et al., Cell. Mol. Neurobiol., 13:503-15 (1993)), neurofilament light-chain gene promoter (Piccioli et al., Proc. Natl. Acad. Sci. USA, 88:5611-5 (1991)), and the neuron-specific vgf gene promoter (Piccioli et al., Neuron, 15:373-84 (1995)), among others which will be apparent to the skilled artisan.


In some instances the modular units or genetic clusters may be designed to lack in restriction recognition sites. Restriction endonucleases cleave DNA with extremely high sequence specificity and due to this property they have become indispensable tools in molecular biology and molecular medicine. Over three thousand restriction endonucleases have been discovered and characterized from a wide variety of bacteria and archae. Comprehensive lists of their recognition sequences and cleavage sites can be found at REBASE.


As used herein the term “isolated nucleic acid molecule” refers to a nucleic acid that is not in its natural environment, for example a nucleic acid that has been (i) extracted and/or purified from a cell, for example, an algae, yeast, plant or mammalian cell by methods known in the art, for example, by alkaline lysis of the host cell and subsequent purification of the nucleic acid, for example, by a silica adsorption procedure; (ii) amplified in vitro, for example, by polymerase chain reaction (PCR); (iii) recombinantly produced by cloning, for example, a nucleic acid cloned into an expression vector; (iv) fragmented and size separated, for example, by enzymatic digest in vitro or by shearing and subsequent gel separation; or (v) synthesized by, for example, chemical synthesis. In some embodiments, the term “isolated nucleic acid molecule” refers to (vi) an nucleic acid that is chemically markedly different from any naturally occurring nucleic acid. In some embodiments, an isolated nucleic acid can readily be manipulated by recombinant DNA techniques well known in the art. Accordingly, a nucleic acid cloned into a vector, or a nucleic acid delivered to a host cell and integrated into the host genome is considered isolated but a nucleic acid in its native state in its natural host, for example, in the genome of the host, is not. An isolated nucleic acid may be substantially purified, but need not be. For example, a nucleic acid that is isolated within a cloning or expression vector is not pure in that it may comprise only a small percentage of the material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is used herein.


Methods to deliver expression vectors or expression constructs into cells are well known to those of skill in the art. Nucleic acids, including expression vectors, can be delivered to prokaryotic and eukaryotic cells by various methods well known to those of skill in the relevant biological arts. Methods for the delivery of nucleic acids to a cell in accordance to some aspects of this invention, include, but are not limited to, different chemical, electrochemical and biological approaches, for example, heat shock transformation, electroporation, transfection, for example liposome-mediated transfection, DEAE-Dextran-mediated transfection or calcium phosphate transfection. In some embodiments, a nucleic acid construct, for example an expression construct comprising a fusion protein nucleic acid sequence, is introduced into the host cell using a vehicle, or vector, for transferring genetic material. Vectors for transferring genetic material to cells are well known to those of skill in the art and include, for example, plasmids, artificial chromosomes, and viral vectors. Methods for the construction of nucleic acid constructs, including expression constructs comprising constitutive or inducible heterologous promoters, knockout and knockdown constructs, as well as methods and vectors for the delivery of a nucleic acid or nucleic acid construct to a cell are well known to those of skill in the art.


In one embodiment, a genetic clusters includes a nucleotide sequence that is at least about 85% or more homologous or identical to the entire length of a naturally occurring genetic cluster sequence, e.g., at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50% or more of the full length naturally occurring genetic cluster sequence). In some embodiments, the nucleotide sequence is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homologous or identical to a naturally occurring genetic cluster sequence. In some embodiments, the nucleotide sequence is at least about 85%, e.g., is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homologous or identical to a genetic cluster sequence, in a fragment thereof or a region that is much more conserved, such as an essential, but has lower sequence identity outside that region.


Calculations of homology or sequence identity between sequences (the terms are used interchangeably herein) are performed as follows. To determine the percent identity of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). The length of a reference sequence aligned for comparison purposes is at least 80% of the length of the reference sequence, and in some embodiments is at least 90% or 100%. The nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein nucleic acid “identity” is equivalent to nucleic acid “homology”). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.


In many cases the nucleic acids described herein having naturally occurring nucleotides and are not modified. In some instances, the nucleic acids may include non-naturally occurring nucleotides and/or substitutions, i.e. Sugar or base substitutions or modifications.


One or more substituted sugar moieties include, e.g., one of the following at the 2′ position: OH, SH, SCH3, F, OCN, OCH3OCH3, OCH3O(CH2)n CH3, O(CH2)n NH2 or O(CH2)n CH3 where n is from 1 to about 10; Ci to C10 lower alkyl, alkoxyalkoxy, substituted lower alkyl, alkaryl or aralkyl; Cl; Br; CN; CF3; OCF3; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; SOCH3; SO2 CH3; ONO2; NO2; N3; NH2; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties of a nucleic acid; or a group for improving the pharmacodynamic properties of a nucleic acid and other substituents having similar properties. Similar modifications may also be made at other positions on the nucleic acid, particularly the 3′ position of the sugar on the 3′ terminal nucleotide and the 5′ position of 5′ terminal nucleotide. Nucleic acids may also have sugar mimetics such as cyclobutyls in place of the pentofuranosyl group.


Nucleic acids can also include, additionally or alternatively, nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include adenine (A), guanine (G), thymine (T), cytosine (C) and uracil (U). Modified nucleobases include nucleobases found only infrequently or transiently in natural nucleic acids, e.g., hypoxanthine, 6-methyladenine, 5-Me pyrimidines, particularly 5-methylcytosine (also referred to as 5-methyl-2′ deoxycytosine and often referred to in the art as 5-Me-C), 5-hydroxymethylcytosine (HMC), glycosyl HMC and gentobiosyl HMC, isocytosine, pseudoisocytosine, as well as synthetic nucleobases, e.g., 2-aminoadenine, 2-(methylamino)adenine, 2-(imidazolylalkyl)adenine, 2-(aminoalklyamino)adenine or other heterosubstituted alkyladenines, 2-thiouracil, 2-thiothymine, 5-bromouracil, 5-hydroxymethyluracil, 5-propynyluracil, 8-azaguanine, 7-deazaguanine, N6 (6-aminohexyl)adenine, 6-aminopurine, 2-aminopurine, 2-chloro-6-aminopurine and 2,6-diaminopurine or other diaminopurines. See, e.g., Kornberg, “DNA Replication,” W. H. Freeman & Co., San Francisco, 1980, pp 75-′7′7; and Gebeyehu, G., et al. Nucl. Acids Res., 15:4513 (1987)). A “universal” base known in the art, e.g., inosine, can also be included.


In the context of the present disclosure, hybridization means base stacking and hydrogen bonding, which may be Watson-Crick, Hoogsteen or reversed Hoogsteen hydrogen bonding, between complementary nucleoside or nucleotide bases. For example, adenine and thymine are complementary nucleobases which pair through the formation of hydrogen bonds. Complementary, as the term is used in the art, refers to the capacity for precise pairing between two nucleotides. For example, if a nucleotide at a certain position of an nucleic acid is capable of hydrogen bonding with a nucleotide at the same position of a second nucleic acid, then the two nucleic acids are considered to be complementary to each other at that position. The nucleic acids are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides that can hydrogen bond with each other through their bases. Thus, “specifically hybridizable” and “complementary” are terms which are used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the nucleic acids. 100% complementarity is not required.


Various aspects of the embodiments described above may be used alone, in combination, or in a variety of arrangements not specifically discussed in the embodiments described in the foregoing and is therefore not limited in its application to the details and arrangement of components set forth in the foregoing description or illustrated in the drawings. For example, aspects described in one embodiment may be combined in any manner with aspects described in other embodiments.


Use of ordinal terms such as “first,” “second,” “third,” etc., in the claims to modify a claim element does not by itself connote any priority, precedence, or order of one claim element over another or the temporal order in which acts of a method are performed, but are used merely as labels to distinguish one claim element having a certain name from another element having a same name (but for use of the ordinal term) to distinguish the claim elements.


The present invention is further illustrated by the following Examples, which in no way should be construed as further limiting. The entire contents of all of the references (including literature references, issued patents, published patent applications, and co pending patent applications) cited throughout this application are hereby expressly incorporated by reference.


As shown in the examples, the refactoring approach has been applied to the nif gene cluster from Klebsiella oxytoca encoding the nitrogen fixation pathway for converting atmospheric N2 to ammonia. The native gene cluster consists of 20 genes in seven operons and is encoded in 23.5 kb of DNA. The refactored gene cluster may share little DNA sequence identity with the wild type (WT).


When the nif cluster is a native nif cluster, it may have the DNA sequence of any naturally occurring nif cluster. For example it may have the sequence of a naturally occurring nif cluster from Klebsiella oxytoca (SEQ ID NO. 4) Pseudomonas stutzi nif cluster (SEQ ID NO. 5) and paenibacillus nif cluster. Refactored nif clusters may be any refactored nif cluster which is active in producing the proteins involved in promoting N2 conversion to other nitrogen forms.


The following exemplary DNA sequences of nif clusters are useful according to the invention.










refactored nif cluster v1.0  



(SEQ ID NO. 1)



taatacgactcactatagggagaacaataaactaacataaggaggataaatatgaccatgcgtcagtgcgcgatt






tatggcaaaggtggtattggcaaaagcacgacgacccagaacttggtggcggccctggccgagatgggtaaaaag





gttatgattgtgggttgcgacccgaaggccgacagcacgcgcctgattctgcacgcgaaagcacaaaacacgatt





atggagatggctgccgaggttggtagcgtggaggatctggagctggaggacgttctgcaaattggttacggtgat 





gttcgttgcgcagagagcggtggtccggaaccaggtgtcggctgtgcgggtcgtggtgtaattaccgctatcaat 





ttcctggaagaagagggtgcgtacgaagatgatctggatttcgttttctacgatgtgctgggtgatgtcgtgtgc 





ggtggttttgcaatgccgattcgcgagaataaggcacaagaaatttacattgtctgtagcggcgagatgatggca 





atgtacgctgctaacaacatcagcaagggtattgttaaatacgcaaaaagcggtaaggttcgcttgggtggtttg 





atttgcaacagccgtcagaccgaccgtgaggacgaactgatcatcgccctggctgagaaactgggcacccaaatg 





atccacttcgtgccacgcgataatattgttcaacgtgcagaaatccgccgtatgaccgtcattgagtatgacccg 





gcatgcaagcaagcgaacgagtaccgcaccttggcacagaaaatcgtgaacaacaccatgaaggttgttccgacg 





ccgtgtacgatggacgagctggagagcctgctgatggagttcggcattatggaggaggaggacaccagcattatc 





ggtaagaccgcagcggaggagaatgcggcataagcgtgcgtacaccttaatcaccgcttcatgctaaggtcctgg 





ctgcatgcaaaaattcacatccctatctagcggaggagccggatgatgactaatgctactggcgaacgtaacctg 





gcactgattcaagaagtactggaagtgttcccggaaaccgcgcgcaaagagcgccgtaaacacatgatggtttct 





gacccggaaatggaatctgtgggtaaatgcatcatctctaatcgcaaatctcagccgggtgtcatgactgttcgt 





ggctgtgcgtacgcaggttctaaaggtgtcgtattcggcccgatcaaagatatggcgcatatctctcatggcccg 





gtaggctgtggccagtactctcgcgcgggacgtcgtaactactacacggvcgtttctggcgttgactctttcggc 





acgctgaacttcacctctgacttccaggaacgtgacatcgttttcggtggcgataaaaagctgtccaaactgatc 





gaagaaatggaactgctgttcccgctgactaaaggcattactatccaaagcgaatgtccggtgggtctgatcggt 





gatgacatcagcgcggtcgcaaacgcatcttccaaagccctggataagccggtgatcccggttcgttgcgagggc 





ttccgcggcgtttctcagtctctgggtcatcacatcgcaaacgatgttgtgcgtgactggattctgaacaaccgt 





gaaggtcagccttttgaaaccaccccttatgacgttgcgattattggcgactataacatcggcggcgacgcctgg 





gcatcccgcatcctgctggaggagatgggtctgcgtgttgtcgcacagtggtctggcgatggcaccctggttgaa 





atggaaaacaccccgtttgttaaactgaacctggttcactgctaccgctccatgaactacattgcccgtcacatg 





gaagaaaaacatcagatcccttggatggaatacaacttcttcggtccgactaaaatcgcagaatccctgcgtaaa 





atcgccgatcagtttgatgataccattcgcgcgaacgctgaagcagtaattgcgcgctacgaaggccagatggca 





gcaatcattgctaagtaccgtccgcgcctggaaggtcgtaaagtgctgctgtacatgggtggtctgcgtccacgt 





catgtgatcggtgcctacgaggacctgggcatggagatcatcgcagcgggttacgaatttgcacacaacgacgac 





tatgatcgtacgctgccagacctgaaagaaggtacgctgctgtttgacgacgccagctcttatgaactggaagcc 





ttcgtgaaagcgctgaaaccagacctgatcggctccggcatcaaggaaaaatacattttccagaaaatgggcgtg 





ccgttccgccagatgcactcctgggactactccggtccgtaccacggctacgacggtttcgctatcttcgctcgt 





gacatggatatgaccctgaataacccagcgtggaatgaactgaccgcaccgtggctgaaatctgcataacaaaca 





ccccatgtcgatactgaacgaatcgacgcacactcccttccttgcaatctcatactctcaaaaattaggcgaggt 





aacatgtctcaaactatcgataaaatcaactcttgttacccgctgttcgagcaggacgaatatcaggaactgttc 





cgtaacaaacgtcagctggaagaagcgcacgacgcacagcgcgtgcaggaagtgttcgcatggaccaccaccgcg 





gaatacgaagctctgaacttccagcgcgaagccctgacggttgatccggcgaaagcgtgccagcctctgggtgcg 





gttctgtgcagcctgggttttgccaacaccctgccgtatgtccacggttcccagggctgcgtagcctacttccgt 





acctatttcaaccgccactttaaagaaccaatcgcgtgcgtgtccgacagcatgacggaggacgcggcagttttc 





ggtggtaacaacaacatgaacctgggcctgcaaaatgcttccgcactgtacaaaccggaaatcatcgcagtgtct 





accacctgcatggcagaggttattggtgatgatctgcaagcatttattgccaacgcaaagaaagacggtttcgtt 





gacagctctatcgcggttccgcacgctcataccccgtccttcatcggttctcacgtaactggttgggacaacatg 





ttcgaaggcttcgcaaaaacttttaccgcagactatcaaggccaaccgggtaaactgccgaagctgaacctggtg 





accggctttgaaacctacctgggcaactttcgtgtcctgaagcgcatgatggagcagatggcggttccgtgttct 





ctgctgtctgacccgtctgaggttctggacactccagcggacggccactatcgcatgtattctggtggcaccact 





cagcaggaaatgaaagaggccccagacgcgattgacaccctgctgctgcaaccgtggcagctgctgaaaagcaag 





aaagttgttcaggaaatgtggaaccagccggcaacggaagttgcaatcccgctgggtctggcagctactgacgaa 





ctgctgatgaccgtgtcccaactgagcggcaaaccaatcgcggatgctctgaccctggaacgcggtcgcctggtg 





gacatgatgctggacagccacacgtggctgcatggcaagaaatttggcctgtacggtgacccggacttcgtaatg 





ggcctgacccgtttcctgctggaactgggctgcgagccgactgttatcctgtctcacaacgctaacaaacgttgg 





cagaaggccatgaacaaaatgctggatgcgagcccatacggccgtgatagcgaagtgttcatcaactgcgacctg 





tggcatttccgctctctgatgtttacgcgtcagccggatttcatgatcggtaactcttacggcaaattcatccag 





cgtgacactctggccaaaggcaaagcgtttgaagtgccgctgattcgtctgggctttccgctgttcgaccgtcac 





cacctgcaccgccagaccacctggggttacgaaggcgcgatgaacatcgtaactactctggtaaacgcagtactg 





gaaaagctggacagcgatacttcccagctgggcaaaaccgactattctttcgatctggttcgttaacctgattgt 





atccgcatctgatgctaccgtggttgagttaccatactcactcccggaggtacttctatgtctgacaatgatacc 





ctgttttggcgcatgctggcgctgtttcagtcgctgccggatttgcagccggctcaaatcgtcgattggctggcg 





caggaatccggcgaaaccctgacgccggagcgccttgccaccctgacccaaccgcaactcgcggcgtcgttccca 





tccgcgacggcagtgatgagcccggctcgctggagccgcgttatggcttctctgcaaggcgccctcccagcccac 





ttgcgcatcgtacgtccggcgcagcgtaccccgcaactgctcgccgcgttttgcagccaagacggccttgttatc 





aatggtcatttcggccagggtcgtctgttcttcatttacgcctttgacgagcagggcggctggctgtatgacttg 





cgccgctatccgagcgcaccgcaccagcaggaagcgaatgaggtgcgtgctcgtctgattgaagattgccagctg 





ctgttctgccaggagattggcggtccggcagcagcgcgtctgatccgccaccgcatccatccgatgaaggcgcag 





ccgggtactacgattcaggcgcagtgtgaagctatcaacaccctgctggccggtcgcctgccgccgtggctcgcc 





aaacgtttgaaccgtgataacccgctggaagagcgtgtgttttaacatttttgccttgcgacagacctcctactt 





agattgccacactattcaatacatcactggaggttattacaaatgaagggtaacgagattcttgctctgctggac 





gaaccggcctgtgaacacaaccataaacagaaatccggctgtagcgccccaaagccgggtgcgacggcggctggc 





tgcgctttcgatggtgcgcagatcaccctgctcccgattgcggacgttgcccacctcgtgcatggcccaatcggt 





tgcgcaggtagctcttgggacaaccgtggcagcgcctccagcggtccgaccctgaatcgtttgggctttaccact 





gacttgaatgaacaagatgtgatcatgggtcgcggcgagcgtcgcctgttccacgctgtgcgccatattgtcacc 





cgttaccacccagcggcagtattcatctacaatacgtgcgtgccggctatggaaggcgatgacctggaggccgtg 





tgtcaggcagcccagactgcgaccggcgtcccggtaatcgcaattgatgcggctggcttctacggttcgaagaac 





ctgggcaaccgtccggcaggcgatgtcatggttaaacgcgtcattggccaacgtgagccagcgccgtggccggag 





agcaccctgtttgccccggagcaacgtcatgacattggcttgatcggtgagttcaacattgcgggcgagttttgg 





cacattcagccgctgcttgatgagctgggtatccgcgttttgggttcgctcagcggcgatggtcgtttcgccgag 





attcaaaccatgcaccgtgcccaggcgaacatgctggtgtgcagccgtgctctgatcaatgttgcgcgtgctctg 





gaacagcgctatggcaccccgtggtttgaaggctcgttctatggtatccgcgcgaccagcgacgccctgcgccag 





ttagcggcgctgctgggcgatgacgacctccgtcagcgcaccgaggcgctgatcgcgcgtgaagaacaggcggct 





gagctggccctgcaaccgtggcgtgaacagctgcgtggccgcaaggccctgctctacacgggtggtgtcaaaagc 





tggtctgtggtgtccgcgcttcaggatctgggtatgaccgtggttgccacgggcacgcgtaagagcacggaagag 





gataaacagcgcatccgcgaattgatgggcgaagaggccgtgatgcttgaagaaggcaacgcacgtaccttattg 





gatgtagtttatcgctatcaagcagacctgatgattgccggtggccgcaacatgtataccgcctacaaagcgcgc 





ttgccgttcctggacatcaaccaggaacgcgagcacgcgtttgcgggctaccaaggcatcgtgaccttagcgcgc 





cagctgtgccaaacgattaacagcccgatctggccgcagactcattcccgcgcaccgtggcgctaatgtcacgct 





aggaggcaattctataagaatgcacactgcacctaaacctaccacacctggaagaagtaattatggcagacattt 





tccgcactgataagccgttggctgtgtcgccgatcaagaccggccagccgctgggtgcgatcctggcgtccctgg 





gtatcgagcactcgattccgctggtacatggcgcgcagggctgttcggcttttgccaaggttttctttatccagc 





acttccacgatccggtcccgctgcaaagcacggcaatggacccgaccagcaccatcatgggcgctgatggtaaca 





tcttcaccgcgctggacactctctgccaacgcaataacccgcaagcaattgtgctgctgagcaccggcctctccg 





aggcgcagggcagcgacatttcccgtgtagtgcgtcagttccgtgaagaatatccgcgtcataaaggcgtggcga 





ttctgactgttaacaccccggacttttacggtagcatggagaacggcttttccgctgtcctggagtctgtgattg 





aacagtgggttccgccagccccacgtccggcgcagcgcaatcgtcgcgtcaatcttttggtgagccatctctgta 





gcccaggcgatattgagtggctgcgccgttgcgtcgaggccttcggtctgcaaccgatcattctgccggatctgg 





ctcagagcatggacggccaccttgctcagggtgacttttcgccgctgacgcagggcggcacgccgttgcgccaaa 





tcgagcagatgggccagagcctttgctcttttgcgattggcgtcagcctgcaccgtgcgagcagcctgctggctc 





cgcgttgtcgtggcgaagtcatcgccttgccgcacctcatgaccttggaacgctgcgacgcctttatccatcagt 





tggcgaaaatcagcggtcgcgccgttccggagtggctggaacgccagcgcggtcagctgcaagacgccatgatcg 





attgccacatgtggctgcaaggccagcgcatggcgattgccgccgaaggcgacctgctggcagcgtggtgcgatt 





tcgcgaactctcaaggtatgcagccgggtccactggttgctccgacgggtcatccgagcctgcgtcagttgccgg 





tggagcgcgtggtgccgggtgatctggaggatcttcagaccctcttatgcgcacatccggccgacttactggtgg 





cgaactcccacgcccgtgatttagcagagcaattcgccctgccgctggtgcgcgcaggcttcccgctgtttgaca 





aactgggcgaatttcgtcgtgttcgccagggttatagcggtatgcgtgataccctgttcgagttggcgaacctga 





tccgtgaacgccatcatcatctggctcattatcgcagcccgctgcgccagaacccagaatcctcgttgtctacgg 





gtggcgcgtacgcagcggattaactagagattaaagaggagaaattaagcatgaaaactatggacggtaacgctg 





cggctgcatggattagctacgcctttaccgaagtggctgcgatctacccgattacgccgagcaccccgatggcgg 





aaaatgtggacgaatgggctgcgcagggcaagaagaacctcttcggccagccggtgcgcctgatggagatgcagt 





cggaagcgggtgcagcaggtgctgtgcatggcgccttgcaagctggcgcactgacgaccacctacaccgcgtcgc 





agggcctgttgctgatgatcccaaacatgtacaaaatcgcgggtgaactgctgccgggtgtctttcatgtttcgg 





cacgcgcactggccaccaatagcctcaacatctttggcgatcatcaggatgtaatggcggtgcgccaaacgggct 





gcgcgatgttggccgagaataacgtccagcaagttatggatttgtccgcggtagcccacttggcagcgatcaaag 





gtcgcattccgttcgtgaacttcttcgatggctttcgcaccagccacgaaatccagaagatcgaggttctggaat 





atgaacagctggccaccttgttggatcgtccggccctggacagcttccgccgtaacgcccttcacccggaccacc 





cggtcatccgtggcaccgcccagaacccggacatctacttccaggaacgtgaggccggtaaccgtttctatcagg 





cgctcccggatattgtggaatcttacatgacccagatttctgccctgactggtcgcgagtatcacctgtttaact 





acactggtgctgcggatgcggagcgcgtgatcatcgcgatgggctctgtctgtgacaccgtccaagaggtggttg 





acacgctgaatgcagcgggtgagaaagttggtctgctctccgttcatcttttccgcccgttttcgttagcgcact 





tcttcgcccaactgccgaaaactgtacagcgtatcgcagtattggaccgtacgaaagagccaggtgctcaagcag 





agccgctgtgcctcgatgtgaagaatgccttttaccaccatgacgatgccccgttgattgtgggtggtcgctatg 





ccttgggcggtaaggacgtgttgccgaacgatattgcggccgtgtttgataacctgaacaaaccgctgccgatgg 





acggcttcacgctgggtatcgtggacgatgttaccttcacctctctcccgccagcgcagcagaccctggcggttt 





ctcacgacggcatcacggcatgtaagttttggggcatgggctccgacggcacggttggtgcgaacaagtccgcga 





tcaagattatcggcgacaaaacgccactgtatgcgcaagcgtacttttcctacgactcgaagaagagcggtggta 





ttaccgtcagccatctgcgttttggtgatcgcccgatcaactccccgtatttgatccatcgcgcggatttcatct 





cgtgcagccagcaaagctatgttgaacgctacgatctgctggatggccttaaaccgggtggcacctttctgctga 





actgctcctggagcgatgccgaactggagcaacatctgccggtcggtttcaaacgttatctggcacgcgagaata 





tccacttctacactctcaacgctgtggacatcgcccgtgagcttggtttgggtggccgtttcaacatgctgatgc 





aggctgccttcttcaaactggccgcgatcattgacccgcagactgctgcggactatctgaagcaggctgttgaga 





aaagctatggcagcaaaggtgcggcggtcatcgagatgaaccagcgtgccatcgagcttggcatggccagcctgc 





accaggtgacgatcccggcacattgggccaccctggatgagccagcggcgcaggcgtccgcgatgatgccggact 





ttatccgcgacatcctgcaaccgatgaaccgtcagtgcggcgaccagcttccggtgtcggcttttgtcggcatgg 





aagatggcaccttcccgtccggcacggccgcatgggagaaacgtggcatcgcccttgaggtgccagtctggcagc 





cggaaggctgcacgcagtgcaaccagtgcgccttcatttgtccgcacgccgcgattcgtccggcgttgttgaatg 





gcgaagagcatgatgctgccccggttggcctgctgagcaaaccggcacaaggcgctaaagaatatcactatcatc 





tggcgattagcccgctggactgctccggctgtggcaactgcgttgacatttgtccagctcgtggcaaagcgttga 





agatgcagtctctggatagccaacgccagatggctccggtgtgggattatgcgctggcgctgaccccgaagtcta 





acccgtttcgtaaaaccaccgtcaaaggctcgcagttcgaaaccccgctgctggagtttagcggtgcgtgcgctg 





gttgtggcgaaacgccgtatgcgcgcctcattacccagctgtttggcgaccgcatgctgattgccaatgccaccg 





gctgttccagcatctggggcgcatctgcgccgagcatcccgtataccaccaatcatcgtggtcatggtccggcct 





gggcgaatagcctgtttgaggacaatgccgaatttggtttaggtatgatgctgggcggtcaagctgtgcgtcaac 





agatcgcggacgatatgacggctgcgttagcgctcccggtttccgatgagctgagcgacgcgatgcgccagtggt 





tggcgaaacaggacgagggtgaaggcacgcgtgagcgtgcggaccgtctgagcgagcgcttagccgcggagaaag 





agggcgttccgctgttagagcagctgtggcaaaatcgtgattactttgtgcgtcgcagccagtggattttcggcg 





gtgacggctgggcctatgatattggcttcggtggcctggaccacgtcctcgccagcggtgaggatgtgaacattc 





tggtatttgacaccgaagtctactcgaacaccggcggtcaaagcagcaaatcgaccccggtcgcggccatcgcca 





agttcgcggctcagggcaagcgcacccgcaagaaagacctgggtatgatggcgatgagctacggcaacgtctatg 





tagcccaggtggcgatgggtgcggataaagatcaaactctgcgcgccattgcggaagctgaagcgtggccaggcc 





cgtcgctggtgattgcgtatgcggcctgcatcaatcatggcctgaaggccggtatgcgttgcagccaacgtgagg 





cgaagcgcgctgttgaggcgggctactggcacctgtggcgttatcacccgcagcgcgaagcggaaggcaagacgc 





cgtttatgttagatagcgaagaaccggaagagtcgttccgtgactttctgttgggtgaggtgcgctacgcatccc 





tgcacaagaccaccccgcacctcgccgatgcccttttcagccgtaccgaagaagatgcgcgtgcgcgctttgcgc 





aataccgtcgcctggctggcgaagagtaataatactctaaccccatcggccgtcttaggggttttttgtccgtgg 





ttgagtcagcgtcgagcacgcggctaatacgactcactagagagagacgcgacttccagagaagaagactactga 





cttgagcgttccctctctgtaatacatcaaatcaatcataggagggctaaaatgacctcttgttcgtcgttttct 





ggcggtaaagcgtgccgtccggccgatgactccgcgctgactccgctggtggccgacaaggcagctgcgcacccg 





tgctatagccgccacggccatcaccgcttcgcgcgtatgcacctgccagtcgctccggcctgcaacttacaatgc 





aactactgcaaccgcaagttcgattgcagcaatgaaagccgtccgggcgtgtcctctaccctgctgacgccggaa 





caggctgtggtgaaggtgcgccaggtcgcccaagctatcccgcagctgtcggtggtcggtattgctggtccgggc 





gatccgcttgcgaatatcgcccgcaccttccgtaccttggagcttattcgcgaacagttgccggacctgaaactg 





tgcctgagcaccaacggcttggtgctgccagatgccgttgatcgtctgctcgatgtgggcgtggatcacgttacc 





gtcaccattaacaccctggacgcagaaatcgcagcgcaaatctacgcgtggttgtggctggatggcgaacgctac 





tccggtcgcgaagccggcgaaattctcattgcccgccagctggaaggcgtacgtcgcctgaccgcgaaaggtgtg 





ctcgtcaagatcaacagcgtattgattccgggcatcaatgacagcggcatggcgggtgttagccgtgcgctgcgc 





gcgtctggtgcgttcatccacaacatcatgccactgattgcgcgtccggagcatggcactgttttcggtctgaac 





ggccagccggaaccggacgcggaaaccctggcggcgacgcgctcccgctgcggcgaggttatgccacaaatgacc 





cactgccaccagtgccgtgccgacgcgattggcatgcttggtgaggatcgctcgcaacagtttacgcaattaccg 





gctccggagtccctcccggcctggctgccgatcctgcatcagcgtgctcagttgcatgcgagcatcgccacgcgc 





ggtgagagcgaagccgatgacgcctgcctggtggccgttgcgtcgagccgtggcgatgtaattgactgccatttc 





ggccatgccgaccgtttctatatctatagcctgtctgcggctggtatggttctggttaacgaacgtttcaccccg 





aaatactgccagggtcgcgatgactgcgagccgcaggacaatgccgcacgctttgctgccatccttgagttgctg 





gcggacgtcaaagcggtgttttgtgtgcgtatcggccataccccgtggcaacagctggagcaggaaggcatcgaa 





ccgtgcgtggatggcgcctggcgtccggtatccgaggtcctgccggcatggtggcagcagcgccgtggtagctgg 





ccggctgcattgccgcacaaaggcgttgcgtaaactacgagatttgaggtaaaccaaataagcacgtagtggcat 





taaagaggagaaattaagcatgccgccattggactggttgcgtcgtttgtggttactctatcacgccggcaaagg 





cagctttccgcttcgtatgggcttgtcgccgcgtgactggcaagctctgcgccgtcgcctgggcgaggtggaaac 





gccgctggatggcgaaaccctgacccgtcgccgtctgatggcggagctgaatgcgacccgcgaagaagaacgcca 





gcagctgggtgcctggctggccggttggatgcaacaggatgccggtccgatggcgcagattatcgcagaggtgag 





cctggcgttcaaccatctctggcaggaccttggcctcgcgagccgcgctgaactgcgtctgctgatgtctgactg 





cttcccgcagctggttgttatgaacgagcacaacatgcgctggaagaaattcttttaccgccagcgttgcctgct 





gcaacagggcgaagtcatctgtcgcagcccgtcttgcgatgaatgctgggaacgttctgcgtgctttgagtaata 





catatcgggggggtaggggttttttgtgtctgtagcacgtgcatctaatacgactcactaatgggagagacaaga 





gtctcaattataaggaggctttactacatggcgaacatcggcatcttctttggtacggataccggcaaaacccgc 





aagattgcgaagatgattcacaaacagctgggcgagctggccgatgccccggttaacatcaatcgtaccactttg 





gatgactttatggcttacccagtcctgttgctcggcacgccgacgcttggtgatggtcaactgccgggcttagag 





gcgggctgcgagagcgaaagctggtctgagtttatctccggtctggatgacgcttccctgaagggcaaaaccgtg 





gcgctgtttggcctgggcgaccagcgtggttacccggacaacttcgtgtcgggtatgcgtccgctgttcgacgcg 





ctgagcgcccgtggcgcccagatgattggtagctggccgaacgaaggttatgagtttagcgcatcgtccgcgctg 





gaaggcgaccgcttcgtcggcttggtgctggatcaagacaatcagttcgaccagaccgaagcgcgcctggcgtct 





tggcttgaagagatcaaacgcaccgttctgtaataatacatatcgggggggtaggggttttttgtggtcattaca 





acggttattaatacgactcactagagagagaaacatagcgttccatgagggctagaattacctaccggcctcaga 





tactgacaaataaaccagcgaaggaggttcctaatgtggaactacagcgagaaagtcaaggaccatttcttcaat 





ccgcgcaacgcgcgtgttgtggataacgcaaatgcggtgggcgacgtcggcagcttatcttgtggcgatgctctc 





cgcttgatgctgcgcgtggacccgcagagcgaaatcatcgaagaagcgggctttcagaccttcggctgcggcagc 





gcgattgcgtcgtccagcgcactgacggagctgatcatcggtcacaccctggcggaagcgggtcagatcaccaac 





cagcagatcgccgactatctggacggcttaccgccggaaaagatgcactgctctgtaatgggccaggaagctctt 





cgtgcggccattgctaactttcgcggtgaatcgctggaagaggagcatgacgagggtaagctgatctgcaagtgc 





ttcggcgtcgatgaaggccatattcgccgtgctgtccagaacaacggtcttacgactctggccgaggtgatcaat 





tacaccaaggcaggtggcggttgtaccagctgccatgagaaaatcgagctggccctggccgagattctcgcccaa 





cagccgcaaaccaccccggcagttgcgtccggtaaagatccgcactggcagagcgtcgtggataccatcgctgaa 





ctgcgtccacatatccaagcggacggtggtgacatggcgctgttgtccgtgacgaaccaccaagtgactgtttcg 





ctgtcgggcagctgttctggctgcatgatgaccgacatgaccctggcgtggctgcaacagaaattgatggagcgt 





accggctgctatatggaagttgttgccgcctaacattgtaatagccaccaaaagagtgatgatagtcatgggtga 





tacccgtagaccattctgaaatcgaaggaggttttccatgaaacaagtgtacctggacaacaacgcgaccacccg 





cctggacccgatggttctggaagcgatgatgccgtttctcacggatttctatggcaatccgtccagcatccatga 





cttcggcatcccggcacaagcggcgctggaacgtgcgcaccagcaagctgcggcactgctgggcgcagagtaccc 





gtctgaaatcattttcacgagctgtgcgaccgaggccactgcaaccgccattgcgtcggccatcgcgttattgcc 





ggaacgccgcgaaatcatcacctcggtagtggagcacccggctacgctggcggcgtgcgagcacctggaacgcca 





aggctatcgcatccatcgcattgcggtggatagcgaaggtgcgctggacatggcccagttccgtgcagcgctctc 





gccgcgtgtcgcgttggtgagcgtgatgtgggccaacaacgaaaccggcgtgctgttcccgattggcgaaatggc 





cgagcttgcccacgagcagggcgctctgttccactgcgatgccgttcaggtcgttggcaaaatcccaattgctgt 





tggccagacgcgcatcgacatgctgtcttgctccgcgcacaagtttcatggtccgaagggtgttggttgcttgta 





cttacgtcgtggcacgcgctttcgtccgctgcttcgcggtggccatcaagaatatggtcgccgtgccggcactga 





gaatatctgtggcatcgtcggcatgggcgctgcgtgcgaactggcgaacatccatctgccgggtatgacccatat 





tggccagttacgcaatcgcctggagcaccgtctgctcgccagcgtgccgtccgtgatggttatgggcggtggtca 





gccgcgtgtaccgggtactgtcaacctggcgttcgagtttatcgaaggtgaagcgatcctgctcttgctgaacca 





ggctggcattgccgcaagctccggctccgcgtgtacctctggcagcttggagccgagccatgtgatgcgcgccat 





gaacattccatacaccgcggctcacggcaccattcgttttagcctgagccgttatacgcgcgagaaagagatcga 





ctacgtcgttgcgaccctcccgccaatcattgatcgtctgcgtgccttgtccccgtattggcagaatggtaagcc 





gcgtccggcagatgcagtctttaccccggtttacggttaagagttactggccctgatttctccgcttctaatacc 





gcacagcgactaggagcctaactcgccacaaggaaacatatggagcgcgtcttgatcaacgatactaccctgcgt 





gatggcgaacaatctccgggcgtagcgtttcgtacctccgagaaagttgccatcgcggaggcactgtacgctgcg 





ggtatcaccgcgatggaagtcggcactccggcgatgggtgatgaagagatcgcccgcattcagctggtgcgtcgt 





caactgccggacgcgacgcttatgacctggtgccgtatgaacgctctggaaatccgtcagagcgcggatctgggt 





attgactgggtggatatctcgatcccagcatccgacaagctgcgtcagtacaagctgcgtgagccgctggccgtg 





ctgctggagcgccttgcgatgtttatccatctggcccacacgttaggcctcaaagtatgtattggttgcgaggat 





gcgagccgtgcgtctggtcagaccctgcgcgccattgccgaggtggcccagcaatgcgcggctgcgcgcttgcgt 





tacgctgacaccgtgggcctgctggacccgttcaccaccgcagcccagatcagcgccctgcgtgacgtttggtcg 





ggcgagatcgagatgcatgctcacaatgatctgggcatggctaccgcgaacacgctggcggcagtttcggctggc 





gccacgtcggtgaacactaccgtcctcggtctgggtgaacgtgcaggcaacgcagccctggaaaccgttgcgctg 





ggcctggaacgctgcctgggcgtggaaaccggcgtccatttcagcgcgctcccagcgctctgtcagcgcgtcgcg 





gaggctgcacagcgcgcaatcgacccgcaacagccgctggtgggtgaattggttttcacccacgagtctggtgtt 





cacgttgcggcgctgctgcgcgacagcgaatcctatcaatctattgccccaagcctcatgggccgtagctaccgt 





ctggtgctcggcaagcattcgggtcgtcaggctgtcaacggtgttttcgaccagatgggttaccacctgaatgcg 





gcgcagatcaatcagttgctgccggccattcgccgcttcgccgagaattggaaacgctctccgaaagactacgaa 





ctggttgcgatctatgacgaattgtgcggtgaatccgcccttcgtgctcgcggctaagactcaacacgctaggga 





cgtgaagtcgattccttcgatgcagaaggcgagaactagatttaagggccattatagatggagtggttttaccag 





attccgggtgtagacgaattgcgcagcgctgaatccttctttcagttcttcgcggttccataccagccggaactg 





ctgggccgctgctcgcttccggtgttagcgacgttccaccgtaaactgcgtgcggaggtcccgctgcaaaaccgt 





ctggaggacaatgatcgtgcgccgtggctcttggcgcgccgcctcctggccgaatcttatcagcagcaatttcag 





gagagcggcacctaatcgagaaacaaggcagttccgggctgaaagtagcgccgggacaagtcccgtattataacc 





gcctaggaggtgttggatgcgcccgaaattcaccttctctgaagaggtccgcgtagttcgcgcgattcgtaatga 





tggcaccgtggcgggttttgcgccaggtgcgctgctggttcgtcgcggttcgacgggctttgtgcgtgactgggg 





tgtgttcctgcaagaccagatcatctatcaaatccactttccggaaaccgaccgcattatcggctgtcgcgagca 





ggagttaatcccgattacccagccgtggttggctggtaacctccagtatcgtgacagcgtcacgtgccaaatggc 





actggctgtcaacggtgacgtggttgtgagcgccggtcaacgtggccgtgtggaggccactgatcgtggcgaact 





tggcgattcctacaccgtggacttcagcggccgttggttccgcgttccggtccaggccatcgcgctgattgaaga 





gcgcgaagaataaacgccacgcgtagtgagacatacacgttcgttgggttcactcagagactgaagttattaccc 





aggaggtctataatgaatccgtggcagcgctttgcccgtcaacgccttgctcgcagccgctggaaccgtgatccg 





gctgctctcgacccagccgataccccagcgttcgagcaggcgtggcagcgtcaatgccatatggaacaaaccatc 





gtagcgcgtgtcccggaaggcgatattccggctgccttactggaaaacatcgcggccagcctggcgatctggctg 





gacgagggtgacttcgctccgccggagcgcgctgcgattgtgcgtcatcatgcacgtctggagctggcgtttgcc 





gacattgcccgccaggcaccgcaaccggatctgagcacggttcaagcgtggtatctgcgtcaccagactcaattc 





atgcgtccggagcagcgtctgacccgtcacctgctcctgacggtcgataatgatcgcgaggcggtgcatcaacgc 





atccttggcctgtatcgtcagatcaacgcgagccgtgacgccttcgccccactggcacagcgccactctcattgc 





ccgtccgccttggaagaaggccgtctgggctggatctcccgtggtctgctgtacccgcagctcgaaaccgcgttg 





tttagcctggcggaaaacgcactgtcgctgccgattgcgtcggaattgggttggcacctgttatggtgcgaggcc 





attcgtccggcagccccgatggagccgcaacaggcccttgaatctgcgcgcgactacttgtggcagcagagccag 





cagcgccaccagcgtcaatggctggagcagatgatttcccgccaaccgggcctgtgtggttaatagcataacccc 





ttggggcctctaaacgggtcttgaggggttttttgt 





refactored nif cluster v2.1 


(SEQ ID NO. 2)



taatacgactcactattgggagatACAAATATATAATATATTTAAGGAGGTTTCATATATGACCATCCGTCAGTG 






CGCGATTTATGGCAAAGGTGGTATTGCCAAAAGCACGACGACCCAGAACTTGGTCGCCGCCGTGGCCGAGATGGG 





TAAAAAGGTTATGATTGTGGGTTGCGACCCGAAGGCCGACAGCACGCGCCTGATTCTCCACGCGAAAGGACAAAA 





CACGATTATGGAGATGGCTGCCGAGGTTGCTAGCGTGGAGGATCTGGAGCTGGAGGACGTTCTGCAAATTGGTTA 





CGGTGATGTTCGTTGCGCAGAGAGCGGTGGTGCGGAACCAGGTGTCGGCTGTGGGGGTCGTGGTGTAATTACCGC 





TATCAATTTCCTGGAAGAAGAGGGTGCGTACGAAGATGATCTGGATTTCGTTTTCTACGATGTGCTGGGTGATGT





CGTGTGCGGTGGTTTTGCAATGCCGATTCGCGAGAATAAGGCACAAGAAATTTACATTGTCTGTAGCGGCGAGAT





GATGGCAATGTACGCTGCTAACAACATCAGCAAGGGTATTGTTAAATACGCAAAAAGCGGTAAGGTTCGCTTGGG





TGGTTTGATTTGCAACAGCCGTCAGACCGACCGTGAGGACGAACTGATCATCGCCCTGGCTGAGAAACTGGGCAC





CCAAATGATCCACTTCGTGCCACGCGATAATATTGTTCAACGTGCAGAAATCCGCCGTATGACCGTCATTGAGTA





TGACCCGGCATGCAAGCAAGCGAACGAGTACCGCACCTTGGCACAGAAAATCGTGAACAACACCATGAAGGTTGT





TCCGACGCCGTGTACGATGGACGAGCTGGAGAGCCTGCTGATGGAGTTCGGCATTATGGAGGAGGAGGACACCAG





CATTATCGGTAAGACCGCAGCGGAGGAGAATGCGGCATAATACTCGAACCCCTAGCCCGCTCTTATCGGGCGGCT





AGGGGTTTTTTGTCGAAGAACAGATATGAAAGTGTTAGAACTGTAATACGACTCACTATAGGTAGAGCGTGCGTA





CACCTTAATCACCGCTTCATGCTAAGGTCCTGGCTGCATGCAAAAATTCACATTTTTATCTAGCGGAGGAGCCGG





atgatgactaatgctactggcgaacgtaacctggcactgattcaagaagtactggaagtgttcccggaaaccgcg 





cgcaaagagcgccgtaaacacatgatggtttctgacccgGaaatgGaatctgtgggtaaatgcatcatctctaat 





cgcaaatctcagccgggtgtcatgactgttcgtggctgtgcgtacgcaggttctaaaggtgtcgtattcggcccg 





atcaaagatatggcgcatatctctcatggcccggTaggctgtggccagtactctcgcgcggGacgtcgtaactac 





tacacgggcgtttctggcgttgactctttcggcacgctgaacttcacctctgacttccaggaacgtgacatcgtt 





ttcggtggcgataaaaagctgtccaaactgatcgaagaaatggaactgctgttcccgctgactaaaggcattact 





atccaaagcgaatgtccggtgggtctgatcggtgatgacatcagcgcggtcgcaaacgcatcttccaaagccctg 





gataagccggtgatcccggttcgttgcgagggcttccgcggcgtttctcagtctctgggtcatcacatcgcaaac 





gatgttgtgcgtgactggattctgaacaaccgtgaaggtcagccttttgaaaccaccccttatgacgttgcgatt 





attggcgactataacatcggcggcgacgcctgggcatcccgcatcctgctggaggagatgggtctgcgtgttgtc 





gcacagtggtctggcgatggcaccctggttgaaatggaaaacaccccgtttgttaaactgaacctggttcactgc 





taccgctccatgaactacattgcccgtcacatggaagaaaaacatcagatcccttggatggaatacaacttcttc 





ggtccgactaaaatcgcagaatccctgcgtaaaatcgccgatcagtttgatgataccattcgcgcgaacgctgaa 





gcagtaattgcgcgctacgaaggccagatggcagcaatcattgctaagtaccgtccgcgcctggaaggtcgtaaa 





gtgctgctgtacatgggtggtctgcgtccacgtcatgtgatcggtgcctacgaggacctgggcatggagatcatc 





gcagcgggttacgaatttgcacacaacgacgactatgatcgtacgctgccagacctgaaagaaggtacgctgctg 





tttgacgacgccagctcttatgaactggaagccttcgtgaaagcgctgaaaccagacctgatcggctccggcatc 





aaggaaaaatacattttccagaaaatgggcgtgccgttccgccagatgcactcctgggactactccggtccgtac 





cacggctacgacggtttcgctatcttcgctcgtgacatggatatgaccctgaataacccagcgtggaatgaactg 





accgcaccgtggctgaaatctgcataaCAAACACCGCATGTCGATACTGAACGAATCGACCCACACTCGCTTCCT 





TGCAATCTCATACTGTCAAAAATTAGGCGAGGTAACatgtctcaaactatcgataaaatcaactcttgttacccg 





ctgttcgagcaggacgaatatcaggaactgttccgtaacaaacgtcagctggaagaagcgcacgacgcacagcgc 





gtgcaggaagtgttcgcatggaccaccaccgcggaatacgaagctctgaacttccagcgcgaagccctgacggtt 





gatccggcgaaagcgtgccagcctctgggtgcggttctgtgcagcctgggttttgccaacaccctgccgtatgtc 





cacggttcccagggctgcgtagcctacttccgtacctatttcaaccgccactttaaagaaccaatcgcgtgcgtg 





tccgacagcatgacggaggacgcggcagttttcggtggtaacaacaacatgaacctgggcctgcaaaatgcttcc 





gcactgtacaaaccggaaatcatcgcagtgtctaccacctgcatggcagaggttattggtgatgatctgcaagca 





tttattgccaacgcaaagaaagacggtttcgttgacagctctatcgcggttccgcacgctcataccccgtccttc 





atcggttctcacgtaactggttgggacaacatgttcgaaggcttcgcaaaaacttttaccgcagactatcaaggc 





caaccgggtaaactgccgaagctgaacctggtgaccggctttgaaacctacctgggcaactttcgtgtcctgaag 





cgcatgatggagcagatggcggttccgtgttctctgctgtctgacccgtctgaggttctggacactccagcggac 





ggccactatcgcatgtattctggtggcaccactcagcaggaaatgaaagaggccccagacgcgattgacaccctg 





ctgctgcaaccgtggcagctgctgaaaagcaagaaagttgttcaggaaatgtggaaccagccggcaacggaagtt 





gcaatcccgctgggtctggcagctactgacgaactgctgatgaccgtgtcccaactgagcggcaaaccaatcgcg 





gatgctctgaccctggaacgcggtcgcctggtggacatgatgctggacagccacacgtggctgcatggcaagaaa 





tttggcctgtacggtgacccggacttcgtaatgggcctgacccgtttcctgctggaactgggctgcgagccgact 





gttatcctgtctcacaacgctaacaaacgttggcagaaggccatgaacaaaatgctggatgcgagcccatacggc 





cgtgatagcgaagtgttcatcaactgcgacctgtggcatttccgctctctgatgtttacgcgtcagccggatttc 





atgatcggtaactcttacggcaaattcatccagcgtgacactctggccaaaggcaaagcgtttgaagtgccgctg 





attcgtctgggctttccgctgttcgaccgtcaccacctgcaccgccagaccacctggggttacgaaggcgcgatg 





aacatcgtaactactctggtaaacgcagtactggaaaagctggacagcgatacttcccagctgggcaaaaccgac 





tattctttcgatctggttcgttaaCCTGATTGTATCCGCATCTGATGCTACCGTGGTTGAGTTACCATACTCACT 





CCCGGAGGTACTTCTATGTCTGACAATGATACCCTGTTTTGGCGCATGCTGGCGCTGTTTCAGTCGCTGCCGGAT 





TTGCACCCGGCTCAAATCGTCGATTGGCTGGCGCAGGAATCCGGCGAAACCCTGACGCCGGAGCCCCTTGCCACC 





CTGACCCAACCGCAACTCGCGGCGTCGTTCCCATCCGCGACGGCAGTGATGAGCCCGGCTCGCTCGAGCCGCGTT 





ATGGCTTCTCTGCAAGGCGCCCTCCCAGCCCACTTGCGCATCGTACGTCCGGCGCAGCGTACCCCGCAACTGCTC





GCCGCGTTTTGCAGCCAAGACGGCCTTGTTATCAATGGTCATTTCGGCCAGGGTCGTCTGTTCTTCATTTACGCC





TTTGACGAGCAGGGCGGCTGGCTGTATGACTTGCGCCGCTATCCGAGCGCACCGCACCAGCAGGAAGCGAATGAG





GTGCGTGCTCGTCTGATTGAAGATTGCCAGCTGCTGTTCTGCCAGGAGATTGGCGGTCCGGCAGCAGCGCGTCTG





ATCCGCCACCGCATCCATCCGATGAAGGCGCAGCCGGGTACTACGATTCAGGCGCAGTGTGAAGCTATCAACACC





CTGCTGGCCGGTCGCCTGCCGCCGTGGCTCGCCAAACGTTTGAACCGTGATAACCCGCTGGAAGAGCGTGTGTTT





TAACATTTTTGCCTTGCGACAGACCTCCTACTTAGATTGCCACACTATTCAATTCATCACTGGAGGTTATTACAA





ATGAACGGTAACGAGATTCTTGCTCTGCTGGACCAACCGGCCTGTGAACACAACCATAAACAGAAATCCGGCTGT 





AGCGCCCCAAAGCCGGGTGCGACGGCGGCTGGCTGCGCTTTCGATGGTGCGCAGATCACCCTGCTCCCGATTGCG





GACGTTGCCCACCTCGTGCATGGCCCAATCGGTTGCGCAGGTAGCTCTTGGGACAACCGTGGCAGCGCCTCCAGC





GGTCCGACCCTGAATCGTTTGGGCTTTACCACTGACTTGAATGAACAAGATGTGATCATGGGTCGCGGCGAGCGT





CGCCTGTTCCACGCTGTGCGCCATATTGTCACCCGTTACCACCCAGCGGCAGTATTCATCTACAATACGTGCGTG





CCGGCTATGGAAGGCGATGACCTGGAGGCCGTGTGTCAGGCAGCCCAGACTGCGACCGGCGTCCCGGTAATCGCA





ATTGATGCGGCTGGCTTCTACGGTTCGAAGAACCTGGGCAACCGTCCGGCAGGCGATGTCATGGTTAAACGCGTC





ATTGGCCAACGTGAGCCAGCGCCGTGGCCGGAGAGCACCCTGTTTGCCCCGGAGCAACGTCATGACATTGGCTTG





ATCGGTGAGTTCAACATTGCGGGCGAGTTTTGGCACATTCAGCCGCTGCTTGATGAGCTGGGTATCCGCGTTTTG





GGTTCGCTCAGCGGCGATGGTCGTTTCGCCGAGATTCAAACCATGCACCGTGCCCAGGCGAACATGCTGGTGTGC





AGCCGTGCTCTGATCAATGTTGCGCGTGCTCTGCAACAGCGCTATGGCACCCCGTGGTTTGAAGCCTCGTTCTAT 





GGTATCCGCGCGACCAGCGACGCCCTGCGCCAGTTAGCGGCGCTGCTGGGCGATGACGACCTCCGTCAGCGCACC





GAGGCGCTGATCGCGCGTGAAGAACAGGCGGCTGAGCTGGCCCTGCAACCGTGGCGTGAACAGCTGCGTGGCCGC





AAGGCCCTGCTCTACACGGGTGGTGTCAAAAGCTGGTCTGTGGTGTCCGCGCTTCAGGATCTGGGTATGACCGTG





GTTGCCACGGGCACGCGTAAGAGCACGGAAGAGGATAAACAGCGCATCCGCGAATTGATGGGCGAAGAGGCCGTG





ATGCTTGAAGAAGGCAACGCACGTACCTTATTGGATGTAGTTTATCGCTATCAAGCAGACCTGATGATTGCCGGT





GGCCGCAACATGTATACCGCCTACAAAGCGCGCTTGCCGTTCCTGGACATCAACCAGGAACGCGAGCACGCGTTT





GCGGGCTACCAAGGCATCGTGACCTTAGCGCGCCAGCTGTGCCAAACGATTAACAGCCCGATCTGGCCGCAGACT





CATTCCCGCGCACCGTGGCGCTAATGTCACGCTAGGAGGCAATTCTATAAGAATGCACACTGCACCTAAACCTAC





CACACCTGGAAGAAGTAATTATGGCAGACATTTTCCGCACTGATAAGCCGTTGGCTGTGTCGCCGATCAAGACCG





GCCAGCCGCTGGGTGCGATCCTGGCGTCCCTGGGTATCGAGCACTCGATTCCGCTGGTACATGGCGCGCAGGGCT





GTTCGGCTTTTGCCAAGGTTTTCTTTATCCAGCACTTCCACGATCCGGTCCCGCTGCAAAGCACGGCAATGGACC





CGACCAGCACCATCATGGGCGCTGATGGTAACATCTTCACCGCGCTGGACACTCTCTGCCAACGCAATAACCCGC





AAGCAATTGTGCTGCTGAGCACCGGCCTCTCCGAGGCGCAGGGCAGCGACATTTCCCGTGTAGTGCGTCAGTTCC





GTGAAGAATATCCGCGTCATAAAGGCGTGGCGATTCTGACTGTTAACACCCCGGACTTTTACGGTAGCATGGAGA





ACGGCTTTTCCGCTGTCCTGGAGTCTGTGATTGAACAGTGGGTTCCGCCAGCCCCACGTCCGGCGCAGCGCAATC





GTCGCGTCAATCTTTTGGTGAGCCATCTCTGTAGCCCAGGCGATATTGAGTGGCTGCGCCGTTGCGTCGAGGCCT





TCGGTCTGCAACCGATCATTCTGCCGGATCTGGCTCAGAGCATGGACGGCCACCTTGCTCAGGGTGACTTTTCGC





CGCTGACGCAGGGCGGCACGCCGTTGCGCCAAATCGAGCAGATGGGCCAGAGCCTTTGCTCTTTTGCGATTGGCG





TCAGCCTGCACCGTGCGAGCAGCCTGCTGGCTCCGCGTTGTCGTGGCGAAGTCATCGCCTTGCCGCACCTCATGA





CCTTCCAACGCTGCCACCCCTTTATCCATCAGTTGCCGAAAATCACCGCTCCCGCCGTTCCGGACTGGCTGGAAC 





CCCAGCGCGGTCAGCTGCAAGACGCCATGATCGATTGCCACATGTGGCTGCAAGGCCAGCGCATGGCGATTGCCG 





CCGAAGGCGACCTGCTGCCAGCGTGGTGCGATTTCGCGAACTCTCAAGGTATGCAGCCGGGTCCACTGGTTGCTC 





CGACGGGTCATCCGAGCCTGCGTCAGTTGCCGGTGGAGCGCGTGGTGCCGGGTGATCTGGAGGATCTTCAGACCC 





TCTTATGCGCACATCCGCCCGACTTACTGGTGGCGAACTCCCACGCCCGTGATTTAGCAGAGCAATTCGCCCTGC 





CGCTGGTGCGCGCAGGCTTCCCGCTGTTTGACAAACTGGGCGAATTTCGTCGTGTTCGCCAGGGTTATAGCGGTA 





TGCGTGATACCCTGTTCCAGTTGGCGAACCTGATCCGTGAACGCCATCATCATCTGGCTCATTATCGCAGCCCGC 





TGCGCCAGAACCCAGAATCCTCGTTGTCTACGGGTGGCGCGTACGCAGCGGATTAActagagattaaTATggaga 





aattaagcATGAAAACTATGGACGGTAACGCTGCGGCTGCATGGATTAGCTACGCCTTTACCGAAGTGGCTGCGA 





TCTACCCGATTACGCCGAGCACCCCGATGGCGGAAAATGTGGACGAATGGGCTGCGCAGGGCAAGAAGAACCTCT 





TCGGCCAGCCGGTGCGCCTGATGGAGATGCAGTCGGAAGCGGGTGCAGCAGGTGCTGTGCATGGCGCCTTGCAAG 





CTGGCGCACTGACGACCACCTACACCGCGTCGCAGGGCCTGTTGCTGATGATCCCAAACATGTACAAAATCGCGG 





GTGAACTGCTGCCGGGTCTCTTTCATGTTTCGGCACGCGCACTGGCCACCAATAGCCTCAACATCTTTGGCGATC 





ATCAGGATGTAATGGCGCTGCGCCAAACGGGCTGCGCGATGTTGGCCGAGAATAACGTCCAGCAAGTTATGGATT 





TGTCCGCGGTAGCCCACTTGGCAGCGATCAAAGGTCGCATTCCGTTCGTGAACTTCTTCGATGGCTTTCGCACCA 





CCCACGAAATCCAGAAGATCGAGGTTCTCGAATATGAACAGCTGGCCACCTTGTTGGATCGTCCGGCCCTGGACA 





GCTTCCGCCGTAACGCCGTTCACCCGGACCACCCGGTCATCCGTGGCACCGCCCAGAACCCGGACATCTACTTCC 





AGGAACGTGAGGCCGGTAACCGTTTCTATCAGGCGCTCCCGGATATTGTGGAATCTTACATGACCCAGATTTCTG 





CCCTGACTGGTCGCGAGTATCACCTGTTTAACTACACTGGTGCTGCGGATGCGGAGCGCGTGATCATCGCGATGG 





GCTCTGTCTGTGACACCCTCCAAGAGGTGGTTGACACGCTGAATGCAGCGGGTGAGAAAGTTGGTCTGCTCTCCG 





TTCATCTTTTCCGCCCGTTTTCGTTAGCGCACTTCTTCGCCCAACTGCCGAAAACTGTACAGCGTATCGCAGTAT 





TGGACCGTACGAAAGAGCCAGGTGCTCAAGCAGAGCCGCTGTGCCTCGATGTGAAGAATGCCTTTTACCACCATG 





ACGATGCCCCGTTGATTGTGGGTGGTCGCTATGCCTTGGGCGGTAAGGACGTGTTGCCGAACGATATTGCGGCCG 





TGTTTGATAACCTGAACAAACCGCTGCCGATGGACGGCTTCACGCTGGGTATCGTGGACGATGTTACCTTCACCT 





CTCTCCCGCCAGCGCAGCAGACCCTGGCGGTTTCTCACGACGGCATCACGGCATGTAAGTTTTGGGGCATGGGCT 





CCGACGGCACGGTTGGTCCGAACAAGTCCGCGATCAAGATTATCGGCGACAAAACGCCACTGTATGCGCAAGCGT 





ACTTTTCCTACGACTCGAAGAAGAGCGGTGGTATTACCGTCAGCCATCTGCGTTTTGGTGATCGCCCGATCAACT 





CCCCGTATTTGATCCATCGCGCGGATTTCATCTCGTGCAGCCAGCAAAGCTATGTTGAACGCTACGATCTGCTGG 





ATGGCCTTAAACCGGGTCGCACCTTTCTGCTGAACTGCTCCTGGAGCGATGCCGAACTGGAGCAACATCTGCCGG 





TCGGTTTCAAACGTTATCTGGCACGCGAGAATATCCACTTCTACACTCTCAACGCTGTGGACATCGCCCGTGAGC 





TTGGTTTGGGTGGCCGTTTCAACATGCTGATGCAGGCTGCCTTCTTCAAACTGGCCGCGATCATTGACCCGCAGA 





CTGGTGGGGACTATCTGAAGCAGGCTGTTGAGAAAAGCTATGGCAGCAAAGGTGGGGCGGTCATCGAGATGAACC 





AGCGTGCCATCGAGCTTCGCATGGCCAGCCTGCACCAGGTGACGATCCCGGCACATTGGGCCACCCTGGATGAGC 





CAGCGGCGCAGGCGTCCCCGATGATGCCGGACTTTATCCGCGACATCCTGCAACCGATGAACCGTCAGTGCGGCG 





ACCAGCTTCCGGTGTCGCCTTTTGTCGGCATGGAAGATGGCACCTTCCCGTCCGGCACGGCCGCATGGGAGAAAC 





GTGGCATCGCCCTTGAGCTGCCAGTCTGGCAGCCGGAAGGCTGCACGCAGTGCAACCAGTGCGCCTTCATTTGTC 





CGCACGCCGCGATTCGTCCGGCGTTGTTGAATGGCGAAGAGCATGATGCTGCCCCGGTTGGCCTGCTGAGCAAAC 





CGGCACAAGGCGCTAAACAATATCACTATCATCTGGCGATTAGCCCGCTGGACTGCTCCGGCTGTGGCAACTGCG 





TTGACATTTGTCCAGCTCGTGGCAAAGCGTTGAAGATGCAGTCTCTGGATAGCCAACGCCAGATGGCTCCGGTGT





GGGATTATGCGCTGGCGCTGACCCCGAAGTCTAACCCGTTTCGTAAAACCACCGTCAAAGGCTCGCAGTTCGAAA 





CCCCGCTGCTGGAGTTTAGCGGTGCGTGCGCTGGTTGTGGCGAAACGCCGTATGCGCGCCTCATTACCCAGCTGT 





TTGGCGACCGCATGCTGATTGCCAATGCCACCGGCTGTTCCAGCATCTGGGGCGCATCTGCGCCGAGCATCCCGT 





ATACCACCAATCATCGTCGTCATGGTCCGGCCTGGGCGAATAGCCTGTTTGAGGACAATGCCGAATTTGGTTTAG 





GTATGATGCTGGGCGGTCAAGCTGTGCGTCAACAGATCGCGGACGATATGACGGCTGCGTTAGCGCTCCCGGTTT 





CCGATGAGCTGAGCGACCCGATGCGCCAGTGGTTGGCGAAACAGGACGAGGGTGAAGGCACGCGTGAGCGTGCGG 





ACCGTCTGAGCGAGCGCTTAGCCGCGGAGAAAGAGGGCGTTCCGCTGTTAGAGCAGCTGTGGCAAAATCGTGATT 





ACTTTGTGCGTCGCAGCCAGTGGATTTTCGGCGGTGACGGCTGGGCCTATGATATTGGCTTCGGTGGCCTGGACC 





ACGTCCTCGCCAGCGGTCAGGATGTGAACATTCTGGTATTTGACACCGAAGTCTACTCGAACACCGGCGGTCAAA 





GCAGCAAATCGACCCCGCTCGCGGCCATCGCCAAGTTCGCGGCTCAGGGCAAGCGCACCCGCAAGAAAGACCTGG 





GTATGATGGCGATGAGCTACGGCAACGTCTATGTAGCCCAGGTGGCGATGGGTGCGGATAAAGATCAAACTCTGC 





GCGCCATTGCGGAAGCTCAAGCGTGGCCAGGCCCGTCGCTGGTGATTGCGTATGCGGCCTGCATCAATCATGGCC 





TGAAGGCCGGTATGCGTTGCAGCCAACGTGAGGCGAAGCGCGCTGTTGAGGCGGGCTACTGGCACCTGTGGCGTT 





ATCACCCGCAGCGCGAACCGGAAGGCAAGACGCCGTTTATGTTAGATAGCGAAGAACCGGAAGAGTCGTTCCGTG 





ACTTTCTGTTGGGTGAGCTGCGCTACGCATCCCTGCACAAGACCACCCCGCACCTCGCCGATGCCCTTTTCAGCC 





GTACCGAAGAAGATGCGCGTGCGCGCTTTGCGCAATACCGTCGCCTGGCTGGCGAAGAGTAATAATACTCTAACC 





CCATCGGCCGTCTTAGGCGTTTTTTGTCCGTGGttagttagttagcccttagtgactcTAATACGACTCACTAGA 





GAGAGACGCGACTTCCACAGAAGAAGACTACTGACTTGAGCGTTCCCTCTCTGTAATACATCAAATCAATCATAG 





GAGGGCTAAAATGACCTCTTGTTCGTCGTTTTCTGGCGGTAAAGCGTGCCGTCCGGCCGATGACTCCGCGCTGAC 





TCCGCTGGTGGCCGACAAGGCAGCTGCGCACCCGTGCTATAGCCGCCACGGCCATCACCGCTTCGCGCGTATGCA 





CCTGCCAGTCGCTCCGGCCTGCAACTTACAATGCAACTACTGCAACCGCAAGTTCGATTGCAGCAATGAAAGCCG 





TCCGCTGGTGGCCGACAAGGCAGCTGCGCACCCGTGCTATAGCCGCCACGGCCATCACCGCTTCGCGCGTATGCA





GCAGCTGtcgGTGGTCGCTATTGCTGGTCCGGGCGATCCGCTTGCGAATATCGCCCGCACCTTCCGTACCTTGGA 





GCTTATTCGCGAACAGTTGCCGGACCTGAAACTGTGCCTGAGCACCAACGGCTTGGTGCTGCCAGATGCCGTTGA 





TCGTCTGCTCGATGTGGCCGTGGATCACGTTACCGTCACCATTAACACCCTGGACGCAGAAATCGCAGCGCAAAT 





CTACGCGTGGTTGTGGCTGGATGGCGAACGCTACTCCGGTCGCGAAGCCGGCGAAATTCTCATTGCCCGCCAGCT 





GGAAGGCGTACGTCGCCTGACCGCGAAAGGTGTGCTCGTCAAGATCAACAGCGTATTGATTCCGGGCATCAATGA 





CAGCGGCATGGCGGGTGTTAGCCGTGCGCTGCGCGCGTCTGGTGCGTTCATCCACAACATCATGCCACTGATTGC 





GCGTCCGGAGCATGGCACTGTTTTCGGTCTGAACGGCCAGCCGGAACCGGACGCGGAAACCCTGGCGGCGACGCG 





CTCCCGCTGCGGCGAGGTTATGCCACAAATGACCCACTGCCACCAGTGCCGTGCCGACGCGATTGGCATGCTTGG 





TGAGGATCGCTCGCAACAGTTTACGCAATTACCGGCTCCGGAGTGCCTCCGGGCCTGGCTGCCGATCCTGCATCA 





GCGTGCTCAGTTGCATGCGAGCATCGCCACGCGCGGTGAGAGCGAAGCCGATGACGCCTGCCTGGTGGCCGTTGC 





GTCGAGCCGTGGCGATGTAATTGACTGCCATTTCGGCCATGCCGACCGTTTCTATATCTATAGCCTGTCTGCGGC 





TGGTATCGTTCTGGTTAACGAACGTTTCACCCCGAAATACTGCCAGGGTCGCGATGACTGCGAGCCGCAGGACAA 





TGCCGCACGCTTTGCTGCCATCCTTGAGTTGCTGGCGGACGTCAAAGCGGTGTTTTGTGTGCGTATCGGCCATAC 





CCCGTGGCAACAGCTGGAGCAGGAAGGCATCGAACCGTGCGTGGATGGCGCCTGGCGTCCGGTATCCGAGGTCCT





GCCGGCATGGTGGCAGCAGCGCCGTGGTAGCTGGCCGGCTGCATTGCCGCACAAAGGCGTTGCGTAAACTACGAG 





ATTTGAGGTAAACCAAATAAGCACGTAGTGGCATTAAAGAGGAGAAATTAAGCATGCCGCCATTGGACTGGTTGC 





GTCGTTTGTGGTTACTCTATCACGCCGGCAAAGGCAGCTTTCCGCTTCGTATGGGCTTGTCGCCGCGTGACTGGC 





AAGCTCTGCGCCGTCGCCTGGGCGAGGTGGAAACGCCGCTGGATGGCGAAACCCTGACCCGTCGCCGTCTGATGG 





CGGAGCTGAATCCGACCCGCGAAGAAGAACGCCAGCAGCTGGCTGCCTGGCTGGCCGGTTGGATGCAACAGGATG 





CCGGTCCGATGGCGCAGATTATCGCAGAGGTGAGCCTGGCGTTCAACCATCTCTGGCAGGACCTTGGCCTCGCGA 





GCCGCGCTGAACTGCGTCTGCTGATGTCTGACTGCTTCCCGCAGCTGGTTGTTATGAACGAGCACAACATGCGCT 





GGAAGAAATTCTTTTACCGCCAGCGTTGCCTGCTGCAACAGGGCGAAGTCATCTGTCGCAGCCCGTCTTGCGATG 





AATGCTGGGAACGTTCTCCGTGCTTTGAGTAATACATATCGGGGGCGTAGGGGTTTTTTGTGTCTGTAGCACGTG 





CATCTAATACGACTCACTAATGGGAGAGACAAGAGTCTCAATTATAAGGAGGCTTTACTACATGGCGAACATCGG 





CATCTTCTTTGGTACGGATACCGGCAAAACCCGCAAGATTGCGAAGATGATTCACAAACAGCTGGGCGAGCTGGC 





CGATGCCCCGGTTAACATCAATCGTACCACTTTGGATGACTTTATGGCTTACCCAGTCCTGTTGCTCGGCACGCC 





GACGCTTGGTGATGGTCAACTGCCGGGCTTAGAGGCGGGCTGCGAGAGCGAAAGCTGGTCTGAGTTTATCTCCGG 





TCTGGATGACGCTTCCCTGAAGGGCAAAACCGTGGCGCTGTTTGGCCTGGGCGACCAGCGTGGTTACCCGGACAA 





CTTCGTGTCGGGTATGCGTCCGCTGTTCGACGCGCTGAGCGCCCGTGGCGCCCAGATGATTGGTAGCTGGCCGAA 





CGAAGGTTATGAGTTTAGCGCATCGTCCGCGCTGGAAGGCGACCGCTTCGTCGGCTTGGTGCTGGATCAAGACAA 





TCAGTTCGACCAGACCGAAGCGCGCCTGGCGTCTTGGCTTGAAGAGATCAAACGCACCGTTCTGTAATAATACAT





ATCGGGGGGGTAGGGGTTTTTTGTGGTCATTACAACGGTTATggtctcaggagtaatacgactcactagagagag 





aggtcgcggacccggccgatccgggggcctcaaagccgcctcaccagatactgacaaataaaccagcgaaggagg 





ttcctaatgtggaactacagcgagaaagtcaaggaccatttcttcaatccgcgcaacgcgcgtgttgtggataac 





gcaaatgcggtgggcgacgtcggcagcttatcttgtggcgatgctctccgcttgatgctgcgcgtggacccgcag 





agcgaaatcatcgaagaagcgggctttcagaccttcggctgcggcagcgcgattgcgtcgtccagcgcactgacg 





gagctgatcatcggtcacaccctggcggaagcgggtcagatcaccaaccagcagatcgccgactatctggacggc 





ttaccgccggaaaagatgcactgctctgtaatgggccaggaagctcttcgtgcggccattgctaactttcgcggt 





gaatcgctggaagaggagcatgacgagggtaagctgatctgcaagtgcttcggcgtcgatgaaggccatattcgc 





cgtgctgtccagaacaacggtcttacgacgctggccgaggtgatcaattacaccaaggcaggtggcggttgtacc 





agctgccatgagaaaatcgagctggccctggccgagattctcgcccaacagccgcaaaccaccccggcagttgcg 





tccggtaaagatccgcactggcagagcgtcgtggataccatcgctgaactgcgtccacatatccaagcggacggt 





ggtgacatggcgctgttgtccgtgacgaaccaccaagtgactgtttcgctgtcgggcagctgttctggctgcatg 





atgaccgacatgaccctggcgtggctgcaacagaaattgatggagcgtaccggctgctatatggaagttgttgcc 





gcctaagaccgcgcgccccgtcagagcaatgcgtataccagctctcctgtcagcagaatggctccagtacatcta 





acggggcagtatccgcggcaagtcctagtccaatcgatacccgtagaccattctgaaatcgaaggaggttttcca 





tgaaacaagtgtacctggacaacaacgcgaccacccgcctggacccgatggttctggaagcgatgatgccgtttc 





tcacggatttctatggcaatccgtccagcatccatgacttcggcatcccggcacaagcggcgctggaacgtgcgc 





accagcaagctgcggcactgctgggcgcagagtacccgtctgaaatcattttcacgagctgtgcgaccgaggcca 





ctgcaaccgccattgcgtcggccatcgcgttattgccggaacgccgcgaaatcatcacctcggtagtggagcacc 





cggctacgctggcggcgtgcgagcacctggaacgccaaggctatcgcatccatcgcattgcggtggatagcgaag 





gtgcgctggacatggcccagttccgtgcagcgctctcgccgcgtgtcgcgttggtgagcgtgatgtgggccaaca 





acgaaaccggcgtgctgttcccgattggcgaaatggccgagcttgcccacgagcagggcgctctgttccactgcg 





atgccgttcaggtcgttggcaaaatcccaattgctgttggccagacgcgcatcgacatgctgtcttgctccgcgc 





acaagtttcatggtccgaagggtgttggttgcttgtacttacgtcgtggcacgcgctttcgtccgctgcttcgcg 





gtggccatcaagaatatggtcgccgtgccggcactgagaatatctgtggcatcgtcggcatgggcgctgcgtgcg 





aactggcgaacatccatctgccgggtatgacccatattggccagttacgcaatcgcctggagcaccgtctgctcg 





ccagcgtgccgtccgtgatggttatgggcggtggtcagccgcgtgtaccgggtactgtcaacctggcgttcgagt 





ttatcgaaggtgaagcgatcctgctcttgctgaaccaggctggcattgccgcaagctccggctccgcgtgtacct 





ctggcagcttggagccgagccatgtgatgcgcgccatgaacattccatacaccgcggctcacggcaccattcgtt 





ttagcctgagccgttatacgcgcgagaaagagatcgactacgtcgttgcgaccctcccgccaatcattgatcgtc 





tgcgtgccttgtccccgtattggcagaatggtaagccgcgtccggcagatgcagtctttaccccggtttacggtt 





aagcgactaggagcctaactcgccacaaggaaacatatggagcgcgtcttgatcaacgatactaccctgcgtgat 





ggcgaacaatctccgggcgtagcgtttcgtacctccgagaaagttgccatcgcggaggcactgtacgctgcgggt 





atcaccgcgatggaagtcggcactccggcgatgggtgatgaagagatcgcccgcattcagctggtgcgtcgtcaa 





ctgccggacgcgacgcttatgacctggtgccgtatgaacgctctggaaatccgtcagagcgcggatctgggtatt 





gactgggtggatatctcgatcccagcatccgacaagctgcgtcagtacaagctgcgtgagccgctggccgtgctg 





ctggagcgccttgcgatgtttatccatctggcccacacgttaggcctcaaagtatgtattggttgcgaggatgcg 





agccgtgcgtctggtcagaccctgcgcgccattgccgaggtggcccagcaatgcgcggctgcgcgcttgcgttac 





gctgacaccgtgggcctgctggacccgttcaccaccgcagcccagatcagcgccctgcgtgacgtttggtcgggc 





gagatcgagatgcatgctcacaatgatctgggcatggctaccgcgaacacgctggcggcagtttcggctggcgcc 





acgtcggtgaacactaccgtcctcggtctgggtgaacgtgcaggcaacgcagccctggaaaccgttgcgctgggc 





ctggaacgctgcctgggcgtggaaaccggcgtccatttcagcgcgctcccagcgctctgtcagcgcgtcgcggag 





gctgcacagcgcgcaatcgacccgcaacagccgctggtgggtgaattggttttcacccacgaatctggtgttcac 





gttgcggcgctgctgcgcgacagcgaatcctatcaatctattgccccaagcctcatgggccgtagctaccgtctg 





gtgctcggcaagcattcgggtcgtcaggctgtcaacggtgttttcgaccagatgggttaccacctgaatgcggcg 





cagatcaatcagttgctgccggccattcgccgcttcgccgagaattggaaacgctctccgaaagactacgaactg 





gttgcgatctatgacgaattgtgcggtgaatccgcccttcgtgctcgcggctaaccgatagtttcaagagaaagg 





gagtagaaacagaatggagtggttttaccagattccgggtgtagacgaattgcgcagcgctgaatccttctttca 





gttcttcgcggttccataccagccggaactgctgggccgctgctcgcttccggtgttagcgacgttccaccgtaa 





actgcgtgcggaggtcccgctgcaaaaccgtctggaggacaatgatcgtgcgccgtggctcttggcgcgccgcct 





cctggccgaatcttatcagcagcaatttcaggagagcggcacctaattcaccagcccgaatcaatataggtcata 





caatgcgcccgaaattcaccttctctgaagaggtccgcgtagttcgcgcgattcgtaatgatggcaccgtggcgg 





gttttgcgccaggtgcgctgctggttcgtcgcggttcgacgggctttgtgcgtgactggggtgtgttcctgcaag 





accagatcatctatcaaatccactttccggaaaccgaccgcattatcggctgtcgcgagcaggagttaatcccga 





ttacccagccgtggttggctggtaacctccagtatcgtgacagcgtcacgtgccaaatggcactggctgtcaacg 





gtgacgtggttgtgagcgccggtcaacgtggccgtgtggaggccactgatcgtggcgaacttggcgattcctaca 





ccgtggacttcagcggccgttggttccgcgttccggtccaggccatcgcgctgattgaagagcgcgaagaataat 





cagagactgaagttattacccaggaggtctataatgaatccgtggcagcgctttgcccgtcaacgccttgctcgc 





agccgctggaaccgtgatccggctgctctcgacccagccgataccccagcgttcgagcaggcgtggcagcgtcaa 





tgccatatggaacaaaccatcgtagcgcgtgtcccggaaggcgatattccggctgccttactggaaaacatcgcg 





gccagcctggcgatctggctggacgagggtgacttcgctccgccggagcgcgctgcgattgtgcgtcatcatgca 





cgtctggagctggcgtttgccgacattgcccgccaggcaccgcaaccggatctgagcacggttcaagcgtggtat 





ctgcgtcaccagacgcaattcatgcgtccggagcagcgtctgacccgtcacctgctcctgacggtcgataatgat 





cgcgaggcggtgcatcaacgcatccttggcctgtatcgtcagatcaacgcgagccgtgacgccttcgccccactg 





gcacagcgccactctcattgcccgtccgccttggaagaaggccgtctgggctggatctcccgtggtctgctgtac 





ccgcagctcgaaaccgcgttgtttagcctggcggaaaacgcactgtcgctgccgattgcgtcggaattgggttgg 





cacctgttatggtgcgaggccattcgtccggcagccccgatggagccgcaacaggcccttgaatctgcgcgcgac 





tacttgtggcagcagagccagcagcgccaccagcgtcaatggctggagcagatgatttcccgccaaccgggcctg 





tgtggttaaTACCATAACCCGttggggcctctaaacgggtcttgaggggttttttgt 






Paenibacillus WLY78 nif cluster 



(SEQ ID NO. 3)



gtagggcgcattaatgcagctggactagtGAATTGAGGATAAATGTCAGGGATTTCATGGAGAAGTGAATTGACT 






GTATTTGTCCCTGTCTCTAAGATGTAATTATATTCCAGACAAAAACAGAGATTTATGTAAGGGAATATAACGTAG 





AGAGGAGGGAATGAATGGACTCTTTAGCTGATCTCTCGGAAACCCCCTTAGCATTAGAAACTCTCAGACGACATC





CCTGTTATAACGAAGAGGCACATCGCTATTTTGCGCGCATTCATCTTCCAGTAGCCCCGGCATGCAATATTCAGT





GCCATTATTGCAACCGCAAATTCGATTGCGTCAATGAAAGCCGTCCCGGCGTTGTTAGTGAACTGCTTACGCCGG





AGCAGGCGGCGAGCAAGACCTATGGCGTAGCGGCACAGCTGATGCAGCTGTCCGTTGTCGGCATTGCGGGACCTG





GAGATCCGCTGGCCAATGCGGAGGCAACCTTCGATACCTTCCGCCGGGTCCGTGAGACAGTTAAGGACGTCATAT





TCTGTCTCAGCACGAATGGCCTTACTTTGATCAGGCATATCGACAGGATTGTAGAGTTGGGTATTTCGCATGTCA





CGATCACGATCAATGCTGTAGATCCAGTGGTGGGGAGCCGCATTTATGGATGGGTCTACGATGAAGGAAAACGCT





ATGCGGGTGAGGAGGCCGCACGACTGTTGATTGACCGCCAGCTGGCAGGCTTGAAGATGCTGGCTTCGAGAGGTG





TATTGTGCAAGGTGAACTCGGTGCTGATTCCCGAAGTCAATGATGCCCATCTGCCGGAGGTAGCGAGGGTGGTCA





AGGAGCACGGCGCGGTGCTGCACAACATTATGCCGCTCATCATCGCACCTGGTAGCCGATATGAGCAGGAAGGGA





TGCGGGCACCCCGTCCCCGTCTGGTCCGGCAGCTGCAGGAGCAATGTGCTGAAGCGGGAGCTGTCATTATGCGCC





ATTGCCGTCAGTGCAGGGCGGATGCGATTGGACTGCTGGGCGAGGATCGCAATCAGGATTTTACATGGGAGAACA





TTGCTGCTGCTCCTCCCATGGATGAAGAGGCAAGGGCACAATTTCAGAAAGAACTGGATGAGAAGGTGAGAGTGA





GAATGGAACGCAAGGAGGGACAATCGCACCACAAACAACCGTCAACCGGTGCTGGTTGTAGCTGCCCGTTATCTG





GGGATAAGCCTGAAGCGAGCTTTACCTCAAAGCCGGTCTTAATCGCTGTGGCTAGTCGTGGTGGAGGGAAGGTGA 





ATCAGCATTTCGGTCGTGCCAAGGAATTTATGATCTATGAAAGCGACGGGACCATCGTAAATTTCATAGGCATTC 





GTAAGGTGCAATCCTACTGTCACGGGAAAGCCGATTGCAATGGAGATAAGGCCGAGACGATCAAGGAGATCCTTT 





CCATGGTACATGATTGTGCATTGCTCCTGTCGTCCGGCATAGGCGAAGCCCCCAAAGAGGCATTGCAGGAAGCGG 





GCGTGCTGCCTATTGTGTGCGGCGGCGATATTGAGGAGTCCGTTCTGGAATATCTAAAATTTCTGCGTTATATGT 





ATCCTGTGCAGACGGGTAAGGGAAGTAAGCGTAATAAGGGAGTTAAGGGCAATCATTCGGATTTACCCATTGAAC 





ATTTTGGAGGCTGAGAAAATATGAGACAAATTGCGTTTTACGCTAAGGGCGGTATCGGCAAATCGACAACCTCGC 





AGAATACACTGCCTCAACTTGCGACCAAATTCAAACAAAAAATTATGATCGTAGGCTGTGATCCCAAGGCAGACT 





CCACCCCTCTTATTTTGAATACCAAGCCCCAACACACCGTACTCCATCTCCCACCTCAAAGCCGTACCCTCCACC 





ACTTGCAACTGCAGCATCTTCTCCAGAACCCCTTCGCTGATATTCTCAACCTCGAATCCCGCCGCCCACACCCCC 





CTCTCCCCTCTCCACCACCCCCTATCATCACACCCATTAATTTTCTCGACCAAGACCCCCCCTACCAACGCCTCC 





ATTTCGTTTCCTACCATCTACTGGGGGACGTCGTGTCCGCGGCGTTCGCCATGCCGATCCGGGAGAACAACGCCC 





AGGAAATCTACATCGTATGCTCAGGCGAGATGATGGCTATGTACGCTGCCAACAATATTGCGCGCGGGATCTTGA 





AGTATGCCAACAGCGGCGGGGTGCGTTTGGGCGGCTTAATCTGCAACAGCCGGAATACGGACCTGGAAGCGGAAT 





TGATCACAGAGCTTGCAAGAAGATTGAACACGCAGATGATCCACTTTTTGCCGCGTGACAATGTTGTGCAGCACG 





CTGAGCTGCGCCGTATGACCGTTACCCAATATAACCCGGAACATAAGCAGGCTGCGGAGTATGAAGAGCTGGCAG 





GTAAGATTTTGAATAATGACATGCTAACGGTTCCCACGCCCATTTCCATGGAAGATCTGGAGGATCTATTGATGG 





AATTCGGCATTATTGAGGATGAAGAAACCGCAATTAACAAAGCTGAGGCGTCCGGGCAGTAGGCTGTAGCCAGAA 





GGCTTAATGACGGAACCATCGTGTAATGATGGGAGGAGCTGAACGCGCAGCTCGCAGGAGGGAGGAATAGGCCAA 





ATGAGCAGTATTGTGGATAAGGGTAAGCAGATCGTAGAGGAGATACTGGAGGTATATCCCAAGAAGGCCAAGAAG 





GATCGGACCAACCATTTTGAGATCGCGGATGAGGAGCTTGTGAACTGCGGAACCTGTTCCATCAAGTCCAACATG 





AAATCACGGCCTGGCGTCATGACAGCAAGGGGCTGTGCTTATCCAGGCTCCAAGGGTGTGGTATGGGGCCCGATT 





AAAGACATGGTGCACATTAGCCATGGTCCCATCGGCTGCGGACAGTACAGTTGGGGTACCCGACGCAATTATGCG 





AATGGGATATTGGGAATCGATAATTTTACCGCCATGCAGATTACAAGCAATTTTCAGGAAAAAGATATCGTGTTC 





GGTGGAGATAAGAAGTTGGAGGTGATCTGCAGGGAAATTAAGGAGATGTTCCCGCTGGCTAAGGGTATCTCCGTG





CAATCTGAATGTCCGGTCGGACTGATTGGTGATGATATCGGGGCCGTGGCCAAGAAGATGACAGAGGAGCTGGGC





ATTCCGGTCATTCCTGTACGCTGTGAGGGCTTTCGCGGGGTGAGTCAGTCTCTGGGCCATCACATTGCCAATGAT





GCTATCCGCGATTTTCTAATGGGGCGCCGAGAACTGAAGGAGTGCGGGCCTTATGATGTCTCCATTATCGGAGAC





TACAATATCGGCGGTGATGCCTGGGCGTCGCGCATTTTGCTGGAGGAAATGGGACTGCGGGTCATAGCGCAGTGG





TCGGGTGACGGTACGATCAATGAGCTGGGGATTGCGCATAAATCCAAGCTCAACCTGATCCATTGTCATCGTTCC





ATGAATTATATGTGCACAACAATGGAGCAGGAATACGGAATTCCCTGGATGGAATATAACTTCTTCGGCCCGACC





AAGACGATGGAGAGCCTCAGAGCGATTGCTGCCCGCTTCGACGAGACGATTCAGGAAAAATGTGAGCAGGTCATC





GCCCAATATATGCCGCAGATGGAGGCGGTCATCCGTAAATATCGCCCACGTCTGGAAGGTAAAAAGGTGATGCTT





CTGATTGGCGGGCTGCGGGCAAGGCATACCATCGGGGCCTATGAGGATCTGGGTATGGAAATTGTGGCTACAGGC





TATGAATTTGCCCATAAGGATGATTACGAAAAGACGTTTCCCGATGTAAAAGAAGGCACCATTCTGTACGATGAT





CCAACGGCATATGAGCTGGAGGAACTGGCCCAGCGGCTGAATATTGACTTAATGGGCGCCGGAGTCAAGGAGAAA





TACGTGTATCACAAAATGGGCATTCCCTTCCGTCAAATGCACTCCTGGGATTACAGCGGGCCTTATCATGGTTTT





GACGGCTTTAAGATTTTTGCACGTGATATGGATATGACCATAAACAGTCCAGTATGGAGCCTGCTGCCCTCACGG





CAGACTGCGGAGGTGCCGGTATGAGCGAGCGTCCGAATATTGTCGATCACAATCAGCTGTTTCGGCAGGATAAAT





ATGTGCGCCAGCGTGAAGAAAAACGAGCCTTCGAGGCCCCATGTTCGCCGGAGGAGGTTACCGACACCCTGGAGT





ACACCAAGACCAAGGAATACAAAGACAAGAATTTTGCCCGTACAGCCGTAGTCGTGAATCCGGCCAAGGCTTGTC





AGCCGCTGGGAGCGGTTATGGCTGCACTGGGCTTCGAAAAAACGCTCCCGTTCATTCATGGTTCACAGGGCTGTA





CGGCTTATTTTCGCAGTCATCTTGCCCGCCACTTCAAAGAGCCTGTTCCTGCCGTCTCCACCTCGATGACCGAGG





ATGCCGCCGTATTCGGCGGCATGCGCAACCTCATTGACGGTATAGAGAACTGCATTGCCTTGTATCAGCCGGAGA





TGATTGCGGTATGCACGACCTGTATGGCAGAGGTGATCGGGGATGATCTGTCTGCCTTCCTGGCCAATGCCCGTC





AGGAGGGAGTCCTTCCTGAGGATATGCCAGTTCCTTTTGCCAATACCCCCAGCTTCTCTGGTTCACATATTACAG 





GCTATGACGCCATGCTGCGCTCTGTACTGGAGACGCTGTATAACAAGTCAGGCCGGACGGCGCAGCCTGGTCATG





AATTGAAGCTGAATGTACTGCTCGGGTTTGACGGGTATACGGGCAATTTTGCGGAAATGCGGCGCATGCTGGGGA





TGTTCGGCGCTACGTATACCATTCTGGGTGACCACAGCAGTAATTTTGATTCAGGGGCCACTGGAGAGTACAGCT





ACTATTACGGGGGAACGCCGCTTGAGGATGTGCCTAAGGCCGCAGATGCTGCCGGCACGTTGGCGATTCAGCAGT





ACTCTCTTCGTAAAACACTAGGCTATATGAAGCAAACCTGGGGGCAGCAGGTGTCCTCCATCTCCACACCGCTGG





GCATCCGGGCTACAGATCGCTTGCTTGAGGAGATTAGCCGCCTGTCTGGAAGGGAAATTCCCGAGGCATTGAAGC





AGGAGCGCGCCCGAATTGTGGATGCCATGATGGATTCACATGCTTATCTGCACGGCAAACGAGTGGCTATGGCAG





GAGACCCGGACATGCTCATCGGCTTGATTGGCTTTTGTCTGGAGCTGGGCATGGAGCCGGTGCATATTGTTTGCT





CCAATGGGGACCGAAAATTTGAGAAGGAAGCAGAGCTTCTGCTGAAGTCCAGCCCTTACGGTGCAGAAGCCACGG





TTCATTCCGGTCAGGATTTGTGGCATATGCGTTCGCTGCTGTTCCAGGACCCGGTGGACCTGGCTATTGGCAGCT





CCCATCTGAAGTTTGCAGCGAAAGAGGCGGAAATTCCTTTGCTTCGTGTAGGCTTTCCGATCTTCGACAGGCATC





ATGTCATGGAGGAGCAGGCTCCGGATCATAGCTTTGATCTGGTGCGCTAATTGCTGTATCGCGTAGAAGGAAGTT





GACAGCTTGGCTTGTGATTTCAATGGATTCTATCTGAAATAAGGGGGTGTGTGGATGGAGCCGGCTGTGTCTAAC





GGAAGGCTGGAGGTATCCTGCGGCAATAAAATTCCCAAAAGCACGCCCTGTCCCCGGCCTGTGCCGGGAGAGGCT





TCGGGTGGCTGCTCCTTTGACGGGGCCCAGATTACACTGATCCCCATTGCAGATGCGGCTCATCTGGTGCACGGG





CCAATTGCGTGTCTCGGCAATAGCTGGGAGAGCAGAGGCAGTCTGTCCAGCGGCCCAGAGCTGTCGGCTTATGGC





TTCACTACTGATCTTGGAGAACAGGACATCATTTTTGGTAGTGAACAGAAGCTGCATGAATCGATCCGCTACATT





GTCAGCCGCTTTGCTCCTCCCGCTGTGTTTGTCTATACCACATGTGTCACAGCCCTCACTGGTGAAGATATCGAG





GGGGTTTGCAAGGCTGAATCGGAGCGGCTGGGGACGCCGATCATTCCGGTGAACAGTCCGGGATTTGTGGGCAGT





AAGAATCTCGGAACCCGGCTGGCCGGAGATGTGCTGTTCCAGCATATTATCGGCAGCACCGAGCCGGAACAGACA





ACCTCCCATGATATCAATCTCATTGGGGAATACAATATTGCGGGCGAGATGTGGCATATCGAGCGGCTGATGCAG





CAGGCGGGAATGAGTATCCTGTCCCGAATTACCGGGGACGGTCGGTTCCGCGAGGTGGGCTGGGCGCACCGTGCC





AAGGCCAACATGGTCGTATGCAGCCGGGCTTTGCTGGGTCTGGCAGTCCAAATGGAGCGTAAATACGGCATTCCT





TATTTTGAAGGTTCATTTTATGGAGCAAAGGAGACGAGTTATTCCTTGCGGCAGATGGCTTACCTGACCGGAGAT





CGTGATGTGGAGCGACGGGTGGATAAGCTGGCCGCACGGGAGGAAATGAGGCTATCGCTGGAGCTGGAGCCCTAC





CGCAAGCAGCTGAAAGGAAAGCGGGCAGTGCTCTATACCGGGGGTGTAAAGAGCTGGTCTGTCATTACGGCTTTG





CAGGAGCTGGGCATAAAGGTGGTTGGTGTAGGCACGAACAAGAGCACTGCCGAGGATGTATCCCGGATTGCTGAC





CGTATCGGGGATGATGCAGAATACATCCCGGAAGGAGGCGCCCGGCAGATTCTCAAGACCGTACGGAGCCGCAAG





GCAGACATGGTCATTGCCGGAGGCCGGAACATGTATATGGCGCTTAAGGAACAGATTCCTTTTGTGGACATCAAT





CAAGAGCGGCACAAAGCCTATGCGGGCTATGACGGGCTGTTGTCTCTGGCGAAACAGCTTGTGCATACGCTGCAG





CATCCAGTATGGGAGCTGACCGCCAAATTGGCTCCATGGGAGGAGGAGACGGAATTTGCTGATTAAATCCGCCAC





GAAGCCTGTCAGTGTCAACCCGCTCAAGGTAGGACAGCCTTTGGGCGGCGTGCTGGCTCTGCAGGGGATGTATCG





CTCAATGCCTTTGCTGCACGGCGCTCAGGGCTGCTCGGCCTTCTCCAAGGCGCTGCTGACTCGCCATTTTCGAGA





GCCGATTGCCGTTCAGACCTCTGCGTTGCAAGAGATGGACGTTATATTTGATGCAGACCGGAATCTGGAGGAGGC





GCTGGATCATATCTGGTCCAAACACCATCCAGATGTCATCGGCGTTATCAGCACGGCCCTCACTGAGGTGGCAGG





CGTTGACTTTCAGTCTAGGGTAAAGGCGTTCAAGCGAGAACGGGCATTGAAGGACAGTCTGCTGTTTTCTGTATC





GCTGCCTGATTTTCACGGCTCTCTGGAGACGGGCTACAGCAGTACAGTAGAGTCACTAATGGATGCCGTACTCGG





GTTGGCCGGGGGCAAGTCCCCCAAAAAACAGCGCCGGACGCAGGTCAATCTGCTGCCGGCTTCTTATCTGACTGC





CGGAGATGTCATGGAAATCAAGGATATTATCGCTTCCTTCGGCCTGGAGGTTATTACGCTCCCCGATATTTCCAC





TTCCTTGTCCGGTCACCTGCTGACAGGCTTTTCCCCTTTGACGAGAGGGGGGACTCCGCTGGATTCAGCCTGCCA





GATGCTGGAGTCTTCCTGCACCATTGCCATTGGCGCGAGCATGGAGCGTCCGGCGCGCAGGCTGACTCATGCTGC





AGGTATTCCCTACCACTTGTTCGCTGGTCTGTCTGGCTTGGCCGCGAGTGATGCGTTCATACATTTTCTGCAGAA





AATCAGCCGCGAGCCAGCCCCCGTTCGCTTCCGTTGGCAGCGTGAAAATCTGTTGGACAGCATGCTGGATGCCCA





TTTCTATTATTCTGGCGCTTCGGCTGTAGTGGCGCTAGAACCGGATCATATGCTGTCGACCGCAGCCTGGCTGGA





GGAGATGGGAGTGGAACTGAAGCGGCTAATTACACCCTGCAGCACGCCCGCACTGCAAAAGACAGAACGGGAAGT





GGAGATGGGAGTGGAACTGAAGCGGCTAATTACACCCTGCAGCACGCCCGCACTGCAAAAGACAGAACGGGAAGT





CTGGATCGGTGACCTGGATGATGCAGAGGAGAGCGCGCAGGGTGTTGATTTGTGGATCAGCAACTCACATGGAAG





AAAGGGAGCGGCACGGGCTGGGGCCTCATTCGTACCGGCAGGCTTGCCGGTGTATGACGAGCTAGGCGCCCACAC





ATCCGTAAGCGTCGGATACCGTGGAACCATGGAGTGGGTGAACAAAGTAGGCAATGTATTGCTTGCCGAGAGGGG





GAGGGGAGGATGAAGGTTGCATTTGCGACGGAAGACGGCGTGCTTGTGAATGCTCATTTTGGGCAGAGTCCCATG





TTCACTATATTCGAAATCCGGCACTCAGGCGTCCAGTTCCTGGAGCATCGGCGGATAGCCCTGGGGAGCGATGAG





AATGAGGCGGGCAAGATCGCCAGCCGAATTGGCCTGATCGAGGATTGTGCCTTGATCTTCCTGGTACAGATTGGC





GCTTCCGCCGCCGCACAGGTTACCAAGCGGACCATTATGCCTGTGAAGGTGGCCTTCGGTAGCACCATTGAGGAG





CAGGTCCAGCGTCTCCAGAATATGCTGACTCGCAATCCGCCCATGTGGCTTGCCAAAATCCTGCATGCTGAGGAG





GGCAGCGGCAAAGCCGAATCATGAGCCCTCCTGTAAGGAAGAGCAACCATATAGGGTATTAAGATCCTGCAGACC





GAATATCTTAAAGGCGGGAGCCGCACATGGAGGGGGTGGACGAATGGTACAACTGCTGGAAGACAGTAGATACGG





ACGCCAGTTGAAGCTGCTGGGAGTGGAAGGTCAGAACAGGCTAAAGCAGGCTACGGTTATGGTTGCAGGCATCGG





AGGATTGGGAGGGGCAGCGGCCATGTACCTGGCCGCTGCCGGAGTAGGAAAGCTGATATTGGCCCATGAGGGCGT





AATCCATCTGCCCGATATGAACCGGCAGGTGCTGATGGACAGCGGACGAATCGGGGAGGAACGGATGGAGACGGC





ATTACAGCATTTGCATCGTATCAATCCGGAGACCGAGCTTGAGGGCCACGCCCACAGAATCACGGAAGAATCCTC





TGGACCATGGGTAGAAGCGTCGGATATCGTGATTGATGCACGATATGACTTTCCGGAAAGATATGCGCTGAACAG





ACTATGTGTTCGACATGGAAGACCGATGATAGAAGCGGCCATGTACGCCTATGAAGTATCATTGATGACCATTGA





TCCCGGTAAGACGGCATGCCTGGAATGTCTTTACCCGGAAGGCGGACAGCCTTGGGAACCTCTGGGATTCCCGGT





CCTGGGAGCCACCTCCGGCTTGATTGGCTGCATGGCTGCACTGGAAGCGGTCAAATGGATTACAGATGCGGGCAA





TCTGTTCACTGACCGCATGTACCGTATGAATGTGCTGGATATGAGCAGCTGCACCATAGCGGTCAAACGCAACCC





GCGTTGTCCGTGCTGCGGAACGGGAGGGGATACAGATGAGTCGGTTGCATATTTGTGATACGACACTTCGTGACG





GAGAACAGGCTCCGGGCGTTGCCTTTTCAGCCGAGGAAAAAACTGAAATTGCCATCATGCTGGACTCGGCGGGGG





TGGAGCAGGCTGAGATCGGAATTCCGGCAATGGGAAAGACGGAGTGCAGGTCTATTGCCAGGATTGCTGCTCTCG





GACTTCAGATGAAGCTAATGACCTGGAATCGCGCGGTGTTCACGGATATTGATGCAACTGAATCGACAGGTGTCG





GCTGGGCCCATATTTCGGTTCCCGTGTCGACGGTGCAGATGAAGTCCAAGCTGGGTATGAATCCTGAGCAGGTGA





CGGAGCTGATCCGCAAGTCTGTCGATTACGCTCTGTGTAAAGGATTGACTGTTTCCGTAGGCTTTGAGGATGCTT





CAAGGGCAGATGACCTGTTCCTTGAGCAGTTGGCGAATCAGCTCTATAGGGATGGCATCCGGCGCTTCAGATATG





CCGATACGCTGTCCGTTCACCATCCCGCTGCCATAGCTGCCCGTATAGACAGGCTTGTATCGCGCGTGCCACAGG





ATGTGGAGCTTGAGATTCACTGTCATAATGATTATGGCCTGGCGCTTGCCAATACCCTGGCAGCTTTGCAAGCGG





GAGCTGTCTGGGCCAGTACCACGGTGTCGGGACTTGGGGAAAGGGCAGGTAATACCGCGCTGGAGGAGGTGGTGA





TGTCGTGGAGGGACCTATATCAAGGAACCTGCAGCGTCCGTCCCGAACTGCTGAACCCGCTGGCTGCACTGGTGT





CCAAAGCCTCCAACCGAATCATTCCTGAAGGCAAGCCCATTGTGGGAGACATGGTATTCGCCCATGAATCCGGCA





TACATATCAACGGTCTGCTAAAGGAGCGCGCCGCCTATCAGGCGCTTGATCCGACTGAGCTGGGCACTGACCATT





CCTTCGTACTCGGCAAGCATTCGGGCAGAAGTGCAGTTCAATATATGCTGGAGCAGGAAGGAATCGAGGCAGGCT





CCGGTGAAATCAAGTTCCTGCTGGAGCGGCTTCGCCTAGTCGGTGAAGATCCCAAGCGTGTCATCCATAGCGCGG





ATTTAAGACGCTGGCTGCAGTATTATCCGGCAGAGCTGCCGAAATAACCGAAAAAGCGTTCCCGTCCGGTAAGTG





TGACCGTGACTGGAACGCTTT 






Klebsiella oxytoca M5a1 nif cluster 



(SEQ ID NO. 4)



GAATTCTAGACTGCTGGATACGCTGCTTAAGGTCATGCAGCAGGAGAACTAAAGGCCCGCTACTCCTCGCCGGCC






AGCCGCCGATACTGGGCAAAGCGGGCCCGCGCGTCCTCCTCGGTTCGGCTAAAGAGCGCATCCGCCAGATGCGGC





GTCGTTTTGTGCAGCGAGGCGTAGCGCACTTCGCCAAGCAAAAAGTCGCGGAAGCTCTCCTCCGGCTCTTCGGAA





TCGAGCATAAACGGCGTCTTACCTTCCGCTTCCCGCTGCGGATGATAGCGCCACAGGTGCCAGTATCCCGCCTCA





ACCGCCCGTTTCGCCTCGCGCTGGCTGCAGCGCATACCGGCTTTCAGCCCGTGGTTAATGCAGGCGGCGTAGGCA





ATCACCAGCGACGGTCCCGGCCAGGCTTCGGCCTCGGCGATCGCCCGTAGGGTCTGATCTTTATCAGCGCCCATC





GCGACCTGGGCCACGTACACATTGCCGTAGCTCATCGCCATCATGCCGAGATCTTTTTTCCGCGTGCGTTTGCCC





TGCGCGGCAAACTTCGCGATGGCCGCCACCGGGGTCGATTTAGACGACTGGCCGCCGGTATTGGAGTAAACCTCG





GTGTCAAACACCAGAATATTGACGTCTTCCCCGCTCGCCAGCACGTGATCGAGACCGCCGAAGCCGATATCGTAG





GCCCAGCCGTCGCCGCCGAAAATCCACTGCGAACGACGAACAAAATAGTCGCGGTTCTGCCACAGCTGCTCCAAC





AGCGGCACGCCCTCTTTTTCCGCCGCCAGCCGTTCGCTGAGCCGGTCCGCGCGCTCGCGGGTGCCCTCGCCTTCA





TCCTGCTTCGCCAGCCACTGGCGCATTGCGTCGCTAAGTTCGTCGCTGACCGGTAGCGCCAGCGCGGCGGTCATA





TCATCGGCGATTTGTTGACGCACCGCCTGGCCGCCGAGCATCATGCCGAGGCCAAACTCCGCATTATCCTCAAAC





AGCGAGTTCGCCCATGCCGGGCCATGGCCGCGGTGGTTGGTGGTATAGGGAATCGACGGCGCGCTGGCTCCCCAG





ATAGAAGAGCAGCCGGTGGCGTTAGCGATCAGCATCCGGTCGCCAAACAGCTGGGTTATCAGGCGGGCATAAGGC





GTTTCACCGCATCCCGCGCAGGCGCCGGAAAACTCCAGCAGCGGGGTTTCAAACTGGCTGCCTTTGACCGTCGTC





TTACGAAACGGATTGCTCTTCGGCGTCAGCGCCAGCGCATAGTCCCAGACCGGCGCCATCTGACGCTGGCTATCG





AGAGACTGCATTTTTAACGCCTTGCCGCGCGCGGGACAGATATCCACGCAGTTGCCGCAGCCGGAACAATCCAGC





GGCGAGATAGCCAGATGGTAGTGATACTCCTTCGCTCCCTGCGCGGGTTTGCTCAGCAGCCCAACCGGCGCGGCG





TCATGCTCTTCGCCGTTGAGCAGCGCCGGGCGGATCGCCGCATGCGGGCAGATAAAGGCGCACTGGTTACACTGC





GTGCAGCCCTCCGGCTGCCAGACCGGCACTTCCAGCGCGATCCCGCGTTTCTCCCACGCGGCGGTGCCCGAAGGA





AAGGTCCCGTCCTCCATACCGACGAACGCGCTCACCGGCAGCTGGTCGCCGCACTGGCGGTTCATCGGCTGCAGA





ATATCGCGGATGAAATCCGGCATCATGGCTGATGCTTGCGCCGCGGGTTCATCCAGCGTCGCCCAGTGCGCCGGA





ATCGTCACCTGATGCAGCGAGGCCATGCCCAGCTCGATCGCCCGCTGGTTCATCTCAATCACCGCCGCCCCTTTG





CTGCCGTAGCTTTTTTCAACCGCCTGCTTGAGGTAATCCGCCGCGGTCTGCGGGTCGATAATCGCCGCCAGCTTA





AAGAACGCCGCCTGCATCAGCATATTAAAGCGCCCGCCCAGCCCGAGCTCGCGGGCGATATCCACGGCGTTCAGG





GTATAAAAATGGATATTTTCCCGCGCCAGATAGCGTTTAAAGCCGACCGGCAGATGCTGCTCCAGCTCCGCATCG





GACCAGCTGCAGTTGAGTAAAAAGGTCCCGCCCGGCTTTAATCCGTCCAGCAGATCGTAGCGCTCAACGTAGGAC





TGCTGCGAACAGGAGATAAAATCGGCCCGATGGATCAGGTAGGGCGAATTGATCGGCCGGTCGCCGAAGCGTAAA





TGTGAAACGGTAATGCCGCCGGATTTTTTCGAGTCATAAGAAAAGTAGGCCTGCGCGTAGAGCGGCGTTTTATCG





CCGATAATTTTGATCGCGCTTTTATTGGCCCCGACGGTGCCGTCCGAGCCCATGCCCCAAAATTTACAGGCGGTG





ATGCCGTCATGCGAGACCGCCAGCGTCTGCTGGCGCGGCGGTAACGAAGTAAAGGTTACATCATCGACAATCCCG





AGGGTAAACCCGTCCATCGGCAGCGGTTTATTGAGGTTATCAAAGACGGCCGCGATATCGTTGGGCAGAACATCC





TTCCCGCCAAGCGCATAGCGGCCGCCGACGATTAGCGGCGCATCGTCGTGGTGGTAGAAGGCGTTTTTCACATCC





AGGCACAGCGGTTCAGCCTGAGCGCCGGGCTCTTTGGTACGGTCAAGGACGGCAATCCGCTGCACGGTTTTCGGC





AGCTGGGCGAAGAAGTGGGCCAGCGAAAAAGGGCGAAACAGATGCACGCTGAGCAGCCCGACCTTCTCTCCCGCC





GCGTTCAGCGTATCCACCACTTCCTGAACGGTATCGCAGACCGATCCCATTGCGATAATCACCCGTTCGGCATCC





GCCGCGCCGGTATAGTTAAACAGATGATACTCCCGGCCGGTGAGCGCGCTGATTTGCGTCATATAGCTTTCGACA





ATGTCGGGCAGCGCCTGATAAAAACGGTTGCCCGCCTCCCGCTCCTGGAAGTAGATATCCGGGTTCTGCGCCGTT





CCGCGGATGACCGGATGATCCGGATGCAGCGCGTTACGGCGGAAGCTGTCGAGCGCGGGCCGGTCCAGCAGCGTC





GCCAGCTGCTCATATTCCAACACCTCGATTTTTTGAATTTCGTGCGAGGTGCGAAAACCGTCGAAGAAGTTAACA





AACGGGATGCGTCCCTTAATCGCCGCCAGATGCGCCACCGCCGACAAATCCATCACCTGCTGCACGTTGTTCTCC





GCCAGCATCGCGCAGCCGGTCTGGCGGACCGCCATCACATCCTGGTGATCGCCAAAAATATTCAGCGAATTGGTC





AGCAGCAGCCCCTGGGAGGCCGTATAGGTGGTGGTGAGCGCCCCGGCCTGCAGCGCGCCGTGGACCGCGCCTGCC





GCGCCGGCCTCCGACTGCATCTCCATTAAGCGCACCGGCTGGCCAAAAAGGTTCTTTTTCCCCTGCGCCGCCCAC





TCGTCGACGTTTTCCGCCATCGGCGTGGAGGGGGTTATGGGGTAAATCGCCGCGACCTCGGTAAAGGCATAAGAG





ATCCAGGCCGCCGCGGCGTTGCCATCCATTGTTTTCATTTTTCCGGACATTGTTCAATCCTCGAAGGTGAGAGGC





ATCTTCGCCGCCTCAAATAAGCGGCAAACCCAGTTGTTGCCTCAAGCACAGCCTGTGCCAGCTCGCGGATGACAG





AAGAGTTAGCGCGAATTCAACGCGTTATGAAGAGAGTCGCCGCGCAGCGCGCCAAGAGATTGCGTGGAATAAGAC





ACAGGGGGCGACAAGCTGTTGAACAGGCGACAAAGCGCCACCATGGCCCCGGCAGGCGCAATTGTTCTGTTTCCC





ACATTTGGTCGCCTTATTGTGCCGTTTTGTTTTACGTCCTGCGCGGCGACAAATAACTAACTTCATAAAAATCAT





AAGAATACATAAACAGGCACGGCTGGTATGTTCCCTGCACTTCTCTGCTGGCAAACACTCAACAACAGGAGAAGT





CACCATGACCATGCGTCAATGCGCTATTTACGGTAAAGGCGGTATCGGTAAATCCACCACCACGCAGAACCTCGT





CGCCGCGCTGGCGGAGATGGGTAAGAAAGTGATGATCGTCGGCTGCGATCCGAAGGCGGACTCCACCCGTCTGAT





TCTGCACGCCAAAGCACAGAACACCATTATGGAGATGGCCGCGGAAGTCGGCTCGGTCGAGGACCTCGAACTCGA





AGACGTGCTGCAAATTGGCTACGGCGATGTGCGCTGCGCGGAATCCGGCGGCCCGGAGCCAGGCGTCGGCTGCGC





GGGACGCGGCGTGATCACGGCGATCAACTTTCTTGAAGAAGAAGGCGCCTACGAGGACGATCTCGATTTCGTGTT





CTATGACGTGCTCGGCGACGTGGTCTGCGGCGGCTTCGCCATCAAGATCCGCGAAAACAAAGCCCAGGAGATCTA





CATCGTCTGCTCCGGCGAAATGATGGCGATGTACGCGGCCAACAATATCTCCAAAGGGATCGTTAAATACGCCAA





ATCCGGCAAGGTGCGCCTCGGCGGCCTGATCTGTAACTCACGTCAGACCGACCGTGAAGACGAACTGATTATTGC





CCTGGCGGAAAAGCTCGGTACCCAGATGATCCACTTTGTGCCCCGCGACAACATCGTGCAGCGCGCGGAGATCCG





CCGCATGACGGTTATCGAGTACGACCCCGCCTGTAAACAGGCCAACGAATACCGCACCCTGGCGCAGAAGATCGT





CAACAACACCATGAAAGTGGTGCCGACGCCCTGCACCATGGATGAGCTGGAATCGCTGCTGATGGAGTTCGGCAT





CATGGAAGAGGAAGACACCAGCATCATTGGCAAAACCGCCGCCGAAGAAAACGCGGCCTGAGCACAGGACAATTA





TGATGACCAACGCAACGGGCGAACGTAATCTGGCGCTGATCCAGGAAGTCCTGGAGGTGTTCCCGGAAACCGCGC





GAAAAGAGCGCAGAAAGCACATGATGGTCAGCGATCCGGAAATGGAGAGCGTCGGCAAGTGCATTATCTCTAACC





GCAAATCACAACCCGGCGTAATGACCGTACGCGGCTGCGCCTACGCCGGTTCCAAAGGGGTGGTATTTGGGCCGA





TTAAGGATATGGCCCATATTTCGCACGGACCGGTCGGCTGCGGCCAGTATTCCCGCGCCGGACGACGCAACTACT





ACACCGGAGTCAGCGGCGTCGATAGCTTCGGCACGCTGAACTTCACCTCTGATTTTCAGGAGCGCGACATCGTCT





TCGGCGGCGATAAAAAGCTCAGCAAGCTGATTGAAGAGATGGAGTTGCTGTTCCCGCTCACCAAAGGGATCACCA





TTCAGTCGGAATGCCCGGTGGGGCTGATCGGTGATGATATCAGCGCGGTGGCCAACGCCAGCAGCAAGGCGCTGG





ATAAACCGGTGATCCCGGTACGCTGCGAAGGCTTTCGCGGCGTGTCGCAGTCTCTGGGGCACCATATCGCCAACG





ACGTGGTGCGCGACTGGATCCTGAACAATCGCGAAGGACAGCCGTTTGAAACCACCCCTTACGATGTGGCGATCA





TCGGCGACTACAACATCGGCGGCGACGCCTGGGCCTCGCGCATTCTGCTGGAAGAGATGGGGCTACGGGTAGTCG





CGCAGTGGTCCGGCGACGGCACGCTGGTGGAGATGGAGAATACCCCATTCGTCAAGCTGAACCTGGTTCACTGCT





ACCGTTCGATGAACTATATCGCCCGCCATATGGAGGAGAAACATCAGATTCCGTGGATGGAGTACAACTTCTTCG





GGCCGACCAAAATCGCCGAATCGCTGCGCAAAATCGCCGACCAGTTCGACGATACCATTCGCGCGAACGCCGAAG





CGGTGATCGCCCGGTATGAGGGGCAGATGGCGGCGATTATCGCCAAATATCGCCCGCGCCTGGAGGGGCGTAAGG





TGCTGCTCTATATGGGCGGCCTGCGGCCGCGCCACGTTATTGGCGCCTATGAGGATCTCGGGATGGAGATCATCG





CCGCCGGCTACGAGTTTGCCCATAACGATGATTACGACCGCACCCTGCCGGATCTGAAAGAGGGCACGCTGCTGT





TCGATGACGCCAGCAGCTACGAGCTGGAAGCGTTCGTCAAGGCGCTGAAGCCCGACCTTATCGGCTCCGGCATCA





AGGAAAAATATATCTTCCAGAAAATGGGCGTGCCGTTCCGCCAGATGCACTCGTGGGACTATTCCGGCCCGTACC





ACGGCTACGATGGTTTCGCCATTTTCGCCCGCGATATGGATATGACCCTGAACAACCCGGCGTGGAACGAACTGA





CCGCTCCGTGGCTGAAGTCTGCGTGATTGCCCACTCACTGTCCCGTCTGTTCACCGATTTGTGGCGCGGGAGGAG





AACACCATGAGCCAAACGATTGATAAAATTAATAGCTGTTATCCGCTATTCGAACAGGATGAATACCAGGAGCTG





TTCCGCAATAAGCGGCAGCTGGAAGAGGCGCACGATGCGCAGCGCGTGCAGGAGGTCTTTGCCTGGACCACCACC





GCCGAGTATGAAGCGCTGAATTTCCAGCGCGAGGCGCTGACCGTTGACCCGGCGAAAGCCTGCCAGCCGCTTGGC





GCGGTGCTTTGCTCGCTGGGATTTGCCAACACCCTGCCGTATGTGCACGGCTCTCAGGGGTGCGTGGCCTACTTT





CGCACCTATTTTAACCGCCATTTCAAAGAGCCGATCGCCTGCGTCTCCGACTCGATGACCGAAGACGCGGCGGTC





TTCGGCGGCAACAACAATATGAACCTGGGCCTGCAGAACGCCAGCGCGCTGTACAAACCGGAGATCATTGCGGTG





TCCACCACCTGCATGGCGGAAGTTATCGGCGATGACCTGCAGGCGTTTATCGCCAACGCTAAAAAAGATGGCTTC





GTCGACAGCAGCATCGCCGTGCCCCACGCCCATACGCCAAGCTTTATCGGCAGCCACGTCACCGGCTGGGATAAC





ATGTTTGAAGGCTTCGCCAAAACCTTCACTGCGGACTACCAGGGGCAGCCGGGCAAATTGCCGAAGCTCAATCTG





GTGACCGGCTTTGAAACCTATCTCGGCAACTTCCGCGTATTAAAGCGGATGATGGAACAGATGGCGGTGCCGTGC





AGCCTGCTCTCCGATCCGTCGGAAGTTCTCGACACGCCCGCCGACGGCCACTATCGGATGTATTCCGGCGGCACC





ACGCAGCAGGAGATGAAAGAGGCCCCTGACGCCATCGATACGCTGCTCCTGCAGCCGTGGCAGCTGCTGAAGAGC





AAAAAAGTGGTGCAGGAGATGTGGAACCAGCCCGCCACCGAGGTCGCCATTCCGCTGGGGCTGGCCGCCACCGAT





GAACTGCTGATGACCGTCAGCCAGCTTAGCGGCAAGCCGATTGCCGACGCCCTCACCCTTGAGCGCGGCCGGCTG





GTTGACATGATGCTCGACTCCCACACCTGGCTGCACGGCAAGAAGTTTGGCCTGTACGGCGATCCGGACTTCGTG





ATGGGCCTCACCCGCTTCCTGCTGGAGCTGGGCTGCGAGCCAACGGTGATCCTGAGCCATAACGCCAACAAACGC





TGGCAAAAAGCGATGAACAAAATGCTCGATGCCTCGCCGTACGGGCGCGATAGCGAAGTGTTTATCAACTGCGAT





TTGTGGCACTTCCGTTCGCTGATGTTCACCCGTCAGCCGGACTTTATGATCGGCAACTCCTACGGCAAGTTTATC





CAGCGCGATACCCTGGCGAAGGGTAAAGCCTTTGAAGTGCCGCTTATCCGCCTCGGCTTTCCGCTGTTCGACCGC





CACCATCTGCACCGCCAGACAACCTGGGGTTATGAAGGGGCGATGAACATTGTGACGACGCTGGTGAACGCCGTG





CTGGAGAAACTGGATAGCGATACCAGCCAGCTGGGCAAAACCGATTACAGCTTCGATCTCGTCCGTTAACCATCA





GGTGCCCCGCGTCATGCGGGGCCAGGAGGGAGTATGCCCATCGTGATTTTCCGTGAGCGCGGCGCGGACCTGTAC





GCCTATATCGCGAAACAGGATCTGGAAGCGCGAGTGATCCAGATTGAGCATAACGACGCTGAACGCTGGGGCGGC





GCGATTTCGCTGGAGGGGGGACGCCGCTACTACGTGCATCCGCAGCCGGGGCGTCCCGTCTTTCCGATAAGCCTG





CGCGCGAGGCGCAATACCTTGATATAAGGAGCTAGTGATGTCCGACAACGATACCCTATTCTGGCGTATGCTGGC





GCGATTTCGCTGGAGGGGGGACGCCGCTACTACGTGCATCCGCAGCCGGGGCGTCCCGTCTTTCCGATAAGCCTG





CGCGCGAGGCGCAATACCTTGATATAAGGAGCTAGTGATGTCCGACAACGATACCCTATTCTGGCGTATGCTGGC





GACGCCAGAGCGTCTGGCGACCCTGACCCAGCCGCAGCTGGCCGCCAGCTTTCCCTCCGCGACGGCGGTGATGTC





CCCCGCTCGCTGGTCGCGGGTGATGGCGAGCCTGCAGGGCGCGCTGCCCGCCCATTTACGCATCGTTCGCCCTGC





CCAGCGCACGCCGCAGCTGCTGGCGGCATTTTGCTCCCAGGATGGGCTGGTGATTAACGGCCATTTCGGCCAGGG





ACGACTGTTTTTTATCTACGCGTTCGATGAACAAGGCGGCTGGTTGTACGATCTGCGCCGCTATCCCTCCGCCCC





CCACCAGCAGGAGGCCAACGAAGTGCGCGCCCGGCTTATTGAGGACTGTCAGCTGCTGTTTTGCCAGGAGATAGG





CGGGCCCGCCGCCGCGCGGCTGATCCGCCATCGCATCCACCCGATGAAAGCGCAGCCCGGGACGACGATTCAGGC





ACAGTGCGAGGCGATCAATACGCTGCTGGCCGGCCGTTTGCCGCCGTGGCTGGCGAAGCGGCTTAACAGGGATAA





CCCTCTGGAAGAACGCGTTTTTTAATCCCTGTTTTGTGCTTGTTGCCCGCTGACCCCGCGGGCTTTTTTTCGCGT





ATGGACGCTCTTCCCCACGTTACGCTCAGGGGAATATTCCGTTCACGGTTGTTCCGGGCTTCTTGATGCGCCTAA





CCCCCTCGCTGCCAGCCTTTCATCAACAAATAGCCATCCCAGCGCGATAGGTCATAAAGCATCACATGCCGCCAT





CCCTTGTCCGATTGTTGGCTTTGTCGCAAAGCCAACAACCTCTTTTCTTTAAAAATCAAGGCTCCGCTTCTGGAG





CGCGAATTGCATCTTCCCCCTCATCCCCCACCGTCAACGAGGTCACTATGAAGGGAAATGAAATTCTGGCGCTGC





TGGATGAACCGGCCTGTGAACACAACCATAAACAAAAATCCGGCTGCAGCGCGCCCAAACCCGGCGCCACCGCCG





GCGGCTGCGCGTTCGACGGCGCGCAGATAACCCTGCTGCCCATCGCCGACGTGGCGCATCTGGTCCACGGCCCCA





TCGGCTGCGCCGGAAGCTCATGGGATAACCGCGGCAGCGCCAGCTCCGGCCCCACCCTTAATCGGCTCGGGTTCA





CCACCGATCTCAACGAACAGGACGTGATTATGGGCCGCGGCGAACGCCGCTTGTTTCACGCCGTGCGCCATATCG





TCACCCGCTATCATCCGGCGGCGGTCTTTATCTACAACACCTGCGTACCGGCCATGGAGGGCGATGACCTGGAAG





CGGTATGCCAGGCCGCGCAGACCGCCACCGGCGTACCGGTTATCGCTATTGACGCCGCCGGTTTCTACGGCAGTA





AAAATCTCGGTAACCGGCTGGCGGGCGACGTCATGGTCAAACGGGTCATCGGCCAGCGCGAGCCCGCCCCCTGGC





CGGAGAGCACGCTCTTTGCCCCGGAGCAGCGTCACGATATTGGCCTGATTGGCGAATTCAATATTGCCGGCGAGT





TCTGGCATATTCAGCCGCTGCTCGACGAACTGGGGATCCGCGTGCTCGGCAGCCTCTCCGGTGATGGCCGCTTCG





CCGAGATCCAGACCATGCACCGGGCGCAGGCCAATATGCTGGTCTGCTCGCGGGCGTTAATTAACGTCGCCAGAG





CCCTGGAGCAGCGCTACGGCACGCCGTGGTTCGAAGGCAGCTTTTACGGGATCCGCGCCACCTCTGACGCCCTGC





GCCAGCTGGCGGCGCTGCTGGGCGACGACGACCTTCGCCAGCGCACCGAAGCGCTGATTGCGCGGGAGGAACAGG





CGGCGGAACTGGCGCTACAGCCGTGGCGCGAACAGCTGCGCGGCCGCAAAGCGCTGCTCTATACCGGCGGGGTGA





AATCCTGGTCGGTGGTATCGGCGCTGCAGGATTTGGGCATGACCGTGGTGGCAACCGGCACGCGTAAATCCACCG





AAGAGGATAAACAGCGGATCCGCGAGCTGATGGGCGAAGAGGCGGTAATGCTGGAAGAGGGCAACGCCCGCACGC





TgctggatgtggtctATCGCTATCAGGCCGACCTGATGATTGCCGGCGGACGCAATATGTACACCGCCTATAAAG





CCAGGCTGCCGTTTCTCGATATCAATCAGGAGCGCGAACACGCCTTCGCTGGCTATCAGGGGATCGTCACCCTCG





CCCGCCAGCTGTGTCAGACCATCAACAGCCCCATCTGGCCGCAAACCCATTCTCGCGCCCCGTGGCGCTAAGGAG





CTCACCATGGCAGACATTTTCCGCACCGATAAGCCGCTGGCGGTCAGCCCCATCAAAACCGGCCAGCCGCTCGGC





GCAATCCTCGCCAGCCTCGGGATCGAACACAGCATCCCTCTGGTCCACGGCGCGCAGGGGTGCAGCGCCTTCGCC





AAAGTCTTTTTTATTCAACATTTCCACGACCCGGTTCCCCTGCAGTCGACGGCGATGGACCCCACGTCGACGATT





ATGGGCGCGGACGGCAATATTTTTACCGCCCTGGATACCCTCTGCCAGCGCAACAATCCGCAGGCTATCGTACTG





CTCAGCACCGGGCTGTCGGAGGCCCAGGGCAGCGATATTTCCCGCGTGGTTCGCCAGTTTCGCGAAGAGTATCCC





CGGCATAAGGGGGTGGCGATATTGACGGTTAACACGCCGGATTTTTATGGCTCCATGGAGAACGGCTTCAGCGCG





GTGTTAGAGAGCGTCATTGAGCAGTGGGTGCCGCCGGCGCCGCGCCCGGCTCAGCGCAATCGCCGGGTCAATCTG





CTGGTCAGCCATCTCTGTTCGCCGGGCGATATCGAGTGGCTGCGCCGATGCGTCGAAGCCTTTGGTCTGCAGCCG





ATAATCCTGCCGGACCTGGCGCAATCGATGGACGGCCACCTGGCGCAGGGCGATTTCTCGCCGCTGACCCAGGGC





GGGACGCCGCTGCGCCAGATAGAGCAGATGGGGCAAAGCCTGTGCAGCTTCGCCATTGGCGTCTCCCTTCATCGC





GCCTCATCGCTGCTGGCCCCGCGCTGCCGCGGCGAGGTTATCGCCCTGCCGCACCTGATGACCCTCGAACGCTGC





GACGCCTTTATTCATCAACTGGCGAAAATTTCCGGACGCGCCGTTCCCGAGTGGCTGGAACGCCAGCGCGGCCAG





CTACAGGATGCGATGATCGACTGCCATATGTGGCTCCAGGGCCAGCGCATGGCGATAGCGGCGGAAGGCGATTTG





CTGGCGGCGTGGTGTGATTTCGCCAACAGCCAGGGGATGCAGCCCGGCCCGCTGGTGGCCCCTACCGGTCATCCC





AGCCTGCGCCAGCTGCCGGTGGAACGGGTGGTGCCGGGGGATCTGGAGGATCTGCAAACCCTGCTGTGCGCGCAT





CCCGCCGACCTGCTGGTGGCGAACTCGCACGCCCGCGACCTGGCGGAGCAGTTTGCGCTGCCGCTGGTGCGCGCG





GGTTTTCCGCTCTTTGACAAGCTCGGCGAATTCCGCCGGGTGCGACAGGGGTATAGCGGGATGCGCGATACGCTG





TTTGAGCTGGCAAACCTGATACGCGAGCGTCACCACCACCTCGCCCACTACCGATCGCCGCTGCGCCAGAACCCC





GAATCGTCACTCTCCACAGGAGGCGCTTATGCCGCCGATTAACCGTCAGTTTGATATGGTCCACTCCGATGAGTG





GTCTATGAAGGTCGCCTTCGCCAGCTCCGACTATCGTCACGTCGATCAGCACTTCGGCGCTACCCCGCGGCTGGT





GGTGTACGGCGTCAAGGCGGATCGGGTCACTCTCATCCGGGTGGTTGATTTCTCGGTCGAGAACGGCCACCAGAC





GGAGAAGATCGCCAGGCGGATCCACGCCCTGGAGGATTGCGTCACGCTGTTCTGCGTGGCGATTGGCGACGCGGT





TTTTCGCCAGCTGTTGCAGGTGGGCGTGCGTGCCGAACGCGTTCCCGCCGACACCACCATCGTCGGCTTACTGCA





GGAGATTCAGCTCTACTGGTACGACAAAGGGCAGCGCAAAAATACGCGCCAGCGCGACCCGGAGCGCTTTACCCG





TCTGCTGCAGGAGCAGGAGTGGCATGGGGATCCGGACCCGCGCCGCTAGCCGTGTCGTTTCTGTGACAAAGCCCA





CAAAACATCGCGACACTGTAGGACGAACCTTGTCAGGACTAATACACAACCATTTGAAAAATATTAATTTTATTC





TCTGGTATCGCAATTGCTAGTTCGTTATCGCCACCGCGCTTCCGCGGTGAACCGCGCCCCGGCGTTTTCCGTCAA





CATCCCTGGAGCTGACAGCATGTGGAATTACTCCGAGAAAGTGAAAGACCATTTTTTTAACCCCCGCAATGCGCG





CGTGGTGGACAACGCCAACGCGGTAGGCGACGTCGGTTCGTTAAGCTGCGGCGACGCCCTGCGCCTGATGCTGCG





CGTCGACCCGCAAAGCGAAATCATTGAGGAGGCGGGCTTccagaccttcggctgCGGCAGCGCCATCGCCTCCTC





CTCCGCGCTGACGGAGCTGATTATCGGCCATACCCTCGCCGAAGCCGGGCAGATAACCAATCAGCAGATTGCCGA





TTATCTCGACGGACTGCCGCCGGAGAAAATGCACTGCTCGGTGATGGGCCAGGAGGCCCTGCGCGCGGCCATCGC





CAACTTTCGCGGCGAAAGCCTTGAAGAGGAGCACGACGAGGGCAAGCTGATCTGCAAATGCTTCGGCGTCGATGA





AGGGCATATTCGCCGCGCGGTACAGAACAACGGGCTGACCACCCTTGCCGAGGTGATCAACTACACCAAAGCGGG





CGGCGGCTGCACCTCTTGCCACGAAAAAATCGAGCTGGCCCTGGCGGAGATCCTCGCCCAGCAGCCGCAGACGAC





GCCAGCCGTGGCCAGCGGCAAAGATCCGCACTGGCAGAGCGTCGTCGATACCATCGCAGAACTGCGGCCGCATAT





TCAGGCCGACGGCGGCGATATGGCGCTACTCAGCGTCACCAACCACCAGGTGACCGTCAGCCTCTCCGGCAGCTG





TAGCGGCTGCATGATGACCGATATGACCCTGGCCTGGCTGCAGCAAAAACTGATGGAACGTACCGGCTGTTATAT





GGAAGTGGTGGCGGCCTGAGCCGCGGTTAACTGACCCAAGGGGGACAAGATGAAACAGGTTTATCTCGATAACAA





CGCCACCACCCGTCTGGACCCGATGGTCCTGGAAGCGATGATGCCCTTTTTGACCGATTTTTACGGCAACCCCTC





GTCGATACACGATTTTGGCATTCCGGCCCAGGCGGCTCTGGAACGCGCGCATCAGCAGGCTGCGGCGCTGCTGGG





CGCGGAGTATCCCAGCGAGATCATCTTTACCTCCTGCGCCACCGAAGCCACCGCCACCGCCATCGCCTCGGCGAT





CGCCCTGCTGCCTGAGCGTCGCGAAATCATCACCAGCGTGGTCGAACATCCGGCGACGCTGGCGGCCTGCGAGCA





CCTGGAGCGCCAGGGCTACCGGATTCATCGCATCGCGGTGGATAGCGAGGGGGCGCTGGACATGGCGCAGTTCCG





CGCGGCGCTCAGCCCGCGCGTCGCGTTGGTCAGCGTGATGTGGGCGAATAACGAAACCGGGGTGCTTTTCCCGAT





CGGCGAAATGGCGGAGCTGGCCCATGAACAAGGGGCGCTGTTTCACTGCGATGCGGTGCAGGTGGTCGGGAAAAT





ACCGATCGCCGTGGGCCAGACCCGCATCGATATGCTCTCCTGCTCGGCGCATAAGTTCCACGGGCCAAAAGGCGT





AGGCTGTCTTTATCTGCGGCGGGGAACGCGCTTTCGCCCGCTGCTGCGCGGCGGTCACCAGGAGTACGGTCGGCG





AGCCGGGACAGAAAATATCTGCGGAATCGTCGGCATGGGCGCGGCCTGCGAGCTGGCGAATATTCATCTGCCGGG





AATGACGCATATCGGCCAATTGCGCAACAGGCTGGAGCATCGCCTGCTGGCCAGCGTGCCGTCGGTCATGGTGAT





GGGCGGCGGCCAGCCGCGGGTGCCCGGCACGGTGAATCTGGCCTTTGAGTTTATTGAAGGTGAAGCCATTCTGCT





GCTGTTAAACCAGGCCGGGATCGCCGCCTCCAGCGGCAGCGCCTGCACCTCAGGCTCGCTGGAACCCTCCCACGT





GATGCGGGCGATGAATATCCCCTACACCGCCGCCCACGGCACCATCCGCTTTTCTCTCTCGCGCTACACCCGGGA





GAAAGAGATCGATTACGTCGTCGCCACGCTGCCGCCGATTATCGACCGGCTGCGCGCGCTGTCGCCCTACTGGCA





GAACGGCAAGCCGCGCCCGGCGGACGCCGTATTCACGCCGGTTTACGGCTAAGGCGGAGGTGGCTGATGGAACGC





GTGCTGATTAACGATACCACCCTGCGCGACGGCGAGCAGAGCCCCGGCGTCGCCTTTCGCACCAGCGAAAAGGTC





GCCATTGCCGAGGCGCTTTACGCCGCAGGAATAACGGCGATGGAGGTCGGCACCCCGGCGATGGGCGACGAGGAG





ATCGCGCGGATCCAGCTGGTGCGTCGCCAGCTGCCCGACGCGACCCTGATGACCTGGTGTCGGATGAACGCGCTG





GAGATCCGCCAGAGCGCCGATCTGGGCATCGACTGGGTGGATATCTCGATTCCGGCTTCGGATAAGCTGCGGCAG





TACAAACTGCGCGAGCCGCTGGCGGTGCTGCTGGAGCGGCTGGCGATGTTTATCCATCTTGCGCATACCCTCGGC





CTGAAGGTATGCATCGGCTGCGAGGACGCCTCGCGGGCCAGCGGCCAGACCCTGCGCGCTATCGCCGAGGTCGCG





CAGCAATGCGCCGCCGCCCGCCTGCGCTATGCCGATACGGTCGGCCTGCTCGACCCTTTTACCACCGCGGCGCAA





ATCTCGGCCCTGCGCGACGTCTGGTCCGGCGAAATCGAAATGCATGCCCATAACGATCTGGGTATGGCGACCGCC





AATACGCTGGCGGCGGTAAGCGCCGGGGCCACCAGCGTGAATACGACGGTCCTCGGTCTCGGCGAGCGGGCGGGC





AACGCGGCGCTGGAAACCGTCGCGCTGGGCCTTGAACGCTGCCTGGGCGTGGAGACCGGCGTGCATTTTTCGGCG





CTGCCCGCGCTCTGTCAGAGGGTCGCGGAAGCCGCGCAGCGCGCCATCGACCCGCAGCAGCCGCTGGTCGGCGAG





CTGGTGTTTACCCATGAGTCAGGTGTCCACGTGGCGGCGCTGCTGCGCGACAGCGAGAGCTACCAGTCCATCGCC





CCTTCCCTGATGGGCCGCAGCTACCGGCTGGTGCTGGGCAAACACTCCGGGCGTCAGGCGGTCAACGGCGTTTTT





GACCAGATGGGCTATCACCTCAACGCCGCGCAGATTAACCAGCTGCTGCCCGCCATCCGCCGCTTCGCCGAGAAC





TGGAAGCGCAGCCCGAAAGATTACGAGCTGGTGGCTATCTACGACGAGCTGTGCGGTGAATCCGCTCTGCGGGCG





AGGGGGTAATGATGGAGTGGTTTTATCAAATTCCCGGCGTGGACGAACTTCGCTCCGCCGAATCTTTTTTTCAGT





TTTTCGCCGTCCCCTATCAGCCCGAGCTGCTTGGCCGCTGCAGCCTGCCGGTGCTGGCAACGTTTCATCGCAAAC





TCCGCGCGGAGGTGCCGCTGCAAAACCGGCTCGAGGATAACGACCGCGCGCCCTGGCTGCTGGCGCGAAGACTGC





TCGCGGAGAGCTATCAGCAACAGTTTCAGGAGAGCGGAACATGAGACCGAAATTCACCTTTAGCGAAGAGGTCCG





CGTCGTACGCGCGATTCGTAACGACGGCACCGTGGCGGGCTTCGCGCCCGGCGCGCTGCTGGTCAGGCGCGGCAG





CACCGGCTTTGTGCGCGACTGGGGCGTTTTTTTGCAAGATCAGATTATCTACCAGATCCACTTTCCGGAAACCGA





TCGGATCATCGGCTGCCGCGAGCAGGAGCTGATCCCCATCACCCAGCCGTGGCTGGCCGGAAATTTGCAATACAG





GGATAGCGTGACCTGCCAGATGGCGCTCGCGGTCAACGGCGATGTGGTCGTGAGCGCCGGCCAGCGGGGACGCGT





TGAGGCTACCGATCGGGGANAGCTCGGCGACAGCTACACCGTCGACTTTAGCGGCCGCTGGTTCAGGGTCCCGGT





GCAGGCCATCGCCCTTATAGAGGAAAGAGAAGAATGAACCCATGGCAACGTTTTGCCCGGCAGCGGCTGGCGCGC





AGCCGCTGGAATCGCGATCCGGCGGCCCTGGATCCGGCCGATACGCCGGCTTTTGAACAGGCCTGGCAACGCCAG





TGCCATATGGAGCAGACGATCGTCGCGCGGGTCCCTGAAGGCGATATTCCGGCGGCGTTGCTGGAGAATATCGCT





GCCTCCCTTGCCATCTGGCTCGACGAGGGGGATTTTGCGCCGCCCGAGCGCGCTGCCATCGTGCGCCATCACGCC





CGGCTGGAACTCGCCTTCGCCGATATCGCCCGCCAGGCGCCGCAGCCGGATCTCTCCACGGTACAGGCATGGTAT





CTGCGCCACCAGACGCAGTTTATGCGCCCGGAACAGCGTCTGACCCGCCATTTACTGCTGACGGTCGATAACGAC





CGCGAAGCCGTGCACCAGCGGATCCTCGGCCTGTATCGGCAAATCAACGCCTCGCGGGACGCTTTCGCGCCGCTG





GCCCAGCGCCATTCCCACTGCCCGAGCGCGCTGGAAGAGGGTCGTTTAGGCTGGATTAGCCGTGGCCTGCTCTAT





CCGCAGCTCGAGACCGCGCTGTTTTCACTGGCGGAAAACGCGCTAAGCCTTCCCATCGCCAGCGAACTGGGCTGG





CATCTTTTATGGTGCGAAGCGATTCGCCCCGCCGCGCCCATGGAGCCGCAGCAGGCGCTGGAGAGCGCGCGCGAT





TATCTTTGGCAGCAGAGCCAGCAGCGCCATCAGCGCCAGTGGCTGGAACAGATGATTTCCCGTCAGCCGGGACTG





TGCGGGTAGCCTCGGCGGCTACCCGTTAACGCCTACAGCACGGTGCGTTTAATCTCCTCAAGCCAGCTCGCCAGA





CGCGCTTCGGTCTGGTCGAACTGGTTATCCTGATCCAGCACCAGCCCAACAAAGCGGTCGCCTTCCAGCGCCGAG





GACGCGCTGAATTCATAACCCTCATTTGGCCAGCTGCCAATCATCTGCGCGCCGCGCGCGCTCAGGGCGTCGAAC





AGCGGGCGCATCCCGCTGACGAAGTTGTCCGGATAGCCTCTCTGATCGCCGAGGCCGAACAGCGCCACGGTTTTC





CCTTTCAGGCTGGCGTCGTCGAGGCCGCTGATAAATTCGCTCCATGACTCGCTTTCGCATCCGGCCTCCAGCCCC





GGCAGCTGGCCGTCGCCGAGCGTCGGCGTGCCCAGCAGCAGCACCGGATAGGCCATAAAGTCGTCCAGCGTCGTG





CGGTTAATGTTGACCGGGGCATCCGCCAGCTCGCCCAGTTGCTTATGGATCATTTTCGCGATTTTGCGGGTTTTA





CCGGTATCGGTGCCAAAGAAAATACCAATGTTCGCCATGTTGCGCTCCTGTCGGAAAAGGGGGTTGAAAATACGC





GTTCTCGCAGGGGTATTGCGAAGGCTGTGCCAGGTTGCTTTGCACTACCGCGGCCCATCCCTGCCCCAAAACGAT





CGCTTCAGCCCTCTCCCGCCGCGCGCGGCGGGGCTGGCGGGGCGCTTAAAATGCAAAAAGCGCCTGCTTTTCCCC





TACCGGATCAATGTTTCTGCACATCACGCCGATAAGGGCGCACGGTTTGCATGGTTATCACCGTTCGGAAAACAC





CGCGGCGTCCCTGTCACGGTGTCGGACAAATTGTCATAACTGCGACACAGGAGTTTGCGATGACCCTGAATATGA





TGCTCGATAACGCCGTACCCGAGGCGATTGCCGGTGCGCTGACTCAACAACATCCGGGGCTGTTTTTTACAATGG





TCGAACAGGCATCGGTAGCGATTTCCCTCACCGATGCCCGGGCGAATATTATCTACGCCAACCCGGCGTTTTGCC





GCCAGACTGGATACTCGCTGGCGCAATTGCTCAATCAAAACCCGCGCCTGCTGGCCAGCAGCCAGACGCCGCGCG





AGATCTACCAGGAGATGTGGCAAACCCTGCTCCAGCGCCAGCCGTGGCGCGGTCAGCTAATTAATCAGCGCCGCG





ACGGCGGCCTGTATCTGGTAGATATCGATATCACGCCGGTGCTGAATCCGCAGGGCGAGCTGGAGCATTATCTGG





CGATGCAGCGGGATATCAGCGTCAGCTATACCCTGGAACAGCGGCTGCGCAATCATATGACGCTAATGGAAGCGG





TGCTCAATAACATCCCCGCCGCCGTGGTCGTGGTCGATGAGCAGGATCGGGTGGTGATGGATAATCTCGCCTACA





AAACGTTCTGCGCGGACTGCGGCGGGAAAGAGCTGCTGGTCGAGCTCCAGGTTTCCCCGCGCAAAATGGGGCCCG





GCGCGGAGCAAATCCTGCCGGTGGTGGTTCGCGGCGCGGTCCGCTGGCTGTCGGTAACCTGCTGGGCGCTGCCCG





GCGTGAGTGAAGAAGCCAGCCGCTACTTCGTCGACAGCGCCCCGGCGCGCACGCTGATGGTGATCGCCGACTGTA





CCCAGCAGCGCCAGCAGCAGGAGCAGGGCCGGCTCGACCGTCTGAAACAGCAAATGACCGCCGGTAAGCTGCTGG





CCGCGATTCGCGAGTCGCTGGACGCGGCGCTGATTCAGCTTAATTGCCCAATCAATATGCTGGCGGCGGCCCGCC





GGCTGAACGGCGAAGGCAGCGGCAACGTGGCGCTGGACGCGGCGTGGCGCGAAGGTGAAGAGGCCATGGCGCGCC





TGCAGCGCTGCCGCCCTTCTCTTGAGCTGGAAAGCAATGCCGTCTGGCCGCTTCAGCCCTTTTTTGACGACCTGT





ACGCCCTCTACCGCACCCGCTTTGACGATCGCGCGCGGCTGCAGGTGGACATGGCATCGCCGCATCTGGTCGGCT





TCGGCCAGCGTACCCAGCTGCTGGCCTGCTTGAGTTTATGGCTCGACCGGACGCTGGCCCTCGCCGCCGAGCTGC





CCTCCGTACCGCTGGAGATCGAGCTTTACGCCGAAGAGGACGAGGGCTGGCTCTCTTTGTATCTCAACGACAATG





TCCCGCTGCTGCAGGTGCGCTACGCCCACTCCCCCGATGCCCTAAACTCTCCCGGCAAAGGGATGGAGCTGCGGC





TGATCCAAACGCTGGTCGCCTACCACCGCGGCGCGATTGAACTGGCTTCGCGACCGCAGGGAGGCACCAGCCTGG





TTCTGCGTTTCCCGCTCTTTAATACCCTGACCGGAGGTGAGCAATGATCCATAAATCCGATTCGGACACCACCGT





CAGACGTTTCGATCTCTCCCAGCAGTTTACCGCCATGCAGCGGATAAGCGTGGTCCTGAGTCGCGCCACCGAAGC





GAGCAAAACCCTGCAGGAGGTTCTGAGCGTGCTACATAACGATGCCTTTATGCAGCACGGGATGATTTGCCTGTA





CGACAGCCAGCAGGAGATCCTGAGCATCGAAGCGCTGCAGCAAACGGAAGATCAGACGCTGCCCGGCAGTACGCA





AATTCGCTACCGGCCGGGGGAAGGATTAGTCGGTACCGTGCTGGCGCAGGGCCAGTCGCTGGTGCTGCCGCGCGT





CGCCGACGACCAGCGTTTTCTCGATCGTCTGAGCCTGTACGACTATGACCTGCCGTTTATCGCCGTTCCGCTGAT





GGGCCCCCACTCCCGGCCCATCGGCGTACTGGCGGCGCAGCCGATGGCGCGTCAGGAAGAGCGGCTGCCCGCCTG





CACGCGCTTTCTCGAAACCGTCGCCAATCTGATCGCCCAGACGATTCGCCTGATGATCCTGCCAACCTCCGCCGC





GCAGGCGCCGCAGCAGAGCCCCAGAATAGAGCGCCCGCGCGCCTGTACCCCTTCGCGCGGTTTCGGCCTGGAAAA





TATGGTCGGTAAAAGCCCGGCGATGCGGCAGATTATGGATATTATTCGTCAGGTTTCCCGCTGGGATACCACGGT





GCTGGTACGCGGCGAGAGCGGCACCGGGAAAGAGCTCATCGCCAACGCCATCCACCATAATTCTCCGCGCGCCGC





CGCGGCGTTCGTCAAATTTAACTGCGCGGCGCTGCCGGACAACCTGCTGGAGAGCGAGCTGTTTGGTCATGAGAA





AGGCGCGTTTACCGGCGCGGTGCGCCAGCGGAAAGGCCGCTTTGAGCTGGCGGACGGCGGCACCTTATTCCTCGA





TGAGATCGGCGAAAGCAGCGCCTCGTTTCAGGCTAAGCTACTGCGTATTCTGCAAGAGGGGGAGATGGAGCGCGT





CGGCGGCGACGAAACCCTGCGGGTCAACGTGCGCATTATCGCGGCGACCAACCGCCATCTGGAAGAGGAGGTGCG





GCTGGGTCATTTCCGCGAGGATCTATACTACCGCCTGAACGTAATGCCTATCGCGCTGCCGCCGCTGCGCGAGCG





CCAGGAGGATATCGCCGAGCTGGCGCACTTTCTGGTGCGAAAAATCGCCCACAGCCAGGGGCGAACGCTGCGCAT





CAGCGATGGGGCGATTCGCCTGCTGATGGAGTACAGCTGGCCGGGAAACGTGCGCGAACTGGAAAACTGTCTCGA





ACGTTCGGCGGTGCTGTCGGAAAGCGGCCTGATAGACCGGGACGTGATTCTGTTCAACCATCGCGATAACCCGCC





GAAAGCGCTCGCCAGCAGCGGCCCGGCGGAGGACGGCTGGCTCGATAACAGCCTCGACGAGCGCCAGCGGCTGAT





CGCCGCCCTGGAAAAAGCGGGCTGGGTGCAGGCCAAAGCGGCGCGGCTGCTCGGCATGACCCCGCGCCAGGTGGC





GTATCGCATTCAGATTATGGATATCACCATGCCGCGACTGTGAAGCCTTATGTGAGATTCAGGACATTGTCGCCA





GCGCGGCGGAATTGCGACAATTCAGGGACGCGGGTTGCCGGTTAAAAAGTCTACTTTTCATGCGGTTGCGAAATT





AACCTCTGGTACAGCATTTGCAGCAGGAAGGTATCGCCCAACCACGAAGGTACGACCATGACTTCCTGCTCCTCT





TTTTCTGGCGGCAAAGCCTGCCGCCCGGCGGATGACAGCGCATTGACGCCGCTTGTGGCCGATAAAGCTGCCGCG





CACCCCTGCTACTCTCGCCATGGGCATCACCGTTTCGCGCGGATGCATCTGCCCGTCGCGCCCGCCTGCAATTTG





CAGTGCAACTACTGTAATCGCAAATTCGATTGCAGCAACGAGTCCCGCCCCGGGGTATCGTCAACGCTGCTGACG





CCTGAACAGGCGGTCGTGAAAGTGCGTCAGGTCGCGCAGGCGATCCCGCAGCTTTCGGTGGTGGGCATCGCCGGG





CCCGGCGATCCGCTCGCCAATATCGCCCGCACCTTTCGCACCCTGGAGCTGATCCGCGAACAGCTGCCGGACCTG





AAATTATGCCTGTCGACCAACGGACTGATGCTGCCTGACGCGGTGGACCGCCTGCTGGATGTCGGCGTTGACCAC





GTCACGGTCACCATTAACACCCTCGACGCGGAGATTGCCGCGCAAATCTACGCCTGGCTATGGCTGGACGGCGAA





CGCTACAGCGGGCGCGAAGCGGGAGAGATCCTGATTGCCCGTCAGCTTGAGGGCGTACGCAGGCTGACCGCCAAA





GGCGTGCTGGTGAAAATAAATTCGGTGCTGATCCCCGGTATCAACGATAGCGGCATGGCCGACGTGAGCCGCGCG





CTGCGGGCCAGCGGCGCGTTTATCCATAATATTATGCCGCTGATCGCCAGGCCGGAGCACGGCACGGTGTTTGGC





CTCAACGGCCAGCCGGAGCCGGACGCCGAGACGCTCGCCGCCACCCGCAGCCGGTGCGGCGAAGTGATGCCGCAG





ATGACCCACTGCCACCAGTGTCGCGCCGACGCCATTGGGATGCTCGGCGAAGACCGCAGCCAGCAGTTTACCCAG





CTTCCGGCGCCAGAGAGTCTCCCGGCCTGGCTGCCGATCCTCCACCAGCGCGCGCAGCTGCACGCCAGCATTGCG





ACCCGCGGCGAATCTGAAGCCGATGACGCCTGCCTGGTCGCCGTGGCGTCAAGCCGCGGGGACGTCATTGATTGT





CACTTTGGTCACGCCGACCGGTTCTACATTTACAGCCTCTCGGCCGCCGGTATGGTGCTGGTCAACGAGCGCTTT





ACGCCCAAATATTGTCAGGGGCGCGATGACTGCGAGCCGCAGGATAACGCAGCCCGGTTTGCGGCGATCCTCGAA





CTGCTGGCGGACGTTAAAGCCGTATTCTGCGTGCGTATCGGCCATACGCCGTGGCAACAGCTGGAACAGGAAGGC





ATTGAACCCTGCGTTGACGGCGCGTGGCGGCCGGTCTCCGAAGTGCTGCCCGCGTGGTGGCAACAGCGTCGGGGG





AGCTGGCCTGCCGCGTTGCCGCATAAGGGGGTCGCCTGATGCCGCCGCTCGACTGGTTGCGGCGCTTATGGCTGC





TGTACCACGCGGGGAAAGGCAGCTTTCCGCTGCGCATGGGGCTTAGCCCGCGCGATTGGCAGGCGCTGCGGCGGC





GCCTGGGCGAGGTGGAAACGCCGCTCGACGGCGAGACGCTCACCCGTCGCCGCCTGATGGCGGAGCTCAACGCCA





CCCGCGAAGAGGAGCGCCAGCAGCTGGGCGCCTGGCTGGCGGGCTGGATGCAGCAGGATGCCGGGCCGATGGCGC





AGATTATCGCCGAGGTTTCGCTGGCGTTTAACCATCTCTGGCAGGATCTTGGTCTGGCATCGCGCGCCGAATTGC





GCCTGCTGATGAGCGACTGCTTTCCACAGCTGGTGGTGATGAACGAACACAATATGCGCTGGAAAAAGTTCTTTT





ATCGTCAGCGCTGTTTGCTGCAACAGGGGGAAGTTATCTGCCGTTCGCCAAGCTGCGACGAGTGCTGGGAACGCA





GCGCCTGTTTTGAGTAGCCGTTTCCCGAAGGGGGCGCTGCAAACAAAAAAGCCGGAGGTTTCCCTCCGGCTTTTC





ACATCATCAAATGTGATTATGCGACGTCTTCGTACTGCGGCACCGGGTTGCGGAAGCTTTTGGTCAC






Pseudomonas stutzeri A1501 nitcluster 



(SEQ ID NO. 5)



gttaggttggcctgaattcggtgtgtatcccccggagatcagcttcgcctcggcacgctcagcctgcactcgccc 






cagcctagctttccgccgcaagtgcggcatcgagtcgcgccaccaggctgccgtcggcttccaggccgaggatga 





tgtcgcaaccgccgaccagctcgccacgcaggaacagctgcgggtaggtcggccactgcgagatcttcggcagct 





tctcgcggatatgcggtgccagtagcacgttgaccgtggcgaacggccggccgctgttcttcaatgcctccaccg 





cggcgcgggagaaaccgcactccggcacgcccggcgtgcccttcatgtacagcagcaccggatgctcggcgagtt 





gctggcgtatgcgtgcttcggtatcgagaacttgcatgcgttcactccattgccagggtgcagggggagttgtag 





gcgcaggggctggcatgggcccgctgtgggcgatccttccaggcctcgtagccgccgtccaggctgtagcaattg 





atgaagccgaaatcgctgaacagctgtgccatgtcacggctggcatgaccgtgctcgcaacagatgatcagatgg 





acgtgatttggcgtgctcttgagcagcgtgcgcaagttcagctcgctgaggcgggtggcgcgcgggtcatggccc 





tggcagtaggcgcgggcatcgcgcatgtccagcagcatggtgttttcggtcgccaacagccgctgggcctgctcg 





acgctgatgcgttggtagtcgctcattgctcttctccaaaacaatcgtgataggtcgggcaggcttcacaagagg 





gggagcggcatacgtagccgccgtcctgttcgcatagttgtttgtagaggaacttcttccagcgcatgtcctggg 





tattgcgccgggccagctgcgggaagttgtgcatcagcagcgcgtacagctgcgcccgcgaggccaggccgaggt 





cgcgccacaggtgttcgccaccgaggcaggcggcagcgacgatggctgccatcgccggttcgccgtggtcgtcct 





ggccgcccagcagcaggtcgtgcagcgcctgccattcttcccggcgcagcgccagcaactcttcgcgcaaggcgt 





cgcgctcgccgagcaaggcctcgtcggcgctacgggatggtggtcgcagcccatggcgcgtcaggagctcggcgt 





actgcgccgcgtcgagcccgaggtgctgcggcaggcaactacggccttcgcgctgggcgcggatgatctgcgcta 





gccaggccgggttgtcgttgactgcgacctccaggcacaacgcggcccggctcattgcggcgtgctcgcggcggg 





gctgcaaccgagcaccgagctggtgcccagcgaacagccgctgatgctcggtgccagcgctggcagccggccttt 





gcacggcagccaggcattgatgccgaccaggtcgaagtcgaccatgtagtgcatgccgcagcggatcagcgggcg 





gcggatcaacagcggctgggccaccatcagctccagtgcctgttcggcgctcagttcgctgacatccagctcgcc 





gtacttgatcgccggggccgacgggttgaaccactcggccaccggcagccggccgaagaacggccgcaggcgttc 





cggcgtccaggcctcgcgcagcaggtcgcgcacttccagctcgatgcccgctgaacgcagcagctccttctgcag 





gcggttggtggcgcaaccgggcttctcgtagaagatgatgcaggacatggcaacctcctcaacgggcctggatgg 





cgcgcatggcctcggccaaccgctccggcggaatgcccgtcagcgagccggctgggttggccgggctgccgtcgg 





cgagcaggatcgcgccttcgatggggcagatgctcgcgcactgttgctcggcgtagtcgccatcgcattcggtgc 





acttgtgcgcgctgatgcggaagtacgcagtgccggggctgatcgcctcgctcgggcagacgtccacgcaggccc 





agcagttgacgcaggattcgacgatttgcagtgccatactccacctcctcatgccatcaggcattgctccgctgc 





gcccaccgacgcatcgagacggccattggcgatcatttcctggtacacctccagcacggcttcctcgatgggctc 





catggcgtgctcgccattgggctggatgccggcggcttccagctcgccccagggttcgaagccgatcttcgagca 





gagcaccgcctcgcagcccttgagcgcgcggatgctgcccgacagcgcactgtccttgtcgccgcagctgtcgtt 





gccgacgcagtactgctcgaccttgcggtggccgatgaagcgcaccccggccggcgaggcctcgtagacgaggaa 





ttcgcgggcatggccgaagtgctggttgaccaggccgccgccgctggtggccacggccatcagtaccgggcgatg 





gcccttgtccactgtgccggtgagctgcgcagcgctgggggtggccaggcgcgccttcttcgccgcgcgttcgtc 





cagctcctccttgatcgccgcgtggatggcggcgcgcttgaccatcgccgcctcgtagtcgacgtccatgctctc 





gatcttgtcgagggtgaactcgtcgccgcggtcctcgccgagcaggcccaccgcgtcggcgcggcactggcggca 





gtggcgcatcatgttcatgtcgccggcacaggcgtcctgcaggtcctgcagttcctccggctccgggctgcgctg 





gcccatcacgccatagaaggtgccgtgctcggcctcggcgatcagcggcatgacgttgtgcaggaaggcgccctt 





ggccttgacgatgcggctgacctctttcaggtgctcatcgttgacgccggggatcagcaccgagttgaccttcac 





caggatgccacgctcgaccagcatctccaggcccttctgctgccgttcgatgaggatcttggccgccttgcgccc 





acggatgcgcttgttgttccagtagatccaggggtagatctcggcgccgatgtccgggtccacgcagttgatggt 





gatggtcacgtggtcgatgttgtgcttggccagctcgtcgacgcagtcgggcagggccaggccgttggtggagac 





gcacagcttgatgtccggcgcctgctcggacagcatgcgaaaggtctcgaaggtgcgctgcgggttggccagcgg 





gtcgcccgggccggcgatgccgagcacggtcatctgcgggatggtcgccgccaccgccttgaccttcttcaccgc 





ttgcaccggctccagcagctcggacaccacgcccgggcgcgattcgttggcgcagtcgtacttgcggttgcagta 





gtggcactggatgttgcaggccggcgccaccgccacatgcatgcgcgcgaagtagtggtgcgcctcctcggagta 





gcaggggtggttgtgcactttctcgcggatgtgctcgggcaggtgcgcgagctggtcatccgagctgccacagga 





acccgctgaacaaccgcccccggcggttggcccggcctcgctctggcccagtacgttcagttccatgttcggtct 





ccgaatagaggtctgtccccggtacctgcagcaaggcttgtgcctgttttcaaatcattgtttcagaacgaattt 





ttcagaaagcgggcggaattcgttgtttcgcaacgaacaaagtggcggggccgggcggggcggctgtcgcaaagg 





cgacaagctgcgcacgcccggttcccgggctgtcgcgacccggtgctccagacgattgcgcatggcgggccgcga 





tccgcaccagcgccccggcccgctggtgccgggctactcctcgaggcgcccgctggcgtcgcgatcgcgcacgta 





atggtgggtgagcggaaacgccggcagccaggactcgcggcggacggtccagagctcgtaggtgggcatcagctg 





gtcgggggcatccagggctcccaggctcacttcgatttcgtccgcggtgcgtgccctgctagtgatgcgtagtgg 





gcacaaggcttcgcgggaagcgccatgcatggggaacgctgcgccgcccgacccgggagtcgggcgggtcgttca 





gatcttgcgcatatgaatgttcagcgtctgcactcggtaggcgatctgccggggcgtcatgccgagcaggcgggc 





ggccttggcctggacccagccggcctgttccagcgcggcgatgacgcgctcgcggtcgtcgaggctgtcgtcggc 





gaggtcgacttcggggaccggcgccagcggcgtggcgtcgtggtcgaggccggtgagggagaccacgtcgcggct 





gatggtgccatcctcgctcatgatggccgagcgttccaggcagttttccagttcgcgcacgttgcccggccagcg 





gtggctcatcagcagacgcagggcgctgtcggtcagcttgagtttgcgaccctgctggcgggcgatcttgtcgag 





gaggaattcggccagttccgggatgtcggcgctgcgctcgcgcagcggcgggacgcggatggccatgacgttgag 





gcggtagtagaggtcttcgcggaacttgccttgctccacctcgtgctccaggtcgcggttggtggcggcgacgat 





gcgcacgttgaccttcaccgtctggctgccgccgacgcgctccagctcgccttcctgcagcacgcgcagcagctt 





ggcctggaacatcggcgagatctcgccgatctcgtcgaggaacagggtgccgccgtcggcctgttcgaaacgtcc 





cttgcgctgcttcacggcgccggtgaaggcgcctttctcgtgaccgaacagttccgattcgagcagggtttccgg 





tagcgcggcgcagttcaggcgtaccagcggctggtgagcgcgcggtgagttgtagtggatggcgctggcgatcag 





ctccttgccggtgccggattcgccgaggatcagcacggtgctgttccacttggcgacccgtcgaacctggtcgaa 





aacccggcgcatggaggcggtgtggcccaccaccatgttctcgaagccgtacttggcgcggacttcgcggcgtag 





ctcgtcgcgctcgtcgaccacttcctggccgtcctcgaggttcaccaccaggcgcacggtctgcgccagtaggcg 





ggcgacgatttccatcaaacgggtgcgttcgggcatcagctcgtcggcgcggcggtcgggctgggcagccagcac 





gccgatggtggtgccgtcgacggccttgatcggcacggcgatgaagggcaggtccatgtcgtacagcgccagtcg 





gtcgagaaagcgcggttcggcgtcgatacgcccgagcaccacgctgttgccatgcttgaggatgttgccgaacac 





gccttcgccgatgcggtagcgggtgctttcgcaggcccgtaccacggtttcggagtcgctgtgcacggcgcccac 





ctgcaggctgccgtccttcgggttgcagatggagaccagcccgtgcagcaggccgaggtcttcgtgcagcacggc 





gaggatctcggccagcagttcctcgatgggccggccgcggttaaggatgcgggcgatctgcgccagcgcctgcag 





ttgggcatccagcagttcgttgcgggttggcgcgctggggcgttcggcgaatgtggcgttcatgcgagcttcccc 





tgtcagctggccgagaagggcagttcgacgacgatcctgcagccctggtcgtagccgctatcgatatgcaccgtg 





ccggcatgctcggtgacggtttcctgcaccatggccaggcccatgccgcgaccggtcttgtgcggcggcttggtg 





ctgaagaagggttcgaataccttgagcgccagctccggcgcgatgcccgggccgctgtcggcgatctccaggcgc 





accacccgctggccctggacacgggtgacgatcgacagcgtgcgcgggttgtcctggttctggctcatggcctcg 





atggcgttttccagcagctgcttgatcatgctgcgcagccggccttcggcgcccatcacccagggtaggcgcagc 





gccggctgccagtcgacgacgatgccctgggcgagcaactggtcggtcatcaggctgaccacttcgcggatcagc 





tggttgatgttgaccggcacgcagccgccggcccggcgctgcggaatcgagccgctgaggctttccagcgcatcc 





atgccagcctggctggcttcgcgcatggcgctgagcaccggatcgccctcggcgctgtcgcccaggcgtcgttcg 





agcatgcgcagcgccgcactgatcaggttgaccgggccctgcaggcggtggatggcgccgttgaaggtttcgcgc 





atgccgtcgagcagctcttcctcggccatcagcaccttcagggcgttgagctgggaggcctgctgttgctggcgc 





agcccggtgatgtcgttgaccgtcagcagcaggtagttttcctcgcccgggtcgaagaagtcgtcggcgcgttcg 





ccttcgatgaggatggcgcggccatggcaggacagccagcgcggtgtgtggccgccgaggtcgaaggtgacttcc 





ttgccggtgaaggcctggccatgcgccttcagcgcctcgatggcgccgccgaggttgtcctgcagcaggctcacc 





agttgcgcgggcgtggcctggtcgccgagttccgcggccaggcggttgaagctggggttggacaggcggatgcgc 





agggcgtgatcgagcaccacgatggccgccggtgcgctgtcgaccaccgcctcgatgatcagccgctggttgctg 





acgcgctgttcgagcttgtgctggtcgctgctgtcgcggtgcatgcccaggtaatggatggtccgctcgtgctcg 





tcgagcaccggcgccacggtcagctcggcgaggtagcagctgtcgtccttgcgccggttgaccagcatgccggac 





caggcctttttctgcgccaggcggctccagagcgcctggtagaccagccgcggggtggtgccgttggacagcacc 





gattcgttcttgccgatcacctcgctgctgtcgtagccggtgatggcgctgaaggcgcggttggcatagaggatg 





ttggccttcagatcggtgatggaaatggcgatcggcgcgtgctccacggcttgctggaacacttcgggcgccaat 





ccatcggacgcagcgggttgccccgcgtcgcgctcgggggtggcctgggtcatgtgcatgtcctcatcgatgcgg 





cgaagccgacgtctgtgcgccggtatccgttgcaaagccatacggttagggggctgttgccgttcgcgagctgcg 





aatgaaacggcaacagaccccttagggttttgcaaaccgcgtgccgtcggtcacattccttgccgacagccctgc 





ggagccgtaaatacgctgtgcagatggatttctgccccgacaggtgccgctgggctgttgcaaaacccacaggga 





ggcgcgcgcacttctcccggcctgtcgcaaaccccacaaagtccgtcgcgccagcgtcgccaggggttgcgctat 





cacgggattcgttgatctgcatcaacgaatcccgggctctcggggcgctccgggacgcccggcggggcgtggcat 





gcttgatgcaaaacccctcacaacaaggcctttgcccgacaacggtgcaagcgctgccaataggctgggaggggt 





tatggaatatgcgctgtttctgatcggcaccgtgctggtcaacaacgtggtgctggtctacttcctcggcctgtg 





tccgttcatgggggtctccggcaagctcgacccctcgctgggcatgggcttggcgacgaccctggtgatgaccct 





gggcggcgtcagcagctggctgctagaacgctacgtgctgcagccgctgggcatcggctttttgcgcatcctctc 





ctacatcctggtgatcgccggcctggtgcagctgatcgagatgatcatccgcagggttagcccgccgctgtatcg 





ctcgctgggcatctacctgccgctgatcaccaccaactgcgccgtgctgggcgtgccgctgatcagcgtgcgcga 





aggccacaggctggccgaggcggggctgttcggcctgggctcggcgctgggcttcaccctggtcatggtgatctt 





cgccggcttgcgcgagcgcctggcgctggccagcgtgccggcggccttcgccggcgcaccgatcgctttcgtcac 





cgccgggttgctggcgatggctttcatgggcttcgccggcctgatctgaaacgcacgccgccggcgaggctggcg 





aaggaggagcaatgctggacgcaattctggttcttgcactgatgggcctgctgctcggcggcggcctcggtctgg 





cggcgcgctatctggcggtttcgcaggagaacccgctgatcaaggaaatcgaggcgctgctgcccggcagccagt 





gcgggcaatgcggctatccgggttgcagtgcggcggccgacgccttggtcgagggcagcgccgcggtcacctgct 





gcccgcccggcggggccgcgctggccgagcgcctggccgaactgctcggcgtgccgctggacgccagtgcgctcg 





ccgcgcccatgctggcgcgcatcgacgccgccgagtgcaccggctgcacgcgttgcttccgcgcctgcccgaccg 





acgccatcgtcggcgccaacgggcagatccattgcgtgttgagcaatgcctgcattggctgcagcaaatgcctgg 





aggcctgcccggaggactgcatcgccctcgcgccccagacactgacgctggaccactggcgctgggccaaaccca 





gggccgcctgatttcgcctgatgaacaggggcgtcagaccccgggagtcgacaatgttcaacctcgcgcattttc 





gcggcggcatccatcccgccgcccacaaggaccgctcggccgccctcggcatcgccgtgcagccgctgccgccgc 





gcctgtacctgccgtttcgccagcatgccggggccgaggccttgccgctggtgaaggcgggcgagcgggtgctca 





agggccagctgctggccggctcgcccactgagctctcggcgccgatccatgcgccgagttccgggcgcatcctct 





cgatcgggccgatcgacgcgccgcatccgtcggggctgcaggtcaacggtgtggtcctcgaatgcgatggcgagg 





agcgctggatcgagctagacgtaccggccgaccccttcgccgaggacccgcagcggctcgcccagcgcgtcgccg 





atgccggcgtggtcgggctcggcggggcgatcttcccggccgcggtgaagctcaagcagggcgcccggcacgaga 





tcaagaccgtgctggtcaacggcagcgagtgcgagccgtacctgagctgcgacgaccggctgatgcgcgagcgcg 





ccgaggcggtggtcgatggcgcgcggctgatccagcacatcctgcgtgcctacagcatcgtcatcgccatcgagg 





acaacaagccggcggcgctggcggccatgcgtgctgcgagcgagccctacggcgccatcgaggtggtggcggtgc 





cggcgctctacccgatgggctcggccaagcagctgatccgccaggtcaccggccgcgaggtgccggccggcgggc 





gcagtaccgacgtcggcgtgctggtacacaacgccggcacggtgtatgcgatccagcaggcgctgcgccacggcc 





gcccgttgatctcgcgggtggtgacggtggctggtggttgcgtgagcaacccgcgcaacatcgagactctgatcg 





gcaccccggtgcaggcgctgttcgaaagctgcggcggactgctgcgcgagccgcagcaactgctgctcggcgggc 





cgatgatgggcatgctgctgccatccacggcggtgccggtgatcaagggcgccaccgggctgctggcgctcgacc 





acggcgaagtgccgcgcagcgacagcgcgccgtgcatccgctgcgcgcgctgcgtcgacgcctgtccgatgggcc 





tggctccgctggagatggccgcgcgcacccgcgtcgacgatttcgacggcgccagcgaatacggcctgcgcgact 





gcatcctctgtggctgctgcgcctatgtctgcccctcgcacattcccttggtgcagtacttccagtacgccgtcg 





gccagcaggacgagcgccgcagcgccgcgcgcaagaacgattacgtcaagcagcttgccgaggcacgggcggcgc 





gcttggccgaggaggaagcggccaaggcggcggccaaggcggcgaagaaacgcaaggcggcggcgccggccgcca 





gcgaggtatcgccatgagcgcgcagggtatcgcggcggggccgttcgcccatgatcgctcctcggtcgaccgcat 





catgctgcacgtctgcctggcgttgctgccgacgacggcctggggcctgtatctgttcggctggccggcgatcta 





cctgtggctgctgacctgcgccagcgcggtggcctgcgaggccgcctgcctgtacctgctcggccggccgctgcg 





ccgcctgctggacggcagcgcactgctcagcggctggctgttggcactgacgctgccgccctgggcgccctggtg 





gatcgccgtcggtggcagcatgttcgccatcggcattggcaagcagctgtacggcggcgtcgggcagaacgtgtt 





caacccggcgatgctggcgcgggtggcgctgctgatcgccttcccgctgcagatgaccacctgggccctgccttt 





gccgctgggtacggagggcgcgcccggctggctcgaaggcctgcgcatcaccttcgccggtggggcgctggccga 





tggcctgagcggcgccaccgcgctgggccacctgcagaccgagctgaccctggggcacagtgccgcgcagatcct 





cgacgggcatttcgcgttgctgccggcctttctcggctacagcggcggcagcctcggcgagacctcggagctgct 





gatcctgctcggcgggctctggctgctggcactgcgcatcatccactgggagatcccgctgggcatgctgctgac 





ggtgggcgcgctggcggcgctggcgaaccagatcgacccgcaggtacatggcggcgggctgttccacctgacctc 





gggcggcttgctgctcggcgcgttgttcatcgccaccgatccggtgacctcgccgatcagccgcagtggccggct 





gatcttcgccatcggttgcggcgcgctggtcttcgtcattcgcagctggggcaatttccccgaagccgtggcgtt 





cgccgtgttgctcatgaacgccctggtgccgctgatcgaccgcgtctgccggccgcgtgcctatggccgcaacgc 





gcgcggcaagccgctggtggcggcgaagtggacccgccaggtgaaggaggtcgacaaggtatgaacgagctgacc 





cagacgccgcccgtggcagacggcaacgaaccgccgctcacccgacccggcctggtcgagacctggcgcgagcgg 





gtttcctaccaggcgctgtcgctgggcttggtctgcgccctggtggccgtggcgctgctgctcggcaaccagctg 





acccaccagcggattgtcgacgccgagcggcaggaccgcctcgccgtgctgcgccaggtgctgccgcaggcgctc 





tacgacaacgatccgctggccgatgccttcaacgtcgaggatgccgagctgggcctgatcgaggtgtacccggcg 





cggcgcgcggggcaactgacggccaccgccttccagatcagcaccgtcggctacggcggcccgatcgtccagttc 





atcgccctcgacagcgaaggccgcatcctcggcgtgcgggtgctcagccacaaggaaacccctggcctggcggac 





aagatcgaagtcacccgcagcgactggatcaaggccttcgacggcctgtcgctggccagcacaccgctggatcag 





tgggcggtgaagaaggacggtggccagttcgaccagttcgccggcgccaccatcaccccgcgggccatcgtcaag 





ggcgtgctccgggcgctcgagttccaggcccgccagtccaccgcccagtccaaccaggagactcggccatgagca 





gccaatgcggatcagcggatgtcacggcgcccaagcccaaggggctgttcaactacttcagctcggcgctgtggg 





actacaacgtcgccctggtgcagatgctcgcgttgtgcccggcgctggcggtgaccaccaccgctaccaacggcc 





tgggcatgggcctggccaccaccctggtgctgatgatcaccaatgcgatcatttccgcgctgcgccacagcattt 





cgccggcggtgcgcaacccgctgatgatcggcatcatcgccggcgtggtgaccctcatcgacatggcgatcaatg 





cctggatgcacgaactgtacaaggtgctggggctgttcatcgccttgatcgtgaccaactgcgcggtactcggcc 





gtgccgaatcgttctgcagccgcaacccggtgctgccctcgatcctcgacggcgccggcatgggcatcggcttca 





cctgggtactggtggtgatcggcgggatacgcgagatcctcggcagcggcacgttgttcgcccaggcctcgctgc 





tgctcggtgagcacttccgctggctggagatcaccgtcctgcccggcttccagggcatcctgctggcgatcctgc 





cgcccggggcgttcattgttctgggcttcgtgctggcgttcaagcgagtagttgatcgccggcgcgccgagcgac 





ggatcaggacccatggcgaactggtagtgttgcagtgagcccggccgaggagcgaagcagacgatgaagatttcc 





gttgtatacgccgcaccccggcagcccctgctgttcgattgccgggtggcggaaggctccagcgtggccgaggcc 





atcgagcactccggggtgctgcgctactgcccggacatcgacctgagcaagcaaaaggtcggggtctacggcaag 





ttcgtcaaactcgacagcccgctgaaggagggcgatcgggtggaaatctaccaacgcatcacgcgcgtgctggat 





gaagacgacgatgacgacgactgacagccgccgcggatgaccatagccgagagaggagcgaccgatgaacagcca 





gcccccgagcatgaaccgtgaaaccgcattacgcatcgcactggccgcccgggcattgcccgaggtgggcgtcgg 





ccggttgctggatatcctgcaccagcggatcgatggagaactgaacgaagagagcctgcagcgcgtgaccgtcac 





cgacctcaagacggcgttcgccagcgccgacggcgaggaggatggcgaggacatcggcatcggcctgccggcgct 





gaaggaagcggtgcgcatcctctggggcgaaggcgtcggcgacgacctgccgcagccggaggtcctggaccgcgt 





gccggaaggctcgatccgggtggccatcgcctccaacaacggcgagcgcctggacggccatttcggctcgtgcct 





gcgttttctgatctaccagatcggcctcgacagcctggcgctggtggacgtgcgctcggcgctggagaccgagtt 





cgccgaggatcgcaatggcgcgcgtgccgagctgatcggcgactgccaggtgctctatgtggtctccatcggcgg 





tccggcggcggccaaggtggtcaagaccggcctgtacccgatcaagaaggccggtggcgaggcccggcagattct 





cgccgacctgcagaccgtcatggccggcaacccgccgccgtggctggccaagctgctgggcgtgagcgccgagca 





gcgagtgcgcttcgaccgctccgacgacgaggcggcctgggcatgagcgatgtgcgcaggctggtcgccgtggcc 





atcgaccgccagggcaaggtcgccggtcacgccggtcgggcgcaccactggcaggtgtacgacatctggcccggc 





gaggcgccggaatccgtctatcgcctggcgctggacgaacaggcctgcctgcacgagtggcatgtcagcgcgcaa 





ccggaacgccatccgctgcacgcggtggacgtggcgatcgccgccagcgccggcgacggcgtggtgcgtcgcctg 





ggcgagcgcggcgtgacgctgttgaccaccgccgagagcgacccggaacatgccgttaaagcctggctcgccggc 





agcctgccgccaggcttgccgcacgaggagccgggctgcggcggcgaggggcaccggcatccctgagcgtgcggg 





gatgggacggatggcaaccccaggctgggtcgagccgcgcagcggcgaagcccaacgtcgtgcgggctcaagccc 





gtgcaaccggcattgttcgtgagaacaccatgggcggatgtggcgcctgatgatccgcgatgttgggcttcgctt 





cgctcaacccaacctacggcaccggggcgataggcaaaaaaactcccctgggagcgcaggggagtggctcatcgc 





caatatggggatgtcaaaccgttgcacgtgacccgggctgcgcccgggctctgcgagcccagggcaacctagggt 





ggaatcgagccccatgctggccaagcccaatacgcccctgggtggttcagatcggcccgcgcgcctcgcgacgat 





gggcgacggtgcagccaagggcggcctcgtagctcagcgtctccagcttcggccggtagtcgcgcagcgcggcgt 





agacggtgagtaccttgtcctcggcgctctggcccttgaagtcttccttgtccaggcgcaggtcgtcggccaggg 





tgttccacaccgcgccttcctcgtgcgcctgcagctgcagggtcatgccgtcgagctcggtgtacagcgactggc 





cgaggttctccacgtagaagatcaccggtgcgccgtcggccaggtcgcggcgatagcgttcgaggaacagcggca 





ggtgggccagttcgatcagggcgccggccatcatcgccagtttcccctgctgctcgagcatgaccgcctgcttga 





ccacggcggtcggcggcaggcggcggccggaggcgtcggcgatctcgccttcttcgagcaccaggtcgccgcgga 





aggcgtaggtgtccggcgtcgtggtcgcgccattgacgctgacgttgcacagcaggttttgcgtttcgtaggttt 





tcaacagcatggtcatggtctctcgtggaaaaaatggtcaggcgacttgtggggcgccctgggtcaggccaagca 





ggtcgtgccaatcggtctcgaccagttccagttgcttgcgcaggcagaccctgcggtcctttgcgctgggcgagg 





ggatcagcgcccagccggcgggcgcggctgcatcgtggatcaggtcgctcatcagcatgcgctcggtgtcgcggt 





tgaactgcatgccgaggaacaggtagcggcattccttgcgctgcagcttgagccagccgggaatggcgaaaccgc 





ccatcagttcggtgatgtagtcgacgaagtcggcgtcgctggcgacgtaggaagggctcggcagcggcgtgccca 





gcggcttgaacagcaccggtagcgcgggattgacctcgctctggtcgatgcgctggtagcggcctgcgtcgaagc 





gatagatgtcgaagcggtaggggctggcggcgatgcgcgcggcgccgaccaccagggtatgcggacggtcggcat 





agcagcgctgcagctgggtgtcgcgattgcaatcgatgacgtagggcaggttcagcccggccagccagcggtgca 





gcggcgcgggcgtccagttgtcgccgccgtaggtctgagtcaggaagcgctcgatgaagctgcggcccttcttgt 





tctccaggtgcatggcggcacgcgggaattcgtacatcagccgcggcgccatcggctgcccgccgttcatggcga 





ggatcaggctttcattgtcggccggcatcggctgacctgtgtcgcggtcgaccacgccgcccagcacaccggggc 





ccagatagggcaccagttcatgggcggcgaggcggtcggcgatttcctgcaaaggatcgttcacggcaaatctcc 





tgcggccagtggatttaccgatagccgatcgcaataaccgagccagccgggagcgtgcatgcaaccccttgatat 





atggggctttgaatgcggcgatagttgccgttcaggtgttttcgaaagtatcgaacgcgacaattgtcatgttcg 





caacagttgccgaaagtgtggaaaaccggcgcttggcccggccgatctttttgtcgccattgcaacagtcaggcc 





tgtcggttgttaactatcgaaccgccgaaggatgttgctagtaattaaattattctaattaaaacaagtgcttag 





attattttagaaacgctggcacaaaggctgctattgccctgttgcgcaggcttgttcgtgcctatagcccacgtc 





aagtggtaacgaaacctgaggaacttaattatggcaatgcgtcaatgcgctatttacgggaagggtggaatcggc 





aaatccaccacgacccagaacctcgtggcggccctggccgaactcggcaagaaggtcatgatcgtcggctgcgac 





cccaaggccgactccactcgcctgatcctgcactccaaggcgcagaacaccatcatggaaatggccgccgaggcc 





ggtaccgtggaagacctggaactcgaggacgtgctcaagaccggctacggcgacatcaagtgcgtcgagtcgggc 





ggtccggagccgggcgtgggctgcgccggtcgcggcgtgatcaccgcgatcaacttcctcgaagaggaaggcgcc 





tacgaggatgacctggacttcgtcttctacgacgtgctcggcgacgtggtctgtggcggcttcgccatgcccatc 





cgcgagaacaaggcccaggagatctacgtggtctgctccggcgagatgatggcgatgtatgccgccaacaacatc 





tgcaagggcatcgtgaagtacgccaactccggcagcgtgcggctcggcgggctgatctgcaacagccgcaacacc 





gaccgcgaggacgagctgatcatggccctggccgacaagctgggctcgcagatgatccacttcgtcccgcgcgac 





aacgtcgtgcagcgcgccgaaatccgccgcatgaccgtcatcgagtacgaccccgccgccaagcaggccgacgaa 





taccggaccctggcgaagaagatcgtcgagaacaagaaactggtcatccccaccccgatcagcatggacgagctg 





gaagccttgcttatggagttcgggatcatggacgaggaagacatgaccatcgtcggcaagaccgccgccgaggaa 





gtcgttgcctgatcgcttcagcagaacggggcagggcggatgggccctgccggggtgtcgcaccgtgcctggcac 





ggtgcggtgcgcccgtgacccgcacatgaacgcaagaggaggtcaatcatgaccggtatgtcccgcgaagaggtg 





gaatccctcatccaggaagtcctggaagtctatccggagaaggcccgcaaggaccgcgccaagcacttgtcgccc 





aacgacccggcgcttgagcaatcgaagaaatgcatcacttccaacaagaaatcccagccgggtctgatgaccatc 





cggggctgcgcctacgccggctcgaagggtgtggtctgggggccgatcaaggacatgatccacatttcccacggg 





ccggtgggctgtggccagtactcgcgcgccgggcggcgcaactactacatcggtaccaccggggtgaacgccttt 





gtgaccatgaacttcacctcggatttccaggagaaggacatcgtcttcggcggcgacaagaagctggccaagctg 





atcgacgagatcgagacgctgttcccgctgaacaagggcatctccgtgcagtccgaatgccccatcggcctgatc 





ggcgacgacattgaggcggtcgccaagaagaaggccgccgagcacgaaaccaccgtggtaccggtgcgctgcgaa 





ggtttccgcggggtgtcgcagtccctcggccaccacatagccaacgacgccatccgcgactgggtgctggacaag 





cgcgacgatgacaccagcttcgagaccacgccctacgacgtttccatcatcggtgactacaacatcggcggcgat 





gcctggtcctcgcgcatcctgctcgaggaaatgggcctgcgcgtggtcgcgcagtggtccggcgacggcacgatt 





tccgagatggaactgacgcccaaggtcaagctcaacctggtgcactgctaccgctcgatgaactacatctcgcgg 





cacatggaagagaagtacggcattccgtggatggagtacaacttcttcggcccaaccaagaccgccgagtcgctg 





cgggccatcgccgagcatttcgacgacagcatcaaggccaagtgcgagcaagtgatcgccaagtaccagtcggag 





tgggaggcggtgatcgccaagtatcgcccgcgcctggaaggcaagcgcgtgatgctctacgtcggcggcctgcgt 





ccgcgccacgtgatcggcgcctacgaggacctgggcatggaagtggtcggcaccggctacgagttcggccacaac 





gacgactacgaccgcaccctcaaggaaatgggcaacgccacgctgctctacgacgacgtcaccggctacgagttc 





gaggagttcgtcaagcgcatcaagcccgacctgatcggctccggcatcaaggaaaaatacatcttccagaagatg 





ggcattccgttccgccagatgcactcctgggattattccggcccgtaccacggctttgacggcttcgccatcttc 





gcccgtgacatggacatgaccctgaacaacccgtgctggaagaagctgcaggcgccctggcagaaggccgaggaa 





tcggccgagaaggtcgccgccagcgcctgatggtccgcagtcgtacgcaacgtccgcggcggccggcgcaggccg 





gtcgctgccgacatccgtgatcgccgttcacagatgagtgaggcgaaggagagagtcatgagccagcaagtcgat 





aacatcaaacccagctatccgctgttccgcgacgaagactacaaggacatgcttgccaagaagcgcgatgccttc 





gaggagaagcatccgcaggacaagatcgacgaagtcttccagtggaccaccacccaggaataccaggagctcaac 





ttccagcgcgaagccctgaccgtgaacccggccaaggcctgccagccgctgggctcggtgctctgcgccctgggc 





tttgagaagaccatgccctacgtgcatggctcgcagggttgcgtcgcctacttccgtacctacttcaaccggcat 





ttcaaggaacccatctcctgcgtgtcggactccatgactgaagatgcggcggtgttcggcggccagcagaacatg 





aaggacggcctggccaactgcaaggccacctacaagccggacatgatcgccgtgtccaccacctgcatggccgag 





gtcatcggcgacgacctcaacgccttcatcaacaactcgaagaaggagggcttcatccccgaggactacccggtc 





ccctatgcccacaccccgagcttcgtcggcagccacgtcaccggctgggacaacatgttcgagggcatcgcccgc 





tacttcaccctcaatcacatggacgacaaggtggtcggtagcaaccacaagatcaacgtcgttcccggcttcgag 





acctacctgggcaacttccgcgtgatcaagcgcatgctcaaggaaatggacgtcggctacagcctgctctccgac 





ccggaagaagtgctcgataccccggccgacggccagttccgcatgtactccggcggcaccacccaggacgagatc 





aaggatgcgcccaacgccctgaacaccctgctgctgcaaccctggcagttggaaaagaccaagaagttcgtcgaa 





ggcacctggaagcacgagacgcccaagctgagcatccccatgggcctggactggaccgacgagttcctgatgaag 





gtcagcgagatcaccggccagccgatccctgaaagcctggccaaggagcgcggccgcctggtcgacatgatgacc 





gactcgcacacctggctgcacggcaagcgcttcgcgctctggggcgatccggacttcgtcatgggcatggccaag 





ttcctcctggagctgggcgccgagccggtgcacatcctcgcccacaacggcaacaagcgctggaagaaggccatg 





gacgcgatcctggagtcctcgccctacggcaagaactgcaccgtgtacatcggcaaggatctctggcacatgcgc 





tcgctggtgttcaccgacaagccggacttcatgatcggcaatagctacggcaagttcatccagcgcgacacgctg 





cacaagggcaaggaattcgaggtgccgctgatccgtctcggcttcccgatcttcgaccgccaccacctgcatcgc 





cagaccaccctgggctacgaaggcgccatgcagatgctgaccaccctcgtcaatgccgtgctcgagcgcctcgac 





gacgagacccgcggcatgcagagcaccgactacaactacgacctggttcgttgaccgctagcggggagggcgacc 





tccccatcctggccggccgacgcaccgcaatggtcgtcggccggccagccctattttcaggaagcctcccatgcc 





cagtgtcatgatcagccgtaacaagaatggccagctgaccttctacatcgccaagaaggaccaggaagaaatcgt 





cgtcagcctggaacacgacagccccgagcgctggggcggcgaagtcgccctggccgatggctccagctactacct 





cgaacccctctcggcaccgccgaaactgccgatcaccctgcgcgccaaacgggccggcgagggctgaacgatggc 





gcccagcaacggacgggctccgctgccggctcacctggccctgcgcatcgccctggcggcgcgcgagctgaacgg 





cgtggataccgggcaactgctgcgcaccctgctcagcgtcaccggcgagccgatcaccgaagcgcggctggccag 





gctgcgcctaaaccgcctgcgcaaccgcctgctgagcagcgtcgacgggccaccgccggtgctcagcgagcggca 





attgcagcgtgcgctcggcctgctcaaggggcgtggcgtgcgaatgcccgaggaaccgttgccggccatcgagcc 





ctatcgcgaaggcgagttgccggattcgatccgcatcgcctgcacctccgacggcggcgagcgcctggacggcag 





cttcggcagctgcgcgcgctttctcgtctaccagatctcgccgagcgccagccgcctgatcgacctgcgcgagcc 





ggggccggccgcgccccacgaggatcgccatgcccgccgcgccgaactgctgcacgactgccagctgctctacac 





cctgagcatcggcgggccggcggcggccaaggtgattcgcgtcggcacccacccggtcaaggtcatgcggccgat 





cccggcccgcgagatcgtcgaggaactgcaacaggtactggccagtgcgccgccgccctggctggccaaggctat 





gggcagcgagccggcaccccgcgtttccatgtctgaaaaagaggacaccccatgatcagtcagacccagctcgac 





gcggtcatccgccaggccgagaacggcccgctgaacgaggcgctgctcgccaggctgcgcagcgagcaccctggt 





atccacttcacctgttgcatggacgacgacgtggtggtcaacgccaagccggttgccgagcggccggggttcaac 





gtctatctggtcaactccagccagcactgctcggtgctgagcaacgacctggacgccgcctcgggcatcgtcctg 





gccgaagtcatcgccgattagagagcgcccatgcagaacgacggtagcgaggacattatccccctggcggactgc 





cgcgattgcagctttcgcggcgacctgctgcccagcggccgctgcacgccgggcgaccgctgcgtagcgatccac 





agcggccggcagatcgaccgtttcttccggcagaatccgcagctggccgtacactacctggccgatccgttctgg 





gagcggcgcgccatcgccgtgcgctacgccccggtggaggcgctgctgtcgatgatccacgacgtcgacgaggcg 





gtgcgtcgtgccgtcgcctaccgcctgccgcgcgagcgcctgggcgaactcatgcgcgacccggatcgggaagta 





cgcatcaccgtcgccgaccgcctgccggccgagcagctggaacggatggctgccgacccggattacctagtgcgc 





gcctacgtggtccagcgcatcgccccagggcggctgttccgcttcatccgcgacgaggaccgccaggtgcgcaag 





ttcgtcgcccagcgtctgcctgaggaaagcctcggactgatggtcaccgaccccgaaccggaagtccgccgcctg 





gttgccgcgcgcctgcatggccaggacgtgctggaaatgctccacgaccccgactggacggtacgcctggccgcc 





gtggaaaacgccccgctcgaggccctgcgcgagctgaacgaagacgatcccgaagtccaggctgcgatcgcgcaa 





cggttggggtaggttgggtggacgcccgacccgagatgatgctttttaggctttggtaggcctgccggcctgcat 





cgccgcgagggcgcgcctcccacaggtccgcaggctgcttgctgcctttgtgagcccgaccacggggcgatgctt 





ttcgctagggtgggccgggcggcgttccgcttcagcccaccaatcaagccagcgatcgcgaaggatgctggtggg 





ctgatgcccaccctacggatccgtaccgcccgacccggcctacggggccactcgccgaatcctttgttgcgaacc 





cgacatctgggcgcgtttgcgacaattttatttcaatgaaaatcatataaatcaatgagttaatttttggtacag 





gcattgcactcacctcgttgcgcataaccacgacgaccggagggtgcgatgaaagccaaggacattgccgagctg 





ctcgacgagcccgcctgcacgcacaacaagaaggagaagtccggctgcgccaagccggcgccgggcgccaccgat 





ggcggctgcgccttcgacggcgcgcagatcgcgctgctgccgatcgccgatgtggcgcacatcgtccatgggccc 





atcgcctgcgccggcagttcctgggacaaccgcggcacccgctccagcggcccgcagttgtaccgcatcggcatg 





accaccgatctctccgagcaggacgtgatcatggggcgcgccgagaagcgcctgttccacgccatccgccaggcg 





gtggagagctacgcgccgccggcggtgttcgtctacaacacctgcgtgccggcgctgatcggcgacgacctcgac 





gccgtgtgcaaggccgccagcgagcatttcgccaccccggtggtgccggtggacggcgccggtttctacggtacc 





aagaacctcggcaaccgcatcgccggcgatgccatggtcaagcacgtgatcggcacccgcgagcccgacccgctg 





ccggccggcgccgagcgcgccggtattcgcgtgcacgacgtcaacctgatcggtgaatacaacatcgccggcgag 





ttctggcacgtgctgccgctgctcgacgagctgggcctgcgcgtgctctgcacgctgtcgggcgatgcgcgtttt 





cgcgaggtgcagaccatgcaccgcgccgaggtgaacatgatggtctgctccaaggccatgctcaatgtcgcgcgc 





aagctgcaggagcgcttcggcacgccctggttcgagggcagcttctacggcatcaccgacacctcgcaggcgctg 





cgcgacttcgcccggctgatcggcgacgacgacctcgccgcgcgcaccgaagcgctgatcgcgcgcgaggaagcg 





aggattcgcgcggcgctggagccctggcgcgaacgcctggccggcaagcgcgtgctgctctacaccggcggggtc 





aagtcctggtcggtgatctccgcgctgcaggacctgggcatgaaggtggtcgccaccggcaccaagaaatccacc 





gaggaggacaaggcgcgcatccgcgagctgatgggcgacgacgtcaagatgctcgacgagggcaacccgcgcgcg 





ttgttgcgcacggtggaggaataccgcgccgacatcctcatcgccggcggtcgcaacatgtacaccgcgctcaag 





gggcgcatccccttcctcgacatcaaccaggaacgcgaattcggctatgccggctacgacggcatgcgggaactg 





gtgcgccagctgtgcctgaccctcgagagcccggtgtggccggcggtgcgccagccggcgccgtgggagcggccc 





gcgtcggccgaggcacaaccccgcacgctggcgaacgcctgaggaggtcgcgatggcacagatcatcaaccgcaa 





caaggcgctggcggtcagcccgctgaaggccagccagaccatgggtgccgcgctggccttcctcggcctggcgcg 





cagcatgccgttgctgcacggttcgcagggctgcacggcgttcgccaaggtgttcttcgtccggcacttccgcga 





gccggtgccgttgcagaccacggcgatggatcaggtcagctcggtgatgggcgccgacgagaacgtggtcgaggc 





gctgcgcaccatttgcgacaagcagcatccagcggtgatcggcctgctcagcaccgcgctggcggagacccaggg 





ctgcgacctgcacagcgccgtgcatcagttccgccgcgaatatcccgagtacggcgacgtggccgtggtgtcggt 





gaacagcccggacttcagcggttgcttcgagagcggtttcgccgccgcgctcaaggcgatgatcgaggcgctggt 





gcccgagcgccgtgaccaggtcggccagcggccgcgccaggtcaacgtgctgtgcagcgccagcctgacacccgg 





cgacctggaattcgtcgccgagagcatcgagagcttcggcctgcggccgttgctgatccccgacctgtccggctc 





gctggacggccatctcgacgaggcggccttcaacccgctgaccaccggcgggctgaccctcgacgagttggccag 





tgccgggcagagcgccgccaccctggtgatcggccagagcctgaccgccgccgccgatgcgctggccgcccgcag 





cggcgtaccggaccggcgtttcggcctgctgctgggcctggaggcggtggatgcctggttgatggcgctgagcga 





gatcagcggcaacccggtgccggagcgctggcagcgccagcgccggcaactgcaggacgccatgctcgataccca 





tttcatgctcggcgacgcgcgtctgggcatcgccgccgaccccgacctgctgctcggtttctccaccctggcgcg 





cggcatgggcgcgcaactggtggccgccgtggtgccggcgcgcgcgccggcgctggccgatgcgccgctggcgcg 





catccaggtcggtgacctggaggacctggagcaggccgcccgcgacggtggtgcccaactgctgctcggcaacag 





ccacgcgctggccagcgccgaccgcctgggcattccgctgctgcgcgtgggctttccgcagtacgacctgctggg 





cggcttccagcgctgctggagcggttaccgggccagcgcgcaggcgctgttcgacctggccaacctgctcaccga 





acaccatcagggtatcgcgccgtatcgctcgatctatgcgcagaagcccgcctccgaccattcgcaatggagcca 





ctgagccatggccagccccatccgacaactgcaggtactcgacggcgagaacgacggcacgctgctcaaggtggc 





cttcgcctcgtccgatcggcgcacggtcgaccagcatttcggttcgtcgcggtcgttcgtgttctacggcatcga 





ccccgagcgggccgagctgcaatcggtggtggaattcggcgagctcgaccaggacggcaacgaggacaagctggc 





ggccaagctggaactgctcgatggctgcatcgcggtgtactgccgcgcctgcggcgcctcggcggtacgccagct 





gctggcgatcggcgtgcagccggtcaaggtcagcgaggccgagggcatcgccgaactgatcgaaacgctgcaggc 





cgagctgcgcgaaggcccttcggcctggctggccaaggcgatccggcgtacccgtggcacgccggaccagcaacg 





tttcgaggccatggccggcgaggcctgggacgaatagcccgacacccgcaatcgaggacagcgttatgtatgcag 





aagaacaacaggcggtcgttcgcgacgacgccccggccctgcaggacccggtgatcaagcagatggtggtgcaac 





tgcgcgccatggacagctacggcacctacgacacctggagcgacgcgcgggtgctcgacccgctggtgctgaccc 





gcgagcggcgccgcgcgatccccatcgtcggcgatccggacgaggtcaccctgtcgcgggtcaaggccttctaca 





acgccctggcgcagatgatcgagcgcgagaccgggctgctcgcggtaccggtgatcaacatcacccacgagggct 





tcggccgcgcgctgatcctggtcggcaagctggtggcgctggacaagaccctgcgcgacgtccatcgcttcggct 





tcgaatcgctcgaggcgttgtcgctcgacgcgcagaagctgctgggcaaggcgaccgcgctggtcgccgagcacc 





gtacggtcgccgagttgtaaggggagacgagccgatgaccgaagaggaactcaaggcgttgaagaaggaagtcag 





ccagaagaagcgcatcgccaccgaatgggcgtcgcagatccacgacctggtcgaggaccggctgctgatcgatta 





ccggcaattgccggaactggcgacgcaggcacaccaggcctgcctcgactgggccgaggccaacgcccggctgga 





agcggccggcaacgcctgaccgccaatacagagcgggcccgagcccgccgtatccctaaccgtaggccgccgcca 





tgccattggcgggcaggagatgacagatggaagcagtgataaccgggcgtacgcgcggtggcgccgaatgggtgc 





cgcagttcgtcaccgccgtcgatgcgcagaagtgcatcggttgcgggcgttgctacaaggtgtgcccgcgcgacg 





tgttcgagctggtggagcgctccggcatggtgggcgaggacgacgacctctacgacgaggacgacgagatgatgg 





tcatggccatcgccgacggcctcgactgcatcggctgcaaggcctgttcggcggtctgcccgaaacaatgccata 





cccatcaggccctggccggctgaggagctgctgacatgccaagacccgactaccacatcttcctctgcctgcagc 





gccgcgccgaggggcacccgcgcggcagttgtgctgcgaagggcggcgaagccctgttcgacgccttctcccagg 





ccctgatccggcgcaacctgatcggccgcatcgccttgaccggcaccggctgcctggggccctgccaggccggcg 





ccaatgtgctgatctacccgggcgcattgatgtacagctgggtggagccggcggatgtcgacagcatcctcacgc 





atctgctcgaaggcgagcccttcgccgacaagctcacccccgcggagctctggtgaggcatgggtgaagtgctgt 





tgctggagcccgaacgggcgttcttttccgaccgcacgccgaccgggctgcgctacctgctgaacagcgcgcgcg 





gcctcgagcatccggcggcggtcgaagccctgctgctggaggcccggcagcgctggagcgaggagccggacgcgc 





atgtcggcctgtacaagttctactttctccaggcccgctacgcggaggccgaagccgccgtatgggaagccctgc 





ggcgggccgcggcctgtgccggcttcagccgcaactaccggcgcctgcaccctgccagcgccgactggcagacac 





gccgcggtgccacgcggttgtacctgttcagcctcaaggcgctgggcgtgatccgcctgcgccgtggcaaggtgg 





acaacgcgcggcgggtgctggagaagctgctggagctcgatccgggcaacgagatcggcggcgaggcgttcctgc 





agatcgcccgcgccttcgaggaggaaaactgatggcggcatcgttcgaagcacgcctgcaggcggcgcggccgct 





gttcggcgaaatccagcgcgcgctgcaggattgcctgcagcgttcggccatccgcctgcaactgcccgacgagcg 





tgaaccgtcgcgcagcgaagtgcgggtcgacccgttcgatcgcagcgaatgcttctacagcgaatggcgcagcgc 





ccagggcgatttcctcggcagcatgcagatcaacggcgacggtcaggtctatgccgagttcgacgtgctgctgaa 





gcacccgcacgagccggcctggctggtggaggcggtcgccgcctggggttggccgggggcgctgaaaagcgagtt 





gcgcctgctgccggcgctcgatcatgaatgagctctacgactggctgctggccagcgccgcgcaggcgcggaccg 





tcgaacatctgtgcctggggttgaactggacactggccgaagtcgacggcaaccagggcttcgccttcagcccgc 





gccaggtgccgcgcacgctcggctggtcgggcacactcgccggccagggcaacgccgcgctgctgccctggctgc 





tgtcgtggaacagcgccgaagccgcggtcggcctggccgtgctcaatgccagcgtgaacacggcggcgggctgcc 





agcgcgaggcgcaggcactgcgcacgcaggcaccggggcatctgcaggtgttcgcacatttccgtccacggctgg 





cgggccagcgggtcgtggtgatcggccattatcccggcctcgaacggctctggcaggaccagccctaccagtgcc 





tggagcgccagcagcaggagggcgacctgcccgattgcgccgccgagtacctgctgcccgaggccgactgggtgt 





tcgtcagcgcgagcagcatcgccaacaagaccttgccgcgcctgctcgagctgtcgcgccaggcccaggtggtgc 





tgatggggccgagcctgccctggctggacggttggcggcgcttcggcgtggactacctggccggggttcgcgtgc 





tcgacccggacggcgtgcggcgggtgattgccgagggtggcggtacgcggctgttcgccgggccggtggagtatg 





ccttgatggcgctcgggaaatgatggggtctcacggccggctgggctggcggatgctgatctgtcacaagcaccc 





ggtcagcgcgcgcctgcatttcctcgtgccgcagcgcggcggggtggtcttgccgcagccccttccggccctcgc 





ggtattcgccgaaccgccgatgcagggcgatctgctggtccatcctgcgggcgctctgcgcagcctgcagcgcga 





cctggggatcgagaaaccgctggagctggtggccgattaccgggtcggcctcgaagtgtcgggcggggttctgcc 





ggtattcctcgccgcactggacgggcacgatcggtgccgggcggccatcggaacccactggatcgaactgacgca 





gagcatcggcatgccctggctggaccgcgaactgctcaggcgggcctatgaagtgctgatcgggtgaagcgtagg 





cgcgtggatcgggcggtcgcctagcctgaatttccagacatatggacgccacccatcctactgcaccgaaaagca 





tcgccccgagggcgggccccccacaaaagcagccagcagcaccgagccccccgtgggcgcgccctcgcggtgatg 





caggccggtaggcctgccaaagactgaaaagcatcgccccgagggcgggcctcccacaaaagcagccagcagcac 





cgagcccccgtgggcgcgctctcgcggcgatgcaggccggtaggcctgccaaagactgaaaagcatcgccccggg 





gtcgggcctccacaaagcagtcccgtagggtgggccgggtggcgttccgcttcagcccacccattccaggcaatg 





ggcgtcatcgaagtgggctgaagcccaccctgctgctgcgtgccgaaatgtaacctcgtgacggatgcgcggacc 





gatggctgacgtgttggcgctcagccacctcccgcacctcaggcgcgcagcagcgccttggccatcttcggcgac 





agctgggcttcgctgaactgtggctcgttcggcggatagagcaggtcctcgatgatgctgtagccgtgttccttg 





ccgagggcgatcacgtcgcggaccttttcacaggccttgagtttttcgccgagcgccgggtcgttcagggcttgg 





ttcgagaaggcttggatttccttgatggacatagggttctctctgttgcgatgactggaaccagcgccgaacggc 





tggcgaggcatgccatagcaacatcgatgcctgagatcattccattgaatatcaatggcttatgaggttttgacg 





agctgccgattgtcgtattggcgacaatcggacaacagccgggctcaacccagcagggccacggccttgatctgt 





gcccacagcggcagcccgggagcgatgcccaactggtcggccgagcggcgagtgatgcgcgccagcagcggcgtg 





ccgccggcatccaggcgcaccagcacgtgggccggggtatctgccgcggccagcgcttcgactcgcgccggcagc 





aggttggtgatgctgctgccctcggcacgggtcagcgccaggctgacgtcgcgggcatgcacgcgaaagcgcagg 





cgctggccgagcgcttccggccgctgcgccaccagtacctcgccgccggggaaggtcaggcgggtcagatggtag 





gcgtcgtcgtgttcggccacgtgggattcgaccaccacgccggcgtcctcgccgagggcggtgggcaggtccagt 





cgtgccagggtttcgcgcaggccgccggcggctaccgcccggccctggtcgagcaacaccacgtgatcggccagc 





cgcgccacttcgtccggcgaatggctgacgtagagcagcgggatgtcgagttcgtcgtgcaggcgttccagatag 





ggcaggatttcgttcttgcgcttgaggtccagcgccgccagcggttcgtccatcagcagcaggcgcgggctggtg 





agcagggcgcgggcgatgccgacgcgctggcgctcacccccggacagcgttcccggcaggcgctccagcaggtgg 





tcgatacccagcaggttcaccacatggtcccagtccacccggcgctgggcggccttgacccgacgcaggccgtat 





tcgaggttgcgccgtgccgtgaggtgcgggaacaggctggcttcctggaatacataacccagggcgcgcgcgtgc 





gtcgggacgaacagcccgcgcgcactgtcctgccagcgttcgccgttgacttccaggtacgcctcgccggcgcgc 





tccaggccggcgacgcagcgcaggcaggtggtcttgcccgagcccgaatggccgaacagcgccgtcacgccgcgg 





ccaggcagggcgaggtcgacgtccagttcgaagccgggccaggtcaggcggaagcgggcgtggatctgcccggcg 





gttggtgagtcgttcatgcacgagtcccttcaattgaggccggacttgaaacggcggctggagtacagcgccagc 





agcacgcagaaggagaacgccagcatgccgccggccagccagtgggcctgggcgtactccatggcctcgacgtgg 





tcgaagatctgtaccgagaccgtgcgggtgacaccggggatgttgccgccgatcatcaacaccacgccgaactcg 





ccgacggtatgggcgaagccgaggatcgaggcggtgacgaagcccggccgcgccagcggcagtaccacgctgaag 





aaggtgtcccagggactggcgcgcagggtggcggctacttccagcgggcgctcaccgatggcttcgaaggcgttc 





tgcaggggttgcacgacgaagggcatggagtaaagcaccgagcccaccaccagaccggcgaaggtaaagggcagc 





agaccgaggccgaggctctgggtcagctggccaaccaggccgttagggcccatggcggtgagcagatagaagccc 





agcacggtcggcggcaacaccagtggcagtgccaccactgcgccgaccggccccttgagcggcgaatgagtacgc 





gccagccaccatgccagcggcgtgccgatcagcaacagcagtgcggtggtgaggctggccagcttgaaggtcagc 





cagatagctgcgaaatcgacgctgtcgagcatcatcgcggttcagtccagctcatagccgtaggcgcgaatcagc 





gcggcggcggtatcgcccctgaggtagtcgagcagcgcctgtgccgccgggttgccctcgccatggcgaagcagc 





agggcgtcctggcggatcgccgcgtgctggtcggccggcaccacccaggccgagccgcgggcgatgcggccgtcc 





tcggtcacctgggacagcgcgacgaagcccagctcggcattgccgctggcgacgaactggtgggcctgggcaatg 





ttctcgccctgcacgaagcgtggctgcagccgttcgcgcaggcccaggcggtctaaggtttccagtgccgcggcg 





ccgtagggggcggttttgggattggccagggccaggtgacggaagtcgccgtcggcgaggatgcgcccctgcgga 





tcgacataaccctcgcgcgccgaccacagcaccaggctgccgatggcataggtgaagcggctaccggagacgccg 





gaaccctcgtcctcgagtcgtgccggtgtgctgtcgtcggccgccagcaggatgtcgaagggcgcgccattgttg 





atttgcgcgtagaacttgccggtggcgccgaaggccagcacggcgcggtggccggtgtcgcgggcgaaggcggcg 





gcgattttctgcattggcgcggtgaagttggccgccacggccacctgcacgtcgtcggcgatggcggttagtggc 





aggcagagcagcagggcggcgcagaaacggcggacagaatgcatggcgactcctttcaatcgacggcgatgatga 





cgtgggatgccttgatcagcgcggtgcagggctggcccagggccaggccgagctcttcggcgctctcgttggtga 





tcacggcgctgagggtgcggttgcccggcagcagcagcttgacctcgcagttcaccgcgcccggcatcagcgcgc 





tgatggtgccggtgaggcgattgcgggcgctgatcttcacgtcaggatcgggcgagagcagcacgaagctggcct 





tgatcagcgccatggcggtattgctgggcgccaactgcagttcgtcgatgctgtcgttggtcagcgtggcgctga 





tgcacaggcctgcgccgatgtccaggcgcaggctgccgttgacggcccccttgtcgacggcggtgatacggccgc 





ggaattgattgcgtgcgctggtcttcatggcgatggccctcagcagccggtcgatgtcgtcgaagccttcgatgc 





cctcggcgacctgggcgagaaagcgctcgtattcggcctgcatgcgccgccatacgtggagcatctcgcggccga 





agtcggtcaggcgcgtgccgccgccctgggcgccgccggcagagcagatcaccaacggccgctcggacaggttgt 





tcatggcatccactgcatcccaggcggccttgtagctcagcttgatggccttggcggcgcggctgatggaaccgg 





tggcctcgatctgctccagcaggtcgatgcgcttgccgcccagatagcctttctcgccccggttgaaccagagct 





ggccgtcgatgcgcaggggtaggtccgcttcgttcatgtcgtttcctcgggctccggctctgggcctggagcaag 





caagaatgcatccaggtctgtgttttcaaataaatccatgaaaatcaaaaagttaatgctttcatggaggccccg 





tgagctgtctggaagatgacattgtgtgatgcgctatatcgttttgtatatagcgctacagaggtattccggccc 





gcccgaggaaccgcggcctggtgtgtcgcaaagccgacattgcgccccatgcgtaccgttcgcgacagcgggaag 





gtcgtgcgatgaatctatatgtatttgaaaaataattgtttttcagcttggcaaggctgggcatgggcgttgcag 





aagtacctgtgccgggtggccagatcgccgccacagccgaggagacatgccgatgattaccctgactgaaagcgc 





caagagtgcgattaaccgcttcatcagcaacgccgacaaacccaccgccggcttgcgcatccgcgtcgagggcgg 





cggctgtgcggggctgaagtacagcctgaagctggaagagcaaggcctcgacggggaccagcaggtcgactgcgg 





cgccttcaccgtgctgatcgacgacgccagcgcaccgctgctcgacggcgtgaccatggacttcgtcgacagcat 





ggaaggcagcggcttcaccttcgtcaacccgaacgccagcagcggttgcagctgcggcaagtccttcgcctgcta 





agcgccattcgaggcggccggccacgaccggccacccagcattcaccgggagatcagccgtcatgtgggattatt 





cggaaaaggtcaaagaacacttctacaacccgaagaacgccggcgccgtggccgaggccaatgccgtcggtgacg 





tcggctcgctgagctgcggcgatgccctgcggctgtcgctgaaggtcgatccggacaccgacgtgattctcgacg 





ccggcttccagaccttcggctgcggctcggcaatcgcatcgagctcggcgctgaccgagatgatcaaggggctga 





ccgtcgacgaggcgctgaagatcagcaaccaggacatcgccgacttcctcgacggcctgccgccggagaagatgc 





actgttcggtgatgggtcgcgaggccttgcaggcggcggtggccaactaccgcggcgaaaccctcgaggacgacc 





acgaggaaggcgcgctggtgtgcaagtgcttcgccatcgacgaggtgatggtgcgcgagaccatccgcgccaacc 





ggctctccagcgtcgaggacgtgaccaactacaccaaggccggcggcggttgctcgtcctgccacgaaggcatcg 





agcggttgctggtcgaggaactggccgcgcgcggcgagatcttcgttccggccggtaccggcgccaaggcggcga 





agaaggccaaggcgccgctggtgaccctggaaaccccgccggcggctccgcaggcggcgcccaccgcgccgcgca 





tgaccaccctgcagcgcatccgccgcatcgaacgcgtgctcgaatcgatccgcccgaccctgcagcgcgaccacg 





gcgacgtcgagctgctggatgtcgagggcaagaacatctacgtcaagctgaccggcgcctgcaccggctgccaga 





tggccagcatgacgttgtccggcatccagcagcggctgatcgaggaactcggcgagttcgtcaaggtggtcccgg 





tcagctccccggcccacagcgcgatggcggaggtgtgagatgagcggcatctatctcgacaacaacgcgaccacc 





cgtgtcgatgacgaagtggtgcaggccatgctgccgttcttcaccgagcagttcggcaacccctcgtcgatgcac 





agcttcggcaaccaagtcggcatggcgctgaagaaggcgcggcagagcgtgcagcggctgctcggtgccgagtac 





gactcggaaatcgtgttcacctcctgcggcaccgaggccgattccaccgcgatcctctcggcgctcaaggcccag 





cccgagcgcaagacgatcatcaccacggtggtcgagcacccggcgatcctcagcctgtgcgactacctggccgag 





gacggctacaccgtgcacaagctcaaggtggacaagaagggccgcctggatctggacgagtacgccgcgctgctc 





gacgacgacgtggccatcgtctcggtgatgtgggccaacaacgagaccggcacgctgttcccggtggagcagatg 





gcgcagatggccgacgatgccggggtcatgttccatagcgatgcggtgcaggcggtcggcaaggtgccgatgaac 





ctcaagggcagcgccatccacatgctctcgctgtccggccacaagctgcatgcgcccaagggcgtcggggtgctc 





tacctgcgccgcggcacgcgcttccggccgttgctgcgcggtggccaccaggagcgcgggcgccgcgccggcacc 





gagaacgcggcctcgatcatcggcctgggggtcgccgccgagcgcgcgctggccttcatggaacacgagaacacc 





gaggtccgccgcctgcgcgacaagctcgaggccggcattctcgccgccgtgccctacgccttcgtcaccggcgat 





ccgggcaatcgcctgccgaacaccgccaacatcgccttcgaatacatcgagggcgaggccatcctgctgctgctg 





aacaaggtcggcatcgccgcctccagcggttcggcatgcacctctgggtcgcttgagccgtcccacgttatgcgt 





gcgatggacattccctatacggcggcccacggcagcgtgcgcttctcgctgtcgcgctacaccaccgaggagcag 





atcgactacgtgatccgcgaggtgccgccgatcatcgcccagttgcgcaagctgtcgccctactggagtggcaac 





ggcccggccgaggcagtgggcgactcgttcgaaccggtctacgcctgaccgccgcttgaccgcggccccatcgcc 





gaggaggttcagcatgtctatcgtgatcgacgacaccaccctgcgtgacggcgaacagagcgccggggtcgcctt 





cagcgccgaggagaagctcgccatcgcccgtgctctggcacagctcggcgtgccggagctggagatcggcattcc 





cagcatgggcgaggaggagtgcgaggtgatgcgcgccatcgccgggctggccctgccggtgcggcttctggcctg 





gtgccggttgtgcgacgctgacctgctggccgccggcggcaccggcgtcggcatggtcgacctgtcgctaccggt 





ctcggacctgatgctgcagcacaagcttggccgcgaccgcgactgggcgttgcgcgaggccgcgcgactggtggg 





cgctgcgcgcgacgccggcctggaggtgtgcctgggctgcgaggacgcctcgcgcgccgatccggagttcatcgt 





ccgcgtggcggaagtcgcccaggccgccggtgcgcgacggctgcgcttcgccgatacggtgggagtaatggagcc 





attcgcgatgcacgcgcgcttccgctttctcgccgagcgcctggatctggagctggaagtgcacgcccacgacga 





cttcggcctggccacagccaacaccctggcagccgtgcgcggaggtgccacgcatatcaacaccacggtcaacgg 





cctcggcgagcgcgccggtaatgccgcgctggaggaatgcgcgctggcgctcaagcacctccacggcatcgactg 





cggtatcgacgtgcgcggcattccctcgatctcggcgctggtggagcaggcctccgggcgccaggtggcctggca 





gaagagcgtggtcggcgccggggtgttcacccacgaggcgggtatccatgtcgacgggctgctcaagcaccggcg 





caactacgaggggctcaaccccgacgagctcgggcgcagccacagcctggtgctgggcaagcattccggcgcgca 





catggttgagctgagctaccgcgagctgggtatcgagctgcagcagtggcagagccgcgcgctgctcggctgcat 





ccgccgtttttccacgcagaccaagcgcagtcctcagagcgccgacctgcagggtttctaccagcagctgtgcga 





acagggcctggccctggccggaggtgccgcatgagcctgtaccgagaatgccgcgacgacgtccgttgcgtgttc 





cagcgcgaccccgcggcgcgctccacgctggaggtgctgaccacctatccgggcgtgcacgcaatcatgctctac 





cgcttcgcgcatcgcctgtggcgacgcgagtggcgctatgccgcgcgtctgttgagtttcgccggacggctgctg 





agcaacgtcgatatccaccccggcgcccgcatcggtgcgcgcttcttcattgaccatggcgctggggtggtgatc 





ggcgaaaccgccgagatcggcgacgacgtcaccctctatcacggtgtgaccctgggcggaaccagctggcgcaag 





ggcaagcgccacccgaccctgggcgacggcgtgctggtcggcgccggggcgaagatcctcgggccgatcagcatc 





ggtgctaatgcccgggttggcgccaactcagtggtggtgcagaacgtgccggacgggtgcacggtggtcggtatc 





cccggcaaggtggtgcgcctgcgcgaggccggccggcccaacgtgtatggcatcgatctcgaccattacctgatt 





cccgacccggtgggcaaggccatcgcctgtctgctggagcgcctggacaacctggaaaggcaggtcgagcagggc 





ggcctggtcgccgccggcagccagcagcggcgctaccaggaatgccagccggacaacagcctgtgtgaaaacgat 





tgtccggccatggccgggcgctgacggagcacgcccatggacctgcagaatttcgacggcgccggcctgtatttc 





gacgagccgcgccagccgcgcgtcgcggcgctgctggacgaggcgtcggcgcagtacgccaccggcactgcggag 





cagccgctgctggcggcgcaggcgctggcgccgggcgatctcagcgtgctggtcgggctctatcgcttctacttc 





taccagcatcgtcatgccgatgccctggccatcgccgcgcaggtcctgcaggtggtcgcgccgcgcctggggctg 





ccctgtgactggcgtgcgctcgataccgactgcctggcacgcgtggcgcccggcgccatcggcctgctgcgtttt 





catctgctggcgctcaagggcgccggttacctgagcctgcgcctgggcctgttcggcgagggcaaggcgatgctg 





agcaaggtcgccgagctcgatgcggacaatcgcctcggcgcgcgcctgctgctcgatgttttggcggccaacagc 





gccgccattttcacctttccccctgctgccaccgtggagacacgcccatgagcgaacaagccgccgaaccgaacc 





tggacgggcccttggacgaggcgctggaagagctggtatcggccgaggatttcctgaacttcttcggcgtgccct 





tcgtgccgtcggtggtgcaggtcaaccgcctgcacatcatgcagcgctatcacgactacctgtgtcaggccggcg 





atatcgagcacctgcaggatgccgtgcggtacgcggtgtatcgcaagctgctggtacgtgcctacgaggatttcg 





tcgcctccgatgcgcagaccgaaaaggtcttcaaggtcttccacatgcacgagccgcagacgaccttcgtgccca 





tcgatcaactgctgggctgacccgcgggaggtgagcgccatgagtctgccgctctacgaatatggccaggccgtc 





aggctgatccgcaacgtacgcaacgacggcacctaccccggcaaggacaccggcgccctgctgatgcgccgcggc 





gcggtgggttgcgtctacgacgtcggcacctacctgcaggatcagctgatctaccgcgtgcatttcctcgatcag 





ggctgcacggtgggctgccgcgaggaggagctgattcccgcgtcggacccttggatacccaacctgttcgagttc 





cgcgaccaggtggtcgccacccgcagcctggccgtgcgcggcgaggtggtggtggagcagggccgcaccggcagc 





atcgagaaggtgctgcgcgacctgcccggcggcatccagtaccacgtctatttcggcgacggccgggtgcttcag 





gtgcccgagacgagcctggcctgggccgacgcgcaggcgggagacgagcatgagcattgatctggtcatcggcaa 





ggatgcccgctaccagctgctgaaggtcgcccacgagcgtttcggctgtgccccggccgccctcagttcgcaaca 





gcgtgaacaggccgagcgcatcatcggtcgccagctgcagctggagaacgccgtgctgcacagcgccgaggcctg 





cggtgtggtgatcccggacgagcaggtcgccgatgcctgggccgagatcgccgcccgctacgaggacccgctcgc 





gctgcacaaggcgctagacgacagtggtctggacgaagccggcctgcgccagctgctggcccgcgaactcaaggt 





cgaaacggtgctgcagcgtgtctgcgccgggctgccggaaatcaccgatacagatgtcagcctgtactacttcaa 





tcatccggagcgcttcgtgcggcccgccacgcgactggcgcgacagatcctgattaccgtcaacgaggatttccc 





ggaaaacagccggaccagcgcttggcgccgcatcaacctgatcgccgagcgcctgctgcgcaagccgcagcgctt 





cgccgagcaggcgctcaagcattccgagtgcccttcggcgatggagggcggaagcctcggcctgatacgccccgg 





cgtgctctatccgcagctggaagcctgcctgttcgccttgcgcgcaggcgagatcggcccggtggtggagacgcc 





actgggctttcacctgctgttctgcgaggagatccatccggcgggccatttgtcgctgcaggaggtcttgccgca 





cctgcgcgagaagctccgcgcccgtcaatacgagcggcaccagcgcgcatggctggccggtttgctgcagtccgc 





cccaacctcaccggagtcgctgccatgactgataccgacaagccctgctgttcgttctgcggcgcggaaaaatca 





ccgacggtacccttgatcgcgggtaacgaaggccggatctgcgaggcctgcgtcaagctggcccaccaggtggtg 





accagctgggggcagcggcgccaggcccagcaactggcgccgcaactgctcacgccggcggcctacatgcagcat 





ctggacgagtcggtgatcggccaggacgaggccaaggaaaccctggcggtggcggtctacaaccactacctgcgc 





ctgctcaactgcacccgcgagccggtctgccaactgggcggaacggtcgagctggagaagtccaacatcctcatg 





gccggcccttcgggcaccggcaagaccctgctggtgcgcaccctggcgcgcatcctcggcgtgcccttcgcctcg 





gctgatgccaccaccctgacccaggccggctacgtcggcgacgacgtcgacagcatcatcgcccgcctgctggaa 





gccgccggtggcgatgtgcagaaggcgcaatggggcatcgtctatatcgacgaggtggacaagctggcacggcgt 





ggcgggggcggcacggcggtgcgcgacatctccggcgaaggcgtgcagcaggcgctgctcaagctggtcgagggt 





agcgaggtgcgcatcggcaaggggggccggcgtggcgaacacggcgaggagcaggtggtggatacgcgcaacatc 





ctgttcatcgccggtggcgcctttccgggcctggaaaccctggtcggcagccgtgtgcatccgcgtggcagcgcg 





atcggcttccatgcgcggccgcagcagcaggcaccgtcgatcaacgagctgctggcggcgctgctgccggacgac 





ctccatgagttcgggctgatccccgagttcatcggtcgcttcccgatcatcaccttcctccgcgagttggaccac 





gcgacgctgctgcgcatcctcagcgaaccgcgcaatgcgctggtcaagcagtaccagcaactgttcgcctaccag 





ggcgtgaagctggagttcagcgaggcggcgctcggccacatagccgaccaggcgctgctgcgccgcaccggcgcg 





cgcgggctgcgcgcggtcatggagagcgcgctgcagcgcaccatgttcgagatgccggcgcagccgcagctgcgc 





agttgcctgctcgacctcgacgaggagggccgcgaactggtggtgctcaggcagttcgacgagtatgccgaagcg 





caacctgccgacagccgggcggccgcggcgtcctggcagcgttccctgctggtggtggatggctagtgtcgcatt 





gccgacagcggcatgccgctgtcggcggccggtttgtgtggtttgcgacaggtaatgttcatgaaaaggctttgt 





tttcattggcttataagaatccagcggctggcgtgtttcctgctatgagtcttttgccgagtgggtatgtgggcc 





cgcggtgtttcattcatccaaacagcaatgaggtggcgtgatggccaggatcggacttttcttcggcagcaacac 





gggcaagacgcgcaaggtcgccaagatgatcaagaagcgcttcgacgacgacaccctggctgatccgctcaacgt 





caaccgcacgagcgccgcagacttcgccggctattcgcacctgatcctcggcacgccgaccctgggcgagggctt 





gctgccggggctgagcgccgattgcgagaacgaaagctgggaggaattcctgccgcagatcgaggggctggattt 





caccggcaagaccgtggccatcttcggcctcggcgatcaggtcggctacgccgacgagtttctcgatgcgatggg 





cgaactgcacgaattcttcagcgagcgcggcgccaccatggtcggcgagtggccgaccacgggctacgaattcac 





ccactccgaagcggtggtggacggcaagttcgtcgggctggcgctggacttggacaaccagagcaacctcaccga 





ggagcggctgggcgcctggttgcgacagatcgctccggccttcgaactgccgctgtgaccatgcgttgagcttcg 





ctgcacgcccggccccgacctacgcctcgtaatccgtaggttgggttgatacgcgcagcatcgaagcccaacgcg 





ctccgaagcgcagctcggcggcgatctgtgcgaaccgttgCccggcggccgtgGGCGGAGTAGCGTGATCGCGAA 





CCGAGGAAGGAGATTCGCC  





(SEQ ID NO. 6)



GGCGTATCACGAGGCCCTTTCGTCTTCACCTCGAGAAAATTTATCAAAAAGAGTGTTGACTTGTGAGCGGATAAC






AATGATACTTAGATTCAATTGTGAGCGGATAACAATTTCACACATCTAGAGCTAATCTTCTCGTACTCATGACGC





AAGTAATGAACACGATTAACATCGCTAAGAACGACTTCTCTGACATCGAACTGGCTGCTATCCCGTTCAACACTC





TGGCTGACCATTACGGTGAGCGTTTAGCTCGCGAACAGTTGGCCCTTGAGCATGAGTCTTACGAGATGGGTGAAG





CACGCTTCCGCAAGATGTTTGAGCGTCAACTTAAAGCTGGTGAGGTTGCGGATAACGCTGCCGCCAAGCCTCTCA





TCACTACCCTACTCCCTAAGATGATTGCACGCATCAACGACTGGTTTGAGGAAGTGAAAGCTAAGCGCGGCAAGC





GCCCGACAGCCTTCCAGTTCCTGCAAGAAATCAAGCCGGAAGCCGTAGCGTACATCACCATTAAGACCACTCTGG





CTTGCCTAACCAGTGCTGACAATACAACCGTTCAGGCTGTAGCAAGCGCAATCGGTCGGGCCATTGAGGACGAGG





CTCGCTTCGGTCGTATCCGTGACCTTGAAGCTAAGCACTTCAAGAAAAACGTTGAGGAACAACTCAACAAGCGCG





TAGGGCACGTCTACAAGAAAGCATTTATGCAAGTTGTCGAGGCTGACATGCTCTCTAAGGGTCTACTCGGTGGCG





AGGCGTGGTCTTCGTGGCATAAGGAAGACTCTATTCATGTAGGAGTACGCTGCATCGAGATGCTCATTGAGTCAA





CCGGAATGGTTAGCTTACACCGCCAAAATGCTGGCGTAGTAGGTCAAGACTCTGAGACTATCGAACTCGCACCTG





AATACGCTGAGGCTATCGCAACCCGTGCAGGTGCGCTGGCTGGCATCTCTCCGATGTTCCAACCTTGCGTAGTTC





CTCCTAAGCCGTGGACTGGCATTACTGGTGGTGGCTATTGGGCTAACGGTCGTCGTCCTCTGGCGCTGGTGCGTA





CTCACAGTAAGAAAGCACTGATGCGCTACGAAGACGTTTACATGCCTGAGGTGTACAAAGCGATTAACATTGCGC





AAAACACCGCATGGAAAATCAACAAGAAAGTCCTAGCGGTCGCCAACGTAATCACCAAGTGGAAGCATTGTCCGG





TCGAGGACATCCCTGCGATTGAGCGTGAAGAACTCCCGATGAAACCGGAAGACATCGACATGAATCCTGAGGCTC





TCACCGCGTGGAAACGTGCTGCCGCTGCTGTGTACCGCAAGGACAAGGCTCGCAAGTCTCGCCGTATCAGCCTTG





AGTTCATGCTTGAGCAAGCCAATAAGTTTGCTAACCATAAGGCCATCTGGTTCCCTTACAACATGGACTGGCGCG





GTCGTGTTTACGCTGTGTCAATGTTCAACCCGCAAGGTAACGATATGACCAAAGGACTGCTTACGCTGGCGAAAG





GTAAACCAATCGGTAAGGAAGGTTACTACTGGCTGAAAATCCACGGTGCAAACTGTGCGGGTGTCGACAAGGTTC





CGTTCCCTGAGCGCATCAAGTTCATTGAGGAAAACCACGAGAACATCATGGCTTGCGCTAAGTCTCCACTGGAGA





ACACTTGGTGGGCTGAGCAAGATTCTCCGTTCTGCTTCCTTGCGTTCTGCTTTGAGTACGCTGGGGTACAGCACC





ACGGCCTGAGCTATAACTGCTCCCTTCCGCTGGCGTTTGACGGGTCTTGCTCTGGCATCCAGCACTTCTCCGCGA





TGCTCCGAGATGAGGTAGGTGGTCGCGCGGTTAACTTGCTTCCTAGTGAAACCGTTCAGGACATCTACGGGATTG





TTGCTAAGAAAGTCAACGAGATTCTACAAGCAGACGCAATCAATGGGACCGATAACGAAGTAGTTACCGTGACCG





ATGAGAACACTGGTGAAATCTCTGAGAAAGTCAAGCTGGGCACTAAGGCACTGGCTGGTCAATGGCTGGCTTACG





GTGTTACTCGCAGTGTGACTAAGAGTTCAGTCATGACGCTGGCTTACGGGTCCAAAGAGTTCGGCTTCCGTCAAC





AAGTGCTGGAAGATACCATTCAGCCAGCTATTGATTCCGGCAAGGGTCTGATGTTCACTCAGCCGAATCAGGCTG





CTGGATACATGGCTAAGCTGATTTGGGAATCTGTGAGCGTGACGGTGGTAGCTGCGGTTGAAGCAATGAACTGGC





TTAAGTCTGCTGCTAAGCTGCTGGCTGCTGAGGTCAAAGATAAGAAGACTGGAGAGATTCTTCGCAAGCGTTGCG





CTGTGCATTGGGTAACTCCTGATGGTTTCCCTGTGTGGCAGGAATACAAGAAGCCTATTCAGACGCGCTTGAACC





TGATGTTCCTCGGTCAGTTCCGCTTACAGCCTACCATTAACACCAACAAAGATAGCGAGATTGATGCACACAAAC





AGGAGTCTGGTATCGCTCCTAACTTTGTACACAGCCAAGACGCTAGCCACCTTCGTAAGACTGTAGTGTGGGCAC 





ACGAGAAGTACCGAATCGAATCTTTTGCACTGATTCACGACTCCTTCGGTACGATTCCGGCTGACGCTGCGAACC 





TGTTCAAAGCACTGCGCGAAACTATCGTTGACACATATGAGTCTTGTGATGTACTGGCTGATTTCTACGACCAGT 





TCGCTGACCAGTTGCACGAGTCTCAATTGGACAAAATGCCAGCACTTCCGGCTAAAGGTAACTTGAACCTCCGTG 





ACATCTTAGAGTCGGACTTCGCGTTCGCGTAAcagatctcatcaccatcaccatcactaagcttaattagctgag 





cttggactcctgttgatagatccagtaatgacctcagaactccatctggatttgttcagaacgctcggttgccgc 





cgggcgttttttattggtgagaatccaagctagcttggcgagatccttgcagcacatccccctttcgccagctgg 





cgtaatagcgaagaggcccgcaccgatcgcaggccaaccagataagtgaaatctagttccaaactattttgtcat 





ttttaattttcgtattagcttacgacgctacacccagttcccatctattttgtcactcttccctaaataatcctt 





aaaaactccatttccacccctcccagttcccaactattttgtccgcccacagcggggcatttttcttcctgttat 





gtttgggcgctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcc 





tcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaacaccacttcaagaa 





ctctgtagcaccgcctacatacctcgctctgctaatcctgttaccagccggttgtcagccgttaagtgttcctgt 





gtcactcaaaattgctttgagaggctctaagggcttctcagtgcgttacatccctggcttgttgtccacaaccgt 





taaaccttaaaagctttaaaagccttatatattcttttttttcttataaaacttaaaaccttagaggctatttaa 





gttgctgatttatattaattttattgttcaaacatgagagcttagtacgtgaaacatgagagcttagtacgttag 





ccatgagagcttagtacgttagccatgagggtttagttcgttaaacatgagagcttagtacgttaaacttgagag 





cttagtacgtgaaacatgagagcttagtacgtactatcaacaggttgaactgcccatgttctttcctgcgttatc 





agagcttatcggccagcctcgcagagcaggattcccgttgagcaccgccaggtgcgaataagggacagtgaagaa 





ggaacacccgctcgcgggtgggcctacttcacctatcctgcccggctgacgccgttggatacaccaaggaaagtc 





tacacgaaccctttggcaaaatcctgtatatcgtgcgaaaaaggatggatataccgaaaaaatcgctataatgac 





cccgaagcagggttatgcagcggaaagtataccttaacatgttctttcctgcgttatcccctgattctgtggata 





accgtattaccgcctgcggttgagtaataaatggatgccctgcgtaagcgggtgtgggcggacaataaagtctta 





aactgaacaaaatagatctaaactatgacaataaagtcttaaactagacagaatagttgtaaactgaaatcagtc 





cagttatgctgtgaaaaagcatactggacttttgttatggctaaagcaaactcttcattttctgaagtgcaaatt 





gcccgtcgtattaaagaggggcgtggggttcgaggtcgacggtatcgataagctagcttaattagctgagcttgg 





aagtacctattccgaagttcctattctctagaaagtataggaacttcagcggaaaaggacaattgtcTCACCTCC 





AGGTGGCCCGGCTCCATGCACCGCGACGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATG





CCAACCCGTTCCATGTGCTCGCCGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAGTGATCGAAGTTAGGC





TGGTAAGAGCCGCGAGCGATCCTTGAAGCTGTCCCTGATGGTCGTCATCTACCTGCCTGGACAGCATGGCCTGCA





ACGCGGGCATCCCGATGCCGCCGGAAGCGAGAAGAATCATAATGGGGAAGGCCATCCAGCCTCGCGTCGCGAACG





CCAGCAAGACGTAGCCCAGCGCGTCGGCCGCCATGCCGGCGATAATGGCCTGCTTCTCGCCGAAACGTTTGGTGG





CGGGACCAGTGACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCCGATCATCGTCG





CGCTCCAGCGAAAGCGGTCCTCGCCGAAAATGACCCAGAGCGCTGCCGGCACCTGTCCTACGAGTTGCATGATAA





AGAAGACAGTCATAAGTGCGGCCACAATGGTCATGCCCCGCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTC





TCAAGGGCATCGGACGGCGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGT





TGAGCACCGCCGCCGCAAGGAATGGTGCGTGCAGGGAGATGGCGCCCAACAGTCCCCCGGCCACGGGGCCTGCCA





CCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCCCATCGGTGATGTCGGCGA





TATAGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGAGAATCCACAGGA





CGGGTGTGGTCGCCATGATCGCGTAGTCGATAGTGGCTCCAAGTAGCGAAGCGAGCAGGACTGGGCGGCGGCCAA





AGCGGTCGGACAGTGCTCCGAGAACGGGTGCGCATAGAAATTGCATCAACGCATATAGCGCTAGCAGCACGCCAT





AGTGACTGGCGATGCTGTCGGAATGGACGATATCCCGCAAGAGGCCCGGCAGTACCGGCATaaccaagcctatgc





ctacagcatccagggtgacggtgccgaggatgacgatgagcgcattgttagatttcatacacggtgcctgactgc 





gttagcaatttaactgtgataaactaccgcattacagtttatcgatgataagctgtcaagaagttcctattccga 





agttcctattctctagaaagtataggaacttctgcatttacgttgacaccatAATAAAAAAGCCCCCCGAATGAT 





CTTCCGGGGGCtcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcg 





cggggagaggcggtttgcgtattgggcgccagggtggtttttcttttcaccagtgagactggcaacagctgattg 





cccttcaccgcctggccctgagagagttgcagcaagcggtccacgctggtttgccccagcaggcgaaaatcctgt 





ttgatggtggttaacggcgggatataacatgagctatcttcggtatcgtcgtatcccactaccgagatatccgca 





ccaacgcgcagcccggactcggtaatggcgcgcattgcgcccagcgccatctgatcgttggcaaccagcatcgca 





gtgggaacgatgccctcattcagcatttgcatggtttgttgaaaaccggacatggcactccagtcgccttcccgt 





tccgctatcggctgaatttgattgcgagtgagatatttatgccagccagccagacgcagacgcgccgagacagaa 





cttaatgggcccgctaacagcgcgatttgctggtgacccaatgcgaccagatgctccacgcccagtcgcgtaccg 





tcctcatgggagtaaataatactgttgatgggtgtctggtcagagacatcaagaaataacgccggaacattagtg 





caggcagcttccacagcaatggcatcctggtcatccagcggatagttaatgatcagcccactgacgcgttgcgcg 





agaagattgtgcaccgccgctttacaggcttcgacgccgcttcgttctaccatcgacaccaccacgctggcaccc 





agttgatcggcgcgagatttaatcgccgcgacaatttgcgacggcgcgtgcagggccagactggaggtggcaacg 





ccaatcagcaacgactgtttgcccgccagttgttgtgccacgcggttgggaatgtaattcagctccaccatcgcc 





gcttccactttttcccgcgttttcgcagaaacgtggctggcctggttcaccacgcgggaaacggtcatataagag 





acaccggcatactctgcgacatcgtataacgttactggtttcacattcaccaccctgaattgactctcttccggg 





cgctatcatgccataccgcgaaaggttttgcaccattcgatggtgtcaacgtaaatgcatgccgcttcgccttcg 





cgcgcgaattgcaggtaccatttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaa 





taaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtctaagaaaccattattatcatgac 





attaacctataaaaata 






The modified bacteria described herein are capable of colonizing a host plant. In certain cases, the modified bacteria can be applied to the plant, by foliar application, foliar sprays, stem injections, soil drenches, immersion, root dipping, seed coating or encapsulation using known techniques.


Successful colonization can be confirmed by detecting the presence of the bacterial population within the plant. For example, after applying the bacteria to the seeds, high titers of the bacteria can be detected in the roots and shoots of the plants that germinate from the seeds. In addition, significant quantities of the bacteria can be detected in the rhizosphere of the plants. Therefore, in one embodiment, the endophytic microbe population is disposed in an amount effective to colonize the plant. Colonization of the plant can be detected, for example, by detecting the presence of the endophytic microbe inside the plant. This can be accomplished by measuring the viability of the microbe after surface sterilization of the seed or the plant: endophytic colonization results in an internal localization of the microbe, rendering it resistant to conditions of surface sterilization.


In some cases, the modified bacteria is mixed with an agriculturally suitable or compatible carrier. The carrier can be a solid carrier or liquid carrier. The carrier may be any one or more of a number of carriers that confer a variety of properties, such as increased stability, wettability, or dispersability. Wetting agents such as natural or synthetic surfactants, which can be nonionic or ionic surfactants, or a combination thereof can be included in a composition of the invention. Water-in-oil emulsions can also be used to formulate a composition that includes the modified bacteria of the present invention. Suitable formulations that may be prepared include wettable powders, granules, gels, agar strips or pellets, thickeners, and the like, microencapsulated particles, and the like, liquids such as aqueous flowables, aqueous suspensions, water-in-oil emulsions, etc. The formulation may include grain or legume products, for example, ground grain or beans, broth or flour derived from grain or beans, starch, sugar, or oil.


In some embodiments, the agricultural carrier may be soil or plant growth medium. Other agricultural carriers that may be used include fertilizers, plant-based oils, humectants, or combinations thereof. Alternatively, the agricultural carrier may be a solid, such as diatomaceous earth, loam, silica, alginate, clay, bentonite, vermiculite, seed cases, other plant and animal products, or combinations, including granules, pellets, or suspensions. Mixtures of any of the aforementioned ingredients are also contemplated as carriers, such as but not limited to, pesta (flour and kaolin clay), agar or flour-based pellets in loam, sand, or clay, etc. Formulations may include food sources for the cultured organisms, such as barley, rice, or other biological materials such as seed, plant parts, sugar cane bagasse, hulls or stalks from grain processing, ground plant material or wood from building site refuse, sawdust or small fibers from recycling of paper, fabric, or wood. Other suitable formulations will be known to those skilled in the art.


In one embodiment, the formulation can comprise a tackifier or adherent. Such agents are useful for combining the modified bacteria with carriers that can contain other compounds (e.g., control agents that are not biologic), to yield a coating composition. Such compositions help create coatings around the plant or seed to maintain contact between the microbe and other agents with the plant or plant part. In one embodiment, adherents are selected from the group consisting of: alginate, gums, starches, lecithins, formononetin, polyvinyl alcohol, alkali formononetinate, hesperetin, polyvinyl acetate, cephalins, Gum Arabic, Xanthan Gum, Mineral Oil, Polyethylene Glycol (PEG), Polyvinyl pyrrolidone (PVP), Arabino-galactan, Methyl Cellulose, PEG 400, Chitosan, Polyacrylamide, Polyacrylate, Polyacrylonitrile, Glycerol, Triethylene glycol, Vinyl Acetate, Gellan Gum, Polystyrene, Polyvinyl, Carboxymethyl cellulose, Gum Ghatti, and polyoxyethylene-polyoxybutylene block copolymers.


The formulation can also contain a surfactant. Non-limiting examples of surfactants include nitrogen-surfactant blends such as Prefer 28 (Cenex), Surf-N(US), Inhance (Brandt), P-28 (Wilfarm) and Patrol (Helena); esterified seed oils include Sun-It II (AmCy), MSO (UAP), Scoil (Agsco), Hasten (Wilfarm) and Mes-100 (Drexel); and organo-silicone surfactants include Silwet L77 (UAP), Silikin (Terra), Dyne-Amic (Helena), Kinetic (Helena), Sylgard 309 (Wilbur-Ellis) and Century (Precision).


In certain cases, the formulation includes a microbial stabilizer. Such an agent can include a desiccant. As used herein, a “desiccant” can include any compound or mixture of compounds that can be classified as a desiccant regardless of whether the compound or compounds are used in such concentrations that they in fact have a desiccating effect on the liquid inoculant. Such desiccants are ideally compatible with the modified bacteria used, and should promote the ability of the microbial population to survive application on the seeds and to survive desiccation. Examples of suitable desiccants include one or more of trehalose, sucrose, glycerol, and methylene glycol. Other suitable desiccants include, but are not limited to, non reducing sugars and sugar alcohols (e.g., mannitol or sorbitol).


The formulations may also include one or more agents such as a fungicide, an antibacterial agent, an herbicide, a nematicide, an insecticide, a plant growth regulator, a rodenticide, and a nutrient. Such agents are ideally compatible with the agricultural seed or seedling onto which the formulation is applied (e.g., it should not be deleterious to the growth or health of the plant).


When the formulation is a liquid solution or suspension, the modified bacteria can be mixed or suspended in aqueous solutions. Suitable liquid diluents or carriers include aqueous solutions, petroleum distillates, or other liquid carriers.


A formulation that is a solid composition can be prepared by dispersing the modified bacteria in or on an appropriately divided solid carrier, such as peat, wheat, bran, vermiculite, clay, talc, bentonite, diatomaceous earth, fuller's earth, or pasteurized soil. When such formulations are used as wettable powders, biologically compatible dispersing agents such as nonionic, anionic, amphoteric, or cationic dispersing and emulsifying agents can be used.


Solid carriers useful in aspects of the invention include, for example, mineral carriers such as kaolin clay, pyrophyllite, bentonite, montmorillonite, diatomaceous earth, acid white soil, vermiculite, and pearlite, and inorganic salts such as ammonium sulfate, ammonium phosphate, ammonium nitrate, urea, ammonium chloride, and calcium carbonate. Also, organic fine powders such as wheat flour, wheat bran, and rice bran may be used. The liquid carriers include vegetable oils such as soybean oil and cottonseed oil, glycerol, ethylene glycol, polyethylene glycol, propylene glycol, polypropylene glycol, etc.


The modified bacteria herein can be combined with one or more of the agents described herein to yield a formulation suitable for combining with a plant, a seed or seedling. The modified bacteria can be obtained from growth in culture, for example, using a synthetic growth medium. In addition, the microbe can be cultured on solid media, for example on petri dishes, scraped off and suspended into the preparation. Microbes at different growth phases can be used. For example, microbes at lag phase, early-log phase, mid-log phase, late-log phase, stationary phase, early death phase, or death phase can be used.


In some embodiments the invention also includes containers or equipment with the modified bacteria, with or without the plants, seeds or seedlings. For instance, the invention may include a bag comprising at least 1,000 seeds having modified bacteria. The bag further comprises a label describing the seeds and/or said modified bacteria.


The population of seeds may be packaged in a bag or container suitable for commercial sale. Such a bag contains a unit weight or count of the seeds comprising the modified bacteria as described herein, and further comprises a label. In one embodiment, the bag or container contains at least 1,000 seeds, for example, at least 5,000 seeds, at least 10,000 seeds, at least 20,000 seeds, at least 30,000 seeds, at least 50,000 seeds, at least 70,000 seeds, at least 80,000 seeds, at least 90,000 seeds or more. In another embodiment, the bag or container can comprise a discrete weight of seeds, for example, at least 1 lb, at least 2 lbs, at least 5 lbs, at least 10 lbs, at least 30 lbs, at least 50 lbs, at least 70 lbs or more. The bag or container comprises a label describing the seeds and/or said modified bacteria. The label can contain additional information, for example, the information selected from the group consisting of: net weight, lot number, geographic origin of the seeds, test date, germination rate, inert matter content, and the amount of noxious weeds, if any. Suitable containers or packages include those traditionally used in plant seed commercialization.


A substantially uniform population of seeds comprising the modified bacteria is provided in other aspects of the invention. In some embodiments, at least 10%, for example, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95% or more of the seeds in the population, contains the modified bacteria in an amount effective to colonize a plant. In other cases, at least 10%, for example, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95% or more of the seeds in the population, contains at least 100 CFU on its surface, for example, at least 200 CFU, at least 300 CFU, at least 1,000 CFU, at least 3,000 CFU, at least 10,000 CFU, at least 30,000 CFU, at least 100,000 CFU, at least 300,000 CFU, or at least 1,000,000 CFU per seed or more.


Alternatively a substantially uniform population of plants is provided. The population comprises at least 100 plants, for example, at least 300 plants, at least 1,000 plants, at least 3,000 plants, at least 10,000 plants, at least 30,000 plants, at least 100,000 plants or more. The plants are grown from the seeds comprising the modified bacteria as described herein. The increased uniformity of the plants can be measured in a number of different ways.


In some embodiments, there is an increased uniformity with respect to the modified bacteria within the plant population. For example, in one embodiment, a substantial portion of the population of plants, for example at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95% or more of the seeds or plants in a population, contains a threshold number of the modified bacteria. The threshold number can be at least 100 CFU, for example at least 300 CFU, at least 1,000 CFU, at least 3,000 CFU, at least 10,000 CFU, at least 30,000 CFU, at least 100,000 CFU or more, in the plant or a part of the plant. Alternatively, in a substantial portion of the population of plants, for example, in at least 1%, at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95% or more of the plants in the population, the modified bacteria that is provided to the seed or seedling represents at least 10%, least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 99%, or 100% of the total microbe population in the plant/seed.


This invention is not limited in its application to the details of construction and the arrangement of components set forth in the following description or illustrated in the drawings. The invention is capable of other embodiments and of being practiced or of being carried out in various ways. Also, the phraseology and terminology used herein is for the purpose of description and should not be regarded as limiting. The use of “including,” “comprising,” or “having,” “containing,” “involving,” and variations thereof herein, is meant to encompass the items listed thereafter and equivalents thereof as well as additional items.


EXAMPLES
Example 1: Nitrogen Fixation in Salmonella Using Refactored Nif Clusters

Methodology


Nitrogenase Activity Assay in Bacteria.


Acetylene reduction assay was used to measure nitrogenase activity of bacteria in free-living conditions. Cultures were initiated by inoculating a single colony into 1 mL of LB medium with appropriate antibiotics in a 15 mL culture tube. Cultures grown with shaking at 250 rpm at 37° C. for 12 h were diluted 100-fold in 1 mL of minimal medium plus 17.1 mM NH4Ac with appropriate antibiotics in 96-well deep well plates. The plates were incubated with shaking at 900 rpm at 30° C. for 20 h. Cultures were diluted an OD600 of 0.5 in 2 mL of nitrogen-free minimal medium supplemented with appropriate antibiotics and inducers in 10 mL glass vials with PTFE-silicone septa screw caps (Supelco Analytical, Bellefonte, Pa., cat. #SU860103). Headspace in the bottles was replaced with 100% argon gas using a vacuum manifold equipped with a copper catalyst oxygen trap. Acetylene freshly generated from CaC2 in a Burris bottle was injected to 10% (vol/vol) into each culture vial to begin the reaction. Cultures were allowed to grow for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.3 mL of 4 M NaOH to each vial. Ethylene production was analyzed by gas chromatography on an Agilent 7890A GC system (Agilent Technologies, Inc. Santa Clara, Calif. USA) equipped with a PAL headspace autosampler and flame ionization detector as follows. 0.25 mL headspace preincubated to 35° C. for 30 s was injected and separated for 5 min on a GS-CarbonPLOT column (0.32 mm×30 m, 3 micron; Agilent) at 60° C. and a He flow rate of 1.8 ml/min. Detection occurred in a FID heated to 300° C. with a gas flow of 35 ml/min H2 and 400 ml/min air. Acetylene and ethylene were detected at 3.0 min and 3.7 min after injection, respectively. Ethylene production was quantified by integrating the 3.7 min peak using Agilent GC/MSD ChemStation Software.


Seed Sterilization, Germination and Inoculation of Bacteria.


For surface-sterilization, Zea mays B73 seeds (U.S. National Plant Germplasm System, IA) first were washed with 70% ethanol and immersed in 2% sodium hypochlorite solution (25% commercial bleach) for 15 min with shaking at 50 rpm and subsequently washed three times with sterile water. Surface-sterilized seeds were placed on 1% Bacto agar plate supplemented with 1 μM of gibberellic acid (Sigma-Aldrich, MO) and incubated under dark at room temperature up to 6 days before germination. A regular weight germination paper (Ancor Paper Co., Mn) soaked in 10 mL of sterile water was placed on the bottom of nitrogen-free Fahräeus agar plate. The germinated seeds were transplanted at the top of the germination paper in Fahräeus agar plate (4 seedling/plate). After establishing rooting system for 2 days, maize roots were flooded with 50 mL of bacteria (OD600=1) resuspended in sterile water and incubated at room temperature. Bacteria were removed by pipetting after 1 h of incubation. The plant growth was continued under 24 h constant light at 26° C. for additional two weeks before the assays.


Internal Colonization Assay


Two weeks post-inoculation, only plant roots were retained by removing leave and seeds from the seedling using a razor blade. To determine internal colonization, each root was immersed in 20 mL of 1.6% sodium hypochlorite solution (20% commercial bleach) in 50 mL falcon tube and vortexed vigorously for 1 min followed by four times washes with 25 mL of sterile water. The surface sterilized roots were vortexed in 5 mL of PBS for 1 min following the last wash and subsequently plated on LB agar plate to quantify residual bacteria. The sterilized roots were crushed using a mortar and pestle in 5 mL of PBS for 5 min and the extracts were serially diluted in PBS and plated on LB agar plates with or without a selective marker to determine the presence of bacteria and the plasmid stability. The plates were incubated at 37° C. for 24 h before analyzing colony forming unit (CFU).


Nitrogenase Activity Assay in Plants


Acetylene reduction assay was used to measure nitrogenase activity of maize seedlings. Two weeks post-inoculation of bacteria, the intact seedlings were transferred into 30 mL volume anaerobic culture tubes (Chemglass Life Sciences, NJ) containing 2 mL of nitrogen-free Fahräeus medium sealed with a rubber stopper without headspace replacement. For the maize seedlings inoculated with the bacteria strain carrying the refactored cluster, 25 mL of 0.5 M IPTG was applied on seedling roots grown 13 days after inoculation of bacteria, after which the seedlings were incubated under constant light for 12 h before transfer into anaerobic culture tubes containing 2 mL of nitrogen-free Fahräeus medium with 10 mM IPTG. Acetylene freshly generated from CaC2 in a Burris bottle was injected to 7% (vol/vol) into each culture tube to start the reaction. The reaction was continued under a light regimen of 18 h of light and 6 h of dark at 28° C. up to 4 days. Ethylene production was quantified by gas chromatography. 0.5 mL of headspace was sampled and analyzed in a manner identical to that described above.


Results


Transfer of nif Clusters into Salmonella Strains.


Transfer of native and refactored nif clusters of Klebsiella was proven to be functional in K. oxytoca M5al and E. coli such as K12 MG1655. However, it hasn't been shown that heterologous expression of nif clusters would be active in other enteric bacteria that can colonize into crop cereals. We have collected pathogenic Salmonella strains that can infect various hosts ranging from humans to plants. We transferred native and refactored nif clusters into diverse Salmonella strains to test nitrogen fixation in a free living condition. Also, together with the refactored cluster, the controller plasmid encoding a sensor and circuit that drives the expression of the entire nif cluster in response to IPTG was introduced into Salmonella strains.


Particularly, S. typhi strains containing the native or refactored nif cluster showed higher nitrogenase activity among diverse Salmonella strains. Salmonella dublin, newport and pomona only exhibited nitrogenase activity from the native nif cluster to a lesser extent than those of the nitrogen fixing S. typhi strains (FIG. 1).


Internal Colonization of Zea mays B73 Roots by S. typhi


To determine whether a Salmonella strain can be a bacterial endophyte in maize plants, we inoculated bacteria onto the roots of Zea mays B73 that is an important commercial crop variety. S. typhi ATCC 14028 showing one of the highest nitrogenase activity by heterologous nif expression was selected for internal colonization assay. 14 days post-inoculation, internal colonization by S. typhi ATCC 14028 was analyzed using the roots of plant seedlings. No CFU of S. typhi ATCC 14028 was detected after surface sterilization of the roots. To assess internally colonized bacteria cells, the surface sterilized roots of each plant seedling were crushed in PBS and plated on LB plates. We detected endophytic colonization of ˜106 CFU/plant by S. typhi ATCC 14028 from the crushed root extracts, but no CFU by E. coli MG1655 in the same setting (FIG. 2). This shows that S. typhi ATCC 14028 can colonize Zea mays B73 internally.


Nitrogenase Activity in Maize Plants


14 days post-inoculation, we analyzed nitrogenase activity from the plant seedlings infected with the genetically modified S. typhi ATCC 14028 strains by acetylene reduction assay. More than 30 plants from each group were analyzed. 18% and 51% of the plants inoculated with S. typhi ATCC 14028 carrying the native nif cluster and the refactored nif cluster, respectively, displayed increased ethylene production compared to those plants inoculated with S. typhi ATCC 14028 expressing no nif cluster (FIG. 3). The refactored nif cluster as compared to the native nif cluster resulted in less variation in acetylene reduction in plants. This suggests that the expression of refactored nif cluster is more consistent in our setting conferred by the synthetic controller system that regulates the expression of the refactored nif cluster by an externally added inducer than that of the native nif cluster whose regulation is still under the control of complex native biological signals.


Improvement of Stability of Genetic Systems


Plasmid-based engineering of the clusters and controllers relies on plasmid stability during cell division. Such selective pressure for plasmid stability as antibiotic use can be easily applied and maintained in an in vitro setup. However, plasmids are cured from the host bacteria over time without selective antibiotic pressure in an in vivo setup.


In order to increase stability of the genetic system in bacteria, two engineering strategies were used. First, we introduced a controller that encodes an IPTG inducible T7 RNA polymerase and a selective marker into a target genome using the mini-Tn7 system [Choi, K. H., (2005). A Tn7-based broad-range bacterial cloning and expression system. Nature methods, 2(6), 443-448.]. It has been demonstrated that the transposition with the mini-Tn7 system is broad-host range and site-specific. Genome integration occurs at the Tn7 attachment site (attTn7) located downstream of the essential gene glmS. Salmonella contains a single glmS gene that ensures a single-copy insertion of an introduced genetic system. A new controller plasmid pR6K-T7RW designed for genome integration consists of a T7 RNA polymerase and a selection marker flanked by two Tn7 ends (Tn7L and Tn7R). To minimize interference by transcriptional read-through from the upstream glmS expression, a constitutive promoter-driven selection marker and a sensor protein lad are oriented opposite to the glmS. A T7 RNA polymerase read-through was blocked by a terminator between the device and the genome. We transformed a controller plasmid pR6K-T7RW and a helper plasmid pTNS3 encoding the TnsABCD transposase into Salmonella ATCC14028. The insertion site of a controller device was verified by PCR. We identified that the device is integrated 25 bp downstream of the glmS stop codon in Salmonella. We tested plasmid stability based on a selective marker in the internally colonized Salmonella strains containing either a genome-based controller or a plasmid-based controller two weeks after inoculation of germinated maize seeds. There was no marker loss from the genome-based system, whereas only about 20% of strains from the plasmid-based system were retained on the plates supplemented with antibiotics, indicating that the controller device on the Salmonella genome was stable without selective pressure over two weeks in the plant seedlings (FIG. 4 A).


The nif clusters were constructed on a broad-host range plasmid pBBR1 such that the optimal expression levels of the nif genes in diverse contexts can be rapidly accessed by swapping genetic parts of the clusters on a plasmid. To keep the versatility and engineerablity of a plasmid-based nif system, we sought to explore an alternative to genome-based engineering while ensuring the stability of the nif clusters on the plasmid. The partitioning system encoded by the two par operons (parCBA and parDE) contributes to stable maintenance of a plasmid RK2 [Easter, C. L., Schwab, H., & Helinski, D. R. (1998). Role of the parCBA operon of the broad-host-range plasmid RK2 in stable plasmid maintenance. Journal of bacteriology, 180(22), 6023-6030.]. However, the transferability of the function of the RK2 par system has not been tested on other types of plasmids. We integrated the RK2 par system into the nif plasmids built upon a plasmid pBBR1 and analyzed plasmid stability in the Salmonella strain from the colonized roots. The nif plasmid stability without the par system decreased to 4% in the absence of a selective pressure after 14 days of inoculation into the plants. On the other hand, adding the par system on the nif plasmids resulted in plasmid stability of 96% under the identical conditions, which suggesting the RK2 par system works as a module to improve the stability of other plasmid types (FIG. 4 B). These engineering efforts can be modular standards as a means to provide the stability of complex multigene systems in the bacteria that are supposed to be released into the environment.


REFERENCES



  • 1. Tilman, D., Balzer, C., Hill, J. & Befort, B. L. Global food demand and the sustainable intensification of agriculture. PNAS 108, 20260-20264 (2011).

  • 2. Mueller, N. D. et al. Closing yield gaps through nutrient and water management. Nature 490, 254-257 (2012).

  • 3. Haapalainen, M., van Gestel, K., Pirhonen, M. & Taira, S. Soluble plant cell signals induce the expression of the type III secretion system of Pseudomonas syringae and upregulate the production of pilus protein HrpA. Mol. Plant Microbe Interact. 22, 282-290 (2009).

  • 4. Holden, N., Pritchard, L. & Toth, I. Colonization outwith the colon: plants as an alternative environmental reservoir for human pathogenic enterobacteria. FEMS Microbiol. Rev. 33, 689-703 (2009).

  • 5. Plotnikova, J. M., Rahme, L. G. & Ausubel, F. M. Pathogenesis of the human opportunistic pathogen Pseudomonas aeruginosa PA14 in Arabidopsis. Plant Physiol. 124, 1766-1774 (2000).

  • 6. Brandl, M. T., Cox, C. E. & Teplitski, M. Salmonella interactions with plants and their associated microbiota. Phytopathology 103, 316-325 (2013).

  • 7. Kutter, S., Hartmann, A. & Schmid, M. Colonization of barley (Hordeum vulgare) with Salmonella enterica and Listeria spp. FEMS Microbiol. Ecol. 56, 262-271 (2006).

  • 8. Temme, K., Zhao, D. & Voigt, C. A. Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca. PNAS 109, 7085-7090 (2012).

  • 9. Smanski, M. J. et al. Functional optimization of gene clusters by combinatorial design and assembly. Nat Biotech 32, 1241-1249 (2014).

  • 10. Chan, L. Y., Kosuri, S. & Endy, D. Refactoring bacteriophage T7. Mol Syst Biol 1, 2005.0018 (2005).

  • 11. Jaschke, P. R., Lieberman, E. K., Rodriguez, J., Sierra, A. & Endy, D. A fully decompressed synthetic bacteriophage øX174 genome assembled and archived in yeast. Virology 434, 278-284 (2012).

  • 12. Wang, X. et al. Using Synthetic Biology to Distinguish and Overcome Regulatory and Functional Barriers Related to Nitrogen Fixation. PLoS ONE 8, e68677 (2013).

  • 13. Widmaier, D. M. et al. Engineering the Salmonella type III secretion system to export spider silk monomers. Mol. Syst. Biol. 5, 309 (2009).



EQUIVALENTS

Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be encompassed by the following claims.


All references, including patent documents, disclosed herein are incorporated by reference in their entirety.

Claims
  • 1. A method for providing fixed nitrogen from atmospheric nitrogen to a cereal plant, or to soil where a cereal plant or seed is growing or is to be planted, comprising delivering genetically engineered bacteria having a refactored exogenous nif cluster to a cereal plant, or to soil where a cereal plant or seed is growing or is to be planted, wherein the genetically engineered bacteria comprise transgenic bacteria, wherein the genetically engineered bacteria comprise bacteria of a species which does not natively contain a nif cluster, wherein the genetically engineered bacteria become established in the cereal plant and provide the cereal plant with fixed nitrogen, wherein the refactored exogenous nif cluster comprises at least one of: codon- optimized nif cluster genes; operons under the control of synthetic parts; operons separated by spacer parts; and a controller.
  • 2. The method of claim 1, wherein the genetically engineered bacteria are gamma-proteobacteria.
  • 3. The method of claim 1, wherein the refactored exogenous_nif cluster is a Klebsiella nif cluster, a Pseudomonas stutzi nif cluster, or a Paenibacillus nif cluster.
  • 4. The method of claim 1, wherein the cereal plant is selected from wheat, rye, barley, triticale, oats, millet, sorghum, teff, fonio, buckwheat, quinoa, corn and rice.
  • 5. The method of claim 1, wherein the genetically engineered bacteria further comprise an exogenous gene encoding a plant growth-stimulating peptide.
  • 6. The method of claim 5, wherein secretion of the plant growth-stimulating peptide from the genetically engineered bacteria is regulated by a type 3 secretion system (T3SS).
  • 7. The method of claim 5, wherein the plant growth stimulating peptide is directly delivered into root or stem tissues.
  • 8. The method of claim 1, wherein the genetically engineered bacteria comprise a system for stable plasmid maintenance.
  • 9. The method of claim 1, wherein the controller is a nucleic acid encoding an IPTG inducible T7 RNA polymerase.
  • 10. The method of claim 8, wherein the system for stable plasmid maintenance is a partitioning system encoded by the two par operons (parCBA and parDE).
  • 11. The method of claim 1, wherein the genetically engineered bacteria comprise a partitioning system, wherein the partitioning system is an RK2 par system.
  • 12. The method of claim 1, wherein the refactored exogenous nif cluster does not comprise an internal regulator.
  • 13. The method of claim 12, wherein at least one of the operons comprises a synthetic regulatory element selected from the group consisting of: a nucleotide sequence that increases or decreases transcription or translation rate, stability, or mobility of a transcription or translation product; a ribozyme; an enhancer sequence; a response element; a protein recognition site; a protein binding sequence; a 5′ untranslated region; a 3′ untranslated region; a transcription terminator sequence; and a polyadenylation sequence.
  • 14. The method of claim 1, wherein the refactored exogenous nif cluster is from an organism of a different species than the genetically engineered bacteria.
  • 15. The method of claim 1, wherein the genetically engineered bacteria are endophytes.
RELATED APPLICATION

This application is national stage filing under U.S.C. § 371 of PCT International Application PCT/US2016/055429entitled, “NITROGEN FIXATION USING REFACTORED NIF CLUSTERS” filed Oct. 5, 2016, which claims the benefit under 35 U.S.C. § 119(e) of U.S. provisional application No. 62/237,426, entitled “NITROGEN FIXATION IN SALMONELLA USING REFACTORED NIF CLUSTERS”, filed Oct. 5, 2015, which are herein incorporated by reference herein in their entirety.

GOVERNMENT SUPPORT

This invention was made with government support under IOS1331098 awarded by the National Science Foundation. The government has certain rights in the invention.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2016/055429 10/5/2016 WO
Publishing Document Publishing Date Country Kind
WO2017/062412 4/13/2017 WO A
US Referenced Citations (199)
Number Name Date Kind
1520545 Murphy Dec 1924 A
4782022 Puhler et al. Nov 1988 A
4832728 Allan et al. May 1989 A
5071743 Slilaty et al. Dec 1991 A
5116506 Williamson et al. May 1992 A
5188960 Payne et al. Feb 1993 A
5229291 Nielsen et al. Jul 1993 A
5354670 Nickoloff et al. Oct 1994 A
5427785 Ronson et al. Jun 1995 A
5610044 Lam et al. Mar 1997 A
5780270 Lesley Jul 1998 A
5789166 Bauer et al. Aug 1998 A
5877012 Estruch et al. Mar 1999 A
5880275 Fischhoff et al. Mar 1999 A
5916029 Smith et al. Jun 1999 A
6033861 Schafer et al. Mar 2000 A
6033874 Baum et al. Mar 2000 A
6083499 Narva et al. Jul 2000 A
6107279 Estruch et al. Aug 2000 A
6114148 Seed et al. Sep 2000 A
6127180 Narva et al. Oct 2000 A
6137033 Estruch et al. Oct 2000 A
6218188 Cardineau et al. Apr 2001 B1
6248535 Danenberg et al. Jun 2001 B1
6326351 Donovan et al. Dec 2001 B1
6340593 Cardineau et al. Jan 2002 B1
6391548 Bauer et al. May 2002 B1
6399330 Donovan et al. Jun 2002 B1
6548289 Beynon et al. Apr 2003 B1
6548291 Narva et al. Apr 2003 B1
6596509 Bauer Jul 2003 B1
6624145 Narva et al. Sep 2003 B1
6673610 Miyawaki et al. Jan 2004 B2
6713063 Malvar et al. Mar 2004 B1
6713285 Bauer et al. Mar 2004 B2
6773900 Short et al. Aug 2004 B2
6841358 Locht et al. Jan 2005 B1
6949626 Donovan et al. Sep 2005 B2
6962705 Malvar et al. Nov 2005 B2
7064249 Corbin et al. Jun 2006 B2
7070982 Malvar et al. Jul 2006 B2
7084331 Isawa et al. Aug 2006 B2
7105332 Abad et al. Sep 2006 B2
7132265 Bauer et al. Nov 2006 B2
7244820 Miles et al. Jul 2007 B2
7329736 Abad et al. Feb 2008 B2
7378499 Abad et al. May 2008 B2
7385107 Donovan et al. Jun 2008 B2
7449552 Abad et al. Nov 2008 B2
7462760 Abad et al. Dec 2008 B2
7470427 Cocking Dec 2008 B2
7476781 Abad et al. Jan 2009 B2
7485451 Vandergheynst et al. Feb 2009 B2
7491698 Hey et al. Feb 2009 B2
7491869 Abad et al. Feb 2009 B2
7504229 Donovan et al. Mar 2009 B2
7615686 Miles et al. Nov 2009 B2
7803943 Mao et al. Sep 2010 B2
7858849 Cerf et al. Dec 2010 B2
7923602 Carozzi et al. Apr 2011 B2
8076142 Huang et al. Dec 2011 B2
8084416 Sampson et al. Dec 2011 B2
8084418 Hey et al. Dec 2011 B2
8137665 Cocking Mar 2012 B2
8236757 Carozzi et al. Aug 2012 B2
8237020 Miles et al. Aug 2012 B2
8268584 Harwood et al. Sep 2012 B1
8304604 Lira et al. Nov 2012 B2
8304605 Lira et al. Nov 2012 B2
8319019 Abad et al. Nov 2012 B2
8334366 Hughes et al. Dec 2012 B1
8334431 Sampson et al. Dec 2012 B2
8377671 Cournac et al. Feb 2013 B2
8481026 Woodruff et al. Jul 2013 B1
8513494 Wu et al. Aug 2013 B2
8530411 Cerf et al. Sep 2013 B2
8575433 Cerf et al. Nov 2013 B2
8686233 Cerf et al. Apr 2014 B2
8759619 Sampson et al. Jun 2014 B2
8795965 Zhang Aug 2014 B2
8802933 Abad et al. Aug 2014 B2
8802934 Abad et al. Aug 2014 B2
9150851 Wigley et al. Oct 2015 B2
9321697 Das et al. Apr 2016 B2
9487451 Doty et al. Nov 2016 B2
9512431 Mirsky et al. Dec 2016 B2
9657298 Soto, Sr. et al. May 2017 B2
9796957 Barney et al. Oct 2017 B2
9957509 Mirsky et al. May 2018 B2
9975817 Temme et al. May 2018 B2
9994557 Davidson et al. Jun 2018 B2
10384983 Temme et al. Aug 2019 B2
10525318 Dougherty Jan 2020 B2
10556839 Temme et al. Feb 2020 B2
10662432 Mirsky et al. May 2020 B2
10919814 Temme et al. Feb 2021 B2
10934226 Temme et al. Mar 2021 B2
10968446 Zhao et al. Apr 2021 B2
20040197916 Carozzi et al. Oct 2004 A1
20040197917 Carozzi et al. Oct 2004 A1
20040210964 Carozzi et al. Oct 2004 A1
20040210965 Carozzi et al. Oct 2004 A1
20040216186 Carozzi et al. Oct 2004 A1
20040235663 Cocking Nov 2004 A1
20040241847 Okuyama et al. Dec 2004 A1
20040250311 Carozzi et al. Dec 2004 A1
20050081262 Cook et al. Apr 2005 A1
20050266541 Dillon Dec 2005 A1
20060033867 Krisko et al. Feb 2006 A1
20060096918 Semmens May 2006 A1
20060112447 Bogdanova et al. May 2006 A1
20060127988 Wood et al. Jun 2006 A1
20060191034 Baum Aug 2006 A1
20060243011 Someus Nov 2006 A1
20070249018 Vemuri et al. Oct 2007 A1
20080295207 Baum et al. Nov 2008 A1
20080311632 Figge et al. Dec 2008 A1
20090105076 Stewart et al. Apr 2009 A1
20090137390 Triplett May 2009 A1
20090144852 Tomso et al. Jun 2009 A1
20090152195 Rodgers et al. Jun 2009 A1
20090162477 Nadel Jun 2009 A1
20090258404 Mikkelsen et al. Oct 2009 A1
20090308121 Reddy et al. Dec 2009 A1
20100005543 Sampson et al. Jan 2010 A1
20100017914 Hart et al. Jan 2010 A1
20100028870 Welch et al. Feb 2010 A1
20100184038 Boddy et al. Jul 2010 A1
20100197592 Heinrichs Aug 2010 A1
20100267147 Qiao Oct 2010 A1
20100298211 Carozzi et al. Nov 2010 A1
20110023184 Desai et al. Jan 2011 A1
20110064710 Benson et al. Mar 2011 A1
20110104690 Yu et al. May 2011 A1
20110263488 Carozzi et al. Oct 2011 A1
20120015806 Paikray et al. Jan 2012 A1
20120107889 Doty et al. May 2012 A1
20120192605 McSpadden Aug 2012 A1
20120266332 Kuykendall Oct 2012 A1
20120278954 Bowen et al. Nov 2012 A1
20120284813 Olivier et al. Nov 2012 A1
20120311745 Meade et al. Dec 2012 A1
20120311746 Meade et al. Dec 2012 A1
20120317681 Meade et al. Dec 2012 A1
20120317682 Meade et al. Dec 2012 A1
20120324605 Meade et al. Dec 2012 A1
20120324606 Meade et al. Dec 2012 A1
20120331589 Meade et al. Dec 2012 A1
20120331590 Meade et al. Dec 2012 A1
20130116170 Graser et al. May 2013 A1
20130126428 Jones et al. May 2013 A1
20130167268 Narva et al. Jun 2013 A1
20130167269 Narva et al. Jun 2013 A1
20140011261 Wang et al. Jan 2014 A1
20140155283 Venkateswaran Jun 2014 A1
20140182018 Lang et al. Jun 2014 A1
20140223598 Sampson et al. Aug 2014 A1
20140223599 Sampson et al. Aug 2014 A1
20140230504 Finlayson et al. Aug 2014 A1
20140273226 Wu Sep 2014 A1
20140301990 Gregory et al. Oct 2014 A1
20140329326 Mirsky Nov 2014 A1
20140336050 Soto, Sr. Nov 2014 A1
20150080261 Wigley et al. Mar 2015 A1
20150101373 Munusamy Apr 2015 A1
20150128670 Das May 2015 A1
20150237807 Valiquette Aug 2015 A1
20150239789 Kang et al. Aug 2015 A1
20150315570 Zhao et al. Nov 2015 A1
20160174570 Vujanovic et al. Jun 2016 A1
20160264929 Barney et al. Sep 2016 A1
20160292355 Lou et al. Oct 2016 A1
20160295868 Jones et al. Oct 2016 A1
20170086402 Meadows-Smith et al. Mar 2017 A1
20170119690 Hansen et al. May 2017 A1
20170152519 Mirsky Jun 2017 A1
20170267997 Nicol et al. Sep 2017 A1
20170367349 Gruver et al. Dec 2017 A1
20180002243 Temme Jan 2018 A1
20180020671 Bioconsortia Jan 2018 A1
20180065896 Ibema et al. Mar 2018 A1
20180073028 Mirsky Mar 2018 A1
20180273437 Temme et al. Sep 2018 A1
20180290942 Voigt et al. Oct 2018 A1
20180297905 Temme et al. Oct 2018 A1
20180297906 Temme et al. Oct 2018 A1
20190039964 Temme et al. Feb 2019 A1
20190144352 Temme et al. May 2019 A1
20200087221 Temme et al. Mar 2020 A1
20200115715 Mirsky et al. Apr 2020 A1
20200299637 Voigt et al. Sep 2020 A1
20200308594 Tamsir et al. Oct 2020 A1
20200331820 Tamsir et al. Oct 2020 A1
20210009483 Temme et al. Jan 2021 A1
20210163374 Bioch et al. Jun 2021 A1
20210214282 Temme et al. Jul 2021 A1
20210315212 Rezaei et al. Oct 2021 A1
20220017911 Temme et al. Jan 2022 A1
20220079163 Reisinger et al. Mar 2022 A1
Foreign Referenced Citations (95)
Number Date Country
636565 May 1993 AU
2051071 Mar 1993 CA
1289852 Apr 2001 CN
1500801 Jun 2004 CN
1552846 Dec 2004 CN
1746304 Mar 2006 CN
101880676 Nov 2010 CN
102041241 May 2011 CN
102417882 Apr 2012 CN
102690808 Sep 2012 CN
103451130 Dec 2013 CN
104136599 Nov 2014 CN
104204211 Dec 2014 CN
0256889 Feb 1988 EP
0292984 Nov 1988 EP
339830 Nov 1989 EP
1535913 Jun 2005 EP
2186890 May 2010 EP
3322679 May 2018 EP
2910230 Jun 2008 FR
S63-501924 Aug 1988 JP
H01225483 Sep 1989 JP
H02-131581 May 1990 JP
2009-232721 Oct 2009 JP
2014096996 May 2014 JP
2015037385 Feb 2015 JP
2015042633 Mar 2015 JP
2015-518023 Jun 2015 JP
2015113274 Jun 2015 JP
2015-519352 Jul 2015 JP
WO 1987004182 Jul 1987 WO
WO 9305154 Mar 1993 WO
WO 9810088 Mar 1998 WO
WO 9909834 Mar 1999 WO
WO 0057183 Sep 2000 WO
WO 0107567 Feb 2001 WO
WO 2004074462 Sep 2004 WO
WO 2005021585 Mar 2005 WO
WO 2005038032 Apr 2005 WO
WO 2006005100 Jan 2006 WO
WO 2006083891 Aug 2006 WO
WO 2006098225 Sep 2006 WO
WO 2006119457 Nov 2006 WO
WO 2007027776 Mar 2007 WO
WO 2009060012 May 2009 WO
WO 2009091557 Jul 2009 WO
WO 2010080184 Jul 2010 WO
WO 2011099019 Aug 2011 WO
WO 2011099024 Aug 2011 WO
WO 2011103247 Aug 2011 WO
WO 2011103248 Aug 2011 WO
WO 2011154960 Dec 2011 WO
WO 2012139004 Oct 2012 WO
WO 2012154651 Nov 2012 WO
WO 2012174271 Dec 2012 WO
WO 2013076687 May 2013 WO
WO 2013132518 Sep 2013 WO
WO 2014042517 Mar 2014 WO
WO 2014071182 May 2014 WO
WO 2014201044 Dec 2014 WO
WO 2016016629 Feb 2016 WO
WO 2016016630 Feb 2016 WO
WO 2016100727 Jun 2016 WO
WO 2016146955 Sep 2016 WO
WO 2016178580 Nov 2016 WO
WO 2016179046 Nov 2016 WO
WO 2016181228 Nov 2016 WO
WO 2016191828 Dec 2016 WO
WO 2017011602 Jan 2017 WO
WO 2017042833 Mar 2017 WO
WO 2017062412 Apr 2017 WO
WO 2017069717 Apr 2017 WO
WO 2017112827 Jun 2017 WO
WO 2017203440 Nov 2017 WO
WO 2018081543 May 2018 WO
WO 2018132774 Jul 2018 WO
WO 2018133774 Jul 2018 WO
WO 2019032926 Feb 2019 WO
WO 2019084342 May 2019 WO
WO 2019140125 Jul 2019 WO
WO 2020006064 Jan 2020 WO
WO 2020006246 Jan 2020 WO
WO 2020014498 Jan 2020 WO
WO 2020023630 Jan 2020 WO
WO 2020061363 Mar 2020 WO
WO 2020092940 May 2020 WO
WO 2020118111 Jun 2020 WO
WO 2020146372 Jul 2020 WO
WO 2020163251 Aug 2020 WO
WO 2020190363 Sep 2020 WO
WO 2020191201 Sep 2020 WO
WO 2020219893 Oct 2020 WO
WO 2020219932 Oct 2020 WO
WO 2021113352 Jun 2021 WO
WO 2021146209 Jul 2021 WO
Non-Patent Literature Citations (409)
Entry
US 8,476,226 B2, 07/2013, Lira (withdrawn)
Zhang et al. (World J Microbiol Biotechnol (2015) 31:921-927). (Year: 2015).
Temme, et al. (Proceedings of the National Academy of Sciences 109.18 (2012): 7085-7090). (Year: 2012).
Lee et al. (Planta (2009) 229:747-755). (Year: 2009).
Cornelis (Nature Reviews Microbiology 4.11 (2006): 811). (Year: 2006).
Kent et al. (Appl. Environ. Microbiol. 64.5 (1998): 1657-1662). (Year: 1998).
Extended European Search Report dated Feb. 20, 2019 for Application No. EP 16854192.8.
International Search Report and Written Opinion mailed Dec. 30, 2016 for Application No. PCT/US2016/055429.
International Preliminary Report on Patentability dated Apr. 19, 2018 for Application No. PCT/US2016/055429.
[No. Author Listed], 40 CFR 725.3 U.S. Government Publishing Office. Jul. 1, 2010. Retrieved from https://www.gpo.gov/fdsys/pkg/CFR-2010-title40-vol30/pdf/CFR-2010-title40-vol30-sec725-3.pdf 3 pages.
[No. Author Listed], T7 RNA Polymerase Expression System for Bacillus megaterium; T7 RNAP Expression System Handbook, Jan. 2010, © MoBiTec GmbH. 18 pages.
[No. Author Listed], BLAST. Basic local alignment search tool. Available at http://blast.ncbi.nIm.nih.gov/Blast.cgi. Accessed on Oct. 10, 2016. 2 pages.
[No. Author Listed], EMBOSS. EMBOSS Needle: Pairwise Sequence Alignment (Nucleotide). Available at http://www.ebi.ac.uk/Tools/psa/emboss_needle/nucleotide.html. Accessed on Oct. 10, 2016. 2 pages.
[No. Author Listed], EMBOSS. EMBOSS Water: Pairwise Sequence Alignment (Nucleotide). Available at http://www.ebi.ac.uk/Tools/psa/emboss_water/nucleotide.html. Accessed on Oct. 10, 2016. 2 pages.
Alper et al., Tuning genetic control through promoter engineering. Proc Natl Acad Sci U S A. Sep. 6, 2005; 102(36):12678-83. Epub Aug. 25, 2005. Erratum in: Proc Natl Acad Sci U S A. Feb. 21, 2006;103(8):3006.
Altschul et al. Basic local alignment search tool. J Mol Biol 215(3):403-410 (1990).
Altschul, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389-3402 (1977).
An et al. Constitutive expression of the nifA gene activates associative nitrogen fixation of Enterobacter gergoviae 57-7, an opportunistic endophytic diazotroph. Journal of Applied Microbiology 103(3):613-620 (Sep. 1, 2007). First published Feb. 7, 2007.
Andersen, et al. Herpesvirus-mediated gene delivery into the rat brain: specificity and efficiency of the neuron-specific enolase promoter. Cell Mol Neurobiol. Oct. 1993;13(5):503-15.
Andersen, et al. Energetics of biological nitrogen fixation: determination of the ratio of formation of H2 to NH4+ catalysed by nitrogenase of Klebsiella pneumoniae in vivo. J Gen Microbial. Nov. 1977; 103(1):107-22.
Andrews et al. Use of Nitrogen Fixing Bacteria Inoculants as a Substitute for Nitrogen Fertiliser for Dryland Graminaceous Crops: Progress Made, Mechanisms of Action and Future Potential. Symbiosis. 2003;34:1-21.
Arbuthnot, et al. In vitro and in vivo hepatoma cell-specific expression of a gene transferred with an adenoviral vector. Hum Gene Ther. Aug. 20, 1996;7(13): 1503-14.
Arsene, et al., Modulation of NifA activity by PII in Azospirillum brasilense: Evidence for a Regulatory role of the NifA N-Terminal Domain. J Biotechnol. Aug. 1996;178(16):4830-8.
Austin, et al. Characterisation of the Klebsiella pneumoniae nitrogen-fixation regulatory proteins NIFA and NIFL in vitro. Eur J Biochem. Jan. 26, 1990;187(2):353-60.
Bageshwar, et al. An Environmentally Friendly Engineered Azotobacter Strain That Replaces a Substantial Amount of Urea Fertilizer while Sustaining the Same Wheat Yield. Appl Environ Microbiol. Aug. 1, 2017; 83(15):e00590-17, 14 pages.
Bali, et al., Excretion of Ammonium by a nifL Mutant of Azotobacter vine landii fixing Nitrogen. Appl Environ Microbiol. May 1992;58(5):1711-8.
Barney, et al., Gene deletions resulting in increased nitrogen release by azotobacter vinelandii: application of a novel nitrogen biosensor. Appl. Environ. Microbiol. 2015; 81 (13):4316-4328. Published online Apr. 17, 2015.
Barney, et al., Transcriptional analysis of an Ammonium-excreting stain of azotobacter vinelandii deregulated for nitrogen fixation. Appl. Environ. Microbial. 2017; 83(20): 1-22.
Barrango et al., Exploiting CRISPR-Cas immune systems for genome editing in bacteria. Curr Opin Biotechnol. 2016;37:61-8.
Beringer, et al., Genetic engineering and nitrogen fixation. Biotech Gen Eng Rev. 1984; 1(1):65-88.
Bikard et al., The synthetic integron: an in vivo genetic shuffling device. Nucleic Acids Res. Aug. 2010;38(15):e153. doi:10.1093/nar/gkq511. Epub Jun. 9, 2010.
Bilitchenko et al., Eugene-a domain specific language for specifying and constraining synthetic biological parts, devices, and systems. PLoS One. Apr. 29, 2011;6(4):e18882. doi: 10.1371/journal.pone.0018882. 12 pages.
Blanco, et al. Sequence and molecular analysis of the nifL gene of Azotobacter vinelandii. Mol Microbial. Aug. 1993;9(4):869-79.
Boshart, et al. A very strong enhancer is located upstream of an immediate early gene of human cytomegalovirus. Cell. Jun. 1985;41(2):521-30.
Bosworth, et al. Alfalfa yield response to inoculation with recombinant strains of Rhizobium meliloti with an extra copy of dctABD and/or modified nifA expression. Appl Environ Microbial. Oct. 1994;60(10):3815-32.
Brandl, et al. Salmonella interactions with plants and their associated microbiota. Phytopathology. 2013;103:316-25.
Brewin, et al., The Basis of Ammonium release in nifL Mutants of Azotobacter vinelandii. J Bacteriol. Dec. 1999;181(23):7356-62.
Buchanan-Wollaston, et al. Role of the nifA gene product in the regulation of nif expression in Klebsiella pneumoniae. Nature. Dec. 2, 19814;294(5843):776-8.
Buddrus-Schiemann, et al. Root colonization by Pseudomonas sp. DSMZ 13134 and impact on the indigenous rhizosphere bacterial community of barley. Microb Ecol. Aug. 2010;60(2):381-93. doi: 10.1007/s00248-010-9720-8. Epub Jul. 20, 2010.
Chan et al., Refactoring bacteriophage T7. Mol Sys Biol. Sep. 13, 2005;1(1): E1-10. oi:10.1038/msb4100025.
Chen, et al. Expression of rat bone sialoprotein promoter in transgenic mice. J Bone Miner Res. May 1996;11(5):654-64.
Chen, et al., Complete genome sequence of Kosakonia sacchari type strain SP1T. Stand Genomic Sci. Jun. 15, 2014; 9(3): 1311-1318.
Chiang, et al., Mutagenic Oligonucleotide-directed PCR Amplification (Mod-PCR): An Efficient Method for Generating Random Base Substitution Mutations in a DNA sequence element. PCR Method Appl. 1993; 2:210-217.
Choi, et al. A Tn7-based broad-range bacterial cloning and expression system. Nat Methods. Jun. 2005;2(6):443-8.
Choudhary, et al. Interactions of Bacillus spp. and Plants—With Special Reference to Induced Systemic Resistance (ISR). Microbiol Res. 2009;164(5):493-513. doi: 10.1016/j.micres.2008.08.007.
Cobb et al., Directed evolution: an evolving and enabling synthetic biology tool. Curr Opin Chem Biol. Aug. 2012;16(3-4):285-91. doi:10.1016/j.cbpa.2012.05.186. Epub Jun. 4, 2012.
Cohen, J.D., In vitro Tomato Fruit Cultures Demonstrate a Role for lndole-3-acetic Acid in Regulating Fruit Ripening. J Amer Soc Hort Sci. 1996;121(3):520-4.
Colnaghi et al., Lethality of glnD null mutations in Azotobacter vinelandii is suppressible by prevention of glutamine synthetase adenylylation. Microbiology. May 2001;147(5):1267-76.
Colnaghi et al., Strategies for increased ammonium production in free-living or plant associated nitrogen fixing bacteria. Plant and Soil, 1997;194:145-54.
Conniff, R., Microbes help grow better crops. Sci Amer. Sep. 1, 2013. 7 pages.
Contreras, et al. The product of the nitrogen fixation regulatory gene nfrX of Azotobacter vinelandii is functionally and structurally homologous to the uridylyltransferase encoded by glnD in enteric bacteria. J Bacterial. Dec. 1991;173(24):7741-9.
Curatti, et al., Genes required for rapid expression of nitrogenase activity in Azotobacter vinelandii. PNAS. 2005;102(18):6291-6.
Datsenko et al., One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products. PNAS. Jun. 6, 2000;97(12):6640-5.
Debruijn et al., The Cloning and characterization of the glnF (ntrA) Gene of Klebsiella pneumoniae: Role of glnF (ntrA) in the Regulation of Nitrogen Fixation (nif) and other Nitrogen assimilation genes. Mol Genet. 1983;192:342-53.
Delaux et al., Tracing the evolutionary path to nitrogen-fixing crops. Curr Opin Plant Biol. 2015;26:95-9.
Dent et al., Establishing symbiotic nitrogen fixation in cereals and other non-legume crops: The greener nitrogen revolution. Agric & Food Secur. 2017;6(7):1-9.
Desnoues et al., Nitrogen fixation genetics and regulation in a Pseudomonas stutzeri strain associated with rice. Microbiology.2003;149:2251-62. doi: 10.1099/mic.0.26270-0.
Dixon et al., Genetic regulation of biological nitrogen fixation. Nat Rev. 2004;2:621-31.
Dong et al., Kinetics and Strain Specificity of Rhizosphere and Endophytic Colonization by Enteric Bacteria on Seedlings of Medicago sativa and Medicago truncatula. Appl Environ Microbiol. Mar. 2003;69(3):1783-90. doi: 10.1128/AEM.69.3.1783-1790.2003.
Dos Santos et al., Distribution of nitrogen fixation and nitrogenase-like sequences amongst microbial genomes. BMC Genomics.2012;13:162, 12 pages.
Du et al., Customized optimization of metabolic pathways by combinatorial transcriptional engineering. Nucl Acids Res. Oct. 2012;40(18):e142. doi: 10.1093/nar/pks549. Epub Jun. 19, 2012. 10 pages.
Easter et al., Role of the parCBA Operon of the Broad-Host-Range Plasmid RK2 in Stable Plasmid Maintenance. J Bacteriol. Nov. 1998;180(22):6023-30.
Egener et al., Identification of NifL-like protein in a diazotroph of the b-subgroup of the proteobacteria, Azoarcus sp. strain BH72. Microbiol. 2002;148:3203-12.
Engler, et al. A one pot, one step, precision cloning method with high throughput capability. PLoS One. 2008;3(11):e3647. doi: 10.1371/journal.pone.0003647. Epub Nov. 5, 2008. 7 pages.
Engler, et al. Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS One. 2009;4(5):e5553. doi: 10.1371/journal.pone.0005553. Epub May 14, 2009. 9 pages.
Ferrieres, et al. The yjbEFGH locus in Escherichia coli K-12 is an operon encoding proteins involved in exopolysaccharide production. Microbiology. Apr. 2007;153(4):1070-80.
Fischbach et al., Prokaryotic gene clusters: A rich toolbox for synthetic biology. Biotechnol J. 2010;15(12):1277-96.
Fox et al., Major cereal crops benefit from biological nitrogen fixation when inoculated with the nitrogen-fixing bacterium Pseudomonas protegens Pf-5 X940. Environ Microbiol. 2016;18(10):3522-34.
Frasch et al., Design-based re-engineering of biosynthetic gene clusters: plug-and-play in practice. Curr Opin Biotechnol. Dec. 2013;24(6): 1144-50. doi: 10.1016/j.copbio.2013.03.006. Epub Mar. 27, 2013.
Gebeyehu et al., Novel biotinylated nucleotide-analogs for labeling and colorimetric detection of DNA. Nucl Acids Res. 1987;15:4513-34.
Geddes et al., Use of plant colonizing bacteria as chassis for transfer of N2-fixation to cereals. Curr Opin Biotechnol. 2015;32:216-22.
Gibson, Physical Environment and Symbiotic Nitrogen Fixation. Aust J Biol Sci. 1963; 16:28-42.
Gossen et al. Tight control of gene expression in mammalian cells by tetracycline-responsive promoters. PNAS USA. Jun. 1992;89:5547-51.
Gossen et al. Transcriptional activation by tetracyclines in mammalian cells. Sci. Jun. 23, 1995;268(5218):1766-9.
Govantes et al., Mechanism of coordinated synthesis of the antagonistic regulatory proteins NifL and NifA of Klebsiella pneumoniae. J Bacteriol. Dec. 1996;178(23):6817-23.
Guo et al., Discovery of Reactive Microbiota-Derived Metabolites that Inhibit Host Proteases. Cell. Jan. 26, 2017;168(3):517-26. doi:10.1016/j.cell.2016.12.021. Epub Jan. 19, 2017.
Haapalainen et al., Soluble plant cell signals induce the expression of the type III secretion system of Pseudomonas syringae and upregulate the production of pilus protein HrpA. Mol Plant Microbe Interact. 2009;22:282-90.
Hale et al., An efficient stress-free strategy to displace stable bacterial plasmids. BioTechniques 2010;48:223-8.
Hansal et al., Induction of antigen-specific hyporesponsiveness by transplantation of hemopoietic cells containing an MHC class I transgene regulated by a lymphocyte-specific promoter. J Immunol. Aug. 1, 1998; 161(3):1063-8.
Harvey et al., Inducible control of gene expression: prospects for gene therapy. Curr Opin Chem Biol. Aug. 1998;2(4):512-8.
Herlache et al., Characterization of the Agrobacterium vitis pehA gene and comparison of the encoded polygalacturonase with the homologous enzymes from Erwinia carotovora and Ralstonia solanacearum. Appl Environ Microbiol. Jan. 1997;63(1):338-46.
Hidaka et al., Promotion of the Growth of Rice by Inoculation of Nitrogen-Fixing-Activity-Enhanced Bacteria to the Rhizosphere. Curr Plant Sci Biotechnol Agri. 1999;38:445.
Holden et al., Colonization outwith the colon: plants as an alternative environmental reservoir for human pathogenic enterobacteria. FEMS Microbiol Rev. 2009;33:689-703.
Hunter, “Genetically Modified Lite” placates public but not activists. EMBO Reports. 2014;15(2):138-41.
Iniguez et al., Nitrogen Fixation in Wheat Provided by Klebsiella pneumoniae 342. MPMI. 2004; 17(10):1078-85.
Jaschke et al., A fully decompressed synthetic bacteriophage øX174 genome assembled and archived in yeast. Virology. 2012;434:278-84.
Kant et al., Understanding plant response to nitrogen limitation for the improvement of crop nitrogen use efficiency. J Exper Botany. 2011;62(4):1499-509.
Karlin et al., Applications and statistics for multiple high-scoring segments in molecular sequences. PNAS USA. Jun. 15, 1993;90(12):5873-7.
Karlin et al., Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. PNAS USA. Mar. 1990;87(6):2264-8.
Kerby et al., Photoproduction of ammonium by immobilized mutant strains of Anabaena variabilis. Applied Microbiology and Biotechnology. Apr. 1986, vol. 24, Issue 1, pp. 42-46.
Kim, et al. Constitutive expression of nitrogenase system in Klebsiella oxytoca by gene targeting mutation to the chromosomal nifLA operon. J Biotechnol. Jun. 1989;10(3-4):293-301.
Kornberg, DNA Replication. Stanford University. 1980. pp. 75-77. 5 pages.
Kurzweil, Plant Bacteria breakthrough enables crops worldwide to take nitrogen from the air. Plant Bacteria Breakthrough Enables Crops Worldwide Take Nitrogen From Air. Aug. 1, 2013. 4 pages.
Kutter et al., Colonization of barley (Hordeum vulgare) with Salmonella enterica and Listeria spp. FEMS Microbiol Ecol. 2006;56:262-71.
Lauritsen et al., A versatile one-step CRISPR-Cas9 based approach to plasmid-curing. Microb Cell Fact. 2017;16(135):1-10.
Leang et al., Genome-wide analysis of the RpoN regulon in Geobacter sulfurreducens. BMC Genomics. Jul. 22, 2009;10:331. doi: 10.1186/1471-2164-10-331.
Liang et al., Minimal effect of gene clustering on expression in Escherichia coli. Genetics. Feb. 2013;193(2):453-65. doi: 10.1534/genetics. 112.147199. Epub Dec. 5, 2012.
Lim et al., Fundamental relationship between operon organization and gene expression. PNAS USA. Jun. 28, 2011; 108(26):10626-31. doi: 10.1073/pnas.1105692108. Epub Jun. 13, 2011.
Liu et al., Whole genome analysis of halotolerant and alkalotolerant plant growth-promoting rhizobacterium Klebsiella sp. D5A. Sci Rep. May 24, 2016;6:1-10.
MacNeil et al., Fine-structure mapping and complementation analysis of nif (nitrogen fixation) genes in Klebsiella pneumoniae. J Bacterial. Oct. 1978;136(1):253-66.
MacNeil et al., Mutations in nif genes that cause Klebsiella pneumoniae to be derepressed for nitrogenase synthesis in the presence of ammonium. J Bacterial. Nov. 1980;144(2):744-51.
Magari et al., Pharmacologic control of a humanized gene therapy system implanted into nude mice. J Clin Invest. Dec. 1, 1997;100(11):2865-72.
Marroqui et al., Enhanced Symbiotic Performance by Rhizobium tropici Glycogen Synthase Mutants. J Bacteriol. Feb. 2001;183(3):854-64.
Martinez-Noel et al., NifB and NifEN protein levels are regulated by ClpX2 under nitrogen fixation conditions in Azotobacter vinelandii. Mol Microbiol. Mar. 2011;79(5):1182-93. doi: 10.1111/j.1365-2958.2011.07540.x. Epub Jan. 25, 2011.
Marx et al., Broad-host-range cre-lox system for antibiotic marker recycling in gram-negative bacteria. Biotechniques. Nov. 2002;33(5):1062-7.
Masepohl et al., Organization and regulation of genes encoding the molybdenum nitrogenase and the alternative nitrogenase in Rhodobacter capsulatus. Arch Microbial. 1996;165:80-90.
Matsubayashi et al., Peptide hormones in plants. Annu Rev Plant Biol. 2006;57:649-74.
Medema et al., Computational tools for the synthetic design of biochemical pathways. Nat Rev Microbiol. Jan. 2, 20123;10(3):191-202. doi: 10.1038/nrmicro2717.
Mengel, Roots, growth and nutrient uptake. Purdue Univ. Dept of Agronomy. May 1995. Publication No. AGRY-95-08. 8 pages.
Mirsky, Refactoring the Salmonella Type Ill Secretion System. University of California, San Francisco Dissertation. Doctor of Philosophy in Biophysics. Apr. 12, 2012. 59 pages.
Mitra, Regulation of nifLA operon in Azotobacter vinelandii. Jawaharlal Nehru University Thesis. Doctor of Philosophy in Biotechnology. 2000. 163 pages.
Moon et al., Genetic programs constructedfrom layered logic gates in single cells. Nature. Nov. 8, 2012;491(7423):249-53. doi: 10.1038/naturell516. Epub Oct. 7, 2012.
Mueller et al., Closing yield gaps through nutrient and water management. Nature. 2012;490:254-57.
Mus et al., Symbiotic Nitrogen Fixation and the Challenges to Its Extension to Nonlegumes. Appl Environ Microbial. Jul. 1, 2016; 82(13): 3698-710. doi: 10.1128/AEM.01055-16. Epub Jun. 13, 2016. Pre-Epub Apr. 15, 2016.
Nassar et al., Promotion of plant growth by an auxin-producing isolate of the yeast Williopsis saturnus endophytic in maize (Zea mays L.) roots. Biol Fertil Soils. 2005;42:97-108.
Nelissen et al., Translational research: from pot to plot. Plant Biotechnol J. 2014;12:277-85.
Nestmann, Mutagenesis by nitrosoguanidine, ethyl methanesulfonate, and mutator gene mutH in continuous cultures of Escherichia coli. Sci Direct. Jun. 1975;28(3):323-30.
Nichkawade, Studies on upstream regulatory sequence of the nifLA promoter of Klebsiella pnuemoniae. Jawaharlal Nehru University Thesis. Doctor of Philosophy in Biotechnology. 1996. 166 pages.
Nielsen, Transgenic organisms—time for conceptual diversification? Nat Biotechnol. 2003;21:227-8.
No et al., Ecdysone-inducible gene expression in mammalian cells and transgenic mice. PNAS USA. Apr. 1996;93(8):3346-51.
Okubo et al., Effects of Elevated Carbon Dioxide, Elevated Temperature, and Rice Growth Stage on the Community Structure of Rice Root-Associated Bacteria. Microbes Environ. Jun. 2014; 29(2):184-90. doi: 10.1264/jsme2.ME14011. Epub May 31, 2014.
Ortiz-Marquez et al., Association with an Ammonium-excreting bacterium allows diazotrophic culture of oil-rich Eukaryotic microalagae. Appl Microbial. 2012;78(7):2345-52.
Pfleger, et al. Combinatorial engineering of intergenic regions in operons tunes expression of multiple genes. Nat Biotechnol. Aug. 2006;24(8):1027-32. Epub Jul. 16, 2006.
Piccioli et al., Neuroantibodies: ectopic expression of a recombinant anti-substance P antibody in the central nervous system of transgenic mice. Neuron. Aug. 1995;15(2):373-84.
Piccioli et al., Neuroantibodies: molecular cloning of a monoclonal antibody against substance P for expression in the central nervous system. PNAS USA. Jul. 1, 1991;88(13): 5611-5.
Plotnikova et al., Pathogenesis of the human opportunistic pathogen Pseudomonas aeruginosa PA14 in Arabidopsis. Plant Physiol. 2000;124:1766-74.
Qiu et al., Construction of genetically engineered strains of Enterobacter cloacae (nifl-(-) A-(c)). Acta Phytophysiologica Sinica. Jan. 1, 1999;25(3):269-73.
Ran et al., Genome erosion in a nitrogen-fixing vertically transmitted endosymbiotic multicellular cyanobacterium. PLoS One. Jul. 8, 2010;5(7):ell486, 11 pages, doi: 10.1371/journal.pone.0011486.
Roberts et al., Regulation and characterization of protein products coded by the nif (nitrogen fixation) genes of Klebsiella pneumoniae. J Bacterial. Oct. 1978;136(1):267-79.
Rogers et al., Synthetic biology approaches to engineering the nitrogen symbiosis in cereals. J Exper Botany. 2014;65(8):1939-46.
Rommens et al., Intergeneric transfer and functional expression of the tomato disease resistance gene Pto. Plant Cell. Oct. 1995;7(10):1537-44.
Roncato-Maccari et al., Endophytic Herbaspirillum seropedicae expresses nif genes in gramineous plants. FEMS Microbiology Ecology. 2003;45:39-47.
Rosenblueth et al., Bacterial Endophytes and Their Interaction with Hosts. Mol Plant Microbe Interact. Aug. 2006;19(8):827-37.
Rosenblueth et al., Nitrogen Fixation in Cereals. Frontiers in Microbiol. Aug. 9, 2018;9:1794, 13 pages.
Saikia et al., Biological nitrogen fixation with non-legumes: An achievable target or a dogma? Curr Sci. 2007;92(3):317-22.
Sandig et al., HBV-derived promoters direct liver-specific expression of an adenovirally transduced LDL receptor gene. Gene Ther. Nov. 1996;3(11):1002-9.
Santi et al., Biological nitrogen fixation in non-legume plants. Annals of Botany. 2013;111:743-67.
Schmitz et al., Iron is required to relieve inhibitory effects on Nifl on transcriptional activation by NifA in Klebsiella pneumoniae. J Bacterial. Aug. 1996;178(15):4679-87.
Schouten et al., Do cisgenic plants warrant less stringent oversight? Nat Biotechnol. 2006;24:753.
Service, Genetically engineered microbes make their own fertilizer, could feed the world's poorest. Science. Apr. 2017. 2 pages. doi:I0.H26/science.aallOOO.
Setten et al., Engineering Pseudomonas protegens Pf-5 for Nitrogen Fixation and its application to improve plant growth under nitrogen-deficient conditions. PLOS One. 2013;8(5):e63666. 14 pages.
Shamseldin, The role of different genes involved in symbiotic nitrogen fixation—review. Global J Biotechnol Biochem. 2013;8(4):84-94.
Sibold et al., A nif mutant of Klebsiella pneumoniae fixing nitrogen in the presence of ammonia. FEMS Microbiol Lett. Jan. 1, 1981;10(1):37-41.
Siddavattam et al., Regulation of nif Gene expression in Enterobacter agglomerans: Nucleotide sequence of the nifLA operon and influence of temperature and ammonium on its transcription. Mol Gen Genet. Dec. 20, 1995;249(6):629-36.
Singh et al., An L-methionine-D,L-sulfoximine-resistant mutant of the cyanobacterium Nostoc muscorum showing inhibitor-resistant γ-glutamyl-transferase, defective glutamine synthetase and producing extracellular ammonia during N2 fixation. FEBS Letters. Apr. 5, 1983;154(1):10-4.
Sleight et al., Designing and engineering evolutionary robust genetic circuits. J Biol Engin. 2010;4(12):1-20.
Smanski et al., Synthetic biology to access and expand nature's chemical diversity. Nat Rev Microbiol. Mar. 2016;14(3):135-49. doi: 10.1038/nrmicro.2015.24.
Smanski, et al. Functional optimization of gene clusters by combinatorial design and assembly. Nat Biotechnol. Dec. 2014;32(12):1241-9. doi: 10.1038/nbt.3063. Epub Nov. 24, 2014, 12 pages.
Souza, et al., The N-Terminus of the NIFA protein of herbaspirillum seropedicae is probably involved in sensing of ammonia. Proceedings of the 10th International Congress on Nitrogen Fixation, St. Petersburg, Russia. 1995:260, 1 page.
Spiller et al., Isolation and characterization of nitrogenase-derepressed mutant strains of cyanobacterium Anabaena variabilis. J Bacteriol. Feb. 1986;165(2):412-9.
Steenhoudt et al., Azospirillum, a free-living nitrogen-fixing bacterium closely associated with grasses: genetic, biochemical and ecological aspects. FEMS Microbial Rev. 2000;24:487-506.
Stein et al., The osteocalcin gene: a model for multiple parameters of skeletal-specific transcriptional control. Mol Biol Rep. Aug. 1997;24(3):185-96.
Stemmer, DNA shuffling by random fragmentation and reassembly: In vitro recombination for molecular evolution. PNAS USA. Oct. 1994;91:10747-51.
Stemple, Tilling—a high-throughput harvest for functional genomics. Nat Rev Genet. Feb. 2004;5:145-50. doi:10.1038/nrgl273.
Stephanopoulos, Challenges in engineering microbes for biofuels production. Science. Feb. 9, 2007;315(5813):801-4.
Subtil et al., Secretion of Predicted Inc Proteins of Chlamydia pneumoniae by a Heterologous Type III Machinery. Mol Microbiol. Feb. 2001;39(3):792-800. doi: 10.1046/j.1365-2958.2001.02272.x.
Swain et al., Nitrogen fixation and its improvement through genetic engineering. J Global Biosciences. 2013;2(5):98-112.
Temme et al., Modular control of multiple pathways using engineered orthogonal T7 polymerases. Nucleic Acids Res. Sep. 1, 2012;40(17):8773-81. Epub Jun. 28, 2012.
Temme, et al., Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca. PNAS, May 1, 2012;109(18):7085-90.
Temme, Designing and Engineering Complex Behavior in Living Machines. University of California, San Francisco Dissertation. Doctor of Philosophy in Bioengineering. Oct. 1, 2011. 74 pages.
Thomas, et al. Ammonium Excretion by an 1-Methionine-dl-Sulfoximine-Resistant Mutant of the Rice Field Cyanobacterium Anabaena siamensis. Appl Environ Microbiol. Nov. 1990; 56(11):3499-504.
Tilman et al., Global food demand and the sustainable intensification of agriculture. PNAS. Oct. 12, 2011;108(50):20260-4.
Triplett, Diazotrophic endophytes: progress and prospects for nitrogen fixation in monocots. Plant and Soil. 1996;186:29-38.
Tritt et al., An Integrated Pipeline for de Novo Assembly of Microbial Genomes. PLOS one. Sep. 13, 2012;7(9):e42304. doi: 10.1371/journal.pone.0042304. 9 pages.
Ueda et al., Remarkable N2-Fixing Bacterial Diversity Detected in Rice Roots by Molecular Evolutionary Analysis of nifH Gene Sequences. J Bacteriol. Mar. 1995;177(5):1414-7.
Vernon et al., Analysis of 16S rRNA gene sequences and circulating cell-free DNA from plasma of chronic fatigue syndrome and non-fatigued subjects. BMC Microbiol. Dec. 23, 2002;2:39, 6 pages.
Villa et al., Azotobacter vinelandii siderophore can provide nitrogen to support the culture of the green algae neochloris oleoabundans and scenedesmus. FEMS Microbiol Lett. 2014;351(1):70-7.
Voigt et al., Genetic parts to program bacteria. Curr Opin Biotechnol. 2006;17(5):548-57.
Voigt, Gaining Access: Rebuilding Genetics from the Ground Up. MIT. Department of Biological Engineering. Mar. 14, 2011. 20 pages.
Wang et al., Positive and negative regulation of gene expression in eukaryotic cells with an inducible transcriptional regulator. Gene Ther. May 1997;4:432-41.
Wang et al., A minimal nitrogen fixation gene cluster from Paenibacillus sp. WLY78 enables expression of active nitrogenase in Escheichia coli. PLoS Genet. Oct. 17, 2013;9(10):e1003865. 11 pages. doi:10.1371/journal.pgen.1003865.
Wang et al., Using Synthetic biology to distinguish and overcome regulatory and functional barriers related to nitrogen fixation. PLoS One. 2013;8(7):e68677. 11 pages.
Watanabe et al., Chapter 15. Plasmid-borne gene cluster assemblage and heterologous biosynthesis of nonribosomal peptides in Escherichia coli. Methods Enzymol. 2009;458:379-99. doi: 10.1016/S0076-6879(09)04815-0.
Weber et al., A modular cloning system for standardized assembly of multigene constructs. PLoS One. Feb. 18, 2011;6(2):el6765, 11 pages, doi: 10.1371/journal.pone.0016765.
Welch et al., Design Parameters to Control Synthetic Gene Expression in Escherichia coli. PLoS One. Sep. 2009;4(9):e7002, 10 pages.
Werner et al., Fast track assembly of multigene constructs using Golden Gate cloning and the MoClo system. Bioeng Bugs. Jan. 1, 2012;3(1):38-43. doi: 10.1371/journal.pone.0016765. Epub Jan. 1, 2012.
Widmaier et al., Engineering the Salmonella type III secretion system to export spider silk monomers. Mol Syst Biol. 2009;5:309, 9 pages.
Wootton et al., Statistics of local complexity in amino acid sequences and sequence databases. Computers & Chemistry. Jun. 1993;17(2):149-63.
Yoshida et al., Atmospheric dinitrogen fixation in the flooded rice rhizosphere as determined by the N-15 isotope technique. Soil Sci Plant Nutr. 1980;26(4):551-9. doi: 10.1080/00380768.1980.10431242.
Young et al., Relationships between corn plants and nitrogen fixing bacteria on an organic farm. Ceres Trust. Dec. 31, 2012. 9 pages.
Zehr et al., New Nitrogen-Fixing Microorganisms Detected in Oligotrophic Oceans by Amplification of Nitrogenase (nifH) Genes. Appl Environ Microbial. Sep. 1998;64(9):3444-50.
Zhang et al., GlnD is Essential for NifA Activation, NtrB/NtrC-Regulated Gene Expression, and Posttranslational Regulation of Nitrogenase Activity in the Photosynthetic, Nitrogen-Fixing Bacterium Rhodospirillum rubrum. J Bacteriol. Feb. 2005;187(4):1254-65.
Zhang et al., Involvement of the ammonium transporter AmtB in nitrogenase regulation and ammonium excretion in Pseudomonas stutzeri A 1501. Res Microbial. 2012;163:332-9.
Brazilian Office Action dated Mar. 10, 2020 for Application No. BR112018006800-4.
European Office Action dated Oct. 16, 2019 for Application No. EP 16854192.8.
Clancy et al., The Domains Carrying the Opposing Activities in Adenylyltransferase are Separated by a Central Regulatory Domain. FEBS J. 2007;274(11):2865-77.
U.S. Appl. No. 14/440,183, filed May 1, 2015, Zhao et al.
U.S. Appl. No. 15/954,557, filed Apr. 16, 2018, Temme et al.
U.S. Appl. No. 15/954,558, filed Apr. 16, 2018, Temme et al.
EP 16854192.8, Feb. 20, 2019, Extended European Search Report.
PCT/US2016/055429, Dec. 30, 2016, International Search Report and Written Opinion.
PCT/US2016/055429, Apr. 19, 2018, International Preliminary Report on Patentability.
Andrianantoandro et al., Synthetic biology: new engineering rules for an emerging discipline. Mol Syst Biol. 2006;2:2006.0028. doi: 10.1038/msb4100073. Epub May 16, 2006.
Batista et al., Manipulating nitrogen regulation in diazotrophic bacteria for agronomic benefit. Biochem Soc Trans. Apr. 1, 2019;47(2):603-14.
Biggins et al., Metabolites from the induced expression of cryptic single operons found in the genome of Burkholderia pseudomallei. J Am Chem Soc. Feb. 16, 2011; 133(6):1638-41. doi: 10.1021/jal087369. Epub Jan. 19, 2011.
Bonde et al., MODEST: a web-based design tool for oligonucleotide-mediated genome engineering and recombineering. Nucleic Acids Res. Jul. 2014;42(Web Server issue):W408-15. doi: 10.1093/nar/gku428. Epub May 16, 2014.
Brandl et al., Salmonella interactions with plants and their associated microbiota. Phytopathology. Apr. 2013;103(4):316-25. doi: 10.1094/PHYTO-11-12-0295-RVW.
Burris, Nitrogenases. J Biol Chem. May 25, 1991;266(15):9339-42.
Cardinale et al., Contextualizing context for synthetic biology identifying causes of failure of synthetic biological systems. Biotechnol. J. 7:856-866 (2012).
Carr et al., Enhanced multiplex genome engineering through co-operative oligonucleotide coselection. Nucleic Acids Res., 2012, 40(17):el32.
U.S. Appl. No. 17/204,219, filed Mar. 17, 2021, Zhao et al.
[No Author Listed] GM Crop Database. Center for Environmental Risk Assessment (CERA), 2010, retrieved from <http://ucbiotech.org/biotech_info/PDFs/Center_for_Environmental_Risk_Assessment_CERA_2011_GM_Crop_Database.pdf>, 1 page.
[No Author Listed] Escherichia coli as a Model Organism and Its Application in Biotechnology. IntechOpen, 2020, retrieved on Mar. 31, 2020, retrieved from https://www.intechopen.com/books/-i-escherichia-coli-i-recent-advances-on-physiology-pathogenesis-and-biotechnological-applications/-i-escherichi%E2%80%A6>, 15 pages.
[No Author Listed] Zehr Lab NifH database, retrieved from URL <https://wwwzehr.pmc.ucsc.edu/nifH_Database_Public/>, Apr. 4, 2014, 1 page.
Amalraj et al., Effect of Polymeric Additives, Adjuvants, Surfactants on Survival, Stability and Plant Growth Promoting Ability of Liquid Bioinoculants. J. Plant Physiol Pathol, 2013, 1:2, 6 pages.
Ambrosio et al., Metabolic engineering of a diazotrophic bacterium improves ammonium release and biofertilization of plants and microalgae, Metab Eng., Mar. 2017, 40:59-68.
Anderson et al., BglBricks: A flexible standard for biological part assembly, J Biological Engineering, 2010;4:1, 12 pages.
Arnold et al., Nucleotide sequence of a 24,206-base-pair DNA fragment carrying the entire nitrogen fixation gene cluster of Klebsiella pneumoniae. J Mol Biol. Oct. 5, 1988;203(3):715-38. doi: 10.1016/0022-2836(88)90205-7.
Arriel-Elias et al., Shelf life enhancement of plant growth promoting rhizobacteria using a simple formulation screening method. African J Microbiology Research, Feb. 2018, 12(5):115-126.
Ausubel et al., Glutamine Synthetase Mutations Which Affect Expression of Nitrogen Fixation Genes in Klebsiella pneumoniae, J Bacteriol, Nov. 1979, 140(2):597-606.
Batzer et al., Enhanced evolutionary PCR using oligonucleotides with inosine at the 3'-terminus. Nucleic Acids Res. Sep. 2, 19915;19(18):5081. doi: 10.1093/nar/19.18.5081.
Baum et al., Control of coleopteran insect pests through RNA interference, Nature Biotechnology, Nov. 2007, 25(11): 1322-1326.
Bayer et al., Synthesis of methyl halides from biomass using engineered microbes. J Am Chem Soc. May 1, 20093;131(18):6508-15. doi: 10.1021/ia809461u.
Bender et al., Regulatory mutations in the Klebsiella aerogenes structural gene for glutamine synthetase, J Bacteriol., Oct. 1977, 132(1):100-105.
Benyon et al., The nif promoters of Klebsiella pneumoniae have a characteristic primary structure. Cell. Sep. 1983;34(2):665-71. doi: 10.1016/0092-8674(83)90399-9.
Berninger et al., Maintenance and assessment of cell viability in formulation of non-sporulating bacterial inoculants. Microb. Biotechnol., Mar. 2018, 11(2):277-301 (2018); doi: 10.1111/1751-7915.12880.
Bittner et al., RpoS and RpoN are involved in the growth-dependent regulation of rfaH transcription and 0 antigen expression in Salmonella enterica serovar typhi, Microbial Pathogenesis, Jan. 2004;36(1): 19-24.
Bloch et al., Biological nitrogen fixation in maize: optimizing nitrogenase expression in a rootassociated diazotroph. J Experimental Botany, Jul. 2020, 71(15):4591-4603.
Bosmans et al., Sea anemone venom as a source of insecticidal peptides acting on voltage-gated Na+ channels, Toxicon, Mar. 2007, 49(4):550-560.
Boyle et al., Tools for genome-wide strain design and construction. Curr Opin Biotechnol. Oct. 2012;23(5):666-71.
Buck et al., Frameshifts close to the Klebsiella pneumoniae nifH promoter prevent multicopy inhibition by hybrid nifH plasmids. Mol Gen Genet. May 1987;207(2-3):492-8. doi: 10.1007/BF00331620.
Buckley et al., NifH Sequence Database. Retrieved from <https://blogs.cornell.edu/buckley/nifh-sequence-database/>. Buckley Lab. Available on or before Jan. 10, 2018. 2 pages.
Chakroun et al., Bacterial Vegetative Insecticidal Proteins (Vip) from Entomopathogenic Bacteria, Microbiol Mol Biol Rev., Mar. 2016, 80(2):329-50.
Chen et al., Characterization of 582 natural and synthetic terminators and quantification of their design constraints, Nat. Methods, 2013, 10:659-664.
Chin, Programming and engineering biological networks, Curr Opin Struct Biol 16:551-556 (2006).
Colby, Calculating Synergistic and Antagonistic Responses of Herbicide Combinations. Weeds, Jan. 1967, 15(1):20-22, 4 pages.
Colebatch et al., Symbiotic nitrogen fixation research in the postgenomics era, New Phytologist., 2002, 153(1):37-42.
Compant et al., A review on the plant microbiome: Ecology, functions, and emerging trends in microbial application, Journal of Advanced Research, Sep. 2019, 19:29-37.
Costerton et al., Microbial Biofilms. Annu. Rev. Microbial., Oct. 1995, 49:711-745.
Crameri et al., Molecular evolution of an arsenate detoxification pathway by DNA shuffling, Nat. Biotechnol., 1997, 15:436-438.
Crickmore et al., Revision of the Nomenclature for the Bacillus thuringiensis Pesticidal Crystal Proteins, Microbiol Mol Biol Rev., Sep. 1998, 62(3):807-813.
Crook et al., Re-engineering multicloning sites for function and convenience, Nucl. Acids Res., 2011, 39:e92, 10 pages.
Czar et al., Gene synthesis demystified, Trends Biotechnol, 2009, 27(2):63-72.
Da Silva et al., Survival of endophytic bacteria in polymer-based inoculants and efficiency of their aplication to sugarcane/Plant Soil, May 2012, 356:231-243.
Dandekar et al., Conservation of gene order: a fingerprint of proteins that physically interact, Trends Biochem. Sci., 1998, 23:324-328.
Das et al., Microbial assay of N2 fixation rate, a simple alternate for acetylene reduction assay, MethodsX, 2018, 5:909-914.
Davin-Regli et al., Enterobacter aerogenes and Enterobacter cloacae; versatile bacterial pathogens confronting antibiotic treatment, Front Microbiol, 2015, 6:392, 10 pages.
De Freitas, Yield and N assimilation of winter wheat (Triticum aestivum L., var. Norstar) inoculated with rhizobacteria, Pedobiologia, Jan. 2000, 44(2):97-104.
De Raad et al., A solid-phase platform for combinatorial and scarless multipart gene assembly, ACS Synth. Biol., 2013, 2:316-326.
Dykxhoorn et al., A set of compatible tac promoter expression vectors, Gene, 1996, 177(1-2):133-136.
Endy et al., Foundations for engineering biology, Nature, 2005, 438:449-453.
Enkh-Amgalan et al., Molecular evolution of the nif gene cluster carrying nifl1 and nifl2 genes in the Gram-positive phototrophic bacterium Heliobacterium chlorum, International Journal of Systematic and Evolutionary Microbiology, 2006, 56:65-74.
Estrem et al., Identification of an UP element consensus sequence for bacterial promoters, PNAS, 1998, 95(11):9761-9766.
Eyraud et al., Expression and Biological Activity of the Cystine Knot Bioinsecticide PA1b (Pea Albumin 1 Subunit b), PLOS One, Dec. 2013, 8(12):e81619, 9 pages.
Fani et al., Molecular evolution of nitrogen fixation: the evolutionary history of the niID, nifK, nifE, and nifN gene, J Mol Evol., 2000;51(1):1-11.
Feher et al., In the fast lane: large-scale bacterial genome engineering, J Biotechnol., Jul. 2012, 160(1-2):72-9.
Fischbach et al., The evolution of gene collectives: how natural selection drives chemical innovation, Proc. Natl. Acad. Sci. USA, 2008, 105:4601-4608.
Fontana et al., RNA folding and combinatory landscapes, Phys. Rev. E., 1993, 47:2083-2099.
Forner et al., Treatment of hepatocellular carcinoma, Crit Rev Oncol Hematol., Nov. 2006, 60(2):89-98.
Gaby et al., A comprehensive aligned nifH gene database: a multipurpose tool for studies of nitrogen- fixing bacteria, Database, 2014, 2014:bau001, 8 pages.
Gamer et al., A T7 RNA polymerase-dependent gene expression system for Bacillus megaterium, Appl Micro Biol Biotechnol., Apr. 2009, 82(6): 1195-203.
GenBank Accession No. CP007215.3, Kosakonia sacchari SP1 chromosome, complete genome, Sep. 19, 2017, 729 pages.
GenBank Accession No. CP016337.1 Kosakonia sacchari strain BO-1 chromosome, complete Genome. Jul. 11, 2016, 1119 pages.
Georg et al., cis-antisense RNA, another level of gene regulation in bacteria, Microbiol Mol Biol Rev, 2011,75(2):286-300.
Gibson et al., Chemical synthesis of the mouse mitochondrial genome, Nat. Methods, 2010, 7:901-903.
Gibson et al., Enzymatic assembly of DNA molecules up to several hundred kilobases, Nat. Methods, 2009, 6(5):343-345.
Gosink et al., The product of the Klebsiella pneumoniae nifX gene is a negative regulator of the nitrogen fixation (nit) regulon, J Bacteriology, 1990, 172(3):1441-1447.
Gottelt et al., Deletion of a regulatory gene within the cpk gene cluster reveals novel antibacterial activity in Streptomyces coelicolor A3(2), Microbiology, 2010, 156:2343-2353.
Guell et al., Bacterial transcriptomics: what is beyond the RNA horiz-ome?, Nature reviews Microbiology, 2011, 9(9):658-669.
Guell et al., Transcriptome complexity in a genome-reduced bacterium, Science, 2009, 326:1268-1271.
Hernandez et al., Biochemical analysis of the recombinant Fur (ferric uptake regulator) protein from Anabaena PCC 7119: factors affecting its oligomerization state, Biochem J., 2002, 366:315-322.
Hoeschle-Zeledon et al., Regulatory challenges for biological control.The CGIAR Systemwide Program on Integrated Pest Management, Jan. 2013, SP-IPM Secretariat, International Institute Tropical Agriculture (IITA), Ibadan, Nigeria, 53 pages.
Hu et al., Assembly of nitrogenase MoFe protein, Biochemistry, 2008, 47(13):3973-3981.
Huynen et al., Smoothness within ruggedness: the role of neutrality in adaptation, Proc. Natl. Acad. Sci. USA, 1996, 93:397-401.
Iber, A quantitative study of the benefits of co-regulation using the spoIIA operon as an example, Mol. Sys. Biol., 2006, 2:1-6.
Idalia et al., Escherichia coli as a model organism and its application in biotechnology, Recent Advances on Physiology, Pathogenesis, and Biotechnological Applications, Chapter 13, 2017, pp. 253-274.
Ishihama, Prokaryotic genome regulation: multifactor promoters, multitarget regulators and hierarchic networks, FEMS Microbial Rev, 2010, 34(5):628-645.
Ivanova et al., Artificial Regulation of Genes, Of the coding proteins of the nitrogenase complex Rhizobial bacteria, Natural Sciences, 2014, 13(174):36-39.
Izquierdo et al., Distribution of Extensive nifH Gene Diversity Across Physical Soil Microenvironments, Microbial Ecology, 2006, 51(4):441-452.
Jacob et al., Solid-state NMR studies of Klebsiella pneumoniae grown under nitrogen-fixing conditions, J Biol Chem, 1987, 262(1):254-259.
Jacoby et al., The Role of Soil Microorganisms in Plant Mineral Nutrition-Current Knowledge and Future Directions, Frontiers in Plant Scients, 2017, 8(19):1-19.
Jahn et al., Extraction of Extracellular Polymeric Substances (EPS) from Biofilms Using a Cation Exchange Resin. Wat. Sci. Tech., 1995, 32(8):157-164.
Janczarek et al., Multiple copies of rosR and pssA genes enhance exopolysaccharide production, symbiotic competitiveness and clover nodulation in Rhizobium leguminosarum bv. trifolii, Antonie Van Leeuwenhoek, Nov. 2009, 96(4):471-86.
Jensen, The Escherichia coli K-12 wild types W3110 and MG1655 have an rph frameshift mutation that leads to pyrimidine starvation due to low pyre expression levels, J. Bacteriol., 1993, 175:3401-3407.
Johnson et al., Properties of overlapping genes are conserved across microbial genomes, Genome Res, 2004, 14(11):2268-2272.
Joseph et al., Recent developments of the synthetic biology toolkit for Clostridum, Frontiers in microbology, 2018, 9(154):1-13.
Kabaluk et al., The use and regulation of microbial pesticides in representative jurisdictions Worldwide. IOBC Global, 2010, 99 pages.
Kalir et al., Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria, Science, 2001, 292(5524):2080-2083.
Kaneko et al., Complete genomic structure of the cultivated rice endophyte Azospirillum sp. B510, DNA Res., 2010, 17:37-50.
Katsnelson, Engineered bacteria could boost corn yields: Gene-edited microbe offer continuous nitrogen fixation, Chemical & Engineering News, Dec. 28, 2021, retrieved from URL https://cen.acs.org/food/agriculture/Engineered-bacteria-boost-corn-yields/99/web/2021/12>, 3 pages.
Kececiglu et al., Of mice and men: Algorithms for evolutionary distances between genomes with translocation, SODA: Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms, 1995, 10 pages.
Kelly et al., Measuring the activity of BioBrick promoters using an in vivo reference standard, J Biol Eng, 2009, 3:4, 13 pages.
Kim et al., “A 20 nucleotide upstream element is essential for the nopaline synthase (nos) promoter activity,” Plant Mol Biol., Jan. 1994, 24(1): 105-17.
King et al., Spider-Venom Peptides: Structure, Pharmacology, and Potential for Control of Insect Pests, Annu. Rev. Entomol., 2013, 58:475-96.
Kingsford et al., Rapid, accurate, computational discovery of Rho-independent transcription terminators illuminates their relationship to DNA uptake, Genome Bio. 2007, 8(2):R22, 12 pages.
Kitano, Systems biology: a brief overview, Science, 2002, 295(5560): 1662-1664.
Klose et al., “Glutamate at the site of phosphorylation of nitrogen-regulatory protein NTRC mimics aspartyl-phosphate and activates the protein,” J Mol Biol., Jul. 1993, 232(1):67-78.
Knight, Idempotent Vector Design for Standard Assembly of Biobricks, MIT Artificial Intelligence Laboratory, The TTL Data Book for Design Engineers, 2003, 11 pages.
Kovacs et al., Stochasticity in protein levels drives colinearity of gene order in metabolic operons of Escherichia coli, PLoS Biol, 2009, 7(5):e1000115, 9 pages.
Kranz et al., “Ammonia-constitutive nitrogen fixation mutants of Rhodobacter capsulatus,” Gene, Nov. 1988, 71(1):65-74.
Kumar et al., Metabolic regulation of Escherichia coli and its gdhA, glnL, gltB, D mutants under different carbon and nitrogen limitations in the continuous culture, Microbial Cell Factories, Jan. 2010, 9(8): 1-17.
Lenski et al., Effects of Segregation and Selection on Instability of Plasmid pACYC184 in Escherichia coli B, Journal of Bacteriology, Nov. 1987, 169(11):5314-5316.
Levican et al., Comparative genomic analysis of carbon and nitrogen assimilation mechanisms in three indigenous bioleaching bacteria: predictions and validations, BMC Genomics, 2008, 9:581, 19 pages.
Levin-Karp et al., Quantifying translational coupling in E. coli synthetic operons using RBS modulation and fluorescent reporters, ACS Synth. Biol., 2013, 2:327-336.
Li et al., “Human Enhancers Are Fragile and Prone to Deactivating Mutations,” Mol Biol Evol., Aug. 2015, 32(8):2161-80.
Lin et al., PC, a Novel Oral Insecticidal Toxin from Bacillus bombysepticus Involved in Host Lethality via APN and BtR-175, Scientific Reports, Jun. 2015, 5:11101, 14 pages.
Liu et al., Phenazine-1-carboxylic acid biosynthesis in Pseudomonas Chlororaphis GP72 is positively regulated by the sigma factor RpoN, World J Microbiology and Biotechnology, Jan. 2008, 24(9): 1961-1966.
Lombo et al., The mithramycin gene cluster of Streptomyces argillaceus contains a positive regulatory gene and two repeated DNA sequences that are located at both ends of the cluster, J. Bacterial., 1999, 181:642-647.
Lowman et al., Strategies for enhancement of switchgrass (Panicum virgatum L.) performance under limited nitrogen supply based on utilization of N-fixing bacterial endophytes. Plant and Soil, Aug. 2016, 405(1):47-63, 17 pages.
Lucks et al., Toward scalable parts families for predictable design of biological circuits, Curr. Opin. Microbiol., 2008, 11:567-573.
Ma et al., Effect of nicotine from tobacco root exudates on chemotaxis, growth, biocontrol efficiency, and colonization byPseudomonas aeruginosaNXHG29, Antonie van Leeuwenhoek, 2018, 111(7):1237-1257.
Mabrouk et al., Chapter 6: Potential of Rhizobia in Improving Nitrogen Fixation and Yields of Legumes, Symbiosis, May 30, 2018, IntechOpen, pp. 1-16, retrieved on Jan. 12, 2021, retrieved from URL<https://www.intechopen.com/books/symbiosis/potential-of-rhizobia-in-improving-B351nitrogen-fixation-and-yields-of-legumes> 2 pages, Abstract.
Magasanik, Genetic control of nitrogen assimilation in bacteria, Ann. Rev. Genet, 1982, 16:135-68.
Mandal et al., Gene regulation by riboswitches, Nat Rev Mol Cell Biol, 2004, 5(6):451-463.
Mao et al., Silencing a cotton bollworm P450 monooxygenase gene by plant-mediated RNAi impairs larval tolerance of gossypol. Nature Biotechnology, Nov. 2007, 25(11): 1307-1313.
Martinelli et al., Structure-function studies on jaburetox, a recombinant insecticidal peptide derived from jack bean (Canavalia ensiformis) urease, Biochimica et Biophysica Acta, Mar. 2014, 1840(3):935-44.
Mason et al., Cryptic Growth in Klebsiella-Pneumoniae, Appl Microbiol Biot, 1987, 25(6):577-584.
Medema et al., Exploiting plug-and-play synthetic biology for drug discovery and production in microorganisms, Nat. Rev. Microbiol., 2011, 9:131-137.
Mirzahoseini et al., Heterologous Proteins Production in Escherichia coli: An Investigation on the Effect of Codon Usage and Expression Host Optimization, Cell Journal (Yakhteh), Dec. 2011, 12(4):453, 7 pages.
Miyazaki, Creating random mutagenesis libraries by megaprimer PCR of whole plasmid (MEGA WHOP), Methods Mol Biol, 2003, 231:23-28.
Mus et al., “Diazotrophic Growth Allows Azotobacter vinelandii To Overcome the Deleterious Effects of a glnE Deletion,” Appl Environ Microbiol., Jun. 2017, 83(13):e00808-17.
Muse et al., The nac (Nitrogen Assimilation Control) Gene from Escherichia coli, J Bacteriology, Mar. 1998, 180(5):1166-1173.
Mutalik et al., Quantitative estimation of activity and quality for collections of functional genetic elements, Nat. Methods, 2013, 10:347-353.
Nagy et al., Nanofibrous solid dosage form of living bacteria prepared by electrospinning. eXPRESS Polymer Letters, 2014, 8(5):352-361.
Naimov et al., Solubilization, Activation, and Insecticidal Activity of Bacillus thuringiensis Serovar thompsoni HD542 Crystal Proteins, Applied and Environmental Microbiology, Dec. 2008, 74(23):7145-7151.
Nielsen et al., Conceptual model for production and composition of exopolymers in biofilms. Wat. Sci. Tech., 1997, 36(1): 11-19.
Nielsen et al., Extraction of EPS. Wingender et al. (eds.), Microbial Extracellular Polymeric Substances, 1999, 24 pages.
Noskov et al., Assembly of large, high G+C bacterial DNA fragments in yeast, ACS Synth. Biol., 2012, 1:267-273.
Oh et al., Organization of nif gene cluster in Frankia sp. EuIK1 strain, a symbiont of Elaeagnus umbellata, Arch. Microbiol., 2012, 194:29-34.
Ohta et al., Associative N2-fixation of rice with soiol microorganisms. Soil and Microorganisms. 1985;27:17-27.
Ohtsuka et al., An alternative approach to deoxyoligonucleotides as hybridization probes by insertion of deoxyinosine at ambiguous codon positions, J. Biol. Chem., 1985, 260:2605-2608.
Orme-Johnson, Molecular basis of biological nitrogen fixation, Annu Rev Biophys Biophys Chem, 1985, 14:419-459.
Ortiz-Marquez et al., Association with an Ammonium-excreting bacterium allows diazotrophic culture of oil-rich Eukaryotic microalagae, Appl. Microbial., 2012, 78(7):2345-2352.
Pakula et al., “Genetic analysis of protein stability and function,” Annu Rev Genet, 1989, 23:289-310.
Parker et al., Pore-forming protein toxins: from structure to function, Progress in Biophysics & Molecular Biology, 2005, 88:91-142.
Patil et al., Liquid formulations of Acetobacter diazotrophicus L 1 and Herbaspirillum seropedicae J24 and their field trials on wheat. International J Environmental Science, 2012, 3(3): 1116-1129, 4 pages (Abstract Only).
Philippe et al., Improvement of pCVD442, a suicide plasmid for gene allele exchange in bacteria, Plasmid, 2004, 51(3):246-255.
Pickens et al., Metabolic engineering for the production of natural products, Annu. Rev. Chem. Biomol. Eng., 2011, 2:211-236.
Poliner et al., Nontransgenic Marker-Free Gene Disruption by an Episomal CRISPR System in the Oleaginous Microalga, Nannochloropsis oceanica CCMP1779. Plant J. Jul. 2019; 99(1): 112-127.
Price et al., Operon formation is driven by coregulation and not by horizontal gene transfer, Genome Res., 2005, 15:809-819.
Price et al., The life-cycle of operons, PLoS Genet., 2006, 2:e96, 15 pages.
Purcell et al., Cholesterol oxidase: a potent insecticidal protein active against boll weevil larvae, Biochem Biophys Res Commun, Nov. 1993, 196(3): 1406-13.
Purnick et al., The second wave of synthetic biology: from modules to systems, Nat Rev Mol Cell Biol, 2009, 10(6):410-422.
Pyne et al., Coupling the CRISPR/Cas9 System with Lambda Red Recombineering Enables Simplified Chromosomal Gene Replacement in Escherichia coli, Applied and Environmental Microbioloy, Aug. 2015, 81(15):5103-5144.
Qaim et al., Yield Effects of Genetically Modified Crops in Developing Countries. Science, Feb. 2003, 299(5608):900-2.
Rakhee et al., Extracellular polymeric substances of the marine fouling diatom amphora rostrata Wm.Sm. Biofouling, 2001, 17(2):117-127, 12 pages.
Ramirez et al., Burkholderia and Paraburkholderia are Predominant Soybean Rhizobial Genera in Venezuelan Soils. Different Climatic and Topographical Regions, Microbes and Environments. Mar. 2019, 34(1):43-58.
Ramon et al., Single-step linker-based combinatorial assembly of promoter and gene cassettes for pathway engineering. Biotechnol. Lett., 2011, 33:549-555.
Resendis-Antonio et al., Systems biology of bacterial nitrogen fixation: High-throughput technology and its integrative description with constraint-based modeling. BMC Syst Biol., 2011,5:120, 15 pages.
Riedel et al., Nitrogen fixation by Klebsiella pneumoniae is inhibited by certain multicopy hybrid nif plasmids. J Bacterial, 1983, 153(1):45-56.
Robledo et al., Rhizobium cellulase CelC2 is essential for primary symbiotic infection of legume host roots. Proc Natl Acad Sci USA, May 2008, 105(19):7064-9.
Robledo et al., Role of Rhizobium endoglucanase CelC2 in cellulose biosynthesis and biofilm formation on plant roots and abiotic surfaces. Microb Cell Fact., Sep. 2012, 11:125, 12 pages.
Robson et al., Azotobacter Genomes: The Genome of Azotobacter chroococcum NCIMB 8003 (ATCC 4412). Plos One. 2015; 10(6): e0127997.
Rojas-Tapias et al., Preservation of Azotobacter chroococcum vegetative cells in dry polymers. Univ. Sci., 2015, 20(2):201-207.
Rong et al., Promoter specificity determinants of T7 RNA polymerase, Proc. Natl. Acad. Sci. USA, 1998;95(2):515-519.
Rossolini et al., Use of Deoxyinosine-Containing Primers vs Degenerate Primers for Polymerase Chain Reaction Based on Ambiguous Sequence Information, Mol. Cell. Probes, 1994, 8:91-98.
Rubio et al., Maturation of Nitrogenase: a Biochemical Puzzle. J. Bacteriology, 2005; 187(2):405-414.
Ryu et al., Control of nitrogen fixation in bacteria that associate with cereals. Nat. Microbiol., Feb. 2020, 5(2):314-330, 31 pages.
Saleh et al., Involvement of gacS and rpoS in enhancement of the plant growth-promoting capabilities of Enterobacter cloacae CAL2 and UW4, Canadian Journal of Microbiology, Aug. 2001, 47(8):698-705.
Salis et al., Automated design of synthetic ribosome binding sites to control protein expression. Nat Biotechnol, 2009, 27(10):946-950.
Sanahuja et al., Bacillus thuringiensis: a century of research, development and commercial applications. Plant Biotechnology J, Apr. 2011, 9(3):283-300.
Sandoval et al., Strategy for directing combinatorial genome engineering in Escherichia coli, Proc Natl Acad Sci USA, Jun. 2012, 109(26): 10540-5.
Sanjuan et al., Multicopy plasmids carrying the klebsiella pneumoniae nifA gene enhance rhizobium meliloti nodulation competitiveness on alfalfa. Mol Plant Microbe Int. 1991;4(4):365-9.
Sanyal et al., The etiology of hepatocellular carcinoma and consequences for treatment. Oncologist, 2010, 15(Suppl 4):14-22.
Schmidt-Dannert et al., Molecular breeding of carotenoid biosynthetic pathways. Nat. Biotechnol., 2000, 18:750-753.
Schuler et al., Insect-resistant transgenic plants, Trends in Biotechnology, Apr. 1998, 16(4):168-175.
Schuler et al., Potential side effects of insect-resistant transgenic plants on arthropod natural Enemies. Trends Biotechnol., May 1999, 17(5):210-216.
Shetty et al., Engineering BioBrick vectors from BioBrick parts. J Biol Eng, 2008, 2:5, 12 pages.
Sibold et al., Constitutive expression of nitrogen fixation (nif) genes of Klebsiella pneumoniae due to a DNA duplication. Embo J., 1982, 1(12):1551-8.
Simon et al., Perturbation of niff expression in Klebsiella pneumoniae has limited effect on nitrogen fixation, J Bacteriol, 1996, 178(10):2975-2977.
Singer et al., “Genes and Genomes,” Moscow: Mir, 1998, 1:33, 4 pages (with machine translation).
Sivaraman et al., Codon choice in genes depends on flanking sequence information—implications for theoretical reverse translation, Nucleic Acids Res, 2008, 36(3):e16, 8 pages.
Sleight et al., Randomized BioBrick assembly: a novel DNA assembly method for randomizing and optimizing genetic circuits and metabolic pathways, ACS Synth. Biol., 2013, 2(9):506-518.
Smanski et al., Engineered Streptomyces platensis strains that overproduce antibiotics platensimycin and platencin, Antimicrob. Agents Chemother., 2009, 53:1299-12304.
Sorek et al., Prokaryotic transcriptomics: a new view on regulation, physiology, and pathogenicity, Nat. Rev. Genet., 2010, 11:9-16.
Staron et al., The Third Pillar of Bacterial Signal Transduction: Classification of the Extracytoplasmic Function (ECF) Sigma Factor Protein Family, Mol Microbiol, 2009, 14(3): 557-81.
Stewart et al., In situ studies on nitrogen fixation with the acetylene reduction technique, Science, 1967, 158(3800):536.
Stucken et al., The smallest known genomes of multicellular and toxic cyanobacteria: comparison, minimal gene sets for linked traits and the evolutionary implications, PLoS One, 2010, 5:e9235, 15 pages.
Suh et al., Functional expression of the FeMo-cofactor-specific biosynthetic genes nifEN as a NifE-N fusion protein synthesizing unit in Azotobacter vinelandii, Biochem. Biophys. Res. Comm., 2002, 299:233-240.
Suzuki et al., Immune-mediated motor polyneuropathy after hematopoietic stem cell transplantation, Bone Marrow Transplant., Aug. 2007, 40(3):289-91.
Tamsir et al., Robust multicellular computing using genetically encoded NOR gates and chemical ‘wires’, Nature, 2011, 469(7329):212-215.
Tan, A synthetic biology challenge: making cells compute, Mol Biosyst, 2007, 3:343-353.
Temme et al., Induction and relaxation dynamics of the regulatory network controlling the type III secretion system encoded within Salmonella pathogenicity island 1, J Mol Biol, 2008, 377(1):47-61.
Thiel et al., Characterization of genes for a second Modependent nitrogenase in the cyanobacterium Anabaena variabilis, J. Bact., 1997, 179:5222-5225.
Tijssen, Laboratory Techniques In Biochemistry And Molecular Biology, Elsevier, 1993, 24:65 pages.
Uozumi et al., Cloning and Expression of the nif A Gene of Klebsiella oxytoca in K. pneumoniae and Azospirillum lipoferum, Agricultural and Biological Chemistry, 1986, 50(6): 1539-1544.
Van Dongen, Performance criteria for graph clustering and Markov cluster experiments, CWI, 2000, 36 pages.
Van Heeswijk et al., Nitrogen Assimilation in Escherichia coli: Putting Molecular Data into a Systems Perspective, Microbiology and Molecular Biology Reviews, Dec. 2013, 77(4):628-695.
Villalobos et al., Gene Designer: a synthetic biology tool for constructing artificial ONA segments, BMC Bioinformatics, 2006, 7:285, 8 pages.
Wang et al., Biofilm formation enables free-living nitrogen-fixing rhizobacteria to fix nitrogen under aerobic conditions. ISME Journal, Jul. 2017, 11:1602-1613.
Wang et al., Ligand-inducible and liver-specific target gene expression in transgenic mice, Nat Biotechnol., Mar. 1997, 15(3):239-43.
Wang et al., Positive and negative regulation of gene expression in eukaryotic cells with an inducible transcriptional regulator, Gene Ther., May 1997, 4(5):432-441.
Wang et al., Programming cells by multiplex genome engineering and accelerated evolution, Nature, Aug. 2009, 460(7257):894-8.
Wang et al., Roles of poly-3-hydroxybutyrate (PHB) and glycogen in symbiosis of Sinorhizobium meliloti with Medicago sp. Microbiology, Feb. 2007, 153(2):388-398.
Watanabe et al., Total biosynthesis of antitumor nonribosomal peptides in Escherichia coli, Nature Chemical Biology, 2006, 2:423-428.
Wei et al., Endophytic nitrogen-fixing Klebsiella variicola strain DX120E promotes sugarcane growth, Biology and fertility of soils, 2014, 50:657-666.
Wells, Additivity of mutational effects in proteins, Biochemistry, 1990, 29:8509-8517.
Wen et al., Enabling Biological Nitrogen Fixation for Cereal Crops in Fertilized Fields. ACS Synth. Biol., Dec. 2021, 10(12):3264-3277.
Wenzel et al., Recent developments towards the heterologous expression of complex bacterial natural product biosynthetic pathways, Curr Opin Biotechnol, 2005, 16(6):594-606.
Wimpenny et al., Community structure and co-operation in biofilms.59th Symposium of the Society for General Microbiology, Allison et al. (eds.), Sep. 2000, 23 pages.
Witkowski et al., Conversion of a β-Ketoacyl synthase to a malonyl decarboxylase by replacement of the active-site cysteine with glutamine. Biochemistry, Sep. 1999, 38(36):11643-50.
Woolbright et al., Novel insight into mechanisms of cholestatic liver injury, World J Gastroenterol., Sep. 2012, 18(36):4985-93.
Wu et al., Effects of biofertilizer containing N-fixer, P and K solubilizers and AM fungi on maize growth: a greenhouse trial. Geodernna, Mar. 2005, 125(1-2):155-166.
Wu et al., Multivariate modular metabolic engineering of Escherichia coli to produce resveratrol from L-tyrosine. J. Biotechnol., 2013, 167:404-411.
Wu et al., Root exudates from two tobacco cultivars affect colonization of Ralstonia solanacearum and the disease index. European J Plant Pathology. 2014;141(4):667-677.
Xie et al., Interaction between NifL and NifA in the nitrogen-fixing Pseudomonas stutzeri A1501. Microbiology (Reading), Dec. 2006, 152(Pt 12):3535-3542.
Xu et al., ePathBrick: a synthetic biology platform for engineering metabolic pathways in E. coli., ACS Synth. Biol, 2012, 1:256-266.
Yan et al., Global transcriptional analysis of nitrogen fixation and ammonium repression in rootassociated Pseudomonas stutzeri A1501, BMC Genomics, Jan. 2010, 11(11):1-13.
Yao et al., Complementation analysis of heterologous nifA genes to nifA mutants of Sinorhizobium pallida. Chinese Science Bulletin, Oct. 2006, 51(19):2258-2264, 2 pages.
Yarza et al., Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences, Nature Rev. Micro., 2014, 12:635-345.
Ye et al., Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction, BMC Bioinformatics., Jun. 2012, 13(134): 1-11.
Yokobayashi et al., Directed evolution of a genetic circuit, Proc Natl Acad Sci USA, 2002, 99(26):16587-16591.
Yu et al., Recombineering Pseudomonas protegens CHA0: An innovative approach that improves nitrogen fixation with impressive bactericidal potency.Microbiological Research, Jan. 2019,218:58-65.
Zaslaver et al., Optimal gene partition into operons correlates with gene functional order, Phys Biol, 2006, 3(3):183-189.
Zazopoulos et al., A genomics-guided approach for discovering and expressing cryptic metabolic pathways, Nat Biotechnol, 2003, 21(2):187-190.
Zhang et al., Mutagenesis and functional characterization of the four domains of G1nD, a bifunctional nitrogen sensor protein. J Bacteriology, Jun. 2010, 192(11):2711-2721.
Zhang et al., Mutagenesis and Functional Characterization of the glnB, glnA, and nifA Genes from the Photosynthetic Bacterium Rhodospirillum rubrum, Journal of Bacteriology, Feb. 2000, 182(4):983-992.
Zhao et al., Evidence for nifU and nifS participation in the biosynthesis of the iron-molybdenum cofactor of nitrogenase, J. Biol. Chem., 2007, 282(51):37016-37025.
Zomer et al., PPP: Perform Promoter Prediction, retrieved from URL <http://bioinformatics.biol.rug.nl/websoftware/ppp/ppp_start.php>, 2011, 2 pages.
Related Publications (1)
Number Date Country
20180290942 A1 Oct 2018 US
Provisional Applications (1)
Number Date Country
62237426 Oct 2015 US