Riboswitches, Structure-Based Compound Design with Riboswitches, and Methods and Compositions for Use of and with Riboswitches

Abstract
Riboswitches and modified versions of riboswitches can be employed as designer genetic switches that are controlled by specific effector compounds. Such effector compounds that activate a riboswitch are referred to herein as trigger molecules. The natural switches are targets for antibiotics and other small molecule therapies. In addition, the architecture of riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non-natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds. The changed switches become part of a therapy regimen-turning on, or off, or regulating protein synthesis. Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments. Compounds can be used to stimulate, active, inhibit and/or inactivate riboswitches. Atomic structures of riboswitches can be used to design new compounds to stimulate, active, inhibit and/or inactivate riboswitches.
Description
FIELD OF THE INVENTION

The disclosed invention is generally in the field of gene expression and specifically in the area of regulation of gene expression.


BACKGROUND OF THE INVENTION

Precision genetic control is an essential feature of living systems, as cells must respond to a multitude of biochemical signals and environmental cues by varying genetic expression patterns. Most known mechanisms of genetic control involve the use of protein factors that sense chemical or physical stimuli and then modulate gene expression by selectively interacting with the relevant DNA or messenger RNA sequence. Proteins can adopt complex shapes and carry out a variety of functions that permit living systems to sense accurately their chemical and physical environments. Protein factors that respond to metabolites typically act by binding DNA to modulate transcription initiation (e.g. the lac repressor protein; Matthews, K. S., and Nichols, J. C., 1998, Prog. Nucleic Acids Res. Mol. Biol. 58, 127-164) or by binding RNA to control either transcription termination (e.g. the PyrR protein; Switzer, R. L., et al., 1999, Prog. Nucleic Acids Res. Mol. Biol. 62, 329-367) or translation (e.g. the TRAP protein; Babitzke, P., and Gollnick, P., 2001, J. Bacteriol. 183, 5795-5802). Protein factors respond to environmental stimuli by various mechanisms such as allosteric modulation or post-translational modification, and are adept at exploiting these mechanisms to serve as highly responsive genetic switches (e.g. see Ptashne, M., and Gann, A. (2002). Genes and Signals. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.).


In addition to the widespread participation of protein factors in genetic control, it is also known that RNA can take an active role in genetic regulation. Recent studies have begun to reveal the substantial role that small non-coding RNAs play in selectively targeting mRNAs for destruction, which results in down-regulation of gene expression (e.g. see Hannon, G. J. 2002, Nature 418, 244-251 and references therein). This process of RNA interference takes advantage of the ability of short RNAs to recognize the intended mRNA target selectively via Watson-Crick base complementation, after which the bound mRNAs are destroyed by the action of proteins. RNAs are ideal agents for molecular recognition in this system because it is far easier to generate new target-specific RNA factors through evolutionary processes than it would be to generate protein factors with novel but highly specific RNA binding sites.


Although proteins fulfill most requirements that biology has for enzyme, receptor and structural functions, RNA also can serve in these capacities. For example, RNA has sufficient structural plasticity to form numerous ribozyme domains (Cech & Golden, Building a catalytic active site using only RNA. In: The RNA World R. F. Gesteland, T. R. Cech, J. F. Atkins, eds., pp. 321-350 (1998); Breaker, In vitro selection of catalytic polynucleotides. Chem. Rev. 97, 371-390 (1997)) and receptor domains (Osborne & Ellington, Nucleic acid selection and the challenge of combinatorial chemistry. Chem. Rev. 97, 349-370 (1997); Hermann & Patel, Adaptive recognition by nucleic acid aptamers. Science 287, 820-825 (2000)) that exhibit considerable enzymatic power and precise molecular recognition. Furthermore, these activities can be combined to create allosteric ribozymes (Soukup & Breaker, Engineering precision RNA molecular switches. Proc. Natl. Acad. Sci. USA 96, 3584-3589 (1999); Seetharaman et al., Immobilized riboswitches for the analysis of complex chemical and biological mixtures. Nature Biotechnol. 19, 336-341 (2001)) that are selectively modulated by effector molecules.


These properties of RNA are consistent with speculation (Gold et al., From oligonucleotide shapes to genomic SELEX: novel biological regulatory loops. Proc. Natl. Acad. Sci. USA 94, 59-64 (1997); Gold et al., SELEX and the evolution of genomes. Curr. Opin. Gen. Dev. 7, 848-851 (1997); Nou & Kadner, Adenosylcobalamin inhibits ribosome binding to btuB RNA. Proc. Natl. Acad. Sci. USA 97, 7190-7195 (2000); Gelfand et al., A conserved RNA structure element involved in the regulation of bacterial riboflavin synthesis genes. Trends Gen. 15, 439-442 (1999); Miranda-Rios et al., A conserved RNA structure (thi box) is involved in regulation of thiamin biosynthetic gene expression in bacteria. Proc. Natl. Acad. Sci. USA 98, 9736-9741 (2001); Stormo & Ji, Do mRNAs act as direct sensors of small molecules to control their expression? Proc. Natl. Acad. Sci. USA 98, 9465-9467 (2001)) that certain mRNAs might employ allosteric mechanisms to provide genetic regulatory responses to the presence of specific metabolites. Although a thiamine pyrophosphate (TPP)-dependent sensor/regulatory protein had been proposed to participate in the control of thiamine biosynthetic genes (Webb & Downs, Characterization of thiL, encoding thiamin-monophosphate kinase, in Salmonella typhimurium. J. Biol. Chem. 272, 15702-15707 (1997)), no such protein factor has been shown to exist.


Transcription of the lysC gene of B. subtilis is repressed by high concentrations of lysine (Kochhar, S., and Paulus, H. 1996, Microbiol. 142:1635-1639; Mäder, U., et al., 2002, J. Bacteriol. 184:4288-4295; Patte, J. C. 1996. Biosynthesis of lysine and threonine. In: Escherichia coli and Salmonella: Cellular and Molecular Biology, F. C. Neidhardt, et al., eds., Vol. 1, pp. 528-541. ASM Press, Washington, D.C.; Patte, J.-C., et al., 1998, FEMS Microbiol. Lett. 169:165-170), but no protein factor had been identified that served as the genetic regulator (Liao, H.-H., and Hseu, T.-H. 1998, FEMS Microbiol. Lett. 168:31-36). The lysC gene encodes aspartokinase II, which catalyzes the first step in the metabolic pathway that converts L-aspartic acid into L-lysine (Belitsky, B. R. 2002. Biosynthesis of amino acids of the glutamate and aspartate families, alanine, and polyamines. In: Bacillus subtilis and its Closest Relatives: from Genes to Cells. A. L. Sonenshein, J. A. Hoch, and R. Losick, eds., ASM Press, Washington, D.C.).


BRIEF SUMMARY OF THE INVENTION

It has been discovered that certain natural mRNAs serve as metabolite-sensitive genetic switches wherein the RNA directly binds a small organic molecule. This binding process changes the conformation of the mRNA, which causes a change in gene expression by a variety of different mechanisms. Modified versions of these natural “riboswitches” (created by using various nucleic acid engineering strategies) can be employed as designer genetic switches that are controlled by specific effector compounds. Such effector compounds that activate a riboswitch are referred to herein as trigger molecules. The natural switches are targets for antibiotics and other small molecule therapies. In addition, the architecture of riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non-natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds. The changed switches become part of a therapy regimen-turning on, or off, or regulating protein synthesis. Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments.


Disclosed are isolated and recombinant riboswitches, recombinant constructs containing such riboswitches, heterologous sequences operably linked to such riboswitches, and cells and transgenic organisms harboring such riboswitches, riboswitch recombinant constructs, and riboswitches operably linked to heterologous sequences. The heterologous sequences can be, for example, sequences encoding proteins or peptides of interest, including reporter proteins or peptides. Preferred riboswitches are, or are derived from, naturally occurring riboswitches.


Also disclosed are the crystalline atomic structures of riboswitches. These structures are useful in modeling and assessing the interaction of a riboswitch with a binding ligand. They are also useful in methods of identifying compounds that interact with the riboswitch.


Also disclosed are chimeric riboswitches containing heterologous aptamer domains and expression platform domains. That is, chimeric riboswitches are made up an aptamer domain from one source and an expression platform domain from another source. The heterologous sources can be from, for example, different specific riboswitches or different classes of riboswitches. The heterologous aptamers can also come from non-riboswitch aptamers. The heterologous expression platform domains can also come from non-riboswitch sources.


Also disclosed are compositions and methods for selecting and identifying compounds that can activate, deactivate or block a riboswitch. Activation of a riboswitch refers to the change in state of the riboswitch upon binding of a trigger molecule. A riboswitch can be activated by compounds other than the trigger molecule and in ways other than binding of a trigger molecule. The term trigger molecule is used herein to refer to molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques). Non-natural trigger molecules can be referred to as non-natural trigger molecules.


Deactivation of a riboswitch refers to the change in state of the riboswitch when the trigger molecule is not bound. A riboswitch can be deactivated by binding of compounds other than the trigger molecule and in ways other than removal of the trigger molecule. Blocking of a riboswitch refers to a condition or state of the riboswitch where the presence of the trigger molecule does not activate the riboswitch.


Also disclosed are methods of identifying a compound that interacts with a riboswitch comprising modeling the atomic structure of the riboswitch with a test compound and determining if the test compound interacts with the riboswitch. This can be done by determining the atomic contacts of the riboswitch and test compound. Furthermore, analogs of a compound known to interact with a riboswitch can be generated by analyzing the atomic contacts, then optimizing the atomic structure of the analog to maximize interaction. These methods can be used with a high throughput screen.


Also disclosed are compounds, and compositions containing such compounds, that can activate, deactivate or block a riboswitch. Also disclosed are compositions and methods for activating, deactivating or blocking a riboswitch. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Compounds can be used to activate, deactivate or block a riboswitch. The trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch. Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch. Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch. A riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.


Also disclosed are compositions and methods for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch, by bringing a compound into contact with the RNA molecule. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Thus, subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.


Also disclosed are compositions and methods for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule, by operably linking a riboswitch to the RNA molecule. A riboswitch can be operably linked to an RNA molecule in any suitable manner, including, for example, by physically joining the riboswitch to the RNA molecule or by engineering nucleic acid encoding the RNA molecule to include and encode the riboswitch such that the RNA produced from the engineered nucleic acid has the riboswitch operably linked to the RNA molecule. Subjecting a riboswitch operably linked to an RNA molecule of interest to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.


Also disclosed are compositions and methods for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can result in death, stasis or debilitation of the cell or organism. For example, activating a naturally occurring riboswitch in a naturally occurring gene that is essential to survival of a microorganism can result in death of the microorganism (if activation of the riboswitch turns off or represses expression). This is one basis for the use of the disclosed compounds and methods for antimicrobial and antibiotic effects.


Also disclosed are compositions and methods for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. The gene or RNA can be engineered or can be recombinant in any manner. For example, the riboswitch and coding region of the RNA can be heterologous, the riboswitch can be recombinant or chimeric, or both. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de-repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.


Also disclosed are compositions and methods for altering the regulation of a riboswitch by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch). The aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain. Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain. Generally, any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.


Also disclosed are compositions and methods for inactivating a riboswitch by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.


Also disclosed are methods of identifying compounds that activate, deactivate or block a riboswitch. For examples, compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner. For example, the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound. As another example, the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. As can be seen, assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.


Identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.


Also disclosed are biosensor riboswitches. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. Also disclosed are methods of detecting compounds using biosensor riboswitches. The method can include bringing into contact a test sample and a biosensor riboswitch and assessing the activation of the biosensor riboswitch. Activation of the biosensor riboswitch indicates the presence of the trigger molecule for the biosensor riboswitch in the test sample.


Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound. This can be accomplished by, for example, combining compound identification methods as disclosed elsewhere herein with methods for manufacturing the identified compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.


Also disclosed are compounds made by checking activation, deactivation or blocking of a riboswitch by a compound and manufacturing the checked compound. This can be accomplished by, for example, combining compound activation, deactivation or blocking assessment methods as disclosed elsewhere herein with methods for manufacturing the checked compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound. Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch.


Also disclosed are methods for selecting, designing or deriving new riboswitches and/or new aptamers that recognize new trigger molecules. Such methods can involve production of a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results. Also disclosed are riboswitches and aptamer domains produced by these methods.


The disclosed riboswitches, including the derivatives and recombinant forms thereof, generally can be from any source, including naturally occurring riboswitches and riboswitches designed de novo. Any such riboswitches can be used in or with the disclosed methods. However, different types of riboswitches can be defined and some such sub-types can be useful in or with particular methods (generally as described elsewhere herein). Types of riboswitches include, for example, naturally occurring riboswitches, derivatives and modified forms of naturally occurring riboswitches, chimeric riboswitches, and recombinant riboswitches. A naturally occurring riboswitch is a riboswitch having the sequence of a riboswitch as found in nature. Such a naturally occurring riboswitch can be an isolated or recombinant form of the naturally occurring riboswitch as it occurs in nature. That is, the riboswitch has the same primary structure but has been isolated or engineered in a new genetic or nucleic acid context. Chimeric riboswitches can be made up of, for example, part of a riboswitch of any or of a particular class or type of riboswitch and part of a different riboswitch of the same or of any different class or type of riboswitch; part of a riboswitch of any or of a particular class or type of riboswitch and any non-riboswitch sequence or component. Recombinant riboswitches are riboswitches that have been isolated or engineered in a new genetic or nucleic acid context.


Different classes of riboswitches refer to riboswitches that have the same or similar trigger molecules or riboswitches that have the same or similar overall structure (predicted, determined, or a combination). Riboswitches of the same class generally, but need not, have both the same or similar trigger molecules and the same or similar overall structure. Riboswitch classes include glycine-responsive riboswitches, guanine-responsive riboswitches, adenine-responsive riboswitches, lysine-responsive riboswitches, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin-responsive riboswitches, flavin mononucleotide-responsive riboswitches, and a S-adenosylmethionine-responsive riboswitches.


Additional advantages of the disclosed method and compositions will be set forth in part in the description which follows, and in part will be understood from the description, or can be learned by practice of the disclosed method and compositions. The advantages of the disclosed method and compositions will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention as claimed.





BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments of the disclosed method and compositions and together with the description, serve to explain the principles of the disclosed method and compositions.



FIGS. 1A, 1B and 1C show the G box RNA of the xpt-pbuX mRNA in B. subtilis responds allosterically to guanine. FIG. 1A shows the consensus sequence and secondary model for the G box RNA domain that resides in the 5′ UTR of genes that are largely involved in purine metabolism (SEQ ID NO: 1). Phylogenetic analysis is consistent with the formation of a three-stem (P1 through P3) junction. Nucleotides depicted shown as lower case letters and capitals are present in greater than 90% and 80% of the representatives examined, respectively. Encircled nucleotides exhibit base complementation, which might indicate the formation of a pseudoknot. FIG. 1B shows sequence and ligand-induced structural alterations of the 5′-UTR of the xpt-pbuX transcriptional unit (SEQ ID NO:2). The putative anti-terminator interaction is represented by the boxes. Nucleotides that undergo structural alteration as determined by in-line probing (from C) are identified with squares. The 93 xpt fragment (boxed) of the 201 xpt RNA retains guanine-binding function. Asterisks denote alterations to the RNA sequence that facilitate in vitro transcription (5′ terminus) or that generate a restriction site (3′ terminus). Nucleotide numbers begin at the first nucleotide of the natural transcription start site. The translation start codon begins at position 186. FIG. 1C shows guanine and related purines selectively induce structural modulation of the 93 xpt mRNA fragment. Precursor RNAs (Pre; 5′ 32P-labeled) were subjected to in-line probing by incubation for 40 hr in the absence (−) or presence of guanine, hypoxanthine, xanthine and adenine as indicated by G, H, X and A, respectively. Lanes designated NR, T1 and OH contain RNA that was not reacted, subjected to partial digestion with RNase T1 (G-specific cleavage), or subjected to partial alkaline digestion, respectively. Selected bands corresponding to G-specific cleavage are identified. Regions 1 through 4 identify major sites of ligand-induced modulation of spontaneous RNA cleavage.



FIGS. 2A, 2B and 2C show a molecular discrimination by the guanine-binding aptamer of the xpt-pbuX mRNA. FIG. 2A shows the chemical structures and apparent KD values for guanine, hypoxanthine and xanthine (active natural regulators of xpt-pbuX genetic expression in B. subtilis) versus that of adenine (inactive). Differences in chemical structure relative to guanine are encircled. KD values were established as shown in FIG. 2 with the 201 xpt RNA. Numbers on guanine represent the positions of the ring nitrogen atoms. FIG. 2B shows chemical structures and KD values for various analogs of guanine reveal that all alterations of this purine cause a loss of binding affinity. Open circles identify KD values that most likely are significantly higher than indicated, as concentrations of analog above 500 μM were not examined in this analysis. The apparent KD values of G, H, X and A as indicated are plotted as triangles for comparison. FIG. 2C shows a schematic representation of the molecular recognition features of the guanine aptamer in 201 xpt. Hydrogen bond formation at position 9 of guanine is expected because guanosine (KD>100 μM) and inosine (KD>100 μM), which are 9-ribosyl derivatives of guanine and hypoxanthine, respectively, do not exhibit measurable binding.



FIGS. 3A, 3B and 3C show guanine- and adenine-specific riboswitches. FIG. 3A shows sequence and structural features of the two guanine-specific (purE and xpt) and three adenine-specific aptamer domains that are examined in this study BS2-purE, BS3-xpt, BS5-ydhL, CP4-add, VV1-add, which are represented by SEQ ID NOS:3-7, respectively. P1 through P3 identify the three base-paired stems comprising the secondary structure of the aptamer domain. Lowercase nucleotides identify positions whose base identity is conserved in greater than 90% of representatives in the phylogeny. The arrow identifies a nucleotide within the conserved core of the aptamer that is a determinant of ligand specificity. BS, CP and VV designate B. subtilis, Clostridium perfringens and Vibrio vulnificus, respectively. FIG. 3B shows sequence and secondary structure of the xpt and ydhL aptamers (SEQ ID NO:8). Encircled nucleotides identify positions within the ydhL aptamer that differ from those in the xpt aptamer. The sequence disclosed in FIG. 3C is SEQ ID NO:9. Nucleotides in xpt are numbered as described in Example 6 of U.S. Application Publication No. 2005-0053951. Other notations are as described in A.



FIGS. 4A and 4B show the specificity of molecular recognition by the adenine aptamer from ydhL. FIG. 4a Top: Chemical structures of adenine, guanine and other purine analogs that exhibit measurable binding to the 80 ydhL RNA. Chemical changes relative to 2,6-DAP, which is the tightest-binding compound, are encircled. Bottom left: Plot of the apparent KD values for various purines. Bottom right: Model for the chemical features on adenine that serve as molecular recognition contacts for ydhL. Note that the importance of N7 and N9 has not been determined. Encircled arrow indicated that a contact could exist if a hydrogen bond donor is appended to C2. FIG. 4b shows chemical structures of various purines that are not bound by the 80 ydhL RNA (KD values poorer than 300 μM).



FIGS. 5A, 5B, 5C, and SD show secondary and tertiary structures of the guanine riboswitch-hypoxanthine complex. FIG. 5A shows left, secondary structure of the xpt-pbuX guanine-binding domain of the guanine riboswitch of B. subtilis (SEQ ID NO:213). Nucleotides conserved in more than 90% of known guanine riboswitches are shown in red; the numbering is consistent with that of the full-length mRNA. Colored boxes correspond to structural features shown in FIGS. 6 and 7. Right, tertiary architecture of the hypoxanthine-bound form. Key tertiary interactions between the loops are shown as thick broken lines; a water-mediated triple is indicated by the circled ‘w’. FIG. 5B shows gene repression by the guanine riboswitch in the 5′ untranslated region of mRNA (SEQ ID NO:214). Initial transcription generates a binding domain that is primed to bind guanine (G) rapidly if it is at a sufficiently high concentration. Hypoxanthine (HX, top right) stabilizes the guanine-binding domain and particularly the P1 helix, forcing the mRNA to form a terminator element that halts transcription. In the absence of ligand (bottom right), an antiterminator forms, facilitating continued transcription. FIG. 5C shows ribbon representation of the three-dimensional structure of the RNA-hypoxanthine complex. The hypoxanthine ligand is shown in red, with its surface represented by dots. FIG. 5D shows the top view of the complex, emphasizing the close packing of the P2 and P3 helices.



FIGS. 6A, 6B, and 6C show recognition of hypoxanthine (HX) by the guanine-binding domain. FIG. 6A: Stereo view of the hypoxanthine-binding pocket in the three-way junction. FIG. 6B: Hydrogen-bonding interactions (grey broken lines) between hypoxanthine and the RNA. The final model (shown in stick representation) is superimposed on a simulated annealing 2Fo-Fc omit map (orange cage), in which the atoms shown were excluded from the map calculation. FIG. 6C: Molecular surface representation of the binding pocket of the guanine riboswitch bound to hypoxanthine (left), compared with the theophylline-binding aptamer bound to theophylline (centre) and the E. coli purR repressor bound to hypoxanthine (right).



FIGS. 7A, 7B, and 7C show stabilization of the tertiary architecture. FIG. 7A: One of two base quartets that form the core of the loop-loop contact. The carbon atoms are colored as in FIG. 5. FIG. 7B: Side view of the loop-loop interaction, emphasizing the arrangement of base pairs and quartets. The bases of the quartet shown in A are colored blue, with the hydrogen bonding between A65 and U34 shown for orientation; the bases of the other quartet are colored green. The A35A64 pair is shown in yellow, with hydrogen bonds emphasizing its interactions with the 2′-hydroxyl group of U34. The capping G62U63 pair is shown in red.



FIGS. 8A and 8B show an estimation of the affinity of the riboswitch for hypoxanthine. Shown are the isothermal titration calorimetry curves for the wild-type guanine-binding domain (FIG. 8A) and for the guanine-binding domain lacking the tertiary interaction (FIG. 8B) with hypoxanthine at 30° C. in buffer containing 10 mM K+-HEPES (pH 7.5), 100 mM KCl and 5 mM MgCl2.



FIGS. 9A and 9B show schematic representations of two types of riboswitch binding assays that can be used with high throughput screens. FIG. 9A: a ribozyme-based assay can be used that exploits inherent action of a self-cleaving ribozyme. X represents the compound being tested, GlcN6P is glucosamine-6-phosphate. F and Q represent fluorophore and quencher moieties, respectfully (e.g., TAMRA and CY3). FIG. 9B: A molecular beacon assay can be used for non-ribozyme riboswitches such as the guanine-binding RNA. Notations are described in A.



FIG. 10 shows cobalt hexamine ions bound to the guanine riboswitch. The RNA (grey) on the left is shown in the same perspective as in FIG. 5B, with bound hypoxanthine in red and Co(NH3)63+ shown in green and blue. The RNA on the right is rotated 180° with respect to the left view.



FIG. 11 shows secondary structure of RNA GR-minimal (SEQ ID NO:215). This RNA has been designed to test the effect of the tertiary interaction formed by the loops L2 and L3 upon ligand binding. In this construct the L2 and L3 loops have been ablated along with part of P2 and P3, and replaced by extremely stable UUCG tetraloops.



FIGS. 12A and 12B show structure based design of anti-riboswitch compounds. FIG. 12A: Atomic-resolution model of the guanine riboswitch bound to the guanine analog hypoxanthine (HX). Guanine binding can include two added contacts made between the exocyclic amine position 2 of the purine ring and the oxygen atoms of C60 and U37. Blue shading identifies the channels that are present, which can allow modification of the guanine ring. Along these channels, and in the vicinity of their termini, are opportunities for new contacts to be made between the aptamer and specific guanine derivatives. FIG. 12B: Representatives of guanine analogs (total of 26; named G-001 though G-26) were synthesized to exploit the channels in the guanine riboswitch structure. Most compounds in this collection bind to the riboswitch with dissociation constants of lower than 1 nanomolar (see FIG. 13). These compounds can be further modified to acquire new contacts between the compounds and functional groups near the channels.



FIG. 13 shows representative polyacrylamide gel electrophoresis analysis of in-line probing studies that reveal allosteric modulation of the guanine aptamer by various analogs. NR, T1 and OH identify no reaction, nuclease T1 partial digestion, and alkaline partial digestion respectively. In-line probing assays were conducted for ˜1 day, and a dampening of spontaneous RNA cleavage product bands in the J1/2, J2/3 and J3/1 regions are indicative of ligand binding. This dampening occurs even when 1 nM of compound is added, indicating that the dissociation constant is equal to or better than this value.



FIG. 14 shows a GFP reporter assay for guanine riboswitch function in vivo. In the absence of guanine, a full-length mRNA is transcribed and GFP is expressed. At high concentrations of guanine, or an active guanine analog, a transcriptional terminator is formed, repressing the expression of GFP. A similar system has been created wherein the β-galactosidase gene is under riboswitch control.



FIG. 15 shows time-kill assay for G-014. G-014 reduces the number of viable B. subtilis cells over time at a rate equal to carbenicillin.





DETAILED DESCRIPTION OF THE INVENTION

The disclosed methods and compositions can be understood more readily by reference to the following detailed description of particular embodiments and the Examples included therein and to the Figures and their previous and following description.


Certain natural mRNAs serve as metabolite-sensitive genetic switches wherein the RNA directly binds a small organic molecule. This binding process changes the conformation of the m-RNA, which causes a change in gene expression by a variety of different mechanisms. Modified versions of these natural “riboswitches” (created by using various nucleic acid engineering strategies) can be employed as designer genetic switches that are controlled by specific effector compounds (referred to herein as trigger molecules). The natural switches are targets for antibiotics and other small molecule therapies. In addition, the architecture of riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non-natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds. The changed switches become part of a therapy regimen—turning on, or off, or regulating protein synthesis. Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments.


Messenger RNAs are typically thought of as passive carriers of genetic information that are acted upon by protein- or small RNA-regulatory factors and by ribosomes during the process of translation. It was discovered that certain mRNAs carry natural aptamer domains and that binding of specific metabolites directly to these RNA domains leads to modulation of gene expression. Natural riboswitches exhibit two surprising functions that are not typically associated with natural RNAs. First, the mRNA element can adopt distinct structural states wherein one structure serves as a precise binding pocket for its target metabolite. Second, the metabolite-induced allosteric interconversion between structural states causes a change in the level of gene expression by one of several distinct mechanisms. Riboswitches typically can be dissected into two separate domains: one that selectively binds the target (aptamer domain) and another that influences genetic control (expression platform). It is the dynamic interplay between these two domains that results in metabolite-dependent allosteric control of gene expression.


Distinct classes of riboswitches have been identified and are shown to selectively recognize activating compounds (referred to herein as trigger molecules). For example, coenzyme B12, glycine, thiamine pyrophosphate (TPP), and flavin mononucleotide (FMN) activate riboswitches present in genes encoding key enzymes in metabolic or transport pathways of these compounds. The aptamer domain of each riboswitch class conforms to a highly conserved consensus sequence and structure. Thus, sequence homology searches can be used to identify related riboswitch domains. Riboswitch domains have been discovered in various organisms from bacteria, archaea, and eukarya.


One class of riboswitches that recognizes guanine and discriminates against most other purine analogs has been discovered. Representative RNAs that carry the consensus sequence and structural features of guanine riboswitches are located in the 5′-untranslated region (UTR) of numerous genes of prokaryotes, where they control expression of proteins involved in purine salvage and biosynthesis. Three representatives of this phylogenetic collection bind adenine with values for apparent dissociation constant (apparent KD) that are several orders of magnitude better than for guanine. The preference for adenine is due to a single nucleotide substitution in the core of the riboswitch, wherein each representative most likely recognizes its corresponding ligand by forming a Watson/Crick base pair. In addition, the adenine-specific riboswitch associated with the ydhL gene of Bacillus subtilis functions as a genetic ‘ON’ switch, wherein adenine binding causes a structural rearrangement that precludes formation of an intrinsic transcription terminator stem. Guanine-sensing riboswitches are a class of RNA genetic control elements that modulate gene expression in response to changing concentrations of this compound.


The 5′-untranslated sequence of the Escherichia coli btuB mRNA assumes a more proactive role in metabolic monitoring and genetic control. The mRNA serves as a metabolite-sensing genetic switch by selectively binding coenzyme B12 without the need for proteins. This binding event establishes a distinct RNA structure that is likely to be responsible for inhibition of ribosome binding and consequent reduction in synthesis of the cobalamin transport protein BtuB. This discovery, along with related observations described herein, supports the hypothesis that metabolic monitoring through RNA-metabolite interactions is a widespread mechanism of genetic control.


RNA structure probing data indicate that the thiamine pyrophosphate (TPP) riboswitch operates as an allosteric sensor of its target compound, wherein binding of TPP by the aptamer domain stabilizes a conformational state within the aptamer and within the neighboring expression platform that precludes translation. The diversity of expression platforms appears to be expansive. The thiM RNA uses a Shine-Dalgarno (SD)-blocking mechanism to control translation. In contrast, the thiC RNA controls gene expression both at transcription and translation, and therefore might make use of a somewhat more complex expression platform that converts the TPP binding event into a transcription termination event and into inhibition of translation of completed mRNAs.


Numerous other riboswitches are known that can be used together or as part of a chimeric riboswitch along with glycine-sensing riboswitches and their components. Examples of such riboswitches and their use are described in U.S. Application Publication No. 2005-0053951, which is hereby incorporated by reference in its entirety and in particular for its description of the structure and operation of particular riboswitches.


A. General Organization of Riboswitch RNAs

Bacterial riboswitch RNAs are genetic control elements that are located primarily within the 5′-untranslated region (5′-UTR) of the main coding region of a particular mRNA. Structural probing studies (discussed further below) reveal that riboswitch elements are generally composed of two domains: a natural aptamer (T. Hermann, D. J. Patel, Science 2000, 287, 820; L. Gold, et al., Annual Review of Biochemistry 1995, 64, 763) that serves as the ligand-binding domain, and an ‘expression platform’ that interfaces with RNA elements that are involved in gene expression (e.g. Shine-Dalgarno (SD) elements; transcription terminator stems). These conclusions are drawn from the observation that aptamer domains synthesized in vitro bind the appropriate ligand in the absence of the expression platform (see Examples 2, 3 and 6 of U.S. Application Publication No. 2005-0053951). Moreover, structural probing investigations suggest that the aptamer domain of most riboswitches adopts a particular secondary- and tertiary-structure fold when examined independently, that is essentially identical to the aptamer structure when examined in the context of the entire 5 leader RNA. This implies that, in many cases, the aptamer domain is a modular unit that folds independently of the expression platform (see Examples 2, 3 and 6 of U.S. Application Publication No. 2005-0053951).


Ultimately, the ligand-bound or unbound status of the aptamer domain is interpreted through the expression platform, which is responsible for exerting an influence upon gene expression. The view of a riboswitch as a modular element is further supported by the fact that aptamer domains are highly conserved amongst various organisms (and even between kingdoms as is observed for the TPP riboswitch), (N. Sudarsan, et al., RNA 2003, 9, 644) whereas the expression platform varies in sequence, structure, and in the mechanism by which expression of the appended open reading frame is controlled. For example, ligand binding to the TPP riboswitch of the tenA mRNA of B. subtilis causes transcription termination (A. S. Mironov, et al., Cell 2002, 111, 747). This expression platform is distinct in sequence and structure compared to the expression platform of the TPP riboswitch in the thiM mRNA from E. coli, wherein TPP binding causes inhibition of translation by a SD blocking mechanism (see Example 2 of U.S. Application Publication No. 2005-0053951). The TPP aptamer domain is easily recognizable and of near identical functional character between these two transcriptional units, but the genetic control mechanisms and the expression platforms that carry them out are very different.


Aptamer domains for riboswitch RNAs typically range from ˜70 to 170 nt in length (FIG. 11 of U.S. Application Publication No. 2005-0053951). This observation was somewhat unexpected given that in vitro evolution experiments identified a wide variety of small molecule-binding aptamers, which are considerably shorter in length and structural intricacy (T. Hermann, D. J. Patel, Science 2000, 287, 820; L. Gold, et al., Annual Review of Biochemistry 1995, 64, 763; M. Famulok, Current Opinion in Structural Biology 1999, 9, 324). Although the reasons for the substantial increase in complexity and information content of the natural aptamer sequences relative to artificial aptamers remains to be proven, this complexity is believed required to form RNA receptors that function with high affinity and selectivity. Apparent KD values for the ligand-riboswitch complexes range from low nanomolar to low micromolar. It is also worth noting that some aptamer domains, when isolated from the appended expression platform, exhibit improved affinity for the target ligand over that of the intact riboswitch. (˜10 to 100-fold) (see Example 2 of U.S. Application Publication No. 2005-0053951). Presumably, there is an energetic cost in sampling the multiple distinct RNA conformations required by a fully intact riboswitch RNA, which is reflected by a loss in ligand affinity. Since the aptamer domain must serve as a molecular switch, this might also add to the functional demands on natural aptamers that might help rationalize their more sophisticated structures.


B. Riboswitch Regulation of Transcription Termination in Bacteria

Bacteria primarily make use of two methods for termination of transcription. Certain genes incorporate a termination signal that is dependent upon the Rho protein, (J. P. Richardson, Biochimica et Biophysica Acta 2002, 1577, 251). while others make use of Rho-independent terminators (intrinsic terminators) to destabilize the transcription elongation complex (I. Gusarov, E. Nudler, Molecular Cell 1999, 3, 495; E. Nudler, M. E. Gottesman, Genes to Cells 2002, 7, 755). The latter RNA elements are composed of a GC-rich stem-loop followed by a stretch of 6-9 uridyl residues. Intrinsic terminators are widespread throughout bacterial genomes (F. Lillo, et al., 2002, 18, 971), and are typically located at the 3′-termini of genes or operons. Interestingly, an increasing number of examples are being observed for intrinsic terminators located within 5′-UTRs.


Amongst the wide variety of genetic regulatory strategies employed by bacteria there is a growing class of examples wherein RNA polymerase responds to a termination signal within the 5′-UTR in a regulated fashion (T. M. Henkin, Current Opinion in Microbiology 2000, 3, 149). During certain conditions the RNA polymerase complex is directed by external signals either to perceive or to ignore the termination signal. Although transcription initiation might occur without regulation, control over mRNA synthesis (and of gene expression) is ultimately dictated by regulation of the intrinsic terminator. Presumably, one of at least two mutually exclusive mRNA conformations results in the formation or disruption of the RNA structure that signals transcription termination. A trans-acting factor, which in some instances is a RNA (F. J. Grundy, et al., Proceedings of the National Academy of Sciences of the United States of America 2002, 99, 11121; T. M. Henkin, C. Yanofsky, Bioessays 2002, 24, 700) and in others is a protein (J. Stulke, Archives of Microbiology 2002, 177, 433), is generally required for receiving a particular intracellular signal and subsequently stabilizing one of the RNA conformations. Riboswitches offer a direct link between RNA structure modulation and the metabolite signals that are interpreted by the genetic control machinery. A brief overview of the FMN riboswitch from a B. subtilis mRNA is provided below to illustrate this mechanism.


The xpt-pbuX operon (Christiansen, L. C., et al., 1997, J. Bacteriol. 179, 2540-2550) is controlled by a riboswitch that exhibits high affinity and high selectivity for guanine. This class of riboswitches is present in the 5′-untranslated region (5′-UTR) of five transcriptional units in B. subtilis, including that of the 12-gene pur operon. Direct binding of guanine by mRNAs serves as a critical determinant of metabolic homeostasis for purine metabolism in certain bacteria. Furthermore, the discovered classes of riboswitches, which respond to seven distinct target molecules, control at least 68 genes in Bacillus subtilis that are of fundamental importance to central metabolic pathways.


Also disclosed are guanine riboswitches that have been identified in B. subtilis. The crystal structure at 1.95 Å resolution of the purine-binding domain of the guanine riboswitch from the xpt-pbuX operon of B. subtilis bound to hypoxanthine, a prevalent metabolite in the bacterial purine salvage pathway, has been elucidated. This structure reveals a complex RNA fold involving several phylogenetically conserved nucleotides that create a binding pocket that almost completely envelops the ligand. Hypoxanthine functions to stabilize this structure and to promote the formation of a downstream transcriptional terminator element, thereby providing a mechanism for directly repressing gene expression in response to an increase in intracellular concentrations of metabolite.


Certain mRNAs involved in thiamine biosynthesis bind to thiamine (vitamin B1) or its bioactive pyrophosphate derivative (TPP) without the participation of protein factors. The mRNA-effector complex adopts a distinct structure that sequesters the ribosome-binding site and leads to a reduction in gene expression. This metabolite-sensing mRNA system provides an example of a genetic “riboswitch” (referred to herein as a riboswitch) whose origin might predate the evolutionary emergence of proteins. It has been discovered that the mRNA leader sequence of the btuB gene of Escherichia coli can bind coenzyme B12 selectively, and that this binding event brings about a structural change in the RNA that is important for genetic control (see Example 1 of U.S. Application Publication No. 2005-0053951). It was also discovered that mRNAs that encode thiamine biosynthetic proteins also employ a riboswitch mechanism (see Example 2 of U.S. Application Publication No. 2005-0053951).


A riboswitch class was discovered in bacteria that is selectively triggered by glycine. A representative of these glycine-sensing RNAs from Bacillus subtilis operates as a rare genetic on switch for the gcvT operon, which codes for proteins that form the glycine cleavage system. Most glycine riboswitches integrate two ligand-binding domains that function cooperatively to more closely approximate a two-state genetic switch. This advanced form of riboswitch may have evolved to ensure that excess glycine is efficiently used to provide carbon flux through the citric acid cycle and maintain adequate amounts of the amino acid for protein synthesis. Thus, riboswitches perform key regulatory roles and exhibit complex performance characteristics that previously had been observed only with protein factors.


Although the specific natural riboswitches disclosed herein are the first examples of mRNA elements that control genetic expression by metabolite binding, it is expected that this genetic control strategy is widespread in biology. It has been suggested (White III, Coenzymes as fossils of an earlier metabolic state. J. Mol. Evol. 7, 101-104 (1976); White III, In: The Pyridine Nucleotide Coenzymes. Acad. Press, NY pp. 1-17 (1982); Benner et al., Modern metabolism as a palimpsest of the RNA world. Proc. Natl. Acad. Sci. USA 86, 7054-7058 (1989)) that TPP, coenzyme B12 and FMN emerged as biological cofactors during the RNA world (Joyce, The antiquity of RNA-based evolution. Nature 418, 214-221 (2002)). If these metabolites were being biosynthesized and used before the advent of proteins, then certain riboswitches might be modern examples of the most ancient form of genetic control. A search of genomic sequence databases has revealed that sequences corresponding to the TPP aptamer exist in organisms from bacteria, archaea and eukarya-largely without major alteration. Although new metabolite-binding mRNAs are likely to emerge as evolution progresses, it is possible that the known riboswitches are molecular fossils from the RNA world.


It is to be understood that the disclosed method and compositions are not limited to specific synthetic methods, specific analytical techniques, or to particular reagents unless otherwise specified, and, as such, can vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.


Materials

Disclosed are materials, compositions, and components that can be used for, can be used in conjunction with, can be used in preparation for, or are products of the disclosed methods and compositions. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference to each of various individual and collective combinations and permutation of these compounds can not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a riboswitch or aptamer domain is disclosed and discussed and a number of modifications that can be made to a number of molecules including the riboswitch or aptamer domain are discussed, each and every combination and permutation of riboswitch or aptamer domain and the modifications that are possible are specifically contemplated unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is not individually recited, each is individually and collectively contemplated. Thus, in this example, each of the combinations A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are specifically contemplated and should be considered disclosed from disclosure of A, B, and C; D, E, and F; and the example combination A-D. Likewise, any subset or combination of these is also specifically contemplated and disclosed. Thus, for example, the sub-group of A-F, B-F, and C-E are specifically contemplated and should be considered disclosed from disclosure of A, B, and C; D, E, and F; and the example combination A-D. This concept applies to all aspects of this application including, but not limited to, steps in methods of making and using the disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods, and that each such combination is specifically contemplated and should be considered disclosed.


A. Riboswitches

Riboswitches are expression control elements that are part of an RNA molecule to be expressed and that change state when bound by a trigger molecule. Riboswitches typically can be dissected into two separate domains: one that selectively binds the target (aptamer domain) and another that influences genetic control (expression platform domain). It is the dynamic interplay between these two domains that results in metabolite-dependent allosteric control of gene expression. Disclosed are isolated and recombinant riboswitches, recombinant constructs containing such riboswitches, heterologous sequences operably linked to such riboswitches, and cells and transgenic organisms harboring such riboswitches, riboswitch recombinant constructs, and riboswitches operably linked to heterologous sequences. The heterologous sequences can be, for example, sequences encoding proteins or peptides of interest, including reporter proteins or peptides. Preferred riboswitches are, or are derived from, naturally occurring riboswitches.


The disclosed riboswitches, including the derivatives and recombinant forms thereof, generally can be from any source, including naturally occurring riboswitches and riboswitches designed de novo. Any such riboswitches can be used in or with the disclosed methods. However, different types of riboswitches can be defined and some such sub-types can be useful in or with particular methods (generally as described elsewhere herein). Types of riboswitches include, for example, naturally occurring riboswitches, derivatives and modified forms of naturally occurring riboswitches, chimeric riboswitches, and recombinant riboswitches. A naturally occurring riboswitch is a riboswitch having the sequence of a riboswitch as found in nature. Such a naturally occurring riboswitch can be an isolated or recombinant form of the naturally occurring riboswitch as it occurs in nature. That is, the riboswitch has the same primary structure but has been isolated or engineered in a new genetic or nucleic acid context. Chimeric riboswitches can be made up of, for example, part of a riboswitch of any or of a particular class or type of riboswitch and part of a different riboswitch of the same or of any different class or type of riboswitch; part of a riboswitch of any or of a particular class or type of riboswitch and any non-riboswitch sequence or component. Recombinant riboswitches are riboswitches that have been isolated or engineered in a new genetic or nucleic acid context.


Riboswitches can have single or multiple aptamer domains. Aptamer domains in riboswitches having multiple aptamer domains can exhibit cooperative binding of trigger molecules or can not exhibit cooperative binding of trigger molecules (that is, the aptamers need not exhibit cooperative binding). In the latter case, the aptamer domains can be said to be independent binders. Riboswitches having multiple aptamers can have one or multiple expression platform domains. For example, a riboswitch having two aptamer domains that exhibit cooperative binding of their trigger molecules can be linked to a single expression platform domain that is regulated by both aptamer domains. Riboswitches having multiple aptamers can have one or more of the aptamers joined via a linker. Where such aptamers exhibit cooperative binding of trigger molecules, the linker can be a cooperative linker.


Aptamer domains can be said to exhibit cooperative binding if they have a Hill coefficient n between x and x−1, where x is the number of aptamer domains (or the number of binding sites on the aptamer domains) that are being analyzed for cooperative binding. Thus, for example, a riboswitch having two aptamer domains (such as glycine-responsive riboswitches) can be said to exhibit cooperative binding if the riboswitch has Hill coefficient between 2 and 1. It should be understood that the value of x used depends on the number of aptamer domains being analyzed for cooperative binding, not necessarily the number of aptamer domains present in the riboswitch. This makes sense because a riboswitch can have multiple aptamer domains where only some exhibit cooperative binding.


Different classes of riboswitches refer to riboswitches that have the same or similar trigger molecules or riboswitches that have the same or similar overall structure (predicted, determined, or a combination). Riboswitches of the same class generally, but need not, have both the same or similar trigger molecules and the same or similar overall structure. Riboswitch classes include glycine-responsive riboswitches, guanine-responsive riboswitches, adenine-responsive riboswitches, lysine-responsive riboswitches, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin-responsive riboswitches, flavin mononucleotide-responsive riboswitches, and a S-adenosylmethionine-responsive riboswitches.


Also disclosed are chimeric riboswitches containing heterologous aptamer domains and expression platform domains. That is, chimeric riboswitches are made up an aptamer domain from one source and an expression platform domain from another source. The heterologous sources can be from, for example, different specific riboswitches, different types of riboswitches, or different classes of riboswitches. The heterologous aptamers can also come from non-riboswitch aptamers. The heterologous expression platform domains can also come from non-riboswitch sources.


Riboswitches can be modified from other known, developed or naturally-occurring riboswitches. For example, switch domain portions can be modified by changing one or more nucleotides while preserving the known or predicted secondary, tertiary, or both secondary and tertiary structure of the riboswitch. For example, both nucleotides in a base pair can be changed to nucleotides that can also base pair. Changes that allow retention of base pairing are referred to herein as base pair conservative changes.


Modified or derivative riboswitches can also be produced using in vitro selection and evolution techniques. In general, in vitro evolution techniques as applied to riboswitches involve producing a set of variant riboswitches where part(s) of the riboswitch sequence is varied while other parts of the riboswitch are held constant. Activation, deactivation or blocking (or other functional or structural criteria) of the set of variant riboswitches can then be assessed and those variant riboswitches meeting the criteria of interest are selected for use or further rounds of evolution. Useful base riboswitches for generation of variants are the specific and consensus riboswitches disclosed herein. Consensus riboswitches can be used to inform which part(s) of a riboswitch to vary for in vitro selection and evolution.


Also disclosed are modified riboswitches with altered regulation. The regulation of a riboswitch can be altered by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch). The aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain. Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain. Generally, any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.


Also disclosed are inactivated riboswitches. Riboswitches can be inactivated by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.


Also disclosed are biosensor riboswitches. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. Biosensor riboswitches can be used in various situations and platforms. For example, biosensor riboswitches can be used with solid supports, such as plates, chips, strips and wells.


Also disclosed are modified or derivative riboswitches that recognize new trigger molecules. New riboswitches and/or new aptamers that recognize new trigger molecules can be selected for, designed or derived from known riboswitches. This can be accomplished by, for example, producing a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results.


Particularly useful aptamer domains can form a stem structure referred to herein as the P1 stem structure (or simply P1). The P1 stems of a variety of riboswitches are shown in FIG. 11 of U.S. Application Publication No. 2005-0053951. The hybridizing strands in the P1 stem structure are referred to as the aptamer strand (also referred to as the P1a strand) and the control strand (also referred to as the P1b strand). The control strand can form a stem structure with both the aptamer strand and a sequence in a linked expression platform that is referred to as the regulated strand (also referred to as the P1c strand). Thus, the control strand (P1b) can form alternative stem structures with the aptamer strand (P1a) and the regulated strand (P1c). Activation and deactivation of a riboswitch results in a shift from one of the stem structures to the other (from P1a/P1b to P1b/P1c or vice versa). The formation of the P1b/P1c stem structure affects expression of the RNA molecule containing the riboswitch. Riboswitches that operate via this control mechanism are referred to herein as alternative stem structure riboswitches (or as alternative stem riboswitches). Some glycine-responsive riboswitches having two aptamers utilize this mechanism using a P1 stem in the second aptamer.


In general, any aptamer domain can be adapted for use with any expression platform domain by designing or adapting a regulated strand in the expression platform domain to be complementary to the control strand of the aptamer domain. Alternatively, the sequence of the aptamer and control strands of an aptamer domain can be adapted so that the control strand is complementary to a functionally significant sequence in an expression platform. For example, the control strand can be adapted to be complementary to the Shine-Dalgarno sequence of an RNA such that, upon formation of a stem structure between the control strand and the SD sequence, the SD sequence becomes inaccessible to ribosomes, thus reducing or preventing translation initiation. Note that the aptamer strand would have corresponding changes in sequence to allow formation of a P1 stem in the aptamer domain. In the case of riboswitches having multiple aptamers exhibiting cooperative binding, one the P1 stem of the activating aptamer (the aptamer that interacts with the expression platform domain) need be designed to form a stem structure with the SD sequence.


As another example, a transcription terminator can be added to an RNA molecule (most conveniently in an untranslated region of the RNA) where part of the sequence of the transcription terminator is complementary to the control strand of an aptamer domain (the sequence will be the regulated strand). This will allow the control sequence of the aptamer domain to form alternative stem structures with the aptamer strand and the regulated strand, thus either forming or disrupting a transcription terminator stem upon activation or deactivation of the riboswitch. Any other expression element can be brought under the control of a riboswitch by similar design of alternative stem structures.


For transcription terminators controlled by riboswitches, the speed of transcription and spacing of the riboswitch and expression platform elements can be important for proper control. Transcription speed can be adjusted by, for example, including polymerase pausing elements (e.g., a series of uridine residues) to pause transcription and allow the riboswitch to form and sense trigger molecules. For example, with the FMN riboswitch, if FMN is bound to its aptamer domain, then the antiterminator sequence is sequestered and is unavailable for formation of an antiterminator structure (FIG. 12 of U.S. Application Publication No. 2005-0053951). However, if FMN is absent, the antiterminator can form once its nucleotides emerge from the polymerase. RNAP then breaks free of the pause site only to reach another U-stretch and pause again. The transcriptional terminator then forms only if the terminator nucleotides are not tied up by the antiterminator.


Disclosed are regulatable gene expression constructs comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous. The riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain and the expression platform domain are heterologous. The riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain comprises a P1 stem, wherein the P1 stem comprises an aptamer strand and a control strand, wherein the expression platform domain comprises a regulated strand, wherein the regulated strand, the control strand, or both have been designed to form a stem structure. The riboswitch can comprise two or more aptamer domains and an expression platform domain, wherein at least one of the aptamer domains and the expression platform domain are heterologous. The riboswitch can comprise two or more aptamer domains and an expression platform domain, wherein at least one of the aptamer domains comprises a P1 stem, wherein the P1 stem comprises an aptamer strand and a control strand, wherein the expression platform domain comprises a regulated strand, wherein the regulated strand, the control strand, or both have been designed to form a stem structure.


Disclosed are riboswitches, wherein the riboswitch is a non-natural derivative of a naturally-occurring riboswitch. The riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain and the expression platform domain are heterologous. The riboswitch can be derived from a naturally-occurring guanine-responsive riboswitch, adenine-responsive riboswitch, lysine-responsive riboswitch, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin-responsive riboswitch, flavin mononucleotide-responsive riboswitch, glycine-responsive riboswitch, or a S-adenosylmethionine-responsive riboswitch. The riboswitch can be activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule.


Table 5 discloses examples of various guanine riboswitches. The alignment of sequences in Table 5 is in Stockholm format as output by the cmalign program of the INFERNAL software package. The last two entries in Table 5 reflect the consensus sequence and structure for the motif as reflected by a computer algorithm described by Eddy, S. R. (2003). INFERNAL. Version 0.55. Distributed by the author. Dept. of Genetics, Washington University School of Medicine. St. Louis, Mo. FIG. 41 of U.S. Application Publication No. 2005-0053951 describes consensus riboswitch sequences, natural riboswitch sequences, and riboswitch sequence alignment. Thermodynamic parameters of various nucleobases bound to adenine riboswitch RNA are shown in Table 7. Thermodynamic parameters of various nucleobases bound to guanine riboswitch RNA are shown in Table 8.


Numerous riboswitches and riboswitch constructs are described and referred to herein. It is specifically contemplated that any specific riboswitch or riboswitch construct or group of riboswitches or riboswitch constructs can be excluded from some aspects of the invention disclosed herein. For example, fusion of the xpt-pbuX riboswitch with a reporter gene could be excluded from a set of riboswitches fused to reporter genes. As another example any combination of the riboswitches listed in Table 5 can be specifically included or specifically excluded form any aspect of the disclosed methods and compositions.


1. Aptamer Domains


Aptamers are nucleic acid segments and structures that can bind selectively to particular compounds and classes of compounds. Riboswitches have aptamer domains that, upon binding of a trigger molecule result in a change in the state or structure of the riboswitch. In functional riboswitches, the state or structure of the expression platform domain linked to the aptamer domain changes when the trigger molecule binds to the aptamer domain. Aptamer domains of riboswitches can be derived from any source, including, for example, natural aptamer domains of riboswitches, artificial aptamers, engineered, selected, evolved or derived aptamers or aptamer domains. Aptamers in riboswitches generally have at least one portion that can interact, such as by forming a stem structure, with a portion of the linked expression platform domain. This stem structure will either form or be disrupted upon binding of the trigger molecule.


Consensus aptamer domains of a variety of natural riboswitches are shown in FIG. 11 of U.S. Application Publication No. 2005-0053951 and elsewhere herein. These aptamer domains (including all of the direct variants embodied therein) can be used in riboswitches. The consensus sequences and structures indicate variations in sequence and structure. Aptamer domains that are within the indicated variations are referred to herein as direct variants. These aptamer domains can be modified to produce modified or variant aptamer domains. Conservative modifications include any change in base paired nucleotides such that the nucleotides in the pair remain complementary. Moderate modifications include changes in the length of stems or of loops (for which a length or length range is indicated) of less than or equal to 20% of the length range indicated. Loop and stem lengths are considered to be “indicated” where the consensus structure shows a stem or loop of a particular length or where a range of lengths is listed or depicted. Moderate modifications include changes in the length of stems or of loops (for which a length or length range is not indicated) of less than or equal to 40% of the length range indicated. Moderate modifications also include and functional variants of unspecified portions of the aptamer domain. Unspecified portions of the aptamer domains are indicated by solid lines in FIG. 11 of U.S. Application Publication No. 2005-0053951.


The P1 stem and its constituent strands can be modified in adapting aptamer domains for use with expression platforms and RNA molecules. Such modifications, which can be extensive, are referred to herein as P1 modifications. P1 modifications include changes to the sequence and/or length of the P1 stem of an aptamer domain.


The aptamer domains shown in FIG. 11 of U.S. Application Publication No. 2005-0053951 (including any direct variants) are particularly useful as initial sequences for producing derived aptamer domains via in vitro selection or in vitro evolution techniques.


Aptamer domains of the disclosed riboswitches can also be used for any other purpose, and in any other context, as aptamers. For example, aptamers can be used to control ribozymes, other molecular switches, and any RNA molecule where a change in structure can affect function of the RNA.


2. Expression Platform Domains


Expression platform domains are a part of riboswitches that affect expression of the RNA molecule that contains the riboswitch. Expression platform domains generally have at least one portion that can interact, such as by forming a stem structure, with a portion of the linked aptamer domain. This stem structure will either form or be disrupted upon binding of the trigger molecule. The stem structure generally either is, or prevents formation of, an expression regulatory structure. An expression regulatory structure is a structure that allows, prevents, enhances or inhibits expression of an RNA molecule containing the structure. Examples include Shine-Dalgarno sequences, initiation codons, transcription terminators, and stability and processing signals.


B. Trigger Molecules

Trigger molecules are molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques).


C. Compounds

Also disclosed are compounds, and compositions containing such compounds, that can activate, deactivate or block a riboswitch. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Compounds can be used to activate, deactivate or block a riboswitch. The trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch. Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch. Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch. A riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.


Also disclosed are compounds for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch. This can be accomplished by bringing a compound into contact with the RNA molecule. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Thus, subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.


Also disclosed are compounds for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule. Also disclosed are compounds for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can in death, stasis or debilitation of the cell or organism.


Also disclosed are compounds for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de-repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.


Also disclosed are methods of identifying compounds that activate, deactivate or block a riboswitch. For example, compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner. For example, the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound. As another example, the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. As can be seen, assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.


Identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.


Compounds can also be identified using the atomic crystalline structure of a riboswitch. An example of such a crystalline atomic structure of a natural guanine-responsive riboswitch can be found in FIG. 5. The atomic coordinates of the atomic structure are listed in Table 6. In FIG. 5, the riboswitch is shown bound to hypoxanthine. In this example, the crystal structure at 1.95 Å resolution of the purine-binding domain of the guanine riboswitch from the xpt-pbuX operon of B. subtilis bound to hypoxanthine, a prevalent metabolite in the bacterial purine salvage pathway is shown. This structure reveals a complex RNA fold involving several phylogenetically conserved nucleotides that create a binding pocket that almost completely envelops the ligand. Hypoxanthine functions to stabilize this structure and to promote the formation of a downstream transcriptional terminator element, thereby providing a mechanism for directly repressing gene expression in response to an increase in intracellular concentrations of metabolite.


Compounds can be identified using the crystalline structure of a riboswitch by, for example, modeling the atomic structure of the riboswitch with a test compound; and determining if the test compound interacts with the riboswitch. This can be done by using a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch. Compounds can also be identified by, for example, assessing the fit between the riboswitch and a compound known to bind the riboswitch (such as the trigger molecule), identify sites where the compound can be changed with little or no obvious adverse effects on binding of the compound, and incorporating one or more such alterations to produce a new compound. The method of identifying compounds that interact with a riboswitch can also involve production of the compounds so identified.


Typically the method first utilizes a 3-dimensional structure of the riboswitch with a compound, also referred to as a “known compound” or “known target”. Any of the trigger molecules and compounds disclosed herein can be used as such a known compound. The structure of the riboswitch can be determined using any known means, such as crystallography or solution NMR spectroscopy. That structure can also be obtained through computer molecular modeling simulation programs, such as AutoDock. The methods can involve determining the amount of binding, such as determining the binding energy, between a riboswitch, and a potential compound for that riboswitch. An active compound is a compound that has some activity against a riboswitch, such as inhibiting the riboswitch's activity or enhancing the riboswitch's activity. In addition, the potential compound can be an analog, which has some structural relationship to a known compound for the molecule. Any of the trigger molecules, known compounds, and compounds disclosed herein can be used as the basis of or to derive a potential compound.


The identity or relationship of the structure, properties, interaction or binding parameters, and the like of the known compound and potential compound can be viewed in number of ways. For example, any of the measures or interaction parameters that can be measured or assessed using the structural model, and such measures and parameters obtained for a known compound and a potential compound can be compared. One can look at the identity between the entire known compound and the potential compound. One can also look at the identity between the potential compound, such as an analog, and the know compound only in the domain where the potential compound interacts with the riboswitch. One can also look at the identity between the potential compound and the known compound at the level of a sub-domain, such as only those moieties or atoms in the potential compound which are within 7 Å, 6 Å, 5 Å, 4 Å, 3 Å, or 2 Å of a moiety or atom which is in contact with the riboswitch in the known compound. Generally, the more specific the sub-domain the higher the identity will be between the moieties of the potential compound and the known compound. For example, there can be 30% or greater, 35% or greater, 40% or greater, 45% or greater, 50% or greater, 55% or greater, 60% or greater, 65% or greater, 70% or greater, 75% or greater, 80% or greater, 85% or greater, 90% or greater, 95% or greater identity between the known compound and potential compound as a whole, 50% or greater, 55% or greater, 60% or greater, 65% or greater, 70% or greater, 75% or greater, 80% or greater, 85% or greater, 90% or greater, 95% or greater identity between the binding domain of the known compound and the potential compound, and 70% or greater, 75% or greater, 80% or greater, 85% or greater, 90% or greater, 95% or greater identity between the moieties or atoms of the potential compound that correspond to the moieties or atoms of the known compound which are within 5 Å of a moiety or atom which interacts with the riboswitch. Another sub-domain is a sub-domain of moieties or atoms which actually contact the riboswitch. In this case the identity can be, for example, greater than 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or higher.


Typically, the potential compounds exist in a family of potential compounds, i.e. a set of analogs, all of which have some structural relationship to the known compound for the riboswitch. A family consisting of any number of members can be screened. The maximum number of members in the family is only limited by the amount of computer power available to screen each member in a desired amount of time. The methods can involve at least one template structure of the riboswitch and a target, often this would be with a known target. It is not required that this structure be existent, as it can be generated, in some cases during the disclosed methods, using standard structure determination techniques. It is preferred that a real structure exist at the time the methods are employed.


The methods can also involve modeling the structure of the potential compound, using information from the structure of the known compound. This modeling can be performed in any way, and as described herein.


The conformation and position of the potential compound can be held fixed during the calculations; that is, it can be assumed that the riboswitch binds in exactly the same orientation to the potential compound as it does to a known compound.


Then, a binding energy (or other property or parameter) can be determined between the riboswitch and the potential compound, and if the binding energy (or other property or parameter) meets certain criteria, then the potential compound can be designated as an actual compound, i.e. one that is likely to interact with the riboswitch. Although the following refers to the use of binding energy, it should be understood that any property or parameter involving the interaction or modeling of a compound and a riboswitch can be used. The criterion can be that the computed binding energy of the riboswitch with the potential compound is similar to, or more favorable than, the computed binding energy of the same riboswitch with a known compound. For example, an actual compound can be a compound where the computed binding energy as discussed herein is, for example, at least 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 100%, 101%, 102%, 103%, 104%, 105%, 106%, 107%, 108%, 109%, 110%, 120%, 130%, 140%, 150%, 200%, 250%, 300%, 350%, 400%, 450%, 500%, 600%, 700%, 800%, 900%, 1000%, or greater than that of the known compound binding energy. An actual compound can also be a compound which after ordering all potential compounds in terms of the strength of their binding energies, are the compounds which are in the top 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% of computed binding strengths, of for example, a set of potential compounds where the set is at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200, 225, 250, 275, 300, 350, 400, 500, 700, or a 1000 potential compounds.


It is also understood that once a potential compound is identified, as disclosed herein, traditional testing and analysis can be performed, such as performing a biological assay using the riboswitch and the actual compound to further define the ability of the actual compound to interact with and/or modulate the riboswitch. The disclosed methods can include the step of assaying the activity of the riboswitch and compound, as well as performing, for example, combinatorial chemistry studies using libraries based on the riboswitch, for example.


Energy calculations can be based on, for example, molecular or quantum mechanics. Molecular mechanics approximates the energy of a system by summing a series of empirical functions representing components of the total energy like bond stretching, van der Waals forces, or electrostatic interactions. Quantum mechanics methods use various degrees of approximation to solve the Schrödinger equation. These methods deal with electronic structure, allowing for the characterization of chemical reactions.


Potential compounds of the riboswitch can be identified. This can be accomplished by selecting potential compounds with a given similarity to the known compound. For example, compounds in the same family as the known compound can be selected.


To prepare each riboswitch for calculation, atoms can be built in that were unresolved or absent from the crystal structures of the potential compound. This can be done, for example, using the PRODRG webserver http://www.davapc1.bioch.dundee.ac.uk./programs/prodrg, or standard molecular modeling programs such as InsightII, Quanta (both at www.accelrys.com), CNS (Brunger et al., Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905-921 (1998)), or any other molecular modeling system capable of preparing the riboswitch structure.


The binding energy (or other property or parameter) of the potential compound and riboswitch can then be calculated. There are numerous means for carrying this out. For example, the sampling of sidechain positions and the computation of the binding thermodynamics can be accomplished using an empirical function that models the energy of the potential compound-molecule as a sum of electrostatic and van der Waals interactions between all pairs of atoms within the model. Any other computational method for scoring the binding energy of the potential compound with the riboswitch can be used (H. Gohlke, & G. Klebe. Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. Angew. Chem. Int. Ed. 41, 2644-4676 (2002)). Examples of such scoring methods include, but are not limited to, those implemented in programs such as AutoDock (G. M. Morris et al. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J. Comput. Chem. 19, 1639-1662 (1998)), Gold (G. Jones et al. Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J. Mol. Biol. 245, 43-53 (1995)), Chem-Score (M. D. Eldridge et al. J. Comput.-Aided Mol. Des. 11, 425-445 (1997)) and Drug-Score (H. Gohlke et al. Knowledge-based scoring function to predict protein-ligand interactions. J. Mol. Biol. 295, 337-356 (2000)).


Rotamer libraries are known to those of skill in the art and can be obtained from a variety of sources, including the internet. Rotamers are low energy side-chain conformations. The use of a library of rotamers allows for the modeling of a structure to try the most likely side-chain conformations, saving time and producing a structure that is more likely to be correct. The use of a library of rotamers can be restricted to those residues that are within a given region of the potential compound, for example, at the binding site, or within a specified distance of the compound. The latter distance can be set at any desired length, for example, the potential compound can be 2, 3, 4, 5, 6, 7, 8, or 9 Å from any atom of the molecule.


Electrostatic interactions between every pair of atoms can be calculated, for example, using a Coulombic model with the formula:






E
elec=332.08q1q2/∈r.


where q1 and q2 are partial atomic charges, r is the distance between them, and ∈ is the dielectric constant.


Partial atomic charges can be taken from existing parameter sets that have been developed to describe charge distributions in molecules. Example parameter sets include, but are not limited to, PARSE (D. A. Sitkoff et al. Accurate calculation of hydration free-energies using macroscopic solvent models. J. Phys. Chem. 98, 1978-1988 (1994)), CHARMM (MacKerell et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J. Phys. Chem. B 102, 3586-3616, 1998) and AMBER (W. D. Cornell et al. A 2nd generation force-field for the simulation of proteins, nucleic-acids, and organic-molecules. J. Am. Chem. Soc. 117. 5179-5195 (1995)). Partial charges for atoms can be assigned either by analogy with those of similar functional groups, or by empirical assignment methods such as that implemented in the PRODRG server (D. M. F. van Aalten et al. PRODRG, a program for generating molecular topologies and unique molecular descriptors from coordinates of small molecules. J. Comput.-Aided Mol. Design. 10, 255-262 (1996)), or by the use of standard quantum mechanical calculation methods (for example, C. I. Bayly et al. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges—the RESP model. J. Phys. Chem. 97, 10269-10280, (1993)).


The electrostatic interaction can also be calculated by more elaborate methodologies that incorporate electrostatic desolvation effects. These can include explicit solvent and implicit solvent models: in the former, water molecules are directly included in the calculations, whereas in the latter, the effects of water are described by a dielectric continuum approach. Specific examples of implicit solvent methods for calculating electrostatic interactions include but are not limited to: Poisson-Boltzmann based methods and Generalized Born methods (M. Feig & C. L. Brooks. Recent advances in the development and application of implicit solvent models in biomolecule simulations. Curr. Opin. Struct. Biol. 14, 217-224 (2004)).


van der Waals and hydrophobic interactions between pairs of atoms (where both atoms are either sulfur or carbon) can be calculated using a simple Lennard-Jones formalism with the following equation:






E
vdw=ε{σatt12/r12−σatt6/r6}.


where ε is an energy, r is the distance between the two atoms and σatt is the distance at which the energy of interaction is zero.


van der Waals interactions between pairs of atoms (where one or both atoms are neither sulfur nor carbon) can be calculated using a simple repulsive energy term:






E
vdw=ε{σrep12/r12}.


where ε is an energy, r is the distance between the two atoms and σrep determines the distance at which the repulsive interaction is equal to ε.


Hydrophobic interactions between atoms can also be calculated using a variety of other methods known to those skilled in the art. For example, the energetic contribution can be calculated as being proportional to the amount of solvent accessible surface area of the ligand and receptor that is buried when the complex is formed. Such contributions can be expressed in terms of interactions between pairs of atoms, such as in the method proposed by Street & Mayo (A. G. Street & S. L. Mayo. Pairwise calculation of protein solvent-accessible surface areas. Folding & Design 3, 253-258 (1998)). Any other implementation of a formalism for describing hydrophobic or van der Waals or other energetic contributions can be included in the calculations.


Binding energies can be calculated for each potential compound-riboswitch interaction. For example, Monte Carlo sampling can be conducted in the presence and absence of the riboswitch, and the average energy in each simulation calculated. A binding energy for the riboswitch with the potential compound can then be calculated as the difference between the two calculated average energies.


The computed binding energy of a potential compound with the riboswitch can be compared with the computed binding energy of a known compound with the riboswitch to determine if the potential compound is likely to be an actual compound. These results can then be confirmed using experimental data, wherein the actual interaction between the riboswitch and compound can be measured. Examples of methods that can be used to determine an actual interaction between the riboswitch and the compound include but are not limited to: equilibrium dialysis measurements (wherein binding of a radioactive form of the compound to the riboswitch is detected), enzyme inhibition assays (wherein the activity of the riboswitch can be monitored in the presence and absence of the compound), and chemical shift perturbation measurements (wherein binding of the riboswitch to the potential compound is monitored by observing changes in NMR chemical shifts of atoms).


In the methods disclosed above, the riboswitch can be a guanine riboswitch, for example. This riboswitch can be selected, for example, from riboswitches in Table 5.


After the atomic crystalline structure of the riboswitch has been modeled with a potential compound, further testing can be carried out to determine the actual interaction between the riboswitch and the compound. For example, multiple different approaches can be used to detect binding RNAs, including allosteric ribozyme assays using gel-based and chip-based detection methods, and in-line probing assays. High throughput testing can also be accomplished by using, for example, fluorescent detection methods. For example, the natural catalytic activity of a glucosamine-6-phosphate sensing riboswitch that controls gene expression by activating RNA-cleaving ribozyme can be used. This ribozyme can be reconfigured to cleave separate substrate molecules with multiple turnover kinetics. Therefore, a fluorescent group held in proximity to a quenching group can be uncoupled (and therefore become more fluorescent) if a compound triggers ribozyme function. Second, molecular beacon technology can be employed. This creates a system that suppresses fluorescence if a compound prevents the beacon from docking to the riboswitch RNA. Either approach can be applied to any of the riboswitch classes by using RNA engineering strategies described herein.


Also disclosed herein are analogs that interact with the guanine riboswitch disclosed herein. Examples of such analogs can be found in FIG. 12B. Many of the 26 compounds synthesized and tested bind the guanine riboswitch with constants that are equal to or better than that of guanine (−5 nM). The fact that appendages with highly variable chemical composition exhibit function shows that numerous variations of these chemical scaffolds can be generated and tested for function in vitro and inside cells. Specifically, further modified versions of these compounds can have improved binding to the guanine riboswitch by making new contacts to other functional groups in the RNA structure. Furthermore, modulation of bioavailability, toxicity, and synthetic ease (among other characteristics) can be tunable by making modifications in these two regions of the scaffold, as the structural model for the riboswitch shows many modifications are possible at these sites.


High-throughput screening can also be used to reveal entirely new chemical scaffolds that also bind to riboswitch RNAs either with standard or non-standard modes of molecular recognition. Since riboswitches are the first major form of natural metabolite-binding RNAs to be discovered, there has been little effort made previously to create binding assays that can be adapted for high-throughput screening. Multiple different approaches can be used to detect metabolite binding RNAs, including allosteric ribozyme assays using gel-based and chip-based detection methods, and in-line probing assays. Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound. This can be accomplished by, for example, combining compound identification methods as disclosed elsewhere herein with methods for manufacturing the identified compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.


Also disclosed are compounds made by checking activation, deactivation or blocking of a riboswitch by a compound and manufacturing the checked compound. This can be accomplished by, for example, combining compound activation, deactivation or blocking assessment methods as disclosed elsewhere herein with methods for manufacturing the checked compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound. Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch.


As used herein, the term “substituted” is contemplated to include all permissible substituents of organic compounds. In a broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, and aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, for example, those described below. The permissible substituents can be one or more and the same or different for appropriate organic compounds. For the purposes of this disclosure, the heteroatoms, such as nitrogen, can have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms. This disclosure is not intended to be limited in any manner by the permissible substituents of organic compounds. Also, the terms “substitution” or “substituted with” include the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g., a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.


“A1,” “A2,” “A3,” and “A4” are used herein as generic symbols to represent various specific substituents. These symbols can be any substituent, not limited to those disclosed herein, and when they are defined to be certain substituents in one instance, they can, in another instance, be defined as some other substituents.


The term “alkyl” as used herein is a branched or unbranched saturated hydrocarbon group of 1 to 24 carbon atoms, such as methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, t-butyl, pentyl, hexyl, heptyl, octyl, nonyl, decyl, dodecyl, tetradecyl, hexadecyl, eicosyl, tetracosyl, and the like. The alkyl group can also be substituted or unsubstituted. The alkyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below. The term “lower alkyl” is an alkyl group with 6 or fewer carbon atoms, e.g., methyl, ethyl, propyl, isopropyl, butyl, sec-butyl, iso-butyl, tert-butyl, pentyl, hexyl, and the like.


Throughout the specification “alkyl” is generally used to refer to both unsubstituted alkyl groups and substituted alkyl groups; however, substituted alkyl groups are also specifically referred to herein by identifying the specific substituent(s) on the alkyl group. For example, the term “halogenated alkyl” specifically refers to an alkyl group that is substituted with one or more halide, e.g., fluorine, chlorine, bromine, or iodine. The term “alkoxyalkyl” specifically refers to an alkyl group that is substituted with one or more alkoxy groups, as described below. The term “alkylamino” specifically refers to an alkyl group that is substituted with one or more amino groups, as described below, and the like. When “alkyl” is used in one instance and a specific term such as “halogenated alkyl” is used in another, it is not meant to imply that the term “alkyl” does not also refer to specific terms such as “halogenated alkyl” and the like.


This practice is also used for other groups described herein. That is, while a term such as “cycloalkyl” refers to both unsubstituted and substituted cycloalkyl moieties, the substituted moieties can, in addition, be specifically identified herein; for example, a particular substituted cycloalkyl can be referred to as, e.g., an “alkylcycloalkyl.” Similarly, a substituted alkoxy can be specifically referred to as, e.g., a “halogenated alkoxy,” a particular substituted alkenyl can be, e.g., an “alkenylalcohol,” and the like. Again, the practice of using a general term, such as “cycloalkyl,” and a specific term, such as “alkylcycloalkyl,” is not meant to imply that the general term does not also include the specific term.


The term “alkoxy” as used herein is an alkyl group bonded through a single, terminal ether linkage; that is, an “alkoxy” group can be defined as —OA where A2 is alkyl as defined above.


The term “alkenyl” as used herein is a hydrocarbon group of from 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon double bond. Asymmetric structures such as (A1A2)C═C(A3A4) are intended to include both the E and Z isomers. This can be presumed in structural formulae herein wherein an asymmetric alkene is present, or it can be explicitly indicated by the bond symbol C═C. The alkenyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.


The term “alkynyl” as used herein is a hydrocarbon group of 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon triple bond. The alkynyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.


The term “aryl” as used herein is a group that contains any carbon-based aromatic group including, but not limited to, benzene, naphthalene, phenyl, biphenyl, phenoxybenzene, and the like. The term “aryl” also includes “heteroaryl,” which is defined as a group that contains an aromatic group that has at least one heteroatom incorporated within the ring of the aromatic group. Examples of heteroatoms include, but are not limited to, nitrogen, oxygen, sulfur, and phosphorus. Likewise, the term “non-heteroaryl,” which is also included in the term “aryl,” defines a group that contains an aromatic group that does not contain a heteroatom. The aryl group can be substituted or unsubstituted. The aryl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein. The term “biaryl” is a specific type of aryl group and is included in the definition of aryl. Biaryl refers to two aryl groups that are bound together via a fused ring structure, as in naphthalene, or are attached via one or more carbon-carbon bonds, as in biphenyl.


The term “cycloalkyl” as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms. Examples of cycloalkyl groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, etc. The term “heterocycloalkyl” is a cycloalkyl group as defined above where at least one of the carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkyl group and heterocycloalkyl group can be substituted or unsubstituted. The cycloalkyl group and heterocycloalkyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.


The term “cycloalkenyl” as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms and containing at least one double bound, i.e., C═C. Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadienyl, cyclohexenyl, cyclohexadienyl, and the like. The term “heterocycloalkenyl” is a type of cycloalkenyl group as defined above, and is included within the meaning of the term “cycloalkenyl,” where at least one of the carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkenyl group and heterocycloalkenyl group can be substituted or unsubstituted. The cycloalkenyl group and heterocycloalkenyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.


The term “cyclic group” is used herein to refer to either aryl groups, non-aryl groups (i.e., cycloalkyl, heterocycloalkyl, cycloalkenyl, and heterocycloalkenyl groups), or both. Cyclic groups have one or more ring systems that can be substituted or unsubstituted. A cyclic group can contain one or more aryl groups, one or more non-aryl groups, or one or more aryl groups and one or more non-aryl groups.


The term “aldehyde” as used herein is represented by the formula —C(O)H. Throughout this specification “C(O)” is a short hand notation for C═O.


The terms “amine” or “amino” as used herein are represented by the formula NA1A2A3, where A1, A2, and A3 can be, independently, hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.


The term “carboxylic acid” as used herein is represented by the formula


—C(O)OH. A “carboxylate” as used herein is represented by the formula —C(O)O.


The term “ester” as used herein is represented by the formula —OC(O)A1 or


—C(O)OA1, where A1 can be an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.


The term “ether” as used herein is represented by the formula A1OA2, where A1 and A2 can be, independently, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.


The term “ketone” as used herein is represented by the formula A1C(O)A2, where A1 and A2 can be, independently, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.


The term “halide” as used herein refers to the halogens fluorine, chlorine, bromine, and iodine.


The term “hydroxyl” as used herein is represented by the formula —OH.


The term “sulfo-oxo” as used herein is represented by the formulas —S(O)A1 (i.e., “sulfonyl”), A1S(O)A2 (i.e., “sulfoxide”), —S(O)2A1, A1SO2A2 (i.e., “sulfone”),


—OS(O)2A1, or —OS(O)2OA1, where A1 and A2 can be hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above. Throughout this specification “S(O)” is a short hand notation for S═O.


The term “sulfonylamino” or “sulfonamide” as used herein is represented by the formula —S(O)2NH—.


The term “thiol” as used herein is represented by the formula —SH.


As used herein, “Rn” where n is some integer can independently possess one or more of the groups listed above. For example, if R10 contains an aryl group, one of the hydrogen atoms of the aryl group can optionally be substituted with a hydroxyl group, an alkoxy group, an amine group, an alkyl group, a halide, and the like. Depending upon the groups that are selected, a first group can be incorporated within second group or, alternatively, the first group can be pendant (i.e., attached) to the second group. For example, with the phrase “an alkyl group comprising an amino group,” the amino group can be incorporated within the backbone of the alkyl group. Alternatively, the amino group can be attached to the backbone of the alkyl group. The nature of the group(s) that is (are) selected will determine if the first group is embedded or attached to the second group.


Unless stated to the contrary, a formula with chemical bonds shown only as solid lines and not as wedges or dashed lines contemplates each possible isomer, e.g., each enantiomer and diastereomer, and a mixture of isomers, such as a racemic or scalemic mixture.


Certain materials, compounds, compositions, and components disclosed herein can be obtained commercially or readily synthesized using techniques generally known to those of skill in the art. For example, the starting materials and reagents used in preparing the disclosed compounds and compositions are either available from commercial suppliers such as Aldrich Chemical Co., (Milwaukee, Wis.), Acros Organics (Morris Plains, N.J.), Fisher Scientific (Pittsburgh, Pa.), or Sigma (St. Louis, Mo.) or are prepared by methods known to those skilled in the art following procedures set forth in references such as Fieser and Fieser's Reagents for Organic Synthesis, Volumes 1-17 (John Wiley and Sons, 1991); Rodd's Chemistry of Carbon Compounds, Volumes 1-5 and Supplementals (Elsevier Science Publishers, 1989); Organic Reactions, Volumes 1-40 (John Wiley and Sons, 1991); March's Advanced Organic Chemistry, (John Wiley and Sons, 4th Edition); and Larock's Comprehensive Organic Transformations (VCH Publishers Inc., 1989).


Compounds useful with guanine-responsive riboswitches (and riboswitches derived from guanine-responsive riboswitches) include compounds represented by Formula I







where the compound can bind a guanine-responsive riboswitch or derivative thereof, where, when the compound is bound to a guanine-responsive riboswitch or derivative, R1 and R2 serve as a hydrogen bond donor, R7 serves as a hydrogen bond acceptor, R9 serves as a hydrogen bond donor, R10 serves as a hydrogen bond acceptor, and where each independently represent a single or double bond. In some other examples, R3 can be a hydrogen bond acceptor.


It is to be understood that while a particular moiety or group can be referred to herein as a hydrogen bond donor or acceptor, this terminology is used to merely categorize the various substituents for ease of reference. Such language should not be interpreted to mean that a particular moiety actually participates in hydrogen bonding with the riboswitch or some other compound. It is possible that, for example, a moiety referred to herein as a hydrogen bond acceptor (or donor) could solely or additionally be involved in hydrophobic, ionic, van de Waals, or other type of interaction with the riboswitch or other compound.


It is also understood that certain groups disclosed herein can be referred to herein as both a hydrogen bond acceptor and a hydrogen bond donor. For example, —OH can be a hydrogen bond donor by donating the hydrogen atom; —OH can also be a hydrogen bond acceptor through one or more of the nonbonded electron pairs on the oxygen atom. Thus, throughout the specification various moieties can be a hydrogen bond donor and acceptor and can be referred to as such.


Suitable hydrogen bond donors that can be present in the disclosed compounds, for example as one or more of R1, R2, and R9, are moieties that contain a polar hydrogen bond, such as when a hydrogen atom is bonded to a more electronegative atom like C, N, O, or S. Examples of suitable hydrogen bond donors for R1, R2, and R9 include, but are not limited to, the following moieties: —NR11—, —CHR11—, ═CR11—, and —C(═NR11)—, where R11 is —H, —NH2, —OH, —SH, —CO2H, substituted or unsubstituted alkyl, alkoxy, aryl, aryloxy, or benzyloxy, —NHalkyl, —NHalkoxy, —NHC(O)alkyl, —NHCO2alkyl, —NHC(O)NH2, —NH—NH2, —NH—NHalkyl, —NH—NHalkoxy, —NH—SO2alkyl, —NH—SO2—R12, —NHCO2CH2—R12, —NH—OR12, —N+H2—R12, —NH—NH—R12, and —NH—NH—CH2—R12, where R12 can be:







where n is from 1 to 5, and R13 can be one or more of —H, —NH2, —OH, alkoxy, —N-morpholino, or halide.


Suitable hydrogen bond acceptors that can be present in the disclosed compounds, for example as one or more of R3, R7, and R10, are moieties that contain a nonbonded electron pair. Nonbonded electron pairs typically exist on N, O, S, and halogen atoms. Examples of suitable hydrogen bond acceptors for R3, R7 include, but are not limited to, N, O, S, and SO2. For example, the groups R1 and R7 taken together can be represented as —CH═N—, —CH2—O—, —CH2—S—, or —CH2SO2—. R3 can be N, O, or S, which when referring to Formula I results in the moieties —N═R2, —O—R2, and —S—R2, respectively, where R2 is as defined above.


For the moiety R10, a suitable hydrogen bond acceptor can be O or S, as in O═R6 or S═R6, where R6 is C. Still further examples of hydrogen bond acceptors for R10 include, but are not limited to, —OH, —SH, —NH2, —CO2H, -alkoxy, -aryloxy, -benzyloxy, —halide, —NHalkyl, —NHalkoxy, —NHC(O)alkyl, —NHCO2alkyl, —NHCO2CH2—R12, —NHC(O)NH2, —NH—NH2, —NH—NHalkyl, —NH—NHalkoxy, —SO2alkyl, —SO2aryl, —NH—SO2alkyl, —NH—SO2—R12, —NH—OR12, —NH—R12, —NH —NH—NH—CH2—R12, or —NH—CH2—R12, where R12 is as defined above.


Yet another example of a suitable hydrogen bond acceptor for R10 is NR14, as in R14N═R6, where R6 is C. In these examples, R14 can be —H, —NH2—, —OH, —SH, —CO2H, —CO2alkyl, —CO2aryl, —C(O)NH2, substituted or unsubstituted alkyl, alkoxy, alkoxy, aryloxy, or benzyloxy, —NHalkyl, —NHalkoxy, —NHC(O)alkyl, —NHCO2alkyl, —NHC(O)NH2, —SO2alkyl, —SO2aryl, —NH—SO2alkyl, —NH—SO2—R12, —NH—OR12, or —NH—CH2—R12, where R12 is as defined above.


Some specific compounds disclosed herein can be represented by Formula II:







where R7 is N or CH;


R10 is ═O, ═S, ═NH, ═NOH, ═Nalkyl, ═Nalkoxyl, ═N-aryl, ═Naryloxy, ═N-benzyl, ═Nbenzyloxy, ═N—NH2, ═N—NHOH, ═N—NHalkyl, ═N—NHalkoxy, ═N—NHaryl, ═N—NHaryloxy, ═N—NHbenzyl, ═N—NHbenzyloxy, ═N—NH-(p-amino-phenyl), ═N—NH-(p-methoxyphenyl), ═N—NH-(p-N-morpholino-phenyl); and


R2 is ═CR15—, where R15 is —NH2, —NHNH2, —NHOH, —NHalkyl, —NHalkoxy, —NHaryl, —NHaryloxy, —NHbenzyl, —NHbenzyloxy, —N+H2aryl, —N+H2-(p-N-morpholino-phenyl), —N+H2-(p-aminophenyl), —N+H2-(p-methoxyphenyl), —NHCO2alkyl, —NHCO2benzyl, —NHNHalkyl, —NHNHaryl, —NHNHbenzyl, or —NHC(O)alkyl. Additional compounds can be represented by Formula III:







where R10 is —H, —OH, —SH, -alkoxy, halide, —NH2, —NHOH, —NHalkyl, —NHalkoxy, —NHaryl, —NHaryloxy, —NHbenzyl, —NHbenzyoxy, —NHC(O)alkyl, —NHCO2alkyl, NHCO2benzyl, —NHNH2, —NHNHalkyl, —NHNHaryl, or —NHNHbenzyl; and R2 and R7 are as defined above.


Further specific examples of compounds are illustrated in FIG. 12B.


Additional compounds useful with guanine-responsive riboswitches (and riboswitches derived from guanine-responsive riboswitches) include compounds having the formula







where the compound can bind a guanine-responsive riboswitch or derivative thereof, where, when the compound is bound to a guanine-responsive riboswitch or derivative, R7 serves as a hydrogen bond acceptor, R10 serves as a hydrogen bond donor, R11 serves as a hydrogen bond acceptor, R12 serves as a hydrogen bond donor, where R13 is H, H2 or is not present, where R1, R2, R3, R4, R5, R6, R8, and R9 are each independently C, N, O, or S, and where each independently represent a single or double bond.


Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not guanine, hypoxanthine, xanthine, or N2-methylguanine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a guanine-responsive riboswitch.


It should be understood that particular contacts and interactions (such as hydrogen bond donation or acceptance) described herein for compounds interacting with riboswitches are preferred but are not essential for interaction of a compound with a riboswitch. For example, compounds can interact with riboswitches with less affinity and/or specificity than compounds having the disclosed contacts and interactions. Further, different or additional functional groups on the compounds can introduce new, different and/or compensating contacts with the riboswitches. For example, for guanine riboswitches, large functional groups can be used at, for example, R10 and R2. Such functional groups can have, and can be designed to have, contacts and interactions with other part of the riboswitch. Such contacts and interactions can compensate for contacts and interactions of the trigger molecules and core structure.


Compounds useful with adenine-responsive riboswitches (and riboswitches derived from adenine-responsive riboswitches) include compounds having the formula







where the compound can bind an adenine-responsive riboswitch or derivative thereof, where, when the compound is bound to an adenine-responsive riboswitch or derivative, R1, R3 and R7 serve as hydrogen bond acceptors, and R10 and R11 serve as hydrogen bond donors, where R12 is H, H2 or is not present, where R1, R2, R3, R4, R5, R6, R8, and R9 are each independently C, N, O, or S, and where each independently represent a single or double bond.


Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not adenine, 2,6-diaminopurine, or 2-amino purine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate an adenine-responsive riboswitch.


Compounds useful with lysine-responsive riboswitches (and riboswitches derived from lysine-responsive riboswitches) include compounds having the formula







where the compound can bind a lysine-responsive riboswitch or derivative thereof, where R2 and R3 are each positively charged, where R1 is negatively charged, where R4 is C, N, O, or S, and where each independently represent a single or double bond. Also contemplated are compounds as defined above where R2 and R3 are each NH3+ and where R1 is O.


Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not lysine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a lysine-responsive riboswitch.


Compounds useful with TPP-responsive riboswitches (and riboswitches derived from lysine-responsive riboswitches) include compounds having the formula







where the compound can bind a TPP-responsive riboswitch or derivative thereof, where R1 is positively charged, where R2 and R3 are each independently C, O, or S, where R4 is CH3, NH2, OH, SH, H or not present, where R5 is CH3, NH2, OH, SH, or H, where R6 is C or N, and where each independently represent a single or double bond. Also contemplated are compounds as defined above where R1 is phosphate, diphosphate or triphosphate.


Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not TPP, TP or thiamine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a TPP-responsive riboswitch.


D. Constructs, Vectors and Expression Systems

The disclosed riboswitches can be used in with any suitable expression system. Recombinant expression is usefully accomplished using a vector, such as a plasmid. The vector can include a promoter operably linked to riboswitch-encoding sequence and RNA to be expression (e.g., RNA encoding a protein). The vector can also include other elements required for transcription and translation. As used herein, vector refers to any carrier containing exogenous DNA. Thus, vectors are agents that transport the exogenous nucleic acid into a cell without degradation and include a promoter yielding expression of the nucleic acid in the cells into which it is delivered. Vectors include but are not limited to plasmids, viral nucleic acids, viruses, phage nucleic acids, phages, cosmids, and artificial chromosomes. A variety of prokaryotic and eukaryotic expression vectors suitable for carrying riboswitch-regulated constructs can be produced. Such expression vectors include, for example, pET, pET3d, pCR2.1, pBAD, pUC, and yeast vectors. The vectors can be used, for example, in a variety of in vivo and in vitro situation.


Viral vectors include adenovirus, adeno-associated virus, herpes virus, vaccinia virus, polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV backbone. Also useful are any viral families which share the properties of these viruses which make them suitable for use as vectors. Retroviral vectors, which are described in Verma (1985), include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Typically, viral vectors contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome. When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into the viral genome in place of the removed viral DNA.


A “promoter” is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site. A “promoter” contains core elements required for basic interaction of RNA polymerase and transcription factors and can contain upstream elements and response elements.


“Enhancer” generally refers to a sequence of DNA that functions at no fixed distance from the transcription start site and can be either 5′ (Laimins, 1981) or 3′ (Lusky et al., 1983) to the transcription unit. Furthermore, enhancers can be within an intron (Banerji et al., 1983) as well as within the coding sequence itself (Osborne et al., 1984). They are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to increase transcription from nearby promoters. Enhancers, like promoters, also often contain response elements that mediate the regulation of transcription. Enhancers often determine the regulation of expression.


Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated cells) can also contain sequences necessary for the termination of transcription which can affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor protein. The 3′ untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA. The identification and use of polyadenylation signals in expression constructs is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs.


The vector can include nucleic acid sequence encoding a marker product. This marker product is used to determine if the gene has been delivered to the cell and once delivered is being expressed. Preferred marker genes are the E. Coli lacZ gene which encodes β-galactosidase and green fluorescent protein.


In some embodiments the marker can be a selectable marker. When such selectable markers are successfully transferred into a host cell, the transformed host cell can survive if placed under selective pressure. There are two widely used distinct categories of selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of a supplemented media. The second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a protein conveying drug resistance and would survive the selection. Examples of such dominant selection use the drugs neomycin, (Southern and Berg, 1982), mycophenolic acid, (Mulligan and Berg, 1980) or hygromycin (Sugden et al., 1985).


Gene transfer can be obtained using direct transfer of genetic material, in but not limited to, plasmids, viral vectors, viral nucleic acids, phage nucleic acids, phages, cosmids, and artificial chromosomes, or via transfer of genetic material in cells or carriers such as cationic liposomes. Such methods are well known in the art and readily adaptable for use in the method described herein. Transfer vectors can be any nucleotide construction used to deliver genes into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). Appropriate means for transfection, including viral vectors, chemical transfectants, or physico-mechanical methods such as electroporation and direct diffusion of DNA, are described by, for example, Wolff, J. A., et al., Science, 247, 1465-1468, (1990); and Wolff, J. A. Nature, 352, 815-818, (1991).


1. Viral Vectors


Preferred viral vectors are Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV backbone. Also preferred are any viral families which share the properties of these viruses which make them suitable for use as vectors. Preferred retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, and for this reason are a commonly used vector. However, they are not useful in non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells. Pox viral vectors are large and have several sites for inserting genes, they are thermostable and can be stored at room temperature. A preferred embodiment is a viral vector which has been engineered so as to suppress the immune response of the host organism, elicited by the viral antigens. Preferred vectors of this type will carry coding regions for Interleukin 8 or 10.


Viral vectors have higher transaction (ability to introduce genes) abilities than do most chemical or physical methods to introduce genes into cells. Typically, viral vectors contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome. When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into the viral genome in place of the removed viral DNA. Constructs of this type can carry up to about 8 kb of foreign genetic material. The necessary functions of the removed early genes are typically supplied by cell lines which have been engineered to express the gene products of the early genes in trans.


i. Retroviral Vectors


A retrovirus is an animal virus belonging to the virus family of Retroviridae, including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are described by Verma, I. M., Retroviral vectors for gene transfer. In Microbiology-1985, American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. Examples of methods for using retroviral vectors for gene therapy are described in U.S. Pat. Nos. 4,868,116 and 4,980,286; PCT applications WO 90/02806 and WO 89/07136; and Mulligan, (Science 260:926-932 (1993)); the teachings of which are incorporated herein by reference.


A retrovirus is essentially a package which has packed into it nucleic acid cargo. The nucleic acid cargo carries with it a packaging signal, which ensures that the replicated daughter molecules will be efficiently packaged within the package coat. In addition to the package signal, there are a number of molecules which are needed in cis, for the replication, and packaging of the replicated virus. Typically a retroviral genome, contains the gag, pol, and env genes which are involved in the making of the protein coat. It is the gag, pol, and env genes which are typically replaced by the foreign DNA that it is to be transferred to the target cell. Retrovirus vectors typically contain a packaging signal for incorporation into the package coat, a sequence which signals the start of the gag transcription unit, elements necessary for reverse transcription, including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, a purine rich sequence 5′ to the 3′ LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to insert into the host genome. The removal of the gag, pol, and env genes allows for about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed, and upon replication be packaged into a new retroviral particle. This amount of nucleic acid is sufficient for the delivery of a one to many genes depending on the size of each transcript. It is preferable to include either positive or negative selectable markers along with other genes in the insert.


Since the replication machinery and packaging proteins in most retroviral vectors have been removed (gag, pol, and env), the vectors are typically generated by placing them into a packaging cell line. A packaging cell line is a cell line which has been transfected or transformed with a retrovirus that contains the replication and packaging machinery, but lacks any packaging signal. When the vector carrying the DNA of choice is transfected into these cell lines, the vector containing the gene of interest is replicated and packaged into new retroviral particles, by the machinery provided in cis by the helper cell. The genomes for the machinery are not packaged because they lack the necessary signals.


ii. Adenoviral Vectors


The construction of replication-defective adenoviruses has been described (Berkner et al., J. Virology 61:1213-1220 (1987); Massie et al., Mol. Cell. Biol. 6:2872-2883 (1986); Haj-Ahmad et al., J. Virology 57:267-274 (1986); Davidson et al., J. Virology 61:1226-1239 (1987); Zhang “Generation and identification of recombinant adenovirus by liposome-mediated transfection and PCR analysis” BioTechniques 15:868-872 (1993)). The benefit of the use of these viruses as vectors is that they are limited in the extent to which they can spread to other cell types, since they can replicate within an initial infected cell, but are unable to form new infectious viral particles. Recombinant adenoviruses have been shown to achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, J. Clin. Invest. 92:1580-1586 (1993);.Kirshenbaum, J. Clin. Invest. 92:381-387 (1993); Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem. 267:25129-25134 (1992); Rich, Human Gene Therapy 4:461-476 (1993); Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 (1993); Bout, Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 (1993); Caillaud, Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-507 (1993)). Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild type or replication-defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and Burlingham, J. Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985); Seth, et al., J. Virol. 51:650-655 (1984); Seth, et al., Mol. Cell. Biol. 4:1528-1533 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 (1993)).


A preferred viral vector is one based on an adenovirus which has had the E1 gene removed and these virons are generated in a cell line such as the human 293 cell line. In another preferred embodiment both the E1 and E3 genes are removed from the adenovirus genome.


Another type of viral vector is based on an adeno-associated virus (AAV). This defective parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably insert into chromosome 19. Vectors which contain this site specific integration property are preferred. An especially preferred embodiment of this type of vector is the P4.1 C vector produced by Avigen, San Francisco, Calif., which can contain the herpes simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP.


The inserted genes in viral and retroviral usually contain promoters, and/or enhancers to help control the expression of the desired gene product. A promoter is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site. A promoter contains core elements required for basic interaction of RNA polymerase and transcription factors, and can contain upstream elements and response elements.


2. Viral Promoters and Enhancers


Preferred promoters controlling transcription from vectors in mammalian host cells can be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, or from heterologous mammalian promoters, e.g. beta actin promoter. The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral origin of replication (Fiers et al., Nature, 273: 113 (1978)). The immediate early promoter of the human cytomegalovirus is conveniently obtained as a HindIII E restriction fragment (Greenway, P. J. et al., Gene 18: 355-360 (1982)). Of course, promoters from the host cell or related species also are useful herein.


Enhancer generally refers to a sequence of DNA that functions at no fixed distance from the transcription start site and can be either 5′ (Laimins, L. et al., Proc. Natl. Acad. Sci. 78: 993 (1981)) or 3′ (Lusky, M. L., et al., Mol. Cell. Bio. 3: 1108 (1983)) to the transcription unit. Furthermore, enhancers can be within an intron (Banerji, J. L. et al., Cell 33: 729 (1983)) as well as within the coding sequence itself (Osborne, T. F., et al., Mol. Cell. Bio. 4: 1293 (1984)). They are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to increase transcription from nearby promoters. Enhancers also often contain response elements that mediate the regulation of transcription. Promoters can also contain response elements that mediate the regulation of transcription. Enhancers often determine the regulation of expression of a gene. While many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, α-fetoprotein and insulin), typically one will use an enhancer from a eukaryotic cell virus. Preferred examples are the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.


The promoter and/or enhancer can be specifically activated either by light or specific chemical events which trigger their function. Systems can be regulated by reagents such as tetracycline and dexamethasone. There are also ways to enhance viral vector gene expression by exposure to irradiation, such as gamma irradiation, or alkylating chemotherapy drugs.


It is preferred that the promoter and/or enhancer region be active in all eukaryotic cell types. A preferred promoter of this type is the CMV promoter (650 bases). Other preferred promoters are SV40 promoters, cytomegalovirus (full length promoter), and retroviral vector LTF.


It has been shown that all specific regulatory elements can be cloned and used to construct expression vectors that are selectively expressed in specific cell types such as melanoma cells. The glial fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in cells of glial origin.


Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated cells) can also contain sequences necessary for the termination of transcription which can affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor protein. The 3′ untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA. The identification and use of polyadenylation signals in expression constructs is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs. In a preferred embodiment of the transcription unit, the polyadenylation region is derived from the SV40 early polyadenylation signal and consists of about 400 bases. It is also preferred that the transcribed units contain other standard sequences alone or in combination with the above sequences improve expression from, or stability of, the construct.


3. Markers


The vectors can include nucleic acid sequence encoding a marker product. This marker product is used to determine if the gene has been delivered to the cell and once delivered is being expressed. Preferred marker genes are the E. Coli lacZ gene which encodes β-galactosidase and green fluorescent protein.


In some embodiments the marker can be a selectable marker. Examples of suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), thymidine kinase, neomycin, neomycin analog G418, hydromycin, and puromycin. When such selectable markers are successfully transferred into a mammalian host cell, the transformed mammalian host cell can survive if placed under selective pressure. There are two widely used distinct categories of selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of a supplemented media. Two examples are: CHO DHFR cells and mouse LTK cells. These cells lack the ability to grow without the addition of such nutrients as thymidine or hypoxanthine. Because these cells lack certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are provided in a supplemented media. An alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements. Individual cells which were not transformed with the DHFR or TK gene will not be capable of survival in non-supplemented media.


The second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which would express a protein conveying drug resistance and would survive the selection. Examples of such dominant selection use the drugs neomycin, (Southern P. and Berg, P., J. Molec. Appl. Genet. 1: 327 (1982)), mycophenolic acid, (Mulligan, R. C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, (Sugden, B. et al., Mol. Cell. Biol. 5: 410-413 (1985)). The three examples employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others include the neomycin analog G418 and puramycin.


E. Biosensor Riboswitches

Also disclosed are biosensor riboswitches. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.


F. Reporter Proteins and Peptides

For assessing activation of a riboswitch, or for biosensor riboswitches, a reporter protein or peptide can be used. The reporter protein or peptide can be encoded by the RNA the expression of which is regulated by the riboswitch. The examples describe the use of some specific reporter proteins. The use of reporter proteins and peptides is well known and can be adapted easily for use with riboswitches. The reporter proteins can be any protein or peptide that can be detected or that produces a detectable signal. Preferably, the presence of the protein or peptide can be detected using standard techniques (e.g., radioimmunoassay, radio-labeling, immunoassay, assay for enzymatic activity, absorbance, fluorescence, luminescence, and Western blot). More preferably, the level of the reporter protein is easily quantifiable using standard techniques even at low levels. Useful reporter proteins include luciferases, green fluorescent proteins and their derivatives, such as firefly luciferase (FL) from Photinus pyralis, and Renilla luciferase (RL) from Renilla reniformis.


G. Conformation Dependent Labels

Conformation dependent labels refer to all labels that produce a change in fluorescence intensity or wavelength based on a change in the form or conformation of the molecule or compound (such as a riboswitch) with which the label is associated. Examples of conformation dependent labels used in the context of probes and primers include molecular beacons, Amplifluors, FRET probes, cleavable FRET probes, TaqMan probes, scorpion primers, fluorescent triplex oligos including but not limited to triplex molecular beacons or triplex FRET probes, fluorescent water-soluble conjugated polymers, PNA probes and QPNA probes. Such labels, and, in particular, the principles of their function, can be adapted for use with riboswitches. Several types of conformation dependent labels are reviewed in Schweitzer and Kingsmore, Curr. Opin. Biotech. 12:21-27 (2001).


Stem quenched labels, a form of conformation dependent labels, are fluorescent labels positioned on a nucleic acid such that when a stem structure forms a quenching moiety is brought into proximity such that fluorescence from the label is quenched. When the stem is disrupted (such as when a riboswitch containing the label is activated), the quenching moiety is no longer in proximity to the fluorescent label and fluorescence increases. Examples of this effect can be found in molecular beacons, fluorescent triplex oligos, triplex molecular beacons, triplex FRET probes, and QPNA probes, the operational principles of which can be adapted for use with riboswitches.


Stem activated labels, a form of conformation dependent labels, are labels or pairs of labels where fluorescence is increased or altered by formation of a stem structure. Stem activated labels can include an acceptor fluorescent label and a donor moiety such that, when the acceptor and donor are in proximity (when the nucleic acid strands containing the labels form a stem structure), fluorescence resonance energy transfer from the donor to the acceptor causes the acceptor to fluoresce. Stem activated labels are typically pairs of labels positioned on nucleic acid molecules (such as riboswitches) such that the acceptor and donor are brought into proximity when a stem structure is formed in the nucleic acid molecule. If the donor moiety of a stem activated label is itself a fluorescent label, it can release energy as fluorescence (typically at a different wavelength than the fluorescence of the acceptor) when not in proximity to an acceptor (that is, when a stem structure is not formed). When the stem structure forms, the overall effect would then be a reduction of donor fluorescence and an increase in acceptor fluorescence. FRET probes are an example of the use of stem activated labels, the operational principles of which can be adapted for use with riboswitches.


H. Detection Labels

To aid in detection and quantitation of riboswitch activation, deactivation or blocking, or expression of nucleic acids or protein produced upon activation, deactivation or blocking of riboswitches, detection labels can be incorporated into detection probes or detection molecules or directly incorporated into expressed nucleic acids or proteins. As used herein, a detection label is any molecule that can be associated with nucleic acid or protein, directly or indirectly, and which results in a measurable, detectable signal, either directly or indirectly. Many such labels are known to those of skill in the art. Examples of detection labels suitable for use in the disclosed method are radioactive isotopes, fluorescent molecules, phosphorescent molecules, enzymes, antibodies, and ligands.


Examples of suitable fluorescent labels include fluorescein isothiocyanate (FITC), 5,6-carboxymethyl fluorescein, Texas red, nitrobenz-2-oxa-1,3-diazol-4-yl (NBD), coumarin, dansyl chloride, rhodamine, amino-methyl coumarin (AMCA), Eosin, Erythrosin, BODIPY®, Cascade Blue®, Oregon Greene, pyrene, lissamine, xanthenes, acridines, oxazines, phycoerythrin, macrocyclic chelates of lanthanide ions such as Quantum Dye™, fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7. Examples of other specific fluorescent labels include 3-Hydroxypyrene 5,8,10-Tri Sulfonic acid, 5-Hydroxy Tryptamine (5-HT), Acid Fuchsin, Alizarin Complexon, Alizarin Red, Allophycocyanin, Aminocoumarin, Anthroyl Stearate, Astrazon Brilliant Red 4G, Astrazon Orange R, Astrazon Red 6B, Astrazon Yellow 7 GLL, Atabrine, Auramine, Aurophosphine, Aurophosphine G, BAO 9 (Bisaminophenyloxadiazole), BCECF, Berberine Sulphate, Bisbenzamide, Blancophor FFG Solution, Blancophor SV, Bodipy F1, Brilliant Sulphoflavin FF, Calcien Blue, Calcium Green, Calcofluor RW Solution, Calcofluor White, Calcophor White ABT Solution, Calcophor White Standard Solution, Carbostyryl, Cascade Yellow, Catecholamine, Chinacrine, Coriphosphine O, Coumarin-Phalloidin, CY3.18, CY5.18, CY7, Dans (1-Dimethyl Amino Naphaline 5 Sulphonic Acid), Dansa (Diamino Naphtyl Sulphonic Acid), Dansyl NH—CH3, Diamino Phenyl Oxydiazole (DAO), Dimethylamino-5-Sulphonic acid, Dipyrrometheneboron Difluoride, Diphenyl Brilliant Flavine 7GFF, Dopamine, Erythrosin ITC, Euchrysin, FIF (Formaldehyde Induced Fluorescence), Flazo Orange, Fluo 3, Fluorescamine, Fura-2, Genacryl Brilliant Red B, Genacryl Brilliant Yellow 10GF, Genacryl Pink 3G, Genacryl Yellow 5GF, Gloxalic Acid, Granular Blue, Haematoporphyrin, Indo-1, Intrawhite Cf Liquid, Leucophor PAF, Leucophor SF, Leucophor WS, Lissamine Rhodamine B200 (RD200), Lucifer Yellow CH, Lucifer Yellow VS, Magdala Red, Marina Blue, Maxilon Brilliant Flavin 10 GFF, Maxilon Brilliant Flavin 8 GFF, MPS (Methyl Green Pyronine Stilbene), Mithramycin, NBD Amine, Nitrobenzoxadidole, Noradrenaline, Nuclear Fast Red, Nuclear Yellow, Nylosan Brilliant Flavin E8G, Oxadiazole, Pacific Blue, Pararosaniline (Feulgen), Phorwite AR Solution, Phorwite BKL, Phorwite Rev, Phorwite RPA, Phosphine 3R, Phthalocyanine, Phycoerythrin R, Polyazaindacene Pontochrome Blue Black, Porphyrin, Primuline, Procion Yellow, Pyronine, Pyronine B, Pyrozal Brilliant Flavin 7GF, Quinacrine Mustard, Rhodamine 123, Rhodamine 5 GLD, Rhodamine 6G, Rhodamine B, Rhodamine B 200, Rhodamine B Extra, Rhodamine BB, Rhodamine BG, Rhodamine WT, Serotonin, Sevron Brilliant Red 2B, Sevron Brilliant Red 4G, Sevron Brilliant Red B, Sevron Orange, Sevron Yellow L, SITS (Primuline), SITS (Stilbene Isothiosulphonic acid), Stilbene, Snarf 1, sulpho Rhodamine B Can C, Sulpho Rhodamine G Extra, Tetracycline, Thiazine Red R, Thioflavin S, Thioflavin TCN, Thioflavin 5, Thiolyte, Thiozol Orange, Tinopol CBS, True Blue, Ultralite, Uranine B, Uvitex SFC, Xylene Orange, and XRITC.


Useful fluorescent labels are fluorescein (5-carboxyfluorescein-N-hydroxysuccinimide ester), rhodamine (5,6-tetramethyl rhodamine), and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7. The absorption and emission maxima, respectively, for these fluors are: FITC (490 nm; 520 nm), Cy3 (554 nm; 568 nm), Cy3.5 (581 nm; 588 nm), Cy5 (652 nm: 672 nm), Cy5.5 (682 nm; 703 nm) and Cy7 (755 nm n; 778 nm), thus allowing their simultaneous detection. Other examples of fluorescein dyes include 6-carboxyfluorescein (6-FAM), 2′,4′,1,4,-tetrachlorofluorescein (TET), 2′,4′,5′,7′,1,4-hexachlorofluorescein (HEX), 2′,7′-dimethoxy-4′,5′-dichloro-6-carboxyrhodamine (JOE), 2′-chloro-5′-fluoro-7′,8′-fused phenyl-1,4-dichloro-6-carboxyfluorescein (NED), and 2′-chloro-7′-phenyl-1,4-dichloro-6-carboxyfluorescein (VIC). Fluorescent labels can be obtained from a variety of commercial sources, including Amersham Pharmacia Biotech, Piscataway, N.J.; Molecular Probes, Eugene, Oreg.; and Research Organics, Cleveland, Ohio.


Additional labels of interest include those that provide for signal only when the probe with which they are associated is specifically bound to a target molecule, where such labels include: “molecular beacons” as described in Tyagi & Kramer, Nature Biotechnology (1996) 14:303 and EP 0 070 685 B1. Other labels of interest include those described in U.S. Pat. No. 5,563,037; WO 97/17471 and WO 97/17076.


Labeled nucleotides are a useful form of detection label for direct incorporation into expressed nucleic acids during synthesis. Examples of detection labels that can be incorporated into nucleic acids include nucleotide analogs such as BrdUrd (5-bromodeoxyuridine, Hoy and Schimke, Mutation Research 290:217-230 (1993)), aminoallyldeoxyuridine (Henegariu et al., Nature Biotechnology 18:345-348 (2000)), 5-methylcytosine (Sano et al., Biochim. Biophys. Acta 951:157-165 (1988)), bromouridine (Wansick et al., J. Cell Biology 122:283-293 (1993)) and nucleotides modified with biotin (Langer et al., Proc. Natl. Acad. Sci. USA 78:6633 (1981)) or with suitable haptens such as digoxygenin (Kerkhof, Anal. Biochem. 205:359-364 (1992)). Suitable fluorescence-labeled nucleotides are Fluorescein-isothiocyanate-dUTP, Cyanine-3-dUTP and Cyanine-5-dUTP (Yu et al., Nucleic Acids Res., 22:3226-3232 (1994)). A preferred nucleotide analog detection label for DNA is BrdUrd (bromodeoxyuridine, BrdUrd, BrdU, BUdR, Sigma-Aldrich Co). Other useful nucleotide analogs for incorporation of detection label into DNA are AA-dUTP (aminoallyl-deoxyuridine triphosphate, Sigma-Aldrich Co.), and 5-methyl-dCTP (Roche Molecular Biochemicals). A useful nucleotide analog for incorporation of detection label into RNA is biotin-16-UTP (biotin-16-uridine-5′-triphosphate, Roche Molecular Biochemicals). Fluorescein, Cy3, and Cy5 can be linked to dUTP for direct labeling. Cy3.5 and Cy7 are available as avidin or anti-digoxygenin conjugates for secondary detection of biotin- or digoxygenin-labeled probes.


Detection labels that are incorporated into nucleic acid, such as biotin, can be subsequently detected using sensitive methods well-known in the art. For example, biotin can be detected using streptavidin-alkaline phosphatase conjugate (Tropix, Inc.), which is bound to the biotin and subsequently detected by chemiluminescence of suitable substrates (for example, chemiluminescent substrate CSPD: disodium, 3-(4-methoxyspiro-[1,2,-dioxetane-3-2′-(5′-chloro)tricyclo [3.3.1.13,7]decane]-4-yl)phenyl phosphate; Tropix, Inc.). Labels can also be enzymes, such as alkaline phosphatase, soybean peroxidase, horseradish peroxidase and polymerases, that can be detected, for example, with chemical signal amplification or by using a substrate to the enzyme which produces light (for example, a chemiluminescent 1,2-dioxetane substrate) or fluorescent signal.


Molecules that combine two or more of these detection labels are also considered detection labels. Any of the known detection labels can be used with the disclosed probes, tags, molecules and methods to label and detect activated or deactivated riboswitches or nucleic acid or protein produced in the disclosed methods. Methods for detecting and measuring signals generated by detection labels are also known to those of skill in the art. For example, radioactive isotopes can be detected by scintillation counting or direct visualization; fluorescent molecules can be detected with fluorescent spectrophotometers; phosphorescent molecules can be detected with a spectrophotometer or directly visualized with a camera; enzymes can be detected by detection or visualization of the product of a reaction catalyzed by the enzyme; antibodies can be detected by detecting a secondary detection label coupled to the antibody. As used herein, detection molecules are molecules which interact with a compound or composition to be detected and to which one or more detection labels are coupled.


I. Sequence Similarities

It is understood that as discussed herein the use of the terms homology and identity mean the same thing as similarity. Thus, for example, if the use of the word homology is used between two sequences (non-natural sequences, for example) it is understood that this is not necessarily indicating an evolutionary relationship between these two sequences, but rather is looking at the similarity or relatedness between their nucleic acid sequences. Many of the methods for determining homology between two evolutionarily related molecules are routinely applied to any two or more nucleic acids or proteins for the purpose of measuring sequence similarity regardless of whether they are evolutionarily related or not.


In general, it is understood that one way to define any known variants and derivatives or those that might arise, of the disclosed riboswitches, aptamers, expression platforms, genes and proteins herein, is through defining the variants and derivatives in terms of homology to specific known sequences. This identity of particular sequences disclosed herein is also discussed elsewhere herein. In general, variants of riboswitches, aptamers, expression platforms, genes and proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to a stated sequence or a native sequence. Those of skill in the art readily understand how to determine the homology of two proteins or nucleic acids, such as genes. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.


Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison can be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. Mol. Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by inspection.


The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol. 183:281-306, 1989 which are herein incorporated by reference for at least material related to nucleic acid alignment. It is understood that any of the methods typically can be used and that in certain instances the results of these various methods can differ, but the skilled artisan understands if identity is found with at least one of these methods, the sequences would be said to have the stated identity.


For example, as used herein, a sequence recited as having a particular percent homology to another sequence refers to sequences that have the recited homology as calculated by any one or more of the calculation methods described above. For example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using the Zuker calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by any of the other calculation methods. As another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using both the Zuker calculation method and the Pearson and Lipman calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by the Smith and Waterman calculation method, the Needleman and Wunsch calculation method, the Jaeger calculation methods, or any of the other calculation methods. As yet another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using each of calculation methods (although, in practice, the different calculation methods will often result in different calculated homology percentages).


J. Hybridization and Selective Hybridization

The term hybridization typically means a sequence driven interaction between at least two nucleic acid molecules, such as a primer or a probe and a riboswitch or a gene. Sequence driven interaction means an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a nucleotide specific manner. For example, G interacting with C or A interacting with T are sequence driven interactions. Typically sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a number of conditions and parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules will hybridize.


Parameters for selective hybridization between two nucleic acid molecules are well known to those of skill in the art. For example, in some embodiments selective hybridization conditions can be defined as stringent hybridization conditions. For example, stringency of hybridization is controlled by both temperature and salt concentration of either or both of the hybridization and washing steps. For example, the conditions of hybridization to achieve selective hybridization can involve hybridization in high ionic strength solution (6×SSC or 6×SSPE) at a temperature that is about 12-25° C. below the Tm (the melting temperature at which half of the molecules dissociate from their hybridization partners) followed by washing at a combination of temperature and salt concentration chosen so that the washing temperature is about 5° C. to 20° C. below the Tm. The temperature and salt conditions are readily determined empirically in preliminary experiments in which samples of reference DNA immobilized on filters are hybridized to a labeled nucleic acid of interest and then washed under conditions of different stringencies. Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA hybridizations. The conditions can be used as described above to achieve stringency, or as is known in the art (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987 which is herein incorporated by reference for material at least related to hybridization of nucleic acids). A preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68° C. (in aqueous solution) in 6×SSC or 6×SSPE followed by washing at 68° C. Stringency of hybridization and washing, if desired, can be reduced accordingly as the degree of complementarity desired is decreased, and further, depending upon the G-C or A-T richness of any area wherein variability is searched for. Likewise, stringency of hybridization and washing, if desired, can be increased accordingly as homology desired is increased, and further, depending upon the G-C or A-T richness of any area wherein high homology is desired, all as known in the art.


Another way to define selective hybridization is by looking at the amount (percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in some embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non-limiting nucleic acid. Typically, the non-limiting nucleic acid is in for example, 10 or 100 or 1000 fold excess. This type of assay can be performed at under conditions where both the limiting and non-limiting nucleic acids are for example, 10 fold or 100 fold or 1000 fold below their kd, or where only one of the nucleic acid molecules is 10 fold or 100 fold or 1000 fold or where one or both nucleic acid molecules are above their kd.


Another way to define selective hybridization is by looking at the percentage of nucleic acid that gets enzymatically manipulated under conditions where hybridization is required to promote the desired enzymatic manipulation. For example, in some embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the nucleic acid is enzymatically manipulated under conditions which promote the enzymatic manipulation, for example if the enzymatic manipulation is DNA extension, then selective hybridization conditions would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the nucleic acid molecules are extended. Preferred conditions also include those suggested by the manufacturer or indicated in the art as being appropriate for the enzyme performing the manipulation.


Just as with homology, it is understood that there are a variety of methods herein disclosed for determining the level of hybridization between two nucleic acid molecules. It is understood that these methods and conditions can provide different percentages of hybridization between two nucleic acid molecules, but unless otherwise indicated meeting the parameters of any of the methods would be sufficient. For example if 80% hybridization was required and as long as hybridization occurs within the required parameters in any one of these methods it is considered disclosed herein.


It is understood that those of skill in the art understand that if a composition or method meets any one of these criteria for determining hybridization either collectively or singly it is a composition or method that is disclosed herein.


K. Nucleic Acids

There are a variety of molecules disclosed herein that are nucleic acid based, including, for example, riboswitches, aptamers, and nucleic acids that encode riboswitches and aptamers. The disclosed nucleic acids can be made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is understood that if a nucleic acid molecule is introduced into a cell or cell environment through for example exogenous delivery, it is advantageous that the nucleic acid molecule be made up of nucleotide analogs that reduce the degradation of the nucleic acid molecule in the cellular environment.


So long as their relevant function is maintained, riboswitches, aptamers, expression platforms and any other oligonucleotides and nucleic acids can be made up of or include modified nucleotides (nucleotide analogs). Many modified nucleotides are known and can be used in oligonucleotides and nucleic acids. A nucleotide analog is a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties. Modifications to the base moiety would include natural and synthetic modifications of A, C, G, and T/U as well as different purine or pyrimidine bases, such as uracil-5-yl, hypoxanthin-9-yl (I), and 2-aminoadenin-9-yl. A modified base includes but is not limited to 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Additional base modifications can be found for example in U.S. Pat. No. 3,687,808, Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, Crooke, S. T. and Lebleu, B. ed., CRC Press, 1993. Certain nucleotide analogs, such as 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5-propynylcytosine. 5-methylcytosine can increase the stability of duplex formation. Other modified bases are those that function as universal bases. Universal bases include 3-nitropyrrole and 5-nitroindole. Universal bases substitute for the normal bases but have no bias in base pairing. That is, universal bases can base pair with any other base. Base modifications often can be combined with for example a sugar modification, such as 2′-O-methoxyethyl, to achieve unique properties such as increased duplex stability. There are numerous United States patents such as U.S. Pat. Nos. 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; and 5,681,941, which detail and describe a range of base modifications. Each of these patents is herein incorporated by reference in its entirety, and specifically for their description of base modifications, their synthesis, their use, and their incorporation into oligonucleotides and nucleic acids.


Nucleotide analogs can also include modifications of the sugar moiety. Modifications to the sugar moiety would include natural modifications of the ribose and deoxyribose as well as synthetic modifications. Sugar modifications include but are not limited to the following modifications at the 2′ position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted C1 to C10, alkyl or C2 to C10 alkenyl and alkynyl. 2′ sugar modifications also include but are not limited to —O[(CH2)nO]mCH3, —O(CH2)nOCH3, —O(CH2)nNH2, —O(CH2)nCH3, —O(CH2)n-ONH2, and —O(CH2)nON[(CH2)nCH3)]2, where n and m are from 1 to about 10.


Other modifications at the 2′ position include but are not limited to: C1 to C10 lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF3, OCF3, SOCH3, SO2 CH3, ONO2, NO2, N3, NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. Similar modifications can also be made at other positions on the sugar, particularly the 3′ position of the sugar on the 3′ terminal nucleotide or in 2′-5′ linked oligonucleotides and the 5′ position of 5′ terminal nucleotide. Modified sugars would also include those that contain modifications at the bridging ring oxygen, such as CH2 and S. Nucleotide sugar analogs can also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. There are numerous United States patents that teach the preparation of such modified sugar structures such as U.S. Pat. Nos. 4,981,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, each of which is herein incorporated by reference in its entirety, and specifically for their description of modified sugar structures, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids.


Nucleotide analogs can also be modified at the phosphate moiety. Modified phosphate moieties include but are not limited to those that can be modified so that the linkage between two nucleotides contains a phosphorothioate, chiral phosphorothioate, phosphorodithioate, phosphotriester, aminoalkylphosphotriester, methyl and other alkyl phosphonates including 3′-alkylene phosphonate and chiral phosphonates, phosphinates, phosphoramidates including 3′-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates. It is understood that these phosphate or modified phosphate linkages between two nucleotides can be through a 3′-5′ linkage or a 2′-5′ linkage, and the linkage can contain inverted polarity such as 3′-5′ to 5′-3′ or 2′-5′ to 5′-2′. Various salts, mixed salts and free acid forms are also included. Numerous United States patents teach how to make and use nucleotides containing modified phosphates and include but are not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and 5,625,050, each of which is herein incorporated by reference its entirety, and specifically for their description of modified phosphates, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids.


It is understood that nucleotide analogs need only contain a single modification, but can also contain multiple modifications within one of the moieties or between different moieties.


Nucleotide substitutes are molecules having similar functional properties to nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide substitutes are molecules that will recognize and hybridize to (base pair to) complementary nucleic acids in a Watson-Crick or Hoogsteen manner, but which are linked together through a moiety other than a phosphate moiety. Nucleotide substitutes are able to conform to a double helix type structure when interacting with the appropriate target nucleic acid.


Nucleotide substitutes are nucleotides or nucleotide analogs that have had the phosphate moiety and/or sugar moieties replaced. Nucleotide substitutes do not contain a standard phosphorus atom. Substitutes for the phosphate can be for example, short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts. Numerous United States patents disclose how to make and use these types of phosphate replacements and include but are not limited to U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated by reference its entirety, and specifically for their description of phosphate replacements, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids.


It is also understood in a nucleotide substitute that both the sugar and the phosphate moieties of the nucleotide can be replaced, by for example an amide type linkage (aminoethylglycine) (PNA). U.S. Pat. Nos. 5,539,082; 5,714,331; and 5,719,262 teach how to make and use PNA molecules, each of which is herein incorporated by reference. (See also Nielsen et al., Science 254:1497-1500 (1991)).


Oligonucleotides and nucleic acids can be comprised of nucleotides and can be made up of different types of nucleotides or the same type of nucleotides. For example, one or more of the nucleotides in an oligonucleotide can be ribonucleotides, 2′-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2′-O-methyl ribonucleotides; about 10% to about 50% of the nucleotides can be ribonucleotides, 2′-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2′-O-methyl ribonucleotides; about 50% or more of the nucleotides can be ribonucleotides, 2′-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2′-O-methyl ribonucleotides; or all of the nucleotides are ribonucleotides, 2′-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2′-O-methyl ribonucleotides. Such oligonucleotides and nucleic acids can be referred to as chimeric oligonucleotides and chimeric nucleic acids.


L. Solid Supports

Solid supports are solid-state substrates or supports with which molecules (such as trigger molecules) and riboswitches (or other components used in, or produced by, the disclosed methods) can be associated. Riboswitches and other molecules can be associated with solid supports directly or indirectly. For example, analytes (e.g., trigger molecules, test compounds) can be bound to the surface of a solid support or associated with capture agents (e.g., compounds or molecules that bind an analyte) immobilized on solid supports. As another example, riboswitches can be bound to the surface of a solid support or associated with probes immobilized on solid supports. An array is a solid support to which multiple riboswitches, probes or other molecules have been associated in an array, grid, or other organized pattern.


Solid-state substrates for use in solid supports can include any solid material with which components can be associated, directly or indirectly. This includes materials such as acrylamide, agarose, cellulose, nitrocellulose, glass, gold, polystyrene, polyethylene vinyl acetate, polypropylene, polymethacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides, polyglycolic acid, polylactic acid, polyorthoesters, functionalized silane, polypropylfumerate, collagen, glycosaminoglycans, and polyamino acids. Solid-state substrates can have any useful form including thin film, membrane, bottles, dishes, fibers, woven fibers, shaped polymers, particles, beads, microparticles, or a combination. Solid-state substrates and solid supports can be porous or non-porous. A chip is a rectangular or square small piece of material. Preferred forms for solid-state substrates are thin films, beads, or chips. A useful form for a solid-state substrate is a microtiter dish. In some embodiments, a multiwell glass slide can be employed.


An array can include a plurality of riboswitches, trigger molecules, other molecules, compounds or probes immobilized at identified or predefined locations on the solid support. Each predefined location on the solid support generally has one type of component (that is, all the components at that location are the same). Alternatively, multiple types of components can be immobilized in the same predefined location on a solid support. Each location will have multiple copies of the given components. The spatial separation of different components on the solid support allows separate detection and identification.


Although useful, it is not required that the solid support be a single unit or structure. A set of riboswitches, trigger molecules, other molecules, compounds and/or probes can be distributed over any number of solid supports. For example, at one extreme, each component can be immobilized in a separate reaction tube or container, or on separate beads or microparticles.


Methods for immobilization of oligonucleotides to solid-state substrates are well established. Oligonucleotides, including address probes and detection probes, can be coupled to substrates using established coupling methods. For example, suitable attachment methods are described by Pease et al., Proc. Natl. Acad. Sci. USA 91(11):5022-5026 (1994), and Khrapko et al., Mol Biol (Mosk) (USSR) 25:718-730 (1991). A method for immobilization of 3′-amine oligonucleotides on casein-coated slides is described by Stimpson et al., Proc. Natl. Acad. Sci. USA 92:6379-6383 (1995). A useful method of attaching oligonucleotides to solid-state substrates is described by Guo et al., Nucleic Acids Res. 22:5456-5465 (1994).


Each of the components (for example, riboswitches, trigger molecules, or other molecules) immobilized on the solid support can be located in a different predefined region of the solid support. The different locations can be different reaction chambers. Each of the different predefined regions can be physically separated from each other of the different regions. The distance between the different predefined regions of the solid support can be either fixed or variable. For example, in an array, each of the components can be arranged at fixed distances from each other, while components associated with beads will not be in a fixed spatial relationship. In particular, the use of multiple solid support units (for example, multiple beads) will result in variable distances.


Components can be associated or immobilized on a solid support at any density. Components can be immobilized to the solid support at a density exceeding 400 different components per cubic centimeter. Arrays of components can have any number of components. For example, an array can have at least 1,000 different components immobilized on the solid support, at least 10,000 different components immobilized on the solid support, at least 100,000 different components immobilized on the solid support, or at least 1,000,000 different components immobilized on the solid support.


M. Kits

The materials described above as well as other materials can be packaged together in any suitable combination as a kit useful for performing, or aiding in the performance of, the disclosed method. It is useful if the kit components in a given kit are designed and adapted for use together in the disclosed method. For example disclosed are kits for detecting compounds, the kit comprising one or more biosensor riboswitches. The kits also can contain reagents and labels for detecting activation of the riboswitches.


N. Mixtures

Disclosed are mixtures formed by performing or preparing to perform the disclosed method. For example, disclosed are mixtures comprising riboswitches and trigger molecules.


Whenever the method involves mixing or bringing into contact compositions or components or reagents, performing the method creates a number of different mixtures. For example, if the method includes 3 mixing steps, after each one of these steps a unique mixture is formed if the steps are performed separately. In addition, a mixture is formed at the completion of all of the steps regardless of how the steps were performed. The present disclosure contemplates these mixtures, obtained by the performance of the disclosed methods as well as mixtures containing any disclosed reagent, composition, or component, for example, disclosed herein.


O. Systems

Disclosed are systems useful for performing, or aiding in the performance of, the disclosed method. Systems generally comprise combinations of articles of manufacture such as structures, machines, devices, and the like, and compositions, compounds, materials, and the like. Such combinations that are disclosed or that are apparent from the disclosure are contemplated. For example, disclosed and contemplated are systems comprising biosensor riboswitches, a solid support and a signal-reading device.


P. Data Structures and Computer Control

Disclosed are data structures used in, generated by, or generated from, the disclosed method. Data structures generally are any form of data, information, and/or objects collected, organized, stored, and/or embodied in a composition or medium. Riboswitch structures and activation measurements stored in electronic form, such as in RAM or on a storage disk, is a type of data structure.


The disclosed method, or any part thereof or preparation therefor, can be controlled, managed, or otherwise assisted by computer control. Such computer control can be accomplished by a computer controlled process or method, can use and/or generate data structures, and can use a computer program. Such computer control, computer controlled processes, data structures, and computer programs are contemplated and should be understood to be disclosed herein.


Methods

Disclosed are methods for activating, deactivating or blocking a riboswitch. Such methods can involve, for example, bringing into contact a riboswitch and a compound or trigger molecule that can activate, deactivate or block the riboswitch. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Compounds can be used to activate, deactivate or block a riboswitch. The trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch. Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch. Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch. Thus, the disclosed method of deactivating a riboswitch can involve, for example, removing a trigger molecule (or other activating compound) from the presence or contact with the riboswitch. A riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.


Also disclosed are methods for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch, by bringing a compound into contact with the RNA molecule. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Thus, subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.


Also disclosed are methods for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule, by operably linking a riboswitch to the RNA molecule. A riboswitch can be operably linked to an RNA molecule in any suitable manner, including, for example, by physically joining the riboswitch to the RNA molecule or by engineering nucleic acid encoding the RNA molecule to include and encode the riboswitch such that the RNA produced from the engineered nucleic acid has the riboswitch operably linked to the RNA molecule. Subjecting a riboswitch operably linked to an RNA molecule of interest to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.


Also disclosed are methods for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can result in death, stasis or debilitation of the cell or organism. For example, activating a naturally occurring riboswitch in a naturally occurring gene that is essential to survival of a microorganism can result in death of the microorganism (if activation of the riboswitch turns off or represses expression). This is one basis for the use of the disclosed compounds and methods for antimicrobial and antibiotic effects.


Also disclosed are methods for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. The gene or RNA can be engineered or can be recombinant in any manner. For example, the riboswitch and coding region of the RNA can be heterologous, the riboswitch can be recombinant or chimeric, or both. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de-repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.


Also disclosed are methods for altering the regulation of a riboswitch by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch). The aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain. Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain. Generally, any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.


Also disclosed are methods for inactivating a riboswitch by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.


Also disclosed are methods for selecting, designing or deriving new riboswitches and/or new aptamers that recognize new trigger molecules. Such methods can involve production of a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results. Also disclosed are riboswitches and aptamer domains produced by these methods.


Techniques for in vitro selection and in vitro evolution of functional nucleic acid molecules are known and can be adapted for use with riboswitches and their components. Useful techniques are described by, for example, A. Roth and R. R. Breaker (2003) Selection in vitro of allosteric ribozymes. In: Methods in Molecular Biology Series—Catalytic Nucleic Acid Protocols (Sioud, M., ed.), Humana, Totowa, N.J.; R. R. Breaker (2002) Engineered Allosteric Ribozymes as Biosensor Components. Curr. Opin. Biotechnol. 13:31-39; G. M. Emilsson and R. R. Breaker (2002) Deoxyribozyrnes: New Activities and New Applications. Cell. Mol. Life. Sci. 59:596-607; Y. Li, R. R. Breaker (2001) In vitro Selection of Kinase and Ligase Deoxyribozymes. Methods 23:179-190; G. A. Soukup, R. R. Breaker (2000) Allosteric Ribozymes. In: Ribozymes: Biology and Biotechnology. R. K. Gaur and G. Krupp eds. Eaton Publishing; G. A. Soukup, R. R. Breaker (2000) Allosteric Nucleic Acid Catalysts. Curr. Opin. Struct. Biol. 10:318-325; G. A. Soukup, R. R. Breaker (1999) Nucleic Acid Molecular Switches. Trends Biotechnol. 17:469-476; R. R. Breaker (1999) In vitro Selection of Self-cleaving Ribozymes and Deoxyribozymes. In: Intracellular Ribozyme Applications: Principles and Protocols. L. Couture, J. Rossi eds. Horizon Scientific Press, Norfolk, England; R. R. Breaker (1997) In vitro Selection of Catalytic Polynucleotides. Chem. Rev. 97:371-390; and references cited therein; each of these publications being specifically incorporated herein by reference for their description of in vitro selections and evolution techniques.


Also disclosed are methods for selecting and identifying compounds that can activate, deactivate or block a riboswitch. Activation of a riboswitch refers to the change in state of the riboswitch upon binding of a trigger molecule. A riboswitch can be activated by compounds other than the trigger molecule and in ways other than binding of a trigger molecule. The term trigger molecule is used herein to refer to molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques). Non-natural trigger molecules can be referred to as non-natural trigger molecules.


Also disclosed herein is a method of identifying a compound that interacts with a riboswitch comprising: modeling the atomic structure the riboswitch with a test compound; and determining if the test compound interacts with the riboswitch. Determining if the test compound interacts with the riboswitch can be accomplished by, for example, determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch, as described elsewhere herein. Determining if the test compound interacts with the riboswitch can be accomplished by, for example, determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch. The predicted interactions can be selected from the group consisting of, for example, van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination, as described above. In one example, the riboswitch is a guanine riboswitch.


Atomic contacts can be determined when interaction with the riboswitch is determined, thereby determining the interaction of the test compound with the riboswitch. Analogs of the test compound can be identified, and it can be determined if the analogs of the test compound interact with the riboswitch.


Also disclosed are methods of killing or inhibiting bacteria, comprising contacting the bacteria with a compound disclosed herein or identified by the methods disclosed herein.


Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: identifying the crystal structure of the riboswitch, modeling the riboswitch with a test compound, and determining if the test compound interacts with the riboswitch. Deactivation of a riboswitch refers to the change in state of the riboswitch when the trigger molecule is not bound. A riboswitch can be deactivated by binding of compounds other than the trigger molecule and in ways other than removal of the trigger molecule. Blocking of a riboswitch refers to a condition or state of the riboswitch where the presence of the trigger molecule does not activate the riboswitch.


Also disclosed are methods of identifying compounds that activate, deactivate or block a riboswitch. For examples, compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner. For example, the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound. As another example, the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. As can be seen, assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.


In addition to the methods disclosed elsewhere herein, identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.


Also disclosed are methods of detecting compounds using biosensor riboswitches. The method can include bringing into contact a test sample and a biosensor riboswitch and assessing the activation of the biosensor riboswitch. Activation of the biosensor riboswitch indicates the presence of the trigger molecule for the biosensor riboswitch in the test sample. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.


Biosensor riboswitches can be used to monitor changing conditions because riboswitch activation is reversible when the concentration of the trigger molecule falls and so the signal can vary as concentration of the trigger molecule varies. The range of concentration of trigger molecules that can be detected can be varied by engineering riboswitches having different dissociation constants for the trigger molecule. This can easily be accomplished by, for example, “degrading” the sensitivity of a riboswitch having high affinity for the trigger molecule. A range of concentrations can be monitored by using multiple biosensor riboswitches of different sensitivities in the same sensor or assay.


Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound. This can be accomplished by, for example, combining compound identification methods as disclosed elsewhere herein with methods for manufacturing the identified compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.


Also disclosed are compounds made by checking activation, deactivation or blocking of a riboswitch by a compound and manufacturing the checked compound. This can be accomplished by, for example, combining compound activation, deactivation or blocking assessment methods as disclosed elsewhere herein with methods for manufacturing the checked compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound. Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch.


Disclosed is a method of detecting a compound of interest, the method comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest. The riboswitch can change conformation when activated by the compound of interest, wherein the change in conformation produces a signal via a conformation dependent label. The riboswitch can change conformation when activated by the compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal. The signal can be produced by a reporter protein expressed from the RNA linked to the riboswitch.


Disclosed is a method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the inhibition is via the riboswitch, and (b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising a riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.


Also disclosed is a method of identifying riboswitches, the method comprising assessing in-line spontaneous cleavage of an RNA molecule in the presence and absence of a compound, wherein the RNA molecule is encoded by a gene regulated by the compound, wherein a change in the pattern of in-line spontaneous cleavage of the RNA molecule indicates a riboswitch.


A. Identification of Antimicrobial Compounds

Riboswitches are a new class of structured RNAs that have evolved for the purpose of binding small organic molecules. The natural binding pocket of riboswitches can be targeted with metabolite analogs or by compounds that mimic the shape-space of the natural metabolite. Riboswitches are: (1) found in numerous Gram-positive and Gram-negative bacteria including Bacillus anthracis, (2) fundamental regulators of gene expression in these bacteria, (3) present in multiple copies that would be unlikely to evolve simultaneous resistance, and (4) not yet proven to exist in humans. This combination of features make riboswitches attractive targets for new antimicrobial compounds. Further, the small molecule ligands of riboswitches provide useful sites for derivitization to produce drug candidates. Distribution of some riboswitches is shown in Table 1 of U.S. Application Publication No. 2005-0053951.


Once a class of riboswitch has been identified and its potential as a drug target assessed (by, for example, determining how many genes in a target organism are regulated by that class of riboswitch), candidate molecules can be identified.


It has been determined that two compounds that have been known for many years to kill bacteria function by binding to riboswitches. Specifically, the compound aminoethylcysteine (an analog of lysine) binds to lysine riboswitches of bacteria and prevents expression of genes required for lysine biosynthesis. Similarly, the compound pyrithiamine (an analog of thiamine) is di-phosphorylated by bacterial cells to yield pyrithiamine pyrophosphate. It his latter compound mimics thiamine pyrophosphate (TPP) and is toxic to bacteria because it binds to TPP riboswitches. Therefore, targeting riboswitches via application of modern drug discovery strategies can lead to the discovery of new classes of antibiotic compounds. Structure-based drug design methods can be used as disclosed herein to generate metabolite analogs that bind to riboswitches and trigger their allosteric action.


The emergence of drug-resistant stains of bacteria highlights the need for the identification of new classes of antibiotics. Anti-riboswitch drugs represent a mode of anti-bacterial action that is of considerable interest for the following reasons. Riboswitches control the expression of genes that are critical for fundamental metabolic processes. Therefore manipulation of these gene control elements with drugs yields new antibiotics. Riboswitches also carry RNA structures that have evolved to selectively bind metabolites, and therefore these RNA receptors make good drug targets as do protein enzymes and receptors. Furthermore, it has been shown that two antimicrobial compounds (discussed above) kill bacteria by deactivating the antibiotics resistance to emerge through mutation of the RNA target. There are at least 11 classes of well-conserved riboswitches in many bacteria, providing numerous drug targets.


Many organisms from both Gram-positive and Gram-negative lineages have numerous classes of riboswitches (see U.S. Application Publication No. 2005-0053951). Major disease-causing organisms such as Staphylococcus and major bioterror-related organisms such as Bacillus anthracis are both rich with riboswitches. As disclosed herein, the atomic-resolution structure model for a guanine riboswitch has been elucidated, which enables the use of structure-based design methods for creating riboswitch-binding compounds. Specifically, the model for the binding site of the guanine riboswitch shows that two channels are present that would permit ligand modification (FIG. 12A). 26 guanine analogs have been generated with chemical modifications at either the N2 or the O6 positions of guanine, and nearly all tested so far bind to the riboswitch with sub-nanomolar dissociation constants. FIG. 12B depicts the structures of guanine analogs synthesized with modified N2 and O6 positions. Most of these compounds take advantage of the molecular recognition “blind spots” in the binding site model or the aptamer domain form a B. subtilis guanine riboswitch. The successful compounds can be used as a scaffold upon which further chemical variation can be introduced to create non-toxic, bioavailable, high affinity, anti-riboswitch compounds.


The following provides an illustration of this using the SAM riboswitch (see Example 7 of U.S. Application Publication No. 2005-0053951).SAM analogs that substitute the reactive methyl and sulfonium ion center with stable sulfur-based linkages (YBD-2 and YBD3) are recognized with adequate affinity (low to mid-nanomolar range) by the riboswitch to serve as a platform for synthesis of additional SAM analogs. In addition, a wider range of linkage analogs (N- and C-based linkages) can be synthesized and tested to provide the optimal platform upon which to make amino acid and nucleoside derivations.


Sulfoxide and sulfone derivatives of SAM can be used to generate analogs. Established synthetic protocols described in Ronald T. Borchardt and Yih Shiong Wu, Potential inhibitor of S-adenosylmethionine-dependent methyltransferase. 1. Modification of the amino acid portion of S-adenosylhomocysteine. J. Med. Chem. 17, 862-868, 1974, can be used, for example. These and other analogs can be synthesized and assayed for binding sequentially or in small groups. Additional SAM analogs can be designed during the progression of compound identification based on the recognition determinants that are established in each round. Simple binding assays can be conducted on B. subtilis and B. anthracis riboswitch RNAs as described elsewhere herein. More advanced assays can also be used.


The most promising SAM analog lead compounds must enter bacterial cells and bind riboswitches while remaining metabolically inert. In addition, useful SAM analogs must be bound tightly by the riboswitch, but must also fail to compete for SAM in the active sites of protein enzymes, or there is a risk of generating an undesirable toxic effect in the patient's cells. As a preliminary assessment of these issues, compounds can be tested for their ability to disrupt B. subtilis growth, but fail to affect E. coli cultures (which use SAM but lack SAM riboswitches). To screen for lead compound candidates, parallel bacterial cultures can be grown as follows:


1. B. subtilis can be cultured in glucose minimal media in the absence of exogenously supplied SAM analogs.


2. B. subtilis can be cultured in glucose minimal media in the presence of exogenously supplied SAM analogs (high doses can be selected, to be followed by repeated experiments designed to test a concentration range of the putative drug compound).


3. E. coli can be cultured in glucose minimal media in the presence of exogenously supplied SAM analogs (high doses will be selected, to be followed by repeated experiments designed to test a concentration range of the putative drug compound).


Fitness of the various cultures can be compared by measurement of cellular doubling times. A range of concentrations for the drug compounds can be tested using cultures grown in microtiter plates and analyzed using a microplate reader from another laboratory. Culture 1 is expected to grow well. Drugs that inhibit culture 2 may or may not inhibit growth of culture 3. Drugs that similarly inhibit both culture 2 and culture 3 upon exposure to a wide range of drug concentrations can reflect general toxicity induced by the exogenous compound (i.e., inhibition of many different cellular processes, in addition or in place of riboswitch inhibition). Successful drug candidates identified in this screen will inhibit E. coli only at very high doses, if at all, and will inhibit B. subtilis at much (>10-fold) lower concentrations.


As derivation points on SAM are identified, efficient identification of lead drug compounds will require larger-scale screening of appropriate SAM analogs or generic chemical libraries. A high-throughput screen can be created by one or two different methods using nucleic acid engineering principles. Adaptation of both fluorescent sensor designs outlined below to formats that are compatible with high-throughput screening assays can be accommodated by using immobilization methods or solution-based methods.


One way to create a reporter is to add a third function to the riboswitch by adding a domain that catalyzes the release of a fluorescent tag upon SAM binding to the riboswitch domain. In the final reporter construct, this catalytic domain can be linked to the yitJ SAM riboswitch through a communication module that relays the ligand binding event by allowing the correct folding of the catalytic domain for generating the fluorescent signal. This can be accomplished as outlined below.


SAM RiboReporter Pool Design: A DNA template for in vitro transcription to RNA was constructed by PCR amplification using the appropriate DNA template and primer sequences. In this construct, stem II of the hammerhead (stem P1 of the SAM aptamer) has been randomized to present more than 250 million possible sequence combinations, wherein some inevitably will permit function of the ribozyme only when the aptamer is occupied by SAM or a related high-affinity analog. Each molecule in the population of constructs is identical in sequence except at the random domain where multiple copies of every possible combination of sequence will be represented in the population.


SAM RiboReporter Selection: The in vitro selection protocol can be a repetitive iteration of the following steps:


1. Transcribe RNA in vitro by standard methods. Include [α-32P] UTP to incorporate radioactivity throughout the RNA.


2. Purify full length RNA on denaturing PAGE by standard methods.


3. Incubate full length RNA (˜100 pmoles) in negative selection buffer containing sufficient magnesium for catalytic activity (20 mM) but no SAM. Incubate 4 h at room temperature (˜23° C.), with thermocycling or alkaline denaturation as needed to preclude the emergence of selfish molecules.


4. Purify full length RNA on denaturing PAGE and discard RNAs that react in the absence of SAM.


5. Incubate in positive selection buffer containing 20 mM Mg2+ and SAM (pH 7.5 at 23° C.). Incubate 20 min at room temperature.


6. Purify cleaved RNA on denaturing PAGE to recover switches that bound SAM and allowed self-cleavage of the RNA.


7. Reverse transcribe RNA to DNA.


8. PCR amplify DNA with primers that reintroduced cleaved portion of RNA.


The concentration of SAM in step 4 can be 100 μM initially and can be reduced as the selection proceeds. The progress of recovering successful communication modules can be assessed by the amount of cleavage observed on the purification gel in step 6. The selection endpoint can be either when the population approaches 100% cleavage in 10 nM SAM (conditions for maximal activity of the parental ribozyme and riboswitch) or when the population approaches a plateau in activity that does not improve over multiple rounds. The end population can then be sequenced. Individual communication module clones can be assayed for generation of a fluorescent signal in the screening construct in the presence of SAM.


A fluorescent signal can also be generated by riboswitch-mediated triggering of a molecular beacon. In this design, riboswitch conformational changes cause a folded molecular beacon tagged with both a fluor and a quencher to unfold and force the fluor away from the quencher by forming a helix with the riboswitch. This mechanism is easy to adapt to existing riboswitches, as this method can take advantage of the ligand-mediated formation of terminator and anti-terminator stems that are involved in transcription control.


To use riboswitches to report ligand binding by binding a molecular beacon, the appropriate construct must be determined empirically. The optimum length and nucleotide composition of the molecular beacon and its binding site on the riboswitch can be tested systematically to result in the highest signal-to-noise ratio. The validity of the assay can be determined by comparing apparent relative binding affinities of different SAM analogs to a molecular beacon-coupled riboswitch (determined by rate of fluorescent signal generation) to the binding constants determined by standard in-line probing.


SPECIFIC EMBODIMENTS

Disclosed is the atomic structure of a natural guanine-responsive riboswitch comprising an atomic structure as depicted in FIG. 6. The atomic coordinates of the atomic structure can comprise the atomic coordinates listed in Table 6 for atoms depicted in FIG. 6. The atomic coordinates of the atomic structure can comprise the atomic coordinates listed in Table 6.


Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: (a) modeling the atomic structure of claim 1 with a test compound; and (b) determining if the test compound interacts with the riboswitch.


Also disclosed is a method of identifying compounds that interact with a riboswitch comprising contacting the riboswitch with a test compound, wherein a fluorescent signal is generated upon interaction of the riboswitch with the test compound.


Also disclosed is a method of killing bacteria, comprising contacting the bacteria with an analog of a compound that interacts with the riboswitch.


Also disclosed is a method of killing bacteria, comprising contacting the bacteria with a compound that interacts with the riboswitch.


Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: (a) identifying the crystal structure of the riboswitch; (b) modeling the riboswitch with a test compound; and (c) determining if the test compound interacts with the riboswitch.


Also disclosed is a regulatable gene expression construct comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous.


Also disclosed is a method of detecting a compound of interest, the method comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest.


Also disclosed is a method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the inhibition is via the riboswitch, and (b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising the riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.


Determining if the test compound interacts with the riboswitch can comprise determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch. Determining if the test compound interacts with the riboswitch can comprise determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch. The riboswitch can be a guanine riboswitch. The guanine riboswitch can be a riboswitch in Table 5. Atomic contacts can be determined by modeling the riboswitch with a test compound, thereby determining the interaction of the test compound with the riboswitch.


The method can further comprise the steps of: (c) identifying analogs of the test compound; (d) determining if the analogs of the test compound interact with the riboswitch. The compound can be hypoxanthine. A gel-based assay can be used to determine if the test compound interacts with the riboswitch. A chip-based assay can be used to determine if the test compound interacts with the riboswitch.


The test compound can interact via van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination. The riboswitch can comprise an RNA cleaving ribozyme. A fluorescent signal can be generated when a nucleic acid comprising a quenching moiety is cleaved. Molecular beacon technology can be employed to generate the fluorescent signal. The method can be carried out using a high throughput screen.


The riboswitch can be a guanine riboswitch. The guanine riboswitch can be a riboswitch in Table 5. The riboswitch can be activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule. The riboswitch can change conformation when activated by a compound of interest, wherein the change in conformation produces a signal via a conformation dependent label. The riboswitch can change conformation when activated by a compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal. The signal can be produced by a reporter protein expressed from the RNA linked to the riboswitch.


EXAMPLES
A. Example 1
Structure of a Natural Guanine-Responsive Riboswitch Complexed with the Metabolite Hypoxanthine

To understand how the natural biosensor functions, the structure of the guanine-binding domain bound to hypoxanthine has been solved by X-ray crystallography (Table 2 and FIG. 12A), a biologically relevant ligand of the guanine-responsive riboswitch. In the hypoxanthine-bound state, the RNA adopts a three-dimensional fold in which the terminal loops (L2 and L3) form a series of interconnecting hydrogen bonds (see pairing scheme in FIG. 5) to bring the P2 and P3 helices parallel to each other (FIG. 5C). Unfavorable electrostatic interactions, a result of the juxtaposition of regions of the ribose-phosphate backbone, are neutralized through the binding of several cations between the two backbones (FIG. 10). Anchoring the global helical arrangement of the RNA are numerous tertiary contacts around the three-way junction, dominated by the J2/3 loop (FIG. 5C) interacting with bound hypoxanthine, the P1 helix, and the J1/2 and J3/1 loops.


The purine-binding pocket is created by conserved nucleotides in and around the three-way junction element. These nucleotides help to define the purine-binding pocket through the formation of two sets of base triples above and below (FIG. 5A). The 3′ side of the pocket is flanked by a water-mediated U22-A52A73 base triple and an A23G46-C53 triple; in both cases, the Watson-Crick face of the adenosine interacts with the minor groove of a Watson-Crick pair (FIG. 6A). The other side is created by sequential base triples between conserved Watson-Crick pairs at the top of helix P1 (U20-A76 and A21-U75) and the Watson-Crick faces of U49 and C50, respectively, which fasten the J2/3 loop to the P1 helix. This extensive use of base triples to create a ligand-binding site is very similar to in vitro selected RNA aptamers that recognize planar ring systems, as exemplified by the structures of the theophylline (Zimmermann, G. R., Jenison, R. D., Wick, C. L., Simmorre, J.-P. & Pardi, A. Interlocking structural motifs mediate molecular discrimination by a theophylline-binding RNA. Nature Struct. Biol. 4, 644-649 (1997)), FMN (Fan, P., Suri, A. K., Fiala, R., Live, D. & Patel, D. J. Molecular recognition in the FMN-RNA aptamer complex. J. Mol. Biol. 258, 480-500 (1996)) and malachite green (Baugh, C., Grate, D. & Wilson, C. 2.8 Å crystal structure of the malachite green aptamer. J. Mol. Biol. 301, 117-128 (2000)) binders. Thus, artificially selected RNAs use some of the same principles for creating binding sites for small-molecule ligands as their naturally occurring counterpart.


Hypoxanthine is bound through an extensive series of hydrogen bonds with nucleotides U22, U47, U51 and C74 (FIG. 6B), forming a base quadruple that stacks directly on the P1 helix. The structure clearly shows that the mRNA contacts all of the functional groups in the ligand, thereby explaining the specificity for hypoxanthine observed in biochemical studies (Mandal, M. & Breaker, R. R. Gene regulation by riboswitches. Nature Rev. Mol. Cell. Biol. 5, 451-463 (2004)). In addition, guanine binding can be readily rationalized, because there is room in the structure to accommodate an exocyclic amine at the 2-position of the bound purine. This additional functional group can form hydrogen bonds with the carbonyl oxygens at the 2-position of C74 and U51, consistent with the tenfold higher affinity of this riboswitch for guanine over hypoxanthine.


One of the most marked features is how the ligand is almost completely enveloped by the RNA (FIG. 6C): 97.8% of the surface of hypoxanthine is inaccessible to bulk solvent in the complex. The almost complete use of a ligand for recognition by an RNA is unprecedented among structurally characterized aptamers, although selection strategies that do not involve immobilization of the ligand on a solid support (Koizumi, M., Soukup, G. A., Kerr, J. N. & Breaker, R. R. Allosteric selection of ribozymes that respond to the second messengers cGMP and cAMP. Nature Struct. Biol. 6, 1062-1071 (1999)) hold promise for the development of RNAs that are capable of a similar degree of ligand burial. This finding also implies that the local binding site must undergo a substantial conformational change upon ligand binding, because it is not possible for hypoxanthine to gain access to a preformed binding site, which is a common feature of many RNA-ligand interactions (Leulliot, N. & Varani, G. Current topics in RNA-protein recognition: control of specificity and biological function through induced fit and conformational capture. Biochemistry 40, 7947-7956 (2001); Williamson, J. R. Induced fit in RNA-protein recognition. Nature Struct. Biol. 7, 834-837 (2000)). Finally, this mode of purine recognition also easily explains its ability to change specificity from guanine to adenine through a single point mutation at nucleotide 74 from cytosine to uracil (Mandal, M. & Breaker, R. R. Adenine riboswitches and gene activation by disruption of a transcription terminator. Nature Struct. Mol. Biol. 11, 29-35 (2004)). Although the Watson-Crick pairing preference changes from guanine to adenine, the other interactions between the purine and the RNA are unchanged.


The tertiary architecture is stabilized through a unique loop-loop interaction capping helices P2 and P3 that is defined by two previously unobserved types of base quadruple. Each quadruple comprises a Watson-Crick pair with a noncanonical pair docked into its minor groove (G38-C60 and G37-C61 interacting with the A33A66 and U34A65 pairs, respectively; FIG. 7A). This arrangement bears a strong similarity to how adenosines pack into an A-form helix in the commonly found type I/II A-minor triple motif (Doherty, E. A., Batey, R. T., Masquida, B. & Doudna, J. A. A universal mode of helix packing in RNA. Nature Struct. Biol. 8, 339-343 (2001); Nissen, P., Ippolito, J. A., Ban, N., Moore, P. B. & Steitz, T. A. RNA tertiary interactions in the large ribosomal subunit: the A-minor motif. Proc. Natl. Acad. Sci. USA 98, 4899-4903 (2001)). Mutation of any one of these eight nucleotides would disrupt the intricate hydrogen-bonding network that cements the core of the loop-loop interaction, explaining their strict phylogenetic conservation. Stacked on these quadruples are two noncanonical pairs, including a side-by-side interaction between G62 and U63 akin to the A-platform motif (Cate, J. H. et al. RNA tertiary structure mediation by adenosine platforms. Science 273, 1696-1699 (1996)) and the bulged-G motif (Correll, C. C., Beneken, J., Planting a, M. J., Lubbers, M. & Chan, Y. L. The common and the distinctive features of the bulged-G motif based on a 1.04 Å resolution RNA structure. Nucleic Acids Res. 31, 6806-6818 (2003)) (FIG. 7B). The G62U63 pair is further stabilized through hydrogen bonding to the backbone of the opposite loop.


The interaction between the two terminal loops is essential for ligand binding by the guanine riboswitch. Differences in the stability of the RNA in the absence and presence of guanine, as determined by in-line probing experiments, indicate that this element of the RNA tertiary structure forms independently of guanine or hypoxanthine. Replacement of the wild-type loops with stable UUCG tetraloops (Molinaro, M. & Tinoco, I. Jr Use of ultra stable UNCG tetraloop hairpins to fold RNA structures: thermodynamic and spectroscopic applications. Nucleic Acids Res. 23, 3056-3063 (1995); FIG. 11), which eliminates the tertiary interaction, abolishes the ability of the riboswitch to recognize hypoxanthine; this shows that it is crucial for promoting a high-affinity interaction (FIG. 8). Thus, although this tertiary interaction does not contact the ligand directly, it is significant in globally organizing the riboswitch for purine recognition. Similarly, natural sequences of various hammerhead ribozymes contain loop-loop interactions that significantly accelerate their rate of cleavage under physiological conditions (De la Pena, M., Gago, S. & Flores, R. Peripheral regions of natural hammerhead ribozymes greatly increase their self-cleavage activity. EMBO J. 22, 5561-5570 (2003); Khvorova, A., Lescoute, A., Westhof, E. & Jayasena, S. D. Sequence elements outside the hammerhead ribozyme catalytic core enable intracellular activity. Nature Struct. Biol. 10, 708-712 (2003)). In each, the tertiary interaction constrains the RNA in a way that allows the three-way junction containing the functional site to respond to physiological concentrations of ligand or Mg2+ ions.


In high Mg2+ ion concentrations (20 mM MgCl2), the guanine riboswitch has a very high affinity for guanine and hypoxanthine (an observed dissociation constant (Kd) of 5 nM and 50 nM, respectively). In Escherichia coli, however, repression of transcription of the xpt-pbuX operon by the purR repressor occurs in response to 1-10 μM concentrations of purine. To determine whether the riboswitch responds to similar concentrations of purine, which probably reflect physiological levels, its affinity for hypoxanthine was determined by isothermal titration calorimetry at varying ionic conditions (Table 1). At a more physiological ionic strength (0.25-1 mM Mg2+), the RNA showed an affinity for hypoxanthine (observed Kd=3-4 μM) similar to that of the purR repressor protein (observed Kd=9 μM for the E. coli variant) (Choi, K. Y. & Zalkin, H. Structural characterization and corepressor binding of the Escherichia coli purine repressor. J. Bacteriol. 174, 6207-6214 (1992)). Thus, both RNA- and protein-based regulatory mechanisms seem to be tuned to respond to similar concentrations of intracellular metabolite.









TABLE 1







Thermodynamic parameters for hypoxanthine binding at 30° C.










MgCl2





(mM)
Kd (μM)*
ΔHobs (kcal mol−1)*
ΔSobs (cal mol−1 K−1)





20
0.732 ± 0.034
−33.5 ± 0.43
−82


 5.0
 1.34 ± 0.094
−28.4 ± 0.48
−67


 1.0
2.99 ± 0.19
−35.4 ± 0.93
−91


 0.25
4.00 ± 0.19
−41.9 ± 0.39
−113 


 0†
ND
ND
ND





*The reported errors represent the s.e.m. of the nonlinear least squares fit to the data. ND, no detectable binding.


†This reaction contained 2 mM Na2-EDTA.






The structure further indicates how RNA directly transduces intracellular metabolite concentration into changes in gene expression through a proposed Rho-independent transcriptional regulation mechanism (Mandal, M., Boese, B., Barrick, J. E., Winkler, W. C. & Breaker, R. R. Riboswitches control fundamental biochemical pathways in Bacillus subtilis and other bacteria. Cell 113, 577-586 (2003); Johansen, L. E., Nygaard, P., Lassen, C., Agerso, Y. & Saxild, H. H. Definition of a second Bacillus subtilis pur regulon comprising the pur and xpt-pbuX operons plus pbuG, nupG (yxjA), and pbuE (ydhL). J. Bacteriol. 185, 5200-5209 (2003)). Transcription of the initial 90 nucleotides results in the formation of the guanine-binding domain, with the L2 and L3 loops interacting to begin to form the tertiary structure of the RNA (FIG. 5B). This partially organizes the three-way junction motif for efficient ligand binding, although the junction must be unstructured to some degree to allow access of the purine to the binding pocket. At sufficiently high concentrations of guanine or hypoxanthine, the nucleobase binds the pocket, stabilizing the short P1 helix through stacking interactions and base triples with J2/3 and preventing incorporation of P1 nucleotides into an antiterminator element. The mRNA can then form a classic Rho-independent terminator stem-loop, and transcription stops. By contrast, in low intracellular concentrations of guanine or hypoxanthine, the 3′ side of the isolated P1 helix is readily conscripted to form a stable antiterminator element, facilitating continued transcription. Thus, hypoxanthine is the keystone for the riboswitch, enabling the riboswitch to direct mRNA folding along two different pathways through its ability to stabilize one conformation over another, resulting in an effective biosensor of intracellular guanine, hypoxanthine and xanthine concentrations.


1. Methods


Crystals: RNA was synthesized and purified by a native affinity-tag purification method (Kieft, J. S. & Batey, R. T. A general method for rapid and nondenaturing purification of RNAs. RNA 10, 988-995 (2004)) and exchanged it into a buffer containing 10 mM K+-HEPES (pH 7.5) and 1 mM hypoxanthine. Crystals were grown by mixing this solution in a 1:1 ratio with mother liquor (containing 25% PEG 3,000 (w/v), 200 mM ammonium acetate and 10 mM cobalt hexamine) and incubating it for 2-3 weeks at room temperature.


Data collection and processing: A single wavelength anomalous diffraction experiment was carried out at the CuKα wavelength on crystals cryoprotected in mother liquor plus 25% 2-methyl-2,4-pentanediol; clear diffraction was observed to at least 1.8 Å resolution. The data was indexed, integrated, and scaled using D*TREK (Pflugrath, J. W. The finer things in X-ray diffraction data collection. Acta Crystallogr. D 55, 1718-1725 (1999)), identified heavy atom sites with SOLVE (Terwilliger, T. SOLVE and RESOLVE: automated structure solution, density modification and model building. J. Synchrotron Radiat. 11, 49-52 (2004)), and carried out refinement with CNS (Brunger, A. T. et al. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905-921 (1998)) to obtain a model with final Rxtal and Rfree values of 17.8% and 22.8%, respectively. The model contains all RNA atoms except A82, for which electron density was not observed, along with the hypoxanthine ligand.


RNA synthesis and purification: A 68 nucleotide construct containing the sequence for the guanine riboswitch of the pbuX-xpt operon of B. subtilis was constructed using overlapping DNA oligonucleotides (Integrated DNA Technologies) and standard PCR methods. The resulting DNA fragment, which contained EcoRI and NgoMIV restriction sites at the 5′ and 3′ ends, respectively, was ligated into pRAV12 (Kieft, J. S. & Batey, R. T. A general method for rapid and nondenaturing purification of RNAs. RNA 10, 988-995 (2004)), a plasmid vector designed for the native purification of RNA; the resulting vector was sequence verified. DNA template for in vitro transcription was generated by PCR from the resulting vector using primers directed against the T7 RNA polymerase promoter at the 5′ end and the 3′ side of the purification affinity tag (5′, GCGCGCGAATTCTAATACGACTCACTATAG (SEQ ID NO:10; 3′, GGATCCTGCCCAGGGCTG; SEQ ID NO:11). RNA was transcribed in a 12.5 mL reaction containing 30 mM Tris-HCl (pH 8.0), 10 mM DTT, 0.1% Triton X-100, 0.1 mM spermidine-HCl, 8 mM each NTP, 40 mM MgCl2, 50 μg/mL T7 RNA polymerase, and 1 mL of ˜0.5 μM template (Doudna, J. A. Preparation of homogeneous ribozyme RNA for crystallization. Methods Mol Biol 74, 365-70 (1997)), supplemented with 1 unit/mL inorganic pyrophosphatase to suppress formation of insoluble magnesium pyrophosphate. The reaction was incubated for 1.5 hr at 37° C. Native purification of the RNA was performed as described. Following elution of the RNA from the affinity column, it was immediately concentrated using a 10,000 MWCO centrifugal filter device (Amicon, Ultra-15). After all of the RNA had been concentrated to 500 μL, it was exchanged against three 15 mL aliquots of a buffer containing 10 mM K+-HEPES, pH 7.5 and 1 mM hypoxanthine using the centrifugal concentrator. The final RNA was concentrated to 450 μM, as judged by it absorbance at 260 nm and a calculated extinction coefficient based upon its base composition.


RNA crystallization: Crystals of the guanine riboswitch were obtained using the hanging drop method in which the RNA solution was mixed in a 1:1 ratio with a reservoir solution containing 10 mM cobalt hexamine, 200 mM ammonium acetate and 25% PEG 2K. The crystallization trays were incubated at room temperature (23° C.), with crystals obtaining their maximum size (0.05×0.05 x 0.2 mm) in 7-14 days. Cryoprotection of the crystals was performed by adding 30 μL of a solution comprising the mother liquor plus 25% (v/v) 2-methyl-2,4 pentanediol (MPD) for five minutes and flash-frozen in liquid nitrogen. Diffraction data was collected on a home X-ray source (Rigaku MSC) using CuKα radiation; collection of anomalous data was achieved by an inverse-beam experiment. The data was indexed, integrated and scaled with CrystalClear (Rigaku MSC) and D*TREK (Pflugrath, J. W. The finer things in X-ray diffraction data collection. Acta Crystallogr D Biol Crystallogr 55, 1718-1725 (1999)). The crystals belong to the C2 spacegroup (a=132.30 Å, b=35.25 Å, c=42.23 Å, d=90.95°) and contain one molecule per asymmetric unit (refer to Table 2 for all crystallographic statistics). All data used in subsequent phasing and refinement was collected from one individual crystal.


Phasing and structure determination. Phases were determined using a single wavelength anomalous diffraction (SAD) experiment (Dauter, Z., Dauter, M. & Dodson, E. Jolly SAD. Acta Crystallogr D Biol Crystallogr 58, 494-506 (2002); Rice, L. M., Earnest, T. N. & Brunger, A. T. Single-wavelength anomalous diffraction phasing revisited. Acta Crystallogr D Biol Crystallogr 56 (Pt 11), 1413-20 (2000)) and diffraction data extending to 1.95 Å resolution. In this experiment, cobalt was treated as the heavy atom derivative, which has weak anomalous signal at CuKα wavelength (f′=−2.464, f″=3.608). With SOLVE (Terwilliger, T. SOLVE and RESOLVE: automated structure solution, density modification and model building. J Synchrotron Radiat 11, 49-52 (2004)), a 10 heavy atom solution was found, with a figure of merit of 0.38 and a score of 40.9. Phases were determined using this heavy atom model and its mirror image using CNS (Brunger, A. T. et al. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr D Biol Crystallogr 54 (Pt 5), 905-21 (1998)) and improved with density modification. Only one of the heavy atom models yielded an electron density map in which there was clear backbone connectivity and base stacking; this map was sufficiently clear to be able to trace the majority of the RNA. Iterative rounds of model building in 0 (Jones, T. A., Zou, J. Y., Cowan, S. W. & Kjeldgaard. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr A 47 (Pt 2), 110-9 (1991)) and refinement in CNS were performed while monitoring Rfree to ensure that it improved after each round. Following building of all nucleotides in the RNA except for the 3′-terminal A82, two rounds of water picking were performed with a total of 332 water molecules placed into the model, using the restrictions that each solvent must be in hydrogen bonding distance to another atom and have a B-factor of less than 80. During this phase of model building, 12 cobalt hexamine ions were identified on the basis of having inner-sphere atoms with clear octahedral coordination geometry and their positions verified by an anomalous difference map; a single spermidine and an acetate ion were also placed within the model at this point. Sugar puckers were constrained to be C3′-endo, except for residues 22, 34, 35, 47, 49 and 62 which were restrained as C2′-endo.


Isothermal Titration Calorimetry. RNA for isothermal titration calorimetry (ITC) (Leavitt, S. & Freire, E. Direct measurement of protein binding energetics by isothermal titration calorimetry. Curr Opin Struct Biol 11, 560-6 (2001); Pierce, M., Raman, C. S. & Nall, B. T. Isothermal titration calorimetry of protein-protein interactions. Methods 19, 213-21 (1999)) was transcribed and purified as described above and exhaustively dialyzed against buffer containing 10 mM K+-HEPES, pH 7.5, 100 mM KCl and varying concentrations of MgCl2 at 4° C. for 24-48 hours. Following dialysis, the buffer was used to prepare a solution of hypoxanthine at a concentration that was approximately 10-fold higher than the RNA (typically about 120 □M and 12 □M, respectively). All experiments were performed with a Microcal MCS ITC instrument at 30° C. Following degassing of the RNA and hypoxanthine solutions, a titration of 29 injections of 10 μL of hypoxanthine into the RNA sample was performed, such that a final molar ratio of between 2:1 and 3:1 hypoxanthine:RNA was achieved (Recht, M. I. & Williamson, J. R. Central domain assembly: thermodynamics and kinetics of S6 and S18 binding to an S15-RNA complex. J Mol Biol 313, 35-48 (2001)). Titration data was analyzed using Origin ITC software (Microcal Software Inc.) and fit to a single-site binding model.









TABLE 2





Crystallography statistics
















Data Collection



Spacegroup:
C2


a, b, c
132.30, 35.25, 42.23 Å


β
90.95°


Resolutiona:
20-1.95 Å (2.02-1.95 Å)


Wavelength:
1.5418


% Completeness:
92.9% (85.3%)


Measured reflections:
76,950


Unique reflections:
25,789


Average redundancy:
2.9 (2.45)


I/σ:
21.5 (6.3)


Rsymb:
3.7% (13.5%)


Phasing


Phasing Powerc:
1.62 (0.95)


Rcullisd:
0.67


Figure of Merit


SOLVE:
0.38


CNS, after dens. mod.:
0.86


Refinement


Resolution:
20-1.95 Å (2.02-1.95 Å)


Number of reflections:


Working:
23,356 (84.1%)


Test set:
2,430 (8.8%)


Rxtale:
17.8 (24.8)


Rfree:
22.8 (31.8)


r.m.s.d. bonds:
0.0095 Å


r.m.s.d. angles:
1.70°


cross-validated Luzzati coordinate error:
0.25 Å


cross-validated Sigma-a coordinate error:
0.23 Å


Average B-factor:
22.4 Å2






aNumbers in parenthesis correspond to the highest resolution shell.




bRmerge = Σ|I − <I>|/ΣI, where I is the observed intensity and <I> is the average intensity of multiple measurements of symmetry related reflections.




cPhasing power = <|FH|>/<||FP + FH| − |FPH||> reported for all reflections.




dRcullis = Σ||FPH ± FP| − FH(calc)|/Σ|FPH| reported for all reflections.




eRxtal = Σ|Fo − |Fc||/Σ|Fo|, Rxtal from the working set and Rfree from the test set.







B. Example 2
Use of Riboswitch Analogs as Antimicrobials

Atomic-resolution models were generated for both a guanine riboswitch aptamer and a related aptamer for adenine (Serganov A, Yuan Y R, Pikovskaya O, Polonskaia A, Malinina L, Phan A T, Hobartner C, Micura R, Breaker R R, Patel D J. (2004) Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. Chem. Biol. 11:1729-1741). These models are essentially identical to that proposed by Batey et al. (Batey, R. T., Gilbert, S. D., Montange, R. K. (2004) Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine. 18:432:411-415.), and therefore shows that the structural model used to design the compounds depicted in FIG. 12B is accurate. Furthermore, the similarity between the guanine and adenine riboswitch aptamers makes it clear that the same approaches used to design guanine analogs can be used to inhibit the adenine riboswitch by similarly-designed adenine analogs.


The ability of the guanine analogs depicted in FIG. 12B to bind the guanine riboswitch in vitro has been tested, as well as their antibacterial activity toward Bacillus subtilis. Of these analogs, six inhibit B. subtilis growth in media lacking guanine (Table 3). One of these, compound G-014, has been tested for inhibitory activity toward B. subtilis in rich media and toward several clinically-relevant pathogens (Table 4). Importantly, G-014 inhibits most of these pathogens, including Bacillus anthracis and Methicillin-resistant Staphylococcus aureus with similar efficacy as vancomycin.


To more fully probe the antibacterial mechanism of these guanine analogs, two gene reporter systems were developed. In one system, a guanine riboswitch has been fused to Green Fluorescent Protein (GFP), such that high concentrations of guanine or a riboswitch-binding guanine analog inhibit the expression of GFP (FIG. 14). Thus, a decreased level of GFP fluorescence indicates binding to and modulation of the guanine riboswitch inside bacteria. A similar system has been constructed with a guanine riboswitch upstream of β-galactosidase. Again, a decreased level of β-galactosidase activity indicates that a compound represses the guanine riboswitch in vivo. Table 3 summarizes the result of these assays for each of the guanine analogs. A correlation between compounds that kill cells and those that control gene expression has also been established. Similar systems can be used involving any suitable reporter gene, such as genes that encode a suitable reporter protein or a suitable reporter RNA (such as a ribozyme).


It has also been determined that G-014 is bactericidal against Bacillus subtilis. In a direct comparison, G-014 reduces the number of viable colony forming units (“CFU”) at a rate equal to carbenicillin (FIG. 15).









TABLE 3







The activity of each guanine analog1.













Kd for binding to



β-gal reporter



the riboswitch in
MIC B. subtilis
MIC B. subtills
GFP reporter
activity


Compound
vitro (nM)
(μM, minimal)
(μM, rich)
activity
(relative)2















G-001
>100,000
>100
ND
ND
1.0


G-002
<1
>100
ND
May repress
0.73


G-003
<1
>100
ND
No repression
0.49



G-004


<1


500


ND


Represses


0.62



G-005
~10
>100
ND
No repression
0.99
























G-008
<1
>100
ND
ND
ND


G-009
<1
>100
ND
ND
ND


G-010
<1
>100
ND
ND
0.97


G-011
<1
>100
ND
ND
0.78


G-012
<1
>100
ND
ND
ND


G-013
<1
>100
ND
may repress
1.0



G-014


<1


6.8 ± 0.9


62.5


Can't detect


Represses



G-015
<1
>100
ND
ND
ND



G-016


<1


inhibits


ND


Represses


0.65



G-017
<1
>100
ND
ND
ND


G-018
<1
>100
ND
ND
ND


G-019
<1
>100
ND
ND
ND


G-020
<1
>100
ND
ND
ND


G-021
<1
>100
ND
ND
ND


G-022
ND
>100
ND
ND
ND



G-023


ND


inhibits


ND


ND


ND



G-024
ND
>100
ND
ND
ND



G-025


ND


inhibits


ND


ND


0.25



G-026
ND
>100
ND
ND
ND






1IND indicates that this value has not yet been determined. Compounds in bold type inhibit bacterial growth.




2Relative β-gal reporter activity indicates Miller Units with the compound divided by Miller Units without the compound. 1.0 = no repression, 0 = complete repression.














TABLE 4







The antibacterial activity of G-014 toward clinically relevant pathogens.










Minimal




inhibitory concentrations (μg/mL)














Compound
SA1
MRSA
EF
BA
SP
HI2
MS2

















G-014
8
4
32
4
2
64
256


Vancomycin
1
1
2
1.5
0.25
64
24






1SA, Staphylococcus aureus; MRSA, methicillin-resistant Staphylococcus aureus; EF, Enterococcus faecalis; SP, Streptococcus pneumoniae; HI, Haemophilus influenza; MS, Mycobacterium smegmatis.




2These pathogens do not have a guanine riboswitch.














TABLE 5





Purine Riboswitches

















# STOCKHOLM 1.0




# = GS NC_003869.1/586365-586467
organism Thermoanaerobacter tengcongensis MB4





# = GS NC_002570.2/1593074-1592972
organism Bacillus halodurans C-125





# = GS NC_002570.2/676475-676577
organism Bacillus halodurans C-125





# = GS NC_000964.2/4004541-4004643
organism Bacillus subtilis subsp. subtilis str. 168





# = GS NC_003366.1/513088-512986
organism Clostridium perfringens str. 13





# = GS NC_000964.2/693775-693877
organism Bacillus subtilis subsp. subtilis str. 168





# = GS NC_002570.2/650309-650411
organism Bacillus halodurans C-125





# = GS NC_006270.2/2294931-2294831
organism Bacillus licheniformis ATCC 14580





# = GS NC_006322.1/2295788-2295688
organism Bacillus licheniformis ATCC 14580





# = GS NC_004193.1/1103943-1104045
organism Oceanobacillus iheyensis HTE831





# = GS NC_003909.8/1650407-1650509
organism Bacillus cereus ATCC 10987





# = GS NC_003997.3/1497571-1497673
organism Bacillus anthracis str. Ames





# = GS NC_005945.1/1497645-1497747
organism Bacillus anthracis str. Sterne





# = GS NC_005957.1/1521880-1521982
organism Bacillus thuringiensis serovar konkukian str. 97-27





# = GS NC_006274.1/1532482-1532584
organism Bacillus cereus ZK





# = GS NC_007530.2/1497694-1497796
organism Bacillus anthracis str. ‘Ames Ancestor’





# = GS NZ_AAAC02000001.1/1985987-1986089
organism Bacillus anthracis str. A2012





# = GS NZ_AAEK01000008.1/96466-96364
organism Bacillus cereus G9241





# = GS NZ_AAEN01000011.1/67143-67245
organism Bacillus anthracis str. CNEVA-9066





# = GS NZ_AAEO01000025.1/66855-66957
organism Bacillus anthracis str. A1055





# = GS NZ_AAEP01000035.1/69365-69467
organism Bacillus anthracis str. Vollum





# = GS NZ_AAEQ01000029.1/70848-70950
organism Bacillus anthracis str. Kruger B





# = GS NZ_AAERO1000023.1/58182-58284
organism Bacillus anthracis str. Western North America USA6153





# = GS NZ_AAES01000034.1/69121-69223
organism Bacillus anthracis str. Australia 94





# = GS NC_004722.1/1515239-1515341
organism Bacillus cereus ATCC 14579





# = GS NC_000964.2/697711-697813
organism Bacillus subtilis subsp. subtilis str. 168





# = GS NC_002976.3/54816-54919
organism Staphylococcus epidermidis RP62A





# = GS NC_004461.1/2433047-2432944
organism Staphylococcus epidermidis ATCC 12228





# = GS NC_003030.1/2824953-2824855
organism Clostridium acetobutylicum ATCC 824





# = GS NC_003909.8/296465-296567
organism Bacillus cereus ATCC 10987





# = GS NC_003997.3/262601-262703
organism Bacillus anthracis str. Ames





# = GS NC_004722.1/261558-261660
organism Bacillus cereus ATCC 14579





# = GS NC_005945.1/262614-262716
organism Bacillus anthracis str. Sterne





# = GS NC_005957.1/268537-268639
organism Bacillus thuringiensis serovar konkukian str. 97-27





# = GS NC_006274.1/267835-267937
organism Bacillus cereus ZK





# = GS NC_007530.2/262601-262703
organism Bacillus anthracis str. ‘Ames Ancestor’





# = GS NZ_AAAC02000001.1/796036-796138
organism Bacillus anthracis str. A2012





# = GS NZ_AAEK01000017.1/88397-88499
organism Bacillus cereus G9241





# = GS NZ_AAEN01000023.1/16980-17082
organism Bacillus anthracis str. CNEVA-9066





# = GS NZ_AAE001000030.1/17090-17192
organism Bacillus anthracis str. A1055





# = GS NZ_AAEP01000043.1/12962-13064
organism Bacillus anthracis str. Vollum





# = GS NZ_AAEQ01000040.1/4962-4860
organism Bacillus anthracis str. Kruger B





# = GS NZ_AAER01000030.1/4374-4272
organism Bacillus anthracis str. Western North America USA6153





# = GS NZ_AABS01000040.1/14760-14862
organism Bacillus anthracis str. Australia 94





# = GS NC_006270.2/693198-693300
organism Bacillus licheniformis ATCC 14580





# = GS NC_006322.1/692981-693083
organism Bacillus licheniformis ATCC 14580





# = GS NC_004722.1/259601-259703
organism Bacillus cereus ATCC 14579





# = GS NC_000964.2/2319410-2319310
organism Bacillus subtilis subsp. subtilis str. 168





# = GS NC_003030.1/2905050-2904948
organism Clostridium acetobutylicum ATCC 824





# = GS NC_003210.1/611119-611017
organism Listeria monocytogenes EGD-e





# = GS NZ_AADQ01000001.1/2431-2533
organism Listeria monocytogenes str. 1/2a F6854





# = GS NC_003212.1/610156-610054
organism Listeria innocua Clip11262





# = GS NC_003366.1/422820-422922
organism Clostridium perfringens str. 13





# = GS NC_002973.5/617846-617744
organism Listeria monocytogenes str. 4b F2365





# = GS NZ_AADR01000111.1/1598-1496
organism Listeria monocytogenes str. 4b H7858





# = GS NC_003909.8/294509-294611
organism Bacillus cereus ATCC 10987





# = GS NC_005957.1/266577-266679
organism Bacillus thuringiensis serovar konkukian str. 97-27





# = GS NC_006274.1/265875-265977
organism Bacillus cereus ZK





# = GS NC_003366.1/2618421-2618323
organism Clostridium perfringens str. 13





# = GS NC_002973.5/1940027-1939926
organism Listeria monocytogenes str. 4b F2365





# = GS NC_003212.1/2013345-2013244
organism Listeria innocua Clip11262





# = GS NZ_AADQ01000095.1/1690-1791
organism Listeria monocytogenes str. 1/2a F6854





# = GS NZ_AADR01000082.1/3415-3516
organism Listeria monocytogenes str. 4b H7858





# = GS NC_003366.1/2871201-2871101
organism Clostridium perfringens str. 13





# = GS NC_002745.2/430797-430900
organism Staphylococcus aureus subsp. aureus N315





# = GS NC_002758.2/430754-430857
organism Staphylococcus aureus subsp. aureus Mu50





# = GS NC_002951.2/460059-460162
organism Staphylococcus aureus subsp. aureus COL





# = GS NC_002952.2/441050-441153
organism Staphylococcus aureus subsp. aureus MRSA252





# = GS NC_002953.3/409191-409294
organism Staphylococcus aureus subsp. aureus MS5A476





# = GS NC_003923.1/410546-410649
organism Staphylococcus aureus subsp. aureus MW2





# = GS NC_006582.1/1115665-1115767
organism Bacillus clausii KSM-K16





# = GS NC_006510.1/274257-274359
organism Geobacillus kaustophilus HTA426





# = GS NC_003210.1/1958922-4958821
organism Listeria monocytogenes EGD-e





# = GS NC_006270.2/697054-697156
organism Bacillus licheniformis ATCC 14580





# = GS NC_006322.1/696838-696940
organism Bacillus licheniformis ATCC 14580





# = GS NC_006510.1/282580-282682
organism Geobacillus kaustophilus HTA426





# = GS NC_003997.3/260641-260743
organism Bacillus anthracis str. Ames





# = GS NC_005945.1/260654-260756
organism Bacillus anthracis str. Sterne





# = GS NC_007530.2/260641-260743
organism Bacillus anthracis str. ‘Ames Ancestor’





# = GS NZ_AAEK01000017.1/86437-86539
organism Bacillus cereus G9241





# = GS NZ_AAEN01000023.1/15020-15122
organism Bacillus anthracis str. CNEVA-9066





# = GS NZ_AAEO01000030.1/15130-15232
organism Bacillus anthracis str. A1055





# = GS NZ_AAEP01000043.1/11002-11104
organism Bacillus anthracis str. Vollum





# = GS NZ_AAEQ01000040.1/6922-6820
organism Bacillus anthracis str. Kruger B





# = GS NZ_AAER01000030.1/6334-6232
organism Bacillus anthracis str. Western North America USA6153





# = GS NZ_AAES01000040.1/12800-12902
organism Bacillus anthracis str. Australia 94





# = GS NC_004193.1/760473-760575
organism Oceanobacillus iheyensis HTE831





# = GS NC_006270.2/4024210-4024312
organism Bacillus licheniformis ATCC 14580





# = GS NC_006322.1/4024324-4024426
organism Bacillus licheniformis ATCC 14580





# = GS NC_004193.1/786767-786868
organism Oceanobacillus iheyensis HTE831





# = GS NC_002662.1/1159509-1159607
organism Lactococcus lactis subsp. lactis Il1403





# = GS NC_000964.2/625993-625893
organism Bacillus subtilis subsp. subtilis str. 168





# = GS NC_003909.8/382630-382528
organism Bacillus cereus ATCC 10987





# = GS NC_003997.3/342356-342254
organism Bacillus anthracis str. Ames





# = GS NC_005945.1/342369-342267
organism Bacillus anthracis str. Sterne





# = GS NC_005957.1/356354-356252
organism Bacillus thuringiensis serovar konkukian str. 97-27





# = GS NC_006274.1/357462-357360
organism Bacillus cereus ZK





# = GS NC_007530.2/342356-342254
organism Bacillus anthracis str. ‘Ames Ancestor’





# = GS NZ_AAAC02000001.1/859268-859166
organism Bacillus anthracis str. A2012





# = GS NZ_AABK01000051.1/8150-8252
organism Bacillus cereus G9241





# = GS NZ_AAEN01000023.1/83441-83339
organism Bacillus anthracis str. CNEVA-9066





# = GS NZ_AAEO01000030.1/96886-96784
organism Bacillus anthracis str. A1055





# = GS NZ_AAEP01000046.1/48810-48708
organism Bacillus anthracis str. Vollum





# = GS NZ_AAEQ01000034.1/49483-49381
organism Bacillus anthracis str. Kruger B





# = GS NZ_AAER01000042.1/191339-191441
organism Bacillus anthracis str. Western North America USA6153





# = GS NZ_AAES01000043.1/48479-48377
organism Bacillus anthracis str. Australia 94





# = GS NZ_AAAC02000001.1/794076-794178
organism Bacillus anthracis str. A2012





# = GS NC_003030.1/1002176-1002275
organism Clostridium acetobutylicum ATCC 824





# = GS NC_006582.1/1554717-1554819
organism Bacillus clausii KSM-K16





# = GS NC_003909.8/336194-336296
organism Bacillus cereus ATCC 10987





# = GS NC_003997.3/295331-295433
organism Bacillus anthracis str. Ames





# = GS NC_005945.1/295344-295446
organism Bacillus anthracis str. Sterne





# = GS NC_005957.1/309524-309626
organism Bacillus thuringiensis serovar konkukian str. 97-27





# = GS NC_006274.1/309094-309196
organism Bacillus cereus ZK





# = GS NC_007530.2/295331-295433
organism Bacillus anthracis str. ‘Ames Ancestor’





# = GS NZ_AAAC02000001.1/812243-812345
organism Bacillus anthracis str. A2012





# = GS NZ_AAEK01000064.1/23153-23051
organism Bacillus cereus G9241





# = GS NZ_AAEN01000023.1/36414-36516
organism Bacillus anthracis str. CNEVA-9066





# = GS NZ_AAEO01000030.1/49824-49926
organism Bacillus anthracis str. A1055





# = GS NZ_AAEP01000046.1/1785-1887
organism Bacillus anthracis str. Vollum





# = GS NZ_AAEQ01000034.1/2457-2559
organism Bacillus anthracis str. Kruger B





# = GS NZ_AAER01000042.1/238364-238262
organism Bacillus anthracis str. Western North America USA6153





# = GS NZ_AAES01000043.1/1489-1591
organism Bacillus anthracis str. Australia 94





# = GS NC_004193.1/769686-769787
organism Oceanobacillus iheyensis HTE831





# = GS NC_004722.1/343847-343745
organism Bacillus cereus ATCC 14579





# = GS NZ_AAAW03000042.1/15038-14936
organism Desulfitobacterium hafniense DCB-2





# = GS NZ_AADT03000005.1/34629-34731
organism Moorella thermoacetica ATCC 39073





# = GS NC_002570.2/648442-648544
organism Bacillus halodurans C-125





# = GS NC_004722.1/298774-298876
organism Bacillus cereus ATCC 14579





# = GS NC_006510.1/272473-272575
organism Geobacillus kaustophilus HTA426





# = GS NZ_AADW02000004.1/225764-225662
organism Exiguobacterium sp. 255-15





# = GS NC_006274.1/3685852-3685750
organism Bacillus cereus ZK





# = GS NZ_AAGO01000011.1/2345-2247
organism Lactococcus lactis subsp. cremoris SK11





# = GS NZ_AAAK03000113.1/5724-5822
organism Enterococcus faecium DO





# = GS NZ_AADW02000005.1/186378-186480
organism Exiguobacterium sp. 255-15





# = GS NC_003909.8/3578068-3577966
organism Bacillus cereus ATCC 10987





# = GS NC_003997.3/3605298-3605196
organism Bacillus anthracis str. Ames





# = GS NC_004722.1/3766254-3766152
organism Bacillus cereus ATCC 14579





# = GS NC_005945.1/3605993-3605891
organism Bacillus anthracis str. Sterne





# = GS NC_005957.1/3626969-3626867
organism Bacillus thuringiensis serovar konkukian str. 97-27





# = GS NC_007530.2/3605425-3605323
organism Bacillus anthracis str. ‘Ames Ancestor’





# = GS NZ_AAAC02000001.1/4059572-4059470
organism Bacillus anthracis str. A2012





# = GS NZ_AAEN01000012.1/139994-140096
organism Bacillus anthracis str. CNEVA-9066





# = GS NZ_AAEO01000020.1/51249-51351
organism Bacillus anthracis str. A1055





# = GS NZ_AAEP01000042.1/12489-12387
organism Bacillus anthracis str. Vollum





# = GS NZ_AAEQ01000019.1/51159-51261
organism Bacillus anthracis str. Kruger B





# = GS NZ_AAER01000035.1/118314-118212
organism Bacillus anthracis str. Western North America U9A6153





# = GS NZ_AAES01000032.1/51320-51422
organism Bacillus anthracis str. Australia 94





# = GS NC_006371.1/1538896-1538796
organism Photobacterium profundum SS9





# = GS NC_004460.1/504371-504471
organism Vibrio vulnificus CMCP6





# = GS NC_005140.1/1130553-1130653
organism Vibrio vulnificus YJ016





# = GS NC_004116.1/1094305-1094209
organism Streptococcus agalactiae 2603V/R





# = GS NC_004368.1/1163490-1163394
organism Streptococcus agalactiae NEM316





# = GS NZ_AADT03000005.1/42245-42347
organism Moorella thermoacetica ATCC 39073





# = GS NC_004557.1/2551392-2551293
organism Clostridium tetani E88





# = GS NC_003028.1/1754786-1754882
organism Streptococcus pneumoniae TIGR4





# = GS NC_003098.1/1634825-1634921
organism Streptococcus pneumoniae R6





# = GS NZ_AAGY01000085.1/5640-5736
organism Streptococcus pneumoniae TIGR4





# = GS NZ_AAEK01000001.1/183902-184006
organism Bacillus cereus G9241





# = GS NC_003454.1/1645820-1645721
organism Fusobacterium nucleatum subsp. nucleatum ATCC 25586





# = GS NZ_AADW02000027.1/2980-2880
organism Exiguobacterium sp. 255-15





# = GS NZ_AADW02000005.1/179959-180061
organism Exiguobacterium sp. 255-15





# = GS NC_006582.1/1039390-1039492
organism Bacillus clausii KSM-K16





# = GS NC_002737.1/930749-930845
organism Streptococcus pyogenes M1 GAS





# = GS NC_003485.1/910599-910695
organism Streptococcus pyogenes MGAS8232





# = GS NC_004070.1/846557-846653
organism Streptococcus pyogenes MGAS315





# = GS NC_004606.1/977077-977173
organism Streptococcus pyogenes SSI-1





# = GS NC_006086.1/857664-857760
organism Streptococcus pyogenes MGAS10394





# = GS NZ_AAFV01000199.1/507-411
organism Streptococcus pyogenes M49 591





# = GS NC_004605.1/1369721-1369821
organism Vibrio parahaemolyticus RIMD 2210633





# = GS NZ_AAAW03000004.1/66862-66959
organism Desulfitobacterium hafniense DCB-2





# = GS NC_004567.1/2968830-2968731
organism Lactobacillus plantarum WCFS1





# = GS NZ_AAEV01000003.1/13765-13667
organism Pediococcus pentosaceus ATCC 25745





# = GS NC_002570.2/806873-806971
organism Bacillus halodurans C-125





# = GS NZ AAEK01000052.1/27552-27452
organism Bacillus cereus G9241





# = GS NC_006814.1/237722-237626
organism Lactobacillus acidophilus NCFM





# = GS NC_005363.1/3414604-3414703
organism Bdellovibrio bacteriovorus HD100





# = GS NZ_AADT03000021.1/9410-9513
organism Moorella thermoacetica ATCC 39073





# = GS NC_004668.1/2288426-2288328
organism Enterococcus faecalis V583





# = GS NC_006449.1/1182949-1183044
organism Streptococcus thermophilus CNRZ1066





# = GS NZ_AAGS01000026.1/9921-10016
organism Streptococcus thermophilus LMD-9





# = GS NZ_AAGQ01000089.1/170-269
organism Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-



365





# = GS NZ_AABF02000293.1/1-98
organism Fusobacterium nucleatum subsp. vincentii ATCC 49256





# = GS NC_006448.1/1185082-1185177
organism Streptococcus thermophilus LMG 18311





# = GS NC_004567.1/2410478-2410577
organism Lactobacillus plantarum WCFS1





# = GS NZ_AABJ03000010.1/47704-47802
organism Oenococcus oeni PSU-1





# = GS NZ_AADT03000002.1/89184-89083
organism Moorella thermoacetica ATCC 39073





# = GS NC_006814.1/1971978-1972076
organism Lactobacillus acidophilus NCFM





# = GS NZ_AAA002000016.1/1579-1480
organism Lactobacillus gasseri





# = GS NC_005362.1/1949387-1949485
organism Lactobacillus johnsonii NCC 533





# = GS NZ_AAA002000035.1/3299-3201
organism Lactobacillus gasseri





# = GS NC_005362.1/263146-263049
organism Lactobacillus johnsonii NCC 533





# = GS NC_005363.1/2004933-2004835
organism Bdellovibrio bacteriovorus HD100





# = GS NZ_AADT03000002.1/98293-98192
organism Moorella thermoacetica ATCC 39073





# = GS NZ_AABH02000036.1/17403-17500
organism Leuconostoc mesenteroides subsp. mesenteroides ATCC



8293





# = GS NC_006055.1/484139-484044
organism Mesoplasma florum L1





# = GS NZ_AABH02000272.1/211-308
organism Leuconostoc mesenteroides subsp. mesenteroides ATCC



8293





# = GS NZ_AABJ03000003.1/47905-47810
organism Oenococcus oeni PSU-1





# = GS NZ_AABH02000038.1/17559-17463
organism Leuconostoc mesenteroides subsp. mesenteroides ATCC



8293





# = GS NC_006055.1/397027-397123
organism Mesoplasma florum L1





# = GS NC_003869.1/586365-586467
taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacter-



iales; Thermoanaerobacteriaceae; Thermoanaerobacter;



tengcongensis





# = GS NC_002570.2/1593074-1592972
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; halodurans





# = GS NC_002570.2/676475-676577
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; halodurans





# = GS NC_000964.2/4004541-4004643
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; subtilis





# = GS NC_003366.1/513088-512986
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; perfringens





# = GS NC_000964.2/693775-693877
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; subtilis





# = GS NC_002570.2/650309-650411
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; halodurans





# = GS NC_006270.2/2294931-2294831
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_006322.1/2295788-2295688
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_004193.1/1103943-1104045
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Oceanobacillus; iheyensis





# = GS NC_003909.8/1650407-1650509
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003997.3/1497571-1497673
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005945.1/1497645-1497747
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005957.1/1521880-1521982
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; thuringiensis





# = GS NC_006274.1/1532482-1532584
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_007530.2/1497694-1497796
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAAC02000001.1/1985987-1986089
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEK01000008.1/96466-96364
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAEN01000011.1/67143-67245
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEO01000025.1/66855-66957
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEP01000035.1/69365-69467
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEQ01000029.1/70848-70950
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAER01000023.1/58182-58284
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAES01000034.1/69121-69223
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_004722.1/1515239-1515341
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_000964.2/697711-697813
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; subtilis





# = GS NC_002976.3/54816-54919
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



epidermidis





# = GS NC_004461.1/2433047-2432944
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



epidermidis





# = GS NC_003030.1/2824953-2824855
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; acetobutylicum





# = GS NC_003909.8/296465-296567
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003997.3/262601-262703
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_004722.1/261558-261660
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_005945.1/262614-262716
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005957.1/268537-268639
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; thuringiensis





# = GS NC_006274.1/267835-267937
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_007530.2/262601-262703
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAAC02000001.1/796036-796138
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEK01000017.1/88397-88499
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAEN01000023.1/16980-17082
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEO01000030.1/17090-17192
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEP01000043.1/12962-13064
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEQ01000040.1/4962-4860
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAER01000030.1/4374-4272
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAES01000040.1/14760-14862
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_006270.2/693198-693300
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_006322.1/692981-693083
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_004722.1/259601-259703
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_000964.2/23l9410-2319310
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; subtilis





# = GS NC_003030.1/2905050-2904948
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; acetobutylicum





# = GS NC_003210.1/611119-611017
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NZ_AADQ01000001.1/2431-2S33
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NC_003212.1/610156-610054
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; innocua





# = GS NC_003366.1/422820-422922
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; perfringens





# = GS NC_002973.5/617846-617744
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NZ_AADR01000111.1/1598-1496
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NC_003909.8/294509-294611
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = Gs NC_005957.1/266577-266679
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; thuringiensis





# = GS NC_006274.1/265875-265977
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003366.1/2618421-2618323
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; perfringens





# = GS NC_002973.5/1940027-1939926
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NC_003212.1/2013345-2013244
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; innocua





# = GS NZ_AADQ01000095.1/1690-1791
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NZ_AADR01000082.1/3415-3516
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NC_003366.1/2871201-2871101
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; perfringens





# = GS NC_002745.2/430797-430900
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



aureus





# = GS NC_002758.2/430754-430857
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



aureus





# = GS NC_002951.2/460059-460162
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



aureus





# = GS NC_002952.2/441050-441153
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



aureus





# = GS NC_002953.3/409191-409294
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



aureus





# = GS NC_003923.1/410546-410649
taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus;



aureus





# = GS NC_006582.1/1115665-1115767
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; clausii





# = GS NC_006510.1/274257-274359
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Geobacillus; kaustophilus





# = GS NC_003210.1/1958922-1958821
taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae;



Listeria; monocytogenes





# = GS NC_006270.2/697054-697156
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_006322.1/696838-696940
taxonomy Bacteria; Firmidutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_006510.1/282580-282682
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Geobacillus; kaustophilus





# = GS NC_003997.3/260641-260743
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005945.1/260654-260756
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_007530.2/260641-260743
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEK01000017.1/86437-86539
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAEN01000023.1/15020-15122
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEO01000030.1/15130-15232
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEP01000043.1/11002-11104
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEQ01000040.1/6922-6820
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAER01000030.1/6334-6232
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAES01000040.1/12800-12902
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_004193.1/760473-760575
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Oceanobacillus; iheyensis





# = GS NC_006270.2/4024210-4024312
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_006322.1/4024324-4024426
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; licheniformis





# = GS NC_004193.1/786767-786868
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Oceanobacillus; iheyensis





# = GS NC_002662.1/1159509-1159607
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Lactococcus; lactis





# = GS NC_000964.2/625993-625893
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; subtilis





# = GS NC_003909.8/382630-382528
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003997.3/342356-342254
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005945.1/342369-342267
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005957.1/356354-356252
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; thuringiensis





# = GS NC_006274.1/357462-357360
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_007530.2/342356-342254
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAAC02000001.1/859268-859166
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEK01000051.1/8150-8252
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAEN01000023.1/83441-83339
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEO01000030.1/96886-96784
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEP01000046.1/48810-48708
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEQ01000034.1/49483-49381
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAER01000042.1/191339-191441
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAES01000043.1/48479-48377
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAAC02000001.1/794076-794178
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_003030.1/1002176-1002275
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Clostridiaceae; Clostridium; acetobutylicum





# = GS NC_006582.1/1554717-1554819
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; clausii





# = GS NC_003909.8/336194-336296
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003997.3/295331-295433
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005945.1/295344-295446
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005957.1/309524-309626
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; thuringiensis





# = GS NC_006274.1/309094-309196
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;



Bacillus; cereus group; Bacillus; cereus





# = GS NC_007530.2/295331-295433
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;



Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAAC02000001.1/812243-812345
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEK01000064.1/23153-23051
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAEN01000023.1/36414-36516
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEO01000030.1/49824-49926
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEP01000046.1/1785-1887
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEQ01000034.1/2457-2559
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAER01000042.1/238364-238262
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAES01000043.1/1489-1591
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_004193.1/769686-769787
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Oceanobacillus; iheyensis





# = GS NC_004722.1/343847-343745
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAAW03000042.1/15038-14936
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Peptococcaceae; Desulfitobacterium; hafniense





# = GS NZ_AADT03000005.1/34629-34731
taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacter-



iales; Thermoanaerobacteriaceae; Moorella group; Moorella;



thermoacetica





# = GS NC_002570.2/648442-648544
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; halodurans





# = GS NC_004722.1/298774-298876
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_006510.1/272473-272575
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Geobacillus; kaustophilus





# = GS NZ_AADW02000004.1/225764-225662
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Exiguobacterium; sp.





# = GS NC_006274.1/3685852-3685750
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NZ_AAGO01000011.1/2345-2247
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Lactococcus; lactis





# = GS NZ_AAAK03000113.1/5724-5822
taxonomy Bacteria; Firmicutes; Lactobacillales;



Enterococcaceae; Enterococcus; faecium





# = GS NZ_AADW02000005.1/186378-186480
taxonomy Bacteria; Firmidutes; Bacillales; Bacillaceae;



Exiguobacterium; sp.





# = GS NC_003909.8/3578068-3577966
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003997.3/3605298-3605196
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_004722.1/3766254-3766152
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_005945.1/3605993-3605891
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_005957.1/3626969-3626867
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; thuringiensis





# = GS NC_007530.2/3605425-3605323
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAAC02000001.1/4059572-4059470
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEN01000012.1/139994-140096
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEO01000020.1/51249-51351
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEP01000042.1/12489-12387
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAEQ01000019.1/51159-51261
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAER01000035.1/118314-118212
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NZ_AAES01000032.1/51320-51422
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; anthracis





# = GS NC_006371.1/1538896-1538796
taxonomy Bacteria; Proteobacteria; Gammaproteobacteria;



Vibrionales; Vibrionaceae; Photobacterium; profundum





# = GS NC_004460.1/504371-504471
taxonomy Bacteria; Proteobacteria; Gammaproteobacteria;



Vibrionales; Vibrionaceae; Vibrio; vulnificus





# = GS NC_005140.1/1130553-1130653
taxonomy Bacteria; Proteobacteria; Gammaproteobacteria;



Vibrionales; Vibrionaceae; Vibrio; vulnificus





# = GS NC_004116.1/1094305-1094209
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; agalactiae





# = GS NC_004368.1/1163490-1163394
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; agalactiae





# = GS NZ_AADT03000005.1/42245-42347
taxonomy Bacteria; Firmicutes; Clostridia;



Thermoanaerobacteriales; Thermoanaerobacteriaceae; Moorella



group; Moorella; thermoacetica





# = GS NC_004557.1/2551392-2551293
taxonomy Bacteria; Firmicutes; Clostridia;



Clostridiales; Clostridiaceae; Clostridium; tetani





# = GS NC_003028.1/1754786-1754882
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pneumoniae





# = GS NC_003098.1/1634825-1634921
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pneumoniae





# = GS NZ_AAGY01000085.1/5640-5736
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pneumoniae





# = GS NZ_AAEK01000001.1/183902-184006
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_003454.1/1645820-1645721
taxonomy Bacteria; Fusobacteria; Fusobacterales;



Fusobacteriaceae; Fusobacterium; nucleatum





# = GS NZ_AADW02000027.1/2980-2880
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Exiguobacterium; sp.





# = GS NZ_AADW02000005.1/179959-180061
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Exiguobacterium; sp.





# = GS NC_006582.1/1039390-1039492
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; clausii





# = GS NC_002737.1/930749-930845
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pyogenes





# = GS NC_003485.1/910599-910695
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pyogenes





# = GS NC_004070.1/846557-846653
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pyogenes





# = GS NC_004606.1/977077-977173
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pyogenes





# = GS NC_006086.1/857664-857760
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pyogenes





# = GS NZ_AAFV01000199.1/507-411
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; pyogenes





# = GS NC_004605.1/1369721-1369821
taxonomy Bacteria; Proteobacteria; Gammaproteobacteria;



Vibrionales; Vibrionaceae; Vibrio; parahaemolyticus





# = GS NZ_AAAW03000004.1/66862-66959
taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;



Peptococcaceae; Desulfitobacterium; hafniense





# = GS NC_004567.1/2968830-2968731
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; plantarum





# = GS NZ_AAEV01000003.1/13765-13667
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Pediococcus; pentosaceus





# = GS NC_002570.2/806873-806971
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; halodurans





# = GS NZ_AAEK01000052.1/27552-27452
taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae;



Bacillus; Bacillus; cereus group; Bacillus; cereus





# = GS NC_006814.1/237722-237626
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; acidophilus





# = GS NC_005363.1/3414604-3414703
taxonomy Bacteria; Proteobacteria; Deltaproteobacteria;



Bdellovibrionales; Bdellovibrionaceae; Bdellovibrio;



bacteriovorus





# = GS NZ_AADT03000021.1/9410-9513
taxonomy Bacteria; Firmicutes; Clostridia;



Thermoanaerobacteriales; Thermoanaerobacteriaceae; Moorella



group; Moorella; thermoacetica





# = GS NC_004668.1/2288426-2288328
taxonomy Bacteria; Firmicutes; Lactobacillales;



Enterococcaceae; Enterococcus; faecalis





# = GS NC_006449.1/1182949-1183044
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; thermophilus





# = GS NZ_AAGS01000026.1/9921-10016
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; thermophilus





# = GS NZ_AAGQ01000089.1/170-269
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; delbrueckii





# = GS NZ_AABF02000293.1/1-98
taxonomy Bacteria; Fusobacteria; Fusobacteria



(class); Fusobacterales; Fusobacteriaceae; Fusobacterium;



nucleatum





# = GS NC_006448.1/1185082-1185177
taxonomy Bacteria; Firmicutes; Lactobacillales;



Streptococcaceae; Streptococcus; thermophilus





# = GS NC_004567.1/2410478-2410577
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; plantarum





# = GS NZ_AABJ03000010.1/47704-47802
taxonomy Bacteria; Firmicutes; Lactobacillales;



Qenococcus; oeni





# = GS NZ_AADT03000002.1/89184-89083
taxonomy Bacteria; Firmicutes; Clostridia;



Thermoanaerobacteriales; Thermoanaerobacteriaceae; Moorella



group; Moorella; thermoacetica





# = GS NC_006814.1/1971978-1972076
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; acidophilus





# = GS NZ_AAA002000016.1/1579-1480
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; gasseri





# = GS NC_005362.1/1949387-1949485
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; johnsonii





# = GS NZ_AAA002000035.1/3299-3201
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; gasseri





# = GS NC_005362.1/263146-263049
taxonomy Bacteria; Firmicutes; Lactobacillales;



Lactobacillaceae; Lactobacillus; johnsonii





# = GS NC_005363.1/2004933-2004835
taxonomy Bacteria; Proteobacteria; Deltaproteobacteria;



Bdellovibrionales; Bdellovibrionaceae; Bdellovibrio;



bacteriovorus





# = GS NZ_AADT03000002.1/98293-98192
taxonomy Bacteria; Firmicutes; Clostridia;



Thermoanaerobacteriales; Thermoanaerobacteriaceae; Moorella



group; Moorella; thermoacetica





# = GS NZ_AABH02000036.1/17403-17500
taxonomy Bacteria; Firmicutes; Lactobacillales;



Leuconostoc; mesenteroides





# = GS NC_006055.1/484139-484044
taxonomy Bacteria; Firmicutes; Mollicutes;



Entomoplasmatales; Entomoplasmataceae; Mesoplasma; florum





# = GS NZ_AABH02000272.1/211-308
taxonomy Bacteria; Firmicutes; Lactobacillales;



Leuconostoc; mesenteroides





# = Gs NZ_AABJ03000003.1/47905-47810
taxonomy Bacteria; Firmicutes; Lactobacillales;



Oenococcus; oeni





# = GS NZ_AABH02000038.1/17559-17463
taxonomy Bacteria; Firmicutes; Lactobacillales;



Leuconostoc; mesenteroides





# = GS NC_006055.1/397027-397123
taxonomy Bacteria; Firmicutes; Mollicutes;



Entomoplasmatales; Entomoplasmataceae; Mesoplasma; florum












NC_003869.1/586365-586467
SEQ ID NO: 12









AAAAAUUUAAUAAGA.AG.CACUCAUAUAAUCCCGAGA.AU.AUGGCUCGGGA.GUCUCUACCGAACAACC..GUAAAUUGUUC.G.ACUAUGAGUGAAAGU.



GUACCUAGGG












NC_002570.2/1593074-1592972
SEQ ID NO: 13









AUUUACAUUAAAAAA.AG.CACUCGUAUAAUCGCGGGA.AU.AGGGCCCGCAA.GUUUCUACCAGGCUGCC..GUAAACAGCCU.G.ACUACGAGUGAUACU.



UUGACAUAGA












NC_002570.2/676475-676577
SEQ ID NO: 14









CGUUCUUUAUAUAAA.GU.ACCUCAUAUAAUCUUGGGA.AU.AUGGCCCAAAA.GUUUCUACCUGCUGACC..GUAAAUCGGCG.G.ACUAUGGGGAAAGAU.



UUUGGAUCUU












NC_000964.2/4004541-4004643
SEQ ID NO: 15









CAUCUUAGAAAAAGA.CA.UUCUUGUAUAUGAUCAGUA.AU.AUGGUCUGAUU.GUUUCUACCUAGUAACC..GUAAAAAACUA.G.ACUACAAGAAAGUUU.



GAAUAAAUUU












NC_003366.1/513088-512986
SEQ ID NO: 16









UAAGUGUAUUAAAUU.UU.AACUCGUAUAUAAUCGGUA.AU.AUGGUCCGAAA.GUUUCUACCUGCUAACC..GUAAAAUAGCA.G.ACUACGAGGAGUUGU.



ACUAUAAAUU












NC_000964.2/693775-693877
SEQ ID NO: 17









AGAAAUCAAAUAAGA.UG.AAUUCGUAUAAUCGCGGGA.AU.AUGGCUCGCAA.GUCUCUACCAAGCUACC..GUAAAUGGCUU.G.ACUACGUAAACAUUU.



CUUUCGUUUG












NC_002570.2/650309-650411
SEQ ID NO: 18









AAUAAAUCGAAAACA.UC.AUUUCGUAUAAUGGCAGGA.AU.AGGGCCUGCGA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGAAAAUAAUG.



GGUUUUUUAC












NC_006270.2/2294931-2294831
SEQ ID NO: 19









AAUUUGAUACAUUAU.AU.CACUCAUAUAAUCGCGUGG.AU.AUGGCACGCAA.GUUUCUACCGGGCA-CC..GUAAA-UGUCC.G.ACUAUGAGUGGGCGA.



UAAGAAAACG












NC_006322.1/2295788-2295688
SEQ ID NO: 20









AAUUUGAUACAUUAU.AU.CACUCAUAUAAUCGCGUGG.AU.AUGGCACGCAA.GUUUCUACCGGGCA-CC..GUAAA-UGUCC.G.ACUAUGAGUGGGCGA.



UAAGAAAACG












NC_004193.1/1103943-1104045
SEQ ID NO: 21









AAACCUUAUAUAUAG.UU.UUUUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGGUUUACC..GUAAAUGAACC.G.ACUAUGGAAAAGCGG.



AAAAUUCGAU












NC_003909.8/1650407-1650509
SEQ ID NO: 22









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NC_003997.3/1497571-1497673
SEQ ID NO: 23









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NC_005945.1/1497645-1497747
SEQ ID NO: 24









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NC_005957.1/1521880-1521982
SEQ ID NO: 25









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NC_006274.1/1532482-1532584
SEQ ID NO: 26









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NC_007530.2/1497694-1497796
SEQ ID NO: 27









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAAC02000001.1/1985987-1986089
SEQ ID NO: 28









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAEK01000008.1/96466-96364
SEQ ID NO: 29









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAEN01000011.1/67143-67245
SEQ ID NO: 30









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAE001000025.1/66855-66957
SEQ ID NO: 31









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAEP01000035.1/69365-69467
SEQ ID NO: 32









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAEQ01000029.1/70848-70950
SEQ ID NO: 33









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAER01000023.1/58182-58284
SEQ ID NO: 34









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NZ_AAES01000034.1/69121-69223
SEQ ID NO: 35









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAUUU












NC_004722.1/1515239-1515341
SEQ ID NO: 36









AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.



GAAUAUAAUU












NC_000964.2/697711-697813
SEQ ID NO: 37









CAUGAAAUCAAAACA.CG.ACCUCAUAUAAUCUUGGGA.AU.AUGGCCCAUAA.GUUUCUACCCGGCAACC..GUAAAUUGCCG.G.ACUAUGCAGGAAAGU.



GAUCGAUAAA












NC_002976.3/54816-54919
SEQ ID NO: 38









CAUAAAAUAAUUUAU.AU.GACUCAUAUAAUCUAGAGA.AU.AUGGCUUUAGAaGUUUCUACCGUGUCGCC..AUAAACGACAC.G.ACUAUGAGUAACAAU.



CCAAUACAUU












NC_004461.1/2433047-2432944
SEQ ID NO: 39









CAUAAAAUAAUUUAU.AU.GACUCAUAUAAUCUAGAGA.AU.AUGGCUUUAGAaGUUUCUACCGUGUCGCC..AUAAACGACAC.G.ACUAUGAGUAACAAU.



CCAAUACAUU












NC_003030.1/2824953-2824855
SEQ ID NO: 40









AAUCGUUAAUAUAGU.UU.AACUCAUAUAU-UUCCUGA.AU.AUGGCAGGAU-.GUUUCUACAAGGAA-CC..UUAAA-UUUCU.U.ACUAUGAGUGAUUUG.



UUUGUAUGCA












NC_003909.8/296465-296567
SEQ ID NO: 41









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_003997.3/262601-262703
SEQ ID NO: 42









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_004722.1/261558-261660
SEQ ID NO: 43









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_005945.1/262614-262716
SEQ ID NO: 44









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_005957.1/268537-268639
SEQ ID NO: 45









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_006274.1/267835-267937
SEQ ID NO: 46









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_007530.2/262601-262703
SEQ ID NO: 47









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAAC02000001.1/796036-796138
SEQ ID NO: 48









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAEKO1000017.1/88397-88499
SEQ ID NO: 49









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAEN01000023.1/16980-17082
SEQ ID NO: 50









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAE001000030.1/17090-17192
SEQ ID NO: 51









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAEP01000043.1/12962-13064
SEQ ID NO: 52









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAEQ01000040.1/4962-4860
SEQ ID NO: 53









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAER01000030.1/4374-4272
SEQ ID NO: 54









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NZ_AAES01000040.1/14760-14862
SEQ ID NO: 55









GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.G.ACUACGAGGCGUUUU.



UAUAAAGGUG












NC_006270.2/693198-693300
SEQ ID NO: 56









AGUAAUUUAAAAAAG.AC.UUGUCGUAUAAUCAUGGGG.AU.AUGGCCCAUAA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGCUUGUAUAC.



AAUAUUUUAU












NC_006322.1/692981-693083
SEQ ID NO: 57









AGUAAUUUAAAAAAG.AC.UUGUCGUAUAAUCAUGGGG.AU.AUGGCCCAUAA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGCUUGUAUAC.



AAUAUUUUAU












NC_004722.1/259601-259703
SEQ ID NO: 58









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAUG.AU.AUGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_000964.2/2319410-2319310
SEQ ID NO: 59









UUACAAUAUAAUAGG.AA.CACUCAUAUAAUCGCGUGG.AU.AUGGCACGCAA.GUUUCUACCGGGCA-CC..GUAAA-UGUCC.G.ACUAUGGGUGAGCAA.



UGGAACCGCA












NC_003030.1/2905050-2904948
SEQ ID NO: 60









GAAAAGUAAUAACAU.AU.UACCCGUAUAUGCUUAGAA.AU.AUGGUCUAAGC.GUCUCUACCGGACUGCC..GUAAAUUGUCU.G.ACUAUGGGUGUUUAU.



AAGUAUUUUA












NC_003210.1/611119-611017
SEQ ID NO: 61









AAUCCGCUACAAUAA.UA.UAGUCGUAUAAGUUCGGUA.AU.AUGGACCGUUC.GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.



AAAAUUUAAU












NZ_AADQO1000001.1/2431-2533
SEQ ID NO: 62









AAUCCGCUACAAUAA.UA.UAGUCGUAUAAGUUCGGUA.AU.AUGGACCGUUC.GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.



AAAAUUUAAU












NC_003212.1/610156-610054
SEQ ID NO: 63









AAUCGUCUACAAUAA.UA.AAGUCGUAUAAGUUCGGUA.AU.AUGGACCGUUC.GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.



AAAAUUUAAU












NC_003366.1/422820-422922
SEQ ID NO: 64









UAUGUACUUAUAUAA.GU.AUAUCGUAUAUGCUCGACG.AU.AUGGGUUGAGU.GUUUCUACUAGGAGGCC..GUAAACAUCCU.A.ACUACGAAUAUAUAG.



GUGAUUUCUA












NC_002973.5/617846-617744
SEQ ID NO: 65









AAUCCGCUACAAUAA.UA.AAGUCGUAUAAGUUCGGUA.AU.AUGGACCGUUC.GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.



AAAAUUUAAU












NZ_AADR01000111.1/1598-1496
SEQ ID NO: 66









AAUCCGCUACAAUAA.UA.AAGUCGUAUAAGUUCGGUA.AU.AUGGACCGUUC.GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.



AAAAUUUAAU












NC_003909.8/294509-294611
SEQ ID NO: 67









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AUGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_005957.1/266577-266679
SEQ ID NO: 68









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AUGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_006274.1/265875-265977
SEQ ID NO: 69









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AUGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_003366.1/2618421-2618323
SEQ ID NO: 70









AAAACGGAAUAUAAA.CA.AACUCGUAUAA-GCUUUGA.AU.AAGGCAAGGC-.GUUUCUACCGGAAA-CC..UUAAA-UUUCC.G.UCUAUGAGUGAAUUU.



GAUAUACUAU












NC_002973.5/1940027-1939926
SEQ ID NO: 71









AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG.AU.-UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG.



UAUAAAGAAG












NC_003212.1/2013345-2013244
SEQ ID NO: 72









AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG.AU.-UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG.



UAUAAAGAAG












NZ_AADQ01000095.1/1690-1791
SEQ ID NO: 73









AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG.AU.-UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG.



UAUAAAGAAG












NZ_AADR01000082.1/3415-3516
SEQ ID NO: 74









AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG.AU.-UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG.



UAUAAAGAAG












NC_003366.1/2871201-2871101
SEQ ID NO: 75









AUAAAAAAAUAAAUU.UU.GCUUCGUAUAACUCUAAUG.AU.AUGGAUUAGAG.GUCUCUACCAAGAA-CC..GAGAA-UUCUU.G.AUUACGAAGAAAGCU.



UAUUUGCUUU












NC_002745.2/430797-430900
SEQ ID NO: 76









GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA.AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.



ACAAUACUAG












NC_002758.2/430754-430857
SEQ ID NO: 77









GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA.AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.



ACAAUACUAG












NC_002951.2/460059-460162
SEQ ID NO: 78









GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA.AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.



ACAAUACUAG












NC_002952.2/441050-441153
SEQ ID NO: 79









GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA.AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.



ACAAUACUAG












NC_002953.3/409191-409294
SEQ ID NO: 80









GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA.AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.



ACAAUACUAG












NC_003923.1/410546-410649
SEQ ID NO: 81









GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA.AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.



ACAAUACUAG












NC_006582.1/1115665-1115767
SEQ ID NO: 82









AAGUUAAAAACGAAA.AC.ACCUCAUAUAUACUCGGGA.AU.AUGGCUCGAAC.GUUUCUACCCGGCAACC..GUAAAUUGCCG.G.ACUAUGAGGGGAAGU.



CAUUACGCGC












NC_006510.1/274257-274359
SEQ ID NO: 83









AUGAAUAUUGUUGAA.UU.CCGUCGUAUAAUCCCGGGA.AU.AUGGCUCGGGA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGAGGGAUGCG.



GGAUCGGAGA












NC_003210.1/1958922-1958821
SEQ ID NO: 84









AUAACUUAAAACCGA.AA.UACUUGUAUAAUAGUUGCG.AU.-UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG.



UAUAAAGAAG












NC_006270.2/697054-697156
SEQ ID NO: 85









CAUGACAUGAAAACA.CA.UCCUCAUAUAAUCUUGGGA.AU.AUGGCCCAUAA.GUCUCUACCCGAUGACC..GUAAAUCAUCG.G.ACUAUGCAGGAAAGU.



GGACAAUAAA












NC_006322.1/696838-696940
SEQ ID NO: 86









CAUGACAUGAAAACA.CA.UCCUCAUAUAAUCUUGGGA.AU.AUGGCCCAUAA.GUCUCUACCCGAUGACC..GUAAAUCAUCG.G.ACUAUGCAGGAAAGU.



GGACAAUAAA












NC_006510.1/282580-282682
SEQ ID NO: 87









AUAGUGUAUGAGAAG.AU.CCCUCAUAUAAUUUUGGGA.AU.AUGGCCCAAAA.GUUUCUACCCAAUCACC..GUAAAUGAUUG.G.ACUAUGAGGGAAAGG.



AUCGGUUUUG












NC_003997.3/260641-260743
SEQ ID NO: 88









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_005945.1/260654-260756
SEQ ID NO: 89









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_007530.2/260641-260743
SEQ ID NO: 90









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAEK01000017.1/86437-86539
SEQ ID NO: 91









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAEN01000023.1/15020-15122
SEQ ID NO: 92









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAEO01000030.1/15130-15232
SEQ ID NO: 93









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAEP01000043.1/11002-11104
SEQ ID NO: 94









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAEQ01000040.1/6922-6820
SEQ ID NO: 95









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAER01000030.1/6334-6232
SEQ ID NO: 96









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NZ_AAES01000040.1/12800-12902
SEQ ID NO: 97









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.



UGUCUUAUAU












NC_004193.1/760473-760575
SEQ ID NO: 98









CAAUUUUUAUCCAAU.GC.CUUUCGUAUAUCCUCGAUA.AU.AUGGUUCGAAA.GUAUCUACCGGGUCACC..GUAAAUGAUCU.G.ACUAUGAAGGCAGAA.



GCAGGUUCGG












NC_006270.2/4024210-4024312
SEQ ID NO: 99









AAAUAAUAGAAGCCC.AC.UUCUUGUAUAUAAUCAGUA.AU.AGGGUCUGAUU.GUUUCUACCUGGCAACC..GUAAAUCGCCA.G.ACUACAAGGAAGUUU.



GAAUAGAUUU












NC_006322.1/4024324-4024426
SEQ ID NO: 100









AAAUAAUAGAAGCCC.AC.UUCUUGUAUAUAAUCAGUA.AU.AGGGUCUGAUU.GUUUCUACCUGGCAACC..GUAAAUCGCCA.G.ACUACAAGGAAGUUU.



GAAUAGAUUU












NC_004193.1/786767-786868
SEQ ID NO: 101









CCGACAAUUGAAAAU.GA.ACCUCAUAUAAAUUUGAGA.AU.AUGGCUCAGAA.GUUUCUACCCAGC-ACC..GUAAAUGGCUG.G.ACUAUGAGGGAAGAU.



GGAUCAUUUC












NC_002662.1/1159509-1159607
SEQ ID NO: 102









UAGUCUAUAAUAGAA.CA.AUCUUAUUUAU-ACCUAGG.AU.AUGGCUGGGC-.GUUUCUACCUCGUA-CC..GUAAA-UGCGA.G.ACAAUAAGGAAAUUC.



GAUUUUUUAG












NC_000964.2/625993-625893
SEQ ID NO: 103









AAUUAAAUAGCUAUU.AU.CACUUGUAUAACCUCAAUA.AU.AUGGUUUGAGG.GUGUCUACCAGGAA-CC..GUAAA-AUCCU.G.AUUACAAAAUUUGUU.



UAUGACAUUU












NC_003909.8/382630-382528
SEQ ID NO: 104









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NC_003997.3/342356-342254
SEQ ID NO: 105









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NC_005945.1/342369-342267
SEQ ID NO: 106









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NC_005957.1/356354-356252
SEQ ID NO: 107









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NC_006274.1/357462-357360
SEQ ID NO: 108









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NC_007530.2/342356-342254
SEQ ID NO: 109









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAACO2000001.1/859268-859166
SEQ ID NO: 110









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAEK01000051.1/8150-8252
SEQ ID NO: 111









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAEN01000023.1/83441-83339
SEQ ID NO: 112









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAEO01000030.1/9688G-96784
SEQ ID NO: 113









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAEP01000046.1/48810-48708
SEQ ID NO: 114









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAEQ01000034.1/49483-49381
SEQ ID NO: 115









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAER01000042.1/191339-191441
SEQ ID NO: 116









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAES01000043.1/48479-48377
SEQ ID NO: 117









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAAC02000001.1/794076-794178
SEQ ID NO: 118









AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.ARGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAWGGCAGUG.



UGUCUUAUAU












NC_003030.1/1002176-1002275
SEQ ID NO: 119









UAUAUAAAAAACUAA.AU.UUCUCGUAUAC-ACCGGUA.AU.AUGGUCCGGAA.GUUUCUACCUGCUG-CC..AUAAA-UAGCA.G.ACUACGGGGUGUUAU.



UGAUAAUAUA












NC_006582.1/1554717-1554819
SEQ ID NO: 120









UAAACGAACAAAGCA.UC.AGCUCGUAUAAUAGCGGUA.AU.AUGGUCCGCGA.GUCUCUACCAGGCUGCC..GAUAACGGCCU.G.ACUACGAGUGGUCUU.



UUUCAGUUGU












NC_003909.8/336194-336296
SEQ ID NO: 121









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_003997.3/295331-295433
SEQ ID NO: 122









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_005945.1/295344-295446
SEQ ID NO: 123









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_005957.1/309524-309626
SEQ ID NO: 124









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_006274.1/309094-309196
SEQ ID NO: 125









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_007530.2/295331-295433
SEQ ID NO: 126









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAAC02000001.1/512243-812345
SEQ ID NO: 127









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAEK01000064.1/23153-23051
SEQ ID NO: 128









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAEN01000023.1/36414-36516
SEQ ID NO: 129









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAEO01000030.1/49824-49926
SEQ ID NO: 130









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAEP01000046.1/1785-1887
SEQ ID NO: 131









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAEQ01000034.1/24572559
SEQ ID NO: 132









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAER01000042.1/238364-238262
SEQ ID NO: 133









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NZ_AAES01000043.1/1489-1591
SEQ ID NO: 134









AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_004193.1/769686-769787
SEQ ID NO: 135









UGAUGUAAUUGAAUA.GA.AAUGCGUAUAAUUAAGGGG.AU.AUGGCCC-ACA.GUUUCUACCAGACCACC..GUAAAUGGUUU.G.ACUACGCAGUAAUUA.



UAUUUGUAUC












NC_004722.1/343847-343745
SEQ ID NO: 136









UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUUGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.



AAUGAAUAUU












NZ_AAAW03000042.1/15038-14936
SEQ ID NO: 137









AAAAAAUUUAAAUAG.GG.CGUUCAUAUAAUCGCGGAG.AU.AGGGUCCGCAA.GUUUCUACCGGGCUGCC..GUAAAUGGCCU.G.ACUAUGAGCGAAACU.



GUGCCCAGGG












NZ_AADT03000005.1/34629-34731
SEQ ID NO: 138









AAAAAUAAUUUAACC.CU.GCUUCGUAUAUUCCCGGAA.AU.GCGGUCCGGGA.GUUUCUACCAGGCAACC..GUAAAUUGCCC.G.GCUACGAAGGUUAUU.



CUUCGUCGUC












NC_002570.2/648442-648544
SEQ ID NO: 139









ACAUGUAGAUAUCAU.CC.CUUUCGUAUAUACUUGGAG.AU.AAGGUCCAGGA.GUUUCUACCAGAUCACC..GUAAAUGAUCU.G.ACUAUGAAGGUGGAA.



UGGCUCGAUA












NC_004722.1/298774-298876
SEQ ID NO: 140









AGAAACAAUAAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.



CACUCGGUAU












NC_006510.1/272473-272575
SEQ ID NO: 141









UACGGAUAGACGAAA.GC.CCUUCAUAUAAGCGCAAGA.AU.AUGGCUUGCGC.GUCUCUACCGGGCCGCC..GUAAACGGCCC.G.ACUAUGAAGGCAGAA.



GACGCUGCUA












NZ_AADW02000004.1/225764-225662
SEQ ID NO: 142









GGAAGAUUGAAUAUA.AC.ACCUCGUAUAAUAGCAGGG.AU.AUGGCUUGCAA.GUUUCUACCCGACGACC..CUAAAUCGUUG.G.ACUAUGGGGUAUAUG.



GAUGUUCGUC












NC_006274.1/3685852-3685750
SEQ ID NO: 143









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGAAAAA.



GUGUGUAACA












NZ_AAG001000011.1/2345-2247
SEQ ID NO: 144









UUGAGCUGCUAUAAA.CA.AUCUUAUUUAU-ACCUAGA.AU.AUGGCUGGGC-.GUUUCUACCUCGUA-CC..GUAAA-UGCGA.G.ACAAUAAGGAAAUUC.



GAUUUUUCAA












NZ_AAAK03000113.1/5724-5822
SEQ ID NO: 145









UAUUAGUUUUACAUA.CC.UACUUAUAUAU-CGUCAUA.AU.AUGGAUGACA-.GUUUCUAGCCAGUA-CC..GUAAA-UGCUG.G.ACUAUAAGUAAAAGA.



UUGGCUAUUU












NZ_AADW02000005.1/186378-186480
SEQ ID NO: 146









CUAAAAAACAAAAAA.UA.AUGCCGUAUAAUUCUGGGG.AU.AUGGCCCGGAA.GUCUCUACAGGAACACC..UUAAAGGUUCC.U.ACUACGGCGUGCACU.



GAUUUCCGGU












NC_003909.8/3578068-3577966
SEQ ID NO: 147









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NC_003997.3/3605298-3605196
SEQ ID NO: 148









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NC_004722.1/3766254-3766152
SEQ ID NO: 149









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NC_005945.1/3605993-3605891
SEQ ID NO: 150









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NC_005957.2.13626969-3626867
SEQ ID NO: 151









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NC_007530.2/3605425-3605323
SEQ ID NO: 152









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAAC02000001.1/4059572-4059470
SEQ ID NO: 153









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAEN01000012.1/139994-140096
SEQ ID NO: 154









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAEO01000020.1/51249-51351
SEQ ID NO: 155









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAEP01000042.1/12489-12387
SEQ ID NO: 156









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAEQ01000019.1/51159-51261
SEQ ID NO: 157









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAER01000035.1/118314-118212
SEQ ID NO: 158









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NZ_AAES01000032.1/51320-51422
SEQ ID NO: 159









CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.



GUGUGUAACA












NC_006371.1/1538896-1538796
SEQ ID NO: 160









UAAUAAUGGGUAAAG.UU.ACUUCGUAUAACCCCACUU.AU.AGGGUGUGGGG.GUCUCUACCAGAAU-CC..GUAAA-AUUCU.G.AUUACGAAGAGUUGA.



GUGAUUGAUC












NC_004460.1/504371-504471
SEQ ID NO: 161









GACUUUCGGCGAUCA.AC.GCUUCAUAUAAUCCUAAUG.AU.AUGGUUUGGGA.GUUUCUACCAAGAG-CC..UUAAA-CUCUU.G.AUUAUGAAGUCUGUC.



GCUUUAUCCG












NC_005140.1/1130553-1130653
SEQ ID NO: 162









GACUUUCGGCGAUCA.AC.GCUUCAUAUAAUCCUAAUG.AU.AUGGUUUGGGA.GUUUCUACCAAGAG-CC..UUAAA-CUCUU.G.AUUAUGAAGUCUGUC.



GCUUUAUCCG












NC_004116.1/1094305-1094209
SEQ ID NO: 163









CAAUUAAAUAUAUGA.UU.UACUUAUUUAU-GCUGAGG.AU.-UGGCUUAGC-.GUCUCUACAAGACA-CC..GU-AA-UGUCU.A.ACAAUAAGUAAGCUA.



AUAAAUAGCU












NC_004368.1/1163490-1163394
SEQ ID NO: 164









CAAUUAAAUAUAUGA.UU.UACUUAUUUAU-GCUGAGG.AU.-UGGCUUAGC-.GUCUCUACAAGACA-CC..GU-AA-UGUCU.A.ACAAUAAGUAAGCUA.



AUAAAUAGCU












NZ_AADT03000005.1/42245-42347
SEQ ID NO: 165









CAAUAAAGCCAGUCU.UA.ACUUCGUAUAUCCCCGGCA.AU.AGGGACCGGGG.GUUUCUACCAGGCAACC..GCAAAUUGCCC.G.GCUACGAAGGUAUAU.



CUUCGUUGCU
T












NC_004557.1/2551392-2551293
SEQ ID NO: 166









CUCUAUAAUAAAUUA.UU.GACUCAUAUAU-CCCCUUA.AU.AAGGUAGGGA-.GUAUCUACCAGAAG-CC..UUAAA-CUUCU.G.ACUAUGAGUGAAUAAa



GCAUUGUCAU












NC_003028.1/1754786-1754882
SEQ ID NO: 167









AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA.AU.-UGGCACGAC-.GUUUCUACAAGGUG-CC..GG-AA-CACCU.A.ACAAUAAGUAAGUCA.



GCAGUGAGAU












NC_003098.1/1634825-1634921
SEQ ID NO: 168









AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA.AU.-UGGCACGAC-.GUUUCUACAAGGUG-CC..GG-AA-CACCU.A.ACAAUAAGUAAGUCA.



GCAGUGAGAU












NZ_AAGY01000085.1/5640-5736
SEQ ID NO: 169









AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA.AU.-UGGCACGAC-.GUUUCUACAAGGUG-CC..GG-AA-CACCU.A.ACAAUAAGUAAGUCA.



GCAGUGAGAU












NZ_AAEK01000001.1/183902184006
SEQ ID NO: 170









AUAAUUUUACACAUU.AU.CACUCGUAUAUACUCGGUA.AU.AUGGUCCGAGC.GUUUCUACCUAGUUCCCaaUGAAAGAACUG.G.ACUACGGGUUAAAGU.



AUUCGGUCGC












NC_003454.1/1645820-1645721
SEQ ID NO: 171









UAAAUAAUUUUAAUA.AA.AAUUCGUAUAA-GCCUAAU.AU.AUGGAAGGGU-.GUCCCUAC-GGUUAACC..AUAAAUUAACC.A.GCUACGAAAAAUGUU.



UUACUGUGUU












NZ_AADW02000027.1/2980-2880
SEQ ID NO: 172









GAAUAAACGUAUAGC.AA.CGCUCGUAUAAUAGUGGGG.AU.-UGGCCCACGA.GUCUCUACCGGAUCGCC..GU-AACGAUCC.G.ACUACGGGUGGUGAG.



UUACUGCUCU












NZ_AADW02000005.1/179959-180061
SEQ ID NO: 173









AAAUCAUACAUGCAU.CU.CCUUUGUAUAUACUCGCGA.AU.AUGGCGUGAGA.GUCUCUACCGGGUCACC..UUAAACGACCU.G.ACUAUGAAGGAGCAG.



ACCCUUCGUA












NC_006582.1/1039390-1039492
SEQ ID NO: 174









AAUGUCCAAUAGGAA.AA.UACCCGUAUAAUUGCAGGA.AU.AAGGCCUGCAC.GUUUCUACCGAGCCACC..GUAAAUGGCUU.G.ACUACGGCAUGAUAA.



AUGGAGCGCA












NC_002737.1/930749-930845
SEQ ID NO: 175









UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA.AU.-UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.



UUAGGCUUGC












NC_003485.1/910599-910695
SEQ ID NO: 176









UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA.AU.-UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.



UUAGGCUUGC












NC_004070.1/846557-846653
SEQ ID NO: 177









UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA.AU.-UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.



UUAGGCUUGC












NC_004606.1/977077-977173
SEQ ID NO: 178









UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA.AU.-UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.



UUAGGCUUGC












NC_006086.1/857664-857760
SEQ ID NO: 179









UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA.AU.-UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.



UUAGGCUUGC












NZ_AAFV01000199.1/507-411
SEQ ID NO: 180









UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA.AU.-UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.



UUAGGCUUGC












NC_004605.1/1369721-1369821
SEQ ID NO: 181









UAUAAUCGCAAGCGU.UU.GCUUCGUAUAACCCCAAUG.AU.AUGGUUUGGGG.GUCUCUACCAGUUC-CC..GCAAA-GUGCU.G.AUUACGAAGAGUUGA.



GAUCACUGUG












NZ_AAAW03000004.1/66862-66959
SEQ ID NO: 182









UGAACUGAAUAAUAA.AA.UUUUUAUAUAA-GUUCAUA.AU.-GGGUUGAAC-.GUCUCUACCAACUA-CC..GUAAA-UAGUU.G.AUUAUAAAAAUUUCG.



AACGGAAUGA












NC_004567.1/2968830-2968731
SEQ ID NO: 183









UUAUCAAUACAACUA.AU.UGCCUAUAUAAU-GCCAUG.AU.AUGGAUGGCGA.GUUUCUACCCAGUG-CC..GUAAA-CACUG.G.ACUAUAAGCGAAUUG.



AGUCGACGGG












NZ_AAEV01000003.1/13765-13667
SEQ ID NO: 184









ACAAUCAAAUAAAAC.UU.AGCCUAUAUAAUGUC-UUA.AU.CUGGU-UGACA.GUUUCUACCCAACG-CC..GUAAA-UGUUG.G.ACUAUAGGAAAACUA.



ACUCUUAUGU












NC_002570.2/806873-806971
SEQ ID NO: 185









UUAAUCGAGCUCAAC.AC.UCUUCGUAUAUCCUC-UCA.AU.AUGGGAUGAGG.GUCUCUAC-AGGUA-CC..GUAAA-UACCU.A.GCUACGAAAAGAAUG.



CAGUUAAUGU












NZ_AAEK010000S2.1/27552-27452
SEQ ID NO: 186









AUUAAUUACAUAUGA.GA.AUCAUGUAUAACUCCAAGA.AU.AUGGCUUGGGG.GUCUCUACCAGGAA-CC..AAUAA-CUCCU.G.ACUACAAAAUGCGUA.



UUAUAGCGUU












NC_006814.1/237722-237626
SEQ ID NO: 187









AUUAAGCCAAUACAA.AU.AUCU-AUAUAU-CGUCGAA.AU.AAGGUCGACA-.GUUUCUACCCACUA-CC..GUAAA-UGGUG.G.ACUAU-AGGUAAACG.



AAAUUAAGAA












NC_005363.1/3414604-3414703
SEQ ID NO: 188









AAAUAACUUCAUAGUgUU.UCCCCGUAUAU-GUUGCGA.AU.AGGGCGCAGC-.GUUUCUACCAGGCA-CC..UCAAA-UGCCU.G.ACUAUGGAGGUUCUU.



UGUGAAGUUG












NZ_AAUT03000021.1/9410-9513
SEQ ID NO: 189









AAAAUAAAUAUGGCA.AU.GGCCUGUAUAAUUGGGGGA.AU.AGGGCUCCCAA.GUUUCUACCGGGCAACC..GUAAAUUGCCCuG.GCUACAGCCUGAAAG.



UGUACCUCAG












NC_004668.1/2288426-2288328
SEQ ID NO: 190









UCCUAGAGAAACAUA.GA.AGCUUGUAUAA-GGUCAAA.AU.ACGGUUGACC-.GUCUCUACCCAGCA-CC..GUAAA-UGCUG.G.UCUAUAAGUGAAGAA.



GAGCGGAUAG












NC_006449.1/1182949-1183044
SEQ ID NO: 191









CAAACAAUGAGAACA.-U.UACUUAUUUAU-GUCACGA.AU.-GGGCGUGAC-.GUUUCUACAAGGUG-CC..GU-AA-CACCU.A.ACAAUAAGUAAGCUA.



AUUUAGUCAU












NZ_AAGS01000026.1/9921-10016
SEQ ID NO: 192









CAAACAAUGAGAACA.-U.UACUUAUUUAU-GUCACGA.AU.-GGGCGUGAC-.GUUUCUACAAGGUG-CC..GU-AA-CACCU.A.ACAAUAAGUAAGCUA.



AUUUAGUCAU












NZ_AAGQ01000089.1/170-269
SEQ ID NO: 193









UAACUUAACAAAUGU.UUcUGCUUAUAUAU-CGCUGCG.AU.ACGGGUAGCA-.GUCUCUACCCGGAG-CC..GUAAA-CUCCG.G.ACUAUAGGUAAAGAA.



GGGCCGGUAU












NZ_AAEF02000293.1/1-98
SEQ ID NO: 194









UAGAUAUUAAAUAAA.--.AAUUCGUAUAA-GCCUAAU.AU.AUGGAAAGGU-.GUCCCUAC-GGUUAACC..GUAAAUUAACC.A.GCUACGAAAAAUGUU.



UUUGCUGUAU












NC_006448.1/1185082-1185177
SEQ ID NO: 195









CAAACAAUGAGAACA.-U.UACUUAUUUAU-GUCACGA.AU.-GGGCGUGAC-.GUUUCUACAAGGUG-CC..GU-AA-CUCCU.A.ACAAUAAGUAAGCUA.



AUUUAGUCAU












NC_004567.1/2410478-2410577
SEQ ID NO: 196









UUCAAAUAAGUGGUA.AU.UGCCUAUAUAAU-GUCAUG.AU.AUGGUUGACGA.GUUUCUACCCAACC-CC..GUAAA-GGUUG.G.ACUAUAAGCAAACGA.



GGUCAUCCG












NZ_AABJ03000010.1/47704-47802
SEQ ID NO: 197









AUUCAGAAUGUUGAA.AA.AGCUUAUAUAUGGUCGU-A.AU.AAGG-AUGACC.GUUUCUACCCGGAG-CC..ACAAA-CUCAG.G.ACUAUAAGCAAUUAA.



GUACUUGUGC












NZ_AADT03000002.1/89184-89083
SEQ ID NO: 198









GAGUCUUCUUUUAGG.UU.UCUUCGUAUAGUCCCGGAG.AU.-UGGUCCGGGG.GUUUCUACCAGGUGACC..GG-AAUCACCUuG.GCUACGAAGGGUUAU.



UUCCUUUGUG












NC_006814.1/1971978-1972076
SEQ ID NO: 199









AUACUUAACAAUCAA.GU.UAUCUAUAUAU-CGUCGAA.AU.AAGGUCGACA-.GUAUCUACCCUGAG-CC..AUAAA-UUCAG.G.ACUAUAGGUAUCAGA.



CGUCAUAAUU












NZ_AAAO02000016.1/1579-1480
SEQ ID NO: 200









CACUCUAGAUAUCAA.AA.UAUCUAUAUAU-CGUCGUA.AU.AAGGUCGACA-.GUUUCUACCCGGAA-CCa.AUUAA-UUCUG.G.ACUAUAGGUAAUCGA.



UGUCAUAAGU












NC_005362.1/1949387-1949485
SEQ ID NO: 201









AUACUUAACAAUCAA.GU.UAUCUAUAUAU-CGUCGAA.AU.AAGGUCGACA-.GUAUCUACCCUGAG-CC..AUAAA-UUCAG.G.ACUAUAGGUAUCAGA.



CGUCAUAAAU












NZ_AAA002000035.1/3299-3201
SEQ ID NO: 202









AUACUUAACAAUCAA.GU.UAUCUAUAUAU-CGUCGAA.AU.AAGGUCGACA-.GUAUCUACCCUGAG-CC..AUAAA-UUCAG.G.ACUAUAGGUAUCAGA.



CGUCAUAAAU












NC_005362.1/263146-263049
SEQ ID NO: 203









CACUCUAGAUAUCAA.AU.AUCU-AUAUAU-CGUCGUA.AU.AAGGUCGACA-.GUUUCUACCCGGAA-CCa.AUUAA-UUCUG.G.ACUAU-AGGUAAUCG.



AUGUCAUAAG












NC_005363.1/2004933-2004835
SEQ ID NO: 204









ACUACUAACCGCGGU.UA.CACAAAUAUAA-GUCGGAG.AU.AGGGUCUGAC-.GUUUCUACCUGCCA-CC..GUAAA-GGGCA.G.UCUAUUUGGAGCGAA.



UAUAUGUUGA












NZ_AADT03000002.1/98293-98192
SEQ ID NO: 205









GACGAAUAUUACUAU.UA.GACUCGUAUAACCCCGGCG.AU.-GGGGCCGGGG.GUCUCUACCAGGUGACC..GG-AAUCACCUcG.GCUACGAGGGUGAGC.



GGCAGCUGGU












NZ_AABH02000036.1/17403-17500
SEQ ID NO: 206









CAACUAAAGAAGUUA.UU.UGCAGAUAUAU-CGUUGGA.AA.ACGGCCAACA-.GUUUCUACCACGCC-CC..-AAAA-GUCGU.G.ACUAUCCGCAAAUGU.



UUUUGACGAU












NC_006055.1/484139-484044
SEQ ID NO: 207









UAUAAAAUUAAUAUG.AA.AACUUGUAUAAUCCUUC--.AU.AUCGGGAAGGA.GUCUCUACCUAACA-CC..---AA-UGUUA.G.AUUAUGAGUUUUAUG.



GUUUUCGCUA












NZ_AABH02000272.1/211-308
SEQ ID NO: 208









CAACUAAAGAAGUUA.UU.UGCAGAUAUAU-CGUUGGA.AA.ACGGCCAACA-.GUUUCUACCACGCC-CC..-AAAA-GUCGU.G.ACUAUCCGCAAAUGU.



UUCUGACGAU












NZ_AABJ03000003.1/47905-47810
SEQ ID NO: 209









AAGAUAAAUAGCAAC.CA.AGCAGGUAUAU-CGUCGGA.UA.AUGGCUGACA-.GUUUCUACCCAACA-CC..---AA-UGUUG.G.ACUAUCUGUGGAUGU.



CUUUUUGGCG












NZ_AABH02000038.1/17559-17463
SEQ ID NO: 210









UAUUCUAUGUGAAAU.UU.AGCUGAUAUAGUAUCGA--.AUaAUGG-UCGAUU.GUUUCUAGCCAGCA-CC..---CA-UGCUG.GaACUAUCAUAAACAUG.



UUAUUUAAUU












NC_006055.1/397027-397123
SEQ ID NO: 211









AAUAAUUAAAUAUAA.AA.AACUUAUACAU-GACAACAuAU.-UGGGUUGUC-.GAC-CUGCCUCUGGACC..-U--AUCCUUA.G.ACUAUAAGCGUGAGG.



UUUUUUUACA












# = GC RF
SEQ ID NO: 212









aAAauuaaAaAAAaA.au.aacUCgUAUAAucucgggA.AU.AUGGcccgaga.GUUUCUACCaggcaaCC..GUAAAuugccu.G.ACUAcGAguaAauuu.



uauuUaUuuu





# = GC SS cons


:::::::::::::::.::.((((((((,,,<<<<<<<_._._>>>>>>>.,,,,,,,,<<<<<<<_.._>>>>>>.>.,,)))))))):::::.:::::::::


://
















TABLE 6





Atomic Coordinates of Guanine Riboswitch


















HEADER
RIBONUCLEIC ACID
05-AUG-04
1U8D








TITLE
GUANINE RIBOSWITCH BOUND TO HYPOXANTHINE


COMPND
MOL_ID: 1;









COMPND
2
 MOLECULE: XPT-PBUX MRNA;


COMPND
3
 CHAIN: A;


COMPND
4
 ENGINEERED: YES;


COMPND
5
 OTHER_DETAILS: G-BOX MRNA


SOURCE

MOL_ID: 1;


SOURCE
2
 SYNTHETIC: YES;


SOURCE
3
 OTHER_DETAILS: RNA WAS PREPARED BY IN VITRO TRANSCRIPTION.


SOURCE
4
 THE SEQUENCE OF THIS RNA CAN BE FOUND NATURALLY IN


SOURCE
5
BACILLUS SUBTILIS (BACTERIA)


KEYWDS

RNA-LIGAND COMPLEX, DOUBLE HELIX, BASE TRIPLES, BASE


KEYWDS
2
 QUADRUPLES, MRNA


EXPDTA

X-RAY DIFFRACTION


AUTHOR

R. T. BATEY, S. D. GILBERT, R. K. MONTANGE


REVDAT
1
  23-NOV-04 1U8D   0









JRNL
 AUTH
R. T. BATEY, S. D. GILBERT, R. K. MONTANGE


JRNL
 TITL
STRUCTURE OF A NATURAL GUANINE-RESPONSIVE


JRNL
 TITL 2
RIBOSWITCH COMPLEXED WITH THE METABOLITE


JRNL
 TITL 3
HYPOXANTHINE


JRNL
 REF
NATURE         V. 432   411 2004


JRNL
 REFN
ASTM NATUAS UK ISSN 0028-0836









REMARK
1



REMARK
2


REMARK
2
 RESOLUTION. 1.95 ANGSTROMS.


REMARK
3










REMARK
3
 REFINEMENT.



REMARK
3
  PROGRAM
: CNS 1.1


REMARK
3
  AUTHORS
: BRUNGER, ADAMS, CLORE, DELANO, GROS, GROSSE-


REMARK
3

: KUNSTLEVE, JIANG, KUSZEWSKI, NILGES, PANNU,


REMARK
3

: READ, RICE, SIMONSON, WARREN


REMARK
3









REMARK
3
 REFINEMENT TARGET: ENGH & HUBER


REMARK
3


REMARK
3
 DATA USED IN REFINEMENT.











REMARK
3
  RESOLUTION RANGE HIGH
(ANGSTROMS)
: 1.95


REMARK
3
  RESOLUTION RANGE LOW
(ANGSTROMS)
: 19.41


REMARK
3
  DATA CUTOFF
(SIGMA(F))
: 0.000


REMARK
3
  DATA CUTOFF
HIGH (ABS(F))
: 1344227.320


REMARK
3
  DATA CUTOFF
LOW (ABS(F))
: 0.0000


REMARK
3
  COMPLETENESS (WORKING + TEST)
(%)
: 92.8


REMARK
3
  NUMBER OF REFLECTIONS

: 25786


REMARK
3









REMARK
3
 FIT TO DATA USED IN REFINEMENT.










REMARK
3
  CROSS-VALIDATION METHOD
: THROUGHOUT


REMARK
3
  FREE R VALUE TEST SET SELECTION
: RANDOM











REMARK
3
  R VALUE
(WORKING SET)
: 0.178










REMARK
3
  FREE R VALUE
: 0.228











REMARK
3
  FREE R VALUE TEST SET SIZE
(%)
: 9.400










REMARK
3
  FREE R VALUE TEST SET COUNT
: 2430


REMARK
3
  ESTIMATED ERROR OF FREE R VALUE
: 0.005


REMARK
3









REMARK
3
 FIT IN THE HIGHEST RESOLUTION BIN.










REMARK
3
  TOTAL NUMBER OF BINS USED
: 10











REMARK
3
  BIN RESOLUTION RANGE HIGH
(A)
: 1.95


REMARK
3
  BIN RESOLUTION RANGE LOW
(A)
: 2.02











REMARK
3
  BIN COMPLETENESS
(WORKING + TEST) (%)
: 85.20


REMARK
3
  REFLECTIONS IN BIN
(WORKING SET)
: 2105


REMARK
3
  BIN R VALUE
(WORKING SET)
: 0.2480


REMARK
3
  BIN FREE R VALUE

: 0.3180











REMARK
3
  BIN FREE R VALUE TEST SET SIZE
(%)
: 10.40










REMARK
3
  BIN FREE R VALUE TEST SET COUNT
: 245


REMARK
3
  ESTIMATED ERROR OF BIN FREE R VALUE
: 0.020









REMARK
3



REMARK
3
 NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT.










REMARK
3
  PROTEIN ATOMS
: 0


REMARK
3
  NUCLEIC ACID ATOMS
: 1426


REMARK
3
  HETEROGEN ATOMS
: 108


REMARK
3
  SOLVENT ATOMS
: 333


REMARK
3











REMARK
3
 B VALUES.




REMARK
3
  FROM WILSON PLOT
(A**2)
: 14.90


REMARK
3
  MEAN B VALUE
(OVERALL, A**2)
: 22.60









REMARK
3
  OVERALL ANISOTROPIC B VALUE.


REMARK
3
  B11 (A**2) : 0.26000


REMARK
3
  B22 (A**2) : −0.88000


REMARK
3
  B33 (A**2) : 0.62000


REMARK
3
  B12 (A**2) : 0.00000


REMARK
3
  B13 (A**2) : −0.48000


REMARK
3
  B23 (A**2) : 0.00000


REMARK
3


REMARK
3
 ESTIMATED COORDINATE ERROR.











REMARK
3
  ESD FROM LUZZATI PLOT
(A)
: 0.19


REMARK
3
  ESD FROM SIGMAA
(A)
: 0.14


REMARK
3
  LOW RESOLUTION CUTOFF
(A)
: 5.00


REMARK
3









REMARK
3
 CROSS-VALIDATED ESTIMATED COORDINATE ERROR.











REMARK
3
  ESD FROM C-V LUZZATI PLOT
(A)
: 0.25


REMARK
3
  ESD FROM C-V SIGMAA
(A)
: 0.23


REMARK
3









REMARK
3
 RMS DEVIATIONS FROM IDEAL VALUES.











REMARK
3
  BOND LENGTHS
(A)
: 0.009


REMARK
3
  BOND ANGLES
(DEGREES)
: 1.60


REMARK
3
  DIHEDRAL ANGLES
(DEGREES)
: 19.10


REMARK
3
  IMPROPER ANGLES
(DEGREES)
: 1.83









REMARK
3



REMARK
3
 ISOTROPIC THERMAL MODEL : RESTRAINED


REMARK
3











REMARK
3
 ISOTROPIC THERMAL FACTOR RESTRAINTS.
RMS
SIGMA












REMARK
3
  MAIN-CHAIN BOND
(A**2)
: 0.810
; 1.500


REMARK
3
  MAIN-CHAIN ANGLE
(A**2)
: 0.000
; 2.000


REMARK
3
  SIDE-CHAIN BOND
(A**2)
: 1.520
; 2.000


REMARK
3
  SIDE-CHAIN ANGLE
(A**2)
: 1.970
; 2.500


REMARK
3









REMARK
3
 BULK SOLVENT MODELING.










REMARK
3
  METHOD USED
: FLAT MODEL


REMARK
3
  KSOL
: 0.36


REMARK
3
  BSOL
: 48.44


REMARK
3









REMARK
3
 NCS MODEL: NULL


REMARK
3












REMARK
3
 NCS RESTRAINTS.

RMS
SIGMA/WEIGHT


REMARK
3
  GROUP 1 POSITIONAL
(A)
: NULL
; NULL


REMARK
3
  GROUP 1 B-FACTOR
(A**2)
: NULL
; NULL


REMARK
3










REMARK
3
 PARAMETER FILE 1
: DNA-RNA_REP.PARAM


REMARK
3
  PARAMETER FILE 2
: ION.PARAM


REMARK
3
  PARAMETER FILE 3
: HPA4.PARAM


REMARK
3
  PARAMETER FILE 4
: COHEX_REP.PARAM


REMARK
3
  PARAMETER FILE 5
: WATER.PARAM


REMARK
3
  PARAMETER FILE 6
: NULL


REMARK
3
  TOPOLOGY FILE 1
: DNA-RNA.TOP


REMARK
3
  TOPOLOGY FILE 2
: ION.TOP


REMARK
3
  TOPOLOGY FILE 3
: HPA4.TOP


REMARK
3
  TOPOLOGY FILE 4
: COHEX_REP.TOP


REMARK
3
  TOPOLOGY FILE 5
: WATER.TOP


REMARK
3
  TOPOLOGY FILE 6
: NULL


REMARK
3









REMARK
3
  OTHER REFINEMENT REMARKS: NULL


REMARK
4


REMARK
4
 1U8D COMPLIES WITH FORMAT V. 2.3, 09-JULY-1998


REMARK
100


REMARK
100
 THIS ENTRY HAS BEEN PROCESSED BY THE NUCLEIC ACID DATABASE


REMARK
100
 ON 13-AUG-2004.


REMARK
100
 THE NDB ID CODE IS UR0039.


REMARK
105


REMARK
105
 THE PROTEIN DATA BANK HAS ADOPTED THE SACCHARIDE CHEMISTS


REMARK
105
 NOMENCLATURE FOR ATOMS OF THE DEOXYRIBOSE/RIBOSE MOIETY


REMARK
105
 RATHER THAN THAT OF THE NUCLEOSIDE CHEMISTS. THE RING


REMARK
105
 OXYGEN ATOM IS LABELLED O4* INSTEAD OF O1*.


REMARK
200


REMARK
200
 EXPERIMENTAL DETAILS










REMARK
200
  EXPERIMENT TYPE
: X-RAY DIFFRACTION


REMARK
200
  DATE OF DATA COLLECTION
: NULL











REMARK
200
  TEMPERATURE
(KELVIN)
: 100.0


REMARK
200
  PH

: 7.50










REMARK
200
  NUMBER OF CRYSTALS USED
: 1


REMARK
200











REMARK
200
  SYNCHROTRON
(Y/N)
: N


REMARK
200
  RADIATION SOURCE

: ROTATING ANODE


REMARK
200
  BEAMLINE

: NULL


REMARK
200
  X-RAY GENERATOR MODEL

: RIGAKU


REMARK
200
  MONOCHROMATIC OR LAUE
(M/L)
: M


REMARK
200
  WAVELENGTH OR RANGE
(A)
: 1.5418










REMARK
200
  MONOCHROMATOR
: NI FILTER


REMARK
200
  OPTICS
: NULL


REMARK
200


REMARK
200
  DETECTOR TYPE
: NULL


REMARK
200
  DETECTOR MANUFACTURER
: NULL


REMARK
200
  INTENSITY-INTEGRATION SOFTWARE
: R-AXIS


REMARK
200
  DATA SCALING SOFTWARE
: R-AXIS


REMARK
200


REMARK
200
  NUMBER OF UNIQUE REFLECTIONS
: 28013











REMARK
200
  RESOLUTION RANGE HIGH
(A)
: 1.800


REMARK
200
  RESOLUTION RANGE LOW
(A)
: 20.000


REMARK
200
  REJECTION CRITERIA
(SIGMA(I))
: 3.000


REMARK
200











REMARK
200
 OVERALL.




REMARK
200
  COMPLETENESS FOR RANGE
(%)
: NULL


REMARK
200
  DATA REDUNDANCY

: 2.900


REMARK
200
  R MERGE
(I)
: 0.05500


REMARK
200
  R SYM
(I)
: 0.03700










REMARK
200
  <I/SIGMA(I)> FOR THE DATA SET
: 21.5000









REMARK
200



REMARK
200
 IN THE HIGHEST RESOLUTION SHELL.


REMARK
200
  HIGHEST RESOLUTION SHELL, RANGE HIGH (A): 1.80


REMARK
200
  HIGHEST RESOLUTION SHELL, RANGE LOW (A): 1.86











REMARK
200
  COMPLETENESS FOR SHELL
(%)
: 100.0


REMARK
200
  DATA REDUNDANCY IN SHELL

: NULL


REMARK
200
  R MERGE FOR SHELL
(I)
: NULL


REMARK
200
  R SYM FOR SHELL
(I)
: NULL


REMARK
200
  <I/SIGMA(I)> FOR SHELL

: NULL


REMARK
200









REMARK
200
 DIFFRACTION PROTOCOL: SINGLE WAVELENGTH


REMARK
200
 METHOD USED TO DETERMINE THE STRUCTURE: SAD


REMARK
200
 SOFTWARE USED: SOLVE


REMARK
200
 STARTING MODEL: NULL


REMARK
200


REMARK
200
 REMARK: NULL


REMARK
280


REMARK
280
 CRYSTAL


REMARK
280
 SOLVENT CONTENT, VS  (%): 46.00


REMARK
280
 MATTHEWS COEFFICIENT, VM (ANGSTROMS**3/DA): 2.29


REMARK
280


REMARK
280
 CRYSTALLIZATION CONDITIONS: 10 MM COBALT HEXAMMINE, 200 MM


REMARK
280
  AMMONIUM ACETATE, 25% PEG 2K, PH 7.5, VAPOR DIFFUSION, HANGING


REMARK
280
  DROP, TEMPERATURE 23 K


REMARK
290


REMARK
290
 CRYSTALLOGRAPHIC SYMMETRY


REMARK
290
 SYMMETRY OPERATORS FOR SPACE GROUP: C 1 2 1


REMARK
290











REMARK
290

SYMOP
SYMMETRY


REMARK
290

NNNMMM
OPERATOR


REMARK
290

1555
X, Y, Z


REMARK
290

2555
−X, Y, −Z


REMARK
290

3555
½ + X, ½ + Y, Z


REMARK
290

4555
½ − X, ½ + Y, −Z


REMARK
290











REMARK
290

WHERE
NNN -> OPERATOR NUMBER


REMARK
290


MMM -> TRANSLATION VECTOR


REMARK
290









REMARK
290
 CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONS


REMARK
290
 THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATM


REMARK
290
 RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLY


REMARK
290
 RELATED MOLECULES.















REMARK
290
  SMTRY1
1
1.000000
0.000000
0.000000
0.00000



REMARK
290
  SMTRY2
1
0.000000
1.000000
0.000000
0.00000


REMARK
290
  SMTRY3
1
0.000000
0.000000
1.000000
0.00000


REMARK
290
  SMTRY1
2
−1.000000
0.000000
0.000000
0.00000


REMARK
290
  SMTRY2
2
0.000000
1.000000
0.000000
0.00000


REMARK
290
  SMTRY3
2
0.000000
0.000000
−1.000000
0.00000


REMARK
290
  SMTRY1
3
1.000000
0.000000
0.000000
66.14900


REMARK
290
  SMTRY2
3
0.000000
1.000000
0.000000
17.62500


REMARK
290
  SMTRY3
3
0.000000
0.000000
1.000000
0.00000


REMARK
290
  SMTRY1
4
−1.000000
0.000000
0.000000
66.14900


REMARK
290
  SMTRY2
4
0.000000
1.000000
0.000000
17.62500


REMARK
290
  SMTRY3
4
0.000000
0.000000
−1.000000
0.00000









REMARK
290



REMARK
290
 REMARK: NULL


REMARK
300


REMARK
300
 BIOMOLECULE: 1


REMARK
300
 THIS ENTRY CONTAINS THE CRYSTALLOGRAPHIC ASYMMETRIC UNIT


REMARK
300
 WHICH CONSISTS OF 1 CHAIN(S). SEE REMARK 350 FOR


REMARK
300
 INFORMATION ON GENERATING THE BIOLOGICAL MOLECULE(S).


REMARK
350


REMARK
350
 GENERATING THE BIOMOLECULE


REMARK
350
 COORDINATES FOR A COMPLETE MULTIMER REPRESENTING THE KNOWN


REMARK
350
 BIOLOGICALLY SIGNIFICANT OLIGOMERIZATION STATE OF THE


REMARK
350
 MOLECULE CAN BE GENERATED BY APPLYING BIOMT TRANSFORMATIONS


REMARK
350
 GIVEN BELOW. BOTH NON-CRYSTALLOGRAPHIC AND


REMARK
350
 CRYSTALLOGRAPHIC OPERATIONS ARE GIVEN.


REMARK
350


REMARK
350
 BIOMOLECULE: 1


REMARK
350
 APPLY THE FOLLOWING TO CHAINS: A















REMARK
350
  BIOMT1
1
1.000000
0.000000
0.000000
0.00000



REMARK
350
  BIOMT2
1
0.000000
1.000000
0.000000
0.00000


REMARK
350
  BIOMT3
1
0.000000
0.000000
1.000000
0.00000


REMARK
465









REMARK
465
 MISSING RESIDUES


REMARK
465
 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE


REMARK
465
 EXPERIMENT. (M = MODEL NUMBER; RES = RESIDUE NAME; C = CHAIN


REMARK
465
 IDENTIFIER; SSSEQ = SEQUENCE NUMBER; I = INSERTION CODE.)


REMARK
465










REMARK
465
M RES
C SSSEQI


REMARK
465
A
A 82


REMARK
500









REMARK
500
 GEOMETRY AND STEREOCHEMISTRY


REMARK
500
 SUBTOPIC: CLOSE CONTACTS


REMARK
500


REMARK
500
 THE FOLLOWING ATOMS THAT ARE RELATED BY CRYSTALLOGRAPHIC


REMARK
500
 SYMMETRY ARE IN CLOSE CONTACT. AN ATOM LOCATED WITHIN 0.15


REMARK
500
 ANGSTROMS OF A SYMMETRY RELATED ATOM IS ASSUMED TO BE ON A


REMARK
500
 SPECIAL POSITION AND IS, THEREFORE, LISTED IN REMARK 375


REMARK
500
 INSTEAD OF REMARK 500. ATOMS WITH NON-BLANK ALTERNATE


REMARK
500
 LOCATION INDICATORS ARE NOT INCLUDED IN THE CALCULATIONS.


REMARK
500


REMARK
500
 DISTANCE CUTOFF:


REMARK
500
 2.2 ANGSTROMS FOR CONTACTS NOT INVOLVING HYDROGEN ATOMS


REMARK
500
 1.6 ANGSTROMS FOR CONTACTS INVOLVING HYDROGEN ATOMS


REMARK
500


















REMARK
500
 ATM1
RES
C
SSEQI
ATM2
RES
C
SSEQI
SSYMOP
DISTANCE


REMARK
500
O1P
G
A
15
O1P
G
A
15
2556
1.81
























SEQRES
1
A
68
G
G
A
C
A
U
A
U
A
A
U
C
G



SEQRES
2
A
68
C
G
U
G
G
A
U
A
U
G
G
C
A


SEQRES
3
A
68
C
G
C
A
A
G
U
U
U
C
U
A
C


SEQRES
4
A
68
C
G
G
G
C
A
C
C
G
U
A
A
A


SEQRES
5
A
68
U
G
U
C
C
G
A
C
U
A
U
G
U


SEQRES
6
A
68
C
C
A













HET
SPD

95

10



HET
ACT

96

4


HET
NCO

101

7


HET
NCO

102

7


HET
NCO

103

7


HET
NCO

104

7


HET
NCO

105

7


HET
NCO

106

7


HET
NCO

107

7


HET
NCO

108

7


HET
NCO

109

7


HET
NCO

110

7


HET
NCO

111

7


HET
NCO

112

7


HET
HPA

90

10








HETNAM
SPD SPERMIDINE


HETNAM
ACT ACETATE ION


HETNAM
NCO COBALT HEXAMMINE ION


HETNAM
HPA HYPOXANTHINE


HETSYN
SPD N-(2-AMINO-PROPYL)-1,4-DIAMINOBUTANE; PA(34)










FORMUL
2
SPD
C7 H19 N3


FORMUL
3
ACT
C2 H3 O2 1−


FORMUL
4
NCO
12(H18 N6 CO1 3+)


FORMUL
16
HPA
C5 H4 N4 O1


FORMUL
17
HOH
*333(H2 O1)
















CRYST1
132.298
35.250
42.225
90.00
90.95
90.00
C 1 2 1
4













ORIGX1
1.000000
0.000000
0.000000
0.00000



ORIGX2
0.000000
1.000000
0.000000
0.00000


ORIGX3
0.000000
0.000000
1.000000
0.00000


SCALE1
0.007559
0.000000
0.000125
0.00000


SCALE2
0.000000
0.028369
0.000000
0.00000


SCALE3
0.000000
0.000000
0.023686
0.00000


















ATOM
1
O3P
G
A
15
2.257
5.451
23.138
1.00
79.41
O


ATOM
2
P
G
A
15
2.002
5.937
21.715
1.00
79.84
P


ATOM
3
O1P
G
A
15
0.530
5.848
21.317
1.00
79.33
O


ATOM
4
O2P
G
A
15
2.597
7.317
21.458
1.00
79.53
O


ATOM
5
O5*
G
A
15
2.809
4.906
20.739
1.00
77.83
O


ATOM
6
C5*
G
A
15
4.171
4.541
21.029
1.00
75.69
C


ATOM
7
C4*
G
A
15
4.238
3.119
21.543
1.00
73.85
C


ATOM
8
O4*
G
A
15
2.948
2.746
22.099
1.00
73.98
O


ATOM
9
C3*
G
A
15
4.556
2.024
20.538
1.00
72.87
C


ATOM
10
O3*
G
A
15
5.945
2.038
20.158
1.00
70.52
O


ATOM
11
C2*
G
A
15
3.933
0.791
21.200
1.00
73.00
C


ATOM
12
O2*
G
A
15
4.650
0.231
22.285
1.00
73.25
O


ATOM
13
C1*
G
A
15
2.648
1.397
21.766
1.00
72.98
C


ATOM
14
N9
G
A
15
1.581
1.433
20.767
1.00
72.54
N


ATOM
15
C8
G
A
15
0.950
2.560
20.284
1.00
72.23
C


ATOM
16
N7
G
A
15
0.045
2.296
19.382
1.00
71.52
N


ATOM
17
C5
G
A
15
0.072
0.912
19.261
1.00
71.53
C


ATOM
18
C6
G
A
15
−0.692
0.051
18.433
1.00
71.10
C


ATOM
19
O6
G
A
15
−1.579
0.355
17.620
1.00
71.14
O


ATOM
20
N1
G
A
15
−0.339
−1.284
18.619
1.00
70.53
N


ATOM
21
C2
G
A
15
0.626
−1.734
19.493
1.00
70.46
C


ATOM
22
N2
G
A
15
0.829
−3.059
19.517
1.00
69.85
N


ATOM
23
N3
G
A
15
1.341
−0.941
20.279
1.00
70.84
N


ATOM
24
C4
G
A
15
1.015
0.361
20.111
1.00
71.66
C


ATOM
25
P
G
A
16
6.984
0.956
20.748
1.00
68.91
P


ATOM
26
O1P
G
A
16
6.845
0.908
22.231
1.00
68.62
O


ATOM
27
O2P
G
A
16
8.306
1.282
20.155
1.00
68.38
O


ATOM
28
O5*
G
A
16
6.531
−0.448
20.141
1.00
66.18
O


ATOM
29
C5*
G
A
16
7.070
−1.646
20.684
1.00
62.29
C


ATOM
30
C4*
G
A
16
6.512
−2.866
19.993
1.00
59.49
C


ATOM
31
O4*
G
A
16
5.058
−2.824
20.037
1.00
58.98
O


ATOM
32
C3*
G
A
16
6.820
−2.981
18.513
1.00
57.81
C


ATOM
33
O3*
G
A
16
8.106
−3.541
18.297
1.00
56.43
O


ATOM
34
C2*
G
A
16
5.709
−3.902
18.025
1.00
57.51
C


ATOM
35
O2*
G
A
16
5.944
−5.257
18.332
1.00
56.72
O


ATOM
36
C1*
G
A
16
4.530
−3.420
18.863
1.00
57.12
C


ATOM
37
N9
G
A
16
3.712
−2.448
18.155
1.00
55.86
N


ATOM
38
C8
G
A
16
3.719
−1.080
18.289
1.00
55.10
C


ATOM
39
N7
G
A
16
2.851
−0.488
17.514
1.00
55.02
N


ATOM
40
C5
G
A
16
2.239
−1.529
16.825
1.00
54.19
C


ATOM
41
C6
G
A
16
1.206
−1.512
15.851
1.00
53.76
C


ATOM
42
O6
G
A
16
0.597
−0.540
15.389
1.00
52.60
O


ATOM
43
N1
G
A
16
0.894
−2.796
15.416
1.00
53.12
N


ATOM
44
C2
G
A
16
1.494
−3.945
15.859
1.00
53.54
C


ATOM
45
N2
G
A
16
1.070
−5.089
15.307
1.00
53.43
N


ATOM
46
N3
G
A
16
2.446
−3.975
16.775
1.00
53.95
N


ATOM
47
C4
G
A
16
2.767
−2.742
17.208
1.00
54.84
C


ATOM
48
P
A
A
17
8.823
−3.354
16.870
1.00
55.82
P


ATOM
49
O1P
A
A
17
10.149
−4.017
16.909
1.00
55.72
O


ATOM
50
O2P
A
A
17
8.728
−1.914
16.506
1.00
55.26
O


ATOM
51
O5*
A
A
17
7.903
−4.185
15.880
1.00
53.03
O


ATOM
52
C5*
A
A
17
7.817
−5.594
15.991
1.00
51.32
C


ATOM
53
C4*
A
A
17
7.075
−6.140
14.809
1.00
49.43
C


ATOM
54
O4*
A
A
17
5.666
−5.818
14.928
1.00
48.16
O


ATOM
55
C3*
A
A
17
7.494
−5.502
13.505
1.00
48.31
C


ATOM
56
O3*
A
A
17
8.659
−6.134
13.012
1.00
48.66
O


ATOM
57
C2*
A
A
17
6.266
−5.722
12.634
1.00
47.38
C


ATOM
58
O2*
A
A
17
6.231
−7.032
12.106
1.00
45.76
O


ATOM
59
C1*
A
A
17
5.138
−5.512
13.653
1.00
46.63
C


ATOM
60
N9
A
A
17
4.655
−4.134
13.716
1.00
45.43
N


ATOM
61
C8
A
A
17
5.126
−3.118
14.513
1.00
45.03
C


ATOM
62
N7
A
A
17
4.471
−1.988
14.375
1.00
45.06
N


ATOM
63
C5
A
A
17
3.505
−2.278
13.419
1.00
44.35
C


ATOM
64
C6
A
A
17
2.480
−1.507
12.837
1.00
43.36
C


ATOM
65
N6
A
A
17
2.232
−0.230
13.146
1.00
43.22
N


ATOM
66
N1
A
A
17
1.698
−2.105
11.915
1.00
43.31
N


ATOM
67
C2
A
A
17
1.930
−3.390
11.607
1.00
43.50
C


ATOM
68
N3
A
A
17
2.852
−4.218
12.085
1.00
44.12
N


ATOM
69
C4
A
A
17
3.616
−3.596
12.998
1.00
44.78
C


ATOM
70
P
C
A
18
9.428
−5.494
11.763
1.00
48.10
P


ATOM
71
O1P
C
A
18
10.761
−6.124
11.593
1.00
48.55
O


ATOM
72
O2P
C
A
18
9.319
−4.016
11.886
1.00
47.97
O


ATOM
73
O5*
C
A
18
8.513
−5.918
10.541
1.00
47.08
O


ATOM
74
C5*
C
A
18
8.419
−5.064
9.438
1.00
44.02
C


ATOM
75
C4*
C
A
18
7.134
−5.285
8.695
1.00
42.22
C


ATOM
76
O4*
C
A
18
5.989
−5.055
9.551
1.00
41.33
O


ATOM
77
C3*
C
A
18
6.973
−4.294
7.574
1.00
40.44
C


ATOM
78
O3*
C
A
18
7.752
−4.737
6.489
1.00
38.62
O


ATOM
79
C2*
C
A
18
5.468
−4.277
7.368
1.00
40.56
C


ATOM
80
O2*
C
A
18
5.017
−5.381
6.605
1.00
39.99
O


ATOM
81
C1*
C
A
18
4.986
−4.366
8.819
1.00
40.15
C


ATOM
82
N1
C
A
18
4.886
−3.032
9.427
1.00
39.26
N


ATOM
83
C2
C
A
18
3.854
−2.179
9.053
1.00
39.01
C


ATOM
84
O2
C
A
18
3.040
−2.556
8.196
1.00
39.82
O


ATOM
85
N3
C
A
18
3.765
−0.960
9.635
1.00
38.83
N


ATOM
86
C4
C
A
18
4.660
−0.585
10.547
1.00
37.75
C


ATOM
87
N4
C
A
18
4.524
0.621
11.092
1.00
36.78
N


ATOM
88
C5
C
A
18
5.730
−1.431
10.939
1.00
37.85
C


ATOM
89
C6
C
A
18
5.804
−2.636
10.360
1.00
39.12
C


ATOM
90
P
A
A
19
8.667
−3.688
5.725
1.00
36.72
P


ATOM
91
O1P
A
A
19
9.480
−4.427
4.735
1.00
37.50
O


ATOM
92
O2P
A
A
19
9.331
−2.852
6.746
1.00
36.38
O


ATOM
93
O5*
A
A
19
7.574
−2.817
4.973
1.00
35.32
O


ATOM
94
C5*
A
A
19
6.619
−3.455
4.146
1.00
32.02
C


ATOM
95
C4*
A
A
19
5.610
−2.459
3.661
1.00
30.54
C


ATOM
96
O4*
A
A
19
4.737
−2.066
4.755
1.00
30.35
O


ATOM
97
C3*
A
A
19
6.211
−1.148
3.184
1.00
30.44
C


ATOM
98
O3*
A
A
19
6.700
−1.256
1.853
1.00
28.92
O


ATOM
99
C2*
A
A
19
5.026
−0.208
3.307
1.00
29.76
C


ATOM
100
O2*
A
A
19
4.113
−0.427
2.254
1.00
30.88
O


ATOM
101
C1*
A
A
19
4.384
−0.698
4.606
1.00
29.58
C


ATOM
102
N9
A
A
19
4.840
0.046
5.783
1.00
27.74
N


ATOM
103
C8
A
A
19
5.802
−0.299
6.700
1.00
26.96
C


ATOM
104
N7
A
A
19
5.955
0.583
7.666
1.00
26.99
N


ATOM
105
C5
A
A
19
5.036
1.577
7.358
1.00
26.41
C


ATOM
106
C6
A
A
19
4.684
2.780
8.000
1.00
25.24
C


ATOM
107
N6
A
A
19
5.209
3.195
9.153
1.00
25.00
N


ATOM
108
N1
A
A
19
3.739
3.547
7.415
1.00
25.67
N


ATOM
109
C2
A
A
19
3.176
3.116
6.284
1.00
24.91
C


ATOM
110
N3
A
A
19
3.401
1.998
5.602
1.00
25.13
N


ATOM
111
C4
A
A
19
4.353
1.265
6.194
1.00
26.86
C


ATOM
112
P
U
A
20
7.901
−0.307
1.370
1.00
27.68
P


ATOM
113
O1P
U
A
20
8.299
−0.722
0.006
1.00
29.66
O


ATOM
114
O2P
U
A
20
8.908
−0.246
2.444
1.00
27.22
O


ATOM
115
O5*
U
A
20
7.230
1.122
1.256
1.00
25.69
O


ATOM
116
C5*
U
A
20
6.245
1.373
0.283
1.00
22.92
C


ATOM
117
C4*
U
A
20
5.719
2.780
0.423
1.00
21.41
C


ATOM
118
O4*
U
A
20
5.042
2.907
1.710
1.00
20.04
O


ATOM
119
C3*
U
A
20
6.777
3.874
0.487
1.00
19.50
C


ATOM
120
O3*
U
A
20
7.259
4.265
−0.783
1.00
20.05
O


ATOM
121
C2*
U
A
20
6.027
5.000
1.173
1.00
18.58
C


ATOM
122
O2*
U
A
20
5.109
5.617
0.301
1.00
17.80
O


ATOM
123
C1*
U
A
20
5.220
4.231
2.212
1.00
19.20
C


ATOM
124
N1
U
A
20
5.900
4.191
3.517
1.00
18.08
N


ATOM
125
C2
U
A
20
5.668
5.253
4.365
1.00
18.46
C


ATOM
126
O2
U
A
20
4.951
6.205
4.059
1.00
15.17
O


ATOM
127
N3
U
A
20
6.303
5.174
5.580
1.00
16.68
N


ATOM
128
C4
U
A
20
7.127
4.167
6.028
1.00
18.60
C


ATOM
129
O4
U
A
20
7.615
4.248
7.167
1.00
15.59
O


ATOM
130
C5
U
A
20
7.333
3.101
5.087
1.00
17.76
C


ATOM
131
C6
U
A
20
6.725
3.154
3.888
1.00
17.60
C


ATOM
132
P
A
A
21
8.804
4.707
−0.950
1.00
21.07
P


ATOM
133
O1P
A
A
21
8.958
5.041
−2.388
1.00
19.56
O


ATOM
134
O2P
A
A
21
9.726
3.742
−0.339
1.00
17.87
O


ATOM
135
O5*
A
A
21
8.914
6.057
−0.104
1.00
18.44
O


ATOM
136
C5*
A
A
21
8.311
7.240
−0.587
1.00
16.49
C


ATOM
137
C4*
A
A
21
8.458
8.398
0.384
1.00
15.52
C


ATOM
138
O4*
A
A
21
7.781
8.097
1.635
1.00
16.13
O


ATOM
139
C3*
A
A
21
9.885
8.660
0.849
1.00
16.12
C


ATOM
140
O3*
A
A
21
10.578
9.415
−0.119
1.00
15.84
O


ATOM
141
C2*
A
A
21
9.670
9.462
2.123
1.00
14.81
C


ATOM
142
O2*
A
A
21
9.390
10.824
1.853
1.00
16.92
O


ATOM
143
C1*
A
A
21
8.420
8.794
2.696
1.00
16.03
C


ATOM
144
N9
A
A
21
8.745
7.898
3.805
1.00
14.49
N


ATOM
145
C8
A
A
21
9.200
6.611
3.807
1.00
14.35
C


ATOM
146
N7
A
A
21
9.500
6.161
5.002
1.00
14.14
N


ATOM
147
C5
A
A
21
9.192
7.227
5.846
1.00
13.55
C


ATOM
148
C6
A
A
21
9.288
7.400
7.255
1.00
13.46
C


ATOM
149
N6
A
A
21
9.760
6.467
8.083
1.00
13.17
N


ATOM
150
N1
A
A
21
8.881
8.588
7.778
1.00
14.41
N


ATOM
151
C2
A
A
21
8.419
9.532
6.935
1.00
14.69
C


ATOM
152
N3
A
A
21
8.294
9.483
5.600
1.00
16.23
N


ATOM
153
C4
A
A
21
8.702
8.288
5.122
1.00
13.95
C


ATOM
154
P
U
A
22
11.833
8.778
−0.860
1.00
17.19
P


ATOM
155
O1P
U
A
22
12.143
9.766
−1.930
1.00
14.62
O


ATOM
156
O2P
U
A
22
11.572
7.360
−1.223
1.00
16.11
O


ATOM
157
O5*
U
A
22
12.955
8.795
0.274
1.00
17.34
O


ATOM
158
C5*
U
A
22
14.086
7.879
0.260
1.00
15.80
C


ATOM
159
C4*
U
A
22
14.668
7.761
1.654
1.00
13.76
C


ATOM
160
O4*
U
A
22
14.980
9.104
2.130
1.00
13.53
O


ATOM
161
C3*
U
A
22
13.575
7.229
2.579
1.00
14.34
C


ATOM
162
O3*
U
A
22
14.069
6.400
3.635
1.00
16.58
O


ATOM
163
C2*
U
A
22
12.950
8.479
3.204
1.00
14.31
C


ATOM
164
O2*
U
A
22
12.419
8.301
4.510
1.00
12.30
O


ATOM
165
C1*
U
A
22
14.140
9.444
3.207
1.00
13.55
C


ATOM
166
N1
U
A
22
13.832
10.876
3.147
1.00
14.29
N


ATOM
167
C2
U
A
22
14.055
11.619
4.286
1.00
13.16
C


ATOM
168
O2
U
A
22
14.534
11.153
5.327
1.00
15.12
O


ATOM
169
N3
U
A
22
13.706
12.925
4.186
1.00
13.49
N


ATOM
170
C4
U
A
22
13.172
13.569
3.099
1.00
13.63
C


ATOM
171
O4
U
A
22
12.789
14.728
3.228
1.00
12.03
O


ATOM
172
C5
U
A
22
12.990
12.747
1.948
1.00
12.63
C


ATOM
173
C6
U
A
22
13.320
11.453
2.009
1.00
13.42
C


ATOM
174
P
A
A
23
14.502
4.869
3.397
1.00
16.90
P


ATOM
175
O1P
A
A
23
13.773
4.291
2.214
1.00
17.04
O


ATOM
176
O2P
A
A
23
14.351
4.224
4.726
1.00
16.03
O


ATOM
177
O5*
A
A
23
16.057
4.949
3.092
1.00
16.99
O


ATOM
178
C5*
A
A
23
16.937
5.684
3.940
1.00
16.50
C


ATOM
179
C4*
A
A
23
18.220
5.970
3.200
1.00
16.03
C


ATOM
180
O4*
A
A
23
17.896
6.559
1.914
1.00
15.15
O


ATOM
181
C3*
A
A
23
19.123
7.000
3.865
1.00
15.88
C


ATOM
182
O3*
A
A
23
20.007
6.350
4.779
1.00
16.04
O


ATOM
183
C2*
A
A
23
19.932
7.535
2.689
1.00
14.37
C


ATOM
184
O2*
A
A
23
21.038
6.689
2.437
1.00
14.07
O


ATOM
185
C1*
A
A
23
18.924
7.447
1.539
1.00
15.13
C


ATOM
186
N9
A
A
23
18.313
8.717
1.157
1.00
14.71
N


ATOM
187
C8
A
A
23
18.027
9.134
−0.113
1.00
14.98
C


ATOM
188
N7
A
A
23
17.513
10.334
−0.171
1.00
15.52
N


ATOM
189
C5
A
A
23
17.440
10.727
1.159
1.00
14.12
C


ATOM
190
C6
A
A
23
16.984
11.892
1.762
1.00
13.27
C


ATOM
191
N6
A
A
23
16.517
12.944
1.074
1.00
13.02
N


ATOM
192
N1
A
A
23
17.029
11.962
3.109
1.00
13.36
N


ATOM
193
C2
A
A
23
17.522
10.921
3.791
1.00
12.32
C


ATOM
194
N3
A
A
23
17.996
9.768
3.328
1.00
12.80
N


ATOM
195
C4
A
A
23
17.924
9.737
1.988
1.00
13.37
C


ATOM
196
P
A
A
24
19.792
6.521
6.367
1.00
16.08
P


ATOM
197
O1P
A
A
24
20.740
5.538
7.026
1.00
18.75
O


ATOM
198
O2P
A
A
24
18.370
6.463
6.689
1.00
17.24
O


ATOM
199
O5*
A
A
24
20.280
8.006
6.656
1.00
14.02
O


ATOM
200
C5*
A
A
24
19.497
9.123
6.288
1.00
12.57
C


ATOM
201
C4*
A
A
24
19.315
10.041
7.475
1.00
13.24
C


ATOM
202
O4*
A
A
24
18.369
9.426
8.402
1.00
14.03
O


ATOM
203
C3*
A
A
24
20.554
10.296
8.339
1.00
12.98
C


ATOM
204
O3*
A
A
24
21.369
11.351
7.801
1.00
13.11
O


ATOM
205
C2*
A
A
24
19.907
10.750
9.647
1.00
13.47
C


ATOM
206
O2*
A
A
24
19.453
12.099
9.622
1.00
11.96
O


ATOM
207
C1*
A
A
24
18.667
9.851
9.720
1.00
13.92
C


ATOM
208
N9
A
A
24
18.806
8.671
10.584
1.00
14.61
N


ATOM
209
C8
A
A
24
18.841
7.342
10.230
1.00
14.46
C


ATOM
210
N7
A
A
24
18.885
6.526
11.250
1.00
15.41
N


ATOM
211
C5
A
A
24
18.905
7.372
12.348
1.00
13.99
C


ATOM
212
C6
A
A
24
18.939
7.129
13.722
1.00
13.84
C


ATOM
213
N6
A
A
24
18.928
5.899
14.267
1.00
13.39
N


ATOM
214
N1
A
A
24
18.969
8.201
14.543
1.00
11.61
N


ATOM
215
C2
A
A
24
18.945
9.429
14.008
1.00
11.99
C


ATOM
216
N3
A
A
24
18.904
9.784
12.740
1.00
14.75
N


ATOM
217
C4
A
A
24
18.885
8.694
11.949
1.00
12.94
C


ATOM
218
P
U
A
25
22.931
11.117
7.493
1.00
14.31
P


ATOM
219
O1P
U
A
25
23.384
9.774
7.967
1.00
13.12
O


ATOM
220
O2P
U
A
25
23.680
12.330
7.933
1.00
15.11
O


ATOM
221
O5*
U
A
25
22.966
11.182
5.904
1.00
12.37
O


ATOM
222
C5*
U
A
25
22.889
9.987
5.117
1.00
15.75
C


ATOM
223
C4*
U
A
25
22.394
10.314
3.715
1.00
14.54
C


ATOM
224
O4*
U
A
25
21.019
10.786
3.786
1.00
14.76
O


ATOM
225
C3*
U
A
25
23.144
11.444
3.031
1.00
15.39
C


ATOM
226
O3*
U
A
25
24.331
10.949
2.403
1.00
15.75
O


ATOM
227
C2*
U
A
25
22.114
11.959
2.027
1.00
15.07
C


ATOM
228
O2*
U
A
25
22.016
11.123
0.889
1.00
14.04
O


ATOM
229
C1*
U
A
25
20.817
11.819
2.831
1.00
14.18
C


ATOM
230
N1
U
A
25
20.412
13.042
3.549
1.00
10.89
N


ATOM
231
C2
U
A
25
19.972
14.094
2.797
1.00
12.58
C


ATOM
232
O2
U
A
25
19.917
14.051
1.569
1.00
14.81
O


ATOM
233
N3
U
A
25
19.592
15.206
3.514
1.00
12.69
N


ATOM
234
C4
U
A
25
19.611
15.359
4.880
1.00
14.17
C


ATOM
235
O4
U
A
25
19.271
16.439
5.374
1.00
13.94
O


ATOM
236
C5
U
A
25
20.084
14.211
5.598
1.00
13.87
C


ATOM
237
C6
U
A
25
20.460
13.117
4.917
1.00
13.05
C


ATOM
238
P
C
A
26
25.693
11.790
2.524
1.00
17.71
P


ATOM
239
O1P
C
A
26
26.754
11.081
1.752
1.00
17.41
O


ATOM
240
O2P
C
A
26
25.922
12.133
3.943
1.00
15.48
O


ATOM
241
O5*
C
A
26
25.352
13.145
1.775
1.00
15.22
O


ATOM
242
C5*
C
A
26
25.046
13.137
0.391
1.00
14.95
C


ATOM
243
C4*
C
A
26
24.479
14.477
−0.021
1.00
17.14
C


ATOM
244
O4*
C
A
26
23.155
14.681
0.542
1.00
15.78
O


ATOM
245
C3*
C
A
26
25.258
15.671
0.494
1.00
18.08
C


ATOM
246
O3*
C
A
26
26.425
15.908
−0.282
1.00
19.26
O


ATOM
247
C2*
C
A
26
24.233
16.785
0.350
1.00
17.99
C


ATOM
248
O2*
C
A
26
24.097
17.196
−0.990
1.00
17.37
O


ATOM
249
C1*
C
A
26
22.960
16.069
0.819
1.00
17.28
C


ATOM
250
N1
C
A
26
22.770
16.251
2.269
1.00
17.50
N


ATOM
251
C2
C
A
26
22.207
17.449
2.714
1.00
16.12
C


ATOM
252
O2
C
A
26
21.837
18.289
1.868
1.00
17.31
O


ATOM
253
N3
C
A
26
22.088
17.671
4.038
1.00
17.53
N


ATOM
254
C4
C
A
26
22.516
16.762
4.915
1.00
18.28
C


ATOM
255
N4
C
A
26
22.401
17.049
6.222
1.00
17.84
N


ATOM
256
C5
C
A
26
23.082
15.519
4.494
1.00
18.25
C


ATOM
257
C6
C
A
26
23.174
15.301
3.169
1.00
17.66
C


ATOM
258
P
G
A
27
27.781
16.387
0.456
1.00
19.54
P


ATOM
259
O1P
G
A
27
28.835
15.402
0.093
1.00
19.84
O


ATOM
260
O2P
G
A
27
27.508
16.662
1.882
1.00
21.05
O


ATOM
261
O5*
G
A
27
28.072
17.770
−0.238
1.00
16.72
O


ATOM
262
C5*
G
A
27
28.214
17.833
−1.641
1.00
18.44
C


ATOM
263
C4*
G
A
27
27.858
19.222
−2.122
1.00
17.66
C


ATOM
264
O4*
G
A
27
26.419
19.395
−1.976
1.00
18.44
O


ATOM
265
C3*
G
A
27
28.429
20.382
−1.312
1.00
16.35
C


ATOM
266
O3*
G
A
27
29.799
20.676
−1.631
1.00
12.94
O


ATOM
267
C2*
G
A
27
27.483
21.494
−1.746
1.00
16.92
C


ATOM
268
O2*
G
A
27
27.805
21.847
−3.096
1.00
18.43
O


ATOM
269
C1*
G
A
27
26.130
20.767
−1.672
1.00
19.53
C


ATOM
270
N9
G
A
27
25.521
20.820
−0.331
1.00
17.32
N


ATOM
271
C8
G
A
27
25.623
19.857
0.640
1.00
18.25
C


ATOM
272
N7
G
A
27
25.058
20.199
1.770
1.00
16.05
N


ATOM
273
C5
G
A
27
24.519
21.457
1.529
1.00
15.29
C


ATOM
274
C6
G
A
27
23.826
22.345
2.410
1.00
15.83
C


ATOM
275
O6
G
A
27
23.506
22.160
3.589
1.00
13.24
O


ATOM
276
N1
G
A
27
23.515
23.548
1.791
1.00
13.74
N


ATOM
277
C2
G
A
27
23.815
23.862
0.493
1.00
15.62
C


ATOM
278
N2
G
A
27
23.476
25.116
0.104
1.00
13.99
N


ATOM
279
N3
G
A
27
24.422
23.032
−0.352
1.00
15.80
N


ATOM
280
C4
G
A
27
24.768
21.859
0.238
1.00
17.14
C


ATOM
281
P
C
A
28
30.774
21.297
−0.490
1.00
15.32
P


ATOM
282
O1P
C
A
28
32.151
21.289
−1.085
1.00
13.28
O


ATOM
283
O2P
C
A
28
30.549
20.564
0.797
1.00
16.81
O


ATOM
284
O5*
C
A
28
30.216
22.799
−0.308
1.00
14.13
O


ATOM
285
C5*
C
A
28
29.979
23.621
−1.457
1.00
12.20
C


ATOM
286
C4*
C
A
28
29.208
24.893
−1.109
1.00
11.20
C


ATOM
287
O4*
C
A
28
27.842
24.579
−0.760
1.00
13.22
O


ATOM
288
C3*
C
A
28
29.693
25.749
0.056
1.00
11.65
C


ATOM
289
O3*
C
A
28
30.795
26.536
−0.369
1.00
12.61
O


ATOM
290
C2*
C
A
28
28.443
26.579
0.321
1.00
12.57
C


ATOM
291
O2*
C
A
28
28.217
27.545
−0.701
1.00
11.37
O


ATOM
292
C1*
C
A
28
27.367
25.500
0.202
1.00
11.49
C


ATOM
293
N1
C
A
28
27.155
24.774
1.465
1.00
11.12
N


ATOM
294
C2
C
A
28
26.362
25.365
2.449
1.00
10.90
C


ATOM
295
O2
C
A
28
25.841
26.462
2.217
1.00
12.61
O


ATOM
296
N3
C
A
28
26.165
24.721
3.633
1.00
9.42
N


ATOM
297
C4
C
A
28
26.703
23.521
3.841
1.00
9.67
C


ATOM
298
N4
C
A
28
26.467
22.940
5.029
1.00
11.69
N


ATOM
299
C5
C
A
28
27.503
22.869
2.845
1.00
10.98
C


ATOM
300
C6
C
A
28
27.715
23.537
1.681
1.00
9.28
C


ATOM
301
P
G
A
29
31.848
27.078
0.701
1.00
12.54
P


ATOM
302
O1P
G
A
29
33.001
27.676
−0.012
1.00
10.32
O


ATOM
303
O2P
G
A
29
32.074
26.030
1.739
1.00
13.92
O


ATOM
304
O5*
G
A
29
31.048
28.236
1.453
1.00
10.81
O


ATOM
305
C5*
G
A
29
30.651
29.422
0.784
1.00
9.98
C


ATOM
306
C4*
G
A
29
29.953
30.336
1.761
1.00
8.40
C


ATOM
307
O4*
G
A
29
28.693
29.737
2.169
1.00
11.49
O


ATOM
308
C3*
G
A
29
30.712
30.532
3.055
1.00
8.64
C


ATOM
309
O3*
G
A
29
31.663
31.571
2.913
1.00
7.56
O


ATOM
310
C2*
G
A
29
29.598
30.920
4.004
1.00
10.60
C


ATOM
311
O2*
G
A
29
29.216
32.260
3.812
1.00
11.11
O


ATOM
312
C1*
G
A
29
28.480
29.976
3.555
1.00
11.07
C


ATOM
313
N9
G
A
29
28.524
28.699
4.271
1.00
11.15
N


ATOM
314
C8
G
A
29
29.049
27.500
3.837
1.00
12.35
C


ATOM
315
N7
G
A
29
28.902
26.525
4.710
1.00
10.34
N


ATOM
316
C5
G
A
29
28.251
27.122
5.781
1.00
8.19
C


ATOM
317
C6
G
A
29
27.837
26.583
7.032
1.00
7.73
C


ATOM
318
O6
G
A
29
27.914
25.425
7.442
1.00
6.94
O


ATOM
319
N1
G
A
29
27.287
27.558
7.854
1.00
7.67
N


ATOM
320
C2
G
A
29
27.125
28.881
7.518
1.00
8.16
C


ATOM
321
N2
G
A
29
26.558
29.664
8.459
1.00
7.86
N


ATOM
322
N3
G
A
29
27.484
29.396
6.356
1.00
8.66
N


ATOM
323
C4
G
A
29
28.038
28.469
5.540
1.00
10.04
C


ATOM
324
P
U
A
30
33.134
31.407
3.520
1.00
8.25
P


ATOM
325
O1P
U
A
30
33.970
32.458
2.893
1.00
11.63
O


ATOM
326
O2P
U
A
30
33.566
29.974
3.446
1.00
11.64
O


ATOM
327
O5*
U
A
30
32.909
31.761
5.054
1.00
8.57
O


ATOM
328
C5*
U
A
30
32.803
33.103
5.491
1.00
7.65
C


ATOM
329
C4*
U
A
30
32.354
33.142
6.956
1.00
7.93
C


ATOM
330
O4*
U
A
30
31.051
32.507
7.112
1.00
7.66
O


ATOM
331
C3*
U
A
30
33.220
32.376
7.932
1.00
7.36
C


ATOM
332
O3*
U
A
30
34.411
33.098
8.229
1.00
8.48
O


ATOM
333
C2*
U
A
30
32.276
32.271
9.116
1.00
8.63
C


ATOM
334
O2*
U
A
30
32.159
33.539
9.769
1.00
8.93
O


ATOM
335
C1*
U
A
30
30.972
31.878
8.399
1.00
9.06
C


ATOM
336
N1
U
A
30
30.909
30.406
8.210
1.00
8.03
N


ATOM
337
C2
U
A
30
30.479
29.640
9.285
1.00
9.41
C


ATOM
338
O2
U
A
30
30.140
30.142
10.342
1.00
9.53
O


ATOM
339
N3
U
A
30
30.484
28.277
9.076
1.00
8.15
N


ATOM
340
C4
U
A
30
30.879
27.615
7.918
1.00
7.87
C


ATOM
341
O4
U
A
30
30.916
26.387
7.900
1.00
8.06
O


ATOM
342
C5
U
A
30
31.293
28.471
6.852
1.00
6.19
C


ATOM
343
C6
U
A
30
31.293
29.810
7.035
1.00
9.31
C


ATOM
344
P
G
A
31
35.776
32.309
8.547
1.00
11.17
P


ATOM
345
O1P
G
A
31
36.843
33.324
8.585
1.00
11.89
O


ATOM
346
O2P
G
A
31
35.907
31.144
7.632
1.00
14.22
O


ATOM
347
O5*
G
A
31
35.546
31.762
10.018
1.00
12.60
O


ATOM
348
C5*
G
A
31
35.387
32.670
11.107
1.00
11.40
C


ATOM
349
C4*
G
A
31
34.796
31.951
12.296
1.00
11.05
C


ATOM
350
O4*
G
A
31
33.529
31.364
11.898
1.00
11.06
O


ATOM
351
C3*
G
A
31
35.585
30.763
12.826
1.00
11.89
C


ATOM
352
O3*
G
A
31
36.673
31.190
13.649
1.00
11.52
O


ATOM
353
C2*
G
A
31
34.499
30.057
13.613
1.00
12.14
C


ATOM
354
O2*
G
A
31
34.173
30.764
14.789
1.00
14.85
O


ATOM
355
C1*
G
A
31
33.321
30.161
12.626
1.00
12.01
C


ATOM
356
N9
G
A
31
33.369
29.020
11.714
1.00
9.31
N


ATOM
357
C8
G
A
31
33.747
28.982
10.391
1.00
8.70
C


ATOM
358
N7
G
A
31
33.733
27.763
9.889
1.00
9.68
N


ATOM
359
C5
G
A
31
33.312
26.959
10.944
1.00
7.23
C


ATOM
360
C6
G
A
31
33.144
25.544
11.029
1.00
7.98
C


ATOM
361
O6
G
A
31
33.310
24.683
10.140
1.00
5.42
O


ATOM
362
N1
G
A
31
32.739
25.152
12.313
1.00
6.14
N


ATOM
363
C2
G
A
31
32.548
25.997
13.371
1.00
10.35
C


ATOM
364
N2
G
A
31
32.176
25.420
14.577
1.00
7.45
N


ATOM
365
N3
G
A
31
32.699
27.315
13.297
1.00
9.43
N


ATOM
366
C4
G
A
31
33.073
27.720
12.071
1.00
9.46
C


ATOM
367
P
G
A
32
37.923
30.221
13.919
1.00
12.74
P


ATOM
368
O1P
G
A
32
38.781
31.042
14.790
1.00
14.11
O


ATOM
369
O2P
G
A
32
38.517
29.617
12.692
1.00
12.26
O


ATOM
370
O5*
G
A
32
37.356
28.990
14.770
1.00
12.62
O


ATOM
371
C5*
G
A
32
36.720
29.166
16.023
1.00
10.29
C


ATOM
372
C4*
G
A
32
36.261
27.823
16.550
1.00
10.54
C


ATOM
373
O4*
G
A
32
35.214
27.266
15.693
1.00
12.36
O


ATOM
374
C3*
G
A
32
37.339
26.771
16.499
1.00
12.35
C


ATOM
375
O3*
G
A
32
38.203
26.883
17.614
1.00
14.68
O


ATOM
376
C2*
G
A
32
36.538
25.487
16.525
1.00
11.24
C


ATOM
377
O2*
G
A
32
35.994
25.214
17.807
1.00
13.27
O


ATOM
378
C1*
G
A
32
35.393
25.857
15.578
1.00
10.96
C


ATOM
379
N9
G
A
32
35.730
25.540
14.191
1.00
9.74
N


ATOM
380
C8
G
A
32
36.070
26.428
13.200
1.00
9.21
C


ATOM
381
N7
G
A
32
36.330
25.848
12.058
1.00
9.05
N


ATOM
382
C5
G
A
32
36.132
24.497
12.304
1.00
10.24
C


ATOM
383
C6
G
A
32
36.241
23.372
11.427
1.00
10.39
C


ATOM
384
O6
G
A
32
36.572
23.356
10.238
1.00
11.57
O


ATOM
385
N1
G
A
32
35.943
22.178
12.079
1.00
8.19
N


ATOM
386
C2
G
A
32
35.627
22.076
13.413
1.00
9.95
C


ATOM
387
N2
G
A
32
35.463
20.831
13.865
1.00
8.24
N


ATOM
388
N3
G
A
32
35.508
23.119
14.242
1.00
9.53
N


ATOM
389
C4
G
A
32
35.776
24.287
13.623
1.00
10.32
C


ATOM
390
P
A
A
33
39.686
26.342
17.495
1.00
15.61
P


ATOM
391
O1P
A
A
33
40.371
26.864
18.691
1.00
19.56
O


ATOM
392
O2P
A
A
33
40.255
26.557
16.151
1.00
17.09
O


ATOM
393
O5*
A
A
33
39.561
24.752
17.619
1.00
14.10
O


ATOM
394
C5*
A
A
33
39.033
24.156
18.788
1.00
14.10
C


ATOM
395
C4*
A
A
33
38.951
22.650
18.615
1.00
13.63
C


ATOM
396
O4*
A
A
33
37.996
22.286
17.569
1.00
13.30
O


ATOM
397
C3*
A
A
33
40.222
21.955
18.171
1.00
13.78
C


ATOM
398
O3*
A
A
33
41.120
21.848
19.264
1.00
13.31
O


ATOM
399
C2*
A
A
33
39.672
20.607
17.712
1.00
12.75
C


ATOM
400
O2*
A
A
33
39.299
19.776
18.802
1.00
12.98
O


ATOM
401
C1*
A
A
33
38.400
21.050
16.987
1.00
12.61
C


ATOM
402
N9
A
A
33
38.609
21.236
15.550
1.00
12.88
N


ATOM
403
C8
A
A
33
38.628
22.381
14.794
1.00
11.71
C


ATOM
404
N7
A
A
33
38.816
22.157
13.502
1.00
10.33
N


ATOM
405
C5
A
A
33
38.927
20.782
13.415
1.00
9.80
C


ATOM
406
C6
A
A
33
39.116
19.908
12.324
1.00
10.29
C


ATOM
407
N6
A
A
33
39.234
20.309
11.050
1.00
9.60
N


ATOM
408
N1
A
A
33
39.178
18.588
12.588
1.00
9.49
N


ATOM
409
C2
A
A
33
39.048
18.179
13.852
1.00
9.84
C


ATOM
410
N3
A
A
33
38.857
18.900
14.964
1.00
11.60
N


ATOM
411
C4
A
A
33
38.811
20.204
14.669
1.00
11.27
C


ATOM
412
P
U
A
34
42.687
21.969
19.023
1.00
15.81
P


ATOM
413
O1P
U
A
34
43.252
22.026
20.387
1.00
17.18
O


ATOM
414
O2P
U
A
34
42.941
23.110
18.071
1.00
16.71
O


ATOM
415
O5*
U
A
34
43.104
20.594
18.319
1.00
15.71
O


ATOM
416
C5*
U
A
34
42.712
19.342
18.883
1.00
13.18
C


ATOM
417
C4*
U
A
34
42.857
18.181
17.894
1.00
13.06
C


ATOM
418
O4*
U
A
34
42.094
18.430
16.686
1.00
12.24
O


ATOM
419
C3*
U
A
34
44.300
18.001
17.421
1.00
13.49
C


ATOM
420
O3*
U
A
34
44.642
16.633
17.345
1.00
14.82
O


ATOM
421
C2*
U
A
34
44.381
18.646
16.051
1.00
13.02
C


ATOM
422
O2*
U
A
34
45.215
17.992
15.121
1.00
13.60
O


ATOM
423
C1*
U
A
34
42.953
18.456
15.570
1.00
12.35
C


ATOM
424
N1
U
A
34
42.536
19.469
14.602
1.00
12.11
N


ATOM
425
C2
U
A
34
42.498
19.079
13.286
1.00
11.31
C


ATOM
426
O2
U
A
34
42.658
17.918
12.930
1.00
12.26
O


ATOM
427
N3
U
A
34
42.267
20.088
12.393
1.00
12.69
N


ATOM
428
C4
U
A
34
42.050
21.410
12.687
1.00
11.38
C


ATOM
429
O4
U
A
34
41.968
22.223
11.766
1.00
12.81
O


ATOM
430
C5
U
A
34
42.032
21.717
14.082
1.00
11.68
C


ATOM
431
C6
U
A
34
42.270
20.742
14.979
1.00
11.42
C


ATOM
432
P
A
A
35
45.682
15.997
18.379
1.00
15.65
P


ATOM
433
O1P
A
A
35
45.216
16.428
19.709
1.00
14.33
O


ATOM
434
O2P
A
A
35
47.048
16.295
17.943
1.00
15.44
O


ATOM
435
O5*
A
A
35
45.445
14.439
18.178
1.00
15.24
O


ATOM
436
C5*
A
A
35
44.211
13.803
18.550
1.00
13.53
C


ATOM
437
C4*
A
A
35
44.300
12.292
18.297
1.00
12.70
C


ATOM
438
O4*
A
A
35
44.299
12.032
16.878
1.00
13.93
O


ATOM
439
C3*
A
A
35
45.610
11.724
18.820
1.00
13.12
C


ATOM
440
O3*
A
A
35
45.420
10.462
19.457
1.00
14.06
O


ATOM
441
C2*
A
A
35
46.543
11.684
17.607
1.00
13.02
C


ATOM
442
O2*
A
A
35
47.425
10.590
17.583
1.00
12.53
O


ATOM
443
C1*
A
A
35
45.544
11.534
16.454
1.00
12.55
C


ATOM
444
N9
A
A
35
45.906
12.201
15.209
1.00
12.31
N


ATOM
445
C8
A
A
35
45.633
13.489
14.841
1.00
13.01
C


ATOM
446
N7
A
A
35
46.117
13.819
13.666
1.00
16.12
N


ATOM
447
C5
A
A
35
46.737
12.661
13.222
1.00
15.27
C


ATOM
448
C6
A
A
35
47.416
12.343
12.024
1.00
15.95
C


ATOM
449
N6
A
A
35
47.607
13.213
11.010
1.00
14.94
N


ATOM
450
N1
A
A
35
47.895
11.084
11.900
1.00
16.37
N


ATOM
451
C2
A
A
35
47.706
10.216
12.909
1.00
17.57
C


ATOM
452
N3
A
A
35
47.086
10.398
14.078
1.00
16.34
N


ATOM
453
C4
A
A
35
46.615
11.653
14.167
1.00
15.44
C


ATOM
454
P
U
A
36
45.278
10.371
21.058
1.00
17.57
P


ATOM
455
O1P
U
A
36
46.459
11.056
21.623
1.00
15.59
O


ATOM
456
O2P
U
A
36
45.073
8.928
21.307
1.00
18.33
O


ATOM
457
O5*
U
A
36
43.954
11.187
21.425
1.00
15.41
O


ATOM
458
C5*
U
A
36
42.696
10.541
21.773
1.00
14.72
C


ATOM
459
C4*
U
A
36
41.978
11.395
22.807
1.00
14.61
C


ATOM
460
O4*
U
A
36
42.775
11.453
23.991
1.00
14.11
O


ATOM
461
C3*
U
A
36
41.774
12.824
22.315
1.00
13.18
C


ATOM
462
O3*
U
A
36
40.517
13.272
22.831
1.00
10.05
O


ATOM
463
C2*
U
A
36
42.798
13.639
23.103
1.00
13.16
C


ATOM
464
O2*
U
A
36
42.458
14.971
23.369
1.00
11.86
O


ATOM
465
C1*
U
A
36
42.962
12.785
24.367
1.00
12.76
C


ATOM
466
N1
U
A
36
44.246
12.864
25.044
1.00
12.63
N


ATOM
467
C2
U
A
36
44.272
13.420
26.310
1.00
12.49
C


ATOM
468
O2
U
A
36
43.273
13.855
26.869
1.00
11.09
O


ATOM
469
N3
U
A
36
45.512
13.445
26.897
1.00
12.26
N


ATOM
470
C4
U
A
36
46.708
12.972
26.346
1.00
12.77
C


ATOM
471
O4
U
A
36
47.745
13.082
26.975
1.00
13.05
O


ATOM
472
C5
U
A
36
46.588
12.412
25.033
1.00
12.67
C


ATOM
473
C6
U
A
36
45.390
12.385
24.437
1.00
12.34
C


ATOM
474
P
G
A
37
39.141
13.060
22.048
1.00
13.09
P


ATOM
475
O1P
G
A
37
38.083
13.377
23.042
1.00
10.03
O


ATOM
476
O2P
G
A
37
39.096
11.783
21.316
1.00
10.97
O


ATOM
477
O5*
G
A
37
39.187
14.193
20.930
1.00
11.94
O


ATOM
478
C5*
G
A
37
38.955
15.558
21.225
1.00
10.67
C


ATOM
479
C4*
G
A
37
38.971
16.337
19.929
1.00
10.54
C


ATOM
480
O4*
G
A
37
40.254
16.171
19.252
1.00
10.54
O


ATOM
481
C3*
G
A
37
37.973
15.801
18.925
1.00
8.49
C


ATOM
482
O3*
G
A
37
36.724
16.406
19.212
1.00
9.71
O


ATOM
483
C2*
G
A
37
38.576
16.253
17.605
1.00
9.01
C


ATOM
484
O2*
G
A
37
38.393
17.651
17.433
1.00
8.99
O


ATOM
485
C1*
G
A
37
40.060
15.983
17.850
1.00
10.41
C


ATOM
486
N9
G
A
37
40.497
14.629
17.473
1.00
9.32
N


ATOM
487
C8
G
A
37
40.487
13.516
18.280
1.00
9.31
C


ATOM
488
N7
G
A
37
40.939
12.442
17.686
1.00
10.48
N


ATOM
489
C5
G
A
37
41.259
12.859
16.408
1.00
10.37
C


ATOM
490
C6
G
A
37
41.782
12.126
15.316
1.00
12.23
C


ATOM
491
O6
G
A
37
42.036
10.920
15.257
1.00
11.04
O


ATOM
492
N1
G
A
37
42.008
12.948
14.197
1.00
9.95
N


ATOM
493
C2
G
A
37
41.748
14.286
14.145
1.00
9.72
C


ATOM
494
N2
G
A
37
42.121
14.931
12.993
1.00
8.78
N


ATOM
495
N3
G
A
37
41.191
14.972
15.147
1.00
9.91
N


ATOM
496
C4
G
A
37
40.995
14.206
16.248
1.00
10.87
C


ATOM
497
P
G
A
38
35.372
15.646
18.858
1.00
12.29
P


ATOM
498
O1P
G
A
38
34.265
16.448
19.436
1.00
9.96
O


ATOM
499
O2P
G
A
38
35.501
14.191
19.191
1.00
11.57
O


ATOM
500
O5*
G
A
38
35.356
15.699
17.258
1.00
12.26
O


ATOM
501
C5*
G
A
38
35.114
16.920
16.540
1.00
10.01
C


ATOM
502
C4*
G
A
38
35.319
16.694
15.053
1.00
8.68
C


ATOM
503
O4*
G
A
38
36.699
16.352
14.806
1.00
7.01
O


ATOM
504
C3*
G
A
38
34.534
15.539
14.472
1.00
9.46
C


ATOM
505
O3*
G
A
38
33.210
15.990
14.155
1.00
12.58
O


ATOM
506
C2*
G
A
38
35.376
15.169
13.243
1.00
10.28
C


ATOM
507
O2*
G
A
38
35.157
15.985
12.097
1.00
9.67
O


ATOM
508
C1*
G
A
38
36.804
15.398
13.759
1.00
9.91
C


ATOM
509
N9
G
A
38
37.412
14.173
14.300
1.00
9.70
N


ATOM
510
C8
G
A
38
37.174
13.612
15.533
1.00
11.32
C


ATOM
511
N7
G
A
38
37.702
12.417
15.664
1.00
11.54
N


ATOM
512
C5
G
A
38
38.362
12.199
14.469
1.00
10.27
C


ATOM
513
C6
G
A
38
39.035
11.045
13.994
1.00
11.93
C


ATOM
514
O6
G
A
38
39.201
9.950
14.575
1.00
8.78
O


ATOM
515
N1
G
A
38
39.522
11.236
12.701
1.00
9.87
N


ATOM
516
C2
G
A
38
39.369
12.387
11.958
1.00
11.31
C


ATOM
517
N2
G
A
38
39.901
12.384
10.710
1.00
10.24
N


ATOM
518
N3
G
A
38
38.739
13.460
12.390
1.00
9.09
N


ATOM
519
C4
G
A
38
38.251
13.297
13.636
1.00
11.34
C


ATOM
520
P
C
A
39
31.929
15.084
14.534
1.00
12.06
P


ATOM
521
O1P
C
A
39
32.388
13.880
15.288
1.00
14.53
O


ATOM
522
O2P
C
A
39
31.165
14.878
13.278
1.00
12.02
O


ATOM
523
O5*
C
A
39
30.903
16.101
15.235
1.00
17.76
O


ATOM
524
C5*
C
A
39
31.093
16.713
16.478
1.00
18.70
C


ATOM
525
C4*
C
A
39
30.971
18.261
16.413
1.00
13.42
C


ATOM
526
O4*
C
A
39
32.106
18.778
15.688
1.00
12.71
O


ATOM
527
C3*
C
A
39
29.833
19.187
15.926
1.00
14.10
C


ATOM
528
O3*
C
A
39
28.693
19.255
16.804
1.00
10.19
O


ATOM
529
C2*
C
A
39
30.564
20.538
16.055
1.00
11.50
C


ATOM
530
O2*
C
A
39
30.790
20.868
17.412
1.00
10.17
O


ATOM
531
C1*
C
A
39
31.968
20.185
15.563
1.00
12.13
C


ATOM
532
N1
C
A
39
32.175
20.579
14.172
1.00
12.50
N


ATOM
533
C2
C
A
39
32.272
21.952
13.896
1.00
12.48
C


ATOM
534
O2
C
A
39
32.208
22.763
14.848
1.00
8.22
O


ATOM
535
N3
C
A
39
32.436
22.355
12.625
1.00
9.69
N


ATOM
536
C4
C
A
39
32.522
21.454
11.639
1.00
11.76
C


ATOM
537
N4
C
A
39
32.692
21.917
10.379
1.00
8.85
N


ATOM
538
C5
C
A
39
32.443
20.047
11.889
1.00
9.77
C


ATOM
539
C6
C
A
39
32.267
19.658
13.164
1.00
11.59
C


ATOM
540
P
A
A
40
27.311
20.017
16.359
1.00
12.95
P


ATOM
541
O1P
A
A
40
26.296
19.624
17.367
1.00
14.20
O


ATOM
542
O2P
A
A
40
26.956
19.926
14.935
1.00
12.21
O


ATOM
543
O5*
A
A
40
27.609
21.582
16.535
1.00
13.28
O


ATOM
544
C5*
A
A
40
27.658
22.200
17.801
1.00
12.99
C


ATOM
545
C4*
A
A
40
27.672
23.716
17.632
1.00
12.56
C


ATOM
546
O4*
A
A
40
28.892
24.107
16.964
1.00
12.75
O


ATOM
547
C3*
A
A
40
26.579
24.281
16.733
1.00
12.50
C


ATOM
548
O3*
A
A
40
25.367
24.448
17.450
1.00
12.35
O


ATOM
549
C2*
A
A
40
27.182
25.611
16.312
1.00
13.51
C


ATOM
550
O2*
A
A
40
27.126
26.577
17.360
1.00
9.74
O


ATOM
551
C1*
A
A
40
28.629
25.197
16.084
1.00
11.10
C


ATOM
552
N9
A
A
40
28.846
24.755
14.708
1.00
11.69
N


ATOM
553
C8
A
A
40
28.858
23.479
14.199
1.00
11.33
C


ATOM
554
N7
A
A
40
29.111
23.431
12.916
1.00
9.06
N


ATOM
555
C5
A
A
40
29.278
24.761
12.557
1.00
8.72
C


ATOM
556
C6
A
A
40
29.583
25.382
11.335
1.00
10.32
C


ATOM
557
N6
A
A
40
29.762
24.716
10.185
1.00
9.54
N


ATOM
558
N1
A
A
40
29.688
26.735
11.325
1.00
9.70
N


ATOM
559
C2
A
A
40
29.463
27.395
12.451
1.00
9.00
C


ATOM
560
N3
A
A
40
29.160
26.925
13.661
1.00
7.01
N


ATOM
561
C4
A
A
40
29.092
25.584
13.644
1.00
9.47
C


ATOM
562
P
C
A
41
23.973
24.307
16.696
1.00
15.74
P


ATOM
563
O1P
C
A
41
22.929
24.350
17.731
1.00
16.51
O


ATOM
564
O2P
C
A
41
23.995
23.181
15.724
1.00
13.62
O


ATOM
565
O5*
C
A
41
23.875
25.634
15.801
1.00
13.56
O


ATOM
566
C5*
C
A
41
23.881
26.916
16.422
1.00
12.13
C


ATOM
567
C4*
C
A
41
24.095
28.017
15.398
1.00
11.36
C


ATOM
568
O4*
C
A
41
25.434
27.933
14.850
1.00
9.89
O


ATOM
569
C3*
C
A
41
23.230
28.007
14.151
1.00
10.98
C


ATOM
570
O3*
C
A
41
21.887
28.429
14.399
1.00
12.66
O


ATOM
571
C2*
C
A
41
24.025
28.958
13.272
1.00
9.59
C


ATOM
572
O2*
C
A
41
23.974
30.285
13.781
1.00
10.24
O


ATOM
573
C1*
C
A
41
25.435
28.418
13.518
1.00
9.79
C


ATOM
574
N1
C
A
41
25.772
27.308
12.587
1.00
7.61
N


ATOM
575
C2
C
A
41
26.181
27.650
11.338
1.00
8.93
C


ATOM
576
O2
C
A
41
26.289
28.873
11.070
1.00
7.63
O


ATOM
577
N3
C
A
41
26.456
26.679
10.436
1.00
8.78
N


ATOM
578
C4
C
A
41
26.342
25.386
10.777
1.00
9.30
C


ATOM
579
N4
C
A
41
26.656
24.451
9.845
1.00
7.23
N


ATOM
580
C5
C
A
41
25.917
24.995
12.077
1.00
7.65
C


ATOM
581
C6
C
A
41
25.654
25.986
12.948
1.00
9.18
C


ATOM
582
P
G
A
42
20.679
27.762
13.566
1.00
13.08
P


ATOM
583
O1P
G
A
42
19.441
28.269
14.161
1.00
16.14
O


ATOM
584
O2P
G
A
42
20.913
26.306
13.504
1.00
15.00
O


ATOM
585
O5*
G
A
42
20.851
28.371
12.110
1.00
13.10
O


ATOM
586
C5*
G
A
42
20.747
29.779
11.925
1.00
11.92
C


ATOM
587
C4*
G
A
42
21.247
30.185
10.564
1.00
10.63
C


ATOM
588
O4*
G
A
42
22.635
29.810
10.398
1.00
11.41
O


ATOM
589
C3*
G
A
42
20.564
29.562
9.364
1.00
12.03
C


ATOM
590
O3*
G
A
42
19.320
30.211
9.085
1.00
13.22
O


ATOM
591
C2*
G
A
42
21.592
29.850
8.281
1.00
11.77
C


ATOM
592
O2*
G
A
42
21.543
31.210
7.918
1.00
12.69
O


ATOM
593
C1*
G
A
42
22.898
29.564
9.023
1.00
11.16
C


ATOM
594
N9
G
A
42
23.287
28.165
8.856
1.00
10.90
N


ATOM
595
C8
G
A
42
23.171
27.140
9.773
1.00
11.82
C


ATOM
596
N7
G
A
42
23.564
25.981
9.301
1.00
11.44
N


ATOM
597
C5
G
A
42
23.980
26.263
8.002
1.00
10.91
C


ATOM
598
C6
G
A
42
24.522
25.409
7.014
1.00
10.75
C


ATOM
599
O6
G
A
42
24.774
24.198
7.101
1.00
11.56
O


ATOM
600
N1
G
A
42
24.797
26.094
5.830
1.00
10.06
N


ATOM
601
C2
G
A
42
24.571
27.437
5.625
1.00
11.20
C


ATOM
602
N2
G
A
42
24.867
27.908
4.395
1.00
10.66
N


ATOM
603
N3
G
A
42
24.079
28.254
6.555
1.00
10.51
N


ATOM
604
C4
G
A
42
23.812
27.602
7.710
1.00
11.10
C


ATOM
605
P
C
A
43
18.199
29.425
8.233
1.00
16.98
P


ATOM
606
O1P
C
A
43
16.981
30.235
8.305
1.00
18.69
O


ATOM
607
O2P
C
A
43
18.159
28.002
8.664
1.00
16.43
O


ATOM
608
O5*
C
A
43
18.772
29.450
6.752
1.00
18.08
O


ATOM
609
C5*
C
A
43
19.022
30.687
6.085
1.00
19.11
C


ATOM
610
C4*
C
A
43
19.566
30.428
4.708
1.00
19.02
C


ATOM
611
O4*
C
A
43
20.885
29.830
4.791
1.00
19.55
O


ATOM
612
C3*
C
A
43
18.780
29.428
3.877
1.00
21.07
C


ATOM
613
O3*
C
A
43
17.631
30.042
3.313
1.00
24.02
O


ATOM
614
C2*
C
A
43
19.809
29.042
2.827
1.00
20.29
C


ATOM
615
O2*
C
A
43
20.015
30.061
1.875
1.00
21.85
O


ATOM
616
C1*
C
A
43
21.077
28.962
3.680
1.00
19.45
C


ATOM
617
N1
C
A
43
21.322
27.589
4.156
1.00
18.04
N


ATOM
618
C2
C
A
43
21.977
26.691
3.289
1.00
16.83
C


ATOM
619
O2
C
A
43
22.310
27.073
2.145
1.00
15.78
O


ATOM
620
N3
C
A
43
22.223
25.438
3.712
1.00
15.70
N


ATOM
621
C4
C
A
43
21.839
25.054
4.931
1.00
15.08
C


ATOM
622
N4
C
A
43
22.109
23.808
5.295
1.00
14.50
N


ATOM
623
C5
C
A
43
21.157
25.932
5.821
1.00
14.50
C


ATOM
624
C6
C
A
43
20.923
27.181
5.397
1.00
16.57
C


ATOM
625
P
A
A
44
16.292
29.177
3.069
1.00
26.38
P


ATOM
626
O1P
A
A
44
15.341
30.100
2.418
1.00
26.88
O


ATOM
627
O2P
A
A
44
15.887
28.460
4.292
1.00
26.22
O


ATOM
628
O5*
A
A
44
16.749
28.038
2.049
1.00
26.94
O


ATOM
629
C5*
A
A
44
16.902
28.330
0.686
1.00
26.45
C


ATOM
630
C4*
A
A
44
17.567
27.192
−0.053
1.00
25.81
C


ATOM
631
O4*
A
A
44
18.835
26.841
0.575
1.00
24.67
O


ATOM
632
C3*
A
A
44
16.920
25.825
−0.196
1.00
25.05
C


ATOM
633
O3*
A
A
44
15.807
25.831
−1.095
1.00
24.16
O


ATOM
634
C2*
A
A
44
18.110
25.083
−0.805
1.00
24.43
C


ATOM
635
O2*
A
A
44
18.398
25.556
−2.099
1.00
26.31
O


ATOM
636
C1*
A
A
44
19.261
25.596
0.065
1.00
23.69
C


ATOM
637
N9
A
A
44
19.494
24.674
1.169
1.00
22.64
N


ATOM
638
C8
A
A
44
19.153
24.773
2.489
1.00
22.43
C


ATOM
639
N7
A
A
44
19.450
23.707
3.189
1.00
22.73
N


ATOM
640
C5
A
A
44
20.033
22.855
2.261
1.00
23.13
C


ATOM
641
C6
A
A
44
20.536
21.549
2.358
1.00
22.92
C


ATOM
642
N6
A
A
44
20.506
20.829
3.484
1.00
25.71
N


ATOM
643
N1
A
A
44
21.061
20.993
1.248
1.00
23.09
N


ATOM
644
C2
A
A
44
21.058
21.702
0.115
1.00
22.58
C


ATOM
645
N3
A
A
44
20.591
22.928
−0.108
1.00
23.04
N


ATOM
646
C4
A
A
44
20.089
23.452
1.022
1.00
22.28
C


ATOM
647
P
A
A
45
14.684
24.664
−1.004
1.00
27.94
P


ATOM
648
O1P
A
A
45
13.634
25.021
−1.982
1.00
27.49
O


ATOM
649
O2P
A
A
45
14.311
24.380
0.411
1.00
26.25
O


ATOM
650
O5*
A
A
45
15.413
23.346
−1.552
1.00
23.99
O


ATOM
651
C5*
A
A
45
15.896
23.306
−2.888
1.00
22.10
C


ATOM
652
C4*
A
A
45
16.650
22.030
−3.146
1.00
20.80
C


ATOM
653
O4*
A
A
45
17.782
21.931
−2.238
1.00
18.05
O


ATOM
654
C3*
A
A
45
15.868
20.747
−2.898
1.00
19.29
C


ATOM
655
O3*
A
A
45
15.065
20.459
−4.029
1.00
19.25
O


ATOM
656
C2*
A
A
45
16.997
19.740
−2.727
1.00
18.96
C


ATOM
657
O2*
A
A
45
17.625
19.395
−3.953
1.00
20.46
O


ATOM
658
C1*
A
A
45
17.993
20.564
−1.910
1.00
18.32
C


ATOM
659
N9
A
A
45
17.789
20.387
−0.475
1.00
17.77
N


ATOM
660
C8
A
A
45
17.252
21.249
0.459
1.00
16.87
C


ATOM
661
N7
A
A
45
17.270
20.777
1.682
1.00
15.45
N


ATOM
662
C5
A
A
45
17.848
19.517
1.545
1.00
15.55
C


ATOM
663
C6
A
A
45
18.183
18.528
2.467
1.00
14.82
C


ATOM
664
N6
A
A
45
18.021
18.657
3.781
1.00
14.77
N


ATOM
665
N1
A
A
45
18.728
17.384
1.995
1.00
14.86
N


ATOM
666
C2
A
A
45
18.938
17.265
0.678
1.00
15.61
C


ATOM
667
N3
A
A
45
18.684
18.132
−0.288
1.00
15.02
N


ATOM
668
C4
A
A
45
18.143
19.255
0.218
1.00
16.02
C


ATOM
669
P
G
A
46
13.590
19.837
−3.836
1.00
21.76
P


ATOM
670
O1P
G
A
46
13.111
19.621
−5.232
1.00
20.58
O


ATOM
671
O2P
G
A
46
12.796
20.615
−2.886
1.00
20.12
O


ATOM
672
O5*
G
A
46
13.859
18.416
−3.180
1.00
17.63
O


ATOM
673
C5*
G
A
46
14.553
17.429
−3.908
1.00
16.53
C


ATOM
674
C4*
G
A
46
14.840
16.245
−3.025
1.00
16.28
C


ATOM
675
O4*
G
A
46
15.704
16.691
−1.942
1.00
15.33
O


ATOM
676
C3*
G
A
46
13.638
15.650
−2.304
1.00
14.19
C


ATOM
677
O3*
G
A
46
12.948
14.718
−3.148
1.00
15.95
O


ATOM
678
C2*
G
A
46
14.333
14.927
−1.152
1.00
15.24
C


ATOM
679
O2*
G
A
46
14.970
13.723
−1.593
1.00
11.22
O


ATOM
680
C1*
G
A
46
15.417
15.944
−0.780
1.00
14.15
C


ATOM
681
N9
G
A
46
15.039
16.881
0.277
1.00
15.14
N


ATOM
682
C8
G
A
46
14.429
18.103
0.129
1.00
13.16
C


ATOM
683
N7
G
A
46
14.249
18.721
1.264
1.00
14.91
N


ATOM
684
C5
G
A
46
14.764
17.852
2.223
1.00
13.65
C


ATOM
685
C6
G
A
46
14.850
17.980
3.646
1.00
14.21
C


ATOM
686
O6
G
A
46
14.457
18.922
4.373
1.00
13.60
O


ATOM
687
N1
G
A
46
15.451
16.861
4.225
1.00
14.00
N


ATOM
688
C2
G
A
46
15.898
15.760
3.534
1.00
13.23
C


ATOM
689
N2
G
A
46
16.436
14.773
4.271
1.00
13.07
N


ATOM
690
N3
G
A
46
15.821
15.634
2.214
1.00
12.14
N


ATOM
691
C4
G
A
46
15.250
16.709
1.631
1.00
13.14
C


ATOM
692
P
U
A
47
11.385
14.951
−3.517
1.00
19.08
P


ATOM
693
O1P
U
A
47
11.197
14.163
−4.756
1.00
17.31
O


ATOM
694
O2P
U
A
47
11.026
16.379
−3.510
1.00
17.84
O


ATOM
695
O5*
U
A
47
10.608
14.251
−2.313
1.00
18.59
O


ATOM
696
C5*
U
A
47
10.648
12.828
−2.158
1.00
16.76
C


ATOM
697
C4*
U
A
47
9.701
12.351
−1.061
1.00
16.21
C


ATOM
698
O4*
U
A
47
10.107
12.861
0.238
1.00
17.46
O


ATOM
699
C3*
U
A
47
8.276
12.864
−1.282
1.00
17.26
C


ATOM
700
O3*
U
A
47
7.372
11.850
−0.863
1.00
19.08
O


ATOM
701
C2*
U
A
47
8.151
14.098
−0.386
1.00
16.32
C


ATOM
702
O2*
U
A
47
6.881
14.327
0.178
1.00
17.79
O


ATOM
703
C1*
U
A
47
9.107
13.723
0.745
1.00
15.55
C


ATOM
704
N1
U
A
47
9.692
14.810
1.530
1.00
16.06
N


ATOM
705
C2
U
A
47
9.596
14.735
2.916
1.00
14.37
C


ATOM
706
O2
U
A
47
9.141
13.791
3.505
1.00
16.51
O


ATOM
707
N3
U
A
47
10.080
15.822
3.587
1.00
14.95
N


ATOM
708
C4
U
A
47
10.653
16.933
3.042
1.00
13.66
C


ATOM
709
O4
U
A
47
10.944
17.868
3.777
1.00
15.44
O


ATOM
710
C5
U
A
47
10.761
16.919
1.612
1.00
15.07
C


ATOM
711
C6
U
A
47
10.287
15.883
0.923
1.00
14.65
C


ATOM
712
P
U
A
48
6.564
10.960
−1.926
1.00
19.85
P


ATOM
713
O1P
U
A
48
5.831
10.040
−1.031
1.00
17.53
O


ATOM
714
O2P
U
A
48
7.437
10.406
−2.967
1.00
19.96
O


ATOM
715
O5*
U
A
48
5.542
12.008
−2.571
1.00
18.50
O


ATOM
716
C5*
U
A
48
4.823
12.913
−1.720
1.00
20.70
C


ATOM
717
C4*
U
A
48
3.830
13.767
−2.497
1.00
19.87
C


ATOM
718
O4*
U
A
48
2.984
12.978
−3.363
1.00
21.40
O


ATOM
719
C3*
U
A
48
2.919
14.504
−1.528
1.00
22.14
C


ATOM
720
O3*
U
A
48
3.091
15.908
−1.719
1.00
21.98
O


ATOM
721
C2*
U
A
48
1.483
14.183
−1.964
1.00
21.79
C


ATOM
722
O2*
U
A
48
0.647
15.315
−2.023
1.00
26.06
O


ATOM
723
C1*
U
A
48
1.688
13.540
−3.338
1.00
21.49
C


ATOM
724
N1
U
A
48
0.705
12.541
−3.764
1.00
19.91
N


ATOM
725
C2
U
A
48
−0.172
12.916
−4.772
1.00
21.91
C


ATOM
726
O2
U
A
48
−0.152
14.035
−5.305
1.00
19.91
O


ATOM
727
N3
U
A
48
−1.064
11.951
−5.142
1.00
19.50
N


ATOM
728
C4
U
A
48
−1.172
10.684
−4.623
1.00
21.21
C


ATOM
729
O4
U
A
48
−2.052
9.936
−5.040
1.00
20.77
O


ATOM
730
C5
U
A
48
−0.228
10.375
−3.587
1.00
20.54
C


ATOM
731
C6
U
A
48
0.653
11.297
−3.207
1.00
21.02
C


ATOM
732
P
U
A
49
4.393
16.672
−1.154
1.00
24.27
P


ATOM
733
O1P
U
A
49
4.112
18.081
−1.444
1.00
21.67
O


ATOM
734
O2P
U
A
49
5.651
16.051
−1.658
1.00
22.02
O


ATOM
735
O5*
U
A
49
4.355
16.404
0.420
1.00
21.65
O


ATOM
736
C5*
U
A
49
3.224
16.736
1.195
1.00
22.80
C


ATOM
737
C4*
U
A
49
3.533
16.647
2.677
1.00
22.95
C


ATOM
738
O4*
U
A
49
3.474
15.273
3.129
1.00
24.98
O


ATOM
739
C3*
U
A
49
4.962
17.109
2.990
1.00
23.23
C


ATOM
740
O3*
U
A
49
4.958
17.936
4.127
1.00
22.14
O


ATOM
741
C2*
U
A
49
5.807
15.844
3.138
1.00
22.97
C


ATOM
742
O2*
U
A
49
6.804
15.917
4.151
1.00
21.66
O


ATOM
743
C1*
U
A
49
4.739
14.866
3.606
1.00
21.87
C


ATOM
744
N1
U
A
49
4.895
13.418
3.466
1.00
20.97
N


ATOM
745
C2
U
A
49
4.504
12.676
4.556
1.00
18.94
C


ATOM
746
O2
U
A
49
4.042
13.182
5.570
1.00
19.40
O


ATOM
747
N3
U
A
49
4.657
11.327
4.423
1.00
19.76
N


ATOM
748
C4
U
A
49
5.143
10.659
3.335
1.00
19.27
C


ATOM
749
O4
U
A
49
5.181
9.441
3.367
1.00
19.94
O


ATOM
750
C5
U
A
49
5.533
11.493
2.231
1.00
19.78
C


ATOM
751
C6
U
A
49
5.398
12.822
2.336
1.00
21.36
C


ATOM
752
P
C
A
50
6.149
18.965
4.371
1.00
25.95
P


ATOM
753
O1P
C
A
50
5.898
20.166
3.536
1.00
23.44
O


ATOM
754
O2P
C
A
50
7.459
18.286
4.288
1.00
25.26
O


ATOM
755
O5*
C
A
50
5.932
19.376
5.884
1.00
22.51
O


ATOM
756
C5*
C
A
50
4.862
20.228
6.238
1.00
24.64
C


ATOM
757
C4*
C
A
50
4.500
19.987
7.664
1.00
23.18
C


ATOM
758
O4*
C
A
50
3.946
18.653
7.796
1.00
22.19
O


ATOM
759
C3*
C
A
50
5.714
19.971
8.569
1.00
23.50
C


ATOM
760
O3*
C
A
50
6.031
21.297
8.951
1.00
23.96
O


ATOM
761
C2*
C
A
50
5.211
19.146
9.739
1.00
22.22
C


ATOM
762
O2*
C
A
50
4.390
19.925
10.581
1.00
23.29
O


ATOM
763
C1*
C
A
50
4.379
18.081
9.020
1.00
20.79
C


ATOM
764
N1
C
A
50
5.148
16.857
8.718
1.00
18.62
N


ATOM
765
C2
C
A
50
5.579
16.061
9.782
1.00
18.06
C


ATOM
766
O2
C
A
50
5.294
16.403
10.944
1.00
18.10
O


ATOM
767
N3
C
A
50
6.286
14.924
9.529
1.00
18.43
N


ATOM
768
C4
C
A
50
6.541
14.559
8.278
1.00
16.95
C


ATOM
769
N4
C
A
50
7.187
13.390
8.096
1.00
16.78
N


ATOM
770
C5
C
A
50
6.127
15.362
7.156
1.00
18.20
C


ATOM
771
C6
C
A
50
5.435
16.496
7.425
1.00
18.70
C


ATOM
772
P
U
A
51
7.545
21.802
8.901
1.00
26.70
P


ATOM
773
O1P
U
A
51
7.531
23.156
9.484
1.00
26.93
O


ATOM
774
O2P
U
A
51
8.082
21.612
7.532
1.00
25.00
O


ATOM
775
O5*
U
A
51
8.299
20.823
9.912
1.00
25.55
O


ATOM
776
C5*
U
A
51
8.050
20.887
11.311
1.00
24.16
C


ATOM
777
C4*
U
A
51
8.585
19.653
12.005
1.00
21.45
C


ATOM
778
O4*
U
A
51
7.971
18.463
11.431
1.00
22.38
O


ATOM
779
C3*
U
A
51
10.072
19.376
11.858
1.00
22.73
C


ATOM
780
O3*
U
A
51
10.845
20.123
12.799
1.00
23.34
O


ATOM
781
C2*
U
A
51
10.136
17.895
12.196
1.00
19.59
C


ATOM
782
O2*
U
A
51
9.952
17.713
13.569
1.00
19.61
O


ATOM
783
C1*
U
A
51
8.878
17.367
11.512
1.00
19.68
C


ATOM
784
N1
U
A
51
9.123
16.859
10.157
1.00
18.48
N


ATOM
785
C2
U
A
51
9.636
15.574
10.020
1.00
17.36
C


ATOM
786
O2
U
A
51
9.930
14.862
10.967
1.00
16.91
O


ATOM
787
N3
U
A
51
9.794
15.153
8.726
1.00
16.05
N


ATOM
788
C4
U
A
51
9.507
15.862
7.571
1.00
16.11
C


ATOM
789
O4
U
A
51
9.679
15.322
6.462
1.00
15.90
O


ATOM
790
C5
U
A
51
8.995
17.179
7.796
1.00
16.34
C


ATOM
791
C6
U
A
51
8.832
17.627
9.048
1.00
16.75
C


ATOM
792
P
A
A
52
12.365
20.517
12.450
1.00
25.19
P


ATOM
793
O1P
A
A
52
12.846
21.398
13.550
1.00
27.50
O


ATOM
794
O2P
A
A
52
12.491
20.983
11.045
1.00
24.46
O


ATOM
795
O5*
A
A
52
13.148
19.125
12.569
1.00
22.18
O


ATOM
796
C5*
A
A
52
13.100
18.336
13.756
1.00
18.74
C


ATOM
797
C4*
A
A
52
13.709
16.971
13.493
1.00
17.82
C


ATOM
798
O4*
A
A
52
12.882
16.221
12.560
1.00
17.99
O


ATOM
799
C3*
A
A
52
15.082
17.021
12.845
1.00
17.21
C


ATOM
800
O3*
A
A
52
16.060
17.141
13.881
1.00
17.77
O


ATOM
801
C2*
A
A
52
15.156
15.672
12.120
1.00
16.59
C


ATOM
802
O2*
A
A
52
15.493
14.600
12.980
1.00
13.63
O


ATOM
803
C1*
A
A
52
13.704
15.476
11.671
1.00
16.93
C


ATOM
804
N9
A
A
52
13.417
15.925
10.298
1.00
18.02
N


ATOM
805
C8
A
A
52
12.870
17.137
9.918
1.00
16.91
C


ATOM
806
N7
A
A
52
12.661
17.242
8.623
1.00
15.76
N


ATOM
807
C5
A
A
52
13.113
16.029
8.111
1.00
14.73
C


ATOM
808
C6
A
A
52
13.146
15.509
6.798
1.00
13.71
C


ATOM
809
N6
A
A
52
12.694
16.170
5.730
1.00
10.55
N


ATOM
810
N1
A
A
52
13.660
14.265
6.626
1.00
13.71
N


ATOM
811
C2
A
A
52
14.109
13.603
7.708
1.00
14.81
C


ATOM
812
N3
A
A
52
14.126
13.984
8.990
1.00
14.94
N


ATOM
813
C4
A
A
52
13.603
15.217
9.127
1.00
15.57
C


ATOM
814
P
C
A
53
17.368
18.050
13.670
1.00
17.51
P


ATOM
815
O1P
C
A
53
18.117
18.008
14.931
1.00
19.79
O


ATOM
816
O2P
C
A
53
17.010
19.365
13.065
1.00
18.18
O


ATOM
817
O5*
C
A
53
18.238
17.298
12.563
1.00
17.18
O


ATOM
818
C5*
C
A
53
18.901
16.077
12.842
1.00
15.96
C


ATOM
819
C4*
C
A
53
19.332
15.446
11.543
1.00
15.86
C


ATOM
820
O4*
C
A
53
18.121
15.192
10.776
1.00
13.93
O


ATOM
821
C3*
C
A
53
20.160
16.337
10.613
1.00
15.87
C


ATOM
822
O3*
C
A
53
21.559
16.258
10.903
1.00
17.45
O


ATOM
823
C2*
C
A
53
19.874
15.668
9.269
1.00
14.61
C


ATOM
824
O2*
C
A
53
20.537
14.416
9.169
1.00
13.51
O


ATOM
825
C1*
C
A
53
18.388
15.358
9.402
1.00
14.62
C


ATOM
826
N1
C
A
53
17.495
16.398
8.857
1.00
15.30
N


ATOM
827
C2
C
A
53
16.863
16.150
7.629
1.00
16.59
C


ATOM
828
O2
C
A
53
17.047
15.036
7.072
1.00
15.08
O


ATOM
829
N3
C
A
53
16.064
17.117
7.081
1.00
15.46
N


ATOM
830
C4
C
A
53
15.882
18.269
7.728
1.00
15.94
C


ATOM
831
N4
C
A
53
15.106
19.193
7.161
1.00
17.55
N


ATOM
832
C5
C
A
53
16.499
18.534
8.995
1.00
18.55
C


ATOM
833
C6
C
A
53
17.293
17.579
9.514
1.00
17.04
C


ATOM
834
P
C
A
54
22.305
17.435
11.741
1.00
20.49
P


ATOM
835
O1P
C
A
54
21.325
18.517
12.085
1.00
24.43
O


ATOM
836
O2P
C
A
54
23.512
17.794
10.991
1.00
22.51
O


ATOM
837
O5*
C
A
54
22.926
16.655
12.998
1.00
23.22
O


ATOM
838
C5*
C
A
54
22.304
16.618
14.248
1.00
20.44
C


ATOM
839
C4*
C
A
54
22.299
15.199
14.853
1.00
16.62
C


ATOM
840
O4*
C
A
54
21.619
14.289
13.954
1.00
13.18
O


ATOM
841
C3*
C
A
54
23.510
14.352
15.283
1.00
15.61
C


ATOM
842
O3*
C
A
54
24.117
14.814
16.513
1.00
15.53
O


ATOM
843
C2*
C
A
54
22.766
13.054
15.620
1.00
13.43
C


ATOM
844
O2*
C
A
54
22.037
13.231
16.823
1.00
10.21
O


ATOM
845
C1*
C
A
54
21.731
12.976
14.489
1.00
12.97
C


ATOM
846
N1
C
A
54
22.153
12.031
13.438
1.00
11.15
N


ATOM
847
C2
C
A
54
22.102
10.675
13.741
1.00
11.28
C


ATOM
848
O2
C
A
54
21.721
10.338
14.877
1.00
10.76
O


ATOM
849
N3
C
A
54
22.477
9.762
12.803
1.00
11.24
N


ATOM
850
C4
C
A
54
22.895
10.172
11.606
1.00
11.60
C


ATOM
851
N4
C
A
54
23.244
9.240
10.718
1.00
10.41
N


ATOM
852
C5
C
A
54
22.965
11.562
11.266
1.00
11.08
C


ATOM
853
C6
C
A
54
22.582
12.450
12.207
1.00
9.08
C


ATOM
854
P
G
A
55
25.594
14.295
16.976
1.00
13.05
P


ATOM
855
O1P
G
A
55
25.962
15.246
18.039
1.00
12.26
O


ATOM
856
O2P
G
A
55
26.496
14.069
15.850
1.00
11.41
O


ATOM
857
O5*
G
A
55
25.399
12.866
17.663
1.00
10.95
O


ATOM
858
C5*
G
A
55
24.831
12.746
18.965
1.00
14.14
C


ATOM
859
C4*
G
A
55
24.802
11.304
19.405
1.00
11.91
C


ATOM
860
O4*
G
A
55
23.968
10.508
18.509
1.00
13.05
O


ATOM
861
C3*
G
A
55
26.149
10.594
19.367
1.00
11.99
C


ATOM
862
O3*
G
A
55
26.929
10.898
20.521
1.00
13.10
O


ATOM
863
C2*
G
A
55
25.728
9.140
19.334
1.00
11.08
C


ATOM
864
O2*
G
A
55
25.292
8.701
20.587
1.00
10.72
O


ATOM
865
C1*
G
A
55
24.527
9.203
18.390
1.00
11.23
C


ATOM
866
N9
G
A
55
24.931
8.998
17.004
1.00
10.75
N


ATOM
867
C8
G
A
55
25.171
9.961
16.045
1.00
7.33
C


ATOM
868
N7
G
A
55
25.353
9.455
14.847
1.00
9.34
N


ATOM
869
C5
G
A
55
25.283
8.079
15.039
1.00
7.65
C


ATOM
870
C6
G
A
55
25.397
7.019
14.114
1.00
8.58
C


ATOM
871
O6
G
A
55
25.565
7.082
12.885
1.00
7.74
O


ATOM
872
N1
G
A
55
25.286
5.775
14.741
1.00
7.05
N


ATOM
873
C2
G
A
55
25.090
5.584
16.085
1.00
9.83
C


ATOM
874
N2
G
A
55
25.007
4.306
16.495
1.00
12.00
N


ATOM
875
N3
G
A
55
24.974
6.572
16.964
1.00
9.85
N


ATOM
876
C4
G
A
55
25.068
7.781
16.375
1.00
9.88
C


ATOM
877
P
G
A
56
28.506
11.118
20.370
1.00
13.76
P


ATOM
878
O1P
G
A
56
28.941
11.806
21.603
1.00
13.53
O


ATOM
879
O2P
G
A
56
28.773
11.722
19.054
1.00
14.23
O


ATOM
880
O5*
G
A
56
29.116
9.645
20.366
1.00
13.32
O


ATOM
881
C5*
G
A
56
29.082
8.849
21.551
1.00
13.87
C


ATOM
882
C4*
G
A
56
29.183
7.374
21.197
1.00
10.21
C


ATOM
883
O4*
G
A
56
28.057
7.023
20.333
1.00
11.28
O


ATOM
884
C3*
G
A
56
30.397
6.922
20.404
1.00
11.41
C


ATOM
885
O3*
G
A
56
31.527
6.711
21.273
1.00
9.71
O


ATOM
886
C2*
G
A
56
29.878
5.605
19.841
1.00
8.73
C


ATOM
887
O2*
G
A
56
29.794
4.659
20.902
1.00
8.39
O


ATOM
888
C1*
G
A
56
28.462
6.009
19.418
1.00
10.13
C


ATOM
889
N9
G
A
56
28.486
6.581
18.071
1.00
10.46
N


ATOM
890
C8
G
A
56
28.551
7.917
17.730
1.00
8.60
C


ATOM
891
N7
G
A
56
28.622
8.113
16.441
1.00
9.93
N


ATOM
892
C5
G
A
56
28.582
6.835
15.897
1.00
8.16
C


ATOM
893
C6
G
A
56
28.626
6.411
14.551
1.00
7.93
C


ATOM
894
O6
G
A
56
28.702
7.101
13.539
1.00
9.17
O


ATOM
895
N1
G
A
56
28.583
5.011
14.447
1.00
7.02
N


ATOM
896
C2
G
A
56
28.536
4.139
15.508
1.00
9.44
C


ATOM
897
N2
G
A
56
28.557
2.786
15.209
1.00
7.72
N


ATOM
898
N3
G
A
56
28.484
4.537
16.776
1.00
8.70
N


ATOM
899
C4
G
A
56
28.504
5.880
16.892
1.00
8.72
C


ATOM
900
P
G
A
57
32.995
6.619
20.673
1.00
13.57
P


ATOM
901
O1P
G
A
57
33.919
6.615
21.837
1.00
11.27
O


ATOM
902
O2P
G
A
57
33.141
7.686
19.611
1.00
15.12
O


ATOM
903
O5*
G
A
57
33.076
5.220
19.904
1.00
11.64
O


ATOM
904
C5*
G
A
57
32.985
3.994
20.606
1.00
10.61
C


ATOM
905
C4*
G
A
57
32.946
2.834
19.636
1.00
10.65
C


ATOM
906
O4*
G
A
57
31.764
2.920
18.802
1.00
10.66
O


ATOM
907
C3*
G
A
57
34.095
2.749
18.637
1.00
11.12
C


ATOM
908
O3*
G
A
57
35.211
2.092
19.227
1.00
12.09
O


ATOM
909
C2*
G
A
57
33.485
1.856
17.579
1.00
10.15
C


ATOM
910
O2*
G
A
57
33.432
0.527
18.073
1.00
11.67
O


ATOM
911
C1*
G
A
57
32.080
2.447
17.495
1.00
10.13
C


ATOM
912
N9
G
A
57
32.032
3.577
16.566
1.00
9.94
N


ATOM
913
C8
G
A
57
31.948
4.915
16.870
1.00
8.87
C


ATOM
914
N7
G
A
57
31.840
5.680
15.806
1.00
9.77
N


ATOM
915
C5
G
A
57
31.896
4.788
14.736
1.00
9.04
C


ATOM
916
C6
G
A
57
31.841
5.018
13.333
1.00
7.94
C


ATOM
917
O6
G
A
57
31.697
6.098
12.719
1.00
7.09
O


ATOM
918
N1
G
A
57
31.968
3.833
12.617
1.00
7.83
N


ATOM
919
C2
G
A
57
32.123
2.586
13.170
1.00
9.14
C


ATOM
920
N2
G
A
57
32.295
1.559
12.302
1.00
10.54
N


ATOM
921
N3
G
A
57
32.131
2.356
14.477
1.00
8.97
N


ATOM
922
C4
G
A
57
32.030
3.490
15.192
1.00
8.38
C


ATOM
923
P
C
A
58
36.691
2.608
18.919
1.00
16.58
P


ATOM
924
O1P
C
A
58
37.600
1.734
19.679
1.00
19.45
O


ATOM
925
O2P
C
A
58
36.781
4.087
19.087
1.00
16.41
O


ATOM
926
O5*
C
A
58
36.862
2.269
17.367
1.00
16.52
O


ATOM
927
C5*
C
A
58
36.965
0.921
16.950
1.00
15.10
C


ATOM
928
C4*
C
A
58
36.833
0.823
15.451
1.00
15.33
C


ATOM
929
O4*
C
A
58
35.542
1.354
15.026
1.00
13.38
O


ATOM
930
C3*
C
A
58
37.823
1.632
14.626
1.00
15.47
C


ATOM
931
O3*
C
A
58
39.082
0.999
14.519
1.00
18.74
O


ATOM
932
C2*
C
A
58
37.111
1.673
13.286
1.00
11.17
C


ATOM
933
O2*
C
A
58
37.125
0.418
12.637
1.00
13.01
O


ATOM
934
C1*
C
A
58
35.686
1.967
13.742
1.00
11.98
C


ATOM
935
N1
C
A
58
35.440
3.426
13.858
1.00
10.35
N


ATOM
936
C2
C
A
58
35.230
4.131
12.682
1.00
10.97
C


ATOM
937
O2
C
A
58
35.275
3.491
11.594
1.00
7.46
O


ATOM
938
N3
C
A
58
34.999
5.473
12.736
1.00
8.91
N


ATOM
939
C4
C
A
58
34.990
6.105
13.916
1.00
10.48
C


ATOM
940
N4
C
A
58
34.733
7.422
13.921
1.00
7.52
N


ATOM
941
C5
C
A
58
35.227
5.405
15.148
1.00
10.36
C


ATOM
942
C6
C
A
58
35.431
4.071
15.068
1.00
9.90
C


ATOM
943
P
A
A
59
40.407
1.880
14.428
1.00
19.54
P


ATOM
944
O1P
A
A
59
41.476
0.909
14.728
1.00
23.14
O


ATOM
945
O2P
A
A
59
40.304
3.121
15.233
1.00
20.05
O


ATOM
946
O5*
A
A
59
40.518
2.324
12.905
1.00
18.97
O


ATOM
947
C5*
A
A
59
40.522
1.349
11.874
1.00
17.12
C


ATOM
948
C4*
A
A
59
40.269
1.997
10.535
1.00
14.95
C


ATOM
949
O4*
A
A
59
38.970
2.626
10.543
1.00
12.74
O


ATOM
950
C3*
A
A
59
41.236
3.099
10.117
1.00
15.97
C


ATOM
951
O3*
A
A
59
42.370
2.521
9.460
1.00
17.11
O


ATOM
952
C2*
A
A
59
40.403
3.881
9.110
1.00
13.15
C


ATOM
953
O2*
A
A
59
40.358
3.242
7.841
1.00
15.00
O


ATOM
954
C1*
A
A
59
39.025
3.807
9.760
1.00
12.41
C


ATOM
955
N9
A
A
59
38.757
4.934
10.638
1.00
9.57
N


ATOM
956
C8
A
A
59
38.679
4.950
12.014
1.00
9.34
C


ATOM
957
N7
A
A
59
38.272
6.102
12.500
1.00
10.38
N


ATOM
958
C5
A
A
59
38.113
6.898
11.371
1.00
9.84
C


ATOM
959
C6
A
A
59
37.677
8.220
11.208
1.00
10.49
C


ATOM
960
N6
A
A
59
37.293
8.991
12.234
1.00
9.11
N


ATOM
961
N1
A
A
59
37.636
8.724
9.946
1.00
8.89
N


ATOM
962
C2
A
A
59
37.996
7.927
8.925
1.00
9.88
C


ATOM
963
N3
A
A
59
38.421
6.652
8.958
1.00
9.56
N


ATOM
964
C4
A
A
59
38.454
6.200
10.225
1.00
11.25
C


ATOM
965
P
C
A
60
43.798
3.248
9.508
1.00
19.04
P


ATOM
966
O1P
C
A
60
44.656
2.375
8.681
1.00
20.33
O


ATOM
967
O2P
C
A
60
44.215
3.641
10.866
1.00
19.75
O


ATOM
968
O5*
C
A
60
43.620
4.636
8.731
1.00
18.48
O


ATOM
969
C5*
C
A
60
43.338
4.688
7.333
1.00
18.85
C


ATOM
970
C4*
C
A
60
42.905
6.090
6.953
1.00
17.29
C


ATOM
971
O4*
C
A
60
41.667
6.421
7.652
1.00
16.10
O


ATOM
972
C3*
C
A
60
43.873
7.165
7.423
1.00
17.86
C


ATOM
973
O3*
C
A
60
44.971
7.328
6.520
1.00
15.95
O


ATOM
974
C2*
C
A
60
42.968
8.392
7.489
1.00
15.75
C


ATOM
975
O2*
C
A
60
42.660
8.899
6.212
1.00
13.38
O


ATOM
976
C1*
C
A
60
41.697
7.779
8.067
1.00
14.86
C


ATOM
977
N1
C
A
60
41.611
7.830
9.544
1.00
13.02
N


ATOM
978
C2
C
A
60
41.114
8.986
10.149
1.00
11.98
C


ATOM
979
O2
C
A
60
40.879
9.970
9.439
1.00
9.87
O


ATOM
980
N3
C
A
60
40.914
8.997
11.493
1.00
10.09
N


ATOM
981
C4
C
A
60
41.219
7.909
12.218
1.00
11.18
C


ATOM
982
N4
C
A
60
40.938
7.915
13.561
1.00
10.47
N


ATOM
983
C5
C
A
60
41.802
6.761
11.628
1.00
12.15
C


ATOM
984
C6
C
A
60
41.979
6.759
10.306
1.00
12.05
C


ATOM
985
P
C
A
61
46.349
7.925
7.060
1.00
17.42
P


ATOM
986
O1P
C
A
61
47.356
7.713
5.977
1.00
20.07
O


ATOM
987
O2P
C
A
61
46.636
7.464
8.434
1.00
19.05
O


ATOM
988
O5*
C
A
61
46.133
9.494
7.201
1.00
17.65
O


ATOM
989
C5*
C
A
61
45.764
10.291
6.081
1.00
17.42
C


ATOM
990
C4*
C
A
61
45.277
11.640
6.565
1.00
16.64
C


ATOM
991
O4*
C
A
61
44.139
11.438
7.426
1.00
14.79
O


ATOM
992
C3*
C
A
61
46.263
12.391
7.446
1.00
17.63
C


ATOM
993
O3*
C
A
61
47.139
13.091
6.584
1.00
20.87
O


ATOM
994
C2*
C
A
61
45.331
13.335
8.203
1.00
16.43
C


ATOM
995
O2*
C
A
61
44.852
14.369
7.365
1.00
14.86
O


ATOM
996
C1*
C
A
61
44.138
12.417
8.454
1.00
13.94
C


ATOM
997
N1
C
A
61
44.153
11.729
9.750
1.00
13.10
N


ATOM
998
C2
C
A
61
43.580
12.368
10.847
1.00
12.09
C


ATOM
999
O2
C
A
61
43.175
13.525
10.710
1.00
13.24
O


ATOM
1000
N3
C
A
61
43.489
11.711
12.021
1.00
11.67
N


ATOM
1001
C4
C
A
61
43.975
10.468
12.138
1.00
12.40
C


ATOM
1002
N4
C
A
61
43.831
9.835
13.322
1.00
12.14
N


ATOM
1003
C5
C
A
61
44.626
9.812
11.054
1.00
12.01
C


ATOM
1004
C6
C
A
61
44.688
10.474
9.885
1.00
12.81
C


ATOM
1005
P
G
A
62
48.728
12.846
6.660
1.00
24.16
P


ATOM
1006
O1P
G
A
62
49.270
13.445
5.410
1.00
23.67
O


ATOM
1007
O2P
G
A
62
48.943
11.426
6.942
1.00
23.26
O


ATOM
1008
O5*
G
A
62
49.205
13.690
7.911
1.00
26.62
O


ATOM
1009
C5*
G
A
62
50.589
13.855
8.182
1.00
30.96
C


ATOM
1010
C4*
G
A
62
50.774
14.353
9.582
1.00
33.50
C


ATOM
1011
O4*
G
A
62
50.528
13.270
10.506
1.00
33.47
O


ATOM
1012
C3*
G
A
62
52.218
14.771
9.809
1.00
34.85
C


ATOM
1013
O3*
G
A
62
52.203
15.955
10.570
1.00
37.99
O


ATOM
1014
C2*
G
A
62
52.906
13.588
10.482
1.00
34.23
C


ATOM
1015
O2*
G
A
62
53.808
13.966
11.496
1.00
36.44
O


ATOM
1016
C1*
G
A
62
51.721
12.929
11.174
1.00
33.41
C


ATOM
1017
N9
G
A
62
51.722
11.510
11.492
1.00
31.26
N


ATOM
1018
C8
G
A
62
52.117
10.448
10.718
1.00
31.91
C


ATOM
1019
N7
G
A
62
51.933
9.296
11.308
1.00
31.27
N


ATOM
1020
C5
G
A
62
51.407
9.628
12.552
1.00
30.56
C


ATOM
1021
C6
G
A
62
50.995
8.807
13.633
1.00
31.34
C


ATOM
1022
O6
G
A
62
51.018
7.569
13.717
1.00
32.16
O


ATOM
1023
N1
G
A
62
50.498
9.572
14.697
1.00
30.15
N


ATOM
1024
C2
G
A
62
50.401
10.940
14.706
1.00
26.83
C


ATOM
1025
N2
G
A
62
49.859
11.501
15.789
1.00
25.94
N


ATOM
1026
N3
G
A
62
50.792
11.704
13.717
1.00
28.92
N


ATOM
1027
C4
G
A
62
51.277
10.988
12.677
1.00
30.49
C


ATOM
1028
P
U
A
63
52.954
17.235
10.045
1.00
41.73
P


ATOM
1029
O1P
U
A
63
53.058
17.081
8.559
1.00
41.63
O


ATOM
1030
O2P
U
A
63
54.175
17.334
10.869
1.00
43.38
O


ATOM
1031
O5*
U
A
63
51.997
18.439
10.460
1.00
39.87
O


ATOM
1032
C5*
U
A
63
50.763
18.720
9.780
1.00
36.71
C


ATOM
1033
C4*
U
A
63
49.836
19.408
10.747
1.00
35.04
C


ATOM
1034
O4*
U
A
63
49.068
18.411
11.462
1.00
32.03
O


ATOM
1035
C3*
U
A
63
50.675
20.143
11.781
1.00
35.26
C


ATOM
1036
O3*
U
A
63
50.995
21.511
11.420
1.00
38.70
O


ATOM
1037
C2*
U
A
63
50.130
19.755
13.165
1.00
33.49
C


ATOM
1038
O2*
U
A
63
49.712
20.770
14.049
1.00
37.02
O


ATOM
1039
C1*
U
A
63
49.029
18.735
12.834
1.00
30.93
C


ATOM
1040
N1
U
A
63
48.986
17.488
13.615
1.00
27.66
N


ATOM
1041
C2
U
A
63
48.353
17.544
14.818
1.00
24.66
C


ATOM
1042
O2
U
A
63
47.901
18.573
15.248
1.00
22.79
O


ATOM
1043
N3
U
A
63
48.278
16.352
15.499
1.00
23.74
N


ATOM
1044
C4
U
A
63
48.798
15.145
15.102
1.00
23.73
C


ATOM
1045
O4
U
A
63
48.743
14.175
15.859
1.00
24.68
O


ATOM
1046
C5
U
A
63
49.463
15.178
13.841
1.00
24.93
C


ATOM
1047
C6
U
A
63
49.531
16.320
13.160
1.00
25.63
C


ATOM
1048
P
A
A
64
49.882
22.684
11.491
1.00
37.56
P


ATOM
1049
O1P
A
A
64
50.561
23.954
11.785
1.00
39.16
O


ATOM
1050
O2P
A
A
64
48.748
22.257
12.335
1.00
40.01
O


ATOM
1051
O5*
A
A
64
49.353
22.820
9.997
1.00
34.07
O


ATOM
1052
C5*
A
A
64
48.031
23.290
9.741
1.00
28.17
C


ATOM
1053
C4*
A
A
64
47.481
22.620
8.508
1.00
24.51
C


ATOM
1054
O4*
A
A
64
47.732
21.200
8.571
1.00
23.04
O


ATOM
1055
C3*
A
A
64
45.992
22.796
8.296
1.00
22.46
C


ATOM
1056
O3*
A
A
64
45.849
23.906
7.425
1.00
20.90
O


ATOM
1057
C2*
A
A
64
45.604
21.505
7.587
1.00
20.89
C


ATOM
1058
O2*
A
A
64
45.901
21.599
6.225
1.00
22.90
O


ATOM
1059
C1*
A
A
64
46.571
20.494
8.203
1.00
20.25
C


ATOM
1060
N9
A
A
64
46.130
19.781
9.396
1.00
17.85
N


ATOM
1061
C8
A
A
64
45.524
20.267
10.530
1.00
16.44
C


ATOM
1062
N7
A
A
64
45.377
19.367
11.477
1.00
16.36
N


ATOM
1063
C5
A
A
64
45.897
18.211
10.916
1.00
16.10
C


ATOM
1064
C6
A
A
64
46.056
16.912
11.416
1.00
17.55
C


ATOM
1065
N6
A
A
64
45.703
16.547
12.661
1.00
17.95
N


ATOM
1066
N1
A
A
64
46.602
15.982
10.594
1.00
16.51
N


ATOM
1067
C2
A
A
64
46.978
16.359
9.358
1.00
17.36
C


ATOM
1068
N3
A
A
64
46.889
17.563
8.781
1.00
17.52
N


ATOM
1069
C4
A
A
64
46.333
18.447
9.623
1.00
16.65
C


ATOM
1070
P
A
A
65
44.523
24.788
7.455
1.00
26.00
P


ATOM
1071
O1P
A
A
65
44.761
25.977
6.601
1.00
22.85
O


ATOM
1072
O2P
A
A
65
44.095
24.965
8.870
1.00
23.33
O


ATOM
1073
O5*
A
A
65
43.412
23.897
6.726
1.00
21.03
O


ATOM
1074
C5*
A
A
65
43.457
23.677
5.321
1.00
18.24
C


ATOM
1075
C4*
A
A
65
42.507
22.569
4.928
1.00
13.85
C


ATOM
1076
O4*
A
A
65
42.806
21.347
5.676
1.00
12.93
O


ATOM
1077
C3*
A
A
65
41.079
22.964
5.312
1.00
13.37
C


ATOM
1078
O3*
A
A
65
40.259
22.339
4.329
1.00
13.50
O


ATOM
1079
C2*
A
A
65
40.840
22.298
6.671
1.00
12.65
C


ATOM
1080
O2*
A
A
65
39.528
21.867
6.921
1.00
13.66
O


ATOM
1081
C1*
A
A
65
41.701
21.055
6.509
1.00
12.78
C


ATOM
1082
N9
A
A
65
42.072
20.249
7.661
1.00
12.77
N


ATOM
1083
C8
A
A
65
42.096
20.565
8.995
1.00
12.34
C


ATOM
1084
N7
A
A
65
42.398
19.546
9.767
1.00
12.41
N


ATOM
1085
C5
A
A
65
42.597
18.501
8.875
1.00
12.05
C


ATOM
1086
C6
A
A
65
42.910
17.149
9.070
1.00
10.90
C


ATOM
1087
N6
A
A
65
43.092
16.609
10.272
1.00
10.50
N


ATOM
1088
N1
A
A
65
43.027
16.366
7.977
1.00
11.90
N


ATOM
1089
C2
A
A
65
42.832
16.924
6.765
1.00
12.11
C


ATOM
1090
N3
A
A
65
42.525
18.188
6.460
1.00
11.24
N


ATOM
1091
C4
A
A
65
42.420
18.925
7.577
1.00
11.53
C


ATOM
1092
P
A
A
66
39.022
23.077
3.643
1.00
14.28
P


ATOM
1093
O1P
A
A
66
39.543
23.558
2.325
1.00
15.32
O


ATOM
1094
O2P
A
A
66
38.338
24.024
4.542
1.00
14.63
O


ATOM
1095
O5*
A
A
66
38.069
21.846
3.338
1.00
13.61
O


ATOM
1096
C5*
A
A
66
37.465
21.085
4.393
1.00
11.48
C


ATOM
1097
C4*
A
A
66
37.723
19.608
4.177
1.00
10.84
C


ATOM
1098
O4*
A
A
66
39.051
19.241
4.669
1.00
10.26
O


ATOM
1099
C3*
A
A
66
36.836
18.669
4.967
1.00
10.58
C


ATOM
1100
O3*
A
A
66
35.542
18.590
4.389
1.00
8.43
O


ATOM
1101
C2*
A
A
66
37.616
17.369
4.831
1.00
10.04
C


ATOM
1102
O2*
A
A
66
37.536
16.826
3.529
1.00
11.06
O


ATOM
1103
C1*
A
A
66
39.042
17.870
5.094
1.00
10.87
C


ATOM
1104
N9
A
A
66
39.240
17.798
6.548
1.00
10.17
N


ATOM
1105
C8
A
A
66
39.155
18.790
7.496
1.00
10.03
C


ATOM
1106
N7
A
A
66
39.261
18.348
8.733
1.00
8.56
N


ATOM
1107
C5
A
A
66
39.441
16.982
8.588
1.00
6.54
C


ATOM
1108
C6
A
A
66
39.556
15.933
9.533
1.00
10.49
C


ATOM
1109
N6
A
A
66
39.492
16.109
10.872
1.00
8.52
N


ATOM
1110
N1
A
A
66
39.716
14.683
9.059
1.00
10.25
N


ATOM
1111
C2
A
A
66
39.726
14.497
7.720
1.00
9.28
C


ATOM
1112
N3
A
A
66
39.599
15.400
6.739
1.00
6.99
N


ATOM
1113
C4
A
A
66
39.470
16.634
7.247
1.00
7.89
C


ATOM
1114
P
U
A
67
34.254
18.628
5.328
1.00
7.15
P


ATOM
1115
O1P
U
A
67
33.113
18.369
4.451
1.00
8.19
O


ATOM
1116
O2P
U
A
67
34.265
19.889
6.140
1.00
10.54
O


ATOM
1117
O5*
U
A
67
34.515
17.404
6.352
1.00
9.66
O


ATOM
1118
C5*
U
A
67
33.879
16.134
6.178
1.00
9.63
C


ATOM
1119
C4*
U
A
67
34.826
15.139
5.537
1.00
8.56
C


ATOM
1120
O4*
U
A
67
36.010
14.944
6.353
1.00
11.36
O


ATOM
1121
C3*
U
A
67
34.231
13.746
5.426
1.00
10.38
C


ATOM
1122
O3*
U
A
67
33.422
13.688
4.257
1.00
9.95
O


ATOM
1123
C2*
U
A
67
35.473
12.885
5.331
1.00
9.97
C


ATOM
1124
O2*
U
A
67
36.048
13.024
4.062
1.00
9.19
O


ATOM
1125
C1*
U
A
67
36.395
13.578
6.337
1.00
11.42
C


ATOM
1126
N1
U
A
67
36.318
13.066
7.718
1.00
10.80
N


ATOM
1127
C2
U
A
67
36.833
11.817
7.968
1.00
10.50
C


ATOM
1128
O2
U
A
67
37.312
11.117
7.101
1.00
10.56
O


ATOM
1129
N3
U
A
67
36.765
11.408
9.272
1.00
9.20
N


ATOM
1130
C4
U
A
67
36.239
12.104
10.335
1.00
13.14
C


ATOM
1131
O4
U
A
67
36.155
11.549
11.435
1.00
12.19
O


ATOM
1132
C5
U
A
67
35.717
13.400
10.002
1.00
12.31
C


ATOM
1133
C6
U
A
67
35.767
13.822
8.734
1.00
11.37
C


ATOM
1134
P
G
A
68
32.112
12.788
4.239
1.00
10.90
P


ATOM
1135
O1P
G
A
68
31.468
12.906
2.896
1.00
11.43
O


ATOM
1136
O2P
G
A
68
31.302
13.053
5.442
1.00
13.26
O


ATOM
1137
O5*
G
A
68
32.721
11.308
4.326
1.00
11.48
O


ATOM
1138
C5*
G
A
68
33.371
10.739
3.203
1.00
11.16
C


ATOM
1139
C4*
G
A
68
33.998
9.411
3.573
1.00
9.62
C


ATOM
1140
O4*
G
A
68
34.942
9.616
4.645
1.00
11.01
O


ATOM
1141
C3*
G
A
68
33.037
8.390
4.142
1.00
11.11
C


ATOM
1142
O3*
G
A
68
32.398
7.725
3.054
1.00
10.48
O


ATOM
1143
C2*
G
A
68
33.998
7.461
4.874
1.00
9.29
C


ATOM
1144
O2*
G
A
68
34.735
6.677
3.982
1.00
10.62
O


ATOM
1145
C1*
G
A
68
34.963
8.466
5.487
1.00
8.97
C


ATOM
1146
N9
G
A
68
34.546
8.820
6.844
1.00
8.81
N


ATOM
1147
C8
G
A
68
33.966
9.976
7.304
1.00
8.53
C


ATOM
1148
N7
G
A
68
33.723
9.947
8.594
1.00
8.57
N


ATOM
1149
C5
G
A
68
34.195
8.708
9.005
1.00
7.34
C


ATOM
1150
C6
G
A
68
34.222
8.109
10.303
1.00
6.86
C


ATOM
1151
O6
G
A
68
33.892
8.610
11.385
1.00
8.26
O


ATOM
1152
N1
G
A
68
34.707
6.806
10.256
1.00
5.47
N


ATOM
1153
C2
G
A
68
35.136
6.164
9.113
1.00
7.70
C


ATOM
1154
N2
G
A
68
35.520
4.887
9.242
1.00
7.97
N


ATOM
1155
N3
G
A
68
35.166
6.729
7.911
1.00
8.25
N


ATOM
1156
C4
G
A
68
34.676
7.991
7.934
1.00
8.75
C


ATOM
1157
P
U
A
69
30.971
7.049
3.252
1.00
13.77
P


ATOM
1158
O1P
U
A
69
30.555
6.595
1.924
1.00
11.85
O


ATOM
1159
O2P
U
A
69
30.093
7.968
3.998
1.00
12.81
O


ATOM
1160
O5*
U
A
69
31.231
5.779
4.195
1.00
11.34
O


ATOM
1161
C5*
U
A
69
32.060
4.700
3.787
1.00
11.10
C


ATOM
1162
C4*
U
A
69
32.278
3.754
4.958
1.00
9.72
C


ATOM
1163
O4*
U
A
69
33.037
4.436
6.006
1.00
8.84
O


ATOM
1164
C3*
U
A
69
31.010
3.319
5.663
1.00
8.43
C


ATOM
1165
O3*
U
A
69
30.409
2.218
4.976
1.00
10.14
O


ATOM
1166
C2*
U
A
69
31.556
2.908
7.017
1.00
8.62
C


ATOM
1167
O2*
U
A
69
32.248
1.685
6.931
1.00
7.78
O


ATOM
1168
C1*
U
A
69
32.576
4.009
7.273
1.00
8.22
C


ATOM
1169
N1
U
A
69
32.004
5.162
7.979
1.00
10.18
N


ATOM
1170
C2
U
A
69
31.922
5.065
9.360
1.00
10.69
C


ATOM
1171
O2
U
A
69
32.209
4.017
9.964
1.00
11.15
O


ATOM
1172
N3
U
A
69
31.478
6.210
10.003
1.00
9.29
N


ATOM
1173
C4
U
A
69
31.088
7.405
9.411
1.00
9.74
C


ATOM
1174
O4
U
A
69
30.787
8.391
10.131
1.00
10.59
O


ATOM
1175
C5
U
A
69
31.145
7.399
7.972
1.00
8.22
C


ATOM
1176
C6
U
A
69
31.587
6.303
7.317
1.00
8.57
C


ATOM
1177
P
C
A
70
28.841
1.987
5.039
1.00
11.98
P


ATOM
1178
O1P
C
A
70
28.566
1.076
3.893
1.00
13.06
O


ATOM
1179
O2P
C
A
70
28.126
3.300
5.133
1.00
13.69
O


ATOM
1180
O5*
C
A
70
28.566
1.234
6.428
1.00
12.93
O


ATOM
1181
C5*
C
A
70
29.111
−0.064
6.679
1.00
12.54
C


ATOM
1182
C4*
C
A
70
29.095
−0.367
8.163
1.00
10.94
C


ATOM
1183
O4*
C
A
70
29.877
0.624
8.879
1.00
12.72
O


ATOM
1184
C3*
C
A
70
27.735
−0.325
8.854
1.00
11.39
C


ATOM
1185
O3*
C
A
70
27.050
−1.575
8.672
1.00
10.71
O


ATOM
1186
C2*
C
A
70
28.156
−0.164
10.305
1.00
10.22
C


ATOM
1187
O2*
C
A
70
28.682
−1.378
10.815
1.00
10.73
O


ATOM
1188
C1*
C
A
70
29.319
0.816
10.169
1.00
10.56
C


ATOM
1189
N1
C
A
70
28.918
2.231
10.310
1.00
10.87
N


ATOM
1190
C2
C
A
70
28.815
2.754
11.585
1.00
10.52
C


ATOM
1191
O2
C
A
70
28.888
1.981
12.549
1.00
13.89
O


ATOM
1192
N3
C
A
70
28.618
4.069
11.750
1.00
9.90
N


ATOM
1193
C4
C
A
70
28.472
4.864
10.678
1.00
11.60
C


ATOM
1194
N4
C
A
70
28.339
6.173
10.891
1.00
9.69
N


ATOM
1195
C5
C
A
70
28.469
4.342
9.351
1.00
9.75
C


ATOM
1196
C6
C
A
70
28.697
3.027
9.213
1.00
10.87
C


ATOM
1197
P
C
A
71
25.506
−1.596
8.258
1.00
11.56
P


ATOM
1198
O1P
C
A
71
25.252
−2.969
7.708
1.00
11.38
O


ATOM
1199
O2P
C
A
71
25.231
−0.392
7.422
1.00
13.37
O


ATOM
1200
O5*
C
A
71
24.680
−1.449
9.614
1.00
10.61
O


ATOM
1201
C5*
C
A
71
24.617
−2.546
10.513
1.00
10.38
C


ATOM
1202
C4*
C
A
71
24.552
−2.068
11.939
1.00
10.89
C


ATOM
1203
O4*
C
A
71
25.639
−1.140
12.164
1.00
10.86
O


ATOM
1204
C3*
C
A
71
23.336
−1.255
12.362
1.00
11.86
C


ATOM
1205
O3*
C
A
71
22.249
−2.098
12.693
1.00
11.40
O


ATOM
1206
C2*
C
A
71
23.864
−0.579
13.618
1.00
11.64
C


ATOM
1207
O2*
C
A
71
23.975
−1.513
14.691
1.00
9.30
O


ATOM
1208
C1*
C
A
71
25.263
−0.193
13.149
1.00
10.57
C


ATOM
1209
N1
C
A
71
25.248
1.161
12.548
1.00
9.67
N


ATOM
1210
C2
C
A
71
25.243
2.271
13.429
1.00
10.84
C


ATOM
1211
O2
C
A
71
25.179
2.052
14.671
1.00
11.11
O


ATOM
1212
N3
C
A
71
25.283
3.534
12.917
1.00
8.57
N


ATOM
1213
C4
C
A
71
25.271
3.710
11.589
1.00
9.43
C


ATOM
1214
N4
C
A
71
25.262
4.948
11.132
1.00
6.44
N


ATOM
1215
C5
C
A
71
25.247
2.602
10.671
1.00
6.94
C


ATOM
1216
C6
C
A
71
25.240
1.353
11.196
1.00
9.46
C


ATOM
1217
P
G
A
72
20.758
−1.542
12.563
1.00
13.41
P


ATOM
1218
O1P
G
A
72
19.842
−2.631
12.899
1.00
12.96
O


ATOM
1219
O2P
G
A
72
20.611
−0.780
11.263
1.00
11.63
O


ATOM
1220
O5*
G
A
72
20.616
−0.447
13.712
1.00
11.41
O


ATOM
1221
C5*
G
A
72
20.561
−0.840
15.068
1.00
11.53
C


ATOM
1222
C4*
G
A
72
20.555
0.380
15.949
1.00
11.87
C


ATOM
1223
O4*
G
A
72
21.770
1.139
15.712
1.00
14.46
O


ATOM
1224
C3*
G
A
72
19.437
1.364
15.661
1.00
14.43
C


ATOM
1225
O3*
G
A
72
18.278
0.935
16.376
1.00
16.44
O


ATOM
1226
C2*
G
A
72
20.016
2.654
16.228
1.00
13.97
C


ATOM
1227
O2*
G
A
72
19.907
2.695
17.643
1.00
15.21
O


ATOM
1228
C1*
G
A
72
21.489
2.520
15.826
1.00
12.65
C


ATOM
1229
N9
G
A
72
21.759
3.170
14.550
1.00
12.14
N


ATOM
1230
C8
G
A
72
21.856
2.616
13.294
1.00
9.16
C


ATOM
1231
N7
G
A
72
22.026
3.517
12.353
1.00
11.34
N


ATOM
1232
C5
G
A
72
22.061
4.729
13.040
1.00
10.76
C


ATOM
1233
C6
G
A
72
22.202
6.078
12.562
1.00
11.86
C


ATOM
1234
O6
G
A
72
22.341
6.488
11.391
1.00
8.38
O


ATOM
1235
N1
G
A
72
22.159
6.993
13.612
1.00
11.84
N


ATOM
1236
C2
G
A
72
21.999
6.663
14.947
1.00
13.77
C


ATOM
1237
N2
G
A
72
21.950
7.693
15.815
1.00
9.84
N


ATOM
1238
N3
G
A
72
21.887
5.422
15.393
1.00
11.48
N


ATOM
1239
C4
G
A
72
21.919
4.522
14.396
1.00
11.62
C


ATOM
1240
P
A
A
73
16.809
1.286
15.834
1.00
19.26
P


ATOM
1241
O1P
A
A
73
15.885
0.363
16.510
1.00
22.32
O


ATOM
1242
O2P
A
A
73
16.775
1.381
14.361
1.00
19.69
O


ATOM
1243
O5*
A
A
73
16.569
2.740
16.433
1.00
17.72
O


ATOM
1244
C5*
A
A
73
15.307
3.388
16.323
1.00
17.97
C


ATOM
1245
C4*
A
A
73
15.431
4.809
16.822
1.00
17.32
C


ATOM
1246
O4*
A
A
73
16.427
5.494
16.024
1.00
16.54
O


ATOM
1247
C3*
A
A
73
14.189
5.661
16.679
1.00
17.30
C


ATOM
1248
O3*
A
A
73
13.363
5.484
17.811
1.00
18.59
O


ATOM
1249
C2*
A
A
73
14.773
7.060
16.630
1.00
17.35
C


ATOM
1250
O2*
A
A
73
15.216
7.464
17.913
1.00
16.54
O


ATOM
1251
C1*
A
A
73
16.012
6.819
15.770
1.00
15.89
C


ATOM
1252
N9
A
A
73
15.791
6.944
14.334
1.00
13.91
N


ATOM
1253
C8
A
A
73
15.673
5.940
13.408
1.00
13.93
C


ATOM
1254
N7
A
A
73
15.535
6.370
12.180
1.00
14.00
N


ATOM
1255
C5
A
A
73
15.532
7.754
12.312
1.00
13.79
C


ATOM
1256
C6
A
A
73
15.406
8.788
11.378
1.00
15.44
C


ATOM
1257
N6
A
A
73
15.244
8.581
10.068
1.00
13.32
N


ATOM
1258
N1
A
A
73
15.451
10.062
11.840
1.00
14.22
N


ATOM
1259
C2
A
A
73
15.592
10.262
13.158
1.00
14.69
C


ATOM
1260
N3
A
A
73
15.706
9.369
14.134
1.00
14.25
N


ATOM
1261
C4
A
A
73
15.679
8.119
13.634
1.00
14.11
C


ATOM
1262
P
C
A
74
11.774
5.640
17.658
1.00
19.05
P


ATOM
1263
O1P
C
A
74
11.240
5.400
19.022
1.00
21.21
O


ATOM
1264
O2P
C
A
74
11.309
4.816
16.524
1.00
18.62
O


ATOM
1265
O5*
C
A
74
11.567
7.165
17.261
1.00
17.10
O


ATOM
1266
C5*
C
A
74
11.953
8.213
18.140
1.00
16.41
C


ATOM
1267
C4*
C
A
74
11.888
9.522
17.409
1.00
18.55
C


ATOM
1268
O4*
C
A
74
12.836
9.487
16.307
1.00
18.29
O


ATOM
1269
C3*
C
A
74
10.569
9.816
16.709
1.00
19.36
C


ATOM
1270
O3*
C
A
74
9.611
10.304
17.650
1.00
19.83
O


ATOM
1271
C2*
C
A
74
11.008
10.851
15.678
1.00
18.77
C


ATOM
1272
O2*
C
A
74
11.287
12.128
16.243
1.00
19.03
O


ATOM
1273
C1*
C
A
74
12.342
10.257
15.224
1.00
15.75
C


ATOM
1274
N1
C
A
74
12.235
9.391
14.042
1.00
14.65
N


ATOM
1275
C2
C
A
74
12.221
9.987
12.789
1.00
15.39
C


ATOM
1276
O2
C
A
74
12.225
11.231
12.722
1.00
14.36
O


ATOM
1277
N3
C
A
74
12.192
9.204
11.677
1.00
14.56
N


ATOM
1278
C4
C
A
74
12.165
7.877
11.801
1.00
13.56
C


ATOM
1279
N4
C
A
74
12.187
7.143
10.683
1.00
13.38
N


ATOM
1280
C5
C
A
74
12.130
7.241
13.078
1.00
15.04
C


ATOM
1281
C6
C
A
74
12.167
8.030
14.162
1.00
12.74
C


ATOM
1282
P
U
A
75
8.046
10.090
17.379
1.00
20.97
P


ATOM
1283
O1P
U
A
75
7.326
10.728
18.515
1.00
20.97
O


ATOM
1284
O2P
U
A
75
7.770
8.678
17.051
1.00
19.40
O


ATOM
1285
O5*
U
A
75
7.759
10.948
16.067
1.00
19.78
O


ATOM
1286
C5*
U
A
75
7.940
12.353
16.067
1.00
19.76
C


ATOM
1287
C4*
U
A
75
7.692
12.920
14.689
1.00
19.28
C


ATOM
1288
O4*
U
A
75
8.781
12.515
13.804
1.00
19.26
O


ATOM
1289
C3*
U
A
75
6.437
12.422
13.980
1.00
19.85
C


ATOM
1290
O3*
U
A
75
5.251
13.115
14.372
1.00
21.73
O


ATOM
1291
C2*
U
A
75
6.784
12.675
12.521
1.00
18.07
C


ATOM
1292
O2*
U
A
75
6.648
14.044
12.198
1.00
18.55
O


ATOM
1293
C1*
U
A
75
8.269
12.294
12.503
1.00
18.72
C


ATOM
1294
N1
U
A
75
8.444
10.876
12.155
1.00
16.03
N


ATOM
1295
C2
U
A
75
8.528
10.591
10.821
1.00
14.55
C


ATOM
1296
O2
U
A
75
8.474
11.463
9.960
1.00
16.34
O


ATOM
1297
N3
U
A
75
8.689
9.269
10.515
1.00
14.24
N


ATOM
1298
C4
U
A
75
8.798
8.233
11.389
1.00
14.08
C


ATOM
1299
O4
U
A
75
9.094
7.130
10.954
1.00
15.54
O


ATOM
1300
C5
U
A
75
8.693
8.599
12.769
1.00
14.83
C


ATOM
1301
C6
U
A
75
8.516
9.890
13.096
1.00
15.06
C


ATOM
1302
P
A
A
76
3.836
12.353
14.321
1.00
22.14
P


ATOM
1303
O1P
A
A
76
2.880
13.223
15.033
1.00
23.41
O


ATOM
1304
O2P
A
A
76
4.009
10.938
14.753
1.00
21.70
O


ATOM
1305
O5*
A
A
76
3.415
12.341
12.788
1.00
21.20
O


ATOM
1306
C5*
A
A
76
3.300
13.552
12.057
1.00
20.43
C


ATOM
1307
C4*
A
A
76
2.982
13.261
10.610
1.00
21.53
C


ATOM
1308
O4*
A
A
76
4.176
12.824
9.881
1.00
20.76
O


ATOM
1309
C3*
A
A
76
1.979
12.147
10.385
1.00
21.60
C


ATOM
1310
O3*
A
A
76
0.644
12.615
10.600
1.00
24.70
O


ATOM
1311
C2*
A
A
76
2.288
11.751
8.942
1.00
20.85
C


ATOM
1312
O2*
A
A
76
1.813
12.741
8.049
1.00
18.38
O


ATOM
1313
C1*
A
A
76
3.818
11.834
8.924
1.00
18.33
C


ATOM
1314
N9
A
A
76
4.448
10.557
9.291
1.00
18.89
N


ATOM
1315
C8
A
A
76
4.743
10.092
10.543
1.00
18.84
C


ATOM
1316
N7
A
A
76
5.292
8.890
10.552
1.00
19.05
N


ATOM
1317
C5
A
A
76
5.370
8.549
9.213
1.00
17.84
C


ATOM
1318
C6
A
A
76
5.873
7.410
8.545
1.00
18.46
C


ATOM
1319
N6
A
A
76
6.430
6.353
9.151
1.00
16.69
N


ATOM
1320
N1
A
A
76
5.789
7.395
7.198
1.00
18.58
N


ATOM
1321
C2
A
A
76
5.262
8.451
6.577
1.00
18.00
C


ATOM
1322
N3
A
A
76
4.779
9.581
7.089
1.00
17.69
N


ATOM
1323
C4
A
A
76
4.855
9.566
8.424
1.00
18.36
C


ATOM
1324
P
U
A
77
−0.490
11.589
11.092
1.00
27.25
P


ATOM
1325
O1P
U
A
77
−1.718
12.394
11.304
1.00
28.56
O


ATOM
1326
O2P
U
A
77
0.014
10.721
12.181
1.00
27.58
O


ATOM
1327
O5*
U
A
77
−0.735
10.633
9.847
1.00
26.15
O


ATOM
1328
C5*
U
A
77
−1.346
11.123
8.674
1.00
27.17
C


ATOM
1329
C4*
U
A
77
−1.337
10.063
7.612
1.00
28.07
C


ATOM
1330
O4*
U
A
77
0.040
9.755
7.267
1.00
27.42
O


ATOM
1331
C3*
U
A
77
−1.901
8.720
8.045
1.00
28.00
C


ATOM
1332
O3*
U
A
77
−3.318
8.669
7.947
1.00
29.45
O


ATOM
1333
C2*
U
A
77
−1.240
7.775
7.055
1.00
28.20
C


ATOM
1334
O2*
U
A
77
−1.847
7.832
5.782
1.00
26.90
O


ATOM
1335
C1*
U
A
77
0.157
8.381
6.958
1.00
27.65
C


ATOM
1336
N1
U
A
77
1.088
7.745
7.899
1.00
27.42
N


ATOM
1337
C2
U
A
77
1.698
6.593
7.466
1.00
27.52
C


ATOM
1338
O2
U
A
77
1.493
6.121
6.373
1.00
28.51
O


ATOM
1339
N3
U
A
77
2.553
6.005
8.360
1.00
28.43
N


ATOM
1340
C4
U
A
77
2.855
6.437
9.623
1.00
27.99
C


ATOM
1341
O4
U
A
77
3.638
5.774
10.307
1.00
28.56
O


ATOM
1342
C5
U
A
77
2.183
7.651
10.018
1.00
29.06
C


ATOM
1343
C6
U
A
77
1.340
8.252
9.153
1.00
27.47
C


ATOM
1344
P
G
A
78
−4.128
7.571
8.789
1.00
29.69
P


ATOM
1345
O1P
G
A
78
−5.574
7.922
8.682
1.00
29.99
O


ATOM
1346
O2P
G
A
78
−3.493
7.464
10.125
1.00
29.83
O


ATOM
1347
O5*
G
A
78
−3.864
6.206
8.025
1.00
28.85
O


ATOM
1348
C5*
G
A
78
−4.288
6.057
6.687
1.00
31.49
C


ATOM
1349
C4*
G
A
78
−3.746
4.786
6.103
1.00
33.11
C


ATOM
1350
O4*
G
A
78
−2.290
4.821
6.141
1.00
32.92
O


ATOM
1351
C3*
G
A
78
−4.069
3.537
6.904
1.00
34.56
C


ATOM
1352
O3*
G
A
78
−5.382
3.042
6.685
1.00
36.42
O


ATOM
1353
C2*
G
A
78
−2.993
2.572
6.437
1.00
33.46
C


ATOM
1354
O2*
G
A
78
−3.300
2.058
5.155
1.00
33.17
O


ATOM
1355
C1*
G
A
78
−1.788
3.506
6.335
1.00
32.77
C


ATOM
1356
N9
G
A
78
−0.943
3.484
7.528
1.00
31.48
N


ATOM
1357
C8
G
A
78
−0.885
4.406
8.552
1.00
30.74
C


ATOM
1358
N7
G
A
78
0.017
4.111
9.453
1.00
30.41
N


ATOM
1359
C5
G
A
78
0.574
2.921
9.000
1.00
30.92
C


ATOM
1360
C6
G
A
78
1.605
2.117
9.547
1.00
30.88
C


ATOM
1361
O6
G
A
78
2.260
2.301
10.569
1.00
31.93
O


ATOM
1362
N1
G
A
78
1.852
0.992
8.763
1.00
31.31
N


ATOM
1363
C2
G
A
78
1.200
0.683
7.596
1.00
31.42
C


ATOM
1364
N2
G
A
78
1.576
−0.452
6.973
1.00
31.76
N


ATOM
1365
N3
G
A
78
0.244
1.429
7.073
1.00
31.87
N


ATOM
1366
C4
G
A
78
−0.017
2.521
7.822
1.00
30.78
C


ATOM
1367
P
U
A
79
−6.043
2.064
7.771
1.00
35.59
P


ATOM
1368
O1P
U
A
79
−7.437
1.843
7.318
1.00
38.84
O


ATOM
1369
O2P
U
A
79
−5.793
2.598
9.125
1.00
34.83
O


ATOM
1370
O5*
U
A
79
−5.233
0.710
7.607
1.00
37.13
O


ATOM
1371
C5*
U
A
79
−5.230
0.014
6.370
1.00
40.35
C


ATOM
1372
C4*
U
A
79
−4.500
−1.284
6.529
1.00
42.63
C


ATOM
1373
O4*
U
A
79
−3.087
−1.022
6.688
1.00
41.91
O


ATOM
1374
C3*
U
A
79
−4.878
−2.028
7.794
1.00
45.01
C


ATOM
1375
O3*
U
A
79
−6.059
−2.771
7.564
1.00
49.49
O


ATOM
1376
C2*
U
A
79
−3.652
−2.897
8.039
1.00
42.79
C


ATOM
1377
O2*
U
A
79
−3.632
−4.039
7.212
1.00
43.51
O


ATOM
1378
C1*
U
A
79
−2.530
−1.946
7.614
1.00
41.31
C


ATOM
1379
N1
U
A
79
−1.962
−1.177
8.732
1.00
38.39
N


ATOM
1380
C2
U
A
79
−0.932
−1.753
9.457
1.00
38.66
C


ATOM
1381
O2
U
A
79
−0.492
−2.866
9.215
1.00
38.18
O


ATOM
1382
N3
U
A
79
−0.438
−0.978
10.475
1.00
36.92
N


ATOM
1383
C4
U
A
79
−0.855
0.280
10.830
1.00
36.93
C


ATOM
1384
O4
U
A
79
−0.255
0.884
11.715
1.00
36.60
O


ATOM
1385
C5
U
A
79
−1.931
0.800
10.037
1.00
36.98
C


ATOM
1386
C6
U
A
79
−2.432
0.070
9.043
1.00
37.15
C


ATOM
1387
P
C
A
80
−6.888
−3.348
8.809
1.00
52.93
P


ATOM
1388
O1P
C
A
80
−8.178
−3.837
8.240
1.00
53.20
O


ATOM
1389
O2P
C
A
80
−6.901
−2.351
9.924
1.00
51.64
O


ATOM
1390
O5*
C
A
80
−6.006
−4.597
9.252
1.00
53.85
O


ATOM
1391
C5*
C
A
80
−6.151
−5.164
10.536
1.00
56.78
C


ATOM
1392
C4*
C
A
80
−4.996
−6.079
10.826
1.00
58.70
C


ATOM
1393
O4*
C
A
80
−3.764
−5.451
10.400
1.00
58.63
O


ATOM
1394
C3*
C
A
80
−4.785
−6.404
12.289
1.00
60.66
C


ATOM
1395
O3*
C
A
80
−5.653
−7.443
12.718
1.00
64.54
O


ATOM
1396
C2*
C
A
80
−3.302
−6.748
12.328
1.00
59.49
C


ATOM
1397
O2*
C
A
80
−3.012
−8.056
11.868
1.00
59.82
O


ATOM
1398
C1*
C
A
80
−2.746
−5.710
11.351
1.00
57.90
C


ATOM
1399
N1
C
A
80
−2.483
−4.432
12.019
1.00
56.16
N


ATOM
1400
C2
C
A
80
−1.428
−4.330
12.923
1.00
55.40
C


ATOM
1401
O2
C
A
80
−0.739
−5.332
13.160
1.00
55.60
O


ATOM
1402
N3
C
A
80
−1.184
−3.140
13.516
1.00
54.79
N


ATOM
1403
C4
C
A
80
−1.948
−2.085
13.236
1.00
54.67
C


ATOM
1404
N4
C
A
80
−1.659
−0.931
13.830
1.00
54.49
N


ATOM
1405
C5
C
A
80
−3.038
−2.167
12.329
1.00
55.04
C


ATOM
1406
C6
C
A
80
−3.268
−3.348
11.748
1.00
55.31
C


ATOM
1407
P
C
A
81
−6.452
−7.275
14.101
1.00
67.04
P


ATOM
1408
O1P
C
A
81
−7.534
−8.288
14.135
1.00
67.56
O


ATOM
1409
O2P
C
A
81
−6.793
−5.832
14.228
1.00
66.62
O


ATOM
1410
O5*
C
A
81
−5.356
−7.637
15.200
1.00
67.52
O


ATOM
1411
C5*
C
A
81
−4.468
−8.729
14.996
1.00
68.78
C


ATOM
1412
C4*
C
A
81
−3.176
−8.504
15.741
1.00
70.00
C


ATOM
1413
O4*
C
A
81
−2.545
−7.295
15.239
1.00
70.26
O


ATOM
1414
C3*
C
A
81
−3.315
−8.246
17.236
1.00
70.40
C


ATOM
1415
O3*
C
A
81
−3.607
−9.405
18.044
1.00
71.08
O


ATOM
1416
C2*
C
A
81
−2.027
−7.493
17.557
1.00
70.66
C


ATOM
1417
O2*
C
A
81
−0.891
−8.331
17.691
1.00
70.46
O


ATOM
1418
C1*
C
A
81
−1.866
−6.637
16.299
1.00
70.42
C


ATOM
1419
N1
C
A
81
−2.395
−5.267
16.429
1.00
70.24
N


ATOM
1420
C2
C
A
81
−1.749
−4.383
17.301
1.00
69.97
C


ATOM
1421
O2
C
A
81
−0.778
−4.792
17.959
1.00
69.96
O


ATOM
1422
N3
C
A
81
−2.199
−3.115
17.406
1.00
69.86
N


ATOM
1423
C4
C
A
81
−3.253
−2.719
16.687
1.00
69.98
C


ATOM
1424
N4
C
A
81
−3.655
−1.451
16.812
1.00
70.37
N


ATOM
1425
C5
C
A
81
−3.940
−3.603
15.805
1.00
69.65
C


ATOM
1426
C6
C
A
81
−3.483
−4.857
15.708
1.00
70.02
C


TER
1427

C
A
81


HETATM
1428
N1
SPD

95
30.635
15.348
6.786
1.00
32.95
N


HETATM
1429
C2
SPD

95
29.601
15.137
7.638
1.00
34.77
C


HETATM
1430
C3
SPD

95
28.160
14.794
7.150
1.00
32.97
C


HETATM
1431
C4
SPD

95
27.154
15.912
6.737
1.00
35.47
C


HETATM
1432
C5
SPD

95
27.896
17.206
6.216
1.00
34.05
C


HETATM
1433
N6
SPD

95
28.356
18.280
7.154
1.00
35.93
N


HETATM
1434
C7
SPD

95
28.129
18.216
8.696
1.00
37.54
C


HETATM
1435
C8
SPD

95
26.699
18.358
9.285
1.00
36.82
C


HETATM
1436
C9
SPD

95
25.563
18.963
8.358
1.00
39.41
C


HETATM
1437
N10
SPD

95
24.963
18.146
7.432
1.00
38.63
N


HETATM
1438
C
ACT

96
35.752
18.872
10.859
1.00
11.75
C


HETATM
1439
O
ACT

96
35.666
18.447
12.039
1.00
12.77
O


HETATM
1440
OXT
ACT

96
35.855
20.114
10.438
1.00
13.73
O


HETATM
1441
CH3
ACT

96
35.725
17.822
9.762
1.00
10.36
C


HETATM
1442
CO
NCO

101
33.779
17.811
0.520
1.00
12.16
CO


HETATM
1443
N1
NCO

101
35.628
18.020
1.035
1.00
11.26
N


HETATM
1444
N2
NCO

101
31.912
17.605
0.016
1.00
11.81
N


HETATM
1445
N3
NCO

101
34.248
16.189
−0.528
1.00
12.00
N


HETATM
1446
N4
NCO

101
33.306
19.417
1.565
1.00
12.05
N


HETATM
1447
N5
NCO

101
33.488
16.682
2.134
1.00
11.77
N


HETATM
1448
N6
NCO

101
34.079
18.927
−1.063
1.00
10.05
N


HETATM
1449
CO
NCO

102
30.972
15.538
21.040
1.00
18.89
CO


HETATM
1450
N1
NCO

102
32.077
13.978
20.540
1.00
18.01
N


HETATM
1451
N2
NCO

102
29.861
17.079
21.546
1.00
20.34
N


HETATM
1452
N3
NCO

102
29.408
14.674
20.281
1.00
18.96
N


HETATM
1453
N4
NCO

102
32.540
16.407
21.793
1.00
19.16
N


HETATM
1454
N5
NCO

102
30.622
14.700
22.802
1.00
21.31
N


HETATM
1455
N6
NCO

102
31.348
16.357
19.288
1.00
18.20
N


HETATM
1456
CO
NCO

103
30.079
10.969
13.083
1.00
9.34
CO


HETATM
1457
N1
NCO

103
31.143
9.318
12.937
1.00
10.62
N


HETATM
1458
N2
NCO

103
29.016
12.607
13.231
1.00
10.47
N


HETATM
1459
N3
NCO

103
28.629
9.958
13.953
1.00
9.90
N


HETATM
1460
N4
NCO

103
31.530
11.994
12.228
1.00
11.47
N


HETATM
1461
N5
NCO

103
30.887
11.312
14.848
1.00
10.93
N


HETATM
1462
N6
NCO

103
29.293
10.591
11.335
1.00
8.96
N


HETATM
1463
CO
NCO

104
35.738
25.339
6.924
1.00
10.52
CO


HETATM
1464
N1
NCO

104
37.192
23.991
6.966
1.00
9.12
N


HETATM
1465
N2
NCO

104
34.292
26.667
6.917
1.00
9.97
N


HETATM
1466
N3
NCO

104
35.290
24.803
5.079
1.00
10.32
N


HETATM
1467
N4
NCO

104
36.200
25.850
8.782
1.00
10.04
N


HETATM
1468
N5
NCO

104
34.462
23.982
7.610
1.00
10.02
N


HETATM
1469
N6
NCO

104
36.989
26.673
6.255
1.00
9.93
N


HETATM
1470
CO
NCO

105
38.319
9.181
18.354
1.00
17.15
CO


HETATM
1471
N1
NCO

105
38.565
7.212
18.263
1.00
16.21
N


HETATM
1472
N2
NCO

105
38.063
11.131
18.455
1.00
16.52
N


HETATM
1473
N3
NCO

105
37.166
9.118
16.756
1.00
15.37
N


HETATM
1474
N4
NCO

105
39.472
9.231
19.943
1.00
15.14
N


HETATM
1475
N5
NCO

105
36.747
8.927
19.488
1.00
13.92
N


HETATM
1476
N6
NCO

105
39.894
9.428
17.221
1.00
16.03
N


HETATM
1477
CO
NCO

106
29.797
11.611
−0.683
1.00
29.10
CO


HETATM
1478
N1
NCO

106
30.133
9.996
0.412
1.00
26.22
N


HETATM
1479
N2
NCO

106
29.459
13.224
−1.786
1.00
27.55
N


HETATM
1480
N3
NCO

106
27.947
10.996
−0.979
1.00
27.31
N


HETATM
1481
N4
NCO

106
31.647
12.200
−0.388
1.00
26.94
N


HETATM
1482
N5
NCO

106
29.213
12.598
0.926
1.00
26.78
N


HETATM
1483
N6
NCO

106
30.382
10.638
−2.307
1.00
27.86
N


HETATM
1484
CO
NCO

107
21.064
21.824
13.859
1.00
66.74
CO


HETATM
1485
N1
NCO

107
19.817
20.309
13.966
1.00
65.15
N


HETATM
1486
N2
NCO

107
22.316
23.350
13.744
1.00
64.75
N


HETATM
1487
N3
NCO

107
20.426
22.219
12.039
1.00
64.82
N


HETATM
1488
N4
NCO

107
21.696
21.423
15.684
1.00
65.00
N


HETATM
1489
N5
NCO

107
19.682
23.014
14.604
1.00
65.37
N


HETATM
1490
N6
NCO

107
22.448
20.636
13.116
1.00
64.84
N


HETATM
1491
CO
NCO

108
11.892
21.884
3.085
1.00
43.83
CO


HETATM
1492
N1
NCO

108
12.540
20.757
4.557
1.00
41.58
N


HETATM
1493
N2
NCO

108
11.237
23.007
1.611
1.00
41.85
N


HETATM
1494
N3
NCO

108
10.087
21.837
3.873
1.00
39.63
N


HETATM
1495
N4
NCO

108
13.697
21.930
2.306
1.00
42.07
N


HETATM
1496
N5
NCO

108
12.302
23.497
4.132
1.00
42.42
N


HETATM
1497
N6
NCO

108
11.489
20.271
2.035
1.00
42.59
N


HETATM
1498
CO
NCO

109
22.085
1.990
6.248
1.00
22.54
CO


HETATM
1499
N1
NCO

109
23.660
1.327
5.263
1.00
21.69
N


HETATM
1500
N2
NCO

109
20.512
2.641
7.231
1.00
23.54
N


HETATM
1501
N3
NCO

109
21.995
0.289
7.247
1.00
23.66
N


HETATM
1502
N4
NCO

109
22.190
3.682
5.266
1.00
24.99
N


HETATM
1503
N5
NCO

109
23.250
2.740
7.635
1.00
24.19
N


HETATM
1504
N6
NCO

109
20.916
1.236
4.862
1.00
24.71
N


HETATM
1505
CO
NCO

110
11.907
4.743
−4.202
1.00
79.10
CO


HETATM
1506
N1
NCO

110
13.089
3.173
−4.157
1.00
78.25
N


HETATM
1507
N2
NCO

110
10.722
6.317
−4.246
1.00
77.68
N


HETATM
1508
N3
NCO

110
10.338
3.559
−4.135
1.00
78.03
N


HETATM
1509
N4
NCO

110
13.479
5.919
−4.271
1.00
78.27
N


HETATM
1510
N5
NCO

110
11.939
4.818
−2.231
1.00
77.98
N


HETATM
1511
N6
NCO

110
11.872
4.671
−6.166
1.00
77.97
N


HETATM
1512
CO
NCO

111
41.881
27.688
12.825
1.00
41.44
CO


HETATM
1513
N1
NCO

111
42.431
25.789
12.719
1.00
39.32
N


HETATM
1514
N2
NCO

111
41.334
29.578
12.935
1.00
39.83
N


HETATM
1515
N3
NCO

111
40.110
27.135
13.502
1.00
38.72
N


HETATM
1516
N4
NCO

111
43.650
28.247
12.141
1.00
39.72
N


HETATM
1517
N5
NCO

111
42.556
27.780
14.672
1.00
39.67
N


HETATM
1518
N6
NCO

111
41.206
27.586
10.976
1.00
39.64
N


HETATM
1519
CO
NCO

112
15.544
−2.470
12.503
0.64
33.78
CO


HETATM
1520
N1
NCO

112
15.053
−4.150
13.387
0.64
33.08
N


HETATM
1521
N2
NCO

112
16.035
−0.799
11.617
0.64
32.88
N


HETATM
1522
N3
NCO

112
13.989
−1.592
13.306
0.64
33.14
N


HETATM
1523
N4
NCO

112
17.106
−3.351
11.708
0.64
31.89
N


HETATM
1524
N5
NCO

112
16.630
−1.966
14.059
0.64
31.87
N


HETATM
1525
N6
NCO

112
14.456
−2.963
10.949
0.64
33.59
N


HETATM
1526
N1
HPA

90
11.850
10.680
9.276
1.00
14.10
N


HETATM
1527
C2
HPA

90
11.445
12.017
9.429
1.00
15.38
C


HETATM
1528
N3
HPA

90
11.023
12.794
8.399
1.00
15.35
N


HETATM
1529
C4
HPA

90
11.032
12.144
7.215
1.00
13.23
C


HETATM
1530
C5
HPA

90
11.464
10.697
6.984
1.00
14.15
C


HETATM
1531
C6
HPA

90
11.894
9.936
8.082
1.00
14.68
C


HETATM
1532
O6
HPA

90
12.274
8.774
8.128
1.00
17.66
O


HETATM
1533
N7
HPA

90
11.319
10.452
5.646
1.00
14.57
N


HETATM
1534
C8
HPA

90
10.837
11.601
5.068
1.00
10.65
C


HETATM
1535
N9
HPA

90
10.654
12.637
5.980
1.00
13.29
N


HETATM
1536
O
HOH

301
27.846
−2.938
13.825
1.00
9.01
O


HETATM
1537
O
HOH

302
25.859
31.056
15.513
1.00
15.15
O


HETATM
1538
O
HOH

304
31.973
24.578
6.159
1.00
8.16
O


HETATM
1539
O
HOH

305
26.612
9.170
11.460
1.00
9.38
O


HETATM
1540
O
HOH

306
27.263
29.190
17.094
1.00
12.73
O


HETATM
1541
O
HOH

307
27.839
31.120
11.251
1.00
8.67
O


HETATM
1542
O
HOH

308
15.358
10.206
7.797
1.00
12.64
O


HETATM
1543
O
HOH

309
36.106
21.180
7.822
1.00
7.98
O


HETATM
1544
O
HOH

310
38.999
22.794
9.431
1.00
11.94
O


HETATM
1545
O
HOH

311
27.651
7.462
8.332
1.00
11.92
O


HETATM
1546
O
HOH

312
32.982
0.834
9.462
1.00
8.08
O


HETATM
1547
O
HOH

313
32.773
15.822
11.031
1.00
11.55
O


HETATM
1548
O
HOH

314
28.650
−0.366
14.062
1.00
14.04
O


HETATM
1549
O
HOH

315
28.295
12.130
16.348
1.00
14.09
O


HETATM
1550
O
HOH

316
35.130
20.473
16.933
1.00
11.57
O


HETATM
1551
O
HOH

317
31.081
33.340
12.189
1.00
12.95
O


HETATM
1552
O
HOH

318
34.682
19.005
19.405
1.00
15.48
O


HETATM
1553
O
HOH

319
33.063
10.330
19.697
1.00
15.28
O


HETATM
1554
O
HOH

320
33.163
29.459
16.840
1.00
13.11
O


HETATM
1555
O
HOH

321
25.367
21.927
13.760
1.00
16.06
O


HETATM
1556
O
HOH

322
37.325
28.769
8.362
1.00
10.37
O


HETATM
1557
O
HOH

323
23.452
−4.001
6.017
1.00
18.50
O


HETATM
1558
O
HOH

324
9.471
20.115
5.676
1.00
20.63
O


HETATM
1559
O
HOH

325
31.413
13.389
17.912
1.00
14.48
O


HETATM
1560
O
HOH

326
30.447
21.096
−4.264
1.00
19.06
O


HETATM
1561
O
HOH

327
29.636
28.818
15.501
1.00
7.36
O


HETATM
1562
O
HOH

328
39.664
24.428
12.189
1.00
15.97
O


HETATM
1563
O
HOH

329
31.754
11.671
9.425
1.00
10.07
O


HETATM
1564
O
HOH

330
31.792
8.455
15.807
1.00
6.64
O


HETATM
1565
O
HOH

331
31.505
23.268
17.911
1.00
11.89
O


HETATM
1566
O
HOH

332
18.453
12.462
12.598
1.00
17.54
O


HETATM
1567
O
HOH

333
18.443
26.704
16.319
1.00
25.32
O


HETATM
1568
O
HOH

334
32.785
19.986
8.347
1.00
15.97
O


HETATM
1569
O
HOH

335
41.786
5.546
14.867
1.00
15.87
O


HETATM
1570
O
HOH

336
7.409
12.505
5.270
1.00
17.89
O


HETATM
1571
O
HOH

337
21.882
4.512
9.287
1.00
19.12
O


HETATM
1572
O
HOH

338
31.187
10.298
17.815
1.00
16.86
O


HETATM
1573
O
HOH

339
26.840
32.305
5.286
1.00
19.04
O


HETATM
1574
O
HOH

340
15.679
12.782
11.026
1.00
14.61
O


HETATM
1575
O
HOH

341
25.408
5.517
8.271
1.00
14.76
O


HETATM
1576
O
HOH

342
30.043
11.328
7.160
1.00
8.77
O


HETATM
1577
O
HOH

343
23.127
14.905
8.144
1.00
14.88
O


HETATM
1578
O
HOH

344
38.365
6.804
15.177
1.00
18.36
O


HETATM
1579
O
HOH

345
35.022
29.298
5.821
1.00
14.35
O


HETATM
1580
O
HOH

346
30.511
24.456
3.612
1.00
14.15
O


HETATM
1581
O
HOH

347
30.481
34.557
3.130
1.00
11.78
O


HETATM
1582
O
HOH

348
26.554
9.913
5.375
1.00
18.81
O


HETATM
1583
O
HOH

349
31.209
21.766
3.243
1.00
18.82
O


HETATM
1584
O
HOH

350
1.741
13.549
1.745
1.00
20.27
O


HETATM
1585
O
HOH

351
29.321
2.066
1.345
1.00
21.19
O


HETATM
1586
O
HOH

352
4.877
8.231
0.914
1.00
21.48
O


HETATM
1587
O
HOH

353
21.713
1.577
10.203
1.00
20.65
O


HETATM
1588
O
HOH

354
16.006
7.690
6.859
1.00
14.39
O


HETATM
1589
O
HOH

355
32.374
14.264
8.770
1.00
13.50
O


HETATM
1590
O
HOH

356
44.956
7.126
13.453
1.00
18.02
O


HETATM
1591
O
HOH

357
34.420
0.993
5.499
1.00
15.80
O


HETATM
1592
O
HOH

358
34.107
10.687
13.188
1.00
21.21
O


HETATM
1593
O
HOH

359
43.241
16.647
21.460
1.00
20.68
O


HETATM
1594
O
HOH

360
29.177
23.123
7.231
1.00
19.78
O


HETATM
1595
O
HOH

361
20.913
10.719
−2.240
1.00
38.20
O


HETATM
1596
O
HOH

362
17.211
12.442
7.676
1.00
12.64
O


HETATM
1597
O
HOH

363
23.238
7.190
7.238
1.00
16.38
O


HETATM
1598
O
HOH

364
23.098
23.481
10.458
1.00
16.85
O


HETATM
1599
O
HOH

365
19.273
4.003
10.474
1.00
14.55
O


HETATM
1600
O
HOH

366
29.119
2.067
20.617
1.00
16.11
O


HETATM
1601
O
HOH

367
13.755
11.592
−2.619
1.00
19.43
O


HETATM
1602
O
HOH

368
27.008
15.371
13.742
1.00
16.03
O


HETATM
1603
O
HOH

370
40.117
15.541
24.855
1.00
17.75
O


HETATM
1604
O
HOH

371
42.204
24.722
15.562
1.00
20.80
O


HETATM
1605
O
HOH

372
33.334
27.064
4.009
1.00
14.27
O


HETATM
1606
O
HOH

373
34.369
22.764
−0.160
1.00
16.16
O


HETATM
1607
O
HOH

374
26.150
9.913
8.539
1.00
17.96
O


HETATM
1608
O
HOH

375
40.783
11.103
6.916
1.00
14.78
O


HETATM
1609
O
HOH

376
35.928
21.012
1.466
1.00
26.24
O


HETATM
1610
O
HOH

377
16.646
10.599
−2.661
1.00
19.58
O


HETATM
1611
O
HOH

378
25.562
13.304
6.313
1.00
18.69
O


HETATM
1612
O
HOH

379
20.343
−1.589
8.728
1.00
18.93
O


HETATM
1613
O
HOH

380
17.478
4.521
8.063
1.00
17.20
O


HETATM
1614
O
HOH

381
37.683
27.819
10.864
1.00
19.86
O


HETATM
1615
O
HOH

382
50.069
8.354
17.181
1.00
25.65
O


HETATM
1616
O
HOH

383
22.215
19.277
−1.414
1.00
21.60
O


HETATM
1617
O
HOH

384
26.899
12.793
23.127
1.00
21.66
O


HETATM
1618
O
HOH

385
27.310
20.284
5.282
1.00
23.43
O


HETATM
1619
O
HOH

386
35.477
14.521
1.832
1.00
16.09
O


HETATM
1620
O
HOH

387
36.421
5.374
5.731
1.00
18.50
O


HETATM
1621
O
HOH

388
35.657
13.089
21.646
1.00
27.43
O


HETATM
1622
O
HOH

389
31.262
−0.028
15.535
1.00
26.71
O


HETATM
1623
O
HOH

390
12.258
13.376
14.333
1.00
23.39
O


HETATM
1624
O
HOH

391
41.547
24.260
9.781
1.00
18.53
O


HETATM
1625
O
HOH

392
45.074
23.301
12.058
1.00
34.28
O


HETATM
1626
O
HOH

393
26.074
11.518
13.046
1.00
13.39
O


HETATM
1627
O
HOH

395
5.587
8.611
15.039
1.00
30.05
O


HETATM
1628
O
HOH

396
16.781
−2.191
17.148
1.00
29.65
O


HETATM
1629
O
HOH

398
39.000
−1.725
12.841
1.00
27.22
O


HETATM
1630
O
HOH

399
45.596
20.807
13.713
1.00
22.64
O


HETATM
1631
O
HOH

400
39.024
5.351
6.718
1.00
27.45
O


HETATM
1632
O
HOH

401
19.128
6.103
17.172
1.00
23.23
O


HETATM
1633
O
HOH

402
−2.414
15.769
−5.977
1.00
28.56
O


HETATM
1634
O
HOH

404
35.863
1.066
10.034
1.00
21.22
O


HETATM
1635
O
HOH

405
45.781
6.088
10.831
1.00
32.51
O


HETATM
1636
O
HOH

406
20.680
25.716
18.205
1.00
23.14
O


HETATM
1637
O
HOH

407
26.449
23.755
−4.077
1.00
19.44
O


HETATM
1638
O
HOH

408
13.404
6.712
6.444
1.00
20.71
O


HETATM
1639
O
HOH

409
33.912
22.171
4.658
1.00
18.92
O


HETATM
1640
O
HOH

410
39.629
15.715
13.211
1.00
93.76
O


HETATM
1641
O
HOH

411
35.297
11.656
18.352
1.00
21.88
O


HETATM
1642
O
HOH

412
10.954
4.182
2.008
1.00
23.12
O


HETATM
1643
O
HOH

413
28.947
21.113
11.642
1.00
22.81
O


HETATM
1644
O
HOH

414
5.170
7.273
12.633
1.00
27.36
O


HETATM
1645
O
HOH

415
28.471
16.583
18.095
1.00
22.56
O


HETATM
1646
O
HOH

416
4.654
4.945
23.880
1.00
52.57
O


HETATM
1647
O
HOH

419
38.860
4.458
16.740
1.00
33.74
O


HETATM
1648
O
HOH

420
−2.549
7.590
−3.638
1.00
24.47
O


HETATM
1649
O
HOH

421
45.977
14.324
21.673
1.00
21.22
O


HETATM
1650
O
HOH

422
21.196
23.126
8.094
1.00
24.30
O


HETATM
1651
O
HOH

423
24.720
30.789
3.238
1.00
17.59
O


HETATM
1652
O
HOH

424
25.564
3.138
6.622
1.00
21.50
O


HETATM
1653
O
HOH

425
34.235
23.186
16.786
1.00
20.56
O


HETATM
1654
O
HOH

426
26.555
15.848
20.797
1.00
34.81
O


HETATM
1655
O
HOH

427
16.137
10.314
16.975
1.00
35.39
O


HETATM
1656
O
HOH

428
15.237
5.341
9.720
1.00
26.74
O


HETATM
1657
O
HOH

429
5.843
16.504
13.479
1.00
20.66
O


HETATM
1658
O
HOH

430
34.267
35.258
2.809
1.00
18.39
O


HETATM
1659
O
HOH

431
34.478
8.946
16.706
1.00
37.42
O


HETATM
1660
O
HOH

432
23.052
6.563
4.267
1.00
23.73
O


HETATM
1661
O
HOH

433
24.461
21.831
8.001
1.00
19.72
O


HETATM
1662
O
HOH

434
16.811
5.911
−0.991
1.00
39.78
O


HETATM
1663
O
HOH

435
18.936
8.756
17.266
1.00
16.77
O


HETATM
1664
O
HOH

436
17.061
32.451
9.214
1.00
32.73
O


HETATM
1665
O
HOH

437
25.185
15.727
10.155
1.00
23.94
O


HETATM
1666
O
HOH

438
36.819
34.970
6.044
1.00
23.13
O


HETATM
1667
O
HOH

439
36.371
2.956
7.585
1.00
22.07
O


HETATM
1668
O
HOH

440
7.686
18.108
−0.709
1.00
31.93
O


HETATM
1669
O
HOH

442
28.154
5.986
5.626
1.00
34.66
O


HETATM
1670
O
HOH

443
43.765
7.991
3.710
1.00
29.75
O


HETATM
1671
O
HOH

444
34.138
11.979
15.697
1.00
19.26
O


HETATM
1672
O
HOH

445
16.154
2.876
6.317
1.00
24.75
O


HETATM
1673
O
HOH

446
3.198
15.740
15.476
1.00
47.56
O


HETATM
1674
O
HOH

447
16.359
21.320
4.320
1.00
27.58
O


HETATM
1675
O
HOH

448
43.253
8.885
16.567
1.00
16.92
O


HETATM
1676
O
HOH

449
35.697
6.317
18.291
1.00
34.21
O


HETATM
1677
O
HOH

450
26.118
7.952
1.424
1.00
47.42
O


HETATM
1678
O
HOH

451
11.782
19.566
7.278
1.00
31.23
O


HETATM
1679
O
HOH

452
13.180
12.084
−5.228
1.00
27.14
O


HETATM
1680
O
HOH

454
37.285
10.836
3.417
1.00
29.24
O


HETATM
1681
O
HOH

456
49.054
14.720
18.870
1.00
34.45
O


HETATM
1682
O
HOH

457
29.111
15.812
11.618
1.00
40.55
O


HETATM
1683
O
HOH

458
31.663
19.001
19.522
1.00
17.15
O


HETATM
1684
O
HOH

459
39.304
19.991
21.460
1.00
29.51
O


HETATM
1685
O
HOH

460
6.990
6.202
−4.001
1.00
40.02
O


HETATM
1686
O
HOH

461
8.794
6.163
15.456
1.00
31.20
O


HETATM
1687
O
HOH

462
31.882
26.863
16.881
1.00
25.16
O


HETATM
1688
O
HOH

463
26.477
−0.242
3.289
1.00
24.32
O


HETATM
1689
O
HOH

464
−3.607
9.559
4.991
1.00
25.17
O


HETATM
1690
O
HOH

465
3.250
20.726
2.705
1.00
41.31
O


HETATM
1691
O
HOH

466
22.033
33.298
4.411
1.00
41.90
O


HETATM
1692
O
HOH

467
46.878
8.195
18.105
1.00
39.35
O


HETATM
1693
O
HOH

468
39.497
14.637
4.088
1.00
29.50
O


HETATM
1694
O
HOH

469
9.001
4.669
12.208
1.00
33.99
O


HETATM
1695
O
HOH

470
−0.445
6.689
3.926
1.00
27.30
O


HETATM
1696
O
HOH

471
17.189
28.967
12.756
1.00
24.27
O


HETATM
1697
O
HOH

472
−5.836
−10.333
16.980
1.00
45.19
O


HETATM
1698
O
HOH

473
45.688
15.274
4.933
1.00
26.44
O


HETATM
1699
O
HOH

474
24.616
0.817
16.978
1.00
26.14
O


HETATM
1700
O
HOH

475
25.860
28.821
0.421
1.00
30.41
O


HETATM
1701
O
HOH

476
18.624
3.432
13.266
1.00
19.31
O


HETATM
1702
O
HOH

478
17.089
26.985
10.853
1.00
34.08
O


HETATM
1703
O
HOH

479
15.210
30.506
6.340
1.00
38.53
O


HETATM
1704
O
HOH

480
36.478
33.125
3.386
1.00
43.68
O


HETATM
1705
O
HOH

481
9.899
3.051
8.224
1.00
36.06
O


HETATM
1706
O
HOH

482
18.088
0.524
11.711
1.00
38.00
O


HETATM
1707
O
HOH

483
4.856
11.207
18.945
1.00
35.35
O


HETATM
1708
O
HOH

484
31.875
14.345
0.745
1.00
27.67
O


HETATM
1709
O
HOH

485
42.954
25.586
19.172
1.00
34.55
O


HETATM
1710
O
HOH

486
19.707
24.093
−3.819
1.00
36.45
O


HETATM
1711
O
HOH

487
21.776
5.844
18.169
1.00
31.67
O


HETATM
1712
O
HOH

488
26.072
21.572
10.831
1.00
26.77
O


HETATM
1713
O
HOH

489
0.375
5.438
11.888
1.00
36.91
O


HETATM
1714
O
HOH

490
28.879
15.319
3.965
1.00
63.69
O


HETATM
1715
O
HOH

491
39.794
17.388
1.508
1.00
40.63
O


HETATM
1716
O
HOH

492
40.065
0.717
19.459
1.00
39.86
O


HETATM
1717
O
HOH

493
49.237
16.037
4.776
1.00
50.04
O


HETATM
1718
O
HOH

494
6.602
16.058
−4.236
1.00
32.86
O


HETATM
1719
O
HOH

495
14.502
4.269
−0.179
1.00
37.62
O


HETATM
1720
O
HOH

496
15.468
20.981
14.363
1.00
39.44
O


HETATM
1721
O
HOH

497
24.964
16.434
−3.116
1.00
23.19
O


HETATM
1722
O
HOH

498
40.994
29.185
18.880
1.00
37.92
O


HETATM
1723
O
HOH

499
26.306
17.999
12.975
1.00
43.45
O


HETATM
1724
O
HOH

500
−1.452
−5.851
6.853
1.00
35.82
O


HETATM
1725
O
HOH

501
10.897
22.806
6.598
1.00
56.17
O


HETATM
1726
O
HOH

502
35.628
12.681
13.339
1.00
92.79
O


HETATM
1727
O
HOH

503
56.988
18.311
10.058
1.00
60.09
O


HETATM
1728
O
HOH

504
42.131
21.097
23.339
1.00
45.34
O


HETATM
1729
O
HOH

505
47.093
7.981
15.216
1.00
42.30
O


HETATM
1730
O
HOH

506
10.635
18.180
−1.367
1.00
28.88
O


HETATM
1731
O
HOH

507
25.312
18.277
3.464
1.00
36.67
O


HETATM
1732
O
HOH

508
53.997
11.193
14.846
1.00
48.50
O


HETATM
1733
O
HOH

509
17.232
19.642
−6.737
1.00
39.43
O


HETATM
1734
O
HOH

511
48.389
18.219
19.003
1.00
37.53
O


HETATM
1735
O
HOH

512
36.441
5.846
21.503
1.00
17.36
O


HETATM
1736
O
HOH

513
27.771
19.877
−5.622
1.00
28.79
O


HETATM
1737
O
HOH

514
49.143
9.733
9.698
1.00
44.63
O


HETATM
1738
O
HOH

515
22.020
29.095
0.023
1.00
38.04
O


HETATM
1739
O
HOH

516
10.150
18.223
−5.419
1.00
45.55
O


HETATM
1740
O
HOH

518
42.433
18.492
3.496
1.00
20.34
O


HETATM
1741
O
HOH

519
12.974
−6.995
9.968
1.00
55.96
O


HETATM
1742
O
HOH

520
41.002
30.190
16.057
1.00
46.34
O


HETATM
1743
O
HOH

521
28.471
5.080
1.598
1.00
38.47
O


HETATM
1744
O
HOH

522
1.319
16.118
8.887
1.00
40.75
O


HETATM
1745
O
HOH

524
47.980
24.698
12.703
1.00
59.46
O


HETATM
1746
O
HOH

525
2.015
17.058
12.074
1.00
47.46
O


HETATM
1747
O
HOH

526
34.792
8.932
23.014
1.00
17.72
O


HETATM
1748
O
HOH

527
−4.125
3.139
11.120
1.00
35.85
O


HETATM
1749
O
HOH

528
14.446
21.653
8.904
1.00
29.07
O


HETATM
1750
O
HOH

529
19.539
24.993
11.728
1.00
32.07
O


HETATM
1751
O
HOH

530
15.285
2.359
12.400
1.00
45.20
O


HETATM
1752
O
HOH

531
35.293
32.877
16.427
1.00
41.71
O


HETATM
1753
O
HOH

532
31.338
11.703
22.624
1.00
30.38
O


HETATM
1754
O
HOH

533
10.807
3.899
5.533
1.00
31.89
O


HETATM
1755
O
HOH

534
23.897
23.200
−2.832
1.00
31.97
O


HETATM
1756
O
HOH

535
4.746
5.239
−2.472
1.00
30.20
O


HETATM
1757
O
HOH

536
24.434
26.832
−2.017
1.00
31.20
O


HETATM
1758
O
HOH

537
43.950
0.544
13.447
1.00
41.33
O


HETATM
1759
O
HOH

538
23.116
8.555
0.668
1.00
35.22
O


HETATM
1760
O
HOH

539
9.978
15.068
14.401
1.00
30.79
O


HETATM
1761
O
HOH

540
2.001
9.284
12.960
1.00
43.03
O


HETATM
1762
O
HOH

541
3.144
3.592
12.702
1.00
57.96
O


HETATM
1763
O
HOH

542
9.889
9.611
−3.909
1.00
31.84
O


HETATM
1764
O
HOH

543
47.304
17.648
6.312
1.00
33.07
O


HETATM
1765
O
HOH

544
4.895
−8.536
10.635
1.00
53.30
O


HETATM
1766
O
HOH

546
39.433
25.709
8.086
1.00
18.28
O


HETATM
1767
O
HOH

547
30.374
−4.058
14.167
1.00
15.55
O


HETATM
1768
O
HOH

548
46.029
15.729
24.092
1.00
25.93
O


HETATM
1769
O
HOH

549
40.466
−1.095
17.221
1.00
30.08
O


HETATM
1770
O
HOH

550
24.133
0.727
19.719
1.00
26.39
O


HETATM
1771
O
HOH

551
3.446
9.427
−2.135
1.00
31.76
O


HETATM
1772
O
HOH

552
19.319
13.000
−2.575
1.00
41.76
O


HETATM
1773
O
HOH

553
31.891
31.580
18.610
1.00
24.77
O


HETATM
1774
O
HOH

554
18.491
2.334
4.660
1.00
22.10
O


HETATM
1775
O
HOH

555
1.314
11.279
0.038
1.00
26.52
O


HETATM
1776
O
HOH

556
28.146
13.256
10.339
1.00
21.95
O


HETATM
1777
O
HOH

557
30.223
20.804
5.827
1.00
36.82
O


HETATM
1778
O
HOH

558
17.905
13.328
15.220
1.00
20.71
O


HETATM
1779
O
HOH

559
29.909
−0.095
18.799
1.00
21.94
O


HETATM
1780
O
HOH

560
31.631
17.664
9.008
1.00
19.67
O


HETATM
1781
O
HOH

562
24.042
20.954
19.484
1.00
31.11
O


HETATM
1782
O
HOH

563
37.392
16.228
−0.497
1.00
26.68
O


HETATM
1783
O
HOH

564
−1.946
8.648
−0.841
1.00
39.16
O


HETATM
1784
O
HOH

565
47.915
9.279
22.936
1.00
35.60
O


HETATM
1785
O
HOH

566
25.748
7.266
4.529
1.00
28.82
O


HETATM
1786
O
HOH

567
21.116
14.888
−2.109
1.00
36.75
O


HETATM
1787
O
HOH

568
23.715
20.126
5.645
1.00
40.35
O


HETATM
1788
O
HOH

569
33.531
−0.771
13.511
1.00
45.31
O


HETATM
1789
O
HOH

570
20.908
18.749
15.697
1.00
34.23
O


HETATM
1790
O
HOH

571
39.015
32.529
4.501
1.00
54.73
O


HETATM
1791
O
HOH

572
41.051
0.455
7.130
1.00
54.62
O


HETATM
1792
O
HOH

573
30.698
17.740
5.106
1.00
35.84
O


HETATM
1793
O
HOH

574
6.859
5.063
17.712
1.00
37.55
O


HETATM
1794
O
HOH

575
39.059
8.657
3.005
1.00
59.67
O


HETATM
1795
O
HOH

576
38.160
32.276
16.708
1.00
37.88
O


HETATM
1796
O
HOH

577
20.383
21.318
−4.691
1.00
39.17
O


HETATM
1797
O
HOH

578
41.157
6.777
17.217
1.00
35.75
O


HETATM
1798
O
HOH

579
27.327
21.257
8.150
1.00
42.39
O


HETATM
1799
O
HOH

580
34.592
24.586
1.886
1.00
30.17
O


HETATM
1800
O
HOH

581
28.618
8.077
−0.712
1.00
54.91
O


HETATM
1801
O
HOH

582
30.108
20.873
8.937
1.00
56.50
O


HETATM
1802
O
HOH

583
29.146
8.970
6.121
1.00
38.73
O


HETATM
1803
O
HOH

584
51.666
17.680
15.607
1.00
47.50
O


HETATM
1804
O
HOH

585
14.071
22.825
6.240
1.00
47.08
O


HETATM
1805
O
HOH

586
43.840
6.576
20.454
1.00
36.44
O


HETATM
1806
O
HOH

587
42.165
8.711
19.272
1.00
37.83
O


HETATM
1807
O
HOH

588
38.236
13.980
1.049
1.00
39.71
O


HETATM
1808
O
HOH

589
45.548
22.632
16.215
1.00
37.00
O


HETATM
1809
O
HOH

590
30.906
16.821
2.292
1.00
76.55
O


HETATM
1810
O
HOH

591
11.602
22.880
−3.030
1.00
36.77
O


HETATM
1811
O
HOH

592
12.805
8.355
−4.098
1.00
49.06
O


HETATM
1812
O
HOH

593
−2.315
6.894
1.774
1.00
37.69
O


HETATM
1813
O
HOH

594
23.656
18.274
20.007
1.00
26.98
O


HETATM
1814
O
HOH

595
10.845
12.955
18.898
1.00
29.83
O


HETATM
1815
O
HOH

596
18.262
4.377
19.245
1.00
39.17
O


HETATM
1816
O
HOH

597
49.235
6.180
11.692
1.00
70.41
O


HETATM
1817
O
HOH

598
14.400
−3.218
15.489
1.00
41.28
O


HETATM
1818
O
HOH

599
24.049
25.032
20.146
1.00
33.82
O


HETATM
1819
O
HOH

600
23.974
20.498
−3.811
1.00
44.21
O


HETATM
1820
O
HOH

601
39.360
6.106
20.658
1.00
34.40
O


HETATM
1821
O
HOH

602
31.294
32.672
16.263
1.00
25.66
O


HETATM
1822
O
HOH

603
44.234
4.286
13.888
1.00
41.49
O


HETATM
1823
O
HOH

604
21.106
8.490
19.650
1.00
66.45
O


HETATM
1824
O
HOH

605
28.263
15.750
23.071
1.00
67.72
O


HETATM
1825
O
HOH

606
17.345
24.004
15.476
1.00
73.54
O


HETATM
1826
O
HOH

607
28.369
11.841
−4.260
1.00
38.80
O


HETATM
1827
O
HOH

608
18.542
3.496
0.954
1.00
48.61
O


HETATM
1828
O
HOH

609
−6.831
−8.495
19.160
1.00
46.57
O


HETATM
1829
O
HOH

610
22.075
20.486
9.491
1.00
58.37
O


HETATM
1830
O
HOH

611
19.748
7.055
−2.211
1.00
48.25
O


HETATM
1831
O
HOH

612
44.625
20.192
21.807
1.00
80.14
O


HETATM
1832
O
HOH

613
38.559
9.253
5.778
1.00
32.81
O


HETATM
1833
O
HOH

614
18.465
0.369
7.409
1.00
51.65
O


HETATM
1834
O
HOH

615
37.639
9.337
22.161
1.00
48.87
O


HETATM
1835
O
HOH

616
16.809
1.980
9.496
1.00
72.40
O


HETATM
1836
O
HOH

617
38.304
20.108
24.081
1.00
25.44
O


HETATM
1837
O
HOH

618
19.482
11.517
17.092
1.00
70.05
O


HETATM
1838
O
HOH

619
15.255
−5.143
10.342
1.00
36.51
O


HETATM
1839
O
HOH

620
29.264
10.712
2.893
1.00
32.87
O


HETATM
1840
O
HOH

621
15.822
7.785
−3.226
1.00
43.27
O


HETATM
1841
O
HOH

622
23.813
11.730
−2.619
1.00
36.87
O


HETATM
1842
O
HOH

623
−3.594
5.850
12.136
1.00
66.99
O


HETATM
1843
O
HOH

624
18.735
22.790
5.719
1.00
40.97
O


HETATM
1844
O
HOH

625
55.575
16.141
8.287
1.00
49.74
O


HETATM
1845
O
HOH

626
53.255
10.141
17.681
1.00
68.52
O


HETATM
1846
O
HOH

627
−1.530
−0.489
4.060
1.00
49.32
O


HETATM
1847
O
HOH

628
47.166
12.933
3.227
1.00
42.68
O


HETATM
1848
O
HOH

629
51.745
12.846
4.265
1.00
32.94
O


HETATM
1849
O
HOH

630
8.335
13.886
−5.452
1.00
42.98
O


HETATM
1850
O
HOH

631
1.415
18.892
4.450
1.00
52.27
O


HETATM
1851
O
HOH

632
8.813
4.607
−6.147
1.00
61.80
O


HETATM
1852
O
HOH

633
14.358
18.909
−7.627
1.00
59.54
O


HETATM
1853
O
HOH

634
42.560
12.882
4.673
1.00
46.06
O


HETATM
1854
O
HOH

635
13.134
−0.048
6.444
1.00
49.92
O


HETATM
1855
O
HOH

636
8.623
23.116
−0.236
1.00
51.89
O


HETATM
1856
O
HOH

637
19.455
18.632
6.820
1.00
29.08
O


HETATM
1857
O
HOH

638
0.328
4.658
14.591
1.00
62.40
O


HETATM
1858
O
HOH

639
49.562
9.193
6.008
1.00
39.48
O


HETATM
1859
O
HOH

640
4.000
−2.106
−0.040
1.00
57.30
0


HETATM
1860
O
HOH

641
16.678
23.799
11.363
1.00
65.19
0


HETATM
1861
O
HOH

642
8.358
0.710
9.304
1.00
43.70
0


HETATM
1862
O
HOH

643
41.615
2.908
17.876
1.00
52.92
0


HETATM
1863
O
HOH

644
29.021
19.152
2.611
1.00
26.91
0


HETATM
1864
O
HOH

645
44.197
15.580
2.167
1.00
76.22
0


HETATM
1865
O
HOH

646
22.571
6.260
−1.309
1.00
50.65
0


HETATM
1866
O
HOH

647
1.473
1.753
3.469
1.00
40.93
0


HETATM
1867
O
HOH

648
46.959
7.034
20.908
1.00
37.66
0


HETATM
1868
O
HOH

649
24.330
17.915
17.109
1.00
53.90
0













CONECT
1428
1429






CONECT
1429
1428
1430


CONECT
1430
1429
1431


CONECT
1431
1430
1432


CONECT
1432
1431
1433


CONECT
1433
1432
1434


CONECT
1434
1433
1435


CONECT
1435
1434
1436


CONECT
1436
1435
1437


CONECT
1437
1436


CONECT
1438
1439
1440
1441


CONECT
1439
1438


CONECT
1440
1438


CONECT
1441
1438


CONECT
1442
1443
1444
1445
1446


CONECT
1442
1447
1448


CONECT
1443
1442


CONECT
1444
1442


CONECT
1445
1442


CONECT
1446
1442


CONECT
1447
1442


CONECT
1448
1442


CONECT
1449
1450
1451
1452
1453


CONECT
1449
1454
1455


CONECT
1450
1449


CONECT
1451
1449


CONECT
1452
1449


CONECT
1453
1449


CONECT
1454
1449


CONECT
1455
1449


CONECT
1456
1457
1458
1459
1460


CONECT
1456
1461
1462


CONECT
1457
1456


CONECT
1458
1456


CONECT
1459
1456


CONECT
1460
1456


CONECT
1461
1456


CONECT
1462
1456


CONECT
1463
1464
1465
1466
1467


CONECT
1463
1468
1469


CONECT
1464
1463


CONECT
1465
1463


CONECT
1466
1463


CONECT
1467
1463


CONECT
1468
1463


CONECT
1469
1463


CONECT
1470
1471
1472
1473
1474


CONECT
1470
1475
1476


CONECT
1471
1470


CONECT
1472
1470


CONECT
1473
1470


CONECT
1474
1470


CONECT
1475
1470


CONECT
1476
1470


CONECT
1477
1478
1479
1480
1481


CONECT
1477
1482
1483


CONECT
1478
1477


CONECT
1479
1477


CONECT
1480
1477


CONECT
1481
1477


CONECT
1482
1477


CONECT
1483
1477


CONECT
1484
1485
1486
1487
1488


CONECT
1484
1489
1490


CONECT
1485
1484


CONECT
1486
1484


CONECT
1487
1484


CONECT
1488
1484


CONECT
1489
1484


CONECT
1490
1484


CONECT
1491
1492
1493
1494
1495


CONECT
1491
1496
1497


CONECT
1492
1491


CONECT
1493
1491


CONECT
1494
1491


CONECT
1495
1491


CONECT
1496
1491


CONECT
1497
1491


CONECT
1498
1499
1500
1501
1502


CONECT
1498
1503
1504


CONECT
1499
1498


CONECT
1500
1498


CONECT
1501
1498


CONECT
1502
1498


CONECT
1503
1498


CONECT
1504
1498


CONECT
1505
1506
1507
1508
1509


CONECT
1505
1510
1511


CONECT
1506
1505


CONECT
1507
1505


CONECT
1508
1505


CONECT
1509
1505


CONECT
1510
1505


CONECT
1511
1505


CONECT
1512
1513
1514
1515
1516


CONECT
1512
1517
1518


CONECT
1513
1512


CONECT
1514
1512


CONECT
1515
1512


CONECT
1516
1512


CONECT
1517
1512


CONECT
1518
1512


CONECT
1519
1520
1521
1522
1523


CONECT
1519
1524
1525


CONECT
1520
1519


CONECT
1521
1519


CONECT
1522
1519


CONECT
1523
1519


CONECT
1524
1519


CONECT
1525
1519


CONECT
1526
1527
1531


CONECT
1527
1526
1528


CONECT
1528
1527
1529


CONECT
1529
1528
1530
1535


CONECT
1530
1529
1531
1533


CONECT
1531
1526
1530
1532


CONECT
1532
1531


CONECT
1533
1530
1534


CONECT
1534
1533
1535


CONECT
1535
1529
1534




















MASTER
249
0
15
0
0
0
0
6
1867
1
120
6



END
















TABLE 7







Thermodynamic parameters associated with various nucleobases bound to the


adenine riboswitch RNA (AR)











#

Kd
ΔH
ΔS


of exp.
Analog
(μM)
(kcal/mol)
(Cal/mol · deg)





5
2,6-Diaminopurine*
0.017 ± 0.006
−40.3 ± 2.91
−97.6 ± 4.00


2
2-Aminopurine
0.251 ± 0.026
−31.8 ± 2.64
−73.9 ± 9.26


2
Adenine
0.469 ± 0.041
−34.5 ± 0.05
−84.8 ± 0.30


2
Purine
110 ± 3 
−25.4 ± 0.99
−65.4 ± 3.00


2
2-Fluoroadenine
4.02 ± 1.07
−37.4 ± 3.41
−98.7 ± 11.8


2
2,4,5,6-Tetraaminopyrimidine
24.9 ± 4.25
−43.2 ± 3.50
 −122 ± 12.0


2
2,4,6-Triaminopyrimidine
18.3 ± 1.67
−37.7 ± 0.23
 −102 ± 0.70


1
2,4-Diaminopyrimidine
n.d.


1
Histamine
n.d.


1
L-Histidinol
n.d.


2
6-Bromoguanine
 2.7 ± 1.16
−50.4 ± 0.16
 −141 ± 1.90


2
6-Chloroguanine
0.721 ± 0.044
−44.3 ± 7.2 
−98.6 ± 4.39


1
2,6-Dichloropurine
n.d.


4
O-Methylguanine
n.d.


1
Kinetin
n.d.


1
6-n-Hexylaminopurine
n.d.


2
6-Benzylaminopurine
n.d.


2
6-Methylpurine
25.2 ± 1.63
−31.5 ± 0.42
−82.9 ± 1.25


2
O-Benzylguanine
n.d.





All experiments were done in 50 mM K+HEPES, pH 7.5, 100 mM KC1, 10 mM MgCl2 at 30° C.


n.d. designates reactions where no detectable binding was indicated.


*The heat capacity (ΔCp) for this reaction is −1.10 kcal mol−1 K−1, as measured between 298 and 323 K.













TABLE 8







Thermodynamic parameters associated with various nucleobases bound to the


guanine riboswitch RNA (GR)











# of

Kd
ΔH
ΔS


exp
Analog
(μM)
(kcal/mol)
(cal/mol · deg)





2
Guanine
0.004 ± 0.003
−53.5 ± 6.98
−137 ± 24.8


2
Hypoxanthine
0.759 ± 0.066
−31.3 ± 0.01
 −75.3 ± 0.145


2
Xanthine
50.5 ± 5.8*
 −34.6 ± 10.4*
 −94.4 ± 34.5*


2
7-Deazaguanine
0.049 ± 0.013
−42.0 ± 3.72
−105 ± 11.7


2
8-Azaguanine
0.0487 ± 0.1  
−46.9 ± 9.87
−121 ± 32.6


1
4,6-Dihyroxypyrimidine
n.d


1
5,6Diamino2,4Dihydroxyupyrimidine
n.d.


2
6-Bromoguanine
2.12 ± 0.56
−39.0 ± 7.79
−103 ± 26.2


2
6-Chloroguanine
0.856 ± 0.014
−43.6 ± 13.1
−116 ± 43.2


1
2,6-Dichloropurine
n.d.


1
O-Methylguanine
0.003
−18.7
−23.3


2
2-Aminopurine
4.36 ± 1.2 
−46.4 ± 2.66
−129 ± 8.20


1
O-Benzylguanine
n.d.


2
6-Thioguanine
0.0737 ± 0.01 
−40.6 ± 14.5
−101 ± 47.5


2
6-Methylthioguanine
0.0418 ± 0.003 
−24.5 ± 0.20
−46.9 ± 0.24 





All experiments were done in 50 mM K+HEPES, 100 mM KCI, 10 mM MgCl2 at 30° C.


n.d. designates reactions where no detectable binding was indicated.


*Software was not able to fit data and report parameters unless n was held at 1.






It is understood that the disclosed method and compositions are not limited to the particular methodology, protocols, and reagents described as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.


It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to “a riboswitch” includes a plurality of such riboswitches, reference to “the riboswitch” is a reference to one or more riboswitches and equivalents thereof known to those skilled in the art, and so forth.


“Optional” or “optionally” means that the subsequently described event, circumstance, or material may or may not occur or be present, and that the description includes instances where the event, circumstance, or material occurs or is present and instances where it does not occur or is not present.


Ranges may be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, also specifically contemplated and considered disclosed is the range from the one particular value and/or to the other particular value unless the context specifically indicates otherwise. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another, specifically contemplated embodiment that should be considered disclosed unless the context specifically indicates otherwise. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint unless the context specifically indicates otherwise. Finally, it should be understood that all of the individual values and sub-ranges of values contained within an explicitly disclosed range are also specifically contemplated and should be considered disclosed unless the context specifically indicates otherwise. The foregoing applies regardless of whether in particular cases some or all of these embodiments are explicitly disclosed.


Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of skill in the art to which the disclosed method and compositions belong. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present method and compositions, the particularly useful methods, devices, and materials are as described. Publications cited herein and the material for which they are cited are hereby specifically incorporated by reference. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such disclosure by virtue of prior invention. No admission is made that any reference constitutes prior art. The discussion of references states what their authors assert, and applicants reserve the right to challenge the accuracy and pertinency of the cited documents. It will be clearly understood that, although a number of publications are referred to herein, such reference does not constitute an admission that any of these documents forms part of the common general knowledge in the art.


Throughout the description and claims of this specification, the word “comprise” and variations of the word, such as “comprising” and “comprises,” means “including but not limited to,” and is not intended to exclude, for example, other additives, components, integers or steps.


Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the method and compositions described herein. Such equivalents are intended to be encompassed by the following claims.

Claims
  • 1. The atomic structure of a natural guanine-responsive riboswitch comprising an atomic structure as depicted in FIG. 6.
  • 2. The atomic structure of claim 1, wherein the atomic coordinates of the atomic structure comprise the atomic coordinates listed in Table 6 for atoms depicted in FIG. 6.
  • 3. The atomic structure of claim 1, wherein the atomic coordinates of the atomic structure comprise the atomic coordinates listed in Table 6.
  • 4. A method of identifying a compound that interacts with a riboswitch comprising: (a) modeling the atomic structure of claim 1 with a test compound; and(b) determining if the test compound interacts with the riboswitch.
  • 5. The method of claim 4, wherein determining if the test compound interacts with the riboswitch comprises determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch.
  • 6. The method of claim 4, wherein determining if the test compound interacts with the riboswitch comprises determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch.
  • 7. The method of claim 4, wherein the riboswitch is a guanine riboswitch.
  • 8. The method of claim 7, wherein the guanine riboswitch is a riboswitch in Table 5.
  • 9. The method of claim 4, wherein atomic contacts are determined in step (b), thereby determining the interaction of the test compound with the riboswitch.
  • 10. The method of claim 9, further comprising the steps of: (c) identifying analogs of the test compound;(d) determining if the analogs of the test compound interact with the riboswitch.
  • 11. The method of claim 10, wherein the compound is hypoxanthine.
  • 12. A method of killing bacteria, comprising contacting the bacteria with an analog identified by the method of claim 10.
  • 13. A method of killing bacteria, comprising contacting the bacteria with a compound identified by the method of claim 4.
  • 14. The method of claim 4, wherein a gel-based assay is used to determine if the test compound interacts with the riboswitch.
  • 15. The method of claim 4, wherein a chip-based assay is used to determine if the test compound interacts with the riboswitch.
  • 16. The method of claim 4, wherein the test compound interacts via van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination.
  • 17. The method of claim 4, wherein the riboswitch comprises an RNA cleaving ribozyme.
  • 18. The method of claim 4, wherein a fluorescent signal is generated when a nucleic acid comprising a quenching moiety is cleaved.
  • 19. The method of claim 4, wherein molecular beacon technology is employed to generate the fluorescent signal.
  • 20. The method of claim 4, wherein the method is carried out using a high throughput screen.
  • 21. A method of identifying compounds that interact with a riboswitch comprising contacting the riboswitch with a test compound, wherein a fluorescent signal is generated upon interaction of the riboswitch with the test compound.
  • 22. A method of identifying a compound that interacts with a riboswitch comprising: (a) identifying the crystal structure of the riboswitch;(b) modeling the riboswitch with a test compound; and(c) determining if the test compound interacts with the riboswitch.
  • 23. The method of claim 22, wherein the riboswitch is a guanine riboswitch.
  • 24. The method of claim 23, wherein the guanine riboswitch is a riboswitch in Table 5.
  • 25. A regulatable gene expression construct comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous.
  • 26. The gene expression construct of claim 25, wherein the riboswitch is activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule.
  • 27. A method of detecting a compound of interest, the method comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest.
  • 28. The method of claim 27, wherein the riboswitch changes conformation when activated by the compound of interest, wherein the change in conformation produces a signal via a conformation dependent label.
  • 29. The method of claim 27, wherein the riboswitch changes conformation when activated by the compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal.
  • 30. The method of claim 29, wherein the signal is produced by a reporter protein expressed from the RNA linked to the riboswitch.
  • 31. A method of inhibiting gene expression, the method comprising (a) bringing into contact a compound and a cell,(b) wherein the compound has the structure of Formula I
  • 32. The method of claim 31, wherein R3 is a hydrogen bond acceptor.
  • 33. The method of claim 31, wherein, independently, R1, R2, R9 or a combination are —NR11,—CHR11—, ═CR11—, and —C(═NR11)—, where R11 is —H, —NH2, —OH, —SH, —CO2H, substituted or unsubstituted alkyl, alkoxy, aryl, aryloxy, or benzyloxy, —NHalkyl, —NHalkoxy, —NHC(O)alkyl, —NHCO2alkyl, —NHC(O)NH2, —NH—NH2, —NH—NHalkyl, —NH—NHalkoxy, —NH—SO2alkyl, —NH—SO2—R12, —NHCO2CH2—R12, —NH—OR12, —N+H2—R12, —NH—NH—R12, and —NH—NH—CH2—R12, wherein R12 is:
  • 34. The method of claim 31, wherein R8 and R7 taken together can be represented as —CH═N—, —CH2—O—, —CH2—S—, or —CH2SO2—.
  • 35. The method of claim 31, wherein R10 is —OH, —SH, —NH2, —CO2H, -alkoxy, -aryloxy, -benzyloxy, -halide, —NHalkyl, —NHalkoxy, —NHC(O)alkyl, —NHCO2alkyl, —NHCO2CH2—R12, —NHC(O)NH2, —NH—NH2, —NH—NHalkyl, —NH—NHalkoxy, —SO2alkyl, —SO2aryl, —NH—SO2alkyl, —NH—SO2—R12, —NH—OR12, —NH—R12, —NH—NH—R12, —NH—NH—CH2—R12, or —NH—CH2—R12, wherein R12 is:
  • 36. The method of claim 31, wherein R10 is NR14, wherein R14 is —H, —NH2, —OH, —SH, —CO2H, —CO2alkyl, —CO2aryl, —C(O)NH2, substituted or unsubstituted alkyl, alkoxy, alkoxy, aryloxy, or benzyloxy, —NHalkyl, —NHalkoxy, —NHC(O)alkyl, —NHCO2alkyl, —NHC(O)NH2, —SO2alkyl, —SO2aryl, —NH—SO2alkyl, —NH—SO2—R12, —NH—OR12, —NH—R12, or —NH—CH2—R12wherein R12 is:
  • 37. The method of claim 31, wherein the compound has the structure of Formula II:
  • 38. The method of claim 31, wherein the compound has the structure of Formula III:
  • 39. A method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the inhibition is via the riboswitch,(b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a),wherein the cell comprises a gene encoding an RNA comprising the riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims benefit of U.S. Provisional Application No. 60/625,864, filed Nov. 8, 2004. U.S. Provisional Application No. 60/625,864, filed Nov. 8, 2004, is hereby incorporated herein by reference in its entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH

This invention was made with government support under Grant NIH GM068819-2 awarded by the National Institutes of Health, and Grant DARPA W911NF-04-1-0416 awarded by the Defense Advanced Research Projects Agency. The government has certain rights in the invention.

PCT Information
Filing Document Filing Date Country Kind 371c Date
PCT/US05/40487 11/8/2005 WO 00 5/12/2008
Provisional Applications (1)
Number Date Country
60625864 Nov 2004 US