[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2006055351A2 - Structure-based compound design involving riboswitches - Google Patents

Structure-based compound design involving riboswitches Download PDF

Info

Publication number
WO2006055351A2
WO2006055351A2 PCT/US2005/040487 US2005040487W WO2006055351A2 WO 2006055351 A2 WO2006055351 A2 WO 2006055351A2 US 2005040487 W US2005040487 W US 2005040487W WO 2006055351 A2 WO2006055351 A2 WO 2006055351A2
Authority
WO
WIPO (PCT)
Prior art keywords
riboswitch
compound
riboswitches
rna
alkyl
Prior art date
Application number
PCT/US2005/040487
Other languages
French (fr)
Other versions
WO2006055351A3 (en
Inventor
Ronald R. Breaker
Jinsoo Lim
Robert Batey
Kenneth F. Blount
Izabela Puskarz
Original Assignee
Yale University
The Regents Of The University Of Colorado
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yale University, The Regents Of The University Of Colorado filed Critical Yale University
Priority to US11/667,153 priority Critical patent/US20080269258A1/en
Priority to AU2005306801A priority patent/AU2005306801B2/en
Priority to MX2007005491A priority patent/MX2007005491A/en
Priority to CA002586998A priority patent/CA2586998A1/en
Priority to JP2007540181A priority patent/JP2008518630A/en
Priority to EP05851443A priority patent/EP1828953B1/en
Publication of WO2006055351A2 publication Critical patent/WO2006055351A2/en
Publication of WO2006055351A3 publication Critical patent/WO2006055351A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6897Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids involving reporter genes operably linked to promoters
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/04Antibacterial agents
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07HSUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
    • C07H21/00Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
    • C07H21/04Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids with deoxyribosyl as saccharide radical
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/115Aptamers, i.e. nucleic acids binding a target molecule specifically and with high affinity without hybridising therewith ; Nucleic acids binding to non-nucleic acids, e.g. aptamers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2310/00Structure or type of the nucleic acid
    • C12N2310/10Type of nucleic acid
    • C12N2310/16Aptamers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2320/00Applications; Uses
    • C12N2320/10Applications; Uses in screening processes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2565/00Nucleic acid analysis characterised by mode or means of detection
    • C12Q2565/10Detection mode being characterised by the assay principle
    • C12Q2565/101Interaction between at least two labels
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/136Screening for pharmacological compounds
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Definitions

  • the disclosed invention is generally in the field of gene expression and specifically in the area of regulation of gene expression.
  • Precision genetic control is an essential feature of living systems, as cells must respond to a multitude of biochemical signals and environmental cues by varying genetic expression patterns. Most known mechanisms of genetic control involve the use of protein factors that sense chemical or physical stimuli and then modulate gene expression by selectively interacting with the relevant DNA or messenger RNA sequence. Proteins can adopt complex shapes and carry out a variety of functions that permit living systems to sense accurately their chemical and physical environments. Protein factors that respond to metabolites typically act by binding DNA to modulate transcription initiation (e.g. the lac repressor protein; Matthews, K.S., and Nichols, J.C., 1998, Prog. Nucleic Acids Res. MoI. Biol. 58, 127-164) or by binding RNA to control either transcription termination (e.g.
  • RNA can take an active role in genetic regulation. Recent studies have begun to reveal the substantial role that small non-coding RNAs play in selectively targeting mRNAs for destruction, which results in down-regulation of gene expression (e.g. see Hannon, GJ. 2002, Nature 418, 244-251 and references therein). This process of RNA interference takes advantage of the ability of short RNAs to recognize the intended mRNA target selectively via Watson-Crick base complementation, after which the bound mRNAs are destroyed by the action of proteins. RNAs are ideal agents for molecular recognition in this system because it is far easier to generate new target- specific RNA factors through evolutionary processes than it would be to generate protein factors with novel but highly specific RNA binding sites.
  • RNA Although proteins fulfill most requirements that biology has for enzyme, receptor and structural functions, RNA also can serve in these capacities. For example, RNA has sufficient structural plasticity to form numerous ribozyme domains (Cech & Golden, Building a catalytic active site using only RNA. Ih: The RNA World R. F. Gesteland, T. R. Cech, J. F. Atkins, eds., ⁇ p.321-350 (1998); Breaker, In vitro selection of catalytic polynucleotides. Chem. Rev. 97, 371-390 (1997)) and receptor domains (Osborne & Ellington, Nucleic acid selection and the challenge of combinatorial chemistry. Chem. Rev.
  • TPP thiamine pyrophosphate
  • the lysC gene encodes aspartokinase II, which catalyzes the first step in the metabolic pathway that converts L-aspartic acid into L-lysine (Belitsky, B.R. 2002. Biosynthesis of amino acids of the glutamate and aspartate families, alanine, and polyamines. In: Bacillus subtilis and its Closest Relatives: from Genes to Cells. A.L. Sonenshein, J.A. Hoch, and R. Losick, eds., ASM Press, Washington, D.C.).
  • riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non-natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds.
  • the changed switches become part of a therapy regimen-turning on, or off, or regulating protein synthesis.
  • Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments.
  • riboswitches Disclosed are isolated and recombinant riboswitches, recombinant constructs containing such riboswitches, heterologous sequences operably linked to such riboswitches, and cells and transgenic organisms harboring such riboswitches, riboswitch recombinant constructs, and riboswitches operably linked to heterologous sequences.
  • the heterologous sequences can be, for example, sequences encoding proteins or peptides of interest, including reporter proteins or peptides.
  • Preferred riboswitches are, or are derived from, naturally occurring riboswitches.
  • the crystalline atomic structures of riboswitches are useful in modeling and assessing the interaction of a riboswitch with a binding ligand. They are also useful in methods of identifying compounds that interact with the riboswitch.
  • chimeric riboswitches containing heterologous aptamer domains and expression platform domains. That is, chimeric riboswitches are made up an aptamer domain from one source and an expression platform domain from another source.
  • the heterologous sources can be from, for example, different specific riboswitches or different classes of riboswitches.
  • the heterologous aptamers can also come from non-riboswitch aptamers.
  • the heterologous expression platform domains can also come from non-riboswitch sources.
  • compositions and methods for selecting and identifying compounds that can activate, deactivate or block a riboswitch are also disclosed.
  • Activation of a riboswitch refers to the change in state of the riboswitch upon binding of a trigger molecule.
  • a riboswitch can be activated by compounds other than the trigger molecule and in ways other than binding of a trigger molecule.
  • the term trigger molecule is used herein to refer to molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch.
  • Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques).
  • Non- natural trigger molecules can be referred to as non-natural trigger molecules.
  • Deactivation of a riboswitch refers to the change in state of the riboswitch when the trigger molecule is not bound.
  • a riboswitch can be deactivated by binding of compounds other than the trigger molecule and in ways other than removal of the trigger molecule.
  • Blocking of a riboswitch refers to a condition or state of the riboswitch where the presence of the trigger molecule does not activate the riboswitch. Also disclosed are methods of identifying a compound that interacts with a riboswitch comprising modeling the atomic structure of the riboswitch with a test compound and determining if the test compound interacts with the riboswitch. This can be done by determining the atomic contacts of the riboswitch and test compound. Furthermore, analogs of a compound known to interact with a riboswitch can be generated by analyzing the atomic contacts, then optimizing the atomic structure of the analog to maximize interaction. These methods can be used with a high throughput screen.
  • compositions containing such compounds that can activate, deactivate or block a riboswitch.
  • compositions and methods for activating, deactivating or blocking a riboswitch are also disclosed.
  • Riboswitches function to control gene expression through the binding or removal of a trigger molecule.
  • Compounds can be used to activate, deactivate or block a riboswitch.
  • the trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch.
  • Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch.
  • Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch.
  • a riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.
  • Riboswitches function to control gene expression through the binding or removal of a trigger molecule.
  • subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.
  • Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.
  • compositions and methods for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule, by operably linking a riboswitch to the RNA molecule by operably linking a riboswitch to the RNA molecule.
  • a riboswitch can be operably linked to an RNA molecule in any suitable manner, including, for example, by physically joining the riboswitch to the RNA molecule or by engineering nucleic acid encoding the RNA molecule to include and encode the riboswitch such that the RNA produced from the engineered nucleic acid has the riboswitch operably linked to the RNA molecule.
  • Subjecting a riboswitch operably linked to an RNA molecule of interest to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.
  • compositions and methods for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can result in death, stasis or debilitation of the cell or organism. For example, activating a naturally occurring riboswitch in a naturally occurring gene that is essential to survival of a microorganism can result in death of the microorganism (if activation of the riboswitch turns off or represses expression). This is one basis for the use of the disclosed compounds and methods for antimicrobial and antibiotic effects.
  • compositions and methods for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch can be engineered or can be recombinant in any manner.
  • the riboswitch and coding region of the RNA can be heterologous, the riboswitch can be recombinant or chimeric, or both. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product.
  • riboswitch As the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules. Also disclosed are compositions and methods for altering the regulation of a riboswitch by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch).
  • the aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain.
  • Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain.
  • any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.
  • compositions and methods for inactivating a riboswitch by covalently altering the riboswitch by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch.
  • Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.
  • compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner.
  • the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound.
  • the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.
  • Identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.
  • Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA.
  • biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • methods of detecting compounds using biosensor riboswitches can include bringing into contact a test sample and a biosensor riboswitch and assessing the activation of the biosensor riboswitch. Activation of the biosensor riboswitch indicates the presence of the trigger molecule for the biosensor riboswitch in the test sample.
  • compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
  • Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch. Also disclosed are methods for selecting, designing or deriving new riboswitches and/or new aptamers that recognize new trigger molecules.
  • Such methods can involve production of a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results. Also disclosed are riboswitches and aptamer domains produced by these methods.
  • the disclosed riboswitches generally can be from any source, including naturally occurring riboswitches and riboswitches designed de novo. Any such riboswitches can be used in or with the disclosed methods. However, different types of riboswitches can be defined and some such sub-types can be useful in or with particular methods (generally as described elsewhere herein). Types of riboswitches include, for example, naturally occurring riboswitches, derivatives and modified forms of naturally occurring riboswitches, chimeric riboswitches, and recombinant riboswitches.
  • a naturally occurring riboswitch is a riboswitch having the sequence of a riboswitch as found in nature.
  • Such a naturally occurring riboswitch can be an isolated or recombinant form of the naturally occurring riboswitch as it occurs in nature. That is, the riboswitch has the same primary structure but has been isolated or engineered in a new genetic or nucleic acid context.
  • Chimeric riboswitches can be made up of, for example, part of a riboswitch of any or of a particular class or type of riboswitch and part of a different riboswitch of the same or of any different class or type of riboswitch; part of a riboswitch of any or of a particular class or type of riboswitch and any non-riboswitch sequence or component.
  • Recombinant riboswitches are riboswitches that have been isolated or engineered in a new genetic or nucleic acid context.
  • riboswitches that have the same or similar trigger molecules or riboswitches that have the same or similar overall structure (predicted, determined, or a combination). Riboswitches of the same class generally, but need not, have both the same or similar trigger molecules and the same or similar overall structure.
  • Riboswitch classes include glycine-responsive riboswitches, guanine- responsive riboswitches, adenine-responsive riboswitches, lysine-responsive riboswitches, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin- responsive riboswitches, flavin mononucleotide-responsive riboswitches, and a S- adenosyknethionine-responsive riboswitches.
  • Figures IA, IB and 1C show the G box RNA of the xpt-pbuXmKNA in B. subtilis responds allosterically to guanine.
  • Figure IA shows the consensus sequence and secondary model for the G box RNA domain that resides in the 5 ' UTR of genes that are largely involved in purine metabolism (SEQ ID NO: 1). Phylogenetic analysis is consistent with the formation of a three-stem (Pl through P3) junction. Nucleotides depicted shown as lower case letters and capitals are present in greater than 90% and 80% of the representatives examined, respectively. Encircled nucleotides exhibit base complementation, which might indicate the formation of a pseudoknot.
  • Figure IB shows sequence and ligand-induced structural alterations of the 5'-UTR of the xpt-pbuX transcriptional unit (SEQ ID NO:2).
  • the putative anti-terminator interaction is represented by the boxes. Nucleotides that undergo structural alteration as determined by in-line probing (from C) are identified with squares. The 93 xpt fragment (boxed) of the 201 xpt RNA retains guanine-binding function. Asterisks denote alterations to the RNA sequence that facilitate in vitro transcription (5 ' terminus) or that generate a restriction site (3' terminus). Nucleotide numbers begin at the first nucleotide of the natural transcription start site. The translation start codon begins at position 186.
  • Figure 1C shows guanine and related purines selectively induce structural modulation of the 93 xpt mRNA fragment.
  • Precursor RNAs Pre; 5 ' 32 P-labeled
  • Precursor RNAs were subjected to in-line probing by incubation for 40 hr in the absence (-) or presence of guanine, hypoxanthine, xanthine and adenine as indicated by G, H, X and A, respectively.
  • Lanes designated NR, Tl and " OH contain RNA that was not reacted, subjected to partial digestion with RNase Tl (G-specific cleavage), or subjected to partial alkaline digestion, respectively. Selected bands corresponding to G-specific cleavage are identified. Regions 1 through 4 identify major sites of ligand-induced modulation of spontaneous RNA cleavage.
  • Figures 2A, 2B and 2C show a molecular discrimination by the guanine-binding aptamer of the xpt-pbuXmKNA.
  • Figure 2 A shows the chemical structures and apparent Kj) values for guanine, hypoxanthine and xanthine (active natural regulators of xpt-pbuX genetic expression in B. subtilis) versus that of adenine (inactive). Differences in chemical structure relative to guanine are encircled. KQ values were established as shown in Figure 2 with the 201 xpt RNA. Numbers on guanine represent the positions of the ring nitrogen atoms.
  • Figure 2B shows chemical structures and K ⁇ values for various analogs of guanine reveal that all alterations of this purine cause a loss of binding affinity. Open circles identify K D values that most likely are significantly higher than indicated, as concentrations of analog above 500 ⁇ M were not examined in this analysis. The apparent K D values of G, H, X and A as indicated are plotted as triangles for comparison.
  • Figure 2C shows a schematic representation of the molecular recognition features of the guanine aptamer in 201 xpt.
  • FIG. 3 A shows sequence and structural features of the two guanine-specific (purE and xpt) and three adenine-specific aptamer domains that are examined in this study BS2-purE, BS3-xpt, BS5-ydhL, CP4-add, VVl-add, which are represented by SEQ ID NOS:3-7, respectively.
  • Pl through P3 identify the three base-paired stems comprising the secondary structure of the aptamer domain.
  • Lowercase nucleotides identify positions whose base identity is conserved in greater than 90% of representatives in the phylogeny.
  • the arrow identifies a nucleotide within the conserved core of the aptamer that is a determinant of ligand specificity.
  • FIG. 3B shows sequence and secondary structure of the xpt andydhL aptamers (SEQ ID NO:8). Encircled nucleotides identify positions within the ydhL aptamer that differ from those in the xpt aptamer.
  • the sequence disclosed in Figure 3C is SEQ ID NO:9. Nucleotides in xpt are numbered as described in Example 6 of U.S. Application Publication No. 2005-0053951. Other notations are as described in A.
  • Figures 4 A and 4B show the specificity of molecular recognition by the adenine aptamer from.
  • FIG. 4a Top: Chemical structures of adenine, guanine and other purine analogs that exhibit measurable binding to the SO ydhL RNA. Chemical changes relative to 2,6-DAP, which is the tightest-binding compound, are encircled. Bottom left: Plot of the apparent KQ values for various purines. Bottom right: Model for the chemical features on adenine that serve as molecular recognition contacts for ydhL. Note that the importance of N7 and N9 has not been determined. Encircled arrow indicated that a contact could exist if a hydrogen bond donor is appended to C2.
  • Figure 4b shows chemical structures of various purines that are not bound by the 80 ydhL RNA (K B values poorer than 300 ⁇ M).
  • Figures 5A, 5B, 5C, and 5D show secondary and tertiary structures of the guanine riboswitch-hypoxanthine complex.
  • Figure 5A shows left, secondary structure of the Jcpt-p&MXguanine-binding domain of the guanine riboswitch of B. subtilis (SEQ ID NO:213). Nucleotides conserved in more than 90% of known guanine riboswitches are shown in red; the numbering is consistent with that of the full-length mRNA. Colored boxes correspond to structural features shown in Figures 6 and 7. Right, tertiary architecture of the hypoxanthine-bound form.
  • FIG. 5B shows gene repression by the guanine riboswitch in the 5' untranslated region of mRNA (SEQ ID NO:214).
  • Initial transcription generates a binding domain that is primed to bind guanine (G) rapidly if it is at a sufficiently high concentration.
  • Hypoxanthine FIX, top right
  • Figure 5 C shows ribbon representation of the three-dimensional structure of the RNA-hypoxanthine complex.
  • the hypoxanthine ligand is shown in red, with its surface represented by dots.
  • Figure 5D shows the top view of the complex, emphasizing the close packing of the P2 and P3 helices.
  • Figures 6A, 6B, and 6C show recognition of hypoxanthine (HX) by the guanine- binding domain.
  • Figure 6A Stereo view of the hypoxanthine-binding pocket in the three-way junction.
  • Figure 6B Hydrogen-bonding interactions (grey broken lines) between hypoxanthine and the RNA.
  • the final model (shown in stick representation) is superimposed on a simulated annealing 2Fo-Fc omit map (orange cage), in which the atoms shown were excluded from the map calculation.
  • Figure 6C Molecular surface representation of the binding pocket of the guanine riboswitch bound to hypoxanthine (left), compared with the theophylline-binding aptamer bound to theophylline (centre) and the E. coli purR repressor bound to hypoxanthine (right).
  • Figures 7A, 7B, and 7C show stabilization of the tertiary architecture.
  • FIG. 7A One of two base quartets that form the core of the loop-loop contact.
  • the carbon atoms are colored as in Figure 5.
  • Figure 7B Side view of the loop— loop interaction, emphasizing the arrangement of base pairs and quartets.
  • the bases of the quartet shown in A are colored blue, with the hydrogen bonding between A65 and U34 shown for orientation; the bases of the other quartet are colored green.
  • the A35A64 pair is shown in yellow, with hydrogen bonds emphasizing its interactions with the 2'-hydroxyl group of U34.
  • the capping G62U63 pair is shown in red.
  • Figures 8 A and 8B show an estimation of the affinity of the riboswitch for hypoxanthine.
  • Figures 9 A and 9B show schematic representations of two types of riboswitch binding assays that can be used with high throughput screens.
  • Figure 9A a ribozyme- based assay can be used that exploits inherent action of a self-cleaving ribozyme.
  • X represents the compound being tested, GlcN6P is glucosamine-6-phosphate.
  • F and Q represent fluorophore and quencher moieties, respectfully (e.g., TAMRA and CY3).
  • Figure 9B A molecular beacon assay can be used for non-ribozyme riboswitches such as the guanine-binding RNA. Notations are described in A.
  • Figure 10 shows cobalt hexammine ions bound to the guanine riboswitch.
  • the RNA (grey) on the left is shown in the same perspective as in Figure 5B, with bound hypoxanthine in red and Co(NH 3 ) 6 3+ shown in green and blue.
  • the RNA on the right is rotated 180° with respect to the left view.
  • Figure 11 shows secondary structure of RNA GR-minimal (SEQ ID NO:215).
  • This RNA has been designed to test the effect of the tertiary interaction formed by the loops L2 and L3 upon ligand binding. In this construct the L2 and L3 loops have been ablated along with part of P2 and P3, and replaced by extremely stable UUCG tetraloops.
  • Figures 12A and 12B show structure based design of anti-riboswitch compounds.
  • Figure 12 A Atomic-resolution model of the guanine riboswitch bound to the guanine analog hypoxanthine (HX).
  • Guanine binding can include two added contacts made between the exocyclic amine position 2 of the purine ring and the oxygen atoms of C60 and U37. Blue shading identifies the channels that are present, which can allow modification of the guanine ring. Along these channels, and in the vicinity of their termini, are opportunities for new contacts to be made between the aptamer and specific guanine derivatives.
  • Figure 12B Representatives of guanine analogs (total of 26; named G-001 though G-26) were synthesized to exploit the channels in the guanine riboswitch structure. Most compounds in this collection bind to the riboswitch with dissociation constants of lower than 1 nanomolar (see Figure 13).
  • Figure 13 shows representative polyacrylamde gel electrophoresis analysis of in ⁇ line probing studies that reveal allosteric modulation of the guanine aptamer by various analogs.
  • NR, Tl and OH identify no reaction, nuclease Tl partial digestion, and alkaline partial digestion respectively.
  • In-line probing assays were conducted for ⁇ 1 day, and a dampening of spontaneous RNA cleavage product bands in the Jl/2, J2/3 and J3/1 regions are indicative of ligand binding. This dampening occurs even when 1 nM of compound is added, indicating that the dissociation constant is equal to or better than this value.
  • Figure 14 shows a GFP reporter assay for guanine riboswitch function in vivo.
  • a full-length mRNA is transcribed and GFP is expressed.
  • a transcriptional terminator is formed, repressing the expression of GFP.
  • a similar system has been created wherein the jS-galactosidase gene is under riboswitch control.
  • Figure 15 shows time-kill assay for G-014.
  • G-014 reduces the number of viable B. subtilis cells over time at a rate equal to carbenicillm.
  • Certain natural mRNAs serve as metabolite-sensitive genetic switches wherein the RNA directly binds a small organic molecule. This binding process changes the conformation of the mRNA, which causes a change in gene expression by a variety of different mechanisms. Modified versions of these natural "riboswitch.es" (created by using various nucleic acid engineering strategies) can be employed as designer genetic switches that are controlled by specific effector compounds (referred to herein as trigger molecules). The natural switches are targets for antibiotics and other small molecule therapies.
  • riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non- natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds.
  • the changed switches become part of a therapy regimen - turning on, or off, or regulating protein synthesis.
  • Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments.
  • RNAs are typically thought of as passive carriers of genetic information that are acted upon by protein- or small RNA-regulatory factors and by ribosomes during the process of translation. It was discovered that certain mRNAs carry natural aptamer domains and that binding of specific metabolites directly to these RNA domains leads to modulation of gene expression. Natural riboswitches exhibit two surprising functions that are not typically associated with natural RNAs. First, the mRNA element can adopt distinct structural states wherein one structure serves as a precise binding pocket for its target metabolite. Second, the metabolite-induced allosteric interconversion between structural states causes a change in the level of gene expression by one of several distinct mechanisms.
  • Riboswitches typically can be dissected into two separate domains: one that selectively binds the target (aptamer domain) and another that influences genetic control (expression platform). It is the dynamic interplay between these two domains that results in metabolite-dependent allosteric control of gene expression.
  • riboswitches Distinct classes of riboswitches have been identified and are shown to selectively recognize activating compounds (referred to herein as trigger molecules). For example, coenzyme B 12 , glycine, thiamine pyrophosphate (TPP), and flavin mononucleotide (FMN) activate riboswitches present in genes encoding key enzymes in metabolic or transport pathways of these compounds.
  • the aptamer domain of each riboswitch class conforms to a highly conserved consensus sequence and structure. Thus, sequence homology searches can be used to identify related riboswitch domains. Riboswitch domains have been discovered in various organisms from bacteria, archaea, and eukarya.
  • RNAs that carry the consensus sequence and structural features of guanine riboswitches are located in the 5'- untranslated region (UTR) of numerous genes of prokaryotes, where they control expression of proteins involved in purine salvage and biosynthesis.
  • UTR 5'- untranslated region
  • Three representatives of this phylogenetic collection bind adenine with values for apparent dissociation constant (apparent KQ) that are several orders of magnitude better than for guanine.
  • adenine is due to a single nucleotide substitution in the core of the riboswitch, wherein each representative most likely recognizes its corresponding ligand by forming a Watson/Crick base pair.
  • the adenine-specific riboswitch associated with the ydhL gene of Bacillus subtilis functions as a genetic 'ON' switch, wherein adenine binding causes a structural rearrangement that precludes formation of an intrinsic transcription terminator stem.
  • Guanine-sensing riboswitches are a class of RNA genetic control elements that modulate gene expression in response to changing concentrations of this compound.
  • the 5 '-untranslated sequence of the Escherichia coli btuB mRNA assumes a more proactive role in metabolic monitoring and genetic control.
  • the mRNA serves as a metabolite-sensing genetic switch by selectively binding coenzyme B 12 without the need for proteins. This binding event establishes a distinct RNA structure that is likely to be responsible for inhibition of ribosome binding and consequent reduction in synthesis of the cobalamin transport protein BtuB.
  • TMs discovery along with related observations described herein, supports the hypothesis that metabolic monitoring through RNA- metabolite interactions is a widespread mechanism of genetic control.
  • RNA structure probing data indicate that the thiamine pyrophosphate (TPP) riboswitch operates as an allosteric sensor of its target compound, wherein binding of TPP by the aptamer domain stabilizes a conformational state within the aptamer and within the neighboring expression platform that precludes translation.
  • the diversity of expression platforms appears to be expansive.
  • the tAiMRNA uses a Shine-Dalgarno (SD)-blocking mechanism to control translation, m contrast, the thiC RNA controls gene expression both at transcription and translation, and therefore might make use of a somewhat more complex expression platform that converts the TPP binding event into a transcription termination event and into inhibition of translation of completed mRNAs.
  • SD Shine-Dalgarno
  • riboswitches Numerous other riboswitches are known that can be used together or as part of a chimeric riboswitch along with glycine-sensing riboswitches and their components. Examples of such riboswitches and their use are described in U.S. Application Publication No. 2005-0053951, which is hereby incorporated by reference in its entirety and in particular for its description of the structure and operation of particular riboswitches.
  • Bacterial riboswitch RNAs are genetic control elements that are located primarily within the 5 '-untranslated region (5'-UTR) of the main coding region of a particular mRNA. Structural probing studies (discussed further below) reveal that riboswitch elements are generally composed of two domains: a natural aptamer (T. Hermann, D. J. Patel, Science 2000, 287, 820; L. Gold, et al., Annual Review of Biochemistry 1995, 64, 763) that serves as the ligand-binding domain, and an 'expression platform' that interfaces with RNA elements that are involved in gene expression ⁇ e.g. Shine-Dalgarno (SD) elements; transcription terminator stems).
  • SD Shine-Dalgarno
  • aptamer domains are highly conserved amongst various organisms (and even between kingdoms as is observed for the TPP riboswitch), (N.
  • Aptamer domains for riboswitch RNAs typically range from ⁇ 70 to 170 nt in length ( Figure 11 of U.S. Application Publication No. 2005-0053951). This observation was somewhat unexpected given that in vitro evolution experiments identified a wide variety of small molecule-binding aptamers, which are considerably shorter in length and structural intricacy (T. Hermann, D. J. Patel, Science 2000, 287, 820; L. Gold, et al., Annual Review of Biochemistry 1995, 64, 763; M. Famulok, Current Opinion in Structural Biology 1999, 9, 324).
  • RNA receptors that function with high affinity and selectivity.
  • Apparent KQ values for the ligand-riboswitch complexes range from low nanomolar to low micromolar. It is also worth noting that some aptamer domains, when isolated from the appended expression platform, exhibit improved affinity for the target ligand over that of the intact riboswitch. (-10 to 100-fold) (see Example 2 of U.S. Application Publication No. 2005-0053951).
  • RNA elements are composed of a GC-rich stem-loop followed by a stretch of 6-9 uridyl residues. Intrinsic terminators are widespread throughout bacterial genomes (F.
  • RNA polymerase responds to a termination signal within the 5 '-UTR in a regulated fashion (T. M. Henkin, Current Opinion in Microbiology 2000, 3, 149). During certain conditions the RNA polymerase complex is directed by external signals either to perceive or to ignore the termination signal.
  • transcription initiation might occur without regulation, control over mRNA synthesis (and of gene expression) is ultimately dictated by regulation of the intrinsic terminator.
  • one of at least two mutually exclusive mRNA conformations results in the formation or disruption of the RNA structure that signals transcription termination.
  • a trans-acting factor which in some instances is a RNA (F. J. Grundy, et al. 5 Proceedings of the National Academy of Sciences of the United States of America 2002, 99, 11121; T. M. Henkin, C. Yanofsky, Bioessays 2002, 24, 700) and in others is a protein (J. Stulke, Archives of Microbiology 2002, 177, 433), is generally required for receiving a particular intracellular signal and subsequently stabilizing one of the RNA conformations.
  • Riboswitches offer a direct link between RNA structure modulation and the metabolite signals that are interpreted by the genetic control machinery. A brief overview of the FMN riboswitch from a B. subtilis mRNA is provided below to illustrate this mechanism.
  • the xpt-pbuX opGvon (Christiansen, L.C., et al., 1997, J. Bacteriol. 179, 2540- 2550) is controlled by a riboswitch that exhibits high affinity and high selectivity for guanine.
  • This class of riboswitches is present in the 5 '-untranslated region (5 '-UTR) of five transcriptional units in B. subtilis, including that of the 12 ⁇ gene/w operon. Direct binding of guanine by mRNAs serves as a critical determinant of metabolic homeostasis for purine metabolism in certain bacteria.
  • the discovered classes of riboswitches which respond to seven distinct target molecules, control at least 68 genes in Bacillus subtilis that are of fundamental importance to central metabolic pathways.
  • guanine riboswitches that have been identified in B. subtilis.
  • This structure reveals a complex RNA fold involving several phylogenetically conserved nucleotides that create a binding pocket that almost completely envelops the ligand.
  • Hypoxanthine functions to stabilize this structure and to promote the formation of a downstream transcriptional terminator element, thereby providing a mechanism for directly repressing gene expression in response to an increase in intracellular concentrations of metabolite.
  • riboswitch a genetic "riboswitch" (referred to herein as a riboswitch) whose origin might predate the evolutionary emergence of proteins.
  • riboswitch class was discovered in bacteria that is selectively triggered by glycine.
  • a representative of these glycine-sensing RNAs from Bacillus subtilis operates as a rare genetic on switch for the gc ⁇ Toperon, which codes for proteins that form the glycine cleavage system.
  • Most glycine riboswitches integrate two ligand-binding domains that function cooperatively to more closely approximate a two-state genetic switch. This advanced form of riboswitch may have evolved to ensure that excess glycine is efficiently used to provide carbon flux through the citric acid cycle and maintain adequate amounts of the amino acid for protein synthesis.
  • riboswitches perform key regulatory roles and exhibit complex performance characteristics that previously had been observed only with protein factors.
  • each of the combinations A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are specifically contemplated and should be considered disclosed from disclosure of A, B 5 and C; D, E, and F; and the example combination A-D.
  • any subset or combination of these is also specifically contemplated and disclosed.
  • the sub-group of A- E, B-F, and C-E are specifically contemplated and should be considered disclosed from disclosure of A, B, and C; D, E, and F; and the example combination A-D.
  • Riboswitches are expression control elements that are part of an RNA molecule to be expressed and that change state when bound by a trigger molecule. Riboswitches typically can be dissected into two separate domains: one that selectively binds the target (aptamer domain) and another that influences genetic control (expression platform domain). It is the dynamic interplay between these two domains that results in metabolite-dependent allosteric control of gene expression.
  • riboswitches Disclosed are isolated and recombinant riboswitches, recombinant constructs containing such riboswitches, heterologous sequences operably linked to such riboswitches, and cells and transgenic organisms harboring such riboswitches, riboswitch recombinant constructs, and riboswitches operably linked to heterologous sequences.
  • the heterologous sequences can be, for example, sequences encoding proteins or peptides of interest, including reporter proteins or peptides.
  • Preferred riboswitches are, or are derived from, naturally occurring riboswitches.
  • the disclosed riboswitches generally can be from any source, including naturally occurring riboswitches and riboswitches designed de novo. Any such riboswitches can be used in or with the disclosed methods. However, different types of riboswitches can be defined and some such sub-types can be useful in or with particular methods (generally as described elsewhere herein). Types of riboswitches include, for example, naturally occurring riboswitches, derivatives and modified forms of naturally occurring riboswitches, chimeric riboswitches, and recombinant riboswitches.
  • a naturally occurring riboswitch is a riboswitch having the sequence of a riboswitch as found in nature.
  • Such a naturally occurring riboswitch can be an isolated or recombinant form of the naturally occurring riboswitch as it occurs in nature. That is, the riboswitch has the same primary structure but has been isolated or engineered in a new genetic or nucleic acid context.
  • Chimeric riboswitches can be made up of, for example, part of a riboswitch of any or of a particular class or type of riboswitch and part of a different riboswitch of the same or of any different class or type of riboswitch; part of a riboswitch of any or of a particular class or type of riboswitch and any non-riboswitch sequence or component.
  • Recombinant riboswitches are riboswitches that have been isolated or engineered in a new genetic or nucleic acid context.
  • Riboswitches can have single or multiple aptamer domains. Aptamer domains in riboswitches having multiple aptamer domains can exhibit cooperative binding of trigger molecules or can not exhibit cooperative binding of trigger molecules (that is, the aptamers need not exhibit cooperative binding). In the latter case, the aptamer domains can be said to be independent binders. Riboswitches having multiple aptamers can have one or multiple expression platform domains. For example, a riboswitch having two aptamer domains that exhibit cooperative binding of their trigger molecules can be linked to a single expression platform domain that is regulated by both aptamer domains. Riboswitches having multiple aptamers can have one or more of the aptamers joined via a linker. Where such aptamers exhibit cooperative binding of trigger molecules, the linker can be a cooperative linker.
  • Aptamer domains can be said to exhibit cooperative binding if they have a Hill coefficient n between x and x-1 , where x is the number of aptamer domains (or the number of binding sites on the aptamer domains) that are being analyzed for cooperative binding.
  • a riboswitch having two aptamer domains can be said to exhibit cooperative binding if the riboswitch has Hill coefficient between 2 and 1. It should be understood that the value of x used depends on the number of aptamer domains being analyzed for cooperative binding, not necessarily the number of aptamer domains present in the riboswitch. This makes sense because a riboswitch can have multiple aptamer domains where only some exhibit cooperative binding.
  • riboswitches that have the same or similar trigger molecules or riboswitches that have the same or similar overall structure (predicted, determined, or a combination). Riboswitches of the same class generally, but need not, have both the same or similar trigger molecules and the same or similar overall structure.
  • Riboswitch classes include glycine-responsive riboswitches, guanine- responsive riboswitches, adenine-responsive riboswitches, lysine-responsive riboswitches, thiamine pyrophosphate-responsive riboswitch, adenosylcobalar ⁇ in- responsive riboswitches, flavin mononucleotide-responsive riboswitches, and a S- adenosyhnethionine-responsive riboswitches. Also disclosed are chimeric riboswitches containing heterologous aptamer domains and expression platform domains.
  • chimeric riboswitches are made up an aptamer domain from one source and an expression platform domain from another source.
  • the heterologous sources can be from, for example, different specific riboswitches, different types of riboswitches, or different classes of riboswitches.
  • the heterologous aptamers can also come from non-riboswitch aptarners.
  • the heterologous expression platform domains can also come from non-riboswitch sources.
  • Riboswitches can be modified from other known, developed or naturally- occurring riboswitches.
  • switch domain portions can be modified by changing one or more nucleotides while preserving the known or predicted secondary, tertiary, or both secondary and tertiary structure of the riboswitch.
  • both nucleotides in a base pair can be changed to nucleotides that can also base pair. Changes that allow retention of base pairing are referred to herein as base pair conservative changes.
  • Modified or derivative riboswitches can also be produced using in vitro selection and evolution techniques, ha general, in vitro evolution techniques as applied to riboswitches involve producing a set of variant riboswitches where part(s) of the riboswitch sequence is varied while other parts of the riboswitch are held constant. Activation, deactivation or blocking (or other functional or structural criteria) of the set of variant riboswitches can then be assessed and those variant riboswitches meeting the criteria of interest are selected for use or further rounds of evolution.
  • Useful base riboswitches for generation of variants are the specific and consensus riboswitches disclosed herein.
  • Consensus riboswitches can be used to inform which part(s) of a riboswitch to vary for in vitro selection and evolution. Also disclosed are modified riboswitches with altered regulation.
  • the regulation of a riboswitch can be altered by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch).
  • the aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain.
  • Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain.
  • any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.
  • Riboswitches can be inactivated by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule. Also disclosed are biosensor riboswitches.
  • Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA.
  • biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • Biosensor riboswitches can be used in various situations and platforms. For example, biosensor riboswitches can be used with solid supports, such as plates, chips, strips and wells.
  • New riboswitches and/or new aptamers that recognize new trigger molecules can be selected for, designed or derived from known riboswitches. This can be accomplished by, for example, producing a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results.
  • Particularly useful aptamer domains can form a stem structure referred to herein as the Pl stem structure (or simply Pl).
  • the Pl stems of a variety of riboswitclies are shown in Figure 11 of U.S. Application Publication No. 2005-0053951.
  • the hybridizing strands in the Pl stem structure are referred to as the aptamer strand (also referred to as the PIa strand) and the control strand (also referred to as the PIb strand).
  • the control strand can form a stem structure with both the aptamer strand and a sequence in a linked expression platform that is referred to as the regulated strand (also referred to as the PIc strand).
  • control strand can form alternative stem structures with the aptamer strand (PIa) and the regulated strand (PIc).
  • PIb aptamer strand
  • PIc regulated strand
  • Activation and deactivation of a riboswitch results in a shift from one of the stem structures to the other (from Pla/Plb to Plb/Plc or vice versa).
  • the formation of the Plb/Plc stem structure affects expression of the RNA molecule containing the riboswitch.
  • Riboswitches that operate via this control mechanism are referred to herein as alternative stem structure riboswitches (or as alternative stem riboswitches).
  • Some glycine-responsive riboswitches having two aptamers utilize this mechanism using a P 1 stem in the second aptamer.
  • any aptamer domain can be adapted for use with any expression platform domain by designing or adapting a regulated strand in the expression platform domain to be complementary to the control strand of the aptamer domain.
  • the sequence of the aptamer and control strands of an aptamer domain can be adapted so that the control strand is complementary to a functionally significant sequence in an expression platform.
  • the control strand can be adapted to be complementary to the Shine-Dalgarno sequence of an RNA such that, upon formation of a stem structure between the control strand and the SD sequence, the SD sequence becomes inaccessible to ribosomes, thus reducing or preventing translation initiation.
  • the aptamer strand would have corresponding changes in sequence to allow formation of a Pl stem in the aptamer domain.
  • one the Pl stem of the activating aptamer (the aptamer that interacts with the expression platform domain) need be designed to form a stem structure with the SD sequence.
  • a transcription terminator can be added to an RNA molecule
  • RNA (most conveniently in an untranslated region of the RNA) where part of the sequence of the transcription terminator is complementary to the control strand of an aptamer domain
  • the sequence will be the regulated strand. This will allow the control sequence of the aptamer domain to form alternative stem structures with the aptamer strand and the regulated strand, thus either forming or disrupting a transcription terminator stem upon activation or deactivation of the riboswitch.
  • Any other expression element can be brought under the control of a riboswitch by similar design of alternative stem structures.
  • the speed of transcription and spacing of the riboswitch and expression platform elements can be important for proper control.
  • Transcription speed can be adjusted by, for example, including polymerase pausing elements (e.g., a series of uridine residues) to pause transcription and allow the riboswitch to form and sense trigger molecules.
  • polymerase pausing elements e.g., a series of uridine residues
  • the antiterminator sequence is sequestered and is unavailable for formation of an antiterminator structure ( Figure 12 of U.S. Application Publication No. 2005-0053951).
  • the antiterminator can form once its nucleotides emerge from the polymerase.
  • RNAP then breaks free of the pause site only to reach another U- stretch and pause again.
  • the transcriptional terminator then forms only if the terminator nucleotides are not tied up by the antiterminator.
  • regulatable gene expression constructs comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous.
  • the riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain and the expression platform domain are heterologous.
  • the riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain comprises a Pl stem, wherein the Pl stem comprises an aptamer strand and a control strand, wherein the expression platform domain comprises a regulated strand, wherein the regulated strand, the control strand, or both have been designed to form a stem structure.
  • the riboswitch can comprise two or more aptamer domains and an expression platform domain, wherein at least one of the aptamer domains and the expression platform domain are heterologous.
  • the riboswitch can comprise two or more aptamer domains and an expression platform domain, wherein at least one of the aptamer domains comprises a Pl stem, wherein the Pl stem comprises an aptamer strand and a control strand, wherein the expression platform domain comprises a regulated strand, wherein the regulated strand, the control strand, or both have been designed to form a stem structure.
  • riboswitches wherein the riboswitch is a non-natural derivative of a naturally-occurring riboswitch.
  • the riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain and the expression platform domain are heterologous.
  • the riboswitch can be derived from a naturally-occurring guanine-responsive riboswitch, adenine-responsive riboswitch, lysine-responsive riboswitch, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin- responsive riboswitch, flavin mononucleotide-responsive riboswitch, glycine-responsive riboswitch, or a S-adenosylmethionine-responsive riboswitch.
  • the riboswitch can be activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule.
  • Table 5 discloses examples of various guanine riboswitches.
  • the alignment of sequences in Table 5 is in Sweden format as output by the cmalign program of the INFERNAL software package.
  • the last two entries in Table 5 reflect the consensus sequence and structure for the motif as reflected by a computer algorithm described by Eddy, S. R. (2003). INFERNAL. Version 0.55. Distributed by the author. Dept. of
  • riboswitches and riboswitch constructs are described and referred to herein. It is specifically contemplated that any specific riboswitch or riboswitch construct or group of riboswitches or riboswitch constructs can be excluded from some aspects of the invention disclosed herein. For example, fusion of the xpt-pbuX riboswitch with a reporter gene could be excluded from a set of riboswitches fused to reporter genes. As another exmple any combination of the riboswitches listed in Table 5 can be specifically included or specifically excluded form any aspect of the disclosed methods and compositions. 1. Aptamer Domains
  • Aptamers are nucleic acid segments and structures that can bind selectively to particular compounds and classes of compounds.
  • Riboswitches have aptamer domains that, upon binding of a trigger molecule result in a change in the state or structure of the riboswitch. In functional riboswitches, the state or structure of the expression platform domain linked to the aptamer domain changes when the trigger molecule binds to the aptamer domain.
  • Aptamer domains of riboswitches can be derived from any source, including, for example, natural aptamer domains of riboswitches, artificial aptamers, engineered, selected, evolved or derived aptamers or aptamer domains.
  • Aptamers in riboswitches generally have at least one portion that can interact, such as by forming a stem structure, with a portion of the linked expression platform domain. This stem structure will either form or be disrupted upon binding of the trigger molecule.
  • Consensus aptamer domains of a variety of natural riboswitches are shown in Figure 11 of U.S. Application Publication No. 2005-0053951 and elsewhere herein. These aptamer domains (including all of the direct variants embodied therein) can be used in riboswitches.
  • the consensus sequences and structures indicate variations in sequence and structure. Aptamer domains that are within the indicated variations are referred to herein as direct variants.
  • These aptamer domains can be modified to produce modified or variant aptamer domains. Conservative modifications include any change in base paired nucleotides such that the nucleotides in the pair remain complementary.
  • Moderate modifications include changes in the length of stems or of loops (for which a length or length range is indicated) of less than or equal to 20% of the length range indicated. Loop and stem lengths are considered to be "indicated” where the consensus structure shows a stem or loop of a particular length or where a range of lengths is listed or depicted. Moderate modifications include changes in the length of stems or of loops (for which a length or length range is not indicated) of less than or equal to 40% of the length range indicated. Moderate modifications also include and functional variants of unspecified portions of the aptamer domain. Unspecified portions of the aptamer domains are indicated by solid lines in Figure 11 of U.S. Application Publication No. 2005-0053951.
  • the Pl stem and its constituent strands can be modified in adapting aptamer domains for use with expression platforms and RNA molecules. Such modifications, which can be extensive, are referred to herein as Pl modifications.
  • Pl modifications include changes to the sequence and/or length of the Pl stem of an aptamer domain.
  • aptamer domains shown in Figure 11 of U.S. Application Publication No. 2005-0053951 are particularly useful as initial sequences for producing derived aptamer domains via in vitro selection or in vitro evolution techniques.
  • Aptamer domains of the disclosed riboswitches can also be used for any other purpose, and in any other context, as aptamers.
  • aptamers can be used to control ribozymes, other molecular switches, and any RNA molecule where a change in structure can affect function of the RNA.
  • Expression platform domains are a part of riboswitches that affect expression of the RNA molecule that contains the riboswitch.
  • Expression platform domains generally have at least one portion that can interact, such as by forming a stem structure, with a portion of the linked aptamer domain. This stem structure will either form or be disrupted upon binding of the trigger molecule.
  • the stem structure generally either is, or prevents formation of, an expression regulatory structure.
  • An expression regulatory structure is a structure that allows, prevents, enhances or inhibits expression of an RNA molecule containing the structure. Examples include Shine-Dalgarno sequences, initiation codons, transcription terminators, and stability and processing signals.
  • Trigger molecules are molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques).
  • Riboswitches function to control gene expression through the binding or removal of a trigger molecule.
  • Compounds can be used to activate, deactivate or block a riboswitch.
  • the trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch.
  • Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch.
  • a riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.
  • RNA molecules for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch.
  • Riboswitches function to control gene expression through the binding or removal of a trigger molecule.
  • subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.
  • Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.
  • riboswitch activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de ⁇ repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.
  • compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner.
  • the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound.
  • the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.
  • Identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation , of the riboswitch.
  • Compounds can also be identified using the atomic crystalline structure of a riboswitch.
  • An example of such a crystalline atomic structure of a natural guanine- responsive riboswitch can be found in Figure 5.
  • the atomic coordinates of the atomic structure are listed in Table 6.
  • the riboswitch is shown bound to hypoxanthine.
  • the crystal structure at 1.95 A resolution of the purine- binding domain of the guanine riboswitch from the xpt-pbuX operon of B. subtilis bound to hypoxanthine, a prevalent metabolite in the bacterial purine salvage pathway is shown.
  • This structure reveals a complex RNA fold involving several phylogenetically conserved nucleotides that create a binding pocket that almost completely envelops the ligand. Hypoxanthine functions to stabilize this structure and to promote the formation of a downstream transcriptional terminator element, thereby providing a mechanism for directly repressing gene expression in response to an increase in intracellular concentrations of metabolite.
  • Compounds can be identified using the crystalline structure of a riboswitch by, for example, modeling the atomic structure of the riboswitch with a test compound; and determining if the test compound interacts with the riboswitch. This can be done by using a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch.
  • Compounds can also be identified by, for example, asessing the fit between the riboswtich and a compound known to bind the riboswitch (such as the trigger molecule), identify sites where the compound can be changed with little or no obvious adverse effects on binding of the compound, and incorporating one or more such alterations to produce a new compound.
  • the method of identifying compounds that interact with a riboswitch can also involve production of the compounds so identified. Typically the method first utilizes a 3-dimensional structure of the riboswitch with a compound, also referred to as a "known compound” or "known target". Any of the trigger molecules and compounds disclosed herein can be used as such a known compound.
  • the structure of the riboswitch can be determined using any known means, such as crystallography or solution NMR spectroscopy. That structure can also be obtained through computer molecular modeling simulation programs, such as AutoDock.
  • the methods can involve determining the amount of binding, such as determining the binding energy, between a riboswitch, and a potential compound for that riboswitch.
  • An active compound is a compound that has some activity against a riboswitch, such as inhibiting the riboswitch's activity or enhancing the riboswitch's activity, hi addition, the potential compound can be an analog, which has some structural relationship to a known compound for the molecule. Any of the trigger molecules, known compounds, and compounds disclosed herein can be used as the basis of or to derive a potential compound.
  • the identity or relationship of the structure, properties, interaction or binding parameters, and the like of the known compound and potential compound can be viewed in number of ways. For example, any of the measures or interaction parameters that can be measured or assessed using the structural model, and such measures and parameters obtained for a known compound and a potential compound can be compared. One can look at the identity between the entire known compound and the potential compound. One can also look at the identity between the potential compound, such as an analog, and the know compound only in the domain where the potential compound interacts with the riboswitch.
  • Another sub-domain is a sub-domain of moieties or atoms which actually contact the riboswitch.
  • the identity can be, for example, greater than 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or higher.
  • the potential compounds exist in a family of potential compounds, i.e. a set of analogs, all of which have some structural relationship to the known compound for the riboswitch.
  • a family consisting of any number of members can be screened. The maximum number of members in the family is only limited by the amount of computer power available to screen each member in a desired amount of time.
  • the methods can involve at least one template structure of the riboswitch and a target, often this would be with a known target. It is not required that this structure be existent, as it can be generated, in some cases during the disclosed methods, using standard structure determination techniques. It is preferred that a real structure exist at the time the methods are employed.
  • the methods can also involve modeling the structure of the potential compound, using information from the structure of the known compound. This modeling can be performed in any way, and as described herein.
  • the conformation and position of the potential compound can be held fixed during the calculations; that is, it can be assumed that the riboswitch binds in exactly the same orientation to the potential compound as it does to a known compound. Then, a binding energy (or other property or parameter) can be determined between the riboswitch and the potential compound, and if the binding energy (or other property or parameter) meets certain criteria, then the potential compound can be designated as an actual compound, i.e. one that is likely to interact with the riboswitch.
  • binding energy it should be understood that any property or parameter involving the interaction or modeling of a compound and a riboswitch can be used.
  • the criterion can be that the computed binding energy of the riboswitch with the potential compound is similar to, or more favorable than, the computed binding energy of the same riboswitch with a known compound.
  • an actual compound can be a compound where the computed binding energy as discussed herein is, for example, at least 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 100%, 101%, 102%, 103%, 104%, 105%, 106%, 107%, 108%, 109%, 110%, 120%, 130%, 140%, 150%, 200%, 250%, 300%, 350%, 400%, 450%, 500%, 600%, 700%, 800%, 900%, 1000%, or greater than that of the known compound binding energy.
  • An actual compound can also be a compound which after ordering all potential compounds in terms of the strength of their binding energies, are the compounds which are in the top 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% of computed binding strengths, of for example, a set of potential compounds where the set is at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18,
  • a potential compound is identified, as disclosed herein, traditional testing and analysis can be performed, such as performing a biological assay using the riboswitch and the actual compound to further define the ability of the actual compound to interact with and/or modulate the riboswitch.
  • the disclosed methods can include the step of assaying the activity of the riboswitch and compound, as well as performing, for example, combinatorial chemistry studies using libraries based on the riboswitch, for example.
  • Energy calculations can be based on, for example, molecular or quantum mechanics.
  • Molecular mechanics approximates the energy of a system by summing a series of empirical functions representing components of the total energy like bond stretching, van der Waals forces, or electrostatic interactions.
  • Quantum mechanics methods use various degrees of approximation to solve the Schr ⁇ dinger equation. These methods deal with electronic structure, allowing for the characterization of chemical reactions.
  • Potential compounds of the riboswitch can be identified. This can be accomplished by selecting potential compounds with a given similarity to the known compound. For example, compounds in the same family as the known compound can be selected.
  • atoms can be built in that were unresolved or absent from the crystal structures of the potential compound. This can be done, for example, using the PRODRG webserver http ⁇ /www.davapcl.bioch.dundee.ac.uk./programs/prodrg, or standard molecular modeling programs such as Insightll, Quanta (both at www.accekys.com), CNS (Brunger et al., Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905-921 (1998)), or any other molecular modeling system capable of preparing the riboswitch structure.
  • the binding energy (or other property or parameter) of the potential compound and riboswitch can then be calculated.
  • the sampling of sidechain positions and the computation of the binding thermodynamics can be accomplished using an empirical function that models the energy of the potential compound-molecule as a sum of electrostatic and van der Waals interactions between all pairs of atoms within the model.
  • Any other computational method for scoring the binding energy of the potential compound with the riboswitch can be used (H. Gohlke, & G. Klebe. Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. Angew. Chem. Int. Ed. 41, 2644-4676 (2002)).
  • scoring methods include, but are not limited to, those implemented in programs such as AutoDock (G. M. Morris et al. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J. Comput. Chem. 19, 1639-1662 (1998)), Gold (G. Jones et al. Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J. MoI. Biol. 245, 43-53 (1995)), Chem-Score (M. D. Eldridge et al. J. Comput-Aided MoI. Des. 11, 425-445 (1997)) and Drug-Score (H. Gohlke et al. Knowledge-based scoring function to predict protein-ligand interactions. J MoI. Biol. 295, 337-356 (2000)).
  • AutoDock G. M. Morris et al. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J. Comput. Chem. 19, 1639-1662 (1998)),
  • Rotamer libraries are known to those of skill in the art and can be obtained from a variety of sources, including the internet. Rotamers are low energy side-chain conformations.
  • the use of a library of rotamers allows for the modeling of a structure to try the most likely side-chain conformations, saving time and producing a structure that is more likely to be correct.
  • the use of a library of rotamers can be restricted to those residues that are within a given region of the potential compound, for example, at the binding site, or within a specified distance of the compound. The latter distance can be set at any desired length, for example, the potential compound can be 2, 3, 4, 5, 6, 7, 8, or 9 A from any atom of the molecule.
  • Electrostatic interactions between every pair of atoms can be calculated, for example, using a Coulombic model with the formula:
  • Partial atomic charges can be taken from existing parameter sets that have been developed to describe charge distributions in molecules.
  • Example parameter sets include, but are not limited to, PARSE (D. A. Sitkoff et al. Accurate calculation of hydration free-energies using macroscopic solvent models. J. Phys. Chem. 98, 1978-1988 (1994)), CHARMM (MacKerell et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J. Phys. Chem. B 102, 3586-3616, 1998) and AMBER (W. D. Cornell et al. A 2 nd generation force-field for the simulation of proteins, nucleic-acids, and organic-molecules. J. Am. Chem. Soc.
  • Partial charges for atoms can be assigned either by analogy with those of similar functional groups, or by empirical assignment methods such as that implemented in the PRODRG server (D. M. F. van Aalten et al. PRODRG, a program for generating molecular topologies and unique molecular descriptors from coordinates of small molecules. J. Comput.-Aided MoI. Design 10, 255-262 (1996)), or by the use of standard quantum mechanical calculation methods (for example, C. I. Bayly et al. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges - the RESP model. J. Phys. Chem.
  • the electrostatic interaction can also be calculated by more elaborate methodologies that incorporate electrostatic desolvation effects. These can include explicit solvent and implicit solvent models: in the former, water molecules are directly included in the calculations, whereas in the latter, the effects of water are described by a dielectric continuum approach. Specific examples of implicit solvent methods for calculating electrostatic interactions include but are not limited to: Poisson-Boltzmann based methods and Generalized Born methods (M. Feig & C. L. Brooks. Recent advances in the development and application of implicit solvent models in biomolecule simulations. Curr. Opin. Struct. Biol. 14, 217-224 (2004)). van der Waals and hydrophobic interactions between pairs of atoms (where both atoms are either sulfur or carbon) can be calculated using a simple Lennard- Jones formalism with the following equation:
  • E vdw G ⁇ 7 rep 12 /r 12 ⁇ .
  • G is an energy
  • r is the distance between the two atoms
  • ⁇ rep determines the distance at which the repulsive interaction is equal to G.
  • Hydrophobic interactions between atoms can also be calculated using a variety of other methods known to those skilled in the art.
  • the energetic contribution can be calculated as being proportional to the amount of solvent accessible surface area of the ligand and receptor that is buried when the complex is formed.
  • Such contributions can be expressed in terms of interactions between pairs of atoms, such as in the method proposed by Street & Mayo (A. G. Street & S. L. Mayo. Pairwise calculation of protein solvent-accessible surface areas. Folding & Design 3, 253-258 (1998)). Any other implementation of a formalism for describing hydrophobic or van der Waals or other energetic contributions can be included in the calculations.
  • Binding energies can be calculated for each potential compound-riboswitch interaction. For example, Monte Carlo sampling can be conducted in the presence and absence of the riboswitch, and the average energy in each simulation calculated. A binding energy for the riboswitch with the potential compound can then be calculated as the difference between the two calculated average energies. The computed binding energy of a potential compound with the riboswitch can be compared with the computed binding energy of a known compound with the riboswitch to determine if the potential compound is likely to be an actual compound. These results can then be confirmed using experimental data, wherein the actual interaction between the riboswitch and compound can be measured.
  • Examples of methods that can be used to determine an actual interaction between the riboswitch and the compound include but are not limited to: equilibrium dialysis measurements (wherein binding of a radioactive form of the compound to the riboswitch is detected), enzyme inhibition assays (wherein the activity of the riboswitch can be monitored in the presence and absence of the compound), and chemical shift perturbation measurements (wherein binding of the riboswitch to the potential compound is monitored by observing changes in NMR chemical shifts of atoms).
  • the riboswitch can be a guanine riboswitch, for example.
  • This riboswitch can be selected, for example, from riboswitches in Table 5.
  • further testing can be carried out to determine the actual interaction between the riboswitch and the compound.
  • multiple different approaches can be used to detect binding RNAs, including allosteric ribozyme assays using gel- based and chip-based detection methods, and in-line probing assays. High throughput testing can also be accomplished by using, for example, fluorescent detection methods.
  • RNA-cleaving ribozyme For example, the natural catalytic activity of a glucosarnine-6-phosphate sensing riboswitch that controls gene expression by activating RNA-cleaving ribozyme can be used.
  • This ribozyme can be reconfigured to cleave separate substrate molecules with multiple turnover kinetics. Therefore, a fluorescent group held in proximity to a quenching group can be uncoupled (and therefore become more fluorescent) if a compound triggers ribozyme function.
  • molecular beacon technology can be employed. This creates a system that suppresses fluorescence if a compound prevents the beacon from docking to the riboswitch RNA.
  • High-throughput screening can also be used to reveal entirely new chemical scaffolds that also bind to riboswitch RNAs either with standard or non- standard modes of molecular recognition. Since riboswitches are the first major form of natural metabolite-binding RNAs to be discovered, there has been little effort made previously to create binding assays that can be adapted for high-throughput screening. Multiple different approaches can be used to detect metabolite binding RNAs, including allosteric ribozyme assays using gel-based and chip-based detection methods, and in-line probing assays. Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound.
  • compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
  • compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
  • Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch.
  • the term "substituted" is contemplated to include all permissible substituents of organic compounds.
  • the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, and aromatic and nonaromatic substituents of organic compounds.
  • Illustrative substituents include, for example, those described below.
  • the permissible substituents can be one or more and the same or different for appropriate organic compounds.
  • the heteroatoms, such as nitrogen can have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms.
  • substitution or “substituted with” include the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g. , a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.
  • a 1 ,” “A 2 ,” “A 3 ,” and “A 4 " are used herein as generic symbols to represent various specific substituents. These symbols can be any substituent, not limited to those disclosed herein, and when they are defined to be certain substituents in one instance, they can, in another instance, be defined as some other substituents.
  • alkyl as used herein is a branched or unbranched saturated hydrocarbon group of 1 to 24 carbon atoms, such as methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, t-butyl, pentyl, hexyl, heptyl, octyl, nonyl, decyl, dodecyl, tetradecyl, hexadecyl, eicosyl, tetracosyl, and the like.
  • the alkyl group can also be substituted or unsubstituted.
  • the alkyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.
  • groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.
  • lower alkyl is an alkyl group with 6 or fewer carbon atoms, e.g., methyl, ethyl, propyl, isopropyl, butyl, sec-butyl, iso-butyl, tert-butyl, pentyl, hexyl, and the like.
  • alkyl is generally used to refer to both unsubstituted alkyl groups and substituted alkyl groups; however, substituted alkyl groups are also specifically referred to herein by identifying the specific substituent(s) on the alkyl group.
  • halogenated alkyl specifically refers to an alkyl group that is substituted with one or more halide, e.g., fluorine, chlorine, bromine, or iodine.
  • alkoxyalkyl specifically refers to an alkyl group that is substituted with one or more alkoxy groups, as described below.
  • alkylamino specifically refers to an alkyl group that is substituted with one or more amino groups, as described below, and the like.
  • alkyl is used in one instance and a specific term such as “halogenated alkyl” is used in another, it is not meant to imply that the term “alkyl” does not also refer to specific terms such as “halogenated alkyl” and the like.
  • cycloalkyl refers to both unsubstituted and substituted cycloalkyl moieties
  • the substituted moieties can, in addition, be specifically identified herein; for example, a particular substituted cycloalkyl can be referred to as, e.g., an "alkylcycloalkyl.”
  • a substituted alkoxy can be specifically referred to as, e.g., a "halogenated alkoxy”
  • a particular substituted alkenyl can be, e.g., an "alkenylalcohol,” and the like.
  • alkoxy as used herein is an alkyl group bonded through a single, terminal ether linkage; that is, an “alkoxy” group can be defined as — OA 1 where A 2 is alkyl as defined above.
  • alkenyl as used herein is a hydrocarbon group of from 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon double bond.
  • the alkenyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.
  • alkynyl as used herein is a hydrocarbon group of 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon triple bond.
  • the alkynyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.
  • aryl as used herein is a group that contains any carbon-based aromatic group including, but not limited to, benzene, naphthalene, phenyl, biphenyl, phenoxybenzene, and the like.
  • aryl also includes "heteroaryl,” which is defined as a group that contains an aromatic group that has at least one heteroatom incorporated within the ring of the aromatic group. Examples of heteroatoms include, but are not limited to, nitrogen, oxygen, sulfur, and phosphorus.
  • non-heteroaryl which is also included in the term “aryl,” defines a group that contains an aromatic group that does not contain a heteroatom. The aryl group can be substituted or unsubstituted.
  • the aryl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.
  • the term "biaryl” is a specific type of aryl group and is included in the definition of aryl. Biaryl refers to two aryl groups that are bound together via a fused ring structure, as in naphthalene, or are attached via one or more carbon-carbon bonds, as in biphenyl.
  • cycloalkyl as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms.
  • examples of cycloalkyl groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, etc.
  • heterocycloalkyl is a cycloalkyl group as defined above where at least one of the carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus.
  • the cycloalkyl group and heterocycloalkyl group can be substituted or unsubstituted.
  • the cycloalkyl group and heterocycloalkyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.
  • Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadienyl, cyclohexenyl, cyclohexadienyl, and the like.
  • heterocycloalkenyl is a type of cycloalkenyl group as defined above, and is included within the meaning of the term “cycloalkenyl,” where at least one of the carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus.
  • the cycloalkenyl group and heterocycloalkenyl group can be substituted or unsubstituted.
  • the cycloalkenyl group and heterocycloalkenyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.
  • cyclic group is used herein to refer to either aryl groups, non-aryl groups (i.e., cycloalkyl, heterocycloalkyl, cycloalkenyl, and heterocycloalkenyl groups), or both. Cyclic groups have one or more ring systems that can be substituted or unsubstituted. A cyclic group can contain one or more aryl groups, one or more non-aryl groups, or one or more aryl groups and one or more non-aryl groups.
  • amine or “amino” as used herein are represented by the formula NA 1 A 2 A 3 , where A 1 , A 2 , and A 3 can be, independently, hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • carboxylic acid as used herein is represented by the formula
  • esters as used herein is represented by the formula — OC(O)A 1 or
  • a 1 can be an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • ether as used herein is represented by the formula A 1 OA 2 , where A 1 and A 2 can be, independently, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • ketone as used herein is represented by the formula A 1 C(O)A 2 , where A 1 and A 2 can be, independently, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • halide refers to the halogens fluorine, chlorine, bromine, and iodine.
  • hydroxy as used herein is represented by the formula — OH.
  • sulfo-oxo is represented by the formulas — S(O)A 1 (i.e., “sulfonyl"), A 1 S(O)A 2 (i.e., “sulfoxide”), -S(O) 2 A 1 , A 1 SO 2 A 2 (i.e., "sulfone"),
  • — OS(O) 2 A 1 or — OS(O) 2 OA 1 , where A 1 and A 2 can be hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • a 1 and A 2 can be hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
  • sulfonylamino or "sulfonamide” as used herein is represented by the formula -S(O) 2 NH-.
  • thiol as used herein is represented by the formula — SH. :
  • R n where n is some integer can independently possess one or more of the groups listed above.
  • R 10 contains an aryl group
  • one of the hydrogen atoms of the aryl group can optionally be substituted with a hydroxyl group, an alkoxy group, an amine group, an alkyl group, a halide, and the like.
  • a first group can be incorporated within second group or, alternatively, the first group can be pendant (i.e., attached) to the second group.
  • an alkyl group comprising an amino group the amino group can be incorporated within the backbone of the alkyl group.
  • the amino group can be attached to the backbone of the alkyl group.
  • the nature of the group(s) that is (are) selected will determine if the first group is embedded or attached to the second group.
  • a formula with chemical bonds shown only as solid lines and not as wedges or dashed lines contemplates each possible isomer, e.g., each enantiomer and diastereomer, and a mixture of isomers, such as a racemic or scalemic mixture.
  • Certain materials, compounds, compositions, and components disclosed herein can be obtained commercially or readily synthesized using techniques generally known to those of skill in the art.
  • the starting materials and reagents used in preparing the disclosed compounds and compositions are either available from commercial suppliers such as Aldrich Chemical Co., (Milwaukee, Wis.), Acros Organics (Morris Plains, NJ.), Fisher Scientific (Pittsburgh, Pa.), or Sigma (St.
  • Suitable hydrogen bond donors that can be present in the disclosed compounds, for example as one or more of R 1 , R 2 , and R 9 , are moieties that contain a polar hydrogen bond, such as when a hydrogen atom is bonded to a more electronegative atom like C, N, O, or S.
  • R 13 can be one or more of -H, -NH 2 , -OH, alkoxy, -JV- morpholino, or halide.
  • Suitable hydrogen bond acceptors that can be present in the disclosed compounds, for example as one or more of R 3 , R 7 , and R 10 , are moieties that contain a nonbonded electron pair. Nonbonded electron pairs typically exist on N, O, S, and halogen atoms. Examples of suitable hydrogen bond acceptors for R , R include, but are not limited to, N, O, S, and SO 2 .
  • Still further examples of hydrogen bond acceptors for R 10 include, but are not limited to, -OH, -SH, -NH 2 , -CO 2 H, -alkoxy, -aryloxy, -benzyloxy, - halide, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO 2 alkyl, -NHCO 2 CH 2 -R 12 , - NHC(O)NH 2 , -NH-NH 2 , -NH-NHalkyl, -NH-NHalkoxy, -S0 2 alkyl, -S0 2 aryl, -NH- S0 2 alkyl, -NH-SO 2 -R 12 , -NH-OR 12 , -NH-R 12 , -NH-NH-NH-R 12 , -
  • R 14 N R 6 , where R 6 is C.
  • R 14 can be -H, -NH 2 , -OH, -SH, -CO 2 H, - C0 2 alkyl, -C0 2 aryl, -C(O)NH 2 , substituted or unsubstituted alkyl, alkoxy, alkoxy, aryloxy, or benzyloxy, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO 2 alkyl, - NHC(O)NH 2 , -SO 2 alkyl, -S0 2 aryl, -NH-SO 2 alkyl, -NH-SO 2 -R 12 , -NH-OR 12 , -NH-R 12 , or -NH-CH 2 -R 12 , where R 12 is as defined above.
  • R 7 is N or CH
  • R 10 is -H, -OH, -SH, -alkoxy, halide, -NH 2 , -NHOH, -NHalkyl, -NHalkoxy, - NHaryl, -NHaryloxy, -NHbenzyl, -NHbenzyoxy, -NHC(O)alkyl, -NHCO 2 alkyl, NHCO 2 benzyl, -NHNH 2 , -NHNHalkyl, -NHNHaryl, or -NHNHbenzyl; and R 2 and R 7 are as defined above.
  • Additional compounds useful with guanine-responsive riboswitches include compounds having the formula
  • the compound can bind a guanine-responsive riboswitch or derivative thereof, where, when the compound is bound to a guanine-responsive riboswitch or derivative, R 7 serves as a hydrogen bond acceptor, R 10 serves as a hydrogen bond donor, R 11 serves as a hydrogen bond acceptor, R 12 serves as a hydrogen bond donor, where R 13 is H, H 2 or is not present, where R 1 , R 2 , R 3 , R 4 , R 5 , R 6 , R 8 , and R 9 are each independently
  • Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not guanine, hypoxanthine, xanthine, or N 2 -methylguanme. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a guanine-responsive riboswitch.
  • contacts and interactions (such as hydrogen bond donation or acceptance) described herein for compounds interacting with riboswitches are preferred but are not essential for interaction of a compound with a riboswitch.
  • compounds can interact with riboswitches with less affinity and/or specificity than compounds having the disclosed contacts and interactions.
  • different or additional functional groups on the compounds can introduce new, different and/or compensating contacts with the riboswitches.
  • large functional groups can be used at, for example, R and R .
  • Such functional groups can have, and can be designed to have, contacts and interactions with other part of the riboswitch.
  • Such contacts and interactions can compensate for contacts and interactions of the trigger molecules and core structure.
  • Compounds useful with adenine-responsive riboswitches include compounds having the formula
  • the compound can bind an adenine-responsive riboswitch or derivative thereof, where, when the compound is bound to an adenine-responsive riboswitch or derivative, R 1 , R 3 and R 7 serve as hydrogen bond acceptors, and R 10 and R 11 serve as hydrogen bond donors, where R 12 is H, H 2 or is not present, where R 1 , R 2 , R 3 , R 4 , R 5 , R 6 ,
  • R 8 , and R 9 are each independently C, N, O, or S, and where each independently represent a single or double bond. Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not adenine, 2,6-diaminopurine, or 2-amino purine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate an adenine-responsive riboswitch.
  • Compounds useful with lysine-responsive riboswitches include compounds having the formula
  • the compound can bind a lysine-responsive riboswitch or derivative thereof, where R 2 and R 3 are each positively charged, where R 1 is negatively charged, where R 4 is C, N, O, or S, and where each independently represent a single or double bond.
  • R 2 and R 3 are each NH 3 + and where R 1 is O " .
  • Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not lysine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a lysine-responsive ribo switch.
  • the compound can bind a TPP-responsive riboswitch or derivative thereof, where R 1 is positively charged, where R 2 and R 3 are each independently C, O, or S, where R 4 is CH 3 , NH 2 , OH, SH, H or not present, where R 5 is CH 3 , NH 2 , OH, SH, or H, where R 6 is C or N, and where each independently represent a single or double bond.
  • R 1 is phosphate, diphosphate or triphosphate.
  • Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not TPP, TP or thiamine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a TPP- responsive riboswitch. D. Constructs, Vectors and Expression Systems
  • the disclosed riboswitches can be used in with any suitable expression system. Recombinant expression is usefully accomplished using a vector, such as a plasmid.
  • the vector can include a promoter operably linked to riboswitch-encoding sequence and
  • RNA to be expression e.g., RNA encoding a protein
  • the vector can also include other elements required for transcription and translation.
  • vector refers to any carrier containing exogenous DNA.
  • vectors are agents that transport the exogenous nucleic acid into a cell without degradation and include a promoter yielding expression of the nucleic acid in the cells into which it is delivered.
  • Vectors include but are not limited to plasmids, viral nucleic acids, viruses, phage nucleic acids, phages, cosmids, and artificial chromosomes.
  • a variety of prokaryotic and eukaryotic expression vectors suitable for carrying riboswitch-regulated constructs can be produced.
  • Such expression vectors include, for example, pET, pET3d, pCR2.1, pBAD, pUC, and yeast vectors.
  • the vectors can be used, for example, in a variety of in vivo and in vitro situation.
  • Viral vectors include adenovirus, adeno-associated virus, herpes virus, vaccinia virus, polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HTV backbone. Also useful are any viral families which share the properties of these viruses which make them suitable for use as vectors. Retroviral vectors, which are described in Verma (1985), include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector.
  • viral vectors typically contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome.
  • viruses When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into the viral genome in place of the removed viral DNA.
  • a “promoter” is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site.
  • a “promoter” contains core elements required for basic interaction of RNA polymerase and transcription factors and can contain upstream elements and response elements.
  • Enhancer generally refers to a sequence of DNA that functions at no fixed distance from the transcription start site and can be either 5' (Laimins, 1981) or 3'
  • enhancers can be within an nitron (Banerji et al., 1983) as well as within the coding sequence itself (Osborne et al., 1984). They are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to increase transcription from nearby promoters. Enhancers, like promoters, also often contain response elements that mediate the regulation of transcription. Enhancers often determine the regulation of expression.
  • Expression vectors used in eukaryotic host cells can also contain sequences necessary for the termination of transcription which can affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor protein. The 3' untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA.
  • the identification and use of polyadenylation signals in expression constructs is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs.
  • the vector can include nucleic acid sequence encoding a marker product.
  • This marker product is used to determine if the gene has been delivered to the cell and once delivered is being expressed.
  • Preferred marker genes are the E. CoIi lacZ gene which encodes /3-galactosidase and green fluorescent protein.
  • the marker can be a selectable marker. When such selectable markers are successfully transferred into a host cell, the transformed host cell can survive if placed under selective pressure. There are two widely used distinct categories of selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of a supplemented media.
  • the second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a protein conveying drug resistance and would survive the selection. Examples of such dominant selection use the drugs neomycin, (Southern and Berg, 1982), mycophenolic acid, (Mulligan and Berg, 1980) or hygromycin (Sugden et al., 1985).
  • Gene transfer can be obtained using direct transfer of genetic material, in but not limited to, plasmids, viral vectors, viral nucleic acids, phage nucleic acids, phages, cosmids, and artificial chromosomes, or via transfer of genetic material in cells or carriers such as cationic liposomes.
  • Transfer vectors can be any nucleotide construction used to deliver genes into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)).
  • Preferred viral vectors are Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV backbone. Also preferred are any viral families which share the properties of these viruses which make them suitable for use as vectors.
  • Preferred retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, and for this reason are a commonly used vector. However, they are not useful in non-proliferating cells.
  • Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells.
  • Pox viral vectors are large and have several sites for inserting genes, they are thermostable and can be stored at room temperature.
  • a preferred embodiment is a viral vector which has been engineered so as to suppress the immune response of the host organism, elicited by the viral antigens.
  • Preferred vectors of this type will carry coding regions for Ltiterleukin 8 or 10.
  • Viral vectors have higher transaction (ability to introduce genes) abilities than do most chemical or physical methods to introduce genes into cells.
  • viral vectors contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome.
  • viruses When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into the viral genome in place of the removed viral DNA. Constructs of this type can carry up to about 8 kb of foreign genetic material.
  • the necessary functions of the removed early genes are typically supplied by cell lines which have been engineered to express the gene products of the early genes in trans. i. Retroviral Vectors
  • a retrovirus is an animal virus belonging to the virus family of Retro viridae, including any types, subfamilies, genus, or tropisms.
  • Retroviral vectors in general, are described by Verma, I.M., Retroviral vectors for gene transfer. In Microbiology- 1985, American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. Examples of methods for using retroviral vectors for gene therapy are described in U.S. Patent Nos. 4,868,116 and 4,980,286; PCT applications WO 90/02806 and WO 89/07136; and Mulligan, (Science 260:926-932 (1993)); the teachings of which are incorporated herein by reference.
  • a retrovirus is essentially a package which has packed into it nucleic acid cargo.
  • the nucleic acid cargo carries with it a packaging signal, which ensures that the replicated daughter molecules will be efficiently packaged within the package coat.
  • a packaging signal In addition to the package signal, there are a number of molecules which are needed in cis, for the replication, and packaging of the replicated virus.
  • a retroviral genome contains the gag, pol, and env genes which are involved in the making of the protein coat. It is the gag, pol, and env genes which are typically replaced by the foreign DNA that it is to be transferred to the target cell.
  • Retrovirus vectors typically contain a packaging signal for incorporation into the package coat, a sequence which signals the start of the gag transcription unit, elements necessary for reverse transcription, including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, a purine rich sequence 5' to the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to insert into the host genome.
  • a packaging signal for incorporation into the package coat a sequence which signals the start of the gag transcription unit, elements necessary for reverse transcription, including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, a purine rich sequence 5' to the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, and specific sequences near the ends of the
  • gag, pol, and env genes allow for about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed , and upon replication be packaged into a new retroviral particle.
  • This amount of nucleic acid is sufficient for the delivery of a one to many genes depending on the size of each transcript. It is preferable to include either positive or negative selectable markers along with other genes in the insert.
  • a packaging cell line is a cell line which has been transfected or transformed with a retrovirus that contains the replication and packaging machinery, but lacks any packaging signal.
  • the vector carrying the DNA of choice is transfected into these cell lines, the vector containing the gene of interest is replicated and packaged into new retroviral particles, by the machinery provided in cis by the helper cell. The genomes for the machinery are not packaged because they lack the necessary signals.
  • viruses have been shown to achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, J. Clin. Invest. 92:1580- 1586 (1993);.Kirshenbaum, J. Clin. Invest. 92:381-387 (1993); Roessler, J. Clin. Invest.
  • Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild type or replication-defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and Burlingham, J. Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985); Seth, et al., J. Virol. 51:650-655 (1984); Seth, et al., MoI. Cell. Biol. 4:1528-1533 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 (1993)).
  • a preferred viral vector is one based on an adenovirus which has had the El gene removed and these virons are generated in a cell line such as the human 293 cell line. In another preferred embodiment both the El and E3 genes are removed from the adenovirus genome.
  • Another type of viral vector is based on an adeno-associated virus (AAV). This defective parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to humans.
  • AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably insert into chromosome 19. Vectors which contain this site specific integration property are preferred.
  • An especially preferred embodiment of this type of vector is the P4.1 C vector produced by Avigen, San Francisco, CA, which can contain the herpes simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP.
  • the inserted genes in viral and retroviral usually contain promoters, and/or enhancers to help control the expression of the desired gene product.
  • a promoter is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site.
  • a promoter contains core elements required for basic interaction of RNA polymerase and transcription factors, and can contain upstream elements and response elements.
  • Preferred promoters controlling transcription from vectors in mammalian host cells can be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, or from heterologous mammalian promoters, e.g. beta actin promoter.
  • the early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral origin of replication (Fiers et al., Nature, 273: 113 (1978)).
  • the immediate early promoter of the human cytomegalovirus is conveniently obtained as a HindIII E restriction fragment (Greenway, PJ. et al., Gene 18: 355-360 (1982)).
  • promoters from the host cell or related species also are useful herein.
  • Enhancer generally refers to a sequence of DNA that functions at no fixed distance from the transcription start site and can be either 5' (Laimins, L. et al., Proc. Natl. Acad. Sci. 78: 993 (1981)) or 3' (Lusky, M.L., et al., Mol. Cell Bio. 3: 1108 (1983)) to the transcription unit. Furthermore, enhancers can be within an intron (Banerji, J.L. et al., Cell 33: 729 (1983)) as well as within the coding sequence itself
  • Enhancers function to increase transcription from nearby promoters. Enhancers also often contain response elements that mediate the regulation of transcription. Promoters can also contain response elements that mediate the regulation of transcription. Enhancers often determine the regulation of expression of a gene. While many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, ⁇ -fetoprotein and insulin), typically one will use an enhancer from a eukaryotic cell virus.
  • Preferred examples are the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
  • the promoter and/or enhancer can be specifically activated either by light or specific chemical events which trigger their function.
  • Systems can be regulated by reagents such as tetracycline and dexamethasone.
  • reagents such as tetracycline and dexamethasone.
  • irradiation such as gamma irradiation, or alkylating chemotherapy drugs.
  • promoter and/or enhancer region be active in all eukaryotic cell types.
  • a preferred promoter of this type is the CMV promoter (650 bases).
  • Other preferred promoters are S V40 promoters, cytomegalovirus (full length promoter), and retroviral vector LTF.
  • GFAP glial fibrillary acetic protein
  • Expression vectors used in eukaryotic host cells can also contain sequences necessary for the termination of transcription which can affect rnRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor protein. The 3' untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA.
  • the identification and use of polyadenylation signals in expression constructs is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs.
  • the polyadenylation region is derived from the SV40 early polyadenylation signal and consists of about 400 bases. It is also preferred that the transcribed units contain other standard sequences alone or in combination with the above sequences improve expression from, or stability of, the construct.
  • the vectors can include nucleic acid sequence encoding a marker product. This marker product is used to determine if the gene has been delivered to the cell and once delivered is being expressed.
  • Preferred marker genes are the E. CoIi lacZ gene which encodes ⁇ -galactosidase and green fluorescent protein.
  • the marker can be a selectable marker.
  • suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), thymidine kinase, neomycin, neomycin analog G418, hydromycin, and puromycin.
  • DHFR dihydrofolate reductase
  • thymidine kinase thymidine kinase
  • neomycin neomycin analog G418, hydromycin
  • puromycin puromycin.
  • the transformed mammalian host cell can survive if placed under selective pressure.
  • the first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of a supplemented media.
  • Two examples are: CHO DHFR" cells and mouse LTK" cells. These cells lack the ability to grow without the addition of such nutrients as thymidine or hypoxanthine. Because these cells lack certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are provided in a supplemented media.
  • An alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements.
  • the second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which would express a protein conveying drug resistance and would survive the selection. Examples of such dominant selection use the drugs neomycin, (Southern P. and Berg, P., J. Molec. Appl. Genet. 1: 327 (1982)), mycophenolic acid, (Mulligan, R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, (Sugden, B. et al., MoI. Cell. Biol. 5: 410-413 (1985)).
  • the three examples employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively.
  • Others include the neomycin analog G418 and puramycin.
  • Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA.
  • biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • a reporter protein or peptide can be used for assessing activation of a riboswitch, or for biosensor riboswitches.
  • the reporter protein or peptide can be encoded by the RNA the expression of which is regulated by the riboswitch.
  • the examples describe the use of some specific reporter proteins.
  • the use of reporter proteins and peptides is well known and can be adapted easily for use with riboswitches.
  • the reporter proteins can be any protein or peptide that can be detected or that produces a detectable signal.
  • the presence of the protein or peptide can be detected using standard techniques (e.g., radioimmunoassay, radio-labeling, immunoassay, assay for enzymatic activity, absorbance, fluorescence, luminescence, and Western blot). More preferably, the level of the reporter protein is easily quantifiable using standard techniques even at low levels.
  • reporter proteins include luciferases, green fluorescent proteins and their derivatives, such as firefly luciferase (FL) from Photinus pyralis, and Renilla luciferase (RL) from Renilla reniformis.
  • FL firefly luciferase
  • RL Renilla luciferase
  • Conformation dependent labels refer to all labels that produce a change in fluorescence intensity or wavelength based on a change in the form or conformation of the molecule or compound (such as a riboswitch) with which the label is associated.
  • Examples of conformation dependent labels used in the context of probes and primers include molecular beacons, Amplifluors, FRET probes, cleavable FRET probes, TaqMan probes, scorpion primers, fluorescent triplex oligos including but not limited to triplex molecular beacons or triplex FRET probes, fluorescent water-soluble conjugated polymers, PNA probes and QPNA probes.
  • Such labels and, in particular, the principles of their function, can be adapted for use with riboswitches.
  • Several types of conformation dependent labels are reviewed in Schweitzer and Kingsmore, Curr. Opin. Biotech. 12:21-27 (2001).
  • Stem quenched labels are fluorescent labels positioned on a nucleic acid such that when a stem structure forms a quenching moiety is brought into proximity such that fluorescence from the label is quenched.
  • the stem is disrupted (such as when a riboswitch containing the label is activated)
  • the quenching moiety is no longer in proximity to the fluorescent label and fluorescence increases. Examples of this effect can be found in molecular beacons, fluorescent triplex oligos, triplex molecular beacons, triplex FRET probes, and QPNA probes, the operational principles of which can be adapted for use with riboswitches.
  • Stem activated labels are labels or pairs of labels where fluorescence is increased or altered by formation of a stem structure.
  • Stem activated labels can include an acceptor fluorescent label and a donor moiety such that, when the acceptor and donor are in proximity (when the nucleic acid strands containing the labels form a stem structure), fluorescence resonance energy transfer from the donor to the acceptor causes the acceptor to fluoresce.
  • Stem activated, labels are typically pairs of labels positioned on nucleic acid molecules (such as riboswitches) such that the acceptor and donor are brought into proximity when a stem structure is formed in the nucleic acid molecule.
  • the donor moiety of a stem activated label is itself a fluorescent label, it can release energy as fluorescence (typically at a different wavelength than the fluorescence of the acceptor) when not in proximity to an acceptor (that is, when a stem structure is not formed). When the stem structure forms, the overall effect would then be a reduction of donor fluorescence and an increase in acceptor fluorescence.
  • FRET probes are an example of the use of stem activated labels, the operational principles of which can be adapted for use with riboswitches.
  • detection labels can be incorporated into detection probes or detection molecules or directly incorporated into expressed nucleic acids or proteins.
  • a detection label is any molecule that can be associated with nucleic acid or protein, directly or indirectly, and which results in a measurable, detectable signal, either directly or indirectly. Many such labels are known to those of skill in the art. Examples of detection labels suitable for use in the disclosed method are radioactive isotopes, fluorescent molecules, phosphorescent molecules, enzymes, antibodies, and ligands.
  • fluorescent labels include fluorescein isothiocyanate (FITC), 5,6-carboxymethyl fluorescein, Texas red, nitrobenz-2-oxa-l,3-diazol-4-yl (NBD), coumarin, dansyl chloride, rhodamine, amino-methyl coumarin (AMCA), Eosin, Erythrosin, BODIPY ® , Cascade Blue ® , Oregon Green ® , pyrene, lissamine, xanthenes, acridines, oxazines, phycoerythrin, macrocyclic chelates of lanthanide ions such as quantum dyeTM, fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7.
  • FITC fluorescein isothiocyanate
  • NBD nitrobenz-2-oxa-l,3-diazol-4-
  • Examples of other specific fluorescent labels include 3-Hydroxypyrene 5,8,10-Tri Sulfonic acid, 5-Hydroxy , Tryptamine (5-HT), Acid Fuchsin, Alizarin Complexon, Alizarin Red, Allophycocyanin, Aminocoumarin, Anthroyl Stearate, Astrazon Brilliant Red 4G, Astrazon Orange R, Astrazon Red 6B, Astrazon Yellow 7 GLL, Atabrine, Auramine, Aurophosphine, Aurophosphine G, BAO 9 (Bisaminophenyloxadiazole), BCECF, Berberine Sulphate, Bisbenzamide, Blancophor FFG Solution, Blancophor SV, Bodipy Fl, Brilliant Sulphoflavin FF, Calcien Blue, Calcium Green, Calcofluor RW Solution, Calcofiuor White, Calcophor White ABT Solution, Calcophor White Standard Solution, Carbostyryl, Cascade Yellow, Catecholamine, Chinacrine, Coriphosphine
  • Genacryl Brilliant Red B Genacryl Brilliant Yellow 10GF, Genacryl Pink 3G, Genacryl Yellow 5GF, Gloxalic Acid, Granular Blue, Haematoporphyrin, Indo-1, Intrawhite Cf Liquid, Leucophor PAF, Leucophor SF, Leucophor WS, Lissamine Rhodamine B200 (RD200), Lucifer Yellow CH, Lucifer Yellow VS, Magdala Red, Marina Blue, Maxilon Brilliant Flavin 10 GFF, Maxilon Brilliant Flavin 8 GFF, MPS (Methyl Green Pyronine Stilbene), Mithramycin, NBD Amine, Nitrobenzoxadidole, Noradrenaline, Nuclear Fast Red, Nuclear Yellow, Nylosan Brilliant Flavin E8G, Oxadiazole, Pacific Blue, Pararosaniline (Feulgen), Phorwite AR Solution, Phorwite BKL, Phorwite Rev, Phorwite RPA, Phosphine 3R, Phthalocyanine, Phycoerythr
  • Useful fluorescent labels are fluorescein (5-carboxyfluorescein-N- hydroxysuccinimide ester), rhodamine (5,6-tetramethyl rhodamine), and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7.
  • the absorption and emission maxima, respectively, for these fluors are: FITC (490 nm; 520 nm), Cy3 (554 nm; 568 nni), Cy3.5 (581 nm; 588 nm), Cy5 (652 nm: 672 nm), Cy5.5 (682 nm; 703 nm) and Cy7 (755 nm; 778 nm), thus allowing their simultaneous detection.
  • fluorescein dyes include 6-carboxyfluorescein (6-FAM), 2',4',1,4,-tetrachlorofluorescein (TET), 2 I ,4',5 I ,7',l,4-hexachlorofluorescein (HEX), 2',7'-dimethoxy-4 I , 5'-dichloro-6- carboxyrhodamine (JOE), 2'-chloro-5'-fluoro-7',8 '-fused phenyl- l,4-dichloro-6 ⁇ carboxyfluorescein (NED), and 2'-chloro-7'-phenyl-l,4-dichloro-6-carboxyfluorescein (VIC).
  • Fluorescent labels can be obtained from a variety of commercial sources, including Amersham Pharmacia Biotech, Piscataway, NJ; Molecular Probes, Eugene, OR; and Research Organics, Cleveland, Ohio.
  • Additional labels of interest include those that provide for signal only when the probe with which they are associated is specifically bound to a target molecule, where such labels include: "molecular beacons” as described in Tyagi & Kramer, Nature Biotechnology (1996) 14:303 and EP 0 070 685 Bl.
  • Other labels of interest include those described in U.S. Pat. No. 5,563,037; WO 97/17471 and WO 97/17076.
  • Labeled nucleotides are a useful form of detection label for direct incorporation into expressed nucleic acids during synthesis.
  • detection labels that can be incorporated into nucleic acids include nucleotide analogs such as BrdUrd (5- bromodeoxyuridine, Hoy and Schinike, Mutation Research 290:217-230 (1993)), aminoallyldeoxyuridine (Henegariu et al:, Nature Biotechnology 18:345-348 (2000)), 5- methylcytosine (Sano et al., Biochim. Biophys. Acta 951:157-165 (1988)), bromouridine (Wansick et al, J.
  • Suitable fluorescence-labeled nucleotides are Fluorescein-isothiocyanate-dUTP, Cyanine-3-dUTP and Cyanine-5-dUTP (Yu et al, Nucleic Acids Res., 22:3226-3232 (1994)).
  • a preferred nucleotide analog detection label for DNA is BrdUrd (bromodeoxyuridine, BrdUrd, BrdU, BUdR, Sigma- Aldrich Co).
  • Other useful nucleotide analogs for incorporation of detection label into DNA are AA-dUTP (aminoallyl-deoxyuridine triphosphate, Sigma- Aldrich Co.), and 5-methyl-dCTP (Roche Molecular Biochemicals).
  • a useful nucleotide analog for incorporation of detection label into RNA is biotin- 16-UTP (biotin- 16- uridine-5 '-triphosphate, Roche Molecular Biochemicals). Fluorescein, Cy3, and Cy5 can be linked to dUTP for direct labeling. Cy3.5 and Cy7 are available as avidin or anti- digoxygenin conjugates for secondary detection of biotin- or digoxygenin-labeled probes.
  • Biotin can be detected using streptavidin-alkaline phosphatase conjugate (Tropix, Inc.), which is bound to the biotin and subsequently detected by chemiluminescence of suitable substrates (for example, chemiluminescent substrate CSPD: disodium, 3-(4- methoxyspiro-[l,2,-dioxetane-3-2'-(5'-chloro)tricyclo [3.3. l.l 3 ' 7 ]decane]-4-yl) phenyl phosphate; Tropix, Inc.).
  • suitable substrates for example, chemiluminescent substrate CSPD: disodium, 3-(4- methoxyspiro-[l,2,-dioxetane-3-2'-(5'-chloro)tricyclo [3.3. l.l 3 ' 7 ]decane]-4-yl
  • Labels can also be enzymes, such as alkaline phosphatase, soybean peroxidase, horseradish peroxidase and polymerases, that can be detected, for example, with chemical signal amplification or by using a substrate to the enzyme which produces light (for example, a chemiluminescent 1,2-dioxetane substrate) or fluorescent signal. Molecules that combine two or more of these detection labels are also considered detection labels. Any of the known detection labels can be used with the disclosed probes, tags, molecules and methods to label and detect activated or deactivated riboswitches or nucleic acid or protein produced in the disclosed methods. Methods for detecting and measuring signals generated by detection labels are also known to those of skill in the art.
  • radioactive isotopes can be detected by scintillation counting or direct visualization; fluorescent molecules can be detected with fluorescent spectrophotometers; phosphorescent molecules can be detected with a spectrophotometer or directly visualized with a camera; enzymes can be detected by detection or visualization of the product of a reaction catalyzed by the enzyme; antibodies can be detected by detecting a secondary detection label coupled to the antibody.
  • detection molecules are molecules which interact with a compound or composition to be detected and to which one or more detection labels are coupled.
  • variants of riboswitches, aptamers, expression platforms, genes and proteins herein typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89,
  • homology 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to a stated sequence or a native sequence.
  • the homology can be calculated after aligning the two sequences so that the homology is at its highest level.
  • Optimal alignment of sequences for comparison can be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection.
  • nucleic acids can be obtained by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et ⁇ . Methods Enzymol. 183:281-306, 1989 which are herein incorporated by reference for at least material related to nucleic acid alignment. It is understood that any of the methods typically can be used and that in certain instances the results of these various methods can differ, but the skilled artisan understands if identity is found with at least one of these methods, the sequences would be said to have the stated identity.
  • a sequence recited as having a particular percent homology to another sequence refers to sequences that have the recited homology as calculated by any one or more of the calculation methods described above.
  • a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using the Zuker calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by any of the other calculation methods.
  • a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using both the Zuker calculation method and the Pearson and Lipman calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by the Smith and Waterman calculation method, the
  • hybridization typically means a sequence driven interaction between at least two nucleic acid molecules, such as a primer or a probe and a riboswitch or a gene. Sequence driven interaction means an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a nucleotide specific manner.
  • G interacting with C or A interacting with T are sequence driven interactions.
  • sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the nucleotide.
  • the hybridization of two nucleic acids is affected by a number of conditions and parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules will hybridize. Parameters for selective hybridization between two nucleic acid molecules are well known to those of skill in the art.
  • selective hybridization conditions can be defined as stringent hybridization conditions. For example, stringency of hybridization is controlled by both temperature and salt concentration of either or both of the hybridization and washing steps.
  • the conditions of hybridization to achieve selective hybridization can involve hybridization in high ionic strength solution (6X SSC or 6X SSPE) at a temperature that is about 12- 25 0 C below the Tm (the melting temperature at which half of the molecules dissociate from their hybridization partners) followed by washing at a combination of temperature and salt concentration chosen so that the washing temperature is about 5°C to 20°C below the Tm.
  • the temperature and salt conditions are readily determined empirically in preliminary experiments in which samples of reference DNA immobilized on filters are hybridized to a labeled nucleic acid of interest and then washed under conditions of different stringencies.
  • Hybridization temperatures are typically higher for DNA-RNA and PvNA-RNA hybridizations.
  • the conditions can be used as described above to achieve stringency, or as is known in the art (Sambrook et al., Molecular Cloning: A
  • a preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous solution) in 6X SSC or 6X SSPE followed by washing at 68 0 C.
  • Stringency of hybridization and washing if desired, can be reduced accordingly as the degree of complementarity desired is decreased, and further, depending upon the G-C or A-T richness of any area wherein variability is searched for.
  • stringency of hybridization and washing if desired, can be increased accordingly as homology desired is increased, and further, depending upon the G-C or A-T richness of any area wherein high homology is desired, all as known in the art.
  • selective hybridization is by looking at the amount (percentage) of one of the nucleic acids bound to the other nucleic acid.
  • selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non- limiting nucleic acid.
  • the non-limiting nucleic acid is in for example, 10 or 100 or 1000 fold excess.
  • This type of assay can be performed at under conditions where both the limiting and non-limiting nucleic acids are for example, 10 fold or 100 fold or 1000 fold below their k d , or where only one of the nucleic acid molecules is 10 fold or 100 fold or 1000 fold or where one or both nucleic acid molecules are above their k d .
  • Another way to define selective hybridization is by looking at the percentage of nucleic acid that gets enzymatically manipulated under conditions where hybridization is required to promote the desired enzymatic manipulation.
  • selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the nucleic acid is enzymatically manipulated under conditions which promote the enzymatic manipulation, for example if the enzymatic manipulation is DNA extension, then selective hybridization conditions would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the nucleic acid molecules are extended.
  • nucleic acid based including, for example, riboswitches, aptamers, and nucleic acids that encode riboswitches and aptamers.
  • the disclosed nucleic acids can be made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U.
  • nucleic acid molecule is introduced into a cell or cell environment through for example exogenous delivery, it is advantageous that the nucleic acid molecule be made up of nucleotide analogs that reduce the degradation of the nucleic acid molecule in the cellular environment.
  • riboswitches, aptamers, expression platforms and any other oligonucleotides and nucleic acids can be made up of or include modified nucleotides (nucleotide analogs). Many modified nucleotides are known and can be used in oligonucleotides and nucleic acids.
  • a nucleotide analog is a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties.
  • Modifications to the base moiety would include natural and synthetic modifications of A, C, G, and T/U as well as different purine or pyrimidine bases, such as uracil-5-yl, hypoxanthin-9-yl (I), and 2-aminoadenin-9-yl.
  • a modified base includes but is not limited to 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-pro ⁇ ynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytos
  • nucleotide analogs such as 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-pro ⁇ ynyluracil and 5-propynylcytosine. 5-methylcytosine can increase the stability of duplex formation.
  • Other modified bases are those that function as universal bases. Universal bases include 3-nitropyrrole and 5- nitroindole. Universal bases substitute for the normal bases but have no bias in base pairing. That is, universal bases can base pair with any other base.
  • Base modifications often can be combined with for example a sugar modification, such as 2'-O- methoxyethyl, to achieve unique properties such as increased duplex stability.
  • a sugar modification such as 2'-O- methoxyethyl
  • There are numerous United States patents such as 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; and 5,681,941, which detail and describe a range of base modifications.
  • Each of these patents is herein incorporated by reference in its entirety, and specifically for their description of base modifications, their synthesis, their use, and their incorporation into oligonucleotides and nucleic acids.
  • Nucleotide analogs can also include modifications of the sugar moiety. Modifications to the sugar moiety would include natural modifications of the ribose and deoxyribose as well as synthetic modifications. Sugar modifications include but are not limited to the following modifications at the 2' position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-0-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted Cl to ClO, alkyl or C2 to ClO alkenyl and alkynyl.
  • 2' sugar modifications also include but are not limited to -O[(CH 2 )n O]m CH 3 , - O(CH 2 )n OCH 3 , -O(CH 2 )n NH 2 , -O(CH 2 )n CH 3 , -O(CH 2 )n -ONH 2 , and - O(CH 2 )nON[(CH 2 )n CH 3 )J 2 , where n and m are from 1 to about 10.
  • modifications at the 2' position include but are not limited to: Cl to ClO lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH 3 , OCN, Cl, Br, CN, CF 3 , OCF 3 , SOCH 3 , SO 2 CH 3 , ONO 2 , NO 2 , N 3 , NH 2 , heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties.
  • Similar modifications can also be made at other positions on the sugar, particularly the 3' position of the sugar on the 3 1 terminal nucleotide or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide.
  • Modified sugars would also include those that contain modifications at the bridging ring oxygen, such as CH 2 and S.
  • Nucleotide sugar analogs can also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.
  • Nucleotide analogs can also be modified at the phosphate moiety.
  • Modified phosphate moieties include but are not limited to those that can be modified so that the linkage between two nucleotides contains a phosphorothioate, chiral phosphorothioate, phosphorodithioate, phosphotriester, aminoalkylphosphotriester, methyl and other alkyl phosphonates including 3'-alkylene phosphonate and chiral phosphonates, phosphinates, phosphoramidates including 3 '-amino phosphoramidate and aminoalkylphosphoramidateSj thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates. It is understood that these phosphate or modified phosphate linkages between two nucleotides can be through a phosphorothioate, chiral phosphorothioate, phosphorodithio
  • Various salts, mixed salts and free acid forms are also included.
  • nucleotides containing modified phosphates include but are not limited to, 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and
  • nucleotide analogs need only contain a single modification, but can also contain multiple modifications within one of the moieties or between different moieties.
  • Nucleotide substitutes are molecules having similar functional properties to nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide substitutes are molecules that will recognize and hybridize to (base pair to) complementary nucleic acids in a Watson-Crick or Hoogsteen manner, but which are linked together through a moiety other than a phosphate moiety. Nucleotide substitutes are able to conform to a double helix type structure when interacting with the appropriate target nucleic acid.
  • PNA peptide nucleic acid
  • Nucleotide substitutes are nucleotides or nucleotide analogs that have had the phosphate moiety and/or sugar moieties replaced. Nucleotide substitutes do not contain a standard phosphorus atom. Substitutes for the phosphate can be for example, short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages.
  • morpholino linkages formed in part from the sugar portion of a nucleoside
  • siloxane backbones sulfide, sulfoxide and sulfone backbones
  • formacetyl and thioformacetyl backbones methylene formacetyl and thioformacetyl backbones
  • alkene containing backbones sulfamate backbones
  • sulfonate and sulfonamide backbones amide backbones; and others having mixed N, O, S and CH2 component parts.
  • phosphate replacements include but are not limited to 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated by reference its entirety, and specifically for their description of phosphate replacements, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids.
  • PNA aminoethylglycine
  • United States patents 5,539,082; 5,714,331; and 5,719,262 teach how to make and use PNA molecules, each of which is herein incorporated by reference. (See also Nielsen et ah, Science 254:1497-1500 (1991)).
  • Oligonucleotides and nucleic acids can be comprised of nucleotides and can be made up of different types of nucleotides or the same type of nucleotides.
  • one or more of the nucleotides in an oligonucleotide can be ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O-methyl ribonucleotides; about 10% to about 50% of the nucleotides can be ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O-methyl ribonucleotides; about 50% or more of the nucleotides can be ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O-methyl ribonucleo
  • Solid supports are solid-state substrates or supports with which molecules (such as trigger molecules) and riboswitches (or other components used in, or produced by, the disclosed methods) can be associated.
  • Riboswitches and other molecules can be associated with solid supports directly or indirectly.
  • analytes e.g., trigger molecules, test compounds
  • capture agents e.g., compounds or molecules that bind an analyte
  • riboswitches can be bound to the surface of a solid support or associated with probes immobilized on solid supports.
  • An array is a solid support to which multiple riboswitches, probes or other molecules have been associated in an array, grid, or other organized pattern.
  • Solid-state substrates for use in solid supports can include any solid material with which components can be associated, directly or indirectly. This includes materials such as acrylamide, agarose, cellulose, nitrocellulose, glass, gold, polystyrene, polyethylene vinyl acetate, polypropylene, polymethacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides, polyglycolic acid, polylactic acid, polyorthoesters, functionalized silane, polypropylfumerate, collagen, glycosaminoglycans, and polyamino acids.
  • materials such as acrylamide, agarose, cellulose, nitrocellulose, glass, gold, polystyrene, polyethylene vinyl acetate, polypropylene, polymethacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides
  • Solid-state substrates can have any useful form including thin film, membrane, bottles, dishes, fibers, woven fibers, shaped polymers, particles, beads, microparticles, or a combination.
  • Solid-state substrates and solid supports can be porous or non-porous.
  • a chip is a rectangular or square small piece of material.
  • Preferred forms for solid-state substrates are thin films, beads, or chips.
  • a useful form for a solid-state substrate is a microliter dish. In some embodiments, a multiwell glass slide can be employed.
  • An array can include a plurality of riboswitches, trigger molecules, other molecules, compounds or probes immobilized at identified or predefined locations on the solid support.
  • Each predefined location on the solid support generally has one type of component (that is, all the components at that location are the same). Alternatively, multiple types of components can be immobilized in the same predefined location on a solid support. Each location will have multiple copies of the given components. The spatial separation of different components on the solid support allows separate detection and identification.
  • solid support be a single unit or structure.
  • a set of riboswitches, trigger molecules, other molecules, compounds and/or probes can be distributed over any number of solid supports.
  • each component can be immobilized in a separate reaction tube or container, or on separate beads or microparticles.
  • Oligonucleotides can be coupled to substrates using established coupling methods. For example, suitable attachment methods are described by Pease et al, Proc. Natl. Acad. Sd. USA 91(ll):5022-5026 (1994), and Khrapko et al., MoI Biol (Mosk) (USSR) 25:718-730 (1991).
  • a method for immobilization of 3'-amine oligonucleotides on casein-coated slides is described by Stimpson et al, Proc. Natl. Acad.
  • Each of the different predefined regions can be physically separated from each other of the different regions.
  • the distance between the different predefined regions of the solid support can be either fixed or variable.
  • each of the components can be arranged at fixed distances from each other, while components associated with beads will not be in a fixed spatial relationship.
  • the use of multiple solid support units (for example, multiple beads) will result in variable distances.
  • Components can be associated or immobilized on a solid support at any density. Components can be immobilized to the solid support at a density exceeding 400 different components per cubic centimeter. Arrays of components can have any number of components. For example, an array can have at least 1,000 different components immobilized on the solid support, at least 10,000 different components immobilized on the solid support, at least 100,000 different components immobilized on the solid support, or at least 1,000,000 different components immobilized on the solid support. M. Kits
  • kits for detecting compounds the kit comprising one or more biosensor riboswitches.
  • the kits also can contain reagents and labels for detecting activation of the riboswitches.
  • mixtures formed by performing or preparing to perform the disclosed method For example, disclosed are mixtures comprising riboswitches and trigger molecules.
  • the method involves mixing or bringing into contact compositions or components or reagents
  • performing the method creates a number of different mixtures. For example, if the method includes 3 mixing steps, after each one of these steps a unique mixture is formed if the steps are performed separately. In addition, a mixture is formed at the completion of all of the steps regardless of how the steps were performed.
  • the present disclosure contemplates these mixtures, obtained by the performance of the disclosed methods as well as mixtures containing any disclosed reagent, composition, or component, for example, disclosed herein.
  • Systems useful for performing, or aiding in the performance of, the disclosed method.
  • Systems generally comprise combinations of articles of manufacture such as structures, machines, devices, and the like, and compositions, compounds, materials, and the like. Such combinations that are disclosed or that are apparent from the disclosure are contemplated.
  • systems comprising biosensor riboswitches, a solid support and a signal-reading device.
  • Data structures used in, generated by, or generated from, the disclosed method.
  • Data structures generally are any form of data, information, and/or objects collected, organized, stored, and/or embodied in a composition or medium.
  • the disclosed method, or any part thereof or preparation therefor, can be controlled, managed, or otherwise assisted by computer control.
  • Such computer control can be accomplished by a computer controlled process or method, can use and/or generate data structures, and can use a computer program.
  • Such computer control, computer controlled processes, data structures, and computer programs are contemplated and should be understood to be disclosed herein.
  • riboswitch a compound or trigger molecule that can activate, deactivate or block the riboswitch.
  • Riboswitches function to control gene expression through the binding or removal of a trigger molecule.
  • Compounds can be used to activate, deactivate or block a riboswitch.
  • the trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch.
  • Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch.
  • Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch.
  • the disclosed method of deactivating a riboswitch can involve, for example, removing a trigger molecule (or other activating compound) from the presence or contact with the riboswitch.
  • a riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.
  • Riboswitches function to control gene expression through the binding or removal of a trigger molecule.
  • subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.
  • Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA.
  • Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.
  • RNA molecules or of a gene encoding an RNA molecule
  • methods for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule by operably linking a riboswitch to the RNA molecule.
  • a riboswitch can be operably linked to an RNA molecule in any suitable manner, including, for example, by physically joining the riboswitch to the RNA molecule or by engineering nucleic acid encoding the RNA molecule to include and encode the riboswitch such that the RNA produced from the engineered nucleic acid has the riboswitch operably linked to the RNA molecule.
  • Subjecting a riboswitch operably linked to an RNA molecule of interest to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Also disclosed are methods for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can result in death, stasis or debilitation of the cell or organism.
  • activating a naturally occurring riboswitch in a naturally occurring gene that is essential to survival of a microorganism can result in death of the microorganism (if activation of the riboswitch turns off or represses expression). This is one basis for the use of the disclosed compounds and methods for antimicrobial and antibiotic effects.
  • the gene or RNA can be engineered or can be recombinant in any manner.
  • the riboswitch and coding region of the RNA can be heterologous, the riboswitch can be recombinant or chimeric, or both. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product.
  • riboswitch If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de-repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches.
  • An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.
  • the aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain.
  • Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain.
  • any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.
  • Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.
  • Activation of a riboswitch refers to the change in state of the riboswitch upon binding of a trigger molecule.
  • a riboswitch can be activated by compounds other than the trigger molecule and in ways other than binding of a trigger molecule.
  • the term trigger molecule is used herein to refer to molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch.
  • Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques).
  • Non-natural trigger molecules can be referred to as non-natural trigger molecules.
  • Also disclosed herein is a method of identifying a compound that interacts with a riboswitch comprising: modeling the atomic structure the riboswitch with a test compound; and determining if the test compound interacts with the riboswitch.
  • Determining if the test compound interacts with the riboswitch can be accomplished by, for example, determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch, as described elsewhere herein. Determining if the test compound interacts with the riboswitch can be accomplished by, for example, determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch. The predicted interactions can be selected from the group consisting of, for example, van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination, as described above. In one example, the riboswitch is a guanine riboswitch.
  • Atomic contacts can be determined when interaction with the riboswitch is determined, thereby determining the interaction of the test compound with the riboswitch.
  • Analogs of the test compound can be identified, and it can be determined if the analogs of the test compound interact with the riboswitch.
  • a method of identifying a compound that interacts with a riboswitch comprising: identifying the crystal structure of the riboswitch, modeling the riboswitch with a test compound, and determining if the test compound interacts with the riboswitch.
  • Deactivation of a riboswitch refers to the change in state of the riboswitch when the trigger molecule is not bound.
  • a riboswitch can be deactivated by binding of compounds other than the trigger molecule and in ways other than removal of the trigger molecule.
  • Blocking of a riboswitch refers to a condition or state of the riboswitch where the presence of the trigger molecule does not activate the riboswitch.
  • compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner.
  • the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound.
  • the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.
  • identification of compounds that block a riboswitch can be accomplished in any suitable manner.
  • an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.
  • Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro.
  • biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA.
  • An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch.
  • Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
  • Biosensor riboswitches can be used to monitor changing conditions because riboswitch activation is reversible when the concentration of the trigger molecule falls and so the signal can vary as concentration of the trigger molecule varies.
  • the range of concentration of trigger molecules that can be detected can be varied by engineering riboswitches having different dissociation constants for the trigger molecule. This can easily be accomplished by, for example, "degrading" the sensitivity of a riboswitch having high affinity for the trigger molecule.
  • a range of concentrations can be monitored by using multiple biosensor riboswitches of different sensitivities in the same sensor or assay.
  • compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
  • Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch.
  • a method of detecting a compound of interest comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest.
  • the riboswitch can change conformation when activated by the compound of interest, wherein the change in conformation produces a signal via a conformation dependent label.
  • the riboswitch can change conformation when activated by the compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal.
  • the signal can be produced by a reporter protein expressed from the RNA linked to the riboswitch.
  • a method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the inhibition is via the riboswitch, and (b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising a riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.
  • Also disclosed is a method of identifying riboswitches comprising assessing in-line spontaneous cleavage of an RNA molecule in the presence and absence of a compound, wherein the RNA molecule is encoded by a gene regulated by the compound, wherein a change in the pattern of in-line spontaneous cleavage of the RNA molecule indicates a riboswitch.
  • Riboswitches are a new class of structured RNAs that have evolved for the purpose of binding small organic molecules.
  • the natural binding pocket of riboswitches can be targeted with metabolite analogs or by compounds that mimic the shape-space of the natural metabolite.
  • Riboswitches are: (1) found in numerous Gram-positive and Gram-negative bacteria including Bacillus anthracis, (2) fundamental regulators of gene expression in these bacteria, (3) present in multiple copies that would be unlikely to evolve simultaneous resistance, and (4) not yet proven to exist in humans. This combination of features make riboswitches attractive targets for new antimicrobial compounds. Further, the small molecule ligands of riboswitches provide useful sites for derivitization to produce drug candidates. Distribution of some riboswitches is shown in Table 1 of U.S. Application Publication No. 2005-0053951.
  • candidate molecules can be identified.
  • Anti-riboswitch drugs represent a mode of anti-bacterial action that is of considerable interest for the following reasons. Riboswitches control the expression of genes that are critical for fundamental metabolic processes. Therefore manipulation of these gene control elements with drugs yields new antibiotics. Riboswitches also carry RNA structures that have evolved to selectively bind metabolites, and therefore these RNA receptors make good drug targets as do protein enzymes and receptors. Furthermore, it has been shown that two antimicrobial compounds (discussed above) kill bacteria by deactivating the antibiotics resistance to emerge through mutation of the RNA target. There are at least 11 classes of well- conserved riboswitches in many bacteria, providing numerous drug targets.
  • riboswitches Many organisms from both Gram-positive and Gram-negative lineages have numerous classes of riboswitches (see U.S. Application Publication No. 2005-0053951). Major disease-causing organisms such as Staphylococcus and major bioterror-related organisms such as Bacillus anthracis are both rich with riboswitches.
  • the atomic-resolution structure model for a guanine riboswitch has been elucidated, which enables the use of structure-based design methods for creating riboswitch-binding compounds. Specifically, the model for the binding site of the guanine riboswitch shows that two channels are present that would permit ligand modification (Figure 12A).
  • guanine analogs have been generated with chemical modifications at either the N2 or the 06 positions of guanine, and nearly all tested so far bind to the riboswitch with sub-nanomolar dissociation constants.
  • Figure 12B depicts the structures of guanine analogs synthesized with modified N2 and 06 positions. Most of these compounds take advantage of the molecular recognition "blind spots" in the binding site model or the aptamer domain form a B. subtilis guanine riboswitch.
  • the successful compounds can be used as a scaffold upon which further chemical variation can be introduced to create non-toxic, bioavailable, high affinity, anti-riboswitch compounds.
  • SAM analogs that substitute the reactive methyl and sulfonium ion center with stable sulfur-based linkages are recognized with adequate affinity (low to mid-nanomolar range) by the riboswitch to serve as a platform for synthesis of additional SAM analogs.
  • YBD-2 and YBD3 stable sulfur-based linkages
  • N- and C-based linkages can be synthesized and tested to provide the optimal platform upon which to make amino acid and nucleoside derivations.
  • Sulfoxide and sulfone derivatives of SAM can be used to generate analogs.
  • SAM analog lead compounds must enter bacterial cells and bind riboswitches while remaining metabolically inert.
  • useful SAM analogs must be bound tightly by the riboswitch, but must also fail to compete for SAM in the active sites of protein enzymes, or there is a risk of generating an undesirable toxic effect in the patient's cells.
  • compounds can be tested for their ability to disrupt B. subtilis growth, but fail to affect E. coli cultures (which use SAM but lack SAM riboswitches).
  • parallel bacterial cultures can be grown as follows: ⁇ . B. subtilis can be cultured in glucose minimal media in the absence of exogenously supplied SAM analogs.
  • B. subtilis can be cultured in glucose minimal media in the presence of exogenously supplied SAM analogs (high doses can be selected, to be followed by repeated experiments designed to test a concentration range of the putative drug compound).
  • E. coli can be cultured in glucose minimal media in the presence of exogenously supplied SAM analogs (high doses will be selected, to be followed by repeated experiments designed to test a concentration range of the putative drug compound). Fitness of the various cultures can be compared by measurement of cellular doubling times. A range of concentrations for the drug compounds can be tested using cultures grown in microtiter plates and analyzed using a microplate reader from another laboratory. Culture 1 is expected to grow well. Drugs that inhibit culture 2 may or may not inhibit growth of culture 3. Drugs that similarly inhibit both culture 2 and culture 3 upon exposure to a wide range of drug concentrations can reflect general toxicity induced by the exogenous compound ⁇ i.e., inhibition of many different cellular processes, in addition or in place of riboswitch inhibition).
  • a high-throughput screen can be created by one or two different methods using nucleic acid engineering principles. Adaptation of both fluorescent sensor designs outlined below to formats that are compatible with high-throughput screening assays can be accommodated by using immobilization methods or solution-based methods.
  • One way to create a reporter is to add a third function to the riboswitch by adding a domain that catalyzes the release of a fluorescent tag upon SAM binding to the riboswitch domain.
  • this catalytic domain can be linked to the yitJ SAM riboswitch through a communication module that relays the ligand binding event by allowing the correct folding of the catalytic domain for generating the fluorescent signal. This can be accomplished as outlined below.
  • SAM RiboReporter Pool Design A DNA template for in vitro transcription to
  • RNA was constructed by PCR amplification using the appropriate DNA template and primer sequences.
  • stem II of the hammerhead (stem Pl of the SAM aptamer) has been randomized to present more than 250 million possible sequence combinations, wherein some inevitably will permit function of the ribozyme only when the aptamer is occupied by SAM or a related high-affinity analog.
  • stem II of the hammerhead (stem Pl of the SAM aptamer) has been randomized to present more than 250 million possible sequence combinations, wherein some inevitably will permit function of the ribozyme only when the aptamer is occupied by SAM or a related high-affinity analog.
  • Each molecule in the population of constructs is identical in sequence except at the random domain where multiple copies of every possible combination of sequence will be represented in the population.
  • the in vitro selection protocol can be a repetitive iteration of the following steps:
  • RNA in vitro by standard methods. Include [ ⁇ - 32 P] UTP to incorporate radioactivity throughout the RNA.
  • RNA Reverse transcribe RNA to DNA.
  • PCR amplify DNA with primers that reintroduced cleaved portion of RNA.
  • the concentration of SAM in step 4 can be 100 ⁇ M initially and can be reduced as the selection proceeds.
  • the progress of recovering successful communication modules can be assessed by the amount of cleavage observed on the purification gel in step 6.
  • the selection endpoint can be either when the population approaches 100% cleavage in 10 nM SAM (conditions for maximal activity of the parental ribozyme and riboswitch) or when the population approaches a plateau in activity that does not improve over multiple rounds.
  • the end population can then be sequenced. Individual communication module clones can be assayed for generation of a fluorescent signal in the screening construct in the presence of SAM.
  • a fluorescent signal can also be generated by riboswitch-mediated triggering of a molecular beacon.
  • riboswitch conformational changes cause a folded molecular beacon tagged with both a fluor and a quencher to unfold and force the fluor away from the quencher by forming a helix with the riboswitch.
  • This mechanism is easy to adapt to existing riboswitches, as this method can take advantage of the ligand- mediated formation of terminator and anti-terminator stems that are involved in transcription control.
  • the appropriate construct must be determined empirically.
  • the optimum length and nucleotide composition of the molecular beacon and its binding site on the riboswitch can be tested systematically to result in the highest signal-to-noise ratio.
  • the validity of the assay can be determined by comparing apparent relative binding affinities of different SAM analogs to a molecular beacon-coupled riboswitch (determined by rate of fluorescent signal generation) to the binding constants determined by standard in-line probing.
  • atomic structure of a natural guanine-responsive riboswitch comprising an atomic structure as depicted in Figure 6.
  • the atomic coordinates of the atomic structure can comprise the atomic coordinates listed in Table 6 for atoms depicted in Figure 6.
  • the atomic coordinates of the atomic structure can comprise the atomic coordinates listed in Table 6.
  • Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: (a) modeling the atomic structure of claim 1 with a test compound; and (b) determining if the test compound interacts with the riboswitch. Also disclosed is a method of identifying compounds that interact with a riboswitch comprising contacting the riboswitch with a test compound, wherein a fluorescent signal is generated upon interaction of the riboswitch with the test compound.
  • Also disclosed is a method of killing bacteria comprising contacting the bacteria with an analog of a compound that interacts with the riboswitch.
  • Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: (a) identifying the crystal structure of the riboswitch; (b) modeling the riboswitch with a test compound; and (c) determining if the test compound interacts with the riboswitch.
  • a regulatable gene expression construct comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous.
  • Also disclosed is a method of detecting a compound of interest comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest.
  • Also disclosed is a method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the inhibition is via the riboswitch, and (b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising the riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.
  • Determining if the test compound interacts with the riboswitch can comprise determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch. Determining if the test compound interacts with the riboswitch can comprise determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch.
  • the riboswitch can be a guanine riboswitch.
  • the guanine riboswitch can be a riboswitch in Table 5. Atomic contacts can be determined by modeling the riboswitch with a test compound, thereby determining the interaction of the test compound with the riboswitch.
  • the method can further comprise the steps of: (c) identifying analogs of the test compound; (d) determining if the analogs of the test compound interact with the riboswitch.
  • the compound can be hypoxanthine.
  • a gel-based assay can be used to determine if the test compound interacts with the riboswitch.
  • a chip-based assay can be used to determine if the test compound interacts with the riboswitch.
  • the test compound can interact via van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination.
  • the riboswitch can comprise an RNA cleaving ribozyme.
  • a fluorescent signal can be generated when a nucleic acid comprising a quenching moiety is cleaved.
  • Molecular beacon technology can be employed to generate the fluorescent signal. The method can be carried out using a high throughput screen.
  • the riboswitch can be a guanine riboswitch.
  • the guanine riboswitch can be a riboswitch in Table 5.
  • the riboswitch can be activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule.
  • the riboswitch can change conformation when activated by a compound of interest, wherein the change in conformation produces a signal via a conformation dependent label.
  • the riboswitch can change conformation when activated by a compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal.
  • the signal can be produced by a reporter protein expressed from the RNA linked to the riboswitch.
  • Example 1 Structure of a Natural Guanine-Responsive Riboswitch Complexed with the Metabolite Hypoxanthine
  • the structure of the guanine- binding domain bound to hypoxanthine has been solved by X-ray crystallography (Table 2 and Figure 12A), a biologically relevant ligand of the guanine-responsive riboswitch.
  • the RNA adopts a three-dimensional fold in which the terminal loops (L2 and L3) form a series of interconnecting hydrogen bonds (see pairing scheme in Figure 5) to bring the P2 and P3 helices parallel to each other ( Figure 5C).
  • the purine-binding pocket is created by conserved nucleotides in and around the three-way junction element. These nucleotides help to define the purine-binding pocket through the formation of two sets of base triples above and below ( Figure 5A).
  • the 3' side of the pocket is flanked by a water-mediated U22— A52A73 base triple and an A23G46-C53 triple; in both cases, the Watson-Crick face of the adenosine interacts with the minor groove of a Watson-Crick pair ( Figure 6A).
  • the other side is created by sequential base triples between conserved Watson— Crick pairs at the top of helix Pl (U20-A76 and A21-U75) and the Watson-Crick faces of U49 and C50, respectively, which fasten the J2/3 loop to the Pl helix.
  • This extensive use of base triples to create a ligand-binding site is very similar to in vitro selected RNA aptamers that recognize planar ring systems, as exemplified by the structures of theophylline (Zimmermann, G. R., Jenison, R. D., Wick, C. L., Sirnmorre, J.-P. & Pardi, A.
  • RNAs Interlocking structural motifs mediate molecular discrimination by a theophylline-binding RNA. Nature Struct. Biol. 4, 644-649 (1997)), FMN (Fan, P., Suri, A. K., Fiala, R., Live, D. & Patel, D. J. Molecular recognition in the FMN-RNA aptamer complex. J. MoI. Biol. 258, 480-500 (1996)) and malachite green (Baugh, C, Grate, D. & Wilson, C. 2.8 A crystal structure of the malachite green aptamer. J. MoI. Biol. 301, 117-128 (2000)) binders.
  • artificially selected RNAs use some of the same principles for creating binding sites for small-molecule ligands as their naturally occurring counterpart.
  • hypoxanthine is bound through an extensive series of hydrogen bonds with nucleotides U22, U47, U51 and C74 (Figure 6B), forming a base quadruple that stacks directly on the Pl helix.
  • the structure clearly shows that the mRNA contacts all of the functional groups in the ligand, thereby explaining the specificity for hypoxanthine observed in biochemical studies (Mandal, M. & Breaker, R. R. Gene regulation by riboswitches. Nature Rev. MoI. Cell. Biol. 5, 451-463 (2004)).
  • guanine binding can be readily rationalized, because there is room in the structure to accommodate an exocyclic amine at the 2-position of the bound purine.
  • This additional functional group can form hydrogen bonds with the carbonyl oxygens at the 2-position of C74 and U51, consistent with the tenfold higher affinity of this riboswitch for guanine over hypoxanthine.
  • One of the most marked features is how the ligand is almost completely enveloped by the RNA (Figure 6C): 97.8% of the surface of hypoxanthine is inaccessible to bulk solvent in the complex.
  • the almost complete use of a ligand for recognition by an RNA is unprecedented among structurally characterized aptamers, although selection strategies that do not involve immobilization of the ligand on a solid support (Koizumi, M., Soukup, G. A., Kerr, J. N. & Breaker, R. R.
  • the tertiary architecture is stabilized through a unique loop— loop interaction capping helices P2 and P3 that is defined by two previously unobserved types of base quadruple.
  • Each quadruple comprises a Watson-Crick pair with a noncanonical pair docked into its minor groove (G38-C60 and G37-C61 interacting with the A33A66 and U34A65 pairs, respectively; Figure 7A).
  • This arrangement bears a strong similarity to how adenosines pack into an A-form helix in the commonly found type I/II A-minor triple motif (Doherty, E. A., Batey, R. T., Masquida, B. & Doudna, J. A.
  • the guanine riboswitch In high Mg2+ ion concentrations (20 mM MgC12), the guanine riboswitch has a very high affinity for guanine and hypoxanthine (an observed dissociation constant (Kd) of 5 nM and 50 nM, respectively). In Escherichia coli, however, repression of transcription of the xpt-pbuX ⁇ QXG ⁇ by the purR repressor occurs in response to 1-10 ⁇ M concentrations of purine. To determine whether the riboswitch responds to similar concentrations of purine, which probably reflect physiological levels, its affinity for hypoxanthine was determined by isothermal titration calorimetry at varying ionic conditions (Table 1).
  • RNA- and protein- based regulatory mechanisms seem to be tuned to respond to similar concentrations of intracellular metabolite.
  • TWs reaction contained 2 mM Na ⁇ -EDTA.
  • RNA directly transduces intracellular metabolite concentration into changes in gene expression through a proposed Rho- independent transcriptional regulation mechanism (Mandal, M., Boese, B., Barrick, J. E., Winkler, W. C. & Breaker, R. R. Riboswitches control fundamental biochemical pathways in Bacillus subtilis and other bacteria. Cell 113, 577-586 (2003); Johansen, L. E., Nygaard, P., Lassen, C, Agerso, Y. & Saxild, H. H.
  • the nucleobase binds the pocket, stabilizing the short Pl helix through stacking interactions and base triples with J2/3 and preventing incorporation of Pl nucleotides into an antiterminator element.
  • the mRNA can then form a classic Rho-independent terminator stem-loop, and transcription stops.
  • the 3' side of the isolated Pl helix is readily conscripted to form a stable antiterminator element, facilitating continued transcription.
  • RNA was synthesized and purified by a native affinity-tag purification method (Kieft, J. S. & Batey, R. T. A general method for rapid and nondenaturing purification of RNAs. RNA 10, 988-995 (2004)) and exchanged it into a buffer containing 10 mM K+-HEPES (pH 7.5) and 1 mM hypoxanthine. Crystals were grown by mixing this solution in a 1:1 ratio with mother liquor (containing 25% PEG 3,000 (w/v), 200 mM ammonium acetate and 10 mM cobalt hexamine) and incubating it for 2-3 weeks at room temperature.
  • RNA synthesis and purification A 68 nucleotide construct containing the sequence for the guanine riboswitch of the pbuX-xpt operon of B. subtilis was constructed using overlapping DNA oligonucleotides (Integrated DNA Technologies) and standard PCR methods. The resulting DNA fragment, which contained EcoRI and NgoMDS restriction sites at the 5' and 3' ends, respectively, was ligated into pRAV12 (Kieft, J. S. & Batey, R. T. A general method for rapid and nondenaturing purification of RNAs. RNA 10, 988-995 (2004)), a plasmid vector designed for the native purification of RNA; the resulting vector was sequence verified.
  • DNA template for in vitro transcription was generated by PCR from the resulting vector using primers directed against the T7 RNA polymerase promoter at the 5' end and the 3' side of the purification affinity tag (5', GCGCGCGAATTCTAATACGACTCACTATAG (S ⁇ Q ID NO: 10; 3', GGATCCTGCCCAGGGCTG; S ⁇ Q ID NO: 11).
  • RNA was transcribed in a 12.5 mL reaction containing 30 niM Tris-HCl (pH 8.0), 10 mM DTT, 0.1 % Triton X-100, 0.1 rnM spermidine-HCl, 8 mM each NTP, 40 mM MgCl 2 , 50 ⁇ g/mL T7 RNA polymerase, and 1 mL of ⁇ 0.5 ⁇ M template (Doudna, J. A. Preparation of homogeneous ribozyme RNA for crystallization. Methods MoI Biol 74, 365-70 (1997)), supplemented with 1 unit/mL inorganic pyrophosphatase to suppress formation of insoluble magnesium pyrophosphate.
  • RNA crystallization Crystals of the guanine riboswitch were obtained using the hanging drop method in which the RNA solution was mixed in a 1 : 1 ratio with a reservoir solution containing 10 mM cobalt hexammine, 200 mM ammonium acetate and 25 % PEG 2K. The crystallization trays were incubated at room temperature (23 0 C), with crystals obtaining their maximum size (0.05 x 0.05 x 0.2 mm) in 7-14 days. Cryoprotection of the crystals was performed by adding 30 ⁇ L of a solution comprising the mother liquor plus 25 % (v/v) 2-methyl-2,4 pentanediol (MPD) for five minutes and flash-frozen in liquid nitrogen. Diffraction data was collected on a home X-ray source (Rigaku MSC) using CuKa radiation; collection of anomalous data was achieved by an inverse-beam experiment. The data was indexed, integrated and scaled with
  • RNA for isothermal titration calorimetry (Leavitt, S. & Freire, E. Direct measurement of protein binding energetics by isothermal titration calorimetry. Curr Opin Struct Biol 11, 560-6 (2001); Pierce, M., Raman, C. S. & Nail, B. T. Isothermal titration calorimetry of protein-protein interactions.
  • Methods 19, 213-21 (1999)) was transcribed and purified as described above and exhaustively dialyzed against buffer containing 10 mM K + -HEPES, pH 7.5, 100 mM KCl and varying concentrations OfMgCl 2 at 4 °C for 24-48 hours. Following dialysis, the buffer was used to prepare a solution of hypoxanthine at a concentration that was approximately 10-fold higher than the RNA (typically about 120 DM and 12 DM, respectively). All experiments were performed with a Microcal MCS ITC instrument at 30 0 C.
  • RNA and hypoxanthine solutions Following degassing of the RNA and hypoxanthine solutions, a titration of 29 injections of 10 ⁇ L of hypoxanthine into the RNA sample was performed, such that a final molar ratio of between 2:1 and 3:1 hypoxanthine:RNA was achieved (Recht, M. I. & Williamson, J. R. Central domain assembly: thermodynamics and kinetics of S6 and Sl 8 binding to an S15-RNA complex. J MoI Biol 313, 35-48 (2001)). Titration data was analyzed using Origin ITC software (Microcal Software Inc.) and fit to a single-site binding model.
  • Atomic-resolution models were generated for both a guanine riboswitch aptamer and a related aptamer for adenine (Serganov A, Yuan YR, Pikovskaya O, Polonskaia A, Malinina L, Phan AT, Hobartner C, Micura R, Breaker RR, Patel DJ. (2004) Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. Chem. Biol. 11:1729-1741). These models are essentially identical to that proposed by Batey et al. (Batey, R.T., Gilbert, S.D., Montange, R.K.
  • a guanine riboswitch has been fused to Green Fluorescent Protein (GFP), such that high concentrations of guanine or a riboswitch-binding guanine analog inhibit the expression of GFP ( Figure 14).
  • GFP Green Fluorescent Protein
  • Figure 14 a decreased level of GFP fluorescence indicates binding to and modulation of the guanine riboswitch inside bacteria.
  • a similar system has been constructed with a guanine riboswitch upstream of /3-galactosidase. Again, a decreased level of /3-galactosidase activity indicates that a compound represses the guanine riboswitch in vivo.
  • Table 3 summarizes the result of these assays for each of the guanine analogs.
  • a correlation between compounds that kill cells and those that control gene expression has also been established. Similar systems can be used involving any suitable reporter gene, such as genes that encode a suitable reporter protein or a suitable reporter RNA (such as a ribozyme).
  • G-014 is bactericidal against Bacillus subtilis. In a direct comparison, G-014 reduces the number of viable colony forming units ("CFU") at a rate equal to carbenicillin ( Figure 15).
  • G-013 ⁇ 1 >100 ND may repress 1.0
  • G-016 ⁇ 1 inhibits ND Represses 0.65
  • G-022 ND >100 ND ND ND
  • G-023 ND inhibits ND ND ND
  • G-025 ND inhibits ND ND 0.25
  • SA Staphylococcus aureus
  • MRSA methicillin-resistant Staphylococcus aureus
  • EF Enterococcus faecalis
  • SP Streptococcus pneumoniae
  • HI Haemophilus influenza
  • MS Mycobacterium smegmatis. 2
  • mesenteroides ATCC 8293 # GS NC_006055.1/397027-397123 organism Mesoplasma florum Ll
  • Bacillus; cereus group; Bacillus; cereus # GS NC 003997.3/1497571-1497673 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_005957.1/1521880-1521982 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; thuringiensis # GS NC_006274.1/1532482-1532584 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,-
  • Bacillus; cereus group; Bacillus; cereus # GS NC_007530.2/1497694-1497796 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAAC02000001.1/1985987-1986089 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEK01000008.1/96466-96364 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NZ-AAENO1000011.1/67143-67245 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEO01000025.1/66855-66957 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEP01000035.1/69365-69467 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEQ01000029.1/70848-70950 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAER01000023.1/58182-58284 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAES01000034.1/69121-69223 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_004722.1/1515239-1515341 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NC_000964.2/697711-697813 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; subtilis
  • Clostridiaceae Clostridium; acetobutylicum
  • Bacillus; cereus group; Bacillus; cereus # GS NC_003997.3/262601-262703 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC 004722.1/261558-261660 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_005957.1/268537-268639 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; thuringiensis # GS NC_006274.1/267835-267937 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group,- Bacillus; cereus # GS NC_007530.2/262601-262703 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAAC02000001.1/796036-796138 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEK01000017.1/88397-88499 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NZ_AAEN01000023.1/16980-17082 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS WZ_AAEO01000030.1/17090-17192 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEP01000043.1/12962-13064 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEQ01000040.1/4962-4860 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAER01000030.1/4374-4272 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAES01000040.1/14760-14862 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_005945.1/260654-260756 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_007530.2/260641-260743 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEK01000017.1/86437-86539 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NZ_AAEN01000023.1/15020-15122 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEO01000030.1/15130-15232 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEP01000043.1/11002-11104 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEQ01000040.1/6922-6820 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAER01000030.1/6334-6232 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAES01000040.1/12800-12902 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NC_003997.3/342356-342254 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_005945.1/342369-342267 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_005957.1/356354-356252 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; thuringiensis # GS NC_006274.1/357462-357360 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NC_007530.2/342356-342254 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAAC02000001.1/859268-859166 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEK01000051.1/8150-8252 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NZ_AAEN01000023.1/83441-83339 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEO01000030.1/96886-96784 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEP01000046.1/48810-48708 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEQ01000034.1/49483-49381 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAER01000042.1/191339-191441 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAES01000043.1/48479-48377 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAAC02000001.1/794076-794178 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_003030.1/1002176-1002275 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
  • Bacillus; cereus group; Bacillus; cereus # GS NC 003997.3/295331-295433 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NC_005957.1/309524-309626 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; thuringiensis # GS NC_006274.1/309094-309196 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NC_007530.2/295331-295433 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAAC02000001.1/812243-812345 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEK01000064.1/23153-23051 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NZ_AAEN01000023.1/36414-36516 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEO01000030.1/49824-49926 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEP01000046.1/1785-1887 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAEQ01000034.1/2457-2559 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAER01000042.1/238364-238262 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; anthracis # GS NZ_AAES01000043.1/1489-1591 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
  • Bacillus; cereus group; Bacillus; cereus # GS NZ_AAAW03000042.1/15038-14936 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
  • Thermoanaerobacteriaceae; Moorella group; Moorella,- thermoacetica # GS NC_002570.2/648442-648544 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; halodurans
  • Vibrionaceae; Vibrio; parahaemolyticus # GS NZ_AAAW03000004.1/66862-66959 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
  • thermoacetica # GS NC_004668.1/2288426-2288328 taxonomy Bacteria; Firmicutes; Lactobacillales; Enterococcaceae;
  • thermophilus # GS NZ_AAGS01000026.1/9921-10016 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
  • thermophilus # GS NZ_AAGQ01000089.1/170-269 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
  • thermophilus # GS HC_004567.1/2410478-2410577 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
  • Bdellovibrionales; Bdellovibrionaceae; Bdellovibrio; bacteriovorus # GS NZ_AADT03000002.1/98293-98192 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales;
  • AU.AUGGCUCGGGA.GUCUCUACCGAACAACC..GUAAAUUGUUC.G.ACUAUGAGUGAAAGU.GUACCUAGGG NC_002570.2/1593074-1592972 SEQ ID NO:13 AUUUACAUUAAAAAA.AG.CACUCGUAUAAUCGCGGGA.
  • AU.AGGGCCUGCGA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGAAAAUAAUG.GGUUUUUAC NC_006270.2/2294931-2294831 SEQ ID NO:19 AAUUUGAUACAUUAU.AU.CACUCAUAUAAUCGCGUGG, AU.AUGGCACGCAA.GUUUCUACCGGGCA-CC..GUAAA-UGUCC.G.ACUAUGAGUGGGCGA.UAAGAAAACG NC_006322.1/2295788-2295688 SEQ ID NO:20 AAUUUGAUACAUUAU.AU.CACUCAUAUAAUCGCGUGG.
  • G.ACUACGAGGCGUUUU,UAUAAAGGUG NZ_AAAC02000001.1/796036-796138 SEQ ID NO:48 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU.
  • G.ACUACGAGGCGUUUU UAUAAAGGUG NZ_AAEK01000017.1/88397-88499 SEQ ID NO:49
  • GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU G.ACUACGAGGCGUUU,UAUAAAGGUG NZ_AAEN01000023.1/16980-17082
  • SEQ ID NO:50 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.
  • GAAUAGAUUU NC_006322.1/4024324-4024426 SEQ ID NO:100 AAAUAAUAGAAGCCC.AC.UUCUUGUAUAUAAUCAGUA. AU.AGGGUCUGAUU.GUUUCUACCUGGCAACC..GUAAAUCGCCA.G.ACUACAAGGAAGUUU GAAUAGAU ⁇ NC_004193.1/786767-786868 SEQ ID NO:101
  • NZ_AAEN01000023.1/83441-83339 SEQ ID NO:112 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA. GUUUCUACCCGGCAACC.
  • NZ_AAEP01000046.1/1785-1887 SEQ ID NO:131 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA. GUCUCUACCUAACGACC. .GUUAUUCGUUA, G.ACUAUGAGGGAAAGU, CACUCGGUAU NZ_AAEQ01000034.1/2457-2559 SEQ ID NO:132 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA. G.ACUAUGAGGGAAAGU.
  • CACUCGGUAU N2_AAER01000042.1/238364-238262 SEQ ID NO.-133 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA. G.ACUAUGAGGGAAAGU, CACUCGGUAU N2_AAES01000043.1/1489-1591 SEQ ID NO:134 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA. G.ACUAUGAGGGAAAGU. CACUCGGUAU NC_004193.1/769686-769787 SEQ ID NO:135
  • GAUGUUCGUC NC_006274.1/3685852-3685750 SEQ ID NO:143 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA. G.ACUAUGAGGGAAAAA.
  • NC_005945.1/3605993-3605891 SEQ ID NO:150 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA. GUUUCUACCUGGCAACC. .AUGAAUUGCUA.G, ACUAUGAGGGGAAAA.GUGUGUAACA NC_005957.1/3626969-3626867 SEQ ID NO:151 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.
  • NZ_AAGY01000085.1/5640-5736 SEQ ID NO:169 AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA. AU. -UGGCACGAC-. GUUUCUACAAGGUG-CC..GG-AA-CACCU. A.ACAAUAAGUAAGUCA. GCAGUGAGAU NZ-AAEKOlO00001.1/183902-184006 SEQ ID NO:170 AUAAUUUUACACAUU.AU.CACUCGUAUAUACUCGGUA.
  • AU.AUGGUCCGAGC.GUUUCUACCUAGUUCCCaaUGAAAGAACUG.G.ACUACGGGUUAAAGU.AUUCGGUCGC NC_003454.1/1645820-1645721 SEQ ID NO:171 UAAAUAAUUUUAAUA.AA.AAUUCGUAUAA-GCCUAAU.
  • NC_005363.1/3414604-3414703 SEQ ID NO:188 AAAUAACUUCAUAGUgUU.UCCCCGUAUAU-GUUGCGA.AU.AGGGCGCAGC-. GUUUCUACCAGGCA-CC UCAAA-UGCCU.G. ACUAUGGAGGUUCUU. UGUGAAGUUG NZ_AADT03000021.1/9410-9513 SEQ ID NO:189 AAAAUAAAUAUGGCA.AU.GGCCUGUAUAAUUGGGGGA.AU.AGGGCUCCCAA.GUUUCUACCGGGCAACC.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Biochemistry (AREA)
  • Biotechnology (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Plant Pathology (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Medicinal Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • General Chemical & Material Sciences (AREA)
  • Oncology (AREA)
  • Communicable Diseases (AREA)
  • Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

Riboswitches and modified versions of riboswitches can be employed as designer genetic switches that are controlled by specific effector compounds. Such effector compounds that activate a riboswitch are referred to herein as trigger molecules. The natural switches are targets for antibiotics and other small molecule therapies. In addition, the architecture of riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non-natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds. The changed switches become part of a therapy regimen-turning on, or off, or regulating protein synthesis. Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments. Compounds can be used to stimulate, active, inhibit and/or inactivate riboswitches. Atomic structures of riboswitches can be used to design new compounds to stimulate, active, inhibit and/or inactivate riboswitches.

Description

RIBOSWITCHES, STRUCTURE-BASED COMPOUND DESIGN WITH RIBOSWITCHES, AND METHODS AND COMPOSITIONS FOR USE OF AND
WITH RIBOSWITCHES CROSS-REFERENCE TO RELATED APPLICATIONS This application claims benefit of U.S. Provisional Application No. 60/625,864, filed November 8, 2004. U.S. Provisional Application No. 60/625,864, filed November 8, 2004, is hereby incorporated herein by reference in its entirety.
STATEMENTREGARDINGFEDERALLYSPONSOREDRESEARCH
This invention was made with government support under Grant NIH GM068819- 2 awarded by the National institutes of Health, and Grant DARPA W911NF-04-1-0416 awarded by the Defense Advanced Research Projects Agency. The government has certain rights in the invention.
FIELD OF THE INVENTION
The disclosed invention is generally in the field of gene expression and specifically in the area of regulation of gene expression.
BACKGROUND OF THE INVENTION
Precision genetic control is an essential feature of living systems, as cells must respond to a multitude of biochemical signals and environmental cues by varying genetic expression patterns. Most known mechanisms of genetic control involve the use of protein factors that sense chemical or physical stimuli and then modulate gene expression by selectively interacting with the relevant DNA or messenger RNA sequence. Proteins can adopt complex shapes and carry out a variety of functions that permit living systems to sense accurately their chemical and physical environments. Protein factors that respond to metabolites typically act by binding DNA to modulate transcription initiation (e.g. the lac repressor protein; Matthews, K.S., and Nichols, J.C., 1998, Prog. Nucleic Acids Res. MoI. Biol. 58, 127-164) or by binding RNA to control either transcription termination (e.g. the PyrR protein; Switzer, R.L., et al., 1999, Prog. Nucleic Acids Res. MoI. Biol. 62, 329-367) or translation (e.g. the TRAP protein; Babitzke, P., and Gollnick, P., 2001, J. Bacteriol. 183, 5795-5802). Protein factors respond to environmental stimuli by various mechanisms such as allosteric modulation or post-translational modification, and are adept at exploiting these mechanisms to serve as highly responsive genetic switches (e.g. see Ptashne, M., and Gann, A. (2002). Genes and Signals. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). In addition to the widespread participation of protein factors in genetic control, it is also known that RNA can take an active role in genetic regulation. Recent studies have begun to reveal the substantial role that small non-coding RNAs play in selectively targeting mRNAs for destruction, which results in down-regulation of gene expression (e.g. see Hannon, GJ. 2002, Nature 418, 244-251 and references therein). This process of RNA interference takes advantage of the ability of short RNAs to recognize the intended mRNA target selectively via Watson-Crick base complementation, after which the bound mRNAs are destroyed by the action of proteins. RNAs are ideal agents for molecular recognition in this system because it is far easier to generate new target- specific RNA factors through evolutionary processes than it would be to generate protein factors with novel but highly specific RNA binding sites.
Although proteins fulfill most requirements that biology has for enzyme, receptor and structural functions, RNA also can serve in these capacities. For example, RNA has sufficient structural plasticity to form numerous ribozyme domains (Cech & Golden, Building a catalytic active site using only RNA. Ih: The RNA World R. F. Gesteland, T. R. Cech, J. F. Atkins, eds., ρp.321-350 (1998); Breaker, In vitro selection of catalytic polynucleotides. Chem. Rev. 97, 371-390 (1997)) and receptor domains (Osborne & Ellington, Nucleic acid selection and the challenge of combinatorial chemistry. Chem. Rev. 97, 349-370 (1997); Hermann & Patel, Adaptive recognition by nucleic acid aptamers. Science 287, 820-825 (2000)) that exhibit considerable enzymatic power and precise molecular recognition. Furthermore, these activities can be combined to create allosteric ribozymes (Soukup & Breaker, Engineering precision RNA molecular switches. Proc. Natl. Acad. Sd. USA 96, 3584-3589 (1999); Seetharaman et al., Immobilized riboswitches for the analysis of complex chemical and biological mixtures. Nature Biotechnol. 19, 336-341 (2001)) that are selectively modulated by effector molecules.
These properties of RNA are consistent with speculation (Gold et al., From oligonucleotide shapes to genomic SELEX: novel biological regulatory loops. Proc. Natl. Acad. Sci. USA 94, 59-64 (1997); Gold et al., SELEX and the evolution of genomes. Curr. Opin. Gen. Dev. 7, 848-851 (1997); Nou & Kadner, Adenosylcobalamin inhibits ribosome binding to UuB RNA. Proc. Natl. Acad. Sd. USA 97, 7190-7195 (2000); Gelfand et al., A conserved RNA structure element involved in the regulation of bacterial riboflavin synthesis genes. Trends Gen. 15, 439-442 (1999); Miranda-Rios et al., A conserved RNA structure (thi box) is involved in regulation of thiamin biosynthetic gene expression in bacteria. Proc. Natl. Acad. ScL USA 98, 9736-9741 (2001); Stormo & Ji, Do rnRNAs act as direct sensors of small molecules to control their expression? Proc. Natl. Acad. ScL USA 98, 9465-9467 (2001)) that certain mRNAs might employ allosteric mechanisms to provide genetic regulatory responses to the presence of specific metabolites. Although a thiamine pyrophosphate (TPP)-dependent sensor/regulatory protein had been proposed to participate in the control of thiamine biosynthetic genes (Webb & Downs, Characterization oϊthiL, encoding thiamin- monophosphate kinase, in Salmonella typhimurium. J. Biol. Chem. 272, 15702-15707 (1997)), no such protein factor has been shown to exist.
Transcription of the lysC gene of B. subtilis is repressed by high concentrations of lysine (Kochhar, S., and Paulus, H. 1996, Microbiol. 142:1635-1639; Mader, U., et al., 2002, J. Bacteriol. 184:4288-4295; Patte, J.C. 1996. Biosynthesis of lysine and threonine. In: Escherichia coli and Salmonella: Cellular and Molecular Biology, F.C. Neidhardt, et al., eds., Vol. 1, pp. 528-541. ASM Press, Washington, DC; Patte, S.-C, et al., 1998, FEMS Microbiol. Lett. 169:165-170), but no protein factor had been identified that served as the genetic regulator (Liao, H.-H., and Hseu, T.-H. 1998, FEMS Microbiol. Lett. 168:31-36). The lysC gene encodes aspartokinase II, which catalyzes the first step in the metabolic pathway that converts L-aspartic acid into L-lysine (Belitsky, B.R. 2002. Biosynthesis of amino acids of the glutamate and aspartate families, alanine, and polyamines. In: Bacillus subtilis and its Closest Relatives: from Genes to Cells. A.L. Sonenshein, J.A. Hoch, and R. Losick, eds., ASM Press, Washington, D.C.).
BRIEF SUMMARY OF THE INVENTION It has been discovered that certain natural mRNAs serve as metabolite-sensitive genetic switches wherein the RNA directly binds a small organic molecule. This binding process changes the conformation of the mRNA, which causes a change in gene expression by a variety of different mechanisms. Modified versions of these natural "riboswitches" (created by using various nucleic acid engineering strategies) can be employed as designer genetic switches that are controlled by specific effector compounds. Such effector compounds that activate a riboswitch are referred to herein as trigger molecules. The natural switches are targets for antibiotics and other small molecule therapies. In addition, the architecture of riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non-natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds. The changed switches become part of a therapy regimen-turning on, or off, or regulating protein synthesis. Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments.
Disclosed are isolated and recombinant riboswitches, recombinant constructs containing such riboswitches, heterologous sequences operably linked to such riboswitches, and cells and transgenic organisms harboring such riboswitches, riboswitch recombinant constructs, and riboswitches operably linked to heterologous sequences. The heterologous sequences can be, for example, sequences encoding proteins or peptides of interest, including reporter proteins or peptides. Preferred riboswitches are, or are derived from, naturally occurring riboswitches. Also disclosed are the crystalline atomic structures of riboswitches. These structures are useful in modeling and assessing the interaction of a riboswitch with a binding ligand. They are also useful in methods of identifying compounds that interact with the riboswitch.
Also disclosed are chimeric riboswitches containing heterologous aptamer domains and expression platform domains. That is, chimeric riboswitches are made up an aptamer domain from one source and an expression platform domain from another source. The heterologous sources can be from, for example, different specific riboswitches or different classes of riboswitches. The heterologous aptamers can also come from non-riboswitch aptamers. The heterologous expression platform domains can also come from non-riboswitch sources.
Also disclosed are compositions and methods for selecting and identifying compounds that can activate, deactivate or block a riboswitch. Activation of a riboswitch refers to the change in state of the riboswitch upon binding of a trigger molecule. A riboswitch can be activated by compounds other than the trigger molecule and in ways other than binding of a trigger molecule. The term trigger molecule is used herein to refer to molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques). Non- natural trigger molecules can be referred to as non-natural trigger molecules. Deactivation of a riboswitch refers to the change in state of the riboswitch when the trigger molecule is not bound. A riboswitch can be deactivated by binding of compounds other than the trigger molecule and in ways other than removal of the trigger molecule. Blocking of a riboswitch refers to a condition or state of the riboswitch where the presence of the trigger molecule does not activate the riboswitch. Also disclosed are methods of identifying a compound that interacts with a riboswitch comprising modeling the atomic structure of the riboswitch with a test compound and determining if the test compound interacts with the riboswitch. This can be done by determining the atomic contacts of the riboswitch and test compound. Furthermore, analogs of a compound known to interact with a riboswitch can be generated by analyzing the atomic contacts, then optimizing the atomic structure of the analog to maximize interaction. These methods can be used with a high throughput screen.
Also disclosed are compounds, and compositions containing such compounds, that can activate, deactivate or block a riboswitch. Also disclosed are compositions and methods for activating, deactivating or blocking a riboswitch. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Compounds can be used to activate, deactivate or block a riboswitch. The trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch. Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch. Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch. A riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.
Also disclosed are compositions and methods for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch, by bringing a compound into contact with the RNA molecule. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Thus, subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.
Also disclosed are compositions and methods for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule, by operably linking a riboswitch to the RNA molecule. A riboswitch can be operably linked to an RNA molecule in any suitable manner, including, for example, by physically joining the riboswitch to the RNA molecule or by engineering nucleic acid encoding the RNA molecule to include and encode the riboswitch such that the RNA produced from the engineered nucleic acid has the riboswitch operably linked to the RNA molecule. Subjecting a riboswitch operably linked to an RNA molecule of interest to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA.
Also disclosed are compositions and methods for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can result in death, stasis or debilitation of the cell or organism. For example, activating a naturally occurring riboswitch in a naturally occurring gene that is essential to survival of a microorganism can result in death of the microorganism (if activation of the riboswitch turns off or represses expression). This is one basis for the use of the disclosed compounds and methods for antimicrobial and antibiotic effects. Also disclosed are compositions and methods for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. The gene or RNA can be engineered or can be recombinant in any manner. For example, the riboswitch and coding region of the RNA can be heterologous, the riboswitch can be recombinant or chimeric, or both. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de-repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules. Also disclosed are compositions and methods for altering the regulation of a riboswitch by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch). The aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain. Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain. Generally, any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch. Also disclosed are compositions and methods for inactivating a riboswitch by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.
Also disclosed are methods of identifying compounds that activate, deactivate or block a riboswitch. For examples, compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner. For example, the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound. As another example, the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. As can be seen, assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.
Identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.
Also disclosed are biosensor riboswitches. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. Also disclosed are methods of detecting compounds using biosensor riboswitches. The method can include bringing into contact a test sample and a biosensor riboswitch and assessing the activation of the biosensor riboswitch. Activation of the biosensor riboswitch indicates the presence of the trigger molecule for the biosensor riboswitch in the test sample.
Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound. This can be accomplished by, for example, combining compound identification methods as disclosed elsewhere herein with methods for manufacturing the identified compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
Also disclosed are compounds made by checking activation, deactivation or blocking of a riboswitch by a compound and manufacturing the checked compound. This can be accomplished by, for example, combining compound activation, deactivation or blocking assessment methods as disclosed elsewhere herein with methods for manufacturing the checked compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound. Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch. Also disclosed are methods for selecting, designing or deriving new riboswitches and/or new aptamers that recognize new trigger molecules. Such methods can involve production of a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results. Also disclosed are riboswitches and aptamer domains produced by these methods.
The disclosed riboswitches, including the derivatives and recombinant forms thereof, generally can be from any source, including naturally occurring riboswitches and riboswitches designed de novo. Any such riboswitches can be used in or with the disclosed methods. However, different types of riboswitches can be defined and some such sub-types can be useful in or with particular methods (generally as described elsewhere herein). Types of riboswitches include, for example, naturally occurring riboswitches, derivatives and modified forms of naturally occurring riboswitches, chimeric riboswitches, and recombinant riboswitches. A naturally occurring riboswitch is a riboswitch having the sequence of a riboswitch as found in nature. Such a naturally occurring riboswitch can be an isolated or recombinant form of the naturally occurring riboswitch as it occurs in nature. That is, the riboswitch has the same primary structure but has been isolated or engineered in a new genetic or nucleic acid context. Chimeric riboswitches can be made up of, for example, part of a riboswitch of any or of a particular class or type of riboswitch and part of a different riboswitch of the same or of any different class or type of riboswitch; part of a riboswitch of any or of a particular class or type of riboswitch and any non-riboswitch sequence or component. Recombinant riboswitches are riboswitches that have been isolated or engineered in a new genetic or nucleic acid context.
Different classes of riboswitches refer to riboswitches that have the same or similar trigger molecules or riboswitches that have the same or similar overall structure (predicted, determined, or a combination). Riboswitches of the same class generally, but need not, have both the same or similar trigger molecules and the same or similar overall structure. Riboswitch classes include glycine-responsive riboswitches, guanine- responsive riboswitches, adenine-responsive riboswitches, lysine-responsive riboswitches, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin- responsive riboswitches, flavin mononucleotide-responsive riboswitches, and a S- adenosyknethionine-responsive riboswitches.
Additional advantages of the disclosed method and compositions will be set forth in part in the description which follows, and in part will be understood from the description, or can be learned by practice of the disclosed method and compositions. The advantages of the disclosed method and compositions will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments of the disclosed method and compositions and together with the description, serve to explain the principles of the disclosed method and compositions.
Figures IA, IB and 1C show the G box RNA of the xpt-pbuXmKNA in B. subtilis responds allosterically to guanine. Figure IA shows the consensus sequence and secondary model for the G box RNA domain that resides in the 5 ' UTR of genes that are largely involved in purine metabolism (SEQ ID NO: 1). Phylogenetic analysis is consistent with the formation of a three-stem (Pl through P3) junction. Nucleotides depicted shown as lower case letters and capitals are present in greater than 90% and 80% of the representatives examined, respectively. Encircled nucleotides exhibit base complementation, which might indicate the formation of a pseudoknot. Figure IB shows sequence and ligand-induced structural alterations of the 5'-UTR of the xpt-pbuX transcriptional unit (SEQ ID NO:2). The putative anti-terminator interaction is represented by the boxes. Nucleotides that undergo structural alteration as determined by in-line probing (from C) are identified with squares. The 93 xpt fragment (boxed) of the 201 xpt RNA retains guanine-binding function. Asterisks denote alterations to the RNA sequence that facilitate in vitro transcription (5 ' terminus) or that generate a restriction site (3' terminus). Nucleotide numbers begin at the first nucleotide of the natural transcription start site. The translation start codon begins at position 186. Figure 1C shows guanine and related purines selectively induce structural modulation of the 93 xpt mRNA fragment. Precursor RNAs (Pre; 5 ' 32P-labeled) were subjected to in-line probing by incubation for 40 hr in the absence (-) or presence of guanine, hypoxanthine, xanthine and adenine as indicated by G, H, X and A, respectively. Lanes designated NR, Tl and "OH contain RNA that was not reacted, subjected to partial digestion with RNase Tl (G-specific cleavage), or subjected to partial alkaline digestion, respectively. Selected bands corresponding to G-specific cleavage are identified. Regions 1 through 4 identify major sites of ligand-induced modulation of spontaneous RNA cleavage.
Figures 2A, 2B and 2C show a molecular discrimination by the guanine-binding aptamer of the xpt-pbuXmKNA. Figure 2 A shows the chemical structures and apparent Kj) values for guanine, hypoxanthine and xanthine (active natural regulators of xpt-pbuX genetic expression in B. subtilis) versus that of adenine (inactive). Differences in chemical structure relative to guanine are encircled. KQ values were established as shown in Figure 2 with the 201 xpt RNA. Numbers on guanine represent the positions of the ring nitrogen atoms. Figure 2B shows chemical structures and K^ values for various analogs of guanine reveal that all alterations of this purine cause a loss of binding affinity. Open circles identify KD values that most likely are significantly higher than indicated, as concentrations of analog above 500 μM were not examined in this analysis. The apparent KD values of G, H, X and A as indicated are plotted as triangles for comparison. Figure 2C shows a schematic representation of the molecular recognition features of the guanine aptamer in 201 xpt. Hydrogen bond formation at position 9 of guanine is expected because guanosine (KB > 100 μM) and inosine (KB > 100 μM), which are 9-ribosyl derivatives of guanine and hypoxanthine, respectively, do not exhibit measurable binding. Figures 3A, 3B and 3C show guanine- and adenine-specific riboswitches. Figure
3 A shows sequence and structural features of the two guanine-specific (purE and xpt) and three adenine-specific aptamer domains that are examined in this study BS2-purE, BS3-xpt, BS5-ydhL, CP4-add, VVl-add, which are represented by SEQ ID NOS:3-7, respectively. Pl through P3 identify the three base-paired stems comprising the secondary structure of the aptamer domain. Lowercase nucleotides identify positions whose base identity is conserved in greater than 90% of representatives in the phylogeny. The arrow identifies a nucleotide within the conserved core of the aptamer that is a determinant of ligand specificity. BS, CP and W designate B. subtilis, Clostridium perfringens and Vibrio vulnificus, respectively. Figure 3B shows sequence and secondary structure of the xpt andydhL aptamers (SEQ ID NO:8). Encircled nucleotides identify positions within the ydhL aptamer that differ from those in the xpt aptamer. The sequence disclosed in Figure 3C is SEQ ID NO:9. Nucleotides in xpt are numbered as described in Example 6 of U.S. Application Publication No. 2005-0053951. Other notations are as described in A. Figures 4 A and 4B show the specificity of molecular recognition by the adenine aptamer from. ydhL. Figure 4a Top: Chemical structures of adenine, guanine and other purine analogs that exhibit measurable binding to the SO ydhL RNA. Chemical changes relative to 2,6-DAP, which is the tightest-binding compound, are encircled. Bottom left: Plot of the apparent KQ values for various purines. Bottom right: Model for the chemical features on adenine that serve as molecular recognition contacts for ydhL. Note that the importance of N7 and N9 has not been determined. Encircled arrow indicated that a contact could exist if a hydrogen bond donor is appended to C2. Figure 4b shows chemical structures of various purines that are not bound by the 80 ydhL RNA (KB values poorer than 300 μM). Figures 5A, 5B, 5C, and 5D show secondary and tertiary structures of the guanine riboswitch-hypoxanthine complex. Figure 5A shows left, secondary structure of the Jcpt-p&MXguanine-binding domain of the guanine riboswitch of B. subtilis (SEQ ID NO:213). Nucleotides conserved in more than 90% of known guanine riboswitches are shown in red; the numbering is consistent with that of the full-length mRNA. Colored boxes correspond to structural features shown in Figures 6 and 7. Right, tertiary architecture of the hypoxanthine-bound form. Key tertiary interactions between the loops are shown as thick broken lines; a water-mediated triple is indicated by the circled V. Figure 5B shows gene repression by the guanine riboswitch in the 5' untranslated region of mRNA (SEQ ID NO:214). Initial transcription generates a binding domain that is primed to bind guanine (G) rapidly if it is at a sufficiently high concentration. Hypoxanthine (FIX, top right) stabilizes the guanine-binding domain and particularly the Pl helix, forcing the mRNA to form a terminator element that halts transcription, m the absence of ligand (bottom right), an antiterminator forms, facilitating continued transcription. Figure 5 C shows ribbon representation of the three-dimensional structure of the RNA-hypoxanthine complex. The hypoxanthine ligand is shown in red, with its surface represented by dots. Figure 5D shows the top view of the complex, emphasizing the close packing of the P2 and P3 helices.
Figures 6A, 6B, and 6C show recognition of hypoxanthine (HX) by the guanine- binding domain. Figure 6A: Stereo view of the hypoxanthine-binding pocket in the three-way junction. Figure 6B: Hydrogen-bonding interactions (grey broken lines) between hypoxanthine and the RNA. The final model (shown in stick representation) is superimposed on a simulated annealing 2Fo-Fc omit map (orange cage), in which the atoms shown were excluded from the map calculation. Figure 6C: Molecular surface representation of the binding pocket of the guanine riboswitch bound to hypoxanthine (left), compared with the theophylline-binding aptamer bound to theophylline (centre) and the E. coli purR repressor bound to hypoxanthine (right). Figures 7A, 7B, and 7C show stabilization of the tertiary architecture. Figure
7A: One of two base quartets that form the core of the loop-loop contact. The carbon atoms are colored as in Figure 5. Figure 7B: Side view of the loop— loop interaction, emphasizing the arrangement of base pairs and quartets. The bases of the quartet shown in A are colored blue, with the hydrogen bonding between A65 and U34 shown for orientation; the bases of the other quartet are colored green. The A35A64 pair is shown in yellow, with hydrogen bonds emphasizing its interactions with the 2'-hydroxyl group of U34. The capping G62U63 pair is shown in red. Figures 8 A and 8B show an estimation of the affinity of the riboswitch for hypoxanthine. Shown are the isothermal titration calorimetry curves for the wild-type guanine-binding domain (Figure 8A) and for the guanine-binding domain lacking the tertiary interaction (Figure 8B) with hypoxanthine at 30 °C in buffer containing 10 mM K+-HEPES (pH 7.5), 100 mM KCl and 5 mM MgCl2.
Figures 9 A and 9B show schematic representations of two types of riboswitch binding assays that can be used with high throughput screens. Figure 9A: a ribozyme- based assay can be used that exploits inherent action of a self-cleaving ribozyme. X represents the compound being tested, GlcN6P is glucosamine-6-phosphate. F and Q represent fluorophore and quencher moieties, respectfully (e.g., TAMRA and CY3).
Figure 9B: A molecular beacon assay can be used for non-ribozyme riboswitches such as the guanine-binding RNA. Notations are described in A.
Figure 10 shows cobalt hexammine ions bound to the guanine riboswitch. The RNA (grey) on the left is shown in the same perspective as in Figure 5B, with bound hypoxanthine in red and Co(NH3)6 3+ shown in green and blue. The RNA on the right is rotated 180° with respect to the left view.
Figure 11 shows secondary structure of RNA GR-minimal (SEQ ID NO:215). This RNA has been designed to test the effect of the tertiary interaction formed by the loops L2 and L3 upon ligand binding. In this construct the L2 and L3 loops have been ablated along with part of P2 and P3, and replaced by extremely stable UUCG tetraloops. Figures 12A and 12B show structure based design of anti-riboswitch compounds. Figure 12 A: Atomic-resolution model of the guanine riboswitch bound to the guanine analog hypoxanthine (HX). Guanine binding can include two added contacts made between the exocyclic amine position 2 of the purine ring and the oxygen atoms of C60 and U37. Blue shading identifies the channels that are present, which can allow modification of the guanine ring. Along these channels, and in the vicinity of their termini, are opportunities for new contacts to be made between the aptamer and specific guanine derivatives. Figure 12B: Representatives of guanine analogs (total of 26; named G-001 though G-26) were synthesized to exploit the channels in the guanine riboswitch structure. Most compounds in this collection bind to the riboswitch with dissociation constants of lower than 1 nanomolar (see Figure 13). These compounds can be further modified to acquire new contacts between the compounds and functional groups near the channels. Figure 13 shows representative polyacrylamde gel electrophoresis analysis of in¬ line probing studies that reveal allosteric modulation of the guanine aptamer by various analogs. NR, Tl and OH identify no reaction, nuclease Tl partial digestion, and alkaline partial digestion respectively. In-line probing assays were conducted for ~1 day, and a dampening of spontaneous RNA cleavage product bands in the Jl/2, J2/3 and J3/1 regions are indicative of ligand binding. This dampening occurs even when 1 nM of compound is added, indicating that the dissociation constant is equal to or better than this value.
Figure 14 shows a GFP reporter assay for guanine riboswitch function in vivo. In the absence of guanine, a full-length mRNA is transcribed and GFP is expressed. At high concentrations of guanine, or an active guanine analog, a transcriptional terminator is formed, repressing the expression of GFP. A similar system has been created wherein the jS-galactosidase gene is under riboswitch control.
Figure 15 shows time-kill assay for G-014. G-014 reduces the number of viable B. subtilis cells over time at a rate equal to carbenicillm.
DETAILED DESCRIPTION OF THE INVENTION The disclosed methods and compositions can be understood more readily by reference to the following detailed description of particular embodiments and the Examples included therein and to the Figures and their previous and following description.
Certain natural mRNAs serve as metabolite-sensitive genetic switches wherein the RNA directly binds a small organic molecule. This binding process changes the conformation of the mRNA, which causes a change in gene expression by a variety of different mechanisms. Modified versions of these natural "riboswitch.es" (created by using various nucleic acid engineering strategies) can be employed as designer genetic switches that are controlled by specific effector compounds (referred to herein as trigger molecules). The natural switches are targets for antibiotics and other small molecule therapies. In addition, the architecture of riboswitches allows actual pieces of the natural switches to be used to construct new non-immunogenic genetic control elements, for example the aptamer (molecular recognition) domain can be swapped with other non- natural aptamers (or otherwise modified) such that the new recognition domain causes genetic modulation with user-defined effector compounds. The changed switches become part of a therapy regimen - turning on, or off, or regulating protein synthesis. Newly constructed genetic regulation networks can be applied in such areas as living biosensors, metabolic engineering of organisms, and in advanced forms of gene therapy treatments.
Messenger RNAs are typically thought of as passive carriers of genetic information that are acted upon by protein- or small RNA-regulatory factors and by ribosomes during the process of translation. It was discovered that certain mRNAs carry natural aptamer domains and that binding of specific metabolites directly to these RNA domains leads to modulation of gene expression. Natural riboswitches exhibit two surprising functions that are not typically associated with natural RNAs. First, the mRNA element can adopt distinct structural states wherein one structure serves as a precise binding pocket for its target metabolite. Second, the metabolite-induced allosteric interconversion between structural states causes a change in the level of gene expression by one of several distinct mechanisms. Riboswitches typically can be dissected into two separate domains: one that selectively binds the target (aptamer domain) and another that influences genetic control (expression platform). It is the dynamic interplay between these two domains that results in metabolite-dependent allosteric control of gene expression.
Distinct classes of riboswitches have been identified and are shown to selectively recognize activating compounds (referred to herein as trigger molecules). For example, coenzyme B12, glycine, thiamine pyrophosphate (TPP), and flavin mononucleotide (FMN) activate riboswitches present in genes encoding key enzymes in metabolic or transport pathways of these compounds. The aptamer domain of each riboswitch class conforms to a highly conserved consensus sequence and structure. Thus, sequence homology searches can be used to identify related riboswitch domains. Riboswitch domains have been discovered in various organisms from bacteria, archaea, and eukarya. One class of riboswitches that recognizes guanine and discriminates against most other purine analogs has been discovered. Representative RNAs that carry the consensus sequence and structural features of guanine riboswitches are located in the 5'- untranslated region (UTR) of numerous genes of prokaryotes, where they control expression of proteins involved in purine salvage and biosynthesis. Three representatives of this phylogenetic collection bind adenine with values for apparent dissociation constant (apparent KQ) that are several orders of magnitude better than for guanine. The preference for adenine is due to a single nucleotide substitution in the core of the riboswitch, wherein each representative most likely recognizes its corresponding ligand by forming a Watson/Crick base pair. In addition, the adenine-specific riboswitch associated with the ydhL gene of Bacillus subtilis functions as a genetic 'ON' switch, wherein adenine binding causes a structural rearrangement that precludes formation of an intrinsic transcription terminator stem. Guanine-sensing riboswitches are a class of RNA genetic control elements that modulate gene expression in response to changing concentrations of this compound.
The 5 '-untranslated sequence of the Escherichia coli btuB mRNA assumes a more proactive role in metabolic monitoring and genetic control. The mRNA serves as a metabolite-sensing genetic switch by selectively binding coenzyme B12 without the need for proteins. This binding event establishes a distinct RNA structure that is likely to be responsible for inhibition of ribosome binding and consequent reduction in synthesis of the cobalamin transport protein BtuB. TMs discovery, along with related observations described herein, supports the hypothesis that metabolic monitoring through RNA- metabolite interactions is a widespread mechanism of genetic control.
RNA structure probing data indicate that the thiamine pyrophosphate (TPP) riboswitch operates as an allosteric sensor of its target compound, wherein binding of TPP by the aptamer domain stabilizes a conformational state within the aptamer and within the neighboring expression platform that precludes translation. The diversity of expression platforms appears to be expansive. The tAiMRNA uses a Shine-Dalgarno (SD)-blocking mechanism to control translation, m contrast, the thiC RNA controls gene expression both at transcription and translation, and therefore might make use of a somewhat more complex expression platform that converts the TPP binding event into a transcription termination event and into inhibition of translation of completed mRNAs. Numerous other riboswitches are known that can be used together or as part of a chimeric riboswitch along with glycine-sensing riboswitches and their components. Examples of such riboswitches and their use are described in U.S. Application Publication No. 2005-0053951, which is hereby incorporated by reference in its entirety and in particular for its description of the structure and operation of particular riboswitches.
A. General Organization of Riboswitch RNAs
Bacterial riboswitch RNAs are genetic control elements that are located primarily within the 5 '-untranslated region (5'-UTR) of the main coding region of a particular mRNA. Structural probing studies (discussed further below) reveal that riboswitch elements are generally composed of two domains: a natural aptamer (T. Hermann, D. J. Patel, Science 2000, 287, 820; L. Gold, et al., Annual Review of Biochemistry 1995, 64, 763) that serves as the ligand-binding domain, and an 'expression platform' that interfaces with RNA elements that are involved in gene expression {e.g. Shine-Dalgarno (SD) elements; transcription terminator stems). These conclusions are drawn from the observation that aptamer domains synthesized in vitro bind the appropriate ligand in the absence of the expression platform (see Examples 2, 3 and 6 of U.S. Application Publication No. 2005-0053951). Moreover, structural probing investigations suggest that the aptamer domain of most riboswitches adopts a particular secondary- and tertiary- structure fold when examined independently, that is essentially identical to the aptamer structure when examined in the context of the entire 5' leader RNA. This implies that, in many cases, the aptamer domain is a modular unit that folds independently of the expression platform (see Examples 2, 3 and 6 of U.S. Application Publication No. 2005- 0053951).
Ultimately, the ligand-bound or unbound status of the aptamer domain is interpreted through the expression platform, which is responsible for exerting an influence upon gene expression. The view of a riboswitch as a modular element is further supported by the fact that aptamer domains are highly conserved amongst various organisms (and even between kingdoms as is observed for the TPP riboswitch), (N.
Sudarsan, et al., RNA 2003, 9, 644) whereas the expression platform varies in sequence, structure, and in the mechanism by which expression of the appended open reading frame is controlled. For example, ligand binding to the TPP riboswitch of the tenA mRNA of B. subtilis causes transcription termination (A. S. Mironov, et al., Cell 2002, 111, 747). This expression platform is distinct in sequence and structure compared to the expression platform of the TPP riboswitch in the thiMmKNA from E. coli, wherein TPP binding causes inhibition of translation by a SD blocking mechanism (see Example 2 of U.S. Application Publication No. 2005-0053951). The TPP aptamer domain is easily recognizable and of near identical functional character between these two transcriptional units, but the genetic control mechanisms and the expression platforms that carry them out are very different.
Aptamer domains for riboswitch RNAs typically range from ~70 to 170 nt in length (Figure 11 of U.S. Application Publication No. 2005-0053951). This observation was somewhat unexpected given that in vitro evolution experiments identified a wide variety of small molecule-binding aptamers, which are considerably shorter in length and structural intricacy (T. Hermann, D. J. Patel, Science 2000, 287, 820; L. Gold, et al., Annual Review of Biochemistry 1995, 64, 763; M. Famulok, Current Opinion in Structural Biology 1999, 9, 324). Although the reasons for the substantial increase in complexity and information content of the natural aptamer sequences relative to artificial aptamers remains to be proven, this complexity is believed required to form RNA receptors that function with high affinity and selectivity. Apparent KQ values for the ligand-riboswitch complexes range from low nanomolar to low micromolar. It is also worth noting that some aptamer domains, when isolated from the appended expression platform, exhibit improved affinity for the target ligand over that of the intact riboswitch. (-10 to 100-fold) (see Example 2 of U.S. Application Publication No. 2005-0053951). Presumably, there is an energetic cost in sampling the multiple distinct RNA conformations required by a fully intact riboswitch RNA, which is reflected by a loss in ligand affinity. Since the aptamer domain must serve as a molecular switch, this might also add to the functional demands on natural aptamers that might help rationalize their more sophisticated structures. B. Riboswitch Regulation of Transcription Termination in Bacteria
Bacteria primarily make use of two methods for termination of transcription. Certain genes incorporate a termination signal that is dependent upon the Rho protein, (J. P. Richardson, Biochimica et Biophysica Acta 2002, 1577, 251). while others make use of Rho-independent terminators (intrinsic terminators) to destabilize the transcription elongation complex (I. Gusarov, E. Nudler, Molecular Cell 1999, 3, 495; E. Nudler, M. E. Gottesman, Genes to Cells 2002, 7, 755). The latter RNA elements are composed of a GC-rich stem-loop followed by a stretch of 6-9 uridyl residues. Intrinsic terminators are widespread throughout bacterial genomes (F. Lillo, et al., 2002, 18, 971), and are typically located at the 3 '-termini of genes or operons. Interestingly, an increasing number of examples are being observed for intrinsic terminators located within 5'-UTRs. Amongst the wide variety of genetic regulatory strategies employed by bacteria there is a growing class of examples wherein RNA polymerase responds to a termination signal within the 5 '-UTR in a regulated fashion (T. M. Henkin, Current Opinion in Microbiology 2000, 3, 149). During certain conditions the RNA polymerase complex is directed by external signals either to perceive or to ignore the termination signal. Although transcription initiation might occur without regulation, control over mRNA synthesis (and of gene expression) is ultimately dictated by regulation of the intrinsic terminator. Presumably, one of at least two mutually exclusive mRNA conformations results in the formation or disruption of the RNA structure that signals transcription termination. A trans-acting factor, which in some instances is a RNA (F. J. Grundy, et al.5 Proceedings of the National Academy of Sciences of the United States of America 2002, 99, 11121; T. M. Henkin, C. Yanofsky, Bioessays 2002, 24, 700) and in others is a protein (J. Stulke, Archives of Microbiology 2002, 177, 433), is generally required for receiving a particular intracellular signal and subsequently stabilizing one of the RNA conformations. Riboswitches offer a direct link between RNA structure modulation and the metabolite signals that are interpreted by the genetic control machinery. A brief overview of the FMN riboswitch from a B. subtilis mRNA is provided below to illustrate this mechanism.
The xpt-pbuX opGvon (Christiansen, L.C., et al., 1997, J. Bacteriol. 179, 2540- 2550) is controlled by a riboswitch that exhibits high affinity and high selectivity for guanine. This class of riboswitches is present in the 5 '-untranslated region (5 '-UTR) of five transcriptional units in B. subtilis, including that of the 12~gene/w operon. Direct binding of guanine by mRNAs serves as a critical determinant of metabolic homeostasis for purine metabolism in certain bacteria. Furthermore, the discovered classes of riboswitches, which respond to seven distinct target molecules, control at least 68 genes in Bacillus subtilis that are of fundamental importance to central metabolic pathways.
Also disclosed are guanine riboswitches that have been identified in B. subtilis. The crystal structure at 1.95 A resolution of the purine-binding domain of the guanine riboswitch from the xpt-pbuX operon of B. subtilis bound to hypoxanthine, a prevalent metabolite in the bacterial purine salvage pathway, has been elucidated. This structure reveals a complex RNA fold involving several phylogenetically conserved nucleotides that create a binding pocket that almost completely envelops the ligand. Hypoxanthine functions to stabilize this structure and to promote the formation of a downstream transcriptional terminator element, thereby providing a mechanism for directly repressing gene expression in response to an increase in intracellular concentrations of metabolite.
Certain mRNAs involved in thiamine biosynthesis bind to thiamine (vitamin B1) or its bioactive pyrophosphate derivative (TPP) without the participation of protein factors. The tnJRNA-effector complex adopts a distinct structure that sequesters the ribosome-binding site and leads to a reduction in gene expression. This metabolite- sensing mRNA system provides an example of a genetic "riboswitch" (referred to herein as a riboswitch) whose origin might predate the evolutionary emergence of proteins. It has been discovered that the mRNA leader sequence of the btuB gene of Escherichia coli can bind coenzyme B12 selectively, and that this binding event brings about a structural change in the RNA that is important for genetic control (see Example 1 of U.S. Application Publication No. 2005-0053951). It was also discovered that mRNAs that encode thiamine biosynthetic proteins also employ a riboswitch mechanism (see Example 2 of U.S. Application Publication No. 2005-0053951).
A riboswitch class was discovered in bacteria that is selectively triggered by glycine. A representative of these glycine-sensing RNAs from Bacillus subtilis operates as a rare genetic on switch for the gcυToperon, which codes for proteins that form the glycine cleavage system. Most glycine riboswitches integrate two ligand-binding domains that function cooperatively to more closely approximate a two-state genetic switch. This advanced form of riboswitch may have evolved to ensure that excess glycine is efficiently used to provide carbon flux through the citric acid cycle and maintain adequate amounts of the amino acid for protein synthesis. Thus, riboswitches perform key regulatory roles and exhibit complex performance characteristics that previously had been observed only with protein factors.
Although the specific natural riboswitches disclosed herein are the first examples of mRNA elements that control genetic expression by metabolite binding, it is expected that this genetic control strategy is widespread in biology. It has been suggested (White III, Coenzymes as fossils of an earlier metabolic state. J. MoL Evol. 7, 101-104 (1976); White III, In: The Pyridine Nucleotide Coenzymes. Acad. Press, NY pp. 1-17 (1982); Benner et al., Modern metabolism as a palimpsest of the RNA world. Proc. Natl. Acad. ScL USA 86, 7054-7058 (1989)) that TPP, coenzyme B12 and FMN emerged as biological cofactors during the RNA world (Joyce, The antiquity of RNA-based evolution. Nature 418, 214-221 (2002)). If these metabolites were being biosynthesized and used before the advent of proteins, then certain riboswitches might be modern examples of the most ancient form of genetic control. A search of genomic sequence databases has revealed that sequences corresponding to the TPP aptamer exist in organisms from bacteria, archaea and eukarya-largely without major alteration. Although new metabolite-binding mRNAs are likely to emerge as evolution progresses, it is possible that the known riboswitches are molecular fossils from the RNA world.
It is to be understood that the disclosed method and compositions are not limited to specific synthetic methods, specific analytical techniques, or to particular reagents unless otherwise specified, and, as such, can vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.
Materials Disclosed are materials, compositions, and components that can be used for, can be used in conjunction with, can be used in preparation for, or are products of the disclosed methods and compositions. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference to each of various individual and collective combinations and permutation of these compounds can not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a riboswitch or ap tamer domain is disclosed and discussed and a number of modifications that can be made to a number of molecules including the riboswitch or aptamer domain are discussed, each and every combination and permutation of riboswitch or aptamer domain and the modifications that are possible are specifically contemplated unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is not individually recited, each is individually and collectively contemplated. Thus, in this example, each of the combinations A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are specifically contemplated and should be considered disclosed from disclosure of A, B5 and C; D, E, and F; and the example combination A-D. Likewise, any subset or combination of these is also specifically contemplated and disclosed. Thus, for example, the sub-group of A- E, B-F, and C-E are specifically contemplated and should be considered disclosed from disclosure of A, B, and C; D, E, and F; and the example combination A-D. This concept applies to all aspects of this application including, but not limited to, steps in methods of making and using the disclosed compositions. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods, and that each such combination is specifically contemplated and should be considered disclosed. A. Riboswitches
Riboswitches are expression control elements that are part of an RNA molecule to be expressed and that change state when bound by a trigger molecule. Riboswitches typically can be dissected into two separate domains: one that selectively binds the target (aptamer domain) and another that influences genetic control (expression platform domain). It is the dynamic interplay between these two domains that results in metabolite-dependent allosteric control of gene expression. Disclosed are isolated and recombinant riboswitches, recombinant constructs containing such riboswitches, heterologous sequences operably linked to such riboswitches, and cells and transgenic organisms harboring such riboswitches, riboswitch recombinant constructs, and riboswitches operably linked to heterologous sequences. The heterologous sequences can be, for example, sequences encoding proteins or peptides of interest, including reporter proteins or peptides. Preferred riboswitches are, or are derived from, naturally occurring riboswitches.
The disclosed riboswitches, including the derivatives and recombinant forms thereof, generally can be from any source, including naturally occurring riboswitches and riboswitches designed de novo. Any such riboswitches can be used in or with the disclosed methods. However, different types of riboswitches can be defined and some such sub-types can be useful in or with particular methods (generally as described elsewhere herein). Types of riboswitches include, for example, naturally occurring riboswitches, derivatives and modified forms of naturally occurring riboswitches, chimeric riboswitches, and recombinant riboswitches. A naturally occurring riboswitch is a riboswitch having the sequence of a riboswitch as found in nature. Such a naturally occurring riboswitch can be an isolated or recombinant form of the naturally occurring riboswitch as it occurs in nature. That is, the riboswitch has the same primary structure but has been isolated or engineered in a new genetic or nucleic acid context. Chimeric riboswitches can be made up of, for example, part of a riboswitch of any or of a particular class or type of riboswitch and part of a different riboswitch of the same or of any different class or type of riboswitch; part of a riboswitch of any or of a particular class or type of riboswitch and any non-riboswitch sequence or component. Recombinant riboswitches are riboswitches that have been isolated or engineered in a new genetic or nucleic acid context.
Riboswitches can have single or multiple aptamer domains. Aptamer domains in riboswitches having multiple aptamer domains can exhibit cooperative binding of trigger molecules or can not exhibit cooperative binding of trigger molecules (that is, the aptamers need not exhibit cooperative binding). In the latter case, the aptamer domains can be said to be independent binders. Riboswitches having multiple aptamers can have one or multiple expression platform domains. For example, a riboswitch having two aptamer domains that exhibit cooperative binding of their trigger molecules can be linked to a single expression platform domain that is regulated by both aptamer domains. Riboswitches having multiple aptamers can have one or more of the aptamers joined via a linker. Where such aptamers exhibit cooperative binding of trigger molecules, the linker can be a cooperative linker.
Aptamer domains can be said to exhibit cooperative binding if they have a Hill coefficient n between x and x-1 , where x is the number of aptamer domains (or the number of binding sites on the aptamer domains) that are being analyzed for cooperative binding. Thus, for example, a riboswitch having two aptamer domains (such as glycine- responsive riboswitches) can be said to exhibit cooperative binding if the riboswitch has Hill coefficient between 2 and 1. It should be understood that the value of x used depends on the number of aptamer domains being analyzed for cooperative binding, not necessarily the number of aptamer domains present in the riboswitch. This makes sense because a riboswitch can have multiple aptamer domains where only some exhibit cooperative binding.
Different classes of riboswitches refer to riboswitches that have the same or similar trigger molecules or riboswitches that have the same or similar overall structure (predicted, determined, or a combination). Riboswitches of the same class generally, but need not, have both the same or similar trigger molecules and the same or similar overall structure. Riboswitch classes include glycine-responsive riboswitches, guanine- responsive riboswitches, adenine-responsive riboswitches, lysine-responsive riboswitches, thiamine pyrophosphate-responsive riboswitch, adenosylcobalarøin- responsive riboswitches, flavin mononucleotide-responsive riboswitches, and a S- adenosyhnethionine-responsive riboswitches. Also disclosed are chimeric riboswitches containing heterologous aptamer domains and expression platform domains. That is, chimeric riboswitches are made up an aptamer domain from one source and an expression platform domain from another source. The heterologous sources can be from, for example, different specific riboswitches, different types of riboswitches, or different classes of riboswitches. The heterologous aptamers can also come from non-riboswitch aptarners. The heterologous expression platform domains can also come from non-riboswitch sources.
Riboswitches can be modified from other known, developed or naturally- occurring riboswitches. For example, switch domain portions can be modified by changing one or more nucleotides while preserving the known or predicted secondary, tertiary, or both secondary and tertiary structure of the riboswitch. For example, both nucleotides in a base pair can be changed to nucleotides that can also base pair. Changes that allow retention of base pairing are referred to herein as base pair conservative changes. Modified or derivative riboswitches can also be produced using in vitro selection and evolution techniques, ha general, in vitro evolution techniques as applied to riboswitches involve producing a set of variant riboswitches where part(s) of the riboswitch sequence is varied while other parts of the riboswitch are held constant. Activation, deactivation or blocking (or other functional or structural criteria) of the set of variant riboswitches can then be assessed and those variant riboswitches meeting the criteria of interest are selected for use or further rounds of evolution. Useful base riboswitches for generation of variants are the specific and consensus riboswitches disclosed herein. Consensus riboswitches can be used to inform which part(s) of a riboswitch to vary for in vitro selection and evolution. Also disclosed are modified riboswitches with altered regulation. The regulation of a riboswitch can be altered by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch). The aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain. Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain. Generally, any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.
Also disclosed are inactivated riboswitcb.es. Riboswitches can be inactivated by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule. Also disclosed are biosensor riboswitches. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. Biosensor riboswitches can be used in various situations and platforms. For example, biosensor riboswitches can be used with solid supports, such as plates, chips, strips and wells.
Also disclosed are modified or derivative riboswitches that recognize new trigger molecules. New riboswitches and/or new aptamers that recognize new trigger molecules can be selected for, designed or derived from known riboswitches. This can be accomplished by, for example, producing a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results. Particularly useful aptamer domains can form a stem structure referred to herein as the Pl stem structure (or simply Pl). The Pl stems of a variety of riboswitclies are shown in Figure 11 of U.S. Application Publication No. 2005-0053951. The hybridizing strands in the Pl stem structure are referred to as the aptamer strand (also referred to as the PIa strand) and the control strand (also referred to as the PIb strand). The control strand can form a stem structure with both the aptamer strand and a sequence in a linked expression platform that is referred to as the regulated strand (also referred to as the PIc strand). Thus, the control strand (PIb) can form alternative stem structures with the aptamer strand (PIa) and the regulated strand (PIc). Activation and deactivation of a riboswitch results in a shift from one of the stem structures to the other (from Pla/Plb to Plb/Plc or vice versa). The formation of the Plb/Plc stem structure affects expression of the RNA molecule containing the riboswitch. Riboswitches that operate via this control mechanism are referred to herein as alternative stem structure riboswitches (or as alternative stem riboswitches). Some glycine-responsive riboswitches having two aptamers utilize this mechanism using a P 1 stem in the second aptamer.
In general, any aptamer domain can be adapted for use with any expression platform domain by designing or adapting a regulated strand in the expression platform domain to be complementary to the control strand of the aptamer domain. Alternatively, the sequence of the aptamer and control strands of an aptamer domain can be adapted so that the control strand is complementary to a functionally significant sequence in an expression platform. For example, the control strand can be adapted to be complementary to the Shine-Dalgarno sequence of an RNA such that, upon formation of a stem structure between the control strand and the SD sequence, the SD sequence becomes inaccessible to ribosomes, thus reducing or preventing translation initiation. Note that the aptamer strand would have corresponding changes in sequence to allow formation of a Pl stem in the aptamer domain. In the case of riboswitches having multiple aptamers exhibiting cooperative binding, one the Pl stem of the activating aptamer (the aptamer that interacts with the expression platform domain) need be designed to form a stem structure with the SD sequence. As another example, a transcription terminator can be added to an RNA molecule
(most conveniently in an untranslated region of the RNA) where part of the sequence of the transcription terminator is complementary to the control strand of an aptamer domain
(the sequence will be the regulated strand). This will allow the control sequence of the aptamer domain to form alternative stem structures with the aptamer strand and the regulated strand, thus either forming or disrupting a transcription terminator stem upon activation or deactivation of the riboswitch. Any other expression element can be brought under the control of a riboswitch by similar design of alternative stem structures. For transcription terminators controlled by riboswitches, the speed of transcription and spacing of the riboswitch and expression platform elements can be important for proper control. Transcription speed can be adjusted by, for example, including polymerase pausing elements (e.g., a series of uridine residues) to pause transcription and allow the riboswitch to form and sense trigger molecules. For example, with the FMN riboswitch, if FMN is bound to its aptamer domain, then the antiterminator sequence is sequestered and is unavailable for formation of an antiterminator structure (Figure 12 of U.S. Application Publication No. 2005-0053951). However, if FMN is absent, the antiterminator can form once its nucleotides emerge from the polymerase. RNAP then breaks free of the pause site only to reach another U- stretch and pause again. The transcriptional terminator then forms only if the terminator nucleotides are not tied up by the antiterminator.
Disclosed are regulatable gene expression constructs comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous. The riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain and the expression platform domain are heterologous. The riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain comprises a Pl stem, wherein the Pl stem comprises an aptamer strand and a control strand, wherein the expression platform domain comprises a regulated strand, wherein the regulated strand, the control strand, or both have been designed to form a stem structure. The riboswitch can comprise two or more aptamer domains and an expression platform domain, wherein at least one of the aptamer domains and the expression platform domain are heterologous. The riboswitch can comprise two or more aptamer domains and an expression platform domain, wherein at least one of the aptamer domains comprises a Pl stem, wherein the Pl stem comprises an aptamer strand and a control strand, wherein the expression platform domain comprises a regulated strand, wherein the regulated strand, the control strand, or both have been designed to form a stem structure. Disclosed are riboswitches, wherein the riboswitch is a non-natural derivative of a naturally-occurring riboswitch. The riboswitch can comprise an aptamer domain and an expression platform domain, wherein the aptamer domain and the expression platform domain are heterologous. The riboswitch can be derived from a naturally-occurring guanine-responsive riboswitch, adenine-responsive riboswitch, lysine-responsive riboswitch, thiamine pyrophosphate-responsive riboswitch, adenosylcobalamin- responsive riboswitch, flavin mononucleotide-responsive riboswitch, glycine-responsive riboswitch, or a S-adenosylmethionine-responsive riboswitch. The riboswitch can be activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule.
Table 5 discloses examples of various guanine riboswitches. The alignment of sequences in Table 5 is in Stockholm format as output by the cmalign program of the INFERNAL software package. The last two entries in Table 5 reflect the consensus sequence and structure for the motif as reflected by a computer algorithm described by Eddy, S. R. (2003). INFERNAL. Version 0.55. Distributed by the author. Dept. of
Genetics, Washington University School of Medicine. St. Louis, Missouri. Figure 41 of U.S. Application Publication No. 2005-0053951 describes consensus riboswitch sequences, natural riboswitch sequences, and riboswitch sequence alignment. Thermodynamic parameters of various nucleobases bound to adenine riboswitch RNA are shown in Table 7. Thermodynamic parameters of various nucleobases bound to guanine riboswitch RNA are shown in Table 8.
Numerous riboswitches and riboswitch constructs are described and referred to herein. It is specifically contemplated that any specific riboswitch or riboswitch construct or group of riboswitches or riboswitch constructs can be excluded from some aspects of the invention disclosed herein. For example, fusion of the xpt-pbuX riboswitch with a reporter gene could be excluded from a set of riboswitches fused to reporter genes. As another exmple any combination of the riboswitches listed in Table 5 can be specifically included or specifically excluded form any aspect of the disclosed methods and compositions. 1. Aptamer Domains
Aptamers are nucleic acid segments and structures that can bind selectively to particular compounds and classes of compounds. Riboswitches have aptamer domains that, upon binding of a trigger molecule result in a change in the state or structure of the riboswitch. In functional riboswitches, the state or structure of the expression platform domain linked to the aptamer domain changes when the trigger molecule binds to the aptamer domain. Aptamer domains of riboswitches can be derived from any source, including, for example, natural aptamer domains of riboswitches, artificial aptamers, engineered, selected, evolved or derived aptamers or aptamer domains. Aptamers in riboswitches generally have at least one portion that can interact, such as by forming a stem structure, with a portion of the linked expression platform domain. This stem structure will either form or be disrupted upon binding of the trigger molecule.
Consensus aptamer domains of a variety of natural riboswitches are shown in Figure 11 of U.S. Application Publication No. 2005-0053951 and elsewhere herein. These aptamer domains (including all of the direct variants embodied therein) can be used in riboswitches. The consensus sequences and structures indicate variations in sequence and structure. Aptamer domains that are within the indicated variations are referred to herein as direct variants. These aptamer domains can be modified to produce modified or variant aptamer domains. Conservative modifications include any change in base paired nucleotides such that the nucleotides in the pair remain complementary. Moderate modifications include changes in the length of stems or of loops (for which a length or length range is indicated) of less than or equal to 20% of the length range indicated. Loop and stem lengths are considered to be "indicated" where the consensus structure shows a stem or loop of a particular length or where a range of lengths is listed or depicted. Moderate modifications include changes in the length of stems or of loops (for which a length or length range is not indicated) of less than or equal to 40% of the length range indicated. Moderate modifications also include and functional variants of unspecified portions of the aptamer domain. Unspecified portions of the aptamer domains are indicated by solid lines in Figure 11 of U.S. Application Publication No. 2005-0053951.
The Pl stem and its constituent strands can be modified in adapting aptamer domains for use with expression platforms and RNA molecules. Such modifications, which can be extensive, are referred to herein as Pl modifications. Pl modifications include changes to the sequence and/or length of the Pl stem of an aptamer domain.
The aptamer domains shown in Figure 11 of U.S. Application Publication No. 2005-0053951 (including any direct variants) are particularly useful as initial sequences for producing derived aptamer domains via in vitro selection or in vitro evolution techniques.
Aptamer domains of the disclosed riboswitches can also be used for any other purpose, and in any other context, as aptamers. For example, aptamers can be used to control ribozymes, other molecular switches, and any RNA molecule where a change in structure can affect function of the RNA. 2. Expression Platform Domains
Expression platform domains are a part of riboswitches that affect expression of the RNA molecule that contains the riboswitch. Expression platform domains generally have at least one portion that can interact, such as by forming a stem structure, with a portion of the linked aptamer domain. This stem structure will either form or be disrupted upon binding of the trigger molecule. The stem structure generally either is, or prevents formation of, an expression regulatory structure. An expression regulatory structure is a structure that allows, prevents, enhances or inhibits expression of an RNA molecule containing the structure. Examples include Shine-Dalgarno sequences, initiation codons, transcription terminators, and stability and processing signals. B. Trigger Molecules
Trigger molecules are molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques). C. Compounds
Also disclosed are compounds, and compositions containing such compounds, that can activate, deactivate or block a riboswitch. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Compounds can be used to activate, deactivate or block a riboswitch. The trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch.
Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch. Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch. A riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch.
Also disclosed are compounds for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch. This can be accomplished by bringing a compound into contact with the RNA molecule. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Thus, subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.
Also disclosed are compounds for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule. Also disclosed are compounds for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can in death, stasis or debilitation of the cell or organism. Also disclosed are compounds for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de¬ repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.
Also disclosed are methods of identifying compounds that activate, deactivate or block a riboswitch. For example, compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner. For example, the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound. As another example, the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. As can be seen, assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.
Identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation , of the riboswitch.
Compounds can also be identified using the atomic crystalline structure of a riboswitch. An example of such a crystalline atomic structure of a natural guanine- responsive riboswitch can be found in Figure 5. The atomic coordinates of the atomic structure are listed in Table 6. In Figure 5, the riboswitch is shown bound to hypoxanthine. In this example, the crystal structure at 1.95 A resolution of the purine- binding domain of the guanine riboswitch from the xpt-pbuX operon of B. subtilis bound to hypoxanthine, a prevalent metabolite in the bacterial purine salvage pathway is shown. This structure reveals a complex RNA fold involving several phylogenetically conserved nucleotides that create a binding pocket that almost completely envelops the ligand. Hypoxanthine functions to stabilize this structure and to promote the formation of a downstream transcriptional terminator element, thereby providing a mechanism for directly repressing gene expression in response to an increase in intracellular concentrations of metabolite.
Compounds can be identified using the crystalline structure of a riboswitch by, for example, modeling the atomic structure of the riboswitch with a test compound; and determining if the test compound interacts with the riboswitch. This can be done by using a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch. Compounds can also be identified by, for example, asessing the fit between the riboswtich and a compound known to bind the riboswitch (such as the trigger molecule), identify sites where the compound can be changed with little or no obvious adverse effects on binding of the compound, and incorporating one or more such alterations to produce a new compound. The method of identifying compounds that interact with a riboswitch can also involve production of the compounds so identified. Typically the method first utilizes a 3-dimensional structure of the riboswitch with a compound, also referred to as a "known compound" or "known target". Any of the trigger molecules and compounds disclosed herein can be used as such a known compound. The structure of the riboswitch can be determined using any known means, such as crystallography or solution NMR spectroscopy. That structure can also be obtained through computer molecular modeling simulation programs, such as AutoDock. The methods can involve determining the amount of binding, such as determining the binding energy, between a riboswitch, and a potential compound for that riboswitch. An active compound is a compound that has some activity against a riboswitch, such as inhibiting the riboswitch's activity or enhancing the riboswitch's activity, hi addition, the potential compound can be an analog, which has some structural relationship to a known compound for the molecule. Any of the trigger molecules, known compounds, and compounds disclosed herein can be used as the basis of or to derive a potential compound.
The identity or relationship of the structure, properties, interaction or binding parameters, and the like of the known compound and potential compound can be viewed in number of ways. For example, any of the measures or interaction parameters that can be measured or assessed using the structural model, and such measures and parameters obtained for a known compound and a potential compound can be compared. One can look at the identity between the entire known compound and the potential compound. One can also look at the identity between the potential compound, such as an analog, and the know compound only in the domain where the potential compound interacts with the riboswitch. One can also look at the identity between the potential compound and the known compound at the level of a sub-domain, such as only those moieties or atoms in the potential compound which are within 7 A, 6A, 5A5 4A , 3 A, or 2A of a moiety or atom which is in contact with the riboswitch in the known compound. Generally, the more specific the sub-domain the higher the identity will be between the moieties of the potential compound and the known compound. For example, there can be 30% or greater, 35% or greater, 40% or greater, 45% or greater, 50% or greater, 55% or greater, 60% or greater, 65% or greater, 70% or greater, 75% or greater, 80% or greater, 85% or greater, 90% or greater, 95% or greater identity between the known compound and potential compound as a whole, 50% or greater, 55% or greater, 60% or greater, 65% or greater, 70% or greater, 75% or greater, 80% or greater, 85% or greater, 90% or greater, 95% or greater identity between the binding domain of the known compound and the potential compound, and 70% or greater, 75% or greater, 80% or greater, 85% or greater, 90% or greater, 95% or greater identity between the moieties or atoms of the potential compound that correspond to the moieties or atoms of the known compound which are within 5 A of a moiety or atom which interacts with the riboswitch. Another sub-domain is a sub-domain of moieties or atoms which actually contact the riboswitch. In this case the identity can be, for example, greater than 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or higher.
Typically, the potential compounds exist in a family of potential compounds, i.e. a set of analogs, all of which have some structural relationship to the known compound for the riboswitch. A family consisting of any number of members can be screened. The maximum number of members in the family is only limited by the amount of computer power available to screen each member in a desired amount of time. The methods can involve at least one template structure of the riboswitch and a target, often this would be with a known target. It is not required that this structure be existent, as it can be generated, in some cases during the disclosed methods, using standard structure determination techniques. It is preferred that a real structure exist at the time the methods are employed.
The methods can also involve modeling the structure of the potential compound, using information from the structure of the known compound. This modeling can be performed in any way, and as described herein.
The conformation and position of the potential compound can be held fixed during the calculations; that is, it can be assumed that the riboswitch binds in exactly the same orientation to the potential compound as it does to a known compound. Then, a binding energy (or other property or parameter) can be determined between the riboswitch and the potential compound, and if the binding energy (or other property or parameter) meets certain criteria, then the potential compound can be designated as an actual compound, i.e. one that is likely to interact with the riboswitch. Although the following refers to the use of binding energy, it should be understood that any property or parameter involving the interaction or modeling of a compound and a riboswitch can be used. The criterion can be that the computed binding energy of the riboswitch with the potential compound is similar to, or more favorable than, the computed binding energy of the same riboswitch with a known compound. For example, an actual compound can be a compound where the computed binding energy as discussed herein is, for example, at least 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 100%, 101%, 102%, 103%, 104%, 105%, 106%, 107%, 108%, 109%, 110%, 120%, 130%, 140%, 150%, 200%, 250%, 300%, 350%, 400%, 450%, 500%, 600%, 700%, 800%, 900%, 1000%, or greater than that of the known compound binding energy. An actual compound can also be a compound which after ordering all potential compounds in terms of the strength of their binding energies, are the compounds which are in the top 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1% of computed binding strengths, of for example, a set of potential compounds where the set is at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18,
19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200, 225, 250, 275, 300, 350, 400, 500, 700, or a 1000 potential compounds.
It is also understood that once a potential compound is identified, as disclosed herein, traditional testing and analysis can be performed, such as performing a biological assay using the riboswitch and the actual compound to further define the ability of the actual compound to interact with and/or modulate the riboswitch. The disclosed methods can include the step of assaying the activity of the riboswitch and compound, as well as performing, for example, combinatorial chemistry studies using libraries based on the riboswitch, for example.
Energy calculations can be based on, for example, molecular or quantum mechanics. Molecular mechanics approximates the energy of a system by summing a series of empirical functions representing components of the total energy like bond stretching, van der Waals forces, or electrostatic interactions. Quantum mechanics methods use various degrees of approximation to solve the Schrόdinger equation. These methods deal with electronic structure, allowing for the characterization of chemical reactions. Potential compounds of the riboswitch can be identified. This can be accomplished by selecting potential compounds with a given similarity to the known compound. For example, compounds in the same family as the known compound can be selected.
To prepare each riboswitch for calculation, atoms can be built in that were unresolved or absent from the crystal structures of the potential compound. This can be done, for example, using the PRODRG webserver http^/www.davapcl.bioch.dundee.ac.uk./programs/prodrg, or standard molecular modeling programs such as Insightll, Quanta (both at www.accekys.com), CNS (Brunger et al., Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905-921 (1998)), or any other molecular modeling system capable of preparing the riboswitch structure.
The binding energy (or other property or parameter) of the potential compound and riboswitch can then be calculated. There are numerous means for carrying this out. For example, the sampling of sidechain positions and the computation of the binding thermodynamics can be accomplished using an empirical function that models the energy of the potential compound-molecule as a sum of electrostatic and van der Waals interactions between all pairs of atoms within the model. Any other computational method for scoring the binding energy of the potential compound with the riboswitch can be used (H. Gohlke, & G. Klebe. Approaches to the description and prediction of the binding affinity of small-molecule ligands to macromolecular receptors. Angew. Chem. Int. Ed. 41, 2644-4676 (2002)). Examples of such scoring methods include, but are not limited to, those implemented in programs such as AutoDock (G. M. Morris et al. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J. Comput. Chem. 19, 1639-1662 (1998)), Gold (G. Jones et al. Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J. MoI. Biol. 245, 43-53 (1995)), Chem-Score (M. D. Eldridge et al. J. Comput-Aided MoI. Des. 11, 425-445 (1997)) and Drug-Score (H. Gohlke et al. Knowledge-based scoring function to predict protein-ligand interactions. J MoI. Biol. 295, 337-356 (2000)).
Rotamer libraries are known to those of skill in the art and can be obtained from a variety of sources, including the internet. Rotamers are low energy side-chain conformations. The use of a library of rotamers allows for the modeling of a structure to try the most likely side-chain conformations, saving time and producing a structure that is more likely to be correct. The use of a library of rotamers can be restricted to those residues that are within a given region of the potential compound, for example, at the binding site, or within a specified distance of the compound. The latter distance can be set at any desired length, for example, the potential compound can be 2, 3, 4, 5, 6, 7, 8, or 9 A from any atom of the molecule.
Electrostatic interactions between every pair of atoms can be calculated, for example, using a Coulombic model with the formula:
Eeiec =332.08 qϊ q2 /er. where q] and q2 are partial atomic charges, r is the distance between them, and ε is the dielectric constant.
Partial atomic charges can be taken from existing parameter sets that have been developed to describe charge distributions in molecules. Example parameter sets include, but are not limited to, PARSE (D. A. Sitkoff et al. Accurate calculation of hydration free-energies using macroscopic solvent models. J. Phys. Chem. 98, 1978-1988 (1994)), CHARMM (MacKerell et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J. Phys. Chem. B 102, 3586-3616, 1998) and AMBER (W. D. Cornell et al. A 2nd generation force-field for the simulation of proteins, nucleic-acids, and organic-molecules. J. Am. Chem. Soc. 117. 5179-5195 (1995)). Partial charges for atoms can be assigned either by analogy with those of similar functional groups, or by empirical assignment methods such as that implemented in the PRODRG server (D. M. F. van Aalten et al. PRODRG, a program for generating molecular topologies and unique molecular descriptors from coordinates of small molecules. J. Comput.-Aided MoI. Design 10, 255-262 (1996)), or by the use of standard quantum mechanical calculation methods (for example, C. I. Bayly et al. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges - the RESP model. J. Phys. Chem. 91, 10269-10280, (1993)). The electrostatic interaction can also be calculated by more elaborate methodologies that incorporate electrostatic desolvation effects. These can include explicit solvent and implicit solvent models: in the former, water molecules are directly included in the calculations, whereas in the latter, the effects of water are described by a dielectric continuum approach. Specific examples of implicit solvent methods for calculating electrostatic interactions include but are not limited to: Poisson-Boltzmann based methods and Generalized Born methods (M. Feig & C. L. Brooks. Recent advances in the development and application of implicit solvent models in biomolecule simulations. Curr. Opin. Struct. Biol. 14, 217-224 (2004)). van der Waals and hydrophobic interactions between pairs of atoms (where both atoms are either sulfur or carbon) can be calculated using a simple Lennard- Jones formalism with the following equation:
Evdw = G{σatt 12/r12 - σatt 6/r6}. where G is an energy, r is the distance between the two atoms and σatt is the distance at which the energy of interaction is zero. van der Waals interactions between pairs of atoms (where one or both atoms are neither sulfur nor carbon) can be calculated using a simple repulsive energy term:
Evdw = G{<7rep 12/r12}. where G is an energy, r is the distance between the two atoms and σrep determines the distance at which the repulsive interaction is equal to G.
Hydrophobic interactions between atoms can also be calculated using a variety of other methods known to those skilled in the art. For example, the energetic contribution can be calculated as being proportional to the amount of solvent accessible surface area of the ligand and receptor that is buried when the complex is formed. Such contributions can be expressed in terms of interactions between pairs of atoms, such as in the method proposed by Street & Mayo (A. G. Street & S. L. Mayo. Pairwise calculation of protein solvent-accessible surface areas. Folding & Design 3, 253-258 (1998)). Any other implementation of a formalism for describing hydrophobic or van der Waals or other energetic contributions can be included in the calculations.
Binding energies can be calculated for each potential compound-riboswitch interaction. For example, Monte Carlo sampling can be conducted in the presence and absence of the riboswitch, and the average energy in each simulation calculated. A binding energy for the riboswitch with the potential compound can then be calculated as the difference between the two calculated average energies. The computed binding energy of a potential compound with the riboswitch can be compared with the computed binding energy of a known compound with the riboswitch to determine if the potential compound is likely to be an actual compound. These results can then be confirmed using experimental data, wherein the actual interaction between the riboswitch and compound can be measured. Examples of methods that can be used to determine an actual interaction between the riboswitch and the compound include but are not limited to: equilibrium dialysis measurements (wherein binding of a radioactive form of the compound to the riboswitch is detected), enzyme inhibition assays (wherein the activity of the riboswitch can be monitored in the presence and absence of the compound), and chemical shift perturbation measurements (wherein binding of the riboswitch to the potential compound is monitored by observing changes in NMR chemical shifts of atoms).
In the methods disclosed above, the riboswitch can be a guanine riboswitch, for example. This riboswitch can be selected, for example, from riboswitches in Table 5. After the atomic crystalline structure of the riboswitch has been modeled with a potential compound, further testing can be carried out to determine the actual interaction between the riboswitch and the compound. For example, multiple different approaches can be used to detect binding RNAs, including allosteric ribozyme assays using gel- based and chip-based detection methods, and in-line probing assays. High throughput testing can also be accomplished by using, for example, fluorescent detection methods. For example, the natural catalytic activity of a glucosarnine-6-phosphate sensing riboswitch that controls gene expression by activating RNA-cleaving ribozyme can be used. This ribozyme can be reconfigured to cleave separate substrate molecules with multiple turnover kinetics. Therefore, a fluorescent group held in proximity to a quenching group can be uncoupled (and therefore become more fluorescent) if a compound triggers ribozyme function. Second, molecular beacon technology can be employed. This creates a system that suppresses fluorescence if a compound prevents the beacon from docking to the riboswitch RNA. Either approach can be applied to any of the riboswitch classes by using RNA engineering strategies described herein. Also disclosed herein are analogs that interact with the guanine riboswitch disclosed herein. Examples of such analogs can be found in Figure 12B. Many of the 26 compounds synthesized and tested bind the guanine riboswitch with constants that are equal to or better than that of guanine (-5 nM). The fact that appendages with highly variable chemical composition exhibit function shows that numerous variations of these chemical scaffolds can be generated and tested for function in vitro and inside cells. Specifically, further modified versions of these compounds can have improved binding to the guanine riboswitch by making new contacts to other functional groups in the RNA structure. Furthermore, modulation of bioavailability, toxicity, and synthetic ease (among other characteristics) can be tunable by making modifications in these two regions of the scaffold, as the structural model for the riboswitch shows many modifications are possible at these sites.
High-throughput screening can also be used to reveal entirely new chemical scaffolds that also bind to riboswitch RNAs either with standard or non- standard modes of molecular recognition. Since riboswitches are the first major form of natural metabolite-binding RNAs to be discovered, there has been little effort made previously to create binding assays that can be adapted for high-throughput screening. Multiple different approaches can be used to detect metabolite binding RNAs, including allosteric ribozyme assays using gel-based and chip-based detection methods, and in-line probing assays. Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound. This can be accomplished by, for example, combining compound identification methods as disclosed elsewhere herein with methods for manufacturing the identified compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
Also disclosed are compounds made by checking activation, deactivation or blocking of a riboswitch by a compound and manufacturing the checked compound.
This can be accomplished by, for example, combining compound activation, deactivation or blocking assessment methods as disclosed elsewhere herein with methods for manufacturing the checked compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound. Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch.
As used herein, the term "substituted" is contemplated to include all permissible substituents of organic compounds. In a broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, and aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, for example, those described below. The permissible substituents can be one or more and the same or different for appropriate organic compounds. For the purposes of this disclosure, the heteroatoms, such as nitrogen, can have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms. This disclosure is not intended to be limited in any manner by the permissible substituents of organic compounds. Also, the terms "substitution" or "substituted with" include the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g. , a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc.
"A1," "A2," "A3," and "A4" are used herein as generic symbols to represent various specific substituents. These symbols can be any substituent, not limited to those disclosed herein, and when they are defined to be certain substituents in one instance, they can, in another instance, be defined as some other substituents.
The term "alkyl" as used herein is a branched or unbranched saturated hydrocarbon group of 1 to 24 carbon atoms, such as methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, t-butyl, pentyl, hexyl, heptyl, octyl, nonyl, decyl, dodecyl, tetradecyl, hexadecyl, eicosyl, tetracosyl, and the like. The alkyl group can also be substituted or unsubstituted. The alkyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below. The term "lower alkyl" is an alkyl group with 6 or fewer carbon atoms, e.g., methyl, ethyl, propyl, isopropyl, butyl, sec-butyl, iso-butyl, tert-butyl, pentyl, hexyl, and the like.
Throughout the specification "alkyl" is generally used to refer to both unsubstituted alkyl groups and substituted alkyl groups; however, substituted alkyl groups are also specifically referred to herein by identifying the specific substituent(s) on the alkyl group. For example, the term "halogenated alkyl" specifically refers to an alkyl group that is substituted with one or more halide, e.g., fluorine, chlorine, bromine, or iodine. The term "alkoxyalkyl" specifically refers to an alkyl group that is substituted with one or more alkoxy groups, as described below. The term "alkylamino" specifically refers to an alkyl group that is substituted with one or more amino groups, as described below, and the like. When "alkyl" is used in one instance and a specific term such as "halogenated alkyl" is used in another, it is not meant to imply that the term "alkyl" does not also refer to specific terms such as "halogenated alkyl" and the like.
This practice is also used for other groups described herein. That is, while a term such as "cycloalkyl" refers to both unsubstituted and substituted cycloalkyl moieties, the substituted moieties can, in addition, be specifically identified herein; for example, a particular substituted cycloalkyl can be referred to as, e.g., an "alkylcycloalkyl." Similarly, a substituted alkoxy can be specifically referred to as, e.g., a "halogenated alkoxy," a particular substituted alkenyl can be, e.g., an "alkenylalcohol," and the like. Again, the practice of using a general term, such as "cycloalkyl," and a specific term, such as "alkylcycloalkyl," is not meant to imply that the general term does not also include the specific term.
The term "alkoxy" as used herein is an alkyl group bonded through a single, terminal ether linkage; that is, an "alkoxy" group can be defined as — OA1 where A2 is alkyl as defined above.
The term "alkenyl" as used herein is a hydrocarbon group of from 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon double bond. Asymmetric structures such as (A1A^C=C(A3A4) are intended to include both the E and Z isomers. This can be presumed in structural formulae herein wherein an asymmetric alkene is present, or it can be explicitly indicated by the bond symbol C=C. The alkenyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below. The term "alkynyl" as used herein is a hydrocarbon group of 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon triple bond. The alkynyl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol, as described below.
The term "aryl" as used herein is a group that contains any carbon-based aromatic group including, but not limited to, benzene, naphthalene, phenyl, biphenyl, phenoxybenzene, and the like. The term "aryl" also includes "heteroaryl," which is defined as a group that contains an aromatic group that has at least one heteroatom incorporated within the ring of the aromatic group. Examples of heteroatoms include, but are not limited to, nitrogen, oxygen, sulfur, and phosphorus. Likewise, the term "non-heteroaryl," which is also included in the term "aryl," defines a group that contains an aromatic group that does not contain a heteroatom. The aryl group can be substituted or unsubstituted. The aryl group can be substituted with one or more groups including, but not limited to, alkyl, halogenated alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein. The term "biaryl" is a specific type of aryl group and is included in the definition of aryl. Biaryl refers to two aryl groups that are bound together via a fused ring structure, as in naphthalene, or are attached via one or more carbon-carbon bonds, as in biphenyl.
The term "cycloalkyl" as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms. Examples of cycloalkyl groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, etc. The term
"heterocycloalkyl" is a cycloalkyl group as defined above where at least one of the carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkyl group and heterocycloalkyl group can be substituted or unsubstituted. The cycloalkyl group and heterocycloalkyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.
The term "cycloalkenyl" as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms and containing at least one double bound, i.e., C=C. Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadienyl, cyclohexenyl, cyclohexadienyl, and the like. The term "heterocycloalkenyl" is a type of cycloalkenyl group as defined above, and is included within the meaning of the term "cycloalkenyl," where at least one of the carbon atoms of the ring is substituted with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkenyl group and heterocycloalkenyl group can be substituted or unsubstituted. The cycloalkenyl group and heterocycloalkenyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, sulfo-oxo, sulfonyl, sulfone, sulfoxide, or thiol as described herein.
The term "cyclic group" is used herein to refer to either aryl groups, non-aryl groups (i.e., cycloalkyl, heterocycloalkyl, cycloalkenyl, and heterocycloalkenyl groups), or both. Cyclic groups have one or more ring systems that can be substituted or unsubstituted. A cyclic group can contain one or more aryl groups, one or more non-aryl groups, or one or more aryl groups and one or more non-aryl groups.
The term "aldehyde" as used herein is represented by the formula — C(O)H. Throughout this specification "C(O)" is a short hand notation for C=O.
The terms "amine" or "amino" as used herein are represented by the formula NA1A2A3, where A1, A2, and A3 can be, independently, hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above. The term "carboxylic acid" as used herein is represented by the formula
— C(O)OH. A "carboxylate" as used herein is represented by the formula — C(O)O".
The term "ester" as used herein is represented by the formula — OC(O)A1 or
— C(O)OA1, where A1 can be an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
The term "ether" as used herein is represented by the formula A1OA2, where A1 and A2 can be, independently, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
The term "ketone" as used herein is represented by the formula A1C(O)A2, where A1 and A2 can be, independently, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above.
The term "halide" as used herein refers to the halogens fluorine, chlorine, bromine, and iodine. The term "hydroxy." as used herein is represented by the formula — OH.
The term "sulfo-oxo" as used herein is represented by the formulas — S(O)A1 (i.e., "sulfonyl"), A1S(O)A2 (i.e., "sulfoxide"), -S(O)2A1, A1SO2A2 (i.e., "sulfone"),
— OS(O)2A1, or — OS(O)2OA1, where A1 and A2 can be hydrogen, an alkyl, halogenated alkyl, alkenyl, alkynyl, aryl, heteroaryl, cycloalkyl, cycloalkenyl, heterocycloalkyl, or heterocycloalkenyl group described above. Throughout this specification "S(O)" is a short hand notation for S=O.
The term "sulfonylamino" or "sulfonamide" as used herein is represented by the formula -S(O)2NH-.
The term "thiol" as used herein is represented by the formula — SH. : As used herein, "Rn" where n is some integer can independently possess one or more of the groups listed above. For example, if R10 contains an aryl group, one of the hydrogen atoms of the aryl group can optionally be substituted with a hydroxyl group, an alkoxy group, an amine group, an alkyl group, a halide, and the like. Depending upon the groups that are selected, a first group can be incorporated within second group or, alternatively, the first group can be pendant (i.e., attached) to the second group. For example, with the phrase "an alkyl group comprising an amino group," the amino group can be incorporated within the backbone of the alkyl group. Alternatively, the amino group can be attached to the backbone of the alkyl group. The nature of the group(s) that is (are) selected will determine if the first group is embedded or attached to the second group.
Unless stated to the contrary, a formula with chemical bonds shown only as solid lines and not as wedges or dashed lines contemplates each possible isomer, e.g., each enantiomer and diastereomer, and a mixture of isomers, such as a racemic or scalemic mixture. Certain materials, compounds, compositions, and components disclosed herein can be obtained commercially or readily synthesized using techniques generally known to those of skill in the art. For example, the starting materials and reagents used in preparing the disclosed compounds and compositions are either available from commercial suppliers such as Aldrich Chemical Co., (Milwaukee, Wis.), Acros Organics (Morris Plains, NJ.), Fisher Scientific (Pittsburgh, Pa.), or Sigma (St. Louis, Mo.) or are prepared by methods known to those skilled in the art following procedures set forth in references such as Fieser and Fieser's Reagents for Organic Synthesis, Volumes 1-17 (John Wiley and Sons, 1991); Rodd's Chemistry of Carbon Compounds, Volumes 1-5 and Supplementals (Elsevier Science Publishers, 1989); Organic Reactions, Volumes 1- 40 (John Wiley and Sons, 1991); March's Advanced Organic Chemistry, (John Wiley and Sons, 4th Edition); and Larock's Comprehensive Organic Transformations (VCH Publishers Inc., 1989). Compounds useful with guanine-responsive ribos witches (and riboswitches derived from guanine-responsive riboswitches) include compounds represented by Formula I
Figure imgf000048_0001
I where the compound can bind a guanine-responsive riboswitch or derivative thereof, where, when the compound is bound to a guanine-responsive riboswitch or derivative, R1 and R2 serve as a hydrogen bond donor, R7 serves as a hydrogen bond acceptor, R9 serves as a hydrogen bond donor, R10 serves as a hydrogen bond acceptor, and where ===z — each independently represent a single or double bond, hi some other examples, R3 can be a hydrogen bond acceptor. It is to be understood that while a particular moiety or group can be referred to herein as a hydrogen bond donor or acceptor, this terminology is used to merely categorize the various substituents for ease of reference. Such language should not be interpreted to mean that a particular moiety actually participates in hydrogen bonding with the riboswitch or some other compound. It is possible that, for example, a moiety referred to herein as a hydrogen bond acceptor (or donor) could solely or additionally be involved in hydrophobic, ionic, van de Waals, or other type of interaction with the riboswitch or other compound. It is also understood that certain groups disclosed herein can be referred to herein as both a hydrogen bond acceptor and a hydrogen bond donor. For example, -OH can be a hydrogen bond donor by donating the hydrogen atom; -OH can also be a hydrogen bond acceptor through one or more of the nonbonded electron pairs on the oxygen atom. Thus, throughout the specification various moieties can be a hydrogen bond donor and acceptor and can be referred to as such.
Suitable hydrogen bond donors that can be present in the disclosed compounds, for example as one or more of R1, R2, and R9, are moieties that contain a polar hydrogen bond, such as when a hydrogen atom is bonded to a more electronegative atom like C, N, O, or S. Examples of suitable hydrogen bond donors for R1, R2, and R9 include, but are not limited to, the following moieties: -NR11-, -CHR11-, =CRπ-, and -CC=NR11V, where R11 is -H, -NH2, -OH, -SH, -CO2H, substituted or unsubstituted alkyl, alkoxy, aryl, aryloxy, or benzyloxy, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO2alkyl, - NHC(O)NH2, -NH-NH2, -NH-NHalkyl, -NH-NHalkoxy, -NH-SO2alkyl, -NH-SO2-R12,- NHCO2CH2-R12, -NH-OR12, -N+H2-R12, -NH-NH-R12, and -NH-NH-CH2-R12, where R12 can be:
Figure imgf000049_0001
where n is from 1 to 5, and R13 can be one or more of -H, -NH2, -OH, alkoxy, -JV- morpholino, or halide. Suitable hydrogen bond acceptors that can be present in the disclosed compounds, for example as one or more of R3, R7, and R10, are moieties that contain a nonbonded electron pair. Nonbonded electron pairs typically exist on N, O, S, and halogen atoms. Examples of suitable hydrogen bond acceptors for R , R include, but are not limited to, N, O, S, and SO2. For example, the groups R8 and R7 taken together can be represented as -CH=N-, -CH2-O-, -CH2-S-, or -CH2SO2-. R3 can be N, O, or S, which when referring to Formula I results in the moieties -N=R2, -0-R2, and -S-R2, respectively, where R is as defined above.
For the moiety R10, a suitable hydrogen bond acceptor can be O or S, as in O=R6 or S=R6, where R6 is C. Still further examples of hydrogen bond acceptors for R10 include, but are not limited to, -OH, -SH, -NH2, -CO2H, -alkoxy, -aryloxy, -benzyloxy, - halide, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO2alkyl, -NHCO2CH2-R12, - NHC(O)NH2, -NH-NH2, -NH-NHalkyl, -NH-NHalkoxy, -S02alkyl, -S02aryl, -NH- S02alkyl, -NH-SO2-R12, -NH-OR12, -NH-R12, -NH-NH-R12, -NH-NH-CH2-R12, or -NH- CH2-R12, where R12 is as defined above. Yet another example of a suitable hydrogen bond acceptor for R10 is NR1 , as in
R14N=R6, where R6 is C. hi these examples, R14 can be -H, -NH2, -OH, -SH, -CO2H, - C02alkyl, -C02aryl, -C(O)NH2, substituted or unsubstituted alkyl, alkoxy, alkoxy, aryloxy, or benzyloxy, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO2alkyl, - NHC(O)NH2, -SO2alkyl, -S02aryl, -NH-SO2alkyl, -NH-SO2-R12, -NH-OR12, -NH-R12, or -NH-CH2-R12, where R12 is as defined above.
Some specific compounds disclosed herein can be represented by Formula II:
Figure imgf000050_0001
II where R7 is N or CH;
R10 is =0, =S, =NH, =N0H, =Nalkyl, =Nalkoxyl, =N-aryl, =Naryloxy, =N- benzyl, -Nbenzyloxy, =N-NH2, =N-NH0H, =N-NHalkyl, =N-NHalkoxy, =N-NHaryl, =N-NHaryloxy, =N-NHbenzyl, =N-NHbenzyloxy, =N-NH-(p-amino-phenyl), =N-NH- (p-methoxyphenyl), =N-NH-(p-N-morpholino-phenyl); and
R2 is =CR15-, where R15 is -NH2, -NHNH2, -NHOH, -NHalkyl, -NHalkoxy, - NHaryl, -NHaryloxy, -NHbenzyl, -NHbenzyloxy, -N+H2BTyI, -N4H2-(p-iV-moφholino- phenyl),
Figure imgf000050_0002
-N+H2-(p-methoxyρhenyl), -NHCO2alkyl, - NHCO2benzyl, -NHNHalkyl, -NHNHaryl, -NHNHbenzyl, or -NHC(O)alkyl. Additional compounds can be represented by Formula III:
Figure imgf000051_0001
III where R10 is -H, -OH, -SH, -alkoxy, halide, -NH2, -NHOH, -NHalkyl, -NHalkoxy, - NHaryl, -NHaryloxy, -NHbenzyl, -NHbenzyoxy, -NHC(O)alkyl, -NHCO2alkyl, NHCO2benzyl, -NHNH2, -NHNHalkyl, -NHNHaryl, or -NHNHbenzyl; and R2 and R7 are as defined above.
Further specific examples of compounds are illustrated in Figure 12B.
Additional compounds useful with guanine-responsive riboswitches (and riboswitches derived from guanine-responsive riboswitches) include compounds having the formula
Figure imgf000051_0002
where the compound can bind a guanine-responsive riboswitch or derivative thereof, where, when the compound is bound to a guanine-responsive riboswitch or derivative, R7 serves as a hydrogen bond acceptor, R10 serves as a hydrogen bond donor, R11 serves as a hydrogen bond acceptor, R12 serves as a hydrogen bond donor, where R13 is H, H2 or is not present, where R1, R2, R3, R4, R5, R6, R8, and R9 are each independently
C, N, O, or S5 and where each independently represent a single or double bond.
Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not guanine, hypoxanthine, xanthine, or N2-methylguanme. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a guanine-responsive riboswitch. It should be understood that particular contacts and interactions (such as hydrogen bond donation or acceptance) described herein for compounds interacting with riboswitches are preferred but are not essential for interaction of a compound with a riboswitch. For example, compounds can interact with riboswitches with less affinity and/or specificity than compounds having the disclosed contacts and interactions. Further, different or additional functional groups on the compounds can introduce new, different and/or compensating contacts with the riboswitches. For example, for guanine riboswitches, large functional groups can be used at, for example, R and R . Such functional groups can have, and can be designed to have, contacts and interactions with other part of the riboswitch. Such contacts and interactions can compensate for contacts and interactions of the trigger molecules and core structure.
Compounds useful with adenine-responsive riboswitches (and riboswitches derived from adenine-responsive riboswitches) include compounds having the formula
Figure imgf000052_0001
where the compound can bind an adenine-responsive riboswitch or derivative thereof, where, when the compound is bound to an adenine-responsive riboswitch or derivative, R1, R3 and R7 serve as hydrogen bond acceptors, and R10 and R11 serve as hydrogen bond donors, where R12 is H, H2 or is not present, where R1, R2, R3, R4, R5, R6,
R8, and R9 are each independently C, N, O, or S, and where each independently represent a single or double bond. Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not adenine, 2,6-diaminopurine, or 2-amino purine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate an adenine-responsive riboswitch.
Compounds useful with lysine-responsive riboswitches (and riboswitches derived from lysine-responsive riboswitches) include compounds having the formula
Figure imgf000053_0001
where the compound can bind a lysine-responsive riboswitch or derivative thereof, where R2 and R3 are each positively charged, where R1 is negatively charged, where R4 is C, N, O, or S, and where each independently represent a single or double bond.
Also contemplated are compounds as defined above where R2 and R3 are each NH3 + and where R1 is O".
Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not lysine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a lysine-responsive ribo switch.
Compounds useful with TPP-responsive riboswitches (and riboswitches derived from lysine-responsive riboswitches) include compounds having the formula
Figure imgf000054_0001
where the compound can bind a TPP-responsive riboswitch or derivative thereof, where R1 is positively charged, where R2 and R3 are each independently C, O, or S, where R4 is CH3, NH2, OH, SH, H or not present, where R5 is CH3, NH2, OH, SH, or H, where R6 is C or N, and where each independently represent a single or double bond. Also contemplated are compounds as defined above where R1 is phosphate, diphosphate or triphosphate.
Every compound within the above definition is intended to be and should be considered to be specifically disclosed herein. Further, every subgroup that can be identified within the above definition is intended to be and should be considered to be specifically disclosed herein. As a result, it is specifically contemplated that any compound, or subgroup of compounds can be either specifically included for or excluded from use or included in or excluded from a list of compounds. For example, as one option, a group of compounds is contemplated where each compound is as defined above but is not TPP, TP or thiamine. As another example, a group of compounds is contemplated where each compound is as defined above and is able to activate a TPP- responsive riboswitch. D. Constructs, Vectors and Expression Systems
The disclosed riboswitches can be used in with any suitable expression system. Recombinant expression is usefully accomplished using a vector, such as a plasmid. The vector can include a promoter operably linked to riboswitch-encoding sequence and
RNA to be expression (e.g., RNA encoding a protein). The vector can also include other elements required for transcription and translation. As used herein, vector refers to any carrier containing exogenous DNA. Thus, vectors are agents that transport the exogenous nucleic acid into a cell without degradation and include a promoter yielding expression of the nucleic acid in the cells into which it is delivered. Vectors include but are not limited to plasmids, viral nucleic acids, viruses, phage nucleic acids, phages, cosmids, and artificial chromosomes. A variety of prokaryotic and eukaryotic expression vectors suitable for carrying riboswitch-regulated constructs can be produced. Such expression vectors include, for example, pET, pET3d, pCR2.1, pBAD, pUC, and yeast vectors. The vectors can be used, for example, in a variety of in vivo and in vitro situation.
Viral vectors include adenovirus, adeno-associated virus, herpes virus, vaccinia virus, polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HTV backbone. Also useful are any viral families which share the properties of these viruses which make them suitable for use as vectors. Retroviral vectors, which are described in Verma (1985), include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Typically, viral vectors contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome. When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into the viral genome in place of the removed viral DNA. A "promoter" is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site. A "promoter" contains core elements required for basic interaction of RNA polymerase and transcription factors and can contain upstream elements and response elements.
"Enhancer" generally refers to a sequence of DNA that functions at no fixed distance from the transcription start site and can be either 5' (Laimins, 1981) or 3'
(Lusky et al., 1983) to the transcription unit. Furthermore, enhancers can be within an nitron (Banerji et al., 1983) as well as within the coding sequence itself (Osborne et al., 1984). They are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to increase transcription from nearby promoters. Enhancers, like promoters, also often contain response elements that mediate the regulation of transcription. Enhancers often determine the regulation of expression.
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated cells) can also contain sequences necessary for the termination of transcription which can affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor protein. The 3' untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA. The identification and use of polyadenylation signals in expression constructs is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs. The vector can include nucleic acid sequence encoding a marker product. This marker product is used to determine if the gene has been delivered to the cell and once delivered is being expressed. Preferred marker genes are the E. CoIi lacZ gene which encodes /3-galactosidase and green fluorescent protein. hi some embodiments the marker can be a selectable marker. When such selectable markers are successfully transferred into a host cell, the transformed host cell can survive if placed under selective pressure. There are two widely used distinct categories of selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of a supplemented media. The second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a protein conveying drug resistance and would survive the selection. Examples of such dominant selection use the drugs neomycin, (Southern and Berg, 1982), mycophenolic acid, (Mulligan and Berg, 1980) or hygromycin (Sugden et al., 1985). Gene transfer can be obtained using direct transfer of genetic material, in but not limited to, plasmids, viral vectors, viral nucleic acids, phage nucleic acids, phages, cosmids, and artificial chromosomes, or via transfer of genetic material in cells or carriers such as cationic liposomes. Such methods are well known in the art and readily adaptable for use in the method described herein. Transfer vectors can be any nucleotide construction used to deliver genes into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). Appropriate means for transfection, including viral vectors, chemical transfectants, or physico-mechanical methods such as electroporation and direct diffusion of DNA, are described by, for example, Wolff, J. A., et al., Science, 247, 1465-1468, (1990); and Wolff, J. A. Nature, 352, 815-818, (1991). 1. Viral Vectors
Preferred viral vectors are Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV backbone. Also preferred are any viral families which share the properties of these viruses which make them suitable for use as vectors. Preferred retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, and for this reason are a commonly used vector. However, they are not useful in non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells. Pox viral vectors are large and have several sites for inserting genes, they are thermostable and can be stored at room temperature. A preferred embodiment is a viral vector which has been engineered so as to suppress the immune response of the host organism, elicited by the viral antigens. Preferred vectors of this type will carry coding regions for Ltiterleukin 8 or 10.
Viral vectors have higher transaction (ability to introduce genes) abilities than do most chemical or physical methods to introduce genes into cells. Typically, viral vectors contain, nonstructural early genes, structural late genes, an RNA polymerase III transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome. When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promoter cassette is inserted into the viral genome in place of the removed viral DNA. Constructs of this type can carry up to about 8 kb of foreign genetic material. The necessary functions of the removed early genes are typically supplied by cell lines which have been engineered to express the gene products of the early genes in trans. i. Retroviral Vectors
A retrovirus is an animal virus belonging to the virus family of Retro viridae, including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are described by Verma, I.M., Retroviral vectors for gene transfer. In Microbiology- 1985, American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. Examples of methods for using retroviral vectors for gene therapy are described in U.S. Patent Nos. 4,868,116 and 4,980,286; PCT applications WO 90/02806 and WO 89/07136; and Mulligan, (Science 260:926-932 (1993)); the teachings of which are incorporated herein by reference.
A retrovirus is essentially a package which has packed into it nucleic acid cargo. The nucleic acid cargo carries with it a packaging signal, which ensures that the replicated daughter molecules will be efficiently packaged within the package coat. In addition to the package signal, there are a number of molecules which are needed in cis, for the replication, and packaging of the replicated virus. Typically a retroviral genome, contains the gag, pol, and env genes which are involved in the making of the protein coat. It is the gag, pol, and env genes which are typically replaced by the foreign DNA that it is to be transferred to the target cell. Retrovirus vectors typically contain a packaging signal for incorporation into the package coat, a sequence which signals the start of the gag transcription unit, elements necessary for reverse transcription, including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, a purine rich sequence 5' to the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to insert into the host genome. The removal of the gag, pol, and env genes allows for about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed , and upon replication be packaged into a new retroviral particle. This amount of nucleic acid is sufficient for the delivery of a one to many genes depending on the size of each transcript. It is preferable to include either positive or negative selectable markers along with other genes in the insert.
Since the replication machinery and packaging proteins in most retroviral vectors have been removed (gag, pol, and env), the vectors are typically generated by placing them into a packaging cell line. A packaging cell line is a cell line which has been transfected or transformed with a retrovirus that contains the replication and packaging machinery, but lacks any packaging signal. When the vector carrying the DNA of choice is transfected into these cell lines, the vector containing the gene of interest is replicated and packaged into new retroviral particles, by the machinery provided in cis by the helper cell. The genomes for the machinery are not packaged because they lack the necessary signals. ii. Adenoviral Vectors
The construction of replication-defective adenoviruses has been described (Berkner et al., J. Virology 61:1213-1220 (1987); Massie et al., MoI. Cell. Biol. 6:2872- 2883 (1986); Haj-Ahmad et al., J. Virology 51:261-21 A (1986); Davidson et al., J. Virology 61:1226-1239 (1987); Zhang "Generation and identification of recombinant adenovirus by liposome-mediated transfection and PCR analysis" BioTechniques 15:868-872 (1993)). The benefit of the use of these viruses as vectors is that they are limited in the extent to which they can spread to other cell types, since they can replicate within an initial infected cell, but are unable to form new infectious viral particles. Recombinant adenoviruses have been shown to achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, J. Clin. Invest. 92:1580- 1586 (1993);.Kirshenbaum, J. Clin. Invest. 92:381-387 (1993); Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem. 267:25129-25134 (1992); Rich, Human Gene Therapy 4:461-476 (1993); Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 (1993); Bout, Human Gene Therapy 5:3- 10 (1994); Zabner, Cell 75:207-216 (1993); Caillaud, Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-507 (1993)). Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild type or replication-defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and Burlingham, J. Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985); Seth, et al., J. Virol. 51:650-655 (1984); Seth, et al., MoI. Cell. Biol. 4:1528-1533 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 (1993)).
A preferred viral vector is one based on an adenovirus which has had the El gene removed and these virons are generated in a cell line such as the human 293 cell line. In another preferred embodiment both the El and E3 genes are removed from the adenovirus genome. Another type of viral vector is based on an adeno-associated virus (AAV). This defective parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably insert into chromosome 19. Vectors which contain this site specific integration property are preferred. An especially preferred embodiment of this type of vector is the P4.1 C vector produced by Avigen, San Francisco, CA, which can contain the herpes simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP.
The inserted genes in viral and retroviral usually contain promoters, and/or enhancers to help control the expression of the desired gene product. A promoter is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site. A promoter contains core elements required for basic interaction of RNA polymerase and transcription factors, and can contain upstream elements and response elements. 2. Viral Promoters and Enhancers
Preferred promoters controlling transcription from vectors in mammalian host cells can be obtained from various sources, for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most preferably cytomegalovirus, or from heterologous mammalian promoters, e.g. beta actin promoter. The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction fragment which also contains the SV40 viral origin of replication (Fiers et al., Nature, 273: 113 (1978)). The immediate early promoter of the human cytomegalovirus is conveniently obtained as a HindIII E restriction fragment (Greenway, PJ. et al., Gene 18: 355-360 (1982)). Of course, promoters from the host cell or related species also are useful herein.
Enhancer generally refers to a sequence of DNA that functions at no fixed distance from the transcription start site and can be either 5' (Laimins, L. et al., Proc. Natl. Acad. Sci. 78: 993 (1981)) or 3' (Lusky, M.L., et al., Mol. Cell Bio. 3: 1108 (1983)) to the transcription unit. Furthermore, enhancers can be within an intron (Banerji, J.L. et al., Cell 33: 729 (1983)) as well as within the coding sequence itself
(Osborne, T.F., et al., MoI. Cell Bio. 4: 1293 (1984)). They are usually between 10 and 300 bp in length, and they function in cis. Enhancers function to increase transcription from nearby promoters. Enhancers also often contain response elements that mediate the regulation of transcription. Promoters can also contain response elements that mediate the regulation of transcription. Enhancers often determine the regulation of expression of a gene. While many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, α-fetoprotein and insulin), typically one will use an enhancer from a eukaryotic cell virus. Preferred examples are the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
The promoter and/or enhancer can be specifically activated either by light or specific chemical events which trigger their function. Systems can be regulated by reagents such as tetracycline and dexamethasone. There are also ways to enhance viral vector gene expression by exposure to irradiation, such as gamma irradiation, or alkylating chemotherapy drugs.
It is preferred that the promoter and/or enhancer region be active in all eukaryotic cell types. A preferred promoter of this type is the CMV promoter (650 bases). Other preferred promoters are S V40 promoters, cytomegalovirus (full length promoter), and retroviral vector LTF.
It has been shown that all specific regulatory elements can be cloned and used to construct expression vectors that are selectively expressed in specific cell types such as melanoma cells. The glial fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in cells of glial origin.
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated cells) can also contain sequences necessary for the termination of transcription which can affect rnRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor protein. The 3' untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA. The identification and use of polyadenylation signals in expression constructs is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs. In a preferred embodiment of the transcription unit, the polyadenylation region is derived from the SV40 early polyadenylation signal and consists of about 400 bases. It is also preferred that the transcribed units contain other standard sequences alone or in combination with the above sequences improve expression from, or stability of, the construct.
3. Markers The vectors can include nucleic acid sequence encoding a marker product. This marker product is used to determine if the gene has been delivered to the cell and once delivered is being expressed. Preferred marker genes are the E. CoIi lacZ gene which encodes β-galactosidase and green fluorescent protein.
In some embodiments the marker can be a selectable marker. Examples of suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), thymidine kinase, neomycin, neomycin analog G418, hydromycin, and puromycin.
When such selectable markers are successfully transferred into a mammalian host cell, the transformed mammalian host cell can survive if placed under selective pressure.
There are two widely used distinct categories of selective regimes. The first category is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of a supplemented media. Two examples are: CHO DHFR" cells and mouse LTK" cells. These cells lack the ability to grow without the addition of such nutrients as thymidine or hypoxanthine. Because these cells lack certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are provided in a supplemented media. An alternative to supplementing the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements. Individual cells which were not transformed with the DHFR or TK gene will not be capable of survival in non-supplemented media. The second category is dominant selection which refers to a selection scheme used in any cell type and does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host cell. Those cells which would express a protein conveying drug resistance and would survive the selection. Examples of such dominant selection use the drugs neomycin, (Southern P. and Berg, P., J. Molec. Appl. Genet. 1: 327 (1982)), mycophenolic acid, (Mulligan, R.C. and Berg, P. Science 209: 1422 (1980)) or hygromycin, (Sugden, B. et al., MoI. Cell. Biol. 5: 410-413 (1985)).
The three examples employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others include the neomycin analog G418 and puramycin.
E. Biosensor Riboswitches
Also disclosed are biosensor riboswitches. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
F. Reporter Proteins and Peptides
For assessing activation of a riboswitch, or for biosensor riboswitches, a reporter protein or peptide can be used. The reporter protein or peptide can be encoded by the RNA the expression of which is regulated by the riboswitch. The examples describe the use of some specific reporter proteins. The use of reporter proteins and peptides is well known and can be adapted easily for use with riboswitches. The reporter proteins can be any protein or peptide that can be detected or that produces a detectable signal. Preferably, the presence of the protein or peptide can be detected using standard techniques (e.g., radioimmunoassay, radio-labeling, immunoassay, assay for enzymatic activity, absorbance, fluorescence, luminescence, and Western blot). More preferably, the level of the reporter protein is easily quantifiable using standard techniques even at low levels. Useful reporter proteins include luciferases, green fluorescent proteins and their derivatives, such as firefly luciferase (FL) from Photinus pyralis, and Renilla luciferase (RL) from Renilla reniformis. G. Conformation Dependent Labels
Conformation dependent labels refer to all labels that produce a change in fluorescence intensity or wavelength based on a change in the form or conformation of the molecule or compound (such as a riboswitch) with which the label is associated. Examples of conformation dependent labels used in the context of probes and primers include molecular beacons, Amplifluors, FRET probes, cleavable FRET probes, TaqMan probes, scorpion primers, fluorescent triplex oligos including but not limited to triplex molecular beacons or triplex FRET probes, fluorescent water-soluble conjugated polymers, PNA probes and QPNA probes. Such labels, and, in particular, the principles of their function, can be adapted for use with riboswitches. Several types of conformation dependent labels are reviewed in Schweitzer and Kingsmore, Curr. Opin. Biotech. 12:21-27 (2001).
Stem quenched labels, a form of conformation dependent labels, are fluorescent labels positioned on a nucleic acid such that when a stem structure forms a quenching moiety is brought into proximity such that fluorescence from the label is quenched. When the stem is disrupted (such as when a riboswitch containing the label is activated), the quenching moiety is no longer in proximity to the fluorescent label and fluorescence increases. Examples of this effect can be found in molecular beacons, fluorescent triplex oligos, triplex molecular beacons, triplex FRET probes, and QPNA probes, the operational principles of which can be adapted for use with riboswitches.
Stem activated labels, a form of conformation dependent labels, are labels or pairs of labels where fluorescence is increased or altered by formation of a stem structure. Stem activated labels can include an acceptor fluorescent label and a donor moiety such that, when the acceptor and donor are in proximity (when the nucleic acid strands containing the labels form a stem structure), fluorescence resonance energy transfer from the donor to the acceptor causes the acceptor to fluoresce. Stem activated, labels are typically pairs of labels positioned on nucleic acid molecules (such as riboswitches) such that the acceptor and donor are brought into proximity when a stem structure is formed in the nucleic acid molecule. If the donor moiety of a stem activated label is itself a fluorescent label, it can release energy as fluorescence (typically at a different wavelength than the fluorescence of the acceptor) when not in proximity to an acceptor (that is, when a stem structure is not formed). When the stem structure forms, the overall effect would then be a reduction of donor fluorescence and an increase in acceptor fluorescence. FRET probes are an example of the use of stem activated labels, the operational principles of which can be adapted for use with riboswitches. H. Detection Labels
To aid in detection and quantitation of riboswitch activation, deactivation or blocking, or expression of nucleic acids or protein produced upon activation, deactivation or blocking of riboswitches, detection labels can be incorporated into detection probes or detection molecules or directly incorporated into expressed nucleic acids or proteins. As used herein, a detection label is any molecule that can be associated with nucleic acid or protein, directly or indirectly, and which results in a measurable, detectable signal, either directly or indirectly. Many such labels are known to those of skill in the art. Examples of detection labels suitable for use in the disclosed method are radioactive isotopes, fluorescent molecules, phosphorescent molecules, enzymes, antibodies, and ligands.
Examples of suitable fluorescent labels include fluorescein isothiocyanate (FITC), 5,6-carboxymethyl fluorescein, Texas red, nitrobenz-2-oxa-l,3-diazol-4-yl (NBD), coumarin, dansyl chloride, rhodamine, amino-methyl coumarin (AMCA), Eosin, Erythrosin, BODIPY®, Cascade Blue®, Oregon Green®, pyrene, lissamine, xanthenes, acridines, oxazines, phycoerythrin, macrocyclic chelates of lanthanide ions such as quantum dye™, fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7. Examples of other specific fluorescent labels include 3-Hydroxypyrene 5,8,10-Tri Sulfonic acid, 5-Hydroxy , Tryptamine (5-HT), Acid Fuchsin, Alizarin Complexon, Alizarin Red, Allophycocyanin, Aminocoumarin, Anthroyl Stearate, Astrazon Brilliant Red 4G, Astrazon Orange R, Astrazon Red 6B, Astrazon Yellow 7 GLL, Atabrine, Auramine, Aurophosphine, Aurophosphine G, BAO 9 (Bisaminophenyloxadiazole), BCECF, Berberine Sulphate, Bisbenzamide, Blancophor FFG Solution, Blancophor SV, Bodipy Fl, Brilliant Sulphoflavin FF, Calcien Blue, Calcium Green, Calcofluor RW Solution, Calcofiuor White, Calcophor White ABT Solution, Calcophor White Standard Solution, Carbostyryl, Cascade Yellow, Catecholamine, Chinacrine, Coriphosphine O, Coumarin- Phalloidin, CY3.1 8, CY5.1 8, CY7, Dans (1-Dimethyl Amino Naphaline 5 Sulphonic Acid), Dansa (Diamino Naphtyl Sulphonic Acid), Dansyl NH-CH3, Diamino Phenyl Oxydiazole (DAO), Dimethylamino-5-Sulphonic acid, Dipyrrometheneboron Difluoride, Diphenyl Brilliant Flavine 7GFF, Dopamine, Erythrosin ITC, Euchrysin, FIF (Formaldehyde Induced Fluorescence), Flazo Orange, Fluo 3, Fluorescamine, Fura-2,
Genacryl Brilliant Red B, Genacryl Brilliant Yellow 10GF, Genacryl Pink 3G, Genacryl Yellow 5GF, Gloxalic Acid, Granular Blue, Haematoporphyrin, Indo-1, Intrawhite Cf Liquid, Leucophor PAF, Leucophor SF, Leucophor WS, Lissamine Rhodamine B200 (RD200), Lucifer Yellow CH, Lucifer Yellow VS, Magdala Red, Marina Blue, Maxilon Brilliant Flavin 10 GFF, Maxilon Brilliant Flavin 8 GFF, MPS (Methyl Green Pyronine Stilbene), Mithramycin, NBD Amine, Nitrobenzoxadidole, Noradrenaline, Nuclear Fast Red, Nuclear Yellow, Nylosan Brilliant Flavin E8G, Oxadiazole, Pacific Blue, Pararosaniline (Feulgen), Phorwite AR Solution, Phorwite BKL, Phorwite Rev, Phorwite RPA, Phosphine 3R, Phthalocyanine, Phycoerythrin R, Polyazaindacene Pontochrome Blue Black, Porphyrin, Primuline, Procion Yellow, Pyronine, Pyronine B, Pyrozal Brilliant Flavin 7GF, Quinacrine Mustard, Rhodamine 123, Rhodamine 5 GLD,
Rhodamine 6G, Rhodamine B, Rhodamine B 200, Rhodamine B Extra, Rhodamine BB, Rhodamine BG, Rhodamine WT, Serotonin, Sevron Brilliant Red 2B, Sevron Brilliant Red 4G, Sevron Brilliant Red B, Sevron Orange, Sevron Yellow L, SITS (Primuline), SITS (Stilbene Isothiosulphonic acid), Stilbene, Snarf 1, sulpho Rhodamine B Can C, Sulpho Rhodamine G Extra, Tetracycline, Thiazine Red R, Thioflavin S, Thioflavin TCN, Thioflavin 5, Thiolyte, Thiozol Orange, Tinopol CBS, True Blue, Ultralite, Uranine B, Uvitex SFC, Xylene Orange, and XRITC.
Useful fluorescent labels are fluorescein (5-carboxyfluorescein-N- hydroxysuccinimide ester), rhodamine (5,6-tetramethyl rhodamine), and the cyanine dyes Cy3, Cy3.5, Cy5, Cy5.5 and Cy7. The absorption and emission maxima, respectively, for these fluors are: FITC (490 nm; 520 nm), Cy3 (554 nm; 568 nni), Cy3.5 (581 nm; 588 nm), Cy5 (652 nm: 672 nm), Cy5.5 (682 nm; 703 nm) and Cy7 (755 nm; 778 nm), thus allowing their simultaneous detection. Other examples of fluorescein dyes include 6-carboxyfluorescein (6-FAM), 2',4',1,4,-tetrachlorofluorescein (TET), 2I,4',5I,7',l,4-hexachlorofluorescein (HEX), 2',7'-dimethoxy-4I, 5'-dichloro-6- carboxyrhodamine (JOE), 2'-chloro-5'-fluoro-7',8 '-fused phenyl- l,4-dichloro-6~ carboxyfluorescein (NED), and 2'-chloro-7'-phenyl-l,4-dichloro-6-carboxyfluorescein (VIC). Fluorescent labels can be obtained from a variety of commercial sources, including Amersham Pharmacia Biotech, Piscataway, NJ; Molecular Probes, Eugene, OR; and Research Organics, Cleveland, Ohio.
Additional labels of interest include those that provide for signal only when the probe with which they are associated is specifically bound to a target molecule, where such labels include: "molecular beacons" as described in Tyagi & Kramer, Nature Biotechnology (1996) 14:303 and EP 0 070 685 Bl. Other labels of interest include those described in U.S. Pat. No. 5,563,037; WO 97/17471 and WO 97/17076.
Labeled nucleotides are a useful form of detection label for direct incorporation into expressed nucleic acids during synthesis. Examples of detection labels that can be incorporated into nucleic acids include nucleotide analogs such as BrdUrd (5- bromodeoxyuridine, Hoy and Schinike, Mutation Research 290:217-230 (1993)), aminoallyldeoxyuridine (Henegariu et al:, Nature Biotechnology 18:345-348 (2000)), 5- methylcytosine (Sano et al., Biochim. Biophys. Acta 951:157-165 (1988)), bromouridine (Wansick et al, J. Cell Biology 122:283-293 (1993)) and nucleotides modified with biotin (Langer et al, Proc. Natl. Acad. Sd. USA 78:6633 (1981)) or with suitable haptens such as digoxygenin (Kerkhof, Anal. Biochem. 205:359-364 (1992)). Suitable fluorescence-labeled nucleotides are Fluorescein-isothiocyanate-dUTP, Cyanine-3-dUTP and Cyanine-5-dUTP (Yu et al, Nucleic Acids Res., 22:3226-3232 (1994)). A preferred nucleotide analog detection label for DNA is BrdUrd (bromodeoxyuridine, BrdUrd, BrdU, BUdR, Sigma- Aldrich Co). Other useful nucleotide analogs for incorporation of detection label into DNA are AA-dUTP (aminoallyl-deoxyuridine triphosphate, Sigma- Aldrich Co.), and 5-methyl-dCTP (Roche Molecular Biochemicals). A useful nucleotide analog for incorporation of detection label into RNA is biotin- 16-UTP (biotin- 16- uridine-5 '-triphosphate, Roche Molecular Biochemicals). Fluorescein, Cy3, and Cy5 can be linked to dUTP for direct labeling. Cy3.5 and Cy7 are available as avidin or anti- digoxygenin conjugates for secondary detection of biotin- or digoxygenin-labeled probes.
Detection labels that are incorporated into nucleic acid, such as biotin, can be subsequently detected using sensitive methods well-known in the art. For example, biotin can be detected using streptavidin-alkaline phosphatase conjugate (Tropix, Inc.), which is bound to the biotin and subsequently detected by chemiluminescence of suitable substrates (for example, chemiluminescent substrate CSPD: disodium, 3-(4- methoxyspiro-[l,2,-dioxetane-3-2'-(5'-chloro)tricyclo [3.3. l.l3'7]decane]-4-yl) phenyl phosphate; Tropix, Inc.). Labels can also be enzymes, such as alkaline phosphatase, soybean peroxidase, horseradish peroxidase and polymerases, that can be detected, for example, with chemical signal amplification or by using a substrate to the enzyme which produces light (for example, a chemiluminescent 1,2-dioxetane substrate) or fluorescent signal. Molecules that combine two or more of these detection labels are also considered detection labels. Any of the known detection labels can be used with the disclosed probes, tags, molecules and methods to label and detect activated or deactivated riboswitches or nucleic acid or protein produced in the disclosed methods. Methods for detecting and measuring signals generated by detection labels are also known to those of skill in the art. For example, radioactive isotopes can be detected by scintillation counting or direct visualization; fluorescent molecules can be detected with fluorescent spectrophotometers; phosphorescent molecules can be detected with a spectrophotometer or directly visualized with a camera; enzymes can be detected by detection or visualization of the product of a reaction catalyzed by the enzyme; antibodies can be detected by detecting a secondary detection label coupled to the antibody. As used herein, detection molecules are molecules which interact with a compound or composition to be detected and to which one or more detection labels are coupled. I. Sequence Similarities It is understood that as discussed herein the use of the terms homology and identity mean the same thing as similarity. Thus, for example, if the use of the word homology is used between two sequences (non-natural sequences, for example) it is understood that this is not necessarily indicating an evolutionary relationship between these two sequences, but rather is looking at the similarity or relatedness between their nucleic acid sequences. Many of the methods for determining homology between two evolutionarily related molecules are routinely applied to any two or more nucleic acids or proteins for the purpose of measuring sequence similarity regardless of whether they are evolutionarily related or not.
In general, it is understood that one way to define any known variants and derivatives or those that might arise, of the disclosed riboswitches, aptamers, expression platforms, genes and proteins herein, is through defining the variants and derivatives in terms of homology to specific known sequences. This identity of particular sequences disclosed herein is also discussed elsewhere herein. In general, variants of riboswitches, aptamers, expression platforms, genes and proteins herein disclosed typically have at least, about 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89,
90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent homology to a stated sequence or a native sequence. Those of skill in the art readily understand how to determine the homology of two proteins or nucleic acids, such as genes. For example, the homology can be calculated after aligning the two sequences so that the homology is at its highest level.
Another way of calculating homology can be performed by published algorithms. Optimal alignment of sequences for comparison can be conducted by the local homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by inspection.
The same types of homology can be obtained for nucleic acids by for example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. Natl. Acad. Sci. USA 86:7706-7710, 1989, Jaeger et Ά\. Methods Enzymol. 183:281-306, 1989 which are herein incorporated by reference for at least material related to nucleic acid alignment. It is understood that any of the methods typically can be used and that in certain instances the results of these various methods can differ, but the skilled artisan understands if identity is found with at least one of these methods, the sequences would be said to have the stated identity.
For example, as used herein, a sequence recited as having a particular percent homology to another sequence refers to sequences that have the recited homology as calculated by any one or more of the calculation methods described above. For example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using the Zuker calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by any of the other calculation methods. As another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using both the Zuker calculation method and the Pearson and Lipman calculation method even if the first sequence does not have 80 percent homology to the second sequence as calculated by the Smith and Waterman calculation method, the
Needleman and Wunsch calculation method, the Jaeger calculation methods, or any of the other calculation methods. As yet another example, a first sequence has 80 percent homology, as defined herein, to a second sequence if the first sequence is calculated to have 80 percent homology to the second sequence using each of calculation methods (although, in practice, the different calculation methods will often result in different calculated homology percentages). J. Hybridization and Selective Hybridization The term hybridization typically means a sequence driven interaction between at least two nucleic acid molecules, such as a primer or a probe and a riboswitch or a gene. Sequence driven interaction means an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a nucleotide specific manner. For example, G interacting with C or A interacting with T are sequence driven interactions. Typically sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a number of conditions and parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules will hybridize. Parameters for selective hybridization between two nucleic acid molecules are well known to those of skill in the art. For example, in some embodiments selective hybridization conditions can be defined as stringent hybridization conditions. For example, stringency of hybridization is controlled by both temperature and salt concentration of either or both of the hybridization and washing steps. For example, the conditions of hybridization to achieve selective hybridization can involve hybridization in high ionic strength solution (6X SSC or 6X SSPE) at a temperature that is about 12- 250C below the Tm (the melting temperature at which half of the molecules dissociate from their hybridization partners) followed by washing at a combination of temperature and salt concentration chosen so that the washing temperature is about 5°C to 20°C below the Tm. The temperature and salt conditions are readily determined empirically in preliminary experiments in which samples of reference DNA immobilized on filters are hybridized to a labeled nucleic acid of interest and then washed under conditions of different stringencies. Hybridization temperatures are typically higher for DNA-RNA and PvNA-RNA hybridizations. The conditions can be used as described above to achieve stringency, or as is known in the art (Sambrook et al., Molecular Cloning: A
Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989; Kunkel et al. Methods Enzymol. 1987:154:367, 1987 which is herein incorporated by reference for material at least related to hybridization of nucleic acids). A preferable stringent hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous solution) in 6X SSC or 6X SSPE followed by washing at 680C. Stringency of hybridization and washing, if desired, can be reduced accordingly as the degree of complementarity desired is decreased, and further, depending upon the G-C or A-T richness of any area wherein variability is searched for. Likewise, stringency of hybridization and washing, if desired, can be increased accordingly as homology desired is increased, and further, depending upon the G-C or A-T richness of any area wherein high homology is desired, all as known in the art.
Another way to define selective hybridization is by looking at the amount (percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in some embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non- limiting nucleic acid. Typically, the non-limiting nucleic acid is in for example, 10 or 100 or 1000 fold excess. This type of assay can be performed at under conditions where both the limiting and non-limiting nucleic acids are for example, 10 fold or 100 fold or 1000 fold below their kd, or where only one of the nucleic acid molecules is 10 fold or 100 fold or 1000 fold or where one or both nucleic acid molecules are above their kd. Another way to define selective hybridization is by looking at the percentage of nucleic acid that gets enzymatically manipulated under conditions where hybridization is required to promote the desired enzymatic manipulation. For example, in some embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the nucleic acid is enzymatically manipulated under conditions which promote the enzymatic manipulation, for example if the enzymatic manipulation is DNA extension, then selective hybridization conditions would be when at least about 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of the nucleic acid molecules are extended. Preferred conditions also include those suggested by the manufacturer or indicated in the art as being appropriate for the enzyme performing the manipulation.
Just as with homology, it is understood that there are a variety of methods herein disclosed for determining the level of hybridization between two nucleic acid molecules.
It is understood that these methods and conditions can provide different percentages of hybridization between two nucleic acid molecules, but unless otherwise indicated meeting the parameters of any of the methods would be sufficient. For example if 80% hybridization was required and as long as hybridization occurs within the required parameters in any one of these methods it is considered disclosed herein. It is understood that those of skill in the art understand that if a composition or method meets any one of these criteria for determining hybridization either collectively or singly it is a composition or method that is disclosed herein. K. Nucleic Acids
There are a variety of molecules disclosed herein that are nucleic acid based, including, for example, riboswitches, aptamers, and nucleic acids that encode riboswitches and aptamers. The disclosed nucleic acids can be made up of for example, nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and other molecules are discussed herein. It is understood that for example, when a vector is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. Likewise, it is understood that if a nucleic acid molecule is introduced into a cell or cell environment through for example exogenous delivery, it is advantageous that the nucleic acid molecule be made up of nucleotide analogs that reduce the degradation of the nucleic acid molecule in the cellular environment.
So long as their relevant function is maintained, riboswitches, aptamers, expression platforms and any other oligonucleotides and nucleic acids can be made up of or include modified nucleotides (nucleotide analogs). Many modified nucleotides are known and can be used in oligonucleotides and nucleic acids. A nucleotide analog is a nucleotide which contains some type of modification to either the base, sugar, or phosphate moieties. Modifications to the base moiety would include natural and synthetic modifications of A, C, G, and T/U as well as different purine or pyrimidine bases, such as uracil-5-yl, hypoxanthin-9-yl (I), and 2-aminoadenin-9-yl. A modified base includes but is not limited to 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-proρynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Additional base modifications can be found for example in U.S. Pat. No. 3,687,808, Englisch et al., Angewandte Chemie, International Edition, 1991, 30, 613, and Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages
289-302, Crooke, S. T. and Lebleu, B. ed., CRC Press, 1993. Certain nucleotide analogs, such as 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-proρynyluracil and 5-propynylcytosine. 5-methylcytosine can increase the stability of duplex formation. Other modified bases are those that function as universal bases. Universal bases include 3-nitropyrrole and 5- nitroindole. Universal bases substitute for the normal bases but have no bias in base pairing. That is, universal bases can base pair with any other base. Base modifications often can be combined with for example a sugar modification, such as 2'-O- methoxyethyl, to achieve unique properties such as increased duplex stability. There are numerous United States patents such as 4,845,205; 5,130,302; 5,134,066; 5,175,273; 5,367,066; 5,432,272; 5,457,187; 5,459,255; 5,484,908; 5,502,177; 5,525,711; 5,552,540; 5,587,469; 5,594,121, 5,596,091; 5,614,617; and 5,681,941, which detail and describe a range of base modifications. Each of these patents is herein incorporated by reference in its entirety, and specifically for their description of base modifications, their synthesis, their use, and their incorporation into oligonucleotides and nucleic acids.
Nucleotide analogs can also include modifications of the sugar moiety. Modifications to the sugar moiety would include natural modifications of the ribose and deoxyribose as well as synthetic modifications. Sugar modifications include but are not limited to the following modifications at the 2' position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-0-alkyl, wherein the alkyl, alkenyl and alkynyl can be substituted or unsubstituted Cl to ClO, alkyl or C2 to ClO alkenyl and alkynyl. 2' sugar modifications also include but are not limited to -O[(CH2)n O]m CH3, - O(CH2)n OCH3, -O(CH2)n NH2, -O(CH2)n CH3, -O(CH2)n -ONH2, and - O(CH2)nON[(CH2)n CH3)J2, where n and m are from 1 to about 10. Other modifications at the 2' position include but are not limited to: Cl to ClO lower alkyl, substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH3, OCN, Cl, Br, CN, CF3, OCF3, SOCH3, SO2 CH3, ONO2, NO2, N3, NH2, heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. Similar modifications can also be made at other positions on the sugar, particularly the 3' position of the sugar on the 31 terminal nucleotide or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide. Modified sugars would also include those that contain modifications at the bridging ring oxygen, such as CH2 and S. Nucleotide sugar analogs can also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. There are numerous United States patents that teach the preparation of such modified sugar structures such as 4,981 ,957; 5,118,800; 5,319,080; 5,359,044; 5,393,878; 5,446,137; 5,466,786; 5,514,785; 5,519,134; 5,567,811; 5,576,427; 5,591,722; 5,597,909; 5,610,300; 5,627,053; 5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, each of which is herein incorporated by reference in its entirety, and specifically for their description of modified sugar structures, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids.
Nucleotide analogs can also be modified at the phosphate moiety. Modified phosphate moieties include but are not limited to those that can be modified so that the linkage between two nucleotides contains a phosphorothioate, chiral phosphorothioate, phosphorodithioate, phosphotriester, aminoalkylphosphotriester, methyl and other alkyl phosphonates including 3'-alkylene phosphonate and chiral phosphonates, phosphinates, phosphoramidates including 3 '-amino phosphoramidate and aminoalkylphosphoramidateSj thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates. It is understood that these phosphate or modified phosphate linkages between two nucleotides can be through a
3 '-5' linkage or a 2'-5' linkage, and the linkage can contain inverted polarity such as 3 '-5' to 5'-3' or 2'-5' to 5'-2'. Various salts, mixed salts and free acid forms are also included. Numerous United States patents teach how to make and use nucleotides containing modified phosphates and include but are not limited to, 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and
5,625,050, each of which is herein incorporated by reference its entirety, and specifically for their description of modified phosphates, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids.
It is understood that nucleotide analogs need only contain a single modification, but can also contain multiple modifications within one of the moieties or between different moieties.
Nucleotide substitutes are molecules having similar functional properties to nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). Nucleotide substitutes are molecules that will recognize and hybridize to (base pair to) complementary nucleic acids in a Watson-Crick or Hoogsteen manner, but which are linked together through a moiety other than a phosphate moiety. Nucleotide substitutes are able to conform to a double helix type structure when interacting with the appropriate target nucleic acid.
Nucleotide substitutes are nucleotides or nucleotide analogs that have had the phosphate moiety and/or sugar moieties replaced. Nucleotide substitutes do not contain a standard phosphorus atom. Substitutes for the phosphate can be for example, short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts. Numerous United States patents disclose how to make and use these types of phosphate replacements and include but are not limited to 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated by reference its entirety, and specifically for their description of phosphate replacements, their synthesis, their use, and their incorporation into nucleotides, oligonucleotides and nucleic acids. It is also understood in a nucleotide substitute that both the sugar and the phosphate moieties of the nucleotide can be replaced, by for example an amide type linkage (aminoethylglycine) (PNA). United States patents 5,539,082; 5,714,331; and 5,719,262 teach how to make and use PNA molecules, each of which is herein incorporated by reference. (See also Nielsen et ah, Science 254:1497-1500 (1991)).
Oligonucleotides and nucleic acids can be comprised of nucleotides and can be made up of different types of nucleotides or the same type of nucleotides. For example, one or more of the nucleotides in an oligonucleotide can be ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O-methyl ribonucleotides; about 10% to about 50% of the nucleotides can be ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O-methyl ribonucleotides; about 50% or more of the nucleotides can be ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O-methyl ribonucleotides; or all of the nucleotides are ribonucleotides, 2'-O-methyl ribonucleotides, or a mixture of ribonucleotides and 2'-O- methyl ribonucleotides. Such oligonucleotides and nucleic acids can be referred to as chimeric oligonucleotides and chimeric nucleic acids. L. Solid Supports
Solid supports are solid-state substrates or supports with which molecules (such as trigger molecules) and riboswitches (or other components used in, or produced by, the disclosed methods) can be associated. Riboswitches and other molecules can be associated with solid supports directly or indirectly. For example, analytes (e.g., trigger molecules, test compounds) can be bound to the surface of a solid support or associated with capture agents (e.g., compounds or molecules that bind an analyte) immobilized on solid supports. As another example, riboswitches can be bound to the surface of a solid support or associated with probes immobilized on solid supports. An array is a solid support to which multiple riboswitches, probes or other molecules have been associated in an array, grid, or other organized pattern.
Solid-state substrates for use in solid supports can include any solid material with which components can be associated, directly or indirectly. This includes materials such as acrylamide, agarose, cellulose, nitrocellulose, glass, gold, polystyrene, polyethylene vinyl acetate, polypropylene, polymethacrylate, polyethylene, polyethylene oxide, polysilicates, polycarbonates, teflon, fluorocarbons, nylon, silicon rubber, polyanhydrides, polyglycolic acid, polylactic acid, polyorthoesters, functionalized silane, polypropylfumerate, collagen, glycosaminoglycans, and polyamino acids. Solid-state substrates can have any useful form including thin film, membrane, bottles, dishes, fibers, woven fibers, shaped polymers, particles, beads, microparticles, or a combination. Solid-state substrates and solid supports can be porous or non-porous. A chip is a rectangular or square small piece of material. Preferred forms for solid-state substrates are thin films, beads, or chips. A useful form for a solid-state substrate is a microliter dish. In some embodiments, a multiwell glass slide can be employed.
An array can include a plurality of riboswitches, trigger molecules, other molecules, compounds or probes immobilized at identified or predefined locations on the solid support. Each predefined location on the solid support generally has one type of component (that is, all the components at that location are the same). Alternatively, multiple types of components can be immobilized in the same predefined location on a solid support. Each location will have multiple copies of the given components. The spatial separation of different components on the solid support allows separate detection and identification.
Although useful, it is not required that the solid support be a single unit or structure. A set of riboswitches, trigger molecules, other molecules, compounds and/or probes can be distributed over any number of solid supports. For example, at one extreme, each component can be immobilized in a separate reaction tube or container, or on separate beads or microparticles.
Methods for immobilization of oligonucleotides to solid-state substrates are well established. Oligonucleotides, including address probes and detection probes, can be coupled to substrates using established coupling methods. For example, suitable attachment methods are described by Pease et al, Proc. Natl. Acad. Sd. USA 91(ll):5022-5026 (1994), and Khrapko et al., MoI Biol (Mosk) (USSR) 25:718-730 (1991). A method for immobilization of 3'-amine oligonucleotides on casein-coated slides is described by Stimpson et al, Proc. Natl. Acad. ScL USA 92:6379-6383 (1995). A useful method of attaching oligonucleotides to solid-state substrates is described by Guo et al, Nucleic Acids Res. 22:5456-5465 (1994). Each of the components (for example, riboswitches, trigger molecules, or other molecules) immobilized on the solid support can be located in a different predefined region of the solid support. The different locations can be different reaction chambers.
Each of the different predefined regions can be physically separated from each other of the different regions. The distance between the different predefined regions of the solid support can be either fixed or variable. For example, in an array, each of the components can be arranged at fixed distances from each other, while components associated with beads will not be in a fixed spatial relationship. In particular, the use of multiple solid support units (for example, multiple beads) will result in variable distances.
Components can be associated or immobilized on a solid support at any density. Components can be immobilized to the solid support at a density exceeding 400 different components per cubic centimeter. Arrays of components can have any number of components. For example, an array can have at least 1,000 different components immobilized on the solid support, at least 10,000 different components immobilized on the solid support, at least 100,000 different components immobilized on the solid support, or at least 1,000,000 different components immobilized on the solid support. M. Kits
The materials described above as well as other materials can be packaged together in any suitable combination as a kit useful for performing, or aiding in the performance of, the disclosed method. It is useful if the kit components in a given kit are designed and adapted for use together in the disclosed method. For example disclosed are kits for detecting compounds, the kit comprising one or more biosensor riboswitches. The kits also can contain reagents and labels for detecting activation of the riboswitches. N. Mixtures
Disclosed are mixtures formed by performing or preparing to perform the disclosed method. For example, disclosed are mixtures comprising riboswitches and trigger molecules.
Whenever the method involves mixing or bringing into contact compositions or components or reagents, performing the method creates a number of different mixtures. For example, if the method includes 3 mixing steps, after each one of these steps a unique mixture is formed if the steps are performed separately. In addition, a mixture is formed at the completion of all of the steps regardless of how the steps were performed. The present disclosure contemplates these mixtures, obtained by the performance of the disclosed methods as well as mixtures containing any disclosed reagent, composition, or component, for example, disclosed herein. O. Systems
Disclosed are systems useful for performing, or aiding in the performance of, the disclosed method. Systems generally comprise combinations of articles of manufacture such as structures, machines, devices, and the like, and compositions, compounds, materials, and the like. Such combinations that are disclosed or that are apparent from the disclosure are contemplated. For example, disclosed and contemplated are systems comprising biosensor riboswitches, a solid support and a signal-reading device. P. Data Structures and Computer Control
Disclosed are data structures used in, generated by, or generated from, the disclosed method. Data structures generally are any form of data, information, and/or objects collected, organized, stored, and/or embodied in a composition or medium. Riboswitch structures and activation measurements stored in electronic form, such as in RAM or on a storage disk, is a type of data structure.
The disclosed method, or any part thereof or preparation therefor, can be controlled, managed, or otherwise assisted by computer control. Such computer control can be accomplished by a computer controlled process or method, can use and/or generate data structures, and can use a computer program. Such computer control, computer controlled processes, data structures, and computer programs are contemplated and should be understood to be disclosed herein. Methods
Disclosed are methods for activating, deactivating or blocking a riboswitch. Such methods can involve, for example, bringing into contact a riboswitch and a compound or trigger molecule that can activate, deactivate or block the riboswitch. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Compounds can be used to activate, deactivate or block a riboswitch. The trigger molecule for a riboswitch (as well as other activating compounds) can be used to activate a riboswitch. Compounds other than the trigger molecule generally can be used to deactivate or block a riboswitch. Riboswitches can also be deactivated by, for example, removing trigger molecules from the presence of the riboswitch. Thus, the disclosed method of deactivating a riboswitch can involve, for example, removing a trigger molecule (or other activating compound) from the presence or contact with the riboswitch. A riboswitch can be blocked by, for example, binding of an analog of the trigger molecule that does not activate the riboswitch. Also disclosed are methods for altering expression of an RNA molecule, or of a gene encoding an RNA molecule, where the RNA molecule includes a riboswitch, by bringing a compound into contact with the RNA molecule. Riboswitches function to control gene expression through the binding or removal of a trigger molecule. Thus, subjecting an RNA molecule of interest that includes a riboswitch to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Expression can be altered as a result of, for example, termination of transcription or blocking of ribosome binding to the RNA. Binding of a trigger molecule can, depending on the nature of the riboswitch, reduce or prevent expression of the RNA molecule or promote or increase expression of the RNA molecule.
Also disclosed are methods for regulating expression of an RNA molecule, or of a gene encoding an RNA molecule, by operably linking a riboswitch to the RNA molecule. A riboswitch can be operably linked to an RNA molecule in any suitable manner, including, for example, by physically joining the riboswitch to the RNA molecule or by engineering nucleic acid encoding the RNA molecule to include and encode the riboswitch such that the RNA produced from the engineered nucleic acid has the riboswitch operably linked to the RNA molecule. Subjecting a riboswitch operably linked to an RNA molecule of interest to conditions that activate, deactivate or block the riboswitch can be used to alter expression of the RNA. Also disclosed are methods for regulating expression of a naturally occurring gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. If the gene is essential for survival of a cell or organism that harbors it, activating, deactivating or blocking the riboswitch can result in death, stasis or debilitation of the cell or organism. For example, activating a naturally occurring riboswitch in a naturally occurring gene that is essential to survival of a microorganism can result in death of the microorganism (if activation of the riboswitch turns off or represses expression). This is one basis for the use of the disclosed compounds and methods for antimicrobial and antibiotic effects.
Also disclosed are methods for regulating expression of an isolated, engineered or recombinant gene or RNA that contains a riboswitch by activating, deactivating or blocking the riboswitch. The gene or RNA can be engineered or can be recombinant in any manner. For example, the riboswitch and coding region of the RNA can be heterologous, the riboswitch can be recombinant or chimeric, or both. If the gene encodes a desired expression product, activating or deactivating the riboswitch can be used to induce expression of the gene and thus result in production of the expression product. If the gene encodes an inducer or repressor of gene expression or of another cellular process, activation, deactivation or blocking of the riboswitch can result in induction, repression, or de-repression of other, regulated genes or cellular processes. Many such secondary regulatory effects are known and can be adapted for use with riboswitches. An advantage of riboswitches as the primary control for such regulation is that riboswitch trigger molecules can be small, non-antigenic molecules.
Also disclosed are methods for altering the regulation of a riboswitch by operably linking an aptamer domain to the expression platform domain of the riboswitch (which is a chimeric riboswitch). The aptamer domain can then mediate regulation of the riboswitch through the action of, for example, a trigger molecule for the aptamer domain. Aptamer domains can be operably linked to expression platform domains of riboswitches in any suitable manner, including, for example, by replacing the normal or natural aptamer domain of the riboswitch with the new aptamer domain. Generally, any compound or condition that can activate, deactivate or block the riboswitch from which the aptamer domain is derived can be used to activate, deactivate or block the chimeric riboswitch.
Also disclosed are methods for inactivating a riboswitch by covalently altering the riboswitch (by, for example, crosslinking parts of the riboswitch or coupling a compound to the riboswitch). Inactivation of a riboswitch in this manner can result from, for example, an alteration that prevents the trigger molecule for the riboswitch from binding, that prevents the change in state of the riboswitch upon binding of the trigger molecule, or that prevents the expression platform domain of the riboswitch from affecting expression upon binding of the trigger molecule.
Also disclosed are methods for selecting, designing or deriving new riboswitches and/or new aptamers that recognize new trigger molecules. Such methods can involve production of a set of aptamer variants in a riboswitch, assessing the activation of the variant riboswitches in the presence of a compound of interest, selecting variant riboswitches that were activated (or, for example, the riboswitches that were the most highly or the most selectively activated), and repeating these steps until a variant riboswitch of a desired activity, specificity, combination of activity and specificity, or other combination of properties results. Also disclosed are riboswitches and aptamer domains produced by these methods.
Techniques for in vitro selection and in vitro evolution of functional nucleic acid molecules are known and can be adapted for use with riboswitches and their components. Useful techniques are described by, for example, A. Roth and R. R. Breaker (2003) Selection in vitro of allosteric ribozymes. In: Methods in Molecular Biology Series - Catalytic Nucleic Acid Protocols (Sioud, M., ed.), Humana, Totowa, NJ; R. R. Breaker (2002) Engineered Allosteric Ribozymes as Biosensor Components. Curr. Opin. Biotechnol. 13:31-39; G. M. Emilsson and R. R. Breaker (2002) Deoxyribozymes: New Activities and New Applications. Cell. MoI. Life Sci. 59:596- 607; Y. Li, R. R. Breaker (2001) In vitro Selection of Kinase and Ligase Deoxyribozymes. Methods 23:179-190; G. A. Soukup, R. R. Breaker (2000) Allosteric Ribozymes. In: Ribozymes: Biology and Biotechnology. R. K. Gaur and G. Krupp eds. Eaton Publishing; G. A. Soukup, R. R. Breaker (2000) Allosteric Nucleic Acid Catalysts. Curr. Opin. Struct. Biol. 10:318-325; G. A. Soukup, R. R. Breaker (1999) Nucleic Acid Molecular Switches. Trends Biotechnol. 17:469-476; R. R. Breaker (1999) In vitro Selection of Self-cleaving Ribozymes and Deoxyribozymes. In: Intracellular Ribozyme Applications: Principles and Protocols. L. Couture, J. Rossi eds. Horizon Scientific Press, Norfolk, England; R. R. Breaker (1997) In vitro Selection of Catalytic Polynucleotides. Chem. Rev. 97:371-390; and references cited therein; each of these publications being specifically incorporated herein by reference for their description of in vitro selections and evolution techniques.
Also disclosed are methods for selecting and identifying compounds that can activate, deactivate or block a riboswitch. Activation of a riboswitch refers to the change in state of the riboswitch upon binding of a trigger molecule. A riboswitch can be activated by compounds other than the trigger molecule and in ways other than binding of a trigger molecule. The term trigger molecule is used herein to refer to molecules and compounds that can activate a riboswitch. This includes the natural or normal trigger molecule for the riboswitch and other compounds that can activate the riboswitch. Natural or normal trigger molecules are the trigger molecule for a given riboswitch in nature or, in the case of some non-natural riboswitches, the trigger molecule for which the riboswitch was designed or with which the riboswitch was selected (as in, for example, in vitro selection or in vitro evolution techniques). Non-natural trigger molecules can be referred to as non-natural trigger molecules.
Also disclosed herein is a method of identifying a compound that interacts with a riboswitch comprising: modeling the atomic structure the riboswitch with a test compound; and determining if the test compound interacts with the riboswitch.
Determining if the test compound interacts with the riboswitch can be accomplished by, for example, determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch, as described elsewhere herein. Determining if the test compound interacts with the riboswitch can be accomplished by, for example, determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch. The predicted interactions can be selected from the group consisting of, for example, van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination, as described above. In one example, the riboswitch is a guanine riboswitch.
Atomic contacts can be determined when interaction with the riboswitch is determined, thereby determining the interaction of the test compound with the riboswitch. Analogs of the test compound can be identified, and it can be determined if the analogs of the test compound interact with the riboswitch.
Also disclosed are methods of killing or inhibiting bacteria, comprising contacting the bacteria with a compound disclosed herein or identified by the methods disclosed herein.
Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: identifying the crystal structure of the riboswitch, modeling the riboswitch with a test compound, and determining if the test compound interacts with the riboswitch. Deactivation of a riboswitch refers to the change in state of the riboswitch when the trigger molecule is not bound. A riboswitch can be deactivated by binding of compounds other than the trigger molecule and in ways other than removal of the trigger molecule. Blocking of a riboswitch refers to a condition or state of the riboswitch where the presence of the trigger molecule does not activate the riboswitch.
Also disclosed are methods of identifying compounds that activate, deactivate or block a riboswitch. For examples, compounds that activate a riboswitch can be identified by bringing into contact a test compound and a riboswitch and assessing activation of the riboswitch. If the riboswitch is activated, the test compound is identified as a compound that activates the riboswitch. Activation of a riboswitch can be assessed in any suitable manner. For example, the riboswitch can be linked to a reporter RNA and expression, expression level, or change in expression level of the reporter RNA can be measured in the presence and absence of the test compound. As another example, the riboswitch can include a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch. As can be seen, assessment of activation of a riboswitch can be performed with the use of a control assay or measurement or without the use of a control assay or measurement. Methods for identifying compounds that deactivate a riboswitch can be performed in analogous ways.
In addition to the methods disclosed elsewhere herein, identification of compounds that block a riboswitch can be accomplished in any suitable manner. For example, an assay can be performed for assessing activation or deactivation of a riboswitch in the presence of a compound known to activate or deactivate the riboswitch and in the presence of a test compound. If activation or deactivation is not observed as would be observed in the absence of the test compound, then the test compound is identified as a compound that blocks activation or deactivation of the riboswitch.
Also disclosed are methods of detecting compounds using biosensor riboswitches. The method can include bringing into contact a test sample and a biosensor riboswitch and assessing the activation of the biosensor riboswitch. Activation of the biosensor riboswitch indicates the presence of the trigger molecule for the biosensor riboswitch in the test sample. Biosensor riboswitches are engineered riboswitches that produce a detectable signal in the presence of their cognate trigger molecule. Useful biosensor riboswitches can be triggered at or above threshold levels of the trigger molecules. Biosensor riboswitches can be designed for use in vivo or in vitro. For example, biosensor riboswitches operably linked to a reporter RNA that encodes a protein that serves as or is involved in producing a signal can be used in vivo by engineering a cell or organism to harbor a nucleic acid construct encoding the riboswitch/reporter RNA. An example of a biosensor riboswitch for use in vitro is a riboswitch that includes a conformation dependent label, the signal from which changes depending on the activation state of the riboswitch. Such a biosensor riboswitch preferably uses an aptamer domain from or derived from a naturally occurring riboswitch.
Biosensor riboswitches can be used to monitor changing conditions because riboswitch activation is reversible when the concentration of the trigger molecule falls and so the signal can vary as concentration of the trigger molecule varies. The range of concentration of trigger molecules that can be detected can be varied by engineering riboswitches having different dissociation constants for the trigger molecule. This can easily be accomplished by, for example, "degrading" the sensitivity of a riboswitch having high affinity for the trigger molecule. A range of concentrations can be monitored by using multiple biosensor riboswitches of different sensitivities in the same sensor or assay.
Also disclosed are compounds made by identifying a compound that activates, deactivates or blocks a riboswitch and manufacturing the identified compound. This can be accomplished by, for example, combining compound identification methods as disclosed elsewhere herein with methods for manufacturing the identified compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound.
Also disclosed are compounds made by checking activation, deactivation or blocking of a riboswitch by a compound and manufacturing the checked compound. This can be accomplished by, for example, combining compound activation, deactivation or blocking assessment methods as disclosed elsewhere herein with methods for manufacturing the checked compounds. For example, compounds can be made by bringing into contact a test compound and a riboswitch, assessing activation of the riboswitch, and, if the riboswitch is activated by the test compound, manufacturing the test compound that activates the riboswitch as the compound. Checking compounds for their ability to activate, deactivate or block a riboswitch refers to both identification of compounds previously unknown to activate, deactivate or block a riboswitch and to assessing the ability of a compound to activate, deactivate or block a riboswitch where the compound was already known to activate, deactivate or block the riboswitch. Disclosed is a method of detecting a compound of interest, the method comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest. The riboswitch can change conformation when activated by the compound of interest, wherein the change in conformation produces a signal via a conformation dependent label. The riboswitch can change conformation when activated by the compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal. The signal can be produced by a reporter protein expressed from the RNA linked to the riboswitch.
Disclosed is a method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the inhibition is via the riboswitch, and (b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising a riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.
Also disclosed is a method of identifying riboswitches, the method comprising assessing in-line spontaneous cleavage of an RNA molecule in the presence and absence of a compound, wherein the RNA molecule is encoded by a gene regulated by the compound, wherein a change in the pattern of in-line spontaneous cleavage of the RNA molecule indicates a riboswitch.
A. Identification of Antimicrobial Compounds
Riboswitches are a new class of structured RNAs that have evolved for the purpose of binding small organic molecules. The natural binding pocket of riboswitches can be targeted with metabolite analogs or by compounds that mimic the shape-space of the natural metabolite. Riboswitches are: (1) found in numerous Gram-positive and Gram-negative bacteria including Bacillus anthracis, (2) fundamental regulators of gene expression in these bacteria, (3) present in multiple copies that would be unlikely to evolve simultaneous resistance, and (4) not yet proven to exist in humans. This combination of features make riboswitches attractive targets for new antimicrobial compounds. Further, the small molecule ligands of riboswitches provide useful sites for derivitization to produce drug candidates. Distribution of some riboswitches is shown in Table 1 of U.S. Application Publication No. 2005-0053951.
Once a class of riboswitch has been identified and its potential as a drug target assessed (by, for example, determining how many genes in a target organism are regulated by that class of riboswitch), candidate molecules can be identified.
It has been determined that two compounds that have been known for many years to kill bacteria function by binding to riboswitches. Specifically, the compound aminoethylcysteine (an analog of lysine) binds to lysine riboswitches of bacteria and prevents expression of genes required for lysine biosynthesis. Similarly, the compound pyrithiamine (an analog of thiamine) is di-phosphorylated by bacterial cells to yield pyrithiamine pyrophosphate. It his latter compound mimics thiamine pyrophosphate (TPP) and is toxic to bacteria because it binds to TPP riboswitches. Therefore, targeting riboswitches via application of modem drug discovery strategies can lead to the discovery of new classes of antibiotic compounds. Structure-based drug design methods can be used as disclosed herein to generate metabolite analogs that bind to riboswitches and trigger their allosteric action.
The emergence of drug-resistant stains of bacteria highlights the need for the identification of new classes of antibiotics. Anti-riboswitch drugs represent a mode of anti-bacterial action that is of considerable interest for the following reasons. Riboswitches control the expression of genes that are critical for fundamental metabolic processes. Therefore manipulation of these gene control elements with drugs yields new antibiotics. Riboswitches also carry RNA structures that have evolved to selectively bind metabolites, and therefore these RNA receptors make good drug targets as do protein enzymes and receptors. Furthermore, it has been shown that two antimicrobial compounds (discussed above) kill bacteria by deactivating the antibiotics resistance to emerge through mutation of the RNA target. There are at least 11 classes of well- conserved riboswitches in many bacteria, providing numerous drug targets.
Many organisms from both Gram-positive and Gram-negative lineages have numerous classes of riboswitches (see U.S. Application Publication No. 2005-0053951). Major disease-causing organisms such as Staphylococcus and major bioterror-related organisms such as Bacillus anthracis are both rich with riboswitches. As disclosed herein, the atomic-resolution structure model for a guanine riboswitch has been elucidated, which enables the use of structure-based design methods for creating riboswitch-binding compounds. Specifically, the model for the binding site of the guanine riboswitch shows that two channels are present that would permit ligand modification (Figure 12A). 26 guanine analogs have been generated with chemical modifications at either the N2 or the 06 positions of guanine, and nearly all tested so far bind to the riboswitch with sub-nanomolar dissociation constants. Figure 12B depicts the structures of guanine analogs synthesized with modified N2 and 06 positions. Most of these compounds take advantage of the molecular recognition "blind spots" in the binding site model or the aptamer domain form a B. subtilis guanine riboswitch. The successful compounds can be used as a scaffold upon which further chemical variation can be introduced to create non-toxic, bioavailable, high affinity, anti-riboswitch compounds.
The following provides an illustration of this using the SAM riboswitch (see Example 7 of U.S. Application Publication No. 2005-0053951).SAM analogs that substitute the reactive methyl and sulfonium ion center with stable sulfur-based linkages (YBD-2 and YBD3) are recognized with adequate affinity (low to mid-nanomolar range) by the riboswitch to serve as a platform for synthesis of additional SAM analogs. In addition, a wider range of linkage analogs (N- and C-based linkages) can be synthesized and tested to provide the optimal platform upon which to make amino acid and nucleoside derivations. Sulfoxide and sulfone derivatives of SAM can be used to generate analogs.
Established synthetic protocols described in Ronald T. Borchardt and Yih Shiong Wu, Potential inhibitor of S-adenosylmethionine-dependent methyltransferase. 1. Modification of the amino acid portion of S-adenosylhomocysteine. J. Med. Chem. 17, 862-868, 1974, can be used, for example. These and other analogs can be synthesized and assayed for binding sequentially or in small groups. Additional SAM analogs can be designed during the progression of compound identification based on the recognition determinants that are established in each round. Simple binding assays can be conducted on B. subtilis and B. anthracis riboswitch RNAs as described elsewhere herein. More advanced assays can also be used. The most promising SAM analog lead compounds must enter bacterial cells and bind riboswitches while remaining metabolically inert. In addition, useful SAM analogs must be bound tightly by the riboswitch, but must also fail to compete for SAM in the active sites of protein enzymes, or there is a risk of generating an undesirable toxic effect in the patient's cells. As a preliminary assessment of these issues, compounds can be tested for their ability to disrupt B. subtilis growth, but fail to affect E. coli cultures (which use SAM but lack SAM riboswitches). To screen for lead compound candidates, parallel bacterial cultures can be grown as follows: \. B. subtilis can be cultured in glucose minimal media in the absence of exogenously supplied SAM analogs.
2. B. subtilis can be cultured in glucose minimal media in the presence of exogenously supplied SAM analogs (high doses can be selected, to be followed by repeated experiments designed to test a concentration range of the putative drug compound).
3. E. coli can be cultured in glucose minimal media in the presence of exogenously supplied SAM analogs (high doses will be selected, to be followed by repeated experiments designed to test a concentration range of the putative drug compound). Fitness of the various cultures can be compared by measurement of cellular doubling times. A range of concentrations for the drug compounds can be tested using cultures grown in microtiter plates and analyzed using a microplate reader from another laboratory. Culture 1 is expected to grow well. Drugs that inhibit culture 2 may or may not inhibit growth of culture 3. Drugs that similarly inhibit both culture 2 and culture 3 upon exposure to a wide range of drug concentrations can reflect general toxicity induced by the exogenous compound {i.e., inhibition of many different cellular processes, in addition or in place of riboswitch inhibition). Successful drug candidates identified in this screen will inhibit E. coli only at very high doses, if at all, and will inhibit B. subtilis at much (>10-fold) lower concentrations. As derivization points on SAM are identified, efficient identification of lead drug compounds will require larger-scale screening of appropriate SAM analogs or generic chemical libraries. A high-throughput screen can be created by one or two different methods using nucleic acid engineering principles. Adaptation of both fluorescent sensor designs outlined below to formats that are compatible with high-throughput screening assays can be accommodated by using immobilization methods or solution-based methods.
One way to create a reporter is to add a third function to the riboswitch by adding a domain that catalyzes the release of a fluorescent tag upon SAM binding to the riboswitch domain. In the final reporter construct, this catalytic domain can be linked to the yitJ SAM riboswitch through a communication module that relays the ligand binding event by allowing the correct folding of the catalytic domain for generating the fluorescent signal. This can be accomplished as outlined below. SAM RiboReporter Pool Design: A DNA template for in vitro transcription to
RNA was constructed by PCR amplification using the appropriate DNA template and primer sequences. In this construct, stem II of the hammerhead (stem Pl of the SAM aptamer) has been randomized to present more than 250 million possible sequence combinations, wherein some inevitably will permit function of the ribozyme only when the aptamer is occupied by SAM or a related high-affinity analog. Each molecule in the population of constructs is identical in sequence except at the random domain where multiple copies of every possible combination of sequence will be represented in the population.
SAM RiboReporter Selection: The in vitro selection protocol can be a repetitive iteration of the following steps:
1. Transcribe RNA in vitro by standard methods. Include [α-32P] UTP to incorporate radioactivity throughout the RNA.
2. Purify full length RNA on denaturing PAGE by standard methods.
3. Incubate full length RNA (~100 pmoles) in negative selection buffer containing sufficient magnesium for catalytic activity (20 rnM) but no SAM. Incubate 4 h at room temperature (~23°C), with thermocycling or alkaline denaturation as needed to preclude the emergence of selfish molecules.
4. Purify full length RNA on denaturing PAGE and discard RNAs that react in the absence of SAM. 5. Incubate in positive selection buffer containing 20 mM Mg2+ and SAM (pH
7.5 at 230C). Incubate 20 min at room temperature.
6. Purify cleaved RNA on denaturing PAGE to recover switches that bound SAM and allowed self-cleavage of the RNA.
7. Reverse transcribe RNA to DNA. 8. PCR amplify DNA with primers that reintroduced cleaved portion of RNA.
The concentration of SAM in step 4 can be 100 μM initially and can be reduced as the selection proceeds. The progress of recovering successful communication modules can be assessed by the amount of cleavage observed on the purification gel in step 6. The selection endpoint can be either when the population approaches 100% cleavage in 10 nM SAM (conditions for maximal activity of the parental ribozyme and riboswitch) or when the population approaches a plateau in activity that does not improve over multiple rounds. The end population can then be sequenced. Individual communication module clones can be assayed for generation of a fluorescent signal in the screening construct in the presence of SAM.
A fluorescent signal can also be generated by riboswitch-mediated triggering of a molecular beacon. In this design, riboswitch conformational changes cause a folded molecular beacon tagged with both a fluor and a quencher to unfold and force the fluor away from the quencher by forming a helix with the riboswitch. This mechanism is easy to adapt to existing riboswitches, as this method can take advantage of the ligand- mediated formation of terminator and anti-terminator stems that are involved in transcription control. To use riboswitches to report ligand binding by binding a molecular beacon, the appropriate construct must be determined empirically. The optimum length and nucleotide composition of the molecular beacon and its binding site on the riboswitch can be tested systematically to result in the highest signal-to-noise ratio. The validity of the assay can be determined by comparing apparent relative binding affinities of different SAM analogs to a molecular beacon-coupled riboswitch (determined by rate of fluorescent signal generation) to the binding constants determined by standard in-line probing.
Specific Embodiments Disclosed is the atomic structure of a natural guanine-responsive riboswitch comprising an atomic structure as depicted in Figure 6. The atomic coordinates of the atomic structure can comprise the atomic coordinates listed in Table 6 for atoms depicted in Figure 6. The atomic coordinates of the atomic structure can comprise the atomic coordinates listed in Table 6.
Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: (a) modeling the atomic structure of claim 1 with a test compound; and (b) determining if the test compound interacts with the riboswitch. Also disclosed is a method of identifying compounds that interact with a riboswitch comprising contacting the riboswitch with a test compound, wherein a fluorescent signal is generated upon interaction of the riboswitch with the test compound.
Also disclosed is a method of killing bacteria, comprising contacting the bacteria with an analog of a compound that interacts with the riboswitch.
Also disclosed is a method of killing bacteria, comprising contacting the bacteria with a compound that interacts with the riboswitch.
Also disclosed is a method of identifying a compound that interacts with a riboswitch comprising: (a) identifying the crystal structure of the riboswitch; (b) modeling the riboswitch with a test compound; and (c) determining if the test compound interacts with the riboswitch.
Also disclosed is a regulatable gene expression construct comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous.
Also disclosed is a method of detecting a compound of interest, the method comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest.
Also disclosed is a method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the inhibition is via the riboswitch, and (b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising the riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch. Determining if the test compound interacts with the riboswitch can comprise determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch. Determining if the test compound interacts with the riboswitch can comprise determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch. The riboswitch can be a guanine riboswitch. The guanine riboswitch can be a riboswitch in Table 5. Atomic contacts can be determined by modeling the riboswitch with a test compound, thereby determining the interaction of the test compound with the riboswitch.
The method can further comprise the steps of: (c) identifying analogs of the test compound; (d) determining if the analogs of the test compound interact with the riboswitch. The compound can be hypoxanthine. A gel-based assay can be used to determine if the test compound interacts with the riboswitch. A chip-based assay can be used to determine if the test compound interacts with the riboswitch.
The test compound can interact via van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination. The riboswitch can comprise an RNA cleaving ribozyme. A fluorescent signal can be generated when a nucleic acid comprising a quenching moiety is cleaved. Molecular beacon technology can be employed to generate the fluorescent signal. The method can be carried out using a high throughput screen.
The riboswitch can be a guanine riboswitch. The guanine riboswitch can be a riboswitch in Table 5. The riboswitch can be activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule. The riboswitch can change conformation when activated by a compound of interest, wherein the change in conformation produces a signal via a conformation dependent label. The riboswitch can change conformation when activated by a compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal. The signal can be produced by a reporter protein expressed from the RNA linked to the riboswitch.
Examples
A. Example 1: Structure of a Natural Guanine-Responsive Riboswitch Complexed with the Metabolite Hypoxanthine
To understand how the natural biosensor functions, the structure of the guanine- binding domain bound to hypoxanthine has been solved by X-ray crystallography (Table 2 and Figure 12A), a biologically relevant ligand of the guanine-responsive riboswitch. In the hypoxanthine-bound state, the RNA adopts a three-dimensional fold in which the terminal loops (L2 and L3) form a series of interconnecting hydrogen bonds (see pairing scheme in Figure 5) to bring the P2 and P3 helices parallel to each other (Figure 5C). Unfavorable electrostatic interactions, a result of the juxtaposition of regions of the ribose-phosphate backbone, are neutralized through the binding of several cations between the two backbones (Figure 10). Anchoring the global helical arrangement of the RNA are numerous tertiary contacts around the three-way junction, dominated by the J2/3 loop (Figure 5C) interacting with bound hypoxanthine, the Pl helix, and the Jl/2 and J3/1 loops.
The purine-binding pocket is created by conserved nucleotides in and around the three-way junction element. These nucleotides help to define the purine-binding pocket through the formation of two sets of base triples above and below (Figure 5A). The 3' side of the pocket is flanked by a water-mediated U22— A52A73 base triple and an A23G46-C53 triple; in both cases, the Watson-Crick face of the adenosine interacts with the minor groove of a Watson-Crick pair (Figure 6A). The other side is created by sequential base triples between conserved Watson— Crick pairs at the top of helix Pl (U20-A76 and A21-U75) and the Watson-Crick faces of U49 and C50, respectively, which fasten the J2/3 loop to the Pl helix. This extensive use of base triples to create a ligand-binding site is very similar to in vitro selected RNA aptamers that recognize planar ring systems, as exemplified by the structures of the theophylline (Zimmermann, G. R., Jenison, R. D., Wick, C. L., Sirnmorre, J.-P. & Pardi, A. Interlocking structural motifs mediate molecular discrimination by a theophylline-binding RNA. Nature Struct. Biol. 4, 644-649 (1997)), FMN (Fan, P., Suri, A. K., Fiala, R., Live, D. & Patel, D. J. Molecular recognition in the FMN-RNA aptamer complex. J. MoI. Biol. 258, 480-500 (1996)) and malachite green (Baugh, C, Grate, D. & Wilson, C. 2.8 A crystal structure of the malachite green aptamer. J. MoI. Biol. 301, 117-128 (2000)) binders. Thus, artificially selected RNAs use some of the same principles for creating binding sites for small-molecule ligands as their naturally occurring counterpart.
Hypoxanthine is bound through an extensive series of hydrogen bonds with nucleotides U22, U47, U51 and C74 (Figure 6B), forming a base quadruple that stacks directly on the Pl helix. The structure clearly shows that the mRNA contacts all of the functional groups in the ligand, thereby explaining the specificity for hypoxanthine observed in biochemical studies (Mandal, M. & Breaker, R. R. Gene regulation by riboswitches. Nature Rev. MoI. Cell. Biol. 5, 451-463 (2004)). In addition, guanine binding can be readily rationalized, because there is room in the structure to accommodate an exocyclic amine at the 2-position of the bound purine. This additional functional group can form hydrogen bonds with the carbonyl oxygens at the 2-position of C74 and U51, consistent with the tenfold higher affinity of this riboswitch for guanine over hypoxanthine. One of the most marked features is how the ligand is almost completely enveloped by the RNA (Figure 6C): 97.8% of the surface of hypoxanthine is inaccessible to bulk solvent in the complex. The almost complete use of a ligand for recognition by an RNA is unprecedented among structurally characterized aptamers, although selection strategies that do not involve immobilization of the ligand on a solid support (Koizumi, M., Soukup, G. A., Kerr, J. N. & Breaker, R. R. Allosteric selection of ribozymes that respond to the second messengers cGMP and cAMP. Nature Struct. Biol. 6, 1062-1071 (1999)) hold promise for the development of RNAs that are capable of a similar degree of ligand burial. This finding also implies that the local binding site must undergo a substantial conformational change upon ligand binding, because it is not possible for hypoxanthine to gain access to a preformed binding site, which is a common feature of many RNA-ligand interactions (Leulliot, N. & Varani, G. Current topics in RNA- protein recognition: control of specificity and biological function through induced fit and conformational capture. Biochemistry 40, 7947-7956 (2001); Williamson, J. R. Induced fit in RNA-protein recognition. Nature Struct. Biol. 7, 834-837 (2000)). Finally, this mode of purine recognition also easily explains its ability to change specificity from guanine to adenine through a single point mutation at nucleotide 74 from cytosine to uracil (Mandal, M. & Breaker, R. R. Adenine riboswitches and gene activation by disruption of a transcription terminator. Nature Struct. MoI. Biol. 11, 29-35 (2004)). Although the Watson-Crick pairing preference changes from guanine to adenine, the other interactions between the purine and the RNA are unchanged.
The tertiary architecture is stabilized through a unique loop— loop interaction capping helices P2 and P3 that is defined by two previously unobserved types of base quadruple. Each quadruple comprises a Watson-Crick pair with a noncanonical pair docked into its minor groove (G38-C60 and G37-C61 interacting with the A33A66 and U34A65 pairs, respectively; Figure 7A). This arrangement bears a strong similarity to how adenosines pack into an A-form helix in the commonly found type I/II A-minor triple motif (Doherty, E. A., Batey, R. T., Masquida, B. & Doudna, J. A. A universal mode of helix packing in RNA. Nature Struct. Biol. 8, 339-343 (2001); Nissen, P., Ippolito, J. A., Ban, N., Moore, P. B. & Steitz, T. A. RNA tertiary interactions in the large ribosomal subunit: the A-minor motif. Proc. Natl Acad. ScL USA 98, 4899-4903 (2001)). Mutation of any one of these eight nucleotides would disrupt the intricate hydrogen-bonding network that cements the core of the loop-loop interaction, explaining their strict phylogenetic conservation. Stacked on these quadruples are two noncanonical pairs, including a side-by-side interaction between G62 and U63 akin to the A-platform motif (Gate, J. H. et al. RNA tertiary structure mediation by adenosine platforms. Science 273, 1696-1699 (1996) ) and the bulged-G motif (Correll, C. C, Beneken, J., Plantinga, M. J., Lubbers, M. & Chan, Y. L. The common and the distinctive features of the bulged-G motif based on a 1.04 A resolution RNA structure. Nucleic Acids Res. 31 , 6806-6818 (2003)) (Figure 7B). The G62U63 pair is further stabilized through hydrogen bonding to the backbone of the opposite loop.
The interaction between the two terminal loops is essential for ligand binding by the guanine riboswitch. Differences in the stability of the RNA in the absence and presence of guanine, as determined by in-line probing experiments, indicate that this element of the RNA tertiary structure forms independently of guanine or hypoxanthine. Replacement of the wild-type loops with stable UUCG tetraloops (Molinaro, M. & Tinoco, I. Jr Use of ultra stable UNCG tetraloop hairpins to fold RNA structures: thermodynamic and spectroscopic applications. Nucleic Acids Res. 23, 3056-3063 (1995); Figure 11), which eliminates the tertiary interaction, abolishes the ability of the riboswitch to recognize hypoxanthine; this shows that it is crucial for promoting a high- affinity interaction (Figure 8). Thus, although this tertiary interaction does not contact the ligand directly, it is significant in globally organizing the riboswitch for purine recognition. Similarly, natural sequences of various hammerhead ribozymes contain loop-loop interactions that significantly accelerate their rate of cleavage under physiological conditions (De Ia Pena, M., Gago, S. & Flores, R. Peripheral regions of natural hammerhead ribozymes greatly increase their self-cleavage activity. EMBO J. 22, 5561-5570 (2003); Khvorova, A., Lescoute, A., Westhof, E. & Jayasena, S. D. Sequence elements outside the hammerhead ribozyme catalytic core enable intracellular activity. Nature Struct. Biol. 10, 708-712 (2003)). In each, the tertiary interaction constrains the RNA in a way that allows the three-way junction containing the functional site to respond to physiological concentrations of ligand or Mg2+ ions. In high Mg2+ ion concentrations (20 mM MgC12), the guanine riboswitch has a very high affinity for guanine and hypoxanthine (an observed dissociation constant (Kd) of 5 nM and 50 nM, respectively). In Escherichia coli, however, repression of transcription of the xpt-pbuX Ό^QXGΆ by the purR repressor occurs in response to 1-10 μM concentrations of purine. To determine whether the riboswitch responds to similar concentrations of purine, which probably reflect physiological levels, its affinity for hypoxanthine was determined by isothermal titration calorimetry at varying ionic conditions (Table 1). At a more physiological ionic strength (0.25-1 mM Mg2+), the RNA showed an affinity for hypoxanthine (observed Xd = 3-4 μM) similar to that of the purR repressor protein (observed Ka = 9 μM for the E. coli variant) (Choi, K. Y. & Zalkin, H. Structural characterization and corepressor binding of the Escherichia coli purine repressor. J. Bacteriol. 174, 6207-6214 (1992)). Thus, both RNA- and protein- based regulatory mechanisms seem to be tuned to respond to similar concentrations of intracellular metabolite.
Table 1 Thermodynamic parameters for hypoxarrthitiθ binding at 300C MgCI2CmM) fC<j (μM)* ΔHebS {kG&lmαr1f ASOb8 (OQ-ImOr1 K"1)
20 0.732 ± 0.034 -33.5 ± 0.43 -82
6.0 1.34 ± 0.094 -28.4 ± 0.4S -67
1.0 2.99 ± 0.19 -«.4 ± 093 -91
0.25 4.00 ± 0.19 -41.9 + 0.39 -113
Of HB ND ND
*7he reported errors represent the s,e.m. of the nonlinear least squares fit to the data, MD, no detectable binding. f "TWs reaction contained 2 mM Na≥-EDTA.
The structure further indicates how RNA directly transduces intracellular metabolite concentration into changes in gene expression through a proposed Rho- independent transcriptional regulation mechanism (Mandal, M., Boese, B., Barrick, J. E., Winkler, W. C. & Breaker, R. R. Riboswitches control fundamental biochemical pathways in Bacillus subtilis and other bacteria. Cell 113, 577-586 (2003); Johansen, L. E., Nygaard, P., Lassen, C, Agerso, Y. & Saxild, H. H. Definition of a second Bacillus subtilis pur regulon comprising the pur and xpt-pbuX operons plus pbuG, nupG (yxj A), and pbuE (ydhL). J. Bacteriol. 185, 5200-5209 (2003)). Transcription of the initial 90 nucleotides results in the formation of the guanine-binding domain, with the L2 and L3 loops interacting to begin to form the tertiary structure of the RNA (Figure 5B). This partially organizes the three-way junction motif for efficient ligand binding, although the junction must be unstructured to some degree to allow access of the purine to the binding pocket. At sufficiently high concentrations of guanine or hypoxanthine, the nucleobase binds the pocket, stabilizing the short Pl helix through stacking interactions and base triples with J2/3 and preventing incorporation of Pl nucleotides into an antiterminator element. The mRNA can then form a classic Rho-independent terminator stem-loop, and transcription stops. By contrast, in low intracellular concentrations of guanine or hypoxanthine, the 3' side of the isolated Pl helix is readily conscripted to form a stable antiterminator element, facilitating continued transcription. Thus, hypoxanthine is the keystone for the riboswitch, enabling the riboswitch to direct mRNA folding along two different pathways through its ability to stabilize one conformation over another, resulting in an effective biosensor of intracellular guanine, hypoxanthine and xanthine concentrations. 1. Methods Crystals: RNA was synthesized and purified by a native affinity-tag purification method (Kieft, J. S. & Batey, R. T. A general method for rapid and nondenaturing purification of RNAs. RNA 10, 988-995 (2004)) and exchanged it into a buffer containing 10 mM K+-HEPES (pH 7.5) and 1 mM hypoxanthine. Crystals were grown by mixing this solution in a 1:1 ratio with mother liquor (containing 25% PEG 3,000 (w/v), 200 mM ammonium acetate and 10 mM cobalt hexamine) and incubating it for 2-3 weeks at room temperature.
Data collection and processing: A single wavelength anomalous diffraction experiment was carried out at the CuKo; wavelength on crystals cryoprotected in mother liquor plus 25% 2-methyl-2,4-pentanediol; clear diffraction was observed to at least 1.8 A resolution. The data was indexed, integrated, and scaled using D*TREK
(Pflugrath, J. W. The finer things in X-ray diffraction data collection. Acta Crystallogr. D 55, 1718-1725 (1999)), identified heavy atom sites with SOLVE (Terwilliger, T. SOLVE and RESOLVE: automated structure solution, density modification and model building. J. Synchrotron Radiat. 11, 49-52 (2004)), and carried out refinement with CNS (Brunger, A. T. et al. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D 54, 905-921 (1998)) to obtain a model with final Rxtal and i?free values of 17.8% and 22.8%, respectively. The model contains all RNA atoms except A82, for which electron density was not observed, along with the hypoxanthine ligand.
RNA synthesis and purification: A 68 nucleotide construct containing the sequence for the guanine riboswitch of the pbuX-xpt operon of B. subtilis was constructed using overlapping DNA oligonucleotides (Integrated DNA Technologies) and standard PCR methods. The resulting DNA fragment, which contained EcoRI and NgoMDS restriction sites at the 5' and 3' ends, respectively, was ligated into pRAV12 (Kieft, J. S. & Batey, R. T. A general method for rapid and nondenaturing purification of RNAs. RNA 10, 988-995 (2004)), a plasmid vector designed for the native purification of RNA; the resulting vector was sequence verified. DNA template for in vitro transcription was generated by PCR from the resulting vector using primers directed against the T7 RNA polymerase promoter at the 5' end and the 3' side of the purification affinity tag (5', GCGCGCGAATTCTAATACGACTCACTATAG (SΕQ ID NO: 10; 3', GGATCCTGCCCAGGGCTG; SΕQ ID NO: 11). RNA was transcribed in a 12.5 mL reaction containing 30 niM Tris-HCl (pH 8.0), 10 mM DTT, 0.1 % Triton X-100, 0.1 rnM spermidine-HCl, 8 mM each NTP, 40 mM MgCl2, 50 μg/mL T7 RNA polymerase, and 1 mL of ~0.5 μM template (Doudna, J. A. Preparation of homogeneous ribozyme RNA for crystallization. Methods MoI Biol 74, 365-70 (1997)), supplemented with 1 unit/mL inorganic pyrophosphatase to suppress formation of insoluble magnesium pyrophosphate. The reaction was incubated for 1.5 hr at 37 °C. Native purification of the RNA was performed as described. Following elution of the RNA from the affinity column, it was immediately concentrated using a 10,000 MWCO centrifugal filter device (Amicon, Ultra- 15). After all of the RNA had been concentrated to ~500 μL, it was exchanged against three 15 mL aliquots of a buffer containing 10 mM K+-HEPES, pH 7.5 and 1 mM hypoxanthine using the centrifugal concentrator. The final RNA was concentrated to 450 μM, as judged by it absorbance at 260 nm and a calculated extinction coefficient based upon its base composition.
RNA crystallization: Crystals of the guanine riboswitch were obtained using the hanging drop method in which the RNA solution was mixed in a 1 : 1 ratio with a reservoir solution containing 10 mM cobalt hexammine, 200 mM ammonium acetate and 25 % PEG 2K. The crystallization trays were incubated at room temperature (23 0C), with crystals obtaining their maximum size (0.05 x 0.05 x 0.2 mm) in 7-14 days. Cryoprotection of the crystals was performed by adding 30 μL of a solution comprising the mother liquor plus 25 % (v/v) 2-methyl-2,4 pentanediol (MPD) for five minutes and flash-frozen in liquid nitrogen. Diffraction data was collected on a home X-ray source (Rigaku MSC) using CuKa radiation; collection of anomalous data was achieved by an inverse-beam experiment. The data was indexed, integrated and scaled with
CrystalClear (Rigaku MSC) and D*TREK (Pflugrath, J. W. The finer things in X-ray diffraction data collection. Acta Crystallogr D Biol Crystallogr 55, 1718-1725 (1999)). The crystals belong to the C2 spacegroup (a = 132.30 A, b = 35.25 A, c = 42.23 A, d= 90.95°) and contain one molecule per asymmetric unit (refer to Table 2 for all crystallographic statistics). All data used in subsequent phasing and refinement was collected from one individual crystal.
Phasing and structure determination. Phases were determined using a single wavelength anomalous diffraction (SAD) experiment (Dauter, Z., Dauter, M. & Dodson, E. Jolly SAD. Acta Crystallogr D Biol Crystallogr 58, 494-506 (2002); Rice, L. M., Earnest, T. N. & Brunger, A. T. Single-wavelength anomalous diffraction phasing revisited. Acta Crystallogr D Biol Crystallogr 56 ( Pt 11), 1413-20 (2000)) and diffraction data extending to 1.95 A resolution. In this experiment, cobalt was treated as the heavy atom derivative, which has weak anomalous signal at CuKa wavelength (f = - 2.464, f" = 3.608). With SOLVE (Terwilliger, T. SOLVE and RESOLVE: automated structure solution, density modification and model building. J Synchrotron Radiat 11, 49-52 (2004)), a 10 heavy atom solution was found, with a figure of merit of 0.38 and a score of 40.9. Phases were determined using this heavy atom model and its mirror image using CNS (Brunger, A. T. et al. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr D Biol Crystallogr 54 ( Pt 5), 905-21 (1998)) and improved with density modification. Only one of the heavy atom models yielded an electron density map in which there was clear backbone connectivity and base stacking; this map was sufficiently clear to be able to trace the majority of the RNA. Iterative rounds of model building in O (Jones, T. A., Zou, J. Y., Cowan, S. W. & Kjeldgaard. Improved methods for building protein models in electron density maps and the location of errors in these models. Acta Crystallogr A 47 ( Pt 2), 110-9 (1991)) and refinement in CNS were performed while monitoring Rfree to ensure that it improved after each round. Following building of all nucleotides in the RNA except for the 3'- terminal A82, two rounds of water picking were performed with a total of 332 water molecules placed into the model, using the restrictions that each solvent must be in hydrogen bonding distance to another atom and have a B-factor of less than 80. During this phase of model building, 12 cobalt hexammine ions were identified on the basis of having inner-sphere atoms with clear octahedral coordination geometry and their positions verified by an anomalous difference map; a single spermidine and an acetate ion were also placed within the model at this point. Sugar puckers were constrained to be C3'-endo, except for residues 22, 34, 35, 47, 49 and 62 which were restrained as C2'- endo.
Isothermal Titration Calorimetry. RNA for isothermal titration calorimetry (ITC) (Leavitt, S. & Freire, E. Direct measurement of protein binding energetics by isothermal titration calorimetry. Curr Opin Struct Biol 11, 560-6 (2001); Pierce, M., Raman, C. S. & Nail, B. T. Isothermal titration calorimetry of protein-protein interactions. Methods 19, 213-21 (1999)) was transcribed and purified as described above and exhaustively dialyzed against buffer containing 10 mM K+-HEPES, pH 7.5, 100 mM KCl and varying concentrations OfMgCl2 at 4 °C for 24-48 hours. Following dialysis, the buffer was used to prepare a solution of hypoxanthine at a concentration that was approximately 10-fold higher than the RNA (typically about 120 DM and 12 DM, respectively). All experiments were performed with a Microcal MCS ITC instrument at 30 0C. Following degassing of the RNA and hypoxanthine solutions, a titration of 29 injections of 10 μL of hypoxanthine into the RNA sample was performed, such that a final molar ratio of between 2:1 and 3:1 hypoxanthine:RNA was achieved (Recht, M. I. & Williamson, J. R. Central domain assembly: thermodynamics and kinetics of S6 and Sl 8 binding to an S15-RNA complex. J MoI Biol 313, 35-48 (2001)). Titration data was analyzed using Origin ITC software (Microcal Software Inc.) and fit to a single-site binding model.
Table 2: Crystallography statistics
Data Collection
Spacegroup: C2 a, b, c 132.30, 35.25, 42.23 A β 90.95°
Resolution': 20 -1.95 A (2.02 - 1.95 A)
Wavelength: 1.5418
% Completeness: 92.9 % (85.3 %)
Measured reflections: 76,950 Unique reflections: 25,789 Average redundancy: 2.9 (2.45)
I/σ: 21.5 (6.3)
Rsym • 3.7% (13.5 %)
Phasing
Phasing Power0: 1.62 (0.95)
Rcullis • 0.67
Figure of Merit
SOLVE: 0.38
CNS, after dens, mod.: 0.86
Refinement
Resolution: 20 - 1.95 A (2.02 - 1.95 A)
Number of reflections:
Working: 23,356 (84.1%)
Test set: 2,430 (8.8 %)
Rxtai : 17.8 (24.8)
Rfree: 22.8 (31.8) r.m.s.d. bonds: 0.0095 A r.m.s.d. angles: 1.70° cross- validated Luzzati coordinate error: 0.25 A cross-validated Sigma-a coordinate error: 0.23 A
Average B-factor: 22.4 A2
Figure imgf000102_0001
B. Example 2: Use of Riboswitch Analogs as Antimicrobials
Atomic-resolution models were generated for both a guanine riboswitch aptamer and a related aptamer for adenine (Serganov A, Yuan YR, Pikovskaya O, Polonskaia A, Malinina L, Phan AT, Hobartner C, Micura R, Breaker RR, Patel DJ. (2004) Structural basis for discriminative regulation of gene expression by adenine- and guanine-sensing mRNAs. Chem. Biol. 11:1729-1741). These models are essentially identical to that proposed by Batey et al. (Batey, R.T., Gilbert, S.D., Montange, R.K. (2004) Structure of a natural guanine-responsive riboswitch complexed with the metabolite hypoxanthine. 18:432:411-415.), and therefore shows that the structural model used to design the compounds depicted in Figure 12B is accurate. Furthermore, the similarity between the guanine and adenine riboswitch aptamers makes it clear that the same approaches used to design guanine analogs can be used to inhibit the adenine riboswitch by similarly- designed adenine analogs.
The ability of the guanine analogs depicted in Figure 12B to bind the guanine riboswitch in vitro has been tested, as well as their antibacterial activity toward Bacillus subtilis. Of these analogs, six inhibit B. subtilis growth in media lacking guanine (Table 3). One of these, compound G-014, has been tested for inhibitory activity toward B. subtilis in rich media and toward several clinically-relevant pathogens (Table 4). Importantly, G-014 inhibits most of these pathogens, including Bacillus anthracis and Methicillin-resistant Staphylococcus aureus with similar efficacy as vancomycin. To more fully probe the antibacterial mechanism of these guanine analogs, two gene reporter systems were developed. In one system, a guanine riboswitch has been fused to Green Fluorescent Protein (GFP), such that high concentrations of guanine or a riboswitch-binding guanine analog inhibit the expression of GFP (Figure 14). Thus, a decreased level of GFP fluorescence indicates binding to and modulation of the guanine riboswitch inside bacteria. A similar system has been constructed with a guanine riboswitch upstream of /3-galactosidase. Again, a decreased level of /3-galactosidase activity indicates that a compound represses the guanine riboswitch in vivo. Table 3 summarizes the result of these assays for each of the guanine analogs. A correlation between compounds that kill cells and those that control gene expression has also been established. Similar systems can be used involving any suitable reporter gene, such as genes that encode a suitable reporter protein or a suitable reporter RNA (such as a ribozyme).
It has also been determined that G-014 is bactericidal against Bacillus subtilis. In a direct comparison, G-014 reduces the number of viable colony forming units ("CFU") at a rate equal to carbenicillin (Figure 15).
Table 3. The activity of each guanine analog1.
Kd for binding to β-gal reporter the riboswitch in MIC B. subtilis MIC B. subtilis GFP reporter activity
Compound vitro (nM) (μM, minimal) (μM, rich) activity (relative)2
G-001 >100,000 >100 ND ND 1.0
G-002 <1 >100 ND May repress 0.73
G-003 <1 >100 ND No repression 0.49
G-004 <1 500 ND Represses 0.62
G-005 -10 >100 ND No repression 0.99
G-006 ND ND ND ND ND j
G-007 ND ' ND ND ND ND
G-008 <1 >100 ND ND ND
G-009 <1 >100 ND ND ND
G-010 <1 >100 ND ND 0.97
G-011 <1 >100 ND ND 0.78
G-012 <1 >100 ND ND ND
G-013 <1 >100 ND may repress 1.0
G -014 <1 6.8 ± 0.9 62.5 Can't detect Represses
G-015 <1 >100 ND ND ND
G-016 <1 inhibits ND Represses 0.65
G-017 <1 >100 ND ND ND
G-018 <1 >100 ND ND ND
G-019 <1 >100 ND ND ND
G-020 <1 >100 ND ND ND
G-021 <1 >100 ND ND ND
G-022 ND >100 ND ND ND
G-023 ND inhibits ND ND ND
G-024 ND >100 ND ND ND
G-025 ND inhibits ND ND 0.25
G-026 ND >100 ND ND ND
1 ND indicates that this value has not yet been determined. Compounds in bold type inhibit bacterial growth.
2 Relative β-gal reporter activity indicates Miller Units with the compound divided by Miller Units without the compound. 1.0 = no repression, 0 = complete repression. Table 4. The antibacterial activity of G-014 toward clinically relevant pathogens.
Figure imgf000105_0001
SA, Staphylococcus aureus; MRSA, methicillin-resistant Staphylococcus aureus; EF, Enterococcus faecalis; SP, Streptococcus pneumoniae; HI, Haemophilus influenza; MS, Mycobacterium smegmatis. 2 These pathogens do not have a guanine riboswitch.
Table 5. Purine Riboswitches
# STOCKHOLM 1.0
#=GS NC_003869.1/586365-586467 organism Thermoanaerobacter tengcongensis MB4
#=GS NC_002570.2/1593074-1592972 organism Bacillus halodurans C-125
#=GS NC_002570.2/676475-676577 organism Bacillus halodurans C-125
#=GS NC_000964.2/4004541-4004643 organism Bacillus subtilis subsp. subtilis str. 168
#=GS NC_003366.1/513088-512986 organism Clostridium perfringens str. 13
#=GS NC_000964.2/693775-693877 organism Bacillus subtilis subsp. subtilis str. 168
#=GS NC_002570.2/650309-650411 organism Bacillus halodurans C-125
#=GS NC_006270.2/2294931-2294831 organism Bacillus licheniformis ATCC 14580
#=GS NC_006322.1/2295788-2295688 organism Bacillus licheniformis ATCC 14580
#=GS NC_004193.1/1103943-1104045 organism Oceanobacillus iheyensis HTE831
#=GS NC_003909.8/1650407-1650509 organism Bacillus cereus ATCC 10987
#=GS NC_003997.3/1497571-1497673 organism Bacillus anthracis str. Ames
#=GS NC_005945.1/1497645-1497747 organism Bacillus anthracis str. Sterne
#=GS NC_005957.1/1521880-1521982 organism Bacillus thuringiensis serovar konkukian str. 97-27
#=GS NC_006274.1/1532482-1532584 organism Bacillus cereus ZK
#=GS NC_007530.2/1497694-1497796 organism Bacillus anthracis str 'Ames Ancestor1 #=GS NZ_AAAC02000001.1/1985987-1986089 organism Bacillus anthracis str A2012
#=GS NZ_AAEK01000008.1/96466-96364 organism Bacillus cereus G9241
#=GS NZ_AAEN01000011.1/67143-67245 organism Bacillus anthracis str CNEVA-9066
#=GS NZ_AAEO01000025.1/66855-66957 organism Bacillus anthracis str AlO55
#=GS NZ_AAEP01000035.1/69365-69467 organism Bacillus anthracis str VoHum
#=GS NZ_AAEQ01000029.1/70848-70950 organism Bacillus anthracis str Kruger B
#=GS NZ_AAER01000023.1/58182-58284 organism Bacillus anthracis str Western North America USA6153
#=GS NZ_AAES01000034.1/69121-69223 organism Bacillus anthracis str. Australia 94
#=GS NC_004722.1/1515239-1515341 organism Bacillus cereus ATCC 14579
#=GS NC_000964.2/697711-697813 organism Bacillus subtilis subsp. subtilis str 168
#=GS NC_002976.3/54816-54919 organism Staphylococcus epidermidis RP62A
#=GS NC_004461.1/2433047-2432944 organism Staphylococcus epidermidis ATCC 12228
#=GS NC_003030.1/2824953-2824855 organism Clostridium acetobutylicum ATCC 824
#=GS NC_003909.8/296465-296567 organism Bacillus cereus ATCC 10987
#=GS NC_003997.3/262601-262703 organism Bacillus anthracis str. Ames
#=GS NC_004722.1/261558-261660 organism Bacillus cereus ATCC 14579
#=GS NC_005945.1/262614-262716 organism Bacillus anthracis str. Sterne
#=GS NC_005957.1/268537-268639 organism Bacillus thuringiensis serovar konkukian str. 97-27
#=GS NC_006274.1/267835-267937 organism Bacillus cereus ZK
#=GS NC_007530.2/262601-262703 organism Bacillus anthracis str •Ames Ancestor'
#=GS NZ_AAAC02000001.1/796036-796138 organism Bacillus anthracis str A2012
#=GS NZ_AAEK01000017.1/88397-88499 organism Bacillus cereus G9241
#=GS NZ_AAEN01000023.1/16980-17082 organism Bacillus anthracis str CNEVA-9066
#=GS NZ_AAEO01000030.1/17090-17192 organism Bacillus anthracis str A1055
#=GS NZ_AAEP01000043.1/12962-13064 organism Bacillus anthracis str Vollum
#=GS NZ_AAEQ01000040.1/4962-4860 organism Bacillus anthracis str Kruger B
#=GS NZ_AAER01000030.1/4374-4272 organism Bacillus anthracis str Western North America USA6153
#=GS NZ_AAES01000040.1/14760-14862 organism Bacillus anthracis str. Australia 94
#=GS NC_006270.2/693198-693300 organism Bacillus licheniformis ATCC 14580
#=GS NC_006322.1/692981-693083 organism Bacillus licheniformis ATCC 14580
#=GS NC_004722.1/259601-259703 organism Bacillus cereus ATCC 14579
#=GS NC_000964.2/2319410-2319310 organism Bacillus subtilis subsp. subtilis str. 168
#=GS NC_003030.1/2905050-2904948 organism Clostridium acetobutylicum ATCC 824
#=GS NC_003210.1/611119-611017 organism Listeria monocytogenes EGD-e
#=GS NZ_AADQ01000001.1/2431-2533 organism Listeria monocytogenes str. l/2a F6854
#=GS NC_003212.1/610156-610054 organism Listeria innocua Clipll262
#=GS WC_003366.1/422820-422922 organism Clostridium perfringens str. 13
#=GS NC_002973.5/617846-617744 organism Listeria monocytogenes str. 4b F2365
#=GS NZ_AM)R01000111.1/1598-1496 organism Listeria monocytogenes str. 4b H7858
#=GS NC_003909.8/294509-294611 organism Bacillus cereus ATCC 10987
#=GS NC_005957.1/266577-266679 organism Bacillus thuringiensis serovar konkukian str. 97-27
#=GS NC_006274.1/265875-265977 organism Bacillus cereus ZK
#=GS NC_003366.1/2618421-2618323 organism Clostridium perfringens str. 13
#=GS NC_002973.5/1940027-1939926 organism Listeria monocytogenes str. 4b F2365
#=GS NC_003212.1/2013345-2013244 organism Listeria innocua Clipll262
#=GS NZ_AADQ01000095.1/1690-1791 organism Listeria monocytogenes str. l/2a F6854
#=GS NZ_AM)R01000082.1/3415-3516 organism Listeria monocytogenes str. 4b H7858
#=GS NC_003366.1/2871201-2871101 organism Clostridium perfringens str 13
#=GS NC_002745.2/430797-430900 organism Staphylococcus aureus subsp aureus N315
#=GS NC_002758.2/430754-430857 organism Staphylococcus aureus subsp aureus Mu50
#=GS NC_002951.2/460059-460162 organism Staphylococcus aureus subsp aureus COL
#=GS NC_002952.2/441050-441153 organism Staphylococcus aureus subsp aureus MRSA252
#=GS NC_002953.3/409191-409294 organism Staphylococcus aureus subsp aureus MSSA476
#=GS NC_003923.1/410546-410649 organism Staphylococcus aureus subsp. aureus MW2
#=GS NC_006582.1/1115665-1115767 organism Bacillus clausii KSM-K16
#=GS NC_006510.1/274257-274359 organism Geobacillus kaustophilus HTA426
#=GS NC_003210.1/1958922-1958821 organism Listeria monocytogenes EGD-e
#=GS NC__006270.2/697054-697156 organism Bacillus licheniformis ATCC 14580
#=GS NC_006322.1/696838-696940 organism Bacillus licheniformis ATCC 14580
#=GS NC__006510.1/282580-282682 organism Geobacillus kaustophilus HTA426
#=GS NC_003997.3/260641-260743 organism Bacillus anthracis str. Ames
#=GS NC_005945.1/260654-260756 organism Bacillus anthracis str Sterne
#=GS NC_007530.2/260641-260743 organism Bacillus anthracis str 'Ames Ancestor1
#=GS NZ_AAEK01000017.1/86437-86539 organism Bacillus cereus G9241
#=GS NZ_AAEN01000023.1/15020-15122 organism Bacillus anthracis str CNEVA-9066
#=GS NZ_AAEO01000030.1/15130-15232 organism Bacillus anthracis str A1055
#=GS NZ_AAEP01000043.1/11002-11104 organism Bacillus anthracis str Vollum
#=GS NZ_AAEQ01000040.1/6922-6820 organism Bacillus anthracis str Kruger B
#=GS NZ__AAER01000030.1/6334-6232 organism Bacillus anthracis str Western North America USA6153
#=GS NZ_AAES01000040.1/12800-12902 organism Bacillus anthracis str. Australia 94
#=GS NC_004193.1/760473-760575 organism Oceanobacillus iheyensis HTE831
#=GS NC__006270.2/4024210-4024312 organism Bacillus licheniformis ATCC 14580
#=GS NC_006322.1/4024324-4024426 organism Bacillus licheniformis ATCC 14580
#=GS NC_004193.1/786767-786868 organism Oceanobacillus iheyensis HTE831
#=GS NC_002662.1/1159509-1159607 organism Lactococcus lactis subsp. lactis 111403
#=GS NC_000964.2/625993-625893 organism Bacillus subtilis subsp. subtilis str. 168
#=GS NC_003909.8/382630-382528 organism Bacillus cereus ATCC 10987
#=GS NC_003997.3/342356-342254 organism Bacillus anthracis str. Ames
#=GS NC_005945.1/342369-342267 organism Bacillus anthracis str. Sterne
#=GS NC_005957.1/356354-356252 organism Bacillus thuringiensis serovar konkukian str. 97-27
#=GS NC_006274.1/357462-357360 organism Bacillus cereus ZK
#=GS NC_007530.2/342356-342254 organism Bacillus anthracis str 'Ames Ancestor'
#=GS NZ_AAAC02000001.1/859268-859166 organism Bacillus anthracis str A2012
#=GS NZ_AAEK01000051.1/8150-8252 organism Bacillus cereus G9241
#=GS NZ_AAEN01000023.1/83441-83339 organism Bacillus anthracis str CNEVA-9066
#=GS NZ_AAEO01000030.1/96886-96784 organism Bacillus anthracis str A1055
#=GS NZ__AAEP01000046.1/48810-48708 organism Bacillus anthracis str Vollum
#=GS NZ_AAEQ01000034.1/49483-49381 organism Bacillus anthracis str Kruger B
#=GS NZ_AAER01000042.1/191339-191441 organism Bacillus anthracis str Western North America USA6153
#=GS NZ_AAES01000043.1/48479-48377 organism Bacillus anthracis str Australia 94
#=GS NZ_AAAC02000001.1/794076-794178 organism Bacillus anthracis str A2012
#=GS NC_003030.1/1002176-1002275 organism Clostridium acetobutylicum ATCC 824
#=GS NC_006582.1/1554717-1554819 organism Bacillus clausii KSM-K16
#=GS NC_003909.8/336194-336296 organism Bacillus cereus ATCC 10987
#=GS NC_003997.3/295331-295433 organism Bacillus anthracis str. Ames
#=GS NC_005945.1/295344-295446 organism Bacillus anthracis str. Sterne
#=GS NC_005957.1/309524-309626 organism Bacillus thuringiensis serovar konkukian str. 97-27
#=GS NC_006274.1/309094-309196 organism Bacillus cereus ZK
#=GS NC_007530.2/295331-295433 organism Bacillus anthracis str 'Ames Ancestor1
#=GS NZ_AAAC02000001.1/812243-812345 organism Bacillus anthracis str A2012
#=GS NZ_AAEK01000064.1/23153-23051 organism Bacillus cereus G9241
#=GS NZ_AAEN01000023.1/36414-36516 organism Bacillus anthracis str CNEVA-9066
#=GS NZ_AAEO01000030.1/49824-49926 organism Bacillus anthracis str AlO55
#=GS NZ_AAEP01000046.1/1785-1887 organism Bacillus anthracis str Vbllum
#=GS NZ_AAEQ01000034.1/2457-2559 organism Bacillus anthracis str Kruger B
#=GS NZ_AAER01000042.1/238364-238262 organism Bacillus anthracis str Western North America USA6153
#=GS NZ_AAES01000043.1/1489-1591 organism Bacillus anthracis str Australia 94
#=GS NC_004193.1/769686-769787 organism Oceanobacillus iheyensis HTE831
#=GS NC_004722.1/343847-343745 organism Bacillus cereus ATCC 14579
#=GS NZ_AAAW03000042.1/15038-14936 organism Desulfitobacterium hafniense DCB-2
#=GS NZ_AADT03000005.1/34629-34731 organism Moorella thermoacetica ATCC 39073
#=GS NC_002570.2/648442-648544 organism Bacillus halodurans C-125
#=GS NC_004722.1/298774-298876 organism Bacillus cereus ATCC 14579
#=GS NC_006510.1/272473-272575 organism Geobacillus kaustophilus HTA426
#=GS NZ_AADW02000004.1/225764-225662 organism Exiguobacterium sp. 255-15
#=GS NC_006274.1/3685852-3685750 organism Bacillus cereus ZK
#=GS NZ_AAGO01000011.1/2345-2247 organism Lactococcus lactis subsp. cremoris SKIl
#=GS NZ_AAAK03000113.1/5724-5822 organism Enterococcus faecium DO
#=GS NZ_AADW02000005.1/186378-186480 organism Exiguobacterium sp. 255-15
#=GS NC_003909.8/3578068-3577966 organism Bacillus cereus ATCC 10987
#=GS NC_003997.3/3605298-3605196 organism Bacillus anthracis str. Ames
#=GS NC_004722.1/3766254-3766152 organism Bacillus cereus ATCC 14579
#=GS NC_005945.1/3605993-3605891 organism Bacillus anthracis str. Sterne
#=GS NC_005957.1/3626969-3626867 organism Bacillus thuringiensis serovar konkukian str. 97-27
#=GS NC_007530.2/3605425-3605323 organism Bacillus anthracis str. 'Ames Ancestor'
#=GS NZ_AAAC02000001.1/4059572-4059470 organism Bacillus anthracis str. A2012
#=GS NZ_AAEN01000012.1/139994-140096 organism Bacillus anthracis str. CNEVA-9066
#=GS NZ_AAEO01000020.1/51249-51351 organism Bacillus anthracis str. A1055
#=GS NZ_AAEP01000042.1/12489-12387 organism Bacillus anthracis str. Vollum
#=GS NZ_AAEQ01000019.1/51159-51261 organism Bacillus anthracis str. Kruger B
#=GS NZ_AAER01000035.1/118314-118212 organism Bacillus anthracis str. Western North America USA6153
#=GS NZ_AAES01000032.1/51320-51422 organism Bacillus anthracis str. Australia 94
#=GS NC_006371.1/1538896-1538796 organism Photobacterium profundum SS9
#=GS NC_004460.1/504371-504471 organism Vibrio vulnificus CMCP6
#=GS NC_005140.1/1130553-1130653 organism Vibrio vulnificus YJ016
#=GS NC_004116.1/1094305-1094209 organism Streptococcus agalactiae 2603V/R
#=GS NC_004368.1/1163490-1163394 organism Streptococcus agalactiae NEM316
#=GS NZ_AADT03000005.1/42245-42347 organism Moorella thermoacetica ATCC 39073
#=GS NC_004557.1/2551392-2551293 organism Clostridium tetani E88
#=GS NC_003028.1/1754786-1754882 organism Streptococcus pneumoniae TIGR4
#=GS NC_003098.1/1634825-1634921 organism Streptococcus pneumoniae R6
#=GS NZ_AAGY01000085.1/5640-5736 organism Streptococcus pneumoniae TIGR4
#=GS NZ_AAEK01000001.1/183902-184006 organism Bacillus cereus G9241
#=GS NC_003454.1/1645820-1645721 organism Fusobacterium nucleatum subsp. nucleatum ATCC 25586
#=GS NZ_AADW02000027.1/2980-2880 organism Exiguobacterium sp. 255-15
#=GS NZ_AADW02000005.1/179959-180061 organism Exiguobacterium sp. 255-15
#=GS NC_006582.1/1039390-1039492 organism Bacillus clausii KSM-K16
#=GS NC_002737.1/930749-930845 organism Streptococcus pyogenes Ml GAS
#=GS NC_003485.1/910599-910695 organism Streptococcus pyogenes MGAS8232
#=GS NC_004070.1/846557-846653 organism Streptococcus pyogenes MGAS315
#=GS NC_004606.1/977077-977173 organism Streptococcus pyogenes SSI-I
#=GS NC_006086.1/857664-857760 organism Streptococcus pyogenes MGAS10394
#=GS NZ_AAFV01000199.1/507-411 organism Streptococcus pyogenes M49 591
#=GS NC_004605.1/1369721-1369821 organism Vibrio parahaemolyticus RIMD 2210633
#=GS NZ_AAAW03000004.1/66862-66959 organism Desulfitobacterium hafniense DCB-2
#=GS NC_004567.1/2968830-2968731 organism Lactobacillus plantarum WCFSl
#=GS NZ_AAEV01000003.1/13765-13667 organism Pediococcus pentosaceus ATCC 25745
#=GS NC_002570.2/806873-806971 organism Bacillus halodurans C-125
#=GS NZ_AAEK01000052.1/27552-27452 organism Bacillus cereus G9241
#=GS NC_006814.1/237722-237626 organism Lactobacillus acidophilus NCFM
#=GS NC_005363.1/3414604-3414703 organism Bdellovibrio bacteriovorus HDlOO
#=GS NZ_AΔDT03000021.1/9410-9513 organism Moorella thermoacetica ATCC 39073
#=GS NC_004668.1/2288426-2288328 organism Enterococcus faecalis V583
#=GS NC_006449.1/1182949-1183044 organism Streptococcus thermophilus CNRZlO66
#=GS NZ_AAGS01000026.1/9921-10016 organism Streptococcus thermophilus LMD-9
#=GS NZ_AAGQ01000089.1/170-269 organism Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365
#=GS NZ_AABP02000293.1/1-98 organism Fusobacterium nucleatum subsp. vincentii ATCC 49256
#=GS NC_006448.1/1185082-1185177 organism Streptococcus thermophilus LMG 18311
#=GS NC_004567.1/2410478-2410577 organism Lactobacillus plantarum WCFSl
#=GS NZ_AABJ03000010.1/47704-47802 organism Oenococcus oeni PSU-1
#=GS NZ_AADT03000002.1/89184-89083 organism Moorella thermoacetica ATCC 39073 #=GS NC_006814.1/1971978-1972076 organism Lactobacillus acidophilus NCFM #=GS NZ_AAAO02000016.1/1579-1480 organism Lactobacillus gasseri #=GS NC_005362.1/1949387-1949485 organism Lactobacillus johnsonii NCC 533 #=GS NZ_AAAO02000035.1/3299-3201 organism Lactobacillus gasseri #=GS NC_005362.1/263146-263049 organism Lactobacillus johnsonii NCC 533 #=GS NC_005363.1/2004933-2004835 organism Bdellovibrio bacteriovorus HDlOO #=GS NZ_AADT03000002.1/98293-98192 organism Moorella thermoacetica ATCC 39073 #=GS NZ_AABH02000036.1/17403-17500 organism Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 #=GS NC_006055.1/484139-484044 organism Mesoplasma florum Ll #=GS NZ_AABH02000272.1/211-308 organism Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 #=GS NZ_AABJ03000003.1/47905-47810 organism Oenococcus oeni PSU-I #=GS NZ_AABH02000038.1/17559-17463 organism Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 #=GS NC_006055.1/397027-397123 organism Mesoplasma florum Ll
#=GS NC_003869.1/586365-586467 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales;
Thermoanaerobacteriaceae; Thermoanaerobacter; tengcongensis #=GS NC_002570.2/1593074-1592972 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; halodurans #=GS NC_002570.2/676475-676577 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; halodurans #=GS NC_000964.2/4004541-4004643 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; subtilis #=GS NC_003366.1/513088-512986 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; perfringens #=GS NC_000964.2/693775-693877 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; subtilis #=GS NC_002570.2/650309-650411 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; halodurans #=GS NC_006270.2/2294931-2294831 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_006322.1/2295788-2295688 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_004193.1/1103943-1104045 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Oceanobacillus; iheyensis #=GS NC_003909.8/1650407-1650509 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC 003997.3/1497571-1497673 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis
#=GS NC_005945.1/1497645-1497747 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_005957.1/1521880-1521982 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; thuringiensis #=GS NC_006274.1/1532482-1532584 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,-
Bacillus; cereus group; Bacillus; cereus #=GS NC_007530.2/1497694-1497796 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAAC02000001.1/1985987-1986089 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEK01000008.1/96466-96364 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NZ-AAENO1000011.1/67143-67245 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEO01000025.1/66855-66957 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEP01000035.1/69365-69467 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEQ01000029.1/70848-70950 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAER01000023.1/58182-58284 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAES01000034.1/69121-69223 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_004722.1/1515239-1515341 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC_000964.2/697711-697813 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; subtilis
#=GS NC_002976.3/54816-54919 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; epidermidis #=GS NC_004461.1/2433047-2432944 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; epidermidis #=GS NC_003030.1/2824953-2824855 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; acetobutylicum
#=GS NC_003909.8/296465-296567 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC_003997.3/262601-262703 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC 004722.1/261558-261660 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus
#=GS NC_005945.1/262614-262716 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_005957.1/268537-268639 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; thuringiensis #=GS NC_006274.1/267835-267937 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group,- Bacillus; cereus #=GS NC_007530.2/262601-262703 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAAC02000001.1/796036-796138 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEK01000017.1/88397-88499 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NZ_AAEN01000023.1/16980-17082 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS WZ_AAEO01000030.1/17090-17192 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEP01000043.1/12962-13064 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEQ01000040.1/4962-4860 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAER01000030.1/4374-4272 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAES01000040.1/14760-14862 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_006270.2/693198-693300 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_006322.1/692981-693083 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_004722.1/259601-259703 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC_000964.2/2319410-2319310 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; subtilis #=GS NC_003030.1/2905050-2904948 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; acetobutylicum #=GS NC_003210.1/611119-611017 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; monocytogenes #=GS NZ AADQ01000001.1/2431-2533 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria;
monocytogenes
#=GS NC _003212.1/610156-610054 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria,- innocua
#=GS NC_003366.1/422820-422922 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; perfringens
#=GS NC_002973.5/617846-617744 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; monocytogenes
#=GS NZ _AADR01000111.1/1598-1496 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; monocytogenes
#=GS NC_003909.8/294509-294611 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus
#=GS NC_005957.1/266577-266679 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; thuringiensis
#=GS NC_006274.1/265875-265977 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus
#=GS NC _003366.1/2618421-2618323 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; perfringens
#=GS NC 002973.5/1940027-1939926 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; monocytogenes
#=GS NC _003212.1/2013345-2013244 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; innocua
#=GS NZ _AADQ01000095.1/1690-1791 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; monocytogenes
#=GS NZ AADR01000082.1/3415-3516 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria; monocytogenes
#=GS NC 003366.1/2871201-2871101 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; perfringens
#=GS NC _002745.2/430797-430900 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; aureus
#=GS NC 002758.2/430754-430857 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; aureus
#=GS NC ~002951.2/460059-460162 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; aureus
#=GS NC JD02952.2/441050-441153 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; aureus
#=GS NC _002953.3/409191-409294 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; aureus
#=GS NC _003923.1/410546-410649 taxonomy Bacteria; Firmicutes; Bacillales; Staphylococcus; aureus
#=GS NC 006582.1/1115665-1115767 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; clausii
#=GS NC 006510.1/274257-274359 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Geobacillus; kaustophilus
#=GS NC 003210.1/1958922-1958821 taxonomy Bacteria; Firmicutes; Bacillales; Listeriaceae; Listeria;
monocytogenes
#=GS NC_006270.2/697054-697156 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_006322.1/696838-696940 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_006510.1/282580-282682 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Geobacillus; kaustophilus #=GS NC_003997.3/260641-260743 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_005945.1/260654-260756 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_007530.2/260641-260743 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEK01000017.1/86437-86539 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NZ_AAEN01000023.1/15020-15122 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEO01000030.1/15130-15232 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEP01000043.1/11002-11104 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEQ01000040.1/6922-6820 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAER01000030.1/6334-6232 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAES01000040.1/12800-12902 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_004193.1/760473-760575 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Oceanobacillus; iheyensis #=GS NC_006270.2/4024210-4024312 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_006322.1/4024324-4024426 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; licheniformis #=GS NC_004193.1/786767-786868 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Oceanobacillus; iheyensis #=GS NC_002662.1/1159509-1159607 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
Lactococcus; lactis #=GS NC 000964.2/625993-625893 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
subtilis
#=GS NC_003909.8/382630-382528 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC_003997.3/342356-342254 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_005945.1/342369-342267 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_005957.1/356354-356252 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; thuringiensis #=GS NC_006274.1/357462-357360 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC_007530.2/342356-342254 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAAC02000001.1/859268-859166 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEK01000051.1/8150-8252 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NZ_AAEN01000023.1/83441-83339 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEO01000030.1/96886-96784 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEP01000046.1/48810-48708 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEQ01000034.1/49483-49381 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAER01000042.1/191339-191441 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAES01000043.1/48479-48377 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAAC02000001.1/794076-794178 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_003030.1/1002176-1002275 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Clostridiaceae; Clostridium; acetobutylicum #=GS NC_006582.1/1554717-1554819 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; clausii #=GS NC_003909.8/336194-336296 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC 003997.3/295331-295433 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis
#=GS NC_005945.1/295344-295446 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_005957.1/309524-309626 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; thuringiensis #=GS NC_006274.1/309094-309196 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NC_007530.2/295331-295433 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAAC02000001.1/812243-812345 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEK01000064.1/23153-23051 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NZ_AAEN01000023.1/36414-36516 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEO01000030.1/49824-49926 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEP01000046.1/1785-1887 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEQ01000034.1/2457-2559 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAER01000042.1/238364-238262 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAES01000043.1/1489-1591 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; anthracis #=GS NC_004193.1/769686-769787 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Oceanobacillus; iheyensis #=GS NC_004722.1/343847-343745 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus #=GS NZ_AAAW03000042.1/15038-14936 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Peptococcaceae; Desulfitobacterium; hafniense #=GS NZ_AADT03000005.1/34629-34731 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales;
Thermoanaerobacteriaceae; Moorella group; Moorella,- thermoacetica #=GS NC_002570.2/648442-648544 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; halodurans
#=GS NC 004722.1/298774-298876 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group; Bacillus; cereus
#=GS NC_006510.1/272473-272575 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Geobacillus,- kaustophilus #=GS NZ_AADW02000004.1/225764-225662 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Exiguobacterium,- sp. #=GS NC_006274.1/3685852-3685750 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,- Bacillus; cereus group; Bacillus; cereus #=GS NZ_AAGO0l000011.1/2345-2247 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae,- Lactococcus; lactis #=GS NZ_AAAK03000113.1/5724-5822 taxonomy Bacteria; Firmicutes; Lactobacillales; Enterococcaceae; Enterococcus; faecium #=GS NZ_AADW02000005.1/186378-186480 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Exiguobacterium; sp. #=GS NC_003909.8/3578068-3577966 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; cereus #=GS NC_003997.3/3605298-3605196 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,- Bacillus; cereus group; Bacillus; anthracis #=GS NC_004722.1/3766254-3766152 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; cereus #=GS NC_005945.1/3605993-3605891 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,- Bacillus; cereus group; Bacillus; anthracis #=GS NC_005957.1/3626969-3626867 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; thuringiensis #=GS NC_007530.2/3605425-3605323 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,- Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAAC02000001.1/4059572-4059470 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEN0l000012.1/139994-140096 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEO01000020.1/51249-51351 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AΑEP0l000042.1/12489-12387 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAEQO1OOOO19.1/51159-51261 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; anthracis #=GS NZ_AAER0l000035.1/118314-118212 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; anthracis #=GS NZ AAES01000032.1/51320-51422 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus,- Bacillus; cereus group; Bacillus,- anthracis
#=GS NC_006371.1/1538896-1538796 taxonomy Bacteria; Proteobacteria,- Gammaproteobacteria; Vibrionales; Vibrionaceae; Photobacterium; profundum #=GS NC_004460.1/504371-504471 taxonomy Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales; Vibrionaceae; Vibrio; vulnificus #=GS NC_005140.1/1130553-1130653 taxonomy Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales; Vibrionaceae; Vibrio; vulnificus #=GS NC_004116.1/1094305-1094209 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; agalactiae #=GS NC_004368.1/1163490-1163394 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; agalactiae #=GS NZ_AADT03000005.1/42245-42347 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales; Thermoanaerobacteriaceae; Moorella group,- Moorella; thermoacetica #=GS NC_004557.1/2551392-2551293 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales; Clostridiaceae; Clostridium; tetani #=GS NC_003028.1/1754786-1754882 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pneumoniae #=GS NC_003098.1/1634825-1634921 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pneumoniae #=GS NZ_AAGY01000085.1/5640-5736 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pneumoniae #=GS NZ_AAEK01000001.1/183902-184006 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus; cereus group; Bacillus; cereus #=GS NC_003454.1/1645820-1645721 taxonomy Bacteria; Fusobacteria; Fusobacterales; Fusobacteriaceae; Fusobacterium; nucleatum #=GS NZ_AADW02000027.1/2980-2880 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Exiguobacterium; sp. #=GS NZ_AADW02000005.1/179959-180061 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Exiguobacterium; sp. #=GS NC_006582.1/1039390-1039492 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; clausii #=GS NC_002737.1/930749-930845 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pyogenes #=GS NC_003485.1/910599-910695 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pyogenes #=GS NC_004070.1/846557-846653 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pyogenes #=GS NC 004606.1/977077-977173 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae; Streptococcus; pyogenes
#=GS NC_006086.1/857664-857760 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
Streptococcus; pyogenes #=GS NZ_AAPV01000199.1/507-411 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
Streptococcus; pyogenes #=GS NC_004605.1/1369721-1369821 taxonomy Bacteria; Proteobacteria; Gammaproteobacteria; Vibrionales;
Vibrionaceae; Vibrio; parahaemolyticus #=GS NZ_AAAW03000004.1/66862-66959 taxonomy Bacteria; Firmicutes; Clostridia; Clostridiales;
Peptococcaceae; Desulfitobacterium; hafniense #=GS NC_004567.1/2968830-2968731 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; plantarum #=GS NZ_AAEV01000003.1/13765-13667 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Pediococcus; pentosaceus #=GS NC_002570.2/806873-806971 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; halodurans #=GS NZ_AAEK01000052.1/27552-27452 taxonomy Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus;
Bacillus; cereus group,- Bacillus; cereus
#=GS NC_006814.1/237722-237626 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; acidophilus
#=GS NC_005363.1/3414604-3414703 taxonomy Bacteria; Proteobacteria; Deltaproteobacteria;
Bdellovibrionales; Bdellovibrionaceae; Bdellovibrio,- bacteriovorus
#=GS NZ_AADT03000021.1/9410-9513 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales;
Thermoanaerobacteriaceae; Moorella group; Moorella; thermoacetica #=GS NC_004668.1/2288426-2288328 taxonomy Bacteria; Firmicutes; Lactobacillales; Enterococcaceae;
Enterococcus; faecalis #=GS NC_006449.1/1182949-1183044 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
Streptococcus; thermophilus #=GS NZ_AAGS01000026.1/9921-10016 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
Streptococcus; thermophilus #=GS NZ_AAGQ01000089.1/170-269 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; delbrueckii #=GS NZ_AABF02000293.1/1-98 taxonomy Bacteria; Fusobacteria; Fusobacteria (class); Fusobacterales;
Fusobacteriaceae; Fusobacterium; nucleatum #=GS NC_006448.1/1185082-1185177 taxonomy Bacteria; Firmicutes; Lactobacillales; Streptococcaceae;
Streptococcus; thermophilus #=GS HC_004567.1/2410478-2410577 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; plantarum
#=GS WZ_AABJ03000010.1/47704-47802 taxonomy Bacteria; Firmicutes; Lactobacillales; Oenococcus; oeni #=GS WZ AADT03000002.1/89184-89083 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales;
Thermoanaerobacteriaceae; Moorella group; Moorella; thermoacetica
#=GS NC_006814.1/1971978-1972076 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; acidophilus #=GS NZ_AAAO02000016.1/1579-1480 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; gasseri #=GS NC_005362.1/1949387-1949485 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; johnsonii #=GS NZ_AAAO02000035.1/3299-3201 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; gasseri #=GS NC_005362.1/263146-263049 taxonomy Bacteria; Firmicutes; Lactobacillales; Lactobacillaceae;
Lactobacillus; johnsonii #=GS NC_005363.1/2004933-2004835 taxonomy Bacteria; Proteobacteria; Deltaproteobacteria;
Bdellovibrionales; Bdellovibrionaceae; Bdellovibrio; bacteriovorus #=GS NZ_AADT03000002.1/98293-98192 taxonomy Bacteria; Firmicutes; Clostridia; Thermoanaerobacteriales;
Thermoanaerobacteriaceae; Moorella group; Moorella; thermoacetica #=GS NZ_AABH02000036.1/17403-17500 taxonomy Bacteria; Firmicutes; Lactobacillales; Leuconostoc; mesenteroides #=GS NC_006055.1/484139-484044 taxonomy Bacteria; Firmicutes; Mollicutes; Entomoplasmatales;
Entomoplasmataceae; Mesoplasma; florum #=GS NZ_AABH02000272.l/211-308 taxonomy Bacteria; Firmicutes; Lactobacillales; Leuconostoc; mesenteroides
#=GS NZ_AABJ03000003.1/47905-47810 taxonomy Bacteria; Firmicutes; Lactobacillales; Oenococcus; oeni #=GS NZ_AABH02000038.1/17559-17463 taxonomy Bacteria; Firmicutes; Lactobacillales; Leuconostoc; mesenteroides
#=GS NC 006055.1/397027-397123 taxonomy Bacteria; Firmicutes; Mollicutes; Entomoplasmatales;
Entomoplasmataceae; Mesoplasma; florum
NC_003869.1/586365-586467 SEQ ID NO:12
AAAAAUUUAAUAAGA.AG.CACUCAUAUAAUCCCGAGA. AU.AUGGCUCGGGA.GUCUCUACCGAACAACC..GUAAAUUGUUC.G.ACUAUGAGUGAAAGU.GUACCUAGGG NC_002570.2/1593074-1592972 SEQ ID NO:13 AUUUACAUUAAAAAA.AG.CACUCGUAUAAUCGCGGGA. AU.AGGGCCCGCAA.GUUUCUACCAGGCUGCC..GUAAACAGCCU.G.ACUACGAGUGAUACU.UUGACAUAGA NC_002570.2/676475-676577 SEQ ID NO:14
CGUUCUUUAUAUAAA.GU.ACCUCAUAUAAUCUUGGGA. AU.AUGGCCCAAAA.GUUUCUACCUGCUGACC..GUAAAUCGGCG.G.ACUAUGGGGAAAGAU.UUUGGAUCUU NC_000964.2/4004541-4004643 SEQ ID NO:15 CAUCUUAGAAAAAGA.CA.UUCUUGUAUAUGAUCAGUA. AU.AUGGUCUGAUU.GUUUCUACCUAGUAACC..GUAAAAAACUA.G.ACUACAAGAAAGUUU.GAAUAAAUUU NC_003366.1/513088-512986 SEQ ID NO:16
UAAGUGUAUUAAAUU.UU.AACUCGUAUAUAAUCGGUA. AU.AUGGUCCGAAA.GUUUCUACCUGCUAACC..GUAAAAUAGCA.G.ACUACGAGGAGUUGU.ACUAUAAAUU NC_000964.2/693775-693877 SEQ ID NO : 17
AGAAAUCAAAUAAGA.UG.AAUUCGUAUAAUCGCGGGA,AU.AUGGCUCGCAA.GUCUCUACCAAGCUACC..GUAAAUGGCUU.G.ACUACGUAAACAUUU.CUUUCGUUUG NC_002570.2/650309-650411 SEQ ID NO:18 AAUAAAUCGAAAACA.UC.AUUUCGUAUAAUGGCAGGA. AU.AGGGCCUGCGA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGAAAAUAAUG.GGUUUUUUAC NC_006270.2/2294931-2294831 SEQ ID NO:19 AAUUUGAUACAUUAU.AU.CACUCAUAUAAUCGCGUGG, AU.AUGGCACGCAA.GUUUCUACCGGGCA-CC..GUAAA-UGUCC.G.ACUAUGAGUGGGCGA.UAAGAAAACG NC_006322.1/2295788-2295688 SEQ ID NO:20 AAUUUGAUACAUUAU.AU.CACUCAUAUAAUCGCGUGG. AU.AUGGCACGCAA.GUUUCUACCGGGCA-CC..GUAAA-UGUCC.G.ACUAUGAGUGGGCGA.UAAGAAAACG NC_004193.1/1103943-1104045 SEQ ID NO:21 AAACCUUAUAUAUAG.UU.UUUUCAUAUAAUCGCGGGG. AU.AUGGCCUGCAA. GUϋϋCUACCGGUUUACC..GUAAAUGAACC. G.ACUAUGGAAAAGCGG.AAAAUUCGAU NC_003909.8/1650407-1650509 SEQ ID NO:22 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG. AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NC_003997.3/1497571-1497673 SEQ ID NO:23 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NC_005945.1/1497645-1497747 SEQ ID NO:24 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NC_005957.1/1521880-1521982 SEQ ID NO:25 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG. AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NC_006274.1/1532482-1532584 SEQ ID NO:26 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU, G.ACUAUGAGUGAGGAC.GAAUAUAUUU NC_007530.2/1497694-1497796 SEQ ID NO:27 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA, GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAAC02000001.1/1985987-1986089 SEQ ID NO:28 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG. AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAEK01000008.1/96466-96364 SEQ ID NO:29 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG. AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAEN01000011.1/67143-67245 SEQ ID NO-.30 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAEO01000025.1/66855-66957 SEQ ID NO:31 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG AU.AUGGCCUGCAA, GUUUCUACCGAAGUACC..GUAAAUACUUU, G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAEP01000035.1/69365-69467 SEQ ID NO:32 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG AU.AUGGCCUGCAA, GUUUCUACCGAAGUACC..GUAAAUACUUU, G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAEQ01000029.1/70848-70950 SEQ ID NO:33 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG. AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAER01000023.1/58182-58284 SEQ ID NO:34 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NZ_AAES01000034.1/69121-69223 SEQ ID NO:35 AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG, AU.AUGGCCUGCAA. GUUUCUACCGAAGUACC..GUAAAUACUUU. G.ACUAUGAGUGAGGAC.GAAUAUAUUU NC_004722.1/1515239-1515341 SEQ ID NO:36
AAAUAAAUAGUUAGC.UA.CACUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUUUCUACCGAAGUACC..GUAAAUACUUU.G.ACUAUGAGUGAGGAC.GAAUAUAAUU NC_000964.2/697711-697813 SEQ ID NO:37
CAUGAAAUCAAAACA.CG.ACCUCAUAUAAUCUUGGGA. AU.AUGGCCCAUAA.GUϋϋCUACCCGGCAACC..GUAAAUUGCCG. G.ACUAUGCAGGAAAGU. GAUCGAUAAA NC_002976.3/54816-54919 SEQ ID NO:38
CAUAAAAUAAUUUAU.AU.GACUCAUAUAAUCUAGAGA. AU.AUGGCUUUAGAaGUUUCUACCGUGUCGCC..AUAAACGACAC. G.ACUAUGAGUAACAAU, CCAAUACAUU NC_004461.1/2433047-2432944 SEQ ID NO:39 CAUAAAAUAAUUUAU.AU.GACUCAUAUAAUCUAGAGA. AU.AUGGCUUUAGAaGUUUCUACCGUGUCGCC..AUAAACGACAC. G.ACUAUGAGUAACAAU. CCAAUACAUU NC_003030.1/2824953-2824855 SEQ ID NO:40 AAUCGUUAAUAUAGU.UU.AACUCAUAUAU-UUCCUGA, AU.AUGGCAGGAU-.GUUUCUACAAGGAA-CC..UUAAA-UUUCU.U.ACUAUGAGUGAUUUG.UUUGUAUGCA NC_003909.8/296465-296567 SEQ ID NO:41
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU,UAUAAAGGUG NC_003997.3/262601-262703 SEQ ID NO:42
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU,UAUAAΆGGUG NC_004722.1/261558-261660 SEQ ID NO:43
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU,UAUAAAGGUG NC_005945.1/262614-262716 SEQ ID NO:44
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU UAUAAAGGUG NC_005957.1/268537-268639 SEQ ID NO:45
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU,UAUAAAGGUG NC_006274.1/267835-267937 SEQ ID NO:46
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG. AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU, G.ACUACGAGGCGUUUU UAUAAAGGUG NC_007530.2/262601-262703 SEQ ID NO:47
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG. AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU,UAUAAAGGUG NZ_AAAC02000001.1/796036-796138 SEQ ID NO:48 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU UAUAAAGGUG NZ_AAEK01000017.1/88397-88499 SEQ ID NO:49 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU, G.ACUACGAGGCGUUUU,UAUAAAGGUG NZ_AAEN01000023.1/16980-17082 SEQ ID NO:50 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG. AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU,UAUAAAGGUG NZ_AAEO01000030.1/17090-17192 SEQ ID NO:51 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG. AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU.UAUAAAGGUG NZ_AAEP01000043.1/12962-13064 SEQ ID NO:52 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU.UAUAAAGGUG NZ_AAEQ01000040.1/4962-4860 SEQ ID NO:53 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU.UAUAAAGGUG NZ_AAER01000030.1/4374-4272 SEQ ID NO:54 GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG, AU.AUGGCCCGAAA.GUUUCUACCUAGCUACC..GUAAAUGGCUU. G.ACUACGAGGCGUUUU.UAUAAAGGUG NZ AAES01000040.1/14760-14862 SEQ ID NO:55
GAAAAGUGAAUAUUA.UG.CCGUCGUAUAAUAUCGGGG.AU.AUGGCCCGAAA,GUUUCUACCUAGCUACC..GUAAAUGGCUϋ G.ACUACGAGGCGUUUU.UAUAAAGGUG NC_006270.2/693198-693300 SEQ ID NO:56
AGUAAUUUAAAAAAG.AC.UUGUCGUAUAAUCAUGGGG. AU.AUGGCCCAUAA,GUUUCUACCAAGCUACC..GUAAAUAGCUU G.ACUACGCUUGUAUAC.AAUAUUUUAU NC_006322.1/692981-693083 SEQ ID NO:57
AGUAAUUUAAAAAAG.AC.UUGUCGUAUAAUCAUGGGG. AU.AUGGCCCAUAA,GUUUCUACCAAGCUACC..GUAAAUAGCUU G.ACUACGCUUGUAUAC.AAUAUUUUAU NCJD04722.1/259601-259703 SEQ ID NO:58
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAUG. AU.AUGGUUUGAGA,GUCUCUACCGGGUUACC..GUAAACAACCU G.ACUAUGAAGGCAGUG.UGUCUUAUAU NC_000964.2/2319410-2319310 SEQ ID NO:59 UUACAAUAUAAUAGG.AA.CACUCAUAUAAUCGCGUGG. AU.AUGGCACGCAA,GUUUCUACCGGGCA-CC..GUAAA-UGUCC G.ACUAUGGGUGAGCAA.UGGAACCGCA NC_003030.1/2905050-2904948 SEQ ID NO:60 GAAAAGUAAUAACAU.AU.UACCCGUAUAUGCUUAGAA. AU.AUGGUCUAAGC ,GUCUCUACCGGACUGCC..GUAAAUUGUCU G.ACUAUGGGUGUUUAU.AAGUAUUUUA NC_003210.1/611119-611017 SEQ ID NO-.61
AAUCCGCUACAAUAA.UA.UAGUCGUAUAAGUUCGGUA. AU.AUGGACCGUUC .GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.AAAAUUUAAU NZ_AADQ01000001.1/2431-2533 SEQ ID NO:62 AAUCCGCUACAAUAA.UA.UAGUCGUAUAAGUUCGGUA. AU.AUGGACCGUUC ,GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.AAAAUUUAAU NC_003212.1/610156-610054 SEQ ID NO:63
AAUCGUCUACAAUAA.UA.AAGUCGUAUAAGUUCGGUA. AU.AUGGACCGUUC .GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.AAAAUUUAAU NC_003366.1/422820-422922 SEQ ID NO:64
UAUGUACUUAUAUAA.GU.AUAUCGUAUAUGCUCGACG. AU.AUGGGUUGAGU.GUUUCUACUAGGAGGCC..GUAAACAUCCU,A.ACUACGAAUAUAUAG. GUGAUUUCUA NC_002973.5/617846-617744 SEQ ID NO:65
AAUCCGCUACAAUAA.UA.AAGUCGUAUAAGUUCGGUA. AU.AUGGACCGUUC .GUUUCUACCAGGCAACC..GUAAAAUGCCA,G.GCUACGAGCUAUUGU,AAAAUUUAAU NZ_AADR01000111.1/1598-1496 SEQ ID NO:66 AAUCCGCUACAAUAA.UA.AAGUCGUAUAAGUUCGGUA. AU.AUGGACCGUUC ,GUUUCUACCAGGCAACC..GUAAAAUGCCA.G.GCUACGAGCUAUUGU.AAAAUUUAAU NC_003909.8/294509-294611 SEQ ID NO:67
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AUGGUUUGAGA,GUCUCUACCGGGUUACC..GUAAACAACCU,G.ACUAUGAAGGCAGUG,UGUCUUAUAU NC_005957.1/266577-266679 SEQ ID NO:68
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AUGGUUUGAGA,GUCUCUACCGGGUUACC..GUAAACAACCU,G.ACUAUGAAGGCAGUG.UGUCUUAUAU NC_006274.1/265875-265977 SEQ ID NO:69
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AUGGUUUGAGA GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU NC_003366.1/2618421-2618323 SEQ ID NO:70 AAAACGGAAUAUAAA.CA.AACUCGUAUAA-GCUUUGA. AU.AAGGCAAGGC- .GUUUCUACCGGAAA-CC..UUAAA-UUUCC ,G.UCUAUGAGUGAAUUU. GAUAUACUAU NC_002973.5/1940027-1939926 SEQ ID NO:71 AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG. AU. -UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG ,G.ACUAUGAGUAGUUUG.UAUAAAGAAG NC_003212.1/2013345-2013244 SEQ ID NO:72 AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG. AU. -UGGGCGACGA,GUUUCUACCUGGUUACC..GUAAAUAACCG G.ACUAUGAGUAGUUUG.UAUAAAGAAG NZ_AADQ01000095.1/1690-1791 SEQ ID NO:73 AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG. AU.-UGGGCGACGA,GUUUCUACCUGGUUACC..GUAAAUAACCG G.ACUAUGAGUAGUUUG.UAUAAAGAAG NZ_AADR01000082.1/3415-3516 SEQ ID NO:74
AUAACUUAAAACCGA.AA.UACUUAUAUAAUAGUUGCG.AU.-UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG.UAUAAAGAAG NC_003366.1/2871201-2871101 SEQ ID NO:75 AUAAAAAAAUAAAUU.UU.GCUUCGUAUAACUCUAAUG. AU.AUGGAUUAGAG.GUCUCUACCAAGAA-CC..GAGAA-UUCUU.G.AUUACGAAGAAAGCU, UAUUUGCUUU NC_002745.2/430797-430900 SEQ ID NO:76
GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA. AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU. G.ACUAUGAGUAACAAC. ACAAUACUAG NC_002758.2/430754-430857 SEQ ID NO:77
GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA. AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU. G.ACUAUGAGUAACAAC ACAAUACUAG NC_002951.2/460059-460162 SEQ ID NO:78
GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA. AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.ACAAUACUAG NC_002952.2/441050-441153 SEQ ID NO:79
GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA. AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU. G.ACUAUGAGUAACAAC ACAAUACUAG NC_002953.3/409191-409294 SEQ ID NO:80
GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA. AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC.ACAAUACUAG NC_003923.1/410546-410649 SEQ ID NO:81
GUUAAAUAAUUUACA.UA.AACUCAUAUAAUCUAAAGA. AU.AUGGCUUUAGAaGUUUCUACCAUGUUGCC..UUGAACGACAU.G.ACUAUGAGUAACAAC ACAAUACUAG NC_006582.1/1115665-1115767 SEQ ID NO:82 AAGUUAAAAACGAAA.AC.ACCUCAUAUAUACUCGGGA. AU.AUGGCUCGAAC.GUUUCUACCCGGCAACC..GUAAAUUGCCG.G.ACUAUGAGGGGAAGU. CAUUACGCGC NC_006510.1/274257-274359 SEQ ID NO:83
AUGAAUAUUGUUGAA.UU.CCGUCGUAUAAUCCCGGGA. AU.AUGGCUCGGGA.GUUUCUACCAAGCUACC..GUAAAUAGCUU.G.ACUACGAGGGAUGCG, GGAUCGGAGA NC_003210.1/1958922-1958821 SEQ ID NO:84 AUAACUUAAAACCGA.AA.UACUUGUAUAAUAGUUGCG. AU. -UGGGCGACGA.GUUUCUACCUGGUUACC..GUAAAUAACCG.G.ACUAUGAGUAGUUUG. UAUAAAGAAG
NC_006270.2/697054-697156 SEQ ID NO:85
CAUGACAUGAAAACA.CA.UCCUCAUAUAAUCUUGGGA.AU.AUGGCCCAUAA. GUCUCUACCCGAUGACC. .GUAAAUCAUCG. G.ACUAUGCAGGAAAGU. GGACAAUAAA NC_006322.1/696838-696940 SEQ ID NO:86
CAUGACAUGAAAACA.CA.UCCUCAUAUAAUCUUGGGA.AU.AUGGCCCAUAA.GUCUCUACCCGAUGACC..GUAAAUCAUCG.G.ACUAUGCAGGAAAGU. GGACAAUAAA NC_006510.1/282580-282682 SEQ ID NO:87
AUAGUGUAUGAGAAG.AU.CCCUCAUAUAAUUUUGGGA.AU.AUGGCCCAAAA.GUUUCUACCCAAUCACC..GUAAAUGAUUG.G.ACUAUGAGGGAAAGG,AUCGGUUUUG NC_003997.3/260641-260743 SEQ ID NO:88
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG,UGUCUUAUAU NC_005945.1/260654-260756 SEQ ID NO:89
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU NC_007530.2/260641-260743 SEQ ID NO:90
AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU NZ_AAEK01000017.1/86437-86539 SEQ ID NO:91 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU NZ_AAEN01000023.1/15020-15122 SEQ ID NO:92 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU
NZ_AAEO01000030.1/15130-15232 SBQ ID NO:93 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AAGGUUUGAGA. GUCUCUACCGGGUUACC. .GUAAACAACCU .G.ACUAUGAAGGCAGUG. UGUCUUAUAU NZ_AAEP01000043.1/11002-11104 SEQ ID NO:94 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU NZ_AAEQ01000040.1/6922-6820 SEQ ID NO:95 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG.UGUCUUAUAU NZ_AAER01000030.1/6334-6232 SEQ ID NO:96 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG. AU.AAGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU,G.ACUAUGAAGGCAGUG, UGUCUUAUAU NZ_AAES01000040.1/12800-12902 SEQ ID NO:97 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG, AU.AAGGUUUGAGA. GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAAGGCAGUG UGUCUUAUAU NC_004193.1/760473-760575 SEQ ID NO:98
CAAUUUUUAUCCAAU.GC.CϋϋUCGUAUAUCCUCGAUA. AU.AUGGUUCGAAA. GUAUCUACCGGGUCACC..GUAAAUGAUCU.G.ACUAUGAAGGCAGAA GCAGGUUCGG NC_006270.2/4024210-4024312 SEQ ID NO:99 AAAUAAUAGAAGCCC.AC.UUCUUGUAUAUAAUCAGUA. AU.AGGGUCUGAUU.GUUUCUACCUGGCAACC..GUAAAUCGCCA.G.ACUACAAGGAAGUUϋ. GAAUAGAUUU NC_006322.1/4024324-4024426 SEQ ID NO:100 AAAUAAUAGAAGCCC.AC.UUCUUGUAUAUAAUCAGUA. AU.AGGGUCUGAUU.GUUUCUACCUGGCAACC..GUAAAUCGCCA.G.ACUACAAGGAAGUUU GAAUAGAUϋϋ NC_004193.1/786767-786868 SEQ ID NO:101
CCGACAAUUGAAAAU.GA.ACCUCAUAUAAAUUUGAGA. AU.AUGGCUCAGAA.GUUUCUACCCAGC-ACC..GUAAAUGGCUG .G.ACUAUGAGGGAAGAU, GGAUCAUUUC NC_002662.1/1159509-1159607 SEQ ID NO:102 UAGUCUAUAAUAGAA.CA.AUCUUAUUUAU-ACCUAGG. AU.AUGGCUGGGC- .GUUUCUACCUCGUA-CC..GUAAA-UGCGA.G.ACAAUAAGGAAAUUC. GAUUUUUUAG NC_000964.2/625993-625893 SEQ ID WO:103 AAUUAAAUAGCUAUU.AU.CACUUGUAUAACCUCAAUA. AU.AUGGUUUGAGG.GUGUCUACCAGGAA-CC..GUAAA-AUCCU G.AUUACAAAAUUUGUϋ.UAUGACAUUU NC_003909.8/382630-382528 SEQ ID NO:104
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG .G.ACUAUGGGGAAAACU.AAUGAAUAUϋ NC_003997.3/342356-342254 SEQ ID NO:105
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG,G.ACUAUGGGGAAAACU.AAUGAAUAUU NC_005945.1/342369-342267 SEQ ID NO:106
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU,AAUGAAUAUU NC_005957.1/356354-356252 SEQ ID NO:107
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG ,G.ACUAUGGGGAAAACU.AAUGAAUAUU NC_006274.1/357462-357360 SEQ ID NO:108
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG ,G.ACUAUGGGGAAAACU.AAUGAAUAUU NC_007530.2/342356-342254 SEQ ID NO:109
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAAC02000001.1/859268-859166 SEQ ID NO:110 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG .G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAEK01000051.1/8150-8252 SEQ ID NO:111 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA. AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG ,G.ACUAUGGGGAAAACU.AAUGAAUAUU
NZ_AAEN01000023.1/83441-83339 SEQ ID NO:112 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA. GUUUCUACCCGGCAACC. .UUAAAUUGCUG.G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAEO01000030.1/96886-96784 SEQ ID NO:113 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAEP01000046.1/48810-48708 SEQ ID NO:114 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAEQ01000034.1/49483-49381 SEQ ID Nθ:115 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAER01000042.1/191339-191441 SEQ ID NO:116 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAES01000043.1/48479-48377 SEQ ID NO:117 UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUCGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG.G.ACUAUGGGGAAAACU.AAUGAAUAUU NZ_AAAC02000001.1/794076-794178 SEQ ID NO:118 AGAUAAUAUAAAACG.AU.CCUUCAUAUAUCCUCAAAG.AU.ARGGUUUGAGA.GUCUCUACCGGGUUACC..GUAAACAACCU.G.ACUAUGAWGGCAGUG.UGUCUUAUAU NC_003030.1/1002176-1002275 SEQ ID NO:119 UAUAUAAAAAACUAA.AU.UUCUCGUAUAC-ACCGGUA.AU.AUGGUCCGGAA.GUUUCUACCUGCUG-CC..AUAAA-UAGCA.G.ACUACGGGGUGUUAU.UGAUAAUAUA NC_006582.1/1554717-1554819 SEQ ID NO:120 UAAACGAACAAAGCA.UC.AGCUCGUAUAAUAGCGGUA.AU.AUGGUCCGCGA.GUCUCUACCAGGCUGCC..GAUAACGGCCU.G.ACUACGAGUGGUCUU.UUUCAGUUGU NC_003909.8/336194-336296 SEQ ID Nθ:121
AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NC_003997.3/295331-295433 SEQ ID Nθ:122
AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NC_005945.1/295344-295446 SEQ ID NO:123
AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA. GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NC_005957.1/309524-309626 SEQ ID NO:124
AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NC_006274.1/309094-309196 SEQ ID NO:125
AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA. GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NC_007530.2/295331-295433 SEQ ID Nθ:126
AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NZ_AAAC02000001.1/812243-812345 SEQ ID Nθ:127 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NZ_AAΞK01000064.1/23153-23051 SEQ ID NO:128 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NZ_AAEN01000023.1/36414-36516 SEQ ID NO:129 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU NZ_AAEO01000030.1/49824-49926 SEQ ID NO:130 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA. GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU.CACUCGGUAU
NZ_AAEP01000046.1/1785-1887 SEQ ID NO:131 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA. GUCUCUACCUAACGACC. .GUUAUUCGUUA, G.ACUAUGAGGGAAAGU, CACUCGGUAU NZ_AAEQ01000034.1/2457-2559 SEQ ID NO:132 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA. G.ACUAUGAGGGAAAGU. CACUCGGUAU N2_AAER01000042.1/238364-238262 SEQ ID NO.-133 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA. G.ACUAUGAGGGAAAGU, CACUCGGUAU N2_AAES01000043.1/1489-1591 SEQ ID NO:134 AAAGAAUAAUAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA. G.ACUAUGAGGGAAAGU. CACUCGGUAU NC_004193.1/769686-769787 SEQ ID NO:135
UGAUGUAAUUGAAUA.GA.AAUGCGUAUAAUUAAGGGG.AU.AUGGCCC-ACA.GUUUCUACCAGACCACC..GUAAAUGGUUU. G.ACUACGCAGUAAUUA.UAUUUGUAUC NC_004722.1/343847-343745 SEQ ID NO:136
UUAAUACGGACGAUG.UU.ACCUCAUAUAUACUUGAUA.AU.AUGGAUCGAGA.GUUUCUACCCGGCAACC..UUAAAUUGCUG. G.ACUAUGGGGAAAACU,AAUGAAUAUU NZ_AAAW03000042.1/15038-14936 SEQ ID NO:137 AAAAAAUUUAAAUAG.GG.CGUUCAUAUAAUCGCGGAG.AU.AGGGUCCGCAA. GUUUCUACCGGGCUGCC..GUAAAUGGCCU, G.ACUAUGAGCGAAACU, GUGCCCAGGG NZ_AADT03000005.1/34629-34731 SEQ ID NO:138 AAAAAUAAUUUAACC.CU.GCUUCGUAUAUUCCCGGAA.AU.GCGGUCCGGGA.GUUUCUACCAGGCAACC..GUAAAUUGCCC.G.GCUACGAAGGUUAUU. CUUCGUCGUC NC_002570.2/648442-648544 SEQ ID NO:139
ACAUGUAGAUAUCAU.CC.CUUUCGUAUAUACUUGGAG.AU.AAGGUCCAGGA.GUUUCUACCAGAUCACC..GUAAAUGAUCU.G.ACUAUGAAGGUGGAA.UGGCUCGAUA NC_004722.1/298774-298876 SEQ ID NO:140
AGAAACAAUAAUAUA.AG.ACCUCAUAUAAUCGCGGGG.AU.AUGGCCUGCAA.GUCUCUACCUAACGACC..GUUAUUCGUUA.G.ACUAUGAGGGAAAGU. CACUCGGUAU NC_006510.1/272473-272575 SEQ ID NO:141
UACGGAUAGACGAAA.GC.CCUUCAUAUAAGCGCAAGA.AU.AUGGCUUGCGC.GUCUCUACCGGGCCGCC..GUAAACGGCCC. G.ACUAUGAAGGCAGAA. GACGCUGCUA NZ_AADW02000004.1/225764-225662 SEQ ID NO:142 GGAAGAUUGAAUAUA.AC.ACCUCGUAUAAUAGCAGGG.AU.AUGGCUUGCAA.GUUUCUACCCGACGACC..CUAAAUCGUUG. G.ACUAUGGGGUAUAUG. GAUGUUCGUC NC_006274.1/3685852-3685750 SEQ ID NO:143 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA. G.ACUAUGAGGGAAAAA. GUGUGUAACA NZ_AAGO01000011.1/2345-2247 SEQ ID NO:144 UUGAGCUGCUAUAAA.CA.AUCUUAUUUAU-ACCUAGA.AU.AUGGCUGGGC-.GUUUCUACCUCGUA-CC..GUAAA-UGCGA, G.ACAAUAAGGAAAUUC, GAUUUUUCAA NZ_AAAK03000113.1/5724-5822 SEQ ID NO:145 UAUUAGUUUUACAUA.CC.UACUUAUAUAU-CGUCAUA.AU.AUGGAUGACA-.GUUUCUAGCCAGUA-CC..GUAAA-UGCUG G.ACUAUAAGUAAAAGA,UUGGCUAUUU NZ_AADW02000005.1/186378-186480 SEQ ID NO:146 CUAAAAAACAAAAAA.UA.AUGCCGUAUAAUUCUGGGG.AU.AUGGCCCGGAA.GUCUCUACAGGAACACC..UUAAAGGUUCC, U.ACUACGGCGUGCACU, GAUUUCCGGU NC_003909.8/3578068-3577966 SEQ ID NO:147 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA. GUUUCUACCUGGCAACC..AUGAAUUGCUA, G.ACUAUGAGGGGAAAA, GUGUGUAACA NC_003997.3/3605298-3605196 SEQ ID NO:148 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA. G.ACUAUGAGGGGAAAA.GUGUGUAACA NC_004722.1/3766254-3766152 SEQ ID NO:149 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA, G.ACUAUGAGGGGAAAA. GUGUGUAACA
NC_005945.1/3605993-3605891 SEQ ID NO:150 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA. GUUUCUACCUGGCAACC. .AUGAAUUGCUA.G, ACUAUGAGGGGAAAA.GUGUGUAACA NC_005957.1/3626969-3626867 SEQ ID NO:151 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA. GUUUCUACCUGGCAACC..AUGAAUUGCUA.G ACUAUGAGGGGAAAA.GUGUGUAACA NC_007530.2/3605425-3605323 SEQ ID NO:152 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAAC02000001.1/4059572-4059470 SEQ ID NO:153 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G,ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAEN01000012.1/139994-140096 SEQ ID NO:154 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G,ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAEO01000020.1/51249-51351 SEQ ID NO:155 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAEP01000042.1/12489-12387 SEQ ID NO:156 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAEQ01000019.1/51159-51261 SEQ ID NO:157 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAER01000035.1/118314-118212 SEQ ID NO:158 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA. GUUUCUACCUGGCAACC..AUGAAUUGCUA.G ACUAUGAGGGGAAAA.GUGUGUAACA NZ_AAES01000032.1/51320-51422 SEQ ID NO:159 CUUCGAAAAGAAUCA.CU.GCCUCAUAUAAUCUUGGAG.AU.AAGGUCCAUAA.GUUUCUACCUGGCAACC..AUGAAUUGCUA.G.ACUAUGAGGGGAAAA.GUGUGUAACA NC_006371.1/1538896-1538796 SEQ ID NO:160 UAAUAAUGGGUAAAG.UU.ACUUCGUAUAACCCCACUU.AU.AGGGUGUGGGG.GUCUCUACCAGAAU-CC..GUAAA-AUUCU.G.AUUACGAAGAGUUGA.GUGAUUGAUC NC_004460.1/504371-504471 SEQ ID NO:161
GACUUUCGGCGAUCA.AC.GCUUCAUAUAAUCCUAAUG.AU.AUGGUUUGGGA.GUUUCUACCAAGAG-CC..UUAAA-CUCUU.G.AUUAUGAAGUCUGUC.GCUUUAUCCG NC_005140.1/1130553-1130653 SEQ ID NO:162 GACUUUCGGCGAUCA.AC.GCUUCAUAUAAUCCUAAUG.AU.AUGGUUUGGGA.GUUUCUACCAAGAG-CC..UUAAA-CUCUU.G.AUUAUGAAGUCUGUC.GCUUUAUCCG NC_004116.1/1094305-1094209 SEQ ID NO:163 CAAUUAAAUAUAUGA.UU.UACϋUAUUUAU-GCUGAGG.AU. -UGGCUUAGC-.GUCUCUACAAGACA-CC..GU-AA-UGUCU.A.ACAAUAAGUAAGCUA.AUAAAUAGCU NC_004368.1/1163490-1163394 SEQ ID NO:164 CAAUUAAAUAUAUGA.UU.UACUUAUUUAU-GCUGAGG.AU. -UGGCUUAGC-.GUCUCUACAAGACA-CC..GU-AA-UGUCU.A.ACAAUAAGUAAGCUA.AUAAAUAGCU NZ_AADT03000005.1/42245-42347 SEQ ID NO:165 CAAUAAAGCCAGUCU.UA.ACUUCGUAUAUCCCCGGCA.AU.AGGGACCGGGG.GUUUCUACCAGGCAACC..GCAAAUUGCCC.G. GCUACGAAGGUAUAU.CUUCGUUGCU NC_004557.1/2551392-2551293 SEQ ID NO:166 CUCUAUAAUAAAUUA.UU.GACUCAUAUAU-CCCCUUA.AU.AAGGUAGGGA-.GUAUCUACCAGAAG-CC..UUAAA-CUUCU.G.ACUAUGAGUGAAUAAaGCAUUGUCAU NC_003028.1/1754786-1754882 SEQ ID NO:167 AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA.AU. -UGGCACGAC- .GUUUCUACAAGGUG-CC..GG-AA-CACCU.A.ACAAUAAGUAAGUCA.GCAGUGAGAU NC_003098.1/1634825-1634921 SEQ ID NO:168 AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA.AU. -UGGCACGAC-.GUUUCUACAAGGUG-CC..GG-AA-CACCU.A.ACAAUAAGUAAGUCA.GCAGUGAGAU
NZ_AAGY01000085.1/5640-5736 SEQ ID NO:169 AAAAUUGAAUAUCGU.UU.UACUUGUUUAU-GUCGUGA. AU. -UGGCACGAC-. GUUUCUACAAGGUG-CC..GG-AA-CACCU. A.ACAAUAAGUAAGUCA. GCAGUGAGAU NZ-AAEKOlO00001.1/183902-184006 SEQ ID NO:170 AUAAUUUUACACAUU.AU.CACUCGUAUAUACUCGGUA. AU.AUGGUCCGAGC.GUUUCUACCUAGUUCCCaaUGAAAGAACUG.G.ACUACGGGUUAAAGU.AUUCGGUCGC NC_003454.1/1645820-1645721 SEQ ID NO:171 UAAAUAAUUUUAAUA.AA.AAUUCGUAUAA-GCCUAAU. AU.AUGGAAGGGU-.GUCCCUAC-GGUUAACC..AUAAAUUAACC.A.GCUACGAAAAAUGUU.UUACUGUGUU NZ_AADW02000027.1/2980-2880 SEQ ID NO:172 GAAUAAACGUAUAGC.AA.CGCUCGUAUAAUAGUGGGG. AU. -UGGCCCACGA.GUCUCUACCGGAUCGCC..GU-AACGAUCC. G.ACUACGGGUGGUGAG.UUACUGCUCU NZ_AADW02000005.1/179959-180061 SEQ ID NO:173 AAAUCAUACAUGCAU.CU.CCUUUGUAUAUACUCGCGA. AU.AUGGCGUGAGA. GUcucuACCGGGUCACC..UUAΆACGACCU G.ACUAUGAAGGAGCAG ACCCUUCGUA NC_006582.1/1039390-1039492 SEQ ID NO:174 AAUGUCCAAUAGGAA.AA.UACCCGUAUAAUUGCAGGA. AU.AAGGCCUGCAC. GUUUCUACCGAGCCACC..GUAAAUGGCUU. G.ACUACGGCAUGAUAA.AUGGAGCGCA NC_002737.1/930749-930845 SEQ ID NO:175
UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA. AU. -UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.UUAGGCUUGC NC_003485.1/910599-910695 SEQ ID NO:176
UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA. AU. -UGGCGCAGC-. GUCUCUACAAGACA-CC..UU-AA-UGUCU,A.ACAAUAAGUAAGCUU.UUAGGCUUGC NC_004070.1/846557-846653 SEQ ID NO:177
UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA. AU. -UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.UUAGGCUUGC NC_004606.1/977077-977173 SEQ ID NO:178
UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA, AU. -UGGCGCAGC-. GUCUCUACAAGACA-CC..UU-AA-UGUCU,A.ACAAUAAGUAAGCUU UUAGGCUUGC NC_006086.1/857664-857760 SEQ ID NO:179
UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA. AU. -UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU.A.ACAAUAAGUAAGCUU.UUAGGCUUGC NZ_AAFV01000199.1/507-411 SEQ ID NO:180
UGAAUUCAAUAAUGA.CA.UACUUAUUUAU-GCUGUGA. AU. -UGGCGCAGC-.GUCUCUACAAGACA-CC..UU-AA-UGUCU A.ACAAUAAGUAAGCUU.UUAGGCUUGC NC_004605.1/1369721-1369821 SEQ ID NO:181 UAUAAUCGCAAGCGU.UU.GCUUCGUAUAACCCCAAUG. AU.AUGGUUUGGGG.GUCUCUACCAGUUC-CC..GCAAA-GUGCU, G.AUUACGAAGAGUUGA, GAUCACUGUG NZ_AAAW03000004.1/66862-66959 SEQ ID NO:182 UGAACUGAAUAAUAA.AA.UϋϋUUAUAUAA-GUUCAUA. AU. -GGGUUGAAC-. GUCUCUACCAACUA-CC..GUAAA-UAGUU G.AUUAUAAAAAUUUCG AACGGAAUGA NC_004567.1/2968830-2968731 SEQ ID NO:183 UUAUCAAUACAACUA.AU.UGCCUAUAUAAU-GCCAUG. AU.AUGGAUGGCGA.GUUUCUACCCAGUG-CC..GUAAA-CACUG, G.ACUAUAAGCGAAUUG AGUCGACGGG NZ_AAEV01000003.1/13765-13667 SEQ ID NO:184 ACAAUCAAAUAAAAC.UU.AGCCUAUAUAAUGUC-UUA, AU.CUGGU-UGACA. GuuucuACCCAACG-cc..GUAAΆ-UGUUG. G.ACUAUAGGAAAACUA.ACUCUUAUGU NC_002570.2/806873-806971 SEQ ID NO:185
UUAAUCGAGCUCAAC.AC.UCUUCGUAUAUCCUC-UCA. AU.AUGGGAUGAGG. GUCUCUAC-AGGUA-CC..GUAAA-UACCU.A.GCUACGAAAAGAAUG.CAGUUAAUGU NZ_AAEK01000052.1/27552-27452 SEQ ID NO:186 AUUAAUUACAUAUGA.GA.AUCAUGUAUAACUCCAAGA. AU.AUGGCUUGGGG.GUCUCUACCAGGAA-CC..AAUAA-CUCCU. G.ACUACAAAAUGCGUA.UUAUAGCGUU NC_006814.1/237722-237626 SEQ ID NO:187
AUUAAGCCAAUACAA.AU.AUCU-AUAUAU-CGUCGAA. AU.AAGGUCGACA-. GUUUCUACCCACUA-CC..GUAAA-UGGUG. G.ACUAU-AGGUAAACG,AAΆUUAΆGAA
NC_005363.1/3414604-3414703 SEQ ID NO:188 AAAUAACUUCAUAGUgUU.UCCCCGUAUAU-GUUGCGA.AU.AGGGCGCAGC-. GUUUCUACCAGGCA-CC UCAAA-UGCCU.G. ACUAUGGAGGUUCUU. UGUGAAGUUG NZ_AADT03000021.1/9410-9513 SEQ ID NO:189 AAAAUAAAUAUGGCA.AU.GGCCUGUAUAAUUGGGGGA.AU.AGGGCUCCCAA.GUUUCUACCGGGCAACC. , GUAAAUUGCCCUG.GCUACAGCCUGAAAG.UGUACCUCAG NC_004668.1/2288426-2288328 SEQ ID Nθ:190 UCCUAGAGAAACAUA.GA.AGCUUGUAUAA-GGUCAAA.AU.ACGGUUGACC-.GUCUCUACCCAGCA-CC. , GUAAA-UGCUG.G.UCUAUAAGUGAAGAA.GAGCGGAUAG NC_006449.1/1182949-1183044 SEQ ID NO:191 CAAACAAUGAGAACA. -U.UACUUAUUUAU-GUCACGA.AU. -GGGCGUGAC- .GUUUCUACAAGGUG-CC.. GU-AA-CACCU.A.ACAAUAAGUAAGCUA.AUUUAGUCAU NZ_AAGS01000026.1/9921-10016 SEQ ID NO:192 CAAACAAUGAGAACA. -U.UACUUAUUUAU-GUCACGA.AU. -GGGCGUGAC-.GUUUCUACAAGGUG-CC.. GU-AA-CACCU.A.ACAAUAAGUAAGCUA.AUUUAGUCAU NZ_AAGQ01000089.l/l70-2S9 SEQ ID NO:193
UAACUUAACAAAUGU.UUcUGCUUAUAUAU-CGCUGCG.AU.ACGGGUAGCA- .GUCUCUACCCGGAG-CC.. GUAAA-CUCCG.G.ACUAUAGGUAAAGAA.GGGCCGGUAU NZ_AABF02000293.1/1-98 SEQ ID NO:194
UAGAUAUUAAAUAAA. --.AAUUCGUAUAA-GCCUAAU.AU.AUGGAAAGGU-.GUCCCUAC-GGUUAACC.. GUAAAUUAACC.A.GCUACGAAAAAUGUU.UUUGCUGUAU NC_006448.1/1185082-1185177 SEQ ID Nθ:195 CAAACAAUGAGAACA.-U.UACUUAUUUAU-GUCACGA.AU. -GGGCGUGAC-.GUUUCUACAAGGUG-CC.. GU-AA-CUCCU.A.ACAAUAAGUAAGCUA.AUUUAGUCAU NC_004567.1/2410478-2410577 SEQ ID NO:196 UUCAAAUAAGUGGUA.AU.UGCCUAUAUAAU-GUCAUG.AU.AUGGUUGACGA.GUUUCUACCCAACC-CC. , GUAAA-GGUUG.G.ACUAUAAGCAAACGA.GGUCAUCCCG NZ_AABJ03000010.1/47704-47802 SEQ ID NO:197 AUUCAGAAUGUUGAA.AA.AGCUUAUAUAUGGUCGU-A.AU.AAGG-AUGACC.GUUUCUACCCGGAG-CC. ,ACAAA-CUCAG.G.ACUAUAAGCAAUUAA.GUACUUGUGC NZ_AADT03000002.1/89184-89083 SEQ ID NO:198 GAGUCUUCUUUUAGG.UU.UCUUCGUAUAGUCCCGGAG.AU. -UGGUCCGGGG. GUUUCUACCAGGUGACC. GG-AAUCACCUUG. GCUACGAAGGGUUAU.UUCCUUUGUG NC_006814.1/1971978-1972076 SEQ ID NO:199 AUACϋϋAACAAUCAA.GU.UAUCUAUAUAU-CGUCGAA.AU.AAGGUCGACA-.GUAUCUACCCUGAG-CC. ,AUAAA-UUCAG.G.ACUAUAGGUAUCAGA.CGUCAUAAUU NZ_AAAO02000016.1/1579-1480 SEQ ID NO:200 CACUCUAGAUAUCAA.AA.UAUCUAUAUAU-CGUCGUA.AU.AAGGUCGACA-.GUUUCUACCCGGAA-CCa AUUAA-UUCUG.G.ACUAUAGGUAAUCGA.UGUCAUAAGU NC_005362.1/1949387-1949485 SEQ ID Nθ:201 AUACUUAACAAUCAA.GU.UAUCUAUAUAU-CGUCGAA.AU.AAGGUCGACA-. GUAUCUACCCUGAG-CC. AUAAA-UUCAG.G,ACUAUAGGUAUCAGA.CGUCAUAAAU NZ_AAAO02000035.1/3299-3201 SEQ ID NO:202 AUACUUAACAAUCAA.GU.UAUCUAUAUAU-CGUCGAA.AU.AAGGUCGACA-.GUAUCUACCCUGAG-CC. ,AUAAA-UUCAG.G.ACUAUAGGUAUCAGA.CGUCAUAΆAU NC_005362.1/263146-263049 SEQ ID NO:203
CACUCUAGAUAUCAA.AU.AUCU-AUAUAU-CGUCGUA.AU.AAGGUCGACA-.GUUUCUACCCGGAA-CCa.AUUAA-UUCUG.G.ACUAU-AGGUAAUCG.AUGUCAUAAG NC_005363.1/2004933-2004835 SEQ ID Nθ:204 ACUACUAACCGCGGU.UA.CACAAAUAUAA-GUCGGAG.AU.AGGGUCUGAC-.GUUUCUACCUGCCA-CC.. GUAAA-GGGCA.G.UCUAUUUGGAGCGAA.UAUAUGUUGA NZ_AADT03000002.1/98293-98192 SEQ ID Nθ:205 GACGAAUAUUACUAU.UA.GACUCGUAUAACCCCGGCG.AU.-GGGGCCGGGG.GUCUCUACCAGGUGACC.. GG-AAUCACCUCG. GCUACGAGGGUGAGC.GGCAGCUGGU NZ_AABH0200003S.1/17403-17500 SEQ ID NO:206 CAACUAAAGAAGUUA.UU.UGCAGAUAUAU-CGUUGGA.AA.ACGGCCAACA-. GUUUCUACCACGCC-CC. -AAAA-GUCGU.G,ACUAUCCGCAAAUGU.UUUUGA.CGAU
WC_006055.1/484139-484044 SEQ ID NO:207
UAUAA-1-A.UUAΑUAUG.AA.AACUUGUAUAAUCCUUC--.AU.AUCGGGAAGGA. GUCUCUACCUAACA-CC . AA-UGUUA.G.AUUAUGAGUUUUAUG.GUUUUCGCUA NZ_AABH02000272.1/211-308 SEQ ID NO:208
CAACUAAAGAAGUUA.UU.UGCAGAUAUAU-CGUUGGA.AA.ACGGCCAACA-.GUUUCUACCACGCC-CC . -AAAA-GUCGU.G.ACUAUCCGCAAAUGU.UUCUGACGAU NZ_AABJ03000003.1/47905-47810 SEQ ID NO:209 AAGAUAAAUAGCAAC.CA.AGCAGGUAUAU-CGUCGGA.UA.AUGGCUGACA-.GUUUCUACCCAACA-CC . AA-UGUUG.G.ACUAUCUGUGGAUGU.CUUUUUGGCG NZ_AABH02000038.1/17559-17463 SEQ ID NO:210 UAUUCUAUGUGAAAU.UU.AGCUGAUAUAGUAUCGA--.AUaAUGG-UCGAUU.GUUUCUAGCCAGCA-CC . ---CA-UGCUG.GaACUAUCAUAAACAUG.UUAUUUAAUU NC_006055.1/397027-397123 SEQ ID NO:211
AAUAAUUAAAUAUAA.AA.AACUUAUACAU-GACAACAUAU.-UGGGUUGUC-.GAC-CUGCCUCUGGACC . -U--AUCCUUA.G.ACUAUAAGCGUGAGG.UUUUUUUACA #=GC RF SEQ ID NO:212 aAAauuaaAaAAAaA.au.aacUCgUAUAAucucgggA.AU.AUGGcccgaga.GUUUCUACCaggcaaCC .GUAAAuugccu.G.ACUAcGAguaAauuu.uau.uUaUu.uu
#=GC SS cons
((((((((,,, •>.,,))))))))
//
Table 6. Atomic Coordinates of Guanine Riboswitch
HEADER RIBONUCLEIC ACID 05-AUG-04 1U8D
TITLE GUANINE RIBOSWITCH BOUND TO HYPOXANTHINE
COMPKD M0L_ID: 1;
COMPND 2 MOLECULE: XPT-PBUX MRNA;
COMPND 3 CHAIN: A;
COMPND 4 ENGINEERED: YES;
COMPND 5 OTHER_DETAILS: G-BOX MRNA
SOURCE M0L_ID: 1;
SOURCE 2 SYNTHETIC: YES;
SOURCE 3 OTHER_DETAILS: RNA WAS PREPARED BY IN VITRO TRANSCRIPTION.
SOURCE 4 THE SEQUENCE OF THIS RNA CAN BE FOUND NATURALLY IN
SOURCE 5 BACILLUS SUBTILIS (BACTERIA)
KEYWDS RNA-LIGAND COMPLEX, DOUBLE HELIX, BASE TRIPLES, BASE
KEYWDS 2 QUADRUPLES, MRNA
EXPDTA X-RAY DIFFRACTION
AUTHOR R.T.BATEY,S.D.GILBERT,R.K.MONTANGE
REVDAT 23-NOV-04 1U8D O
JRNL AUTH R.T.BATEY,S.D.GILBERT,R.K.MONTANGE
JRNL TITL STRUCTURE OF A NATURAL GUANINE-RESPONSIVE
JRNL TITL 2 RIBOSWITCH COMPLEXED WITH THE METABOLITE
JRNL TITL 3 HYPOXANTHINE
JRNL REF NATURE V. 432 411 2004
JRNL REFN ASTM NATUAS UK ISSN 0028-0836
REMARK
REMARK
REMARK RESOLUTION. 1.95 ANGSTROMS.
REMARK
REMARK REFINEMENT.
REMARK PROGRAM CNS 1.1
REMARK AUTHORS BRUNGER,ADAMS,CLORE,DELANO,GROS,GROSSE-
REMARK KUNSTLEVE,JIANG,KUSZEWSKI,NILGES, PANNU,
REMARK READ,RICE,SIMONSON,WARREN
REMARK
REMARK REFINEMENT TARGET : ENGH & HUBER
REMARK
REMARK DATA USED IN REFINEMENT.
REMARK RESOLUTION RANGE HIGH (ANGSTROMS) 1.95
REMARK RESOLUTION RANGE LOW (ANGSTROMS) 19.41
REMARK DATA CUTOFF (SIGMA(F)) 0.000
REMARK DATA CUTOFF HIGH (ABS (F) ) 1344227.320
REMARK DATA CUTOFF LOW (ABS (F) ) 0.0000
REMARK COMPLETENESS (WORKING+TEST) (%) 92.8
REMARK NUMBER OF REFLECTIONS 25786
REMARK
REMARK FIT TO DATA USED IN REFINEMENT.
REMARK CROSS-VALIDATION METHOD THROUGHOUT
REMARK FREE R VALUE TEST SET SELECTION RANDOM
REMARK R VALUE (WORKING SET) 0.178
REMARK FREE R VALUE 0.228
REMARK FREE R VALUE TEST SET SIZE (%) 9.400
REMARK FREE R VALUE TEST SET COUNT 2430
REMARK ESTIMATED ERROR OF FREE R VALUE 0.005
REMARK
REMARK FIT IN THE HIGHEST RESOLUTION BIN.
REMARK TOTAL NUMBER OF BINS USED : 10
REMARK BIN RESOLUTION RANGE HIGH (A) : 1.95
REMARK BIN RESOLUTION RANGE LOW (A) : 2.02 REMARK 3 BIN COMPLETENESS (WORKING+TEST) (%) 85.20 REMARK 3 REFLECTIONS IN BIN (WORKING SET) 2105 REMARK 3 BIN R VALUE (WORKING SET) 0.2480 REMARK 3 BIN FREE R VALUE 0.3180 REMARK 3 BIN FREE R VALUE TEST SET SIZE (%) 10.40 REMARK 3 BIN FREE R VALUE TEST SET COUNT 245 REMARK 3 ESTIMATED ERROR OF BIN FREE R VALUE 0.020 REMARK 3 REMARK 3 NUMBER OF NON-HYDROGEN ATOMS USED IN REFINEMENT. REMARK 3 PROTEIN ATOMS : 0 REMARK 3 NUCLEIC ACID ATOMS : 1426 REMARK 3 HETEROGEN ATOMS : 108 REMARK 3 SOLVENT ATOMS : 333 REMARK 3 REMARK 3 B VALUES. REMARK 3 FROM WILSON PLOT (A**2) 14.90 REMARK 3 MEAN B VALUE (OVERALL, A**2) 22.60 REMARK 3 OVERALL ANISOTROPIC B VALUE. REMARK 3 BIl (A**2) 0.26000 REMARK 3 B22 (A**2) -0.88000 REMARK 3 B33 (A**2) 0.62000 REMARK 3 B12 (A**2) 0.00000 REMARK 3 B13 (A**2) -0.48000 REMARK 3 B23 (A**2) 0.00000 REMARK 3 REMARK 3 ESTIMATED COORDINATE ERROR, REMARK 3 ESD FROM LUZZATI PLOT (A) 0. 19 REMARK 3 ESD FROM SIGMAA (A) 0.14 REMARK 3 LOW RESOLUTION CUTOFF (A) 5.00 REMARK 3 REMARK 3 CROSS-VALIDATED ESTIMATED COORDINATE ERROR. REMARK 3 ESD FROM C-V LUZZATI PLOT (A) : 0.25 REMARK 3 ESD FROM C-V SIGMAA (A) : 0.23 REMARK 3 REMARK 3 RMS DEVIATIONS FROM IDEAL VALUES. REMARK 3 BOND LENGTHS (A) : 0.009 REMARK 3 BOND ANGLES (DEGREES) : 1.60 REMARK 3 DIHEDRAL ANGLES (DEGREES) : 19.10 REMARK 3 IMPROPER ANGLES (DEGREES) : 1.83 REMARK 3 REMARK 3 ISOTROPIC THERMAL MODEL RESTRAINED REMARK 3 REMARK 3 ISOTROPIC THERMAL FACTOR RESTRA \IINNTTSS.. RMS SIGMA REMARK 3 MAIN-CHAIN BOND (A**2) : 0.810 ; 1.500 REMARK 3 MAIN-CHAIN ANGLE (A**2) : 0.000 ; 2.000 REMARK 3 SIDE-CHAIN BOND (A**2) : 1.520 ; 2.000 REMARK 3 SIDE-CHAIN ANGLE (A**2) : 1.970 ; 2.500 REMARK 3 REMARK 3 BULK SOLVENT MODELING. REMARK 3 METHOD USED : FLAT MODEL REMARK 3 KSOL : 0.36 REMARK 3 BSOL : 48.44 REMARK 3 REMARK 3 NCS MODEL NULL REMARK 3 REMARK 3 NCS RESTRAINTS. RMS SIGMA/WEIGHT REMARK 3 GROUP 1 POSITIONAL (A) : NULL ; NULL REMARK 3 GROUP 1 B-FACTOR (A**2) : NULL ; NULL REMARK 3 REMARK 3 PARAMETER FILE 1 DNA-RNA_REP.PARAM
REMARK 3 PARAMETER FILE 2 ION.PARAM
REMARK 3 PARAMETER FILE 3 HPA4.PARAM
REMARK 3 PARAMETER FILE 4 COHEX-REP.PARAM
REMARK 3 PARAMETER FILE 5 WATER.PARAM
REMARK 3 PARAMETER FILE 6 NULL
REMARK 3 TOPOLOGY FILE 1 DNA-RNA.TOP
REMARK 3 TOPOLOGY FILE 2 ION.TOP
REMARK 3 TOPOLOGY FILE 3 HPA4.TOP
REMARK 3 TOPOLOGY FILE 4 COHEX_REP.TOP
REMARK 3 TOPOLOGY FILE 5 WATER.TOP
REMARK 3 TOPOLOGY FILE 6 NULL
REMARK 3
REMARK 3 OTHER REFINEMENT REMARKS: NULL
REMARK 4
REMARK 4 1U8D COMPLIES WITH FORMAT V. 2.3, 09-JULY-1998
REMARK 100
REMARK 100 THIS ENTRY HAS BEEN PROCESSED BY THE NUCLEIC ACID DATABASE
REMARK 100 ON 13-AUG-2004.
REMARK 100 THE NDB ID CODE IS UR0039.
REMARK 105
REMARK 105 THE PROTEIN DATA BANK HAS ADOPTED THE SACCHARIDE CHEMISTS
REMARK 105 NOMENCLATURE FOR ATOMS OF THE DEOXYRIBOSE/RIBOSE MOIETY
REMARK 105 RATHER THAN THAT OF THE NUCLEOSIDE CHEMISTS. THE RING
REMARK 105 OXYGEN ATOM IS LABELLED 04* INSTEAD OF 01*.
REMARK 200
REMARK 200 EXPERIMENTAL DETAILS
REMARK 200 EXPERIMENT TYPE X-RAY DIFFRACTION
REMARK 200 DATE OF DATA COLLECTION NULL
REMARK 200 TEMPERATURE (KELVIN) 100.0
REMARK 200 PH 7.50
REMARK 200 NUMBER OF CRYSTALS USED 1
REMARK 200
REMARK 200 SYNCHROTRON (Y/N) N
REMARK 200 RADIATION SOURCE ROTATING ANODE
REMARK 200 BEAMLINE NULL
REMARK 200 X-RAY GENERATOR MODEL RIGAKU
REMARK 200 MONOCHROMATIC OR LAUE (M/L) M
REMARK 200 WAVELENGTH OR RANGE (A) 1.5418
REMARK 200 MONOCHROMATOR NI FILTER
REMARK 200 OPTICS NULL
REMARK 200
REMARK 200 DETECTOR TYPE NULL
REMARK 200 DETECTOR MANUFACTURER NULL
REMARK 200 INTENSITY-INTEGRATION SOFTWARE R-AXIS
REMARK 200 DATA SCALING SOFTWARE R-AXIS
REMARK 200
REMARK 200 NUMBER OF UNIQUE REFLECTIONS 28013
REMARK 200 RESOLUTION RANGE HIGH (A) 1.800
REMARK 200 RESOLUTION RANGE LOW (A) 20.000
REMARK 200 REJECTION CRITERIA (SIGMA(I)) 3.000
REMARK 200
REMARK 200 OVERALL.
REMARK 200 COMPLETENESS FOR RANGE (%) NULL
REMARK 200 DATA REDUNDANCY 2.900
REMARK 200 R MERGE (I) 0.05500
REMARK 200 R SYM (I) 0.03700
REMARK 200 <I/SIGMA(I)> FOR THE DATA SET 21.5000
REMARK 200 REMARK 200 IN THE HIGHEST RESOLUTION SHELL. REMARK 200 HIGHEST RESOLUTION SHELL RANGE HIGH (A) : 1 .80 REMARK 200 HIGHEST RESOLUTION SHELL RANGE LOW (A) : 1 .86 REMARK 200 COMPLETENESS FOR SHELL (%) : 100. 0 REMARK 200 DATA REDUNDANCY IN SHELL : NULL REMARK 200 R MERGE FOR SHELL (I) : NULL REMARK 200 R SYM FOR SHELL (I) : NULL REMARK 200 <I/SIGMA(I)> FOR SHELL : NULL REMARK 200 REMARK 200 DIFFRACTION PROTOCOL: SINGLE WAVELENGTH REMARK 200 METHOD USED TO DETERMINE THE STRUCTURE: SAD REMARK 200 SOFTWARE USED: SOLVE REMARK 200 STARTING MODEL: NULL REMARK 200 REMARK 200 REMARK: NULL REMARK 280 REMARK 280 CRYSTAL REMARK 280 SOLVENT CONTENT, VS (%) : 46.00 REMARK 280 MATTHEWS COEFFICIENT, VM (ANGSTROMS**3/DA) : 2.29 REMARK 280 REMARK 280 CRYSTALLIZATION CONDITIONS: 10 MM COBALT HEXAMMINE, 200 MM REMARK 280 AMMONIUM ACETATE, 25% PEG 2K, PH 7.5, VAPOR DIFFUSION, HANGING REMARK 280 DROP, TEMPERATURE 23K REMARK 290 REMARK 290 CRYSTALLOGRAPHIC SYMMETRY REMARK 290 SYMMETRY OPERATORS FOR SPACE GROUP: C 1 2 1 REMARK 290 REMARK 290 SYMOP SYMMETRY REMARK 290 NNNMMM OPERATOR REMARK 290 1555 X,Y,Z REMARK 290 2555 -X,Y, -Z REMARK 290 3555 l/2+X,l/2+Y, Z REMARK 290 4555 l/2-X,l/2+Y, -Z REMARK 290 REMARK 290 WHERE NNN -> OPERATOR NUMBER REMARK 290 MMM -> TRANSLATION VECTOR REMARK 290 REMARK 290 CRYSTALLOGRAPHIC SYMMETRY TRANSFORMATIONS REMARK 290 THE FOLLOWING TRANSFORMATIONS OPERATE ON THE ATOM/HETATM REMARK 290 RECORDS IN THIS ENTRY TO PRODUCE CRYSTALLOGRAPHICALLY REMARK 290 RELATED MOLECULES. REMARK 290 SMTRYl 1 1.0 00000000000 0.000000 0.000000 0.00000 REMARK 290 SMTRY2 1 0.000000000000 1.000000 0.000000 0.00000 REMARK 290 SMTRY3 1 0.000000000000 0.000000 1.000000 0.00000 REMARK 290 SMTRYl 2 -1.000000000000 0.000000 0.000000 0.00000 REMARK 290 SMTRY2 2 0.000000 1.000000 0.000000 0.00000 REMARK 290 SMTRY3 2 0.000000 0.000000 -1.000000 0.00000 REMARK 290 SMTRYl 3 1.000000 0.000000 0.000000 66.14900 REMARK 290 SMTRY2 3 0.000000 000000 0.000000 17.62500 REMARK 290 SMTRY3 3 0.000000 000000 1..000000 0.00000 REMARK 290 SMTRYl 4 -1 000000 0.000000 0..000000 66.14900 REMARK 290 SMTRY2 4 0.000000 1.000000 0.000000 17.62500 REMARK 290 SMTRY3 4 0.000000 0.000000 -1.000000 0.00000 REMARK 290 REMARK 290 REMARK: NULL REMARK 300 REMARK 300 BIOMOLECULE: 1 REMARK 300 THIS ENTRY CONTAINS THE CRYSTALLOGRAPHIC ASYMMETRIC UNIT REMARK 300 WHICH CONSISTS OF 1 CHAIN(S) . SEE REMARK 350 FOR REMARK 300 INFORMATION ON GENERATING THE BIOLOGICAL MOLECULE(S) .
REMARK 350
REMARK 350 GENERATING THE BIOMOLΞCULE
REMARK 350 COORDINATES FOR A COMPLETE MULTIMER REPRESENTING THE KNOWN
REMARK 350 BIOLOGICALLY SIGNIFICANT OLIGOMERIZATION STATE OF THE
REMARK 350 MOLECULE CAN BE GENERATED BY APPLYING BIOMT TRANSFORMATIONS
REMARK 350 GIVEN BELOW, BOTH NON-CRYSTALLOGRAPHIC AND
REMARK 350 CRYSTALLOGRAPHIC OPERATIONS ARE GIVEN.
REMARK 350
REMARK 350 BIOMOLECULE: 1
REMARK 350 APPLY THE FOLLOWING TO CHAINS: A
REMARK 350 BIOMTl 1 1.000000 0.000000 0.000000 0.00000
REMARK 350 BIOMT2 1 0.000000 1.000000 0.000000 0.00000
REMARK 350 BIOMT3 1 0.000000 0.000000 1.000000 0.00000
REMARK 465
REMARK 465 MISSING RESIDUES
REMARK 465 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE
REMARK 465 EXPERIMENT. (M=MODEL NUMBER; RES=RESIDUE NAME; C=CHAIN
REMARK 465 IDENTIFIER; SSSEQ=SEQUENCE NUMBER; I=INSERTION CODE.)
REMARK 465
REMARK 465 M RES C SSSEQI
REMARK 465 A A 82
REMARK 500
REMARK 500 GEOMETRY AND STEREOCHEMISTRY
REMARK 500 SUBTOPIC: CLOSE CONTACTS
REMARK 500
REMARK 500 THE FOLLOWING ATOMS THAT ARE RELATED BY CRYSTALLOGRAPHIC
REMARK 500 SYMMETRY ARE IN CLOSE CONTACT. AN ATOM LOCATED WITHIN 0.15
REMARK 500 ANGSTROMS OF A SYMMETRY RELATED ATOM IS ASSUMED TO BE ON A
REMARK 500 SPECIAL POSITION AND IS, THEREFORE, LISTED IN REMARK 375
REMARK 500 INSTEAD OF REMARK 500. ATOMS WITH NON-BLANK ALTERNATE
REMARK 500 LOCATION INDICATORS ARE NOT INCLUDED IN THE CALCULATIONS.
REMARK 500
REMARK 500 DISTANCE CUTOFF:
REMARK 500 2.2 ANGSTROMS FOR CONTACTS NOT INVOLVING HYDROGEN ATOMS
REMARK 500 1.6 ANGSTROMS FOR CONTACTS INVOLVING HYDROGEN ATOMS
REMARK 500
REMARK 500 ATML RES C SSEQI ATM2 RES C SSEQI SSYMOP DISTANCE
REMARK 500 OIP A 15 OIP G 15 2556 1.81
SEQRES 1 A 68 G G A C A U A U A A U C G
SEQRES 2 A 68 C G U G G A U A U G G C A
SEQRES 3 A 68 C G C A A G U U U C U A C
SEQRES 4 A 68 C G G G C A C C G U A A A
SEQRES 5 A 68 U G U C C G A C U A U G U
SEQRES 6 A 68 C C A
HET SPD 95 10
HET ACT 96 4
HET NCO 101 7
HET NCO 102 7
HET NCO 103 7
HET NCO 104 7
HET NCO 105 7
HET NCO 106 7
HET NCO 107 7
HET NCO 108 7
HET NCO 109 7
HET NCO 110 7
HET NCO 111 7
HET NCO 112 7 HET HPA 90 10
HETNAM SPD SPERMIDINE
HETNAM ACT ACETATE ION
HETNAM NCO COBALT HEXAMMINE ION
HETNAM HPA HYPOXANTHINE
HETSYN SPD N- (2-AMINO-PROPYL) -1,4-DIAMINOBUTANE; PA(34)
FORMUL 2 SPD C7 H19 N3
FORMUL 3 ACT C2 H3 02 1-
FORMUL 4 NCO 12 (H18 N6 COl 3+)
FORMUL 16 HPA C5 H4 N4 Ol
FORMUL 17 HOH *333 (H2 01)
CRYSTl 132 .298 35.250 42.225 90.00 90.95 90.00 C 1 2 1 4
ORIGXl 1.000000 0.000000 0.000000 0.00000
0RIGX2 0.000000 1.000000 0.000000 0.00000
0RIGX3 0.000000 0.000000 1.000000 0.00000
SCALEl 0.007559 0.000000 0.000125 0.00000
SCALE2 0.000000 0.028369 0.000000 0.00000
SCALE3 0.000000 0.000000 0.023686 0.00000
ATOM 1 O3P G A 15 2.257 5.451 23.138 1.00 79.41 O
ATOM 2 P G A 15 2.002 5.937 21.715 1.00 79.84 P
ATOM 3 O1P G A 15 0.530 5.848 21.317 1.00 79.33 O
ATOM 4 02P G A 15 2.597 7.317 21.458 1.00 79.53 O
ATOM 5 05* G A 15 2.809 4.906 20.739 1.00 77.83 O
ATOM 6 C5* G A 15 4.171 4.541 21.029 1.00 75.69 C
ATOM 7 C4* G A 15 4.238 3.119 21.543 1.00 73.85 C
ATOM δ 04* G A 15 2.948 2.746 22.099 1.00 73.98 O
ATOM 9 C3* G A 15 4.556 2.024 20.538 1.00 72.87 C
ATOM 10 03* G A 15 5.945 2.038 20.158 1.00 70.52 O
ATOM 11 C2* G A 15 3.933 0.791 21.200 1.00 73.00 C
ATOM 12 02* G A 15 4.650 0.231 22.285 1.00 73.25 O
ATOM 13 Cl* G A 15 2.648 1.397 21.766 1.00 72.98 C
ATOM 14 N9 G A 15 1.581 1.433 20.767 1.00 72.54 N
ATOM 15 C8 G A 15 0.950 2.560 20.284 1.00 72.23 C
ATOM 16 N7 G A 15 0.045 2.296 19.382 1.00 71.52 N
ATOM 17 C5 G A 15 0.072 0.912 19.261 1.00 71.53 C
ATOM 18 C6 G A 15 -0.692 0.051 18.433 1.00 71.10 C
ATOM 19 06 G A 15 -1.579 0.355 17.620 1.00 71.14 O
ATOM 20 Nl G A 15 -0.339 -1.284 18.619 1.00 70.53 N
ATOM 21 C2 G A 15 0.626 -1.734 19.493 1.00 70.46 C
ATOM 22 N2 G A 15 0.829 -3.059 19.517 1.00 69.85 N
ATOM 23 N3 G A 15 1.341 -0.941 20.279 1.00 70.84 N
ATOM 24 C4 G A 15 1.015 0.361 20.111 1.00 71.66 C
ATOM 25 P G A 16 6.984 0.956 20.748 1.00 68.91 P
ATOM 26 O1P G A 16 6.845 0.908 22.231 1.00 68.62 O
ATOM 27 O2P G A 16 8.306 1.282 20.155 1.00 68.38 O
ATOM 28 05* G A 16 6.531 -0.448 20.141 1.00 66.18 O
ATOM 29 C5* G A 16 7.070 -1.646 20.684 1.00 62.29 C
ATOM 30 C4* G A 16 6.512 -2.866 19.993 1.00 59.49 C
ATOM 31 04* G A 16 5.058 -2.824 20.037 1.00 58.98 O
ATOM 32 C3* G A 16 6.820 -2.981 18.513 1.00 57.81 C
ATOM 33 03* G A 16 8.106 -3.541 18.297 1.00 56.43 O
ATOM 34 C2* G A 16 5.709 -3.902 18.025 1.00 57.51 C
ATOM 35 02* G A 16 5.944 -5.257 18.332 1.00 56.72 O
ATOM 36 Cl* G A 16 4.530 -3.420 18.863 1.00 57.12 C
ATOM 37 N9 G A 16 3.712 -2.448 18.155 1.00 55.86 N
ATOM 38 C8 G A 16 3.719 -1.080 18.289 1.00 55.10 C
ATOM 39 N7 G A 16 2.851 -0.488 17.514 1.00 55.02 N
ATOM 40 C5 G A 16 2.239 -1.529 16.825 1.00 54.19 C
ATOM 41 C6 G A 16 1.206 -1.512 15.851 1.00 53.76 C ATOM 42 06 G A 16 0.597 -0.540 15.389 1.00 52.60 o
ATOM 43 Nl G A 16 0.894 -2.796 15.416 1.00 53.12 N
ATOM 44 C2 G A 16 1.494 -3.945 15.859 1.00 53.54 C
ATOM 45 N2 G A 16 1.070 -5.089 15.307 1.00 53.43 N
ATOM 46 N3 G A 16 2.446 -3.975 16.775 1.00 53.95 N
ATOM 47 C4 G A 16 2.767 -2.742 17.208 1.00 54.84 C
ATOM 48 P A A 17 8.823 -3.354 16.870 1.00 55.82 P
ATOM 49 OIP A A 17 10.149 -4.017 16.909 1.00 55.72 o
ATOM 50 O2P A A 17 8.728 -1.914 16.506 1.00 55.26 o
ATOM 51 05* A A 17 7.903 -4.185 15.880 1.00 53.03 o
ATOM 52 C5* A A 17 7.817 -5.594 15.991 1.00 51.32 C
ATOM 53 C4* A A 17 7.075 -6.140 14.809 1.00 49.43 C
ATOM 54 04* A A 17 5.666 -5.818 14.928 1.00 48.16 o
ATOM 55 C3* A A 17 7.494 -5.502 13.505 1.00 48.31 C
ATOM 56 03* A A 17 8.659 -6.134 13.012 1.00 48.66 o
ATOM 57 C2* A A 17 6.266 -5.722 12.634 1.00 47.38 C
ATOM 58 02* A A 17 6.231 -7.032 12.106 1.00 45.76 o
ATOM 59 Cl* A A 17 5.138 -5.512 13.653 1.00 46.63 C
ATOM 60 N9 A A 17 4.655 -4.134 13.716 1.00 45.43 N
ATOM 61 C8 A A 17 5.126 -3.118 14.513 1.00 45.03 C
ATOM 62 N7 A A 17 4.471 -1.988 14.375 1.00 45.06 N
ATOM 63 C5 A A 17 3.505 -2.278 13.419 1.00 44.35 C
ATOM 64 C6 A A 17 2.480 -1.507 12.837 1.00 43.36 C
ATOM 65 N6 A A 17 2.232 -0.230 13.146 1.00 43.22 N
ATOM 66 Nl A A 17 1.698 -2.105 11.915 1.00 43.31 N
ATOM 67 C2 A A 17 1.930 -3.390 11.607 1.00 43.50 C
ATOM 68 N3 A A 17 2.852 -4.218 12.085 1.00 44.12 N
ATOM 69 C4 A A 17 3.616 -3.596 12.998 1.00 44.78 C
ATOM 70 P C A 18 9.428 -5.494 11.763 1.00 48.10 P
ATOM 71 OIP C A 18 10.761 -6.124 11.593 1.00 48.55 O
ATOM 72 02P C A 18 9.319 -4.016 11.886 1.00 47.97 O
ATOM 73 05*, C A 18 8.513 -5.918 10.541 1.00 47.08 O
ATOM 74 C5* C A 18 8.419 -5.064 9.438 1.00 44.02 C
ATOM 75 C4* C A 18 7.134 -5.285 8.695 1.00 42.22 C
ATOM 76 04* C A 18 5.989 -5.055 9.551 1.00 41.33 o
ATOM 77 C3* C A 18 6.973 -4.294 7.574 1.00 40.44 C
ATOM 78 03* C A 18 7.752 -4.737 6.489 1.00 38.62 o
ATOM 79 C2* C A 18 5.468 -4.277 7.368 1.00 40.56 C
ATOM 80 O2* C A 18 5.017 -5.381 6.605 1.00 39.99 o
ATOM 81 Cl* C A 18 4.986 -4.366 8.819 1.00 40.15 C
ATOM 82 Nl C A 18 4.886 -3.032 9.427 1.00 39.26 N
ATOM 83 C2 C A 18 3.854 -2.179 9.053 1.00 39.01 C
ATOM 84 02 C A 18 3.040 -2.556 8.196 1.00 39.82 o
ATOM 85 N3 C A 18 3.765 -0.960 9.635 1.00 38.83 N
ATOM 86 C4 C A 18 4.660 -0.585 10.547 1.00 37.75 C
ATOM 87 N4 C A 18 4.524 0.621 11.092 1.00 36.78 N
ATOM 88 C5 C A 18 5.730 -1.431 10.939 1.00 37.85 C
ATOM 89 C6 C A 18 5.804 -2.636 10.360 1.00 39.12 C
ATOM 90 P A A 19 8.667 -3.688 5.725 1.00 36.72 P
ATOM 91 OIP A A 19 9.480 -4.427 4.735 1.00 37.50 o
ATOM 92 02P A A 19 9.331 -2.852 6.746 1.00 36.38 o
ATOM 93 05* A A 19 7.574 -2.817 4.973 1.00 35.32 o
ATOM 94 C5* A A 19 6.619 -3.455 4.146 1.00 32.02 C
ATOM 95 C4* A A 19 5.610 -2.459 3.661 1.00 30.54 C
ATOM 96 04* A A 19 4.737 -2.066 4.755 1.00 30.35 o
ATOM 97 C3* A A 19 6.211 -1.148 3.184 1.00 30.44 C
ATOM 98 03* A A 19 6.700 -1.256 1.853 1.00 28.92 o
ATOM 99 C2* A A 19 5.026 -0.208 3.307 1.00 29.76 C
ATOM 100 02* A A 19 4.113 -0.427 2.254 1.00 30.88 o ATOM 101 Cl* A A 19 4.384 -0.698 4.606 1.00 29.58 C
ATOM 102 N9 A A 19 4.840 0.046 5.783 1.00 27.74 N
ATOM 103 C8 A A 19 5.802 -0.299 6.700 1.00 26.96 C
ATOM 104 N7 A A 19 5.955 0.583 7.666 1.00 26.99 N
ATOM 105 C5 A A 19 5.036 1.577 7.358 1.00 26.41 C
ATOM 106 C6 A A 19 4.684 2.780 8.000 1.00 25.24 C
ATOM 107 N6 A A 19 5.209 3.195 9.153 1.00 25.00 N
ATOM 108 Nl A A 19 3.739 3.547 7.415 1.00 25.67 N
ATOM 109 C2 A A 19 3.176 3.116 6.284 1.00 24.91 C
ATOM 110 N3 A A 19 3.401 1.998 5.602 1.00 25.13 N
ATOM 111 C4 A A 19 4.353 1.265 6.194 1.00 26.86 C
ATOM 112 P U A 20 7.901 -0.307 1.370 1.00 27.68 P
ATOM 113 OIP U A 20 8.299 -0.722 0.006 1.00 29.66 o
ATOM 114 02P U A 20 8.908 -0.246 2.444 1.00 27.22 o
ATOM 115 05* U A 20 7.230 1.122 1.256 1.00 25.69 o
ATOM 116 C5* U A 20 6.245 1.373 0.283 1.00 22.92 C
ATOM 117 C4* U A 20 5.719 2.780 0.423 1.00 21.41 C
ATOM 118 04* U A 20 5.042 2.907 1.710 1.00 20.04 o
ATOM 119 C3* U A 20 6.777 3.874 0.487 1.00 19.50 C
ATOM 120 03* U A 20 7.259 4.265 0.783 1.00 20.05 o
ATOM 121 C2* U A 20 6.027 5.000 1.173 1.00 18.58 C
ATOM 122 02* U A 20 5.109 5.617 0.301 1.00 17.80 o
ATOM 123 Cl* U A 20 5.220 4.231 2.212 1.00 19.20 C
ATOM 124 Nl U A 20 5..900 4.191 3.517 1.00 18.08 N
ATOM 125 C2 U A 20 5..668 5.253 4.365 1.00 18.46 C
ATOM 126 02 U A 20 4.951 6.205 4.059 1.00 15.17 o
ATOM 127 N3 U A 20 6.303 5.174 5.580 1.00 16.68 N
ATOM 128 C4 U A 20 7.127 4.167 6.028 1.00 18.60 C
ATOM 129 04 U A 20 7.615 4.248 7.167 1.00 15.59 O
ATOM 130 C5 U A 20 7.333 3.101 5 087 1.00 17.76 C
ATOM 131 C6 U A 20 6.725 3.154 3.888 1.00 17.60 C
ATOM 132 P A A 21 8.804 4.707 0.950 1.00 21.07 P
ATOM 133 OIP A A 21 8.958 5.041 2.388 1.00 19.56 o
ATOM 134 02P A A 21 9.726 3.742 0.339 1.00 17.87 o
ATOM 135 05* A A 21 8.914 6.057 0.104 1.00 18.44 o
ATOM 136 C5* A A 21 8.311 7.240 0.587 1.00 16.49 C
ATOM 137 C4* A A 21 8.458 8.398 0.384 1.00 15.52 C
ATOM 138 04* A A 21 7.781 8.097 1.635 1.00 16.13 o
ATOM 139 C3* A A 21 9.885 8.660 0.849 1.00 16.12 C
ATOM 140 03* A A 21 10.578 9.415 -0 119 1.00 15.84 o
ATOM 141 C2* A A 21 9.670 9.462 2.123 1.00 14.81 C
ATOM 142 02* A A 21 9.390 10.824 1.853 1.00 16.92 o
ATOM 143 Cl* A A 21 8.420 8.794 2.696 1.00 16.03 C
ATOM 144 N9 A A 21 8.745 7.898 3.805 1.00 14.49 N
ATOM 145 C8 A A 21 9.200 6.611 3.807 1.00 14.35 C
ATOM 146 N7 A A 21 9.500 6.161 5.002 1.00 14.14 N
ATOM 147 C5 A A 21 9.192 7.227 5.846 1.00 13.55 C
ATOM 148 C6 A A 21 9.288 7.400 7.255 1.00 13.46 C
ATOM 149 N6 A A 21 9.760 6.467 8.083 1.00 13.17 N
ATOM 150 Nl A A 21 8.881 8.588 7.778 1.00 14.41 N
ATOM 151 C2 A A 21 8.419 9.532 6.935 1.00 14.69 C
ATOM 152 N3 A A 21 8.294 9.483 5.600 1.00 16.23 N
ATOM 153 C4 A A 21 8.702 8.288 5.122 1.00 13.95 C
ATOM 154 P U A 22 11.833 8.778 -0.860 1.00 17.19 P
ATOM 155 OIP U A 22 12.143 9.766 -1.930 1.00 14.62 o
ATOM 156 02P U A 22 11.572 7.360 1.223 1.00 16.11 o
ATOM 157 05* U A 22 12.955 8.795 0.274 1.00 17.34 o
ATOM 158 C5* U A 22 14.086 7.879 0.260 1.00 15.80 C
ATOM 159 C4* U A 22 14.668 7.761 1.654 1.00 13.76 C ATOM 160 04* U A 22 14.980 9.104 2.130 1.00 13.53 o
ATOM 161 C3* U A 22 13.575 7.229 2.579 1.00 14.34 C
ATOM 162 03* U A 22 14.069 6.400 3.635 1.00 16.58 o
ATOM 163 C2* U A 22 12.950 8.479 3.204 1.00 14.31 C
ATOM 164 02* U A 22 12.419 8.301 4.510 1.00 12.30 o
ATOM 165 Cl* U A 22 14.140 9.444 3.207 1.00 13.55 C
ATOM 166 Nl U A 22 13.832 10.876 3.147 1.00 14.29 N
ATOM 167 C2 U A 22 14.055 11.619 4.286 1.00 13.16 C
ATOM 168 02 U A 22 14.534 11.153 5.327 1.00 15.12 o
ATOM 169 N3 U A 22 13.706 12.925 4.186 1.00 13.49 N
ATOM 170 C4 U A 22 13.172 13.569 3.099 1.00 13.63 C
ATOM 171 04 U A 22 12.789 14.728 3.228 1.00 12.03 O
ATOM 172 C5 ϋ A 22 12.990 12.747 1.948 1.00 12.63 C
ATOM 173 C6 U A 22 13.320 11.453 2.009 1.00 13.42 C
ATOM 174 P A A 23 14.502 4.869 3.397 1.00 16.90 P
ATOM 175 OIP A A 23 13.773 4.291 2.214 1.00 17.04 o
ATOM 176 O2P A A 23 14.351 4.224 4.726 1.00 16.03 o
ATOM 177 05* A A 23 16.057 4.949 3.092 1.00 16.99 o
ATOM 178 C5* A A 23 16.937 5.684 3.940 1.00 16.50 C
ATOM 179 C4* A A 23 18.220 5.970 3.200 1.00 16.03 C
ATOM 180 04* A A 23 17.896 6.559 1.914 1.00 15.15 o
ATOM 181 C3* A A 23 19.123 7.000 3.865 1.00 15.88 C
ATOM 182 03* A A 23 20.007 6.350 4.779 1.00 16.04 o
ATOM 183 C2* A A 23 19.932 7.535 2.689 1.00 14.37 C
ATOM 184 02* A A 23 21.038 6.689 2.437 1.00 14.07 o
ATOM 185 Cl* A A 23 18.924 7.447 1.539 1.00 15.13 C
ATOM 186 N9 A A 23 18.313 8.717 1.157 1.00 14.71 N
ATOM 187 C8 A A 23 18.027 9.134 -0.113 1.00 14.98 C
ATOM 188 N7 A A 23 17.513 10.334 -0.171 1.00 15.52 N
ATOM 189 C5 A A 23 17.440 10.727 1.159 1.00 14.12 C
ATOM 190 C6 A A 23 16.984 11.892 1.762 1.00 13.27 C
ATOM • 191 N6 A A 23 16.517 12.944 1.074 1.00 13.02 N
ATOM 192 Nl A A 23 17.029 11.962 3.109 1.00 13.36 N
ATOM 193 C2 A A 23 17.522 10.921 3.791 1.00 12.32 C
ATOM 194 N3 A A 23 17.996 9.768 3.328 1.00 12.80 N
ATOM 195 C4 A A 23 17.924 9.737 1.988 1.00 13.37 C
ATOM 196 P A A 24 19.792 6.521 6.367 1.00 16÷08 P
ATOM 197 OIP A A 24 20.740 5.538 7.026 1.00 18.75 O
ATOM 198 O2P A A 24 18.370 6.463 6.689 1.00 17.24 O
ATOM 199 05* A A 24 20.280 8.006 6.656 1.00 14.02 O
ATOM 200 C5* A A 24 19.497 9.123 6.288 1.00 12.57 C
ATOM 201 C4* A A 24 19.315 10.041 7.475 1.00 13.24 C
ATOM 202 04* A A 24 18.369 9.426 8.402 1.00 14.03 O
ATOM 203 C3* A A 24 20.554 10.296 8.339 1.00 12.98 C
ATOM 204 03* A A 24 21.369 11.351 7.801 1.00 13.11 O
ATOM 205 C2* A A 24 19.907 10.750 9.647 1.00 13.47 C
ATOM 206 02* A A 24 19.453 12.099 9.622 1.00 11.96 o
ATOM 207 Cl* A A 24 18.667 9.851 9.720 1.00 13.92 C
ATOM 208 N9 A A 24 18.806 8.671 10.584 1.00 14.61 N
ATOM 209 C8 A A 24 18.841 7.342 10.230 1.00 14.46 C
ATOM 210 N7 A A 24 18.885 6.526 11.250 1.00 15.41 N
ATOM 211 C5 A A 24 18.905 7.372 12.348 1.00 13.99 C
ATOM 212 C6 A A 24 18.939 7.129 13.722 1.00 13.84 C
ATOM 213 N6 A A 24 18.928 5.899 14.267 1.00 13.39 N
ATOM 214 Nl A A 24 18.969 8.201 14.543 1.00 11.61 N
ATOM 215 C2 A A 24 18.945 9.429 14.008 1.00 11.99 C
ATOM 216 N3 A A 24 18.904 9.784 12.740 1.00 14.75 N
ATOM 217 C4 A A 24 18.885 8.694 11.949 1.00 12.94 C
ATOM 218 P U A 25 22.931 11.117 7.493 1.00 14.31 P ATOM 219 O1P U A 25 23.384 9.774 7.967 1.00 13.12 O
ATOM 220 O2P U A 25 23.680 12.330 7.933 1.00 15.11 O
ATOM 221 05* U A 25 22.966 11.182 5.904 1.00 12.37 O
ATOM 222 C5* U A 25 22.889 9.987 5.117 1.00 15.75 C
ATOM 223 C4* U A 25 22.394 10.314 3.715 1.00 14.54 C
ATOM 224 04* U A 25 21.019 10.786 3.786 1.00 14.76 O
ATOM 225 C3* U A 25 23.144 11.444 3.031 1.00 15.39 C
ATOM 226 03* U A 25 24.331 10.949 2.403 1.00 15.75 O
ATOM 227 C2* U A 25 22.114 11.959 2.027 1.00 15.07 C
ATOM 228 02* U A 25 22.016 11.123 0.889 1.00 14.04 O
ATOM 229 Cl* U A 25 20.817 11.819 2.831 1.00 14.18 C
ATOM 230 Wl U A 25 20.412 13.042 3.549 1.00 10.89 N
ATOM 231 C2 U A 25 19.972 14.094 2.797 1.00 12.58 C
ATOM 232 02 U A 25 19.917 14.051 1.569 1.00 14.81 O
ATOM 233 N3 U A 25 19.592 15.206 3.514 1.00 12.69 N
ATOM 234 C4 U A 25 19.611 15.359 4.880 1.00 14.17 C
ATOM 235 04 U A 25 19.271 16.439 5..374 1.00 13.94 O
ATOM 236 C5 U A 25 20.084 14.211 5..598 1.00 13.87 C
ATOM 237 C6 U A 25 20.460 13.117 4, .917 1.00 13.05 C
ATOM 238 P C A 26 25.693 11.790 2.524 1.00 17.71 P
ATOM 239 O1P C A 26 26.754 11.081 1.752 1.00 17.41 O
ATOM 240 O2P C A 26 25.922 12.133 3.943 1.00 15.48 O
ATOM 241 05* C A 26 25.352 13.145 1.775 1.00 15.22 O
ATOM 242 C5* C A 26 25.046 13.137 0.391 1.00 14.95 C
ATOM 243 C4* C A 26 24.479 14.477 -0.021 1.00 17.14 C
ATOM 244 04* C A 26 23.155 14.681 0.542 1.00 15.78 O
ATOM 245 C3* C A 26 25.258 15.671 0.494 1.00 18.08 C
ATOM 246 03* C A 26 26.425 15.908 -0.282 1.00 19.26 O
ATOM 247 C2* C A 26 24.233 16.785 0.350 1.00 17.99 C
ATOM 248 02* C A 26 24.097 17.196 -0.990 1.00 17.37 O
ATOM 249 Cl* C A 26 22.960 16.069 0.819 1.00 17.28 C
ATOM 250 Nl C A 26 22.770 16.251 2.269 1.00 17.50 N
ATOM 251 C2 C A 26 22.207 17.449 2.714 1.00 16.12 C
ATOM 252 02 C A 26 21.837 18.289 1.868 1.00 17.31 O
ATOM 253 N3 C A 26 22.088 17.671 4.038 1.00 17.53 N
ATOM 254 C4 C A 26 22.516 16.762 4.915 1.00 18.28 C
ATOM 255 N4 C A 26 22.401 17.049 6.222 1.00 17.84 N
ATOM 256 C5 C A 26 23.082 15.519 4.494 1.00 18.25 C
ATOM 257 C6 C A 26 23.174 15.301 3.169 1.00 17.66 C
ATOM 258 P G A 27 27.781 16.387 0.456 1.00 19.54 P
ATOM 259 O1P G A 27 28.835 15.402 0.093 1.00 19.84 O
ATOM 260 O2P G A 27 27.508 16.662 1.882 1.00 21.05 O
ATOM 261 05* G A 27 28.072 17.770 -0.238 1.00 16.72 O
ATOM 262 C5* G A 27 28.214 17.833 -1.641 1.00 18.44 C
ATOM 263 C4* G A 27 27.858 19.222 -2.122 1.00 17.66 C
ATOM 264 04* G A 27 26.419 19.395 -1.976 1.00 18.44 O
ATOM 265 C3* G A 27 28.429 20.382 -1.312 1.00 16.35 C
ATOM 266 03* G A 27 29.799 20.676 -1.631 1.00 12.94 O
ATOM 267 C2* G A 27 27.483 21.494 -1.746 1.00 16.92 C
ATOM 268 02* G A 27 27.805 21.847 -3.096 1.00 18.43 O
ATOM 269 Cl* G A 27 26.130 20.767 -1..672 1.00 19.53 C
ATOM 270 N9 G A 27 25.521 20.820 -0.331 1.00 17.32 N
ATOM 271 C8 G A 27 25.623 19.857 0.640 1.00 18.25 C
ATOM 272 N7 G A 27 25.058 20.199 1.770 1.00 16.05 N
ATOM 273 C5 G A 27 24.519 21.457 1.529 1.00 15.29 C
ATOM 274 C6 G A 27 23.826 22.345 2.410 1.00 15.83 C
ATOM 275 06 G A 27 23.506 22.160 3.589 1.00 13.24 O
ATOM 276 Nl G A 27 23.515 23.548 1.791 1.00 13.74 N
ATOM 277 C2 G A 27 23.815 23.862 0.493 1.00 15.62 C ATOM 278 N2 G A 27 23.476 25.116 0.104 1.00 13.99 N
ATOM 279 N3 G A 27 24.422 23.032 -0.352 1.00 15.80 N
ATOM 280 C4 G A 27 24.768 21.859 0.238 1.00 17.14 C
ATOM 281 P C A 28 30.774 21.297 -0.490 1.00 15.32 P
ATOM 282 O1P C A 28 32.151 21.289 -1.085 1.00 13.28 O
ATOM 283 02P C A 28 30.549 20.564 0.797 1.00 16.81 O
ATOM 284 05* C A 28 30.216 22.799 -0.308 1.00 14.13 O
ATOM 285 C5* C A 28 29.979 23.621 -1.457 1.00 12.20 C
ATOM 286 C4* C A 28 29.208 24.893 -1.109 1.00 11.20 C
ATOM 287 04* C A 28 27.842 24.579 -0.760 1.00 13.22 O
ATOM 288 C3* C A 28 29.693 25.749 0.056 1.00 11.65 C
ATOM 289 03* C A 28 30.795 26.536 -0.369 1.00 12.61 O
ATOM 290 C2* C A 28 28.443 26.579 0.321 1.00 12.57 C
ATOM 291 02* C A 28 28.217 27.545 -0.701 1.00 11.37 O
ATOM 292 Cl* C A 28 27.367 25.500 0.202 1.00 11.49 C
ATOM 293 Nl C A 28 27.155 24.774 1.465 1.00 11.12 N
ATOM 294 C2 C A 28 26.362 25.365 2.449 1.00 10.90 C
ATOM 295 02 C A 28 25.841 26.462 2.217 1.00 12.61 O
ATOM 296 N3 C A 28 26.165 24.721 3.633 1.00 9.42 N
ATOM 297 C4 C A 28 26.703 23.521 3.841 1.00 9.67 C
ATOM 298 N4 C A 28 26.467 22.940 5.029 1.00 11.69 N
ATOM 299 C5 C A 28 27.503 22.869 2.845 1.00 10.98 C
ATOM 300 C6 C A 28 27.715 23.537 1.681 1.00 9.28 C
ATOM 301 P G A 29 31.848 27.078 0.701 1.00 12.54 P
ATOM 302 O1P G A 29 33.001 27.676 -0.012 1.00 10.32 O
ATOM 303 02P G A 29 32.074 26.030 1.739 1.00 13.92 O
ATOM 304 05* G A 29 31.048 28.236 1.453 1.00 10.81 O
ATOM 305 C5* G A 29 30.651 29.422 0.784 1.00 9.98 C
ATOM 306 C4* G A 29 29.953 30.336 1.761 1.00 8.40 C
ATOM 307 04* G A 29 28.693 29.737 2.169 1.00 11.49 O
ATOM 308 C3* G A 29 30.712 30.532 3.055 1.00 8.64 C
ATOM 309 03* G A 29 31.663 31.571 2.913 1.00 7.56 O
ATOM 310 C2* G A 29 29.598 30.920 4.004 1.00 10.60 C
ATOM 311 02* G A 29 29.216 32.260 3.812 1.00 11.11 O
ATOM 312 Cl* G A 29 28.480 29.976 3.555 1.00 11.07 C
ATOM 313 N9 G A 29 28.524 28.699 4.271 1.00 11.15 N
ATOM 314 C8 G A 29 29.049 27.500 3.837 1.00 12.35 C
ATOM 315 N7 G A 29 28.902 26.525 4.710 1.00 10.34 N
ATOM 316 C5 G A 29 28.251 27.122 5.781 1.00 8.19 C
ATOM 317 C6 G A 29 27.837 26.583 7.032 1.00 7.73 C
ATOM 318 06 G A 29 27.914 25.425 7.442 1.00 6.94 O
ATOM 319 Nl G A 29 27.287 27.558 7.854 1.00 7.67 N
ATOM 320 C2 G A 29 27.125 28.881 7.518 1.00 8.16 C
ATOM 321 N2 G A 29 26.558 29.664 8.459 1.00 7.86 N
ATOM 322 N3 G A 29 27.484 29.396 6.356 1.00 8.66 N
ATOM 323 C4 G A 29 28.038 28.469 5.540 1.00 10.04 C
ATOM 324 P U A 30 33.134 31.407 3.520 1.00 8.25 P
ATOM 325 O1P U A 30 33.970 32.458 2.893 1.00 11.63 O
ATOM 326 O2P U A 30 33.566 29.974 3.446 1.00 11.64 O
ATOM 327 05* U A 30 32.909 31.761 5.054 1.00 8.57 O
ATOM 328 C5* U A 30 32.803 33.103 5.491 1.00 7.65 C
ATOM 329 C4* U A 30 32.354 33.142 6.956 1.00 7.93 C
ATOM 330 04* U A 30 31.051 32.507 7.112 1.00 7.66 O
ATOM 331 C3* U A 30 33.220 32.376 7.932 1.00 7.36 C
ATOM 332 03* U A 30 34.411 33.098 8.229 1.00 8.48 O
ATOM 333 C2* U A 30 32.276 32.271 9.116 1.00 8.63 C
ATOM 334 02* U A 30 32.159 33.539 9.769 1.00 8.93 O
ATOM 335 Cl* U A 30 30.972 31.878 8.399 1.00 9.06 C
ATOM 336 Nl U A 30 30.909 30.406 8.210 1.00 8.03 N ATOM 337 C2 U A 30 30 .479 29 . 640 9 .285 1 . 00 9 .41 C
ATOM 338 02 U A 30 30.140 30.142 10.342 1.00 9.53 o
ATOM 339 N3 U A 30 30.484 28.277 9.076 1.00 8.15 N
ATOM 340 C4 U A 30 30.879 27.615 7.918 1.00 7.87 C
ATOM 341 04 U A 30 30.916 26.387 7.900 1.00 8.06 o
ATOM 342 C5 U A 30 31.293 28.471 6.852 1.00 6.19 C
ATOM 343 C6 U A 30 31.293 29.810 7.035 1.00 9.31 C
ATOM 344 P G A 31 35.776 32.309 8.547 1.00 11.17 P
ATOM 345 OIP G A 31 36.843 33.324 8.585 1.00 11.89 o
ATOM 346 O2P G A 31 35.907 31.144 7.632 1.00 14.22 o
ATOM 347 05* G A 31 35.546 31.762 10.018 1.00 12.60 o
ATOM 348 C5* G A 31 35.387 32.670 11.107 1.00 11.40 C
ATOM 349 C4* G A 31 34.796 31.951 12.296 1.00 11.05 C
ATOM 350 04* G A 31 33.529 31.364 11.898 1.00 11.06 o
ATOM 351 C3* G A 31 35.585 30.763 12.826 1.00 11.89 C
ATOM 352 03* G A 31 36.673 31.190 13.649 1.00 11.52 o
ATOM 353 C2* G A 31 34.499 30.057 13.613 1.00 12.14 C
ATOM 354 02* G A 31 34.173 30.764 14.789 1.00 14.85 o
ATOM 355 Cl* G A 31 33.321 30.161 12.626 1.00 .12.01 C
ATOM 356 N9 G A 31 33.369 29.020 11.714 1.00 9.31 N
ATOM 357 C8 G A 31 33.747 28.982 10.391 1.00 8.70 C
ATOM 358 N7 G A 31 33.733 27.763 9.889 1.00 9.68 N
ATOM 359 C5 G A 31 33.312 26.959 10.944 1.00 7.23 C
ATOM 360 C6 G A 31 33.144 25.544 11.029 1.00 7.98 C
ATOM 361 06 G A 31 33.310 24.683 10.140 1.00 5.42 o
ATOM 362 Nl G A 31 32.739 25.152 12.313 1.00 6.14 N
ATOM 363 C2 G A 31 32.548 25.997 13.371 1.00 10.35 C
ATOM 364 N2 G A 31 32.176 25.420 14.577 1.00 7.45 N
ATOM 365 N3 G A 31 32.699 27.315 13.297 1.00 9.43 N
ATOM 366 C4 G A 31 33.073 27.720 12.071 1.00 9.46 C
ATOM 367 P G A 32 37.923 30.221 13.919 1.00 12.74 P
ATOM 368 OIP G A 32 38.781 31.042 14.790 1.00 14.11 o
ATOM 369 02P G A 32 38.517 29.617 12.692 1.00 12.26 o
ATOM 370 05* G A 32 37.356 28.990 14.770 1.00 12.62 o
ATOM 371 C5* G A 32 36.720 29.166 16.023 1.00 10.29 C
ATOM 372 C4* G A 32 36.261 27.823 16.550 1.00 10.54 C
ATOM 373 04* G A 32 35.214 27.266 15.693 1.00 12.36 o
ATOM 374 C3* G A 32 37.339 26.771 16.499 1.00 12.35 C
ATOM 375 O3* G A 32 39.203 26.383 17.614 1.00 14.68 O
ATOM 376 C2* G A 32 36.538 25.487 16.525 1.00 11.24 C
ATOM 377 02* G A 32 35.994 25.214 17.807 1.00 13.27 o
ATOM 378 Cl* G A 32 35.393 25.857 15.578 1.00 10.96 C
ATOM 379 N9 G A 32 35.730 25.540 14.191 1.00 9.74 N
ATOM 380 C8 G A 32 36.070 26.428 13.200 1.00 9.21 C
ATOM 381 N7 G A 32 36.330 25.848 12.058 1.00 9.05 N
ATOM 382 C5 G A 32 36.132 24.497 12.304 1.00 10.24 C
ATOM 383 C6 G A 32 36.241 23.372 11.427 1.00 10.39 C
ATOM 384 06 G A 32 36.572 23.356 10.238 1.00 11.57 o
ATOM 385 Nl G A 32 35.943 22.178 12.079 1.00 8.19 N
ATOM 386 C2 G A 32 35.627 22.076 13.413 1.00 9.95 C
ATOM 387 N2 G A 32 35.463 20.831 13.865 1.00 8.24 N
ATOM 388 N3 G A 32 35.508 23.119 14.242 1.00 9.53 N
ATOM 389 C4 G A 32 35.776 24.287 13.623 1.00 10.32 C
ATOM 390 P A A 33 39.686 26.342 17.495 1.00 15.61 P
ATOM 391 OIP A A 33 40.371 26.864 18.691 1.00 19.56 o
ATOM 392 02P A A 33 40.255 26.557 16.151 1.00 17.09 o
ATOM 393 05* A A 33 39.561 24.752 17.619 1.00 14.10 o
ATOM 394 C5* A A 33 39.033 24.156 18.788 1.00 14.10 C
ATOM 395 C4* A A 33 38.951 22.650 18.615 1.00 13.63 C ATOM 396 04* A A 33 37.996 22.286 17.569 1.00 13.30 O
ATOM 397 C3* A A 33 40.222 21.955 18.171 1.00 13.78 C
ATOM 398 O3* A A 33 41.120 21.848 19.264 1.00 13.31 o
ATOM 399 C2* A A 33 39.672 20.607 17.712 1.00 12.75 C
ATOM 400 02* A A 33 39.299 19.776 18.802 1.00 12.98 o
ATOM 401 Cl* A A 33 38.400 21.050 16.987 1.00 12.61 C
ATOM 402 N9 A A 33 38.609 21.236 15.550 1.00 12.88 N
ATOM 403 C8 A A 33 38.628 22.381 14.794 1.00 11.71 C
ATOM 404 N7 A A 33 38.816 22.157 13.502 1.00 10.33 N
ATOM 405 C5 A A 33 38.927 20.782 13.415 1.00 9.80 C
ATOM 406 C6 A A 33 39.116 19.908 12.324 1.00 10.29 C
ATOM 407 N6 A A 33 39.234 20.309 11.050 1.00 9.60 N
ATOM 408 Nl A A 33 39.178 18.588 12.588 1.00 9.49 N
ATOM 409 C2 A A 33 39.048 18.179 13.852 1.00 9.84 C
ATOM 410 N3 A A 33 38.857 18.900 14.964 1.00 11.60 N
ATOM 411 C4 A A 33 38.811 20.204 14.669 1.00 11.27 C
ATOM 412 P U A 34 42.687 21.969 19.023 1.00 15.81 P
ATOM 413 OIP U A 34 43.252 22.026 20.387 1.00 17.18 o
ATOM 414 O2P U A 34 42.941 23.110 18.071 1.00 16.71 o
ATOM 415 05* U A 34 43.104 20.594 18.319 1.00 15.71 o
ATOM 416 C5* U A 34 42.712 19.342 18.883 1.00 13.18 C
ATOM 417 C4* U A 34 42.857 18.181 17.894 1.00 13.06 C
ATOM 418 04* U A 34 42.094 18.430 16.686 1.00 12.24 o
ATOM 419 C3* U A 34 44.300 18.001 17.421 1.00 13.49 C
ATOM 420 O3* U A 34 44.642 16.633 17.345 1.00 14.82 o
ATOM 421 C2* U A 34 44.381 18.646 16.051 1.00 13.02 C
ATOM 422 02* ϋ A 34 45.215 17.992 15.121 1.00 13.60 o
ATOM 423 Cl* U A 34 42.953 18.456 15.570 1.00 12.35 C
ATOM 424 Nl U A 34 42.536 19.469 14.602 1.00 12.11 N
ATOM 425 C2 U A 34 42.498 19.079 13.286 1.00 11.31 C
ATOM 426 02 U A 34 42.658 17.918 12.930 1.00 12.26 o
ATOM 427 N3 U A 34 42.267 20.088 12.393 1.00 12.69 N
ATOM 428 C4 U A 34 42.050 21.410 12.687 1.00 11.38 C
ATOM 429 04 U A 34 41.968 22.223 11.766 1.00 12.81 O
ATOM 430 C5 U A 34 42.032 21.717 14.082 1.00 11.68 C
ATOM 431 C6 U A 34 42.270 20.742 14.979 1.00 11.42 C
ATOM 432 P A A 35 45.682 15.997 18.379 1.00 15.65 P
ATOM 433 OIP A A 35 45.216 16.428 19.709 1.00 14.33 o
ATOM 434 02P A A 35 47.048 16.295 17.943 1.00 15.44 o
ATOM 435 05* A A 35 45.445 14.439 18.178 1.00 15.24 o
ATOM 436 C5* A A 35 44.211 13.803 18.550 1.00 13.53 C
ATOM 437 C4* A A 35 44.300 12.292 18.297 1.00 12.70 C
ATOM 438 04* A A 35 44.299 12.032 16.878 1.00 13.93 o
ATOM 439 C3* A A 35 45.610 11.724 18.820 1.00 13.12 C
ATOM 440 03* A A 35 45.420 10.462 19.457 1.00 14.06 o
ATOM 441 C2* A A 35 46.543 11.684 17.607 1.00 13.02 C
ATOM 442 02* A A 35 47.425 10.590 17.583 1.00 12.53 o
ATOM 443 Cl* A A 35 45.544 11.534 16.454 1.00 12.55 C
ATOM 444 N9 A A 35 45.906 12.201 15.209 1.00 12.31 N
ATOM 445 C8 A A 35 45.633 13.489 14.841 1.00 13.01 C
ATOM 446 N7 A A 35 46.117 13.819 13.666 1.00 16.12 N
ATOM 447 C5 A A 35 46.737 12.661 13.222 1.00 15.27 C
ATOM 448 C6 A A 35 47.416 12.343 12.024 1.00 15.95 C
ATOM 449 N6 A A 35 47.607 13.213 11.010 1.00 14.94 N
ATOM 450 Nl A A 35 47.895 11.084 11.900 1.00 16.37 N
ATOM 451 C2 A A 35 47.706 10.216 12.909 1.00 17.57 C
ATOM 452 N3 A A 35 47.086 10.398 14.078 1.00 16.34 N
ATOM 453 C4 A A 35 46.615 11.653 14.167 1.00 15.44 C
ATOM 454 P U A 36 45.278 10.371 21.058 1.00 17.57 P ATOM 455 OIP U A 36 46.459 11.056 21.623 1.00 15.59 o
ATOM 456 02P U A 36 45.073 8.928 21.307 1.00 18.33 o
ATOM 457 05* U A 36 43.954 11.187 21.425 1.00 15.41 o
ATOM 458 C5* U A 36 42.696 10.541 21.773 1.00 14.72 C
ATOM 459 C4* U A 36 41.978 11.395 22.807 1.00 14.61 C
ATOM 460 04* U A 36 42.775 11.453 23.991 1.00 14.11 o
ATOM 461 C3* U A 36 41.774 12.824 22.315 1.00 13.18 C
ATOM 462 03* U A 36 40.517 13.272 22.831 1.00 10.05 o
ATOM 463 C2* U A 36 42.798 13.639 23.103 1.00 13.16 C
ATOM 464 02* U A 36 42.458 14.971 23.369 1.00 11.86 o
ATOM 465 Cl* U A 36 42.962 12.785 24.367 1.00 12.76 C
ATOM 466 Nl U A 36 44.246 12.864 25.044 1.00 12.63 N
ATOM 467 C2 U A 36 44.272 13.420 26.310 1.00 12.49 C
ATOM 468 02 U A 36 43.273 13.855 26.869 1.00 11.09 o
ATOM 469 N3 U A 36 45.512 13.445 26.897 1.00 12.26 N
ATOM 470 C4 U A 36 46.708 12.972 26.346 1.00 12.77 C
ATOM 471 04 U A 36 47.745 13.082 26.975 1.00 13.05 O
ATOM 472 C5 U A 36 46.588 12.412 25.033 1.00 12.67 C
ATOM 473 C6 U A 36 45.390 12.385 24.437 1.00 12.34 C
ATOM 474 P G A 37 39.141 13.060 22.048 1.00 13.09 P
ATOM 475 OIP G A 37 38.083 13.377 23.042 1.00 10.03 o
ATOM 476 02P G A 37 39.096 11.783 21.316 1.00 10.97 o
ATOM 477 05* G A 37 39.187 14.193 20.930 1.00 11.94 o
ATOM 478 C5* G A 37 38.955 15.558 21.225 1.00 10.67 C
ATOM 479 C4* G A 37 38.971 16.337 19.929 1.00 10.54 C
ATOM 480 04* G A 37 40.254 16.171 19.252 1.00 10.54 o
ATOM 481 C3* G A 37 37.973 15.801 18.925 1.00 8.49 C
ATOM 482 03* G A 37 36.724 16.406 19.212 1.00 9.71 o
ATOM 483 C2* G A 37 38.576 16.253 17.605 1.00 9.01 C
ATOM 484 02* G A 37 38.393 17.651 17.433 1.00 8.99 o
ATOM 485 Cl* G A 37 40.060 15.983 17.850 1.00 10.41 C
ATOM 486 N9 G A 37 40.497 14.629 17.473 1.00 9.32 N
ATOM 487 C8 G A 37 40.487 13.516 18.280 1.00 9.31 C
ATOM 488 N7 G A 37 40.939 12.442 17.686 1.00 10.48 N
ATOM 489 C5 G A 37 41.259 12.859 16.408 1.00 10.37 C
ATOM 490 C6 G A 37 41.782 12.126 15.316 1.00 12.23 C
ATOM 491 06 G A 37 42.036 10.920 15.257 1.00 11.04 o
ATOM 492 Nl G A 37 42.008 12.948 14.197 1.00 9.95 N
ATOM 493 C2 G A 37 41.748 14.286 14.145 1.00 9.72 C
ATOM 494 N2 G A 37 42.121 14.931 12.993 1.00 8.78 N
ATOM 495 N3 G A 37 41.191 14.972 15.147 1.00 9.91 N
ATOM 496 C4 G A 37 40.995 14.206 16.248 1.00 10.87 C
ATOM 497 P G A 38 35.372 15.646 18.858 1.00 12.29 P
ATOM 498 OIP G A 38 34.265 16.448 19.436 1.00 9.96 o
ATOM 499 O2P G A 38 35.501 14.191 19.191 1.00 11.57 o
ATOM 500 05* G A 38 35.356 15.699 17.258 1.00 12.26 o
ATOM 501 C5* G A 38 35.114 16.920 16.540 1.00 10.01 C
ATOM 502 C4* G A 38 35.319 16.694 15.053 1.00 8.68 C
ATOM 503 04* G A 38 36.699 16.352 14.806 1.00 7.01 o
ATOM 504 C3* G A 38 34.534 15.539 14.472 1.00 9.46 C
ATOM 505 03* G A 38 33.210 15.990 14.155 1.00 12.58 o
ATOM 506 C2* G A 38 35.376 15.169 13.243 1.00 10.28 C
ATOM 507 02* G A 38 35.157 15.985 12.097 1.00 9.67 o
ATOM 508 Cl* G A 38 36.804 15.398 13.759 1.00 9.91 C
ATOM 509 N9 G A 38 37.412 14.173 14.300 1.00 9.70 N
ATOM 510 C8 G A 38 37.174 13.612 15.533 1.00 11.32 C
ATOM 511 N7 G A 38 37.702 12.417 15.664 1.00 11.54 N
ATOM 512 C5 G A 38 38.362 12.199 14.469 1.00 10.27 C
ATOM 513 C6 G A 38 39.035 11.045 13.994 1.00 11.93 C ATOM 514 06 G A 38 39.201 9.950 14.575 00 8.78 O
ATOM 515 Nl G A 38 39.522 11.236 12.701 00 9.87 N
ATOM 516 C2 G A 38 39.369 12.387 11.958 00 11.31 C
ATOM 517 N2 G A 38 39.901 12.384 10.710 1.00 10.24 N
ATOM 518 N3 G A 38 38.739 13.460 12.390 1, 00 9.09 N
ATOM 519 C4 G A 38 38.251 13.297 13.636 1, 00 11.34 C
ATOM 520 P C A 39 31.929 15.084 14.534 1, 00 12.06 P
ATOM 521 OIP C A 39 32.388 13.880 15.288 1.00 14.53 o
ATOM 522 O2P C A 39 31.165 14.878 13.278 1.00 12.02 o
ATOM 523 05* C A 39 30.903 16.101 15.235 1.00 17.76 o
ATOM 524 C5* C A 39 31.093 16.713 16.478 1.00 18.70 C
ATOM 525 C4* C A 39 30.971 18.261 16.413 1.00 13.42 C
ATOM 526 04* C A 39 32.106 18.778 15.688 1.00 12.71 o
ATOM 527 C3* C A 39 29.833 19.187 15.926 1.00 14.10 C
ATOM 528 03* C A 39 28.693 19.255 16.804 1.00 10.19 o
ATOM 529 C2* C A 39 30.564 20.538 16.055 1.00 11.50 C
ATOM 530 02* C A 39 30.790 20.868 17.412 1.00 10.17 o
ATOM 531 Cl* C A 39 31.968 20.185 15.563 1.00 12.13 C
ATOM 532 Nl C A 39 32.175 20.579 14.172 1.00 12.50 N
ATOM 533 C2 C A 39 32.272 21.952 13.896 1.00 12.48 C
ATOM 534 02 C A 39 32.208 22.763 14.848 1, 00 8.22 o
ATOM 535 N3 C A 39 32.436 22.355 12.625 1, 00 9.69 N
ATOM 536 C4 C A 39 32.522 21.454 11.639 1.00 11.76 C
ATOM 537 N4 C A 39 32.692 21.917 10.379 00 8.85 N
ATOM 538 C5 C A 39 32.443 20.047 11.889 00 9.77 C
ATOM 539 C6 C A 39 32.267 19.658 13.164 1.00 11.59 C
ATOM 540 P A A 40 27.311 20.017 16.359 1.00 12.95 P
ATOM 541 OIP A A 40 26.296 19.624 17.367 1.00 14.20 o
ATOM 542 O2P A A 40 26.956 19.926 14.935 1.00 12.21 o
ATOM 543 05* A A 40 27.609 21.582 16.535 1.00 13.28 o
ATOM 544 C5* A A 40 27.658 22.200 17.801 1.00 12.99 C
ATOM 545 C4* A A 40 27.672 23.716 17.632 1.00 12.56 C
ATOM 546 04* A A 40 28.892 24.107 16.964 1.00 12.75 o
ATOM 547 C3* A A 40 26.579 24.281 16.733 1.00 12.50 C
ATOM 548 03* A A 40 25.367 24.448 17.450 1.00 12.35 o
ATOM 549 C2* A A 40 27.182 25.611 16.312 1.00 13.51 C
ATOM 550 02* A A 40 27.126 26.577 17.360 00 9.74 o
ATOM 551 Cl* A A 40 28.629 25.197 16.084 00 11.10 C
ATOM 552 N9 A A 40 28.846 24.755 14.708 1.00 11.69 N
ATOM 553 C8 A A 40 28.858 23.479 14.199 1, 00 11.33 C
ATOM 554 N7 A A 40 29.111 23.431 12.916 1, 00 9.06 N
ATOM 555 C5 A A 40 29.278 24.761 12.557 1, 00 8.72 C
ATOM 556 C6 A A 40 29.583 25.382 11.335 1.00 10.32 C
ATOM 557 N6 A A 40 29.762 24.716 10.185 1.00 9 9..5544 N
ATOM 558 Nl A A 40 29.688 26.735 11.325 1.00 9 9..7700 N
ATOM 559 C2 A A 40 29.463 27.395 12.451 1.00 9 9..0000 C
ATOM 560 N3 A A 40 29.160 26.925 13.661 1.00 7 7..0011 N
ATOM 561 C4 A A 40 29.092 25.584 13.644 1.00 9 9..4477 C
ATOM 562 P C A 41 23.973 24.307 16.696 1.00 15.74 P
ATOM 563 OIP C A 41 22.929 24.350 17.731 1.00 16.51 o
ATOM 564 O2P C A 41 23.995 23.181 15.724 1.00 13.62 o
ATOM 565 05* C A 41 23.875 25.634 15.801 1.00 13.56 o
ATOM 566 C5* C A 41 23.881 26.916 16.422 1.00 12.13 C
ATOM 567 C4* C A 41 24.095 28.017 15.398 1.00 1 111..3366 C
ATOM 568 04* C A 41 25.434 27.933 14.850 1.00 99..8899 o
ATOM 569 C3* C A 41 23.230 28.007 14.151 1.00 10.98 C
ATOM 570 03* C A 41 21.887 28.429 14.399 00 12.66 o
ATOM 571 C2* C A 41 24.025 28.958 13.272 00 9.59 C
ATOM 572 02* C A 41 23.974 30.285 13.781 1.00 10.24 o ATOM 573 Cl* C A 41 25.435 28.418 13.518 1.00 9.79 C
ATOM 574 Nl C A 41 25.772 27.308 12.587 1.00 7.61 N
ATOM 575 C2 C A 41 26.181 27.650 11.338 1.00 8.93 C
ATOM 576 02 C A 41 26.289 28.873 11.070 1.00 7.63 O
ATOM 577 N3 C A 41 26.456 26.679 10.436 1.00 8.78 N
ATOM 578 C4 C A 41 26.342 25.386 10.777 1.00 9.30 C
ATOM 579 N4 C A 41 26.656 24.451 9.845 1.00 7.23 N
ATOM 580 C5 C A 41 25.917 24.995 12.077 1.00 7.65 C
ATOM 581 C6 C A 41 25.654 25.986 12.948 1.00 9.18 C
ATOM 582 P G A 42 20.679 27.762 13.566 1.00 13.08 P
ATOM 583 OIP G A 42 19.441 28.269 14.161 1.00 16.14 o
ATOM 584 O2P G A 42 20.913 26.306 13.504 1.00 15.00 o
ATOM 585 05* G A 42 20.851 28.371 12.110 1.00 13.10 o
ATOM 586 C5* G A 42 20.747 29.779 11.925 1.00 11.92 C
ATOM 587 C4* G A 42 21.247 30.185 10.564 1.00 10.63 C
ATOM 588 O4* G A 42 22.635 29.810 10.398 1.00 11.41 o
ATOM 589 C3* G A 42 20.564 29.562 9.364 1.00 12.03 C
ATOM 590 03* G A 42 19.320 30.211 9.085 1.00 13.22 o
ATOM 591 C2* G A 42 21.592 29.850 8.281 1.00 11.77 C
ATOM 592 02* G A 42 21.543 31.210 7.918 1.00 12.69 o
ATOM 593 Cl* G A 42 22.898 29.564 9.023 1.00 11.16 C
ATOM 594 N9 G A 42 23.287 28.165 8.856 1.00 10.90 N
ATOM 595 C8 G A 42 23.171 27.140 9.773 1.00 11.82 C
ATOM 596 N7 G A 42 23.564 25.981 9.301 1.00 11.44 N
ATOM 597 C5 G A 42 23.980 26.263 8.002 1.00 10.91 C
ATOM 598 C6 G A 42 24.522 25.409 7.014 1.00 10.75 C
ATOM 599 06 G A 42 24.774 24.198 7.101 1.00 11.56 o
ATOM 600 Nl G A 42 24.797 26.094 5.830 1.00 10.06 N
ATOM 601 C2 G A 42 24.571 27.437 5.625 1.00 11.20 C
ATOM 602 N2 G A 42 24.867 27.908 4.395 1.00 10.66 N
ATOM 603 W3 G A 42 24.079 28.254 6.555 1.00 10.51 N
ATOM 604 C4 G A 42 23.812 27.602 7.710 1.00 11.10 C
ATOM 605 P C A 43 18.199 29.425 8.233 1.00 16.98 P
ATOM 606 OIP C A 43 16.981 30.235 8.305 1.00 18.69 o
ATOM 607 02P C A 43 18.159 28.002 8.664 1.00 16.43 o
ATOM 608 05* C A 43 18.772 29.450 6.752 1.00 18.08 o
ATOM 609 C5* C A 43 19.022 30.687 6.085 1.00 19.11 C
ATOM 610 C4* C A 43 19.566 30.428 4.708 1.00 19.02 C
ATOM 611 04* C A 43 20.885 29.830 4.791 1.00 19.55 o
ATOM 612 C3* C A 43 18.780 29.428 3.877 1.00 21.07 C
ATOM 613 03* C A 43 17.631 30.042 3.313 1.00 24.02 o
ATOM 614 C2* C A 43 19.809 29.042 2.827 1.00 20.29 C
ATOM 615 02* C A 43 20.015 30.061 1.875 1.00 21.85 o
ATOM 616 Cl* C A 43 21.077 28.962 3.680 1.00 19.45 C
ATOM 617 Nl C A 43 21.322 27.589 4.156 1.00 18.04 N
ATOM 618 C2 C A 43 21.977 26.691 3.289 1.00 16.83 C
ATOM 619 02 C A 43 22.310 27.073 2.145 1.00 15.78 o
ATOM 620 N3 C A 43 22.223 25.438 3.712 1.00 15.70 N
ATOM 621 C4 C A 43 21.839 25.054 4.931 1.00 15.08 C
ATOM 622 N4 C A 43 22.109 23.808 5.295 1.00 14.50 N
ATOM 623 C5 C A 43 21.157 25.932 5.821 1.00 14.50 C
ATOM 624 C6 C A 43 20.923 27.181 5.397 1.00 16.57 C
ATOM 625 P A A 44 16.292 29.177 3.069 1.00 26.38 P
ATOM 626 OIP A A 44 15.341 30.100 2.418 1.00 26.88 o
ATOM 627 O2P A A 44 15.887 28.460 4.292 1.00 26.22 o
ATOM 628 05* A A 44 16.749 28.038 2.049 1.00 26.94 o
ATOM 629 C5* A A 44 16.902 28.330 0.686 1.00 26.45 C
ATOM 630 C4* A A 44 17.567 27.192 -0.053 1.00 25.81 C
ATOM 631 04* A A 44 18.835 26.841 0.575 1.00 24.67 o ATOM 632 C3* A A 44 16.920 25.825 0.196 1.00 25.05 C
ATOM 633 O3* A A 44 15.807 25.831 1.095 1.00 24.16 O
ATOM 634 C2* A A 44 18.110 25.083 0.805 1.00 24.43 C
ATOM 635 02* A A 44 18.398 25.556 2.099 1.00 26.31 O
ATOM 636 Cl* A A 44 19.261 25.596 0.065 1.00 23.69 C
ATOM 637 N9 A A 44 19.494 24.674 1.169 1.00 22.64 N
ATOM 638 C8 A A 44 19.153 24.773 2.489 1.00 22.43 C
ATOM 639 N7 A A 44 19.450 23.707 3.189 1.00 22.73 N
ATOM 640 C5 A A 44 20.033 22.855 2.261 1.00 23.13 C
ATOM 641 C6 A A 44 20.536 21.549 2.358 1.00 22.92 C
ATOM 642 N6 A A 44 20.506 20.829 3.484 1.00 25.71 N
ATOM 643 Nl A A 44 21.061 20.993 1.248 1.00 23.09 N
ATOM 644 C2 A A 44 21.058 21.702 0.115 1.00 22.58 C
ATOM 645 N3 A A 44 20.591 22.928 0.108 1.00 23.04 N
ATOM 646 C4 A A 44 20.089 23.452 1.022 1.00 22.28 C
ATOM 647 P A A 45 14.684 24.664 1.004 1.00 27.94 P
ATOM 648 OIP A A 45 13.634 25.021 1.982 1.00 27.49 o
ATOM 649 02P A A 45 14.311 24.380 0.411 1.00 26.25 o
ATOM 650 05* A A 45 15.413 23.346 1.552 1.00 23.99 o
ATOM 651 C5* A A 45 15.896 23.306 2.888 1.00 22.10 C
ATOM 652 C4* A A 45 16.650 22.030 3.146 1.00 20.80 C
ATOM 653 04* A A 45 17.782 21.931 2.238 1.00 18.05 o
ATOM 654 C3* A A 45 15.868 20.747 -2 898 1.00 19.29 C
ATOM 655 03* A A 45 15.065 20.459 4.029 1.00 19.25 o
ATOM 656 C2* A A 45 16.997 19.740 2.727 1.00 18.96 C
ATOM 657 02* A A 45 17.625 19.395 3.953 1.00 20.46 o
ATOM 658 Cl* A A 45 17.993 20.564 1.910 1.00 18.32 C
ATOM 659 N9 A A 45 17.789 20.387 0.475 1.00 17.77 N
ATOM 660 C8 A A 45 17.252 21.249 0.459 1.00 16.87 C
ATOM 661 N7 A A 45 17.270 20.777 1.682 1.00 15.45 N
ATOM 662 C5 A A 45 17.848 19.517 1.545 1.00 15.55 C
ATOM 663 C6 A A 45 18.183 18.528 2.467 1.00 14.82 C
ATOM 664 N6 A A 45 18.021 18.657 3.781 1.00 14.77 N
ATOM 665 Ml A A 45 18.728 17.384 1.995 1.00 14.86 N
ATOM 666 C2 A A 45 18.938 17.265 0.678 1.00 15.61 C
ATOM 667 N3 A A 45 18.684 18.132 0.288 1.00 15.02 N
ATOM 668 C4 A A 45 18.143 19.255 0.218 1.00 16.02 C
ATOM 669 P G A 46 13.590 19.837 3.836 1.00 21.76 P
ATOM 670 OIP G A 46 13.111 19.621 5.232 1.00 20.58 o
ATOM 671 02P G A 46 12.796 20.615 -2, 886 1.00 20.12 o
ATOM 672 05* G A 46 13.859 18.416 -3, 180 1.00 17.63 o
ATOM 673 C5* G A 46 14.553 17.429 -3, 908 1.00 16.53 C
ATOM 674 C4* G A 46 14.840 16.245 -3.025 1.00 16.28 C
ATOM 675 04* G A 46 15.704 16.691 -1.942 1.00 15.33 o
ATOM 676 C3* G A 46 13.638 15.650 -2.304 1.00 14.19 C
ATOM 677 03* G A 46 12.948 14.718 -3.148 1.00 15.95 o
ATOM 678 C2* G A 46 14.333 14.927 -1.152 1.00 15.24 C
ATOM 679 02* G A 46 14.970 13.723 -1.593 1.00 11.22 o
ATOM 680 Cl* G A 46 15.417 15.944 -0.780 1.00 14.15 C
ATOM 681 N9 G A 46 15.039 16.881 0.277 1.00 15.14 N
ATOM 682 C8 G A 46 14.429 18.103 0.129 1.00 13.16 C
ATOM 683 N7 G A 46 14.249 18.721 1.264 1.00 14.91 N
ATOM 684 C5 G A 46 14.764 17.852 2.223 1.00 13.65 C
ATOM 685 C6 G A 46 14.850 17.980 3.646 1.00 14.21 C
ATOM 686 06 G A 46 14.457 IB.922 4.373 1.00 13.60 o
ATOM 687 Nl G A 46 15.451 16.861 4.225 1.00 14.00 N
ATOM 688 C2 G A 46 15.898 15.760 3.534 1.00 13.23 C
ATOM 689 N2 G A 46 16.436 14.773 4.271 1.00 13.07 N
ATOM 690 N3 G A 46 15.821 15.634 2.214 1.00 12.14 N ATOM 691 C4 G A 46 15.250 16.709 1.631 1.00 13.14 C
ATOM 692 P U A 47 11.385 14.951 3.517 1.00 19.08 P
ATOM 693 OIP U A 47 11.197 14.163 4.756 1.00 17.31 O
ATOM 694 O2P U A 47 11.026 16.379 -3 510 1.00 17.84 O
ATOM 695 05* U A 47 10.608 14.251 2.313 1.00 18.59 o
ATOM 696 C5* U A 47 10.648 12.828 2.158 1.00 16.76 C
ATOM 697 C4* U A 47 9.701 12.351 -1 061 1.00 16.21 C
ATOM 698 04* U A 47 10.107 12.861 0.238 1.00 17.46 o
ATOM 699 C3* U A 47 8.276 12.864 1.282 1.00 17.26 C
ATOM 700 03* U A 47 7.372 11.850 0.863 1.00 19.08 o
ATOM 701 C2* U A 47 8.151 14.098 0.386 1.00 16.32 C
ATOM 702 02* U A 47 6.881 14.327 0.178 1.00 17.79 o
ATOM 703 Cl* U A 47 9.107 13.723 0.745 1.00 15.55 C
ATOM 704 Nl U A 47 9.692 14.810 1.530 1.00 16.06 N
ATOM 705 C2 U A 47 9.596 14.735 2.916 1.00 14.37 C
ATOM 706 02 U A 47 9.141 13.791 3.505 1.00 16.51 o
ATOM 707 N3 U A 47 10.080 15.822 3.587 1.00 14.95 N
ATOM 708 C4 U A 47 10.653 16.933 3.042 1.00 13.66 C
ATOM 709 04 U A 47 10.944 17.868 3.111 1.00 15.44 O
ATOM 710 C5 U A 47 10.761 16.919 1.612 1.00 15.07 C
ATOM 711 C6 U A 47 10.287 15.883 0.923 1.00 14.65 C
ATOM 712 P U A 48 6.564 10.960 1.926 1.00 19.85 P
ATOM 713 OIP U A 48 5.831 10.040 1.031 1.00 17.53 o
ATOM 714 O2P U A 48 7.437 10.406 2.967 1.00 19.96 o
ATOM 715 05* U A 48 5.542 12.008 2.571 1.00 18.50 o
ATOM 716 C5* U A 48 4.823 12.913 -1 720 1.00 20.70 C
ATOM 717 C4* U A 48 3.830 13.767 2.497 1.00 19.87 C
ATOM 718 04* U A 48 2.984 12.978 3.363 1.00 21.40 o
ATOM 719 C3* U A 48 2.919 14.504 1.528 1.00 22.14 C
ATOM 720 03* U A 48 3.091 15.908 1.719 1.00 21.98 o
ATOM 721 C2* U A 48 1.483 14.183 -1 964 1.00 21.79 C
ATOM 722 02* U A 48 0.647 15.315 2.023 1.00 26.06 o
ATOM 723 Cl* U A 48 1.688 13.540 3.338 1.00 21.49 C
ATOM 724 Nl U A 48 0.705 12.541 -3 764 1.00 19.91 N
ATOM 725 C2 U A 48 -0.172 12.916 -4.772 1.00 21.91 C
ATOM 726 02 U A 48 -0.152 14.035 -5.305 1.00 19.91 o
ATOM 727 N3 U A 48 -1.064 11.951 -5.142 1.00 19.50 N
ATOM 728 C4 U A 48 -1.172 10.684 -4.623 1.00 21.21 C
ATOM 729 04 U A 48 -2.052 9.936 -5.040 1.00 20.77 o
ATOM 730 C5 U A 48 -0.228 10.375 -3.587 1.00 20.54 C
ATOM 731 C6 U A 48 0.653 11.297 -3.207 1.00 21.02 C
ATOM 732 P U A 49 4.393 16.672 -1.154 1.00 24.27 P
ATOM 733 OIP U A 49 4.112 18.081 -1, .444 1.00 21.67 o
ATOM 734 O2P U A 49 5.651 16.051 -1..658 1.00 22.02 o
ATOM 735 05* U A 49 4.355 16.404 0.420 1.00 21.65 o
ATOM 736 C5* U A 49 3.224 16.736 1.195 1.00 22.80 C
ATOM 737 C4* U A 49 3.533 16.647 2.677 1.00 22.95 C
ATOM 738 04* U A 49 3.474 15.273 3.129 1.00 24.98 o
ATOM 739 C3* U A 49 4.962 17.109 2.990 1.00 23.23 C
ATOM 740 03* U A 49 4.958 17.936 4.127 1.00 22.14 o
ATOM 741 C2* U A 49 5.807 15.844 3.138 1.00 22.97 C
ATOM 742 02* U A 49 6.804 15.917 4.151 1.00 21.66 o
ATOM 743 Cl* U A 49 4.739 14.866 3, .606 1.00 21.87 C
ATOM 744 Nl U A 49 4.895 13.418 3.466 1.00 20.97 N
ATOM 745 C2 U A 49 4.504 12.676 4.556 1.00 18.94 C
ATOM 746 02 U A 49 4.042 13.182 5.570 1.00 19.40 o
ATOM 747 N3 U A 49 4.657 11.327 4.423 .00 19.76 N
ATOM 748 C4 U A 49 5.143 10.659 3.335 .00 19.27 C
ATOM 749 04 U A 49 5.181 9.441 3.367 1.00 19.94 o ATOM 750 C5 U A 49 5.533 11.493 2.231 1.00 19.78 -C
ATOM 751 C6 U A 49 5.398 12.822 2.336 1.00 21.36 C
ATOM 752 P C A 50 6.149 18.965 4.371 1.00 25.95 P
ATOM 753 OIP C A 50 5..898 20.166 3.536 1.00 23.44 o
ATOM 754 02P C A 50 7..459 18.286 4.288 1.00 25.26 o
ATOM 755 05* C A 50 5.932 19.376 5.884 1.00 22.51 o
ATOM 756 C5* C A 50 4.862 20.228 6.238 1.00 24.64 C
ATOM 757 C4* C A 50 4.500 19.987 7..664 1.00 23.18 C
ATOM 758 04* C A 50 3..946 18.653 7..796 1.00 22.19 o
ATOM 759 C3* C A 50 5..714 19.971 8.569 1.00 23.50 C
ATOM 760 03* C A 50 6.031 21.297 8.951 1.00 23.96 o
ATOM 761 C2* C A 50 5.211 19.146 9.739 1.00 22.22 C
ATOM 762 02* C A 50 4.390 19.925 10.581 1.00 23.29 o
ATOM 763 Cl* C A 50 4.379 18.081 9.020 1.00 20.79 C
ATOM 764 Nl C A 50 5.148 16.857 8.718 1.00 18.62 N
ATOM 765 C2 C A 50 5.579 16.061 9.782 1.00 18.06 C
ATOM 766 02 C A 50 5.294 16.403 10.944 1.00 18.10 o
ATOM 767 N3 C A 50 6.286 14.924 9.529 1.00 18.43 N
ATOM 768 C4 C A 50 6.541 14.559 8.278 1.00 16.95 C
ATOM 769 N4 C A 50 7.187 13.390 8.096 1.00 16.78 N
ATOM 770 C5 C A 50 6.127 15.362 7.156 1.00 18.20 C
ATOM 771 C6 C A 50 5.435 16.496 7.425 1.00 18.70 C
ATOM 772 P U A 51 7..545 21.802 8.901 1.00 26.70 P
ATOM 773 OIP U A 51 7..531 23.156 9.484 1.00 26.93 o
ATOM 774 02P U A 51 8.082 21.612 7.532 1.00 25.00 o
ATOM 775 05* U A 51 8.299 20.823 9.912 1.00 25.55 o
ATOM lie C5* U A 51 8.050 20.887 11.311 1.00 24.16 C
ATOM 111 C4* U A 51 8.585 19.653 12.005 1.00 21.45 C
ATOM 778 04* U A 51 7.971 18.463 11.431 1.00 22.38 o
ATOM 779 C3* U A 51 10.072 19.376 11.858 1.00 22.73 C
ATOM 780 03* U A 51 10.845 20.123 12.799 1.00 23.34 o
ATOM 781 C2* U A 51 10.136 17.895 12.196 1.00 19.59 C
ATOM 782 02* U A 51 9.952 17.713 13.569 1.00 19.61 o
ATOM 783 Cl* U A 51 8.878 17.367 11.512 1.00 19.68 C
ATOM 784 Nl U A 51 9.123 16.859 10.157 1.00 18.48 N
ATOM 785 C2 U A 51 9.636 15.574 10.020 1.00 17.36 C
ATOM 786 02 U A 51 9.930 14.862 10.967 1.00 16.91 o
ATOM 787 N3 U A 51 9.794 15.153 8.726 1.00 16.05 N
ATOM 788 C4 U A 51 9.507 15.862 7.571 1.00 16.11 C
ATOM 789 04 U A 51 9.679 15.322 6.462 1.00 15.90 o
ATOM 790 C5 U A 51 8.995 17.179 7.796 1.00 16.34 C
ATOM 791 C6 U A 51 8.832 17.627 9.048 1.00 16.75 C
ATOM 792 P A A 52 12.365 20.517 12.450 1.00 25.19 P
ATOM 793 OIP A A 52 12.846 21.398 13.550 1.00 27.50 o
ATOM 794 O2P A A 52 12.491 20.983 11.045 1.00 24.46 o
ATOM 795 05* A A 52 13.148 19.125 12.569 1.00 22.18 o
ATOM 796 C5* A A 52 13.100 18.336 13.756 1.00 18.74 C
ATOM 797 C4* A A 52 13.709 16.971 13.493 1.00 17.82 C
ATOM 798 04* A A 52 12.882 16.221 12.560 1.00 17.99 o
ATOM 799 C3* A A 52 15.082 17.021 12.845 1.00 17.21 C
ATOM 800 03* A A 52 16.060 17.141 13.881 1.00 17.77 o
ATOM 801 C2* A A 52 15.156 15.672 12.120 1.00 16.59 C
ATOM 802 02* A A 52 15.493 14.600 12.980 1.00 13.63 o
ATOM 803 Cl* A A 52 13.704 15.476 11.671 1.00 16.93 C
ATOM 804 N9 A A 52 13.417 15.925 10.298 1.00 18.02 N
ATOM 805 C8 A A 52 12.870 17.137 9.918 1.00 16.91 C
ATOM 806 N7 A A 52 12.661 17.242 8.623 1.00 15.76 N
ATOM 807 C5 A A 52 13.113 16.029 8.111 1.00 14.73 C
ATOM 808 C6 A A 52 13.146 15.509 6.798 1.00 13.71 C ATOM 809 N6 A A 52 12.694 16.170 5.730 1.00 10.55 N
ATOM 810 Nl A A 52 13.660 14.265 6.626 1.00 13.71 N
ATOM 811 C2 A A 52 14.109 13.603 7.708 1.00 14.81 C
ATOM 812 N3 A A 52 14.126 13.984 8.990 1.00 14.94 N
ATOM 813 C4 A A 52 13.603 15.217 9.127 1.00 15.57 C
ATOM 814 P C A 53 17.368 18.050 13.670 1.00 17.51 P
ATOM 815 O1P C A 53 18.117 18.008 14.931 1.00 19.79 O
ATOM 816 O2P C A 53 17.010 19.365 13.065 1.00 18.18 O
ATOM 817 O5* C A 53 18.238 17.298 12.563 1.00 17.18 O
ATOM 818 C5* C A 53 18.901 16.077 12.842 1.00 15.96 C
ATOM 819 C4* C A 53 19.332 15.446 11.543 1.00 15.86 C
ATOM 820 04* C A 53 18.121 15.192 10.776 1.00 13.93 O
ATOM 821 C3* C A 53 20.160 16.337 10.613 1.00 15.87 C
ATOM 822 03* C A 53 21.559 16.258 10.903 1.00 17.45 O
ATOM 823 C2* C A 53 19.874 15.668 9.269 1.00 14.61 C
ATOM 824 02* C A 53 20.537 14.416 9.169 1.00 13.51 O
ATOM 825 Cl* C A 53 18.388 15.358 9.402 1.00 14.62 C
ATOM 826 Nl C A 53 17.495 16.398 8.857 1.00 15.30 N
ATOM 827 C2 C A 53 16.863 16.150 7.629 1.00 16.59 C
ATOM 828 02 C A 53 17.047 15.036 7.072 1.00 15.08 O
ATOM 829 N3 C A 53 16.064 17.117 7.081 1.00 15.46 N
ATOM 830 C4 C A 53 15.882 18.269 7..728 1.00 15.94 C
ATOM 831 N4 C A 53 15.106 19.193 7.161 1.00 17.55 N
ATOM 832 C5 C A 53 16.499 18.534 8.995 1.00 18.55 C
ATOM 833 C6 C A 53 17.293 17.579 9.514 1.00 17.04 C
ATOM 834 P C A 54 22.305 17.435 11.741 1.00 20.49 P
ATOM 835 O1P C A 54 21.325 18.517 12.085 1.00 24.43 O
ATOM 836 O2P C A 54 23.512 17.794 10.991 1.00 22.51 O
ATOM 837 05* C A 54 22.926 16.655 12.998 1.00 23.22 O
ATOM 838 C5* C A 54 22.304 16.618 14.248 1.00 20.44 C
ATOM 839 C4* C A 54 22.299 15.199 14.853 1.00 16.62 C
ATOM 840 04* C A 54 21.619 14.289 13.954 1.00 13.18 O
ATOM 841 C3* C A 54 23.510 14.352 15.283 1.00 15.61 C
ATOM 842 03* C A 54 24.117 14.814 16.513 1.00 15.53 O
ATOM 843 C2* C A 54 22.766 13.054 15.620 1.00 13.43 C
ATOM 844 02* C A 54 22.037 13.231 16.823 1.00 10.21 O
ATOM 845 Cl* C A 54 21.731 12.976 14.489 1.00 12.97 C
ATOM 846 Nl C A 54 22.153 12.031 13.438 1.00 11.15 N
ATOM 847 C2 C A 54 22.102 10.675 13.741 1.00 11.28 C
ATOM 848 02 C A 54 21.721 10.338 14.877 1.00 10.76 O
ATOM 849 N3 C A 54 22.477 9.762 12.803 1, 00 11.24 N
ATOM 850 C4 C A 54 22.895 10.172 11.606 1, 00 11.60 C
ATOM 851 N4 C A 54 23.244 9.240 10.718 1.00 10.41 N
ATOM 852 C5 C A 54 22.965 11.562 11.266 00 11.08 C
ATOM 853 C6 C A 54 22.582 12.450 12.207 00 9.08 C
ATOM 854 P G A 55 25.594 14.295 16.976 00 13.05 P
ATOM 855 01P G A 55 25.962 15.246 18.039 00 12.26 O
ATOM 856 02P G A 55 26.496 14.069 15.850 00 11.41 O
ATOM 857 05* G A 55 25.399 12.866 17.663 00 10.95 O
ATOM 858 C5* G A 55 24.831 12.746 18.965 00 14.14 C
ATOM 859 C4* G A 55 24.802 11.304 19.405 00 11.91 C
ATOM 860 O4* G A 55 23.968 10.508 18.509 00 13.05 O
ATOM 861 C3* G A 55 26.149 10.594 19.367 00 11.99 C
ATOM 862 03* G A 55 26.929 10.898 20.521 00 13.10 O
ATOM 863 C2* G A 55 25.728 9.140 19.334 00 11.08 C
ATOM 864 02* G A 55 25.292 8.701 20.587 00 10.72 O
ATOM 855 Cl* G A 55 24.527 9.203 18.390 00 11.23 C
ATOM 866 N9 G A 55 24.931 8.998 17.004 00 10.75 N
ATOM 867 C8 G A 55 25.171 9.961 16.045 1.00 7.33 C ATOM 868 N7 G A 55 25.353 9.455 14.847 1.00 9.34 N
ATOM 869 C5 G A 55 25.283 8.079 15.039 1.00 7.65 C
ATOM 870 C6 G A 55 25.397 7.019 14.114 1.00 8.58 C
ATOM 871 06 G A 55 25.565 7.082 12.885 1.00 7.74 O
ATOM 872 Nl G A 55 25.286 5.775 14.741 1.00 7.05 N
ATOM 873 C2 G A 55 25.090 5.584 16.085 1.00 9.83 C
ATOM 874 N2 G A 55 25.007 4.306 16.495 1.00 12.00 N
ATOM 875 N3 G A 55 24.974 6.572 16.964 1.00 9.85 N
ATOM 876 C4 G A 55 25.068 7.781 16.375 1.00 9.88 C
ATOM 877 P G A 56 28.506 11.118 20.370 1.00 13.76 P
ATOM 878 OIP G A 56 28.941 11.806 21.603 1.00 13.53 O
ATOM 879 O2P G A 56 28.773 11.722 19.054 1.00 14.23 O
ATOM 880 05* G A 56 29.116 9.645 20.366 1.00 13.32 o
ATOM 881 C5* G A 56 29.082 8.849 21.551 1.00 13.87 C
ATOM 882 C4* G A 56 29.183 7.374 21.197 1.00 10.21 C
ATOM 883 04* G A 56 28.057 7.023 20.333 1.00 11.28 o
ATOM 884 C3* G A 56 30.397 6.922 20.404 1.00 11.41 C
ATOM 885 03* G A 56 31.527 6.711 21.273 1.00 9.71 o
ATOM 886 C2* G A 56 29.878 5.605 19.841 1.00 8.73 C
ATOM 887 02* G A 56 29.794 4.659 20.902 1.00 8.39 o
ATOM 888 Cl* G A 56 28.462 6.009 19.418 1.00 10.13 C
ATOM 889 N9 G A 56 28.486 6.581 18.071 1.00 10.46 N
ATOM 890 C8 G A 56 28.551 7.917 17.730 1.00 8.60 C
ATOM 891 N7 G A 56 28.622 8.113 16.441 1.00 9.93 N
ATOM 892 C5 G A 56 28.582 6.835 15.897 1.00 8.16 C
ATOM 893 C6 G A 56 28.626 6.411 14.551 1.00 7.93 C
ATOM 894 06 G A 56 28.702 7.101 13.539 1.00 9.17 o
ATOM 895 Nl G A 56 28.583 5.011 14.447 1.00 7.02 N
ATOM 896 C2 G A 56 28.536 4.139 15.508 1.00 9.44 C
ATOM 897 N2 G A 56 28.557 2.786 15.209 1.00 7.72 N
ATOM 898 N3 G A 56 28.484 4.537 16.776 1.00 8.70 N
ATOM 899 C4 G A 56 28.504 5.880 16.892 1.00 8.72 C
ATOM 900 P G A 57 32.995 6.619 20.673 1.00 13.57 P
ATOM 901 OIP G A 57 33.919 6.615 21.837 1.00 11.27 o
ATOM 902 02P G A 57 33.141 7.686 19.611 1.00 15.12 o
ATOM 903 05* G A 57 33.076 5.220 19.904 1.00 11.64 o
ATOM 904 C5* G A 57 32.985 3.994 20.606 1.00 10.61 C
ATOM 905 C4* G A 57 32.946 2.834 19.636 1.00 10.65 C
ATOM 906 04* G A 57 31.764 2.920 18.802 1.00 10.66 o
ATOM 907 C3* G A 57 34.095 2.749 18.637 1.00 11.12 C
ATOM 908 03* G A 57 35.211 2.092 19.227 1.00 12.09 o
ATOM 909 C2* G A 57 33.485 1.856 17.579 1.00 10.15 C
ATOM 910 02* G A 57 33.432 0.527 18.073 1.00 11.67 o
ATOM 911 Cl* G A 57 32.080 2.447 17.495 1.00 10.13 C
ATOM 912 N9 G A 57 32.032 3.577 16.566 1.00 9.94 N
ATOM 913 C8 G A 57 31.948 4.915 16.870 1.00 8.87 C
ATOM 914 N7 G A 57 31.840 5.680 15.806 1.00 9.77 N
ATOM 915 C5 G A 57 31.896 4.788 14.736 1.00 9.04 C
ATOM 916 C6 G A 57 31.841 5.018 13.333 1.00 7.94 C
ATOM 917 06 G A 57 31.697 6.098 12.719 1.00 7.09 O
ATOM 918 Nl G A 57 31.968 3.833 12.617 1.00 7.83 N
ATOM 919 C2 G A 57 32.123 2.586 13.170 1.00 9.14 C
ATOM 920 N2 G A 57 32.295 1.559 12.302 1.00 10.54 N
ATOM 921 N3 G A 57 32.131 2.356 14.477 1.00 8.97 N
ATOM 922 C4 G A 57 32.030 3.490 15.192 1.00 8.38 C
ATOM 923 P C A 58 36.691 2.608 18.919 1.00 16.58 P
ATOM 924 OIP C A 58 37.600 1.734 19.679 1.00 19.45 o
ATOM 925 O2P C A 58 36.781 4.087 19.087 1.00 16.41 o
ATOM 926 05* C A 58 36.862 2.269 17.367 1.00 16.52 o ATOM 927 C5* C A 58 36.965 0.921 16.950 1.00 15.10 C
ATOM 928 C4* C A 58 36.833 0.823 15.451 1.00 15.33 C
ATOM 929 04* C A 58 35.542 1.354 15.026 1.00 13.38 o
ATOM 930 C3* C A 58 37.823 1.632 14.626 1.00 15.47 C
ATOM 931 03* C A 58 39.082 0.999 14.519 1.00 18.74 o
ATOM 932 C2* C A 58 37.111 1.673 13.286 1.00 11.17 C
ATOM 933 02* C A 58 37.125 0.418 12.637 1.00 13.01 o
ATOM 934 Cl* C A 58 35.686 1.967 13.742 1.00 11.98 C
ATOM 935 Nl C A 58 35.440 3.426 13.858 1.00 10.35 N
ATOM 936 C2 C A 58 35.230 4.131 12.682 1.00 10.97 C
ATOM 937 02 C A 58 35.275 3.491 11.594 1.00 7.46 o
ATOM 938 N3 C A 58 34.999 5.473 12.736 1.00 8.91 N
ATOM 939 C4 C A 58 34.990 6.105 13.916 1.00 10.48 C
ATOM 940 N4 C A 58 34.733 7.422 13.921 1.00 7.52 N
ATOM 941 C5 C A 58 35.227 5.405 15.148 1.00 10.36 C
ATOM 942 C6 C A 58 35.431 4.071 15.068 1.00 9.90 C
ATOM 943 P A A 59 40.407 1.880 14.428 1.00 19.54 P
ATOM 944 OIP A A 59 41.476 0.909 14.728 1.00 23.14 o
ATOM 945 02P A A 59 40.304 3.121 15.233 1.00 20.05 o
ATOM 946 05* A A 59 40.518 2.324 12.905 1.00 18.97 o
ATOM 947 C5* A A 59 40.522 1.349 11.874 1.00 17.12 C
ATOM 948 C4* A A 59 40.269 1.997 10.535 1.00 14.95 C
ATOM 949 04* A A 59 38.970 2.626 10.543 1.00 12.74 o
ATOM 950 C3* A A 59 41.236 3.099 10.117 1.00 15.97 C
ATOM 951 03* A A 59 42.370 2.521 9.460 1.00 17.11 o
ATOM 952 C2* A A 59 40.403 3.881 9.110 1.00 13.15 C
ATOM 953 02* A A 59 40.358 3.242 7.841 1.00 15.00 o
ATOM 954 Cl* A A 59 39.025 3.807 9.760 1.00 12.41 C
ATOM 955 N9 A A 59 38.757 4.934 10.638 1.00 9.57 N
ATOM 956 C8 A A 59 38.679 4.950 12.014 1.00 9.34 C
ATOM 957 N7 A A 59 38.272 6.102 12.500 1.00 10.38 N
ATOM 958 C5 A A 59 38.113 6.898 11.371 1.00 9.84 C
ATOM 959 C6 A A 59 37.677 8.220 11.208 1.00 10.49 C
ATOM 960 N6 A A 59 37.293 8.991 12.234 1.00 9.11 N
ATOM 961 Nl A A 59 37.636 8.724 9.946 1.00 8.89 N
ATOM 962 C2 A A 59 37.996 7.927 8.925 1.00 9.88 C
ATOM 963 N3 A A 59 38.421 6.652 8.958 1.00 9.56 N
ATOM 964 C4 A A 59 38.454 6.200 10.225 1.00 11.25 C
ATOM 965 P C A 60 43.798 3.248 9.508 1.00 19.04 P
ATOM 966 OIP C A 60 44.656 2.375 8.681 1.00 20.33 o
ATOM 967 02P C A 60 44.215 3.641 10.866 1.00 19.75 o
ATOM 968 05* C A 60 43.620 4.636 8.731 1.00 18.48 o
ATOM 969 C5* C A 60 43.338 4.688 7.333 1.00 18.85 C
ATOM 970 C4* C A 60 42.905 6.090 6.953 1.00 17.29 C
ATOM 971 04* C A 60 41.667 6.421 7.652 1.00 16.10 o
ATOM 972 C3* C A 60 43.873 7.165 7.423 1.00 17.86 C
ATOM 973 03* C A 60 44.971 7.328 6.520 1.00 15.95 o
ATOM 974 C2* C A 60 42.968 8.392 7.489 1.00 15.75 C
ATOM 975 02* C A 60 42.660 8.899 6.212 1.00 13.38 o
ATOM 976 Cl* C A 60 41.697 7.779 8.067 1.00 14.86 C
ATOM 977 Nl C A 60 41.611 7.830 9.544 1.00 13.02 N
ATOM 978 C2 C A 60 41.114 8.986 10.149 1.00 11.98 C
ATOM 979 02 C A 60 40.879 9.970 9.439 1.00 9.87 o
ATOM 980 N3 C A 60 40.914 8.997 11.493 1.00 10.09 N
ATOM 981 C4 C A 60 41.219 7.909 12.218 1.00 11.18 C
ATOM 982 N4 C A 60 40.938 7.915 13.561 1.00 10.47 N
ATOM 983 C5 C A 60 41.802 6.761 11.628 1.00 12.15 C
ATOM 984 C6 C A 60 41.979 6.759 10.306 1.00 12.05 C
ATOM 985 P C A 61 46.349 7.925 7.060 1.00 17.42 P ATOM 986 OIP C A 61 47.356 7.713 5.977 1.00 20.07 o
ATOM 987 O2P C A 61 46.636 7.464 8.434 1.00 19.05 o
ATOM 988 05* C A 61 46.133 9.494 7.201 1.00 17.65 o
ATOM 989 C5* C A 61 45.764 10.291 6.081 1.00 17.42 C
ATOM 990 C4* C A 61 45.277 11.640 6.565 1.00 16.64 C
ATOM 991 04* C A 61 44.139 11.438 7.426 1.00 14.79 o
ATOM 992 C3* C A 61 46.263 12.391 7.446 1.00 17.63 C
ATOM 993 03* C A 61 47.139 13.091 6.584 1.00 20.87 o
ATOM 994 C2* C A 61 45.331 13.335 8.203 1.00 16.43 C
ATOM 995 02* C A 61 44.852 14.369 7.365 1.00 14.86 o
ATOM 996 Cl* C A 61 44.138 12.417 8.454 1.00 13.94 C
ATOM 997 Nl C A 61 44.153 11.729 9.750 1.00 13.10 N
ATOM 998 C2 C A 61 43.580 12.368 10.847 1.00 12.09 C
ATOM 999 02 C A 61 43.175 13.525 10.710 1.00 13.24 o
ATOM 1000 N3 C A 61 43.489 11.711 12.021 1.00 11.67 N
ATOM 1001 C4 C A 61 43.975 10.468 12.138 1.00 12.40 C
ATOM 1002 N4 C A 61 43.831 9.835 13.322 1.00 12.14 N
ATOM 1003 C5 C A 61 44.626 9.812 11.054 1.00 12.01 C
ATOM 1004 C6 C A 61 44.688 10.474 9.885 1.00 12.81 C
ATOM 1005 P G A 62 48.728 12.846 6.660 1.00 24.16 P
ATOM 1006 OIP G A 62 49.270 13.445 5.410 1.00 23.67 o
ATOM 1007 O2P G A 62 48.943 11.426 6.942 1.00 23.26 o
ATOM 1008 05* G A 62 49.205 13.690 7.911 1.00 26.62 o
ATOM 1009 C5* G A 62 50.589 13.855 8.182 1.00 30.96 C
ATOM 1010 C4* G A 62 50.774 14.353 9.582 1.00 33.50 C
ATOM 1011 04* G A 62 50.528 13.270 10.506 1.00 33.47 o
ATOM 1012 C3* G A 62 52.218 14.771 9.809 1.00 34.85 C
ATOM 1013 03* G A 62 52.203 15.955 10.570 1.00 37.99 o
ATOM 1014 C2* G A 62 52.906 13.588 10.482 1.00 34.23 C
ATOM 1015 02* G A 62 53.808 13.966 11.496 1.00 36.44 o
ATOM 1016 Cl* G A 62 51.721 12.929 11.174 1.00 33.41 C
ATOM 1017 N9 G A 62 51.722 11.510 11.492 1.00 31.26 N
ATOM 1018 C8 G A 62 52.117 10.448 10.718 1.00 31.91 C
ATOM 1019 N7 G A 62 51.933 9.296 11.308 1.00 31.27 N
ATOM 1020 C5 G A 62 51.407 9.628 12.552 1.00 30.56 C
ATOM 1021 C6 G A 62 50.995 8.807 13.633 1.00 31.34 C
ATOM 1022 06 G A 62 51.018 7.569 13.717 1.00 32.16 O
ATOM 1023 Nl G A 62 50.498 9.572 14.697 1.00 30.15 N
ATOM 1024 C2 G A 62 50.401 10.940 14.706 1.00 26.83 C
ATOM 1025 N2 G A 62 49.859 11.501 15.789 1.00 25.94 N
ATOM 1026 N3 G A 62 50.792 11.704 13.717 1.00 28.92 N
ATOM 1027 C4 G A 62 51.277 10.988 12.677 1.00 30.49 C
ATOM 1028 P U A 63 52.954 17.235 10.045 1.00 41.73 P
ATOM 1029 OIP U A 63 53.058 17.081 8.559 1.00 41.63 o
ATOM 1030 O2P U A 63 54.175 17.334 10.869 1.00 43.38 o
ATOM 1031 05* U A 63 51.997 18.439 10.460 1.00 39.87 o
ATOM 1032 C5* U A 63 50.763 18.720 9.780 1.00 36.71 C
ATOM 1033 C4* U A 63 49.836 19.408 10.747 1.00 35.04 C
ATOM 1034 04* U A 63 49.068 18.411 11.462 1.00 32.03 o
ATOM 1035 C3* U A 63 50.675 20.143 11.781 1.00 35.26 C
ATOM 1036 03* U A 63 50.995 21.511 11.420 1.00 38.70 o
ATOM 1037 C2* U A 63 50.130 19.755 13.165 1.00 33.49 C
ATOM 1038 02* U A 63 49.712 20.770 14.049 1.00 37.02 o
ATOM 1039 Cl* U A 63 49.029 18.735 12.834 1.00 30.93 C
ATOM 1040 Nl U A 63 48.986 17.488 13.615 1.00 27.66 N
ATOM 1041 C2 U A 63 48.353 17.544 14.818 1.00 24.66 C
ATOM 1042 02 U A 63 47.901 18.573 15.248 1.00 22.79 o
ATOM 1043 N3 U A 63 48.278 16.352 15.499 1.00 23.74 N
ATOM 1044 C4 U A 63 48.798 15.145 15.102 1.00 23.73 C ATOM 1045 04 U A 63 48.743 14.175 15.859 1.00 24.68 o
ATOM 1046 C5 U A 63 49.463 15.178 13.841 1.00 24.93 C
ATOM 1047 C6 U A 63 49.531 16.320 13.160 1.00 25.63 C
ATOM 1048 P A A 64 49.882 22.684 11.491 1.00 37.56 P
ATOM 1049 OIP A A 64 50.561 23.954 11.785 1.00 39.16 o
ATOM 1050 O2P A A 64 48.748 22.257 12.335 1.00 40.01 o
ATOM 1051 05* A A 64 49.353 22.820 9.997 1.00 34.07 o
ATOM 1052 C5* A A 64 48.031 23.290 9.741 1.00 28.17 C
ATOM 1053 C4* A A 64 47.481 22.620 8.508 1.00 24.51 C
ATOM 1054 04* A A 64 47.732 21.200 8.571 1.00 23.04 o
ATOM 1055 C3* A A 64 45.992 22.796 8.296 1.00 22.46 C
ATOM 1056 03* A A 64 45.849 23.906 7.425 1.00 20.90 o
ATOM 1057 C2* A A 64 45.604 21.505 7.587 1.00 20.89 C
ATOM 1058 02* A A 64 45.901 21.599 6.225 1.00 22.90 o
ATOM 1059 Cl* A A 64 46.571 20.494 8.203 1.00 20.25 C
ATOM 1060 N9 A A 64 46.130 19.781 9.396 1.00 17.85 N
ATOM 1061 C8 A A 64 45.524 20.267 10.530 1.00 16.44 C
ATOM 1062 N7 A A 64 45.377 19.367 11.477 1.00 16.36 N
ATOM 1063 C5 A A 64 45.897 18.211 10.916 1.00 16.10 C
ATOM 1064 C6 A A 64 46.056 16.912 11.416 1.00 17.55 C
ATOM 1065 N6 A A 64 45.703 16.547 12.661 1.00 17.95 N
ATOM 1066 Nl A A 64 46.602 15.982 10.594 1.00 16.51 N
ATOM 1067 C2 A A 64 46.978 16.359 9.358 1.00 17.36 C
ATOM 1068 N3 A A 64 46.889 17.563 8.781 1.00 17.52 N
ATOM 1069 C4 A A 64 46.333 18.447 9.623 1.00 16.65 C
ATOM 1070 P A A 65 44.523 24.788 7.455 1.00 26.00 P
ATOM 1071 OIP A A 65 44.761 25.977 6.601 1.00 22.85 o
ATOM 1072 02P A A 65 44.095 24.965 8.870 1.00 23.33 o
ATOM 1073 05* A A 65 43.412 23.897 6.726 1.00 21.03 o
ATOM 1074 C5* A A 65 43.457 23.677 5.321 1.00 18.24 C
ATOM 1075 C4* A A 65 42.507 22.569 4.928 1.00 13.85 C
ATOM 1076 04* A A 65 42.806 21.347 5.676 1.00 12.93 o
ATOM 1077 C3* A A 65 41.079 22.964 5.312 1.00 13.37 C
ATOM 1078 03* A A 65 40.259 22.339 4.329 1.00 13.50 o
ATOM 1079 C2* A A 65 40.840 22.298 6.671 1.00 12.65 C
ATOM 1080 02* A A 65 39.528 21.867 6.921 1.00 13.66 o
ATOM 1081 Cl* A A 65 41.701 21.055 6.509 1.00 12.78 C
ATOM 1082 N9 A A 65 42.072 20.249 7.661 1.00 12.77 N
ATOM 1083 C8 A A 65 42.096 20.565 8.995 1.00 12.34 C
ATOM 1084 N7 A A 65 42.398 19.546 9.767 1.00 12.41 N
ATOM 1085 C5 A A 65 42.597 18.501 8.875 1.00 12.05 C
ATOM 1086 C6 A A 65 42.910 17.149 9.070 1.00 10.90 C
ATOM 1087 N6 A A 65 43.092 16.609 10.272 1.00 10.50 N
ATOM 1088 Nl A A 65 43.027 16.366 7.977 1.00 11.90 N
ATOM 1089 C2 A A 65 42.832 16.924 6.765 1.00 12.11 C
ATOM 1090 N3 A A 65 42.525 18.188 6.460 1.00 11.24 N
ATOM 1091 C4 A A 65 42.420 18.925 7.577 1.00 11.53 C
ATOM 1092 P A A 66 39.022 23.077 3.643 1.00 14.28 P
ATOM 1093 OIP A A 66 39.543 23.558 2.325 1.00 15.32 O
ATOM 1094 02P A A 66 38.338 24.024 4.542 1.00 14.63 O
ATOM 1095 05* A A 66 38.069 21.846 3.338 1.00 13.61 O
ATOM 1096 C5* A A 66 37.465 21.085 4.393 1.00 11.48 C
ATOM 1097 C4* A A 66 37.723 19.608 4.177 1.00 10.84 C
ATOM 1098 04* A A 66 39.051 19.241 4.669 1.00 10.26 O
ATOM 1099 C3* A A 66 36.836 18.669 4.967 1.00 10.58 C
ATOM 1100 03* A A 66 35.542 18.590 4.389 1.00 8.43 O
ATOM 1101 C2* A A 66 37.616 17.369 4.831 1.00 10.04 C
ATOM 1102 02* A A 66 37.536 16.826 3.529 1.00 11.06 o
ATOM 1103 Cl* A A 66 39.042 17.870 5.094 1.00 10.87 C ATOM 1104 N9 A A 66 39.240 17.798 6.548 1.00 10.17 N
ATOM 1105 C8 A A 66 39.155 18.790 7.496 1.00 10.03 C
ATOM 1106 N7 A A 66 39.261 18.348 8.733 1.00 8.56 N
ATOM 1107 C5 A A 66 39.441 16.982 8.588 1.00 6.54 C
ATOM 1108 C6 A A 66 39.556 15.933 9.533 1.00 10.49 C
ATOM 1109 N6 A A' 66 39.492 16.109 10.872 1.00 8.52 N
ATOM 1110 Nl A A 66 39.716 14.683 9.059 1.00 10.25 N
ATOM 1111 C2 A A 66 39.726 14.497 7.720 1.00 9.28 C
ATOM 1112 N3 A A 66 39.599 15.400 6.739 1.00 6.99 N
ATOM 1113 C4 A A 66 39.470 16.634 7.247 1.00 7.89 C
ATOM 1114 P U A 67 34.254 18.628 5.328 1.00 7.15 P
ATOM 1115 OIP U A 67 33.113 18.369 4.451 1.00 8.19 O
ATOM 1116 O2P U A 67 34.265 19.889 6.140 1.00 10.54 O
ATOM 1117 05* U A 67 34.515 17.404 6.352 1.00 9.66 O
ATOM 1118 C5* U A 67 33.879 16.134 6.178 1.00 9.63 C
ATOM 1119 C4* U A 67 34.826 15.139 5.537 1.00 8.56 C
ATOM 1120 04* U A 67 36.010 14.944 6.353 1.00 11.36 O
ATOM 1121 C3* U A 67 34.231 13.746 5.426 1.00 10.38 C
ATOM 1122 03* U A 67 33.422 13.688 4.257 1.00 9.95 O
ATOM 1123 C2* U A 67 35.473 12.885 5.331 1.00 9.97 C
ATOM 1124 02* U A 67 36.048 13.024 4.062 1.00 9.19 O
ATOM 1125 Cl* U A 67 36.395 13.578 6.337 1.00 11.42 C
ATOM 1126 Nl U A 67 36.318 13.066 7.718 1.00 10.80 N
ATOM 1127 C2 U A 67 36.833 11.817 7.968 1.00 10.50 C
ATOM 1128 02 U A 67 37.312 11.117 7.101 1.00 10.56 O
ATOM 1129 N3 U A 67 36.765 11.408 9.272 1.00 9.20 N
ATOM 1130 C4 U A 67 36.239 12.104 10.335 1.00 13.14 C
ATOM 1131 04 U A 67 36.155 11.549 11.435 1.00 12.19 O
ATOM 1132 C5 U A 67 35.717 13.400 10.002 1.00 12.31 C
ATOM 1133 C6 U A 67 35.767 13.822 8.734 1.00 11.37 C
ATOM 1134 P G A 68 32.112 12.788 4.239 1.00 10.90 P
ATOM 1135 OIP G A 68 31.468 12.906 2.896 1.00 11.43 o
ATOM 1136 O2P G A 68 31.302 13.053 5.442 1.00 13.26 o
ATOM 1137 05* G A 68 32.721 11.308 4.326 1.00 11.48 o
ATOM 1138 C5* G A 68 33.371 10.739 3.203 1.00 11.16 C
ATOM 1139 C4* G A 68 33.998 9.411 3.573 1.00 9.62 C
ATOM 1140 04* G A 68 34.942 9.616 4.645 1.00 11.01 o
ATOM 1141 C3* G A 68 33.037 8.390 4.142 1.00 11.11 C
ATOM 1142 03* G A 68 32.398 7.725 3.054 1.00 10.48 o
ATOM 1143 C2* G A 68 33.998 7.461 4.874 1.00 9.29 C
ATOM 1144 02* G A 68 34.735 6.677 3.982 1.00 10.62 o
ATOM 1145 Cl* G A 68 34.963 8.466 5.487 1.00 8.97 C
ATOM 1146 N9 G A 68 34.546 8.820 6.844 1.00 8.81 N
ATOM 1147 C8 G A 68 33.966 9.976 7.304 1.00 8.53 C
ATOM 1148 N7 G A 68 33.723 9.947 8.594 1.00 8.57 N
ATOM 1149 C5 G A 68 34.195 8.708 9.005 1.00 7.34 C
ATOM 1150 C6 G A 68 34.222 8.109 10.303 1.00 6.86 C
ATOM 1151 06 G A 68 33.892 8.610 11.385 1.00 8.26 o
ATOM 1152 Nl G A 68 34.707 6.806 10.256 1.00 5.47 N
ATOM 1153 C2 G A 68 35.136 6.164 9.113 1.00 7.70 C
ATOM 1154 N2 G A 68 35.520 4.887 9.242 1.00 7.97 N
ATOM 1155 N3 G A 68 35.166 6.729 7.911 1.00 8.25 N
ATOM 1156 C4 G A 68 34.676 7.991 7.934 1.00 8.75 C
ATOM 1157 P U A 69 30.971 7.049 3.252 1.00 13.77 P
ATOM 1158 OIP U A 69 30.555 6.595 1.924 1.00 11.85 o
ATOM 1159 O2P U A 69 30.093 7.968 3.998 1.00 12.81 o
ATOM 1160 05* U A 69 31.231 5.779 4.195 1.00 11.34 o
ATOM 1161 C5* U A 69 32.060 4.700 3.787 1.00 11.10 C
ATOM 1162 C4* U A 69 32.278 3.754 4.958 1.00 9.72 C ATOM 1163 04* U A 69 33.037 4.436 6.006 1.00 8.84 O
ATOM 1164 C3* U A 69 31.010 3.319 5.663 1.00 8.43 C
ATOM 1165 03* U A 69 30.409 2.218 4.976 1.00 10.14 O
ATOM 1166 C2* U A 69 31.556 2.908 7.017 1.00 8.62 C
ATOM 1167 02* U A 69 32.248 1.685 6.931 1.00 7.78 O
ATOM 1168 Cl* U A 69 32.576 4.009 7.273 1.00 8.22 C
ATOM 1169 Nl U A 69 32.004 5.162 7.979 1.00 10.18 N
ATOM 1170 C2 U A 69 31.922 5.065 9.360 1.00 10.69 C
ATOM 1171 02 U A 69 32.209 4.017 9.964 1.00 11.15 O
ATOM 1172 N3 U A 69 31.478 6.210 10.003 1.00 9.29 N
ATOM 1173 C4 U A 69 31.088 7.405 9.411 1.00 9.74 C
ATOM 1174 04 U A 69 30.787 8.391 10.131 1.00 10.59 O
ATOM 1175 C5 ϋ A 69 31.145 7.399 7.972 1.00 8.22 C
ATOM 1176 C6 U A 69 31.587 6.303 7.317 1.00 8.57 C
ATOM 1177 P C A 70 28.841 1.987 5.039 1.00 11.98 P
ATOM 1178 O1P C A 70 28.566 1.076 3.893 1.00 13.06 O
ATOM 1179 O2P C A 70 28.126 3.300 5.133 1.00 13.69 O
ATOM 1180 05* C A 70 28.566 1.234 6.428 1.00 12.93 O
ATOM 1181 C5* C A 70 29.111 -0.064 6.679 1.00 12.54 C
ATOM 1182 C4* C A 70 29.095 -0.367 8.163 1.00 10.94 C
ATOM 1183 04* C A 70 29.877 0.624 8.879 1.00 12.72 O
ATOM 1184 C3* C A 70 27.735 -0.325 8.854 1.00 11.39 C
ATOM 1185 03* C A 70 27.050 -1.575 8.672 1.00 10.71 O
ATOM 1186 C2* C A 70 28.156 -0.164 10.305 1.00 10.22 C
ATOM 1187 02* C A 70 28.682 -1.378 10.815 1.00 10.73 O
ATOM 1188 Cl* C A 70 29.319 0.816 10.169 1.00 10.56 C
ATOM 1189 Nl C A 70 28.918 2.231 10.310 1.00 10.87 N
ATOM 1190 C2 C A 70 28.815 2.754 11.585 1.00 10.52 C
ATOM 1191 02 C A 70 28.888 1.981 12.549 1.00 13.89 O
ATOM 1192 N3 C A 70 28.618 4.069 11.750 1.00 9.90 N
ATOM 1193 C4 C A 70 28.472 4.864 10.678 1.00 11.60 C
ATOM 1194 N4 C A 70 28.339 6.173 10.891 1.00 9.69 N
ATOM 1195 C5 C A 70 28.469 4.342 9.351 1.00 9.75 C
ATOM 1196 C6 C A 70 28.697 3.027 9.213 1.00 10.87 C
ATOM 1197 P C A 71 25.506 -1.596 8.258 1.00 11.56 P
ATOM 1198 O1P C A 71 25.252 -2.969 7.708 1.00 11.38 O
ATOM 1199 O2P C A 71 25.231 -0.392 7.422 1.00 13.37 O
ATOM 1200 05* C A 71 24.680 -1.449 9.614 1.00 10.61 O
ATOM 1201 C5* C A 71 24.617 -2.546 10.513 1.00 10.38 C
ATOM 1202 C4* C A 71 24.552 -2.068 11.939 1.00 10.89 C
ATOM 1203 04* C A 71 25.639 -1.140 12.164 1.00 10.86 O
ATOM 1204 C3* C A 71 23.336 -1.255 12.362 1.00 11.86 C
ATOM 1205 03* C A 71 22.249 -2.098 12.693 1.00 11.40 O
ATOM 1206 C2* C A 71 23.864 -0.579 13.618 1.00 11.64 C
ATOM 1207 02* C A 71 23.975 -1.513 14.691 1.00 9.30 O
ATOM 1208 Cl* C A 71 25.263 -0.193 13.149 1.00 10.57 C
ATOM 1209 Nl C A 71 25.248 1.161 12.548 1.00 9.67 N
ATOM 1210 C2 C A 71 25.243 2.271 13.429 1.00 10.84 C
ATOM 1211 02 C A 71 25.179 2.052 14.671 1.00 11.11 O
ATOM 1212 N3 C A 71 25.283 3.534 12.917 1.00 8.57 N
ATOM 1213 C4 C A 71 25.271 3.710 11.589 1.00 9.43 C
ATOM 1214 N4 C A 71 25.262 4.948 11.132 1.00 6.44 N
ATOM 1215 C5 C A 71 25.247 2.602 10.671 1.00 6.94 C
ATOM 1216 C6 C A 71 25.240 1.353 11.196 1.00 9.46 C
ATOM 1217 P G A 72 20.758 -1.542 12.563 1.00 13.41 P
ATOM 1218 O1P G A 72 19.842 -2.631 12.899 1.00 12.96 O
ATOM 1219 O2P G A 72 20.611 -0.780 11.263 1.00 11.63 O
ATOM 1220 05* G A 72 20.616 -0.447 13.712 1.00 11.41 O
ATOM 1221 C5* G A 72 20.561 -0.840 15.068 1.00 11.53 C ATOM 1222 C4* G A 72 20.555 0.380 15.949 1.00 11.87 C
ATOM 1223 04* G A 72 21.770 1.139 15.712 1.00 14.46 O
ATOM 1224 C3* G A 72 19.437 1.364 15.661 1.00 14.43 C
ATOM 1225 03* G A 72 18.278 0.935 16.376 1.00 16.44 O
ATOM 1226 C2* G A 72 20.016 2.654 16.228 1.00 13.97 C
ATOM 1227 02* G A 72 19.907 2.695 17.643 1.00 15.21 O
ATOM 1228 Cl* G A 72 21.489 2.520 15.826 1.00 12.65 C
ATOM 1229 N9 G A 72 21.759 3.170 14.550 1.00 12.14 N
ATOM 1230 C8 G A 72 21.856 2.616 13.294 1.00 9.16 C
ATOM 1231 N7 G A 72 22.026 3.517 12.353 1.00 11.34 N
ATOM 1232 C5 G A 72 22.061 4.729 13.040 1.00 10.76 C
ATOM 1233 C6 G A 72 22.202 6.078 12.562 1.00 11.86 C
ATOM 1234 06 G A 72 22.341 6.488 11.391 1.00 8.38 O
ATOM 1235 Nl G A 72 22.159 6.993 13.612 1.00 11.84 N
ATOM 1236 C2 G A 72 21.999 6.663 14.947 1.00 13.77 C
ATOM 1237 N2 G A 72 21.950 7.693 15.815 1.00 9.84 N
ATOM 1238 N3 G A 72 21.887 5.422 15.393 1.00 11.48 N
ATOM 1239 C4 G A 72 21.919 4.522 14.396 1.00 11.62 C
ATOM 1240 P A A 73 16.809 1.286 15.834 1.00 19.26 P
ATOM 1241 O1P A A 73 15.885 0.363 16.510 1.00 22.32 O
ATOM 1242 O2P A A 73 16.775 1.381 14.361 1.00 19.69 O
ATOM 1243 05* A A 73 16.569 2.740 16.433 1.00 17.72 O
ATOM 1244 C5* A A 73 15.307 3.388 16.323 1.00 17.97 C
ATOM 1245 C4* A A 73 15.431 4.809 16.822 1.00 17.32 C
ATOM 1246 04* A A 73 16.427 5.494 16.024 1.00 16.54 O
ATOM 1247 C3* A A 73 14.189 5.661 16.679 1.00 17.30 C
ATOM 1248 03* A A 73 13.363 5.484 17.811 1.00 18.59 O
ATOM 1249 C2* A A 73 14.773 7.060 16.630 1.00 17.35 C
ATOM 1250 02* A A 73 15.216 7.464 17.913 1.00 16.54 O
ATOM 1251 Cl* A A 73 16.012 6.819 15.770 1.00 15.89 C
ATOM 1252 N9 A A 73 15.791 6.944 14.334 1.00 13.91 N
ATOM 1253 C8 A A 73 15.673 5.940 13.408 1.00 13.93 C
ATOM 1254 N7 A A 73 15.535 6.370 12.180 1.00 14.00 N
ATOM 1255 C5 A A 73 15.532 7.754 12.312 1.00 13.79 C
ATOM 1256 C6 A A 73 15.406 8.788 11.378 1.00 15.44 C
ATOM 1257 N6 A A 73 15.244 8.581 10.068 1.00 13.32 N
ATOM 1258 Nl A A 73 15.451 10.062 11.840 1.00 14.22 N
ATOM 1259 C2 A A 73 15.592 10.262 13.158 1.00 14.69 C
ATOM 1260 N3 A A 73 15.706 9.369 14.134 1.00 14.25 N
ATOM 1261 C4 A A 73 15.679 8.119 13.634 1.00 14.11 C
ATOM 1262 P C A 74 11.774 5.640 17.658 1.00 19.05 P
ATOM 1263 O1P C A 74 11.240 5.400 19.022 1.00 21.21 O
ATOM 1264 O2P C A 74 11.309 4.816 16.524 1.00 18.62 O
ATOM 1265 05* C A 74 11.567 7.165 17.261 1.00 17.10 O
ATOM 1266 C5* C A 74 11.953 8.213 18.140 1.00 16.41 C
ATOM 1267 C4* C A 74 11.888 9.522 17.409 1.00 18.55 C
ATOM 1268 04* C A 74 12.836 9.487 16.307 1.00 18.29 O
ATOM 1269 C3* C A 74 10.569 9.816 16.709 1.00 19.36 C
ATOM 1270 03* C A 74 9.611 10.304 17.650 1.00 19.83 O
ATOM 1271 C2* C A 74 11.008 10.851 15.678 1.00 18.77 C
ATOM 1272 02* C A 74 11.287 12.128 16.243 1.00 19.03 O
ATOM 1273 Cl* C A 74 12.342 10.257 15.224 1.00 15.75 C
ATOM 1274 Nl C A 74 12.235 9.391 14.042 1.00 14.65 N
ATOM 1275 C2 C A 74 12.221 9.987 12.789 1.00 15.39 C
ATOM 1276 02 C A 74 12.225 11.231 12.722 1.00 14.36 O
ATOM 1277 N3 C A 74 12.192 9.204 11.677 1.00 14.56 N
ATOM 1278 C4 C A 74 12.165 7.877 11.801 1.00 13.56 C
ATOM 1279 N4 C A 74 12.187 7.143 10.683 1.00 13.38 N
ATOM 1280 C5 C A 74 12.130 7.241 13.078 1.00 15.04 C ATOM 1281 C6 C A 74 12.167 8.030 14.162 1.00 12.74 C
ATOM 1282 P U A 75 8.046 10.090 17.379 1.00 20.97 P
ATOM 1283 OIP U A 75 7.326 10.728 18.515 1.00 20.97 o
ATOM 1284 O2P U A 75 7.770 8.678 17.051 1.00 19.40 o
ATOM 1285 05* U A 75 7.759 10.948 16.067 1.00 19.78 o
ATOM 1286 C5* U A 75 7.940 12.353 16.067 1.00 19.76 C
ATOM 1287 C4* U A 75 7.692 12.920 14.689 1.00 19.28 C
ATOM 1288 04* U A 75 8.781 12.515 13.804 1.00 19.26 o
ATOM 1289 C3* U A 75 6.437 12.422 13.980 1.00 19.85 C
ATOM 1290 03* U A 75 5.251 13.115 14.372 1.00 21.73 o
ATOM 1291 C2* U A 75 6.784 12.675 12.521 1.00 18.07 C
ATOM 1292 02* U A 75 6.648 14.044 12.198 1.00 18.55 o
ATOM 1293 Cl* U A 75 8.269 12.294 12.503 1.00 18.72 C
ATOM 1294 Nl U A 75 8.444 10.876 12.155 1.00 16.03 N
ATOM 1295 C2 U A 75 8.528 10.591 10.821 1.00 14.55 C
ATOM 1296 02 U A 75 8.474 11.463 9.960 1.00 16.34 o
ATOM 1297 N3 U A 75 8.689 9.269 10.515 1.00 14.24 N
ATOM 1298 C4 U A 75 8.798 8.233 11.389 1.00 14.08 C
ATOM 1299 04 U A 75 9.094 7.130 10.954 1.00 15.54 O
ATOM 1300 C5 U A 75 8.693 8.599 12.769 1.00 14.83 C
ATOM 1301 C6 U A 75 8.516 9.890 13.096 1.00 15.06 C
ATOM 1302 P A A 76 3.836 12.353 14.321 1.00 22.14 P
ATOM 1303 OIP A A 76 2.880 13.223 15.033 1.00 23.41 o
ATOM 1304 O2P A A 76 4.009 10.938 14.753 1.00 21.70 o
ATOM 1305 05* A A 76 3.415 12.341 12.788 1.00 21.20 o
ATOM 1306 C5* A A 76 3.300 13.552 12.057 1.00 20.43 C
ATOM 1307 C4* A A 76 2.982 13.261 10.610 1.00 21.53 C
ATOM 1308 04* A A 76 4.176 12.824 9.881 1.00 20.76 o
ATOM 1309 C3* A A 76 1.979 12.147 10.385 1.00 21.60 C
ATOM 1310 03* A A 76 0.644 12.615 10.600 1.00 24.70 o
ATOM 1311 C2* A A 76 2.288 11.751 8.942 1.00 20.85 C
ATOM 1312 02* A A 76 1.813 12.741 8.049 1.00 18.38 o
ATOM 1313 Cl* A A 76 3.818 11.834 8.924 1.00 18.33 C
ATOM 1314 N9 A A 76 4.448 10.557 9.291 1.00 18.89 N
ATOM 1315 C8 A A 76 4.743 10.092 10.543 1.00 18.84 C
ATOM 1316 N7 A A 76 5.292 8.890 10.552 1.00 19.05 N
ATOM 1317 C5 A A 76 5.370 8.549 9.213 1.00 17.84 C
ATOM 1318 C6 A A 76 5.873 7.410 8.545 1.00 18.46 C
ATOM 1319 N6 A A 76 6.430 6.353 9.151 1.00 16.69 N
ATOM 1320 Nl A A 76 5.789 7.395 7.198 1.00 18.58 N
ATOM 1321 C2 A A 76 5.262 8.451 6.577 1.00 18.00 C
ATOM 1322 N3 A A 76 4.779 9.581 7.089 1.00 17.69 N
ATOM 1323 C4 A A 76 4.855 9.566 8.424 1.00 18.36 C
ATOM 1324 P U A 77 -0.490 11.589 11.092 1.00 27.25 P
ATOM 1325 OIP U A 77 -1.718 12.394 11.304 1.00 28.56 O
ATOM 1326 O2P U A 77 0.014 10.721 12.181 1.00 27.58 O
ATOM 1327 05* U A 77 -0.735 10.633 9.847 1.00 26.15 O
ATOM 1328 C5* U A 77 -1.346 11.123 8.674 1.00 27.17 C
ATOM 1329 C4* U A 77 -1.337 10.063 7.612 1.00 28.07 C
ATOM 1330 04* U A 77 0.040 9.755 7.267 1.00 27.42 O
ATOM 1331 C3* U A 77 -1.901 8.720 8.045 1.00 28.00 C
ATOM 1332 03* U A 77 -3.318 8.669 7.947 1.00 29.45 o
ATOM 1333 C2* U A 77 -1.240 7.775 7.055 1.00 28.20 C
ATOM 1334 02* U A 77 -1.847 7.832 5.782 1.00 26.90 o
ATOM 1335 Cl* U A 77 0.157 8.381 6.958 1.00 27.65 C
ATOM 1336 Nl U A 77 1.088 7.745 7.899 1.00 27.42 N
ATOM 1337 C2 U A 77 1.698 6.593 7.466 1.00 27.52 C
ATOM 1338 02 U A 77 1.493 6.121 6.373 1.00 28.51 o
ATOM 1339 N3 U A 77 2.553 6.005 8.360 1.00 28.43 N ATOM 1340 C4 U A 77 2.855 6.437 9.623 1.00 27.99 C
ATOM 1341 04 U A 77 3.638 5.774 10.307 1.00 28.56 o
ATOM 1342 C5 U A 77 2.183 7.651 10.018 1.00 29.06 C
ATOM 1343 C6 U A 77 1.340 8.252 9.153 1.00 27.47 C
ATOM 1344 P G A 78 4.128 7.571 8.789 1.00 29.69 P
ATOM 1345 OIP G A 78 5.574 7, .922 8.682 1.00 29.99 o
ATOM 1346 02P G A 78 3.493 7..464 10.125 1.00 29.83 o
ATOM 1347 05* G A 78 -3 864 6.206 8.025 1.00 28.85 o
ATOM 1348 C5* G A 78 4.288 6.057 6.687 1.00 31.49 C
ATOM 1349 C4* G A 78 3.746 4.786 6.103 1.00 33.11 C
ATOM 1350 04* G A 78 2.290 4.821 6.141 1.00 32.92 o
ATOM 1351 C3* G A 78 4.069 3.537 6.904 1.00 34.56 C
ATOM 1352 03* G A 78 5.382 3.042 6.685 1.00 36.42 o
ATOM 1353 C2* G A 78 -2.993 2.572 6.437 1.00 33.46 C
ATOM 1354 02* G A 78 -3.300 2.058 5.155 1.00 33.17 o
ATOM 1355 Cl* G A 78 -1.788 3.506 6.335 1.00 32.77 C
ATOM 1356 N9 G A 78 0.943 3.484 7.528 1.00 31.48 N
ATOM 1357 C8 G A 78 0.885 4.406 8.552 1.00 30.74 C
ATOM 1358 N7 G A 78 0.017 4.111 9.453 1.00 30.41 N
ATOM 1359 C5 G A 78 0.574 2.921 9.000 1.00 30.92 C
ATOM 1360 CS G A 78 1.605 2.117 9.547 1.00 30.88 C
ATOM 1361 06 G A 78 2.260 2.301 10.569 1.00 31.93 o
ATOM 1362 Nl G A 78 1.852 0.992 8.763 1.00 31.31 N
ATOM 1363 C2 G A 78 1.200 0.683 7.596 1.00 31.42 C
ATOM 1364 N2 G A 78 1.576 -0.452 6.973 1.00 31.76 N
ATOM 1365 N3 G A 78 0.244 1.429 7.073 1.00 31.87 N
ATOM 1366 C4 G A 78 0.017 2.521 7.822 1.00 30.78 C
ATOM 1367 P U A 79 6.043 2.064 7..771 1.00 35.59 P
ATOM 1368 OIP U A 79 -7.437 1..843 7..318 1.00 38.84 o
ATOM 1369 02P U A 79 -5.793 2.598 9.125 1.00 34.83 o
ATOM 1370 05* U A 79 -5.233 0.710 7.607 1.00 37.13 o
ATOM 1371 C5* U A 79 -5.230 0.014 6.370 1.00 40.35 C
ATOM 1372 C4* U A 79 -4.500 -1.284 6.529 1.00 42.63 C
ATOM 1373 04* U A 79 -3 087 -1.022 6.688 1.00 41.91 o
ATOM 1374 C3* U A 79 4.878 -2.028 7.794 1.00 45.01 C
ATOM 1375 03* U A 79 6.059 -2.771 7.564 1.00 49.49 o
ATOM 1376 C2* U A 79 3.652 -2.897 8.039 1.00 42.79 C
ATOM 1377 02* U A 79 3.632 -4.039 7.212 1.00 43.51 o
ATOM 1378 Cl* U A 79 2.530 -1.946 7.614 1.00 41.31 C
ATOM 1379 Nl U A 79 -1 962 -1.177 8.732 1.00 38.39 N
ATOM 1380 C2 U A 79 0.932 -1.753 9.457 1.00 38.66 C
ATOM 1381 02 U A 79 0.492 -2.866 9.215 1.00 38.18 o
ATOM 1382 N3 U A 79 0.438 -0.978 10.475 1.00 36.92 N
ATOM 1383 C4 U A 79 -0 855 0.280 10.830 1.00 36.93 C
ATOM 1384 04 U A 79 0.255 0.884 11.715 1.00 36.60 o
ATOM 1385 C5 U A 79 1.931 0.800 10.037 1.00 36.98 C
ATOM 1386 C6 U A 79 2.432 0.070 9.043 1.00 37.15 C
ATOM 1387 P C A 80 6.888 -3.348 8.809 1.00 52.93 P
ATOM 1388 OIP C A 80 8.178 -3.837 8.240 1.00 53.20 o
ATOM 1389 O2P C A 80 6.901 -2.351 9.924 1.00 51.64 o
ATOM 1390 05* C A 80 6.006 -4.597 9.252 1.00 53.85 o
ATOM 1391 C5* C A 80 6.151 -5.164 10.536 1.00 56.78 C
ATOM 1392 C4* C A 80 4.996 -6.079 10.826 1.00 58.70 C
ATOM 1393 04* C A 80 3.764 -5.451 10.400 1.00 58.63 o
ATOM 1394 C3* C A 80 4.785 -6.404 12.289 1.00 60.66 C
ATOM 1395 03* C A 80 -5 653 -7.443 12.718 1.00 64.54 o
ATOM 1396 C2* C A 80 3.302 -6.748 12.328 1.00 59.49 C
ATOM 1397 02* C A 80 3.012 -8.056 11.868 1.00 59.82 o
ATOM 1398 Cl* C A 80 2.746 -5.710 11.351 1.00 57.90 C ATOM 1399 Nl C A 80 -2.483 -4.432 12.019 1.00 56.16 N
ATOM 1400 C2 C A 80 -1.428 -4.330 12.923 1.00 55.40 C
ATOM 1401 02 C A 80 -0.739 -5.332 13.160 1.00 55.60 O
ATOM 1402 N3 C A 80 -1.184 -3.140 13.516 1.00 54.79 N
ATOM 1403 C4 C A 80 -1.948 -2.085 13.236 1.00 54.67 C
ATOM 1404 N4 C A 80 -1.659 -0.931 13.830 1.00 54.49 N
ATOM 1405 C5 C A 80 -3.038 -2.167 12.329 1.00 55.04 C
ATOM 1406 C6 C A 80 -3.268 -3.348 11.748 1.00 55.31 C
ATOM 1407 P C A 81 -6.452 -7.275 14.101 1.00 67.04 P
ATOM 1408 OIP C A 81 -7.534 -8.288 14.135 1.00 67.56 O
ATOM 1409 O2P C A 81 -6.793 -5.832 14.228 1.00 66.62 O
ATOM 1410 05* C A 81 -5.356 -7.637 15.200 1.00 67.52 O
ATOM 1411 C5* C A 81 -4.468 -8.729 14.996 1.00 68.78 C
ATOM 1412 C4* C A 81 -3.176 -8.504 15.741 1.00 70.00 C
ATOM 1413 04* C A 81 -2.545 -7.295 15.239 1.00 70.26 O
ATOM 1414 C3* C A 81 -3.315 -8.246 17.236 1.00" 70.40 C
ATOM 1415 03* C A 81 -3.607 -9.405 18.044 1.00 71.08 O
ATOM 1416 C2* C A 81 -2.027 -7.493 17.557 1.00 70.66 C
ATOM 1417 02* C A 81 -0.891 -8.331 17.691 1.00 70.46 O
ATOM 1418 Cl* C A 81 -1.866 -6.637 16.299 1.00 70.42 C
ATOM 1419 Nl C A 81 -2.395 -5.267 16.429 1.00 70.24 N
ATOM 1420 C2 C A 81 -1.749 -4.383 17.301 1.00 69.97 C
ATOM 1421 02 C A 81 -0.778 -4.792 17.959 1.00 69.96 O
ATOM 1422 N3 C A 81 -2.199 -3.115 17.406 1.00 69.86 N
ATOM 1423 C4 C A 81 -3.253 -2.719 16.687 1.00 69.98 C
ATOM 1424 N4 C A 81 -3.655 -1.451 16.812 1.00 70.37 N
ATOM 1425 C5 C A 81 -3.940 -3.603 15.805 1.00 69.65 C
ATOM 1426 C6 C A 81 -3.483 -4.857 15.708 1.00 70.02 C
TER 1427 C A 81
HETATM 1428 Nl SPD 95 30.635 15.348 6.786 1.00 32.95 N
HETATM 1429 C2 SPD 95 29.601 15.137 7.638 1.00 34.77 C
HETATM 1430 C3 SPD 95 28.160 14.794 7.150 1.00 32.97 C
HETATM 1431 C4 SPD 95 27.154 15.912 6.737 1.00 35.47 C
HETATM 1432 C5 SPD 95 27.896 17.206 6.216 1.00 34.05 C
HETATM 1433 N6 SPD 95 28.356 18.280 7.154 1.00 35.93 N
HETATM 1434 C7 SPD 95 28.129 18.216 8.696 1.00 37.54 C
HETATM 1435 C8 SPD 95 26.699 18.358 9.285 1.00 36.82 C
HETATM 1436 C9 SPD 95 25.563 18.963 8.358 1.00 39.41 C
HETATM 1437 NlO SPD 95 24.963 18.146 7.432 1.00 38.63 N
HETATM 1438 C ACT 96 35.752 18.872 10.859 1.00 11.75 C
HETATM 1439 O ACT 96 35.666 18.447 12.039 1.00 12.77 O
HETATM 1440 OXT ACT 96 35.855 20.114 10.438 1.00 13.73 O
HETATM 1441 CH3 ACT 96 35.725 17.822 9.762 1.00 10.36 C
HETATM 1442 CO NCO 101 33 . 779 17 . 811 0 . 520 1 . 00 12 . 16
CO
HETATM 1443 Nl NCO 101 35 . 628 18 . 020 1 . 035 1 . 00 11 .26 N
HETATM 1444 N2 NCO 101 31.912 17.605 0.016 1.00 11.81 N
HETATM 1445 N3 NCO 101 34.248 16.189 -0.528 1.00 12.00 N
HETATM 1446 N4 NCO 101 33.306 19.417 1.565 1.00 12.05 N
HETATM 1447 N5 NCO 101 33.488 16.682 2.134 1.00 11.77 N
HETATM 1448 N6 NCO 101 34.079 18.927 -1.063 1.00 10.05 N
HETATM 1449 CO NCO 102 30.972 15.538 21.040 1.00 18.89
CO
HETATM 1450 Nl NCO 102 32.077 13.978 20.540 1.00 18.01 N
HETATM 1451 N2 NCO 102 29.861 17.079 21.546 1.00 20.34 N
HETATM 1452 N3 NCO 102 29.408 14.674 20.281 1.00 18.96 N
HETATM 1453 N4 NCO 102 32.540 16.407 21.793 1.00 19.16 N
HETATM 1454 N5 NCO 102 30.622 14.700 22.802 1.00 21.31 N
HETATM 1455 N6 NCO 102 31.348 16.357 19.288 1.00 18.20 N HETATM 1456 CO NCO 103 30 . 079 10 . 969 13 . 083 1. 00 9.34
CO
HETATM 1457 Nl NCO 103 31.143 9.318 12.937 1.00 10.62 N
HETATM 1458 N2 NCO 103 29.016 12.607 13.231 1.00 10.47 N
HETATM 1459 N3 NCO 103 28.629 9.958 13.953 1.00 9.90 N
HETATM 1460 N4 NCO 103 31.530 11.994 12.228 1.00 11.47 N
HETATM 1461 N5 NCO 103 30.887 11.312 14.848 1.00 10.93 N
HETATM 1462 N6 NCO 103 29.293 10.591 11.335 1.00 8.96 N
HETATM 1463 CO NCO 104 35.738 25.339 6.924 1.00 10.52
CO
HETATM 1464 Nl NCO 104 37.192 23.991 6.966 1.00 9.12 N
HETATM 1465 N2 NCO 104 34.292 26.667 6.917 1.00 9.97 N
HETATM 1466 N3 NCO 104 35.290 24.803 5.079 1.00 10.32 N
HETATM 1467 N4 NCO 104 36.200 25.850 8.782 1.00 10.04 N
HETATM 1468 N5 NCO 104 34.462 23.982 7.610 1.00 10.02 N
HETATM 1469 N6 NCO 104 36.989 26.673 6.255 1.00 9.93 N
HETATM 1470 CO NCO 105 38.319 9.181 18.354 1.00 17.15
CO
HETATM 1471 Nl NCO 105 38.565 7.212 18.263 1.00 16.21 N
HETATM 1472 N2 NCO 105 38.063 11.131 18.455 1.00 16.52 N
HETATM 1473 N3 NCO 105 37.166 9.118 16.756 1.00 15.37 N
HETATM 1474 N4 NCO 105 39.472 9.231 19.943 1.00 15.14 N
HETATM 1475 N5 NCO 105 36.747 8.927 19.488 1.00 13.92 N
HETATM 1476 N6 NCO 105 39.894 9.428 17.221 1.00 16.03 N
HETATM 1477 CO NCO 106 29.797 11.611 -0.683 1.00 29.10
CO
HETATM 1478 Nl NCO 106 30.133 9.996 0.412 1.00 26.22 N
HETATM 1479 N2 NCO 106 29.459 13.224 -1.786 1.00 27.55 N
HETATM 1480 N3 NCO 106 27.947 10.996 -0.979 1.00 27.31 N
HETATM 1481 N4 NCO 106 31.647 12.200 -0.388 1.00 26.94 N
HETATM 1482 N5 NCO 106 29.213 12.598 0.926 1.00 26.78 N
HETATM 1483 N6 NCO 106 30 . 382 10 . 638 -2 . 307 1 . 00 27 . 86 N
HETATM 1484 CO NCO 107 21 . 064 21 . 824 13 . 859 1. 00 66 . 74
CO
HETATM 1485 Nl NCO 107 19 . 817 20 . 309 13 . 966 1 . 00 65 . 15 N
HETATM 1486 N2 NCO 107 22.316 23.350 13.744 1.00 64.75 N
HETATM 1487 N3 NCO 107 20.426 22.219 12.039 1.00 64.82 N
HETATM 1488 N4 NCO 107 21.696 21.423 15.684 1.00 65.00 N
HETATM 1489 N5 NCO 107 19.682 23.014 14.604 1.00 65.37 N
HETATM 1490 N6 NCO 107 22.448 20.636 13.116 1.00 64.84 N
HETATM 1491 CO NCO 108 11.892 21.884 3.085 1.00 43.83
CO
HETATM 1492 Nl NCO 108 12.540 20.757 4.557 1.00 41.58 N
HETATM 1493 N2 NCO 108 11.237 23.007 1.611 1.00 41.85 N
HETATM 1494 N3 NCO 108 10.087 21.837 3.873 1.00 39.63 N
HETATM 1495 N4 NCO 108 13.697 21.930 2.306 1.00 42.07 N
HETATM 1496 N5 NCO 108 12.302 23.497 4.132 1.00 42.42 N
HETATM 1497 N6 NCO 108 11.489 20.271 2.035 1.00 42.59 N
HETATM 1498 CO NCO 109 22.085 1.990 6.248 1.00 22.54
CO
HETATM 1499 Nl NCO 109 23.660 1.327 5.263 1.00 21.69 N
HETATM 1500 N2 NCO 109 20.512 2.641 7.231 1.00 23.54 N
HETATM 1501 N3 NCO 109 21.995 0.289 7.247 1.00 23.66 N
HETATM 1502 N4 NCO 109 22.190 3.682 5.266 1.00 24.99 N
HETATM 1503 N5 NCO 109 23.250 2.740 7.635 1.00 24.19 N
HETATM 1504 N6 NCO 109 20.916 1.236 4.862 1.00 24.71 N
HETATM 1505 CO NCO 110 11.907 4.743 -4.202 1.00 79.10
CO
HETATM 1506 Nl NCO 110 13.089 3.173 -4.157 1.00 78.25 N HETATM 1507 N2 NCO 110 10.722 6.317 -4.246 1.00 77.68 N
HETATM 1508 N3 NCO 110 10.338 3.559 -4.135 1.00 78.03 N
HETATM 1509 N4 NCO 110 13.479 5.919 -4.271 1.00 78.27 N
HETATM 1510 N5 NCO 110 11.939 4.818 -2.231 1.00 77.98 N
HETATM 1511 N6 NCO 110 11.872 4.671 -6.166 1.00 77.97 N
HETATM 1512 CO NCO 111 41.881 27.688 12.825 1.00 41.44
CO
HETATM 1513 Wl NCO 111 42.431 25.789 12.719 1.00 39.32 N
HETATM 1514 N2 NCO 111 41.334 29.578 12.935 1.00 39.83 N
HETATM 1515 N3 NCO 111 40.110 27.135 13.502 1.00 38.72 N
HETATM 1516 N4 NCO 111 43.650 28.247 12.141 1.00 39.72 N
HETATM 1517 N5 NCO 111 42.556 27.780 14.672 1.00 39.67 N
HETATM 1518 N6 NCO 111 41.206 27.586 10.976 1.00 39.64 N
HETATM 1519 CO NCO 112 15.544 -2.470 12.503 0.64 33.78
CO
HETATM 1520 Nl NCO 112 15.053 -4.150 13.387 0.64 33.08 N
HETATM 1521 N2 NCO 112 16.035 -0.799 11.617 0.64 32.88 N
HETATM 1522 N3 NCO 112 13.989 -1.592 13.306 0.64 33.14 N
HETATM 1523 N4 NCO 112 17.106 -3.351 11.708 0.64 31.89 N
HETATM 1524 N5 NCO 112 16.630 -1.966 14.059 0.64 31.87 N
HETATM 1525 N6 NCO 112 14.456 -2.963 10.949 0.64 33.59 N
HETATM 1526 Nl HPA 90 11.850 10.680 9.276 1..00 14.10 N
HETATM 1527 C2 HPA 90 11.445 12.017 9.429 1..00 15.38 C
HETATM 1528 N3 HPA 90 11.023 12.794 8.399 1..00 15.35 N
HETATM 1529 C4 HPA 90 11.032 12.144 7.215 1..00 13.23 C
HETATM 1530 C5 HPA 90 11.464 10.697 6.984 1.00 14.15 C
HETATM 1531 C6 HPA 90 11.894 9.936 8.082 1. 00 14.68 C
HETATM 1532 06 HPA 90 12.274 8.774 8.128 1.00 17.66 O
HETATM 1533 N7 HPA 90 11.319 10.452 5.646 1.00 14.57 N
HETATM 1534 C8 HPA 90 10.837 11.601 5.068 1.00 10.65 C
HETATM 1535 N9 HPA 90 10.654 12.637 5.980 1.00 13.29 N
HETATM 1536 O HOH 301 27.846 -2.938 13.825 1.00 9.01 O
HETATM 1537 O HOH 302 25.859 31.056 15.513 1.00 15.15 O
HETATM 1538 O HOH 304 31.973 24.578 6.159 1.00 8.16 O
HETATM 1539 O HOH 305 26.612 9.170 11.460 1.00 9.38 O
HETATM 1540 O HOH 306 27.263 29.190 17.094 1.00 12.73 O
HETATM 1541 O HOH 307 27.839 31.120 11.251 1.00 8.67 O
HETATM 1542 O HOH 308 15.358 10.206 7.797 1.00 12.64 O
HETATM 1543 O HOH 309 36.106 21.180 7.822 1.00 7.98 O
HETATM 1544 O HOH 310 38.999 22.794 9.431 1.00 11.94 O
HETATM 1545 O HOH 311 27.651 7.462 8.332 1.00 11.92 O
HETATM 1546 O HOH 312 32.982 0.834 9.462 1.00 8.08 O
HETATM 1547 O HOH 313 32.773 15.822 11.031 1.00 11.55 O
HETATM 1548 O HOH 314 28.650 -0.366 14.062 1.00 14.04 O
HETATM 1549 O HOH 315 28.295 12.130 16.348 1.00 14.09 O
HETATM 1550 O HOH 316 35.130 20.473 16.933 1.00 11.57 O
HETATM 1551 O HOH 317 31.081 33.340 12.189 1.00 12.95 O
HETATM 1552 O HOH 318 34.682 19.005 19.405 1.00 15.48 O
HETATM 1553 O HOH 319 33.063 10.330 19.697 1.00 15.28 O
HETATM 1554 O HOH 320 33.163 29.459 16.840 1.00 13.11 O
HETATM 1555 O HOH 321 25.367 21.927 13.760 1.00 16.06 O
HETATM 1556 O HOH 322 37.325 28.769 8.362 1.00 10.37 O
HETATM 1557 O HOH 323 23.452 -4.001 6.017 1.00 18.50 O
HETATM 1558 O HOH 324 9.471 20.115 5.676 1.00 20.63 O
HETATM 1559 O HOH 325 31.413 13.389 17.912 1.00 14.48 O
HETATM 1560 O HOH 326 30.447 21.096 -4.264 1.00 19.06 O
HETATM 1561 O HOH 327 29.636 28.818 15.501 1.00 7.36 O
HETATM 1562 O HOH 328 39.664 24.428 12.189 1.00 15.97 O
HETATM 1563 O HOH 329 31.754 11.671 9.425 1.00 10.07 O HETATM 1564 O HOH 330 31.792 8.455 15.807 00 6.64 O
HETATM 1565 O HOH 331 31.505 23.268 17.911 00 11.89 O
HETATM 1566 O HOH 332 18.453 12.462 12.598 00 17.54 O
HETATM 1567 O HOH 333 18.443 26.704 16.319 00 25.32 O
HETATM 1568 O HOH 334 32.785 19.986 8.347 00 15.97 O
HETATM 1569 O HOH 335 41.786 5.546 14.867 00 15.87 O
HETATM 1570 O HOH 336 7.409 12.505 5.270 00 17.89 O
HETATM 1571 O HOH 337 21.882 4.512 9.287 00 19.12 O
HETATM 1572 O HOH 338 31.187 10.298 17.815 00 16.86 O
HETATM 1573 O HOH 339 26.840 32.305 5.286 00 19.04 O
HETATM 1574 O HOH 340 15.679 12.782 11.026 00 14.61 O
HETATM 1575 O HOH 341 25.408 5.517 8.271 00 14.76 O
HETATM 1576 O HOH 342 30.043 11.328 7.160 00 8.77 O
HETATM 1577 O HOH 343 23.127 14.905 8.144 00 14.88 O
HETATM 1578 O HOH 344 38.365 6.804 15.177 00 18.36 O
HETATM 1579 O HOH 345 35.022 29.298 5.821 00 14.35 O
HETATM 1580 O HOH 346 30.511 24.456 3.612 00 14.15 O
HETATM 1581 O HOH 347 30.481 34.557 3.130 00 11.78 O
HETATM 1582 O HOH 348 26.554 9.913 5.375 00 18.81 O
HETATM 1583 O HOH 349 31.209 21.766 3.243 00 18.82 O
HETATM 1584 O HOH 350 1.741 13.549 1.745 00 20.27 O
HETATM 1585 O HOH 351 29.321 2.066 1.345 00 21.19 O
HETATM 1586 O HOH 352 4.877 8.231 0.914 00 21.48 O
HETATM 1587 O HOH 353 21.713 1.577 10.203 00 20.65 O
HETATM 1588 O HOH 354 16.006 7.690 6.859 00 14.39 o
HETATM 1589 O HOH 355 32.374 14.264 8.770 00 13.50 o
HETATM 1590 O HOH 356 44.956 7.126 13.453 00 18.02 o
HETATM 1591 O HOH 357 34.420 0.993 5.499 00 15.80 o
HETATM 1592 O HOH 358 34.107 10.687 13.188 00 21.21 o
HETATM 1593 O HOH 359 43.241 16.647 21.460 00 20.68 o
HETATM 1594 O HOH 360 29.177 23.123 7.231 00 19.78 o
HETATM 1595 O HOH 361 20.913 10.719 -2.240 00 38.20 o
HETATM 1596 O HOH 362 17.211 12.442 7.676 00 12.64 o
HETATM 1597 O HOH 363 23.238 7.190 7.238 00 16.38 o
HETATM 1598 O HOH 364 23.098 23.481 10.458 00 16.85 o
HETATM 1599 O HOH 365 19.273 4.003 10.474 00 14.55 o
HETATM 1600 O HOH 366 29.119 2.067 20.617 00 16.11 o
HETATM 1601 O HOH 367 13.755 11.592 -2.619 00 19.43 o
HETATM 1602 O HOH 368 27.008 15.371 13.742 00 16.03 o
HETATM 1603 O HOH 370 40.117 15.541 24.855 00 17.75 o
HETATM 1604 O HOH 371 42.204 24.722 15.562 00 20.80 o
HETATM 1605 O HOH 372 33.334 27.064 4.009 00 14.27 o
HETATM 1606 O HOH 373 34.369 22.764 -0.160 00 16.16 o
HETATM 1607 O HOH 374 26.150 9.913 8.539 00 17.96 o
HETATM 1608 O HOH 375 40.783 11.103 6.916 00 14.78 o
HETATM 1609 O HOH 376 35.928 21.012 1.466 00 26.24 o
HETATM 1610 O HOH 377 16.646 10.599 -2.661 00 19.58 o
HETATM 1611 O HOH 378 25.562 13.304 6.313 00 18.69 o
HETATM 1612 O HOH 379 20.343 -1.589 8.728 00 18.93 o
HETATM 1613 O HOH 380 17.478 4.521 8.063 00 17.20 o
HETATM 1614 O HOH 381 37.683 27.819 10.864 00 19.86 o
HETATM 1615 O HOH 382 50.069 8.354 17.181 00 25.65 o
HETATM 1616 O HOH 383 22.215 19.277 -1.414 00 21.60 o
HETATM 1617 O HOH 384 26.899 12.793 23.127 00 21.66 o
HETATM 1618 O HOH 385 27.310 20.284 5..282 00 23.43 o
HETATM 1619 O HOH 386 35.477 14.521 1. ,832 00 16.09 o
HETATM 1620 O HOH 387 36.421 5.374 5.731 00 18.50 o
HETATM 1621 O HOH 388 35.657 13.089 21.646 00 27.43 o
HETATM 1622 O HOH 389 31.262 -0.028 15.535 1.00 26.71 o HETATM 1623 O HOH 390 12.258 13.376 14.333 1.00 23.39 O
HETATM 1624 O HOH 391 41.547 24.260 9.781 1.00 18.53 O
HETATM 1625 O HOH 392 45.074 23.301 12.058 1.00 34.28 o
HETATM 1626 O HOH 393 26.074 11.518 13.046 1.00 13.39 o
HETATM 1627 O HOH 395 5.587 8.611 15.039 1.00 30.05 o
HETATM 1628 O HOH 396 16.781 -2.191 17.148 1.00 29.65 o
HETATM 1629 O HOH 398 39.000 -1.725 12.841 1.00 27.22 o
HETATM 1630 O HOH 399 45.596 20.807 13.713 1.00 22.64 o
HETATM 1631 O HOH 400 39.024 5.351 6.718 1.00 27.45 o
HETATM 1632 O HOH 401 19.128 6.103 17.172 1.00 23.23 o
HETATM 1633 O HOH 402 -2.414 15.769 -5.977 1.00 28.56 o
HETATM 1634 O HOH 404 35.863 1.066 10.034 1.00 21.22 o
HETATM 1635 O HOH 405 45.781 6.088 10.831 1.00 32.51 o
HETATM 1636 O HOH 406 20.680 25.716 18.205 1.00 23.14 o
HETATM 1637 O HOH 407 26.449 23.755 -4.077 1.00 19.44 o
HETATM 1638 O HOH 408 13.404 6.712 6.444 1.00 20.71 o
HETATM 1639 O HOH 409 33.912 22.171 4.658 1.00 18.92 o
HETATM 1640 O HOH 410 39.629 15.715 13.211 1.00 93.76 o
HETATM 1641 O HOH 411 35.297 11.656 18.352 1.00 21.88 o
HETATM 1642 O HOH 412 10.954 4.182 2.008 1.00 23.12 o
HETATM 1643 O HOH 413 28.947 21.113 11.642 1.00 22.81 o
HETATM 1644 O HOH 414 5.170 7.273 12.633 1.00 27.36 o
HETATM 1645 O HOH 415 28.471 16.583 18.095 1.00 22.56 o
HETATM 1646 O HOH 416 4.654 4.945 23.880 1.00 52.57 o
HETATM 1647 O HOH 419 38.860 4.458 16.740 1.00 33.74 o
HETATM 1648 O HOH 420 -2.549 7.590 -3.638 1.00 24.47 o
HΞTATM 1649 O HOH 421 45.977 14.324 21.673 1.00 21.22 o
HETATM 1650 O HOH 422 21.196 23.126 8.094 1.00 24.30 o
HETATM 1651 O HOH 423 24.720 30.789 3.238 1.00 17.59 o
HETATM 1652 O HOH 424 25.564 3.138 6.622 1.00 21.50 o
HETATM 1653 O HOH 425 34.235 23.186 16.786 1.00 20.56 o
HETATM 1654 O HOH 426 26.555 15.848 20.797 1.00 34.81 o
HETATM 1655 O HOH 427 16.137 10.314 16.975 1.00 35.39 o
HETATM 1656 O HOH 428 15.237 5.341 9.720 1.00 26.74 o
HETATM 1657 O HOH 429 5.843 16.504 13.479 1.00 20.66 o
HETATM 1658 O HOH 430 34.267 35.258 2.809 1.00 18.39 o
HETATM 1659 O HOH 431 34.478 8.946 16.706 1.00 37.42 o
HETATM 1660 O HOH 432 23.052 6.563 4.267 1.00 23.73 o
HETATM 1661 O HOH 433 24.461 21.831 8.001 1.00 19.72 o
HETATM 1662 O HOH 434 16.811 5.911 -0.991 1.00 39.78 o
HETATM 1663 O HOH 435 18.936 8.756 17.266 1.00 16.77 o
HETATM 1664 O HOH 436 17.061 32.451 9.214 1.00 32.73 o
HETATM 1665 O HOH 437 25.185 15.727 10.155 1.00 23.94 o
HETATM 1666 O HOH 438 36.819 34.970 6.044 1.00 23.13 o
HETATM 1667 O HOH 439 36.371 2.956 7.585 1.00 22.07 o
HETATM 1668 O HOH 440 7.686 18.108 -0.709 1.00 31.93 o
HETATM 1669 O HOH 442 28.154 5.986 5.626 1.00 34.66 o
HETATM 1670 O HOH 443 43.765 7.991 3.710 1.00 29.75 o
HΞTATM 1671 O HOH 444 34.138 11.979 15.697 1.00 19.26 o
HETATM 1672 O HOH 445 16.154 2.876 6.317 1.00 24.75 o
HETATM 1673 O HOH 446 3.198 15.740 15.476 1.00 47.56 o
HETATM 1674 O HOH 447 16.359 21.320 4.320 1.00 27.58 o
HΞTATM 1675 O HOH 448 43.253 8.885 16.567 1.00 16.92 o
HETATM 1676 O HOH 449 35.697 6.317 18.291 1.00 34.21 o
HETATM 1677 O HOH 450 26.118 7.952 1.424 1.00 47.42 o
HETATM 1678 O HOH 451 11.782 19.566 7.278 1.00 31.23 o
HΞTATM 1679 O HOH 452 13.180 12.084 -5.228 1.00 27.14 o
HΞTATM 1680 O HOH 454 37.285 10.836 3.417 1.00 29.24 o
HΞTATM 1681 O HOH- 456 49.054 14.720 18.870 1.00 34.45 o HETATM 1682 O HOH 457 29.111 15.812 11.618 1.00 40.55 o
HETATM 1683 O HOH 458 31.663 19.001 19.522 1.00 17.15 o
HETATM 1684 O HOH 459 39.304 19.991 21.460 1.00 29.51 o
HETATM 1685 O HOH 460 6.990 6.202 -4.001 1.00 40.02 o
HETATM 1686 O HOH 461 8.794 6.163 15.456 1.00 31.20 o
HETATM 1687 O HOH 462 31.882 26.863 16.881 1.00 25.16 o
HETATM 1688 O HOH 463 26.477 -0.242 3.289 1.00 24.32 o
HETATM 1689 O HOH 464 -3.607 9.559 4.991 1.00 25.17 o
HETATM 1690 O HOH 465 3.250 20.726 2.705 1.00 41.31 o
HETATM 1691 O HOH 466 22.033 33.298 4.411 1.00 41.90 o
HETATM 1692 O HOH 467 46.878 8.195 18.105 1.00 39.35 o
HETATM 1693 O HOH 468 39.497 14.637 4.088 1.00 29.50 o
HETATM 1694 O HOH 469 9.001 4.669 12.208 1.00 33.99 o
HETATM 1695 O HOH 470 -0.445 6.689 3.926 1.00 27.30 o
HETATM 1696 O HOH 471 17.189 28.967 12.756 1.00 24.27 o
HETATM 1697 O HOH 472 -5.836 -10.333 16.980 1.00 45.19 o
HETATM 1698 O HOH 473 45.688 15.274 4.933 1.00 26.44 o
HETATM 1699 O HOH 474 24.616 0.817 16.978 1.00 26.14 o
HETATM 1700 O HOH 475 25.860 28.821 0.421 1.00 30.41 o
HETATM 1701 O HOH 476 18.624 3.432 13.266 1.00 19.31 o
HETATM 1702 O HOH 478 17.089 26.985 10.853 1.00 34.08 o
HETATM 1703 O HOH 479 15.210 30.506 6.340 1.00 38.53 o
HETATM 1704 O HOH 480 36.478 33.125 3.386 1.00 43.68 o
HETATM 1705 O HOH 481 9.899 3.051 8.224 1.00 36.06 o
HETATM 1706 O HOH 482 18.088 0.524 11.711 1.00 38.00 o
HETATM 1707 O HOH 483 4.856 11.207 18.945 1.00 35.35 o
HETATM 1708 O HOH 484 31.875 14.345 0.745 1.00 27.67 o
HETATM 1709 O HOH 485 42.954 25.586 19.172 1.00 34.55 o
HETATM 1710 O HOH 486 19.707 24.093 -3.819 1.00 36.45 o
HETATM 1711 O HOH 487 21.776 5.844 18.169 1.00 31.67 o
HETATM 1712 O HOH 488 26.072 21.572 10.831 1.00 26.77 o
HETATM 1713 O HOH 489 0.375 5.438 11.888 1.00 36.91 o
HETATM 1714 O HOH 490 28.879 15.319 3.965 1.00 63.69 o
HETATM 1715 O HOH 491 39.794 17.388 1.508 1.00 40.63 o
HETATM 1716 O HOH 492 40.065 0.717 19.459 1.00 39.86 o
HETATM 1717 O HOH 493 49.237 16.037 4.776 1.00 50.04 o
HETATM 1718 O HOH 494 6.602 16.058 -4.236 1.00 32.86 o
HETATM 1719 O HOH 495 14.502 4.269 -0.179 1.00 37.62 o
HETATM 1720 O HOH 496 15.468 20.981 14.363 1.00 39.44 o
HETATM 1721 O HOH 497 24.964 16.434 -3.116 1.00 23.19 o
HETATM 1722 O HOH 498 40.994 29.185 18.880 1.00 37.92 o
HETATM 1723 O HOH 499 26.306 17.999 12.975 1.00 43.45 o
HETATM 1724 O HOH 500 -1.452 -5.851 6.853 1.00 35.82 o
HETATM 1725 O HOH 501 10.897 22.806 6.598 1.00 56.17 o
HETATM 1726 O HOH 502 35.628 12.681 13.339 1.00 92.79 o
HETATM 1727 O HOH 503 56.988 18.311 10.058 1.00 60.09 o
HETATM 1728 O HOH 504 42.131 21.097 23.339 1.00 45.34 o
HETATM 1729 O HOH 505 47.093 7.981 15.216 1.00 42.30 o
HETATM 1730 O HOH 506 10.635 18.180 -1.367 1.00 28.88 o
HETATM 1731 O HOH 507 25.312 18.277 3.464 1.00 36.67 o
HETATM 1732 O HOH 508 53.997 11.193 14.846 1.00 48.50 o
HETATM 1733 O HOH 509 17.232 19.642 -6.737 1.00 39.43 o
HETATM 1734 O HOH 511 48.389 18.219 19.003 1.00 37.53 o
HETATM 1735 O HOH 512 36.441 5.846 21.503 1.00 17.36 o
HETATM 1736 O HOH 513 27.771 19.877 -5.622 1.00 28.79 o
HETATM 1737 O HOH 514 49.143 9.733 9.698 1.00 44.63 o
HETATM 1738 O HOH 515 22.020 29.095 0.023 1.00 38.04 o
HETATM 1739 O HOH 516 10.150 18.223 -5.419 1.00 45.55 o
HETATM 1740 O HOH 518 42.433 18.492 3.496 1.00 20.34 o HETATM 1741 O HOH 519 12.974 -6.995 9.968 1.00 55.96 o
HETATM 1742 O HOH 520 41.002 30.190 16.057 1.00 46.34 o
HETATM 1743 O HOH 521 28.471 5.080 1.598 1.00 38.47 o
HETATM 1744 O HOH 522 1.319 16.118 8.887 1.00 40.75 o
HETATM 1745 O HOH 524 47.980 24.698 12.703 1.00 59.46 o
HETATM 1746 O HOH 525 2.015 17.058 12.074 1.00 47.46 o
HETATM 1747 O HOH 526 34.792 8.932 23.014 1.00 17.72 o
HETATM 1748 O HOH 527 -4.125 3.139 11.120 1.00 35.85 o
HETATM 1749 O HOH 528 14.446 21.653 8.904 1.00 29.07 o
HETATM 1750 O HOH 529 19.539 24.993 11.728 1.00 32.07 o
HETATM 1751 O HOH 530 15.285 2.359 12.400 1.00 45.20 o
HETATM 1752 O HOH 531 35.293 32.877 16.427 1.00 41.71 o
HETATM 1753 O HOH 532 31.338 11.703 22.624 1.00 30.38 o
HETATM 1754 O HOH 533 10.807 3.899 5.533 1.00 31.89 o
HETATM 1755 O HOH 534 23.897 23.200 -2.832 1.00 31.97 o
HETATM 1756 O HOH 535 4.746 5.239 -2.472 1.00 30.20 o
HETATM 1757 O HOH 536 24.434 26.832 -2.017 1.00 31.20 o
HETATM 1758 O HOH 537 43.950 0.544 13.447 1.00 41.33 o
HETATM 1759 O HOH 538 23.116 8.555 0.668 1.00 35.22 o
HETATM 1760 O HOH 539 9.978 15.068 14.401 1.00 30.79 o
HETATM 1761 O HOH 540 2.001 9.284 12.960 1.00 43.03 o
HETATM 1762 O HOH 541 3.144 3.592 12.702 1.00 57.96 o
HETATM 1763 O HOH 542 9.889 9.611 -3.909 1.00 31.84 o
HETATM 1764 O HOH 543 47.304 17.648 6.312 1.00 33.07 o
HETATM 1765 O HOH 544 4.895 -8.536 10.635 1.00 53.30 o
HETATM 1766 O HOH 546 39.433 25.709 8.086 1.00 18.28 o
HETATM 1767 O HOH 547 30.374 -4.058 14.167 1.00 15.55 o
HETATM 1768 O HOH 548 46.029 15.729 24.092 1.00 25.93 o
HETATM 1769 O HOH 549 40.466 -1.095 17.221 1.00 30.08 o
HETATM 1770 O HOH 550 24.133 0.727 19.719 1.00 26.39 o
HETATM 1771 O HOH 551 3.446 9.427 -2.135 1.00 31.76 o
HETATM 1772 O HOH 552 19.319 13.000 -2.575 1.00 41.76 o
HETATM 1773 O HOH 553 31.891 31.580 18.610 1.00 24.77 o
HETATM 1774 O HOH 554 18.491 2.334 4.660 1.00 22.10 o
HETATM 1775 O HOH 555 1.314 11.279 0.038 1.00 26.52 o
HETATM 1776 O HOH 556 28.146 13.256 10.339 1.00 21.95 o
HETATM 1777 O HOH 557 30.223 20.804 5.827 1.00 36.82 o
HETATM 1778 O HOH 558 17.905 13.328 15.220 1.00 20.71 o
HETATM 1779 O HOH 559 29.909 -0.095 18.799 1.00 21.94 o
HETATM 1780 O HOH 560 31.631 17.664 9.008 1.00 19.67 o
HETATM 1781 O HOH 562 24.042 20.954 19.484 1.00 31.11 o
HETATM 1782 O HOH 563 37.392 16.228 -0.497 1.00 26.68 o
HETATM 1783 O HOH 564 -1.946 8.648 -0.841 1.00 39.16 o
HETATM 1784 O HOH 565 47.915 9.279 22.936 1.00 35.60 o
HETATM 1785 O HOH 566 25.748 7.266 4.529 1.00 28.82 o
HETATM 1786 O HOH 567 21.116 14.888 -2.109 1.00 36.75 o
HETATM 1787 O HOH 568 23.715 20.126 5.645 1.00 40.35 o
HETATM 1788 O HOH 569 33.531 -0.771 13.511 1.00 45.31 o
HETATM 1789 O HOH 570 20.908 18.749 15.697 1.00 34.23 o
HETATM 1790 O HOH 571 39.015 32.529 4 . 501 1.00 54.73 o
HETATM 1791 O HOH 572 41.051 0.455 7 . 130 1.00 54.62 o
HETATM 1792 O HOH 573 30.698 17.740 5 . 106 1.00 35.84 o
HETATM 1793 O HOH 574 6.859 5.063 17 . 712 1.00 37.55 o
HETATM 1794 O HOH 575 39.059 8.657 3.005 1.00 59.67 o
HETATM 1795 O HOH 576 38.160 32.276 16.708 1.00 37.88 o
HETATM 1796 O HOH 577 20.383 21.318 -4.691 1.00 39.17 o
HETATM 1797 O HOH 578 41.157 6.777 17.217 1.00 35.75 o
HETATM 1798 O HOH 579 27.327 21.257 8.150 1.00 42.39 o
HETATM 1799 O HOH 580 34.592 24.586 1.886 1.00 30.17 o HETATM 1800 O HOH 581 28.618 8.077 -0.712 1.00 54.91 o
HETATM 1801 O HOH 582 30.108 20.873 8.937 1.00 56.50 o
HETATM 1802 O HOH 583 29.146 8.970 6.121 1.00 38.73 o
HETATM 1803 O HOH 584 51.666 17.680 15.607 1.00 47.50 o
HETATM 1804 O HOH 585 14.071 22.825 6.240 1.00 47.08 o
HETATM 1805 O HOH 586 43.840 6.576 20.454 1.00 36.44 o
HETATM 1806 O HOH 587 42.165 8.711 19.272 1.00 37.83 o
HETATM 1807 O HOH 588 38.236 13.980 1.049 1.00 39.71 o
HETATM 1808 O HOH 589 45.548 22.632 16.215 1.00 37.00 o
HETATM 1809 O HOH 590 30.906 16.821 2.292 1.00 76.55 o
HETATM 1810 O HOH 591 11.602 22.880 -3.030 1.00 36.77 o
HETATM 1811 O HOH 592 12.805 8.355 -4.098 1.00 49.06 o
HETATM 1812 O HOH 593 -2.315 6.894 1.774 1.00 37.69 o
HETATM 1813 O HOH 594 23.656 18.274 20.007 1.00 26.98 o
HETATM 1814 O HOH 595 10.845 12.955 18.898 1.00 29.83 o
HETATM 1815 O HOH 596 18.262 4.377 19.245 1.00 39.17 o
HETATM 1816 O HOH 597 49.235 6.180 11.692 1.00 70.41 o
HETATM 1817 O HOH 598 14.400 -3.218 15.489 1.00 41.28 o
HETATM 1818 O HOH 599 24.049 25.032 20.146 1.00 33.82 o
HETATM 1819 O HOH 600 23.974 20.498 -3.811 1.00 44.21 o
HETATM 1820 O HOH 601 39.360 6.106 20.658 1.00 34.40 o
HETATM 1821 O HOH 602 31.294 32.672 16.263 1.00 25.66 o
HETATM 1822 O HOH 603 44.234 4.286 13.888 1.00 41.49 o
HETATM 1823 O HOH 604 21.106 8.490 19.650 1.00 66.45 o
HETATM 1824 O HOH 605 28.263 15.750 23.071 1.00 67.72 o
HETATM 1825 O HOH 606 17.345 24.004 15.476 1.00 73.54 o
HETATM 1826 O HOH 607 28.369 11.841 -4.260 1.00 38.80 o
HETATM 1827 O HOH 608 18.542 3.496 0.954 1.00 48.61 o
HETATM 1828 O HOH 609 -6.831 -8.495 19.160 1.00 46.57 o
HETATM 1829 O HOH 610 22.075 20.486 9.491 1.00 58.37 o
HETATM 1830 O HOH 611 19.748 7.055 -2.211 1.00 48.25 o
HETATM 1831 O HOH 612 44.625 20.192 21.807 1.00 80.14 o
HETATM 1832 O HOH 613 38.559 9.253 5.778 1.00 32.81 o
HETATM 1833 O HOH 614 18.465 0.369 7.409 1.00 51.65 o
HETATM 1834 O HOH 615 37.639 9.337 22.161 1.00 48.87 o
HETATM 1835 O HOH 616 16.809 1.980 9.496 1.00 72.40 o
HETATM 1836 O HOH 617 38.304 20.108 24.081 1.00 25.44 o
HETATM 1837 O HOH 618 19.482 11.517 17.092 1.00 70.05 o
HETATM 1838 O HOH 619 15.255 -5.143 10.342 1.00 36.51 o
HETATM 1839 O HOH 620 29.264 10.712 2.893 1.00 32.87 o
HETATM 1840 O HOH 621 15.822 7.785 -3.226 1.00 43.27 o
HETATM 1841 O HOH 622 23.813 11.730 -2.619 1.00 36.87 o
HETATM 1842 O HOH 623 -3.594 5.850 12.136 1.00 66.99 o
HETATM 1843 O HOH 624 18.735 22.790 5.719 1.00 40.97 O
HETATM 1844 O HOH 625 55.575 16.141 8.287 1.00 49.74 o
HETATM 1845 O HOH 626 53.255 10.141 17.681 1.00 68.52 o
HETATM 1846 O HOH 627 -1.530 -0.489 4.060 1.00 49.32 o
HETATM 1847 O HOH 628 47.166 12.933 3.227 1.00 42.68 o
HETATM 1848 O HOH 629 51.745 12.846 4.265 1.00 32.94 o
HETATM 1849 O HOH 630 8.335 13.886 -5.452 1.00 42.98 o
HETATM 1850 O HOH 631 1.415 18.892 4.450 1.00 52.27 o
HETATM 1851 O HOH 632 8.813 4.607 -6.147 1.00 61.80 o
HETATM 1852 O HOH 633 14.358 18.909 -7.627 1.00 59.54 o
HETATM 1853 O HOH 634 42.560 12.882 4.673 1.00 46.06 o
HETATM 1854 O HOH 635 13.134 -0.048 6.444 1.00 49.92 o
HETATM 1855 O HOH 636 8.623 23.116 -0.236 1.00 51.89 o
HETATM 1856 O HOH 637 19.455 18.632 6.820 1.00 29.08 o
HETATM 1857 O HOH 638 0.328 4.658 14.591 1.00 62.40 o
HETATM 1858 O HOH 639 49.562 9.193 6.008 1.00 39.48 o HETATM 1859 0 HOH 640 4.000 -2.106 -0.040 1.00 57.30 0
HETATM 1860 O HOH 641 16 .678 23 .799 11 .363 1. 00 65 .19 0
HETATM 1861 0 HOH 642 8 .358 0 .710 9 .304 1. 00 43 .70 0
HETATM 1862 O HOH 643 41 .615 2 .908 17 .876 1. 00 52 .92 0
HETATM 1863 0 HOH 644 29 .021 19 .152 2 .611 1. 00 26 .91 0
HETATM 1864 0 HOH 645 44 .197 15 .580 2 .167 1. 00 76 .22 0
HETATM 1865 0 HOH 646 22 .571 6 .260 -1 .309 1. 00 50 .65 0
HETATM 1866 O HOH 647 1 .473 1 .753 3 .469 1. 00 40 .93 0
HETATM 1867 0 HOH 648 46 .959 7 .034 20 .908 1. 00 37 .66 0
HETATM 1868 0 HOH 649 24 .330 17 .915 17 .109 1. 00 53 .90 0
CONECT 1428 1429
CONECT 1429 1428 1430
CONECT 1430 1429 1431
CONECT 1431 1430 1432
CONECT 1432 1431 1433
CONECT 1433 1432 1434
CONECT 1434 1433 1435
CONECT 1435 1434 1436
CONECT 1436 1435 1437
CONECT 1437 1436
CONECT 1438 1439 1440 1441
CONECT 1439 1438
CONECT 1440 1438
CONECT 1441 1438
CONECT 1442 1443 1444 1445 1446
CONECT 1442 1447 1448
CONECT 1443 1442
CONECT 1444 1442
CONECT 1445 1442
CONECT 1446 1442
CONECT 1447 1442
CONECT 1448 1442
CONECT 1449 1450 1451 1452 1453
CONECT 1449 1454 1455
CONECT 1450 1449
CONECT 1451 1449
CONECT 1452 1449
CONECT 1453 1449
CONECT 1454 1449
CONECT 1455 1449
CONECT 1456 1457 1458 1459 1460
CONECT 1456 1461 1462
CONECT 1457 1456
CONECT 1458 1456
CONECT 1459 1456
CONECT 1460 1456
CONECT 1461 1456
CONECT 1462 1456
CONECT 1463 1464 1465 1466 1467
CONECT 1463 1468 1469
CONECT 1464 1463
CONECT 1465 1463
CONECT 1466 1463
CONECT 1467 1463
CONECT 1468 1463
CONECT 1469 1463
CONECT 1470 1471 1472 1473 1474
CONECT 1470 1475 1476
CONECT 1471 1470 CONECT 1472 1470
CONECT 1473 1470
CONECT 1474 1470
CONECT 1475 1470
CONECT 1476 1470
CONECT 1477 1478 1479 1480 1481
CONECT 1477 1482 1483
CONECT 1478 1477
CONECT 1479 1477
CONECT 1480 1477
CONECT 1481 1477
CONECT 1482 1477
CONECT 1483 1477
CONECT 1484 1485 1486 1487 1488
CONECT 1484 1489 1490
CONECT 1485 1484
CONECT 1486 1484
CONECT 1487 1484
CONECT 1488 1484
CONECT 1489 1484
CONECT 1490 1484
CONECT 1491 1492 1493 1494 1495
CONECT 1491 1496 1497
CONECT 1492 1491
CONECT 1493 1491
CONECT 1494 1491
CONECT 1495 1491
CONECT 1496 1491
CONECT 1497 1491
CONECT 1498 1499 1500 1501 1502
CONECT 1498 1503 1504
CONECT 1499 1498
CONECT 1500 1498
CONECT 1501 1498
CONECT 1502 1498
CONECT 1503 1498
CONECT 1504 1498
CONECT 1505 1506 1507 1508 1509
CONECT 1505 1510 1511
CONECT 1506 1505
CONECT 1507 1505
CONECT 1508 1505
CONECT 1509 1505
CONECT 1510 1505
CONECT 1511 1505
CONECT 1512 1513 1514 1515 1516
CONECT 1512 1517 1518
CONECT 1513 1512
CONECT 1514 1512
CONECT 1515 1512
CONECT 1516 1512
CONECT 1517 1512
CONECT 1518 1512
CONECT 1519 1520 1521 1522 1523
CONECT 1519 1524 1525
CONECT 1520 1519
CONECT 1521 1519
CONECT 1522 1519
CONECT 1523 1519 CONECT 1524 1519
CONECT 1525 1519
CONECT 1526 1527 1531
CONECT 1527 1526 1528
CONECT 1528 1527 1529
CONECT 1529 1528 1530 1535
CONECT 1530 1529 1531 1533
CONECT 1531 1526 1530 1532
CONECT 1532 1531
CONECT 1533 1530 1534
CONECT 1534 1533 1535
CONECT 1535 1529 1534
MASTER 249 0 15 6 1867 1 120
END
Table 7. Thermodynamic parameters associated with various nucleobases bound to the adenine riboswitch KNA (AR)
Figure imgf000173_0001
All experiments were done in 5OmM K HEPES, pH 7.5, 10OmM KCl, 1OmM MgCl2 at 300C. n.d. designates reactions where no detectable binding was indicated. * The heat capacity (ΔCp) for this reaction is -1.10 kcal mol"1 K"1, as measured between 298 and 323 K. Table 8. Thermodynamic parameters associated with various nucleobases bound to the guanine riboswitch RNA (GR)
Figure imgf000174_0001
All experiments were done in 5OmM K HEPES, 10OmM KCI, 1OmM MgCl2 at 3O0C. n.d. designates reactions where no detectable binding was indicated.
* Software was not able to fit data and report parameters unless n was held at 1.
It is understood that the disclosed method and compositions are not limited to the particular methodology, protocols, and reagents described as these may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims.
It must be noted that as used herein and in the appended claims, the singular forms "a ", "an", and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a riboswitch" includes a plurality of such riboswitches, reference to "the riboswitch" is a reference to one or more riboswitches and equivalents thereof known to those skilled in the art, and so forth.
"Optional" or "optionally" means that the subsequently described event, circumstance, or material may or may not occur or be present, and that the description includes instances where the event, circumstance, or material occurs or is present and instances where it does not occur or is not present.
Ranges may be expressed herein as from "about" one particular value, and/or to "about" another particular value. When such a range is expressed, also specifically contemplated and considered disclosed is the range from the one particular value and/or to the other particular value unless the context specifically indicates otherwise. Similarly, when values are expressed as approximations, by use of the antecedent "about," it will be understood that the particular value forms another, specifically contemplated embodiment that should be considered disclosed unless the context specifically indicates otherwise. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint unless the context specifically indicates otherwise. Finally, it should be understood that all of the individual values and sub-ranges of values contained within an explicitly disclosed range are also specifically contemplated and should be considered disclosed unless the context specifically indicates otherwise. The foregoing applies regardless of whether in particular cases some or all of these embodiments are explicitly disclosed.
Unless defined otherwise, all technical and scientific terms used herein have the same meanings as commonly understood by one of skill in the art to which the disclosed method and compositions belong. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present method and compositions, the particularly useful methods, devices, and materials are as described. Publications cited herein and the material for which they are cited are hereby specifically incorporated by reference. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such disclosure by virtue of prior invention. No admission is made that any reference constitutes prior art. The discussion of references states what their authors assert, and applicants reserve the right to challenge the accuracy and pertinency of the cited documents. It will be clearly understood that, although a number of publications are referred to herein, such reference does not constitute an admission that any of these documents forms part of the common general knowledge in the art.
Throughout the description and claims of this specification, the word "comprise" and variations of the word, such as "comprising" and "comprises," means "including but not limited to," and is not intended to exclude, for example, other additives, components, integers or steps.
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the method and compositions described herein. Such equivalents are intended to be encompassed by the following claims.

Claims

CLAIMSWe claim:
1. The atomic structure of a natural guanine-responsive riboswitch comprising an atomic structure as depicted in Figure 6.
2. The atomic structure of claim 1, wherein the atomic coordinates of the atomic structure comprise the atomic coordinates listed in Table 6 for atoms depicted in Figure 6.
3. The atomic structure of claim 1, wherein the atomic coordinates of the atomic structure comprise the atomic coordinates listed in Table 6.
4. A method of identifying a compound that interacts with a riboswitch comprising:
(a) modeling the atomic structure of claim 1 with a test compound; and
(b) determining if the test compound interacts with the riboswitch.
5. The method of claim 4, wherein determining if the test compound interacts with the riboswitch comprises determining a predicted minimum interaction energy, a predicted bind constant, a predicted dissociation constant, or a combination, for the test compound in the model of the riboswitch.
6. The method of claim 4, wherein determining if the test compound interacts with the riboswitch comprises determining one or more predicted bonds, one or more predicted interactions, or a combination, of the test compound with the model of the riboswitch.
7. The method of claim 4, wherein the riboswitch is a guanine riboswitch.
8. The method of claim 7, wherein the guanine riboswitch is a riboswitch in Table 5.
9. The method of claim 4, wherein atomic contacts are determined in step (b), thereby determining the interaction of the test compound with the riboswitch.
10. The method of claim 9, further comprising the steps of:
(c) identifying analogs of the test compound;
(d) determining if the analogs of the test compound interact with the riboswitch.
11. The method of claim 10, wherein the compound is hypoxanthine.
12. A method of killing bacteria, comprising contacting the bacteria with an analog identified by the method of claim 10.
13. A method of killing bacteria, comprising contacting the bacteria with a compound identified by the method of claim 4.
14. The method of claim 4, wherein a gel-based assay is used to determine if the test compound interacts with the riboswitch.
15. The method of claim 4, wherein a chip-based assay is used to determine if the test compound interacts with the riboswitch.
16. The method of claim 4, wherein the test compound interacts via van der Waals interactions, hydrogen bonds, electrostatic interactions, hydrophobic interactions, or a combination.
17. The method of claim 4, wherein the riboswitch comprises an RNA cleaving ribozyme.
18. The method of claim 4, wherein a fluorescent signal is generated when a nucleic acid comprising a quenching moiety is cleaved.
19. The method of claim 4, wherein molecular beacon technology is employed to generate the fluorescent signal.
20. The method of claim 4, wherein the method is carried out using a high throughput screen.
21. A method of identifying compounds that interact with a riboswitch comprising contacting the riboswitch with a test compound, wherein a fluorescent signal is generated upon interaction of the riboswitch with the test compound.
22. A method of identifying a compound that interacts with a riboswitch comprising:
(a) identifying the crystal structure of the riboswitch;
(b) modeling the riboswitch with a test compound; and
(c) determining if the test compound interacts with the riboswitch.
23. The method of claim 22, wherein the riboswitch is a guanine riboswitch.
24. The method of claim 23, wherein the guanine riboswitch is a riboswitch in Table 5.
25. A regulatable gene expression construct comprising a nucleic acid molecule encoding an RNA comprising a riboswitch operably linked to a coding region, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch regulates expression of the RNA, wherein the riboswitch and coding region are heterologous.
26. The gene expression construct of claim 25, wherein the riboswitch is activated by a trigger molecule, wherein the riboswitch produces a signal when activated by the trigger molecule.
27. A method of detecting a compound of interest, the method comprising bringing into contact a sample and a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the riboswitch is activated by the compound of interest, wherein the riboswitch produces a signal when activated by the compound of interest, wherein the riboswitch produces a signal when the sample contains the compound of interest.
28. The method of claim 27, wherein the riboswitch changes conformation when activated by the compound of interest, wherein the change in conformation produces a signal via a conformation dependent label.
29. The method of claim 27, wherein the riboswitch changes conformation when activated by the compound of interest, wherein the change in conformation causes a change in expression of an RNA linked to the riboswitch, wherein the change in expression produces a signal.
30. The method of claim 29, wherein the signal is produced by a reporter protein expressed from the RNA linked to the riboswitch.
31. A method of inhibiting gene expression, the method comprising
(a) bringing into contact a compound and a cell,
(b) wherein the compound has the structure of Formula I
Figure imgf000179_0001
I wherein, when the compound is bound to a guanine-responsive riboswitch, R1 and R2 serve as a hydrogen bond donor, R7 serves as a hydrogen bond acceptor, R9 serves as a hydrogen bond donor, R10 serves as a hydrogen bond acceptor, wherein each independently represent a single or double bond, wherein the compound is not guanine, hypoxanthine, or xanthine, wherein the cell comprises a gene encoding an RNA comprising a guanine-responsive riboswitch, wherein the compound inhibits expression of the gene by binding to the guanine- responsive riboswitch..
32. The method of claim 31, wherein R3 is a hydrogen bond acceptor.
33. The method of claim 31, wherein, independently, R1, R2, R9 or a combination are - NR11-, -CHR11-, =CRπ-, and-C(=NRπ)-, where R11 is -H, -NH2, -OH, -SH, -CO2H, substituted or unsubstituted alkyl, alkoxy, aryl, aryloxy, or benzyloxy, -NHalkyl, -NHalkoxy, - NHC(O)alkyl, -NHCO2alkyl, -NHC(O)NH2, -NH-NH2, -NH-NHalkyl, -NH-NHalkoxy, -NH- S02alkyl, -NH-SO2-R12,-NHCO2CH2-R12, -NH-OR12, -N+H2-R12, -NH-NH-R12, and -NH-NH- CH2-R12, wherein R12 is:
Figure imgf000180_0001
wherein n is from 1 to 5, and R13 is one or more of -H, -NH2, -OH, alkoxy, -N- morpholino, or halide.
34. The method of claim 31, wherein R8 and R7 taken together can be represented as - CH=N-, -CH2-O-, -CH2-S-, or -CH2SO2-.
35. The method of claim 31, wherein R10 is -OH, -SH, -NH2, -CO2H, -alkoxy, - aryloxy, -benzyloxy, -halide, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO2alkyl, - NHCO2CH2-R12, -NHC(O)NH2, -NH-NH2, -NH-NHalkyl, -NH-NHalkoxy, -S02alkyl, - S02aryl, -NH-SO2alkyl, -NH-SO2-R12, -NH-OR12, -NH-R12, -NH-NH-R12, -NH-NH-CH2-R12, or -NH-CH2-R12, wherein R12 is:
Figure imgf000180_0002
wherein n is from 1 to 5, and R13 is one or more of -H, -NH2, -OH, alkoxy, -N- morpholino, or halide.
36. The method of claim 31, wherein R10 is NR14, wherein R14 is -H, -NH2, -OH, -SH, -CO2H, ~C02alkyl, -C02aryl, -C(O)NH2, substituted or unsubstituted alkyl, alkoxy, alkoxy, aryloxy, or benzyloxy, -NHalkyl, -NHalkoxy, -NHC(O)alkyl, -NHCO2alkyl, -NHC(O)NH2, - S02alkyl, -S02aryl, -NH-SO2alkyl, -NH-SO2-R12, -NH-OR12, -NH-R12, or -NH-CH2-R12, wherein R12 is:
Figure imgf000180_0003
wherein n is from 1 to 5, and R13 is one or more of -H, -NH2, -OH, alkoxy, -N- morpholino, or halide.R12 is as defined above.
37. The method of claim 31, wherein the compound has the structure of Formula II:
Figure imgf000181_0001
II wherein R7 is N or CH; wherein R10 is =0, =S, =NH, =N0H, =Nalkyl, =Nalkoxyl, =N-aryl, =Naryloxy, =N- benzyl, =Nbenzyloxy, =N-NH2, =N-NH0H, =N-NHalkyl, =N-NHalkoxy, =N-NHaryl, =N- NHaryloxy, =N-NHbenzyl, =N-NHbenzyloxy, =N-NH-(p-amino-phenyi), ==N-NH-(p- methoxyphenyl), =N-NH-(p-N-morpholino-phenyl); and wherein R2 is =CR15-, where R15 is -NH2, -NHNH2, -NHOH, -NHalkyl, -NHalkoxy, - NHaryl, -NHaryloxy, -NHbenzyl, -NHbenzyloxy, -N^^aryl, -N^Ha-Cp-N-niorpholino-phenyl), -^^-(p-aminophenyl), -N1H2-(p-methoxyphenyl), -NHCO2alkyl, -NHCO2benzyl, - NHNHalkyl, -NHNHaryl, -NHNHbenzyl, or -NHC(O)alkyl.
38. The method of claim 31, wherein the compound has the structure of Formula III:
Figure imgf000181_0002
III wherein R7 is N or CH; wherein R10 is -H, -OH, -SH, -alkoxy, halide, -NH2, -NHOH, -NHalkyl, -NHalkoxy, - NHaryl, -NHaryloxy, -NHbenzyl, -NHbenzyoxy, -NHC(O)alkyl, -NHCO2alkyl, NHCO2benzyl, -NHNH2, -NHNHalkyl, -NHNHaryl, or -NHNHbenzyl; and wherein R2 is =CR15-, where R15 is -NH2, -NHNH2, -NHOH, -NHalkyl, -NHalkoxy, - NHaryl, -NHaryloxy, -NHbenzyl, -NHbenzyloxy, -N+H2aryl, -N+H2-(p-N-moφholino-phenyl), -N+H2-(p-aminophenyl), -N+H2-(p-methoxyphenyl), -NHCO2alkyl, -NHCO2benzyl, - NHNHalkyl, -NHNHaryl, -NHNHbenzyl, or -NHC(O)alkyl.
39. A method comprising (a) testing a compound for inhibition of gene expression of a gene encoding an RNA comprising a riboswitch, wherein the riboswitch is a riboswitch in Table 5, wherein the inhibition is via the riboswitch,
(b) inhibiting gene expression by bringing into contact a cell and a compound that inhibited gene expression in step (a), wherein the cell comprises a gene encoding an RNA comprising the riboswitch, wherein the compound inhibits expression of the gene by binding to the riboswitch.
PCT/US2005/040487 2004-11-08 2005-11-08 Structure-based compound design involving riboswitches WO2006055351A2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
US11/667,153 US20080269258A1 (en) 2004-11-08 2005-11-08 Riboswitches, Structure-Based Compound Design with Riboswitches, and Methods and Compositions for Use of and with Riboswitches
AU2005306801A AU2005306801B2 (en) 2004-11-08 2005-11-08 Structure-based compound design involving riboswitches
MX2007005491A MX2007005491A (en) 2004-11-08 2005-11-08 Structure-based compound design involving riboswitches.
CA002586998A CA2586998A1 (en) 2004-11-08 2005-11-08 Structure-based compound design involving riboswitches
JP2007540181A JP2008518630A (en) 2004-11-08 2005-11-08 Riboswitch, structure-based compound design by riboswitch, and method of using riboswitch and composition with riboswitch
EP05851443A EP1828953B1 (en) 2004-11-08 2005-11-08 Crystalline xpt guanine riboswitch from Bacillus subtilis and structure-based compound identification employing said riboswitch

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US62586404P 2004-11-08 2004-11-08
US60/625,864 2004-11-08

Publications (2)

Publication Number Publication Date
WO2006055351A2 true WO2006055351A2 (en) 2006-05-26
WO2006055351A3 WO2006055351A3 (en) 2007-03-22

Family

ID=36407624

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/040487 WO2006055351A2 (en) 2004-11-08 2005-11-08 Structure-based compound design involving riboswitches

Country Status (10)

Country Link
US (1) US20080269258A1 (en)
EP (1) EP1828953B1 (en)
JP (2) JP2008518630A (en)
KR (1) KR20070111448A (en)
CN (1) CN101111847A (en)
AU (1) AU2005306801B2 (en)
CA (1) CA2586998A1 (en)
MX (1) MX2007005491A (en)
SG (1) SG160326A1 (en)
WO (1) WO2006055351A2 (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009120347A2 (en) * 2008-03-27 2009-10-01 Memorial Sloan Kettering Cancer Center Modulators of rna riboswitches
WO2009134322A2 (en) * 2008-04-14 2009-11-05 Regents Of The University Of Colorado Lysine riboswitch and compositions and uses thereof
WO2010033777A2 (en) * 2008-09-19 2010-03-25 University Of Pittsburgh-Of The Commonwealth System Of Higher Education Discovery of t -homology in a set of sequences and production of lists of t-homologous sequences with predefined properties
WO2010090764A1 (en) 2009-02-09 2010-08-12 Supergen, Inc. Pyrrolopyrimidinyl axl kinase inhibitors
WO2012153142A3 (en) * 2011-05-10 2013-01-03 The University Of Manchester Riboswitches
WO2017165245A2 (en) 2016-03-19 2017-09-28 F1 Oncology, Inc. Methods and compositions for transducing lymphocytes and regulated expansion thereof
WO2018009923A1 (en) 2016-07-08 2018-01-11 F1 Oncology, Inc. Methods and compositions for transducing lymphocytes and regulating the activity thereof
US9993491B2 (en) 2009-06-12 2018-06-12 Socpra—Sciences Et Génie S.E.C. Guanine riboswitch binding compounds and their use as antibiotics
WO2018161064A1 (en) 2017-03-03 2018-09-07 F1 Oncology, Inc. Methods and compositions for transducing and expanding lymphocytes and regulating the activity thereof
WO2019222403A2 (en) 2018-05-15 2019-11-21 Flagship Pioneering Innovations V, Inc. Fusosome compositions and uses thereof
WO2020014209A1 (en) 2018-07-09 2020-01-16 Flagship Pioneering Innovations V, Inc. Fusosome compositions and uses thereof
WO2020102503A2 (en) 2018-11-14 2020-05-22 Flagship Pioneering Innovations V, Inc. Fusosome compositions for t cell delivery
WO2020102499A2 (en) 2018-11-14 2020-05-22 Flagship Pioneering Innovations V, Inc. Fusosome compositions for cns delivery
WO2020102485A1 (en) 2018-11-14 2020-05-22 Flagship Pioneering Innovations V, Inc. Fusosome compositions for hematopoietic stem cell delivery
US20200216878A1 (en) * 2013-12-30 2020-07-09 Curevac Real Estate Gmbh Methods for rna analysis
US11111505B2 (en) 2016-03-19 2021-09-07 Exuma Biotech, Corp. Methods and compositions for transducing lymphocytes and regulating the activity thereof
US11325948B2 (en) 2016-03-19 2022-05-10 Exuma Biotech Corp. Methods and compositions for genetically modifying lymphocytes to express polypeptides comprising the intracellular domain of MPL
US11371077B2 (en) 2016-08-03 2022-06-28 Meiragtx Uk Ii Limited High throughput cell-based screening for aptamers

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5223237B2 (en) * 2007-05-14 2013-06-26 ソニー株式会社 Method for detecting binding or interaction between nucleic acid strand for detection and substance
JP5463621B2 (en) * 2008-02-27 2014-04-09 ソニー株式会社 Target substance quantification method
US20100305197A1 (en) * 2009-02-05 2010-12-02 Massachusetts Institute Of Technology Conditionally Active Ribozymes And Uses Thereof
US20100248272A1 (en) * 2009-03-31 2010-09-30 Cornell University Method for identifying Smk box riboswitch modulating compounds
EP2524042A2 (en) * 2010-01-12 2012-11-21 Yale University Structured rna motifs and compounds and methods for their use
ES2623127T3 (en) 2010-02-01 2017-07-10 Mikrobex Bacteriotherapy for Clostridium difficile colitis
EP2531223A4 (en) * 2010-02-04 2013-06-26 Biorelix Inc Novel methods and utilities
US9782445B2 (en) 2013-06-05 2017-10-10 Rebiotix, Inc. Microbiota restoration therapy (MRT), compositions and methods of manufacture
US9694039B2 (en) 2013-06-05 2017-07-04 Rebiotix, Inc. Microbiota restoration therapy (MRT), compositions and methods of manufacture
US9511099B2 (en) 2013-06-05 2016-12-06 Rebiotix, Inc. Microbiota restoration therapy (MRT), compositions and methods of manufacture
US10383901B2 (en) 2013-06-05 2019-08-20 Rebiotix, Inc. Microbiota restoration therapy (MRT), compositions and methods of manufacture
US9511100B2 (en) 2013-06-05 2016-12-06 Rebiotix, Inc. Microbiota restoration therapy (MRT), compositions and methods of manufacture
CA2914636C (en) 2013-06-05 2021-04-06 Rebiotix, Inc. Microbiota restoration therapy (mrt), compositions and methods of manufacture
US10799539B2 (en) 2015-06-09 2020-10-13 Rebiotix, Inc. Microbiota restoration therapy (MRT) compositions and methods of manufacture
US10828340B2 (en) 2015-06-09 2020-11-10 Rebiotix, Inc. Microbiota restoration therapy (MRT) compositions and methods of manufacture
EP3939601A1 (en) 2015-06-09 2022-01-19 Rebiotix, Inc. Microbiota restoration therapy (mrt) compositions and methods of manufacture
US10905726B2 (en) 2015-06-09 2021-02-02 Rebiotix, Inc. Microbiota restoration therapy (MRT) compositions and methods of manufacture
WO2018034843A1 (en) * 2016-08-17 2018-02-22 Maumita Mandal Materials and methods for controlling gene expression
MX2019006825A (en) * 2016-12-12 2019-10-30 Univ Colorado Regents Use of biological rna scaffolds with in vitro selection to generate robust small molecule binding aptamers for genetically encodable biosensors.
CN108624516B (en) * 2017-03-20 2022-08-26 华东理工大学 Method for improving metabolite amount in fermentation cells and preparing IDMS standard substance
KR20230003477A (en) 2020-03-24 2023-01-06 제너레이션 바이오 컴퍼니 Non-viral DNA vectors and their use for expressing Factor IX therapeutics
JP2023520763A (en) 2020-03-24 2023-05-19 ジェネレーション バイオ カンパニー Non-viral DNA vectors for expressing Gaucher therapeutics and their use
CN113564155B (en) * 2021-07-22 2023-10-31 华侨大学 Aptamer screening method and application thereof
WO2024182701A1 (en) * 2023-03-01 2024-09-06 Northwestern University Membrane encapsulated cell-free systems with biosensing capabilities

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE357516T1 (en) * 1998-05-12 2007-04-15 Isis Pharmaceuticals Inc MODULATION OF MOLECULAR INTERACTION POSITIONS IN RNA AND OTHER BIOMOLECULES
US7125660B2 (en) * 2000-09-13 2006-10-24 Archemix Corp. Nucleic acid sensor molecules and methods of using same
WO2003005013A1 (en) * 2001-07-03 2003-01-16 Georgia Tech Research Corporation Filtration-based microarray chip
WO2004027035A2 (en) * 2002-09-20 2004-04-01 Yale University Riboswitches, methods for their use, and compositions for use with riboswitches.

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP1828953A4 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009120347A2 (en) * 2008-03-27 2009-10-01 Memorial Sloan Kettering Cancer Center Modulators of rna riboswitches
WO2009120347A3 (en) * 2008-03-27 2009-12-17 Memorial Sloan Kettering Cancer Center Modulators of rna riboswitches
WO2009134322A2 (en) * 2008-04-14 2009-11-05 Regents Of The University Of Colorado Lysine riboswitch and compositions and uses thereof
WO2009134322A3 (en) * 2008-04-14 2010-03-18 Regents Of The University Of Colorado Lysine riboswitch and compositions and uses thereof
WO2010033777A2 (en) * 2008-09-19 2010-03-25 University Of Pittsburgh-Of The Commonwealth System Of Higher Education Discovery of t -homology in a set of sequences and production of lists of t-homologous sequences with predefined properties
WO2010033777A3 (en) * 2008-09-19 2010-09-10 University Of Pittsburgh-Of The Commonwealth System Of Higher Education Discovery of t -homology in a set of sequences and production of lists of t-homologous sequences with predefined properties
WO2010090764A1 (en) 2009-02-09 2010-08-12 Supergen, Inc. Pyrrolopyrimidinyl axl kinase inhibitors
US9993491B2 (en) 2009-06-12 2018-06-12 Socpra—Sciences Et Génie S.E.C. Guanine riboswitch binding compounds and their use as antibiotics
WO2012153142A3 (en) * 2011-05-10 2013-01-03 The University Of Manchester Riboswitches
US20200216878A1 (en) * 2013-12-30 2020-07-09 Curevac Real Estate Gmbh Methods for rna analysis
EP3430136A2 (en) * 2016-03-19 2019-01-23 F1 Oncology, Inc. Methods and compositions for transducing lymphocytes and regulated expansion thereof
WO2017165245A2 (en) 2016-03-19 2017-09-28 F1 Oncology, Inc. Methods and compositions for transducing lymphocytes and regulated expansion thereof
US10596274B2 (en) 2016-03-19 2020-03-24 Exuma Biotech Corp. Methods and compositions for transducing lymphocytes and regulated expansion thereof
US11325948B2 (en) 2016-03-19 2022-05-10 Exuma Biotech Corp. Methods and compositions for genetically modifying lymphocytes to express polypeptides comprising the intracellular domain of MPL
US11111505B2 (en) 2016-03-19 2021-09-07 Exuma Biotech, Corp. Methods and compositions for transducing lymphocytes and regulating the activity thereof
WO2018009923A1 (en) 2016-07-08 2018-01-11 F1 Oncology, Inc. Methods and compositions for transducing lymphocytes and regulating the activity thereof
US11371077B2 (en) 2016-08-03 2022-06-28 Meiragtx Uk Ii Limited High throughput cell-based screening for aptamers
WO2018161064A1 (en) 2017-03-03 2018-09-07 F1 Oncology, Inc. Methods and compositions for transducing and expanding lymphocytes and regulating the activity thereof
WO2019222403A2 (en) 2018-05-15 2019-11-21 Flagship Pioneering Innovations V, Inc. Fusosome compositions and uses thereof
WO2020014209A1 (en) 2018-07-09 2020-01-16 Flagship Pioneering Innovations V, Inc. Fusosome compositions and uses thereof
WO2020102485A1 (en) 2018-11-14 2020-05-22 Flagship Pioneering Innovations V, Inc. Fusosome compositions for hematopoietic stem cell delivery
WO2020102499A2 (en) 2018-11-14 2020-05-22 Flagship Pioneering Innovations V, Inc. Fusosome compositions for cns delivery
WO2020102503A2 (en) 2018-11-14 2020-05-22 Flagship Pioneering Innovations V, Inc. Fusosome compositions for t cell delivery

Also Published As

Publication number Publication date
AU2005306801A2 (en) 2006-05-26
EP1828953A4 (en) 2010-05-26
SG160326A1 (en) 2010-04-29
WO2006055351A3 (en) 2007-03-22
EP1828953A2 (en) 2007-09-05
JP2012065661A (en) 2012-04-05
MX2007005491A (en) 2007-09-27
JP2008518630A (en) 2008-06-05
EP1828953B1 (en) 2013-03-06
CN101111847A (en) 2008-01-23
AU2005306801B2 (en) 2012-05-17
KR20070111448A (en) 2007-11-21
AU2005306801A1 (en) 2006-05-26
US20080269258A1 (en) 2008-10-30
CA2586998A1 (en) 2006-05-26

Similar Documents

Publication Publication Date Title
EP1828953B1 (en) Crystalline xpt guanine riboswitch from Bacillus subtilis and structure-based compound identification employing said riboswitch
US8440810B2 (en) Riboswitches, methods for their use, and compositions for use with riboswitches
US20130029342A1 (en) Glycine riboswitches, methods for their use, and compositions for use with glycine riboswitches
US20130143955A1 (en) Cyclic di-GMP-II Riboswitches, Motifs, and Compounds, and Methods for Their Use
US20130012527A1 (en) Preq1 riboswitches and methods and compositions for use of and with preq1 riboswitches
US20120107331A1 (en) Gemm riboswitches, structure-based compound design with gemm riboswitches, and methods and compositions for use of and with gemm riboswitches
US20100137440A1 (en) Lysine riboswitches, structure-based compound design with lysine riboswitches, and methods and compositions for use of and with lysine riboswitches
US20100324123A1 (en) Glms riboswitches, structure-based compound design with glms riboswitches, and methods and compositions for use of and with glms riboswitches

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KN KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: MX/a/2007/005491

Country of ref document: MX

Ref document number: 2007540181

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2586998

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2005306801

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1020077012055

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2005851443

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2005306801

Country of ref document: AU

Date of ref document: 20051108

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2005306801

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 200580046277.7

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2005851443

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 11667153

Country of ref document: US