Nucleic acid probes that recognize and report the presence of a specific nucleic acid sequence have been used to detect specific nucleic acids primarily in in vitro reactions. See, for example, U.S. Pat. No. 5,925,517, incorporated herein by reference. One type of probe is designed to have a hairpin-shaped structure, with a central stretch of nucleotides complementary to the target sequence, and termini comprising short mutually complementary sequences. See, for example, Tyagi and Kramer, Nature Biotechnology, 14, 303-308 (1996), incorporated herein by reference. One terminus of the stem-loop shaped probe is covalently bound to a fluorophore and the other to a quenching moiety. When in their native state with hybridized termini, the proximity of the fluorophore and the quencher is such that relatively little or essentially no fluorescence is produced. The stem-loop probe undergoes a conformational change when hybridized to its target nucleic acid that results in the detectable change in the production of fluorescence from the fluorophore. Researchers have used the hairpin-shaped probe to perform in-situ visualization of messenger RNA (Matsuo, 1998, Biochim. Biophys. Acta 1379:178-184) in living cells.
The present invention provides methods and compositions comprising novel signaling probes for analyzing or isolating cells or generating cell lines expressing one or more RNA. The method is based on the detection of signal produced by the probes upon their hybridization with target sequence. The RNA may be introduced into the cells via a DNA construct, or the cells may be suspected of expressing the RNA endogenously. The DNA construct may further encode a tag sequence, and the signaling probe is complementary to the tag sequence. In one embodiment, the isolated cells, or generated cell lines are functionally null for expression or have reduced expression of one or more preselected proteins or RNAs. The invention also provides a method of generating transgenic animals using cells that are isolated according to the methods described.
The invention provides signaling probes used in a method for quantifying the expression level of one or more RNA transcripts. In addition, the signaling probes are used in a method for identifying a compound or RNA sequence that modulates transcription of at least one preselected RNA. In another embodiment, the signaling probes are used in a method for identifying genetic recombinational events in living cells. The signaling probe comprises one or more strands of nucleotides, wherein the signaling probe comprises nucleotides that are complementary to a target nucleic acid (e.g., RNA) of interest and wherein the signaling probe further comprises an interacting pair comprising two moieties. The structure of the signaling probe is such that when the signaling probe is not hybridized to the target sequence, the two moieties of the interacting pair are physically located such that no or background signal is produced. When the signaling probe is hybridized to the target sequence, the two moieties are such that a signal is produced. Alternatively, the moieties of the signaling probe may be such that when the probe is not hybridized to the target, there is particular signal produced and a different signal is produced upon hybridization of the probe to the target sequence. The two moieties of the interacting pair may be attached to one or more terminus of one or more strands of the signaling probe. Alternatively, the moieties may be internally incorporated into one or more strands of the signaling probe. The nucleotides of the signaling probe may also be modified.
The present invention also provides protease probes. In one embodiment, the signaling or protease probe comprises two separate strands of nucleic acid or modified nucleic acid one or more portions of which anneal to each other, and at least one terminus of one strand is adjacent to a terminus of the other strand. The nucleic acid may be DNA or RNA. For the signaling probe with two separate strands, in one embodiment, one strand has at least a quencher moiety on one terminus, and the other strand has at least a fluorophore on the adjacent terminus For the protease probe, in one embodiment, one strand has at least a proteolytic enzyme on one terminus, and the other strand has at least an inhibitor of the proteolytic enzyme on the adjacent terminus.
In another embodiment, the signaling or protease probe is designed to comprise at least a mutually complementary region and at least a non-complementary region. In one embodiment, at least one non-complementary region may be designed to form a loop region. In one embodiment, the probe is designed to form at least a stem-loop structure. In another embodiment, the probe forms a dumbbell structure or a three-arm junction structure. In one embodiment, the signaling probe has at least a fluorophore and at least a quencher moiety at each terminus of the strand. In one embodiment, the protease probe has a proteolytic enzyme and an inhibitor of the proteolytic enzyme at each terminus of the strand.
In another embodiment, the signaling or protease probe is chemically modified. One or more of the sugar-phosphodiester type backbone, 2′OH and purine or pyrimidine base is modified. In one embodiment, the deoxyribose backbone is replaced by peptide nucleic acid
In one embodiment, the tag sequence is a structural RNA, i.e., the RNA has secondary structure, preferably a three-arm junction structure. In one embodiment, the tag sequence comprises the structure or sequence according to
In other embodiments is provided:
This invention provides a method of isolating cells expressing an RNA comprising the steps of providing cells expressing the RNA, exposing the cells to a signaling probe that produces a detectable signal upon hybridization with the RNA, and isolating the cells that produce the signal. In one embodiment, the RNA is an endogenous RNA. In another embodiment, the endogenous RNA is expressed as a result of DNA that is introduced into the cell, e.g., the DNA comprises enhancer or promoter or other sequences that induce expression of an endogenous RNA. For example, the DNA may encode a protein, e.g., a transcription factor that induces expression of an RNA. In yet another embodiment, the RNA is encoded by a nucleic acid that is introduced into the cells. The DNA that is introduced into the cells may additionally encode a tag sequence where the signaling probe may optionally be targeted to the tag and/or RNA sequence. The tag sequence may be in-frame or out of frame with the open reading frame of the RNA. In one embodiment, the method includes detecting the expression of both an exogenous or heterologous RNA and an endogenous RNA. These methods may be practiced to detect multiple RNAs and/or tags at the same time. The RNAs may be the same or different RNAs. In those embodiments where the method is used to detect more than one RNA using more than one signaling probe, the signal produced by the different signaling probes may be the same or may be different from each other.
This invention also provides a method of isolating cells that comprise more than one copy of an exogenous or heterologous DNA. Such method include the steps of introducing a DNA encoding an RNA and a tag sequence and introducing a DNA encoding the same RNA and a different tag sequence; exposing the cells to signaling probes that produce a detectable signal to both tags; and isolating the cells that produce both signals.
In any of the methods of the invention where there is more than one exposing step, one or more of the exposing steps (i.e., the step where the cells are exposed to signaling probe) may be performed simultaneously or sequentially.
In any of the methods of the invention, the RNAs or proteins may be those in the same or related biological pathway, act upstream or downstream from each other, have modulating, activating, or repressing function with respect to each other, dependent on each other for function or activity, are components of the same complex, members of the same protein family, etc.
This invention provides a method of isolating cells comprising a DNA construct encoding a first RNA that is under the control of a conditional promoter, comprising the steps of introducing into cells a DNA construct encoding an RNA under the control of a constitutive promoter, wherein the DNA construct further encodes a second RNA under the control of a conditional promoter, under conditions where the second RNA is not expressed or expressed at a low level; exposing the cells to a signaling probe that produces a detectable signal upon hybridization with the first RNA; and isolating the cells that produce the signal. The DNA construct may further encode a test RNA, where, for example, the test RNA is variable, e.g., derived from an expression library. These cells may be used to obtain or identify RNA or compounds that are capable of activating the conditional promoter that drives expression of the second RNA.
Also provided by this invention are methods of isolating a plurality of cells wherein a subset of the cells express an RNA that is not expressed by another subset of the cells, comprising the steps of introducing into cells a plurality of DNA encoding a plurality of RNA, wherein at least a subset of the plurality of RNA are different from each other, exposing the cells to a plurality of different signaling probes, wherein the signaling probes produce a detectable signal upon hybridization to one or more RNAs encoded by the plurality of DNA, and isolating the cells that produce the signal. The subset of cells may be one cell or more than one cell. In one embodiment, the DNA does not encode RNA but results in the expression of endogenous RNA, e.g., the DNA comprises enhancer or promoter sequences, e.g., the DNA comprises sequences that induce expression of the endogenous RNA. For example, the DNA may encode a protein, e.g., a transcription factor that induces expression of an RNA. The DNA may additionally encode a tag sequence and the signaling probe may target the tag sequence and/or the RNA sequence. In one embodiment, the plurality of RNA form an expression library. In another embodiment, at least a subset of the DNA encode the same tag sequence.
This invention also provides a method of isolating two or more RNA libraries of cells comprising the steps of introducing into cells DNA encoding a first RNA expression library, wherein each DNA further encodes a first tag sequence, introducing into cells DNA encoding a second RNA expression library, wherein each DNA further encodes a second tag sequence, exposing the cells to a first signaling probe that produces a detectable signal upon hybridization to the first tag and a second signaling probe that produces a detectable signal upon hybridization to the second tag sequence, and isolating the cells that produce both signals. This method may be carried out with DNA encoding additional RNA expression libraries where one uses a third tag, etc.
Any of the methods of this invention may be used to identify a compound that modulates the expression of an RNA or plurality of RNAs by adding the compound to the cells and assaying for a change (increase or decrease) in signal produced by the signaling probe(s).
This invention provides methods of reducing expression of a protein comprising the steps of introducing into cells a DNA encoding an antisense RNA or an shRNA that reduces expression of the protein, exposing the cells to a first signaling probe that produces a detectable signal upon hybridization to the antisense RNA or shRNA, and isolating the cells that produce the signal. This method may further comprise the step of exposing the cells to a second signaling probe that produces a detectable signal upon hybridization to the RNA encoding the protein, wherein lack of signal from the second signaling probe indicates that the expression of the protein is reduced. One may also assay for reduced expression of the protein using other methods, e.g., using an antibody that specifically binds the protein, using a functional test, e.g. assaying for a known biological activity of the protein, etc. In one embodiment of this method, the step of exposing the cells to the first signaling probe is omitted. Any of the methods using siRNA may also be carried out using shRNA.
In any of the methods of this invention, the DNA that is introduced into the cells may be operably linked to a conditional promoter. In one embodiment, the RNA encoded by the DNA is lethal or damaging to the cell. The DNA may additionally comprise a selectable marker that can be used to select cells comprising the DNA.
This invention also provides a method of quantifying the expression level of an RNA in a biological sample comprising the steps of exposing the biological sample to a first signaling probe that produces a detectable signal upon hybridization with the RNA, quantifying the level of the signal in the biological sample, and correlating the level of signal with the expression level of the RNA.
This invention further provides a method of identifying a compound that modulates expression of an RNA comprising adding a test compound to cells expressing the RNA, exposing the cells to a signaling probe that produces a detectable signal upon hybridization with the RNA, and comparing the signal produced by cells exposed to the test compound to the signal produced by cells not exposed to the test compound, wherein an increase or decrease in signal produced by the former cells as compared to the signal produced by the latter cells indicates that the compound modulates expression of the RNA. In one embodiment, the RNA is encoded by DNA that is introduced into the cells. In one embodiment, the compound is an RNA or protein.
This invention provides a method of identifying a genetic recombinatorial event in living cells comprising the steps of exposing a cell to a signaling probe that produces a detectable signal upon hybridization with an RNA transcribed from a recombined sequence, wherein detection of a cell producing the signal indicates that the cell comprises the genetic recombinatorial event.
This invention also provides cells produced by any of the methods. These cells may be cultured and may be used to generate cell lines or a plurality of cell lines. The cells may be used for a variety of purposes, e.g., in a cell-based assay or where the cell is implanted in an animal, non-human animal, or mammal The cell may be an embryonic stem cell, a primary, germ , or stem cell. The cell may also be an immortalized cell. The cells may be endothelial, epidermal, mesenchymal, neural, renal, hepatic, hematopoietic, or immune cells. The cells may be eukaryotic, prokaryotic, mammalian, yeast, plant, human, primate, bovine, porcine, feline, rodent, marsupial, murine or other cells.
The tag sequences may comprise multiple target sequences, wherein one signaling probe hybridizes to each target sequence. The tag sequences may be an RNA having secondary structure. The structure may be a three-arm junction structure. The DNA may comprise multiple tag sequences. The tag sequence may be transcribed as the same RNA as the RNA encoded by the DNA or the tag sequence may be transcribed as a separate RNA. Also provided is a DNA construct comprising a DNA sequence that encodes an RNA and a tag sequence. The tag sequence may be any one of those described herein. Cells and vectors comprising the DNA construct are also provided.
This invention also provides libraries of mammalian cell lines comprising at least 1,000, at least 800, at least 600, at least 500, at least 400, at least 200, at least 100 or at least 50 cell lines, wherein each cell line comprises a stably integrated expressed sequence. Also provided are libraries of mammalian cell lines comprising at least 500, at least 400, at least 300, at least 200, at least 200, at least 100, at least 50 cell lines, wherein each cell line comprises at least two stably integrated sequences. Also provided are libraries of mammalian cell lines comprising at least 100, at least 50, at least 25, at least 10 cell lines, wherein each cell line comprises at least three stably integrated sequences. Also provided are libraries of mammalian cell lines comprising at least 50, at least 25, at least 20, at least 10 cell lines, wherein each cell line comprises at least four stably integrated sequences. The stably integrated sequences in these cell lines may additionally lack a selection marker, e.g., a drug resistance gene. The stably integrated sequences may be of known or unknown sequence identity. These sequences may have shared sequence homology, functional significance, or related origin. These libraries may be used for a variety of purposes, e.g., in a cell-based screening assay.
This invention also provides a method of identifying a compound that enhances the detection of targets in cells using signaling probes comprising the steps of introducing a signaling probe into cells comprising a target sequence, wherein the signaling probe produces a detectable signal upon hybridization with the target sequence, exposing the cells to a test compound, and detecting the signal produced by the cells, wherein an increase in the signal produced by cells exposed to the test compound as compared to the signal produced by cells not exposed to the test compound indicates that the test compound is a compound that enhances the detection of targets in cells using signaling probes.
This invention also provides a method of identifying a compound that mediates or improves the introduction of signaling probes into cells comprising the steps of exposing cells to a signaling probe in the presence of a test compound, wherein the cells comprise a target sequence and wherein the signaling probe produces a signal upon hybridization with the target sequence; and detecting the signal produced by the cells, wherein an increase in signal produced by the cells exposed to the test compound as compared to cells not exposed to the test compound indicates that the test compound is a compound that mediates or improves the introduction of signaling probes into cells.
These and other aspects of the invention will be appreciated from the following detailed description.
In all Panels, the compositions of the tubes are as indicated below:
This figure shows that each of the FPs tested were specifically reporting the presence of target sequences by emitting a greater signal in tubes containing oligos having targeted sequence as compared to control tubes containing oligos having non-targeted sequence. Tubes containing FPs in the presence of oligos comprising target sequence are indicated by asterisk.
The FPs tested in each Panel are listed below, alongside the oligo which includes sequence that is designed to be recognized by the FP:
This figure shows that each of the FPs tested were specifically reporting the presence of target sequences by emitting a greater signal in tubes containing oligo having targeted sequence as compared to control tubes containing no oligo or oligo having non-targeted sequence. Tubes containing FPs in the presence of oligos comprising target sequence are indicated by asterisk.
Sequences from 5′ to 3′ direction for TO-M1, TO-M2 and TO-M3 are listed below:
The protocol for panel A, B, C was according to the protocol for
Panel B shows the same cell one hour later, after it attached to the surface of the well. Panel C shows the same cell the following day, after it had undergone cell division.
The three panels each show that cells isolated according to the methods remain viable despite the previously unknown effects of the reagents used to expose the cells to the probes. These effects include compromising the plasma membrane of the cell and possibly further subjecting the cells to high pressures during FACS. Panel A shows that the cell membrane is not found to be compromised, and Panels B and C further demonstrate that the cell is viable since it can attach to the surface of the culture dish and divide, both of which are properties of viable cells.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. All publications and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control.
The term “adjacent” as used in the context of probes refers to a condition of proximity to allow an interacting pair to functionally interact with each other. For example, the condition of proximity allows a fluorophore to be quenched or partially quenched by a quencher moiety or a protease inhibitor to inhibit or partially inhibit a protease. The distance required for currently known fluorophore and quencher to interact is about 20-100 Å.
The term “basepair” refers to Watson-Crick basepairs.
The term “bulge region” refers to a single-stranded region of one nucleotide or modified nucleotide that is not basepaired. The bulged nucleotide can be between mutually complementary regions (for example,
The term “dumbbell structure” refers to a strand of nucleic acid or modified nucleic acid having the conformation of two stem-loop structures linked via the end of an arm from each of the stem regions (for example,
The term “interacting pair” refers to two chemical groups that functionally interact when adjacent to each other, and when not adjacent to each other, produce a detectable signal compared to the absence of signal or background signal produced by the interacting chemical groups, or produce a different signal than the signal produced by the interacting chemical groups. An interacting pair includes but is not limited to a fluorophore and a quencher, a chemiluminescent label and a quencher or adduct, a dye dimer and FRET donor and acceptor, or a combination thereof. A signaling probe can comprise more than one interacting pair. For example, a wavelength-shifting signaling probe has a first fluorophore and a second fluorophore that both interact with the quencher, and the two fluorophores are FRET donor and acceptor pairs.
The term “loop region” refers to a single-stranded region of more than one nucleotide or modified nucleotide that is not base-paired (for example,
The term “signaling probe” refers to a probe comprising a sequence complementary to a target nucleic acid sequence and at least a mutually complementary region, and further comprising at least an interacting pair. When the signaling probe is not bound to its target sequence, the moieties of the interacting pair are adjacent to each other such that no or little or different signal is produced. When the signaling probe is bound to the target sequence, the moieties of the interacting pair are no longer adjacent to each other and a detectable signal or a different signal than the signal produced by the probe in its unbound state is produced. In one embodiment, the signaling probe is a fluorogenic or fluorescent probe that comprises a fluorophore and a quencher moiety, and a change in fluorescence is produced upon hybridization to the target sequence. The moieties of the interacting pair may be attached to the termini of the signaling probe or may be attached within the nucleic acid sequence. Examples of moieties that may be incorporated internally into the sequence of the signaling probe include the quenchers: dabcyl dT, BHQ2 dT, and BHQ1 dT, and the fluorophores: fluorescein dT, Alexa dT, and Tamra dT.
The term “protease probe” refers to a probe comprising a sequence complementary to a target sequence and at least a mutually complementary region, and further comprising at least a proteolytic enzyme and at least an inhibitor of the proteolytic enzyme or another molecule capable of reversibly inactivating the enzyme. When the probe is not hybridized to a target sequence, the proximity of the proteolytic enzyme and the inhibitor of the proteolytic enzyme allows them to interact, inhibiting proteolytic activity. Upon hybridization of the probe to the target sequence, the proteolytic enzyme and its inhibitor are separated, activating the proteolytic enzyme. The proteolytic enzyme and inhibitor can be covalently or non-covalently attached to the probe.
The term “mismatch region” refers to a double-stranded region in a nucleic acid molecule or modified nucleic acid molecule, wherein the bases or modified bases do not form Watson-Crick base-pairing (for example,
The term “mutually complementary region” refers to a region in a nucleic acid molecule or modified nucleic acid molecule that is Watson-Crick base paired.
The term “non-complementary region” refers to a region in a nucleic acid molecule or modified nucleic acid molecule that is not Watson-Crick base paired. For example, the non-complementary region can be designed to have bulged nucleotides, a single-stranded loop, overhang nucleotides at the 5′ or 3′ ends, or mismatch regions.
The term “stem region” refers to a region in a nucleic acid molecule or modified nucleic acid molecule that has at least two Watson-Crick basepairs. For example, the stem region can be designed to have more than one mutually complementary region linked by non-complementary regions, or form a continuous mutually complementary region.
The term “stem-loop structure” refers to a nucleic acid molecule or modified nucleic acid molecule with a single-stranded loop sequence flanked by a pair of 5′ and 3′ oligonucleotide or modified oligonucleotide arms (for example,
The term “three-arm junction structure” refers to a strand of nucleic acid or modified nucleic acid that has a conformation of a stem region, a first stem-loop region, and a second stem-loop region linked together via arms of the stem regions (for example,
The signaling probe may have more than one interacting pair, or have different interacting pairs. In one embodiment, the signaling probe is a fluorogenic probe. In one embodiment, the fluorogenic probe does not emit or emits a background level of fluorescence in its unhybridized state, but fluoresces upon or fluoresces above the background level upon binding to its target. Multiple fluorophores can be used to increase signal or provide fluorescence at different color ranges. Multiple quenchers can be used to decrease or eliminate signal in the absence of target sequence. Examples of quenchers include but are not limited to DABCYL, EDAC, Cesium, p-xylene-bis-pyridinium bromide, Thallium and Gold nanoparticles. Examples of fluorophores include but are not limited to sulforhodamine 101, acridine, 5-(2′-aminoethyl)aminoaphthaline-1-sulfonic acid (EDANS), Texas Red, Eosine, and Bodipy and Alexa Fluor 350, Alexa Fluor 405, Alexa Fluor 430, Alexa Fluor 488, Alexa Fluor 500, Alexa Fluor 514, Alexa Fluor 532, Alexa Fluor 546, Alexa Fluor 555, Alexa Fluor 568, Alexa Fluor 594, Alexa Fluor 610, Alexa Fluor 633, Alexa Fluor 635, Alexa Fluor 647, Alexa Fluor 660, Alexa Fluor 680, Alexa Fluor 700, Alexa Fluor 750, Allophycocyanin, Aminocoumarin, Bodipy-FL, Cy2, Cy3, Cy3.5, CyS, Cy5.5, carboxyfluorescein (FAM), Cascade Blue, APC-CyS, APC-Cy5.5, APC-Cy7, Coumarin, ECD (Red613), Fluorescein (FITC), Hexachlorfluoroscein (HEX), Hydroxycoumarin, Lissamine Rhodamine B, Lucifer yellow, Methoxycoumarin, Oregon Green 488, Oregon Green 514, Pacific Blue, PE-Cy7 conjugates , PerC, PerCP-Cy5.5, R-Phycoerythrin (PE), Rhodamine, Rhodamine Green, Rodamine Red-X, Tetratchlorofluoroscein (TET), TRITC, Tetramethylrhodamine, Texas Red-X, TRITC, XRITC, and Quantum dots. See, for example, Tyagi et al. Nature Biotechnology 16:49-53, (1998) and Dubertret et al., Nature Biotechnology, 19:365-370 (2001), incorporated herein by reference.
The invention also provides signaling probes that are wavelength-shifting. In one embodiment, one terminus of the probe has at least a harvester fluorophore and an emitter fluorophore, an adjacent terminus of the probe has at least a quencher moiety. See, for example, Tyagi et al., Nature Biotechnology, 18, 1191-1196 (2000), incorporated herein by reference. In one embodiment, the harvester fluorophore and the emitter fluorophore are at the same terminus, wherein the emitter fluorophore is at the distal end, and a quencher moiety is at an opposite terminus to the harvester fluorophore. The emitter fluorophore may be separated from the harvester fluorophore by a spacer arm of a few nucleotides. The harvester fluorophore absorbs strongly in the wavelength range of the monochromatic light source. In the absence of target sequence, both fluorophores are quenched. In the presence of targets, the probe fluoresces in the emission range of the emitter fluorophore. The shift in emission spectrum is due to the transfer of absorbed energy from the harvester fluorophore to the emitter fluorophore by fluorescence resonance energy transfer. These types of signaling probes may provide a stronger signal than signaling probes containing a fluorophore that cannot efficiently absorb energy from the monochromatic light sources. In one embodiment, the harvester fluorophore is fluorescein and the emitter fluorophore is 6-carboxyrhodamine 6G, tetramethylrhodamine or Texas red.
In another embodiment, one terminus of the probe has at least a fluorophore F1, and another adjacent terminus has at least another fluorophore F2. The two fluorophores are chosen so that fluorescence resonance energy transfer (FRET) will occur when they are in close proximity. When the probe is not bound to its target sequence, upon excitation at the absorption band of F1, the fluorescence of F1 is quenched by F2, and the fluorescence of F2 is observed. When the probe is bound to its target sequence, FRET is reduced or eliminated and the fluorescence of F1 will rise while that of F2 will diminish or disappear. This difference in fluorescence intensities can be monitored and a ratio between the fluorescence of F1 and F2 can be calculated. As residual fluorescence is sometimes observed in fluorophore-quencher systems, this system may be more advantageous in the quantitative detection of target sequence. See, Zhang et al., Angrew. Chem. Int. Ed., 40, 2, pp. 402-405 (2001), incorporated herein by reference. Examples of FRET donor-acceptor pairs include but are not limited to the coumarin group and 6-carboxyfluorescein group, respectively.
In one embodiment, the signaling probe comprises a luminescent label and adduct pair. The interaction of the adduct with the luminescent label diminishes signal produced from the label. See Becker and Nelson, U.S. Pat. No. 5,731,148, incorporated herein by reference.
In another embodiment, the signaling probe comprises at least a dye dimer. When the probe is bound to the target sequence, the signal from the dyes are different from the signal of the dye in dimer conformation.
In yet another embodiment, the interacting pair may be an enzyme and an inhibitor of that enzyme, e.g., a nuclease and a nuclease inhibitor, a kinase and an inhibitor of the kinase, a protease and an inhibitor of the protease, a phosphatase and an inhibitor of the phosphatase, a caspase and an inhibitor of the caspase, or a ribozyme and an inhibitor of the ribozyme, or an antigen and an antibody that specifically binds to the antigen such that the detected target of the probe may be shuttled to a specific cellular localization of the antigen, e.g., to the synapse of a neuron, etc.
Conformation of Signaling Probes and Protease Probes or other Probes
The present invention provides signaling or protease probes or other probe comprising at least two separate strands of nucleic acid that are designed to anneal to each other or form at least a mutually complementary region. At least one terminus of one strand is adjacent to a terminus of the other strand (
The two separate strands may be designed to be fully complementary or comprise complementary regions and non-complementary regions. In one embodiment, the two separate strands are designed to be fully complementary to each other. In one embodiment, the two strands form a mutually complementary region of 4 to 9, 5 to 6, 2 to 10, 10 to 40, or 40 to 400 continuous basepairs at each end (see, e.g.,
The region that hybridizes to the target sequence may be in the complementary regions, non-complementary regions of one or both strands or a combination thereof. More than one target nucleic acid sequence may be targeted by the same signaling probe. The one or more targets may be on the same or different sequences, and they may be exactly complementary to the portion of the probe designed to bind target or at least complementary enough. In one embodiment, the two strands form a mutually complementary region at each end and the target complement sequence resides in the regions other than the mutually complementary regions at the ends (
In one embodiment, the signaling probe with at least two separate strands is a fluorogenic probe. In one embodiment, one strand has at least a quencher moiety on one terminus, and a fluorophore on an adjacent terminus of the other strand (
For the protease probe, in one embodiment, one strand has at least a proteolytic enzyme on one terminus, and an inhibitor of the proteolytic enzyme on an adjacent terminus of the other strand. In one embodiment, each of the 5′ and 3′ terminus of one strand has a proteolytic enzyme, and each of the 5′ and 3′ terminus of the other strand has an inhibitor of the proteolytic enzyme. In one embodiment, the 5′ terminus of one strand has a proteolytic enzyme and the 3′ terminus has an inhibitor of the proteolytic enzyme, and the 3′ terminus of the other strand has an inhibitor of the proteolytic enzyme and the 5′ terminus has a proteolytic enzyme.
In another embodiment, the signaling or protease probe or other probe is a strand of nucleic acid or modified nucleic acid that comprises at least a mutually complementary region and at least a non-complementary region. In one embodiment, the probe forms a stem-loop structure. The stem region can be mutually complementary, or comprise mutually complementary regions and non-complementary regions (
When the stem region is fully complementary, the stem region can include 3-4, 5-6, 7-8, 9-10, 2-6, 7-10, or 11-30 base-pairs (see, e.g.,
In one embodiment, the stem-loop structure comprises at least an interactive pair comprising two chemical groups, and one chemical group is at each terminus of the strand. In one embodiment, the signaling probe has at least a fluorophore and a quencher moiety at each terminus of the strand (
In one embodiment, the stem region comprises two mutually complementary regions connected via a non-complementary region, the mutually complementary region adjacent to the interactive pair forms 5 to 9 basepairs, and the mutually complementary region adjacent to the loop region forms 4 to 5 basepairs (
In the stem-loop structure, the region that is complementary to the target sequence may be in one or more stem regions or loop regions, or both. The region in the stem that hybridizes to the target may be in the mutually complementary regions, non-complementary regions or both. In one embodiment, the target complement sequence is in the single-stranded loop region. In one embodiment, the regions other than the stem region adjacent to the interactive pair is the target complement sequence (
The increase in stem length may increase the stability of the signaling probes in their closed conformation, and thus, may increase the signal to noise ratio of detectable signal. Exposure of these signaling probes to cells can be carried out at slightly elevated temperatures which are still safe for the cell followed by a return to normal temperatures. At the higher temperatures, the signaling probes would open and bind to their target if present. Once cooled, the signaling probes not bound to target would revert to their closed states, which is assisted by the increased stability of the stem. Similarly, other forces may be used to achieve the same outcome, for instance DMSO which is thought to relax base-pairing.
In another embodiment, the signaling or protease probe or other probe is a strand of nucleic acid that forms a three-arm junction structure (
In one embodiment, the three-arm junction structure comprises at least an interactive pair comprising two chemical groups, and one chemical group is at each terminus of the strand. In one embodiment, the probe has at least a fluorophore and a quencher moiety at each terminus of the strand. The protease probe has at least a proteolytic enzyme and an inhibitor of the proteolytic enzyme at each terminus of the strand.
In one embodiment, the stem region adjacent to the interactive pair forms 3 to 4, or 3 to 6 continuous basepairs, the stem region of the first stem-loop structure forms 4 to 5 continuous basepairs, and the stem region of the second stem-loop structure forms 2 to 3 continuous basepairs (
The region in the stem that hybridizes to the target may be in the mutually complementary regions, non-complementary regions or both. In one embodiment, the target complement sequence is in the single-stranded loop region. In one embodiment, the regions other than the stem region adjacent to the interactive pair is the target complement sequence (
In another embodiment, the signaling or protease probe or other probe is a strand of nucleic acid that forms a dumbbell-shaped structure (
In one embodiment, the signaling probe has at least a fluorophore and a quencher moiety at each terminus of the strand. The protease probe has at least a proteolytic enzyme and an inhibitor of the proteolytic enzyme at each terminus of the strand.
The region in the stem that hybridizes to the target may be in the mutually complementary regions, non-complementary regions or a combination thereof. In one embodiment, the target complement sequence is in the single-stranded loop region. In one embodiment, the target complement sequence is the region other than the two stem regions. More than one target nucleic acid sequence may be targeted by the same probe. The one or more targets may be on the same or different sequences, and they may be exactly complementary to the portion of the probe designed to bind target or at least complementary enough.
DNA or RNA folding programs are available in the art to predict the conformation of a given nucleic acid or modified nucleic acid. Such folding programs include but are not limited to the programs described in Nucleic Acids Res. 31: 3429-3431 (2003) and the oligoanalyzer 3.0 software available at the Integrated DNA Technologies SciTools website; hereby incorporated by reference. Such folding programs often predict a number of energetically more favorable structures. In other embodiments, the invention encompasses the energetically more favorable structures of probes FP1-18 (
The present invention also provides signaling or protease probes or other probes which are chemically modified. One or more of the sugar-phosphodiester type backbone, 2′OH, base can be modified. The substitution of the phosphodiester linkage includes but is not limited to —OP(OH)(O)O—, —OP(O−M+)(O)O—, —OP(SH)(O)O—, —OP(S−M+)(O)O—, —NHP(O)2O—, —OC(O)2O—, —OCH2C(O)2 NH—, —OCH2C(O)2O—, —OP(CH3)(O)O—, OP(CH2C6H5)(O)O—, —P(S)(O)O— and —OC(O)2NH—. M+ is an inorganic or organic cation. The backbone can also be peptide nucleic acid, where the deoxyribose phosphate backbone is replaced by a pseudo peptide backbone. Peptide nucleic acid is described by Hyrup and Nielsen, Bioorganic & Medicinal Chemistry 4:5-23, 1996, and Hydig-Hielsen and Godskesen, WO 95/32305, each of which is hereby incorporated by reference herein.
The 2′ position of the sugar includes but is not limited to H, OH, C1-C4 alkoxy, OCH2—CH═CH2, OCH2—CH═CH—CH3, OCH2—CH═CH—(CH2)nCH3 (n=0,1 . . . 30), halogen (F, Cl, Br, I), C1-C6 alkyl and OCH3. C1-C4 alkoxy and C1-C6 alkyl may be or may include groups which are straight-chain, branched, or cyclic.
The bases of the nucleotide can be any one of adenine, guanine, cytosine, thymine, uracil, inosine, or the forgoing with modifications. Modified bases include but are not limited to N4-methyl deoxyguanosine, deaza or aza purines and pyrimidines. Ring nitrogens such as the N1 of adenine, N7 of guanine, N3 of cytosine can be alkylated. The pyrimidine bases can be substituted at position 5 or 6, and the purine bases can be substituted at position 2, 6 or 8. See, for example, Cook, WO 93/13121; Sanger, Principles of Nucleic Acid Structure, Springer-Verlag, New York (1984), incorporated herein by reference.
Derivatives of the conventional nucleotide are well known in the art and include, for example, molecules having a different type of sugar. The O4′ position of the sugar can be substituted with S or CH2 For example, a nucleotide base recognition sequence can have cyclobutyl moieties connected by linking moieties, where the cyclobutyl moieties have hetereocyclic bases attached thereto. See, e.g., Cook et al., International Publication WO 94/19023 (hereby incorporated by reference herein).
Other chemical modifications of probes useful in facilitating the delivery of the probes into cells include, but are not limited to, cholesterol, transduction peptides (e.g., TAT, penetratin, etc.).
The methods of this invention are based upon the ability of signaling probes to produce a detectable signal upon hybridization to target RNA sequences in living cells. The signal produced should be detectably higher than that produced in control cells (e.g., background fluorescence). Thus, it is not necessary that the control cells produce no fluorescence at all. In one embodiment, the method is for detecting or quantitating RNA. One method is to isolate cells or generate cell lines that express at least an RNA. In any of the methods of the invention that involve isolating cells, the cells may be cultured and may also be cultured to generate cell lines. A DNA construct encoding an RNA or an RNA and a tag sequence is introduced into cells. The DNA construct may be integrated at different locations in the genome of the cell. Integration at one or more specific loci may also be accomplished. Then, the transfected cells are exposed to the signaling probe, which generates a detectable signal upon binding to the target RNA or tag sequence. The cells that produce the detectable signal are isolated. Cells can be isolated and cultured by any method in the art, e.g., cells can be isolated and plated individually or in batch. Cell lines can be generated by growing the isolated cells.
Any of the method of the invention may be carried out using a selection marker. Although drug selection (or selection using any other suitable selection marker) is not a required step, it may be used to enrich the transfected cell population for stably transfected cells, provided that the transfected constructs are designed to confer drug resistance. If selection using signaling probes is performed too soon following transfection, some positive cells may only be transiently and not stably transfected. However, this can be minimized given sufficient cell passage allowing for dilution or loss of transfected plasmid from non-stably transfected cells. Some stably integrated plasmids may not generate any RNA corresponding to cloned cDNA inserts. Others may generate RNAs which may not be or may be inefficiently detected by the signaling probes.
The RNAs can have one or more of the following different roles: messenger RNAs that encode proteins, fusion proteins, peptides fused to proteins, export signals, import signals, intracellular localization signals or other signals, which may be fused to proteins or peptides; antisense RNA, siRNA, structural RNAs, cellular RNAs including but not limited to such as ribosomal RNAs, tRNAs, hnRNA, snRNA; random RNAs, RNAs corresponding to cDNAs or ESTs; RNAs from diverse species, RNAs corresponding to oligonucleotides, RNAs corresponding to whole cell, tissue, or organism cDNA preparations; RNAs that have some binding activity to other nucleic acids, proteins, other cell components or drug molecules; RNAs that may be incorporated into various macromolecular complexes; RNAs that may affect some cellular function; or RNAs that do not have the aforementioned function or activity but which may be expressed by cells nevertheless; RNAs corresponding to viral or foreign RNAs, linker RNA, or sequence that links one or more RNAs; or RNAs that serve as tags or a combination or recombination of unmodified mutagenized, randomized, or shuffled sequences of any one or more of the above. RNAs may be under the control of constitutive or conditional promoters including but not limited to inducible, repressible, tissue-specific, heat-shock, developmental, cell lineage specific, or temporal promoters or a combination or recombination of unmodified or mutagenized, randomized, shuffled sequences of any one or more of the above.
In one embodiment, the signaling probes are fluorogenic probes. Fluorescence cell sorter or related technology can be used with fluorogenic probes to identify and/or separate cells exhibiting a certain level or levels of fluorescence at one or more wavelengths. Being able to detect reliably and efficiently mRNAs as well as other RNAs in living cells enables their use to identify and, if desired, separate cells based on their desired characteristics, for instance by using a Fluorescence Activated Cell Sorter (FACS). FACS technology currently allows sorting at up to 70,000 cells per second. 5,000,000 cells can be sorted in less than 2 minutes.
1. Generating Protein-Expressing Cell Lines
Some of the most tedious steps involved in generating cell lines are eliminated by the application of signaling probes as described herein. In one embodiment, following transfection with a DNA construct encoding a desired gene, one introduces into these cells fluorogenic probes designed to recognize the message of the gene of interest. This step can be performed following selection using a selection marker, e.g., drug selection provided that the transfected DNA construct also encodes drug resistance. Those cells transcribing the gene will fluoresce. Subsequent FACS analysis results in the isolation of the fluorescent cells which may then be grown to give rise to cell lines expressing the gene of choice.
In one embodiment, the signaling probes are designed to be complementary to either a portion of the RNA encoding the protein of interest or to portions of their 5′ or 3′ untranslated regions. If the signaling probe designed to recognize a messenger RNA of interest is able to detect endogenously existing target sequences, the proportion of these in comparison to the proportion of the target sequence produced by transfected cells is such that the sorter is able to discriminate the two cell types. The gene of interest may be tagged with a tag sequence and the signaling probe may be designed so that it recognizes the tag sequence. The tag sequence can either be in frame with the protein-coding portion of the message of the gene or out of frame with it, depending on whether one wishes to tag the protein produced.
Additionally, the level of expression of the gene of interest in any given cell may vary. This can be due to a variety of factors that can influence the level of RNA expression including but not limited to the quantity or copy number of DNA that was transfected into a cell, the site of any resulting genomic integration of the DNA and the integrity of the DNA and resulting expression from it following genomic integration. One may apply FACS to evaluate expression levels and differentially select individual cells expressing the same gene.
2. Generating Cell Lines that Down Regulate Genes
There are several studies describing the generation of cell lines which express not RNA that encodes a protein, but rather one that is the antisense of a gene or portion of a gene. Such methods aim to reduce the amount of a specific RNA or protein in a given cell. The steps described above for the generation of protein-expressing cell lines are equally applicable here and virtually identical except that here the signaling probe is designed to detect an RNA which is an antisense RNA.
Not all attempts at making stably transfected antisense-expressing cell lines result in cell lines where the expression of the targeted protein is affected sufficiently. This difficulty has made it less worthwhile to pursue the production of such cell lines. Given the ease of the procedure described here, one easily assays the effectiveness of numerous different genetic sequences for their ability to yield active, i.e., effective antisense expressing cell lines. One can then analyze these to determine which exhibit appropriate expression profiles where the down regulation of targeted genes can be analyzed.
RNA interference is an alternate approach which also aims to decrease transcription levels of specific genes. RNA interference may be induced transiently using chemically synthesized siRNA or DNA constructs encoding short siRNAs, or it may be stably induced if stable cells appropriately expressing the short siRNAs are generated. The methods described here can also be used to analyze or isolate cells or cell lines based on their expression of such siRNAs.
The application of the methods described here to obtain cells expressing RNAs that induce RNA interference is additionally important because it will help overcome some complexities currently encountered in such cells. For instance, although RNA interference is performed in order to reduce the transcriptional levels of targeted RNAs and may cause downstream effects on the transcriptional levels of other RNAs, it has also been shown to reduce the transcriptional levels of RNAs that are not intended targets. The identity of these unintended targets varies depending on the sequence of the RNA that is used to induce RNA interference. This is a complicating feature of using RNA interference in cells as results may be compromised by such unintended consequences. Because the methods described here enable the efficient generation of multiple stable cells each expressing for instance, a different RNA sequence used to induce RNA interference for the same gene, each of these sequences can be assayed in the cell for its effects on the transcriptional levels of other genes. Analysis of this information can be used to distinguish RNAs that have decreased transcriptional levels, of which the decrease is not due to a decreased expression level of the targeted RNA. RNA interference has rapidly gained popularity as it has been successfully used to overcome difficulties associated with achieving specific downregulation using antisense RNA. Our methods may be helpful in determining the most effective sequences for more specific RNA interference having reduced non-specific activity. Because the methods also provide a method for selection of the most effective and specific antisense, they also represent a method of identifying effective antisense RNAs.
3. Differentiating Between Cells Based on Cell Surface-Localized Antigens
Immunologists and others have long used the FACS to sort cells. Generally, this method is based on labeling cell-surface localized proteins with differentially labeled probes, usually fluorophore-labeled antibody probes. For instance, cells positive for expression of cell surface localized proteins may be carried out. This method is most commonly used under conditions designed to preserve the integrity of the cell and maintain its viability.
In accordance with the present invention, to detect the presence of cell surface localized protein, a signaling probe is made to target the mRNA encoding the protein of interest. The signaling probe is introduced into cells by transfection without abrogating cell viability. Then, the cell sorter is used to isolate positive-scoring cells. Additionally, if a combination of signaling probes is used, each targeted to the mRNA of one of the proteins of interest, each can be labeled differently. If cells have a greater number of targets than what can be detected in a single application of FACS, multiple rounds of sorting are performed to sort cells.
The methods described here also enable the analysis or sorting of cells for other cellular RNAs, for instance mRNAs that code for proteins that are internally localized or secreted from the cell, or RNAs that do not code for protein. As a result, one or more of the cellular RNAs described previously may be detected as targets, including RNAs encoding proteins which are inaccessible to the commonly used antibody probes or for which probes have not yet been developed. These may include membrane-associated, membrane-spanning, membrane-anchored, cytoplasmic, or nucleoplasmic proteins.
4. Assaying Cells for the Expression of Specific RNAs and Quantifying the Level of RNA Expression in Cells
If the target RNA of a fluorogenic probe that is introduced into a cell is present, the cell will fluoresce. This information can be qualitatively assessed by use of Fluorescence Microscopy (including confocal, laser-scanning or other types of microscopy) or FACS, and it is also quantifiable by either of these. For instance, instead of performing in situ reverse-transcription polymerase chain reaction (RT-PCR) on slices of tissues to determine a pattern of expression for a particular RNA, a signaling probe is used to carry out the same experiment. Moreover, using a combination of differently fluorescent fluorogenic probes, each targeting a specific RNA, one assays for the presence or quantity of several RNAs of interest in one step. Detection of RNAs (in fixed samples) can be performed at temperatures empirically determined to limit non-target specific signal generation. It is common practice to establish optimal temperature conditions for target detection when using nucleic acid probes. (Localization of Antigens in Combination with Detection of RNAs in Cells and Tissues)
Using fixed cells or tissue slices, one uses immunocytochemistry to describe the localization of the protein antigens recognized, and using signaling probe targeting specific RNAs, one co-localizes in the same samples the RNAs of interest. It has been shown that fluorogenic probes targeted to RNAs function in fixed cells.
5. Generating Cell Lines Expressing Multiple RNAs or Proteins
Using the methods of the present invention, one very quickly generates stably transfected cell lines expressing any number of RNAs or proteins, even without the need to maintain these cells in the presence of a mixture of numerous selective drugs (or using other selective agents). Following gene transfection and optionally, drug-selection, a combination of signaling probes, one to the message for each protein, is introduced into the cells. By designing the target complementary sequence of each fluorogenic probe to hybridize to the mRNA of only one of the genes or to the tag sequences with which the messages may be associated, each signaling probe is designed to recognize the mRNA encoded by only one of the genes. In one embodiment, the cells are then sorted by FACS. By selecting for one or more signals, a variety of cell lines is generated in a single application.
One may have a need to produce a cell line expressing a number of RNAs of interest that is above the number that may be identified in a single application of FACS. For instance, it would be highly informative to have a cell line in which are over-expressed all of the proteins and RNA sequences thought to be involved in the formation of a particular complex or involved in a biological pathway. For example, RNAs or proteins in the same or related biological pathway, RNAs or proteins that act upstream or downstream of each other, RNAs or proteins that have a modulating, activating or repressing function to each other, RNAs or proteins that are dependent on each other for function or activity, RNAs or proteins that form a complex or bind to each other, or RNAs or proteins that share homology (e.g., sequence, structural, or functional homology). If the number of RNAs required is greater than can be analyzed in one application of FACS, then to achieve this, the steps described above are repeated using cells already expressing a combination of some of the RNAs as the host cells into which would be transfected additional constructs encoding additional RNAs. Multiple rounds of the methods described may be used to obtain cells expressing all or a subset of the RNAs that are required.
If multiple RNAs to be expressed are all cloned into constructs conferring upon cells resistance to the same drug, in one embodiment, FACS can be used to isolate cells expressing all of the desired RNAs. In the case where the sequences are stably integrated into the genome, it is desired that the cells not lose expression of any of the sequences. However, it is possible that one or more of the sequences could be lost. If this is the case, one increases the concentration of the selective drug or selection agent in the media in which these cells are grown, making this possibility less likely. Alternatively, one uses constructs each of which confers resistance to a different drug, and maintains cells in a mix of appropriate drugs. Also, a subset of the constructs to be stably transfected into cells may be chosen so as to encode a resistance gene for one drug, and another subset to encode a resistance gene for another drug.
Moreover, if some cells of a cell line lose expression of an RNA of interest, then as one resort, the first experiment to isolate the cell line as described above is repeated and new cells obtained. Alternatively, the mixture of cells described are analyzed by FACS, with the aim of re-isolating cells expressing all of the desired RNAs. This is a very useful procedure as it again yields cells which give rise to a cell line with the same genetic make-up of the original cell line selected.
The approaches described above yield an unlimited supply of cells expressing any combination of proteins and RNA sequences, amenable to virtually unlimited methods of analysis. Yet it is possible that a protein that is overexpressed may be toxic to the cell, and as will be discussed later, this possibility can be readily addressed.
The ease with which it is possible to re-isolate cells expressing all of the desired RNAs from cells which no longer express all of the RNAs makes it possible to maintain cell lines in the presence of no drug or minimal concentrations of drug. The methods described here also enable the re-application of signaling probes to cells or cell lines generated previously. For example, to determine if and to what extent the cells are still positive for any one or more of the RNAs for which they were originally isolated.
6. Generating Cell Lines Dramatically Over-Expressing One or More RNAs or Proteins
For each gene that is to be highly over-expressed, for example, two or more sequences for the same gene are first cloned into DNA constructs optionally also conferring drug resistance or other selectable marker. Each of the multiple sequences for each gene is designed to include the sequence encoding a different tag sequence. In one embodiment, following transfection of the DNA constructs into cells and subsequent selection, fluorogenic probes, each of which is targeted to only one tag sequence and differentially fluorescently labeled, are introduced into the cells and the cell sorter is used to isolate cells positive for their signals. Such cells have integrated into their genomes at least one copy of each of the differentially tagged sequences, and thus the expression of the sequence of interest occurs from an increased number of copies of essentially the same sequence of interest. The sequence of interest may be integrated at different locations of the genome in the cell. This method is used in conjunction with the use of the FACS to pick out those cells scoring most intensely for the signal of each fluorophore. A portion or all of the different tags may be identical such that a common signaling probe directed against this common sequence may be used to detect all of the various tags in cells.
7. Generating Cell Lines Expressing Multiple Antisense RNAs
Stably transfected cell lines producing multiple antisense messages are created as follows. Such antisense messages target either mRNAs or other RNAs. One selects cells which express at different levels any one of the antisense sequences transfected. Through repeated rounds of stable transfections, one readily selects cells that would give rise to stably transfected cell lines which express the antisense message of an unlimited number of RNAs.
Of course, cells expressing other RNAs other than antisense RNAs can be prepared by the methods described herein. Such RNAs include but are not limited to one or more of mRNA, rRNA, siRNA, shRNA, other structural RNAs such as hnRNA, tRNA, or snRNA, RNAs that have RNA interference activity, RNAs that serve as tags, etc. 8. Generating Libraries of Cell lines
A plurality of cells are transfected with DNA constructs that form an expression library. Expression libraries of DNA sequences can include any of the kinds that are known in the art or a mixture of these, including but not limited to cDNA or EST libraries generated from, e.g., whole organisms, tissues, cells or cell lines, and synthetic libraries including but not limited to oligonucleotides or sequences coding for peptides. Similarly, libraries of DNA constructs not specifically designated as expression libraries may be used. For instance, a DNA construct library may include sequences of DNA that may have regulatory functions such as promoter, repressor, or enhancer elements, and these may be constitutive, inducible or repressible. Likewise, DNA construct libraries may comprise large segments of DNA such as entire or partial genetic loci or genomic DNA, from which transcription may occur. Any of these expression libraries of DNA constructs may each be wholly or partially mutagenized, randomized, recombined, shuffled, altered or treated in any combination thereof Additionally, any of these types of libraries may further comprise at least one tag that is expressed and that may be used as a target for the signaling probes.
Expression libraries of DNA sequences for specific classes of proteins can be made and used to generate cell line libraries. For instance, cDNAs for protein kinases can be cloned into expression constructs comprising a tag sequence. Cells stably expressing different kinases can be obtained and used to generate a cell line library limited to cell lines expressing kinases. The class of sequences can be chosen to meet further applications such as drug screening.
Libraries of cell lines can also express classes of proteins having sequence homology, belonging to the same protein family or within a functional family, and also proteins defined by their role in a given protein pathway or complex or system. For instance, in a drug screen for compounds which may bind to various HIV proteins, a cell line library where each cell line expresses a different HIV protein can be made and used for screening drug compounds.
Libraries of cell lines may be used to assay secreted peptides or proteins having a desired activity. First, a cell line library is made using an expression library (for peptides, proteins, ESTs, cDNAs, etc.) additionally encoding an export signal translated in frame with the peptides/proteins. Expressed peptides/proteins are secreted into the growth medium. The effect of these peptides on a particular activity can then be determined given an appropriately designed assay. Tissue culture supernatant from the cell lines can be collected and applied to test cells to assay for the activity.
In addition, a library of cell lines may be generated for instance by transfecting an expression library into cells, and optionally, first selecting cells transfected with the expression library (for instance, using a signaling probe directed against a tag that is included in the expression library) and then exposing the cells to one or more signaling probes directed against one or more RNAs of interest. This would enable one to express various RNAs in cells and determine which of these RNAs results in the downstream transcriptional upregulation or downregulation of one or more RNAs of interest.
9. Generating Cell Lines Which are Functional Knock-Outs for One or More Proteins
The methods of the present invention provide the means to prepare functional knock-outs in cultured cells. One generates cell lines which are functional knock-outs of any one protein of interest by generating cells expressing from multiple loci virtually the same antisense RNA or siRNA having RNA interference activity to a unique RNA sequence. Any of the methods of the invention using siRNA may also be carried out using shRNA. For instance, one transfects into cells multiple constructs each of which would encode either the antisense RNA for a particular gene or siRNA having RNA interference activity for the gene, or both. Here each antisense RNA sequence differs only in that each would be tagged with the nucleotide sequence of a unique tag sequence. One selects those cells expressing one or more or all of the differentially-tagged antisense RNAs. Similarly, the presence of each siRNA or shRNA with RNA interference activity would be determined by detection of a tag with which each siRNA or shRNA is associated. Because the FACS is used to quantify fluorescence as previously described, this feature enables one to select for those cells most strongly expressing any one or more of the antisense sequences. One could isolate cells exhibiting the desired expression levels of the targeted RNA, or little or no expression of it due to expression of any number or combination of expressed RNAs, which act to decrease the expression level of the targeted RNA.
Importantly, one or more of different antisense sequences and siRNAs having RNA interference activity targeting the same gene may be used in this approach. For instance, some of the antisense RNAs, the expression of which is selected for by using signaling probes and the FACS, is designed so as to target a particular region of the messenger RNA for the gene, whereas others are designed such that they target an alternate portion of the same messenger. In order to generate cell lines which are functional knock-outs of a protein of interest, one stably transfects into cells as many genetic sequences encoding similar or different antisense RNAs or siRNAs having RNA interference activity to the same gene of interest as is necessary for the production of a cell line which exhibits no detectable levels of expression of the protein of interest, or alternatively, acceptably low levels of expression.
Moreover, one generates cell lines in which multiple proteins are functionally knocked-out or have reduced expression levels by repeating the procedure described above while targeting any number of sequences to be knocked-out functionally by antisense or siRNA. For instance, to study the function of a complex of proteins, one knock-outs or reduces the expression levels of one, all, or any combination of the proteins making up the complex.
10. Generating Cell Lines Which are Functional Knock-Outs of Only Selected Alternatively Spliced Forms of One or More Genes
Differentially spliced versions of a single gene are often translated into proteins with differing functions. Using the methods of the present invention, one generates cell lines in which only selected alternatively spliced forms of one or more proteins are functional knock-outs or are reduced in expression levels. For example, by designing antisense or siRNAs that would target only those alternatively spliced versions of the messenger RNA of the gene that one would like to eliminate from the cell, one functionally knock-outs all of the alternatively spliced RNAs of the gene of interest, or sufficiently reduces their expression levels to desired levels except for those alternatively spliced messages which are of interest.
11. Generating Cell Lines Expressing One or More RNAs or Proteins While Functionally Knocked-Out for One or More Other Proteins
For instance, for a given group of proteins that is thought to interact with each other, one can study their interactions by generating stably transfected cell lines in which one or more of the proteins of interest are functionally knocked-out or have reduced expression levels by the cell's expression of antisense or siRNAs (or shRNAs). The function of the remaining proteins of interest in the cell can then be studied, but perhaps more interestingly, such a cell could be further altered by further manipulating it such that it will now over-express one or more of the remaining proteins of interest. In addition, one can over-express or eliminate or reduce the expression of additional proteins in cells. Again, it is possible that overexpression of certain proteins or a functional knock out of certain proteins may be lethal to cells. This is a problem that will be addressed below.
Analogous methods can be used to randomly up or down-regulate genes by introduction of DNAs with direct or indirect roles in transcriptional regulation. For instance, DNA sequences including but not limited to one or a combination of a promoter, enhancer, or repressor sequences, or a sequence which has some other binding or functional activity that results in the modulation of transcriptional levels for one or more RNAs can be transfected into cells. The activity of these elements may be constitutive, inducible or repressible. The signaling probes to one or more specific RNAs can be used to identify or isolate cells where the expression levels of these RNAs have been increased or decreased. Stable integration of some DNA sequences using the methods described may randomly sufficiently turn on or shut off transcription from genetic loci. These cell line libraries can be screened for cells which are effectively overexpressing or knocked-out for specific genes. Cells with desired levels of one or more RNAs can be selected in this way. Multiple rounds of this procedure may be performed, if necessary, to isolate cells having the desired expression profiles for multiple RNAs of interest.
12. Generation of Transgenic Mice
For some purposes, the study of cells in culture is not sufficient. The methodology described above, however, also lends itself towards the manipulation of embryonic stem cells. Embryonic stem cells may be obtained that could either express multiple RNAs or proteins or act as functional knock-outs of multiple proteins or a subset of the alternatively spliced forms of multiple proteins, etc., following the above procedures. Such embryonic stem cells are then used as the basis for the generation of transgenic animals.
Cells isolated according to the methods described may be implanted into organisms directly, or their nuclei may be transferred into other recipient cells and these may then be implanted. One use could be to generate transgenic animals and other uses may include but are not limited to introducing cells which synthesize or secrete cellular products into the organism and cells which are engineered to carry out desired roles in the organism.
13. Generating Inducible Stably Transfected Cell Lines
The over-expression or the lack of expression of certain proteins or RNAs in cells may be lethal or damaging. Yet it may be of critical importance to study a cell over-expressing a toxic protein or RNA, or one which is a functional knock-out of a protein or RNA, without which the cell is unable to survive. To this end, one generates stably transfected cells where selected RNAs having such deleterious effects on the cell are under the control of inducible or conditional promoters. To isolate such cell lines, in one embodiment, the transfected and optionally drug-selected cells are first minimally induced to affect transcription of the inducible genes, and the cells are then subjected to FACS analysis following the transfection into them of signaling probes designed to recognize the appropriate RNAs. The cells obtained are maintained such that the toxic RNAs are induced and transcribed only when necessary.
Inducible systems may be advantageous for applications other than the expression of toxic RNAs. For instance, one induces the expression of genetic sequences stably transfected into cells at a certain point during the cell cycle of a synchronized cell line. Alternatively, if the expressed products of a set of one or more stably transfected genetic sequences is thought to act on the expressed products of another set, then it is of interest to clone the genetic sequences of the first set under the control of one inducible promoter, and those of the second set under the control of a second inducible promoter. By varied inductions, one studies the expressed products encoded by either set of genetic sequences in either the absence or the presence of the expressed products of the other set. In general, the DNA sequences which are incorporated into cells as described in the methods above can each be placed under the control of a promoter or other regulator of transcription with desired activity. For instance inducible, tissue-specific, time-specific or temporal promoters, enhancers or repressors may be used, as well as regulatory elements that are modulated, activated or repressed due to cellular or extracellular signals, including but not limited to one or more of compounds or chemicals, other cells, proteins, peptides, hormones, signaling molecules, factors secreted from cells, whole or fractionated extracts from organisms, tissues or cells, or environmental samples. In these cases cells are first exposed to appropriate levels of the agents regulating transcription prior to their exposure to the signaling probe.
14. Detecting Genetic Recombinational Events in Living Cells and the Subsequent Isolation of Non-Recombined or Differentially Recombined Cells
Parallel to the use of signaling probes to detect cells having undergone the recombinational events involved in the creation of stable cell lines, is the use of signaling probes to detect and isolate from a mixture of living cells those cells which have undergone other specific recombinational events. The same principle can be used to assay for VDJ recombination, translocation, and viral genome integration, for instance.
In cellular recombinational events, for instance, one sequence of genomic DNA is swapped for another. If a DNA sequence encoding a region where a recombinational event occurred is transcribed into RNA, then the presence of such an event is detected by a signaling probe designed to recognize either the RNA transcribed from the unrecombined DNA sequence, or that which is transcribed from the recombined sequence. Such an assay can also be carried out using the Fluorescent Microscope (or other equipment which can quantify the resulting signal). If one would like to separate cells which have recombined from those which have not, then one subjects the cells to FACS and sorts them. In addition, FACS can be used to sort out cells based on the presence or absence in them of numerous recombinational events.
15. Sorting Cells on the Basis of Expressed RNAs
The use of signaling probes as described herein allows cells to be sorted based on their expression of RNAs encoding internally localized, cell-surface localized, or secreted proteins as well as for other RNAs that may be present in the cell. For instance, starting from a mixed population of cells, one isolates those cells which express internally localized proteins of interest by designing signaling probes which recognize the mRNAs which give rise to these proteins. These signaling probes are transfected into the mixture of cells and FACS can be used to sort them as appropriate. Multiple rounds of sorting may be carried out.
Additionally, a researcher may be interested, for instance, in isolating cells which express the mRNA of one or more specific protein or RNAs that are transcribed in response to a given added factor, or in determining which added factor induces or represses the transcription of one or more proteins or RNAs. The added factors which may be tested in this way may include but are not limited to one or more of nucleic acids, proteins, peptides, hormones, signaling molecules, chemical compounds, inorganic or organic chemicals, cells, whole or fractionated extracts from or derived from organisms, tissues or cells, products purified or isolated from cells or organisms, samples from the environment or other sources.
To isolate cells that are induced to express one or more specific RNAs in response to a cytokine, for instance, a mixture of cells is first induced by the cytokine, then transfected with signaling probes, each of which is designed to recognize the mRNA that would give rise to one of the proteins of interest. In one embodiment, the FACS is then used to isolate those cells which score positive for the mRNA of interest. In an alternative embodiment, one also assays cells infected with a virus, for instance, for their expression of a particular gene. Alternatively, a set or library of compounds such as a chemical compound library, or an expression library of RNAs can be applied to cells to determine which if any compounds or mixture of compounds, or RNAs induces, represses or modulates the transcription of one or more specific proteins or RNAs.
It is possible with the methodology described hereinabove to detect cells positive for the presence of RNAs with one or more of the following different roles: messenger RNAs that encode proteins, fusion proteins, peptides fused to proteins, export signals, import signals, intracellular localization signals or other signals, which may be fused to proteins or peptides; antisense RNA, siRNA, short RNAs which form hairpin structures that have an activity similar to siRNA; structural RNAs, cellular RNAs including but not limited to such as ribosomal RNAs, tRNAs, hnRNA, snRNA; random RNAs, RNAs corresponding to cDNAs or ESTs; RNAs from diverse species, RNAs corresponding to oligonucleotides, RNAs corresponding to whole cell, tissue, or organism cDNA preparations; RNAs that have some binding activity to other nucleic acids, proteins, other cell components or drug molecules; RNAs that may be incorporated into various macromolecular complexes; RNAs that may affect some cellular function; or RNAs that do not have the aforementioned function or activity but which may be expressed by cells nevertheless; RNAs corresponding to viral or foreign RNAs linker RNA, or sequence that links one or more RNAs; or RNAs that serve as tags or a combination or recombination of unmodified mutagenized, randomized, or shuffled sequences of any one or more of the above. RNAs may be under the control of constitutive or conditional promoters including but not limited to inducible, repressible, tissue-specific, or temporal promoters or a combination or recombination of unmodified or mutagenized, randomized, shuffled sequences of any one or more of the above. Expression of the RNAs described above may result from the introduction into cells of DNA constructs, vectors or other delivery methods that deliver nucleic acids that result in their expression.
16. In Vivo Detection of Nucleic Acids and Subsequent Selection of Cells using Protease Probes
The present invention is also directed to a novel form of protease probe which, in contrast to the signaling probe, exhibits proteolytic activity upon binding to their target nucleic acids. Such proteolytic activity may be used for detection purposes, but also to degrade particular protein sequences in a cell should the target nucleic acid be present in the cell. For example, a protease which specifically cleaves a viral protein may be activated when transcription of a viral sequence is activated, such as in a latent infection and where, e.g., the protease probe is directed against the viral message).
In another aspect, the present invention is directed to a proteolytic activity-generating unitary hybridization probe, herein referred to as a protease probe. Such protease probes also comprise nucleotides or modified nucleotides complementary to a target RNA and nucleotides or modified nucleotides mutually complementary. These protease probes operate in a similar fashion to the aforementioned probes, but instead of a production of or change in fluorescent signal upon interaction of the signaling probe with its target nucleic acid sequence, the protease probe becomes proteolytic in the presence of the target.
One can substitute protease probes in place of signaling probes in the above methods, yielding new possibilities. Upon transfection of protease probes into cells expressing the RNA that is recognized by a protease probe, the protease probe hybridizes to its target. This causes activation of the protease as in its hybridized state, the protease is no longer in the vicinity of its protease inhibitor. A cell in which the target of such a protease probe is present and recognized, is damaged and is thus selected against. Cells not expressing this mRNA are more or less unaffected. Conversely, the protease probe can be designed to catalyze a proteolytic reaction which stimulates or otherwise imparts a beneficiary effect on cell growth or viability, or imparts a growth advantage to cells where its target is present and recognized.
Additionally, protease probes are useful in various other applications. The activity of the protease can be readily measured, and furthermore, the active protease in the presence of a particular nucleic acid target sequence may be employed not only for detection purposes but also for therapeutic purposes, in which, for example, a cell in which the protease probe is delivered is proteolyzed and rendered nonviable if a particular gene is transcribed, for example, one related to cellular transformation, oncogenesis, dysproliferation, and the like. For example, given a mixture of cells in which some of the cells are infected by a particular virus, one introduces into the cells a protease probes that targets a specifically viral mRNA. Cells which carry such an mRNA activate the proteolytic activity of the protease probe they contain, and this destroys these cells.
Preferably, the proteolytic enzyme inhibitor is a peptide or small chemical, although other molecules including but not limited to metals and metal chelators are also useful, to provide reversible inhibition of the enzyme upon interaction with the inhibitor. Examples of useful pairs of proteolytic enzymes and inhibitors of the proteolytic enzyme include but are not limited to aminopeptidase and amastatin, trypsin-like cysteine proteases and antipain, aminopeptidase and bestatin, chymotrypsin like cysteine proteases and chymostatin, aminopeptidase and diprotin A or B, carboxypeptidase A and EDTA, elastase-like serine proteases and elastinal, and thermolysin or aminopeptidase M and 1,10-phenanthroline.
In addition, probes incorporating other interacting pairs can be used where one member of the interacting pair has a desired activity and the second acts to inhibit or diminish this activity when the probes are unbound to the target. Upon binding to their targets, the activity of the probe is exhibited as the inhibitory member of the interacting pair is no longer in the vicinity of the member having the desired activity.
Based on the foregoing description, the following methods may be carried out.
A method for isolating cells expressing at least one RNA, comprising the steps of:
This method may further comprise the step of growing the isolated cells to generate a cell line expressing the RNA. A plurality of cell lines may be generated if the DNA construct is integrated at different locations in the genome of the transfected cells. Unless genomic integration of a transfected construct is directed to a particular location with the genome, integration is thought to occur randomly, so each positive cell may be different from another, and there would be multiple different cell lines all positive for the RNAs for which they selected. A plurality of cell lines may also be generated if the DNA construct is introduced into a mixed population of cells, for example, immortalized, primary, stem and germ cells or cell lines. The cells may also be from any established cell line, including but not limited to HeLa, HEK 293T, Vero, Caco, Caco-2, MDCK, COS-1, COS-7, K562, Jurkat, CHO-K1, Huvec, CV-1, HuH-7, NIH3T3, HEK293, 293, A549, HepG2, IMR-90, MCF-7, U-2 OS or CHO. Optionally, the DNA construct may further encode at least a drug resistance marker or other selectable marker, and the method may further comprise the step of selecting cells using the selection marker after step a). Isolated cells may be grown separately or pooled. Whenever cells are isolated, whether following transfection with one or more constructs or one or more expression libraries, the isolated cells may be grown separated from each other, or pooled.
The isolated cells may be further prepared to express a second RNA. In either a simultaneous or sequential fashion, additional steps include transfecting the cells or cell line with a second DNA construct encoding a second RNA; exposing said cells to a second signaling probe which produces a detectable signal upon hybridization to said second RNA; and isolating cells that exhibit the signal of at least one or both of said RNA and second RNA. The first signaling probe may produce the same or a different signal from the second signaling probe, for example, they may have the same or different fluorophores. Cells or cell lines expressing more than two RNAs may be provided by repeating the steps simultaneously or sequentially. The second DNA construct may also contain the same or different drug resistance or other selectable marker. If the first and second drug resistance markers are the same, simultaneous selection may be achieved by increasing the level of the drug. A plurality of cell lines may be generated by repeating the above steps in a simultaneous or sequential fashion using DNA constructs that form an expression library, wherein at least a portion of the cells express different RNA.
A related approach is disclosed in which a tag sequence associated with the transfected gene is used as the target for the signaling probe, of which one application is to allow the selection of cells whose RNA may be difficult to identify over background, for example, if the signaling probe detects a closely related RNA species. Accordingly, a method for isolating cells expressing at least one RNA is provided comprising the steps of:
This method is essentially the same as that described previously, except that the signaling probe used is designed to recognize the tag sequence rather than said RNA. A benefit of this procedure over the previous one is that only a small number of signaling probes, corresponding to the number of different tag sequences, is needed to prepare a large number of different cell lines expressing one or more RNAs. Optionally, the DNA construct may further encode at least a drug resistance or other selectable marker, and the method may further comprise the step of selecting cells resistant to at least one drug or other selective agent to which said marker confers resistance after step a). Isolated cells may be grown separately or pooled. Whenever cells are isolated, whether following transfection with one or more constructs or one or more expression libraries, the isolated cells may be grown separated from each other, or pooled.
Tag sequences refers to a nucleic acid sequence that is expressed as part of an RNA that is to be detected by a signaling probe. Signaling probes may be directed against the tag by designing the probes to include a portion that is complementary to the sequence of the tag. Examples of tag sequences which may be used in the invention, and to which signaling probes may be prepared include but are not limited to the RNA transcript of epitope tags which include but are not limited to HA (influenza hemagglutinin protein), myc, his, protein C, VSV-G, FLAG, or FLU. These and other tag sequences are known to one of skill in the art and typically correspond to amino acid sequences which may be incorporated into expressed protein products and often selected based on the availability of robust antibodies or protein detection reagents which may be used to report their presence. The tag sequences described herein are not meant to refer solely to sequences which may be used to modify at the amino acid level protein products encoded by the RNAs that are tagged, or to aid in the subsequent detection of any such modified protein products through use of the corresponding antibody or protein detection reagents. As used herein, the tag sequence provides at least a unique nucleic acid sequence for recognition by a signaling probe. The signaling probes have been described for use in detecting a variety of RNAs. Any of these RNAs may be used as tags. The DNA portion of the construct encoding the tag sequence may be in frame or out of frame with the portion of the DNA construct encoding the protein-coding portion of the at least one RNA. Thus, the tag sequence does not need to be translated for detection by the signaling probe.
A tag sequence may comprise a multiple repeated sequence which is designed to act as target sites for a signaling probe. Such a tag sequence would provide multiple signaling probe target sites. As a result, a greater number of signaling probes can bind. This would increase the total signal that can be generated from any one nucleic acid molecule that is to be detected by a signaling probe, and thus increase signal to noise ratios.
In addition to target sequences for probes, tags may comprise one or more additional sequences (referred to as “helper” or “helper sequence”) designed, identified or selected to improve the detection of cells expressing the tags. For instance, helpers may have a number of effects including but not limited to effects on the folding, localization, or secondary, tertiary or quaternary structure of the tags where any or a combination of these effects acts to improve or increase the detection of the target sequence in cells. Helpers may influence tag folding or structure so that the target sequence is presented such that it is more accessible for probe binding. This could result from altered base-pairings or it could be due to binding interactions between the helpers and proteins or other cellular or introduced components. Helpers may work to stabilize or to make more dynamic the folding or structure of sequences comprising them. Also, helpers may act to stabilize with respect to degradation the sequences comprising them either before, during or following probe binding, and such effects could result directly from changes in folding or structure of sequences or as a consequence of the binding of proteins or cellular or introduced components to the helper sequences. Also, helpers may act to increase the transcription of sequences comprising them, for instance by enhancing the efficiency of transcriptional initiation or processing, decreasing premature termination of transcription or increasing the efficiency of post-transcriptional processing.
Functional approaches can be used to identify helpers regardless of how they exert their effect, and there may be different helper sequences for any given tag and corresponding probe. Helper sequences that work for multiple sets of a tag and corresponding probe may be identified functionally, also.
Variable sequences can be tested to identify which ones act as helpers for instance by constructing an expression library comprising a gene sequence and a tag sequence where variable sequences are inserted in between the gene and tag sequences. If the gene has a stop codon, the variable sequence may be inserted downstream of the stop codon. Additional variable sequences may be inserted at different sites. Next, the expression library is introduced into cells, and the cells are subsequently assayed by introduction of signaling probe directed against the target sequence. Cells that exhibit increased signal above control are detected (where control signal is the signal exhibited by control cells or cells into which a control expression construct, for instance one comprising the gene and the tag but no additional variable sequence, has been introduced). Such cells may be isolated for instance by FACS and the variable sequences represented by them may be isolated and further characterized, if desired. This approach can be used to detect or identify sequences that act as helper sequences for the specific combinations of a gene, tag and corresponding signaling probe used. Essentially the same approach can be used to find helper sequences for any sequence that comprises at least a target sequence for a corresponding signaling probe (i.e., an expression library comprising either a variable sequence and a target sequence but for instance no gene or an expression library comprising a gene, itself comprising the target sequence, and a variable sequence but no tag can each be used to detect suitable helpers). Cells that are isolated may be grown and cell lines may be generated.
The benefit of helper sequences that are detected or identified in this way may be specific for the sequence context which was used (i.e., for the specific tag, gene, target sequence or corresponding probe). More versatile helper sequences that are beneficial for more broadly in various sequence contexts can be determined experimentally for instance by following the methods described above. For instance, iterative rounds of selection of variable sequences can be performed where variable sequences acting as helper sequences can be isolated in each round and then tested in a subsequent rounds. In this case for instance, each subsequent round of testing would be performed by creating expression vectors where the sequences isolated from the previous round would be used to create an expression construct comprising a gene or tag or target sequence that is different from that used in the expression construct of the first round. The methods described can also be used to confirm the versatility of helper sequences given diverse sequence contexts. Identifying versatile or universal helper sequences can be helpful to aid in the detection of diverse sequences in cells.
One source of variable sequences could be genomic sequence. Genomic sequence can be digested with restriction enzymes to yield fragments of various sizes which can be obtained for cloning to create the expression libraries.
Tag sequences can either be chosen or designed to exhibit a certain amount of predicted or experimentally determined secondary structure, with the goal of optimally presenting the tag sequence for signaling probe binding. As such, tag sequences comprise at least a sequence against which at least one signaling probe is directed and tag sequences may in addition comprise additional sequence which is not chosen or designed to directly interact with the signaling probe. Nucleic acid folding prediction algorithms may be used to design potential tag sequences according to their structural adaptations. See for example, Nucleic Acids Res. 31: 3429-3431 (2003), hereby incorporated by reference. The nucleic acid folding prediction algorithms often predict a number of energetically most favorable structures of a given sequence. Alternatively, libraries of sequences representing variable nucleic acid sequences, for instance including but not limited to digested genomic DNA, can be assayed to determine or identify which sequences act to aid the detection of the tag sequences by signaling probes. This represents a functional approach to identification or isolation of tag sequences. This can be accomplished by creating expression libraries comprising at least a common sequence chosen or designed to be recognized by signaling probe, and at least a variable sequence. Such an expression library can be transfected into cells, the cells can then be exposed to signaling probe, and the most highly positive cells can be isolated by FACS. The variable sequences represented by these cells may then be isolated. For instance, the sequences may be amplified directly from the isolated cells by PCR techniques followed by cloning of amplified products. Alternatively, the isolated cells may be lysed to result in the release of the DNA constructs corresponding to the variable sequences expressed in the isolated cells, and these constructs or vectors may be isolated and propagated from the resulting material. For instance, in the case where the constructs are plasmid vectors, the lysed material may be used to transform competent bacterial cells, followed by isolation and amplification of the plasmid using bacterial hosts. For tag sequences incorporating multiple repeated sequence units, each of these units may not necessarily adopt the same structure due to potential interactions between the repeated units and/or other sequence present in the molecule incorporating the tag sequence. The structure of any given tag sequence could be influenced by its sequence context.
In one embodiment, the tag sequence is derived from reverse-vav RNA. In one embodiment, the tag sequence forms a three-arm junction structure (
In one embodiment, the stem region comprises 8-9 basepairs, the first stem-loop region comprises 4-6 basepairs and the second stem-loop region comprises 13-17 basepairs (
The present invention also provides a DNA construct comprising at least one DNA encoding at least one RNA of interest and a tag sequence as described above. The invention also provides vectors and cells comprising the DNA construct.
In a further embodiment, the cell line may be made to express at least a second RNA; the steps further including transfecting the cell line with a second DNA construct encoding the second RNA and a second tag sequence, and optionally, a second selectable marker, e.g., a drug resistance marker; optionally, selecting for cells transcribing the second marker; exposing the cells to a second signaling probe that produces a detectable signal upon hybridization with the second tag sequence, and isolating the cells that exhibit the signal of both or at least one of the first and the second tag sequence. In the case of two RNAs, the portion of the DNA sequence encoding the second tag sequence may also be in frame or out of frame with the portion of the DNA sequence encoding the protein-coding portion of the second RNA. The second RNA may be transfected either simultaneously or sequentially with the first. Should the method be performed simultaneously, and the same selectable marker, e.g., a drug resistance marker is used for both constructs, a higher level of drug or appropriate selective agent may be used to select for cells expressing both constructs. Furthermore, more than two RNAs may be provided in the cell line by repeating the aforementioned steps.
A plurality of cell lines may be generated by repeating the above steps in a simultaneous or sequential fashion using DNA constructs that form an expression library. In one embodiment, the expression library uses a single tag sequence, and cells are exposed to the same signaling probe which is complementary to the tag sequence. In another embodiment, different expression libraries can use different tag sequences. In one embodiment, each cell line expresses one RNA. In another embodiment, each cell line expresses more than one RNA. This can be achieved by transfecting the cell with multiple DNA constructs sequentially or simultaneously. The likelihood of obtaining a cell stably transfected with multiple DNA constructs may be increased by introducing a higher concentration of DNA constructs used for transfection. Alternatively, the starting cells may have already been transfected with a first DNA construct to obtain a cell with multiple DNA constructs. Also, multiple different expression libraries may be used to transfect cells. Each library may incorporate a distinct tag sequence that is detected using corresponding signaling probes. Each expression library may comprise a selectable marker, e.g., a drug resistance gene.
Isolated cells may be grown individually or pooled. Individually isolated or pooled cells may be grown to give rise to populations of cells. Cell lines may be generated by growing individually isolated cells. Individual or multiple cell lines may be grown separately or pooled. If a pool of cell lines is producing a desired activity, it can be further fractionated until the cell line or set of cell lines having this effect is identified. This may make it easier to maintain large numbers of cell lines without the requirements for maintaining each separately.
Yet another method is provided for generating a cell line that overexpresses an RNA comprising the steps of:
This method may further comprise the step of growing the isolated cells to generate a cell line expressing or overexpressing the RNA. A plurality of cell lines may be generated if the DNA construct is integrated at different locations in the genome of the transfected cell. Unless genomic integration of a transfected construct is directed to a particular location with the genome, integration is through to occur randomly, so each positive cell may be different from another, and there would be multiple different cell lines all positive for the RNAs for which they are selected. Optionally, the DNA construct may further encode at least a selectable marker, e.g., a drug resistance marker, and the method may further comprise the step of selecting cells using the selectable marker, e.g., selecting for resistance to at least one drug to which said marker confers resistance after step a). Whenever cells are isolated, whether following transfection with one or more constructs or one or more expression libraries, the isolated cells may be grown separated from each other, or pooled.
In one embodiment, the cells express an antisense or siRNA or shRNA or protein. In addition, cells made to express a particular protein or proteins may be used as the starting point for creating cells expressing proteins and antisense RNA molecules. Of course, the cells expressing the antisense or siRNA or shRNA molecules may be used as the starting point for adding additional RNAs encoding additional proteins, using the methods herein. Simultaneous transfection of RNAs encoding proteins and antisense or siRNA or shRNA molecules, with corresponding signaling probes and, if desired, tag sequences, may also be performed. The various combinations of the aforementioned procedures is embraced herein.
Likewise, methods are provided for isolating cells expressing at least one RNA comprising the steps of:
The method may also be used to identify cells also expressing a second RNA, using a second signaling probe which produces a detectable signal upon hybridization to its target RNA, cells having fluorescence of both or each of the first and second signaling probes are isolated. Simultaneous expression of more than two RNAs is also achievable. Whenever cells are isolated, whether following transfection with one or more constructs or one or more expression libraries, the isolated cells may be grown separated from each other, or pooled.
Another method is provided for isolating cells expressing at least one exogenous RNA and one endogenous RNA, comprising the steps of:
The above two methods may further comprise the step of generating a cell line or a plurality of cell lines expressing said at least one exogenous RNA and at least one endogenous RNA by growing said isolated cells.
These methods are useful for an RNA that expresses a protein, e.g., a cell surface-localized protein, intracellular protein, secreted protein or other protein. These methods do not require the use of probes for the proteins themselves, which may be more difficult or will affect the cell such that further experiments cannot be performed. More than one RNA encoding a protein can be identified using a plurality of signaling probes, up to the number simultaneously detectable by the technology used for isolation. Optionally, the DNA construct may further encode at least a selectable marker, e.g., a drug resistance marker, and the method may further comprise the step of selecting cells using the selectable marker, e.g., by selecting cells that are resistant to at least one drug to which said marker confers resistance after step a). Isolated cells may be grown separately or pooled. Whenever cells are isolated, whether following transfection with one or more constructs or one or more expression libraries, the isolated cells may be grown separated from each other, or pooled.
For the above methods, in one embodiment, the cells are implantable in an animal. In one embodiment, the signaling probe is a fluorogenic probe, and the cells that express said RNA fluoresce. Isolating said cells that fluoresce may be carried out using fluorescence activated cell sorter technology or any technology that can be used to isolate cells based on fluorescence. In one embodiment, two signaling probes are used to target the RNA or tag sequence. The fluorophore of the first probe may be the same or different from that of the second probe. In the case of two different fluorophores, they may have similar or different emission wavelengths. At present, FACS technology can allow the detection of up to seven different fluorophores during a sorting procedure. The above methods may be repeated simultaneously to obtain the expression of up to seven different proteins. If desired, a cell line expressing seven different proteins may then be used as the starting point for the introduction of more proteins following the procedure. As FACS technology advances, it will be able to resolve a greater number of signals, and one would be able to select for cells having a greater number of RNAs in one application of FACS.
Naturally, the aforementioned procedures may be used to quantify the level of at least one RNA transcript expression in a biological sample comprising the steps of:
The biological sample may be a cellular sample, a tissue sample or preparations derived thereof; these may be frozen and/or fixed, for example, with formaldehyde, glutaraldehyde, or any number of known cellular fixatives which do not interfere with the detection of RNA using signaling probes. Preparations of cellular samples include but are not limited to subcellular organelles or compartments, mitochondria, organelles of cells, subcellular fractions, plasma membrane of cells intracellular or extracellular material, membrane preparations, preparations of nucleic acid from any one or more of these, preparations of any virus or other virus or organism present in any one or more of these, or any combination of one or more of these.
For the embodiment of fluorogenic probes, the fluorescence may be quantitated by fluorescence microscopy or fluorescence-activated cell sorter technology. Additional RNA species may be quantitated (i.e., quantified) simultaneously using a second signaling probe which fluoresces upon hybridization to a second RNA transcript. The above method may be used simultaneously with assays that utilize a fluorogenic reporter for the detection of intracellular events, states or compositions. Such fluorescent assays include but are not limited to TUNEL, Apoptosis, necrosis, Ca2+/Ion flux, pH flux, immunofluorescence, organelle labeling, cell adhesion, cell cycle, DNA content, and assays used to detect interactions between: protein-protein, protein-DNA, protein-RNA. Reagents which may be fluorescently labeled for use in these assays include but are not limited to Proteins (labeled with fluorescent molecules or autofluorecent proteins); fluorescent metabolic indicators (C12 resazurin, CFSE for cell divisions); fluorescent substrates or by-products; fluorescently-labeled lectins; fluorescent chemicals; caged fluorescent compounds; fluorescent nucleic acid dyes; fluorescent polymers, lipids, amino acid residues and nucleotide/side analogues.
Introduction of antisense or siRNA molecules in cells is useful for functionally eliminating or reducing the levels of one or more proteins or RNAs from the cell. Following the above methods, a method is provided for isolating cells or generating cells functionally null or reduced for expression of at least one preselected protein or RNA comprising the steps of providing in said cells a plurality of antisense or siRNA to said preselected protein or RNA, each provided in accordance with the aforementioned methods, wherein said plurality of antisense or siRNA binds essentially all or a sufficient level of mRNA transcripts of said at least one preselected protein or RNA. The preselected protein may be an alternatively spliced form of a gene product.
Following similar lines, a method is provided for generating a transgenic animal that is a functionally null-expressing mutant of at least one preselected protein or RNA, or that expresses said at least one preselected protein or RNA at reduced levels, comprising carrying out the steps described hereinabove utilizing embryonic stem cells, and using said viable embryonic stem cells to produce said transgenic animal.
Likewise, a method is provided for isolating cells or generating a cell line which is functionally null or reduced for expressing at least one protein or RNA and overexpresses at least one other protein or RNA, comprising carrying out the methods herein on the same cells. In similar fashion, a method is provided for generating a cell line expressing a lethal antisense or siRNA under control of a inducible promoter, or a sequence which has some other binding or functional activity that results in the modulation of transcription levels. This can be achieved by carrying out the method herein, wherein the transfection step is performed in the presence of a minimal amount of an inducer or compound.
Therefore, the present invention provides a method for isolating cells that overexpress at least a first protein and which are functionally null expressing or reduced in expression for at least a second protein, comprising the steps of:
17. Identification of Agents that Modulate Expression Levels, Structure, Gene Rearrangements, or Subcellular Localization of RNAs of Interest
In yet another embodiment, the present invention provides a method of identifying a compound that modulates transcription of at least one preselected RNA, comprising the steps of:
In one embodiment, said preselected RNA is encoded by the genome of the cell. In one embodiment, said preselected RNA is encoded by a DNA construct that is transfected into the cells prior to step a). In one embodiment, the transfected cells are exposed to a signaling probe designed to recognize said RNA and the cells express said RNA. In one embodiment, the DNA construct comprises a promoter or operator and encodes a repressor, enhancer, or a sequence that modulates transcription. In one embodiment, the preselected RNA is linked to a tag sequence, and the signaling probe produces a detectable signal upon hybridization with the tag sequence.
In another embodiment, the present invention provides a method of identifying an RNA sequence that modulates transcription of at least one preselected RNA, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent on the level of an RNA of interest within a cell, comprising the steps of:
In another embodiment, the present invention provides method of detecting the effect of an agent on the level of an alternative splice form of an RNA of interest, comprising the steps of:
In some embodiments, a signaling probe can be used to detect a specific three-dimensional RNA structure. As the structural conformation of the RNA changes, different bases may be available for hybridization, thus allowing the development of signaling probes that hybridize specifically to the parts of the RNA made available by the structural conformation or by a change in the structural conformation. The signaling probe may hybridize, for example, to a linear RNA sequence made available by the structural conformation, or to a non-linear RNA sequence in which different sections of the RNA molecule are brought next to each other by the structural conformation.
In one embodiment, the present invention provides a method of detecting the effect of an agent on the level of a selected structure of an RNA of interest, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent in changing the structure of an RNA of interest from a first structure to a second structure, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent in changing the structure of an RNA of interest from a first structure to a second structure, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent in changing the accessibility of an RNA of interest for hybridization with a probe, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent in rearranging an RNA of interest, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent on a gene recombination event in an RNA of interest, comprising the steps of:
Structure prediction algorithms exist for the prediction of structures of nucleic acid sequences, including DNA and RNA (see, e.g., Hofacker, Nucleic Acids Research 31(13):3429-3431 (2003), and Zuker, Nucleic Acids Research 31(13):3406-3415 (2003)). Relatively minor differences in sequence may result in different predicted structures. Such structure prediction algorithms can be used to predict, e.g., the structures of alternative splice forms of an RNA. Probes for the detection of one or a subset of the differentially spliced RNAs can be designed and tested based on structures that are produced by operating the algorithm at different settings (e.g., different temperatures or ionic strength settings). Preferred settings may be, e.g., those used during the hybridization steps of the protocols used to expose the probes to the samples tested.
For differentially accessible sequences, different splice forms, or different RNA structures, the structure prediction algorithm may also be used to predict structural motifs of the sequences that are more stable or less stable depending on the inputted parameters. Probes directed to these sequences may be designed and tested for the selective detection of one or a subset of differentially accessible sequences, different splice forms, or different RNA structures. In some cases, the probes can be tested for their differential detection of one or a subset of specific RNAs by introducing them into cells comprising these RNA forms and comparing the resulting signal to that in control cells that do not comprise the RNA forms, or by introducing them into solutions comprising these RNA forms and comparing the resulting signal to that in control solutions that do not comprise the RNA forms. In some embodiments, the RNA forms to be detected may be tested in cells or solutions that additionally comprise proteins, chemicals, reagents or binding factors that bind or alter the structure or accessibility of the forms compared to control cells or solutions that do not.
In some embodiments, the present invention provides methods of analyzing the effects of an agent on one or more cellular pathways. The methods described herein can be used, e.g., to understand which portions of a pathway are affected by a given agent and how these portions interact; to measure the effects of an agent on targets in a single pathway or in multiple pathways; and/or to measure the effects of an agent on a given pathway, and concurrently, on a control target RNA or pathway. Such methods can be used, e.g., to identify agents that affect a certain pathway, affect multiple pathways, affect specific portions of a certain pathway or pathways, or that differentially affect some pathways while failing to affect others. Dissecting cellular pathways using the claimed methods may provide a better understanding of the mechanism of action of a given agent found to affect an RNA of interest.
In one embodiment, the present invention provides a method of detecting the effect of an agent on the level of expression of one or more RNAs of interest in a biological pathway, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent on the level of expression of two or more RNAs of interest, wherein at least a first RNA of interest is in a different pathway from a second RNA of interest, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent on the level of expression of two or more RNAs of interest, wherein said RNAs of interest function in the same pathway, comprising the steps of:
In embodiments involving two or more RNAs of interest, according to the method, one can detect, for example, an increase in all RNAs of interest, a decrease in all RNAs of interest, or an increase as to some RNAs of interest and a decrease as to other RNAs of interest.
In another embodiment, the present invention provides a method of detecting the effect of an agent in changing the subcellular location of an RNA of interest, comprising the steps of:
In another embodiment, the present invention provides a method of detecting the effect of an agent on the introduction of signaling probes into cells, comprising the steps of:
In any of the embodiments described herein, the signal produced in the absence of the agent may be, e.g., the signal produced in the cells prior to exposure to the agent, or the signal produced in control cells not exposed to the agent.
In any of the embodiments described herein, the agent may be or comprise a compound. For example, the compound may be selected from the group consisting of nucleic acids, proteins, peptides, hormones, signaling molecules, chemical compounds, inorganic or organic chemicals, whole or fractionated extracts from or derived from organisms, tissues or cells, products purified or isolated from cells or organisms, or samples from the environment. In some embodiments, the agent may be or comprise an oligonucleotide. In certain embodiments, the oligonucleotide may cause overexpression of an RNA of interest. For example, the oligonucleotide may bind to and inactivate a negative regulatory element, e.g., a suppresser element, and/or may bind to and activate a positive regulatory element, controlling expression of the RNA of interest. In certain embodiments, the oligonucleotide may knock down expression of an RNA of interest, e.g., by RNA interference, or by binding to and inactivating a positive regulatory element, decreasing or blocking its activity in upregulating the expression of the RNA of interest, or binding to and activating a negative regulatory element that controls expression of the RNA of interest. Regulatory elements include, e.g., proteins and nucleic acid sequences.
In any of the embodiments described herein, the agent may be or comprise, e.g., a mechanical force, an electromagnetic force, a magnetic force, or a change in temperature, pH, reduction potential, or counterion or ionic state.
Any of the embodiments described herein may be used to identify an agent that is or is not able to produce a given effect detected in cells after exposure to the agent as compared to in the absence of the agent. It would be possible, for example, to perform a high-throughput assay testing a multitude of agents (e.g., a library of agents) for the ability to produce the effect in cells, and selecting agents that are or are not able to produce the effect. In one embodiment, for example, probes are designed to target the most stable or consistent sequences within a set of related RNAs. High-throughput screening can then be performed to identify agents that downregulate the class of related RNAs.
In any of the embodiments described herein, the signals produced by the signaling probes may be detected in individual cells, in portions of individual cells, in sub-populations of cells, or in whole populations of cells. The signals can be detected using, for example, fluorescence microscopy, flow cytometry analysis, fluorescence activated cell sorting technology, and whole animal fluorescence microscopy. In certain embodiments, high throughput methods can be used to detect the probe signal, including but not limited to high throughput microscopy, high content imaging, and the use of whole well plate reader detectors (e.g., FLIPR, FDSS, and fluorescence plate readers).
In any of the embodiments described herein in which two or more probes are used, the use of three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, or, indeed, any number of probes is also contemplated.
In any of the embodiments described herein, the signal(s) produced by the signaling probes may be detected in any one or more subcellular compartments within a cell. Subcellular compartments may include, without limitation, the cytoplasm, nucleus, endoplasmic reticulum (ER), mitochondria, chloroplasts, and Golgi apparatus (GA). For example, in the case of the ER, membrane proteins and secreted proteins may be synthesized in part on ribosomes associated with the ER; in some embodiments. Detecting RNAs encoding these proteins in the ER, or detecting the relative abundance of signal for these RNAs in the ER as compared to in the nucleus, cytoplasm, or both, may provide data regarding the translation of the proteins. In certain embodiments, signaling probes are used to detect the amount of an RNA of interest in each of two or more subcellular compartments, or to detect and/or monitor a change in localization of an RNA of interest from any subcellular compartment to any other subcellular compartment.
In any of the embodiments described herein, the cells may be living, or may be fixed. Fixation methods are well known in the art, and may include the use of, e.g., formaldehyde, glutaraldehyde, alcohols, acetone, or any number of known cellular fixatives which do not interfere with the detection of RNA using signaling probes, including any combinations of fixatives. In some embodiments, fixation may take place, e.g., at room temperature or at −20° C., or at any temperature in between. The cells may be fixed before or after exposure to the signaling probes. In embodiments in which the cells are fixed before exposure to the signaling probes, the signal to noise ratio can be increased, e.g., by adjusting the temperature of the hybridization reaction (for example, to temperatures higher than those used for living cells), the number of washes to decrease unhybridized probe, and the chemical stringency of the hybridization reaction.
In any of the embodiments described herein, the cells may be permeabilized before exposure to the signaling probes. In certain embodiments, the permeabilized cells are fixed. Cells may be permeabilized with reagents including, but not limited to, certain antibiotics, detergents (e.g., digitonin, Triton-X, SDS, etc.), ionophores, toxins, pore forming proteins, or a combination of reagents or solvents, including any combination of the above. At low concentration, digitonin permeabilizes only the plasma membrane, whereas at higher concentrations, it permeabilizes both the plasma membrane and the nuclear membrane. Accordingly, in some embodiments, digitonin may be used to permeabilize cells, and the concentration can be adjusted depending on whether a cytoplasmic signal or a whole-cell signal is desired.
In any of the embodiments described herein, the signals produced by the signaling probes can be amplified or the noise reduced by manipulating one or more of:
In certain embodiments, a replica plate may be made from living cells, where the cells in one plate are subjected to probe hybridization conditions that are lethal to the cells (e.g., fixing, crosslinking, high temperatures, denaturing, etc.), while the living cells in the other plate are maintained and may be identified and used for downstream applications. Such a method may be used for detecting DNA or RNA target sequences. Detection of a sequence at the DNA level may be advantageous, e.g., in situations where the sequence is not transcribed.
In some embodiments, the methods described herein are performed on cells that are in plates, e.g., microwell plates. For example, methods of measuring the effects of an agent can be performed in parallel on cells in plates. The use of plates may help to facilitate, e.g., high throughput assays to detect and/or quantitate the effects of an agent, or to identify an agent with a desired effect.
In some embodiments, the methods described herein are performed on cellular samples, tissue samples, or preparations derived therefrom. The samples may include clinical samples, biopsies, and tissue slices (e.g., brain, tumor, and normal). The samples may be preserved (e.g., samples embedded in paraffin, wax, or various hardening polymers).
In any of the embodiments described herein, the RNA of interest may be or comprise at least one RNA selected from the group consisting of: RNAs transcribed from either coding or non-coding portions of cellular DNA, including lncRNA, lincRNA, piRNA, miRNA, shRNA, and various other RNAs of intermediate length, messenger RNAs that encode proteins, fusion proteins, peptides fused to proteins, export signals, import signals, intracellular localization signals or other signals, which may be fused to proteins or peptides; antisense RNA, siRNA, short RNAs which form hairpin structures that have an activity similar to siRNA; structural RNAs, cellular RNAs including but not limited to such as ribosomal RNAs, tRNAs, hnRNA, snRNA; random RNAs, RNAs corresponding to cDNAs or ESTs; RNAs from diverse species, RNAs corresponding to oligonucleotides, RNAs corresponding to whole cell, tissue, or organism cDNA preparations; RNAs that have some binding activity to other nucleic acids, proteins, other cell components or drug molecules; RNAs that may be incorporated into various macromolecular complexes; RNAs that may affect some cellular function; or RNAs that do not have the aforementioned function or activity but which may be expressed by cells nevertheless; RNAs that mediate RNA interference; RNAs corresponding to viral or foreign RNAs; linker RNA, or sequences that link one or more RNAs; or RNAs that serve as tags or a combination or recombination of unmodified mutagenized, randomized, or shuffled sequences of any one or more of the above. The RNAs of interest may be endogenous or exogenous to the cells in which they are expressed. In particular embodiments, the RNAs of interest may be coding or non-coding RNAs of any length but particularly linear RNAs over 20 base pairs in length and non linear RNAs (e.g., RNAs with significant 3D structure or and/or with internal hybridizations so that they are kinked) over 50 base pairs in length.
In any of the embodiments described herein, the DNA encoding the RNA may be under the control of a constitutive or conditional promoter including but not limited to inducible, repressible, tissue-specific, heat-shock, developmental, cell lineage specific, or temporal promoters or a combination or recombination of unmodified or mutagenized, randomized, shuffled sequences of any one or more of the above. The promoter may exogenous or endogenous, and may be operably linked to an exogenous RNA or an endogenous RNA.
In any of the embodiments described herein, the DNA encoding the RNA of interest may further encode a selective marker, and any of the described methods may then further comprise the step of selecting cells resistant to at least one drug or other selective agent to which said marker confers resistance.
In any of the embodiments described herein, the signals from the signaling probes may be detected at a static point, or may be monitored over a period of time (e.g., at selected timepoints). The length of time will depend on the needs of the method, and can readily be assessed by the skilled worker.
In some embodiments, the signaling probes described herein are used in fluorescence in situ hybridization (FISH) assays to study chromosome structure and function. The described probes may be used, e.g., in ACM-FISH, armFISH, CARD-FISH, catfish, CB-FISH, CO-FISH, COBRA-FISH, COD-FISH, COMBO-FISH, Comet-FISH, Cryo-FISH, D-FISH, DBD-FISH, e-FISH, Fiber-FISH, Flow-FISH, Fusion-Signal FISH, Halo-FISH, Harlequin-FISH, Immuno-FISH, LNA-FISH, M-FISH, Multilocus or ML-FISH, PCC-FISH, PNA-FISH, Q-FISH, QD-FISH, Rainbow-FISH, Raman-FISH, ReD-FISH, Reverse-FISH, RING-FISH, RNA-FISH, RxFISH, Split-SignalFISH, T-FISH, 3-D FISH, or Zoo-FISH techniques, or any combination thereof (see, e.g., Volpi and Bridger, Biotechniques 45(4):385-409 (2008)). In certain embodiments, FISH is used to determine the effect of an agent on chromosomal architecture and/or function.
In any of the embodiments described herein, the cells can be isolated and grown to give rise to cell lines. They may be grown separately or pooled. The modulation of transcription can be downstream up or down regulation of the preselected RNA. In one embodiment, said preselected RNA is encoded by the genome. Said preselected RNA is encoded by a DNA construct that is transfected into the cells prior to step a). In one embodiment, the transfected cells are exposed to a signaling probe designed to recognize said RNA and the cells express said RNA. In one embodiment, the preselected RNA is linked to a tag sequence, and the signaling probe produces a detectable signal upon hybridization with the tag sequence. In one embodiment, an expression library of RNA sequences is used to identify the RNA sequences that modulate transcription. In one embodiment, the test RNA sequence is linked to a tag sequence. Step e) is facilitated by exposing the cells following step a) to at least a signaling probe that produces a detectable signal upon hybridization to said RNA sequences or tag sequences, followed by step b). The signals produced by the signaling probes directed to the test RNA sequence or tag sequence thereof may be different from the signaling probe directed to the preselected RNA, and therefore the cells may be exposed to different signaling probes simultaneously.
A method is also provided herein for identifying genetic recombinational events in living cells comprising the steps of:
The detecting and/or sorting of the cells may be performed by FACS or fluorescence microscope.
18. Identification of Reagents or Compounds to Introduce or Improve the Introduction of Signaling Probes into Cells or to Enhance the Detection of Targets
Reagents that can be used to introduce signaling probes into cells can be identified by testing various chemicals including but not limited to proteins, lipids, polymers, extracts or compounds or mixtures of these. The chemicals may be in gas, liquid or solid form. This can be done by mixing the chemicals with signaling probe in various ratios ranging from 1:1,000,000,000 to 1,000,000,000:1 in various solvents including but not limited to organic and aqueous solvents, buffered solutions or media or any combination of these, where the mixture is incubated for various time periods ranging from 1 minute to 48 hours and where the incubation is performed under various temperature conditions ranging from below 0 to above 100 degrees Celsius and with various degrees of agitation or mixing from none to gently shaking or rocking to vortexing or constant pipetting and where the incubation is carried out in light or dark or under various other environmental conditions. Next, the mixtures would be applied to cells and probe delivery would be assayed using fluorescence microscopy, fluorescence plate reader technology or by FACS or other fluorescence detection method. This may be done for instance in 96, 384 or 1536 well plates or on beads compatible with subsequent high throughput analysis. Note that when using beads, the beads may be analyzed or isolated using FACS and if they are additionally coded isolated beads can be used to determine the identity of the mixtures with which they were treated.
The cells used may be living or fixed and may be presented attached to a solid surface including but not limited to tissue culture plates or beads or they may be in suspension. The cells may be washed using a variety of buffers or solutions prior to addition of the mixtures. Once the reagents have been applied to the cells the reaction may be incubated for various periods of time with variable degrees of agitation of mixing in light or dark or under various other environmental conditions and at various temperatures, all as described above, and where these parameters may be limited for instance if cell viability is to be maintained. Following the incubation, the cells may be analyzed directly or washed as described above prior to analysis.
The above methods may be carried out using a signaling probe or a constitutively active probe (for instance in the case of fluorogenic probes, a probe lacking a quencher may be used). Also, each of the mixtures to be tested may be tested using more than one cell sample. For instance, a mixture may be tested using both cells known to comprise the target of the signaling probe used as well as control cells. The multiple cell types may be presented mixed or separately. Preferred mixtures would be those that result in increased signal to noise, for instance those that result in increased signal in cells comprising the target compared to control cells.
Similarly, essentially the same methods could be performed except where the cells are added to the mixtures. For instance, the mixtures to be tested could first be applied to test chambers for instance to wells of 96, 384 or 1536 well plates or beads, and the cells could be applied next. Cells may be added directly to the plated mixtures or they may added once the mixtures have been processed, for instance by drying, heating or evaporation.
In a related manner, compounds can be tested for their ability to enhance the detection of targets in cells using signaling probes. In this case, signaling probes would first be introduced into cells where the cells could include both cells that are known to comprise the target and control cells. Next, chemicals as described above and using variable conditions and parameters as described above could be assayed for their ability to improve signal. Note that the chemicals used in this case would not be mixed with signaling probes. Preferred chemicals would be those detected or identified to increase signal to noise ratios, for instance those that result in increased signal in cells comprising target compared to control cells. Chemicals identified using this approach may act by a number of mechanisms including but not limited to increasing the delivery of signaling probe into the cell cytoplasm, influencing the folding or structure of the target such that its detection by probe is improved (for instance by causing improved accessibility of the target) or by reducing non-specific background signal (for instance by reducing the amount of signaling probe attached to the outside surface of cells).
Chemicals that act to increase signal to noise ratios that can be used to introduce signaling probes into cells or that can be used to enhance the detection of target may also be determined by using test and control signaling probes each applied to cells known to comprise target and analyzing the chemicals for increased signal when using test signaling probe compared to control signaling probe.
The present invention may be better understood by reference to the following non-limiting Examples, which are provided as exemplary of the invention. The following examples are presented in order to more fully illustrate the preferred embodiments of the invention. They should in no way be construed, however, as limiting the broad scope of the invention. Methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present invention.
General Protocol. Starting Material: Signaling probes may be introduced into cells which are not expressing any RNAs from the DNA construct (i.e., the signaling probes may detect endogenous RNAs), or they may be used to detect RNA messages encoded from the DNA construct. The method of introduction of the signaling probes into either of these two types of cells is identical.
The result is that a pool of cells remain and all of these would be resistant to selective agents, but only a small fraction of which express the RNAs of interest. To select for the cells expressing the desired RNAs, the protocol below may be followed.
Transfection may be carried out by a vast variety of methods, similar to the transfection of the DNA constructs into cells. The method employed should be chosen based on the cell type being used as some cells respond better to some transfection methods over other methods. Transfection should be performed according to the instructions of the manufacturer of the transfection reagent used and may need to be optimized. Optimization may include treatment with chemicals to enhance delivery of transfected material into the cell or cell cytoplasm.
Transfection of signaling probes into cells may be carried out either on cells in suspension or on cells growing on solid surfaces, depending on the cells and transfection reagent used.
Following FACS selection, the positive-scoring cells can be maintained in appropriate medium as described in more detail in the description of the invention. These cells would give rise to cell lines expressing the RNAs of interest.
Concentration of the signaling probe: The concentration of signaling probe to be used depends on several factors. For instance, one must consider the abundance within cells of the RNA to be detected and the accessibility of this RNA to the signaling probe. For instance, if the RNA to be detected is present in very low amounts or if it is found in a portion of the RNA which is not readily accessible based on the three-dimensional folding of the RNA or due to protein binding to the RNA, then more signaling probe should be used here then in cases where the RNA to be detected is in high abundance and where the site recognized by the signaling probe is fully accessible. The exact amount of signaling probe to be used will have to be determined empirically for each application.
This can be accomplished by introducing different amounts of signaling probes into different groups of cells and selecting the condition where background fluorescence is low and where signal is high (the condition where not all but some of the cells score positive for the signaling probe).
Cells attached, partially attached to, or settled on surfaces, or in solution may be exposed to FPs using a variety of methods to introduce molecules into cells, including but not limited to methods known in the art such as microinjection, mechanical shearing forces such as vortexing or mixing, passing through needles, or cell loading techniques including scraping, permeabilization using reagents such as certain antibiotics or detergents or a combination of reagents or solvents, or through use of a variety of transfection reagents of varying chemical properties (for instance liposomal based, chemical, or protein based), by growing the cells on a surface coated with the probes such that the probes are absorbed into the cells, or through a combination of any one or more of these methods.
A more detailed sample protocol describing the use of lipid based transfection reagents is outlined below:
Cells were plated into tissue culture wells either prior to (cell plating method 1) or on the same day as their exposure to (cell plating method 2) FPs, or cells were transferred to microcentrifuge tubes on the same day as their exposure to FPs (cell plating method 3).
All three preparative methods were used successfully. Cell plating method 1 allows sufficient time for the cells to attach to the surface of the culture well, while cell plating method 2 allows cells settled on the plate or attached to varying degrees to be processed depending on how much time is allowed to pass before cells are further processed, and cell plating method 3 allows for cells to be processed directly after they have been transferred to the tubes without allowing any time for them to settle or attach to any surface, although processing may also be carried out after a given amount of time has allowed to pass.
The cells were rinsed once or more with buffer such as serum-free media or PBS, although other buffers may be used. Generally, buffers included MgCl2 at varying millimolar concentrations. The rinsing step may be omitted depending on the method of exposure.
FPs were prepared for addition to cell preparations using commercially available transfection reagents and following protocols as described by the manufacturers. Manufacturers' protocols instruct that multiple parameters need to be empirically determined for multiple variables including which cell types are used, at what confluency or concentration cells should be processed, which specific molecules are to be introduced, in which proportion and absolute amounts various reagents are to be combined, and for what durations various steps are to be carried out or incubated, which steps may be omitted, and other variables. In the manufacturer's protocol, while certain ranges are provided, the protocols also suggest that these may have to be exceeded or other parameters may have to be optimized for successful use of their reagent. In general, the exposure of cells to FPs was carried out using parameters for these conditions which were within the ranges suggested as preliminary ranges by the manufacturers' protocols.
In general, FPs would be added to a tube containing serum-free media and the transfection reagent would be added to a second tube also containing serum free media using volumes and concentrations suggested by the manufacturer. The contents of each tube would be mixed, combined, and incubated for a length of time all as indicated by the manufacturers' protocols. Next, the FP would be applied to the preparations of cells, and the cells would be assayed following various incubation periods.
The protocol for exposing the FPs to cells in
Cells were plated a day before exposure to FPs, at approximately 1×105 cells/ml and at 0.5 ml per well of a 24-well plate. Cells used were cells transfected with and drug selected for an expression construct encoding r-vav.
FP reagents were prepared by incubating 0.625 to 2.5 ul of a 20 uM stock of FP in 50 ul to 200 ul of serum-free media containing from 1 to 4 mM MgCl2 in one tube, and 0.625 to 2.5 ul of TfX50 (Promega) in an equal volume of serum-free media having the same concentration of MgCl2 in a second tube. The contents of each tube would be mixed and then combined and incubated for 15 to 45 minutes at room temperature.
Cells were rinsed one or more times with serum-free media supplemented to the same concentration of MgCl2 used above. The preparation of FP would be applied to the cells and extra serum-free media plus MgCl2 may optionally be added depending on the volume of FP preparation added such that the cells are covered.
Cells were incubated for 3 to 5 hours in a tissue culture incubator and then assayed, and optionally DMSO may be added prior to observation. DMSO may aid in the delivery of the signaling probe into cells or in the folding or presentation of the target to be detected. Other solvents or chemicals may also be used with the aim of increasing delivery of the signaling probe into the cell or cell cytoplasm, or with the aim of improving detection of target by the signaling probe. Solvents and chemicals may be tested for their desirability or suitability by determining whether they result in increased delivery efficiency or increased signal to noise for the detection of target in cells expressing target compared to cells not expressing target, where both cell types are exposed to signaling probe.
The protocol for
The protocol for exposing the FPs to cells in FIGS. 33(A,B,C) are as follows:
Approximately 50 to 100 ul of cells plated at approximately 2.5×105 cell/ml in PBS supplemented with 4 mM MgCl2 (PBS+4) was plated in wells of a 96-well plate, and the cells were exposed to FPs after sufficient time had passed for them to settle on the plate surface but before they had spread.
FP reagents were prepared by mixing 2.5 ul of a 20 uM stock of FP in 100 ul of PBS+4 in one tube and 7 ul of Lipofectamine (InVitrogen) was added to 100 ul of PBS+4 in a second tube. The contents of each tube were mixed and the solutions were incubated for 30 minutes at room temperature before they were combined, mixed and incubated for an additional 15 minutes at room temperature. 50 ul of the FP preparation was added to each well. A rinse of the cells with serum-free media or other buffer is optional. Cells were assayed following incubation in a tissue culture incubator for 2 to 3 hours and then assayed.
The protocol for exposing the FPs to cells in FIGS. 33(D,E) were essentially the same as for
Cells were exposed to FPs as described above and then processed to detach them from surfaces if they were attached and separate them from each other (for instance by using trypsin, although other enzymatic or non-enzymatic procedures may be used) and then they would be applied to FACS.
Following exposure to FPs, the FP containing solution would be removed from the cells and trypsin would be applied directly to the cells. Rinsing of the cells prior to this step using a variety of buffers or reagents may be used, including reagents designed to remove any FP reagent or other reagent which may have associated with cell surfaces during the exposure of the cells to FPs. The cells would be resuspended in buffer (for instance media containing serum and magnesium), and the cells would be further dispersed for instance by pipeting.
The cells were then analyzed by FACS according to standard FACS methods. For analysis of the cells, by comparing the fluorescence intensities of control cells and cells potentially expressing target sequences for the presence of these sequences using FPs, one can determine the background fluorescence of cells having undergone this procedure to determine if the cells potentially expressing targeted sequences show any changed fluorescence. Cells can be isolated either individually or in batch based on this information. For instance, existing technology enables the direct isolation of desired cells into unique wells of 96-well plates, or multiple desired cells may be obtained.
By increasing or decreasing the threshold of fluorescence intensity in FACS, cells exhibiting different levels of fluorescence signal may be isolated. One may wish to either set the threshold very high to have the high assurance that the cells are indeed positive or one may set a lower threshold if the purity of positive cells in the population that is obtained is not critical, for instance in the case where one would simply like to enrich the isolated cells for positive cells.
One can monitor the expression levels of one or more genes in cells over time. For instance, given a cell line derived from cells isolated as positive for expression of a sequence using a FP designed to recognize the sequence, one can expose the cells and control cells to the FP and determine the ratio of fluorescence intensities of the two cell types for the signal emitted by the FP. This procedure can be repeated over time and changes in the ratio will reflect changes in relative levels of expression for the sequence being detected by the FP. This procedure may be carried out using multiple FPs.
The sequence for tagl was used to generate two additional different sequences such that each of the sequences could be recognized by a unique signaling probe, with the intention of generating sequences having the same or similar predicted structure particularly with respect to the regions of the structure most directly involved in binding the signaling probe such that the target sequence of the signaling probe would be similarly presented for binding. For each of the two different tag sequences (tag2 and 3) that were generated, the targeted sequence within the original sequence for tagl was first changed, and compensatory changes were made to some additional bases predicted to interact with the changed sequences in an effort to preserve the interactions at these same positions (
FP17 is predicted to form a 7 base pair mutually complementary region adjacent to the interactive pair. Elevated temperatures was required for FP 17 to give a stronger fluorescence signal when it was incubated in the presence of target oligo compared to control oligo. For instance, by briefly placing them in hot water of 90 to 95° C. The tubes used here contained 16 ul total consisting of 5 ul of a 20 uM FP stock, 1.5 ul 25 mM MgCl2, 8 ul 20 uM oligo, and 1.5 ul of water, having a final magnesium concentration of approximately 2.34 mM. The target oligo used was TO-FP1 and the control oligo used was TO-FP18 as described above.
Fifteen different chemically modified probes based on the FP1 probe sequence were synthesized. All of these probes have the same sequence and are directed against the same target sequence. The probes were introduced into 293T cells. These cells express a tag sequence that includes the target sequence of FP1. FACS was used to analyze the fluorescence from these cells. The fifteen different probes are described below (SEQ ID NOS: 6-8, 2 and 10-20 are disclosed respectively in order of appearance). The results of the FACS analysis are shown in
The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Aug. 7, 2013, is named “0022980035101.TXT” and is 27,639 bytes in size.
This application is a continuation-in-part of U.S. patent application Ser. No. 10/589,052, for which the 371(c) date is Sep. 8, 2008, which is a national stage entry of International Patent Application No. PCT/US05/05080, filed Feb. 17, 2005, which claims priority from U.S. Provisional Application No. 60/546,075, filed Feb. 18, 2004. The disclosures, including the claims, of all the aforementioned priority applications are incorporated by reference in their entirety herein.
Number | Date | Country | |
---|---|---|---|
60546075 | Feb 2004 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10589052 | Sep 2008 | US |
Child | 13839685 | US |