The present invention relates to a method for the detection of a specific nucleic acid in a sample or into a cell. Specifically, the invention provides a method for detecting the presence of a specific nucleic acid using engineered DNA-binding domains such as Transcription Activator Like-Effector (TALE) domain. The method of the invention is particularly useful for in vitro diagnostic application.
Transcription activator-like effectors (TALEs), a group of bacterial plant pathogen proteins, have emerged as new engineerable scaffolds for production of tailored DNA binding domains with chosen specificities. Interest in these systems comes from the apparent simple cipher governing DNA recognition by their DNA binding domain (Boch, Scholze et al. 2009; Moscou and Bogdanove 2009). The TALE DNA-binding domain is composed of multiple TALE repeats that individually recognize one DNA base pair through specific amino acid di-residues (repeat variable di-residues or RVDs). The remarkably high specificity of TALE repeats and the apparent absence of context dependence effects among repeats in an array, allow modular assembly of TALE DNA binding domains able to recognize almost any DNA sequence of interest. Within the past two years, engineered TALE DNA binding domains have been fused to transcription activator (dTALEs) (Morbitzer, Elsaesser et al. 2011), repressor (Cong, Zhou et al. 2012) or nuclease domains (TALENs) (Christian, Cermak et al. 2010) and used to specifically regulate or modify genes of interest (Bogdanove and Voytas 2011). Recently, TALE DNA binding domains have been also used to specifically target a native section of a chromosome for purification and proteomic analysis (Byrum, Taverna et al. 2013). More recently, further modular base per base nucleic acid binding domain referred as MBBBD have been discovered with a similar ability to bind 10 to 20 base pairs nucleic acid target sequence upon expression into cells (PCT/US2013/051783).
The detection of specific nucleic acid sequence has been largely used for a wide range of applications including as non limiting examples in vitro diagnostic, indication of the presence of an organism, research in the field of molecular biology. In particular, detection of a specific nucleic acid sequence in a physiological sample constitutes, at the present time, the major line of development of diagnostic methods, e.g. for identifying the probability of bacteria of developing antibiotic resistance, genetic abnormalities, the risks of cancer associated with genetic modifications and viral infections.
A number of procedures are presently available for the detection of specific nucleic acid molecules. Several techniques have been developed over the years, all relying on the detection of a hybrid molecule between the target nucleic acid molecule and a specific labeled probe such as in situ hybridization which allows the detection and localization in a cell of a specific DNA sequence using labeled probes.
Today, the most widely used methods to detect nucleic acids are based on the polymerase chain reaction (PCR). Real-time PCR for example is used to amplify and simultaneously quantify a targeted DNA molecule. As well as PCR, a variety of other techniques have been developed for detection and amplification of specific sequences. For direct detection, the target nucleic acid is most commonly separated on the basis of size by gel electrophoresis and transferred to a solid support prior to hybridization with a probe complementary to the target sequence (Southern and Northern blotting). Another detection method is based on engineered zinc finger proteins wherein the signal transducer is based on the binding dependent reassembly of a reporter protein, such that no signal should be present unless DNA binding occurs, but the sensitivity of this method need to be improved in order to be used to detect unique sequences in cells (Ghosh, Stains et al, molecular biosystems 2006; Kim, Segal et al. 2011, NAR).
However, all the methods developed so far suffer from serious drawbacks. In particular, amplification of the target sequence is time consuming, increases the probability of errors, and is highly prone to contamination. Moreover, they all make use of labeled nucleotides, thus contributing to seriously increasing the overall costs. To overcome these drawbacks, the inventors sought to develop a rapid test using engineered DNA binding domain such as TALE to detect a specific nucleic acid sequence. They develop a low-cost method, simple to operate and which is adapted for use in low resource settings.
The present invention relates to a new rapid diagnostic tool based on DNA binding molecule, preferably TALE-protein. The inventors develop a method of detecting a nucleic acid sequence of interest comprising: providing nucleic acids which can comprise a nucleic acid sequence of interest; providing at least one DNA binding domain capable of binding said specific nucleic acid sequence; contacting nucleic acids with said DNA binding domain; removing unbound DNA-binding domain and nucleic acid and detecting the presence or the absence of said nucleic acid sequence of interest. Said nucleic acid sequence of interest can be a double strand nucleic acid or a single strand nucleic acid. The single strand nucleic acid is detected once hybridized with a complementary sequence. Said nucleic acid can be a DNA, RNA or DNA/RNA heteroduplex. One or more DNA binding domains which bind each different part of the nucleic acid sequence of interest can be used in this method. This method can be realized in solution with or without the help of a solid support. Different detection techniques can be used, included as non limiting example direct protein detection (e.g. immunohistochemistry, immunoassay), detection of reporter molecule fused to DNA binding domain or labeled nucleic acid sequence (e.g. fluorescent protein, tag, chemiluminescent proteins, enzyme), nucleic acid detection (e.g. PCR, hybridization) or indirect detection by comparing the result with standard sample comprising known amounts of a purified nucleic acid or protein (e.g. bovine serum albumin (BSA)), by a competitive protein binding assay.
The present invention relates to a new nucleic acid diagnostic tool based on DNA binding molecule, preferably TALE-protein. In particular, the present invention relates to a method of detecting a nucleic acid sequence of interest comprising:
Examples of this method are represented in
The nucleic acid sequence of interest can be a genetic sequence present within a cell, such as, for example a gene or a non-coding sequence within or adjacent to a gene. The nucleic acid sequence of interest can be present in a chromosome, an episome, an organellar genome such as mitochondrial or chloroplast genome or genetic material that can exist independently to the main body of genetic material such as an infecting viral genome, plasmids, episomes, transposons for example. A sequence of interest can be within the coding sequence of a gene, within transcribed non-coding sequence such as, for example, leader sequences, trailer sequence or introns. Said nucleic acid sequence of interest can be a double or a single strand nucleic acid sequence. As TALE DNA binding domains have the ability to bind DNA-RNA heteroduplex with a similar affinity than double strand DNA, in the present invention nucleic acid can be DNA, RNA or DNA-RNA heteroduplex.
In another particular embodiment, said nucleic acid sequence of interest can be a single strand nucleic acid sequence. In particular, to detect a single strand nucleic acid sequence of interest (pray sequence), a nucleic acid sequence (bait sequence) complementary to at least to a part of the sequence of interest is provided. Among nucleic acids, only the specific pray sequence hybridized to the bait sequence. To detect the pray:bait sequence, at least one specific DNA binding domain capable of binding said hybridized sequence is provided. As described above, after contacting said DNA binding domain with nucleic acids and removing unbound DNA binding domain and nucleic acids, the presence or absence of the nucleic acid sequence of interest is detected. As TALE DNA binding domains display a much lower affinity for single stranded nucleic acid than for their double stranded counterparts, the low affinity binding of TALE domain with single strand DNA (ssDNA) avoids the formation of TALE:ssDNA complex rendering this method particularly sensitive.
In other words, the method of detecting a single-strand nucleic acid sequence of interest, named “pray” comprises:
Examples of this method are represented in
According to the invention, the nucleic acid sequence named bait comprises a single-strand complementary sequence of at least a part of the pray such that the two single-strand oligonucleotides can hybridize together. By hybridization sequence is meant the sequence part of the oligonucleotide that can hybridize to one of the other oligonucleotides under standard low stringent conditions. Such conditions can be for instance at room temperature for 2 hours by using a buffer containing 25% formamide, 4×SSC, 50 mM NaH2PO4/Na2HPO4 buffer; pH 7.0, 5×Denhardt's, 1 mM EDTA, 1 mg/ml DNA+20 to 200 ng/ml probe to be tested (approx. 20-200 ng/ml). This can be also predicted by standard calculation of hybridization using the number of complementary bases within the sequence and the content in G-C at room temperature as provided in the literature.
Preferentially, the hybridization sequences are complementary to each other pursuant to the complementarity between two nucleic acid strands relying on Watson-Crick base pairing between the strands, i.e. the inherent base pairing between adenine and thymine (A-T) nucleotides and guanine and cytosine (G-C) nucleotides. Accurate base pairing equates with Watson-Crick base pairing includes base pairing between standard and modified nucleosides and base pairing between modified nucleosides, where the modified nucleosides are capable of substituting for the appropriate standard nucleosides according to the Watson-Crick pairing. The complementary sequence of the single-strand oligonucleotide can be any length that supports specific and stable hybridization between the two single-strand oligonucleotides under the reaction conditions.
In one embodiment, said pray can be a single strand nucleic acid sequence which comprises a complementary sequence to a part of the pray sequence. In another embodiment, said bait can be a nucleic acid sequence comprising a double-stranded sequence and a single strand sequence which comprises a complementary sequence to a part of the pray sequence (see
Sample:
The nucleic acid sequence of interest in the present invention can be detected in situ within a live cell. The DNA-binding domain, preferably polynucleotides encoding said DNA binding domain can be introduced into the live cells by transformation methods well-known in the art including as non limiting examples electroporation, microinjection, sonoporation, particle bombardment, liposome transfection and delivery via viral vector. The specific nucleic acid sequence in the present method can be also detected in situ within a cell or tissue, preferably fixed on microscope slides.
In particular embodiment, the nucleic acid sequence of interest of the present invention can be present in a sample. The sample of the invention is any type of sample which may contain the desired nucleic acid species, such as e.g. a reaction mixture. It may also be, for example, a biological sample. A “biological sample” may be any sample which may contain a biological organism, such as, for example, bacteria, viruses, plants, yeasts etc. A “biological sample” according to the invention also refers to a sample which may be obtained from a biological organism, such as a cellular extract obtained from bacteria, viruses, plants, yeasts etc. in particular, a biological sample may be any sample taken from a subject, such as a serum sample, a plasma sample, a urine sample, a blood sample, a lymph sample, or a biopsy. Preferred biological samples for the detection of a specific nucleic acid sequence among genomic sequences include samples such as a blood sample, a plasma sample, a lymph sample, or a biopsy. Preferably, the biological sample is a blood sample. A blood sample can be used from a cancer patient and, for example, used to characterize a liquid tumor by the method of the invention. A “biological sample” as used herein also includes a solid cancer sample of the patient to be tested, when the cancer is a solid cancer.
The sample may thus be any material containing nucleic acid within such cells, including for example foods and allied products, clinical and environmental samples. Thus, the sample may be a biological sample, which may contain any viral or cellular material, including all prokaryotic or eukaryotic cells, viruses, bacteriophages, mycoplasmas, protoplasts and organelles. Such biological material may thus comprise all types of mammalian and non-mammalian animal cells, plant cells, algae including blue-green algae, fungi, bacteria, protozoa etc. Representative samples thus include whole blood and blood-derived products such as plasma or buffy coat, urine, faeces, cerebrospinal fluid or any other body fluids, tissues, cell cultures, cell suspensions etc., and also environmental samples such as soil, water, or food samples. The sample may also include relatively pure or partially purified starting materials, such as semi-pure preparations obtained by other cell separation process
By “DNA binding domain”, it is meant a protein domain capable of binding a nucleic acid sequence of interest, preferably a double strand nucleic acid molecule. The DNA binding domain recognizes and binds nucleic acid at specific polynucleotide sequences, further referred to as “nucleic acid target sequence”. In a preferred embodiment, said DNA binding domain is derived a Transcription Activator like Effector (TALE) engineered to bind a specific nucleic acid target sequence.
TALEs are proteins from the bacterial species Xanthomonas which comprise a plurality of repeat sequences, each repeat comprising di-residues in position 12 and 13 (RVD) that are specific to each nucleotide base of the nucleic acid targeted sequence. TALE binding domain is composed by a variable number of 33-35 amino acid repeat modules. These repeat modules are nearly identical to each other except for two variable amino acids located at positions 12 and 13 (i.e. Repeat Variable Di residues, RVD). The nature of residues 12 and 13 determines base preferences of individual repeat module. Preferably, RVDs associated with recognition of the different nucleotides are HD for recognizing C, NG for recognizing T, NI for recognizing A, NN for recognizing G or A, NS for recognizing A, C, G or T, HG for recognizing T, IG for recognizing T, NK for recognizing G, HA for recognizing C, ND for recognizing C, HI for recognizing C, HN for recognizing G, NA for recognizing G, SN for recognizing G or A and YG for recognizing T, TL for recognizing A, VT for recognizing A or G and SW for recognizing A. In another embodiment, critical amino acids 12 and 13 can be mutated towards other amino acid residues in order to modulate their specificity towards nucleotides A, T, C and G and in particular to enhance this specificity.
Binding domains with similar base-per-base nucleic acid binding properties (modular base-per-base specific nucleic acid binding domains (MBBBD) can also be derived from new modular proteins recently discovered by the applicant in a different bacterial species (PCT/US2013/051783). Said MBBBD can be engineered, for instance, from the newly identified proteins, namely EAV36_BURRH, E5AW43_BURRH, E5AW45_BURRH and E5AW46_BURRH proteins from the recently sequenced genome of the endosymbiont fungi Burkholderia Rhizoxinica (Lackner, Moebius et al. 2011; Lackner, Moebius et al. 2011).
In another embodiment, the method according to the present invention relates to a multiplex detection method (
In a particular embodiment, the bound of the nucleic acid sequence of interest with the DNA binding domain (optionally via the bait) can be realized in solution or on a solid support.
In a particular embodiment, said DNA-binding domain and/or said nucleic acid sequence is fixed on a solid support (see for examples
Immobilisation may be achieved in any number of ways, known in the art, described herein, and/or as can be developed. For example, immobilisation may involve any technique resulting in direct and/or indirect association of an analyte (and its corresponding antagonist) with the substrate, including any means that at least temporarily prevents or hinders its release into a surrounding solution or other medium. The means can be by covalent bonding, non-covalent bonding, ionic bonding, electrostatic interactions, Hydrogen bonding, van der Waals forces, hydrophobic bonding, or a combination thereof. For example, immobilisation can be mediated by chemical reaction where the substrate contains an active chemical group that forms a covalent bond with the second binding partner. For example, an aldehyde-modified support surface can react with amino groups in protein receptors; or amino-based support surface can react with oxidization-activated carbohydrate moieties in glycoprotein receptors; a support surface containing hydroxyl groups can react with bifunctional chemical reagents, such as N,N dissuccinimidyl carbonate (DSC), or N-hydroxysuccinimidyl chloroformate, to activate the hydroxyl groups and react with amino-containing receptors. In some embodiments, support surface of the substrate may comprise animated or carboxylated polystyrenes; polyacrlyamides; polyamines; polyvinylchlorides, and the like. In still some embodiments, immobilization may utilize one or more binding-pairs to bind or otherwise attach a receptor to a substrate, including, but not limited to, an antigen-antibody binding pair, hapten/anti-hapten systems, a avidin-biotin binding pair; a streptavidin-biotin binding pair, a folic acid/folate binding pair; photoactivated coupling molecules, and/or double stranded oligonucleotides that selectively bind to proteins, e.g., transcriptional factors.
By “detecting a nucleic acid sequence of interest”, it is herein meant all the activities leading directly or indirectly to the obtaining of some information on the presence or absence of a nucleic acid sequence of interest, or eventually the quantity of a nucleic acid of interest including but not limited to, the detection and/or quantification of the DNA binding domain, the detection and/or quantification of at least a part of the nucleic acid molecule and/or the detection and/or the quantification of the DNA binding domain bound to the specific nucleic acid molecule. The detection/and or quantification of a specific nucleic acid sequence can be realized by techniques that allow the specific separation of the nucleic acid sequence bound to DNA-binding domain from nucleic acid which do not have bound DNA binding domain thereon.
The detection and/or quantification of the specific nucleic acid sequence can be realized by techniques well known in the art that allow the detection and/or quantification of the DNA binding domain included as non limiting examples, immunoassay, protein immunoprecipitation, mass spectrometry, colorimetry methods (e. g Biuret assay, Lowry assay, bis-cinchinonic acid assay, Bradford assay), amino acid analysis, and/or the detection and/or quantification of the specific nucleic acid sequence included as non limiting examples PCR, real time PCR, sequencing, electrophoresis.
In particular embodiment, the detection and/or quantification of a specific nucleic acid sequence may include the detection of a reporter molecule fused to the DNA binding domain and/or labeled bait nucleic acid of the present invention (as illustrated in
Said nucleic acid can be labeled with several fluorescent protein or enzymes that generate colored products. The nucleic acid used in the present invention can be labeled with more than one distinguishable fluorescent or pigment color. Nucleic acids may be labeled with Texas red, rhodamine and its derivatives, fluorescein and its derivatives, dansyl, umbelliferone and the like or with horse radish peroxidase, alkaline phosphatase, or the like.
Nucleic acid can be labeled directly or indirectly. The common indirect labeling schemes covalently bind a ligand to the nucleotide and prepare labeled nucleic acid by incorporating this using random priming or nick translation. The ligand then binds an anti-ligand which is covalently bound to a fluorescent label. Ligands and anti-ligands vary widely. When a ligand has an anti-ligand, e.g., biotin, thyroxine, or cortisol, the ligand may be used in conjunction with the labeled naturally-occurring anti-ligand. Alternatively, a hapten or antigen may be used in combination with an antibody.
In another embodiment, the detection and/or quantification of the specific nucleic acid sequence can also be based on protein complementation assays wherein a reporter molecule is dissected into two non-functional fragments. Functionality is restored when the fragments are reassembled by attached protein-protein interaction domains. Reporter molecules used in the protein complementation assays can be a β-galactosidase, dihydrofolate reductase, fluorescent protein (e.g. green fluorescent protein, GFP), luciferase, β-lactamase. Detection of the signal from reporter molecule may be done by standard methods known in the art for diagnostic and other detection methods such as fluorescence or colorimetric detection systems. One example of methods according to the present invention using protein complementation assay is illustrated in
FRET (Fluorescence resonance energy transfer) occurs when the emission spectrum of a fluorescent molecule, an energy donor, overlaps with the absorption spectrum of another molecule, an energy acceptor. When the donor is excited, FRET results in reduction of the intensity of donor emission, as energy from the donor in its excited state is transferred to the acceptor. The acceptor can be fluorescent or non-fluorescent. If the acceptor is fluorescent, the intensity of acceptor emission is increased as a result of FRET. A second critical criterion of energy transfer in FRET is close proximity between energy donor and acceptor. In a particular embodiment, the acceptor can be fused to the DNA binding domain and the acceptor can be fused to the nucleic acid or inversely the acceptor can be fused to the nucleic acid and the donor can be fused to the nucleic acid such that when the DNA binding domain specifically binds target nucleic acid the intensity of the acceptor emission is increased. Examples of methods using FRET techniques are illustrated in
The present invention also relates to a DNA binding domain fused to at least one electron transfer peptide. In a more particularly embodiment, the present invention relates to a nanowire which comprises several DNA binding domains fused to an electron transfer peptide which are adjacent on nucleic acid target sequence, such that each electron transporter are regularly disposed along the nucleic acid molecule to allow the transfer of electrons. In a more preferred embodiment said electron transfer peptide is derived from an Iron-Sulfur protein (Fe—S protein). By “a peptide derived from”, it is meant a peptide having an amino acid sequence which comprises at least a part of amino acid sequence of a Fe—S protein, or a variant thereof. Iron-sulfur proteins are proteins characterized by the presence of iron-sulfur clusters containing sulfide-linked di-, tri-, and tetrairon centers in variable oxidation states (WO2012/120013). As non limiting examples, Fe—S protein can be rubredoxin (i. e. NCBI Reference Sequence: WP_013180622.1), ferredoxin, hydrogenase or variants thereof. In another embodiment, the present invention relates to a method of producing said DNA binding domain fused to an electron transfer peptide comprising introducing into a cell at least one polynucleotide encoding said DNA binding domain fusion protein, and expressing DNA binding domain fusion protein into the cell. The present invention also relates to a polynucleotide, an expression vector encoding said DNA binding domain fused to an electron transfer peptide, or a host cell comprising said polynucleotide or DNA binding domain fusion protein.
In the context of the reporter molecule, the DNA binding domain fused to an electron transfer peptide can be used to detect the presence or absence of the DNA binding domain, for example by detecting the production of electrons. One example of methods according to the present invention using TALE DNA binding domain fused to an electron transfer peptide is represented in
“Detecting nucleic acid sequence” can also refer to the quantification of the specific nucleic acid sequence. The quantity of nucleic acid sequence and/or DNA binding domain can be determined by comparing the result with standard sample comprising known amounts of a purified protein (e.g. bovine serum albumin (BSA)) or of nucleic acid. In another embodiment, said nucleic acid can be quantified by a competitive binding assay based for example on the competitive nucleic acid binding between labeled DNA binding domain and unlabelled DNA binding domain, or competitive DNA binding domain binding between labeled nucleic acid and unlabeled nucleic acid.
The present invention can thus be used every time there is a need to detect nucleic acid sequence of interest. Applications include the fields of in vitro diagnostics, including clinical diagnostics, research in the fields of molecular biology, high throughput drug screening, veterinary diagnostics, agricultural-genetics testing, environmental testing, food testing, industrial process monitoring, biosecurity, forensics, and insurance testing. In vitro diagnostics and clinical diagnostics is related to the analysis of nucleic acid samples drawn from the body to detect the existence of a disease or condition, its stage of development and/or severity, and the patient's response to treatment. In high throughput drug screening and development nucleic acids are used similarly to other agents, such as, antigens, antibodies, receptors, etc., to analyze the response of biological systems upon exposure to libraries of compounds in a high sample number setting to identify drug leads. Veterinary diagnostics and agricultural genetics testing involve samples from a non-human animal or a plant species similar to in vitro diagnostics and to provide means of quality control for agricultural genetic products and processes. In environmental testing, organisms and their toxins that indicate the pollution of an environmental medium, e.g. soil, water, air, etc., are analyzed. Food testing includes the quantitation of organisms, e.g. bacteria, fungi, etc., as a means of quality control. In industrial process monitoring, nucleic acids are detected to indicate proper control of a production process and/or to generate a signal if such processes are out of control. In insurance testing organisms and/or their toxins are identified in screening tests to determine the risk category of a client or to help approve candidates. There are various other applications of the detection of nucleic acids and new applications are being developed constantly.
In particular embodiment, the method according to the present invention can be used to detect the presence or absence of a nucleic acid sequence of interest such as a genetic abnormality or a single nucleotide polymorphism (SNP). This system can be used to detect genomic rearrangement in DNA and for identification of highly repetitive sequence. The method can also be used to detect specific nucleic acid, that serve as a unique marker for a disease or type of cancer or to detect the presence or absence of a nucleic acid sequence of an infectious agent such as HIV, influenza, or E. coli. The method according to the present invention can also be used for DNA profiling studies.
The present invention also relates to a kit comprising the DNA binding domain according to the present invention specific to a nucleic acid sequence and preferably fused to a reporter molecule.
The kit according to the present invention can be used for all above-mentioned applications. Particularly, the kit is particularly useful to detect the presence or absence of a specific nucleic acid sequence such as a genetic abnormality or a single nucleotide polymorphism (SNP). The kit can also be used to detect genomic rearrangement in DNA, for identification of highly repetitive sequence, to detect specific nucleic acid that serves as a unique marker for a disease or type of cancer, and to detect the presence or absence of nucleic acid sequence of interest of an infectious agent. Additionally, the kit of the present invention preferably comprises the reagents for extraction of the biological sample to be tested, optionally a resuspension solution and/or a substrate for assaying the presence of the binding event.
Nucleotides are designated as follows: one-letter code is used for designating the base of a nucleoside: a is adenine, t is thymine, c is cytosine, and g is guanine. For the degenerated nucleotides, r represents g or a (purine nucleotides), k represents g or t, s represents g or c, w represents a or t, m represents a or c, y represents t or c (pyrimidine nucleotides), d represents g, a or t, v represents g, a or c, b represents g, t or c, h represents a, t or c, and n represents g, a, t or c.
The term “nucleic acid” refers to deoxyribonucleotide or ribonucleotide polymer, in linear or circular conformation, and in either single- or double-stranded form. The terms can encompass known modified nucleotides.
As used in the context of the present invention, the term “modified nucleotide” refers to a nucleotide that differs in structure from the standard or “unmodified” nucleotides 2 ‘-deoxy-adenosine, 2’-deoxy-thymidine, 2′-deoxy-cytidine and 2′-deoxy-guanosine, and that is capable of pairing with an unmodified nucleotide or a modified nucleotide. Non limiting examples of modifications on the sugar or base moieties of a nucleotide include the addition (or removal) of acetyl groups, amino groups, carboxyl groups, carboxymethyl groups, hydroxyl groups, methyl groups, phosphoryl groups and thiol groups, as well as the substitution of the carbon and nitrogen atoms of the bases with other atoms (e.g., 7-deaza purines). Modified nucleotide also include dideoxy nucleotides, 2′-O-methyl nucleotides, locked nucleic acids (LNA), peptides nucleic acids (PNA), and morpholinos. The nucleotides of the single-stranded nucleic acids may be linked by phosphodiester, phosphorodiamate bonds, or combinations thereof. Such modified nucleotides are useful for instance to introduce markers, tag, epitopes or reactive groups into the genome. The linear single-strand oligonucleotide can also comprise one or more phosphorothioatephosphodiester bonds between terminal base pairs to protect the linear donor polynucleotide from exonucleolytic degradation. These bonds may be in two or more positions at the 5′ and/or 3′ ends of the molecule and may be added during isolation or synthesis using standard methodology. See, e.g. (Ciafre, Rinaldi et al. 1995). Said single-strand oligonucleotide may comprise phosphodiester linkages, phosphorothiorate linkages, phosphoramidite
By “gene” is meant the basic unit of heredity, consisting of a segment of DNA arranged in a linear manner along a chromosome, which codes for a specific protein or segment of protein. A gene typically includes a promoter, a 5′ untranslated region, one or more coding sequences (exons), optionally introns, a 3′ untranslated region. The gene may further comprise a terminator, enhancers and/or silencers
By “delivery vector” or “delivery vectors” is intended any delivery vector which can be used in the present invention to put into cell contact or deliver inside cells or subcellular compartments agents/chemicals and molecules (proteins or nucleic acids) needed in the present invention. It includes, but is not limited to, transducing vectors, liposomal delivery vectors, viral delivery vectors, drug delivery vectors, chemical carriers, polymeric carriers, lipoplexes, polyplexes, dendrimers, microbubbles (ultrasound contrast agents), nanoparticles, emulsions or other appropriate transfer vectors. These delivery vectors allow delivery of molecules, chemicals, macromolecules (genes, proteins), or other vectors such as plasmids. These delivery vectors are molecule carriers.
By “variant(s)”, is meant a protein or a polynucleotide encoding thereof that are obtained by genetic engineering or by random mutagenesis. A variant according to the invention can for example be obtained by deletion, addition or substitution with a different amino acid of at least one residue in the amino acid sequence of their wild-type sequences. Substitution(s), additions(s) and deletion(s) can for example be introduced by directed mutagenesis and/or by random mutagenesis. In the frame aspects of the present invention, variants have the same function and ability than the parent protein or polynucleotide.
By “DNA binding domain”, it is meant a protein domain capable of binding a target nucleic acid sequence, preferably a DNA molecule. The DNA binding domain recognizes and binds nucleic acid at specific polynucleotide sequences, further referred to as “nucleic acid target sequence”. In a preferred embodiment, said DNA binding domain is derived from a Transcription Activator like Effector (TALE).
Binding domains with similar base-per-base nucleic acid binding properties (modular base-per-base specific nucleic acid binding domains (MBBBD) (PCT/US2013/051783) can also be derived from new modular proteins recently discovered by the applicant in a different bacterial species. Said MBBBD can be engineered, for instance, from the newly identified proteins, namely EAV36_BURRH, E5AW43_BURRH, E5AW45_BURRH and E5AW46_BURRH proteins from the recently sequenced genome of the endosymbiont fungi Burkholderia Rhizoxinica.
The DNA binding domain according to the present invention can also be derived from the CRISPR/Cas9 system. Recently, a new genome engineering tool has been developed based on the RNA-guided Cas9 nuclease (Gasiunas, Barrangou et al. 2012; Jinek, Chylinski et al. 2012; Cong, Ran et al. 2013; Mali, Yang et al. 2013) from the type II prokaryotic CRISPR (Clustered Regularly Interspaced Short palindromic Repeats) adaptive immune system (see for review (Sorek, Lawrence et al. 2013)). The CRISPR Associated (Cas) system was first discovered in bacteria and functions as a defense against foreign DNA, either viral or plasmid. CRISPR-mediated genome engineering first proceeds by the selection of target sequence which can be flanked by a short sequence motif, referred as the proto-spacer adjacent motif (PAM). Following target sequence selection, a specific crRNA, complementary to this target sequence is engineered. Trans-activating crRNA (tracrRNA) required in the CRISPR type II systems paired to the crRNA and bound to the provided Cas9 protein. Cas9 acts as a molecular anchor facilitating the base pairing of tracRNA with cRNA (Deltcheva, Chylinski et al. 2011). In this ternary complex, the dual tracrRNA:crRNA structure acts as guide RNA (gRNA) that directs the endonuclease Cas9 to the cognate target sequence. Target recognition by the Cas9-tracrRNA:crRNA complex is initiated by scanning the target sequence for homology between the target sequence and the crRNA. In addition to the target sequence-crRNA complementarity, DNA targeting requires the presence of a short motif adjacent to the protospacer (protospacer adjacent motif—PAM). Recently, it has been demonstrated that Cas9 HNH domain is responsible for nicking of one strand of the target double-stranded DNA and the Cas9 RuvC-like RNaseH fold domain is involved in cleavage of the other strand of the double-stranded DNA target (Jinek, Chylinski et al. 2012). Together, these two domains each nick a strand of the target DNA within the proto-spacer in the immediate vicinity of the PAM, which results in blunt cleavage of the invasive DNA (Jinek, Chylinski et al. 2012). In the present invention the catalytically inactive form of the Cas9 has been used to specifically bind target genes. In particular, Cas9 of the present invention comprises the two catalytic domains HNH and RuvC inactivated and lacks endonucleolytic activity. The resulting Cas9 co-expressed with gRNA yields to specific targeting of the gene of interest.
In addition to the preceding features, the invention comprises further features which will emerge from the following examples illustrating the method of engineering allogeneic and resistant T-cells for immunotherapy, as well as to the appended drawings.
The detection method consists in using a TALE DNA binding domain fused to a Renila Luciferase to detect a single strand DNA (ssDNA) sequence of interest via luminescence emission. It relies on strepavidin coated magnetic beads, a biotinylated ssDNA “bait”, complementary to the sequence “pray” to detect and a TALE-Luciferase detector protein (TALE-LUC) specific for the double strand DNA (dsDNA) pray:bait complex (
The first step of this method consists in anchoring the ssDNA bait onto the streptavidin coated magnetic beads (
This method is expected to avoid false positive signals due to the formation of TALE-LUC:non specific ssDNA or TALE-LUC:mismatched dsDNA complexes. Indeed, on one hand, false positive complex consisting in TALE-LUC bound to ssDNA bait are unlikely because of two reasons. First, as TALE are known to interact with ssDNA with poor affinity, TALE-LUC:non specific ssDNA complex is expected to be disrupted by subsequent washing steps. Second, TALE-LUC are designed to interact with the sense strand (DNA pray) of the dsDNApray:bait complex. Thus, the TALE-LUC:ssDNA bait is not expected to be formed. On the other hand, false positive complex consisting in TALE-LUC bound to mismatched dsDNA are likely to be avoided for two reasons. First, mismatched dsDNA are unlikely to be formed by annealing under optimized conditions and second, they will poorly bind to TALE-LUC because of the intrinsic specificity of TALE DNA binding domain.
In summary, the ssDNA pray to detect is selected among random ssDNA molecules by two types of captures. The first capture is based on highly specific DNA/DNA interaction while the second one is rather based on TALE/DNA interaction. Both captures are expected to prevent non specific interaction between random ssDNA and TALE-LUC and to promote efficient and specific detection of ssDNA pray. Finally, as TALE DNA binding domains have the ability to bind DNA/RNA heteroduplex, the method 1 could be adapted to detect RNA pray.
We chose to assemble a TALE-LUC consisting in the TALE DNA binding domain targeting RAGT2 target (SEQ ID 25) harboring the conventional deltaN152 terminal domains and a C terminal domain consisting in a C11 truncation fused to a Renila luciferase (pCLS 24992, SEQ ID NO 1). Such system will be called TALE RAGT2-Renila detector system. This construction allowed us to express the corresponding protein in E coli and purify it via Nickel affinity column, taking advantage of the His tag present at the Nterm of the construction. TALE RAGT2-Renila detector system was overexpressed and purified to 80% homogeneity as illustrated in
We first sought to assess the TALE RAGT2-Renila detector system sensitivity. To do so, increasing amounts of TALE RAGT2-Renila (from 0 to 10 pmol) were incubated in 96 well plate. Renila reaction mix was then added to each well, the luminescence was measured and plotted as a function of TALE RAGT2-Renila quantity (
We then evaluated the amount of TALE RAGT2-Renila able to proficiently bind dsDNA immobilized onto streptavidine coated plate (96 well plate format). To do so we immobilized wild type biotinylated dsDNA target (SEQ ID NO 2 and 3), incubated it with increasing amounts of TALE RAGT2-Renila and determined the amount of TALE RAGT2-Renila-DNA complex by luminescence. Briefly, about 13 pmol of biotinylated dsDNA wild type (SEQ ID NO 2 and 3) were immobilized on the streptavidine coated plate. Wells were then washed with high and low salt containing TBST buffers (50 mM Tris buffer, 150 mM NaCL, 0.05% TWEEN 20 and 50 mM Tris buffer, 1 M NaCL, 0.05% TWEEN 20 respectively) and increasing amounts of TALE RAGT2-Renila (from 0 to 750 pmol total) were added to each well. Unbound TALE RAGT2-Renila molecules were then removed by two low salt washing steps and the amount of RAGT2-Renila-DNA complex was determined by luminescence. The luminescence obtained in the presence of increasing amounts of TALE RAGT2-Renila is illustrated on
Our result showed that 200 pmol of TALE RAGT2-Renila needed to be added to fully saturate about 13 pmol of immobilized dsDNA. Thus our protein preparation contained about 7% of TALE RAGT2-Renila able to proficiently bind dsDNA. In the experiment described to assess the specificity of the TALE RAGT2-Renila detector system, 300 pmol of TALE RAGT2-Renila will be used systematically (see the next example) to make sure to fully bind the DNA to detect.
After determining the amount of “competent” TALE RAGT2-Renila present in our protein preparation, we investigated its detection sensitivity toward the ssDNA pray. Briefly, decreasing amounts of ssDNA pray wild type (SEQ ID NO 5, from 10 to 0 pmol) were incubated with the ssDNA biotinylated bait (SEQ ID NO 4, 10 pmol) already immobilized on the streptavidine coated plate. The mixture was heated and allowed to cool down to RT. After washing the well with high and low salt containing TBST buffers, an excess of TALE RAGT2-Renila was added to the well (˜300 pmol total), the unbound was then removed by two low salt TBST buffer washing steps and the reaction mixture was added to determine luminescence. The luminescence obtained for each quantity of ssDNA pray used is illustrated in
To assess the influence of exogenous non specific DNA molecule on the ability of TALE RAGT2-Renila detector system to detect its ssDNA pray (SEQ ID NO 5), the same experiment was performed in the presence of 10 pmol of fragmented DNA plasmid (temperature fragmentation resulting in random DNA fragments of 150 bp length). Increasing ssDNA pray quantity in the well, resulted in a proportional increase of luminescence that was highly similar to the one obtained in the absence of exogenous non specific DNA. Such high similarity was constant whatever the amount of ssDNA pray added in the well as demonstrated by the steadiness of the ratio of luminescence obtained in the absence of exogenous DNA versus the one obtained in the presence of exogenous DNA plasmid (
Different ssDNA targets (SEQ ID NO 6-20) harboring unique or multiple mutations with respect to the ssDNA wild type pray (SEQ ID NO 5, 10 pmol) were synthesized and used to assess the specificity of TALE RAGT2-Renila detector system. We first studied the effect of increasing amount of mutation within the ssDNA target on the luminescence of TALE RAGT2-Renila according to the detection protocol described earlier. Briefly, 10 pmol of ssDNA wild type or mutated were incubated with the ssDNA biotinylated pray already immobilized on the streptavidine coated plate. The mixture was heated and allow to cool down to RT. After washing the well with high and low salt containing TBST buffers, an excess of TALE RAGT2-Renila was added to the well (˜300 pmol total), the unbound was then removed by two low salt buffer washing steps and the reaction mixture was added to determine luminescence. The luminescence obtained for each of the ssDNA prays used is illustrated in
Our results showed that one mutation at the TO is sufficient to reduce the luminescence signal by more than 60%. Additional mutations reduced the signal even further indicating that that detector system is sensitive to mono and by extension, multiple mutations present within the ssRNA pray to detect. We then investigated the effect of one single mutation T→C on the luminescence signal using different ssRNA prays (SEQ ID NO 5 and 11-16). Such mutation was intentionally positioned at different locations of the ssRNA pray to study the influence of its position on the luminescence signal alteration (
The principle of ssRNA pray detection is based on the method delineated above to detect ssRNA pray. It includes all the steps (1-4) described above. However, instead of using a biotinylated DNA bait corresponding to the reversed and complementary sequence of TALE RAGT2 target (SEQ ID NO 4), the biotinylated DNA bait used in the method described herein, corresponds to the actual TALE RAGT2 target (SEQ ID NO 21,
According to the protocol described for the detection of ssRNA pray, ssRNA pray (SEQ ID NO 23, 10 pmol) was incubated with ssRNA biotinylated bait (SEQ ID NO 21) already immobilized on a streptavidin coated 96 well plate. The mixture was heated and allowed to cool down to RT. After washing the well with high and low salt containing buffers, an excess of TALE RAGT2-Renila was added to the well (˜300 pmol total), the unbound was then removed by two low salt washing steps and the reaction mixture was added to determine luminescence. The luminescence obtained for each experiment (wild type ssRNA pray, wild type ssRNA and random ssRNA pray, SEQ ID NO 22, 23 and 24 respectively) are illustrated in
Our results showed that the TALE RAGT2-Renila detector system was able to detect 10 pmol of DNA/RNA heteroduplex to the same extent than a DNA/DNA homoduplex. Such detection was specific to the ssRNA wild type pray (SEQ ID NO 23). Indeed the luminescence signal obtained in the presence of random ssRNA (SEQ ID NO 24) was similar to the one obtained in the negative control performed with the biotinylated bait alone (SEQ ID NO 21). Altogether, our data demonstrate that the TALE RAGT2 detector system is able to detect DNA/RNA heteroduplex in a specific and quantitative manner.
Number | Date | Country | Kind |
---|---|---|---|
PA 2013 70578 | Oct 2013 | DK | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2014/071812 | 10/10/2014 | WO | 00 |