The priority of Republic of Korea Patent Application 10-2023-0122320 filed Sep. 14, 2023 is hereby claimed, under the provisions of under 35 USC § 119, and the disclosure of such application is hereby incorporated herein by reference, in its entirety, for all purposes.
This application includes an electronically submitted sequence listing in .xml format. The .xml file contains a sequence listing entitled “744_SeqListing.xml” created on Sep. 10, 2024 and is 11,486 bytes in size. The sequence listing contained in this .xml file is part of the specification and is hereby incorporated by reference herein in its entirety.
The present invention relates to a method for purifying a nucleic acid and a method for analyzing the presence of a target nucleic acid in a sample, and relates to a method for purifying a target nucleic acid from a nucleic acid library to high purity or a method for analyzing the presence of a target nucleic acid in a sample. Specifically, the present invention relates to a method for purifying and analyzing a nucleic acid for selecting a target nucleic acid based on whether a nucleic acid complementary to the target nucleic acid is synthesized.
Nucleic acids are becoming increasingly important. DNA or RNA has nucleic acids as carriers of genetic information, and the discovery of microRNAs, long noncoding RNAs, and chemically modified RNAs, for example, has reshaped our understanding of the importance of RNAs in biological processes and disease, and has led to a growing number of medical uses of RNAs (Cell, 169 (2017), pp. 1187-1200).
Most studies can be determined by high-quality nucleic acids that are free of contaminants. For this reason, research on purification continues. Purification of all or partial nucleic acids, e.g., DNA, RNA (e.g., mRNA, small RNA), or synthetic RNA isolated from a biological sample is typically performed by gel electrophoresis (RNA, 10 (2004), pp. 889-893), liquid chromatography (J. Chromatogr. A, 1216 (2009), pp. 1377-1382), organic extraction (Cold Spring Harb. Protoc, 2010 (2010)), or purification via spin column (PLoS One, 13 (2018), Article e0203011).
In particular, purified nucleic acid sequences are needed to study nucleic acid modifications within specific sequences or to study biomolecular interactions of nucleic acids by single-molecule methods.
Nevertheless, the purification rate of nucleic acids, especially RNA, by current universal purification methods is less than 80%. In particular, purification of RNA with one-base error is currently not possible.
To make RNA useful without side effects, research is needed to separate normal RNA from RNA with one-base error that occurs during synthesis, and to purify or analyze it.
Under this technical background, the inventors of the present invention have developed a technique to identify target RNAs, specifically RNAs with one-base error, and have completed the present invention.
It is an object of the present invention to provide a method for purifying a target nucleic acid from a library comprising the target nucleic acid.
It is an object of the present invention to provide a method for analyzing the presence of a target nucleic acid in a sample.
To accomplish the above objectives, the present invention provides a method for purifying a target nucleic acid, comprising:
The present invention also provides a method for analyzing the presence of a target nucleic acid in a sample, comprising:
According to the present invention, a method for purifying or analyzing biologically useful nucleic acids to high purity can be provided.
Unless otherwise defined, all technical and scientific terms used herein shall have the same meaning as commonly understood by those skilled in the art. In general, the nomenclature used herein is well known and in common use in the art. In general, the nomenclature used herein is well known and commonly used in the art.
The present invention relates to a method of purifying a target nucleic acid, in one aspect, comprising:
The present invention also relates to a method for analyzing the presence of a target nucleic acid in a sample, comprising:
A “target nucleic acid” is a target aiming nucleic acid to be detected, which may include, but is not limited to, RNA or DNA, provided that the nucleic acid comprises a nucleotide as its basic building block. According to the present invention, it is possible to specifically distinguish RNA with a one-base error.
In the case of DNA, it may be synthesized from deoxyribonucleotides, and in a sample containing DNA, it may be DNA that is substantially free of errors or damage, and may be a gene for a desired function in an organism.
In the case of RNA, in a sample containing RNA that is synthesized from ribonucleotides, but in a modified form due to some of the nucleosides introduced during the synthesis process, it may be RNA that is substantially free of errors or damage, and may be a gene for a desired function in an organism. The target nucleic acid may be described interchangeably with the target aiming nucleic acid.
“Purification” may refer to the process of removing erroneous or damaged nucleic acids contained in a library other than the intended target nucleic acid to obtain the target nucleic acid.
“Analysis” may refer to the process of removing erroneous or damaged nucleic acids other than the target nucleic acid from a sample to determine the presence of the target nucleic acid.
A substantially error- or damage-free nucleic acid is a nucleic acid that does not contain errors, which may mean an original nucleic acid having the intended sequence. The target nucleic acid is a substantially error- or damage-free nucleic acid selected from a library comprising the target nucleic acid, which may be a nucleic acid isolated from a nucleic acid containing an error in the library, such as a nucleic acid containing an one-base error.
The library means a set of nucleic acid products and refers to nucleic acids present in the original sample. The library may comprise a nucleic acid comprising an error and a target nucleic acid, and only the target nucleic acid may be selected from the library.
“DNA” means a polynucleotide comprising a deoxyribose sugar and consisting of a purine or pyrimidine base of adenine, thymine, cytosine, or guanine.
“RNA” refers to a polynucleotide comprising a ribose sugar, and usually uracil rather than thymine as one of the pyrimidine bases. RNA can include a single-stranded molecule that is transcribed from DNA and have a linear sequence of nucleotide bases complementary to the transcribed DNA strand.
Specifically, the nucleic acid may comprise a guide RNA. It can contain a single and/or double guide RNA sequence. “dgRNA” refers to a dual guide RNA sequence comprising crRNA (a spacer sequence comprising a seed region complementary to the target sequence) and a separate tracrRNA (transcriptional activation sequence) complementary to the RNA; and “sgRNA” refers to a single guide RNA sequence containing the fused crRNA and tracrRNA.
Specifically, the nucleic acid may comprise pegRNA. pegRNA has an extension arm attached to a traditional guide RNA. In this case, the extension arm includes the extension area. The extension area includes an editing template that is a template for the intended editing, for inserting the intended editing into the target region. “Nuceotide” means a glycoside comprising a sugar moiety, a base moiety, and a covalently linked linker, such as a linker between phosphate or phosphorothioate nucleotides. It can include both naturally occurring nucleotides, such as DNA or RNA, and non-naturally occurring nucleotides, which in some cases include modified sugar and/or base moieties.
The term “ribonucleotide” or “deoxyribonucleotide” can include both natural and synthetic, unmodified and modified ribonucleotides or deoxyribonucleotides. Modifications can include changes to sugar moieties, base moieties, and/or linkages between ribonucleotides or deoxyribonucleotides within an oligonucleotide.
“Polynucleotide” means a nucleotide in polymeric form that includes nucleotides of any length, including deoxyribonucleotides and/or ribonucleotides, or analogs thereof. Polynucleotide can have any three-dimensional structure and can perform any known or unknown function. The structure of polynucleotide can be considered with reference to its 5′ or 3′ end, which indicates the orientation of the polynucleotide. In a strand of polynucleotide, neighboring nucleotides are typically linked by a phosphodiester bond between the 3′ and 5′ carbons. However, other intra-nucleotide bonds can also be used, such as methylene, phosphoramidate bonds, and the like. This means that each of the 5′ and 3′ carbons can be exposed at both ends of the polynucleotide, which can be referred to as the 5′ and 3′ ends or termini. The 5′ and 3′ ends can also be referred to as phosphoryl (PO4) and hydroxyl (OH) ends, respectively, due to the chemical groups attached to those ends.
The polynucleotide may comprise both double-stranded and single-stranded molecules. Examples of polynucleotides include, but are not limited to: a gene or gene fragment (e.g., a probe, primer, EST, or SAGE tag), genomic DNA, genomic DNA fragment, exon, intron, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozyme, cDNA, recombinant polynucleotide, branched polynucleotide, plasmid, vector, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probe, primer, or amplified copy of any of the foregoing. Polynucleotides can include modified nucleotides, such as methylated nucleotides and nucleotide analogs (including nucleotides with non-natural bases, nucleotides with modified natural bases, such as aza- or deza-purines, and the like).
The polynucleotide may be further modified after polymerization by conjugation with a labeling component. The nucleotide sequence of polynucleotide can be interrupted by non-nucleotide components. One or more ends of polynucleotide can interact in a specific way with other polynucleotides (e.g: covalent bond formation) or otherwise modified.
The polynucleotide may comprise a specific sequence of four nucleotide bases: adenine (A), cytosine (C), guanine (G), and thymine (T). Uracil (U) can exist as a natural substitute for thymine, for example, when the polynucleotide is RNA. Uracil can also be used in DNA. Therefore, the term “sequence” refers to the alphabetical representation of a polynucleotide or any nucleic acid molecule, including natural and non-natural bases.
“Primer” means a naturally occurring or synthetically produced single-stranded oligonucleotide that, when placed under conditions that catalyze the synthesis of a nucleic acid strand and a complementary primer extension product or amplicon, can serve as a synthetic initiation point along the complementary strand.
These conditions can include the presence of four different deoxyribonucleoside triphosphates and a polymerization inducer, such as thermostable nucleotide polymerase, DNA polymerase, or reverse transcriptase, in an appropriate buffer and at an appropriate temperature. Primer is preferably single-stranded to maximize amplification efficiency.
The primer binds complementarily to the target RNA. “Complementary” means sufficiently complementary that the primers selectively hybridizes to the target sequence under the prescribed annealing conditions, and includes both substantially complementary and perfectly complementary.
The term “substantially complementary sequence” as used in connection with the primer includes not only fully matched sequences, but also sequences that are partially mismatched to the sequence to be compared, to the extent that they can be annealed to a specific sequence to serve as primers.
“Annealing” or “priming” means the apposition of an oligodeoxynucleotide or nucleic acid to a template nucleic acid, wherein the apposition causes polymerization of the nucleotide to form a nucleic acid molecule complementary to the template nucleic acid or a portion thereof.
The primer should be long enough to prime the synthesis of the extension product. The appropriate length of a primer is determined by a number of factors, such as temperature, application, and the source of the primer, but is typically 15-30 nucleotides. Short primer molecules generally require lower temperatures to form sufficiently stable hybridization complexes with the template.
The design of the primer is facilitated by reference to the nucleotide sequence and can be performed, for example, using commercially available primer design programs.
“Hybridization” refers to the process by which complementary nucleotide sequences anneal to each other; specifically, the single strand of a nucleic acid sequence forms a double-helix fraction through hydrogen bonding between complementary bases.
It refers to the formation of double-stranded nucleic acids from complementary single-stranded nucleic acids. Hybridization can occur when there is a perfect match of complementarity between the two nucleic acid strands, or it can occur even when some mismatched bases are present.
The degree of complementarity required for hybridization can vary depending on the hybridization conditions, which can be controlled by temperature, in particular.
In a hybridization reaction, the conditions used to achieve a strict specific level vary depending on the nature of the hybridized nucleic acids. For example, the length of the nucleic acid sites to be hybridized, the degree of homology, the nucleotide sequence composition (e.g., GC/AT ratio), and the type of nucleic acid (e.g., RNA, DNA) are considered in selecting hybridization conditions.
Examples of very stringent conditions include 2×SSC/0.1% SDS at room temperature (hybridization condition); 0.2×SSC/0.1% SDS at room temperature (low stringency condition); 0.2×SSC/0.1% SDS at 42° C. (moderate stringency condition); 0.1×SSC at 68° C. (high stringency condition). The washing process may be performed using any one of these conditions, e.g., high stringency condition, or each of the above conditions, and these conditions in whole or in part may be repeated in the order described above for 10 to 15 minutes each. However, as described above, optimal conditions vary depending on the particular hybridization reaction involved therein and can be determined by experimentation.
“NTP” refers to nucleoside triphosphates, including not only natural (deoxy) triphosphates but also NTPs that can have additional modifications to the triphosphate side chain (e.g., methyl phosphonate, phosphothioate). It can also include a modified sugar or sugar analog (e.g., a 2′-O-alkyl derivative 3′ and/or 5′ aminoribose, locked ribose, hexitol, altitol, cyclohexene, cyclopentane) or a modified base, for example, 5-methyl-C or a base analog such as 7-deazapurine. The base, sugar, or triphosphate side chains of these NTPs can be linked via linkers to detectable units or reactive groups. Suitable NTPs can include both naturally occurring and synthetic nucleotide triphosphates, and can be, for example, ATP, CTP, GTP, TTP, or ITP, but are not limited thereto.
An “oligonucleotide” may include not only (deoxy)oligoribonucleotide but also oligonucleotide analogs that contain one or more nucleotide analogs modified in the phosphate backbone (e.g., methylphosphonates, phosphorothioates), or modified bases such as 7-deazapurines (e.g., 2′-O-alkyl derivatives, 3′ and/or 5′ amino ribose, LNA, HNA, TCA), or sugars (e.g., 2′-O-alkyl derivatives, 3′ and/or 5′ amino ribose).
“Nucleoside” means a compound consisting of purine, deazapurine, or pyrimidine nucleoside base, for example, adenine, guanine, cytosine, uracil, thymine, deazadenine, deazaguanosine, or the like, linked to pentose at the 1′ position, including the 2′-deoxy and 2′-hydroxyl forms.
With respect to adenine, guanine, cytosine, uracil, and thymine, the symbols listed are as follows:
“Preparation” is an object to be reacted with a hybridization reactant and may include, but is not limited to, any substance that can be a component of the reaction. The preparation comprises a nucleotide triphosphate (NTP) comprising an irreversible blocker.
The step (b) may further comprise at least one blocker-free NTP selected from the group consisting of blocker-free ATP, CTP, GTP, and TTP that binds to the target nucleic acid, or an NTP comprising a cleavable blocker.
Step (b) may comprise sequentially alternating NTPs comprising an irreversible blocker and blocker-free NTPs; or NTPs comprising an irreversible blocker and NTPs comprising a cleavable blocker for each reaction cycle.
With respect to any one of the blocker-free NTPs selected from the group consisting of blocker-free ATP, CTP, GTP, and TTP that binds to the target nucleic acid, the blocker-free NTP may be, for example, an unmodified dNTP. The blocker-free NTP can comprise four deoxyribonucleotide triphosphates: dATP (deoxyadenosine triphosphate), dCTP (deoxycytidine triphosphate), dGTP (deoxyguanosine triphosphate), and dTTP (deoxythymidine triphosphate). The blocker-free NTPs can be used as building blocks in DNA synthesis in general.
The NTP comprising the irreversible blocker may be a nucleotide analog capable of terminating DNA synthesis, for example, a modified dNTP or a dideoxynucleotide. Specifically, NTPs comprising the irreversible blocker can be, but are not limited to, dideoxy nucleoside triphosphate (ddNTP) or pyrrolidinyl nucleoside triphosphate (prNTP).
The ddNTP is dideoxynucleotide, which may include, for example, ddATP, ddCTP, ddGTP, or ddTTP.
The prNTP (pyrrolidinyl nucleoside triphosphate) is a derivative of nucleoside triphosphate, wherein the nucleoside triphosphate is substituted with a 4-6 membered cycloheteroalkyl, for example, azetidinyl, pyrrolidinyl, piperidinyl, morpholinyl, thiamorpholinyl, imidazolidinyl, tetrahydrofuranyl, tetrahydropyranyl, tetrahydrothiophenyl, or piperazinyl. The prNTP may be a derivative in which the nucleoside triphosphate is substituted with a pyrrolidinyl group.
The prNTP may be, for example, prATP, prTTP, prGTP, or prCTP.
In the NTP comprising the irreversible blocker, the remaining NTPs except for one selected from the group consisting of the blocker-free ATP, CTP, GTP, and TTP may comprise an irreversible blocker.
Specifically, when one of ATP, CTP, GTP and TTP is selected as a blocker-free NTP that binds complementarily to a predetermined target nucleic acid, the NTPs other than the selected blocker-free NTP may comprise an irreversible blocker.
For example, when any one is selected from dNTPs that complementarily binds to a predetermined target nucleic acid, for example, deoxyadenosine triphosphate (dATP), deoxycytidine triphosphate (dCTP), deoxyguanosine triphosphate (dGTP), and deoxythymidine triphosphate (dTTP), the NTPs other than the selected dATP, dCTP, dGTP or dTTP may have a ddNTP form.
If the preparation that complementarily binds to a predetermined target nucleic acid comprises dATP, it may comprise an NTP other than dATP, for example, ddCTP, ddGTP, or ddTTP.
For an NTP comprising the cleavable blocker, the remaining NTPs except for one selected from the group consisting of ATP, CTP, GTP, and TTP comprising the cleavable blocker may comprise an irreversible blocker.
The NTP comprising the cleavable blocker may comprise an R (reversible moiety), which may be capped with a 3′-OH group in dNTP.
“Cleavable” means chemically cleavable or photodegradable. For example, ultraviolet light can be used to photochemically cleave a photochemically cleavable moiety.
The moiety cleavable to the 3′-oxygen of deoxyribose in the dNTP may be a substituted or unsubstituted azido, allyl, or 2-nitrobenzyl moiety.
The azido can be represented as N3. The azido can be substituted or unsubstituted. If the azido group is substituted, it may be substituted with hydrogen, alkyl, or cycloalkyl.
“Alkyl” is the monovalent part obtained by removing a hydrogen atom from a carbon atom of an aliphatic or alicyclic, saturated or unsaturated (unsaturated, completely unsaturated) hydrocarbon compound, wherein a saturated alkyl is, for example, methyl, ethyl, propyl, butyl, pentyl, hexyl, heptyl, and the like, a saturated straight-chain alkyl may include, for example, methyl, ethyl, n-propyl, n-butyl, n-pentyl (amyl), n-hexyl, n-heptyl, and the like, a saturated branched-chain alkyl may include, for example, isopropyl, isobutyl, sec-butyl, tert-butyl, isopentyl, neopentyl, and the like.
“Cycloalkyl” indicates the monovalent portion of a cyclic hydrocarbon compound obtained by removing a hydrogen atom from the alicyclic ring atom. Examples of cycloalkyl groups may include saturated single-ring hydrocarbon compounds such as cyclopropane, cyclobutane, cyclopentane, cyclohexane, cycloheptane, methylcyclopropane, dimethylcyclopropane, methylcyclobutane, dimethylcyclobutane, methylcyclopentane, dimethylcyclopentane, or methylcyclohexane; or unsaturated single-ring hydrocarbon compounds such as cyclopropene, cyclobutene, cyclopentene, cyclohexene, methylcyclopropene, dimethylcyclopropene, methylcyclobutene, dimethylcyclobutene, methylcyclopentene, dimethylcyclopentene, or methylcyclohexene.
The NTP comprising the cleavable blocker can include, for example, one or more selected from the group consisting of 3′-O-azidomethyl-dNTP, 3′-ONH2-dNTP, 3′-O-allyl-dNTP, and 3′-O-2-nitrobenzyl-dNTP, but is not limited thereto.
The 3′-O-azidomethyl-dNTP can be 3′-O-azidomethyl-dATP, 3′-O-azidomethyl-dCTP, 3′-O-azidomethyl-dGTP, or 3′-O-azidomethyl-dTTP.
The 3′-ONH2-dNTP may comprise 3′-ONH2-dATP, 3′-ONH2-dCTP, 3′-ONH2-dGTP, or 3′-ONH2-dTTP.
The 3′-O-allyl-dNTP can comprise 3′-O-allyl-dATP, 3′-O-allyl-dCTP, 3′-O-allyl-dGTP, or 3′-O-allyl-dTTP.
The 3′-O-2-nitrobenzyl-dNTP can include 3′-O-2-nitrobenzyl-dGTP, 3′-O-2-nitrobenzyl-dCTP, 3′-O-2-nitrobenzyl-dATP, or 3′-O-2-nitrobenzyl-dTTP.
(i) The nucleotide triphosphate (NTP) comprising an irreversible blocker and (ii) a blocker-free NTP; or (i) an NTP comprising an irreversible blocker and (iii) an NTP comprising a cleavable blocker may be introduced into an extending strand of DNA.
The blocker can be removed by chemical or photocleavage reactions to allow for the next step of DNA introduction. Among the various 3′-OH modifications, considering the introduction efficiency, ease of cleavage with agents that do not damage DNA, and specificity, the 2-nitrobenzyl group can be cleaved by exposure to UV light, the allylic group can be cleaved via a Pd-catalyzed reaction, and the azidomethyl group can be cleaved by treatment with tris-(2-carboxyethyl)phosphine (TCEP).
The present invention provides a method for purifying a target RNA from a library comprising the target RNA, wherein the method can be applied to RNA by multiplex oligonucleotide library purification by synthesis and selection (MOPSS). The present invention includes the steps of (a) hybridizing a library comprising a target nucleic acid with a primer that binds complementarily to the target nucleic acid.
The library may comprise a nucleic acid comprising an error and a target nucleic acid. By hybridizing a primer that complementarily binds to the target nucleic acid, the primer having a sequence complementary to the target nucleic acid may bind.
The primer complementarily binds to the target nucleic acid and may be further linked with a dye to confirm binding. The dye may be ligated to the 5′ end of the primer.
The dye is not limited as long as the binding to the target RNA can be identified via fluorescence or light, but includes, for example, FITC, Alexa Fluor 488, GFP, CFSE, CFDA-SE, DyLight 488, PE, PI, PerCP, PerCP-Cys 5.5, PE-Alexa Fluor 700, PE-Cy5 (TRI-COLOR), PE-Cy5.5, PE-Alexa Fluor 750, PE-Cy7, APC, APC-Cy7, APC-eFluor 780, Alexa Fluor 700, Cys5, Draq-5, Pacific Orange, Amine Aqua, Pacific Blue, DAPI, Alexa Fluor 405, eFluor 450, eFluor 605 Nanocrystals, eFluor 625 Nanocrystals, or eFluor 650 Nanocrystals.
In specific examples according to the present invention, Cy5 may be linked to the 5′ end of the primer so that the resulting product can be identified by gel electrophoresis in a separate wavelength band.
The present invention comprises the step of reacting the hybridization reactant of step (a) with a preparation comprising (i) a nucleotide triphosphate (NTP) comprising an irreversible blocker and/or (ii) a blocker-free NTP selected from the group consisting of blocker-free ATP, CTP, GTP, and TTP that binds to the target nucleic acid.
In the step, the hybridization reactant generated by hybridizing a library comprising a target nucleic acid with a primer that binds complementarily to the target nucleic acid in step (a) is reacted with a preparation comprising (i) a nucleotide triphosphate (NTP) comprising an irreversible blocker and/or (ii) a blocker-free NTP selected from the group consisting of blocker-free ATP, CTP, GTP, and TTP that binds to the target nucleic acid.
In step (b), in the NTP comprising the irreversible blocker, the remaining NTPs except for one selected from the group consisting of the blocker-free ATP, CTP, GTP, and TTP may comprise an irreversible blocker.
Step (b) may comprise alternating NTPs comprising an irreversible blocker and blocker-free NTPs; or NTPs comprising an irreversible blocker and NTPs comprising a cleavable blocker for each reaction cycle.
Reacting a preparation comprising the blocker-free NTP is an elongation reaction, wherein a blocker-free NTP selected from the group consisting of complementary blocker-free ATP, CTP, GTP, and TTP, e.g., dNTP, is coupled to a target nucleic acid of n bp to a desired length, thereby increasing the length of the nucleic acid.
For example, when the sequence of the target nucleic acid is 3′-TGTGGCGGCAGATGATGACGcgggctgctctgatggccaagctataccctaagctgataACCCCAGTCCC CTGCTGTCA-5′, to purify the target nucleic acid, the elongation reaction can be initiated after hybridizing a portion of the target nucleic acid with a complementary sequence 5′-ACACCGCCGTCTACTACTGC-3′.
The complementary sequence 5′-ACACCGCCGTCTACTACTGC-3′ hybridizes to the 3′-5′ site, and if only one of the complementary blocker-free NTPs from among the blocker-free NTPs, in the order of the target nucleic acid sequence, is subsequently bound, binding will not occur in molecules with a different sequence from the target nucleic acid sequence, resulting in a different length of the complementary strand.
For example, hybridization of the complementary sequence and binding of only dGTP will result in binding to the target nucleic acid sequence but not to the sequence that does not contain c, resulting in a difference in the length of the complementary sequence.
If it is desired to use the same mechanism to bind dCTP after dGTP, the target nucleic acid will bind three dCTPs, but if there is an error in ggg, less than three dCTPs will bind.
In this regard, even if using only blocker-free NTP, if the order of blocker-free NTP to bind is determined based on the complementary sequence of the target nucleic acid, the molecule containing the synthesis error will not bind, leading to a difference in the length of the complementary strands, and this difference in length can be used to purify (screen) the target nucleic acid.
This introduces a blocker-free NTP that binds complementarily to the nucleic acid template to which the primer has hybridized, allowing the nucleic acid to be extended to the desired length.
The nucleic acid may be synthesized on a solid support, e.g., glass, beads, particles, or other suitable forms. The material composition of the solid support can be any suitable material, such as a polymeric or silica-based support. For example, it can include plastic, nylon, glass, silica, metal, metal alloy, polyacrylamide, polyacrylate, polystyrene, cross-linked dextran, and combinations thereof.
The reaction is an amplification reaction, meaning a reaction to obtain a replication product of nucleic acid. It can be generated by any of a variety of amplification methods using the target nucleic acid as a template.
This method involves using one of the blocker-free NTPs selected from the group consisting of blocker-free ATP, CTP, GTP, and TTP, to generate a nucleic acid that binds complementarily to a base at a specific position from the 5′ end of the target nucleic acid in the n bp, wherein the remaining NTPs, excluding the NTP selected from the group consisting of blocker-free ATP, CTP, GTP, and TTP, include an irreversible blocker, thereby allowing the elongation of the nucleic acid to be either permanently or temporarily halted at any position from 5′ to 3′.
An NTP comprising a cleavable blocker is included, and the remaining NTPs except for one selected from the group consisting of ATP, CTP, GTP, and TTP comprise an irreversible blocker, thereby allowing the elongation of the nucleic acid to be either permanently or temporarily halted at any position from 5′ to 3′.
The reaction of the above step may result in the binding of a nucleic acid complementary to the target nucleic acid and mismatch to the non-target nucleic acid, such that the reaction is stopped by an NTP comprising an irreversible blocker.
NTPs comprising the irreversible blocker can be, for example, but not limited to, dideoxy nucleoside triphosphates (ddNTPs) or pyrrolidinyl nucleoside triphosphates (prNTPs).
The ddNTP may include, for example, but is not limited to, ddATP, ddCTP, ddGTP, or ddTTP.
The prNTP (pyrrolidinyl nucleoside triphosphate) may be, for example, prATP, prTTP, prGTP, or prCTP, but are not limited thereto.
The NTP comprising the irreversible blocker can be included at a concentration of, for example, 20 to 200 μM. For various concentrations, it has been shown that the irreversible blocker, such as ddATP, ddCTP, ddGTP, or ddTTP, can be introduced with high efficiency, but ddTTP is required to have concentration of 200 μM or more.
The blocker-free NTP may be, for example, but not limited to, dNTP. The blocker-free NTP may be, for example, an unmodified dNTP. The blocker-free NTP can comprise four deoxyribonucleotide triphosphates: dATP (deoxyadenosine triphosphate), dCTP (deoxycytidine triphosphate), dGTP (deoxyguanosine triphosphate), and dTTP (deoxythymidine triphosphate).
In step (b), NTP comprising the cleavable blocker can be included alone.
The NTP comprising the cleavable blocker can include, for example, one or more selected from the group consisting of 3′-O-azidomethyl-dNTP, 3′-ONH2-dNTP, 3′-O-allyl-dNTP, and 3′-O-2-nitrobenzyl-dNTP, but is not limited thereto.
The 3′-O-azidomethyl-dNTP can be 3′-O-azidomethyl-dATP, 3′-O-azidomethyl-dCTP, 3′-O-azidomethyl-dGTP, or 3′-O-azidomethyl-dTTP.
The 3′-ONH2-dNTP may comprise 3′-ONH2-dATP, 3′-ONH2-dCTP, 3′-ONH2-dGTP, or 3′-ONH2-dTTP.
The 3′-O-allyl-dNTP can comprise 3′-O-allyl-dATP, 3′-O-allyl-dCTP, 3′-O-allyl-dGTP, or 3′-O-allyl-dTTP.
The 3′-O-2-nitrobenzyl-dNTP can include 3′-O-2-nitrobenzyl-dGTP, 3′-O-2-nitrobenzyl-dCTP, 3′-O-2-nitrobenzyl-dATP, or 3′-O-2-nitrobenzyl-dTTP.
The preparation of step (b) may further comprise, for example, an enzyme, a buffer, and a salt.
The enzyme is a polymerase, which includes an enzyme that produces complementary copies of a nucleic acid molecule utilizing the nucleic acid as a template strand. DNA polymerase binds to the template strand and then sequentially moves down the template strand, adding nucleotides to the free hydroxyl groups at the 3′ end of the growing strand of the nucleic acid. Polymerase can utilize short strands, called primers, to initiate strand growth. Some polymerases can displace strands upstream of the site where they add a base to the chain. These polymerases are called strand displacers, meaning that they have the activity of removing the complementary strand from the template strand that is read by the polymerase.
The polymerase may be, for example, but not limited to, a Taq DNA polymerase, a Tth DNA polymerase, a Pfu DNA polymerase, a Bst DNA polymerase, a Tli DNA polymerase, a KOD DNA polymerase, an nTha DNA polymerase, or a Tha DNA polymerase.
The buffer may be any buffer suitable for the reaction, and may include, but is not limited to, for example, a pH-stabilizing solution such as Tris, CAPS (N-cyclohexyl-3-aminopropanesulfonic acid), HEPES (4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid), MOPS (3-morpholinopropane-1-sulfonic acid), CAPSO (3-(Cyclohexylamino)-2-hydroxy-1-propanesulfonic acid), EPPS (4-(2-Hydroxyethyl)-1-piperazinepropanesulfonic acid), MES (2-(N-Morpholino)ethanesulfonic acid), PIPES (1,4-Piperazinediethanesulfonic acid), TAPS (N-[Tris(hydroxymethyl)methyl]-3-aminopropanesulfonic acid), TAPSO (2-Hydroxy-3-[tris(hydroxymethyl)methylamino]-1-propanesulfonic acid), TES (2-[(2-Hydroxy1,1-bis(hydroxymethyl)ethyl)amino]ethanesulfonic acid), or PBS (Phosphate Buffered Saline).
The salt may comprise, for example, MnCl2 or MnSO4. In this case, the salt may be included at a concentration of 1 mM or more. The salt may be included at a concentration of, for example, 10 mM or less.
Because RNA has a different chemical structure than DNA, there are differences in the conditions under which reactions can occur with RNA as a template and the resulting reactivity. The introduction of MnCl2 may not be necessary to synthesize complementary sequences to DNA. However, for complementary DNA synthesis to RNA, the introduction of MnCl2 may be necessary when reacting a preparation containing (i) a nucleotide triphosphate (NTP) comprising an irreversible blocker and/or (ii) a blocker-free NTP.
In specific examples according to the present invention, it has been found that when the concentration of MnCl2 in the preparation is greater than or equal to 1 mM, the incorporation yield of NTP comprising irreversible blocker and/or cleavable blocker can approach 100%.
Step (b) is not limited as long as the conditions are sufficient to allow the reaction of the hybridization reactants of step (a) with the preparation to occur, but may be, for example, at a temperature of 60° C. for 15 minutes.
The present invention includes the step (c) of isolating the target nucleic acid to which the complementary nucleic acid is bound from the reactants of step (b).
The elongation reaction of step (b) can be used to isolate the target nucleic acid to which the complementary nucleic acid is bound, excluding products that are mismatched and in which the reaction is stopped by the introduction of an irreversible blocker.
To perform the step (c), the target nucleic acid to which the complementary nucleic acid is bound may be separated from the product in which the reaction has stopped, including the step of ligating the additional tag.
The tag is a capture tag for isolating a target nucleic acid, and it may be intended to separate a target nucleic acid to which a complementary nucleic acid is bound from a product to which the reaction has stopped due to the introduction of an irreversible blocker that has not reacted. The tag can be biotinylated, for example, by contacting a biotinylated dideoxynucleotide to an NTP comprising an irreversible blocker that is non-complementary to the target nucleic acid to form a biotin bond only to the target nucleic acid, and selectively extracting only the target nucleic acid via particles that form a bond with biotin under appropriate binding conditions.
Specifically, biotin-dATP can be introduced using a terminal transferase used for nucleotide triphosphate (NTP) tailing that contains an irreversible blocker, and screened using streptavidin-coated magnetic particles with oligonucleotides that can form a bond with it.
The magnetic particles are available from commercial sources. The magnetic particle is a particle that is not magnetic but responds to a magnetic field by forming a magnetic dipole when exposed to a magnetic field. The magnetic particles include, for example, materials that can be magnetized in the magnetic field but are not themselves magnetic in the absence of a magnetic field. The magnetic particles may be, for example, magnetic beads. The magnetic material includes, but is not limited to, any paramagnetic or superparamagnetic material.
The method may further comprise the step (d) of isolating only the target nucleic acid from the nucleic acid to which the complementary nucleic acid is bound.
The target nucleic acid to which the complementary nucleic acid is bound may be treated with an enzyme to isolate only the target nucleic acid from the target nucleic acid to which the complementary nucleic acid is bound.
The present invention will now be described in more detail with reference to the following examples. These examples are for the purpose of illustrating the invention only, and it will be apparent to one of ordinary skill in the art that the scope of the invention is not to be construed as limited by these examples.
Despite the growing medical use of RNA materials, the purification rate of RNA by universal purification methods is 80% or less. In particular, purification of RNA with one-base error is currently not possible. In order to use RNA medicinally without side effects, research is needed to separate and purify normal RNA from RNA with one-base error that occurs during synthesis.
Therefore, a new chemistry is provided along with a technique to distinguish one-base error. RNA has different physical and chemical properties than DNA, so new chemistry is applied to the reaction.
The detailed process of RNA MOPSS is shown in
The specific of MOPSS is as follows:
Only DNA extended to a certain length was collected based on the length difference, so that blocked RNA remained in solution and was not collected.
1-2. RNA Screening with MOPSS
(SEQ ID NO: 1)
Target bases were highlighted (double underlined), primer binding sites were highlighted (single underlined), and the sequence was zoned according to the elongation method.
The DNA primer was labeled with Cy5 dye, which allowed for a separate wavelength band from conventional nucleic acids staining when viewing the results by gel electrophoresis.
Double underlines were introduced for base differences between A and B types, assuming errors in the RNA. The dash underlined nucleotides were added to artificially introduce a difference in length. The restriction site, I (Inosine), was labeled in bold italics.
To optimize Azido-dNTP introduction efficiency, RNA type A was hybridized onto beads with DNA primer and introduction was performed under various conditions.
The dNTP was extended after introduction of one nucleotide (
If Azido-dNTPs are introduced, short DNA lengths should be seen as the DNA primer does not extend (
Using 9°N DNA polymerase, it was found that Azido-dNTP was introduced under various conditions, but the efficiency was lower than 100%. The incorporation yield was found to be close to 100% when the concentration of MnCl2 in solution was 1 mM or more (
(3) Introduction of ddNTP
To optimize the introduction efficiency of ddNTP, RNA type A was hybridized onto beads with DNA primer, on which dATP, dCTP, dGTP, dTTP, and azido dNTPs were used to move to specific sites (double underlined in the table above) for ddNTP introduction (
After introduction of the ddNTP, the dNTP was elongated and the resulting product was subjected to restriction enzyme (endonuclease V) treatment. If ddNTP is introduced, short DNA lengths should be seen as the DNA primer does not extend (
Bst 3.0 is a polymerase known to be capable of ddNTP or dNTP introduction on both RNA and DNA Polymerase was used, and MnCl2 was added to a final concentration of 1 mM.
For various concentrations, it has been shown that ddATP, ddTTP, ddCTP, and ddGTP were introduced with high efficiency, but ddTTP was required to have a concentration of 200 μM or more (
Based on the conditions of Examples 1-2, an experiment to select specific RNAs among type A and type B RNAs was conducted.
Type A and B RNAs can be screened by blocking them with ddNTPs because they differ in specific bases. The two types of RNA were hybridized simultaneously on beads and introduction was performed under various conditions.
To screen the positions, the first region “UUCGGGUGUUUUUU” shown in Table 2 was extended with d(C,G,A)NTP, and the subsequent region “CCGA” was extended with azido-dNTP for 4 cycles. Blocking with Azido-dNTP was removed by treatment with 100 mM of Tris(2-Carboxyethyl)phosphine (TCEP) at pH9.0 at 65° C. for 15 min.
Subsequent ddATP treatment, dNTP introduction, and restriction enzyme (endonuclease V) treatment confirmed that only DNA corresponding to the specific RNA was selected and extended (
Terminal transferase, which is used for DNA tailing, was then used to introduce biotin-dATP, resulting in enrichment of only RNA A using streptavidin beads.
The sequence of the target RNA, the sequence of the primer for MOPSS (primer 1 below), and the sequence of the oligo used for ligation (primer 2) are shown below.
Primer 2 is phosphorylated at the 5′ end for ligation, and in
In addition, each sample was loaded so that the ratio of P1:RNA:P2 was 5:1:10.
In
Table 3 shows the results of how the samples in each lane reacted.
Lane 2 is the result of MOPSS and it can be seen that the darker band at the top and the fainter band at the bottom in the yellow circle, where the top is the DNA that has been elongated to the desired position and the bottom is the byproduct.
Lane 4 is the result of performing a ligation reaction on the sample of lane 2, and compared to the result from lane 2, it can be seen that the DNA has ligated to the desired position and lengthened (yellow circle at the top of lane 4).
It was also found that when checking the yellow circle at the bottom of lane 4, the amount of DNA that had previously extended to the desired location was significantly reduced compared to that in lane 2.
On the other hand, other products, RNA, etc. did not change in length and quantity, indicating that ligation did not occur.
Overall, from the results in
The results from lane 4 also confirmed that there was no length change in other products or RNA after the ligation reaction.
Lanes 6 and 8 demonstrated by treating DNase and RNase, respectively, that the product in the yellow circle above lane 4 is DNA that extends to the desired location and that the other products are RNA, primers, or DNA that does not extend to the desired location.
Using the CRISPR (clustered regularly interspaced short palindromic repeats) system, therapeutic RNAs that can correct DNA errors at the gene level were purified through MOPSS and the purification effect was confirmed (using the actual RNA sequence).
The therapeutic RNAs purified by MOPSS are single-guide RNAs (sgRNAs) and prime editing guide RNAs (pegRNAs).
The structure simulation results of sgRNA and pegRNA at room temperature (20° C.) using MOPSS are shown in
G
AG CAC ACG UCU GAA CUC CAG UCA (SEQ ID NO: 9)
GGG CAG GGC UGG GAG GGU UUU UUU (SEQ ID NO: 10)
For purification using MOPSS for the corresponding RNA, DNA primer binding, hybridization, blocking, elongation, restriction, and capture on the six magnetic beads described above were performed in a stepwise manner.
DNA primer complementary to a portion of the sequence in the 3′ direction of the RNA to undergo MOPSS was chemically bound to the magnetic bead. The DNA primer on the bead and the corresponding RNA were combined via hybridization, immobilizing the RNA on the bead. Afterward, ddNTP blocking and dNTP elongation were conducted in an alternating manner, and MOPSS was performed sequentially according to the type of base corresponding to the sgRNA sequence.
Blocking was performed with a buffer containing 1 mM Mn2+ and a concentration of 40 uM ddNTP. Elongation was performed without Mn2+ and at a concentration of 20 uM dNTP.
The order is ddNTP, then dNTP. The ddNTP proceeds with a combination of three ddNTPs that do not contain the sequence of the subsequent dNTP. The dNTP proceeded one at a time through the known sequences.
One MOPSS cycle is a set of blocking and dNTP elongation using a combination of three different ddNTPs corresponding to one base on the RNA sequence. It was confirmed that dNTPs can elongate sequentially in sgRNA (
In
The gel shows the sequential lengthening of DNA through dNTP elongation up to cycle 9th.
To proceed with the blocking, it was verified that the ddNTP selectively binds only to the corresponding base, preventing the progression of dNTP elongation (
It was shown that blocking works perfectly for all four types of bases A, G, C, and U in the sgRNA.
Without blocking, a red DNA band that progressed to the 12th cycle, which is the control (the DNA band increased to the white line in
Finally, the DNA that has undergone MOPSS to reach the target position is finally linked with biotin-dNTP at the 3′ end of the DNA so that it can be selectively extracted by streptavidin magnetic beads during the capture step.
Below are gel images of DNA and RNA collected on streptavidin beads after MOPSS of 2, 4, 6, and 8 cycles with biotin-dNTPs attached to the ends (
The results of the captured DNA and RNA are shown as a set for cycles 2, 4, 6, and 8, in which biotin-dNTPs are attached after each MOPSS cycle.
The results of the MOPSS cycle and collection of only DNA bands bound by biotin-dNTPs show that the pure MOPSS product can be separated and extracted by the second streptavidin magnetic bead.
To analyze the improvement of MOPSS, Next-Generation Sequencing (NGS) on RNA was performed before and after MOPSS.
The effect of decreasing the error in bases on the sgRNA after 8 cycles of MOPSS is shown in
Results from NGS on raw RNA before MOPSS are labeled as control. The results of MOPSS are labeled as MOPSS.
The graphs in
The results show that MOPSS results in an overall reduction in total errors per base compared to the control (
When analyzed by the types of errors that make up the total error rate (insertion, deletion, and substitution), the overall effect was to reduce the error rate (
To analyze the effect of progressing through MOPSS, the number of windows was increased one by one (e.g., C→CT→CTT→ . . . ) and the data was analyzed accordingly. As the window increases, the number of error-containing bases increases, showing a gradual increase in the number of error-containing molecules and a corresponding decrease in the number of error-free molecules.
It was analyzed whether the number of errors within molecules that contain errors is also reduced, as a smaller percentage of errors increase the likelihood that they will not cause significant errors in the final function (
As the window increased, the number of errors also tended to increase, and within the 8 cycles of MOPSS, the error improvement was seen by confirming how many the errors in that range decreased relative to the control as the window increased. The improvement was seen across all regions regardless of the number of errors, and when using the entire range up to 8th cycle as the window, an improvement effect of approximately 40% was observed in all error counts.
To analyze the error reduction effect mentioned above, the error reduction was calculated and plotted compared to the control using the formula (ErrorMOPSS,type−ErrorControl,type)/ErrorControl,type (
As above, it showed that MOPSS is also possible for pegRNA (
The same MOPSS was performed, but with longer MOPSS cycles (8 cycles for sgRNA and 20 cycles for pegRNA).
The simulation structure of the pegRNA at the temperature of the MOPSS run (
Due to the structural complexity of pegRNA and the longer cycles of MOPSS than sgRNA, fewer end products than sgRNA were observed, but it was shown that the MOSS products were selectively collected by finally proceeding to capture, demonstrating that pegRNA can also be purified via MOPSS (lane 6 of
While the foregoing has described in detail certain aspects of the present invention, it will be apparent to one of ordinary skill in the art that these specific descriptions are merely preferred examples and are not intended to limit the scope of the invention. Accordingly, the substantial scope of the present invention will be defined by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
10-2023-0122320 | Sep 2023 | KR | national |