The present invention relates to a method of evaluating a DNA-encoded library.
A compound library is a group of compound derivatives in which compounds having a possibility to have a specific activity, such as a drug candidate compound, are systematically collected. The compound library is, in many cases, synthesized based on synthetic techniques and methodologies of combinatorial chemistry. The combinatorial chemistry is a field of experimental methods for efficiently synthesizing a series of compound libraries enumerated and designed based on the combinatorics with a wide variety of compounds by a systematic synthetic route and research relating thereto.
DNA-encoded library is one kind of compound library based on the combinatorial chemistry. Hereinafter, the DNA-encoded library is appropriately abbreviated to as DEL. In DEL, a DNA tag is added to each compound in the library. The sequence of the DNA tag is designed so that each structure in each compound can be identified and functions as a label for the compound (Patent Documents 1 to 3).
A plurality of compounds having potential for pharmaceutical development have been found by screening using DEL. The screening using DEL is carried out, for example, as follows (Non-Patent Documents 1 to 3).
However, the screening method as described above has several problems. First, since it is necessary to immobilize the target protein, depending on the protein, the steric structure will change after the immobilization. In such a case, the compound obtained by screening does not bind to a desired non-immobilized target protein, and a target adaptation range of the DEL is limited (Patent Documents 4 and 5, and Non-Patent Documents 4 and 5). In addition, in the above screening method, a binder having high affinity can be recovered, but a binder having medium affinity (for example, a Kd value is on the order of M) has low kinetic stability of a complex with a target protein, and is difficult to recover after washing and removal. As is well-known to those skilled in the art, such a binder having medium affinity is also useful as a hit compound serving as a starting point for drug discovery, and is also useful as structure-activity relationship information (Non-Patent Document 6).
As an approach to solve the above problems, a plurality of screening methods using a cross linker-modified DEL have recently been reported. As will be shown below, several different methods have been reported, all of which are common in that targets in proximity to an affinity library molecule are reacted with a cross linker to form a covalent bond. The method is useful because it allows screening for DEL without immobilizing the target protein (or before immobilizing the target protein) (Patent Documents 4 and 5, and Non-Patent Documents 4, 5, and 7 to 9).
Here, a DNA strand structure of the DEL will be described. The DNA strand structure of the conventionally known DEL is represented by two kinds, i.e. a hairpin strand and a double strand.
Hereinafter, the outline of a double strand DEL and a hairpin strand DEL and the merit and the demerit thereof are described.
A DEL using a hairpin-stranded DNA is a single-stranded structure in which two complementary DNA strands are linked, and synthesized by using a hairpin type DNA having functional groups for introducing various building blocks as a starting material (head piece) (Patent Document 3, and Non-Patent Document 1 and 2).
In many cases, this method uses a relatively short double-stranded DNA tag with about 9 to 13-mer having a sticky end with 2-mer, and the double-stranded DNA tag is introduced by a ligation reaction by a DNA ligase. Using such a short DNA tag becomes possible because the hairpin-stranded DNA strongly forms a duplex in the molecule and the DNA site other than the sticky end does not interfere with the DNA tag. Using a short double-stranded DNA tag has some merits in the DEL synthesis. Cost for synthesizing the DNA tag is low is mentioned as one merit. Also, as another merit, using a shorter DNA tag suppresses the overall length of the DEL short when the same number of reaction cycles are encoded. That is, if a larger number of cycles is encoded, the overall length of the DEL can be suppressed to a range in which the DNA sequence can be efficiently read by the next-generation sequencer. In fact, in Non-Patent Document 3, construction of the DEL using a hairpin-stranded DNA encoding 6 cycle reactions has been achieved by using a hairpin-stranded DNA.
Different from the double strand, in the hairpin strand, even when a duplex structure is melted during the reaction under heating, the duplex in the original molecule is reformed without generating strand exchange under the subsequent reannealing conditions. Accordingly, DEL using a hairpin-stranded DNA has a merit that it can be used under a wider range of chemical conditions (Non-Patent Document 2). Also, in general, as for nucleic acid strands, if the chain length is the same, the hairpin strand forms a stronger duplex than the double strand (Tm value is high). Accordingly, under various chemical conditions at the time of introducing the building blocks, each chemical structure of the hairpin-stranded DNA, particularly the structure of the base portion, should resist the structural conversion as compared with the double strand.
The hairpin-stranded DNA forms a duplex in the molecule, and it is difficult to newly form a double strand with another oligonucleotide chain. Therefore, it is difficult to convert it into a cross linker-modified double-stranded DEL by adding a new cross linker-modified oligonucleotide.
A DEL using a double-stranded DNA is synthesized using a single-stranded DNA (single-stranded DNA that is not a hairpin strand) or double-stranded DNA having a functional group(s) for introducing various building blocks as a starting material (head piece).
Contrary to the DEL using the hairpin-stranded DNA, in many cases, relatively long single-stranded or double-stranded DNA tags with about 20 to 30-mer having 4 to 10-mer sticky end have been used (Patent Document 2 and Non-Patent Document 10) and a DEL encoding about 3 cycles reaction is common.
The double-stranded DNA can be converted into a single-stranded DNA by denaturation, or a strand exchange reaction can be carried out for it, which is advantageous in that it can be converted into a DNA structure suitable for various purposes. Therefore, conversion to a cross linker-modified double-stranded DEL can be carried out by adding a new cross linker-modified single-stranded oligonucleotide (Non-Patent Documents 7, 8, and 11).
Like this, although the hairpin-stranded DNA and the double-stranded DNA each have merits at the time of synthesis of DEL and evaluation, no technique that can achieve both merits has been known.
As an example of a report on screening using a cross linker-modified DEL, the following is known.
In Non-Patent Documents 7 and 8, a single-stranded DEL having a library molecule at the 5′ end is synthesized, a double strand with a DNA having a photoreactive cross linker at the 3′ end are formed, and screening is carried out to obtain a binder having medium affinity. On the other hand, in this method, since the DNA linked to the photoreactive cross linker does not contain a coding sequence, when the DNA is exposed to strong separation or elution conditions for removaling a non-specific binder, etc., there is a possibility that the double strand is separated and a sequence encoding a desired structure cannot be obtained. In addition, since the single-stranded DEL is used, the merits of the hairpin-stranded DEL cannot be utilized in synthesis.
Xiaoyu Li et al. report a screening method using a cross linker-modified DEL (Patent Document 5, and Non-Patent Documents 4, 5, and 12). In Non-Patent Document 5, a single-stranded DEL having a library molecule at the 3′ end is synthesized, and a double strand are formed with a short-stranded DNA having a photoreactive cross linker at the 5′ end and to make a photoreactive cross linker-modified DEL. Merits of this method that the coding sequence is covalently linked to the target by an elongation reaction with DNA polymerase since a photoreactive cross linker is present at the 5′ end are included. Therefore, strong separation and elution conditions can be included in the screening. However, a method for simply synthesizing a single-stranded DEL having a library molecule at the 3′ end has not been reported. In addition, similarly, the method does not take the merits of the hairpin-stranded DEL in synthesis.
Patent Document 4 describes a hairpin-stranded DEL having a linking site with a cross linker. Although this method can solve the problems as in Non-Patent Document 5 described above, it has another problem that library synthesis needs to be carried out without impairing the functional group for cross linker linking and that usable reactions and/or library molecular structures can be limited.
Patent Document 6 describes synthesis of a double-stranded DEL crosslinked by a reversible covalent bond (which is considered to have properties equivalent to those of a hairpin-stranded DEL from the viewpoint of duplex forming ability, etc.) and conversion to a cross linker-modified DEL. As a reversible covalent bond, a covalent bond by [2+2] photocyclization between a special base such as cyanovinylcarbazole and a pyrimidine base is disclosed. However, it is known that the photocyclized pyrimidine base has lost aromaticity, and such a pyrimidine base having lost aromaticity is chemically unstable and decomposed under basic conditions (Non-Patent Document 13). Therefore, in this method, usable reactions are limited during DEL synthesis, and the library molecular structure that can be constructed is also limited.
As described above, there is no known technique which has merits in terms of synthesis at the same level as the conventional hairpin-stranded DEL and is easily adaptable to screening with a cross linker-modified DEL.
The present invention provides a method of inducing a DEL containing a cleavable site in a DNA strand to a cross linker-modified double-stranded DEL and evaluating the DEL.
As one of chemistry of a nucleic acid such as DNA, there is a technology to cleavage the nucleic acid. For example, when deoxyuridine is introduced into a DNA strand, it can be selectively cleaved by USER® enzyme.
The present inventors have found that, as a result of earnest studies, both the merits of hairpin-stranded DNA and double-stranded DNA can be obtained, for example, by introducing a cleavable site such as deoxyuridine to the DNA strand, and have solved the above problems by easily inducing to a cross linker-modified DEL.
Accordingly, the present invention is as follows.
An-Sp-C-Bn (III)
The present invention provides a method of inducing a DEL containing a cleavable site in a DNA strand to a cross linker-modified double-stranded DEL and evaluating the DEL. That is, it provides a technique for screening a compound including both “simple DEL synthesis method” and “expansion and improvement of the DEL evaluation method” as compared with the conventional one. Therefore, according to the present invention, an opportunity to acquire a hit compound useful in the development of drugs, agrochemicals, and medical materials is expanded.
As mentioned above and a concept well known to those skilled in the art, in the present invention, a compound library means a group of compound derivatives in which compounds possibly having a specific activity such as a drug candidate compound are systematically collected. The compound library is, in many cases, synthesized based on the synthetic techniques and methodologies of combinatorial chemistry. The combinatorial chemistry is a field of experimental methods for efficiently synthesizing a series of compound libraries enumerated and designed based on the combinatorics with a wide variety of compounds by a systematic synthetic route and research relating thereto.
As mentioned above and well known to those skilled in the art, there is a DNA-encoded library as one kind of compound library based on the combinatorial chemistry. The DNA-encoded library is appropriately abbreviated as DEL. Also, DEL is essentially synonymous with a DNA-encoded compound library.
In the present invention, the DNA-encoded library means a library in which a tag of DNA is added to each compound in the library. In the tag of DNA, a sequence is so designed that each structure in each compound can be identified and functions as a label of the compound.
Nucleotides are, in general, understood as substances in which a phosphate group is bound to a nucleoside. Whereas nucleotides and nucleosides are terms well known to those skilled in the art, nucleosides are, as one general embodiment, understood as materials in which a nucleic acid base, such as a purine base or a pyrimidine base, is attached to the 1-position of a sugar such as a pentose via a glycoside bonding. Nucleosides and nucleotides are also units that constitute nucleic acids such as DNA and RNA.
Also, a nucleic acid is a well-known concept for those skilled in the art, and as a general embodiment, it is understood as a polymer of nucleotides.
As one embodiment, the nucleic acid according to the present invention is a polymer composed of nucleotides and nucleic acid analogues mentioned later.
Also, in the present specification, in addition to a nucleic acid polymer composed of nucleotide and nucleic acid analogues, a nucleic acid monomer, such as nucleotides and nucleic acid analogues, may be also simply referred to as a nucleic acid. The latter usage is also a usage according to the common technical knowledge and can be understood by those skilled in the art according to the context as appropriate.
Nucleotides in a broad sense include, in addition to natural nucleotides (original nucleotides), artificial nucleotides (various kinds of nucleic acid analogues). Nucleotides in a broad sense in the present invention include the following embodiments.
In the present invention, when it is described as a nucleotide without any particular limitation, it means a natural nucleotide. The natural nucleotide is a term well known to those skilled in the art and is not particularly limited as long as it is essentially naturally existing nucleotide. As one embodiment, the natural nucleotide in the present invention is the nucleotide described in the above (A).
Nucleic acid analogue is a term well known to those skilled in the art and the structure of the nucleic acid analogue in the present invention is not limited as long as it has the effect of the present invention
As one embodiment, the nucleic acid analogue is a compound of the embodiments of the above (B) to (H).
As one embodiment, the nucleic acid analogue in the present invention is a compound having a phosphoric acid-corresponding site and a hydroxyl group-corresponding site in the nucleic acid monomer. The nucleic acid analogue is more preferably a compound having a phosphoric acid site and a hydroxyl group.
As one embodiment, the nucleic acid analogue in the present invention is a compound that can be utilized as a monomer in a nucleic acid synthesizer. Whereas it is well known for those skilled in the art, in the nucleic acid synthesizer, a nucleic acid oligomer can be synthesized by converting phosphoric acid (or a corresponding site thereto) of the nucleic acid analogue into a phosphoramidite and utilizing it as a monomer in which a hydroxyl group (or a corresponding site thereto) is protected by a protective group.
The partial structure other than phosphoric acid site (or a corresponding site thereto) and a hydroxyl group (or a corresponding site thereto) in the nucleic acid analogue can be said to be a nucleic acid analogue residue. The structure of the nucleic acid analogue residue is not limited as long as it has the effect of the present invention, but when the characteristics of the respective structures of the natural nucleic acids (deoxyadenosine, thymidine, deoxycytidine, and deoxyguanosine) are confirmed as a reference, the characteristics include that the molecular weight is from about 322 (thymidine monophosphate) to about 347 (deoxyguanosine monophosphate) and the number of the atoms from oxygen atom of the hydroxyl group at the 3′ position and the phosphorus atom at the 5′ position constituting the nucleic acid strand (including the oxygen atom and the phosphorus atom; hereinafter also referred to as the number of atoms between the residues) is 6. As the nucleic acid analogue that can be used for a nucleic acid synthesizer, the following are known.
As a reference, the structure of each nucleic acid analogue is described below.
Accordingly, as one embodiment, the nucleic acid analogue is a compound (B1) characterized by the following:
As one embodiment, the nucleic acid analogue is a compound (B2) characterized by the following:
As one embodiment, the nucleic acid analogue is a compound (B3) characterized by the following:
As one embodiment, the nucleic acid analogue is a following compound (B41), (B42), (B43), (B44), (B5), (B51), or (B52).
In the present invention, an oligonucleotide and oligonucleotide chain mean a polymer of a nucleotide having one or more nucleotides at the 5′ end and the 3′ end and at the internal positions between the 5′ end and the 3′ end.
Mutually complementary base sequence means a sequence of nucleotides which can form the so-called complementary base pairs that form a fixed pair of adenine and thymine (or uracil), or guanine and cytosine between two oligonucleotides of nucleic acids and are linked by hydrogen bonds. Formation of the complementary base pairs is also called hybridization.
The complementary base pairs are a concept generally called “Watson-Crick type base pairs” and “natural type base pairs”. However, the base pairs may be Watson-Crick type, Hoogsteen type base pairs, or base pairs by other hydrogen bond motif (for example, diaminopurine and T, 5-methyl C and G, 2-thiothymine and A, 6-hydroxypurine and C, pseudoisocytosine and G) formation, etc. As long as two oligonucleotides are sequences that can form a double strand and can be used for the purpose of the present invention, there is no limitation on the sequence of “mutually complementary base sequence” and there is no limitation on the homology between the two sequences. The homology is preferably, in a more preferable order, 99% or more, 98% or more, 95% or more, 90% or more, 85% or more, 80% or more, 70% or more, 60% or more, or 50% or more.
Whereas it is repeated again, to hybridize in the present invention means an act to form a double strand by oligonucleotides or oligonucleotide chains containing mutually complementary base sequences and a phenomenon to form a duplex by oligonucleotides or oligonucleotide chains containing complementary sequences.
The duplex in the present invention means a state that two nucleic acid strands form complementary base pairs (are hybridized). The two nucleic acid strands may be derived from two nucleic acid strands or may be derived from two nucleic acid sequences in one nucleic acid strand molecule.
In the present invention, the double-stranded oligonucleotide and the double-stranded oligonucleotide chain mean a secondary structure formed by two or more different oligonucleotide chains being hybridized. The chain lengths of the two oligonucleotides may be different and may have regions that are not hybridized.
The region in which the two chains are hybridized is a duplex.
In the present invention, the double-stranded DNA means a secondary structure formed by two different DNA strands being hybridized. The chain lengths of the respective DNA strands may be different and may have regions that are not hybridized. The DNA strands are not limited to naturally existing deoxyribonucleotides and mean all oligonucleotide chains that can be amplified by DNA polymerase.
In the present invention, “forming a duplex” may form a duplex under standard conditions for handling oligonucleotides, for example, at a temperature of 4 to 40° C., an aqueous solvent and a pH of 4 to 10. For example, even if there is a case where no duplex is formed by a specific solvent and conditions, the nucleic acid is a nucleic acid that forms a duplex if the nucleic acid forms a duplex under standard conditions.
In the present invention, the Tm value refers to a temperature at which half of the DNA molecules are annealed with the complementary strand.
In the present invention, the blunt end means that both terminals of the double-stranded oligonucleotide are paired without protruding.
In the present invention, the protruding end means that, among the terminals of the double-stranded oligonucleotide, one of the strands has a protruded portion. The protruded portion of the protruding end can be of any length, and the length is preferably 1 to 50 bases, more preferably 1 to 30 bases, further preferably 1 to 15 bases, and most preferably 2 to 6 bases. In a specific embodiment, it is possible that the protruded portion can be used as a hybridizing region for carrying out ligation of the sticky end.
PCR means a polymerase chain reaction. PCR is means for amplifying the oligonucleotide chains and is a technique well known to those skilled in the art. When the outline of the process of PCR is explained, in PCR, (1) the double-stranded oligonucleotide chain to be amplified is dissociated into two single strands by heat treatment, etc., and (2) after adjusting the temperature suitable for the enzymatic reaction, strands complementary to the respective single strands are synthesized by an enzyme (DNA polymerase, etc.) existing in the reaction system. That is, one double-stranded oligonucleotide can be amplified to be two. In PCR, oligonucleotide chains can be amplified with high efficiency by repeating the processes (1) and (2) by adjusting the temperature.
In the present invention, the primer means an oligonucleotide that is annealed to an oligonucleotide chain which becomes a template and can be elongated by a polymerase in a template-dependent manner.
In the present invention, the primer sequence for PCR means a sequence of a portion in the oligonucleotide chain to which the primer is annealed and is preferably a sequence suitable for PCR as known in the art and is preferably present at the terminal of the oligonucleotide chain.
In the present invention, the nick means a portion in the double-stranded oligonucleotide chain in which a linkage between the nucleotides is lacking and the oligonucleotide chain is broken. The 5′ side of this lacking portion may have a phosphoric acid group or may not have a phosphoric acid group.
In the present invention, the gap means a portion in the double-stranded oligonucleotide chain in which one or more consecutive nucleotides are deleted and the oligonucleotide chains are separated. The 5′ side of the deleted portion may have a phosphoric acid group or may not have a phosphoric acid group.
In the present invention, the hairpin strand is a single-stranded structure in which two complementary nucleic acid strands are linked and the characteristics of the hairpin strand and the hairpin strand DEL are as described above. The terms “hairpin site”, “hairpin structure”, and “hairpin type” used in the present invention are understood as terms derived from the hairpin having the same concept as the above-mentioned “hairpin strand”.
In the present invention, the nucleic acid ligation reaction and ligation mean a reaction in which the terminals of nucleic acids are linked to each other.
The nucleic acid ligation reaction by an enzyme and enzymatic ligation means a reaction in which the terminals of nucleic acids are linked to each other using an enzyme.
An enzyme that can be used in the nucleic acid ligation reaction is, for example, DNA ligase, RNA ligase, DNA polymerase, RNA polymerase, or topoisomerase.
As one embodiment, DNA ligase is an enzyme that ligates the terminals of DNA strands with a phosphoric acid diester bond. As one embodiment, DNA ligase is understood as a ligase belonging to EC number: 6.5.1.1 or 6.5.1.2. DNA ligase is also called polydeoxyribonucleotide synthase or polynucleotide ligase, etc. Examples of DNA ligase include DNA ligases I, II, III, and IV, and T4 DNA ligase, etc.
As one embodiment, RNA ligase is an enzyme that ligates the terminals of RNA strands with a phosphoric acid diester bond. As one embodiment, RNA ligase is understood as a ligase belonging to EC number: 6.5.1.3. Also, as one embodiment, RNA ligase belongs to the lineage of poly(ribonucleotide): poly(ribonucleotide) ligase. RNA ligase is also called as polyribonucleotide synthase or polyribonucleotide ligase.
In the present invention, the chemical ligation means a reaction in which the terminals of the nucleic acids are bound to each other without using an enzyme.
In the chemical ligation, a ligating portion is formed by reacting the terminals of the nucleic acids having a functional group which becomes a pair of the chemical reaction. The functional group which becomes a pair of the chemical reaction is, for example, a pair of an alkynyl group which may be substituted and an azide group which may be substituted, a pair of a diene which may be substituted having a 471 electron system (for example, a 1,3-unsaturated compound which may be substituted, for example, it includes 1,3-butadiene, 1-methoxy-3-trimethylsilyloxy-1,3-butadiene, cyclopentadiene, cyclohexadiene, or furan, each of which may be substituted) and a dienophile which may be substituted or a heterodienophile which may be substituted (for example, it includes an alkenyl group which may be substituted or an alkynyl group which may be substituted) having a 2π electron system, a pair of an amino group which may be substituted and a carboxylic acid group, a pair of a phosphorothioate group and an iodo group (for example, it includes a phosphorothioate group at the 3′ end and an iodo group at the 5′ end), or a pair of a phosphoric acid group and a hydroxy group (for example, it includes a pair of a phosphoric acid group at the 5′ end and a hydroxy group at the 3′ end, or a pair of a hydroxy group at the 5′ end and a phosphoric acid group at the 3′ end).
The chemical ligation is a concept well known to those skilled in the art and those skilled in the art can appropriately achieve chemical ligation based on the common general technical knowledge. In addition to the above, it can be also referred to Artificial DNA; PNA & XNA, 2014, vol. 5, e27896; Current Opinion in Chemical Biology, 2015, vol. 26, pp. 80-88, etc.
In the present invention, “selectively cleavable” means that, in a certain compound, only a specific site can be selectively cleaved under predetermined conditions without changing the other molecular structures of the compound.
In the present invention, a “selectively cleavable site” means, in a certain compound, a site that can be selectively cleaved under predetermined conditions.
As one embodiment, a preferred structure of the “selectively cleavable site” in the present invention is a “selectively cleavable nucleic acid”. The site may be a site that is comprised of a plurality of nucleic acids and can be successfully cleaved by a specific sequence, or may be a site comprised of a single nucleic acid. When the cleavable site is a nucleic acid, it is preferable in the viewpoints that (1) established producing methods, such as a nucleic acid synthesizer, can be utilized so that the production efficiency is good, (2) in the reaction conditions for constructing the building blocks of DEL, it is essential that the nucleic acid at the DNA tag portion is not decomposed, so that when the cleavable site is nucleic acid, it does not decompose as well, etc.
More preferred structure of the above-mentioned “selectively cleavable nucleic acid” is nucleic acid containing a nucleotide which is not comprised in the sequence of the DNA tag of DEL. If the cleavable site is a nucleotide which is not comprised in the sequence of the DNA tag, it is possible to utilize it without limiting the sequence of the DNA tag to avoid the cleavage of the DNA tag portion.
As the nucleic acid used for the sequence of the DNA tag, deoxyadenosine, deoxyguanosine, thymidine, and deoxycytidine are preferable. Therefore, a preferred structure of the selectively cleavable site is a nucleic acid that is neither deoxyadenosine, deoxyguanosine, thymidine, nor deoxycytidine.
Examples of the “selectively cleavable site” include a “nucleotide having a cleavable base”. For example, in the “nucleotide having a cleavable base” in DEL, the N-glycoside bond between the base portion and the sugar portion is cleaved by the action of DNA glycosylase to leave an abasic site. The phosphodiester bond adjacent to the abasic site is cleaved by chemical condition change (for example, temperature rise, basic hydrolysis, etc.) or an enzyme having depurine/depyrimidine (AP) endonuclease activity or AP lyase activity (for example, endonuclease III, endonuclease IV, endonuclease V, endonuclease VI, endonuclease VII, endonuclease VIII, APEl (human-derived AP endonuclease), Fpg (formamide pyridine-DNA glycosylase), etc.) to form a gap with one base unit or a nick.
Examples of the “nucleotide having the cleavable base” include deoxyuridine, bromodeoxyuridine, deoxyinosine, 8-hydroxydeoxyguanosine, 3-methyl-2′-deoxyadenosine, N6-etheno-2′-deoxyadenosine, 7-methyl-2′-deoxyguanosine, 2′-deoxyxanthosine, 5,6-dihydroxydeoxythymidine, etc. Nucleotides having other cleavable bases are obvious to those of skill in the art. By incorporating these “nucleotides having the cleavable base” into DEL and using a DNA glycosylase that specifically recognizes the structure, the DEL is selectively debased.
In the present invention, the DNA glycosylase refers to an enzyme which is an optional enzyme having glycosylase activity, recognizes an optional nucleic acid base portion in the oligonucleotide, cleaves the N-glycoside bond between the base portion and the sugar portion, and creates an abasic site. For example, Examples include uracil DNA glycosylase (recognizing deoxyuridine), alkyladenine DNA glycosylase (recognizing 3-methyl-2′-deoxyadenosine, 7-methyl-2′-deoxyguanosine, and deoxyinosine), Fpg (recognizing 8-hydroxydeoxyguanosine), endonuclease VIII (recognizing decomposed pyrimidine base such as 5,6-dihydroxydeoxythymidine or uracil glycol), SUMG1 (abbreviation of single-strand selective uracil DNA glycosylase, which recognizes deoxyuridine), etc.
In the present invention, more preferable examples of the “selectively cleavable site include deoxyinosine and deoxyuridine.
In the present invention, particularly preferable examples of the “selectively cleavable site” include deoxyuridine.
As one embodiment, the “selectively cleavable site” in the present invention is preferably cleaved using an enzyme. The enzyme generally has high substrate specificity and does not recognize the DNA tag portion of DEL and the compound portion constructed by a plurality of building blocks as a substrate and recognizes only the “selectively cleavable site” and acts so that it is preferable. Also, the cleavage using the above-mentioned enzyme may be achieved by changing the structure of the “selectively cleavable site” by the enzyme and then changing the chemical conditions. Examples of the enzyme include glycosylase and nuclease.
In the present invention, the glycosylase is an enzyme having a function of hydrolyzing a glycoside bond (a covalent bond formed by dehydration condensation of a sugar molecule and another organic compound). Among them, the DNA glycosylase is an enzyme that recognizes the nucleic acid base portion in the oligonucleotide, as mentioned above, and hydrolyzes the glycoside bond.
In the present invention, the nuclease is an enzyme having a function of hydrolyzing a phosphodiester bond between the sugar and the phosphoric acid of the nucleic acid. The nuclease includes AP endonuclease, nicking endonuclease, and ribonuclease, for example.
The AP endonuclease cleaves a phosphodiester bond adjacent to the abasic site formed by the action of an optional DNA glycosylase as mentioned above. Therefore, in the present invention, it is preferable to use DNA glycosylase and AP endonuclease in combination.
The nicking endonuclease (for example, Nb.BbvCI, Nb.BsmI, Nb.BsrDI, etc.) recognizes a specific DNA sequence and generates a nick in which a phosphodiester bond is cleaved only one strand among the double strand. Also, the endonuclease V can generate a nick in which the second phosphodiester bond in the 3′ direction from the deoxyinosine is cleaved, which is useful for carrying out the present invention.
The ribonuclease is an enzyme that decomposes RNA. In the present invention, the ribonucleoside is used as the “selectively cleavable site” and it can be utilized by allowing ribonuclease to act on it. RNaseHII, which is a kind of the ribonuclease, can generate a nick in which the phosphodiester bond on the 5′ side of the ribonucleotide incorporated into the DNA sequence is cleaved, which is useful for carrying out the present invention.
In the present invention, USER® means “Uracil-Specific Excision Reagent” Enzyme. The USER is an endonuclease cocktail that removes uracil including uracil DNA glycosylase (UDG) and endonuclease VIII. The USER removes uracil in the double-stranded DNA to generate a one base gap to cleave the DNA strand. In the process of the USER, UDG firstly removes an uracil base to produce an abasic site. Subsequently, the endonuclease decomposes a phosphodiester bond to liberate a deoxyribose having no base to produce a one base gap.
In the explanation of the present specification, USER® enzyme and USER® Enzyme are USER® in the above-mentioned definition.
In the present invention, the exonuclease is an enzyme having a function of sequentially hydrolyzing phosphodiester bonds from the 5′ end or 3′ end of nucleic acids. The exonuclease includes lambda exonuclease, exonuclease III, and T7 exonuclease, for example.
The lambda exonuclease is an enzyme that decomposes DNA whose 5′ end in the double-stranded DNA is phosphorylated. In the present invention, it is useful in the scene where a double-stranded DEL is converted into a single-stranded DEL.
In the present invention, the building block is a portion that has a functional group and can constitute a part of a compound, which may be in the form of a compound.
In the present invention, the base sequence which can identify the respective building blocks means a specific base sequence designed to correspond to the structures of the respective building blocks. To design a sequence means, for example, to assign the nucleic acid base sequence to each structure such as to assign the nucleic acid base sequence AAA to the building block structure A, the nucleic acid base sequence TTT to the structure B, and the nucleic acid base sequence CGC to the structure C. The sequence can be freely designed as long as the object of the present invention is achieved. For example, an optional number of base sequences can be assigned to one building block.
In the present invention, the oligonucleotide tag is a partial structure comprising an oligonucleotide which comprises a base sequence capable of identifying the structure of the partial structure constructed by the building blocks. In the present invention, the oligonucleotide tag may be an oligonucleotide corresponding to each building block, or may be a longer chain oligonucleotide containing an oligonucleotide corresponding to a plurality of building blocks.
The nucleotide constituting the oligonucleotide tag of the present invention is not limited as long as it can accomplish the effect of the present invention and, in the viewpoint of easiness of amplification by PCR and analysis by a sequencer, it is desirably a nucleotide suitable for these operations. Examples of such a preferable nucleotide include a nucleotide having the above-mentioned natural nucleic acid base as the base portion and having the above-mentioned ribose or 2′-deoxyribose as the sugar portion and more preferable examples include deoxyadenosine, thymidine, deoxycytidine, and deoxyguanosine.
In the present invention, the head piece means a starting compound for producing a compound library such as DEL. The structure of the head piece according to the present invention is not limited as long as it can accomplish the object of the present invention, but in the most typical embodiment, it has at least one site to which the building block can be linked and at least one site to which the oligonucleotide tag can be ligated and further contains at least one selectively cleavable site in the structure.
As described later, the DNA tag is preferably a double-stranded oligonucleotide chain and the site to which the oligonucleotide tag can be ligated is preferably two.
As one embodiment, the head piece is a compound shown in the following schematic drawing.
As one embodiment, the head piece is desirably to be chemically stable.
In addition, as one embodiment, the head piece preferably has a structure in which the DNA tag and the building block can be arranged in an appropriate space.
As one embodiment, it is preferable that the head piece has appropriate flexibility.
Here, more appropriate spatial arrangement and flexibility (structural characteristics of the head piece) will be explained. Here, the structural characteristics of the head piece to be explained may be achieved by the head piece alone or may be achieved by coupling the head piece with a bifunctional spacer.
As one embodiment, the preferred structural characteristics of the head piece are structural characteristics that the head piece or the DNA tag does not inhibit the forming reaction of the building block and conversely the head piece or the building block does not inhibit the elongation reaction of the DNA tag.
As one embodiment, the preferred structural characteristics of the head piece are structural characteristics that the head piece or the DNA tag portion does not affect the interaction between the building block compound (library compound) and the target (target protein, etc.).
As one embodiment, the preferred structural characteristics of the head piece are structural characteristics that the DNA tag and the building block site are oriented on opposite sides (for example, 90 degrees or more on the opposite side).
As one embodiment, the preferred structural characteristics of the head piece are structural characteristics that the loop site and the building block of the head piece are separated from several atoms to a dozen atoms in terms of the skeleton of the organic compound.
As one embodiment, the head piece preferably has an appropriate affinity with the DNA tag portion and the building block portion. The appropriate affinity means, for example, chemical reactivity and stability so that a bond can be formed, maintained, and cleaved under desired conditions for carrying out the present invention.
In the present invention, the bifunctional spacer means a spacer portion having at least two reactive groups that enables binding between the building block site and the head piece.
In the explanation of the present invention, the terms “head piece”, “head piece compound”, and “compound for the head piece” are terms referring to the same conceptual compound.
In the explanation of the present invention, a “compound used as a head piece” can be understood essentially the same as “use of a compound as a head piece” from the viewpoint of use and can be understood essentially the same as the “method of using the compound as a head piece” from the viewpoint of a method. The same applies to the compound library.
Hereinafter, a structure of the preferred head piece is explained but the structure of the head piece is not limited as long as the effects of the present invention are achieved.
As one embodiment, the head piece is comprised of
As one embodiment, the head piece is a compound represented by the following formula (I).
The compound represented by
wherein E and F are each independently an oligomer composed of nucleotides or nucleic acid analogues,
In the present invention, among the loop sites, the partial structure of the site that binds to the linker may be sometimes referred to as a linking site or (LS).
Also, in the present invention, E-LP-F may be sometimes collectively referred to as a hairpin site.
Hereinafter, preferred embodiments of the first oligonucleotide chain (E) and the second oligonucleotide chain (F) will be explained.
It is preferable that the first oligonucleotide chain (E) and the second oligonucleotide chain (F) form a duplex in the molecule via the loop site (LP) and the head piece form a hairpin structure. The chain length preferable for the formation of the duplex in the molecule is 3 bases or more, more preferably 4 bases or more, and further preferably 6 bases or more.
The chain length of E and F is, as one embodiment, each 3 to 40, respectively.
The chain length of E and F is, as one embodiment, each 4 to 40, respectively.
The chain length of E and F is, as one embodiment, each 6 to 25, respectively.
The site to which the oligonucleotide tag is linked preferably has a structure suitable for enzymatic ligation or chemical ligation. As one embodiment, the ligation between the head piece and the oligonucleotide tag is carried out by double-stranded ligation using an enzyme. In that case, it is preferable that the first and the second oligonucleotide chains form a protruding end for ligation. The above-mentioned chain length of the protruding end is preferably 2 bases or more, more preferably 2 to 10 bases, and further preferably 2 to 5 bases. Accordingly, it is preferable that one of the first and the second oligonucleotide chains is longer than the other chain by the chain length of the protruding end. Also, for ligation with the DNA ligase, among the first and the second oligonucleotide chains, it is preferable that the 5′ end of the chain having the 5′ end of the head piece is phosphorylated.
In addition, the first and the second oligonucleotide chains may contain a part or whole of the primer binding sequence for PCR. The appropriate chain length for the primer binding sequence is 17 to 25 bases.
Hereinafter, preferred embodiments of the linker (L) will be explained.
The linker is, as mentioned above, a site that elongates from the reactive functional group and binds to the linking site. Typically, the linker is a divalent group (-L-) derived from the following embodiments.
As one embodiment, the linker is the following embodiment (L1).
As other embodiments, L is the following embodiment (L2), (L3), (L4), or (L5).
Hereinafter, preferred embodiments of (D) the reactive functional group will be explained.
As described above, the reactive functional group has at least one site that can be linked directly or linked indirectly via the bifunctional spacer to the building block and is a site that binds to the linker group. Typically, the reactive functional group becomes a monovalent group (D-) in the head piece and it becomes a “divalent group derived from the reactive functional group” (-D-) based on the above-mentioned (D-) in the DEL.
For example, when D is an amino group, the specific structure of (D-) is (R—HN—) (R is a substituent explained below). For example, it reacts with an activated carboxy group, a reactive sulfonyl group, or an isocyanate group to form an amide bond, a sulfonamide bond, or urea bond, respectively. At that time, the specific structure of (-D-) is (—NR—).
R is not limited as long as the effects of the present invention are accomplished but in the following embodiments of (D1) to (D5), R is preferably (1) a hydrogen atom, or (2) a C1 to 6 alkyl group which is unsubstituted or substituted with 1 to 3 substituents selected solely or different from a substituent group consisting of a C1 to 6 alkoxy group, a fluorine atom, and a chlorine atom.
R is more preferably a hydrogen atom or a C1 to 3 alkyl group, and further preferably a hydrogen atom.
Also, for example, when (D-) is a methylene group having a leaving group (X—), the specific structure of (D-) is (X—CH2—) and, for example, it reacts with a nucleophilic reagent such as an amino group, a hydroxy group, or a thiol group to form a carbon-nitrogen bond, a carbon-oxygen bond or a carbon-sulfur bond. At that time, the specific structure of (-D-) is (—CH2—). Also, for example, when (D-) is an aldehyde group, the specific structure of (-D-) is (HOC—). The aldehyde group forms a carbon-nitrogen bond, for example, by the reductive amination reaction with an amino group, and at that time, (-D-) is —CH2—; and forms a carbon-carbon double bond, for example, by the reaction with a phosphorus-iride group, and at that time, (-D-) is —CH═; and forms a carbon-carbon triple bond, for example, by the reaction with an α-diazophosphonate group, and at that time, (-D-) is —C—.
As one embodiment, the site (D-) is the following embodiment (D1).
As other embodiments, (D-) is the following embodiment (D2), (D3), (D4), or (D5).
In this case, (-D-) can be —(C1 hydrocarbon)-, —NR—, —O—, —(C═O)—, —S—, —CH2—, —CH═, —C≡, and the like.
In this case, (-D-) can be —(C1 hydrocarbon)-, —NR—, —O—, —(C═O)—, —S—, —CH2—, —CH=, —C≡, and the like
In this case, (-D-) can be —CH2—, —NR—, —O—, or —(C═O)—, respectively.
In this case, (-D-) is —NH—.
Hereinafter, preferred embodiments of the loop site (LP) will be explained.
The loop site (LP) is preferably so designed that the first oligonucleotide chain (E) and the second oligonucleotide chain (F) form a duplex in the molecule and the head piece can form a hairpin structure. That is, the loop site (LP) preferably has a chain length that makes the loop structure thermodynamically stable and flexibility of bonding.
Accordingly, as one embodiment, the loop site (LP) is as follows.
Further preferred embodiments of the loop site are as explained above.
Hereinafter, the structure of the loop site will be further supplemented.
Here, the nucleotide is the natural nucleotide of the above-mentioned explanation and the nucleic acid analogues is as the above-mentioned explanation.
Here, LP1 is each a partial structure independently or differently selected with the number of p from the compound group described in the following (1) and (2), and LP2 is each partial structure selected solely or differently from the compound groups described in the following (1) and (2) with the number of q.
Selected solely or differently with the number of q is that, for example, when p is 4, LP1 can be selected solely or differently from the compound group described in (1) and (2), like AATG, ATCG, TC (d-Spacer) G or A (d-Spacer) (d-Spacer) C. The same applies to LP2.
Also, the loop site may contain a part or whole of the primer bond sequence for PCR.
As one embodiment, LS is (A) a nucleotide or (B) a nucleic acid analogue.
When LS is (A) a nucleotide or (B) a nucleic acid analogue, the loop site is a nucleic acid oligomer. The nucleic acid oligomer according to the present invention refers to an oligomer in which the nucleotide or the nucleic acid analogues is linked as a monomer. The oligomer can be also said to be a chain state compound.
Accordingly, the nucleic acid oligomer according to the present invention is any of an oligonucleotide chain, a nucleic acid analogue chain, or a mixed chain of a nucleotide and a nucleic acid analogue.
When LS is (A) a nucleotide or (B) a nucleic acid analogue, the loop site is a nucleic acid oligomer. In this case, the head piece can be produced by a nucleic acid synthesizer, which is markedly preferable in practice.
When LS is (A) a nucleotide or (B) a nucleic acid analogue, in the production of the head piece, as one embodiment, a monomer for nucleic acid synthesis in which the linker site (L) and the reactive functional group site (D) are bound to LS is prepared and then a nucleic acid oligomer can be synthesized.
Examples of such a monomer for nucleic acid synthesis include the above-mentioned Amino C6 dT, mdC (TEG-Amino), Uni-Link® Amino Modifier, and the like.
In the case of this embodiment, for example, among the structures of mdC (TEG-Amino), which is the monomer, the nucleotide portion corresponds to the linking site (LS), and the side chain portion elongating from the base corresponds to the linker site (L) and the reactive functional group site (D).
In the preparation, the reactive functional group (D) may be protected by a protective group.
In such a case, as one embodiment, the nucleic acid analogue is the following compound (B6).
As one embodiment, the nucleic acid analogue is the following compound (B61), (B62), (B63), (B64), or (B65).
When LS is (A) a nucleotide or (B) a nucleic acid analogue, in the production of the head piece, as one embodiment, a nucleic acid oligomer is firstly synthesized, and then the above-mentioned linker site (L) and the reactive functional group site (D) can be bound to it.
In such a case, it is preferable to put the “specific nucleic acid analogue” to which the linker site binds into the hairpin site (nucleic acid analogues oligomer) as the linking site (LS). Examples of the “specific nucleic acid analogue” include the above-mentioned Amino C6 dT, mdC (TEG-Amino), and Uni-Link® Amino Modifier.
In the case of this embodiment, for example, mdC (TEG-Amino) itself corresponds to a linking site (LS), and additional sites that further bind from the chain on the base side correspond to the linker site (L) and the reactive functional group site (D).
As mentioned above, it is preferable that the chain length of the above-mentioned loop site is the one so that the first oligonucleotide chain (E) and the second oligonucleotide chain (F) form a duplex in the molecule and the head piece forms the hairpin structure.
As one embodiment, the total number of p and q is 1 to 40.
As one embodiment, the total number of p and q is 2 to 20.
As one embodiment, the total number of p and q is 2 to 10.
As one embodiment, the total number of p and q is 2 to 7.
As one embodiment, the loop site of the present invention is comprised of
As one embodiment, LS is preferably B42, B43, or B44.
Also, as one embodiment, LP1 and LP2 are preferably A, B41, or B52.
As one embodiment, the loop site is a nucleic acid oligomer according to the sequences described in the following (X1) to (X9).
In the above-mentioned head piece, the number of the cleavable sites is preferably within 5 and more preferably 1 to 2.
In the above-mentioned head piece, when the cleavable sites are two or more, it is preferable that at least one cleavable site is in the first oligonucleotide chain or between the first oligonucleotide chain and the linker binding site and at least one cleavable site is in the second oligonucleotide chain or between the second oligonucleotide chain and the linker binding site.
As one embodiment, in the above-mentioned head piece, the position of the cleavable site is preferably within 20 bases, more preferably within 10 bases, and further preferably within 3 bases, starting from the binding portion between the loop site and the first oligonucleotide chain or the second oligonucleotide chain.
As one aspect, the present invention provides appropriate conditions in a method of inducing a DEL containing a cleavable site in a DNA strand to a cross linker-modified double-stranded DEL and evaluating it.
As one embodiment, in the above-mentioned head piece, the position of the cleavable site is present in the 3′ direction and is preferably within 20 bases, more preferably within 10 bases, further preferably within 3 bases, and most preferably within 1 base, starting from the binding portion between the loop site and the first oligonucleotide chain or the second oligonucleotide chain.
Although this is the explanation just in case, the preferred embodiment of the “selectively cleavable site” and, for example, the preferred embodiment of E, F or LP, etc., is different concept, respectively. That is, even if the position of the “selectively cleavable site” is included in E, the preferred embodiment of E does not necessarily apply to the “selectively cleavable site”.
As one embodiment, the compound constituting the DEL of the present invention is a compound represented by the following formula (II).
wherein
As one embodiment, preferred embodiments of E, F, LP, L, and D in the above-mentioned compound represented by the formula (II) are the same as the preferred embodiments of E, F, LP, L, and D explained with respect to the above-mentioned formula (I).
Preferred embodiments of X, Y, Sp, and An will be explained separately.
As described above, the bifunctional spacer is a spacer portion having at least two reactive groups that enables binding between the partial structure An of the compound library and the head piece. As one embodiment, the bifunctional spacer is SpD-SpL-SpX.
SpX is a reactive group that forms a covalent bond with the reactive functional group of the head piece.
SpD is a reactive group that forms a covalent bond with the partial structure An of the compound library.
SpL is a chemically inactive spacing portion
Similar to the reactive functional group (D), the reactive group (SpX) is a monovalent group (-SpX) in the bifunctional spacer substance itself (in the state of the reagent before binding to the head piece) and is a “divalent group derived from a reactive group” (-SpX-) based on the above-mentioned (-SpX) in the DEL (in the state of being bound to the head piece).
Also, similarly, the reactive group (SpD) is a monovalent group (SpD-) in the state before binding to An and is a “divalent group derived from a reactive group” (-SpD-) based on the above-mentioned (SpD-) in the DEL (in the state of being bound to An).
A preferred embodiment of SpX is a reactive group that forms an amino, carbonyl, amide, ester, urea, or sulfonamide bond. As one embodiment, SpX is a structure of the following (SpX1), (SpX2), or (SpX3), which is a reactive group suitable when the reactive functional group of the head piece is an amino group.
A preferred embodiment of SpD is the same as the above-mentioned D.
As one embodiment, SpD is the above-mentioned (D1), (D2), (D3), (D4), or (D5).
A preferred embodiment of SpL is the following embodiments.
As one embodiment, SpL is the above-mentioned (L1), (L2), (L3), (L4), or (L5).
As one embodiment, SpL is the following (SpL1), (SpL2), or (SpL3).
As one embodiment, the bifunctional spacer is as follows.
As one embodiment, (Sp-D-L) portion of the compound constituting the DEL is so constituted as (SpDL1), (SpDL2), (SpDL3) (SpDL4), (SpDL5), (SpDL6), (SpDL7), (SpDL8), (SpDL9), or (SpDL10).
In carrying out the present invention, it is advantageous if the head piece can be synthesized by a nucleic acid synthesizer. In this practice, as mentioned above, as one embodiment, a monomer for nucleic acid synthesis in which the linker site (L) and the reactive functional group site (D) are bound to LS can be prepared, and then a nucleic acid oligomer can be synthesized. Examples of such a monomer for nucleic acid synthesis include the above-mentioned Amino C6 dT, mdC (TEG-Amino), Uni-Link® Amino Modifier, and the like.
On the other hand, when the above-mentioned commercially available monomer for nucleic acid synthesis or a nucleic acid analogue that can be used in a nucleic acid synthesizer is used, there is a possibility that the length of the linker site is limited. In such a case, as one embodiment, by introducing an appropriate bifunctional spacer, it becomes possible to adjust the distance between the head piece and An, which is advantageous in carrying out the invention.
In the explanation of the present invention, “of C1 to C6” or “C1 to 6” in the terms such as a “C1 to C6 alkyl group” or a “C1 to 6 alkyl group” means that the number of carbon atom(s) is 1 to 6. Similarly, when m and n are integers and there are expression “of Cm to Cn” or “Cm to n”, the expression means that the number of carbon atom(s) is m to n. Accordingly, a “C1 to C6 alkyl group” or a “C1 to 6 alkyl group” means an alkyl group of which the number of carbon atom(s) is 1 to 6, and a “C1 to C6 alkylene” or a “C1 to 6 alkylene” means an alkylene of which the number of carbon atom(s) is 1 to 6.
In the present invention, the “C1 to 6 alkyl” means a linear or branched alkyl group of which the number of carbon atom(s) is 1 to 6. Specific examples include methyl, ethyl, propyl, isopropyl, butyl, isobutyl, sec-butyl, tert-butyl, pentyl, hexyl, and the like.
In the present invention, the “C1 to 3 alkyl” means a linear or branched alkyl group of which the number of carbon atom(s) is 1 to 3. Specific examples are methyl, ethyl, propyl, and isopropyl.
In the present invention, the “C1 to 6 alkoxy” means a linear or branched alkoxy of which the number of carbon atom(s) is 1 to 6. Specific examples include methoxy, ethoxy, propoxy, isopropoxy, butoxy, isobutoxy, sec-butoxy, tert-butoxy, pentyloxy, hexyloxy, and the like.
In the present invention, the “C1 to 3 alkoxy” means a linear or branched alkoxy of which the number of carbon atom(s) is 1 to 3. Specific examples are methoxy, ethoxy, propoxy, and isopropoxy.
In the present invention, the “hydrocarbon” means a linear, branched, or cyclic saturated or unsaturated compound comprised of carbon atom and hydrogen atom only.
In the present invention, the “aliphatic hydrocarbon” means a non-aromatic material among the hydrocarbons. The “aliphatic hydrocarbon” may be linear, branched, or cyclic, or may be saturated or unsaturated. Specific examples of the structure include alkyl, alkenyl, alkynyl, cycloalkyl, or cycloalkenyl, or a structure by a combination thereof.
In the present invention, the “C1 to 20 aliphatic hydrocarbon” means an aliphatic hydrocarbon of which the number of carbon atom(s) is 1 to 20.
In the present invention, the “C1 to 10 aliphatic hydrocarbon” means an aliphatic hydrocarbon of which the number of carbon atom(s) is 1 to 10.
In the present invention, the “C1 to 6 aliphatic hydrocarbon” means an aliphatic hydrocarbon of which the number of carbon atom(s) is 1 to 6.
In the present invention, the “aromatic hydrocarbon” means an aromatic one among the hydrocarbons.
In the present invention, the “C6 to 14 aromatic hydrocarbon” means an aromatic hydrocarbon of which the number of carbon atoms is 6 to 14. Specific examples include benzene, naphthalene, and anthracene.
In the present invention, the “C6 to 10 aromatic hydrocarbon” means an aromatic hydrocarbon of which the number of carbon atoms is 6 to 10. Specific examples are benzene or naphthalene.
The aromatic heterocyclic ring in the present invention is an aromatic heterocyclic ring having an element(s) selected solely or differently from the group consisting of nitrogen, oxygen, and sulfur as a hetero atom(s) in the cyclic structure.
As one embodiment, the aromatic heterocyclic ring is a “C1 to 9 aromatic heterocyclic ring” of which the number of carbon atom(s) is 1 to 9, and as one embodiment, the “C1 to 9 aromatic heterocyclic ring” is a “5 to 10-membered aromatic heterocyclic ring”.
As one embodiment, the aromatic heterocyclic ring is a “C1 to 5 aromatic heterocyclic ring” of which the number of carbon atom(s) is 1 to 5, and as one embodiment, the “C1 to 5 aromatic heterocyclic ring” is a “5 to 10-membered aromatic heterocyclic ring”.
As one embodiment, the aromatic heterocyclic ring is a “C2 to 9 aromatic heterocyclic ring” of which the number of carbon atoms is 2 to 9, and as one embodiment, the “C2 to 9 aromatic heterocyclic ring” is a “5 to 10-membered aromatic heterocyclic ring”.
As one embodiment, the aromatic heterocyclic ring is a “C2 to 5 aromatic heterocyclic ring” of which the number of carbon atoms is 2 to 5, and as one embodiment, the “C2 to 5 aromatic heterocyclic ring” is a “5 to 6-membered aromatic heterocyclic ring”.
The nitrogen-containing aromatic heterocyclic ring in the present invention is an aromatic heterocyclic ring having nitrogen in the cyclic structure as a hetero atom.
As one embodiment, the nitrogen-containing aromatic heterocyclic ring is a “C1 to 5 nitrogen-containing aromatic heterocyclic ring” of which the number of carbon atom(s) is 1 to 5, and as one embodiment, the “C1 to 5 nitrogen-containing aromatic heterocyclic ring” is a “5 to 6-membered aromatic heterocyclic ring”.
As one embodiment, the nitrogen-containing aromatic heterocyclic ring is a “C2 to 5 nitrogen-containing aromatic heterocyclic ring” of which the number of carbon atoms is 2 to 5, and as one embodiment, the “C2 to 5 nitrogen-containing aromatic heterocyclic ring” is a “5 to 6-membered aromatic heterocyclic ring”.
The non-aromatic heterocyclic ring in the present invention is a non-aromatic heterocyclic ring having an element(s) selected solely or differently from the group consisting of nitrogen, oxygen, and sulfur as a hetero atom(s) in the cyclic structure.
The non-aromatic heterocyclic ring may contain a partially-unsaturated bond.
As one embodiment, the non-aromatic heterocyclic ring is a “C2 to 9 non-aromatic heterocyclic ring” of which the number of carbon atoms is 2 to 9, and as one embodiment, the “C2 to 9 non-aromatic heterocyclic ring” is a “5 to 10-membered non-aromatic heterocyclic ring”.
In the present invention, the “trivalent group of C1 to 14” means a trivalent group derived from a compound of which the number of carbon atom(s) is 1 to 14. As long as the effect of the present invention is achieved, the structure is not limited.
In the present invention, when there is a description that “may be replaced with a hetero atom(s)”, the hetero atom means an atom other than carbon and hydrogen.
The hetero atom is preferably an oxygen atom, a nitrogen atom, a silicon atom, a phosphorus atom, or a sulfur atom, and more preferably an oxygen atom, a nitrogen atom, or a sulfur atom.
Accordingly, for example, when propyl (—CH2—CH2—CH3) is mentioned as an example of the hydrocarbon, the “propyl which may be replaced with a hetero atom(s)” is a concept including structures such as an ether ((—CH2—O—CH3) or (—O—CH2—CH3)) in which methylene (—CH2—) in the alkyl is replaced with oxygen, and an amine ((—CH2—NH—CH3) or (—NH—CH2—CH3)) in which it is replaced with nitrogen.
In the present invention, when there is a description that “may have a substituent(s)”, the substituent is not limited as long as it achieves the object of the present invention.
The substituent is preferably a C1 to 6 alkyl group, a C1 to 6 alkoxy group, an amino group, a hydroxy group, a nitro group, a cyano group, an oxo group, or a halogen atom.
The substituent is more preferably a C1 to 6 alkyl group, a C1 to 6 alkoxy group, a fluorine atom, or a chlorine atom.
In the present invention, the polypeptide and the peptide mean a compound or a partial structure formed by connecting amino acids. The amino acid is a general term for organic compounds having both functional groups of an amino group and a carboxy group. The amino acid that constitutes the polypeptide and peptide according to the present invention is not particularly limited and includes a modified amino acid, etc. In accordance with general usage in the field of life science, in the present invention, proline (classified as imino acid) is also included in the amino acids. The amino acid that constitutes the polypeptide and the peptide according to the present invention is preferably a amino acid and more preferably an “amino acid that “constitutes a protein”.
The halogen atom in the present invention includes a fluorine atom, a chlorine atom, a bromine atom, and an iodine atom.
A C—C, amino, ether, carbonyl, amide, ester, urea, sulfide, disulfide, sulfoxide, sulfonamide, and sulfonyl bond are chemical bonds having chemical structures understood by their respective names. Those skilled in the art understand that, for example, an ether bond is a bond that can generally be represented by “—O—” and a carbonyl bond is a bond that can generally be represented by “—C(═O)—”. An amino, amide, and urea bond have a hydrogen atom or other substituent(s) on the nitrogen atom, but the structure on the nitrogen atom is not limited as long as it has the effect of the present invention. The above-mentioned substituent(s) on the nitrogen atom is/are preferably a C1 to 6 alkyl group(s) or a hydrogen atom(s), and more preferably a hydrogen atom(s). Also, it may be needless to say, the C—C bond means a carbon-carbon bond. The C—C bond includes a single bond, a double bond, and a triple bond. As one embodiment, in the steps a and/or c in the production method according to the present invention, a bond appropriately selected from the above-mentioned 11 kinds is constructed. These 11 kinds of bonds are particularly basic bonding modes in organic chemistry and the reactions for constructing them are also well known to those skilled in the art. Accordingly, in designing and constructing the partial structure An of the compound library according to the present invention, those skilled in the art can combine these 11 kinds of bonds appropriately and use them.
The organic compound composed of an element selected alone or differently from the element group consisting of H, B, C, N, O, Si, P, S, F, Cl, Br, and I is an organic compound constructed by the bond of the above-mentioned 12 kinds of elements.
As one embodiment, the partial structure An of the compound library according to the present invention is constructed by the above-mentioned 12 kinds of elements. These 12 kinds of elements are particularly basic elements in organic compounds and the reactions for constructing them are also well known to those skilled in the art. Accordingly, in designing and constructing the partial structure An of the compound library according to the present invention, those skilled in the art can combine these 12 kinds of elements appropriately and used them.
A low molecular weight organic compound having a substituent(s) selected alone or differently from a substituent group consisting of an aryl group, a non-aromatic cyclyl group, a heteroaryl group, and a non-aromatic heterocyclyl group is a low molecular weight organic compound having a chemical structure understood by each name. The low molecular weight compound is a concept well known to those skilled in the art and examples of the preferred molecular weight of the low molecular weight compound in the present invention will be mentioned separately.
The aryl group in the present invention is preferably a C6 to 10 aryl group and more preferably a phenyl group.
The non-aromatic cyclyl group in the present invention is preferably a 5-membered to 8-membered non-aromatic cyclyl group and more preferably a 5-membered or 6-membered non-aromatic cyclyl group. The non-aromatic cyclyl group may contain a partially-unsaturated bond.
The heteroaryl group and the non-aromatic heterocyclyl group in the present invention are groups having an element selected alone or differently from the group consisting of nitrogen, oxygen, and sulfur as a hetero atom(s) in the cyclic structure. The heteroaryl group and the non-aromatic heterocyclyl group in the present invention is preferably a 5-membered to 8-membered group and more preferably a 5-membered or 6-membered group, and the non-aromatic heterocyclyl group may contain a partially-unsaturated bond.
As one embodiment, the partial structure An of the compound library according to the present invention has the above-mentioned 4 kinds of groups. These 4 kinds of groups are particularly basic partial structures in organic compounds and reactions for constructing them in the compounds are also well known to those skilled in the art. Accordingly, in designing and constructing the partial structure An of the compound library according to the present invention, those skilled in the art can appropriately combine and use these 4 kinds of groups.
The above-mentioned preferred embodiments, that is, a compound library constructed by the 11 kinds of bonds, the 12 kinds of elements and/or the 4 kinds of groups, have particular core value. Accordingly, those skilled in the art would understand that compound libraries constructed without these preferred embodiments would generally limit their use, and in many cases, would limit their commercial value.
The synthesis history of An means a general record of operations carried out until An is synthesized, and in particular, it means the structure of the building blocks the order thereof used until An is synthesized. For example, when reaction is carried out using respective different building block and/or different reaction conditions in two or more separate reaction vessels, an oligonucleotide chain having a previously determined sequence is ligated to the products in the respective reaction vessels before and after the reaction, whereby the synthesis history is imparted as sequence information of the oligonucleotide. By repeating such an operation until An is constructed, an oligonucleotide of Bn having the synthesis history of An is constructed
The split and pool synthesis is a synthetic method developed by Geisen et al., as a combinatorial chemical constructing method of a peptide library utilizing a solid-phase synthetic method in the early days of combinatorial chemistry. The split and pool synthesis is also called a split-mix method, etc.
In accordance with the above, when the synthesis of a peptide library utilizing a solid-phase synthetic method is explained as an example. In the split and pool synthesis, each step of increasing the terminal of the peptide, without cutting out the sample from the solid-phase carrier to which amino acids are peptide-bonded, N kinds of carriers are once mixed and homogenized, and then they are divided into equal parts and the terminal by the next N kinds of amino acids is increased.
That is, one kind of peptide chain is formed for each carrier, and when all 20 kinds of natural amino acids are applied at each stage, a peptide library combinable with all peptides having specific lengths is to be constructed.
If this peptide library is to be screened by antigen presentation or receptor binding, an assay can be carried out by utilizing a peptide on a solid-phase carrier using an ELISA method, etc. That is, it is not necessary to cut out the peptide of the sample from the carrier, and the carrier particles that have reacted in the assay are picked up (for example, the carrier particles of about 0.1 mm that are fluorescently labeled are picked up by an optical microscope). Then, the objective peptide sequence can be determined by the peptide of the particles using an instrument analyzer (peptide analyzer, etc.), or the peptide sequence that is indirectly becomes a candidate for screening can be determined by other combinatorial chemical identification method (for example, tag method), etc.
Further, in the production method according to the present invention, the following case will be explained where v kinds of structures when m is 2 and w kinds when m is 3 are synthesized by the split and pool synthesis. In the explanation, the steps are repeated in the order of (c) and (d).
(m=2)
In the step of m=2, to A1-Sp-C-B1, α2 is added in the step (c) and β2 in the step (d), respectively, to produce A2-Sp-C-B2.
Here, α2 (α2 (a-v)) with v kinds of structures and v kinds of β2 (β2 (a-v)) corresponding thereto are prepared and the steps (c) and (d) are each carried out for each structure, and then v kinds of A2-Sp-C-B2 (A2(a)-Sp-C-B2(a), A2(b)-Sp-C-B2(b) . . . A2(v)-Sp-C-B2(v): that is, A2(a-v)-Sp-C-B2(a-v)) can be obtained. In the split and pool synthesis, v kinds of A2-Sp-C-B2 are mixed and then divided into the number of w. Division means, most specifically, that it is subdivided into reaction vessels with the number of w.
(m=3)
In the step of m=3, to A2-Sp-C-B2, α3 is added in the step (c) and β3 in the step (d), respectively, to produce A3-Sp-C-B3.
Here, α3 (α3 (a-w)) with w kinds of structures and w kinds of β3 (β2 (a-w)) corresponding thereto are prepared, and to (A2 (a-v)-Sp-C-B2 (a-v) mixture) with the number of w, the steps (c) and (d) are each carried out. Then, through the steps of n=2 and 3, (v×w) kinds of A3-Sp-C-B3 is to be efficiently synthesized by (v+w) times of syntheses.
When the obtained products with the number of w are mixed, a mixture of (v×w) kinds of A3-Sp-C-B3 compound library is obtained. For example, if a binding test of a drug receptor is carried out to this mixture, screening of (v×w) kinds of compounds can be carried out at one time. By washing away the compounds that did not bind to the drug receptor, only the bound compounds can be isolated. In a DEL like the present invention, the DNA of the isolated A3-Sp-C-B3 compound is amplified to an amount that can be sequenced and the structure of A3 can be grasped from the sequence information.
A compound library, building blocks, split and pool, etc., are terms well known to those skilled in the art in fields such as combinatorial chemistry and can be carried out in a timely manner with reference to the following Literature, etc.
A DNA-encoded library (or DEL) is a compound library comprising a group of compounds labeled with DNA or oligonucleotides having substantially the same function as DNA (DNA-encoded compound). By the split and pool synthesis as mentioned above, the structure or synthesis history of each compound is imparted to the labeled DNA as sequence information. From such characteristics, the DNA-encoded library is screened in the form of a mixture of 102 to 1020 kinds of compounds and the DNA sequences contained in the obtained compounds are identified by techniques known in the art (for example, use of next-generation sequencers and/or use of microarrays), whereby it is possible to identify the structure of the compound. As one embodiment of the above-mentioned screening method, a method can be selected where a target such as a protein is contacted with a DNA-encoded library and a compound bound to the target is selected.
A “biological target” is a term well known to those skilled in the art, and as one embodiment, in the present invention, the “biological target” is a biological substance group that can be a target in the development of a drug, etc., represented by medical and agrochemical drugs, including, for example, an enzyme (for example, kinase, phosphatase, methylase, demethylase, protease, and DNA repair enzyme), a protein involved in protein-protein interaction (for example, a ligand for receptor), receptor target (for example, GPCR), ion channel, cell, bacteria, virus, parasite DNA, RNA, prion, and sugar.
“Biological activity evaluation” is a term well known to those skilled in the art, and as one embodiment, in the present invention, the “biological activity evaluation” is to evaluate the presence or absence, or strength of the biological activity possessed by a compound (for example, an ability to bind to a biological target, inhibitory function of enzyme activity, promotion function of enzyme activity, etc.). As specific examples of the biological activity evaluation, the above-mentioned Patent Documents 2 and 3, Non-Patent Documents 1 to 6, etc., can be also referred to.
“Functionality evaluation” is a term well known to those skilled in the art, and as one embodiment, in the present invention, the “functionality evaluation” is to evaluate the presence or absence, or strength of a specific function possessed by a compound (for example, binding ability, biological activity, luminescence property, etc.).
The present invention provides a plurality of methods having several advantages with respect to DEL and a method for producing DEL by using a DNA strand having a cleavable site. Forms 1 to 7 will be described in detail below.
The present invention provides a DEL using the above-mentioned “hairpin type head piece having a cleavable site”.
As exemplified in
As exemplified in
As exemplified in
As exemplified in
As exemplified in
For example, deoxyuridine may be used as the cleavable site in the first oligonucleotide chain (E) and deoxyinosine may be used as the cleavable site in the second oligonucleotide chain (F).
In this case, when the USER enzyme is used, deoxyuridine in the first oligonucleotide chain (E) can be selectively cleaved.
On the other hand, when alkyladenine DNA glycosylase and endonuclease VIII are used, in the second oligonucleotide chain (F), the cleavage site originating from deoxyinosine can be selectively cleaved.
Like this, by selecting the cleavage site as desired, a wider range of modifications of the DEL becomes possible and a wider means can be applied to the evaluation thereafter.
can be expected.
As exemplified in
The protruding end is utilized as a sticky end and a desired nucleic acid sequence, for example, UMIs (a specific molecule identification sequence), etc., can be ligated.
After the biological evaluation, to the selected DEL compound, the UMIs region is imparted as mentioned above, and by subjecting to DNA sequencing, it is possible to carry out the analysis in which amplification bias by PCR is reduced
Like this, in the present invention, by having selectively cleavable site(s) in the nucleic acid sequence, it is possible to impart unconventional properties in the aspect of production and use of the DEL compound.
Here, UMIs (a specific molecule identification sequence) is a molecular identifier that gives individual DNA sequence to each DNA molecule by imparting it to the DNA comprised in a certain sample (see Nature Method, 2012, vol. 9, pp. 72-74). By providing such a molecular identifier before PCR amplification, when the number of DNA molecules having a specific sequence in the sample is quantified, it is possible to identify PCR duplication (sequences derived from the same molecule) and quantification in reducing PCR amplification bias is possible.
As exemplified in
In accordance with
Here, the functional molecule is a molecule having a specific chemical or biological function (for example, solubility, photoreactivity, substrate-specific reactivity, target protein degradation-inducing property), and by imparting it to a DEL, it is possible to carry out evaluation or purification of the DEL depending on the function.
Here, biotin means all biotins that bind to avidin and includes not only vitamin B7 but also, for example, desthiobiotin.
As one aspect, the present invention provides appropriate conditions in a method of inducing a DEL containing a cleavable site in a DNA strand to a cross linker-modified double-stranded DEL and evaluating it.
As another method of preparing a DEL in which a hairpin-stranded DNA is converted into a single-stranded DNA, a method using the following exonuclease can also be mentioned.
The single-stranded DEL obtained above is preferably a single-stranded DEL having a library molecule in the 3′ direction of the oligonucleotide chain. The single-stranded DEL can be subjected to a primer elongation reaction using a cross linker-modified primer having a cross linker at the 5′ end. By this method, it is possible to easily synthesize a “cross linker-modified double-stranded DEL” in which a cross linker is linked to an oligonucleotide having a coding sequence via a covalent bond.
In the case of acquiring the single-stranded DEL having the library molecules in the 3′ direction, the “selectively cleavable site” of the hairpin type DEL as a starting material is present in the 3′ direction from the site to which the library molecule is bound.
As exemplified in
Furthermore, when the cross linker-modified primer is used, an optionally added primer may be elongated to induce to a cross linker-modified double-stranded DEL compound. Such a cross linker-modified double-stranded DEL compound is useful in the present invention because a covalent bond is formed between the biological target and the coding sequence after screening.
As exemplified in
In accordance with
In the scene of DEL evaluation, when the building block compound (library low molecular weight compound) binds to the target protein, the cross linker-modified double-stranded DEL compound can further bind the cross linker to the target protein, whereby detection sensitivity can be markedly improved (refer to Non-Patent Documents 7 and 11, etc.). In practice of the DEL technique for evaluating a large number of library compounds, it is extremely useful to enhance the affinity of the library compounds and to improve the detection sensitivity.
The present invention is to provide a novel and highly efficient method of producing a cross linker-modified double-stranded DEL compound, which is extremely useful.
As one aspect, the present invention provides appropriate conditions in a method of inducing a DEL containing a cleavable site in a DNA strand to a cross linker-modified double-stranded DEL and evaluating it.
As one embodiment, in the cross linker-modified double-stranded DEL compound, the cross linker is preferably linked to an oligonucleotide having a coding sequence via a covalent bond. Such a “cross linker-modified double-stranded DEL compound” is significantly useful because a covalent bond is formed between a target and a coding sequence after screening, and the compound is resistant to separation and elution conditions stronger than the conventional ones for removal of a non-specific binder, etc.
As one embodiment, in the present invention, the “cross linker” means a reactive group having reactivity capable of forming a covalent bond by a reaction with a biological target such as a protein or a nucleic acid molecule. For example, cross linkers as described in Thermo Scientific Crosslinking Technical Handbook are known.
The cross linker used in the present invention is preferably a reactive group comprising at least one of an azide group, a diazirine group, a sulfonyl fluoride group, a diazo group, a cinnamoyl group, or an acrylate group, and more preferably a reactive group comprising at least one of an azide group, a diazirine group, or a sulfonyl fluoride group.
As one embodiment, in the present invention, “having a cross linker” and “cross linker modification” mean having a partial structure comprising a cross linker as a substituent.
As one preferred embodiment, in the “cross linker-modified double-stranded DEL”, the “cross linker-modified DNA”, and the “cross linker-modified primer” bind directly or via a bifunctional spacer to the 5′ ends of the “double-stranded DEL”, the “DNA”, and the “primer”, respectively.
At that time, the cross linker is preferably a structure of any of the following formulae (AA) to (AE), or (BA) or (BB):
wherein * means a binding site with the 5′ end of the “double-stranded DEL”, “DNA”, or “primer”, or the bifunctional spacer side binding to the 5′ end.
As one preferred embodiment, the cross linker used in the present invention is a photoreactive cross linker. In the present invention, the photoreactive cross linker means a reactive group that is changed to a reactive group having a high reaction activity (for example, nitrene and carbene) by light irradiation and forms a covalent bond with a nearby biological target. For example, an azide group and a diazirine group are known, and the structures of the formulae (AA) to (AE) are known.
In addition, as one preferred embodiment, the cross linker used in the present invention is a reactive group comprising at least one sulfonyl fluoride group. For example, the sulfonyl fluoride group reacts with residues such as serine, threonine, tyrosine, lysine, cysteine, and histidine in a biological target protein to form a covalent bond. For example, the structures of the formulae (BA) and (BB) are known.
In the present invention, the crosslinking reaction between the cross linker and the biological target is preferably carried out in a temperature range in which the desired high-dimensional structure of the biological target does not change significantly. A preferred temperature is, for example, in a range of 4 to 40° C.
As described above, the bifunctional spacer is a spacer portion having at least two reactive groups that enables binding between the partial structure An of the compound library and the head piece.
Furthermore, in the present invention, the bifunctional spacer is a spacer moiety having two reactive groups that enable the bonding between the cross linker and the “double-stranded DEL,” “DNA”, or “primer”. The bifunctional spacer of the embodiment may be referred to as a “bifunctional spacer of a cross linker”. In contrast, the bifunctional spacer bonded to the compound library described above may be referred to as a “bifunctional spacer of a compound library”.
As one embodiment, the preferred embodiment of the “bifunctional spacer of the cross linker” is the same as the preferred embodiment of the “bifunctional spacer of the compound library” described above.
In addition, as one embodiment, the “bifunctional spacer of the cross linker” preferably has a molecular chain length suitable for reacting the cross linker with a biological target when the compound library binds to the biological target at the time of screening, and preferably has a molecular chain length equivalent to the “bifunctional spacer of the compound library”.
In the present invention, the “coding sequence” is a sequence moiety of an oligonucleotide having a sequence capable of identifying a structure of a library molecule among sequences of oligonucleotides comprised in DEL.
The “reactive group for cross linker modification” is not particularly limited as long as it is a reactive group capable of reacting with a cross linker unit described below.
As one embodiment, the “reactive group for cross linker modification” is a reactive group having reaction selectivity with a cross linker. By having reaction selectivity with the cross linker, reaction conditions under which the cross linker cannot be applied, such as the cross linker reacting first to undergoing structural transformation, can be applied to the present invention. That is, a unit having a reactive group for cross linker modification is introduced into the process according to the present invention, reaction conditions under which the cross linker cannot be applied are used in the process according to the present invention, and then the unit is reacted with the cross linker, whereby the cross linker required for the present invention can be introduced into the cross linker-modified DEL according to the present invention.
As one embodiment, the “reactive group for cross linker modification” and the “reactive group paired with the reactive group for cross linker modification” are a pair of reactive groups having high affinity in a binding reaction. When two compounds each having the pair are bonded, even if there are various other functional groups in the compound, the pair preferentially reacts with high selectivity to form a bond.
Examples of the above pairs include pairs of functional groups in the click reaction.
The “click reaction” is a concept well known to those skilled in the art. (See H. C. Kolb, M. G. Finn & K. B. Sharpless: Angew. Chem. Int. Ed., 40, 2004 (2001), etc.)
The “click reaction” is understood as one embodiment as follows.
The “click reaction” refers to a reaction that has at least the following characteristics: (1) exhibiting orthogonality of a functional group (that is, a functional moiety only reacts with a reactive site complementary to the functional moiety without reacting with other reactive sites); and (2) irreversibility of the obtained bond (that is, when the reactant reacts to form a product, decomposition of the product into the reactant becomes difficult), or in some cases, possible reversibility of the obtained bond (that is, under appropriate conditions, returns to the reactant). Optionally, the “click” chemistry can further have one or more of the following characteristics: (1) stereospecificity; (2) reaction conditions without strict purification, atmosphere control, etc.; (3) readily available starting materials and reagents; (4) harmless solvents available or no solvents available at all; (5) isolation of products by crystallization or distillation; (6) physiological stability; (7) high thermodynamic driving power (for example, 10 to 20 kcal/mol); (8) single reaction products; and (9) high chemical yields (for example, more than 50%).
As one embodiment, the “reactive group for cross linker modification” and the “reactive group paired with the reactive group for cross linker modification” are preferably reactive groups for the click reaction, more preferably an alkynyl group, an alkenyl group, an azide group, or a tetrazinyl group, and still more preferably any of the formulae (CA) to (CL).
Here, examples of the pair of “a reactive group for cross linker modification” and “a reactive group that is paired with a reactive group for cross linker modification” preferably include an azide group for an alkynyl group and a tetrazinyl group for an alkenyl group. These pairs are so-called bolt and nut relationships and are interchangeable. For example, when an alkynyl group is used as the “reactive group for cross linker modification”, an azide group can be used as the “reactive group paired with the reactive group for cross linker modification”. Their selection is well known to those skilled in the art.
Preferred examples of the “reactive group for cross linker modification” and the “reactive group paired with a reactive group for cross linker modification” include (CA) and (CH), (CB) and (CH), (CC) and (CH), (CE) and (CI), (CE) and (CJ), (CE) and (CK), (CE) and (CL), (CF) and (CI), (CF) and (CJ), (CF) and (CK), (CF) and (CL), (CG) and (CI), (CG) and (CJ), (CG) and (CK), (CG) and (CL), etc.
The “cross linker unit” is not particularly limited as long as it is a unit having the “reactive group paired with the reactive group for cross linker modification” as described above and a cross linker.
As one embodiment, the “cross linker unit” is composed of a “reactive group paired with a reactive group for cross linker modification”, a “bifunctional spacer”, and a “cross linker”. The embodiment of the “bifunctional spacer” is as described above.
The “DNA having a reactive group for cross linker modification” is not particularly limited as long as it is a compound having the “reactive group for cross linker modification” described above described above.
As one embodiment, the “DNA having a reactive group for cross linker modification” is composed of a “reactive group for cross linker modification,” a “bifunctional spacer,” and “DNA.” The embodiment of the “bifunctional spacer” is as described above.
For an embodiment of a “modified primer having a reactive group for cross linker modification”, the “cross linker-modified primer” described above is referred to. However, the term “cross linker” is replaced to refer to as a “reactive group for cross linker modification”.
Hereinafter, Examples are shown and the present invention will be described in more detail, but the present invention is not limited to these Examples. Various kinds of nucleic acids having sequences in Examples can be prepared, for example, according to a conventional method by an automated polynucleotide synthesizer. Examples of the automated polynucleotide synthesizer include nS-811 (manufactured by GeneDesign, Inc.), etc. In addition, for the preparation of the nucleic acid, consignment synthesis, contract labs, etc., can also be used. The contract labs well known to those skilled in the art include GeneDesign, Inc., LGC Biosearch Technologies, etc. In general, these contract labs prepare nucleic acids having the sequence specified by the consignor and deliver them to the consignor under a confidentiality agreement.
The compound having the sequence shown in Table 1 was prepared using the automated polynucleotide synthesizer nS-811 (manufactured by GeneDesign, Inc.). In the sequence notations in Table 1, as is obvious to those skilled in the art, each sequence unit is bound by a phosphodiester bond, and “A” means deoxyadenosine, “T” means thymidine, “G” means deoxyguanosine, “C” means deoxycytidine, “(dU)” means deoxyuridine, “(p)” means phosphoric acid, and “(amino-C6-dT)” means the modified nucleic acid represented by the following formula (1)
“(amino-NC6-dT)” means the modified nucleic acid represented by the following formula (2)
“(dSpacer)” means the group represented by the following formula (3)
and “(aminoC7)” means the group represented by the following formula (4)
In addition, amino-NC6-dT was introduced using the nucleic acid synthetic reagent of the following formula (5)
synthesized according to the method described in Journal of the American Chemical Society, 1993, vol. 115, pp. 7128-7134.
In Table 1, “No.” in the left column represents SEQ ID NO and “Seq.” in the right column represents a sequence. The left side of the sequence represents the 5′ side and the right side represents 3′ side. Also, the names of the compounds corresponding to each SEQ ID NO (No.) are as follows.
A 0.1 mM aqueous solution of each of the compounds having the sequences shown in Table 1 was prepared and investigation of the cleavage reaction by USER® enzyme was carried out by the following procedures.
To a PCR tube were added 1 μL of 0.1 mM aqueous solution of the compound having the sequence shown in Table 1; 10 μL of CutSmart® Buffer (available from New England BioLabs, Catalog number: B7204S); and 79 μL of deionized water. To the solution was added 10 μL of USER® enzyme (available from New England BioLabs, Catalog number: M5505S) and incubation of the obtained solution was started at 37° C.
Each reaction solution was sampled with each 20 μL after starting the incubation, 1 hour and 3 hours lapsed, respectively. U-DEL1-sh, U-DEL5-HP, U-DEL6-HP, U-DEL7-HP, U-DEL8-HP, U-DEL9-HP, and U-DEL10-HP were sampled with each 20 μL after 20 hours lapsed. U-DEL8-HP and U-DEL9-HP were further incubated at 90° C. for 1 hour and then sampled with each 20 μL, respectively.
Among the sampled solutions, U-DEL1-sh, U-DEL2-sh, U-DEL3-sh, and U-DEL4-sh were analyzed by Analytical condition 1 shown below, and U-DEL5-HP, U-DEL6-HP, U-DEL7-HP, U-DEL8-HP, U-DEL9-HP, and U-DEL10-HP were analyzed by Analytical condition 2 shown below.
The sequences and the expected molecular weights of the products (abasic product of deoxyuridine portion and cleaved fragments) assumed in each reaction solution and the molecular weight observed in each reaction solution are shown in Tables 2 and 3. In Tables 2 and 3, the notation of each column is as follows.
It indicates the experimental number and the substrates corresponding to each experimental number (Entry) are as follows.
“No.” (Second from the Left):
It indicates the sequence number. Among the respective SEQ ID NOs (Nos.), Nos. 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10 are substrates of each reaction solution, Nos. 11, 14, 17, 20, 22, 25, 29, 31, 33, and 35 are dibasic products of the deoxyuridine portion of each substrate, and the remaining SEQ ID NOs are fragments of each substrate cleaved.
“Seq.” (Third from Left):
It represents the sequence, the left side represents the 5′ side, and the right side represents the 3′ side.
In the sequence notation, “(B)” means the group (debasic site) represented by the following formula (6)
and other notations are the same as in Table 1.
“Expected MW.” (fourth from left):
It represents the numerical values of the expected molecular weight (Da) of each sequence.
It represents the numerical value of the observed molecular weight (Da) identified as each sequence. “-” notation indicates that it has not been detected.
From the area ratio of the peak corresponding to each sequence detected, the conversion rate of the debasic reaction and of the cleavage reaction were calculated. In the debasic reaction, 9900 or more was converted in all the substrate at the stage of 1 hour at 37° C. (the peak of the substrate was less than 100 and the remaining peak was of the debasic product and the cleaved fragment only).
A graph showing the conversion rate of the cleavage reaction is also shown in
From the above results, at the partial structure of the hairpin type DEL containing various kinds of deoxyuridines, it was shown that, at deoxyuridine site, a debasic reaction by USER® enzyme and subsequently a cleavage reaction proceeded.
As in the schematic diagram shown in
and other notations are the same as in Table 1.
The names of the compounds corresponding to each SEQ ID NO (No.) are as follows.
The names of the compounds of the starting material head piece for synthesizing each hairpin DEL are each as follows.
Further, SEQ ID NO “No.” and the sequence “Seq” of U-DEL1-HP, U-DEL2-HP, U-DEL4-HP, and H-DEL-HP are as shown in Table 5.
The starting material head piece shown in Table 5 were prepared using the automated polynucleotide synthesizer nS-811 (manufactured by GeneDesign, Inc.) in the same manner as in Example 1.
To a PCR tube were added 2.0 μL of 1 mM aqueous solution of various kinds of the starting material head piece; 2.4 μL of 1 mM aqueous solution of Pr_TAG (it was prepared by annealing Pr_TAG_a and Pr_TAG_b synthesized in the same manner as in Example 1; the sequences are shown in Table 6); 0.8 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate); and 2.0 μL of deionized water. To the solution was added 0.8 μL of a 10-fold diluted aqueous solution of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013) and the obtained solution was incubated at 16° C. for 24 hours. The sequence notation in Table 6 is the same as in Table 1. Also, the names of the compounds corresponding to each SEQ ID NO (No.) are as follows.
The reaction solution was treated with 0.8 μL of 5 M aqueous sodium chloride solution and 17.6 μL of cooled (−20° C.) ethanol and allowed to stand at −78° C. for 2 hours. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. To each pellet was added 2.0 μL of deionized water to prepare a solution.
To the obtained each solution were added 2.4 μL of 1 mM aqueous solution of CP (it was prepared by annealing CP_a and CP_b synthesized in the same manner as in Example 1; the sequences are shown in Table 7); 0.8 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate), and 2.0 μL of deionized water. To the solution was added 0.8 μL of a 10-fold diluted aqueous solution of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013) and the obtained solution was incubated at 16° C. for 24 hours. The sequence notations in Table 7 are the same as in Table 1. Also, the names of the compounds corresponding to each SEQ ID NO (No.) are as follows.
The reaction solution was treated with 0.8 μL of 5 M aqueous sodium chloride solution and 17.6 μL of cooled (−20° C.) ethanol and allowed to stand at −78° C. for 2 hours. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. To the pellets was added 10 μL of deionized water to prepare a solution.
From the obtained solution, 1.0 μL was sampled, and after diluting with deionized water, mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 2 in Example 1 to identify the target product (the expected molecular weight and the observed molecular weight of each sequence are shown in Table 4). After lyophilizing the rest of the solution, deionized water was added to prepare a 20 M solution.
Among the eight kinds of the hairpin type DEL obtained as mentioned above, H-DEL is a conventional type hairpin DEL and the remaining seven kinds are cleavable hairpin DELs containing deoxyuridine. Real-time PCR analysis was carried out to compare the PCR efficiency of various kinds of hairpin type DELs before treatment with USER® enzyme and the PCR efficiency after the treatment. Also, as the double-stranded DEL to be compared, DS-DEL (it was prepared by annealing the compounds having the sequences No. 47 and No. 48) shown in Table 7 was used. In the sequence notations in Table 8, “(amino-C6-L)” means the group represented by the following formula (8)
and other notations are the same as in Table 1.
<Treatment Step with USER® Enzyme>
Treatment of eight kinds of hairpin DELs and double-stranded DEL (DS-DEL) with USER® enzyme was carried out by the following procedures.
To a PCR tube were added 1 μL of various kinds of 20 M DEL aqueous solution; 1 μL of CutSmart® Buffer (available from New England BioLabs, Catalog number: B7204S); and 7 μL of deionized water. To the solution was added 1 μL of USER® enzyme (available from New England BioLabs, Catalog number: M5505S) and the obtained solution was incubated at 37° C. for 1 hour.
Samples of various kinds of DELs before the treatment with USER® enzyme and the reaction solutions after the treatment were each diluted with deionized water to prepare DEL samples with 0.05 pM, 0.5 pM, and 5 pM.
The Ct value of various kinds of the DEL samples obtained as mentioned above was measured by real-time PCR and the PCR efficiencies were compared. The conditions are as mentioned below and the results are shown in
The sequence notations in Table 9 are the same as in Table 1.
As shown in
This result shows that the DEL cleaved by the USER® enzyme has improved PCR efficiency than that before the cleavage and that the cleavable hairpin DEL containing deoxyuridine was cleaved by USER® enzyme with high efficiency and high selectively.
The compounds having the sequences shown in Table 10 (hairpin DEL) were synthesized by the following procedures. In the sequence notations in Table 10, “[mdC (TEG-Amino)]” means the group represented by the following formula (9)
and other notations are the same as in Table 4.
The names of the compounds corresponding to each SEQ ID NO (No.) are as follows.
The names of the compounds of the starting material head piece for synthesizing each hairpin DEL are each as follows.
Further, SEQ ID NO “No.” and the sequences “Seq” of U-DEL11-HP, U-DEL12-HP, and U-DEL13-HP are as mentioned in the following Table 11. The notations in Table 11 are the same as in Table 10.
Among the starting material head piece shown in Table 11, U-DEL12-HP and U-DEL13-HP were prepared using the automated polynucleotide synthesizer nS-811 (manufactured by GeneDesign, Inc.) in the same manner as in Example 1. U-DEL11-HP was also prepared according to a conventional method.
Similar to Example 2, using various kinds of starting material head pieces, two-step double-stranded ligation with the double-stranded oligonucleotide Pr_TAG and CP was carried out.
A part of the obtained solution was sampled, and after diluting with deionized water, mass spectrometry by ESI-MS was carried out under Analytical condition 3 shown below to identify the target product (the expected molecular weight and the observed molecular weight of each sequence are shown in Table 10). After lyophilizing the rest of the solution, deionized water was added to prepare a 20 M solution.
Verification of the cleavage reaction of the hairpin DEL (U-DEL5, U-DEL7, U-DEL9, U-DEL11, U-DEL12, and U-DEL13) containing 6 kinds of deoxyuridines by USER® enzyme was carried out by the following procedures.
To a PCR tube were added 2 μL of various kinds of 20 M hairpin DEL aqueous solution; 2 μL of CutSmart® Buffer (available from New England BioLabs, Catalog number: B7204S); and 14 μL of deionized water. To the solution was added 2 μL of USER® enzyme (available from New England BioLabs, Catalog number: M5505S), and the obtained solution was incubated at 37° C. for 16 hours and further incubated at 90° C. for 1 hour.
<Confirmation of Product after Cleavage by LC-MS Measurement>
From the obtained reaction solutions, 5.0 μL was sampled, and after diluting with deionized water, mass spectrometry by ESI-MS was carried out under Analytical condition 3. The sequences and the expected molecular weights of the products after the cleavage assumed in each reaction solution and the molecular weight observed in each reaction solution are shown in Table 12. The substrates corresponding to each experimental numbers (Entry) are as follows and other notations are the same as in Table 10.
In any of the samples, no MS of the substrate was detected, and the MS of the product after the cleavage was observed as the main peak.
Also, among the obtained reaction solutions, a part thereof was sampled and analyzed by modified polyacrylamide gel electrophoresis under the conditions shown below. From the results shown in
From the above results, in the hairpin type DEL containing various kinds of deoxyuridines, it was shown that the cleavage reaction by USER® enzyme proceeded at the deoxyuridine site.
The compounds having the sequence shown in Table 13 (hairpin DEL) were synthesized by the following procedures. In the sequence notations in Table 13, “I” means deoxyinosine and other notations are the same as in Table 2.
The names of the compounds corresponding to each SEQ ID NO (No.) are as follows.
The names of the compounds of the starting material head piece for synthesizing each hairpin DEL are each as follows.
Further, SEQ ID NO “No.” and the sequence “Seq” of I-DEL1-HIP, I-DEL2-HP, I-DEL3-HP, and I-DEL4-HP are as shown in Table 14. The notations in Table 14 are the same as in Table 13.
The starting material head pieces shown in Table 14 were prepared according to a conventional method.
Similar to Example 2, using various kinds of starting material head pieces, two-step double-stranded ligation with the double-stranded oligonucleotide Pr_TAG and CP was carried out.
A part of the obtained solution was sampled, and after diluting with deionized water, mass spectrometry by ESI-MS was carried out under Analytical condition 3 to identify the target product (the expected molecular weight and the observed molecular weight of each sequence are shown in Table 13). After lyophilizing the rest of the solution, deionized water was added to prepare a 20 M solution.
Verification of the cleavage reaction of the 4 kinds of hairpin DELs (I-DEL1, I-DEL2, I-DEL3, and I-DEL4) containing deoxyinosines by endonuclease V was carried out by the following procedures.
To a PCR tube were added 1 μL of various kinds of 20 M hairpin DEL aqueous solution; 2 μL of NEBuffer® 4 (available from New England BioLabs, Catalog number: B7004); and 15 μL of deionized water. To the solution was added 2 μL of Endonuclease V (available from New England BioLabs, Catalog number: M0305S), and the obtained solution was incubated at 37° C. for 24 hours.
<Confirmation of Product after Cleavage Using LC-MS Measurement>
Among the obtained reaction solutions, 8.0 μL was sampled, and after diluting with deionized water, mass spectrometry by ESI-MS was carried out under Analytical condition 3. The sequences and the expected molecular weights of the products after the cleavage assumed in each reaction solution and the molecular weight observed in each reaction solution are shown in Table 15. The substrates corresponding to each experimental numbers (Entry) are as follows and other notations are the same as in Table 13.
In any of the samples, no MS of the substrate was detected, and the MS of the product after the cleavage was observed as the main peak.
Also, among the obtained reaction solutions, a part was sampled and analyzed by modified polyacrylamide gel electrophoresis under the same conditions as in Example 3. From the results shown in
From the above results, in the various kinds of hairpin type DEL containing deoxyinosines, it was shown that the second phosphodiester bond in the 3′ direction from the deoxyinosine was cleaved by endonuclease V.
The compound having the sequence shown in Table 16 (hairpin DEL) was synthesized by the following procedures. In the sequence notations in Table 16, “u” means uridine and other notations are the same as in Table 2.
The name of the compound corresponding to SEQ ID NO (No.) is as follows.
The name of the compound of the starting material head piece for synthesizing each hairpin DEL is as follows.
Further, SEQ ID NO “No.” of R-DEL1-HP and the sequence “Seq” are as shown in Table 17. The notations in Table 17 are the same as in Table 16.
The starting material head pieces shown in Table 17 were prepared according to a conventional method.
Similar to Example 2, using starting material head pieces, two-step double-stranded ligation with the double-stranded oligonucleotide Pr_TAG and CP was carried out.
A part of the obtained solutions was sampled, and after diluting with deionized water, mass spectrometry by ESI-MS was carried out under Analytical condition 3 to identify the target product (the expected molecular weight and the observed molecular weight of each sequence are shown in Table 16). After lyophilizing the rest of the solution, deionized water was each added to prepare a 200 pM solution.
Verification of the cleavage reaction of the hairpin DEL (R-DEL1) containing ribonucleoside by RNaseHII was carried out by the following procedures.
To a PCR tube were added 0.5 μL of 200 M hairpin DEL aqueous solution; 4.9 μL of ThermoPol® Reaction Buffer Pack (available from New England BioLabs, Catalog number: B9004); and 43.6 μL of deionized water. To the solution was added 1 μL of RNase HII (available from New England BioLabs, Catalog number: M0288S), and the obtained solution was incubated at 37° C. for 8 hours.
<Confirmation of Product after Cleavage Using LC-MS Measurement>
Among the obtained reaction solutions, 10 μL was sampled and mass spectrometry by ESI-MS was carried out under Analytical condition 3. The sequences and the expected molecular weights of the products after the cleavage assumed in each reaction solution and the molecular weight observed in each reaction solution are shown in Table 18. The substrates corresponding to each experimental numbers (Entry) are as follows and other notations are the same as in Table 16.
In any of the samples, no MS of the substrate was detected, and the MS of the product after the cleavage was observed as the main peak.
Also, among the obtained reaction solutions, a part was sampled and analyzed by modified polyacrylamide gel electrophoresis under the same conditions as in Example 3. From the results shown in
From the above results, in the hairpin type DEL containing ribonucleoside, it was shown that the phosphodiester bond at the 5′ side of the ribonucleotide was cleaved by RNaseHII.
As in the schematic diagram shown in
In Table 19, “Tag No.” (leftmost) represents a tag number, “No.” (second from the left) represents SEQ ID NO, and “Seq.” (third from the left) represents a sequence. The sequence notations are the same as in Table 1.
Each double-stranded oligonucleotide tag was prepared by annealing 2 kinds of oligonucleotides having a SEQ ID NO corresponding to each tag number, as shown in Table 19.
The compound “AOP-U-DEL9-HP” having the sequence shown in Table 20 was synthesized by the following procedures. In the sequence notations in Table 20, “(AOP-AminoC7)” means the group represented by the following formula (10)
and other notations are the same as in Table 2.
To four violamo centrifuge tubes was added a solution (2.5 mL, 1 mM) of U-DEL9-HP in a sodium borate buffer (150 mM, pH 9.4) cooled to 10° C. To the respective tubes were added 40 equivalent of N-Fmoc-15-amino-4,7,10,13-tetraoxaoctadecanoic acid (250 μL, 0.4M N,N-dimethylacetamide solution), subsequently 40 equivalents of 4-(4,6-dimethoxy[1.3.5]triazin-2-yl)-4-methylmorpholinium chloride hydrate (DMTMM) (200 μL, 0.5 M aqueous solution), and the obtained solution was shaken at 10° C. for 5 hours.
The above-mentioned solutions were each treated by 295 μL of a 5 M aqueous sodium chloride solution and 9.7 mL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. To the pellets were each added 2.75 mL of deionized water to dissolve them, 306 μL of piperidine was added at 0° C., and the mixture was shaken at 10° C. for 3 hours. After the mixture was centrifuged, the precipitates were removed by filtration and washed with 1.47 mL of deionized water twice. The obtained filtrates were each treated with 600 μL of a 5 M aqueous sodium chloride solution and 19.8 mL of cooled (−20° C.) ethanol and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed and the obtained pellets were air-dried.
To the obtained pellets was added 10 mL of deionized water to make it a solution. Among the obtained solutions, a part was sampled and diluted with deionized water. After then, mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 2 in Example 1 to identify the target product (the expected molecular weight and the observed molecular weight of the compound are shown in Table 20). After lyophilizing the rest of the solution, deionized water was each added to prepare a 5 mM solution.
The compound “AOP-U-DEL9-HP-Pr” having the sequence shown in Table 21 was synthesized by ligating the compound “AOP-U-DEL9-HP” and the double-stranded oligonucleotide tag “Pr” according to the following procedures. The sequence notations in Table 21 are the same as in Table 20.
To a violamo centrifuge tube were added 40 μL of 5 mM aqueous solution of the compound “AOP-U-DEL9-HP”; 160 μL of 100 mM aqueous sodium hydrogen carbonate solution water; 240 μL of 1 mM aqueous solution of the double-stranded oligonucleotide tag “Pr”; 80 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate); and 272 μL of deionized water. To the solution was added 8.0 μL of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013), and the obtained solution was incubated at 16° C. for 24 hours.
The reaction solution was treated with 80 μL of 5 M aqueous sodium chloride solution and 2640 μL of cooled (−20° C.) ethanol and allowed to stand at −78° C. for 2 hours. After centrifugation, the supernatant was removed, and 400 μL of deionized water was added to the obtained pellets. The obtained solution was concentrated by Amicon® Ultra Centrifugal filter (30 kD cutoff). A part of the obtained solution was sampled, and mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 2 to identify the target product (the expected molecular weight and the observed molecular weight of the compound are shown in Table 21). According to the above procedures, 133 nmol of the compound “AOP-U-DEL9-HP-Pr” with a purity of 84.5% was obtained. To the obtained compound “AOP-U-DEL9-HP-Pr” was added a 100 mM aqueous sodium hydrogen carbonate solution to prepare a 1 mM solution.
To each of three PCR tubes were added 20 μL of 1 mM solution of the compound “AOP-U-DEL9-HP-Pr” obtained as mentioned above; 30 μL of a 1 mM aqueous solution of one of the double-stranded oligonucleotide tags A1 to A3; 8.0 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate); and 21.6 μL of deionized water. To the solution was added 0.4 μL of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013), and the obtained solution was incubated at 16° C. for 18 hours.
The reaction solutions were each treated with 8.0 μL of a 5 M aqueous sodium chloride solution and 264 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were each dissolved in 20 μL of 150 mM sodium borate buffer (pH 9.4).
To each tube were added 40 equivalents of one of the building blocks BB1 to BB3 (4.0 μL, 200 mM N,N-dimethylacetamide solution), subsequently 40 equivalents of 4-(4,6-dimethoxy[1.3.5]triazin-2-yl)-4-methylmorpholinium chloride hydrate (DMTMM) (4.0 μL, 200 mM aqueous solution), and the mixture was shaken at 10° C. for 2 hours. Further, to each tube were added 20 equivalents of building blocks (2.0 μL, 200 mM N,N-dimethylacetamide solution), subsequently 20 equivalents of DMTMM (2.0 μL, 200 mM aqueous solution), and the mixture was shaken at 10° C. for 30 minutes.
The reaction solutions were each treated with 3.2 μL of a 5 M aqueous sodium chloride solution and 106 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and to the obtained pellets was added each 18 μL of deionized water, and 3 kinds of the solutions were mixed in one PCR tube.
To the mixed solution was added 6.0 μL of piperidine at 0° C., and the mixture was shaken at room temperature for 1 hour. The reaction solution was treated with 6.0 μL of a 5 M aqueous sodium chloride solution and 198 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 18 hours. After centrifugation, the supernatant was removed, and 400 μL of deionized water was added to the obtained pellets. The obtained solution was concentrated by Amicon® Ultra Centrifugal filter (30 kD cutoff), a 100 mM aqueous sodium hydrogen carbonate solution was added to prepare a 1 mM solution, and used in the next step as a starting material.
To each of three PCR tubes were added 13.7 μL of 1 mM solution obtained in Cycle A as a starting material; 20.6 μL of 1 mM aqueous solution of one of the three double-stranded oligonucleotide tags B1 to B3; 5.5 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate); and 14.8 μL of deionized water. To the solution was added 0.3 μL of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013), and the obtained solution was incubated at 16° C. for 16 hours.
The reaction solutions were each treated with 5.5 μL of a 5 M aqueous sodium chloride solution and 181 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were each dissolved in 13.7 μL of 150 mM sodium borate buffer (pH 9.4).
To each tube were added 80 equivalents of one of the building blocks BB1 to BB3 (5.5 μL, 200 mM N,N-dimethylacetamide solution), subsequently 80 equivalents of DMTMM (5.5 μL, 200 mM aqueous solution), and the mixture was shaken at 10° C. for 1 hour. Further, to each tube were added 40 equivalents of the building block (2.3 μL, 200 mM N,N-dimethylacetamide solution), subsequently 40 equivalents of DMTMM (2.3 μL, 200 mM aqueous solution), and the mixture was shaken at 10° C. for 2 hours.
The reaction solutions were each treated with 2.5 μL of a 5 M aqueous sodium chloride solution and 81.4 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and to the obtained pellets was added each 12.3 μL of deionized water, and 3 kinds of the solutions were mixed in one PCR tube.
To the mixed solution was added 4.1 μL of piperidine at 0° C., and the mixture was shaken at room temperature for 3 hours. The reaction solution was treated with 4.1 μL of a 5 M aqueous sodium chloride solution and 136 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 3 hours. After centrifugation, the supernatant was removed and 400 μL of deionized water was added to the obtained pellets. The obtained solution was concentrated by Amicon® Ultra Centrifugal filter (30 kD cutoff), a 100 mM aqueous sodium hydrogen carbonate solution was added to prepare a 0.48 mM solution, and used in the next step as a starting material.
To each of three PCR tubes were added 14.5 μL of 0.48 mM solution of the starting material obtained in Cycle B; 10.5 μL of 1 mM aqueous solution of one of the three double-stranded oligonucleotide tags C1 to C3; and 2.8 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate). To the solution was added 0.14 μL of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013), and the obtained solution was incubated at 16° C. for 16 hours.
The reaction solutions were each treated with 2.8 μL of a 5 M aqueous sodium chloride solution and 92 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and the obtained pellets were each dissolved in 7.0 μL of 150 mM sodium borate buffer (pH 9.4).
To each tube were added 80 equivalents of one of the building blocks BB1 to BB3 (2.8 μL, 200 mM N,N-dimethylacetamide solution), subsequently 80 equivalents of DMTMM (2.8 μL, 200 mM aqueous solution), and the mixture was shaken at 10° C. for 1 hour. Further, to each tube were added 40 equivalents of the building block (1.4 μL, 200 mM N,N-dimethylacetamide solution), subsequently 40 equivalents of DMTMM (1.4 μL, 200 mM aqueous solution), and the mixture was shaken at 10° C. for 2 hours.
The reaction solutions were each treated with 1.3 μL of a 5 M aqueous sodium chloride solution and 41.4 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and each 6.3 μL of deionized water was added to the obtained pellets, and then 3 kinds of the solutions were mixed in one PCR tube.
To the mixed solution was added 2.1 μL of piperidine at 0° C., and the mixture was shaken at room temperature for 2 hours. The reaction solution was treated with 2.1 μL of a 5 M aqueous sodium chloride solution and 69 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 3 hours. After centrifugation, the supernatant was removed, and 400 μL of deionized water was added to the obtained pellets. The obtained solution was concentrated by Amicon® Ultra Centrifugal filter (30 kD cutoff), a 100 mM aqueous sodium hydrogen carbonate solution was added to prepare a 0.41 mM solution, and used in the next step as a starting material.
To a PCR tube were added 12.2 μL of 0.41 mM solution of the starting material obtained in Cycle C; 6.0 μL of 1 mM aqueous solution of CP (the same as used in Example 2); 2.1 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 100 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate), and 0.7 μL of deionized water. To the solution was added 0.1 μL of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013), and the obtained solution was incubated at 16° C. for 16 hours.
The reaction solution was treated with 2.1 μL of a 5 M aqueous sodium chloride solution and 69.6 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and 400 μL of deionized water was added to the obtained pellets. The obtained solution was concentrated by Amicon® Ultra Centrifugal filter (30 kD cutoff), and deionized water was added to prepare a 20 pM solution.
The samples after ligation of the double-stranded oligonucleotide tag in each cycle were analyzed by electrophoresis using a 2.2% agarose gel (manufactured by Lonza, FlashGel® cassette, Catalog number: 57031). From the results shown in FIG. 17, in each cycle, it was confirmed that coding with the double-stranded oligonucleotide tag was achieved with high efficiency. The samples in each lane in
The samples after the completion of Cycle C were analyzed under Analytical condition 3.
According to the above, by the above-mentioned synthesis procedures, the synthesis of the model library containing the 3×3×3 (27) compound species using U-DEL9-HP as a starting material was achieved.
The cleavage reaction of the model library obtained above by USER® enzyme was carried out by the following procedures.
To a PCR tube were added 2.0 μL of 20 M aqueous solution of the model library; 2 μL of CutSmart® Buffer (available from New England BioLabs, Catalog number: B7204S), and 14 μL of deionized water. To the solution was added 2 μL of USER® enzyme (available from New England BioLabs, Catalog number: M5505S), and the obtained solution was incubated at 37° C. for 16 hours and further incubated at 90° C. for 1 hour.
Among the obtained reaction solutions, a part thereof was sampled, and analysis was carried out by modified polyacrylamide gel electrophoresis under the same conditions as in Example 3. From the results shown in
[Conversion of DEL Compound from Hairpin DNA to Single-Stranded DNA and Addition of New Function]
AAZ-DEL-HP having a sequence shown in Table 22 was synthesized by the following procedures. In the sequence notation in Table 22, “(AAZ-AOP-AminoC7)” means the group represented by the following formula (11)
and other notations are the same as in Table 2.
To each of eight PCR tubes were added dimethyl sulfoxide (599 L), 4-oxo-4-[(5-sulfamoyl-1,3,4-thiadiazol-2-yl)amino)butanoic acid (37.5 μL, 0.2 M dimethyl sulfoxide solution), sodium 1-hydroxy-2,5-dioxopyrrolidine-3-sulfonate (60 μL, 0.33 M dimethyl sulfoxide/deionized water (2:1, v/v) solution), subsequently 1-(3-dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride (72 μL, 0.1 M dimethyl sulfoxide solution), and the obtained solution was shaken at 30° C. for 30 minutes. Then, to each solution were added 150 μL of triethylamine hydrochloride buffer (500 mM, pH 10), subsequently an aqueous solution of AOP-U-DEL9-HP (synthesized in Example 6) (75 μL, 0.67 mM), and the mixture was shaken at 37° C. for 6 hours.
The reaction solutions was combined into one violamo centrifuge tube, treated with 800 μL of 5 M aqueous sodium chloride solution and 26.3 mL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. The pellets were dissolved in deionized water, and purified by reverse phase HPLC using Phenomenex Gemini C18 column. Using a dual mobile phase gradient profile, the target product was eluted using 50 mM triethyl ammonium acetate buffer (pH7.5) and acetonitrile/water (100:1, v/v). Fractions containing the target product were collected, mixed and concentrated, respectively. The obtained solution was desalted by Amicon® Ultra Centrifugal filter (3 kD cutoff), and ethanol precipitation was carried out, and then deionized water was added to the pellets to prepare a 1 mM solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 2 in Example 2 to identify the target product AAZ-DEL-HP (the expected molecular weight and the observed molecular weight of the compound are shown in Table 22).
<Synthesis of the Starting Material Head Pieces for 3 Kinds of DEL Compounds (SABA-DEL-HP, ClSABA-DEL-HP and mSABA-DEL-HP)>
The compound having the sequence shown in Table 23 was synthesized by the following procedures. In the sequence notation in Table 23, “(SABA-AOP-AminoC7)” means the group represented by the following formula (12)
“(ClSABA-AOP-AminoC7)” means the group represented by the following formula
“(mSABA-AOP-AminoC7)” means the group represented by the following formula (14)
and other notations are the same as in Table 2.
The names of the compounds corresponding to each SEQ ID NO: (No.) are as follows.
The starting material carboxylic acids for synthesizing each compound are as follows.
To each PCR tube was added a starting material carboxylic acid (50 μL, 0.2 M N,N-dimethylacetamide solution). To each tube were added 3-hydroxytriazolo[4,5-b]pyridine (16.7 μL, 0.6 M N,N-dimethylacetamide solution), N,N′-diisopropylcarbodiimide (16.7 μL, 0.6 M N,N-dimethylacetamide solution), subsequently N,N-diisopropylethylamine (16.7 μL, 0.6 M N,N-dimethylacetamide solution), and the obtained solution was shaken at 25° C. for 30 minutes. Then, to each solution was added a solution (100 μL, 1 mM) of AOP-U-DEL9-HP (synthesized in Example 6) in a sodium borate buffer (250 mM, pH 9.4), and the mixture was shaken at 25° C. for 90 minutes.
The respective solutions were treated with 20 μL of 5 M aqueous sodium chloride solution and 660 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. The respective pellets were dissolved in 50 mM triethyl ammonium acetate buffer (pH 7.5), and purified by reverse phase HPLC using Phenomenex Gemini C18 column. Using a dual mobile phase gradient profile, the target product was eluted using 50 mM triethyl ammonium acetate buffer (pH7.5) and acetonitrile/water (100:1, v/v). Fractions containing the target product were collected, mixed, and concentrated, respectively. Each of the obtained solutions was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff), and ethanol precipitation was carried out, and then 25 μL of deionized water was added to the pellets to prepare a 1 mM aqueous solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 2 in Example 2 to identify each target product (the expected molecular weight and the observed molecular weight of each compound are shown in Table 23).
<Synthesis of 5 Kinds of DEL Compounds Having Biotin at 3′ End (“AAZ-BIO-DEL”, “SABA-BIO-DEL”, “ClSABA-BIO-DEL”, “mSABA-BIO-DEL”, and “Amino-BIO-DEL”)>
The DEL compounds having the sequences shown in Table 24 were synthesized by the following procedures. In the sequence notation in Table 24, “(BIO)” means the group represented by the following formula (15)
and other notations are the same as in Tables 2, 20, 22, and 23.
The names of the compounds corresponding to each SEQ ID NO: (No.) are as follows.
The names of the compounds of the starting material head pieces for synthesizing each DEL compound are each as follows.
To a PCR tube were added 10 μL of 1 mM aqueous solution of various kinds of the starting material head pieces; 12 μL of 1 mM aqueous solution of Pr_TAG2_CP-BIO (it was prepared by annealing Pr_TAG2_CP_a and Pr_TAG2_CP-BIO_b synthesized in the same manner as in Example 1, the sequence is shown in Table 25); 4 μL of 10× ligase buffer (500 mM Tris hydrochloride, pH 7.5; 500 mM sodium chloride; 10 mM magnesium chloride; 100 mM dithiothreitol; and 20 mM adenosine triphosphate), and 10 μL of deionized water. To the solution, 4 μL of a 10-fold diluted aqueous solution of T4DNA ligase (available from Thermo Fisher, Catalog number: EL0013) was added, and the obtained solution was incubated at 16° C. overnight. The sequence notation in Table 25 is the same as in Table 24. Also, the names of the compounds corresponding to each SEQ ID NO: (No.) are as follows.
The reaction solution was treated with 4 μL of 5 M aqueous sodium chloride solution and 132 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, the obtained pellets were air-dried, and the pellets were dissolved in deionized water. The obtained solution was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff).
Of the obtained supernatant, a part thereof was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify each target product (the expected molecular weight and the observed molecular weight of each compound are shown in Table 24).
<Cleavage by USER® Enzyme of 5 Kinds of DEL Compounds Having Biotin at 3′ End (AAZ-BIO-DEL, SABA-BIO-DEL, ClSABA-BIO-DEL, mSABA-BIO-DEL, and Amino-BIO-DEL)>
The 5 kinds of DEL compounds “AAZ-BIO-DEL”, “SABA-BIO-DEL”, “ClSABA-BIO-DEL”, “mSABA-BIO-DEL”, and “Amino-BIO-DEL” obtained above were cleaved by USER® enzyme in the following procedures to convert into DEL compounds “DS-AAZ-BIO-DEL”, “DS-SABA-BIO-DEL”, “DS-ClSABA-BIO-DEL”, “DS-mSABA-BIO-DEL”, and “DS-Amino-BIO-DEL”, each having the double-stranded nucleic acid having the sequence shown in Table 26. The sequence notation in Table 26 is the same as in Table 24, and it means that the 5 kinds of compounds are formed by double strands of the oligonucleotide chains of SEQ ID NOs: 124 and 125, SEQ ID NOs: 126 and 127, SEQ ID NOs: 128 and 129, SEQ ID NOs: 130 and 131, and SEQ ID NOs: 132 and 133, respectively.
To a PCR tube were added 10 μL of 100 μM aqueous solution of various kinds of DEL compounds; 100 μL of CutSmart® Buffer (available from New England BioLabs, Catalog number: 72405), and 860 μL of deionized water. To the solutions was each added 30 μL of USER® enzyme (available from New England BioLabs, Catalog number: 5505S), and the obtained solution was incubated at 37° C. for 1 hour.
The obtained reaction solution was desalted and concentrated by the Amicon® Ultra Centrifugal filter (3 kD cutoff), and ethanol precipitation was carried out, and then deionized water was added to each obtained pellet to prepare an aqueous solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under Analytical condition 3 in Example 3 to identify the DEL compounds “DS-AAZ-BIO-DEL”, “DS-SABA-BIO-DEL”, “DS-ClSABA-BIO-DEL”, “DS-mSABA-BIO-DEL”, and “DS-Amino-BIO-DEL” having the double-stranded nucleic acid of interest (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 26).
Also, a part of the obtained reaction solution was sampled, and analysis by modified polyacrylamide gel electrophoresis was carried out under the same conditions as in Example 3. From the results shown in
The DEL compounds having the double-stranded nucleic acids obtained above “DS-AAZ-BIO-DEL”, “DS-SABA-BIO-DEL”, “DS-CISABA-BIO-DEL”, “DS-mSABA-BIO-DEL”, and “DS-Amino-BIO-DEL” were each treated with streptavidin beads, and the DEL compounds having a single-stranded DNA “SS-AAZ-DEL”, “SS-SABA-DEL”, “SS-ClSABA-DEL”, “SS-mSABA-DEL”, and “SS-Amino-DEL” were prepared by the following procedures. The 5 kinds of compounds are oligonucleotide chains of SEQ ID NOs: 125, 127, 129, 131, and 133 in Table 26, respectively.
To five PCR tubes was each added 450 μL of Magnosphere (trademark) MS160/Streptavidin (JSR Life Sciences, Catalog number: J-MS-S160S), the supernatant was removed by magnetic separation, and then 900 μL of 1× binding buffer (10 mM Tris hydrochloride, pH 7.5; 0.5 mM ethylenediamine tetraacetic acid; 1 M sodium chloride; and 0.05% v/v Tween 20) was added and the supernatant was removed by magnetic separation. An aqueous solution (700 μmol, 450 μL, each) of “DS-AAZ-BIO-DEL”, “DS-SABA-BIO-DEL”, “DS-ClSABA-BIO-DEL”, “DS-mSABA-BIO-DEL”, or “DS-Amino-BIO-DEL” and 450 μL of 2× binding buffer (20 mM Tris hydrochloric acid, pH 7.5; 1 mM ethylenediaminetetraacetic acid; 2 M sodium chloride; and 0.1% v/v Tween 20) were added to the obtained particles and mixed, and the mixture was shaken at room temperature for 20 minutes.
The supernatant was removed from each mixture by magnetic separation, and washing of particles using 900 μL of 1× binding buffer (10 mM Tris hydrochloride, pH 7.5; 0.5 mM ethylenediamine tetraacetic acid; 1 M sodium chloride; and 0.05% v/v Tween 20) and removal of the supernatant by magnetic separation were each repeated twice. Then, each 900 μL of a denaturing solution (0.1 M sodium hydroxide; and 0.1 M sodium chloride) was added, and the supernatant was recovered by magnetic separation.
900 μL of 3-(N-morpholino)propanesulfonic acid buffer (1.0 M, pH 7.0) was each added to the obtained supernatant and desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff). Each of the obtained supernatants was subjected to ethanol precipitation, and then deionized water was added to the pellets to prepare a solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3. The molecular weights of 24255.3, 24170.5, 24208.8, 24176.7, and 23984.8 were observed and the object DEL compounds having the single-stranded DNA of interest “SS-AAZ-DEL”, “SS-SABA-DEL”, “SS-ClSABA-DEL”, “SS-mSABA-DEL”, and “SS-Amino-DEL” were identified.
The photoreactive cross linker-modified primer having the sequence shown in Table 27 “PXL-Pr” was synthesized by the following procedures. In the sequence notation in Table 27, “X” means the group represented by the following formula (16)
and other notations are the same as in Table 2.
To a PCR tube was added a solution (200 TL, 1 mM) of L-Pr (synthesized in the same manner as in Example 1; the sequence is shown in Table 28) in a sodium borate buffer (150 mM, pH 9.4) cooled to 10° C. To the tube were added 40 equivalents of N-Fmoc-15-amino-4,7,10,13-tetraoxaoctadecanoic acid (20 μL, 0.4 M N,N-dimethylacetamide solution), subsequently 40 equivalents of 4-(4,6-dimethoxy[1.3.5]triazin-2-yl)-4-methylmorpholinium chloride hydrate (DMTMM) (16 μL, 0.5 M aqueous solution), and the resulting mixture was shaken at 10° C. for 4 hours. The sequence notation in Table 28 is the same as in Table 8.
The reacted solution was treated with 23.6 μL of 5 M aqueous sodium chloride solution and 778.8 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. To the pellets was added 180 μL of deionized water to prepare a solution, and then 20 μL of piperidine was added and the mixture was shaken at 10° C. for 3 hours.
The resulting solution was treated with 20 μL of 5 M aqueous sodium chloride solution and 660 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and to the obtained pellets was added 200 μL of deionized water to prepare a 1 mM solution.
To 100 μL of the solution obtained above were added 75 μL of triethylamine hydrochloride buffer (500 mM, pH 10), subsequently 50 equivalents of sodium 1-((3-(3-methyl-3H-diazirin-3-yl)propanoyl)oxy)-2,5-dioxopyrrolidine-3-sulfonate (Sulfo-SDA) (25 μL, 200 mM aqueous solution), and the mixture was shaken at 37° C. for 2 hours.
The resulting solution was treated with 20 μL of 5 M aqueous sodium chloride solution and 660 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and to the obtained pellets were added 100 μL of deionized water, subsequently 75 μL of triethylamine hydrochloride buffer (500 mM, pH 10) and 50 equivalents of Sulfo-SDA (25 μL, 200 mM aqueous solution), and the mixture was shaken at 37° C. for 1 hour and 20 minutes. 50 equivalents of Sulfo-SDA (25 μL, 200 mM aqueous solution) was further added and the mixture was shaken at 37° C. for 40 minutes.
The resulting solution was treated with 22.5 μL of 5 M aqueous sodium chloride solution and 743 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed, and to the obtained pellets were added 100 μL of deionized water, subsequently 75 μL of triethylamine hydrochloride buffer (500 mM, pH 10), subsequently 50 equivalents of Sulfo-SDA (25 μL, 200 mM aqueous solution), and the mixture was shaken at 37° C. for 3 hours.
The resulting solution was treated with 20 μL of 5 M aqueous sodium chloride solution and 660 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. The pellets were dissolved in 50 mM triethyl ammonium acetate buffer (pH 7.5) and purified by reverse phase HPLC using Phenomenex Gemini C18 column. Using a dual mobile phase gradient profile, the target product was eluted using 50 mM triethylammonium acetate buffer (pH7.5) and acetonitrile/water (100:1, v/v). Fractions containing the target product were collected, mixed, and concentrated. The resulting solution was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff), ethanol precipitation was carried out, and then 100 μL of deionized water was added to the pellets to prepare a solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the target photoreactive cross linker-modified primer “PXL-Pr” (the expected molecular weight and the observed molecular weight of the compound are shown in Table 27).
Using the DEL compounds having the single-stranded DNA obtained above (“SS-AAZ-DEL”, “SS-SABA-DEL”, “SS-ClSABA-DEL”, “SS-mSABA-DEL”, and “SS-Amino-DEL”) as template DNAs, a primer elongation reaction using “PXL-Pr” was carried out in the following procedures to synthesize the photoreactive cross linker-modified double-stranded DEL compounds having the sequences shown in Table 29 (“PXL-DS-AAZ-DEL”, “PXL-DS-SABA-DEL”, “PXL-DS-CISABA-DEL”, “PXL-DS-mSABA-DEL”, and “PXL-DS-Amino-DEL”). The sequence notation in Table 29 is the same as in Tables 26 and 27, and it means that the 5 kinds of the compounds are formed by double strands of the oligonucleotide chains of SEQ ID NOs: 136 and 125, SEQ ID NOs: 136 and 127, SEQ ID NOs: 136 and 129, SEQ ID NOs: 136 and 131, and SEQ ID NOs: 136 and 133, respectively.
To a PCR tube were added 30 μL of 10 μM aqueous solution of DEL compounds having various single-stranded DNAs; 0.505 μL of 594 μM “PXL-Pr” aqueous solution; 60 μL of 10× NEBluffer® 2 (available from New England BioLabs, 5 Catalog number: B7002S), and 476 μL of deionized water. To the solution were added 6 μL of DNA Polymerase I, Large (Klenow) Fragment (available from New England BioLabs, Catalog number: M0210), and 12 μL of Deoxynucleotide (dNTP) Solution Mix (available from New England BioLabs, Catalog number: N0447), and the resulting solution was incubated at 25° C. for 90 minutes.
The resulting solution was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff). To the obtained supernatant was added deionized water to obtain a 60 μL solution. Then, the solution was treated with 6 μL of 5 M aqueous sodium chloride solution and 198 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. To the pellets was added 30 μL of deionized water to prepare a solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the target photoreactive cross linker-modified double-stranded DELs “PXL-DS-AAZ-DEL”, “PXL-DS-SABA-DEL”, “PXL-DS-ClSABA-DEL”, “PXL-DS-mSABA-DEL”, and “PXL-DS-Amino-DEL” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 29).
In addition, a part of the obtained reaction solution was sampled and analysis by polyacrylamide gel electrophoresis was carried out under the conditions mentioned below. From the results shown in
Each of the 5 kinds of the photoreactive cross linker-modified double-stranded DELs obtained above was diluted with deionized water to prepare a 50 nM DEL sample.
The reaction solution after the UV radiation was mixed with Dyanebeads his-tag pulldown and incubated at room temperature for 30 minutes. The mixture was fixed to a magnet stand and left to stand for 2 minutes, and then the supernatant was removed and 200 μL of Wash buffer was added to suspend Dynabeads. This washing operation was further repeated five times. 80 μL of D-PBS(−) was added to the Dybabeads after the washing and the mixture was reacted at 95° C. for 10 minutes. After the reaction, Dynabeads were placed on a magnet stand and the supernatant was collected after 2 minutes (solution E).
<Preparation of Sample without Photocrosslinking Reaction>
The above series of operations were carried out without UV radiation, and the solution S and the solution E were prepared as samples, respectively.
The Ct value of various kinds of DEL samples obtained above was measured by real-time PCR and the PCR efficiencies were compared. The conditions are as mentioned below and the results are shown in
The strength of the affinity between the CA9 protein and each compound as a binder is expected to be in the following order (Non-Patent Documents 6 and 7). “PXL-DS-AAZ-DEL”>“PXL-DS-SABA-DEL”>“PXL-DS-CISABA-DEL”>“PXL-DS-mSABA-DEL”>“PXL-DS-Amino-DEL (negative control)”
As shown in the graph of
On the other hand, in the solution E with UV radiation, the Ct values of any compounds were significantly reduced as compared to the negative control. That is, the result suggests that in the case of carrying out the photocrosslinking reaction in the DEL screening, acquisition of a binder having medium affinity can be expected as well.
The result means that a photoreactive cross linker-modified double-stranded DEL derived from a hairpin type DEL having “selectively cleavable site(s)” is useful in DEL screening utilizing a photocrosslinking reaction.
Conversion of the DEL compound having a double-stranded nucleic acid “DS-Amino-BIO-DEL” to the DEL compound having a single-stranded DNA “SS-Amino-DEL” by Lambda Exonuclease treatment was carried out by the following procedures.
To a PCR tube were added an aqueous solution of DS-Amino-BIO-DEL (500 μmol); 5 μL of 10× Lambda Exonuclease Reaction Buffer (available from New England BioLabs, Catalog number: B0262); and 1 μL of Lambda Exonuclease (available from New England BioLabs, Catalog number: M0262), and then a solution was prepared with deionized water so that a total solution amount was 50 μL. The resulting solution was incubated at 37° C. for 30 minutes.
10 μL of the obtained reaction solution was sampled and mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3, whereby the molecular weight of 24024.9 was observed and the DEL compound having the single-stranded DNA of interest “SS-Amino-DEL” was identified.
MS of one oligonucleotide chain (SEQ ID NO: 132) comprised in DS-Amino-BIO-DEL was not detected and MS of the product after becoming the single-strand was observed as a main peak, whereby it was confirmed that becoming the single-strand was carried out with high yield.
In the same manner as in Example 7, 5 kinds of DEL compounds having a single-stranded DNA (“SS-AAZ-DEL”, “SS-SABA-DEL”, “SS-ClSABA-DEL”, “SS-mSABA-DEL”, and “SS-Amino-DEL”) were prepared. However, in the step of <Synthesis of 5 kinds of DEL compounds having biotin at 3′ end> described in Example 7, Pr_TAG2_CP (Pr_TAG2_CP_a and Pr_TAG2_CP_b synthesized in the same manner as in Example 1 were annealed, and the sequences are shown in Table 31) was used instead of Pr_TAG2_CP-BIO, and the preparation of DEL having a single-stranded DNA was carried out in the same manner as in Example 9 instead of the step of <Preparation of DEL having a single-stranded DNA using streptavidin beads>. The sequence notation in Table 31 is the same as in Table 25. Also, the names of the compounds corresponding to each SEQ ID NO: (No.) are as follows.
The photoreactive cross linker-modified primer “PXL-Pr2” having the sequence shown in 32 was synthesized by the following procedures. In the sequence notation in Table 32, “(X2)” means the group represented by the following formula (17)
and other notations are the same as in Table 2.
To a PCR tube was added 3-(3-methyl-3H-diazilin-3-yl)propanoic acid (5 μL, 0.2 M N,N-dimethylacetamide solution). To the tube were added 1-[bis(dimethylamino)methylene]-1H-1,2,3-triazolo[4,5-b]pyridinium 3-oxide hexafluorophosphate (2.5 μL, 0.4 M N,N-dimethylacetamide solution), subsequently N,N-diisopropylethylamine (2.5 μL, 0.4 M N,N-dimethylacetamide solution), and the resulting solution was shaken at 4° C. for 10 minutes. To the resulting solution was added a solution (100 μL, 1 mM) of L-Pr (the sequence is shown in Table 28) in a sodium borate buffer (250 mM, pH 9.5) and the mixture was shaken at 10° C. for 30 minutes.
The solution was treated with 11 μL of 5 M aqueous sodium chloride solution and 363 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. The obtained pellets were dissolved in deionized water and the solution was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff)
A part of the obtained supernatant was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the photoreactive cross linker-modified primer of interest “PXL-Pr2” (the expected molecular weight and the observed molecular weight of each compound are shown in Table 32).
Using the DEL compounds having the single-stranded DNA (“SS-AAZ-DEL”, “SS-SABA-DEL”, “SS-ClSABA-DEL”, “SS-mSABA-DEL”, and “SS-Amino-DEL”) obtained above as template DNAs, a primer elongation reaction using “PXL-Pr2” was carried out in the same procedures as in Example 7 to synthesize photoreactive cross linker-modified double-stranded DEL compounds having the sequences shown in Table 33 (“PXL-DS-AAZ-DEL2”, “PXL-DS-SABA-DEL2”, “PXL-DS-CISABA-DEL2”, “PXL-DS-mSABA-DEL2”, and “PXL-DS-Amino-DEL2”). The sequence notation in Table 33 is the same as in Tables 26 and 32, and it means that the 5 kinds of the compounds are formed by double strands of the oligonucleotide chains of SEQ ID NOs: 142 and 125, SEQ ID NOs: 142 and 127, SEQ ID NOs: 142 and 129, SEQ ID NOs: 142 and 131, and SEQ ID NOs: 142 and 133, respectively.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the photoreactive cross linker-modified double-stranded DELs of interest “PXL-DS-AAZ-DEL2”, “PXL-DS-SABA-DEL2”, “PXL-DS-ClSABA-DEL2”, “PXL-DS-mSABA-DEL2”, and “PXL-DS-Amino-DEL2” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 33).
Also, a part of the obtained reaction solution was sampled and analysis by polyacrylamide gel electrophoresis was carried out under the conditions shown in Example 7. From the results shown in
A DEL compound having a single-stranded DNA “SS-Amino-DEL3” (the sequence is shown in Table 34) was prepared using an automated polynucleotide synthesizer nS-811 (manufactured by GeneDesign, Inc.) in the same manner as in Example 1. “SS-Amino-DEL3” has the same structure as the oligonucleotide derived by the same procedures as <Preparation of 5 kinds of DELs having a single-stranded DNA> using “U-DEL12-HP” (the sequence is shown in Table 11) as a starting material head piece. In addition, the sequence notation in Table 34 is the same as in Table 10.
A DEL compound having a single-stranded DNA having the sequence shown in Table 35 “SS-AAZ-DEL3” was synthesized by the following procedures. In the sequence notation in Table 35, “[AAZ-mdC (TEG-Amino)]” means the group represented by the following formula (18)
and other notations are the same as in Table 10.
In the same manner as in Example 7<Synthesis of starting material head piece (AAZ-DEL-HIP) of DEL compound>, a condensation reaction with 4-oxo-4-[(5-sulfamoyl-1,3,4-thiadiazol-2-yl)amino)butanoic acid was carried out using “SS-Amino-DEL3” as a starting material.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the DEL compound having the single-stranded DNA of interest “SS-AAZ-DEL3” (the expected molecular weight and the observed molecular weight of the compound are shown in Table 35)
3 kinds of DEL compounds having a single-stranded DNA having the sequences shown in Table 36 (“SS-SABA-DEL3”, “SS-ClSABA-DEL3”, and “SS-mSABA-DEL3”) were synthesized using the following procedures. In the sequence notation in Table 36, “[SABA-mdC (TEG-Amino)]” means the group represented by the following formula (19)
“[ClSABA-mdC (TEG-Amino)]” means the group represented by the following formula (20)
“[mSABA-mdC (TEG-Amino)]” means the group represented by the following formula (21)
and other notations are the same as in Table 10. The names of the compounds corresponding to each SEQ ID NO: (No.) are as follows.
The starting material carboxylic acids for synthesizing each compound are as follows.
A starting material carboxylic acid (4 μL, 0.2M N,N-dimethylacetamide solution) was added to a PCR tube. To the tube were added 1-[bis(dimethylamino)methylene]-1H-1,2,3-triazolo[4,5-b]pyridinium 3-oxide hexafluorophosphate (2 μL, 0.4 M N,N-dimethylacetamide solution), subsequently N,N-diisopropylethylamine (2 μL, 0.4 M N,N-dimethylacetamide solution), and the resulting solution was shaken at 10° C. for 30 minutes. To the resulting solution was added a solution (50 μL, 1 mM) of SS-Amino-DEL3 in a sodium borate buffer (250 mM, pH 9.5) and the mixture was shaken at 10° C. for 2 hours.
The above solution was treated with 5.8 μL of 5 M aqueous sodium chloride solution and 192 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were air-dried.
The obtained pellets were dissolved in 50 mM triethylammonium acetate buffer (pH 7.5) and purified by reverse phase HPLC using Phenomenex Gemini C18 column. Using a dual mobile phase gradient profile, the target product was eluted using 50 mM triethylammonium acetate buffer (pH 7.5) and acetonitrile/500 mM triethylammonium acetate buffer (9:1, v/v). Fractions containing the target product were collected, mixed, and concentrated. The resulting solution was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff), ethanol precipitation was carried out, and then deionized water was added to the pellets to prepare a solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify “SS-SABA-DEL3”, “SS-ClSABA-DEL3”, and “SS-mSABA-DEL3” of interest (the expected molecular weight and the observed molecular weight of each compound are shown in Table 36).
<Synthesis of photoreactive cross linker-modified primer “PXL-Pr3”>
A photoreactive cross linker-modified primer having the sequence shown in Table 37 “PXL-Pr3” was synthesized by the same procedures as in the above <Synthesis of photoreactive cross linker-modified primer “PXL-Pr2”>. However, as a starting material, “L-Pr3” (synthesized in the same manner as in Example 1, and the sequence is shown in Table 38) was used instead of “L-Pr”. The sequence notation in Table 37 is the same as in Table 32. In addition, the sequence notation in Table 38 is the same as in Table 8.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the photoreactive cross linker-modified primer “PXL-Pr3” of interest (the expected molecular weight and the observed molecular weight of each compound are shown in Table 37).
Using the DEL compounds having the single-stranded DNA obtained above (“SS-AAZ-DEL3”, “SS-SABA-DEL3”, “SS-ClSABA-DEL3”, “SS-mSABA-DEL3”, and “SS-Amino-DEL3”) as template DNAs, a primer elongation reaction using “PXL-Pr3” was carried out by the same procedures as in Example 7 to synthesize 5 kinds of the photoreactive cross linker-modified double-stranded DEL compounds having the sequences shown in Table 39 (“PXL-DS-AAZ-DEL3”, “PXL-DS-SABA-DEL3”, “PXL-DS-ClSABA-DEL3”, “PXL-DS-mSABA-DEL3”, and “PXL-DS-Amino-DEL3”). The sequence notation in Table 39 is the same as in Tables 34 to 37, and it means that the 5 kinds of the compounds are formed by double strands of the oligonucleotide chains of SEQ ID NOs: 150 and 144, SEQ ID NOs: 150 and 145, SEQ ID NOs: 150 and 146, SEQ ID NOs: 150 and 147, and SEQ ID NOs: 150 and 143, respectively.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify each photoreactive cross linker-modified double-stranded DEL compound “PXL-DS-AAZ-DEL3”, “PXL-DS-SABA-DEL3”, “PXL-DS-CISABA-DEL3”, “PXL-DS-mSABA-DEL3”, and “PXL-DS-Amino-DEL3” of interest (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 39).
Also, a part of the obtained reaction solution was sampled and analysis by polyacrylamide gel electrophoresis was carried out under the conditions shown in Example 7. From the results shown in
Lane 1: 20 bp DNA Ladder (manufactured by Lonza, Lonza 20 bp DNA Ladder, Catalog number: 50330)
Each of the 4 kinds of the photoreactive cross linker-modified double-stranded DELs obtained in Example 10 (PXL-DS-mSABA-DEL2, PXL-DS-Amino-DEL2, PXL-DS-mSABA-DEL3, and PXL-DS-Amino-DEL3) was diluted with deionized water to prepare a 50 nM DEL sample.
D-PBS(−) was added to the reaction solution after the UV radiation. Thereafter, the remaining solution was mixed with Dyanebeads his-tag pulldown and incubated at room temperature for 30 minutes. The mixture was fixed to a magnet stand and left to stand for 2 minutes, and then the supernatant was removed, 200 μL of Wash buffer was added to suspend Dynabeads. This operation was further repeated five times. 100 μL of Elution buffer was added to Dynabeads after the washing and the mixture was allowed to stand at room temperature for 10 minutes. After the reaction, Dynabeads were placed on a magnetic stand, and after 2 minutes, the supernatant was collected as a sample.
<Preparation of Sample without Photocrosslinking Reaction>
The above series of the operations was carried out without UV radiation, and each sample was collected.
The Ct values of the various DEL samples obtained above were measured by real-time PCR in the same manner as in Example 8. The results of comparing the ΔCt values (difference from the Ct value of the negative control) are shown in
As described in Example 8, the intensity of the affinities between the CA9 protein and “PXL-DS-mSABA-DEL2” and “PXL-DS-mSABA-DEL3” is expected to be medium (Non-Patent Documents 6 and 7).
As shown in the graph of
On the other hand, in the samples with UV radiation, every ΔCt values increased as compared to the samples without UV radiation, which suggests that acquisition of a binder having medium affinity can be expected also in the photoreactive cross linker-modified double-stranded DELs (all of them are different in the linker structure from the photoreactive cross linker-modified double-stranded DELs used in Example 8) used in this example.
The result means that a photoreactive cross linker-modified double-stranded DEL derived from a hairpin type DEL having “selectively cleavable site(s)” and having various linker structures is useful in DEL screening utilizing a photocrosslinking reaction.
[Comparison of Binder Recovery Efficiencies (DNA Detection Sensitivity) Between “Photoreactive Cross Linker-Modified Double-Stranded DEL Having Covalent Bond Between Cross Linker and Coding Sequence” and “Photoreactive Cross Linker-Modified Double-Stranded DEL without Covalent Bond Between Cross-Linker and Coding Sequence” ]
<Synthesis of Photoreactive Cross Linker-Modified Double-Stranded DEL without Covalent Bond Between Cross Linker and Coding Sequence>
Using the DEL compounds having the single-stranded DNA obtained in Example 10 (“SS-SABA-DEL3”, “SS-ClSABA-DEL3”, “SS-mSABA-DEL3”, and “SS-Amino-DEL3”), annealing using “PXL-Pr3” was carried out by the following procedures to synthesize 4 kinds of the photoreactive cross linker-modified double-stranded DEL compounds having the sequences shown in Table 40 (“PXL-DS-SABA-DEL4”, “PXL-DS-ClSABA-DEL4”, “PXL-DS-mSABA-DEL4”, and “PXL-DS-Amino-DEL4”). The sequence notation in Table 40 is the same as in Tables 34, 36, and 37, and it means that the 4 kinds of the compounds are formed by double strands of the oligonucleotide chains of SEQ ID NOs: 148 and 145, SEQ ID NOs: 148 and 146, SEQ ID NOs: 148 and 147, and SEQ ID NOs: 148 and 143, respectively.
To a PCR tube were added 30 μL of 10 μM aqueous solution of a DEL compound having various single-stranded DNAs; and 3.77 μL of 159 μM aqueous solution of “PXL-Pr3”. Deionized water was added to the resulting aqueous solution to have a total solution amount 60 μL. Thereafter, the mixture was incubated at 90° C. for 2 minutes and then cooled to room temperature over 30 minutes.
4 kinds of the “photoreactive cross linker-modified double-stranded DELs without a covalent bond between a cross linker and a coding sequence” obtained above (“PXL-DS-SABA-DEL4”, “PXL-DS-ClSABA-DEL4”, “PXL-DS-mSABA-DEL4”, and “PXL-DS-Amino-DEL4”) and 4 kinds of the “photoreactive cross linker-modified double-stranded DELs with a covalent bond between a cross linker and a coding sequence” obtained in Example 10 (“PXL-DS-SABA-DEL3”, “PXL-DS-ClSABA-DEL3”, “PXL-DS-mSABA-DEL3”, and “PXL-DS-Amino-DEL3”) were each diluted with deionized water to prepare 50 nM DEL samples.
The reaction solution after the UV radiation was mixed with Dynabeads his-tag pulldown and incubated at room temperature for 30 minutes. The mixture was fixed to a magnet stand and left to stand for 2 minutes, and then the supernatant was removed, 200 μL of Wash buffer was added to suspend Dynabeads. This operation was further repeated five times. 100 μL of Elution buffer was added to t Dynabeads after the washing and the mixture was reacted at room temperature for 10 minutes. After the reaction, Dynabeads were placed on a magnetic stand, and after 2 minutes, the supernatant was collected as a sample.
<Preparation of Sample without Photocrosslinking Reaction>
The above series of the operations were carried out without UV radiation, and each sample was collected.
The Ct values of the various DEL samples obtained above were measured by real-time PCR in the same manner as in Example 8. The results of comparing the ΔCt values (difference from the Ct value of the negative control) are shown in
Similar to Example 8, the strength of affinity between the CA9 protein and each compound as a binder is expected to be in the following order (Non-Patent Documents 6 and 7).
“PXL-DS-SABA-DEL3”, “PXL-DS-SABA-DEL4”>“PXL-DS-C1SABA-DEL3”, “PXL-DS-ClSABA-DEL4”>“PXL-DS-mSABA-DEL3”, “PXL-DS-mSABA-DEL4”>“PXL-DS-Amino-DEL3 (negative control)”, “PXL-DS-Amino-DEL4 (negative control)”
As shown in the graph of
The result means that the “photoreactive cross linker-modified double-stranded DEL with a covalent bond between a cross linker and a coding sequence” derived from hairpin type DEL having “selectively cleavable site(s)” is more useful than the “photoreactive cross linker-modified double-stranded DEL without a covalent bond between a cross linker and a coding sequence” in DEL screening using a photocrosslinking reaction.
[Verification of Binder Recovery Efficiency in Photocrosslinking Reaction of Photoreactive Cross Linker-Modified Double-Stranded DEL to which Strong Separation and Elution Conditions are Applied]
Similar to Example 12, 4 kinds of the “photoreactive cross linker-modified double-stranded DEL without a covalent bond between a cross linker and a coding sequence” (“PXL-DS-SABA-DEL4”, “PXL-DS-ClSABA-DEL4”, “PXL-DS-mSABA-DEL4”, and “PXL-DS-Amino-DEL4”) and 4 kinds of the “photoreactive cross linker-modified double-stranded DEL with a covalent bond between a cross linker and a coding sequence” (“PXL-DS-SABA-DEL3”, “PXL-DS-ClSABA-DEL3”, “PXL-DS-mSABA-DEL3”, and “PXL-DS-Amino-DEL3”) were each diluted with deionized water to prepare 50 nM DEL samples.
The reaction solution after the UV radiation was mixed with Dynabeads his-tag pulldown and incubated at room temperature for 30 minutes. The mixture was fixed to a magnet stand and left to stand for 2 minutes, and then the supernatant was removed and 200 μL of Wash buffer was added to suspend Dynabeads and the mixture was reacted at 90° C. for 10 minutes. This operation was further repeated three times. 30 μL of 200 mM imidazole solution was added to Dynabeads after the washing and the mixture was reacted at room temperature for 10 minutes. After the reaction, Dynabeads were placed on a magnetic stand, and after 2 minutes, the supernatant was collected as a sample.
<Preparation of Sample without Photocrosslinking Reaction>
The above series of the operations were carried out without UV radiation, and each sample was collected.
As described in the above <Recovery of DEL crosslinked with protein>, the heating conditions were applied when the DEL crosslinked with protein was recovered, and therefore strong separation and elution conditions were applied.
The Ct values of the various DEL samples obtained above were measured by real-time PCR in the same manner as in Example 8. The results of comparing the ΔCt values (difference from the Ct value of the negative control) are shown in
As shown in the graph of
The result means that the “photoreactive cross linker-modified double-stranded DEL with a covalent bond between a cross linker and a coding sequence” derived from a hairpin type DEL having “selectively cleavable site(s)” is more useful than the “photoreactive cross linker-modified double-stranded DEL without a covalent bond between a cross linker and a coding sequence” in DEL screening using a photocrosslinking reaction.
In addition, the present results suggest that the “photoreactive cross linker-modified double-stranded DEL with a covalent bond between a cross linker and a coding sequence” derived from a hairpin type DEL having “selectively cleavable site(s)” can also be adapted to DEL screening under strong separation conditions or elution conditions for the purpose of removing non-specific binders, etc.
<Synthesis of Starting Material Head Piece (“mSABA-DEL-HP5”) of DEL Compound>
“mSABA-DEL-HP5” having the sequence shown in Table 41 was synthesized using “U-DEL13-HP” as a starting material and by the same procedures as in <Synthesis of 3 kinds of DEL compounds having single-stranded DNA (“SS-SABA-DEL3”, “SS-ClSABA-DEL3”, and “SS-mSABA-DEL3”)> of Example 10. The notations in Table 41 are the same as in Table 36.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the “mSABA-DEL-HP5” of interest (the expected molecular weight and the observed molecular weight of each compound are shown in Table 41).
<Synthesis of Hairpin DEL Compound (“mSABA-DEL5”)>
The hairpin DEL compound having the sequence shown in Table 42 (“mSABA-DEL5”) was synthesized in the same manner as in Example 10 by double strand ligation of the starting material head piece “mSABA-DEL-HP5” with Pr_TAG2_CP. The sequence notation in Table 42 is the same as in Table 36.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the “mSABA-DEL5” of interest (the expected molecular weight and the observed molecular weight of each compound are shown in Table 42).
<Cleavage of Hairpin DEL Compound (“mSABA-DEL5”) by USER® Enzyme>
The hairpin DEL compound obtained above “mSABA-DEL5” was cleaved by USER® enzyme in the same procedures as in <Cleavage of 5 kinds of DEL compounds having biotin at 3′ end (AAZ-BIO-DEL, SABA-BIO-DEL, ClSABA-BIO-DEL, mSABA-BIO-DEL, and Amino-BIO-DEL) by USER® enzyme> of Example 7 to convert it into the DEL compound having the double-stranded nucleic acid having the sequence of shown in Table 43 “DS-mSABA-DEL5”. The sequence notation in Table 43 is the same as in Table 36, and it means that the compound is formed by a double strand of the oligonucleotide chains of SEQ ID NOs: 153 and 154.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the DEL compound having the double-stranded nucleic acid of interest “DS-mSABA-DEL5” (the expected molecular weight and the observed molecular weight of each compound are shown in Table 43).
Also, a part of the obtained reaction solution was sampled and analysis by modified polyacrylamide gel electrophoresis was carried out under the same conditions as in Example 3. From the results shown in
The DEL compound having the double-stranded nucleic acid obtained above “DS-mSABA-DEL5” was treated by Lambda Exonuclease in the same manner as in Example 9 to prepare the DEL compound having the single-stranded DNA “SS-mSABA-DEL5”. “SS-mSABA-DEL5” is the oligonucleotide chain of SEQ ID NO: 154 in Table 43.
A part of the obtained supernatant was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3, whereby the molecular weight of 23825.8 was observed and the DEL compound having the single-stranded DNA of interest “SS-mSABA-DEL5” was identified.
The photoreactive cross linker-modified primer having the sequence shown in Table 44 “PXL-Pr5” was synthesized in the same procedures as in <Synthesis of photoreactive cross linker-modified primer “PXL-Pr2”> of Example 10. However, as a starting material, “L-Pr5” (synthesized in the same manner as in Example 1, and the sequence is shown in Table 45) was used instead of “L-Pr”. The sequence notation in Table 44 is the same as in Table 32. In addition, the sequence notation in Table 45 is the same as in Table 8.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the photoreactive cross linker-modified primer of interest “PXL-Pr5” (the expected molecular weight and the observed molecular weight of each compound are shown in Table 44).
The DEL compound having the single-stranded DNA obtained above “SS-mSABA-DEL5” was used as a template DNA, and a primer elongation reaction using “PXL-Pr5” was carried out in the same procedures as in Example 7 to synthesize the photoreactive cross linker-modified double-stranded DEL compound having the sequence shown in Table 46 “PXL-DS-mSABA-DEL5”. The sequence notations in Table 46 are the same as in Tables 36 and 44, and it means that the compound is formed by a double strand of the oligonucleotide chains of SEQ ID NOs: 157 and 154.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the photoreactive cross linker-modified double-stranded DEL of interest “PXL-DS-mSABA-DEL5” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 46).
Also, a part of the obtained reaction solution was sampled and analysis by polyacrylamide gel electrophoresis was carried out under the conditions shown in Example 7. From the results shown in
Various cross linker-modified primers having the sequences shown in Table 47 were synthesized by the following procedures. In the sequence notation in Table 47, “(PA)” means the group represented by the following formula (22)
“(TPD)” means the group represented by the following formula (23)
“(ACA)” means the group represented by the following formula (24)
and other notations are the same as in Table 2.
The starting material active esters for synthesizing each compound are as follows.
To a PCR tube was added a solution (1 mM) of L-Pr (described in Example 7) in a sodium borate buffer (250 mM, pH 9.4) cooled to 10° C. To the tube was added 50 equivalents of the starting material active ester (25 μL, 0.2 M dimethylsulfoxide solution) and the formed mixture was shaken at 10° C. for 30 minutes.
The reaction solution was treated with 12 μL of 5 M aqueous sodium chloride solution and 396 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were air-dried.
The obtained pellets were dissolved in 50 mM triethylammonium acetate buffer (pH 7.5) and purified by reverse phase HPLC using Phenomenex Gemini C18 column. Using a dual mobile phase gradient profile, the target product was eluted using 50 mM triethylammonium acetate buffer (pH 7.5) and acetonitrile/500 mM triethylammonium acetate buffer (9:1, v/v). Fractions containing the target product were collected, mixed, and concentrated. The resulting solution was desalted with the Amicon® Ultra Centrifugal filter (3 kD cutoff), ethanol precipitation was carried out, and then deionized water was added to the pellets to prepare a solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the target product respectively (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 47).
The compound having the sequence shown in Table 48 “AOP-L-Pr” was synthesized by the following procedures. In the sequence notation in Table 48, “(AOP-aminoC6-L)” means the group represented by the following formula (25)
and other notations are the same as in Table 2.
To a PCR tube was added a solution (200 μL, 1 mM) of L-Pr (synthesized in the same manner as in Example 1, and the sequence is shown in Table 28) in a sodium borate buffer (150 mM, pH 9.4) cooled to 10° C. To the tube were added 40 equivalents of N-Fmoc-15-amino-4,7,10,13-tetraoxaoctadecanoic acid (20 μL, 0.4 M N,N-dimethyl-acetamide solution), subsequently 40 equivalents of 4-(4,6-dimethoxy[1.3.5]triazin-2-yl)-4-methylmorpholinium chloride hydrate (DMTMM) (16 μL, 0.5 M aqueous solution), and the formed mixture was shaken at 10° C. for 4 hours.
The reaction solution was treated with 23.6 μL of 5 M aqueous sodium chloride solution and 778.8 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. overnight. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. To the pellets was added 180 μL of deionized water to prepare a solution, and then 20 μL of piperidine was added and the mixture was shaken at 10° C. for 3 hours.
The resulting solution was treated with 20 μL of 5 M aqueous sodium chloride solution and 660 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed, and to the obtained pellets was added 200 μL of deionized water to prepare a 1 mM solution.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 2 in Example 1 to identify the target product (the expected molecular weight and the observed molecular weight of each sequence are shown in Table 48).
The cross linker-modified primer having the sequence shown in Table 49 (BMP-Pr) was synthesized by the following procedures. In the sequence notation in Table 49, “(BMP)” means the group represented by the following formula (26)
and other notations are the same as in Table 2.
To a PCR tube was added a solution (40 μL, 0.5 mM) of AOP-L-Pr in sodium phosphate buffer (125 mM, pH 9.4) cooled to 10° C. To the tube was added 50 equivalents of N-succinimidyl 3-maleimidopropionate (5 μL, 0.2 M dimethylsulfoxide solution) and the formed mixture was shaken at 10° C. for 40 minutes. Thereafter, 20 μL of dimethyl sulfoxide was added and the formed mixture was further shaken at 10° C. for 25 minutes.
The reaction solution was treated with 5 μL of 5 M aqueous sodium chloride solution and 215 μL of cooled (−20° C.) ethanol, and allowed to stand at −78° C. for 30 minutes. After centrifugation, the supernatant was removed and the obtained pellets were air-dried. The obtained pellets were dissolved in deionized water and the solution was desalted by the Amicon® Ultra Centrifugal filter (3 kD cutoff)
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the cross linker-modified primer of interest “BMP-Pr” (the expected molecular weight and the observed molecular weight of each compound are shown in Table 49).
Using the DEL compound having the single-stranded DNA obtained in Example 10 “SS-mSABA-DEL” as a template DNA, a primer elongation reaction using the cross linker-modified primer “TPD-Pr” obtained in the above <Synthesis of 3 kinds of cross linker-modified primers> was carried out in the following procedures to synthesize the cross linker-modified double-stranded DEL compound having the sequence shown in Table 50 “TPD-DS-mSABA-DEL”. The sequence notations in Table 50 are the same as in Tables 26 and 47, and it show that “TPD-DS-mSABA-DEL”is formed by a double strand of the oligonucleotide chains of SEQ ID NOs: 163 and 131.
A part of the obtained solution was sampled and diluted with deionized water and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the cross linker-modified double-stranded DEL of interest “TPD-DS-mSABA-DEL” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 50).
Using the DEL compound having the single-stranded DNA obtained in Example 10 “SS-ClSABA-DEL” as a template DNA, a primer elongation reaction using the cross linker-modified primer “ACA-Pr” obtained above was carried out in the same procedures as in Example 7 to synthesize the cross linker-modified double-stranded DEL compound having the sequence shown in Table 51 “ACA-DS-ClSABA-DEL”. The sequence notations in Table 51 are the same as in Tables 26 and 47, and it shows that “ACA-DS-ClSABA-DEL” is formed by a double strand of the oligonucleotide chains of SEQ ID NO: 164 and SEQ ID NO: 129.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the cross linker-modified double-stranded DEL of interest “ACA-DS-ClSABA-DEL” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 51).
A part of the solutions obtained in the above <Synthesis of cross linker-modified double-stranded DEL (“TPD-DS-mSABA-DEL”)> and <Synthesis of cross linker-modified double-stranded DEL (“ACA-DS-ClSABA-DEL”)> was sampled, and analysis by polyacrylamide gel electrophoresis was carried out under the conditions shown in Example 7. From the results shown in
The primer having the reactive group for cross linker modification of the sequence shown in Table 52 (BCN-Pr) was synthesized by the following procedures. In the sequence notation in Table 52, “(BCN)” means the group represented by the following formula (27)
and other notations are the same as in Table 2.
Synthesis of “BCN-Pr” was carried out in the same manner as in Example 15, using “AOP-L-Pr” as a starting material and (1R,8S,9s)-bicyclo[6.1.0]non-4-yn-9-ylmethyl succinimidyl carbonate as a starting material active ester.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the “BCN-Pr” of interest (the expected molecular weight and the observed molecular weight of the compound are shown in Table 52).
Using “SS-mSABA-DEL” obtained in Example 10 as a template DNA, a primer elongation reaction using “BCN-Pr” was carried out in the same procedures as in Example 7 to synthesize the double-stranded DEL compound having the reactive group for cross linker modification having the sequence shown in Table 53 “BCN-DS-mSABA-DEL”. The sequence notations in Table 53 are the same as in Tables 26 and 52, and it shows that “BCN-DS-mSABA-DEL” is formed by a double strand of the oligonucleotide chains of SEQ ID NOs: 166 and 131.
A part of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the double-stranded DEL having the reactive group for cross linker modification of interest “BCN-DS-mSABA-DEL” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 53).
In addition, a part of the obtained reaction solution was sampled and analysis by polyacrylamide gel electrophoresis was carried out under the same conditions as in Example 7. From the results shown in
A cross linker was introduced to “BCN-DS-mSABA-DEL” obtained above by the click reaction to synthesize the cross linker-modified double-stranded DEL compound having the sequence shown in Table 54 “PSF-DS-mSABA-DEL”. The synthesis procedure is shown below. In the sequence notation in Table 54, “(PSF-t)” means a group represented by the following formula (28)
and other notations are the same as in Table 26, and it shows that “PSF-DS-mSABA-DEL” is formed by a double strand of the oligonucleotide chains of SEQ ID NOs: 131 and 167.
To a PCR tube was added N-succinimidyl 2-azidoacetate (125 μL, 0.2 M dimethylsulfoxide solution). To the tube were added N,N-diisopropylethylamine (5.2 μL), subsequently 4-(2-aminoethyl)benzenesulfonyl fluoride hydrochloride (30 mg), and the resulting mixture was shaken at 10° C. for 1 hour.
Dimethyl sulfoxide was added to the resulting mixture to be 10-fold diluted (diluted solution).
To a PCR tube were added a solution (2 μL, 0.1 mM) of BCN-DS-mSABA-DEL in a sodium phosphate buffer (250 mM, pH 7.0), subsequently 1.8 μL of dimethyl sulfoxide. Thereafter, 0.2 μL of the diluted solution obtained above was added and the obtained solution was shaken at 25° C. for 2 hours. [0457]1 μL of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the cross linker-modified double-stranded DEL of interest “PSF-DS-mSABA-DEL” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 54).
<Synthesis of Cross Linker-Modified Double-Stranded DEL (“BMP-DS-mSABA-DEL”) by Click Reaction with Double-Stranded DEL Having Reactive Group for Cross Linker Modification>
A cross linker was introduced to “BCN-DS-mSABA-DEL” obtained above by the click reaction to synthesize the cross linker-modified double-stranded DEL compound having the sequence shown in Table 55 “BMP-DS-mSABA-DEL”. The synthesis procedure is shown below. In the sequence notation in Table 55, “(BMP-t)” means a group represented by the following formula (29)
and other notations are the same as in Table 26, and it shows that “BMP-DS-mSABA-DEL” is formed by a double strand of the oligonucleotide chains of SEQ ID NOs: 131 and 168.
To a PCR tube was added 3-azidopropylamine (4.8 mg, 0.2 M dimethylsulfoxide solution). To the tube was added N-succinimidyl 3-maleimidopropionate (30 mg) and the resulting mixture was shaken at 10° C. for 1 hour.
Dimethyl sulfoxide was added to the resulting mixture to be 10-fold diluted (diluted solution).
To a PCR tube were added a solution (2 μL, 0.1 mM) of BCN-DS-mSABA-DEL in a sodium phosphate buffer (250 mM, pH 7.0), subsequently 1.8 μL of dimethyl sulfoxide. Thereafter, 0.2 μL of the diluted solution obtained above was added and the resulting solution was shaken at 25° C. for 2 hours.
1 μL of the obtained solution was sampled and diluted with deionized water, and then mass spectrometry by ESI-MS was carried out under the conditions of Analytical condition 3 in Example 3 to identify the cross linker-modified double-stranded DEL of interest “BMP-DS-mSABA-DEL” (the expected molecular weights and the observed molecular weights of the compounds are shown in Table 55).
[Conversion from Model Library Using U-DEL9-HP as Starting Material to Single-Stranded DNA and Addition of New Function]
Conversion to a DEL compound having a single-stranded DNA in the model library was carried out in the same procedures as in Example 9 using the sample of the model library (synthesized in Example 6) after carrying out a cleavage reaction by USER® enzyme. Then, the obtained solution was used as a starting material for the next step.
Using the model library converted to the DEL having the single-stranded DNA obtained above as a template DNA, a primer elongation reaction using “PXL-Pr” (synthesized in Example 7) and “BCN-Pr” (synthesized in Example 16) was carried out in the same procedures as in Example 7.
A part of the obtained two kinds of the reaction solutions obtained by the primer elongation reaction was sampled and analysis by polyacrylamide gel electrophoresis was carried out under the same conditions as in Example 7. From the results shown in
The present invention provides a method using a nucleic acid compound having selectively cleavable site(s). Further, the present invention provides a method of inducing a DEL containing a cleavable site in a DNA strand to a cross linker-modified double-stranded DEL and evaluating it, which makes it possible to screen compounds having both a “simple DEL synthesis method” and “expansion and improvement of the DEL evaluation method” as compared to conventional methods.
Number | Date | Country | Kind |
---|---|---|---|
2021-190592 | Nov 2021 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2022/043405 | 11/24/2022 | WO |