1. Field of the Invention
The present invention relates to a DNA computer.
2. Description of the Related Art
A DNA computer is known as a unique attempt to utilize the characteristics of biomolecules. Calculation in DNA computers involves artificial incorporation of input values and programs into DNA sequence and appropriately combining the resulting DNA with various reactions such as enzyme reactions (ex. DNA modification enzymes and restriction enzymes) and hybridization reactions with other DNAs.
The history of DNA computers dates to demonstrating by Adleman that the experimental system with DNAs can be used to solve a mathematical problem (Adleman LM, Molecular computation of solutions to combinatorial problems., “Science”, (USA), 1994; 266(5187), p. 1021-4). In this study, he solved a mathematical problem, directed Hamiltonian Path Problem, using an experimental system with DNA molecules. In addition, in the year after, Lipton reported the solution for satisfiability problem with a DNA computer (Lipton R J, DNA solution of hard computational problems., “Science”, USA, 1995; 268(5210), p. 542-5). Many kind of Computational algorithms for a DNA computer have been proposed, which include the technique based on an elongation reaction in single DNA molecule (Sakamoto K, Gouzu H, Komiya K, Kiga D, Yokoyama S, Yokomori T, Hagiya M,, Molecular computation by DNA hairpin formation., “Science”, USA, 2000; 288 (5469), p. 1223-6, akamoto K, Kiga D, Komiya K, Gouzu H, Yokoyama S, Ikeda S, Sugiyama H, Hagiya M, State transitions by molecules., “Biosystems”, 1999; 52 (1-3), p. 81-91) and the approach with hairpin structure in single stranded DNA (Sakamoto K, Kiga D, Komiya K, Gouzu H, Yokoyama S, Ikeda S, Sugiyama H, Hagiya M, State transitions by molecules. “Biosystems”, 1999; 52(1-3), p. 81-91), the technique to identify the appropriate solution on the solid phase using DNA as memories (Liu Q, Wang L, Frutos A G, Condon A E, Corn R M, Smith L M DNA computing on surfaces., “Nature”, UK, 2000; 403(6766), p. 175-9, ang L, Hall J G, Lu M, Liu Q, Smith L M A DNA computing readout operation based on structure-specific cleavage., “Nat Biotechnol”, UK, 2001; 19(11), p. 1053-9) and the method involving insertion of double stranded DNA into plasmids and cleavage of double stranded DNA. Furthermore, DNA computation is expanding its scope into further area including some reports, such as RNA based, instead of DNA, molecular computation (Faulhammer D, Cukras A R, Lipton R J, Landweber L F Molecular computation: RNA solutions to chess problems., “Proc Natl Acad Sci“, USA, 2000; 97(4), p. 1385-9), the technique based on nanostructure formed with self-assembly of DNA (Mao C, LaBean T H, Relf J H, Seeman N C, Logical computation using algorithmic self-assembly of DNA triple-crossover molecules., “Nature”, UK, 2000; 407(6803), p. 493-6).
In almost conventional DNA computation including Adleman's studies, DNA molecules having specific sequence are used as input data, and programs are defined with protocols of subsequent biochemical operation steps. Recently, some scientists are studying for achieving large scale calculation with robotic technologies for automatization of various reactions (Japanese patent publication (Tokkai) 2002-318992, (Tokkai) 2002-181813, Morimoto N, Kiyohara H, Sugimura N, Karaki S, Nakajima T, Makino T, Nishida N, Suyama A, Automated processing system for gene expression profiling based on DNA computing technologies., “Eighth International Meeting on DNA Based Computers”, Japan, 2002; Hokkaido University, Suyama A, Programmable DNA computer with application to mathematical and biological problems., “Eighth International Meeting on DNA Based Computers”, Japan, 2002; Hokkaido University). From a different viewpoint, some scientists are also working on studies for an autonomously working molecular computer. This type of computers, which can execute programs without the need of extraneous handling for a reaction solution to initiate reactions, work autonomously and output calculation results under certain conditions by addition of input data and calculation programs as DNA molecules into a reaction solution, and one of such computer technologies, developed using turing machines as a model, has been published (Benenson Y, Paz-Elizur T, Adar R, Keinan E, Livneh Z, Shapiro E, Programmable and autonomous computing machine made of biomolecules. “Nature”.UK. 2001; 414(6862), p. 430-4). An autonomously running molecular computer is attracting the attention because of its potential to calculate in an environment where conventional computers could never work, such as interior of living cells.
The main purpose of such studies for DNA computers is to achieve large scale parallel computation. This is based on the idea that in a test tube, in which a large number of DNA molecules can co-exist, and chemical reactions corresponding to calculation processes are carried out concurrently with assembly of the DNA molecules into each of which an initial values for calculation or a computation program itself is applied, which enables to carry out computation with very wide-ranging initial values or computation programs all at once in parallel. As described above, the studies have been made to develop the system to execute mathematical calculations such as parallel computation using parallelable reactions characterizing the DNA computing system.
While the studies for application of bioreactions to mathematical purposes have been attracted a lot of attention due to their unique ideas and potential, studies for practical applied technologies have not progressed and their capability are still unclear at the present stage. On the other hand, conventional computers, in particular, using electronic signals are improved in their processing capacity year by year, suggesting the low potency of the molecular computers to exceed the conventional ones in their processing capacity and correctness. There is a need of finding the suitable field for the molecular computers, different from the conventional computer-applied fields, to provide their best effect. In the meantime, some scientists is starting the studies to apply DNA computers to gene expression analysis and SNPs analysis (Nishida N, Wakui M, Tokunaga K, Suyama A, Highly specific and quantitative gene expression profiling based on DNA computing., “Genome Informatics”, 2001; (12), p. 259-260, Mills A P Jr, Gene expression profiling diagnosis through DNA molecular computation., “Trends Biotechnol”, 2002; 20(4), p. 137-40). These may be promising as the applicable fields suitable for unique property of molecular computers in which biomolecules can be used as input data directly. However, conventionally molecular computers has bee proposed which cannot work autonomously as applicable ones to bioanalysis, thus their application is restricted.
Accessing to information comprised in a nucleic acid involves hybridization reactions between nucleic acids, which cause formation of a stable hybrid between nucleic acids at the site, blocking further accessing to information without any treatments. However, it is desirable to construct nucleic acids-information utilizing molecular computers in which the information can be accessed repeatedly like chain reaction. To solve the problem, some processes are needed to return the inaccessible information in double stranded nucleic acid molecules to be in accessible state again. In conventional DNA computers, this process often involves denaturing of nucleic acids with heating. However, this procedure is incompatible with an autonomously running molecular computer because extraneous temperature control is needed. The key factor to realize an autonomously running molecular computer is to return information enclosed in double stranded nucleic acid to an available state again by using molecular reactions, for example enzyme reactions. One example of a molecular computer is achieved by Shapiro et al., who has succeeded to realize an autonomous running molecular computer by digesting double stranded DNA with restriction enzymes to expose single stranded DNA at the digested site (Y. Benenson et al, DNA molecule provides a computing machine with both data and fuel, “Proc. Natl. Acad. Sci.”, 2003; 100, p. 2191-6).
In consideration of the situation above, the present invention is directed to provide an information processing method using autonomously workable nucleic acids, and a molecular computer to carry out operations with the method.
In view of the situation above, the present invention is directed to provide an information processing method using autonomously workable nucleic acids, and a molecular computer to carry out operations with the method.
Procedures to Solve the Problems
The assignments above can be achieved by procedures, for example, below. The present invention provides an information processing method carrying out operations with functions receiving an argument and returning a return value through chemical reactions of molecules, comprising:
(a) inputting a first encoded nucleic acid defined in correspondence to a first degradable single stranded nucleic acid as an argument:
(b) carrying out an operation with functions defined in correspondence to chemical reactions of operator nucleic acids based on the argument:
(c) obtaining a second encoded nucleic acid defined in correspondence to a second single stranded nucleic acid as a return value.
The inventors made studies of solutions to this problem and, as a result, accomplished the present invention based on the following idea.
It is known that retrovirus, one of RNA genome-containing virus, replicates within host cells (
Retrovirus genome replication above comprises 4 characteristic reactions. The first reaction is a reverse transcription reaction by RNA dependent DNA polymerase activity. The second is formation of double stranded DNA by DNA dependent DNA polymerase activity. The third is a transcription reaction by DNA dependent RNA polymerase activity. Furthermore, in replication of full-length genome, RNaseH activity is also important to remove RNA strand in DNA-RNA hybrid during reverse transcription and formation of double stranded DNA. Genome amplification is achieved by combination of these 4 reactions. Looking such a series of systems as a kind of computer, retrovirus may be regarded to execute the program receiving its own genome RNA as “an input” and returning replicated RNA having an identical sequence to the input with above 4 reaction activity in a host cell, “hardware”.
Appropriate combination of above 4 reactions may also enable to allow such systems to execute a program different from self-genome-replicating program of retrovirus. Therefore, the invention has attempted to design a molecular computer comprising such 4 reactions. The molecular computer designed herein uses a reaction solution, as hardware, in which RNA dependent DNA polymerase, DNA dependent DNA polymerase, DNA dependent RNA polymerase and RNaseH activities are made active concurrently. To this hardware, RNA samples, as an input data, are provided to carry out operations with “functions” using RNA molecules as arguments and return values. In this invention, some examples are defined as underlying functions working in this hardware. Furthermore, combining these functions accordingly enables to construct programs, which are also applicable to gene expression analysis and like. Such molecular computers may exert different effects depending on introduced programs. Therefore, it may be a programmable general-purpose molecular computer.
Among reverse transcription activity, double-stranded-DNA formation activity, transcription activity and RNaseH activity, all of which are comprised in retrovirus genome amplification system, transcription activity and RNaseH activity may be listed as the most characteristic reactions in application of this mechanism to an autonomous running molecular computer. In retroviral typed molecular computers, key factors to allow molecular computers to run autonomously are transcription activity to separate single stranded RNA from double stranded DNA molecules and RNaseH activity to remove only RNA strand from DNA-RNA hybrid to leave single stranded DNA.
Based on ideas above, the present inventions has developed an information processing method for carrying out operations with functions receiving arguments and returning return values based on realization of autonomous reactions, which involves molecular chemical reactions with enzymes having polymerase activities such as DNA dependent DNA polymerase, RNA dependent DNA polymerase and DNA dependent RNA polymerase activity, RNaseH activity and like respectively.
As used herein, an “autonomous” reaction refers to that a reaction product can be obtained without extraneous handlings such as separation and isolation of nucleic acids in the course of molecular chemical reactions. In turn, an operation with a function outputting a return value against an input argument can be carried out without extraneous handlings.
As used herein, a “nucleic acid” includes all kind of DNA and RNA, including cDNA, genomic DNA, synthetic DNA, mRNA, total RNA, hnRNA and synthetic RNA, as well as artificial nucleic acids, such as peptide nucleic acids, morpholino nucleic acids, methylphosphonate nucleic acids and S-oligo nucleic acids. In the specification, “nucleic acid”, “nucleic acid molecule” and “molecule” are used synonymously each other.
As used herein, the both terms of “base sequence” and “sequence” refer to the array of bases composing specific nucleic acid.
Hereinafter, preferred embodiments of the present invention will be described with referent to the drawings.
According to preferred embodiments of the invention, an information processing method using nucleic acids is provided.
The invention discloses an autonomously-executable method for data processing and gene analysis involving carrying out calculation with nucleic acids. Also, an autonomous process of reactions is achieved by describing data and programs with nucleic acid molecules and replacing operations defined in the program with molecular reactions.
At first, as development of an information processing method of the invention, detail of operations performed is converted into executable data format for molecular chemical reactions. Specifically, before execution of an operation with molecular reactions, information is converted into encoded nucleic acids which are generated by pre-association of a molecule and a specific code. Data, such as parameters and constants for operations, are replaced to encoded nucleic acids according to conversion rules. Then, arithmetic processing with these encoded nucleic acids is conducted to obtain outputs with the encoded nucleic acids. The operations are accomplished by conversion of the resulting encoded nucleic acids into information pre-associated to them.
The first embodiment of the invention will be described according to processing flows in
(S1) is a step for inputting argument 11. Specifically, an encoded nucleic acid defined in correspondence to degradable single stranded nucleic acid 21 as an argument.
(S2) is a step for carrying out an operation with function 12. Specifically, an operation is carried out, based on argument 21, using function 12 defined in correspondence to chemical reaction 22 with operator nucleic acid 22. “Operator nucleic acids” are various nucleic acids designed to react with input single stranded nucleic acid 21 etc to produce specific reaction products through given reactions. In turn, they are nucleic acids having sequence required to initiate chemical reactions corresponding to functions, and, for example, they act as primers and promoters. Plural operator nucleic acids may be available, which may be used to carry out single function.
(S3) is a step for obtaining return value 13 of the function. Specifically, encoded nucleic acid 13 defined in correspondence to single stranded nucleic acid 23 is obtained in the step.
Herein, “defined in correspondence to” describes correspondence of a manipulation in information processing to a manipulation in a chemical reaction of nucleic acids. It means that encoded nucleic acid (argument) 11, an operation with function 12 and return value 13 in information processing correspond to the degradable single nucleic acid 22, used in a chemical reaction, chemical reaction 22 with operator nucleic acids in a chemical reaction and degradable single stranded nucleic acids etc., and single stranded nucleic acid 23, which is a reaction product in a chemical reaction, respectively.
In step (S1), an input argument is not required to be an encoded nucleic acid defined in correspondence to a degradable single stranded nucleic acid, thus a degradable single nucleic acid itself can be input directly as an argument. In this case, arithmetic processing is carried out with a degradable single stranded nucleic acid itself to obtain an output with encoded nucleic acids. In addition, not only encoded nucleic acid 13 defined in correspondence to the second single stranded nucleic acid 23 but also the second single stranded nucleic acid may be obtained directly as a return value of a function obtained in (S3). However, when an operation with functions is carried out as information processing method, either an argument or a return value should be an encoded nucleic acid in which a molecule is pre-associated with a specific code.
An example of chemical reactions used in the invention is showed in
A method of the invention provides an “input” as a degradable single stranded nucleic acid (ex. a RNA molecule). “Input of an argument” in information processing corresponds to adding a degradable single stranded nucleic acid to a reaction solution. Hereinafter, a method of the invention will be described taking the case of RNA used as a degradable single stranded nucleic acid, as an example.
The presence of an operator nucleic acid corresponding to an input RNA molecule (the primer showed in
In conventional DNA molecules-based information processing methods, input DNAs was not digested and it was still left in a reaction system after read of input DNAs. Thus, when the DNAs were undesired in subsequent reactions, complicated handlings were required to remove the DNAs. Such separating treatments are accompanied by a series of extraneous handlings, making it difficult to realize autonomous running. For example, robotic separating manipulations were required to automatize separating treatment. In contrast, the use of degradable nucleic acids, for example RNA molecules in the case of which only input RNA can be easily removed with RNaseH activity, would allow autonomous initiation of reactions in a reaction system. Despite RNA molecules used as degradable single stranded nucleic acids in the invention, other degradable nucleic acids may also be used.
Herein, “degradable” refers to that only “degradable single stranded nucleic acids” are degraded while other nucleic acids are not degraded. It means that, in particular, under the condition that operation nucleic acids are not degraded, only “degradable single stranded nucleic acids” are degraded selectively. For example, when DNA is used as an “operator nucleic acid”, RNA would be “degradable” because RNA would be selectively degraded with RNaseH. Furthermore, when RNA is used as an “operator nucleic acid” with addition of a pure deoxyribonuclease, DNA can be degraded selectively, thus DNA would be a “degradable” nucleic acid in such a condition. Therefore, “degradable” may have a relative concept.
Other examples of “degradable” nucleic acids include, but are not limited to, uracil containing DNA used when an operator molecule is DNA (A RACHITT for our toolbox, Nature Biotechnology, April 2001 Volume 19 Number 4 pp 314-315, DNA shuffling method for generating highly recombined genes and evolved enzymes, Nature Biotechnology, April 2001 Volume 19 Number 4 pp 354-359), and DNA and RNA when an operator molecule is Peptide Nucleic Acid.
In a method of the invention, a single stranded nucleic acid is input as an argument. An information processing method with nucleic acids involves hybridization reactions to access the information in nucleic acid sequence. Thus, in the case of the conventional technique using double stranded DNA as an input, reactions to return double stranded DNA into single stranded DNA is required to allow the double stranded DNA to hybridize with an operation nucleic acid. However, in such reactions, a series of extraneous handlings is required to control reaction temperature. Therefore, such reactions, as the separation treatments above, made it difficult to allow a series of reactions to run autonomously.
When nucleic acids are not degradable, accompanied by remaining of hybridized double stranded nucleic acids, the temperature control is required to unwind them into single strand.
In contrast, degradable nucleic acids are used in a method of the invention, resulting in the nucleic acids degraded after hybridization. Thus, the autonomous operations are achieved. In other words, the method enables to leads chemical reactions of operator nucleic acids even at constant temperature, providing autonomously occurring degrading reaction. For example, as discussing in the following examples, autonomous reactions may be achieved at constant temperature, 50° C.
On the other hand, information input as RNA is removed by degrading with RNaseH, and, at the same time, reverse transcripted into a more stable nucleic acid (ex. DNA molecules), allowing them to be stored and saved more stably. In addition, remaining single stranded DNA acts as a primer for yet another RNA, and, thus, may serves repeatedly as an operator nucleic acid. Thus, it would be possible to induce further elongation reaction for the DNA (
It is known that a promoter region has to be double stranded DNA to induce the transcription activity of transcriptases such as T7 RNA polymerase (Milligan et al. Oligoribonucleotide synthesis using T7 RNA polymerase and synthetic DNA templates. Nucleic Acids Res November 1987 11;15(21):8783-98). In the present invention, outputs are controlled based on this characteristic (
In an information processing method of the invention, whole a series of systems makes one component which receiving inputs with RNA and returning outputs resulted from occurrence of various reactions. As mentioned above, such a component is designated as a “function” receiving an “argument” and returning a “return value” (
Obtaining a return value as a single stranded nucleic acid, in an information processing method of the invention, it is easy to access this return value again. In addition, arguments and return values in each function are same kind of molecules (both are RNA, degradable nucleic acids), which allows a return value of one function to be an argument for another one. Thus, a return value from one function may be used as an argument of further function to obtain a further return value. In addition, plural arguments can be used, without limiting to single argument per function. In this case, functions are also defined to use the return values obtained from the plural functions as arguments to obtain further return values. Combining such functions, it may be possible to obtain certain return values. In turn, operations with plural functions can be also carried out following a program described with combination of functions, arguments and return values to extract calculation results as return values.
Assuming that whole a series of systems above is a molecular computer, a reaction solution composed of operator nucleic acids for carrying out operations with desired functions, suitable reaction solution and suitable enzymes would correspond to “hardware” in a computer to execute operations with these functions. A “program” would be defined with operator nucleic acids such as DNA (or RNA) primers and like, determining which reactions will occur (
Design of Various Underlying Functions
Specific examples of operations with functions above are given as follows.
Functions carried out in an information processing method of the invention are defined with operator nucleic acids. Preferably, operator nucleic acids are primers having one or more sequences selected from, for example, sequences acting as a primer for a single stranded nucleic acid, promoter sequences and sequences acting as a primer for any nucleic acid. When arguments are RNA molecules as degradable nucleic acids, two kind of operation nucleic acids, the first primer (P1), which hybridizes with this single stranded RNA and initiate the elongation reaction of DNA to form the first strand cDNA, and the second primer (P2), which hybridizes with the first strand cDNA, are required to carry out an operation with functions according to the invention. When a promoter sequence is incorporated at any site of these primers, hybridization of the primers induces transcription activity. As a result, specific RNAs are output. As the examples of above functions, the following 4 types of functions are considered depending on location and direction of an incorporated promoter sequence (
Hereinafter, above 5 functions will be described in detail.
The underlying function A: Path (a→b)=>X
The function returns RNA of specified sequence X in the presence of a path in RNA starting at sequence a through sequence b.
P1 is a primer having a promoter sequence in 5′-end direction, a reverse complementary sequence of X at downstream of the promoter sequence and a complementary strand sequence of a at its 3′-end, and primer P2 has the base sequence b (
Then, the presence of a complementary sequence to sequence B in the single stranded DNA generated by reverse transcription from P1, to which primer P2 binds, induces initiation of synthesis of a second strand DNA. This reaction makes a promoter sequence in P1 double stranded and induces transcription, which provides output of RNA molecules of sequence X located in downstream of the promoter. Therefore, the reaction is a function returning RNA of sequence X when sequence b exists along the path in RNA starting at sequence a.
Underlying function B: Path (a−# b [; b′; b″ . . . ])=>X
This function returns RNA of specified sequence X in the presence of a path in RNA starting at sequence a and ending at sequence b. The terminating condition may be extended in a paratactic manner as “b or b′, b″ . . . ”.
P1 is a primer having complementary strand sequence to a, and P2 is a primer having a promoter sequence in 5′-end direction, sequence X at downstream of the promoter sequence and sequence b at 3′-end of X (
Underlying function C: Amplify (a−# b [--add5 P] [--add3 Q])
When there is a path in RNA starting at sequence A and ending at B, this function amplifies RNA of that sequence. In addition, it can also amplify RNA with addition of optional sequence P or Q at 3′- or 5′-end of the amplified sequence.
P1 is a primer having complementary strand sequence to a, and P2 consists of a promoter sequence in 3′-end direction and sequence b at its 3′-end (
Underlying function D: RevAmplify (a→b [--add5 P] [--add3 Q])
When there is a path in RNA starting at sequence A through sequence b, this function amplifies RNA of its reverse complementary strand sequence. In addition, an optional sequence, P or Q, may be added to 3′- or 5′-end of the amplified sequence.
P1 consists of a promoter sequence in 3′-end direction and a complementary strand to sequence a at its 3′-end, and P2 has sequence b (
Underlying function E: Output ( ) RNA X
This function always outputs RNA of sequence X without requiring an argument.
Underlying function E is designed to always transcript RNA of sequence X without requiring an argument. This function is achieved with double stranded DNA consisting of a promoter sequence and its downstream sequence X.
Program construction with combination of the underlying functions
Combining the underlying functions above enables to construct a higher-order function. In addition, programs may also be constructed by combining above functions, arguments and return values. However, when the program showed in
Alternatively, if a promoter sequence is double stranded due to dimmer formation of primers, a wrong return value may be returned. Furthermore, when multiple functions are concurrently executed in single reaction solution, the more types of functions used, the more combinations of primers may cause interaction within a combination, resulting in chances of side reactions increased. To implement programs effectively executing targeted function reactions without the effect of side reactions, it is particularly important to consider using the combination of functions possibly having less chance of side reactions, and carefully programming and designing, in particular, a sequence of primers used in the reactions.
For example, nucleic acids including orthonormalized sequences may be used as an operator nucleic acid. The term “normalize” in “orthonormalized sequence” refers to maintain the normality of their thermal property among multiple sequences, and, in other words, make them have uniform melting temperature within certain range. The normality of the thermal property maintained, reactions would be advantageously executed using many sequences as a whole. The term “ortho” in “orthonormalized sequence” refers to give orthogonality to sequences, wherein each of all sequences included in one group of orthogonalized sequences reacts independently, and, thus, sequences included in one group of orthogonalized sequences hardly or never react among the sequences, except for desired combinations, and inside of its own sequence. In turn, a sequence included in one group of orthonormalized sequences has less or no chance to cause cross-hybridization between each sequence, and undesired hybridization inside of its own sequence.
The above orthonormalized sequences are described in H. Yshida and A. Suyama, “Solution to 3-SAT by breadth first search”, DIMACS Vol. 54 9-20(2000) and Japanese patent No. 2003-108126 in detail. Using the methods described in these references, orthonormalized sequences can be designed. Briefly, they can be produced using the method comprising: generating multiple base sequences previously in random manner: calculating the average of their melting temperature: selecting candidate sequences based on threshold limited with the average ±t° C.: and obtaining a group of orthonormalized sequences from the candidate sequences selected with an indication whether or not the sequences react independently.
The base sequences or nucleic acids included a group of orthonormalized sequences share almost similar melting temperature, have little chance to cause cross-hybridization each other and have unstable secondary structure. The orthonormalized sequences may also be used as nucleic acids of coding sequences in the following examples.
In addition, preferably, encoded nucleic acids of the invention have also orthonormalized sequences above. On the other hand, for example, total RNA purified from cells may also be used as a first encoded nucleic acids directly. In turn, without converting pre-associated information to encoded nucleic acids, the obtained nucleic acid itself (for example, a non-encoded degradable nucleic acid such as total RNA), may also be directly used as an encoded nucleic acid, regarded as information. One example is a case of using a method of the invention for gene expression analysis below. Furthermore, application of further operations to a second encoded nucleic acid obtained from former operation also enables to obtain a non-encoded single stranded nucleic acid as a return value directly. Such nucleic acids may be mRNA or adaptamer nucleic acids binding to proteins. In addition, they may be antisense RNA hybridizing to specific gene mRNA. One example is the case of using a method of the invention for intracellular molecular computing below.
In such cases, preferably, RNA used for input are allowed to react further after converted into encoded nucleic acids having orthonormalized sequences, for example, as described below.
(Gene Expression Analysis Program)
Hereinafter, the case of the application to gene expression analysis will be illustrated, as an example of programs with combination of the underlying functions above.
(Gene Encoding)
For gene analysis with DNA microarray etc, encoding techniques converting specific genes to corresponding zip codes or internal codes has been developed to control hybridization appropriately (Gerry et al. Universal DNA microarray method for multiplex detection of low abundance point mutations. J Mol Biol September 1999 17;292(2):251-62, Nishida et al. Highly specific and quantitative gene expression profiling based on DNA computing. Genome Informatics 2001 (12) 259-260, Wharam et al. Specific detection of DNA and RNA targets using a novel isothermal nucleic acid amplification assay based on the formation of a three-way junction structure. Nucleic Acids Res June 2001 1;29(11):E54-4).
The program uses the underlying function A(path (a→b)=>X) (
Using the program, genes encoding can be achieved under autonomous condition. For example, it can be also applied to gene detection with DNA micro array and like. In addition, for example, a coding sequence RNA can be used as an input for an operation program with other functions to construct gene expression analysis program.
(Conversion of each Gene to a Path Element and Gene Expression Analysis with Logic Operation)
Here, a method of gene expression analysis involving encoding of each gene for a path element is described. The program example returning gene X in the presence of gene A and B is showed in
The program consists of a function converting RNA of gene A and B to coding sequences and a function recognizing a path and returning gene X. In turn, gene RNA is encoded, and the operation is carried out with the resulting encoded sequence. At first, the consideration is given to a encoding function returning coding sequence, Code[2,1], which has the sequence consisting of coding sequences, Code[2] and Code[1], aligned in the direction from 5′-end to 3′-end, in the presence of gene A using the underlying function A. In the same way, a function returning Code[3,2], which has a sequence consisting of Code[3] and Code[2] aligned, in the presence of gene B, wherein Code[1], [2] and [3] may be any sequences. Preferably, these have sequences which hardly cause mis-priming etc and have similar priming efficiency under the condition of the reaction solution. In turn, the orthonormalized sequences mentioned above are preferable.
Combining the above functions, path element Code[1]-Code[2] is formed only in the presence of gene A, and path element Code[2]→Code[3] is formed only in the presence of gene B. Therefore, only in the presence of both gene A and B, a path in RNA starting at Code[1] and ending at Code[3] is formed (
The key property of the method is to execute gene analysis involving conversion of each gene to each path element (1→2 and 2→3), which is a constituent of a virtual path consisting of coding sequences (in this case, path 1→2→3) and detection of the presence of the path. Extending the scale of a path and using increased types of associated genes would enable to carry out more complicated operations (
(Gene Expression Analysis using Neural Networks)
In gene expression analysis with logic operation, gene expression patterns have to be known. In addition, essentially, it analyzes only existence of genes and can not estimate information of the concentration. A neural network constructed using an information processing method of the invention will be illustrated to show an example of methods also enabling estimation of concentration of genes whose expression patterns are unknown.
Some scientists have proposed ideas to apply a neural network constructed with a DNA computer to gene expression analysis (Mills Gene expression profiling diagnosis through DNA molecular computation. Trends Biotechnol April 2002; 20(4):137-40). However, it was difficult to carry out complicated analysis using conventional ideas because it was a single-layered simple perceptron model without intermediate layers. In addition, it required a manipulation containing multiple steps. On the contrary, using an information processing method of the invention, multilayered perceptron which may execute a complicated analysis can be achieved in autonomously working reaction system (
At first, genes are encoded to carry out gene analysis. The encoding function is made to output Code[a1,ST] in the presence of RNA A. This may be associated to path ST→a1. Similar functions are also configured for RNA B, C and D to replace them into path ST→a2, a3 and a4 respectively. These encoding functions carry out input into a neural network depending on the existence of each gene RNA. All path units: a1→b1, a1→b2, a1→b3, . . . , b4→c4 and c1→X, c1→Y, c2→X, . . . , c4→Y, which connect intermediate layers of perceptron, can be generated by the corresponding RNA output using the underlying function E: Output( ). In addition, using the underlying function B, a program is constructed with introduction of a function returning x depending on the existence of path ST→X (path (ST−# X) x) and a function returning y depending on the existence of path ST→Y (Path (ST−# Y) y). As a result, a neural network is formed to change the proportion of output x to y depending on input RNA is formed (
Using the method showed in
Utilizing of gene expression analysis technique involving the neural network of this molecular computer may provide a novel gene diagnosis technique (
(Extension of Functions)
Usable Functions for the invention are not limited to above 5 functions. It is possible to define various functions using various operator nucleic acids.
For example, in all of above underlying functions, which is constructed to lead reverse transcription reaction initiated with P1 and hybridization of P2 with cDNA generated from the reverse transcription, P2 may also be used as a primer for RNA. In this case, 3′-end of P2 would be changed through elongation reaction. Such a change of 3′-end sequence may be considered to correspond to the change of detail of a function. The use of such a change enables to extend the concept of functions. In addition, achieving the chemical reactions exemplified below in the hardware reaction solution, it would be possible to extend the definitions of functions available for programs beyond 5 underlying functions.
In order to return a result of a program, as a computer, it is necessary to detect the output resulted from a series of reactions corresponding to functions. A program consisting of only above underlying functions, all of which return RNA as return values, also give RNA molecules as final outputs. These output RNA can be purified with molecular biology procedures. The use of techniques such as RT-PCR, northern blotting and DNA microarray also enables to detect output RNA. Taking the advantages of an autonomously workable molecular computer of the invention, it would be more effective to carry out a series of steps leading up to the detection of results in single reaction solution. Therefore, it is preferable to detect output RNA molecules in the computing reaction solution directly. For example, it is possible to apply Fluorescence Resonance Energy Transfer (FRET) technology to detect RNA molecules directly. FRET is very useful to detect fluorescence externally to take information. FRET technology has been applied to real-time PCR with fluorescence labeled DNA probes (Didenko, DNA probes using fluorescence resonance energy transfer (FRET): designs and applications. Biotechniques November 2001; 31(5):1106-16, 1118, 1120-1). For example, the use of FRET probes showed in
Using these fluorescence outputting functions, it is possible to design gene diagnosis program making it possible to carry out the course leading up to detection of output in single step. For example, in a gene expression analysis using a neural network showed in
For further reactions, other type of primers may be used, for example, based on 3-way junction (3WJ) structure, published by Wharam et al., in 2001, (
Furthermore, in terms of extension of functions, for example, RNA output from certain function may also be used for an operation with functions. For example, RNA molecules themselves output from each function, which may act as primers, may be allowed to act as operator nucleic acids in an operation with functions.
Furthermore, ribozymes have been studied to utilize as elements for molecular computers (Wickiser et al. Oligonucleotide Sensitive Hammerhead Ribozymes As Logic Gates. Eighth International Meeting on DNA Based Computers, June 2002 10-13; Hokkaido University, Japan). Ribozymes are known as RNA molecules having enzyme activity. When such ribozymes are used, RNA molecules themselves, which are generated as outputs in functions, may be act as ribozymes, resulting in an output RNA fulfilling a new feature as a function directly. Such ribozymes may be used as functions used in an information processing method of the invention.
Executing reactions other than the above exemplified underlying functions in hardware of the molecular computer would provide further function enhancement of the computer.
As described above, Combination of 4 types of reactions, RNA dependent DNA polymerase activity, DNA dependent DNA polymerase activity, DNA dependent RNA polymerase activity and RNaseH, which are critical reaction activity for retrovirus genome amplification, provides an autonomous running programmable molecular computer.
Specifically, a computer characterized by consisting of containers containing operator nucleic acids for carrying out operations with desired functions, a suitable reaction solution and suitable enzymes is provided as a molecular computer for carrying out the operation with the information processing method described above. Although 5 types of underlying functions are expediently defined as functions constituting a program in a molecular computer, more generally, the following 3 kinds of oligo nucleic acids are added to hardware of a molecular computer as programs; a nucleic acid containing a promoter placed in 5′-end direction, a nucleic acid containing a promoter placed in 3′-end direction and a nucleic acid without a promoter sequence. In turn, it can be said to be a system in which elongation reaction is initiated appropriately if RNA given as an input to a reaction system containing these oligo nucleic acids, and when a promoter sequence is made double stranded at any site, RNA of the downstream sequence is returned.
On the other hand, the usable containers for a molecular computer include, for example, sample tubes, test tubes and micro channels conventionally used for nucleic acid reactions. In addition, single container is enough for the molecular computer, but plural containers may be used.
If cells or tissues are used as containers, desired gene transcription can be also controlled depending on the results from autonomous detection of gene expression level and pattern in the living cells. Therefore, output of RNA can be controlled in living cells, which will provide a new controlling mechanism of cells. For example, specific genes can be expressed only in cells in which genes are expressed in specific pattern, and, the genes normalizing cells can be also expressed only in targeted cells, such as cancer cells. Such techniques may be applied to techniques such as gene therapy.
To carry out information processing with an information processing method of the invention, necessary operator nucleic acids may also be provided as a kit. The kit contains operator nucleic acids for carrying out operations with desired functions. Preferably, the kit contains an operator nucleic acid comprising one or more sequences selected from sequences acting as a primer for a first single stranded nucleic acid, promoter sequences and sequences acting as a primer for any nucleic acid.
In addition, the kit may contain not only an operator nucleic acid but also a suitable reaction solution and suitable enzymes. Suitable reaction solution include, for example, buffers suitable for a synthesis reaction, an amplification reaction, a reverse transcription reaction, a transcription reaction and a degrading reaction, and suitable enzymes include, for example, enzymes having DNA dependent DNA polymerase activity, those having RNA dependent DNA polymerase activity, those having DNA dependent RNA polymerase activity and RNaseH.
When the kit described above is a kit for gene expression analysis, for example, as described in the above section “gene expression program”, it would contain operator nucleic acids necessary for encoding, enzymes having DNA dependent DNA polymerase activity, those having RNA dependent DNA polymerase activity, those having DNA dependent RNA polymerase activity and RNase H as well as a suitable reaction solution, 40 mM Tris-HCl (pH 8.0), 50 mM NaCl, 8 mM MgCl2, 5 mM DTT. Above enzymes may be pre-added in a reaction solution. For example, the kit may be used as follows: a RNA sample is added to a buffer solution containing all of enzymes at 50° C. and mixed well, then the reaction mixture is incubated at 50° C. For example, 3 μl of enzyme buffer is added per tube in total volume of 25 μl, which is allowed to react for 30 min.
The reaction required for execution of programs are substantially same as the reactions actually caused by retrovirus and retrotransposon in living cells, suggesting the possibility for achievement of a molecular computer with the system in living cells. When this intracellular molecular computing is materialized, for example, the gene expression analysis program in living cells combined with fluorescence outputting functions, it can be also applied to the technology for nondisruptive external monitoring of the gene expression pattern in living cells.
Alternatively, outputting RNA of gene which controls cellular activity also provides the program which controls cellular activity depending on gene patterns. For example, gene therapy may also be achieved to involve expression of introduced specific genes only in defective cells by input of marker genes for a disease such as cancer.
(Advantageous Effect of the Invention)
A programmable autonomous running molecular computer can be generated by using an information processing method of the invention. Such a computer has versatility to execute different programs in single hardware. In particular, it can be applied to uses such as research and development regarding function analysis of genes, gene diagnosis and like, for which the needs may grow in the future.
Gene-expression-analysis executing programs based on logic operation or neural network combined with fluorescence outputting functions, it may be allowed to carry out autonomously all of measurements and analysis of genes, and output of the results. Furthermore, using the method involving above neural network, it would be possible to analyze gene expression in principle even if relationship between gene expression pattern and phenotypes is not clear. In addition, it is-also possible to estimate information about concentration of expressed genes.
(Materials and Methods)
(Equipments and Reagents)
Double-stranded DNA molecules were detected with Agilent 2100 bioanalyzer (Agilent Technologies) after electrophoresis. Reagents used in practice of the method are DNA 500 LabChip® kits or DNA 7500 LabChip® kits. Real-time PCR was carried out using LightCycler™ Quick System 330 (Roche Diagnostics Co.). Reagents used for the PCR were LightCycler™ FastStart DNA Master SYBR® Green I, purchased from said company. Preparation of reagents and operation of instruments were carried out according to manufacturer's manuals.
(Design of Gene Specific Sequences)
Primers recognizing TGTP gene and Vitronectin gene were designed respectively using the specific primer design program developed by Takashi Mishima et al. (“Study for a probe and primer sequence design method for measurement of gene expression in large scale”, Graduate School of Science, The University of Tokyo, master's thesis 2001), Primer3 (Rozen and Skaletsky, Primer3 on the WWW for general users and for biologist programmers Methods Mol Biol 2000; 132:365-86) available to the public as a primer design software, and like others, and suitable primers are selected from the generated primers.
The used sequences specific to TGTP gene and Vitronectin gene are summarized below (numbers in parentheses denotes the location of a primer in a RNA molecule of either TGTP or Vitronectin. “S” refers to a sense strand sequence, “A” refers to an anti strand sequence.)
If required, to above sequences are added a promoter sequence or a coding sequence at their 5′-end to use for the study. Synthesis of primer DNAs having less than 30 bases were basically customized by Oligo Japan Co. as Easy oligos®. Longer primers, having 30 bases or more, were customized by Sawady Technology Co. Ltd.
(Design of Coding Sequences)
Oligo DNAs containing an artificially generated “coding sequence” were used in this example. A “coding sequence”, as used herein, refers to a sequence pair of which members have the same base length and are designed to be characterized by having the equalized melting temperature of double stranded DNA with calculation using the nearest-neighbor method (SantaLucia A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proc Natl Acad Sci USA February 1998 17;95(4):1460-5) and having little chance of the formation of stable secondary structure and mis-hybridization (Yoshida et al “Solution to 3-SAT by breadth first search. DIMACS Series in Discrete Mathematics and Theoretical Computer Science, 2000 54: 9-22, American Mathematical Society). In the study, the following 5 sequences, having 25-base length were used.
(Primer Sequences Used for the Computing Reaction)
Gene specific sequences, coding sequences and the T7 promoter sequence etc were combined to make specialized oligo DNAs for using the computing reaction. Their names, structures and sequences are listed below. (In the item “Structure”, [ ] refers to a sequence name of gene specific sequence, <>refers to a sequence name of coding sequence, {T7} refers to the T7 promoter sequence. Tg is a sequence having 6-base length, sequence 5′-GGGAGA-3′, Tc is a 9-base length sequence, 5′-ATAGGGAGA-3′. a′ headed sequences denote reverse complementary stranded sequences. S denotes other sequences.
Comment: Complementary sequence to 5′-end of
Comment: Complementary sequence to 5′-end of Vitronectin-T32 primer
Comment: A primer used for in vitro synthesis of TGTP gene. The sequence of first 20 bases at the 3′-end is identical with the sequence of 5′-end of synthesized TGTP RNA molecule.
Comment: A primer used for in vitro synthesis of Vitronectin gene. The sequence of first 20 bases at the 3′-end is identical with the sequence of 5′-end of synthesized Vitronectin RNA molecule.
Comment: This is the first 20 bases at 5′-end of Code[1] sequence, and its Tm value is approximately 48° C. in the computing reaction solution.
Comment:Complementary sequence to 5′-end of aC3-T45.
Comment: Complementary sequence to 5′-end of aC3-T465.
(Preparation of RNA Samples)
TGTP and Vitronectin RNA molecules, as well as Code[2,1] and Code[3,2] RNA molecules used for the computing reaction were prepared with an in vitro transcription method.
TGTP gene and Vitronectin gene were prepared with the following procedures. The graft versus host reaction (GVHR) is induced in BALB/c mice by implantation of spleen cells derived from C57/BL10 mice. C57/BL10 mice derived spleen cells were given by Prof. Katsushi Tokunaga, Faculty of Medicine, The University of Tokyo. Then, total RNA is prepared from a liver taken from the mice 2 days after the implantation. An equivalent of this sample has been confirmed to contain RNA of TGTP gene and Vitronectin gene by a semiquantitaive real-time PCR method (Wakui et al. 2001). Then, reverse transcription was performed using total RNA as a template to generate cDNAs of TGTP gene and Vitronectin gene. TGTP-PE and Vitronectin-PE were used as primes in the reverse transcription for TGTP gene and Vitronectin gene respectively. AMV Reverse Transcriptase XL, containing 50 mM Tris-HCl (pH 8.3), 4 mM DDT, 10 mM MgCl2, 100 mM KCl, 0.5 mM dNTPs, 800 nM of each primer and 0.3 Units/μl, (Takara Bio Inc.), was used as a reaction solution for the reverse transcription, to which total RNA was added when the reaction performed. The hot-start method was used to perform the reaction. Specifically, 9.5 μl of the reaction solution without an enzyme was incubated for 5 minutes at 65° C., followed by 3 μl of a solution containing the enzyme added. After the solution added the enzyme was incubated for 60 min at 50° C., 0.5 μl of Ribonuclease H (2 U/l; Invitrogen) was added, and then the mixture was reacted for 20 min at 37° C. Then, PCR reaction was separately performed using resulting cDNA as a template. When the reactions were carried out, the pair of TGTP-PE and TGTP-PT and the pair of Vitronectin-PE and Vnct-PT were used as primer pairs for TGTP and Vitronectin respectively, wherein TGTP-PT primer and Vitronectin-PT primer are oligo DNAs added a clump sequence having 6-base length (5′-GATGCA-3′ (SEQ ID NO:26)) and T7 promoter sequence having 23-base length (5′-TAATACGACTCACTATAGGGAG A-3′(SEQ ID NO:27)) at 5′-end of gene specific sequences, TGTP-PS and Vitronectin-PS respectively. TaKaRa Ex Taq™ (Takara Bio Inc.) was used in the PCR reaction, which was performed following the attached protocol (Cool start method). Briefly, the solution, prepared by adding 0.8 μl of each primer DNA, each of 0.2 mM dNTPs, 40 U/ml enzyme and 1 μl of cDNA sample to 25 μl of the reaction buffer, was applied to the reaction for 31 cycles of 94° C.-30 sec, 60° C.-90 sec and 72° C.-60 sec, followed by 720 ° C.-10 min. Detection of actually resulting PCR products with electrophoresis revealed that this reaction provided single bands having the same base length as each expected value, which were 831-base and 846-base double stranded DNA for TGTP and Vitronectin respectively (data not shown).
In vitro transcriptions were performed separately using T7 promoter-containing double stranded DNA of either TGTP or Vitronectin gene, generated from above PCR reaction, to produce RNA molecule for each gene. This reaction, for each genes, was carried out with 100 μl of the reaction solution aliquoted into 4 tubes, in each of which 500 U/ml T7 RNA Polymerase (Invitrogen) and 1μl of double stranded template were added to the reaction buffer, comprising of 40 mM Tris-HCl (pH 8.0), 8 mM MgCl2, 2 mM Spermidine-(HCl)3, 25 mM NaCl, 5 mM DDT, 0.4 mM NTPs. After incubation for 1 hr at 37° C., to the mixture was added 2.5 μl of 1 U/μl Deoxyribonuclease I (Amplification Grade; Invitrogen) and incubated for further 15 min at 37° C. The resulting reaction products were purified with ethanol precipitation. The ethanol precipitation was performed with Pellet Paint® Co-Precipitant (Novagen) following the attached protocol. The resulting precipitates were solved in DEPC water to store at −20° C. before use.
Code[2,1] and Code[3,2] RNA molecules were in vitro synthesized using customized oligo DNAs, TGTP-T21 and Vitronectin-T32. For each reactions, 20-base length primers complementary to 3′-end of the oligo DNAs are mixed with the PCR reaction solution and incubated for 5 minutes at 94° C., then added buffer containing the enzyme at 80° C., followed by incubation for 5 minutes at 60° C. and then 72° C. for 60 minutes. The resulting double stranded DNA containing T7 promoter sequence was used for in vitro transcription to produce coding RNA. The transcription reaction, Deoxyribonuclease I treatment and ethanol precipitation method were similar to the case of TGTP gene and Vitronectin gene.
(Computing Reaction)
Computing reaction executing various function reactions with DNA primers are accomplished by coexisting of an enzyme with RNA dependent DNA polymerase activity, an enzyme with DNA dependent DNA polymerase activity or an enzyme with DNA dependent RNA polymerase activity in single buffer, in which the enzymes can be active. The reaction solution comprises 40 mM Tris-HCl (pH 8.0), 50 mM NaCl, 8 mM MgCl2, 5 mM DTT and 0.3 U/μl AMV Reverse Transcriptase XL (Takara Bio Inc.), 0.04 U/μl Ex Taq™ (Takara Bio Inc.), 3.2 U/μl Thermo T7 RNA Polymerase (TOYOBO). To this reaction solution are added DNA primers and a RNA template accordingly. Unless otherwise provided, DNA primers are added in the final concentration of 1 nM. The reaction was carried out with the hot start method, wherein the reaction solution without enzymes was incubated at 65° C. for 5 minutes. Then, the buffered solution containing all enzymes was added at 50° C. and mixed well, followed by incubation at 50° C. Unless otherwise provided, 3 μl of enzyme buffer is added per tube in 25 μl of total volume of the reaction solution and allowed to react for 30 min. The reaction mixture was incubated at 85° C. for 10 minutes to deactivate the transcription enzyme immediately after completion of the reaction.
(Detection of RNA Products After the Computing Reaction)
The RNA products resulted from the computing reaction were detected by reverse transcriptional-PCR after DNA degraded with enzymes.
Before degrading of DNA with enzyme, column purifying was performed to remove enzymes potentially binding to DNA and inhibiting the enzyme degrading, such as a Taq polymerase. After a sample solution was prepared by addition of DEPC treated water to the computing reaction solution to 50 μl total volume, it was charged to a MW cut off: 10,000-column, MICROCON YM-100 (Millipore) and centrifuged at 4° C., 12,000 rcf for 10 minutes. Collected flow-through solution was applied to MICROCON YM-10 (Millipore, molecular weight cut off=10,000) again and centrifuged at 4° C., 12,000 rcf for 50 minutes, followed by centrifugation of the column placed upside down in a new tube at 4° C., 12,000 rcf for 10 minutes to collect concentrated solution remaining on upper side of the column.
DNA degrading reaction was performed at room temperature for 15 minutes in 10 μl of the reaction solution which was prepared by addition of 1 μl of each sample collected from above to 20 mM Tris-HCl (pH 8.4), 2 mM MgCl2, 50 mM KCl and 0.1 U/μl Deoxyribonuclease I (Amplification Grade; Invitrogen). After the reaction, to the reaction solution was added 1 μl of 25 mM EDTA and then incubated at 65° C. for 10 minutes.
Reverse transcription reaction was performed in 12.5 μl of a reaction solution per tube, which was prepared by addition of primer DNAs in final concentration of 600 mM and 1 μl of a DNase I reaction product obtained above to 50 mM Tris-HCl (pH 8.3), 4 mM DDT, 10 mM MgCl2, 100 mM KCl, 0.5 mM dNTPs and 0.3 Units/μl AMV Reverse Transcriptase XL (Takara Bio Inc.). This reaction was carried out with the hot start method, wherein the solution comprising all component except for the enzyme was incubated at 65° C. for 5 minutes, followed by 3 μl of the buffered solution with the enzyme added at 50° C. Then, it was allowed to react at 50° C. for 1 hr, followed by 94° C. for 10 minutes.
Resulting cDNA was quantitatively analyzed by real-time PCR. To 20 μl of reaction solution, prepared following the manufacturer's manual, was added 1 μl of the reverse transcriptional product and incubated at 94° C. for 10 minutes, and then PCR reaction was performed. The PCR reaction was carried out for 40 cycles of 94° C.-3 sec, 60° C.-10 sec and 72° C.-5 sec to amplify a coding sequence and gene sequence with less than 300 base length, and for 40 cycles of 94° C.-25 sec, 60° C.-10 sec, 72° C.-25 sec to amplify a gene sequence with 300 bases or more. The quantitative concentration analysis was performed by comparing PCR amplification curves obtained above to those from simultaneous PCR reactions with single stranded DNA in finale concentrations of 0.1 nM, 0.03 nM, 0.01 nM, using the software appended to a machine. In addition, the PCR reaction was stopped at an appropriate time point to take halfway amplified samples, which were detected and analyzed by gel electrophoresis using Agilent 2100 bioanalyzer (Agilent Technologies).
(Detection of Intermediate DNA Products in a Computing Reaction)
Intermediate products comprising single stranded and double stranded DNA generated in the reaction solution were detected to confirm the progress of the computing reaction. Single stranded or double stranded DNAs generated by reverse transcription reaction were detected with amplifying them by PCR reaction after purification of them. DEPC treated water was added to the computing reaction solution to adjust the sample volume to 50 μl, which was pipetted into a column, MICROCON YM-100 (Millipore) (MW cutoff value is 100,000) and centrifuged at 4° C., 12000 rcf for 10 minutes. Flow-through solution from the column was collected and pipetted into MICROCON YM-100 (Millipore) (MW cutoff value is 100,000), which was centrifuged 4° C., 12000 rcf for 50 minute, followed by further centrifugation of the column placed upside down in a new tube at 4° C., 12000 rcf for 10 minutes to collect concentrated solution remaining at upper side of the column. The resulting solution was used for PCR to amplify single stranded DNAs in the reaction solution containing buffer added appropriate primers and Ex Taq® (Takara Bio Inc.). The amplification was carried out with the cool start method, for 31 cycles of 94° C.-30 sec, 60° C.-60 sec and 72° C.-60 sec, followed by incubation at 72° C. for 10 minutes. Resulting amplified products were detected by gel electrophoresis.
Double stranded DNA generated from the DNA double-strand formation reaction was detected by gel electrophoresis using Agilent 2100 bioanalyzer (Agilent Technologies). Base length and concentration of double stranded DNA were determined following the protocols of the instrument.
Results
(Development of Hardware)
To achieve a molecular computer simulating retrovirus genome amplification reactions, at first, the condition of the reaction solution was considered to generate all chemical reactions required for a molecular computer. This reaction solution is critical because it acts as hardware constructing the molecular computer.
For the hardware used here, it is necessary to allow all enzymes, which have DNA dependent DNA polymerase activity, RNA dependent DNA polymerase activity, DNA dependent RNA polymerase activity and RNaseH activity respectively, to be active simultaneously in single tube maintained at the certain temperature. We performed the experiment following the condition used in 3SR amplification technique (Guatelli et al. Isothermal, in vitro amplification of nucleic acids by a multienzyme reaction modeled after retroviral replication. Proc Natl Acad Sci USA October 1990; 87(19):7797), in which the similar reaction solution has been achieved. However, when the experiment was carried out following the conditions for 3SR, wherein, as well as in the similar technique, the reaction temperature is lower, 37° C. to 42° C., than annealing temperature in PCR reaction, it showed the difficulty to allow primer DNAs used for reverse transcription and DNA double-strand formation reaction to act specifically, resulting in causing more frequent dimmer formation of having no targets particularly, as well as inhibition of expected reactions by non-specific reactions (data not shown).
Such properties are not suitable for the hardware of the molecular computer, in which DNAs are used for input of programs, thus we considered setting the reaction temperature higher to achieve highly specific priming. AMV reverse transcriptase, T7 RNA polymerase and RNaseH were used in the 3SR, however two latter enzymes would be inactivated at higher reaction temperature. Therefore, Thermo T7 RNA Polymerase (TT7; TOYOBO) and Thermus thermophilus RibonucleaseH (Tth RNaseH; TOYOBO) were examined for the use as an enzyme showing DNA dependent RNA polymerase activity and one showing RNaseH activity respectively at higher reaction temperature. AMV reverse transcriptase, TT7 and Tth RNase have been confirmed to be active below 65° C., 50° C. and 90° C. respectively, and at as low as approximately 37° C. Preferably, this experiment was performed as high temperature as possible, thus, the reaction was examined at 50° C. or higher.
The assay performed for each reaction activity under the conditions using heat-resistant enzymes at form 50° C. to 62° C. showed that DNA dependent RNA polymerase activity becomes dramatically higher as higher temperature beyond 50° C. (
Based on the considerations above, we decided that the computing reaction was performed using a reaction solution comprising AMV reverse transcriptase, TT7 RNA polymerase and Taq DNA polymerase as hardware under the condition maintained at constant temperature, 50° C.
(Assessment of the Specificity of Primer Elongation Reaction)
In the computing reaction, data input and operations are carried out by elongation reaction with primers using RNA and DNA as templates. Thus, it is very important to ensure the specificity of priming. Here, we carried out the experiment to assess the activity and specificity of primer elongation reaction in reverse transcription reaction with specifically designed primers using, as targets, in vitro-synthesized gene fragments for both TGTP/Mg21 gene (hereinafter called TGTP gene) and Vitronectin gene, which are known to be highly expressed in graft versus host disease (GVHR).
TGTP-P1 is a primer having TGTP-P1, which is specific sequence of TGTP gene, at 3′-end. To assess the elongation activity and specificity of this primer, the computing reaction was carried out with mixture of this primer and TGTP gene for 15, 30 and 45 min, and the resulting primer elongation product was applied to PCR amplification reaction (
In the similar experiment using Vitronectin-P1, which is specific primer for Vitronectin gene RNA, a peak was observed at expected MW, 792 bp, only in the presence of vitronectin gene (lane 7˜12). This result confirms that this primer also provides specific priming only with the target region. However, smear signal observed suggests that the non-specific reactions also occur slightly.
Above results ensured the availability of TGTP-P1 and Vitronectin-P1 primers as specific primers in the hardware. Furthermore, the similar experiment performed under the condition at 37° C. resulted in primer-dimer-like bands and non-specific smear detected strongly, demonstrating again that the reaction condition at 50° C., developed here, is appropriate for the computing reaction (data not shown).
(Execution of Encoding Functions)
In the presence of specific RNA, an encoding function generates the corresponding coded RNA. First, it would be important to execute the encoding function to achieve the gene expression analysis program. Here, we designed the encoding functions for TGTP gene and Vitronectin gene RNA, and performed the experiment using them.
The structure of TGTP encoding function is showed in
TGTP gene encoding function illustrated here was executed using hybrid of TGTP-T21 and aT21 oligo DNA as P1, and TGTP-S2 as P2 in the computing reaction solution to perform the quantitative experiment for RNA of output coding sequence, Code[2,1] (
The reaction specificity of encoding functions was assessed experimentally. The computing reaction was performed for 30 min with addition of TGTP gene RNA and Vitronectin gene RNA for encoding functions, or the same amount of water (N.C.) for negative control, and the concentration of the resulting coded sequence was measured (
(Reverse Transcription Reaction and Operation Reaction with the Path Across Multiple RNA Molecules)
When performing theoretical operation and gene expression analysis program with a neural network on the molecular computer, it is required to give the reaction to reverse transcript a multiple RNA molecules-comprising path. The reverse transcription for the multiple RNA molecules-comprising path is the process involving reverse transcription initiated by priming of primers to the first RNA molecule and further priming of 3′-end of the resulting cDNA to the second RNA molecule, in which RNaseH activity is important to remove the first RNA molecule.
The experiment was performed to assess the reaction to reverse transcript the path in RNA across two RNA molecules, Code[1]→Code[2]→Code[3], using Code[2,1] and Code[3,2] RNA molecules, synthesized in vitro as described
The feasibility of the reverse transcription reaction along with multiple RNA molecules demonstrated, the function using the path as an argument, “Path (Code[1]-# Code[3]),=>Code[4,5]”, was constructed based on the underlying function B ( Vitronectin
code[4,5]”, which returns Code[4,5] only when both TGTP gene and Vitronectin gene co-existing (See
(Execution of Sense Strand RNA Amplifying Function)
Based on underlying function C: Amplify (a−# b [--add5 P] [--add3 Q]), the amplifying function for TGTP gene sequence was designed and the reaction was examined experimentally. TGTP-PT is the primer used in vitro synthesis of TGTP gene RNA, thus sequence TGTP-PS, which is located at 3′-end of this primer, is identical to 5′-end of TGTP gene RNA and further has T7 promoter sequence at its 5′-end. TGTP-AR primer comprises reverse complementary sequence to the 26 base-length region, starting at the position of 538th base of in vitro synthesized TGTP gene RNA.
Combination of TGTP-PT and TGTP-AR primers provides the gene amplifying function “Amplify(a TGTP-AR-# TGTP-PS”, using TGTP gene as the argument, wherein pass of TGTP gene RNA was expected to lead to amplification of the sense strand RNA sequence, sandwiched between sequence TGTP-PS and TGTP-AR (
Using this TGTP gene sense strand RNA amplifying function, the computing reaction was performed for 0, 15 and 30 min with addition of either of in-vitro synthesized TGTP gene or Vitronectin gene, or addition of the same volume of water (N.C.) to detect the RNA products. The detection was performed as follows: the computing reaction product, which was treated with DNaseI to remove primer DNAs and intermediate DNAs, was applied to reverse transcription using TGTP-AR primer, followed by PCR-amplification using both TGTP-AR and TGTP-PT, and detected by gel electrophoresis (
Discussion
We carried out the experiments to implement the molecular computer simulating the retrovirus genome amplification reaction in a reaction system in vitro. In the hardware for this molecular computer, it was required to allow DNA dependent DNA polymerase, RNA dependent DNA polymerase, DNA dependent RNA polymerase and RNaseH activities to co-exist, in addition, it was essential to ensure the sufficient specificity for each reaction to execute the computing reaction correctly. In this study, the reaction condition fitted to above requirements was newly developed and applied to the experiments as hardware.
Executing the gene encoding reaction with this hardware confirmed the generation of the coding sequence RNA at the constant temperature for as short reaction time as 30˜40 min. This might be applied to gene expression measurement effectively as an easy-to-use gene encoding technique, as well as available as the input for the program of the higher leveled molecular computer, such as gene expression analysis. It is quite unlikely that there are any problems in the target specificity of the priming because gene specific sequence region of the first primers (P1), which were used in the encoding functions for TGTP gene and Vitronectin gene respectively, have already been confirmed the specificity for the primer elongation in
This study showed that the reaction required to implement the retro viral type molecular computer may be achieved in vitro. Furthermore, TGTP gene and Vitronectin gene, which were targeted in the study, are counted to be applied as marker genes to do gene diagnosis for graft versus host reaction (GVHR) after transplant surgeries. In this study, gene expression analysis program was designed to consist of the encoding functions using these genes as arguments and the functions receiving the output and then executing the operation functions, a part of which was showed experimentally to be evidently executable. The system of this molecular computer is expected to provide the establishment of technology to allow each molecule within a test tube to analyze the expression patterns of multiple genes autonomously and output the results with only operations to execute the reactions in single tube at the constant temperature, which may be expected to be applied for the simple and accurate gene diagnosis technology. Furthermore, in the future, it is also expected to develop into the study to execute the similar molecular computer system in living cells, thus the findings from the studies may indicate the new direction of molecular computer studies.
(Multilayering of Multiple Functions)
In the molecular computer, a return value of one function may be used as an argument for another function. An experiment was conducted to determine the semantics of a program comprising an encoding function outputting Code[3,2] RNA sequence in the presence of Vitronectin gene and another function outputting Code[4,6,5] RNA sequence in the presence of a return value from the former functions in single computing reaction solution. The reaction is summarized in
The above results demonstrated that the program comprising 2 types of functions illustrated
Number | Date | Country | Kind |
---|---|---|---|
2003-155988 | May 2003 | JP | national |
This is a Continuation Application of PCT Application No. PCT/JP2004/000952, filed Jan. 30, 2004, which was published under PCT Article 21(2) in Japanese. This application is based upon and claims the benefit of priority from prior Japanese Patent Application No. 2003-155988, filed May 30, 2003, the entire contents of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP04/00952 | Jan 2004 | US |
Child | 11290259 | Nov 2005 | US |