The present invention relates to designing a single nucleotide sequence that binds to biomarkers for detection of multiple nucleic acids, and a multiple-nucleic-acid probe comprising the same.
Genetic testing can confirm or rule out a suspected genetic disease in patients having symptoms of the disease, and can also predict diseases that increase the likelihood and risk of developing diseases through testing for genetic mutations that cause diseases. It has been reported that such genetic testing can reduce disease morbidity and mortality by helping to prevent, diagnose early, treat and manage diseases.
Genetic testing can be performed on any tissue in the human body on the premise that all cells in the human body are genetically identical, but genetic testing can be most easily performed using nucleic acids extracted from blood.
The field of nucleic acid diagnosis has been used for determination of single nucleotide polymorphism (SNP), detection and identification of pathogenic bacteria or viruses, diagnosis of genetic diseases, and the like. Accordingly, a number of methods for quickly and accurately detecting specific nucleic acids have been proposed, and many studies related thereto are still underway (W. Shen et al., 2013, Biosen. and Bioele., 42:165-172.; M. L. Ermini et al., 2014, Biosen. and Bioele., 61:28-37.; K. Chang et al., 2015, Biosen. and Bioele., 66:297-307.). The most common methods that are used to detect specific nucleic acids comprise: methods that use polymerase chain reaction (PCR); multiplex polymerase chain reaction (multiplex PCR) methods; and SNPlex, GoldenGate assay, and molecular inversion probes (MIPs), which are techniques enabling high-throughput analysis by simultaneously amplifying multiple nucleic acids using common primers without using multiple polymerase chain reaction.
In recent years, a nucleic acid probe composed of a nucleotide sequence capable of complementarily binding to a target nucleic acid to be detected has been developed. The nucleic acid probe is a method of measuring biomarkers that hybridized with a biological sample, and a method of simultaneously detecting multiple biomarkers is essentially used in order to increase the significance of disease diagnosis and prognosis prediction. A conventional method is a method of performing multiple diagnoses using multiple nucleotide sequences capable of binding complementarily to single nucleic acid biomarkers.
However, the cost for multiple diagnoses is very high, and hence this conventional method has limitations in that it is difficult to use at an early stage and continuous monitoring and prognostic observation are difficult. For example, in the case of BRCA1 and BRCA2, which are representative breast cancer biomarkers, the probability of finding mutations in the actual patient group is low at about 13.9% (Journal of the Korean Society for Radiation Oncology), and in the case of commercially available diagnostic kits that are used for accurate diagnosis, it is possible to simultaneously diagnose an average of about 10 different genetic factors, but the cost per diagnostic test is $3,000 or more, and thus preventive diagnosis and continuous monitoring are severely limited.
Meanwhile, the k-means clustering algorithm belongs to the partitioning method among clustering methods. Partitioning is a method of dividing a given data set into several groups. For example, when it is assumed that n data objects are input, the partitioning method divides the input data into k groups, where k is smaller than or equal to the number (n) of the data objects n. Each of the divided groups forms clusters. That is, the data is divided into k groups, each consisting of one or more data objects. The process of dividing into groups is performed in a manner that minimizes a cost function such as dissimilarity between groups based on distance, and in this process, the similarity between data objects in the same group increases, and the similarity with the data objects in other groups decreases. The k-means algorithm determines the sum of squares of the distance between the centroid of each group and the data objects in the group as a cost function, and performs clustering by updating the group belonging to each data object in the direction that minimizes this function value.
The present inventors have made efforts to overcome the above-described disadvantages and to develop a method capable of quickly and inexpensively diagnosing the overall pattern of detectable nucleic acid biomarkers in a patient's biological sample, and as a result, have found that it is possible to develop a method of modeling an artificial nucleotide sequence having optimal selectivity for multiple nucleic acid biomarkers by using thermodynamic principles, comprising k-means clustering algorithms, and social network analysis, and it is possible to quickly and accurately detect multiple nucleic acids by the method, thereby completing the present invention.
An object of the present invention is to provide a method for designing an artificial nucleotide sequence for binding to multiple nucleic acid biomarkers.
Another object of the present invention is to provide a multiple-nucleic-acid probe using the artificial nucleotide sequence.
Yet another object of the present invention is to provide a method of detecting multiple nucleic acids using the artificial nucleotide sequence.
The present invention provides a method for designing an artificial nucleotide sequence for binding to multiple nucleic acid biomarkers, the method comprising steps of: a) preparing each random analog sequence (standard sequence) set having similarity to target nucleic acids;
b) selecting two analog sequences having the highest hybridization profile similarity among the analog sequence set by using a nearest-neighbor algorithm;
c) setting multiple equilibrium reactions of a triple-stranded sequence consisting of the two selected analog sequences and an arbitrary nucleic acid sequence, and selecting a common complement, which indicates the sum of the highest equilibrium constants (K), by using the following Equation 1:
[Analog A]+[Complement]=[ComplexA] . . . ΔGA=−RT ln KA
[Analog B]+[Complement]=[ComplexB] . . . ΔGB=−RT ln KB
K
A=[ComplexA]/[Analog A][Complement]=xA(4−xA−xB)/(1−xA)(2−xA−xB)
K
B=[ComplexB]/[Analog B][Complement]=xB(4−xA−xB)/(1−xB)(2−xA−xB) [Equation 1]
wherein [Analog] represents the concentration of the analog, [Complex] represents the concentration of a complex of the analog and the complement, the concentration of the sequence is assigned equal to the hybridization reactivity of the two analog sequences, AG represents the Gibbs free energy in the standard state, R represents the gas constant, T represents absolute temperature, and x represents concentration;
d) selecting, as a strand complementary to the common complement, a representative sequence which is representative of the two analog sequences; and
e) repeating steps a) to e) until a single strand remains.
The present invention also provides a probe for detecting multiple nucleic acid biomarkers, the probe comprising the artificial nucleotide sequence and a fluorescent substance.
An artificial nucleotide sequence designed by the method for designing an artificial nucleotide sequence according to the present invention exhibits better hybridization reactivity with all analogs, and a multiple-nucleic-acid probe comprising the artificial nucleotide sequence is designed such that a single diagnostic probe is capable of simultaneously binding to multiple types of nucleic acids having significance. Thus, the use of the single diagnostic probe may achieve ultra-multiplex diagnosis of nucleic acid biomarkers by diagnosing the overall expression pattern in a sample, whereby diagnostic ability may be improved and the cost of examination may be drastically reduced.
The present invention relates to a multiple-nucleic-acid probe using a single nucleotide sequence that binds to biomarkers for detection of multiple nucleic acids, and a method for detecting multiple nucleic acids using the same.
Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art may easily implement the present invention. However, the present invention may be embodied in various different forms and is not limited to the embodiments described herein. In addition, in the drawings, parts not related to the description are omitted in order to clearly describe the present invention.
Throughout the present specification, it is to be understood that, when any part is referred to as “comprising” any component, it does not exclude other components, but may further comprise other components, unless otherwise specified.
As used throughout the present specification, terms of degree, such as “about” and “substantially”, are used in the sense of “at, or nearly at, when given the manufacturing and material tolerances inherent in the stated circumstances”, and are used to prevent any unconscientious violator from unduly taking advantage of the disclosure in which exact or absolute numerical values are given so as to help understand the invention. As used throughout the present specification, the term “step of (doing) . . . ” or “step of . . . ” does not mean “a step for . . . ”.
First, the present invention provides a method for designing an artificial nucleotide sequence for binding to multiple nucleic acid biomarkers, the method comprising steps of:
a) preparing each random analog sequence set having similarity to target nucleic acids;
b) selecting two analog sequences having the highest hybridization profile similarity among the analog sequence set by using a nearest-neighbor algorithm;
c) setting multiple equilibrium reactions of a triple-stranded sequence consisting of the two selected analog sequences and an arbitrary nucleic acid sequence, and selecting a common complement, which indicates the sum of the highest equilibrium constants (K), by using the following Equation 1:
[Analog A]+[Complement]=[ComplexA] . . . ΔGA=−RT ln KA
[Analog B]+[Complement]=[ComplexB] . . . ΔGB=−RT ln KB
K
A=[ComplexA]/[Analog A][Complement]=xA(4−xA−xB)/(1−xA)(2−xA−xB)
K
B=[ComplexB]/[Analog B][Complement]=xB(4−xA−xB)/(1−xB)(2−xA−xB) [Equation 1]
wherein [Analog] represents the concentration of the analog, [Complex] represents the concentration of a complex of the analog and the complement, the concentration of the sequence is assigned equal to the hybridization reactivity of the two analog sequences, AG represents the Gibbs free energy in the standard state, R represents gas constant, T represents absolute temperature, and x represents concentration;
d) selecting, as a strand complementary to the common complement, a representative sequence which is representative of the two analog sequences; and
e) repeating steps a) to e) until a single strand remains.
If the analog sequences have three or more multiple strands, the method may further comprise:
f) selecting a common complement, which indicates the sum of the highest equilibrium constants (K), through the following Equation 2 using the representative sequence of step d) and the concentration thereof:
[Representative]+[Common]=[ComplexR] . . . ΔGR=−RT ln KR
K
R=[ComplexR]/[Common][Representative]=xR(2x−xR)/(x−xR)2 [Equation 2]
wherein [Representative] represents the concentration of a strand complementary to the common complement, [Common] represents the concentration of the common complement, [Complex] represents the concentration of the common complement and the strand complementary thereto, AG represents the Gibbs free energy in the standard state, R represents the gas constant, T represents absolute temperature, and x represents concentration;
h) selecting, as a strand complementary to the common complement, a representative sequence which is representative of the two analog sequences; and
i) repeating steps a) to h) until a single strand remains.
The multiple strands preferably have 3 to 5 strands.
In the “step of preparing each random analog sequence set having similarity to target nucleic acids” may be a step of automatically producing a set of analog sequences complementary to target sequences by using a program.
The “target nucleic acid” refers to any nucleic acid of interest that may be present in a biological sample.
The “biomarker” is used to refer to a target molecule that indicates or is a sign of normal or abnormal process in an individual or a disease or other condition in an individual. More specifically, the “biomarker” is an anatomical, physiological, biochemical or molecular parameter associated with the presence of a specific physiological state or process, whether normal or abnormal, and if abnormal, whether chronic or acute.
When a biomarker indicates or is a sign of an abnormal process or a disease or other condition in an individual, that biomarker is generally described as being either over-expressed or under-expressed as compared to an expression level or value of the biomarker that indicates or is a sign of a normal process or an absence of a disease or other condition in an individual. “Up-regulation”, “up-regulated”, “over-expression”, “over-expressed”, and any variations thereof are used interchangeably to refer to a value or level of a biomarker in a biological sample that is greater than a value or level (or range of values or levels) of the biomarker that is typically detected in similar biological samples from healthy or normal individuals.
For verification of the representative sequence, Pearson's correlation coefficients of the Gibbs free energy values may be used, and preferably, the average Pearson's correlation coefficient of the representative sequence is higher than those of all the analogs constituting the analog sequence set of step a).
Through the verification, it can be confirmed that the representative sequence represents the sequences and hybridization profile of the two analogs.
The artificial nucleotide sequence obtained in the present invention may exhibit higher hybridization reactivity than all analogs constituting the analog sequence set in step a).
In step a), each analog sequence may consist of 8 to 10 bases.
In step c), two analog sequences with the highest closeness may be calculated first after the number of shared complementary sequences is expressed as closeness.
The nearest-neighbor algorithm that is used in step c) calculates a sequence having the highest similarity to the target sequences. Specifically, the nearest-neighbor algorithm calculates all pairwise “distances” between analogs in one individual and analogs in each of all individuals. For the algorithm, reference may be made to Bremner D, et al., (2005). “Output-sensitive algorithms for computing nearest-neighbor decision boundaries”. Discrete and Computational Geometry 33 (4): 593604), which is known literature.
Although a variety of methods may be used to identify the network of arbitrary elements, the most intuitive method is a method of adjusting elements in a multidimensional space and then measuring the Euclidean distance therebetween. When this method is applied to nucleotides, it can be seen that, if dots representing two nucleotides are located close to each other, the nucleotides show high sequence similarity, and on the contrary, if the sequences of the nucleotides significantly differ from each other, the distance between the two dots will be longer. Therefore, adjusting nucleotides by spatial arrangement may be ideally used for network analysis.
However, because the sequences of nucleotides are diverse and the interactions thereof are complex, it is almost impossible to project the same onto a single coordinate system. Thus, basic social networking methodologies may help determine nucleotide similarity. In the present invention, similarity was determined by visualizing a sociogram.
The present invention also provides a probe for detecting multiple nucleic acid biomarkers, the probe comprising: a nucleotide sequence designed by the method; and a fluorescent substance.
The fluorescent substance is preferably one or more selected from the group consisting of cyanine fluorescent molecules, rhodamine fluorescent molecules, Alexa fluorescent molecules, FITC (fluorescein isothiocyanate) fluorescent molecules, FAM (5-carboxy fluorescein) fluorescent molecules, Texas Red fluorescent molecules, and fluorescein. More preferably, the fluorescent substance may be cyanine.
The present invention also provides a method for detecting multiple nucleic acids, the method comprising steps of:
a) collecting a sample containing target nucleic acids;
b) mixing the sample, a primer set having a nucleotide sequence complementary to the sample, a cleavage reagent and a probe produced according to claim 8, and then amplifying target nucleic acid-probe complexes by an extension reaction; and
d) measuring the amount of probe fragments isolated from the complexes amplified in step c).
The “sample” is used interchangeably herein to refer to any material, biological fluid, tissue, or cell obtained or otherwise derived from an individual. This comprises blood (comprising whole blood, leukocytes, peripheral blood mononuclear cells, buffy coat, plasma, and serum), sputum, tears, mucus, nasal washes, nasal aspirate, breath, urine, semen, saliva, peritoneal washing, cystic fluid, amniotic fluid, glandular fluid, lymph fluid, cytologic fluid, ascites, pleural fluid, nipple aspirate, bronchial aspirate, bronchial brushing, synovial fluid, joint aspirate, organ secretions, cells, a cell extract, and cerebrospinal fluid. This also comprises experimentally separated fractions of all of the preceding. For example, a blood sample may be fractionated into serum, plasma, or into fractions containing particular types of blood cells, such as red blood cells or white blood cells. If desired, a sample may be a combination of samples from an individual, such as a combination of a tissue and fluid sample. The term “sample” also comprises materials containing homogenized solid materials, such as from a stool sample, a tissue sample, or a tissue biopsy, for example. The term “sample” also comprises materials derived from a tissue culture or a cell culture. Any suitable methods for obtaining a biological sample can be employed; exemplary methods comprise, e.g., phlebotomy, swab (e.g., buccal swab), and a fine needle aspirate biopsy procedure. Exemplary tissues susceptible to fine needle aspiration comprise lymph node, lung, lung washes, BAL (bronchoalveolar lavage), thyroid, breast, pancreas, and liver. Samples may also be collected, e.g., by microdissection (e.g., laser capture microdissection (LCM) or laser microdissection (LMD)), bladder wash, smear (e.g., a PAP smear), or ductal lavage. A “biological sample” obtained or derived from an individual comprises a sample that has been processed in any suitable manner after being obtained from the individual.
Furthermore, a biological sample may be derived by taking biological samples from a number of individuals and pooling the same, or pooling an aliquot of each individual's biological sample. The pooled sample may be treated as a sample from a single individual.
Hereinafter, preferred examples are presented to aid in understanding the present invention. However, the following examples are provided for easier understanding of the present invention, and the contents of the present invention are not limited by the examples.
All of the nucleotides used in the experiments were purchased from Integrated DNA Technologies, Inc. (Coralville, Iowa, USA). Initially, lyophilized nucleotides were dissolved in TE buffer (10 mM Tris, pH 8.0, 0.1 mM EDTA) to a concentration of 100 μM. The nucleotides were mixed at a ratio corresponding to each experiment with 100 mM NaCl for the hybridization process. The annealing process was performed in a Mastercycler Pro thermocycler from Eppendorf (Westbury, N.Y., USA). After heating at 95° C. for 5 minutes, temperature was decreased gradually from 95° C. to 25° C. at a rate of 0.5° C. per minute. The fluorescence intensities were measured using a SpectraMax M5 provided by Molecular Devices, Inc. (Sunnyvale, Calif., USA).
All calculations were conducted using Python v2.7, comprising the Gibbs free energy, Pearson's correlation coefficient, multiple equilibrium constants, and closeness. NumPy v1.8.0rc1 and SciPy v0.13.0b1 were used for algorithmic efficiency. Details of the calculation methods and formulae are presented in supplementary information.
Cytoscape v3.6.0 was used to visualize the sociogram. The standard sequence, the analogs, and the complementary candidates were used as nodes. The complementary candidates were connected to their relevant standard sequence or analogs. The position of nodes was determined through a prefuse force-directed layout.
1-1. Mapping Nucleotides According to Sequence Similarity
First, a model nucleotide sequence consisting of 10 random bases (standard sequence) was obtained from CANADA 2.0, analog sequences (mutants) were synthesized by changing the base of the standard sequence in a cumulative manner. Mut-1 was generated by a single base random mutation of the standard sequence, and the mutated base was transferred to the next mutant Mut-2. Thus, Mut-2 possessed two mismatched sequences from the standard sequence, one of Mut-1 and one of its own. In this way, a total of 10 mutants was generated. As the mutation number becomes higher (accumulation of mutations), the sequence difference between the standard sequence and the mutant increases.
The sequence information on the standard sequence and mutants is shown in
Since the sequence consisted of 10 bases, a total of 410 complementary sequences was present. Among all possible complementary strands, 100 sequences with the highest Gibbs free energy were selected (complementary candidates), and connected to each standard sequence or mutant to draw a nondirectional sociogram as shown in
As shown in
The number of shared complementary candidates was expressed as closeness, and the value of closeness was used to indicate similar Euclidean distances between nucleotides. The closeness profiles of the standard sequence and the mutants with the top 1000 rated complementary sequences are shown in
Cells contributing to the same sequence are indicated in yellow, and cells with a higher number of shared sequences are indicated in red. As shown in
1-2. Generation of Representative Sequence from Two Analogs
In the initial stage, two analogs were used as a model to prove the presence of the representative sequence. The calculation process is shown in
1. The Gibbs free energy of all complementary strands for two analogs is calculated, and the complementary strands (complementary candidates) with the highest sum of Gibbs free energy are selected.
2. The equilibrium of multiple reactions of the complementary candidates and Gibbs free energy values is calculated, and the best complementary strand (common complement) with the highest reaction equilibrium sum is selected.
3. A representative sequence with a perfect antisense match to the common complement is generated, and the concentration of the representative sequence with an equal sum of the reaction equilibrium of the two analogs and the common complement is calculated.
First, in the procedure for the Gibbs free energy calculation, the nearest-neighbor model was used with some modifications. In general usage, the nearest-neighbor parameter of the nucleic acid duplex and the terminal base pair parameters should be comprised to calculate the enthalpy and entropy of hybridization. However, the present inventors considered only the nearest-neighbor parameter in the complementary base pairing. The nearest-neighbor parameters were referenced from a previous study. After calculation, the Gibbs free energy values of each complementary strand against two analogs were added, and 1000 strands with the highest Gibbs free energy were selected as the complementary candidates for the next equilibrium calculation step.
Since the Gibbs free energy values between the analog and the complementary candidates were calculated for a single reaction condition, the Gibbs free energy values should be converted to the reaction constants in multiple reactions, which contain both the analogs and the complementary candidate. The sum of the reaction constants of the two analogs indicated the involvement of the complementary candidate in hybridization. Then, the common complement with the highest sum of reaction constants was selected from the complementary candidates. The basic formula and calculation process are shown in Equation 1 below.
[Analog A]+[Complement]=[ComplexA] . . . ΔGA=−RT ln KA
[Analog B]+[Complement]=[ComplexB] . . . ΔGB=−RT ln KB
K
A=[ComplexA]/[Analog A][Complement]=xA(4−xA−xB)/(1−xA)(2−xA−xB)
K
B=[ComplexB]/[Analog B][Complement]=xB(4−xA−xB)/(1−xB)(2−xA−xB) [Equation 1]
As shown in
Two different sequences of nucleotides with two base mismatches were randomly selected for the analogs. The most favorable common complement and representative sequence were determined through calculations. Sequence information and hybridization Gibbs free energy of the analogs used to calculate the representative sequence are shown in
As shown in
In addition, the multiple reaction equilibriums of the complement and the anti-analogs are shown in
As shown in
In addition,
As shown in
In addition,
As shown in
From the above results, it was confirmed that the representative sequence can represent the sequences and hybridization profile of the two analogs.
1-3. Generation of Representative Sequence from Multiple Sequences
In the coordinate system, the K-means clustering algorithm can generate intuitive and rational centroids for clustering. For clustering with the K-means algorithm, the sum of distances from the centroid to the data objects is measured, and the coordinates of centroids are updated to minimize the sum. The centroid itself has a coordinate just like other data objects, although it is not real. Thus, it can be said that the centroid represents the properties of the data objects in the cluster. This is quite similar to the calculation of the representative sequence from the analog sequences. The present inventors tried to apply the K-means clustering algorithm to the generation of the representative sequence with multiple sequences.
In the existing k-means clustering algorithm, the distance from the centroid is readjusted to the centroid of the data objects in clustering to minimize the sum of distances. However, it is almost impossible to calculate the centroids of nucleotides with a large number of bases. Several remarkable approaches have been developed to calculate multiple nucleotide equilibrium states. For instance, Robert Dirt and his colleagues developed methodologies for calculating multi-strand interactions and the formation of secondary structures by the combination of graph theory and a partition function. However, tremendous resources and calculation times are needed to obtain reasonable results for thousands of reactions simultaneously. Therefore, the present inventors applied the k-means clustering algorithm in a step-by-step manner.
The difference arising in the stepwise calculation of the k-means clustering algorithm can be overcome by assigning mass to the data. For example, as indicated below, the centroid coordinates (x, y) of three points (a, b, c) in a two-dimensional space are the average of the coordinates.
(x,y)=((xa+xb+xc)/3,(ya+yb+yc)/3)
(x,y)ab=((xa+xb)/2,(ya+yb)/2)
(x,y)=([{(xa+xb)/2}+xc]/2,[{(ya+yb)/2}+yc]/2)
Meanwhile, when certain masses (α, β, γ) are comprised, the centroid coordinates are expressed as follows.
(x,y)=((αxa+βxb+γxc)/(α+β+γ),(αya+βyb+γyc)/(α+β+γ))
In this case, stepwise calculation of the centroid coordinates is as follows, and the coordinate centroid end will be the same as that of a single calculation.
To experimentally demonstrate multiple reactions and hybridization between the analogs and the common complement, two common complements (code numbers: 294346 and 281802) were selected, and the efficiencies of hybridization of the common complements to the analogs are shown in
As shown in
This phenomenon was also observed when the concentration of the analogs increased. Before the actual experiment, the hybridization efficiency of the Anti-analogs and the common complements in various concentrations of analogs was calculated, and the results of the calculation are shown in
As shown in
In addition,
As shown in
To make this result more reliable, 100 random analog sets were used to generate the representative sequences. As a result, even though there were differences in the hybridization efficiency values, the results showed solid evidence of the same process in the representative sequences. The Gibbs free energy profile of the analogs and the representative sequences against all possible complementary sequences were compared using Pearson's correlation coefficient. Between two-base mismatched nucleotides, the Pearson's correlation coefficient was 0.636±0.049. In contrast, the average Pearson's correlation coefficient between the representative sequence and the analogs was 0.805±0.042. This increment of the coefficient indicated that the representative sequence can delegate the hybridization profile of the analog sequences.
In addition, it was possible to calculate the optimized representative sequences from various concentrations of analogs, and the results of the calculation are shown in
As shown in
In addition, Pearson's correlation coefficients of the optimized representative sequences for the analogs were calculated, and the results of the calculation are shown in
As shown in
In order to verify the representative sequence, the sequence information of several analogs, Gibbs energy and Pearson's correlation coefficient between the sequences were calculated, and the results are shown in
As shown in
In contrast, the Pearson's correlation coefficient of Analog 2/3 against Analog 1 did not increase, and there was no increase in the Pearson's correlation coefficient to the standard sequence. This indicates that the calculated representative sequence was specific to the target analogs. Meanwhile, the representative sequence calculated from Analog 1 and Analog 2/3 showed an increment of the Pearson's correlation coefficient against the standard sequence and modest Gibbs free energy values. The coefficient value was 0.664, which was higher than the coefficient value of any other analog. The average coefficient was increased within all the analogs, and the average Pearson's correlation coefficient of the initial analogs against the representative sequence was higher than that against the standard sequence. This result not only demonstrates that it was possible to make a representative sequence, but also that its performance might be better than that of the standard sequence.
The potential of the representative sequence was also revealed in the equilibrium constant calculation, and the results of the calculation are shown in
As shown in
In addition, for analysis of multiple nucleotides, the representative sequences were obtained by the same procedure with five analogs and three mismatched bases. The average Pearson's correlations of the standard sequence and the representative sequence against the initial analogs were calculated, and the results of the calculation are shown in
As shown in
In addition,
As shown in
Number | Date | Country | Kind |
---|---|---|---|
10-2018-0055315 | May 2018 | KR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/KR2019/005809 | 5/15/2019 | WO | 00 |