The present invention relates to a method of preparing a DNA library derived from formalin-fixed, paraffin-embedded (FFPE) tissue using an endonuclease.
Recently, molecular biology techniques have been widely used in the medical field. In particular, in terms of molecular pathology, these techniques can diagnose pathologically from the morphological level to the genetic level, and also reveal the genetic characteristics of the specific disease from the morphological level and gene level. For this diagnostic procedure, the hospital collects the tissue in the form of FFPE, and from the FFPE tissue, genetic information related to various clinical conditions can be analyzed and diagnosed.
For next generation sequencing (NGS) analysis, a DNA library should be prepared by extracting genomic DNA (gDNA) from a sample, fragmenting the extracted genomic DNA in a size of about 150 to 250 bp, selecting and extracting DNA of the corresponding size, and subjecting to end-repair using various enzymes. When using gDNA of normal quality, a DNA library can be prepared through the above process, however, in case of old and much degraded DNA, DNA library preparation often fails. In the case of DNA fragmentation, a sonication method e.g. using Covaris apparatus is generally used. It controls the size of the final fragmented DNA by adjusting the output and reaction time and at this time, since the location of fragmentation on DNA occurs randomly, already severely degraded gDNA extracted from old samples is fragmented into finer pieces, thereby resulting in failure of DNA library preparation.
Endonuclease is a generic term for enzymes that produce oligoribonucleotides by cleaving 3′,5′-phosphodiester bonds inside the polynucleotide chain, without distinguishing between DNA and RNA. S1 nuclease, a kind of endonucleases, is an enzyme that binds only one strand of double-stranded DNA and targets the region, and it produces two pieces of fragmented DNA having double helix structure or removes by cutting the DNA of single strand structure. In molecular biology, S1 nucleases are commonly used to remove single-stranded ends from DNA molecules or to remove open hairpin loops during the synthesis of double-stranded cDNA, in order to create a blunt end.
The majority of common clinical samples are FFPE tissues and in the case of old FFPE tissues, NGS analysis is difficult because of the above problems and therefore, there is an urgent need to overcome this problem. Therefore, the present inventors confirmed that almost all the FFPEs have a nick in the dsDNA all over, and this part can be a target of S1 nucleases, and the preparation efficiency of the NGS library can be greatly increased by using the S1 nucleases.
It is an object of the present invention to provide a composition for preparing a DNA library of FFPE tissue-derived DNA comprising an endonuclease as an active ingredient, and a method of preparing a DNA library of FFPE tissue-derived DNA by using the same.
Also, another object of the present invention is to provide a composition for eluting FFPE tissue-derived DNA comprising an endonuclease as an active ingredient, and a method of eluting FFPE tissue-derived DNA by using the same.
In order to achieve the above object, the present invention provides a composition for preparing a DNA library of FFPE tissue-derived DNA comprising endonuclease as an active ingredient.
Also, the present invention provides a kit for preparing a DNA library of FFPE tissue-derived DNA comprising the composition.
In addition, the present invention provides a composition for eluting FFPE tissue-derived DNA comprising endonuclease as an active ingredient.
Furthermore, the present invention provides a kit for eluting FFPE tissue-derived DNA comprising the composition.
In addition, the present invention provides a method of preparing a DNA library having improved complexity comprising: in a method of preparing a DNA library derived from FFPE tissue, reacting by treating the FFPE tissue slice lysate with an endonuclease; and eluting the DNA from a reaction product.
Further, the present invention provides a method of eluting FFPE tissue-derived DNA comprising: reacting by treating the FFPE tissue slice lysate with an endonuclease; and eluting the DNA from a reaction product.
The present invention also provides uses of endonuclease for the preparation of DNA library of FFPE tissue-derived DNA.
Also, the present invention provides the use of endonuclease for eluting FFPE tissue-derived DNA.
The present invention relates to a method of preparing a DNA library derived from FFPE tissue using endonucleases. The majority of general clinical samples are FFPE tissues and in case of old FFPE tissues, DNA library preparation often fails and NGS analysis becomes difficult and therefore there is a desperate need to overcome these problems. S1 nucleases are enzymes which specifically cleave only single-stranded DNA and in almost all FFPEs, there is a nick in the dsDNA everywhere, which can be the target of S1 nucleases. In the process of extracting gDNA from FFPE tissues, since when S1 nucleases are treated and eluted under appropriate conditions during elution, a DNA fragment can be obtained with DNA having a size suitable for DNA library preparation together with the DNA extraction, the Covaris fragmentation process can be omitted and cost and time can be reduced. In addition, it is possible to perform end-repair without additional fragmentation, so it can be inhibited from not being used as ligation insert owing too small a piece and can increase the complexity of the prepared library.
The present invention provides a composition for preparing a DNA library of FFPE tissue-derived DNA comprising endonuclease as an active ingredient. Specifically, the endonuclease may cleave a nick which is single-stranded DNA derived from FFPE tissue as a target.
Preferably, the DNA library is for next generation sequencing (NGS) analysis, but it is not limited thereto.
The term “formalin-fixed, paraffin-embedded (FFPE)” in the present invention is a method for keeping or storing tissue isolated from an animal and it means that the tissue is fixed and the fixed tissue is embedded in the wax. The specific process for preparing the FFPE tissue, though it is not limited thereto, may be as follows:
First, tissue is removed from animal specimens by incision or biopsy. The tissue is then cut to prepare slices and fixed in order to be prevented from corruption or degradation and to be inspected clearly under microscope for histologic, pathological or cytological studies. Fixation is the process of immobilizing, killing and preserving tissue for the purpose of staining and viewing under a microscope. Post-processing allows the tissue to penetrate the dyeing reagent and crosslink the macromolecules, stabilizing and trapping them on the spot. Many fixative solutions such as Bouine solution, formalin or liquid nitrogen are used for this purpose. The fixed tissue is then embedded in wax, cut into thin sections and allowed to stain by hematoxylin and eosin staining. Subsequently, microtoming is performed by microscopic sectioning so as to study the staining with antibody under a microscope.
On the other hand, as for FFPE tissues to be used for fragmentation of FFPE tissue-derived DNA of the present invention, not only recently produced but also FFPE tissues that have been stored for at least 10 years can be applied.
The term “endonuclease” in the present invention refers to a generic term for enzymes that produce oligoribonucleotides by cleaving 3′,5′-phosphodiester bonds inside the polynucleotide chain, without distinguishing between DNA and RNA.
On the other hand, in the present invention, all endonuclease can be applied. Preferably, S1 nuclease, P1 nuclease, mung bean nuclease or BAL-31 nuclease can be applied, but it is not limited thereto.
The term “S1 nuclease” in the present invention is an enzyme which specifically cleaves only single-stranded DNA and in almost all FFPEs, there is a nick in the dsDNA everywhere, which can be the target of S1 nucleases.
In addition, the present invention provides a kit for preparing a DNA library of FFPE tissue-derived DNA comprising the composition.
On the other hand, the kit may include other instruments, solutions and the like that are generally used for preparing a DNA library of FFPE tissue-derived DNA, but it is not limited thereto.
Also, the present invention provides a composition for eluting FFPE tissue-derived DNA comprising endonuclease as an active ingredient.
In addition, the present invention provides a kit for eluting FFPE tissue-derived DNA comprising the composition.
On the other hand, the kit may additionally include other instruments, solutions and the like which are generally used for eluting FFPE tissue-derived DNA, but it is not limited thereto.
Further, the present invention provides a method of preparing a DNA library having improved complexity comprising: in a method of preparing a DNA library derived from FFPE tissue, reacting by treating the FFPE tissue slice lysate with an endonuclease; and eluting the DNA from a reaction product.
In detail, the FFPE tissue slice lysate may be obtained by decomposing a protein from the FFPE tissue slice and removing it and the step of eluting DNA from the reaction product may further comprise subjecting to end-repair of both ends of the DNA and ligating the end-repaired DNA.
Preferably, the step of reacting by treating the FFPE tissue slice lysate with an endonuclease may comprise treating the endonuclease with 1 to 100 units and may comprise treating the endonuclease at 37° C. for 1 to 60 minutes, but it is not limited thereto.
Preferably, the DNA library is for NGS analysis, but it is not limited thereto.
Further, the present invention provides a method of eluting FFPE tissue-derived DNA comprising: reacting by treating the FFPE tissue slice lysate with an endonuclease; and eluting the DNA from a reaction product.
Particularly, the FFPE tissue slice lysate may be obtained by decomposing a protein from the FFPE tissue slice and removing it.
Preferably, the step of reacting by treating the FFPE tissue slice lysate with an endonuclease may comprise treating the endonuclease with 1 to 100 units and may comprise treating the endonuclease at 37° C. for 1 to 60 minutes, but it is not limited thereto.
On the other hand, in a method for preparing a DNA library or a method of eluting DNA of the present invention, all endonucleases can be applied, and preferably, S1 nuclease, P1 nuclease, mung bean nuclease or BAL-31 nuclease can be applied, but it is not limited thereto.
The present invention also provides uses of endonuclease for the preparation of DNA library of FFPE tissue-derived DNA.
Also, the present invention provides the use of endonuclease for eluting FFPE tissue-derived DNA.
Hereinafter, the present invention will be described in detail with reference to the following examples. It should be noted, however, that the following examples are illustrative of the present invention and are not intended to limit the scope of the present invention. The examples of the present invention are provided to more fully describe the present invention to those skilled in the art.
A Covaris method generally used for DNA fragmentation and a DNA fragmentation process using S1 nucleases of the present invention were compared using DNAs in good, intermediate and bad conditions (
The Covaris method consists of the following steps: After extracting gDNA from a sample and fragmenting the extracted gDNA to a size of about 150 to 250 bp, DNA of the corresponding size is selected and subjected to end repair using various enzymes. Then, A-tailing at the 3′ end and ligation using adapter sequences are performed.
When using gDNA of normal quality, a DNA library can be prepared through the above process, however, in case of old and much degraded DNA, DNA library preparation often fails. Already severely degraded gDNA extracted from old samples is fragmented into finer pieces, thereby resulting in failure of DNA library preparation.
Particularly, in the case of FFPE tissues which have been a long time since it was made, DNA fragmentation is already proceeding much, and when random fragmentation using Covaris is repeated again, a region that is cleaved into small pieces forms. This part is removed during the process of size selection after preparing the library (only the library size is 150 bp or more selected), resulting in the loss of the entire gDNA information, thereby decreasing the complexity of the prepared library.
In contrast, in the case of gDNA extracted using the S1 nucleases of the present invention, much more sites are conserved and the complexity of the library is increased.
In order to confirm how the size of DNA changes after S1 nuclease treatment actually, DNA was extracted by using FFPE tissue for 5 patients and the size of extracted DNA was measured by using BioAnalyzer.
When DNA was extracted by the usual method using Covaris, gDNA of a smearing pattern of 150 bp˜10 kb was confirmed as expected.
On the other hand, it was confirmed that, as expected, the fragmentation of a specific size mainly occurred without wide distribution of DNA size when S1 nuclease was treated in the elution process (20 units, 10 min). In addition, it was confirmed that the size thereof ranges within 150 to 250 bp required for DNA library preparation. It was confirmed that appropriate DNA of 150-250 bp size can be extracted by treating S1 nuclease during extraction of gDNA from FFPE tissue.
Specifically, in the case of the #192 sample, the DNA library was well prepared by both methods, and a slightly larger DNA library was prepared in the Covaris method (
In the case of the #27 sample, the amount of the DNA selected and extracted after fragmentation by the Covaris method was too small to proceed to the next step, and the DNA library was not prepared, however, the DNA library was successfully prepared by the S1 nuclease treatment method (
In the case of the #51 and #62 samples, both methods succeeded in preparing DNA libraries (
In the case of the #217 sample, there was a problem in the form of the DNA library prepared by the Covaris method, and it was impossible to use it in the NGS analysis, but the DNA library was successfully prepared by the S1 nuclease treatment method (
The optimal condition was established by variously comparing the treatment conditions of S1 nucleases such as the used amount and the reaction time of S1 nucleases. The treatment conditions for S1 nucleases are shown in Table 1.
As a result, it was confirmed that a library showing some differences according to the conditions was produced.
The library size tended to be smaller as the concentration of S1 nucleases was higher or the reaction time was longer.
The produced amount of library tended to increase as the concentration of S1 nucleases was higher or the reaction time was longer.
The active site of the S1 nucleases was present in the FFPE DNA and it is confirmed to be fragmented due to this.
The increased amount of library prepared means that the number of moles of insert, DNA fragmented and generated by S1, which can be ligated with the adapter, has increased, i.e. the complexity of the library is increased. Therefore, protocol B (20 unit, 30 min) was considered to be the optimum condition so as to increase the complexity of the library. However, this is optimal among the compared conditions, and protocol B is not limited to the final optimal condition, and more efficient optimal conditions can be changed through additional comparison analysis.
Whether there was no problem in analyzing the mutation of the library prepared by S1 nuclease treatment method using targeted NGS or not was actually confirmed.
1. Capture panel: using Tier 2 panel (cancer panel targeting genes of 200 or more)
2. Library pooling
(1) Capture by pooling eight libraries barcoded at each different index.
(2) The eight libraries for 5 DNA used in the comparative experiments of libraries according to said each different S1 nuclease reaction conditions were as follows (Table 2).
i) Five libraries prepared by protocol B
ii) Three libraries prepared by protocol A
(3) Pooling 56˜80 ng depending on the size of each library.
After sequencing the libraries with the above conditions, the results of the analysis between the libraries prepared by the protocol A and the protocol B were compared using the values of the QC metrics, and the library results of the small size were confirmed (Table 3).
In the library prepared using protocol B, the depth was much higher than the protocol A library despite the smaller size, and the same results were obtained in all three patients compared.
As described in Example 3, the complexity of the library prepared by protocol B was high, which proved that the duplication rate was lower in the library of the protocol B method than in the library of the protocol A method.
In addition, because the size is smaller than protocol A and even though the same amount is used, the mole number will be higher, the number of total reads is proved to be higher in the library of the protocol B method than that in the library of the protocol A method.
On the other hand, even a small-sized library prepared by S1 nucleases was able to obtain values at problem-free level for sequence analysis.
In the smallest libraries of 236 bp (16R234_0009_20611846) and 238 bp (16R219_0004_28042413), the following QC metrics values could be obtained.
Of course, it was lower value than the larger library (library prepared from DNA in better-condition: 16R214_0002_2733598, 16R229_0008_37176699).
However, considering that the covaris method failed to prepare a library and could not perform the sequence analysis itself, and the level at which there is no significant problem in genetic variation analysis is at least mean coverage 100×, it was confirmed that the small size library prepared by the S1 nuclease treatment method can be sufficiently utilized for NGS analysis.
Its value can be analyzed on the above QC metrics, but it is confirmed if the genetic variation can be detected with the actual corresponding analysis result.
The agreement degree of somatic variants with five samples used in the experiment was compared.
The OncoMap_V4 panel is a method to amplify a short PCR product of about 100 bp and to confirm a specific hot spot mutation using the MassArray system. The mutation that can be confirmed by the OncoMap_V4 panel is identified as 41 genes of 473 hot spot mutations. Among 5 samples, hot spot mutations of the EGFR and MLH1 genes were detected in 3 samples and were not detected in 2 samples (Table 4).
As a result, it was confirmed that the genetic mutation exactly coincided with the mutation result confirmed by OncoMap_v4 in 5 samples.
Using the same sample of the FFPE tissue of the patient sample that already had the results, the library prepared by the Covaris method and the library prepared by the S1 nuclease treatment method (protocol B method) were pooled, captured simultaneously and NGS analyzed and the value of the results were compared directly. For four samples, each library was prepared, pooled, and captured using a Tier 2 panel, followed by sequencing using the MiSeq v3 kit.
From the value of the result by comparing QC Metrics, it was confirmed that the results obtained from the S1 nuclease treatment method were superior to the Covaris fragmentation method (Table 5).
In addition, it was confirmed if mutation results obtained using the library prepared by respective method were consistent (
For mutations detected simultaneously in both methods, both depth (total depth and variant allele depth) were all at least 30×.
The variant allele fraction was high enough as 30% or higher. Higher depths were established in the mutants obtained from the libraries prepared by the S1 nuclease treatment method. This implies an increase in the reliability of the result value.
In the case of the mutations detected in only each method, as for depth, S1 was higher in total depth and was similar at 2-4× level in variant allele depth. The variant allele fraction was similar at 6-13% level. In the case of the mutect algorithm used for single nucleotide variation analysis using NGS data, it has high possibility to be a false positive if the variant allele depth is less than 5×.
Based on the variant allele depth 5×, it is confirmed if how many variants were detected in the covaris method and the S1 nuclease treatment method, respectively. If the number of variant allele is at least depth 5×, it was determined to be true positive and two variants were further detected in the S1 nuclease treatment method. If the number of variant allele is less than depth 5×, it was determine to be false positive and three variants were further detected in the Covaris method (Table 7).
In other words, in case of using the library prepared by the S1 nuclease treatment method, it is considered that a higher reliability can be obtained.
The library was constructed by S1 nuclease method using 89 samples which failed to prepare library by Covaris method and as a result, 66.3% of the samples were successfully prepared library of which the level is suitable for NGS sequencing (Table 8).
Fresh FFPE tissue was also subjected to S1 nuclease treatment to confirm whether fragmented DNA for NGS could be obtained. In the case of old FFPE tissue, there is nick which is a target that the S1 nucleases can act to cause fragmentation, but in the case of fresh FFPE tissue recently made with FFPE, it is not known whether such nick exists and therefore for confirmation, DNA libraries were prepared using two FFPEs prepared three days before and using the conventional gDNA extraction method without S1 nuclease and the gDNA extraction method using S1 nuclease, respectively (Table 9).
Sizes of extracted gDNA and DNA libraries were identified using BioAnalyzer. As expected, gDNA extracted by the conventional method had a size of 10 kb or more. In contrast, it was confirmed that the gDNA extracted by the S1 nuclease treatment method was cleaved to about 200 bp (
The library was constructed using the corresponding gDNA and the size of the library was confirmed, and as a result, it was confirmed that a library of a suitable size of 270-280 bp was produced in sufficient quantity. DNA extraction for constructing NGS library was successful by the S1 nuclease treatment method even though fresh FFPE tissue which has been 3 days since its production is used. In other words, this means that DNA fragmented in appropriate size for preparing a library for NGS can be extracted by S1 nuclease treatment method in all FFPE tissues.
Since it was confirmed that there is also nick which is an active site of the S1 nuclease in the fresh FFPE tissue, it was found that the nick was formed during fixation with formaldehyde. To confirm this, DNA was extracted from fresh, unfixed cells by the S1 nuclease treatment method and a library was prepared using the extracted DNA (
This is a completely different result from the gDNA extracted by the S1 nuclease treatment method in fresh FFPE. Namely, there is nicks, which are active sites in which S1 nuclease can function in fresh FFPE DNA but three is not in cell line DNA. It was confirmed that nicks were also present at intervals of about average 200 bp on the fresh FFPE which has been 3 days since its production.
In conclusion, S1 nuclease treatment during DNA extraction in FFPE tissues could obtain gDNA fragments of appropriate size without any fragmentation essentially required for NGS library preparation. This can skip the first step in preparing an NGS library, thereby having a significant reduction effect in time and cost. When the library was prepared using the fragmented DNA thus obtained, the success rate of library preparation was much higher than that of the conventional covaris method for fragmentation.
In addition, compared with the covaris method, a much higher depth could be obtained when using the library prepared by the S1 nuclease treatment method. It is also possible to prepare a library for NGS with a very high success rate even in very old FFPE tissues and to prepare library in the same way in very fresh FFPE tissues, and it proved that the S1 method can be a standard for NGS fragments of all FFPE tissues.
The nicks present on the DNA of the FFPE tissues occur during the formaldehyde fixation process for the preparation of FFPE tissue and the S1 nucleases react with the nick as the target.
Indeed, in order to confirm whether the nick of the FFPE tissue DNA occurs in the formaldehyde fixation process,
1. FFPE cell tissue was prepared using the cultured cells (A549),
2. Fixation time of formaldehyde was varied to 3/6/12/24/48 hours, and then embedded in paraffin to prepare a cell block,
3. Each DNA library was prepared by Covaris and S1 nuclease treatment methods, and the amount and size of the prepared library were measured,
4. NGS sequencing was performed to compare the quality of the produced data.
As a result, the longer the formaldehyde fixation time was, the smaller the size of fragmented DNA during S1 nuclease treatment was. This means that the longer the formaldehyde fixing time is, the more nicks which are active sites of S1 nucleases are generated (
In addition, four cell block DNAs extracted by the Covaris method and the S1 nuclease treatment method were subject to library preparation, captured simultaneously using the AMCv2 panel in the same amount, subjected to NGS sequencing and the results were compared (Table 10).
On the other hand, as a result of the QC metrics comparison, the library prepared by Covaris method using DNA with long formaldehyde fixation time had lower peak size and total amount of libraries, lowered mean depth and increased duplication rate. In the case of the S1 method, the tendency of the amount of the library to be lowered is the same, but it was produced in amount of much more than the Covaris method. The peak size of the library was unchanged. Rather, the mean depth was increased which is caused by a small number of nicks that are the active site of S1 in the case of DNA with a short fixation time, and the duplication rate also increases. However, its degree is very lower than that of covaris.
The above results are summarized as follows.
1. The longer the formaldehyde fixation time is, the smaller the size of the fragmented DNA during S1 enzyme treatment is.
2. In the case of the Covaris method, as the fixation time increased, the quantity and quality of the prepared library became significantly worse, thereby reducing the amount of sequencing data.
3. In the S1 nuclease method, as the fixation time increased, the amount of prepared library was rather increased, there was no significant difference in quality and there was no significant difference in the amount of sequencing data.
Number | Date | Country | Kind |
---|---|---|---|
10-2016-0146642 | Nov 2016 | KR | national |
10-2017-0145331 | Nov 2017 | KR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/KR2017/012421 | 11/3/2017 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2018/084640 | 5/11/2018 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20110104693 | Seligmann | May 2011 | A1 |
20130316913 | Makarov et al. | Nov 2013 | A1 |
20150031556 | Barrett et al. | Jan 2015 | A1 |
20160265027 | Sanches-Kuiper | Sep 2016 | A1 |
Number | Date | Country |
---|---|---|
2013-509863 | Mar 2013 | JP |
Entry |
---|
Laskowski et al. (Methods in Enzymology, 1980, 65:263-276) (Year: 1980). |
International Search Report for PCT/KR2017/012421 dated Feb. 21, 2018 from Korean Intellectual Property Office. |
Parkinson, Nicholas J. et al., “Preparation of high-quality next-generation sequencing libraries from picogram quantities of target DNA”, Genome Research, 2012, vol. 22, No. 1, pp. 125-133. |
Hedegaard, Jakob et al., “Next-Generation Sequencing of RNA and DNA Isolated from Paired Fresh-Frozen and Formalin-Fixed Paraffin-Embedded Samples of Human Cancer and Normal Tissue”, PloS One, 2014, vol. 9, No. 5, e98187, inner pp. 1-16. |