The present invention relates to an electrophoresis analyzing apparatus, an electrophoresis analysis method, and a program.
An electrophoresis apparatus is used to analyze a specimen such as a small amount of protein, deoxyribonucleic acid (DNA), or the like (refer to Patent Literature 1). Moreover, there exists a technique for determining the quantity of a specimen, based on actual waveform data of an electropherogram acquired through electrophoresis. For example, in Patent Literature 2, the area of a peak waveform appearing in actual waveform data is calculated to thereby determine the quantity of a specimen.
Japanese Patent Kokai Publication No. JP2002-310989A
Japanese Patent Kokai Publication No. JP2016-33492A
Note that the disclosures in the above-mentioned CITATION LIST are incorporated herein by reference. The following analysis has been made by the inventor of the present invention.
The technique disclosed in Patent Literature 2 described above has a problem that it is not possible to determine the quantity of a specimen when actual waveform data includes at least two peak waveforms partially including a superimposed portion. Specifically, actual waveform data expresses a waveform of a superimposed portion as a total value of first and second peak waveforms, and hence it is not possible to calculate the area of each of the first and second peak waveforms.
The present invention has a primary object to provide an electrophoresis analyzing apparatus, an electrophoresis analysis method, and a program for contributing to improving accuracy of electropherogram analysis.
According to a first aspect of the present invention or disclosure, provided is an electrophoresis analyzing apparatus including: an acquisition part configured to acquire actual waveform data of electrophoresis, the actual waveform data including at least two peak waveforms partially including a superimposed portion; an estimation part configured to estimate, from an already-appeared peak waveform, a residual portion of the already-appeared peak waveform in the superimposed portion, the already-appeared peak waveform having appeared, in the actual waveform data, before an analysis-target peak waveform to be subjected to waveform analysis; and a correction part configured to subtract the residual portion from the superimposed portion to correct the analysis-target peak waveform and obtain a true analysis-target waveform.
According to a second aspect of the present invention or disclosure, provided is an electrophoresis analysis method including: acquiring actual waveform data of electrophoresis, the actual waveform data including at least two peak waveforms partially including a superimposed portion; estimating, from an already-appeared peak waveform, a residual portion of the already-appeared peak waveform in the superimposed portion, the already-appeared peak waveform having appeared, in the actual waveform data, before an analysis-target peak waveform to be subjected to waveform analysis; and subtracting the residual portion from the superimposed portion to correct the analysis-target peak waveform to obtain a true analysis-target waveform.
According to a third aspect of the present invention or disclosure, provided is a program causing a computer to execute: processing of acquiring actual waveform data of electrophoresis, the actual waveform data including at least two peak waveforms and partially including a superimposed portion; processing of estimating, from an already-appeared peak waveform, a residual portion of the already-appeared peak waveform in the superimposed portion, the already-appeared peak waveform having appeared, in the actual waveform data, before an analysis-target peak waveform to be subjected to waveform analysis; and processing of subtracting the residual portion from the superimposed portion and correcting the analysis-target peak waveform to obtain a true analysis-target waveform.
Note that this program may be recoded on a computer-readable storage medium. The storage medium may be a non-transient medium, such as a semiconductor memory, a hard disk, a magnetic recording medium, or an optical recording medium. The present invention may be implemented as a computer program product.
According to the aspects of the present invention or disclosure, an electrophoresis analyzing apparatus, an electrophoresis analysis method, and a program for contributing to improving accuracy of electropherogram analysis are provided.
First of all, an outline of one example embodiment is described. Note that the reference signs in the drawings added in this outline are given, as an example, to elements for convenience for the sake of better understanding, and the description of this outline is not intended to provide any particular limitation.
As illustrated in
The acquisition part 101 acquires actual waveform data on electrophoresis as one illustrated in
Concrete example embodiments are described below in further detail with reference to drawings. Note that the same constituent components are denoted by the same reference signs, and descriptions thereof are omitted, in the example embodiments. Connecting lines between the blocks in each diagram include both bidirectional and unidirectional connecting lines. Each one-direction arrow is to schematically indicate a main flow of a signal (data) and is not intended to exclude bidirectional properties. In addition, an input port and an output port exist respectively at an input end and an output end of each connecting line although explicit illustrations thereof are omitted in circuit diagrams, block diagrams, inner configuration diagrams, connection diagrams, and the like illustrated in the disclosure of the present application. The same applies to an input/output interface.
A first example embodiment is described in more detail by using drawings.
In the first example embodiment, an electrophoresis apparatus that migrates fluorescence-labeled DNA chains is described.
In the disclosure of the present application, DNA chains to be subjected to electrophoresis are referred to as follows. The order in which DNA groups arrive, after electrophoresis is started, at a detection window is expressed using ordinal numbers. For example, assume that there exist two DNA groups provided with the same fluorescence label and having different sequence lengths (molecular weights). In this case, the DNA group arriving first at the detection window is referred to as a first DNA group, and the DNA group arriving later is referred to as a second DNA group.
A sample including fluorescence-labeled DNA chains is injected into the capillary 10. Electrodes 23-1 and 23-2 are inserted into the electrode tanks 202-1 and 202-2, respectively.
The electrophoresis system also includes an electrophoresis apparatus 20 and an electrophoresis analyzing apparatus 30.
The electrophoresis apparatus 20 is an apparatus that performs electrophoresis by using the capillary 10. The electrophoresis apparatus 20 is formed by including an electrophoresis detection part 21 and a power supply part 22.
The electrophoresis detection unit 21 is a mechanism for detecting a fluorescence label. The electrophoresis detection part 21 includes, as a fluorescence label detection mechanism, an excitation device, such as an argon-ion laser, and a detection device, such as a filter or a camera.
The power supply part 22 is a means that applies an electrophoresis voltage to the capillary 10. More specifically, the power supply part 22 is connected to the electrodes 23-1 and 23-2 inserted into the respective electrode tanks 202-1 and 202-2. The power supply part 22 applies a direct voltage to the electrodes. Note that, upon starting of electrophoresis, the electrophoresis apparatus 20 notifies the electrophoresis analyzing apparatus 30 that electrophoresis is started.
When the direct voltage is applied to the electrodes 23 via the power supply part 22 and capillary electrophoresis is started, fluorescence-labeled DNA chains move from the electrode tank 202-1 in the direction toward the electrode tank 202-2. Upon starting of the electrophoresis, the electrophoresis detection part 21 monitors the capillary via the detection window to create actual waveform data indicating chronological changes in fluorescence brightness. The electrophoresis detection part 21 then outputs the created actual waveform data to the electrophoresis analyzing apparatus 30.
Specifically, the electrophoresis detection part 21 emits laser beams toward the capillary 10 via the detection window, and a fluorescent light at the detection window is received by an image sensor or the like. As illustrated in
The electrophoresis detection part 30 analyzes the actual waveform data.
The waveform data acquisition part 301 is a means that acquires actual waveform data from the electrophoresis apparatus 20. Specifically, the waveform data acquisition part 301 analyzes the actual waveform data acquired from the electrophoresis apparatus 20 to detect a peak waveform(s).
Conceptually, the waveform data acquisition part 301 acquires an actual waveform pattern as one illustrated in
A reason why the above superimposed portion occurs is described below.
With reference to
Here, the fluorescence waveform is ideally assumed to have a Gaussian distribution shape. Specifically, in the example in
In view of this, a reason of the occurrence of the offset described above is considered.
Assume that electrophoresis is performed using a flow path (capillary) as one illustrated in
However, while DNA having the same sequence length are migrated at the same speed in theory, DNA are not migrated uniformly due to a diffusion phenomenon, such as Brownian motion, even having the same sequence length. In addition, as illustrated in
The residual DNAs result in arriving at the detection window (position of X=5) later than the other DNA chains. The DNA chains arriving later are also detected at the detection window, and consequently, a fluorescence waveform as one illustrated in
Return the description to
Return the description to
The residual amount estimation part 302 estimates the quantity of the delayed DNA chains in the first DNA group forming the first peak waveform, as a residual portion of the first peak waveform. The residual portion of the first peak waveform corresponds to the length L of the offset from the baseline 402 illustrated in
In a conceptual description, the residual amount estimation part 302 pays attention to the waveform at time T01 to time T03 in
The Gaussian waveform illustrated in
In Equation (1), Xc denotes a center position of the Gaussian distribution, W denotes half-width at half-maximum (HWHM) of the Gaussian distribution, and H denotes the height of the Gaussian distribution (refer to
The saturation waveform illustrated in
On the assumption that variation in moving speed follows the Gaussian distribution, the saturation waveform is a similar figure to an “integral of the Gaussian function”. Note that, however, since not all of the DNAs (first DNA group) injected into the capillary 10 are delayed DNA chains, the integral of the Gaussian function is multiplied by a predetermined coefficient to approximate the waveform of signal strength brought about by delayed DNA chains (refer to
The waveform illustrated in
Note that α denotes the predetermined coefficient by which the above-mentioned “integral of the Gaussian function” is multiplied. Moreover, erf denotes an error function, and sqrt is a function for obtaining a square root.
In this way, the first peak waveform illustrated in
f(x)=f1(x)+f2(x) (3)
According to Equation (3), it is understood that the waveform illustrated in
The residual amount estimation part 302 estimates the residual portion of the first peak waveform, based on the above viewpoints. Specifically, the residual amount estimation part 302 detects a peak waveform from the actual waveform data acquired by the waveform data acquisition part 301. In the example in
The residual amount estimation part 302 then acquires data (fluorescence brightness) in a predetermined range having the detected peak as a center. For example, in the example in
The residual amount estimation part 302 then identifies the four parameters (Xc, W, H, and a) defining the fluorescence waveform in the predetermined range, based on the data in the predetermined range having the detected peak as a center. Specifically, the residual amount estimation part 302 compares the detected peak waveform and a waveform obtained according to Equation (3) modeling the detected peak waveform, to calculate four parameters constituting Equation (3). For example, the residual amount estimation part 302 determines four parameters so that the difference between waveforms obtained by changing the four parameters and the corresponding actual waveform (waveform from time T01 to time T03 in
Upon determination of the four parameters, Equation (3) is determined. Moreover, Equation (2) is determined by using the four parameters. Equation (2) indicates the fluorescence brightness of the residual portion of the first peak waveform as illustrated in
In this way, the residual amount estimation part 302 models, by using Equation (3), waveform data as that illustrated from time T01 to time T03 in
The waveform correction part 303 is a means that subtracts a residual portion from actual waveform data to correct an analysis-target peak waveform to obtain a true analysis-target waveform. Specifically, the waveform correction part 303 subtracts the fluorescence brightness obtained based on the residual portion of the first peak waveform from the fluorescence intensity of the actual waveform data.
For example, in the example in
The waveform analysis part 304 is a means that analyzes a true analysis-target waveform. For example, the waveform analysis part 304 calculates the area of a peak region included in a true analysis-target waveform to estimate a DNA amount. For example, with reference to
The summary of the operations of the electrophoresis analyzing apparatus 30 is as illustrated in
In Step S01, the waveform data acquisition part 301 acquires a signal through electrophoresis.
In Step S02, the residual amount estimation part 302 estimates the residual amount of the first DNA group.
In Step S03, the waveform correction part 303 corrects an actual waveform pattern by using the estimated residual amount. Through the correction of the actual waveform pattern, a true analysis-target waveform is obtained.
In Step S04, the waveform analysis part 304 performs an analysis of the actual waveform pattern resulting from the correction.
A hardware configuration of the electrophoresis analyzing apparatus 30 according to the first example embodiment is described.
Note that, however, the configuration illustrated in
The memory 32 is a random access memory (RAM), a read only memory (ROM), or an auxiliary storage (such as a hard disk).
The input/output interface 33 is an interface with an unillustrated display apparatus and/or input apparatus. The display apparatus is a liquid crystal display or the like, for example. The input apparatus is, for example, an apparatus that receives a user operation, such as a keyboard or a mouse, or an apparatus that inputs information from an external storage, such as a universal serial bus (USB) memory. A user inputs necessary information to the electrophoresis analyzing apparatus 30 by using a keyboard, a mouse, or the like. The input/output interface 33 also includes an interface (e.g., a USB interface) for connecting to the electrophoresis apparatus 20.
Functions of the electrophoresis analyzing apparatus 30 are implemented by the above-described processing modules. The processing modules are implemented, for example, by the CPU 31 executing a program stored in the memory 32. The program may be updated by downloading via a network or by using a storage medium having a program stored therein. Alternatively, the processing modules may be implemented with a semiconductor chip. In other words, the functions performed by the processing modules may be implemented using a kind of hardware and/or software. Moreover, a computer in which the above-described computer program is installed in a storage part thereof may be caused to function as the electrophoresis analyzing apparatus 30. Furthermore, by causing a computer to run the above-described program, an electrophoresis analysis method (a residual amount estimation method, a waveform correction method, a waveform analysis method, and the like) can be performed by the computer.
As described above, the electrophoresis analyzing apparatus 30 according to the first example embodiment estimates a residual amount of the first DNA group through analysis of an actual waveform pattern. By subtracting the residual amount estimated from an analysis-target actual waveform pattern, a more accurate analysis-target pattern can be obtained. Since residues of the first DNA group forming a peak first are eliminated from the analysis target thus obtained, more accurate analysis is possible.
The system configurations and operations described in the above example embodiments are examples, and various modifications are possible to be made. For example, the electrophoresis apparatus 20 and the electrophoresis analyzing apparatus 30 illustrated in
In the above-described example embodiments, the operations of the electrophoresis analyzing apparatus 30 are described by using the waveform obtained based on the first and second DNA groups (waveform as that illustrated in
A part or the whole of the above-described example embodiments can be described as, but is not limited to, the following modes.
An electrophoresis analyzing apparatus according to the above-described first aspect.
The electrophoresis analyzing apparatus according to Mode 1, in which
the estimation part compares the already-appeared peak waveform and a waveform according to a predetermined equation for modeling the already-appeared peak waveform and calculates a parameter(s) constituting the predetermined equation to thereby estimate the residual portion of the already-appeared peak waveform.
The electrophoresis analyzing apparatus according to Mode 2, in which
the predetermine equation for modeling the already-appeared peak waveform is
where Xc denotes a center position of a Gaussian distribution, W denotes a half-width at half-maximum of the Gaussian distribution, H denotes a height of the Gaussian distribution, and α denotes a predetermined coefficient.
The electrophoresis analyzing apparatus according to Mode 3, in which
the estimation part determines a value calculated according to a following expression to be an estimation value of the residual portion.
The electrophoresis analyzing apparatus according to any one of Modes 1 to 4, further including a waveform analysis part configured to calculate an area of a peak region included in the true analysis-target waveform.
The electrophoresis analyzing apparatus according to any one of Modes 1 to 5, in which the actual waveform data is data obtained through DNA capillary electrophoresis.
The electrophoresis analyzing apparatus according to Mode 6, in which
the actual waveform data is DNA capillary electrophoresis by sample injection using a cross-injection method.
An electrophoresis analysis method according to the above-described second aspect.
A program according to the above-described third aspect.
Note that Mode 8 and Mode 9, as Mode 1, can be developed as in Modes 2 to 7.
Note that the disclosures in the above-mentioned cited patent literatures and the like are incorporated herein by reference. Making a change and adjustment of the example embodiments and examples is allowed within the framework of the entire disclosure (including the scope of the claims) of the present invention, and also based on a basic technical concept of the present invention. Further, various combinations or selections of various disclosed elements (including each element of each claim, each element of each example embodiment and each example, each element of each drawing, and the like) are allowed within the framework of the entire disclosure of the present invention. Specifically, as a matter of course, the present invention encompasses various modifications and amendments that may be achieved by a person skilled in the art based on the entire disclosure including the scope of the claims and the technical concept. Regarding a numerical range described herein, in particular, it should be interpreted that any numerical value or any smaller range included within the range is specifically described even without particular description.
Number | Date | Country | Kind |
---|---|---|---|
2017-066161 | Mar 2017 | JP | national |
This application is a National Stage of International Application No. PCT/JP2018/012657 filed Mar. 28, 2018, claims priority based on Japanese Patent Application No. 2017-066161 (filed on Mar. 29, 2017), the contents of which application are incorporated herein in its entirety by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2018/012657 | 3/28/2018 | WO | 00 |