This application claims the benefit under 35 U.S.C. 119(a)-(d) to Chinese Application No. 201611129604.1, filed Dec. 9, 2016, and Chinese Application No. 201710607586.1, filed Jul. 24, 2017, which applications are hereby incorporated by reference in their entireties.
The present disclosure relates to a field of gene sequencing technology, and more particularly relates to a method and a device for identifying a single molecule, a method and a device for counting a single molecule and a system for processing a single molecule.
In the related art, the third generation sequencing technology is a single molecule sequencing technology based on imaging optical detections. Moreover, the single molecule sequencing technology is a technology for identifying base groups according to optical signals and electrical signals. Specifically, a base group is identified by fluorescence which is a light intensity emitted from an excited state to a ground state at a specific power laser irradiation. However, due to different fluorophores illumine for different time durations and have different light intensities, as well as the presence of background noise, it is possible to cause errors for identification of the single molecules. At the same time, an effective number of single molecules may be reduced by uneven distribution of DNA chain and agglomeration of base groups.
The existing methods rely mainly on the human eye to identify and count single molecules on a collected fluorescence image, but these methods cost a lot of labor and time. With reference to phonetic identification, if a method according to HMM (Hidden Markov Model) and machine study is used, a lot of training is required and operating efficiency is not high.
Embodiments of the present disclosure seek to solve at least one of the problems existing in the related art to at least some extent. Accordingly, embodiments of the present disclosure provide a method and a device for identifying a single molecule and a method and a device for counting a single molecule.
A method for identifying a single molecule is provided according to embodiments of the present disclosure. The method includes inputting time sequence of an intensity of an spot; forming a line chart of time and intensity of the spot according to the time sequence, in which the line chart consists of a plurality of line segments; dividing the line chart into a plurality of grids in an array, counting the number of the line segments and/or ends of the line segments in each grid; subjecting the divided line chart to a line corrosion according to numbers corresponding to the grids to transform the divided line chart to a simplified image; subjecting the simplified image to run-length coding to indicate connected domains; calculating an area of each connected domain and determining that the connected domain corresponds to a single molecule if the area of the connected domain is greater than a first predetermined threshold. According to the method for identifying the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, which results in a quick identification for the single molecule and a high accuracy of the identification.
A method for counting a single molecule is provided according to embodiments of the present disclosure. The method includes: inputting time sequence of an intensity of an spot; forming a line chart of time and intensity of the spot according to the time sequence, in which the line chart consists of a plurality of line segments; dividing the line chart into a plurality of grids in an array, counting the number of the line segments and/or ends of the line segments in each grid; subjecting the divided line chart to a line corrosion according to numbers corresponding to the grids to transform the divided line chart to a simplified image; subjecting the simplified image to run-length coding to indicate connected domains; calculating an area of each connected domain, and determining that the connected domain corresponds to a single molecule if the area of the connected domain is greater than a first predetermined threshold; obtaining a first number of the single molecules by calculation. According to the method for counting the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, which results in a quick count for the single molecule and a high accuracy of the count.
A further method for counting a single molecule is provided according to embodiments of the present disclosure. The method includes: inputting time sequence of an intensity of an spot; forming a line chart of time and intensity of the spot according to the time sequence, in which the line chart consists of a plurality of line segments; dividing the line chart into a plurality of grids in an array, counting the number of the line segments and/or ends of the line segments in each grid; subjecting the divided line chart to a line corrosion according to numbers corresponding to the grids to transform the divided line chart to a simplified image; subjecting the simplified image to run-length coding to indicate connected domains; calculating an area of each connected domain, and determining that 1 is added to the number of the single molecules if the area of the connected domain is greater than a first predetermined threshold. According to the method for counting the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, which results in a quick count for the single molecule and a high accuracy of the count.
A device for identifying a single molecule is provided according to embodiments of the present disclosure. The device includes: an inputting unit configured to input time sequence of an intensity of an spot; a transforming unit configured to form a line chart of time and intensity of the spot according to the time sequence of the inputting unit, in which the line chart consists of a plurality of line segments; a grid counting unit configured to divide the line chart of the transforming unit into a plurality of grids in an array, and count the number of the line segments and/or ends of the line segments in each grid; a simplifying unit configured to subject the divided line chart to a line corrosion according to numbers corresponding to the grids to transform the divided line chart to a simplified image; an indicating unit configured to subject the simplified image to run-length coding to indicate connected domains; a determining unit configured to calculate an area of each connected domain and determine that the connected domain corresponds to a single molecule if the area of the connected domain is greater than a first predetermined threshold. According to the device for identifying the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, which results in a quick identification for the single molecule and a high accuracy of the identification.
A device for counting a single molecule is provided according to embodiments of the present disclosure. The device includes: an inputting unit configured to input time sequence of an intensity of an spot; a transforming unit configured to form a line chart of time and intensity of the spot according to the time sequence of the inputting unit, in which the line chart consists of a plurality of line segments; a grid counting unit configured to divide the line chart of the transforming unit into a plurality of grids in an array, and count the number of the line segments and/or ends of the line segments in each grid; a simplifying unit configured to subject the divided line chart to a line corrosion according to numbers corresponding to the grids to transform the divided line chart to a simplified image; an indicating unit configured to subject the simplified image to run-length coding to indicate connected domains; a determining unit configured to calculate an area of each connected domain and determine that the connected domain corresponds to a single molecule if the area of the connected domain is greater than a first predetermined threshold; a calculating unit configured to obtain a first number of the single molecules by calculation. According to the device for counting the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, which results in a quick count for the single molecule and a high accuracy of the count.
A device for counting a single molecule is provided according to embodiments of the present disclosure. The device includes: an inputting unit configured to input time sequence of an intensity of an spot; a transforming unit configured to form a line chart of time and intensity of the spot according to the time sequence of the inputting unit, in which the line chart consists of a plurality of line segments; a grid counting unit configured to divide the line chart of the transforming unit into a plurality of grids in an array, and count the number of the line segments and/or ends of the line segments in each grid; a simplifying unit configured to subject the divided line chart to a line corrosion according to numbers corresponding to the grids to transform the divided line chart to a simplified image; an indicating unit configured to subject the simplified image to run-length coding to indicate connected domains; a determining unit configured to calculate an area of each connected domain and determine that 1 is added to the number of the single molecule if the area of the connected domain is greater than a first predetermined threshold. According to the device for counting the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, which results in a quick count for the single molecule and a high accuracy of the count.
A system for processing a single molecule is provided according to embodiments of the present disclosure. The system includes: a data inputting device configured to input data; a data outputting device configured to output data; a memory device configured to store data including a computer executable program; and a processor, configured to perform the computer executable program for performing a method for identifying a single molecule or a method for counting a single molecule described above. Thus, the identification and count of the single molecule may be achieved by the system for processing the single molecule.
A computer readable memory medium is also provided according to embodiments of the present disclosure. The computer readable memory medium is configured to store a program that, when executed by a computer, causes the computer to perform any method described above. The computer readable memory medium may be a read-only memory, a random access memory, a magnetic disc, an optical disc, etc.
Additional aspects and advantages of embodiments of present disclosure will be given in part in the following descriptions, become apparent in part from the following descriptions, or be learned from the practice of the embodiments of the present disclosure.
These and other aspects and advantages of embodiments of the present disclosure will become apparent and more readily appreciated from the following descriptions made with reference to the accompanying drawings, in which:
Reference will be made in detail to embodiments of the present disclosure, where the same or similar elements and the elements having same or similar functions are denoted by like reference numerals throughout the descriptions. The embodiments described herein with reference to drawings are explanatory, illustrative, and used to generally understand the present disclosure. The embodiments shall not be construed to limit the present disclosure. Instead, the embodiments of the present disclosure include all the variants, modifications and their equivalents within the spirit and scope of the present disclosure as defined by the claims.
In the description of the disclosure, it should be understood that terms such as “first” and “second” are used herein for purposes of description and are not intended to indicate or imply relative importance or significance. Thus, the feature defined with “first” and “second” may include one or more this feature. In the description of the present disclosure, “a plurality of” means two or more than two, unless specified otherwise.
In the present disclosure, unless specified or limited otherwise, the terms “mounted,” “connected,” “coupled,” “fixed” and the like are used broadly, and may be, for example, fixed connections, detachable connections, or integral connections; may also be mechanical or electrical connections; may also be direct connections or indirect connections via intervening structures; may also be inner communications of two elements, which can be understood by those skilled in the art according to specific situations.
Exemplary embodiments and examples are provided as follows to show different variations of the present disclosure. In order to keep the present disclosure brief, only certain embodiments, components and settings are described. For a purpose of simplicity and clear, reference numbers and/or reference letters may be repeatedly used in different examples of the present disclosure, while the reference numbers and the reference letters are not used to instruct the relationship between the various embodiments and/or configurations.
A method for identifying a single molecule and a method for counting a single molecule according to embodiments of the present disclosure may be used in gene sequencing. The term “gene sequencing” in embodiments of the present disclosure is the same as nucleic acid sequencing, including DNA sequencing and/or RNA sequencing, and including long fragment sequencing and/or short fragment sequencing.
As shown in
Specifically, in the step S01, when the spot is formed, the test sample is irradiated with a laser beam of a specific wavelength, and the test sample is excited to emit fluorescence, and then the fluorescence is collected by a camera to form an image containing the spot corresponding to the fluorescence emitted by the test sample, i.e., corresponding to a nucleic acid molecule. The term “spot” refers to a luminous point on the image, in which one luminous point occupies at least one pixel point, and the term “pixel point” refers to a pixel.
In an embodiment of the present disclosure, the image may be acquired from a single molecule sequencing platform from companies such as Helicos, Pacific Biosciences (PacBio). The original data after input is a parameter of the pixel point of the image, and the detection for the “spot” is detection for an optical signal of a single molecule.
As shown in
With the method for processing an image, the to-be-processed image is processed in the image pre-processing step, such that computation in the spot detecting step may be decreased. In addition, by judging whether the candidate spot is the spot according to the spot determining threshold, the accuracy of determining the spot is improved.
Specifically, in an example, an inputted to-be-processed image may be a 16-bit TIFF image of size 512*512 or 2048*2048, and the TIFF image may be a grayscale image. Thus, the processing procedure of this method may be simplified.
Referring to
Referring to
Referring to
Referring to
Referring to
Referring to
In some implementations of the present disclosure, performing a background subtraction on the to-be-processed image includes: determining a background of the to-be-processed image by an opening operation; and performing the background subtraction on the to-be-processed image according to the background. In this way, the opening operation is for eliminating small objects, splitting objects at tenuous points and smoothing boundaries of big objects without changing the area of the image, such that the background subtracted image may be acquired precisely.
Specifically, in implementations of the present disclosure, the to-be-processed image (such as a grayscale image) is shifted by an a*a window (such as a 15*15 window), and the background of the to-be-processed image may be estimated using the opening operation (corroding first and expanding thereafter), as shown in formula 1 and formula 2:
g(x,y)=erode[f(x,y),B]=min{f(x+x′,y+y′)−B(x′,y′)|(x′,y′)∈Db} formula 1
where g(x,y) represents a corroded grayscale image, f(x,y) represents the original grayscale image, and B represents a structural element;
g(x,y)=dilate[f(x,y),B]=max{f(x−x′,y−y′)−B(x′,y′)|(x′,y′)∈Db} formula 2
where g(x,y) represents an expanded grayscale image, f(x,y) represents the original grayscale image, and B represents the structural element.
Therefore, the background noise may be acquired according to:
g=imopen(f(x,y),B)=dilate[erode(f(x,y),B)] formula 3
The background subtraction is performed on the original image according to formula 4.
f=f−g={f(x,y)−g(x,y)|(x,y)∈D} formula 4
It should be noted that the specific methods of performing a background subtraction on the to-be-processed image in the present implementations may be applied to the steps of performing background subtraction on the image mentioned in any of the above-described implementations.
In some implementations of the present disclosure, performing the filtering is performing a Mexican Hat filtering. It is easy to perform the Mexican Hat filtering on an image and the cost of the method for identifying and/or counting single molecules may be reduced. Additionally, a contrast ratio of the foreground and the background may be improved by performing the Mexican Hat filtering, i.e., making the foreground brighter and the background darker.
The Mexican Hat filtering may be performed as follows: performing a Gaussian filtering on the to-be-processed image before the filtering using an m*m window, and performing a two-dimensional Laplacian sharpening on the to-be-processed image after the Gaussian filtering. m is a natural number and particularly, an odd number greater than 1. In this way, the Mexican Hat filtering may be accomplished by two steps.
Specifically, referring to
where x and y represent coordinates of the pixel point.
Firstly, the Gaussian filtering is performed on the to-be-processed image using the m*m window, as shown in formula 6.
where t1 and t2 represent positions of the filter window, and wt1,t2 represents a weight of Gaussian filtering.
Secondly, the two-dimensional Laplacian sharpening is performed on the to-be-processed image, as shown in formula 7.
∂2f=[Kf(x,y)−kf(x−1,y−1)−kf(x−1,y+1)−kf(x+1,y−1)−kf(x+1,y+1)] formula 7
where K and k are Laplacian operators with regard to a sharpen effect. K and k are modified if the sharpening needs to be strengthened or weakened.
In an example, m=3 and thus m*m=3*3, the formula 6 may written as
when performing the Gaussian filtering.
It should be noted that the specific methods of performing a Mexican Hat filtering on the to-be-processed image in the present implementations may be applied to the steps of performing Mexican Hat filtering on the image mentioned in any of the above-described implementations.
In some implementations of the present disclosure, the simplified image is a binary image. The binary image is easy to process and may be applied widely.
Specifically, in an example, a binary image may include 0 and 1 for indicating different pixel points with different attributes respectively, the binary image may be represented as
In some implementations of the present disclosure, the simplifying step includes: acquiring a signal-to-noise ratio (SNR) matrix according to the to-be-processed image before the simplifying, and simplifying the to-be-processed image before the simplifying according to the SNR matrix to obtain the first image.
In an example, a background subtraction is performed on the to-be-processed image to obtain a background subtracted image, and then a signal-to-noise ratio (SNR) matrix is acquired according to the background subtracted image. In this way, the image with less noise may be acquired and the accuracy of the method for processing an image may be improved. Specifically, in an example, the SNR matrix may be represented as
formula 8, where x and y represent coordinates of the pixel point, h represents a height of the image and w represents a width of the image, i∈w, j∈h.
Specifically, in an example, the simplified image is a binary image. The binary image may be acquired according to the SNR matrix, in which the binary image is represented as formula 9.
When computing the SNR matrix, the background subtraction and/or the filtering may be performed on the to-be-processed image, for example, the subtractive background processing step and/or the filtering processing step as described above. The background subtracted image is obtained according formula 4, and then a matrix of a ratio between the background subtracted image and the background may be acquired according to:
R=f/g={f(x,y)/g(x,y)|(x,y)∈D} formula 10
where D represents a dimension (height*width) of the image f.
Therefore, the SNR matrix may be acquired according to:
In some implementation of the present disclosure, analyzing the first image so as to compute a spot determining threshold includes: processing the first image by an Otsu method so as to compute the spot determining threshold. In this way, the spot determining threshold may be found by a sophisticated and simple method, such that the accuracy of the method may be improved and the cost of the method may be reduced. In addition, by computing the spot determining threshold based on the first image, the efficiency and accuracy of the method may be improved.
Specifically, the Otsu method is also called as a maximum class square error method in which the image is split according to the maximum class square error, which means that a probability of wrong split is the minimum and the accuracy is high. It is assumed that a splitting threshold between the foreground and the background of the to-be-processed image is T, a number of the pixel points belonging to the foreground accounts for ω0 of the whole image, and the average gray level of the pixel points belonging to the foreground is μ0; a number of the pixel points belonging to the background accounts for Φ1 of the whole image, and the average gray level of the pixel points belonging to the background is μ1. The average gray level of the to-be-processed image is μ, and the class square error is var. Therefore, following formulas may be acquired.
μ=ω0*μ0+ω1*μ1 formula 11
var=ω0(μ0−μ)2+ω1(μ1−μ)2 formula 12
Formula 13 is acquired by substituting the formula 11 into the formula 12.
var=ω0ω1(μ1−μ0)2 formula 13
The splitting threshold T achieving the maximum class square error may be acquired by traversing, which may be configured as the spot determining threshold T.
In some implementations of the present disclosure, referring to
Specifically, Imax may be understood as the maximum intensity of the center of the candidate spot. In an example, h=3. Referring to
The first image can be the image after the simplifying, such as the binary image. The set value in the binary image may be the value corresponding to the pixel specifying a set condition. In another example, the binary image may include 0 and 1 for indicating different pixel points with different attributes respectively. If the set value is 1, ABI is a proportion of pixels in the binary image being 1 in the h*h window. For example, BI=1 when SNR<=mean (SNR).
In some implementations of the present disclosure, the value of p can be set as the value of m which is used in the Mexican Hat filtering, i.e. h=m. In some embodiments of the present disclosure, during the collection for forming above described image, the camera collects the fluorescence in a plurality of fields of view (FOV) in sequence according to the time sequence. Therefore, when the image data is obtained, the intensity of the spot included in the image data corresponds to the time sequence collected by the camera.
In the step S02, after the spots are obtained, points of intensities of the spots of adjacent collection times are connected by a line, thus forming a line chart of time and intensity of the spot as shown in
In an embodiment of the present disclosure, the vertical axis corresponds to values of the intensities of the spots, and the value of the intensity of the spot is a value of the pixel. For a 16-bit TIFF image, the value of the spot pixel is in the range of 0 to 65535. For an 8-bit grayscale image, the value of the spot pixel is in the range of 0 to 255. The 16-bit image is used in the present disclosure.
In the step S03, waveform of the line chart is processed, such that the run-length coding may be performed subsequently. During processing, the line chart is divided into a plurality of grids.
In some embodiments of the present disclosure, the line chart is divided according to a time frame of the intensity when collected and a magnitude of the intensity. In this case, the line chart may be divided into grids easily to reduce the cost of the method for identifying the single molecule. Specifically, the line chart is divided into M*N grids, in which M is determined by dividing according to the time frame and N is determined by dividing according to the magnitude of the intensity. The time frame of the collected intensity is the time interval of adjacent collections of the fluorescence. In an embodiment of the present disclosure, a direction along the horizontal axis refers to a length direction of a grid and a direction along the vertical axis refers to a height direction of a grid. The length of the grid may be set as times of the time frame, such as 1 times, 2 times and 2.5 times. The height of the grid may be set flexibly. For example, for the 16-bit TIFF image, the value of the vertical axis is in the range of 0 to 65535. When grids are divided, the values of the vertical axis are normalized and then divided into 50 parts, such that the height of the grid may be set as 0.02, i.e., N=50.
In an embodiment of the present disclosure, the time interval between two adjacent collections of fluorescence is 20 ms, and the length of one grid is equal to one time interval, i.e., the height=0.02. As shown in
In an embodiment of the present disclosure, as shown in
In an embodiment of the present disclosure, as shown in
In an embodiment of the present disclosure, as shown in
It should be noted that W is greater than or equal to the length of one grid. Optionally, W is an integer multiple of the length of one grid. In an embodiment of the present disclosure, W≥L1. Optionally, W is an integer multiple of L1.
In some embodiments of the present disclosure, the threshold T is in a range [6, 8] dependent on undulation of the waveform of the line chart. Specifically, the smaller the undulation is, the greater the threshold T is.
For ease of understanding, when run-length coding is described, the following description is made by taking 1 and 0 in the binary image as an example. It should be understood that modifications may be made by one skilled in the art, for example, to use another type of the simplified image and values except for the first and second values.
In run-time coding, an eight connection manner may be used. According to the eight connection principle, grids are connected to form the connected domain by a recursive algorithm and then the connected domain is indicated by using run-length coding. Specifically, according to an eight connection, for example, a 3*3 window used as shown in
In
In the above embodiment, a recursive algorithm is used. In other examples, a traversal algorithm may also be used to define the connected domain.
If the area of the connected domain is greater than a first determined threshold P, the connected domain corresponds to a single molecule. The value of P is related to the decay time of single molecule fluorescence. In one example, the first determined threshold P is in the range [5, 10].
In some embodiments of the present disclosure, with reference to
As described above, the method for identifying the single molecule may thus be applied more widely. The method for identifying the single molecule based on the histogram can identify single molecule accurately according to data of the intensity and time sequence of the spot, and particularly suitable to the spot containing more than 3 single molecules. In such an embodiment, the method based on the histogram and the method based on run-length coding may be combined to identify single molecules in the line chart (time sequence of the intensity of the spot) having different waveforms accurately.
In some embodiments of the present disclosure, S51 further includes: dividing the intensity in N groups according to the magnitude of the intensity and calculating frequencies of the number in the N groups:
in which ni represents a sum of the frequencies of the numbers of an ist line of the grids, j represents a time frame, gij represents a frequency of the number of grid (i, j), and M represents the number of the time frames.
Specifically, in an embodiment, a horizontal axis of the histogram represents the number of a group, and a vertical axis of the histogram represents a frequency of the number in each group. It should be noted that the value of N is equal to the value of N in M*N grids formed above, and the value of M is equal to the value of M in M*N grids formed above.
In some embodiments, the step of dividing the intensity in groups according to the magnitude of the intensity to subject the numbers to frequency calculating to obtain the histogram includes: equalizing the histogram with a L-shaped window:
in which np represents equalization of ni, ni′ represents a sum of results of the equalization of ni, and p represents an integer associated with the size of the L-shaped window and the ist line. Thus, the histogram is distributed more uniformly to be identified easily. L-shaped window is used for equalization of the histogram, and the value of L is related to the size of the L-shaped window and the ist line. In general, if the single molecule has a fast fluorescence quenching, the value of L should not be too large. The accuracy of the histogram is affected by the size of the L-shaped window, and the value of L can be set flexibly to select appropriate histogram accuracy. In one embodiment, the value of L is in a range [5, 15].
With reference to
In the step 52, all the maximum values of the histogram may be calculated by derivation. The second determined threshold Q and the third determined threshold H are related to the shape of the peak of the line chart. Specifically, the sharper the peak is, the greater the second determined threshold Q is, and the smaller the third determined threshold H is. The gentler the peak is, the smaller the second determined threshold Q is, the greater the third determined threshold H is. In an embodiment, the second determined threshold Q is in a range [2, 6], and the third determined threshold H is in the range [4, 10].
In an embodiment of the present disclosure, the maximum value is a top value of the peak (i.e., an inflexion point). In other words, if the maximum value in the peak meets certain conditions, the peak corresponds to a single molecule.
In some embodiments of the present disclosure, before the line chart is divided, the method for identifying the single molecule further includes: performing a filtering to the line chart, such that it is possible to eliminate the abrupt error caused by the intensity flicker and the camera sampling, and the waveform of the line graph is smoother. In particular, the modification of the waveform may be performed by using a median value of a window of a size of L2 for filtering: R=medium(Zi). In an embodiment, with reference to
With reference to
For example, in an embodiment of the present disclosure, before S83, the method for counting the single molecule may further include filtering the line chart. In another embodiment, as shown in
As shown in
According to the method for counting the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, thus obtaining the connected domains by run-length coding, which results in a quick count for the single molecule and a high accuracy of the count.
It should be noted that the description of the technical features and advantages of methods for identifying and/or counting the single molecule in any one of the above embodiments and examples, including explanations and descriptions of steps, parameter settings, and image pre-processing spot detection, is also applicable to the method for counting the single molecule in embodiments of the present embodiment. In order to avoid redundancy, it is not further described in detail here.
For example, in some embodiments, before S63, the method for counting the single molecule may further include filtering the line chart. In another embodiment, as shown in
The method for counting the single molecule based on the histogram calculation is particularly suitable to accurately find the spot containing more than 3 single molecules. The method for counting the single molecules based on the run-length coding is particularly suitable to accurately find the spot which contains ≤3 single molecules. In an embodiment, above two methods may be combined, and the single molecule of the line chart with different waveforms may be found and counted accurately. For example, the number of the single molecules based on the run-length coding is the first number and the number of the single molecules based on the histogram is the second number. The first number is compared to the second number and the smaller one of these two numbers is taken as the final number of the single molecules.
As shown in
In an embodiment, the simplified image is a binary image.
It should be noted that the description of the technical features and advantages of the method for identifying the single molecule in any one of the above embodiments and examples is also applicable to the device 200 for identifying the single molecule in embodiments of the present embodiment. In order to avoid redundancy, it is not further described in detail here. For example, in some embodiments, as shown in
In some embodiments of the present disclosure, in the first grid counting unit 206, the line chart is divided according to a time frame of the intensity when collected and a magnitude of the intensity.
Referring to
In some embodiments of the present disclosure, in first histogram obtaining unit 214, dividing the intensity in groups according to a magnitude of the intensity to subject the numbers to frequency calculating to obtain a histogram includes: dividing the intensity in N groups according to the magnitude of the intensity and calculating frequencies of the number in the N groups:
in which ni represents a sum of the frequencies of the numbers of an ist line of the grids, j represents a time frame, gij represents a frequency of the number of grid (i, j), and M represents the number of the time frames.
In some embodiments, in first histogram obtaining unit 214, dividing the intensity in groups according to a magnitude of the intensity to subject the numbers to frequency calculating to obtain a histogram includes: equalizing the histogram with a L-shaped window:
in which np represents equalization of ni, ni′ represents a sum of results of the equalization of ni, and p represents an integer associated with the size of the L-shaped window and the ist line.
In some embodiments, as shown in
In some implementations, referring to
In some implementations, referring to
In some implementations, referring to
In some implementations, referring to
In some implementations, referring to
In some implementations, referring to
In some implementations, the background subtraction module 226 is configured to determine a background of the to-be-processed image by an opening operation; and to perform the background subtraction on the to-be-processed image according to the background.
In some implementations, the image filtering module 224 is configured to perform a Mexican Hat filtering on the to-be-processed image.
In some implementations, the simplifying module 222 is configured to perform a binary process.
In some implementations, when using the simplifying module 222 to simplify the image, a signal-to-noise ratio matrix is acquired according to the to-be-processed image before the simplifying, and to simplify the to-be-processed image before the simplifying according to the signal-to-noise ratio matrix so as to acquire the first image. In some implementations, the spot detecting module 220 is configured to process the first image by an Otsu method so as to compute the spot determining threshold. In some implementations, the spot detecting module 220 is configured to: search in the first image for a pixel point with connectivity greater than (h*h−1) as a center of the candidate spot; and to judge whether the center of the candidate spot satisfies a condition of Imax*ABI*ceofguass>T, where Imax is a maximum intensity of a center of the h*h window, ABI is a proportion of pixels in the first image being a set value in the h*h window, ceofguass is a correlation coefficient between a pixel in the h*h window and a two-dimensional Gaussion distribution, and T is the spot determining threshold, in which h is a natural number and particularly, an odd number greater than 1; to determine a spot corresponding to the center of the candidate spot as the spot when the condition is satisfied; and to discard the candidate spot when the condition fails to be satisfied. As shown in
It should be noted that the description of the technical features and advantages of the method for counting the single molecule in any one of the above embodiments and examples is also applicable to the device 400 for counting the single molecule in embodiments of the present embodiment. In order to avoid redundancy, it is not further described in detail here.
For example, in some embodiments, as shown in
In some embodiments, as shown in
As shown in
According to the device 600 for counting the single molecule, the line chart of the time sequence of the intensity of the spot is processed to transform into the image, thus obtaining the connected domains by run-length coding, which results in a quick count for the single molecule and a high accuracy of the count.
It should be noted that the description of the technical features and advantages of the method for counting the single molecule in any one of the above embodiments and examples is also applicable to the device 600 for counting the single molecule in embodiments of the present embodiment. In order to avoid redundancy, it is not further described in detail here.
For example, in some embodiments, as shown in
In some embodiments, as shown in
As shown in
A computer readable memory medium according to embodiments of the present disclosure is configured to store a program that, when executed by a computer, causes the computer to perform any method described above. The computer readable memory medium may be a read-only memory, a random access memory, a magnetic disc, an optical disc, etc.
Reference throughout this specification to “one embodiment”, “some embodiments,” “an embodiment,” “a specific example,” or “some examples,” means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present disclosure. Thus, the appearances of the phrases in various places throughout this specification are not necessarily referring to the same embodiment or example of the present disclosure. Furthermore, the particular features, structures, materials, or characteristics may be combined in any suitable manner in one or more embodiments or examples.
Although explanatory embodiments have been shown and described, it would be appreciated that the above embodiments are explanatory and cannot be construed to limit the present disclosure, and changes, alternatives, and modifications can be made in the embodiments without departing from scope of the present disclosure by those skilled in the art.
Number | Date | Country | Kind |
---|---|---|---|
201611129604.1 | Dec 2016 | CN | national |
201710607586.1 | Jul 2017 | CN | national |