The invention relates to a computer-implemented method of analysing image texture. In particular, it provides a method of applying selected mappings to an image to generate a surface that may be analysed to characterise the texture of the image.
Image texture is a useful characteristic for object differentiation. However, image texture is a difficult characteristic to describe without using some kind of comparison to already familiar objects. Thus, human analysis of image texture is highly subjective. A computer based texture analysis method has the potential to provide a quantitative, and therefore more useful, measure of texture.
Once a textured object is in digital form the problem of analysis becomes a mathematical problem rather than the verbal problem of description. The mathematical problem has proven to be no less intractable.
The problem of image analysis has traditionally been approached as an analysis of visual images or pictures of macroscopic objects, such as crops or forests, or microscopic objects such as stained tissues or cells. However, the principles of texture analysis are applicable to any image be it visual, acoustic or otherwise. Image texture can be qualitatively evaluated as having one or more of the properties of fineness, coarseness, smoothness, granulation, randomness, lineation or being mottled, irregular or hummocky. These are relative terms, a useful analysis of the characteristics of an image must include an absolute measurement of image properties.
In general terms, the approaches to texture analysis can be divided into three primary techniques, these are: statistical, structural and spectral. Statistical techniques include first order approaches such as autocorrelation functions and higher order approaches such as gray-tone spatial-dependence matrices (see Haralick, Shanmugan & Dinstein; “Textural features for image classification”, IEEE Transactions on Systems, Man and Cybernetics SMC-3(6), 610-621; 1973). Structural techniques include the use of primitives and spatial relationships (see Haralick; “Statistical and structural approaches to texture”, Proceedings of the IEEE 67(5), 786-804; 1979). The main spectral technique is Fourier analysis.
There are five steps in image processing. These steps are image acquisition, preprocessing, segmentation, representation and description, and recognition and interpretation. The invention falls into the step of representation and description. Persons skilled in the art will be aware of existing techniques for performing the remaining four steps of image processing.
A general explanation of the prior art relevant to texture analysis can be found in Chapter 9 of “Vision in Man and Machine” by Levine, M; McGraw-Hill, Kew York (1985). Despite research in the area of texture analysis no single method has been developed that completely describes texture either verbally or mathematically.
Abdel-Mottaleb makes mention of texture analysis in U.S. Pat. No. 5,768,333, as a technique for second stage discrimination of objects. While, Abdel-Mottaleb makes use of a large number of gray level thresholds, image texture is then characterised by computing many features of each connected component in the resulting threshold separately, no 2-dimensional surface is formed.
It is an object of the present invention to provide a method of image texture analysis to usefully describe observed image texture.
Further objects will be evident from the following description.
In one form, although it need not be the only or indeed the broadest form, the invention resides in a method of analysing the texture of an image, including the steps of:
mapping the image into a first set of binary representations of the image, wherein each representation in the set corresponds to a transformation by a first monotonically varying operator;
mapping each binary representation of the first set of binary representations into a further set of binary image representations using a second monotonically varying operator, thereby forming a matrix of binary image representations;
transforming the matrix by allocating a scalar value to each binary image representation;
arranging the scalar values in a two dimensional space against parameters of the first monotonically varying operator and the second monotonically varying operator to form a first array of scalar values; and
identifying defined texture characteristics from the resulting first array of scalar values.
By monotonic operator is meant an operator acting on an image, the result of which only decreases (never increases), with an increase in its parameter, or with a decrease in its argument.
In preference, the first monotonically varying operator is a threshold operator wherein the threshold operator applies a threshold value that varies monotonically from a minimum value to a maximum value to produce the first set of binary representations.
In preference, the second monotonically varying operator is a spatial operator wherein the spatial operator applies an opening that varies monotonically from a minimum value to a maximum value to produce the further set of binary representations. Suitably, the second monotonically varying operator is a morphological opening that removes all image objects below a defined size.
The method may include the further step of calculating a second array of scalar values derived from the first array of scalar values by taking at each point the minimum of the first difference in a first direction and the first difference in a second direction.
The step of identifying defined texture characteristics may include the further step of comparing the first or second array of scalar values to a knowledge base of first or second array values known to indicate defined texture characteristics.
The method may include the further step of calculating one or several texture feature values by computing linear sums of the values in the first or second arrays of scalar values. In preference, the coefficients of these linear sums are determined by a principal components analysis on the data.
In a further form, the invention resides in an image analysis apparatus comprising:
storage means for storing a digital representation of an image;
processing means for:
display means for displaying the first array of scalar values and identified texture characteristics.
To assist in understanding the invention preferred embodiments will now be described with reference to the following figures in which:
a shows a 50×50 pixel gray level image;
b is a binary image produced by thresholding the image of
c is the image produced by opening the image of
d is the image produced by opening the image of
e is a binary image produced by thresholding the image of
f is the image produced by opening the image of
g is the image produced by opening the image of
a shows the granold surface derived from the example of
b shows the granold spectrum derived from the granold surface of
a shows a gray level image in a first example of the application of the method to the characterisation of ammonium sulphate pellets;
b is a plot of a granold surface produced by thresholding the image of
c is a granold spectrum corresponding to the granold surface of
d is a threshold/radius location of the maximum value of the granold surface of
a shows a gray level image in a second example of the application of the method to the characterisation of ammonium sulphate pellets;
b is a plot of a granold surface produced by thresholding the image of
c is a granold spectrum corresponding to the granold surface of
d is a threshold/radius location of the maximum value of the granold surface of
a shows a gray level image in a third example of the application of the method to the characterisation of ammonium sulphate pellets;
b is a plot of a granold surface produced by thresholding the image of
c is a granold spectrum corresponding to the granold surface of
d is a threshold/radius location of the maximum value of the granold surface of
In the drawings, like reference numerals refer to like parts. Referring to
The flow diagram is directed to the situation in which the monotonic mappings are gray level thresholding and spatial opening. The example is two-dimensional but the method is applicable to multi-dimensional applications. A mathematical description is presented later which generalises the specific example to a wider range of situations.
In the flow diagram, the gray level image to be analysed is designated as X. A threshold operator Φi, is applied at a first value of I=imin. Typically, an 8-bit gray level image might first be thresholded at gray level 0. A binary image Φi(X) is obtained by this thresholding operation where pixels having a gray level above the threshold will have a value of one and all other pixels have a value of zero. Applying spatial operator Ψj then morphologically opens the binary image.
In the example described here, a set of disc shaped structuring elements of increasing radius is used as the morphological opening. Thus, the smallest value of j=jmin corresponds to the first member in the set, which has the smallest radius. All pixel clusters having a radius less than j are set to zero and the rest are retained. The remaining area of the resulting binary image ΨjΦi(X) is then calculated (for example, by counting all pixels having a value of one) to give a value δΨjΦi(X). This value is stored in a two-dimensional array Gx in a position corresponding to the ith threshold operator versus the jth spatial operator.
The value of j is iterated and the next spatial operator Ψj+1 is applied to Φi(X) and the value δΨj+1Φi(X) is calculated and stored in the array at location Gx(i,j+1). The iteration and application of the spatial operator continues until j=jmax. The value of jmax may be arbitrarily chosen or may be the value of j such that δΨjmaxΦi(X)=0. This process of continually opening a binary image with a suitable set of structuring elements and measuring the remaining area produces a morphological granulometry of the binary image.
Once the granulometry of the first binary image has been completed the original gray level image is thresholded for the next value of I. The morphological granulometry for the I+1th binary image is then formed and entered into the granold surface array. The process is repeated, as shown in
The resultant array is represented in FIG. 2 and is referred to as the topographical granold surface. The array has (imax−imin)×(jmax−jmin) elements that can be plotted as a surface in the manner shown in FIG. 3. Corners appear on the surface that indicate a relationship between size and gray level of granules in the original image. An idealised corner is shown in the surface Gx(i,j) of
The complete granold surface Gx(i,j) for the image of
Characterisation of matter by pellet size and colour intensity may be applicable to a variety of industries, such as confectionery, pharmaceuticals and fertilisers, for quality control. FIG's 6-8 provide an example of the application of the invention to the characterisation of ammonium sulphate pellets.
In each of FIG's 6-8, image (a) is a gray level image produced by a video camera and frame grabber as ammonium sulphate pellets fall from a drum granulator in a fertiliser plant. The pellets of FIG. 6 and
Image (b) of each figure is a plot of the granold surface produced by applying a threshold operator that applied gray level thresholds from 0 to 255 and a spatial operator that applied a set of 16 disc shaped structuring elements. Image (c) is the corresponding granold spectrum.
Further processing of the granold spectrum involved identifying the threshold/radius location of the maximum value of the surface. The value indicates the gray level and size of the majority of the pellets in the image. It will be appreciated that statistical analysis of the granold spectrum can characterise the pellets by the mean and standard deviation of the size and colour of the pellets. Such an analysis provides a useful standard for quality control.
It will be appreciated that the foregoing examples are limited to two-dimensional space but can be generalised to multi-dimensional images to produce a similar granold surface and granold spectrum. Furthermore, the operators that produce the mappings are not limited to threshold, opening size and area. Other suitable operators may include contrast dynamic, attribute opening, etc.
The following generalised description is presented mathematically in a manner that will be understood by those skilled in the relevant art.
MATHEMATICAL DESCRIPTION
In order to describe the method it is necessary to define certain notations.
In the following, “R+” denotes the non-negative real numbers (including zero), “Z” the integers { . . . , −1,0,1, . . . } and “N” the natural numbers {0,1,2, . . . }.
We define
{F}:Zn→Zn (1)
Here {F} represents the general class of m-integer valued functions on n-dimensional space. We would typically use such a function to represent a gray scale 2-D image, in which case we would write, for example, “let XεF be a 2-D gray scale image . . . ”. This statement implies that n=2 and m=1. We keep the dimensions of the image representation general (at n and m) so as not to unnecessarily limit the scope of the method to 2-D images. The domain of the image XεF will be denoted by Dom(X). The domain is the set of al points in Zn for which X is defined.
Next we define,
BεP(Zn) (2)
Where P is the powerset operator, so P(Zn) is the set of all subsets of Zn. B represents the general class of binary images on n-dimensional integer space. We would typically use such a notation to represent a 2-D binary image where we would write, “let be a 2-D binary image . . . ”. which statement implies that n+2. Again, we keep the dimensions of the image general at n so as not to unnecessarily limit the scope of the method to 2-D images. The domain of the binary image BεB will be denoted Dom(B).
Based on the image spaces defined above we now define the mappings or operators used to form the granold surface. Firstly, we define,
Φ:F→B (3)
This function to set mapping takes a gray scale image and returns a binary image. As an example let XεF be a 2-D gray scale image, so we could write, B=Φ(X). The resulting image BεB is a 2-D binary image.
The thresholding of regions based on region minima or maxima values, the thresholding of regions based on contrast dynamic values, and the gray level threshold operation, well known in image processing, are all examples of this type of mapping.
Secondly we define,
Ψ:B→B (4)
This set to set mapping, which we will call an “operator”, takes a binary image and returns a binary image. As an example, let BεB be a 2-D binary image so we could write, A=Ψ(B). The resulting image AεB is also a 2-D binary image.
The binary AND operation, the attribute opening operation, and the morphological binary opening operation, also well known in image processing, are all examples of this type of mapping.
It is the intention that these mappings preserve the domain or support of the images in all cases, that is,
Dom(Φ(X))=Dom(X) (5)
and
Dom(Ψ(B))=Dom(B) (6)
and with the composition of these mappings'
Dom(ΨΦ(X))=Dom(X) (7)
Lastly, we define the set measure,
δ:B→R+ (8)
This set to scalar mapping, which we will call a “measure”, takes a binary image and returns a non-negative scalar value. The area (number of object pixels) of a binary image is an example of this type of mapping, so we could write, for example, area=δ(B).
The next step is to parameterise the mappings Φ and Ψ to give families of mappings and operators indexed on non-negative integer parameters iεN. Therefore we have families,
{Φi}:F→B (9)
and,
{Ψi}:B→B (10)
It is important for the method that the above mappings posses additional mathematical properties as follows:
Property 1—Monotone Decreasing for Parameter
The families of mappings Φi and Ψi must be “monotone decreasing” with respect to their parameter I. Also, to ensure that (in the limit) the mappings extract all the information from the images, there should exist values M and N (which may depend on the images) such that for i>M and i>N respectively, the appropriate mappings produce the empty-set, Ø. In contrast, we should also ensure that the index zero mappings have the least possible effect on the image and respectively produce the domain of the image or reproduces the image itself.
In mathematical terms, for the mappings Φi and for any XεF,
Likewise, for the operators Ψi and for any AεB,
Property 2—Monotone Increasing for Argument
The families of operators (set to set mappings) Ψi need to have some additional properties to ensure the granold surface is well behaved. Firstly, Ψi needs to be “anti-extensive” and to remove detail from the binary image BεB, that is, for all iεN
Ψi(B)=B (17)
Secondly, the operators Ψi must be “monotone increasing” with respect to their arguments, the binary images, that is for the binary images A, BεB for a fixed integer iεN
A⊂BΨi(A)⊂Ψi(B) (18)
Property 3—Measurement
Finally, the measure δ, to represent the usual sense of measuring something, must be: (I) zero if the object to be measured is empty,
δ(Ø)=0 (19)
and (ii) increasing, that is,
A⊂Bδ(A)≦δ(B) (20)
Now we consider the composition of these mappings as follows'
(δΨjΦi):F→R+ (21)
This function to scalar mapping, which we have previously termed a “granold” (granulometry of threshold), takes a gray scale image and returns a non-negative scalar which depends on the values of the parameters I and j (and of course the image itself).
To show the dependence on I and j explicitly, vie choose to denote the granold Gx(i,j):N2→R+ of an image XεF as a 2-D function,
Gx(i,j)def=(δΨjΦi)(X) (22)
If G is represented as a function on two dimensions or a surface then regions I and j both have a large effect on the value of δ will appear as “corners” in the surface. The location and “sharpness” of these corners will reveal the particular values of I and j that have a large effect on the value of δ.
The following first differences are useful for analysing G. These differences approximate the directional derivatives of the granold and will be large wherever there is a rapid fall in the granold in the corresponding directions.
The quantity,
is useful to determine where the granold surface has simultaneous large first differences in both the Φi and Ψi directions, and produces the granold spectrum.
To classify textures we may compute the texture features,
where the coefficients, αkij and βkij and would be found by, for example, principal components analysis of the data.
A suitable apparatus for putting the invention into effect is shown in FIG. 9. An image is acquired with acquisition means 1. This could be a digital camera or digitizer that converts an analogue signal. Pre-processing may occur at 2. This may be on-board processing in the acquisition means 1, or may be, for example, post-acquisition filtering.
Signals are passed to a processor 3 that is programmed to work the method described above. Required parameters may be stored in ROM 4 and temporary storage is provided by RAM 5. The result of the image texture analysis is displayed on monitor 6. Other data storage means such as disk or tape (not shown) may also be connected to the processor 3. The data storage may store a knowledge database of previous image texture analyses that can be used to identify texture characteristics by comparison.
Persons skilled in the relevant art will appreciate that the method of image texture analysis can be applied to a diverse range of situations. The inventors have demonstrated the application of the method to fertiliser pellets. Preliminary considerations indicate that it is also applicable to analysis of cervical cells on a Pap smear slide on a microscopic level and skin screening at a macroscopic level. Throughout the specification the aim has been to describe the preferred embodiments of the invention without limiting the invention to any one embodiment or specific collection of features.
Number | Date | Country | Kind |
---|---|---|---|
PP8984 | Mar 1999 | AU | national |
Number | Name | Date | Kind |
---|---|---|---|
5426684 | Gaborski et al. | Jun 1995 | A |
5680475 | Zwierski et al. | Oct 1997 | A |
5832118 | Kim | Nov 1998 | A |
5848198 | Penn | Dec 1998 | A |
6038342 | Bernzott et al. | Mar 2000 | A |
6052126 | Sakuraba et al. | Apr 2000 | A |
6285373 | Baldwin et al. | Sep 2001 | B1 |
6333747 | Murata et al. | Dec 2001 | B1 |
6636635 | Matsugu | Oct 2003 | B1 |
Number | Date | Country |
---|---|---|
A-3839597 | Sep 1998 | AU |
Number | Date | Country | |
---|---|---|---|
20020051571 A1 | May 2002 | US |