The present invention relates to remote sensing imagery processing, and more particularly to a method for target detection and identification with hyperspectral image, wherein the information extracted from the proximity pixels is used to aid the target detection and identification.
The unifying trait of all hyperspectral data is the existence of a gross quantity of specific and minuscule spectral bands located within the optical wavelength region. The exact quantity of bands relating to any hyperspectral image varies widely. A single band range located within the visible wavelength region might vary between a single nanometer to hundreds of nanometers. Band ranges located within the infrared and thermal wavelength regions might exceed those for the visible wavelength region. Of course, hyperspectral data is exceedingly desirable, due to the ease with which one may recognize observed entities based on very specific characteristic features, corresponding to those entities, which are associated with very narrow spectral bands. This type of detection and recognition is simply not possible by using traditional methods.
The disadvantage associated with hyperspectral data is the necessary capability to process extraordinary amounts of information. Specific elements, entities or objects, or components thereof, possess specific spectral signatures. One's ability to ascertain a specific spectral signature results in one's ability to ascertain the corresponding element. Previous methods for dealing with hyperspectral data included pattern matching techniques. These techniques rely upon models and least squares algorithms in order to recognize and isolate elements within hyperspectral data. These pattern matching techniques are limited by their lack of robustness. Their results degrade significantly across spatial and temporal variations. They are inadequate at recognizing elemental components from a combined spectral signature. They require a tremendous amount of computation. Their results degrade significantly across sensor and atmospheric variations. They do not deal with nonlinearity well. Also, these techniques do not respond well to increasing databases of elements for which to detect within the hyperspectral data.
It is also noted that the image data represent a very complex set of materials that are not easily classified without accurate ground truth data. The traditional method for target detection and identification with hyperspectral image data set is to use a large number of material signatures as references. The disadvantage of this method is huge computational load and low accuracy.
A main object of the present invention is to provide an efficient proximity-based approach to perform fast and accurate pixel unmixing for target detection and identification.
A further object of the present invention is to provide an efficient proximity-based approach to perform fast and accurate pixel unmixing for target detection and identification by using evolutionary algorithm.
Another object of the present invention is to provide an efficient proximity-based approach to perform fast and accurate pixel unmixing for target detection and identification by using evolutionary algorithm and using the neighboring pixel signatures as references.
FIGS. 1(a) to 1(d) intuitively illustrate the concept of the proximity-based target detection and identification approach.
The present invention provides an efficient proximity-based approach to unmix spectral pixels for target detection and identification. This proximity-based approach uses the neighboring pixel signatures as references to detect material of interest which does not present in the neighboring pixels. By using the neighboring pixel signatures and the signature of material of interest as references there are only nine endmembers are involved. The computational load is dramatically reduced. The accuracy of target detection is also enhanced due to the introduction of the material information presented in the neighboring pixels.
The possible application of the proximity-based approach of the present invention for target detection assumes that the physical environment changes gradually over a small range of terrain (terrain covered by 3×3 pixels). In image science, it is commonly assumed that any pixel property is strongly dependent on the surrounding pixels. If this pixel is partitioned into sub-pixels, the properties of these sub-pixels are determined using a numerical interpolation scheme or by fitting a “true” surface to the set of properties of the surrounding pixels. Pixels further away from a given pixel are not expected to have a considerable contribution to the sub-pixels' properties obtained from a partition of this pixel. That is, subpixel properties of a given pixel can be considered independent of the properties of pixels far away.
A section from band 52 of the Jasper Ridge hyperspectral image is taken to graphically illustrate this effect. A piece of this band comprising a subset 3×3 pixel image, is selected to demonstrate the effect of neighboring pixels on the test pixel in a heterogeneous background.
If this image represents a range of terrain, then it is expected that the discontinuities between consecutive pixels do not, in general, physically exist, but instead, are limitations due to the resolution of the image. FIG. 1(b) shows a 3-D image of the pixel gray values of the image displayed in FIG. 1(a). In an exact representation of the terrain, smoother variations are expected in the pixel gray values.
FIG. 1(c) shows the same terrain under the assumption that correlation among adjacent pixels exists. Under this assumption, the image in FIG. 1(a) was re-sampled (partitioned). Each pixel in FIG. 1(a) was divided into 49 sub-pixels. Under the assumption of correlation among neighboring pixels (often borne out in practice), then FIG. 1(c) is a more realistic representation of the terrain characterized by the image given in FIG. 1(a).
FIG. 1(d) shows a 3-D representation of the pixel gray values for the same image after a bi-cubic interpolation. Continuous changes in the pixel gray values are noticeable. This is a more realistic representation of terrain variations.
This example graphically illustrates the application of the present invention for target detection which is based on the rationale of neighboring pixel spectral signature similarities. If the image shows discontinuity between consecutive pixels, it is assumed that different materials present in some pixels.
The proximity-based target detection approach of the present invention uses the signatures extracted from proximity pixels as the reference to find the inflection point which implies a new material occurring in this pixel, as shown in FIG. 2. Referring to
The proximity-based target detection approach comprises of the following steps:
(1) Read into the hyperspectral image data, wherein the hyperspectral image data is a hyperspectral cube, i.e. to receive the hyperspectral image cube which represents a scene in terms of wavelength and spatial position.
(2) Select the trial pixel, wherein the trial pixel can be presented by its location (x,y).
(3) Select target/material of interest from a target database, wherein the material of interest represents a target for target detection and identification.
(4) Build a reference spectra library, where endmembers of the spectra library are the signatures collected from the eight neighboring pixels and the signature of the material of interest.
(5) Apply an abundance estimator to unmix the trial pixel, wherein the abundance estimate of the material of interest implies the presence of the target.
Referring to
The hyperspectral image input module 10 reads image data from the hyperspectral image cube which is a 3-dimensional data set. The horizontal location (x,y) presents two dimensions and the third dimension is the wavelength.
The trial pixel selection module 30 defines a trial pixel for analysis. The trial pixel is represented by its location, i.e. (x,y), which can be selected interactively if a graphic user interface (GUI) is available. It also can be selected by input the numbers of x and y.
The reference spectra building module 40 collects the signatures of the neighboring pixels around the trial pixel and the signature of the material of interest. The neighboring pixels are (x−1, y+1), (x, y+1), (x+1, y+1), (x−1, y), (x+1, y), (x, y−1) and (x+1, y−1).
The abundance estimator 50 performs the unmixing of the trail pixel by using the reference signatures from the reference spectra building module 40. The abundance estimator 50 can be a maximum likelihood estimator, a least square estimator, or an evolutionary algorithm.
The preferred implementation of the evolutionary algorithm is shown in
The initial population generation module 52 creates p initial parent strings of abundance (a1, a2, . . . , ap). A random number generator can be utilized to produce uniform numbers between 0 and 1, which guarantees that the values of the elements of the abundance vector are between 0 and 1. In order to make the total abundance of each parent equal to 1.0, each element of the abundances vector of each parent is normalized by the sum, i.e.,
The generated p parents are sent to the selection and coupling module 53.
The cost function module 51 plays a role to evaluate the population of the abundance. The cost function module 51 can be mean square error (MSE). The selection and coupling module 53 receives the population of abundance and selects two best parents based on the cost function module 51, i.e. a minimum value of the MSE. The value of the MSE for any parent in the current population is calculated by
where l is the total band number, m is the total number of endmembers, s is the endmember signature, a is the abundance, and k represents the kth parent. The selected two best parents are sent to the crossover module 54 to perform crossover operation.
In the crossover module 54, first, a split point is chosen for both of the two best parents. If ai1 (i=1,2, . . . m) represents the first best parent and ai2 (i=1,2, . . . m) is the second best parent, after crossover, the new string will be
where msp represents the location of the split point. As an example, let the two best parents have the following abundance values:
If the split point is located between the second and the third elements, after the crossover, two new strings will be created. They are:
After crossover, the new strings should be normalized to make the sum of the abundances equal to 1. If the new strings are better than any parent in the old population, the new strings will replace the old ones and enter the new population. Otherwise, the parent strings will be inherited.
After crossover operation, the survived strings are sent to the mutation module 55, where each parent string mutates into a child by generating −10% to 10% random numbers to add to the elements for each parent. The string is normalized and then sent to the fitness evaluation module 56. The fitness evaluation module 56 calculates the MSE for each new string. If the value of MSE of the new string is better than the parent's, the new string will go into the new population as the child string. Otherwise, the parent will be kept as part of the new population.
For a specified problem, a different cost function (fitness) will be determined to evaluate the population. In many problems, the objective is more naturally stated as the minimization of some cost function J(x) rather than the maximization of some utility or profit function u(x). Even if the problem is naturally stated in a maximization of form, this doesn't guarantee that the utility function will be nonnegative for all values of x as it is required in the fitness function. Therefore, it is necessary to change the cost function into a fitness function. The duality of cost minimization and profit maximization is well known. In normal research work, the simple way to transform a minimization problem to a maximization problem is to multiply the cost function by a minus one. However, in evolutionary algorithm work, this operation is insufficient because the measure thus obtained is not guaranteed to be nonnegative in all instances. So with evolutionary algorithm the following cost-to-fitness transformation is used
There are a variety of ways to choose the coefficient Cmax, Cmax may be taken as an input coefficient. For a control problem, the performance measures (cost function J(x)) usually are
All of these performance indicators can be used to decide when the optimization should be terminated. This operation is performed by Discriminator 57, as shown in FIG. 4.
Referring to
The parent input module 541 takes the two best parents from the selection and coupling module 53. These two parents will exchange their string parts after determination of a crossover point. The crossover point determination 542 randomly picks up a number between 1 and m−1 as the crossover point, where m represents the total number of endmembers.
The first child generation 543 combines the first string part of the first parent and the second string part of the second parent to form the first child. The first child is normalized in normalization 544 that the sum of the abundance equal to 1. The normalized first child is put into the child pool 547.
The second child generation 545 combines the first string part of the second parent and the second string part of the first parent to form the second child. The second child is normalized in normalization 546 that the sum of the abundance equal to 1. The normalized second child is put into the child pool 547.
Referring to
The parent input module 552 takes the survived strings from crossover module 54. The random number generation 551 generates random numbers that, in child generation 553, are added to the elements of each parent from parent input module 552. The mutated child from child generation 553 is normalized in normalization 554.
The MSE calculation 555 computes the mean square error corresponding to each string, and the survived child is finally chosen in survival selection 556.
This is a regular application of a provisional application, application No. 60/211,343, filed Jun. 13, 2000.
Number | Name | Date | Kind |
---|---|---|---|
6008492 | Slater et al. | Dec 1999 | A |
6075891 | Burman | Jun 2000 | A |
6282301 | Haskett | Aug 2001 | B1 |
6353673 | Shnitser et al. | Mar 2002 | B1 |
6665438 | Lin | Dec 2003 | B1 |
6678395 | Yonover et al. | Jan 2004 | B2 |
6694064 | Benkelman | Feb 2004 | B1 |
6724940 | Qian et al. | Apr 2004 | B1 |
6741744 | Hsu | May 2004 | B1 |
6750964 | Levenson et al. | Jun 2004 | B2 |
6778702 | Sweet | Aug 2004 | B1 |
6792136 | Niesen | Sep 2004 | B1 |
Number | Date | Country | |
---|---|---|---|
20020012451 A1 | Jan 2002 | US |
Number | Date | Country | |
---|---|---|---|
60211343 | Jun 2000 | US |