1. Field of the Invention
The present invention relates to image, and data processing systems and methods. More specifically, the present invention relates to automatic target recognition systems.
2. Description of the Related Art
Automatic target recognition systems are well known in the art. Automatic target recognizers (ATRs) use computer processing to detect and recognize target signatures typically from synthetic aperture radar (SAR) images.
Unfortunately, ATRs often detect targets that are not represented in the ATR's database. That is, these systems typically have knowledge of only a subset of targets that they encounter. Hence, in field operations, current ATRs often place unknown objects (e.g. a bulldozer) into one of a number of known target classes. This results in a high false alarm rate.
Thus, there is a need in the art for a system or method for detecting unknown targets in high resolution ATR SAR imagery. Moreover, there is a need in the art for a system and technique for detecting unknown targets in SAR ATRs that is insensitive to the nature of unknown objects.
The need in the art is addressed by the system and method of the present invention. In the illustrative embodiment the invention is an automatic target recognition system and includes an arrangement for providing a plurality of images; an arrangement for processing the images to identify one or more features thereof; an arrangement for counting each occurrence of each feature in an image as a vote; and an arrangement for using the vote to recognize a presence of an object of a particular class in the image.
In the illustrative application, the class is “unknown” and the system provides an indication of a recognition of an object of unknown classification. In the illustrative embodiment, the feature is a base signature and the arrangement for processing the images includes an arrangement for performing a random projection with respect thereto. The arrangement for counting each base signature as a vote includes an arrangement for executing a shortest path QR algorithm and the arrangement for using the vote to recognize the object class includes an arrangement for placing each vote into a histograms. The invention further includes an arrangements for performing statistical preprocessing and an arrangement for performing statistical preprocessing and rotation.
a is an illustrative SAR image chip of an m109 tank.
b is an illustrative SAR image chip of a bulldozer.
a) is a diagram showing matrix operations in an illustrative implementation of a QR voting scheme in accordance with the present teachings.
b) illustrates a QR voting histogram in accordance with the present teachings.
a is a diagram illustrating a search for directions enclosing data points in the two-dimensional case in accordance with the present teachings.
b is a diagram illustrating a half-plane search for directions enclosing data points in the two-dimensional case in accordance with the present teachings.
Illustrative embodiments and exemplary applications will now be described with reference to the accompanying drawings to disclose the advantageous teachings of the present invention.
While the present invention is described herein with reference to illustrative embodiments for particular applications, it should be understood that the invention is not limited thereto. Those having ordinary skill in the art and access to the teachings provided herein will recognize additional modifications, applications, and embodiments within the scope thereof and additional fields in which the present invention would be of significant utility.
Automatic target recognition currently requires a training of the ATR prior to operation. When the number of training objects is greater than the dimensionality of an object, a dimensionality reduction method may be used. Many dimensionality reduction methods are known in the art, such as Principal Component Analysis (PCA), Singular Value Decomposition (SVD) and Discrete Cosine Transformation (DCT). See “Random Projection in Dimensionality Reduction: Applications to Image and Text Data,” published by E. Bingham and H. Mannila, in Knowledge Discovery and Data Mining, pages 245-250 (2001).
If the number of training objects is less than the dimensionality, a technique called Random Projection (RP) can be use to extract object features for the one target class case through reduction of dimensions. See “Database-friendly Random Projections,” published by Dimitris Achlioptas in Symposium on Principles of Database Systems, pages 274-281 (2001). Random projection has been a powerful tool to reduce the dimensionality of an object while preserving class separation. Here a class refers to a cluster of objects, which share similar features. The separation between clusters allows a given class to be separated from the other classes. The separation may be a c-separated Gaussians mixture. See “Experiments with Random Projection,” published by Sanjoy Dasgupta in Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence, p. 143-151, Jun. 30, 2000. Consider an object image chip having N rows and M columns and N×M dimensions. Each training object image becomes a data point in an N×M dimensional space. Typically, the number of training objects is much less than N×M, i.e., the image size. Relatively, speaking, data points may be sparsely located in a very highly dimensional, space and the base signatures of an object's class may not be easily extracted. Therefore, it may be desirable to transform data points into a feature space through reduction of dimensions.
However, to preserve the class separation distance up to a certain threshold, RP requires maintaining a minimum number of dimensions. This causes problems due to the fact that the distribution of data points in highly dimensional space is typically very sparse. To improve on this limitation on RP, the present invention extends the RP technique by introducing a novel QR voting scheme to extract features.
Hence, a system implemented in accordance with the present teachings should offer the following advantages over existing techniques: 1) an additional means to reduce RP dimensions for the sake of ATR, 2) efficacy in an, environment, i.e., unmanned, where human supervision is no longer available, and 3) an insensitivity to the nature of the objects detected.
In accordance with conventional teachings, random projection maintains a minimum number of dimensions in order to preserve a class separation distance for a given threshold T. In accordance with the present teachings, the same separation distance is preserved using fewer RP dimensions. In addition, the reduced set of RP dimensions consists of occurring RP vectors for the purpose of classifying the Known/Unknown objects. Non-occurring RP vectors are eliminated in order to reduce the false alarm rate. (An ‘occurring’ RP vector is defined as a nonzero projection of a given object onto to a given RP vector. A ‘non-occurring’ RP vector is one, which has a zero projection.) In accordance with conventional teachings, non-occurring RP features may occur in the unknown objects and thereby increase the false alarm rate of the ATR. However, the inventive QR voting scheme is designed to eliminate these non-occurring RP vectors and reduce the false alarm rate.
In short, the present invention provides a novel QR enclosing voting scheme that allows the extraction of base signatures of objects encountered by Automatic Target Recognizers. The occurrence of each base signature obtained from random projection (RP) is counted as a vote by a shortest path QR algorithm. Then, the vote is placed into a histogram for the purpose of recognizing/rejecting class of objects under consideration.
a is an illustrative SAR image chip of an m109 tank.
b is an illustrative SAR image chip of a bulldozer. Note the obvious visual similarity between the images in
The CFAR 16 receives CFAR parameters from a Target/Background Threshold (TBT) 18. The CFAR 16 provides detected image pixels to the SLD 20. The SLD 20 receives image size and resolution parameters from SAR sensor system 22 and outputs region of interest locations and features to the JFD 24. The JFD 24 then uses these features to discriminate target from the background clutter. The automatic target cuer 14 outputs target clutter discrimination and region of interest data to a model-based recognizer or ‘Fast Matcher’ 30. The recognizer 30 receives imaging geometry (e.g. squint angle, depression angle, etc.) from SAR sensor system 32 and stored reference signatures from a data base consists of known targets 34 and outputs a target ID score to a Target Arbitor 38 and a QR confuser target rejector 40 implemented in accordance with the present teachings.
The operation of the target rejector is described more fully below. The target rejector outputs an ‘unknown target recognized’ signal to the Target Arbitor 38 so that minimum false alarms will be made. In general, the goal is to reduce the false alarm rate, which is the declaration that a target of a specified type is present when the declaration is false. In the best mode, the target rejector 40 is implemented in software. However, recognizer 30 and the rejector 40 may be integrated without departing from the scope of the present teachings.
First, the preprocessor 42 uses a statistical method, commonly used with SAR data, and performs a multi-resolution decomposition to reduce processing requirements.
Next, random projection is performed by the RP element 44, providing an over-completed base, and a feature selection is performed using a novel shortest path QR voting scheme. In the preferred embodiment, the shortest path algorithm is implemented in accordance with the teachings of U.S. patent application Ser. No. 10/421,167, entitled SYSTEM AND METHOD FOR SEPARATING SIGNALS RECEIVED BY AN OVERLOADED ANTENNA ARRAY, filed Apr. 22, 2003, by Shu et al. the teachings of which are hereby incorporated by reference herein.
The training phase of the feature selection is shown and discussed more fully below with respect to
Finally, the Gaussian Mixture Model classifier 46 trains itself using these QR selected RP features from the SAR data in the training set. This results in a prototypical feature, which is the mean of the known target cluster. The standard deviation of the cluster is also computed for later use as a rejection threshold. A slightly translated and rotated version of the training set may be included during training to make the system more robust.
In accordance with the present teachings, feature selection via QR voting is executed at step 58 to provide a random QR projection matrix (RQP). This matrix is used at step 60 to compute RQP features on the transformed image. At step 62, the system trains Gaussian Mixture Mode (GMM). Finally, at step 66, class statistics are computed and a rejection threshold is provided. This rejection threshold is utilized in the operational mode as discussed more fully below.
Random Projection (RP)
A system implemented in accordance with the present teachings should achieve feature selection in the RP space with over-completed bases (i.e., the number of RP bases is greater than number of training objects) and may use the following approach. Assume a training set of n images, with each image chip containing p dimensions with R rows and C columns. (i.e., p=RC) Let xi be the preprocessed ith image and data point xi=[xi1, . . . xip] in Rp, i=1:n, p>>n. Moreover, xi is the ith row of X, the preprocessed training set. The random projection of n data points from Rp to Rq requires a p×q projection matrix, where ‘q’ is the minimum number of dimensions required to preserve a given class separation distance. The resulting set of compressed vectors A is exhibited in
Theorem 1: Given n points in RP (in form of an n×p matrix X), choose ε, β>0 and q>=[(4+2*β)/(ε2/2−ε3/3)] ln(n) , and let A=(1/q1/2)XJRP, for projection matrix JRP. Then, mapping from X to A preserves distances up to factor 1±ε for all rows in X with probability (1−n−β). Projection matrix JRP, p×q, can be constructed in one of the following ways:
rij=±1 with probability 0.5 each [1]
rij=±31/2*(1 with probability ⅙ each, or 0 with probability ⅔) [2]
Using the first of the methods suggested by Achlioptas and since we are only concerned with preserving separation between points, we do not scale our projection by (1/q1/2).
The resulting set of compressed vectors A together with X, are used by the shortest path QR voting scheme of the present invention to select the set of “occurring” RP feature bases as shown in
a) is a diagram showing matrix operations in an illustrative implementation of a QR voting scheme in accordance with the present teachings. As discussed more fully below, the QR scheme selects n RP bases of q bases.
Let X=AV where a given column of V represents a ballot containing q candidates, each candidate corresponds to a given RP basis. Each component within a V column contains the voting weight toward a given candidate. There are a total of p ballots, one corresponds to each column of X.
Let xt be a given column vector of X corresponding to the tth dimension of the preprocessed training set, where t=1, 2, . . . , p. A given xt represents a voter that will cast a ballot according to the QR voting scheme. The votes cast are gathered into a histogram as shown in the right hand side of
b) illustrates a QR voting histogram in accordance with the present teachings. As shown in
In accordance with the best mode, the voting is formulated by solving the shortest path problem:
subject to xt=AVt, for jε{1,2, . . . q}.
In particular, by exploiting the near Laplacian distribution of base signatures in the random projection domain, the component weight of vjt can be assumed to achieve sparsity where the term “sparse” refers to the fact that under such a distribution the l1 norm Σjt|vjt| of the components is minimized, therefore maximizing the number of voting components which are zero. Thus, sparsity here means that only a small number of components in the voting domain differ significantly from zero due to projections are random, thus most projections are “non-occurring” RP feature bases. There are at most n components different from zero because of n training images.
Shortest Path QR Voting Scheme
After successfully mapping from X to A by random projection, we are ready to VOTE as follows. Since the system in equation [1] is under-determined (q>>n), its solution is not unique when given the compressed A matrix. The sparse approach to the under-determined case consists in finding the solution that minimizes the l1 norm, as in equation [1], yielding the optimal sparse decomposition. From the point of view of RP space, the l1 norm formulation has a geometrical meaning as shown in
a is a diagram illustrating a search for directions enclosing data points in the two-dimensional case (n=2) in accordance with the present teachings. In
We search for the shortest path using both a clockwise and a counter-clockwise search method and equation [3]. As shown in
Now, letting Wr=[a1 a2]−1 be the reduced N×N inverse matrix (N=2), and vrt be the reduced decomposition along directions a1 and a2.
The components of the votes are then obtained as:
vrt=WrXt, [4]
vjt=0, for j≠1, 2. [5]
Before we generalize the 2-D case to higher dimensions, we introduce a half-plane enclosing search algorithm, rather than employing a clockwise search as before. This kind of 2-D half-plane search can easily be extended to a half-space search suitable for the higher dimensions (3-D and above).
b is a diagram illustrating a half-plane search for directions enclosing data points in the two-dimensional case (n=2) in accordance with the present teachings. As shown in
(xt−⊥xt)′*(a1−⊥a1)>0 [6]
where ⊥xt denotes xt projection onto a2 and ⊥a1 a denotes a1 projection onto a2.
To extend this enclosing condition to the higher dimension n, at each step of finding the next closest, compressed direction, these projections need to be modified. Instead of a projection onto one direction a2, these projections would be done onto the subspace spanned by the set of closest compressed directions identified by all previous steps. This set, called “W,” forms a hyper-plane that partitions an n-dimensional space into two half-spaces. It is this W matrix that provides the foundation for developing a novel QR enclosing method in accordance with the present teachings to decompose a higher dimensional data point xt.
a3 and xt
are projections onto w subspace formed by previously calculated enclosing mixing directions.
The QR enclosing algorithm starts by finding the closest compressed direction a2 to Xt. Letting w1 be the a2, where w1 is the first column of matrix W which is used by the QR decomposition, then, at the second step, we search for the next closest direction w2=a1 from Xt such that (xt−⊥xt)′*(a1−⊥a1)>0. As a result, w=[w1 w2] forms the hyper-plane that partitions a n-Dimensional space into two half-spaces.
In this particular illustration, the hyper-plane made of a1 and a2 is coplanar with that of X2 and X3, and the same front half-space contains both a3 and Xt. Thus w3=a3 is qualified as the third closest direction to Xt. To verify that a3 and Xt reside on the same half-space, the following enclosing condition has to be satisfied:
(xt−⊥xt)′*(a3−⊥a3)>0 [7]
where ⊥xt denotes xt projection onto w and ⊥a3 denotes a3 projection onto w.
To extend this enclosing condition to step k, where k≦n, QR factorization of w is employed to compute xt projection onto w as follows. Let
w=[w1 . . . wk−1]=QR, [8]
then
⊥xt=Q*(Q′*xt); [9]
⊥aj=Q*(Q′*aj); [10]
where ⊥aj denotes aj projection onto w, and wk=aj is the kth closest direction from Xt such that:
(xt−⊥xt)′*(aj−⊥aj)>0 [11]
Finally, when QR has been used to constrain the search and find all the minimum enclosing compressed directions in n-space, we can then estimate the voting components through the shortest path. Let Wr=[w1 . . . wn]−1 be the reduced n×n inverse matrix and let vrt be the reduced decomposition along directions w1 . . . wn. The components of the votes are then obtained as:
vrt=Wrxt, [12]
vjt=0, for j>n. [13]
Resulting magnitudes of components of the votes are then compared with a threshold and the passing votes are placed into a histogram for the purpose of selecting bases for recognizing/rejecting class of objects under consideration. The lower right corner of
As mentioned previously, the training phase of the feature selection is diagrammed in
Finally,
Therefore, the present teachings provide a method that uses real training data (i.e, not synthetic, but real SAR data gathered by an aircraft) to provide a very low false alarm rate for unknown targets in the four class ATR application with low computation and memory requirements.
Thus, the present invention has been described herein with reference to a particular embodiment for a particular application. Those having ordinary skill in the art and access to the present teachings will recognize additional modifications applications and embodiments within the scope thereof. For example, in the best mode, the present teachings are implemented in software stored on a medium and executed by a processor. However, those skilled in the art will appreciate that the present teachings may be implemented in hardware without departing from the scope of the invention.
It is therefore intended by the appended claims to cover any and all such applications, modifications and embodiments within the scope of the present invention.
Accordingly,
Number | Name | Date | Kind |
---|---|---|---|
5319779 | Chang et al. | Jun 1994 | A |
5351310 | Califano et al. | Sep 1994 | A |
5392050 | Guerci et al. | Feb 1995 | A |
5479255 | Denny et al. | Dec 1995 | A |
5497158 | Schmid et al. | Mar 1996 | A |
5828769 | Burns | Oct 1998 | A |
5835682 | Broomhead et al. | Nov 1998 | A |
5864779 | Fujimoto | Jan 1999 | A |
6108437 | Lin | Aug 2000 | A |
6259396 | Pham et al. | Jul 2001 | B1 |
6337654 | Richardson et al. | Jan 2002 | B1 |
6807286 | Krumm et al. | Oct 2004 | B1 |
6897802 | Daniell et al. | May 2005 | B1 |
6968073 | O'Boyle et al. | Nov 2005 | B1 |
7015855 | Medl et al. | Mar 2006 | B1 |
7089009 | Fauconnier | Aug 2006 | B1 |
7116265 | Shu et al. | Oct 2006 | B2 |
20010038713 | Kitagawa et al. | Nov 2001 | A1 |
20020181780 | Simon et al. | Dec 2002 | A1 |
20030128876 | Yamaguchi | Jul 2003 | A1 |
20030185436 | Smith | Oct 2003 | A1 |
20040054499 | Starzyk et al. | Mar 2004 | A1 |
20050143640 | Hoctor et al. | Jun 2005 | A1 |
20050265605 | Nakamoto et al. | Dec 2005 | A1 |
20060176209 | Shu et al. | Aug 2006 | A1 |
20070058836 | Boregowda et al. | Mar 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20070139251 A1 | Jun 2007 | US |