This application relates to pattern recognition, and more particularly to the detection of threats using x-ray imaging.
The discrimination of suspicious content in luggage through x-ray imaging is quite challenging since luggage, by its very nature, tends to be crammed with diverse objects that produce considerable clutter in the resulting x-ray images. In addition, mass transportation at airports requires considerable throughput such as the clearing of at least 450 bags per hour for each x-ray imaging system. An alternative to automated analysis of the x-ray images is manual analysis. But human recognition of threats such as guns and knives in x-ray images of luggage has proven to be error-prone, particularly when the threat presents itself in the image in a non-standard profile or orientation and when superimposed against clutter. Moreover, human beings are prone to fatigue and can thus miss threats even when they have been highly trained.
To improve the accuracy and speed for automated x-ray imaging and detection of threats in luggage, the use of a convolutional neural network (CNN) has been studied. But these conventional CNN efforts to detect threats in luggage have not solved the detection problem posed by the clutter and non-standard profile or orientation of a weapon that will commonly occur in the packing of such a weapon into luggage with assorted clothing and other items. Accordingly, there is a need in the art for improved x-ray analysis and detection of weapons or threats in luggage.
A weapon detection station is provided for the detection of weapons or threats in luggage. The weapon detection station includes an x-ray system for capturing x-ray images of luggage. The resulting captured data may thus be streamed to the cloud or processed in the weapon detection station itself by a suitable processor such as a graphics processing unit (GPU). The GPU incorporates a pre-trained commercial off-the-shelf (COTS) convolutional neural network (CNN) that is further trained on images of weapons. To address the non-standard orientation problem that will commonly arise with the taking of a two-dimensional (2D) x-ray image of a weapon within luggage, the CNN is also further trained on partial images of weapons.
Each beam within the x-ray system corresponds to the collection of a 2D image. To increase accuracy, a dual-beam (which may also be designated as a dual-energy) x-ray system is used in some embodiments. Regardless of how many beams are employed, the trained CNN does not analyze the original 2D x-ray image corresponding to a particular beam. Instead, the weapons detection station filters the original 2D x-ray image according to a threshold to distinguish relatively dense items from background clutter to address the problems of clutter that is endemic to x-ray imaging of the luggage. Following the filtering, a certain minimum size requirement is applied to filter out dense objects that are too small to constitute a weapon. The resulting disconnected dense objects are then placed in separate image bins for processing by the trained CNN. To further increase accuracy, the prediction results from the separate beams in dual-beam embodiments are combined.
These and other advantageous features may be better appreciated through the following detailed description.
Embodiments of the present disclosure and their advantages are best understood by referring to the detailed description that follows. It should be appreciated that like reference numerals are used to identify like elements illustrated in one or more of the figures.
To quickly and accurately detect weapons in luggage, a commercial off-the shelf (COTS) pre-trained convolutional neural network (CNN) is exploited by further training the CNN on images not related to the images for its pre-training. In particular, images of weapons such as handguns and knives were used to provide this further training. As a control to guard against false detection, the CNN is also trained on images of an innocuous item that is similarly shaped to a knife such as a shoe having stiletto heels or a wrench. In addition, the CNN is also trained on partial images of knives and handguns. Such partial image training is quite advantageous in that the clutter that is endemic to the hand-packing of luggage with various and sundry items can often obscure the outline of a weapon. Yet certain features such as the trigger in a handgun or the sharpened point of a knife are quite distinctive even when viewed in isolation. The partial image training of the CNN thus enables the resulting trained CNN to spot weapons even when the object is partially obscured by clutter. The partial image training is also quite advantageous with regard to identifying a weapon when viewed in a non-standard orientation. As compared to traditional uses of x-ray to manually detect weapons, the resulting detection has much greater accuracy yet requires relatively little computing complexity. For example, accuracies of 97% or greater were achieved in distinguishing between various types of handguns and knives as compared to non-weapons such as wrenches or other types of household items.
An example weapon detection station 100 is shown in
To enhance accuracy and automate weapon detection, x-ray imaging system 105 is integrated with a CNN system 140 that includes a suitable processor such as a GPU 145 for performing a CNN algorithm on the x-ray images. Examples of suitable GPUs include the NVidia K80/40 GPU and related models. Should a weapon or other form of security threat be detected, it is then highlighted by CNN system 140 on a second display (Display2). In alternative embodiments, the first and second displays may comprise a single integrated display. Advantageously, it has been discovered that remarkable accuracy may be achieved through the use of COTS convolutional neural networks such as the MatLab-based “AlexNet” CNN. Such COTS CNNs are typically pre-trained on the “ImageNet” database, which has 1,000 object categories over approximately 1.2 million images.
The pre-trained CNN as programmed into GPU 145 is further trained as shown in
To further increase accuracy of the resulting CNN image processing, the images within the desired categories for additional training of the pre-trained CNN may be refined by the removal of those images that were not correctly identified. Should some unseen images be incorrectly identified, they are removed from the training image database, whereupon the pre-trained CNN is re-trained on the refined database of images. Conversely, the training database of images is enhanced by the inclusion of unseen images that are correctly identified. Such recursive correction of the training database may be performed manually or may be automated.
With the pre-trained CNN trained on the training set of images, weapon detection station 100 is then ready to detect weapons in the resulting x-ray images of luggage. Note that is conventional for an x-ray imaging system such as system 105 to layer the x-ray images into different color layers for the enhanced manual inspection of the layered x-ray images such as by TSA personnel. For example, the x-ray image is separated into layers based upon the density of the objects within the x-ray images. A dense layer (such as given the color blue) is then filtered according to threshold process 300 as shown in
The CNN analysis following the thresholding is shown in
It will be appreciated that many modifications, substitutions and variations can be made in and to the materials, apparatus, configurations and methods of use of the devices of the present disclosure without departing from the scope thereof For example, the thresholding and CNN analysis may be done locally at the weapon detection station or may be performed in the cloud. In light of this, the scope of the present disclosure should not be limited to that of the particular embodiments illustrated and described herein, as they are merely by way of some examples thereof, but rather, should be fully commensurate with that of the claims appended hereafter and their functional equivalents.
This application claims the benefit of U.S. Provisional Application No. 62/415,331, filed Oct. 31, 2016.
Number | Date | Country | |
---|---|---|---|
62415331 | Oct 2016 | US |