The present application claims priority to Taiwan application No. 110141070, filed on Nov. 04, 2021, the content of which is hereby incorporated by reference in its entirety.
The present application relates generally to an assessment system and an assessment method, and more particularly to a system and a method to assess abnormality.
With the development of technology, the applications of artificial intelligence (AI) become more and more diversified. Performing the image detection via an image recognition model is an example. The pre-training procedure of the image recognition model is closely related to its performance. There are a lot of model training methods in the technical field of the artificial intelligence, wherein “supervised learning” is a mainstream method. The fundamental of the supervised learning is to collect a large number of image samples and manually apply a respective feature label to each image sample. The feature label is the objective for the image recognition model to recognize. The image recognition model is trained according to the large number of the image samples and their feature labels.
It is to be understood that the performance of the image recognition model is limited to the contents of the image samples and their feature labels. Namely, the image recognition model trained no more than the supervised learning fails to recognize an objective excluded from said feature labels. For example, in the training by the supervised learning, known abnormalities of the image samples are labelled as feature labels, such that the image recognition model can learn no more than the known abnormalities. When the image recognition model runs at the worksite practically, the image recognition model may receive a product image from a camera of a production line. Although the image recognition model can recognize the known abnormalities, the image recognition model fails to recognize unknown abnormalities.
Another training method is “unsupervised learning”. In the training by the unsupervised learning, the image samples do not need to be labelled for the above-mentioned feature labels. The image recognition model just learns to recognize the features in the image samples. As a result, when the image recognition model runs at the worksite practically, although the image recognition model trained no more than the unsupervised learning can recognize multiple features in the product image, the image recognition model fails to recognize whether any feature recognized in the product image is abnormal or not.
In conclusion, the image recognition model trained no more than the supervised learning, or no more than the unsupervised learning, has the shortcoming as mentioned above, thereby limiting the application of the image recognition model running at the worksite practically, and should be further improved.
An objective of the present invention is to provide a system and a method to assess abnormality to overcome the shortcoming that an image recognition model trained no more than the supervised learning fails to recognize unknown abnormalities, and overcome another shortcoming that an image recognition model trained no more than the unsupervised learning fails to recognize whether any feature is abnormal or not.
The system to assess abnormality of the present invention is adapted to be connected to an image capturing device and comprises multiple classification models and a processing module. Each one of the classification models is alternately trained by supervised learning and unsupervised learning. Parameters of the classification models are not identical. The processing module is connected to the classification models, receives a test image from the image capturing device, and outputs the test image to the classification models to respectively obtain multiple feature vectors of test images from the classification models and to generate an abnormality assessment information.
The method to assess abnormality of the present invention is performed by a processing module and comprises: receiving a test image from an image capturing device, and outputting the test image to multiple classification models to respectively obtain multiple feature vectors of test images from the classification models, wherein each one of the classification models is alternately trained by supervised learning and unsupervised learning, and parameters of the classification models are not identical; and generating an abnormality assessment information based on the feature vectors of test images.
According to the system and the method of the present invention to assess abnormality, each one of the classification models is alternately trained by the supervised learning and the unsupervised learning, so as to have the characteristics of both the supervised learning and the unsupervised learning. The abnormality assessment information generated by the present invention can indicate not only known abnormalities, but also unknown abnormalities, thereby overcoming the shortcoming that an image recognition model trained no more than the supervised learning fails to recognize unknown abnormalities, and overcoming another shortcoming that an image recognition model trained no more than the unsupervised learning fails to recognize whether any feature is abnormal or not.
With reference to
The present invention may be applied to a tile production line as an example. The application of the present invention should not be limited to the tile production line. With reference to
In the present invention, the classification models 11 are artificial intelligence models, such as convolutional neural networks (CNN) models. Program codes/data of the classification models 11 may be stored in a computer-readable medium, such as a traditional hard disk drive (HDD), a solid-state drive (SSD), or a cloud-storage drive. The processing module 12 has function of data processing. For example, the processing module 12 may be implemented by a central processing unit (CPU) or a graphics processing unit (GPU). Parameters of the classification models 11 are not identical (i.e.: a part of their parameters could be the same and another part of their parameters could be different, or the parameters of the classification models are entirely different from each other). For example, the parameters may include learning rates, weights, loss functions, activation functions, optimizers, and so on. Besides, training samples for the classification models 11 are not identical (i.e.: a part of their training samples could be the same and another part of their training samples could be different, or the training samples of the classification models are entirely different from each other). As a result, the classification models 11 respectively have different classifying specialties. The processing module 12 is connected to the classification models 11 to collaborate with the classification models 11. Namely, the processing module 12 and the classification models 11 form an abnormality decision configuration of multi-model ensemble classification.
Therefore, the processing module 12 receives a test image 21 from the image capturing device 20, and outputs the test image 21 to the classification models 11 to respectively obtain multiple feature vectors of test images V from the classification models 11, and to generate an abnormality assessment information 121 according to the feature vectors of test images V. The abnormality assessment information 121 may indicate a condition, such as high risk, low risk, or non-risk (normal). In an embodiment of the present invention, the abnormality assessment information 121 may be a value quantized from a risk level. For example, the abnormality assessment information 121 may be numbers for respectively corresponding to different risk levels. Number 1 to number 5 respectively indicate a lower risk level to a higher risk level.
The training principle to the classification models 11 is described as follows. Each one of the classification models 11 is a model alternately and repeatedly trained by supervised learning and unsupervised learning. The computer-readable medium stores multiple training samples. The training samples include multiple normal-image samples as a data source for the unsupervised learning. Besides, the training samples include multiple abnormal-image samples with feature labels as a data source for the supervised learning, wherein the abnormal-image samples and the feature labels may correspond to different abnormal risk levels. The training samples for any two of the classification models 11 are not identical. Namely, in any two of the classification models 11, the normal-image samples for training one classification model 11 are not identical to the normal-image samples for training the other classification model 11, and the abnormal-image samples for training one classification model 11 are not identical to the abnormal-image samples for training the other classification model 11.
The abnormal-image samples may comprise at least one of real abnormal image data, open-source image data, and composite image data, but are not limited to the real abnormal image data, the open-source image data, and the composite image data. The real image data may be the original image files captured by the image capturing device 20, wherein the original image files have abnormal parts. The open-source image data may be image files obtained from open-source databases, and such image files are provided to aid machine learning for image features. The open-source databases may be food-101, Birdsnap, and so on. The composite image data may be image files processed by an image editing software. For example, the user can operate the image editing software to create an abnormal part for recognition in an image sample, or to superimpose an object of a foreign matter on the image sample. By doing so, the contents of the abnormal-image samples could be customized and diversified.
During a training procedure of the classification models 11, the processing module 12 sets file reading paths for the classification models 11 by program commands. For example, each one of the classification models 11 reads a part of the training samples stored in the computer-readable medium for training, wherein the part of the training samples can be randomly selected. Or, a particular part of the training samples can be selected for training one of the classification models 11. In other words, such part of the training samples is equivalent to a subset. Due to above-mentioned random selection for the training samples, during the training procedure of each one of the classification models 11, the classification model 11 may alternately and repeatedly read the normal-image samples and the abnormal-image samples with their feature labels. In addition, the training samples for training any two of the classification models 11 are not identical. The purpose that each one of the classification models 11 is alternately and repeatedly trained by the supervised learning and the unsupervised learning is implemented.
In addition, by “classification” as a technique to extract features from data, when a normal-image sample is inputted into the classification model 11, the output data of the classification model 11 is a feature vector of training. The feature vector of training reflects a feature of the normal-image sample recognized by the classification model 11. Similarly, when an abnormal-image sample with its feature label is inputted into the classification model 11, the output data of the classification model 11 is another feature vector of training. Said another feature vector of training reflects a feature, such as an abnormal feature, of the abnormal-image sample recognized by the classification model 11. Hence, when the classification models 11 complete the training, the classification models 11 respectively generate multiple feature vectors of training. With reference to
The supervised learning and the unsupervised learning are alternately and repeatedly adopted for training in the present invention, such as in sequence of the supervised learning, the unsupervised learning, the supervised learning, the unsupervised learning, and so on. To facilitate understanding, after the training, the feature vectors of training Vt generated by the classification models 11 could be referred to the low-dimensional space distribution as shown in
In order to define the regularity of the feature vectors of training Vt, the feature vectors of training Vt should be processed by vector quantization to be values, and then the regularity is determined. In the embodiment of the present invention, as shown in
As mentioned above, the classification models 11 respectively have different classifying specialties. The processing module12 defines weight values to the classification models 11 respectively, such that each one of the classification models 11 has a corresponding weight value for indicating the importance of the classification model 11. When the processing module 12 receives the test image 21 from the image capturing device 20, the processing module 12 outputs the test image 21 to the classification models 11. Each one of the classification models 11 outputs one feature vector of test images V according to the test image 21. As a result, the processing module 12 may receive multiple feature vectors of test images V from the classification models 11 respectively. The processing module 12 generates multiple abnormality levels of the feature vectors of test images V by the discrimination mechanism M, wherein the information of one abnormality level is generated via the discrimination mechanism M from one feature vectors of test images V. Based on the classification models 11 respectively having different classifying specialties, it is to be understood that the abnormality level of the feature vectors of test images V generated by a part of the classification models 11 could be high risk, and the abnormality level of the feature vectors of test images V generated by another part of the classification models 11 could be low risk. Hence, the processing module 12 generates the abnormality assessment information 121 according to the weight values of the classification models 11 and the abnormality levels of the classification models 11.
As mentioned above, the abnormality assessment information 121 may be a value quantized from a risk level. For example, number 1 to number 5 respectively indicate a lower risk level to a higher risk level. The processing module 12 defines level “1” as low risk, and defines level “5” as high risk. The abnormality assessment information 121 generated by the processing module 12 may be “1” when the results determined by most of the classification models 11 or the classification models 11 with higher weight values is low risk. The rest may be deduced by analogy. The abnormality assessment information 121 generated by the processing module 12 may be “5” when the results determined by most of the classification models 11 or the classification models 11 with higher weight values are high risk.
With reference to
In the above description, the processing module 12 may transmit the test image 21 to a convolutional neural network to compute, and receives a feature map from the convolutional neural network via a class activation mapping (CAM). The feature map is to be the risk indicating information 122 or the visualized segmentations 123. Said class activation mapping (CAM) can be GradCAM, GradCAM++, or Score-CAM that are conventional arts and are not described in detail herein.
In summary,
In one embodiment of the present invention, the processing module 12 reads the feature vectors of training Vt from the data module 13. The feature vectors of training Vt are data generated by the classification models 11 during the training procedure. The processing module 12 performs a space clustering based on the feature vectors of training Vt to form multiple feature clusters 50, so as to quantize the feature vectors of training Vt as multiple score values and generate a discrimination mechanism M via the linear regression based on the score values to generate the abnormality assessment information 121.
In one embodiment of the present invention, the processing module 12 defines weight values to the classification models 11 respectively. The processing module 12 generates multiple abnormality levels of the feature vectors of test images Vt by the discrimination mechanism M according to the weight values of the classification models 11 and the discrimination mechanism M. The processing module 12 generates the abnormality assessment information 121 according to the weight values of the classification models 11 and the abnormality levels.
In one embodiment of the present invention, the processing module 12 sets the risk indicating information 122 according to the abnormality assessment information 121, and superimposing the risk indicating information 122 on the test image 21 to be transmitted to the display device 60 for displaying.
In one embodiment of the present invention, in the step of setting the risk indicating information 122, the processing module 12 transmits the test image 21 to a convolutional neural network, and receives a feature map from the convolutional neural network via a class activation mapping (CAM) to be the risk indicating information 122.
In one embodiment of the present invention, in the step of superimposing the risk indicating information 122 on the test image 21 to be transmitted to the display device 60 for displaying, the visualized segmentation 123 is performed on the abnormality part in the test image 21 by the processing module 12, and the risk indicating information 122 is displayed at the position of the visualized segmentation 123.
In one embodiment of the present invention, each one of the classification models 11 is alternately and repeatedly trained by the supervised learning and the unsupervised learning. During the training procedure, the normal-image samples are adopted by the supervised learning to train each one of the classification models 11, and multiple abnormal-image samples are adopted by the unsupervised learning to train each one of the classification models 11. The normal-image samples for training one of the classification models 11 are not identical to the normal-image samples for training another one of the classification models 11. The abnormal-image samples for training one of the classification models 11 are not identical to the abnormal-image samples for training another one of the classification models 11.
In one embodiment of the present invention, each one of the classification models 11 is an artificial intelligence model. The abnormal-image samples comprise at least one of real abnormal image data, open-source image data, and composite image data.
In conclusion, each one of the classification models 11 is alternately trained by the supervised learning and the unsupervised learning, so as to have the characteristics of both the supervised learning and the unsupervised learning. The classification models 11 respectively have different classifying specialties. The abnormality assessment information 121 generated by the present invention may indicate known abnormalities and unknown abnormalities. Especially, the abnormality part in the test image 21 is visualized in a much better way, and the risk is marked accordingly. The practicability of the present invention is significantly enhanced.
The above details only a few embodiments of the present invention, rather than imposing any forms of limitation to the present invention. Any professionals in related fields of expertise relating to the present invention, within the limitations of what is claimed, are free to make equivalent adjustments regarding the embodiments mentioned above. However, any simple adjustments and equivalent changes made without deviating from the present invention would be encompassed by what is claimed for the present invention.
Number | Date | Country | Kind |
---|---|---|---|
110141070 | Nov 2021 | TW | national |