This patent application claims the benefit and priority of Chinese Patent Application No. 202210506022.X, filed with the China National Intellectual Property Administration on May 9, 2022, the disclosure of which is incorporated by reference herein in its entirety as part of the present application.
The present disclosure relates to image processing in transmission line inspection, and in particular, to a transmission line defect identification method based on a saliency map and a semantic-embedded feature pyramid.
In industrial practice, a common method for identifying a defect of a transmission line quickly and economically is to use an image classification model to classify an image output after target detection performed on the transmission line, so as to determine whether a component is faulty. However, in practice, most of target images of the transmission line are non-defective images, and cannot be used to train a supervised learning model for defect identification and analysis. Such an imbalance between defective and non-defective samples is referred to as a long tail effect. In order to resolve this problem, data levels can be augmented, and resampling can be performed to generate a new sample for a category with insufficient samples. This method will lead to oversampling of a few samples, thereby causing model overfitting and directly lowering performance of a feature extraction model.
To resolve a defect identification problem of the transmission line, the present disclosure provides a transmission line defect identification method based on a saliency map and a semantic-embedded feature pyramid. Innovation and technological contributions of this method are mainly reflected in the following aspects:
Experimental results show that the model proposed in the present disclosure has high accuracy, strong robustness, and a high defect recall ratio, and can better resolve the defect identification problem of the transmission line.
The present disclosure is intended to provide a transmission line defect identification method based on a saliency map and a semantic-embedded feature pyramid. The method generates a super-resolution image for a small target of a transmission line by using an Electric Line-Enhanced Super-Resolution Generative Adversarial Network (EL-ESRGAN) model, performs image saliency detection on a defect dataset based on a saliency map by constructing a nested U-shaped network, performs data augmentation on the defect dataset based on the saliency map by using GridMask and random cutout algorithms, and performs defect identification on a target image of the transmission line based on a ResNet34 classification algorithm by constructing a DSE-based feature pyramid classification network.
The present disclosure provides a transmission line defect identification method based on a saliency map and a semantic-embedded feature pyramid, including the following steps:
As an optional implementation solution of the present solution, the performing image super-resolution expansion by using an EL-ESRGAN algorithm in the step 2) specifically includes:
where LGRa represents the GAN loss function of the generator, LDRa represents the GAN loss function of the discriminator, DRa(xr,xf) represents a probability that an authenticated image is more real than a false image, DRa(xf,xr) represents a probability that the authenticated image is falser than a real image, Ex
As an optional implementation solution of the present disclosure, the step 3) specifically includes:
As an optional implementation solution of the present disclosure, the DSE-based feature pyramid classification network used in the step 4) includes:
As an optional implementation solution of the present disclosure, the dataset used in the step 1) is an insulator self-explosion dataset of the transmission line.
According to a specific implementation solution of the present disclosure, the DSE-based feature pyramid classification network is obtained through training by using an insulator image training set, of the transmission line, constructed in the step 2), an insulator image test set, of the transmission line, constructed in the step 2) is used to test the classification network, and network classification accuracy and an F2-Score are used as evaluation indicators of a classification effect.
In an embodiment of the present disclosure, the step 5) specifically includes:
Compared with the prior art, the present disclosure has the following beneficial effects:
In order to describe the technical solutions in the embodiments of the present disclosure more clearly, the accompanying drawings required for describing the embodiments are briefly described below. Obviously, the accompanying drawings in the following description show merely some embodiments of the present disclosure, and a person of ordinary skill in the art can further derive other accompanying drawings from these accompanying drawings without creative efforts.
The technical solutions of the embodiments of the present disclosure are clearly and completely described below with reference to the accompanying drawings. Apparently, the described embodiments are merely a part rather than all of the embodiments of the present disclosure. All other embodiments obtained by a person of ordinary skill in the art on the basis of the embodiments of the present disclosure without creative efforts shall fall within the protection scope of the present disclosure.
As shown in
In the step 2), a main structure of a generator G in an EL-ESRGAN is shown in
As shown in
In the step 3), the U2-Net is used to generate the saliency map of the image. A structure of the U2-Net is shown in
A structure of the RSU network is shown in
In
In the step 3), the saliency map of the U2-Net is used to guide image augmentation. A corresponding algorithm is implemented according to the following steps:
In the step 4), the DSE-based enhanced feature pyramid classification network is shown in
In the DFF module, feature processing of a high-level feature map is completed by two residual blocks and a bypass connection. A configuration of the residual block is shown in Table 1. After convolution, each layer is connected to one batch normalization layer and one ReLU activation layer of nonlinear transformation.
Feature processing of a low-level feature map has a similar structure to that of the high-level feature map, except that an atrous convolution residual block instead of the original residual block is used. A configuration of the atrous convolution residual block is shown in Table 2. After convolution, each layer is connected to one batch normalization layer and one ReLU activation layer of nonlinear transformation.
The ResNet34 is taken as a benchmark to carry out a defect elimination experiment for each module in the present disclosure. Experimental results are shown in
In defect identification of the transmission line, it is necessary to improve a recall rate on a premise of ensuring accuracy, so as to find faults as much as possible and reduce potential risks to transmission safety. Therefore, an F-Score is introduced as an evaluation indicator of measuring the accuracy and the recall rate, and is defined as follows:
In the present disclosure, accuracy, a recall rate, and an F-Score of each model are shown in Table 3:
In order to check all defects of the transmission line as much as possible and avoid a potential power failure risk, the F2-Score, which has a higher recall rate and more tends to check all potential risks as much as possible, is taken as the evaluation indicator. The DSE-based enhanced feature pyramid classification network proposed in the present disclosure can better find more potential risks.
Experimental results of data augmentation and defect elimination in the step 2) and the step 3) of the present disclosure are shown in Table 4:
It can be seen from Table 4 that the data augmentation method can improve the accuracy of the defect set more effectively, because the data augmentation method decouples more background factors from the identified target, and improves a defect set with low classification accuracy.
The foregoing embodiments are only used to explain the technical solutions of the present disclosure, and are not intended to limit the same. Although the present disclosure is described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still modify the technical solutions described in the foregoing embodiments, or make equivalent substitutions on some technical features therein. These modifications or substitutions do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present disclosure. The present disclosure is not limited to the above-mentioned optional implementations, and anyone can derive other products in various forms under the enlightenment of the present disclosure. The above-mentioned specific implementations should not be construed as limiting the protection scope of the present disclosure, and the protection scope of the present disclosure should be defined by the claims. Moreover, the description can be used to interpret the claims.
The preferred embodiments of the present disclosure disclosed above are only used to help illustrate the present disclosure. The preferred embodiments neither describe all the details in detail, nor limit the present disclosure to the specific implementations described. Obviously, many modifications and changes may be made based on the content of the present specification. In the present specification, these embodiments are selected and specifically described to better explain the principle and practical application of the present disclosure, so that a person skilled in the art can well understand and use the present disclosure. The present disclosure is only limited by the claims and a full scope and equivalents thereof.
Number | Name | Date | Kind |
---|---|---|---|
20190385057 | Litichever | Dec 2019 | A1 |
Number | Date | Country |
---|---|---|
108876788 | Nov 2018 | CN |
109145905 | Jan 2019 | CN |
109215020 | Jan 2019 | CN |
109559310 | Apr 2019 | CN |
111524135 | Aug 2020 | CN |
114170535 | Mar 2022 | CN |
Number | Date | Country | |
---|---|---|---|
20230360390 A1 | Nov 2023 | US |