This application is based upon and claims priority to Chinese Patent Application No. 202210401847.5, filed on Apr. 18, 2022, the entire contents of which are incorporated herein by reference.
The present disclosure relates to the technical field of machine learning, and in particular to a method for constructing a support vector machine of a nonparallel structure.
A support vector machine (SVM) has been proposed by Vapnik, etc., and is suitable for pattern recognition and other fields. The SVM is characterized by considering both an empirical risk and a structural risk, that is, supervised learning is realized by finding a hyperplane that can ensure classification accuracy and maximize an interval between two types of data. With some desirable characteristics, such as kernel tricks, sparsity and global solutions, the SVM is widely used in remote sensing image classification because of its solid theoretical basis and desirable generalization. The premise of a support vector machine classification model is assuming that boundaries of positive and negative classes are parallel. However, for actual remote sensing data, the assumption is difficult to establish, which affects generalization ability of the model. Jayadeva, etc. have proposed a twin support vector machine (TWSVM) to solve the problem. The TWSVM aims to find a pair of nonparallel hyperplanes (a parallel state can be regarded as a special nonparallel state). Data points of each class are close to one of the two nonparallel hyperplanes and far away from the other. Classes of samples are determined by comparing distances between the samples and the two hyperplanes. The TWSVM is especially successful but still has obvious shortcomings: a TWSVM model only considers an empirical risk without a structural risk, and its generalization performance is affected, so that its classification effect is not as good as that of a traditional support vector machine in many cases. Therefore, the TWSVM is not effective in hyperspectral image classification directly. A new nonparallel vector machine algorithm is proposed herein, so as to further improve classification accuracy of hyperspectral images on the basis of the algorithm itself.
In view of the above situation, based on a traditional parallel support vector machine, a nonparallel support vector machine model is constructed herein, that is, an additional empirical risk minimization nonparallel support vector machine (AERM-NSVM), by adding a least square term of samples and an additional empirical risk minimization term, which is referred to as the patent method hereinafter.
An objective of the present disclosure is to provide a method for constructing a support vector machine of a nonparallel structure, and a new nonparallel vector machine algorithm is proposed to further improve classification accuracy of hyperspectral images on the basis of the algorithm itself, so as to obtain better classification performance.
To achieve the objective, the present disclosure provides the method for constructing a support vector machine of a nonparallel structure. The method includes:
Preferably, the preprocessing data in S1 specifically includes:
Preferably, the solving a Lagrange multiplier of a positive-class hyperplane in S2 specifically includes:
If P+ is obtained by Equation (7).
At this time, obtaining vectors α = (α1, ..., αm) and λ = (λ1, ..., λm+) of the Lagrange multiplier by means of a formula (8);
Preferably, the solving a Lagrange multiplier of a negative-class hyperplane in S3 specifically includes:
If P_ is obtained by Equation (15).
At this time, obtaining vectors θ = (θ1, ..., θm) and γ = (γ1, ..., γm_) of the Lagrange multiplier by means of a formula (16);
Preferably, the solving parameters of positive-class and negative-class hyperplanes in S4 specifically includes:
If the Lagrange multipliers are obtained by Equation (7), the solving parameters of positive-class and negative-class hyperplanes in S4 specifically includes:
Preferably, the determining a class of a new data point in S5 specifically includes:
Preferably, construction methods are conducted in a linear manner, and under the condition that the methods are used in a nonlinear case, expansion modes of the methods are consistent with that of a parallel support vector machine (SVM); and
according to description of a case of two classes, under the condition that the methods are used in a multi-class case, the expansion modes of the methods are consistent with that of the parallel SVM.
Therefore, through the method for constructing a support vector machine of a nonparallel structure of the present disclosure, a new nonparallel vector machine algorithm is proposed to further improve classification accuracy of hyperspectral images on the basis of the algorithm itself, so as to obtain better classification performance.
The technical solution of the present disclosure will be further described in detail below with reference to the accompanying drawings and the embodiments.
The technical solution of the present disclosure will be further described below with reference to the accompanying drawings and the embodiments.
The present disclosure provides a method for constructing a support vector machine of a nonparallel structure. The method includes:
The present disclosure provides a method for constructing a support vector machine of a nonparallel structure. The method includes:
In the present disclosure, construction methods are conducted in a linear manner, and under the condition that the methods are used in a nonlinear case, expansion modes of the methods are consistent with that of a parallel support vector machine (SVM); and according to description of a case of two classes, under the condition that the methods are used in a multi-class case, the expansion modes of the methods are consistent with that of the parallel SVM.
As shown in
To illustrate effectiveness of the present disclosure, the following experimental demonstration is conducted.
The Pavia Center data set is acquired by Reflective optics system imaging spectrometer (ROSIS) sensor in Pavia, northern Italy. The number of spectral bands in a center of Pavia is 102. The center of Pavia is a 1096× 1096 pixel image, which contains 9 classes. A sample division case of a training set test set is shown in Table 1:
Classification results are shown in
The Pavia Center data set has a large amount of data, and an equal amount of samples are taken for training. It may be seen from Table 2 that a twin support vector machine (TWSVM) in the Pavia Center data set has a classification result still slightly lower than that of SVM, and has a Kappa coefficient also lower than that of SVM. The method of the present disclosure has classification accuracy exceeding that of a standard SVM, and has a Kappa coefficient higher than that of SVM.
The Pavia University data set is acquired by ROSIS sensor in Pavia, northern Italy. The number of spectral bands in the Pavia University is 103. The Pavia University has a 610×610 pixel, which contains 9 classes. A sample division case of a training set test set is shown in Table 3:
Classification results are shown in
It may be seen from Table 4 that TWSVM in the Pavia University data set has classification accuracy higher than that of SVM, and has a Kappa coefficient similar to that of SVM, which is more suitable for a case of a nonparallel classification plane. Compared with the standard SVM and TWSVM, the method of the present disclosure has more excellent classification accuracy. The method of the present disclosure has accuracy that is 1.05% higher than that of the standard SVM, and has a Kappa coefficient that is 1.29% higher than that of the SVM. The method of the present disclosure has accuracy that is 0.95% higher than that of the standard SVM, and has a Kappa coefficient that is 1.16% higher than that of the SVM. It is indicated that the method of the present disclosure with structural risk minimization may achieve better results than TWSVM with only empirical risk minimization.
Therefore, through the method for constructing a support vector machine of a nonparallel structure of the present disclosure, a new nonparallel vector machine algorithm is proposed to further improve classification accuracy of hyperspectral images on the basis of the algorithm itself, so as to obtain better classification performance.
Finally, it should be noted that the above embodiments are merely used to describe the technical solution of the present disclosure, rather than limiting the same. Although the present disclosure has been described in detail with reference to the preferred embodiments, those of ordinary skill in the art should understand that the technical solution of the present disclosure may still be modified or equivalently replaced. However, these modifications or equivalent replacement cannot make the modified technical solution deviate from the spirit and scope of the technical solution of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202210401847.5 | Apr 2022 | CN | national |