The present disclosure relates to the technical field of machine learning, and in particular to a later-fusion multiple kernel clustering machine learning method and system based on proxy graph improvement.
Aiming at dividing unlabeled data into several unrelated classes, clustering plays an important role in machine learning and data analysis. In the era of big data, data may come from multiple sources, known as multi-view data. Methods for clustering multi-view data are called multi-view clustering algorithms. Multi-kernel clustering algorithms are an important branch of multi-view clustering, with the aim of making full use of a series of predefined basis kernels to improve clustering performance.
Existing multi-kernel clustering algorithms can be roughly divided into two categories, i.e. early fusion and later fusion, according to the timing of fusion. Early fusion refers to a fusion of several kernel matrices before performing a kernel k-means algorithm. Specifically, a method of regularization term induced by matrix (X. Liu, Y Dou, J. Yin, et al., Multiple Kernel K-means Clustering with Matrix-induced Regularization in AAAI 2016, pp. 1888-1894) can be used to adaptively adjust a kernel coefficient according to the similarity of kernel matrices, such that the redundancy of similar information is avoided, and the quality of the optimal kernel matrix is accordingly improved. A method for maintaining the local structure of a kernel (M. Gonen and A. A. Margolin, Localized Data Fusion for Kernel K-means Clustering with Application to Cancer Biology in Neur IPS 2014, pp. 1305-1313) can also improve the effects of algorithms.
For later fusion of multi-kernel clustering first, the kernel k-means algorithm is first performed for basis kernel matrices to obtain basic divisions, and then these basic divisions are fused. A later fusion algorithm based on maximum alignment (S. Wang, X. Liu, E. Zhu, et al., Multi-view Clustering via Late Fusion Alignment Maximization in IJCAI 2019, pp. 3778-3784) makes use of permutation matrices to achieve an alignment effect of the basic divisions, and then combines the basic divisions. A later fusion method proposed by Liu et al. (X. Liu, M. Li, C. Tang, et al., Efficient and Effective Regularized Incomplete Multi-view Clustering in T-PAMI 2020) is capable of handling incomplete view data and obtaining a good clustering effect.
Compared with the early fusion, the later fusion features very low computational and storage complexities, and relatively desirable clustering performance. However, existing later fusion clustering algorithms still have the following deficiencies: first, a clustering process of the basis kernel is separated from a later fusion process of the basic divisions, in which case, the quality of the basic divisions has a great impact on the performance of the final clustering, and outliers and noises therein, if any, will result in an unsatisfactory clustering effect. Second, existing methods simply take a consistency division as a linear transformation of the basic divisions, making it difficult to be applied to the field of multi-kernel data in reality.
In order to overcome deficiencies in the prior art, the present disclosure provides a later-fusion multiple kernel clustering machine learning method and system based on proxy graph improvement.
In order to achieve the above objective, the present disclosure adopts the following technical solution:
Further, the objective function of kernel k-means clustering constructed in the step S3 is expressed as:
Assuming <ϕ(xi),ϕ(xj)>=Kij, where Kij represents elements of a kernel matrix K, then Equation (1) is expressed as:
Assuming
then Equation (2) is expressed as:
Further, the objective function constructed in the step S3 is expressed as:
Further, the objective function constructed in the step S3 is cyclically solved in the step S4, specifically:
S41. fixing S and optimizing {Hi}i=1m, being expressed as:
Equation (8) is solved through the steps S421 and S422:
Using a derivative 0 to obtain a closed-form solution
Obtaining a closed-form solution:
where Sj,: represents a jth column of a matrix S, αj represents an intermediate variable used for solution; Ŝj,: represents a jth column of Ŝ; and Ŝj,:T represents a transpose of Ŝj,:.
Further, the objective function constructed in the step S3 is cyclically solved, with a cycle termination condition being expressed as:
Correspondingly, the present disclosure further provides a later-fusion multiple kernel clustering machine learning system based on proxy graph improvement, including:
Further, the objective function of kernel k-means clustering in the construction module is expressed as:
assuming <ϕ(xi),ϕ(xj)>=Kij, where Kij represents elements of a kernel matrix K, then Equation (1) is expressed as:
then Equation (2) is expressed as:
Further, the objective function constructed in the construction module is expressed as:
Further, the constructed objective function is cyclically solved in the solution module, specifically:
Solving Equation (8):
Obtaining a closed-form solution:
Further, the constructed objective function is cyclically solved, with a cycle termination condition being expressed as:
Compared with the prior art, the present disclosure provides a novel later-fusion multiple kernel clustering machine learning method based on proxy graph improvement, and the method includes a basic division acquisition module, a proxy graph construction module, a basic division improvement module through the proxy graph, a spectral clustering module through the proxy graph, and the like. By optimizing the basic division, an optimized basic division not only has information of a single kernel, but can also obtain global information by means of a proxy graph, which is more beneficial to fusing views, such that a learned proxy graph can better fuse information of each kernel matrix, thereby realizing an aim of improving a clustering effect. Results of experiments on six multi-kernel data sets prove that the performance of the present disclosure is better than those of existing methods.
The implementation of the present disclosure will be illustrated below in conjunction with specific embodiments. Those skilled in the art can easily understand other advantages and effects of the present disclosure from the content disclosed in the Specification. The present disclosure can also be implemented or applied through other different specific implementations, and various modifications or variations can be made to details in the specification based on different viewpoints and applications without departing from the spirit of the present disclosure. It should be noted that the embodiments below and features in the embodiments can be combined with each other, so long as they are not in conflict with each other.
In order to overcome defects of the prior art, the present disclosure provides a later-fusion multiple kernel clustering machine learning method and system based on proxy graph improvement.
This embodiment provides a later-fusion multiple kernel clustering machine learning method based on proxy graph improvement, as shown in
In the step S3, k-means clustering and graph improvement are run on each view corresponding to the acquisition of the clustering task and the target data sample, and an objective function is constructed by combining kernel k-means clustering and graph improvement methods.
A kernel k-means clustering objective function is expressed as follows: {xi}i=1n⊆ represents a data set consisting of n samples, assuming that a kernel function is κ(⋅, ⋅), and κ(x,x′)=<ϕ(x),ϕ(x′)> according to the nature of a reproducing kernel, where ϕ:x∈→ represents feature mapping that a sample x is projected to a reproducing kernel Hilbert space . ϕ(x) is substituted into an objective function of k-means clustering to obtain an objective function of the kernel k-means clustering, which is expressed as:
The kernel trick is used, assuming <ϕ(xi),ϕ(xj)>=Kij, where Kij represents elements of a kernel matrix K, then Equation (1) is expressed as:
An optimization about B in Equation (2) has been proved to be an NP-hardness problem, therefore, discrete constraints of B are transformed into real-valued orthogonal constraints, and assuming
Equation (2) is then expressed as:
In this embodiment, eigendecomposition can be performed on the kernel matrix K, and the optimal H is an eigenvector corresponding to the first k largest eigenvalues of the K.
The function improvement of the graph improvement part is specifically as follows: assuming that the basic division obtained by an ith running kernel k-means clustering is Hi, in order to obtain the global information from the basic division, the basic division can be adjusted by minimizing ∥Hi−SHi∥F2, where S is a graph matrix shared by all basis kernels, satisfying S≥0, S1=1, and elements on a diagonal are 0.
The “constructing an objective function by combining kernel k-means clustering and graph improvement methods” is expressed as:
Since Equation (4) can make use of S for adjusting the Hi, the algorithm is named “later-fusion multiple kernel clustering based on proxy graph improvement”.
In the step S4, the objective function constructed in the step S3 is cyclically solved so as to obtain a graph matrix, which is fused with basic kernel information.
The objective function can be solved using the following two-step iterative method, specifically:
Using a derivative 0 to obtain a closed-form solution
Obtaining a closed-form solution:
A termination condition of the above two-step (the steps S41 and S42) alternating method is expressed as:
In the step S5, spectral clustering is performed on the obtained graph matrix, so as to obtain a final clustering result.
A standard spectral clustering algorithm is performed on the outputted graph matrix S to obtain the final clustering result.
This embodiment provides a novel later-fusion multiple kernel clustering machine learning method based on proxy graph improvement, and the method includes a basic division acquisition module, a proxy graph construction module, a basic division improvement module through the proxy graph, a spectral clustering module through the proxy graph, and the like. By optimizing the basic division, an optimized basic division not only has information of a single kernel, but can also obtain global information by means of a proxy graph, which is more beneficial to fusing views, such that a learned proxy graph can better fuse information of each kernel matrix, thereby realizing an aim of improving a clustering effect.
A later-fusion multiple kernel clustering machine learning method based on proxy graph improvement provided in this embodiment is distinguished from that in Embodiment 1 by:
In this embodiment, the clustering performance of the method of the present disclosure is tested on six MKL standard data sets.
The six MKL standard data sets are AR10P, YALE, Protein fold prediction, Oxford Flower17, Nonplant and Oxford Flower102. Information of the data sets is illustrated in Table 1.
For ProteinFold, this embodiment generates 12 benchmark kernel matrices, in which the first 10 feature sets adopt second-order polynomial kernels, and the last two adopt cosine inner product kernels. Kernel matrices of other data sets can be publicly downloaded from the Internet.
This embodiment adopts such algorithms as a best single-view kernel k-means clustering (BSKM), a multiple kernel k-means clustering (MKKM), a co-regularized spectral clustering (CRSC), a robust multiple kernel k-means clustering (RMKKM), a robust multi-view spectral clustering (RMSC), a multiple kernel k-means with matrix-induced regularization (MKMR), a multiple kernel clustering with local kernel alignment maximization (MKAM), a multi-view clustering via later fusion alignment maximization (MLFA) and a flexible multi-view representation learning for subspace clustering (FMR). In all experiments, all basis kernels are first centered and regularized. For all datasets, the number of classes is assumed to be known and set to be the number of cluster classes. Parameters of the compared algorithms adopted in this experiment are all set according to the corresponding literature. Parameters λ and β of the method are chosen from [2−2, 2−1, . . . , 22] through grid search.
The embodiment adopts widely used clustering accuracy (ACC), normalized mutual information (NMI) and purity to evaluate the clustering performance of each algorithm. For all algorithms, each experiment is repeated for 50 times with random initialization, and the best result is reported to reduce the effect of randomness caused by k-means.
Table 2 shows clustering effects of the above method and the compared algorithms on the six data sets of different algorithms. It can be concluded from the above table that: 1. the proposed algorithm is superior to all compared algorithms under the three evaluation criteria; and 2. the proposed algorithm outperforms the second-best compared algorithm by 4.92%, 1.21%, 2.16%, 2.12%, 6.85% and 4.05% on ACC of the six data sets, respectively.
This embodiment also presents changes in the objective function of each iteration, as shown in
The experimental results of this embodiment on six multi-kernel data sets prove that the performance of the present disclosure is better than those of existing methods.
This embodiment provides a later-fusion multiple kernel clustering machine learning system based on proxy graph improvement, including:
Further, the objective function of kernel k-means clustering in the construction module is expressed as:
Assuming <ϕ(xi),ϕ(xj)>=Kij, where Kij represents elements of a kernel matrix K, then Equation (1) is expressed as:
then Equation (2) is expressed as:
Further, the objective function constructed in the construction module is expressed as:
Further, the constructed objective function is cyclically solved in the solution module, specifically:
Solving Equation (8):
Using a derivative 0 to obtain a closed-form solution
Obtaining a closed-form solution:
Further, the constructed objective function is cyclically solved, with a cycle termination condition being expressed as:
It should be noted that the later-fusion multiple kernel clustering machine learning system based on proxy graph improvement provided in this embodiment is similar to that in Embodiment 1, and will not be described in detail here.
The system provided in the present disclosure includes a basic division acquisition module, a proxy graph construction module, a basic division improvement module through the proxy graph, a spectral clustering module through the proxy graph, and the like. By optimizing the basic division, an optimized basic division not only has information of a single kernel, but can also obtain global information by means of a proxy graph, which is more beneficial to fusing views, such that a learned proxy graph can better fuse information of each kernel matrix, thereby realizing an aim of improving a clustering effect.
It should be noted that what is described above is merely illustrative of the preferred embodiments of the present disclosure and the technical principles applied. Those skilled in the art will understand that the present disclosure is not limited to the particular embodiments described herein, and various obvious changes, readjustments and substitutions may be made by those skilled in the art without departing from the scope of protection of the present disclosure. Therefore, although the present disclosure has been described in greater detail by way of the above embodiments, the present disclosure is not limited to the above embodiments and may include many other equivalent embodiments without departing from the concept of the present disclosure, the scope of the present disclosure is determined by the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
202110607669.7 | Jun 2021 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/095836 | 5/30/2022 | WO |