The present application relates to the technical field of machine learning, and in particular to a late fusion multi-view clustering method and system based on local maximum alignment.
With the development of multi-source information collection technology, the collected data can be represented in various ways, for example, a video can have image data and sound data from different angles. Such data, in the field of machine learning, is referred to as multi-view data. The full and reasonable application of such data has always been an important topic in theoretical research and scientific practice. The clustering algorithm plays an important role in the field of unsupervised learning in machine learning, and aims to perform disjoint partition on unlabeled data. Clustering with multiple views can extract sample information from different angles, so that the clustering effect is better than that of a single view.
Multi-view clustering can be roughly classified into the following three types: i) Co-training multi-view clustering (A. Blum and T. Mitchell, “Combining labeled and unlabeled data with co-training”, in COLT 1998, pp. 92-100). This method, in addition to extracting information from each view, simultaneously seeks consistent clustering results across views. ii) Subspace clustering (X. Cao, C. Zhang, H. Fu, S. Liu, and H. Zhang, “Diversity-induced multi-view subspace clustering”, in CVPR 2015, pp. 586-594). This method aims to construct a consistent subspace through representation of different views to achieve the purpose of view fusion. iii) Multi-kernel clustering (M. Gönen and A. A. Margolin, “Localized data fusion for kernel kmeans clustering with application to cancer biology”, in NeurIPS 2014, pp. 1305-1313). The principle of this algorithm is to find the optimal combination coefficient of the base kernel by means of optimization, so as to achieve the purpose of improving the clustering effect.
The multi-kernel clustering algorithm in the above method has attracted much attention because of its strong interpretability and good effect. However, in the actual applications, the multi-kernel clustering algorithm has the following two disadvantages: first, the computational complexity and storage complexity is relatively high. Because several kernel matrices need to be stored and calculated, the space complexity of this type of algorithm is O(n{circumflex over ( )}2); the eigendecomposition of the kernel matrix is also required, resulting in a time complexity of O(n{circumflex over ( )}3). Secondly, a more complex optimization process increases the risk of getting trapped in a poor local optimum.
In order to overcome the above defects, the purposes of reducing complexity and simplifying optimization process are achieved. The late fusion multi-view clustering no longer uses the kernel matrix for fusion, but fuses more lightweight basic partitions. The late fusion multi-view clustering based on maximum alignment (S. Wang, X. Liu, E. Zhu, et al., “Multi-view clustering via late fusion alignment maximization”, in IJCAI 2019, pp. 3778-3784) not only reduces the computational complexity from O(n{circumflex over ( )}3) to O(n), but also further improves the clustering effect. The efficient and effective regularized incomplete multi-view clustering algorithm (Liu X, Li M, Tang C, et al., “Efficient and Effective Regularized Incomplete Multi-view Clustering”, in TPAMI, 2020, preprint) uses the late fusion method to process the incomplete multi-view clustering problem, so that the clustering effect exceeds the same type of algorithm, and lower computational complexity is achieved. However, this method does not take into account the local structure of the data. At present, there is no method that can integrate the two advantages of fast operation speed and local data structure of late fusion.
For the defects of the prior art, an objective of the present application is to provide a late fusion multi-view clustering method and system based on local maximum alignment.
In order to achieve the above objective, the present application uses the following technical solutions.
A late fusion multi-view clustering method based on local maximum alignment includes the following steps:
Further, the kernel k-means clustering in the step S2 is represented as:
where H∈Rn×k represents a partition matrix solved according to the kernel matrix K; Im represents an identity matrix with a dimension of m(∈N+); HT represents the permutation of H; and Ik represents a k-dimensional identity matrix.
Further, the calculating basic partition of each view in the step S3 specifically includes: constructing different kernel matrices {Kp}p=1m for different views, and operating kernel k-means clustering to obtain the basic partition {Hp}p=1m of each view.
Further, the establishing a late fusion multi-view clustering objective function based on maximum alignment in the step S3 is represented as:
where F represents an optimized optimal partition; β represents a vector formed by the combination coefficients of each view, βp represents a coefficient of the pth view, and {Wp}p=1m represents a permutation matrix of each view; m represents average partition obtained by performing kernel k-means clustering on the average kernel; FT represents a permutation of F; WT represents a permutation of W; Hp represents the basic partition of each view obtained by kernel k mean clustering; and m represents the number of views.
Further, the establishing a late fusion multi-view clustering objective function based on local maximum alignment in the step S4 is represented as:
where Ap(i) represents an indicator matrix of τ neighbors in sample i in the pth view, that is, a neighbor matrix of each view; n represents the number of samples; {tilde over (H)}p(i) represents a basic partition matrix with the ith sample local information in the pth view; {Wp}p=1m represents a permutation matrix of each view; λ represents a regularization parameter; {tilde over (M)}i represents an average partition matrix with the ith sample local information; and (Ap(i))T represents a permutation of Ap(i).
Further, the solving the established late fusion multi-view clustering objective function based on local maximum alignment in a cyclic manner in the step S5 specifically includes:
Further, in the step S5, the established late fusion multi-view clustering objective function based on local maximum alignment is solved in a cyclic manner, a termination condition of the circulation is represented as:
(obj(t-1)−obj(t)/obj(t)≤ε
where obj(t-1) and obj(t) represent values of the objective function for the tth iteration and t−1th iteration; and ε represents the set precision.
Correspondingly, further provided is a late fusion multi-view clustering system based on local maximum alignment, which includes:
Further, the establishing a late fusion multi-view clustering objective function based on maximum alignment in the first establishment module is represented as:
where F represents an optimized optimal partition; β represents a vector formed by the combination coefficients of each view, βp represents a coefficient of the pth view, and {Wp}p=1m represents a permutation matrix of each view; m represents average partition obtained by performing kernel k-means clustering on the average kernel; FT represents a permutation of F; WT represents a permutation of W; Hp represents the basic partition of each view obtained by kernel k mean clustering; and m represents the number of views.
Further, the establishing a late fusion multi-view clustering objective function based on local maximum alignment in the second establishment module is represented as:
where Ap(i) represents an indicator matrix of τ neighbors in sample i in the pth view, that is, a neighbor matrix of each view; n represents the number of samples; {tilde over (H)}p(i) represents a basic partition matrix with the ith sample local information in the pth view; {Wp}p=1m represents a permutation matrix of each view; λ represents a regularization parameter; {tilde over (M)}i represents an average partition matrix with the ith sample local information; and (Ap(i))T represents a permutation of Ap(i).
Compared with the prior art, the present application provides a novel late fusion multi-view clustering machine learning method based on local maximum alignment, and the method includes acquiring a neighbor matrix and basic partition of each view, and constructing an objective function by using local information of each view. Then, an optimal partition matrix with a local structure is learned through optimization, and therefore the purpose of improving the clustering effect is achieved. Meanwhile, the present application can also solve the clustering problem on large-scale data. Experimental results on 8 multi-kernel datasets (including 6 benchmark datasets and 2 large-scale datasets) demonstrated superior performance of the present application over existing methods.
The following describes the embodiments of the present application by specific examples, and other advantages and effects of the present application will be readily apparent to those skilled in the art from the disclosure of the present application. The present application can also be implemented or applied through other different specific embodiments, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present application. It should be noted that the following embodiments and features in the embodiments can be combined with each other without conflict.
For the defects of the prior art, an objective of the present application is to provide a late fusion multi-view clustering method and system based on local maximum alignment.
This embodiment provides a late fusion multi-view clustering method based on local maximum alignment, as shown in
According to the late fusion multi-view clustering method based on local maximum alignment, the basic partition matrix has local clustering structure information, so that the optimal partition obtained through learning has a better clustering structure.
In the step S2, a permutation matrix of each view and a combination coefficient of each view are initialized, and average partition of kernel k-means clustering is performed on an average kernel to obtain a neighbor matrix of each view.
The permutation matrix of each matrix is set as {Wp}p=1m, the combination coefficient of each view is set as β, the average partition of kernel k-means clustering performed on an average kernel is set as M, a neighbor matrix of each view is set as Ap(i), and the above data is initialized.
In this embodiment, the basic partition is first obtained by kernel k-means clustering. Assuming that a sample set is X={x1, . . . , xn}⊆χ, where χ is the sample space. A kernel function is set as κ:χ×χ→R, a corresponding kernel matrix K∈Rn×n is obtained, and the element in this matrix Kij=κ(xi, xj). The objective formula of kernel k-means clustering is as follows:
where H∈Rn×k represents a partition matrix solved according to the kernel matrix K; Im represents an identity matrix with a dimension of m(∈N+); HT represents the permutation of H; and Ik represents a k-dimensional identity matrix. The above formula can be solved by performing eigendecomposition on K, and the solution is the eigenvector corresponding to K maximum eigenvalues before K.
In the step S3, the basic partition of each view is calculated, and a late fusion multi-view clustering objective function based on maximum alignment is established.
In this embodiment, different kernel matrices {Kp}p=1m can be constructed for different views, and kernel k-means clustering is performed to obtain the basic partition {Hp}p=1m of each view. The late fusion multi-view clustering objective function based on maximum alignment is as follows:
where F represents an optimized optimal partition; β represents a vector formed by the combination coefficients of each view, βp represents a coefficient of the pth view, and {Wp}p=1m represents a permutation matrix of each view; m represents average partition obtained by performing kernel k-means clustering on the average kernel; FT represents a permutation of F; WT represents a permutation of W; Hp represents the basic partition of each view obtained by kernel k mean clustering; and m represents the number of views.
The optimization of F can be obtained by performing economic singular value decomposition on X+λM and taking the product of left and right singular value vectors; the optimization of β can be obtained by using the condition that the equal sign of the Cauchy-Bunyakovsky-Schwarz inequality is established; and the optimization the Wp can be obtained by performing singular value decomposition on the FTHp and taking the product of the left and the right singular value vectors.
In the step S4, basic partition having local information is obtained, and a late fusion multi-view clustering objective function based on local maximum alignment is established by combining the neighbor matrix of each view and the step S3.
The basic partition used in the method in the step S3 only has the global clustering structure of each view, and ignores the local clustering structure. This embodiment has matrix Ap(i)∈{0,1}n×n representing an indicator matrix of whether the pth view is τ neighbor in sample i. Accordingly, a basic partition matrix {tilde over (H)}p(i)=(Ap(i))THp having the ith sample local information in the pth view and an average partition matrix {tilde over (M)}i=(Ap(i))TM with the ith sample local information can be defined, where M is the average partition obtained by performing kernel k-means clustering on the average kernel.
The late fusion multi-view clustering objective function based on local maximum alignment is as follows:
where Ap(i) represents an indicator matrix of τ neighbors in sample i in the pth view, that is, a neighbor matrix of each view; n represents the number of samples; {tilde over (H)}p(i) represents a basic partition matrix with the ith sample local information in the pth view; {Wp}p=1m represents a permutation matrix of each view; λ represents a regularization parameter; {tilde over (M)}i represents an average partition matrix with the ith sample local information; and (Ap(i))T represents a permutation of Ap(i).
In the step S5, the established late fusion multi-view clustering objective function based on local maximum alignment is solved in a cyclic manner to obtain optimal partition after fusing each basic partition.
In this embodiment, a three-step alternating optimization method is used to solve the objective function in the step S4, which specifically includes:
The termination condition of the alternating method of steps A1-A3 is represented as:
(obj(t-1)−obj(t)/obj(t)≤ε
where obj(t-1) and obj(t) represent values of the objective function for the tth iteration and t−1th iteration; and ε represents the set precision.
In the step S6, k-means clustering is performed on the optimal partition to obtain a clustering result. The obtained partition is a variable F in the objective function in the step S4, and each row of F is regarded as a sample, and k-means clustering is performed on the sample to obtain a final clustering result.
This embodiment includes acquiring a neighbor matrix and basic partition of each view, constructing an objective function by using local information of each view, and then learning an optimal partition matrix with a local structure through optimization; therefore the purpose of improving the clustering effect is achieved.
The late fusion multi-view clustering method based on local maximum alignment provided by this embodiment is different from Embodiment 1 in that:
The image datasets include a face image dataset, a plant image dataset, a handwritten Arabic numeral image dataset, a medical image dataset, an object behavior and action posture, business order data, mass order grouping, order wave order combination, order data mining and analysis, inventory allocation, goods shelf adjustment, supply chain optimization, intelligent replenishment, and the like.
This embodiment takes a face as an example for explanation.
The clustering performance of this method is tested on 6 multi-kernel standard datasets (including 5 benchmark datasets and 1 large-scale dataset).
The 6 multi-kernel standard datasets include AR10P, YALE, Plant, Caltech102-30 (Cal102-30 for short), Flower17, and Mnist. AR10P is a database of face images, where each person has photos taken in different situations such as facial expressions, lighting, or disguise. YALE faces contain 165 pictures from 15 people, each person's photos are taken in different facial expressions, postures, or lighting conditions. Plant and Flower17 are datasets of plant images. Caltech102 is a dataset composed of 102 different types of item photos. 30 samples are selected from each category as a training set that is denoted as Caltech102-30. Mnist is a large-scale dataset that contains 60000 handwritten Arabic numeral images to validate the performance of the algorithm on large-scale datasets. Table 1 shows relevant information on the dataset. The kernel matrices of all datasets can be downloaded from the internet.
In this experiment, an average multi-kernel k-means clustering algorithm (AMKKM), an optimal single-view kernel k-means clustering algorithm (SB-KKM), a multi-kernel k-means clustering (MKKM), a collaborative regularization spectral clustering (CRSC), a robust multi-kernel clustering (RMKKM), a robust multi-view spectral clustering (RMSC), a local multi-kernel k-means clustering (LMKKM), a multi-kernel k-means clustering with a matrix induction regularization term (MKKM-MR), and a multi-kernel clustering based on local kernel maximum alignment (LKAM) are used. In all experiments, all benchmark kernels are first centered and regularized. For all datasets, assuming that the number of categories is known and set as the number of clustering categories. The contrastive algorithm used in this experiment all set parameters according to the corresponding literature. The parameter λ of this method is determined by the range of grid search [2−5, 2−4, . . . , 25], and the parameter τ is determined by the range of grid search [0.1, 0.2, . . . , 1].
This experiment used common clustering accuracy (ACC) and normalized mutual information (NMI) to show the clustering performance of each method. All methods were randomly initialized and repeated 50 times and showed the optimal results to reduce the randomness caused by k-means.
Table 2 shows the clustering effect of this method (Proposed) and the contrastive algorithm on five benchmark datasets, and the notation “-” represents memory overflow, and the algorithm cannot run. It can be seen from this table that: 1. this method is superior to all contrastive algorithms under two evaluation criteria. 2. The performance of this method on six datasets ACC is respectively 12.31%, 2.58%, 4.58%, 3.86%, and 3.53% higher than that of the suboptimal contrastive algorithm. Table 3 shows the performance of this method on large scale datasets. It can be seen from Table 3 that, when many contrastive algorithms cannot run due to memory overflow, this method can not only run smoothly, but also obtain the significant effect. This demonstrates the effectiveness of this method on large-scale datasets.
This example also shows the variation of the objective function at each iteration, as shown in
This embodiment can solve the clustering problem on large-scale data. Experimental results on 7 multi-kernel image datasets (including 5 benchmark datasets and 1 large-scale dataset) demonstrated superior performance of this method over existing methods.
This embodiment further provides a late fusion multi-view clustering system based on local maximum alignment, which includes:
Further, the establishing a late fusion multi-view clustering objective function based on maximum alignment in the first establishment module is represented as:
where F represents an optimized optimal partition; β represents a vector formed by the combination coefficients of each view, βp represents a coefficient of the pth view, and {Wp}p=1m represents a permutation matrix of each view; m represents average partition obtained by performing kernel k-means clustering on the average kernel; FT represents a permutation of F; WT represents a permutation of W; Hp represents the basic partition of each view obtained by kernel k mean clustering; and m represents the number of views.
Further, the establishing a late fusion multi-view clustering objective function based on local maximum alignment in the second establishment module is represented as:
where Ap(i) represents an indicator matrix of τ neighbors in sample i in the pth view, that is, a neighbor matrix of each view; n represents the number of samples; {tilde over (H)}p(i) represents a basic partition matrix with the ith sample local information in the pth view; {Wp}p=1m represents a permutation matrix of each view; λ represents a regularization parameter; {tilde over (M)}i represents an average partition matrix with the ith sample local information; and (Ap(i))T represents a permutation of Ap(i).
It should be noted that the late fusion multi-view clustering system based on local maximum alignment provided in this embodiment is similar to Embodiment 1. Details are not described herein again.
This embodiment includes acquiring a neighbor matrix and basic partition of each view, constructing an objective function by using local information of each view, and then learning an optimal partition matrix with a local structure through optimization; therefore the purpose of improving the clustering effect is achieved.
It should be noted that the foregoing are merely some embodiments of the present application and applied technical principles. Those skilled in the art may understand that the present application is not limited to specific embodiments described herein, and those skilled in the art may make various significant changes, readjustments, and replacements without departing from the protection scope of the present application. Therefore, although the present application is described in detail by using the foregoing embodiments, the present application is not limited to the foregoing embodiments, and may further include more other equivalent embodiments without departing from the concept of the present application. The scope of the present application is determined by the scope of the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
202110706944.0 | Jun 2021 | CN | national |
202111326425.8 | Nov 2021 | CN | national |
This application is the national phase entry of International Application No. PCT/CN2022/098950, filed on Jun. 15, 2022, which is based upon and claims priority to Chinese Patent Application No. 202110706944.0, filed on Jun. 24, 2021; and Chinese Patent Application No. 202111326425.8, filed on Nov. 10, 2021, the entire contents of which are incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2022/098950 | 6/15/2022 | WO |