The present application claims foreign priority of Chinese Patent Application No. 202010175032.0, filed on Mar. 13, 2020 in the State Intellectual Property Office of China, the disclosures of all of which are hereby incorporated by reference.
The disclosure relates to the field of smart manufacturing, and more particularly, to an inhomogeneous sample equalization method and system for a product assembly process.
With the rise of smart manufacturing, more and more manufacturing companies apply a machine learning algorithm to actual production to improve production efficiency and reduce labor cost consumption, such as establishing a model through machine learning in a product assembly process to predict production costs of products, required time, and the like. Initial sample data is an important part of the machine learning. Accuracy and homogeneity of the initial data may affect accuracy of a final result and generalization ability of the machine learning algorithm.
However, samples are inhomogeneous in an assembly process of same products with different styles. Since the same products with different styles may have subtle differences in function, these subtle differences may lead to different product assembly processes. One product has a main function and multiple additional functions, and the additional functions have different improvement effects on a sales volume of the product. For the additional functions capable of improving the sales volume of the product, manufacturers may design more styles for the additional functions with great improvement effects on the sales volume, so that products with more styles have the additional functions. For the additional functions with insignificant improvement effects on the sales volume, products with the additional functions have fewer styles, so that a model tends to be more in line with a characteristic of an assembly process of a product with large output, and a generalization ability of the model is insufficient. Moreover, for the assembly process of the same products with different styles, a strong relevance of internal data of the samples causes difficult homogenization of the samples.
The disclosure aims to provide an inhomogeneous sample equalization method and system for a product assembly process to solve the above problems.
In order to achieve the objectives, the disclosure employs the following technical solutions.
An inhomogeneous sample equalization method for a product assembly process takes an assembly process topological structure of a product as a sample, and takes assembly process topological structures of same products with different styles as different samples, wherein the method comprises the following steps of:
step A: calculating a similarity among different samples;
step B: constructing a fuzzy compatibility matrix S for representing the similarity among all the samples, constructing a fuzzy compatibility space X with different granule layers through the fuzzy compatibility matrix S, and clustering all samples through the fuzzy compatibility space X, wherein the fuzzy compatibility space X is divided into a plurality of different granule layers according to the similarity among the samples;
step C: based on a granular calculating mode, screening out a granule layer with a maximum comprehensive value of an information increment and the similarity among the samples from the fuzzy compatible space X to serve as an optimal granule layer; and
step D: carrying out equalization processing on a sample of the optimal granule layer.
An inhomogeneous sample equalization system for a product assembly process takes an assembly process topological structure of a product as a sample, and takes assembly process topological structures of same products with different styles as different samples, wherein the system includes:
a similarity generation module configured to calculate a similarity among different samples;
a fuzzy compatibility space construction module configured to construct a fuzzy compatibility matrix S for representing the similarity among all the samples, construct a fuzzy compatibility space X with different granule layers through the fuzzy compatibility matrix S, and cluster all samples through the fuzzy compatibility space X, wherein the fuzzy compatibility space X is divided into a plurality of different granule layers according to the similarity among the samples;
an optimal granule layer generation module configured to, based on a granular calculating mode, screen out a granule layer with a maximum comprehensive value of an information increment and the similarity among the samples from the fuzzy compatible space X to serve as an optimal granule layer; and
an equalization module configured to carry out equalization processing on a sample of the optimal granule layer.
According to the inhomogeneous sample equalization method for the product assembly process, equalization of inhomogeneous samples of the product assembly process is solved, and an accuracy of a final prediction result and a generalization ability of a model are improved; the similarity of the assembly process topological structures of the same products with different styles is also considered, and the samples are clustered from this point of view, so that a problem that the samples are not easy to be homogenized due to a strong relevance of internal data of the samples is solved, thus being more in line with a characteristic of such sample of the product assembly process, and making a final equalized result more scientific. By constructing the fuzzy compatibility space X, the samples may be clustered into different granule layers, and the samples may be observed and analyzed from multiple granule layers, so as to obtain the optimal granule layer C(λo), so that more representative and accurate sample granule may be obtained, and the number of the samples in each sample granule is homogenized from the optimal granule layer C(λo), so that an equalized effect is more representative.
The drawings further illustrate the disclosure, but the contents in the drawings do not constitute any limitation on the disclosure.
The technical solutions of the disclosure will be further described hereinafter with reference to the accompanying drawings and the specific implementations.
An inhomogeneous sample equalization method for a product assembly process according to the embodiment takes an assembly process topological structure of a product as a sample, and takes assembly process topological structures of same products with different styles as different samples. As shown in
step A: calculating a similarity among different samples;
step B: constructing a fuzzy compatibility matrix S for representing the similarity among all the samples, constructing a fuzzy compatibility space X with different granule layers through the fuzzy compatibility matrix S, and clustering all samples through the fuzzy compatibility space X, wherein the fuzzy compatibility space X is divided into a plurality of different granule layers according to the similarity among the samples;
step C: based on a granular calculating mode, screening out a granule layer with a maximum comprehensive value of an information increment and the similarity among the samples from the fuzzy compatible space X to serve as an optimal granule layer; and
step D: carrying out equalization processing on a sample of the optimal granule layer.
According to the inhomogeneous sample equalization method for the product assembly process, equalization of inhomogeneous samples of the product assembly process is solved, and an accuracy of a final prediction result and a generalization ability of a model are improved; the similarity of the assembly process topological structures of the same products with different styles is also considered, and the samples are clustered from this point of view, so that a problem that the samples are not easy to be homogenized due to a strong relevance of internal data of the samples is solved, thus being more in line with a characteristic of such sample of the product assembly process, and making a final equalized result more scientific. The same products with different styles may be mobile phones with different styles, such as a senior citizen mobile phone, a bezel-less display mobile phone, a curved display mobile phone, a three-camera mobile phone, a single-camera mobile phone, and the like.
According to the inhomogeneous sample equalization method for the product assembly process, by constructing the fuzzy compatibility space X, the samples may be clustered into different granule layers, and the samples may be observed and analyzed from multiple granule layers, so as to obtain the optimal granule layer C(λo), so that more representative and accurate sample granule may be obtained, and the number of the samples in each sample granule is homogenized from the optimal granule layer C(λo), so that an equalized effect is more representative.
Preferably, the step A specifically includes:
step A1: calculating a node similarity among different samples:
wherein i represents a same product of an ith type, j represents a same product of a jth type, and Snode(vi, vj) represents a node similarity between an assembly process topological structure vi the same product of the ith type and an assembly process topological structure vj of the same product of the jth type; mi, j represents a number of nodes matched in the assembly process topological structure vi and the assembly process topological structure vj; ei represents a sum of a number of all nodes in the assembly process topological structure vi; and ej represents a sum of a number of all nodes in the assembly process topological structure vj;
step A2: calculating a topological relation similarity among different samples:
wherein Srel(vi, vj) represents a topological relation similarity between the assembly process topological structure vi the same product of the ith type and the assembly process topological structure vj of the same product of the jth type; Mi, j represents a number of relation edges matched in the assembly process topological structure vi and the assembly process topological structure vi; Ei represents a sum of a number of all relation edges in the assembly process topological structure vi; and Ej represents a sum of a number of all relation edges in the assembly process topological structure vj; and
step A3: calculating a topological structure similarity among different samples:
S(i,j)=Snode(vi,vj)×Wnode+Srel(vi,vj)×Wrel,
wherein S(i, j) represents the topological structure similarity between the assembly process topological structure vi of the same product of the ith type and the assembly process topological structure vj of the same product of the jth type, Wnode is a preset node weight parameter, and Wrel is a preset relation edge weight parameter. Wnode represents a degree of influence of the nodes on a whole topological structure, and Wrel represents a degree of influence of the relation edges on the whole topological structure. Wnode and Wrel may be given artificially by a designer according to an importance of the process and an importance of the assembly process.
When the assembly process topological structure of each type of product is expressed, the assembly processes of the products with different styles are basically similar, the assembly of the products with different styles may have more processes, and more differences lie in different parameters of the products with different styles in the same assembly process. Therefore, a difference in the assembly process topological structures of the products with different styles mainly lies in the nodes. In the step A, a difference of the whole topological structure is calculated by comprehensively considering differences of the nodes and the topological relations in the assembly process topological structure of the product.
Preferably, the step B specifically includes:
step B1: constructing the fuzzy compatibility matrix S representing a similarity among all topological structures of a set V={v1, v2, v3, . . . , vn} of all the samples;
step B2: taking the fuzzy compatibility matrix S as an input and constructing the fuzzy compatibility space X with different granule layers by the following methods, and clustering all the samples through the fuzzy compatibility space X:
step B21: setting a threshold λ, wherein 1=λ1>λ2>λ3> . . . >λn=0, and when values of the threshold λ are respectively λ1, λ2, λ3, . . . , λn, calculating the similarity S(i, j) between the sample vi and the other sample vj in V={v1, v2, v3, . . . , vn}, wherein i∈(1,n); and
step B22: according to
obtaining all the samples meeting Sλ(i,j)=1 when λ=λi to construct sample granules Gi, wherein i=1, 2, 3, . . . , n, then constructing a corresponding granule layer C(λi) through all the sample granules Gi, and finally constructing the fuzzy compatible space X according to all the granule layers C(λi).
The larger the value of λi is, the smaller the number of samples in the sample granule Gi is, and the finer the granularity of the granule layer C(λi) is. The smaller the value of λi is, the larger the number of sample granules Gi in the granule layer C(λi) is, and the coarser the granularity of the granule layer C(λi) is.
Preferably, the step C specifically includes:
step C1: calculating a granularity of the granule layer C(λi), wherein i=1, 2, 3, . . . , n:
wherein Gi, k is a kth sample granule in the granule layer C(λi); |Gi, k| is a number of samples contained in the kth sample granule; log2(|Gi, k|) represents an amount of information needed to completely distinguish all granules in the sample granule Gi, k; and g represents a number of sample granules in the granule layer C(λi);
step C2: calculating an information increment IG[C(λi)] of the granule layer C(λi), wherein 1=1, 2, 3, . . . , n:
IG[C(λi)]=E[C(λi)]−E[C(λi-1)];
step C3: calculating an information increment and a comprehensive value Di of the similarity among the samples of the granule layer C(λi), wherein i=1, 2, 3, . . . , n:
Di=IG[C(λi)]*Wig+λiWλ;
wherein Wig is a weight of the information increment of the granule layer C(λi), and Wλ is a weight of a sample similarity threshold of the granule layer C(λi); and
step C4: screening out a granule layer with a maximum comprehensive value Di as an optimal granule layer C(λo).
The granularity of the granule layer C(λi) is regarded as an average amount of information needed to completely distinguish all sample granules in the granule layer. When the coarse granule layer C(λi) is converted to the fine granule layer C(λi-1), information gain may occur. The larger the information increment is, the more meaningful the conversion is during granule layer conversion. Meanwhile, a degree of similarity of the samples in the sample granule may also affect an effectiveness of final equalization. The larger the threshold λ is, the higher the similarity of the samples is, and the worse the equalization effect is. Therefore, it is necessary to comprehensively consider the information increment of the granule layer C(λi) and the similarity of the samples to comprehensively determine the optimal granule layer C(λo), which means that the optimal granule layer C(λo) should have a large information increment and a minimum similarity at the same time.
The weights Wig and Wλ are randomly selected from previous sample data, then determined samples of the optimal granule layer C(λo) are calculated, and the samples determined for the first time are trained with the following machine learning algorithm model, so as to obtain a result. Then, the weighs Wig and Wλ are adjusted according to the result, and iterated for many times to make a final result optimal, so as to obtain a final weight value.
Preferably, the step D specifically includes:
step D1: calculating an average number
step D2: increasing and decreasing a number of samples of each sample granule Gi in the optimal granule layer C(λo) by a random sampling method, so that the number of the samples in each sample granule Gi is the same to complete the equalization processing:
if |Gi, k|>
if |Gi, k|<
An inhomogeneous sample equalization system for a product assembly process according to the embodiment takes an assembly process topological structure of a product as a sample, and takes assembly process topological structures of same products with different styles as different samples, wherein the system includes:
a similarity generation module configured to calculate a similarity among different samples;
a fuzzy compatibility space construction module configured to construct a fuzzy compatibility matrix S for representing the similarity among all the samples, construct a fuzzy compatibility space X with different granule layers through the fuzzy compatibility matrix S, and cluster all samples through the fuzzy compatibility space X, wherein the fuzzy compatibility space X is divided into a plurality of different granule layers according to the similarity among the samples;
an optimal granule layer generation module configured to, based on a granular calculating mode, screen out a granule layer with a maximum comprehensive value of an information increment and the similarity among the samples from the fuzzy compatible space X to serve as an optimal granule layer; and
an equalization module configured to carry out equalization processing on a sample of the optimal granule layer.
According to the inhomogeneous sample equalization system for the product assembly process, equalization of inhomogeneous samples of the product assembly process is solved, and an accuracy of a final prediction result and a generalization ability of a model are improved; the similarity of the assembly process topological structures of the same products with different styles is also considered, and the samples are clustered from this point of view, so that a problem that the samples are not easy to be homogenized due to a strong relevance of internal data of the samples is solved, thus being more in line with a characteristic of such sample of the product assembly process, and making a final equalized result more scientific. The same products with different styles may be mobile phones with different styles, such as a senior citizen mobile phone, a bezel-less display mobile phone, a curved display mobile phone, a three-camera mobile phone, a single-camera mobile phone, and the like.
According to the inhomogeneous sample equalization system for the product assembly process, by constructing the fuzzy compatibility space X, the samples may be clustered into different granule layers, and the samples may be observed and analyzed from multiple granule layers, so as to obtain the optimal granule layer C(λo), so that more representative and accurate sample granule may be obtained, and the number of the samples in each sample granule is homogenized from the optimal granule layer C(λo), so that an equalized effect is more representative.
Preferably, the similarity generation module includes:
a node similarity generation sub-module configured to calculate a node similarity among different samples:
wherein i represents a same product of an ith type, j represents a same product of a jth type, and Snode(vi, vj) represents a node similarity between an assembly process topological structure vi of the same product of the ith type and an assembly process topological structure vj of the same product of the jth type; mi, j represents a number of nodes matched in the assembly process topological structure vi and the assembly process topological structure vj; ei represents a sum of a number of all nodes in the assembly process topological structure vi; and ej represents a sum of a number of all nodes in the assembly process topological structure vj;
a topological relation similarity generation sub-module configured to calculate a topological relation similarity among different samples:
wherein Srel(vi, vj) represents a topological relation similarity between the assembly process topological structure vi of the same product of the ith type and the assembly process topological structure vj of the same product of the jth type; Mi, j represents a number of relation edges matched in the assembly process topological structure vi and the assembly process topological structure vi; Ei represents a sum of a number of all relation edges in the assembly process topological structure vi; and Ej represents a sum of a number of all relation edges in the assembly process topological structure vj; and
a topological structure similarity generation sub-module configured to calculate a topological structure similarity among different samples:
S(i,j)=Snode(vi,vj)×Wnode+Srel(vi,vj)×Wrel,
wherein S(i, j) represents the topological structure similarity between the assembly process topological structure vi of the same product of the ith type and the assembly process topological structure vj of the same product of the jth type, Wnode is a preset node weight parameter, and Wrel is a preset relation edge weight parameter.
Wnode represents a degree of influence of the nodes on a whole topological structure, and Wrel represents a degree of influence of the relation edges on the whole topological structure. Wnode and Wrel may be given artificially by a designer according to an importance of a technology and an assembly process.
When the assembly process topological structure of each type of product is expressed, the assembly processes of the products with different styles are basically similar, the assembly of the products with different styles may have more processes, and more differences lie in different parameters of the products with different styles in the same assembly process. Therefore, a difference in the assembly process topological structures of the products with different styles mainly lies in the nodes. According to the similarity generation module, a difference of the whole topological structure is calculated by comprehensively considering differences of the nodes and the topological relations in the assembly process topological structure of the product.
Preferably, the fuzzy compatibility space construction module includes:
a fuzzy compatibility matrix generation sub-module configured to construct the fuzzy compatibility matrix S representing a similarity among all topological structures of a set V={v1, v2, v3, . . . , vn} of all the samples;
a granule layer generation sub-module configured to take the fuzzy compatibility matrix S as an input and constructing the fuzzy compatibility space X with different granule layers by the following methods, and cluster all the samples through the fuzzy compatibility space X:
a first unit configured to set a threshold λ, wherein 1=λ1>λ2>λ3> . . . >λn=0, and when values of the threshold λ are respectively λ1, λ2, λ3, . . . , λn, calculating the similarity S(i, j) between the sample vi and the other sample vj in V={v1, v2, v3, . . . , vn}, wherein i∈(1,n); and
a second unit configured to, according to
obtain all the samples meeting Sλ(i,j)=1 when λ=λi to construct sample granules Gi, wherein i=1, 2, 3, . . . , n, then construct a corresponding granule layer C(λi) through all the sample granules Gi, and finally construct the fuzzy compatible space X according to all the granule layers C(λi).
The larger the value of λi is, the smaller the number of samples in the sample granule Gi is, and the finer the granularity of the granule layer C(λi) is. The smaller the value of λi is, the larger the number of sample granules Gi in the granule layer C(λi) is, and the coarser the granularity of the granule layer C(λi) is.
Preferably, the optimal granule layer generation module includes:
a granularity calculation sub-module configured to calculate a granularity of the granule layer C(λi), wherein i=1, 2, 3, . . . , n:
wherein Gi, k is a kth sample granule in the granule layer C(λi); |Gi, k| is a number of samples contained in the kth sample granule; log2(|Gi, k|) represents an amount of information needed to completely distinguish all granules in the sample granule Gi, k; and g represents a number of sample granules in the granule layer C(λi);
an information increment calculation sub-module configured to calculate an information increment IG[C(λi)] of the granule layer C(λi), wherein i=1, 2, 3, . . . , n:
IG[C(λi)]=E[C(λi)]−E[C(λi-1)];
a comprehensive value calculation sub-module configured to calculate an information increment and a comprehensive value Di of the similarity among the samples of the granule layer C(λi), wherein i=1, 2, 3, . . . , n:
Di=IG[C(λi)]*Wig+λiWλ,
wherein Wig is a weight of the information increment of the granule layer C(λi), and Wλ is a weight of a sample similarity threshold of the granule layer C(λi); and
a screening sub-module configured to screen out a granule layer with a maximum comprehensive values Di as an optimal granule layer C(λo).
The granularity of the granule layer C(λi) is regarded as an average amount of information needed to completely distinguish all sample granules in the granule layer. When the coarse granule layer C(λi) is converted to the fine granule layer C(λi-1), information gain may occur. The larger the information increment is, the more meaningful the conversion is during the granule layer conversion. Meanwhile, a degree of similarity of the samples in sample granule may also affect an effectiveness of final equalization. The larger the threshold λ is, the higher the similarity of the samples is, and the worse the equalization effect is. Therefore, it is necessary to comprehensively consider the information increment of the granule layer C(λi) and the similarity of the samples to comprehensively determine the optimal granule layer C(λo), which means that the optimal granule layer C(λo) should have a large information increment and a minimum similarity at the same time.
The weights Wig and Wλ are randomly selected from previous sample data, then determined samples of the optimal granule layer C(λo) are calculated, and the samples determined for the first time are trained with the following machine learning algorithm model, so as to obtain a result. Then, the weights Wig and Wig are adjusted according to the result, and iterated for many times to make a final result optimal, so as to obtain a final weight value.
Preferably, the equalization module includes:
an average sample number calculation sub-module configured to calculate an average number
and
a sample increasing and decreasing sub-module configured to increase and decrease a number of samples of each sample granule Gi in the optimal granule layer C(λo) by a random sampling method, so that the number of the samples in each sample granule Gi is the same to complete the equalization processing
if |Gi, k|>
if |Gi, k<
The technical principles of the disclosure are described above with reference to the specific embodiments. These descriptions are only for the purpose of explaining the principles of the disclosure, but cannot be interpreted as a limitation on the scope of protection of the disclosure in any form. Based on the explanation herein, those skilled in the art may think of other specific implementations of the disclosure without going through any creative work, and these implementations shall all fall within the scope of protection of the disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202010175032.0 | Mar 2020 | CN | national |
Number | Name | Date | Kind |
---|---|---|---|
20060285755 | Hager et al. | Dec 2006 | A1 |
20180012355 | Sarkar | Jan 2018 | A1 |
Number | Date | Country |
---|---|---|
104657418 | May 2015 | CN |
105868791 | Aug 2016 | CN |
110766055 | Feb 2020 | CN |
111291818 | Jun 2020 | CN |
111832664 | Oct 2020 | CN |
Entry |
---|
Iam-On, Comparative study of matrix refinement approaches for ensemble clustering, 2012, Springer (Year: 2012). |
Hu, Incremental fuzzy cluster ensemble learning based on rough set theory, Jun. 2017, Elsevier (Year: 2017). |
Number | Date | Country | |
---|---|---|---|
20210286326 A1 | Sep 2021 | US |