GREEN KNOWLEDGE RECOMMENDATION MATHOD BASED ON CHARACTERISTIC SIMILARITY AND USERDEMANDS, ELECTRONIC DEVICE AND COMPUTER READABLE STORAGE MEDIUM THEREOF

FIELD OF THE INVENTION

The present invention relates to green knowledge recommendation methods, and more particularly to a green knowledge recommendation method based on characteristic similarity and user demands, an electronic device and a computer readable storage medium thereof.

BACKGROUND OF THE INVENTION

In the green knowledge base, the traditional way for users to search for the desired knowledge is not accurate and the search time is too slow. Because users in the search process is often very broad but not accurate. The traditional way to respond to a user's search is to give a search result that is only large enough, rather than trying to determine how to reduce uncertainty in the user's broad knowledge. Traditional methods only give a broad range of results and let the user to slowly search for themselves, thereby reducing what is unnecessary knowledge. Such a search method is too slow, and the search results are not accurate enough to meet the needs of users.

SUMMARY OF THE INVENTION

The object of the present invention is to provide a green knowledge recommendation method based on characteristic similarity and user demands to solve the problem that the search results are not accurate enough to meet the needs of users. The green knowledge recommendation method acts as a template-based method and allows users to quickly find what they need, so as to avoid users' meaningless search, improve the search efficiency, and reduce the loss of useless time.

It is adopted by the present invention to realize with the following technical scheme.

A green knowledge recommendation method based on characteristic similarity and user demands includes following steps 1˜4.

Step 1, obtain a current-search text e and a historical-search-texts set E_uboth from a user u, E_u={e_1,u, e_2,u. . . , e_n₁_,u, . . . , E_N₁_,u}. Wherein, the e_n1,urepresents the n₁th historical-search text, 1≤n₁≤N₁; the N₁represents the total number of historical-search texts.

Step 2, construct a topics dictionary and a subtopics dictionary, and decompose the current-search text e and the historical-search-texts set E_uon the basis of semantic decomposition. The step 2 includes steps 2.1˜2.6.

Step 2.1, construct a topics dictionary X of a green knowledge base, X={x₁, x₂, . . . , x_n2, . . . . x_N2}. Wherein the x_n2represents the n₂th topics, the N₂represents the total number of topics in the dictionary X.

Construct a daily-expressions dictionary C of a set of users, C={c₁, c₂, . . . , c_n4, . . . , c_N4}. Wherein the c_n4represents the n₄th daily expression, the N₄represents the total number of daily expressions in the dictionary C.

Step 2.2, decompose e and e_n1,uaccording to dictionaries X, Y, C to obtain two text-vector sets w_eand w_n1correspondingly. The w_eis about the current-search text e, w_e={w₁^e, w₂^e, . . . , w_i_e^e, . . . , w_i_e^e}. The w_n1is about the n₁th historical-search text e_n1,u,

$w_{n_{1}} = {w_{1}^{n_{1}}, w_{2}^{n_{1}}, \dots, w_{i_{n_{1}}}^{n_{1}}, \dots, w_{I_{n_{1}}}^{n_{1}}} .$

Wherein the w_i_e^erepresents the i_eth word of the current-search text e; the I_erepresents the total number of words in the current-search text e; the

$w_{i_{n_{1}}}^{n_{1}}$

represents the i th word or the n₁the historical-search text e_n1,u; the I_n₁represents the total number of words in the n₁th historical-search text e_n1,u.

Define ti, being the label of the w_i_e^e. If the t_i_e^ebelongs to the dictionary X, define w_i_e^e∈X; if the t_i_e^ebelongs to the dictionary Y, define w_i_e^e∈Y; if the t_i_e^ebelongs to the dictionary C, define w_i_e^e∈C; otherwise define w_i_e^e∈Ø.

Define t_iⁿ¹being the label of

$w_{i_{n_{1}}}^{n_{1}} .$

If the t_iⁿ¹belongs to the dictionary X, define

$w_{i_{n_{1}}}^{n_{1}} \in X;$

if it the t_iⁿ¹belongs to the dictionary Y, define

$w_{i_{n_{1}}}^{n_{1}} \in Y_{;}$

if the t_iⁿ¹belongs to the dictionary C, define

$w_{i_{n_{1}}}^{n_{1}} \in C,$

otherwise define

$w_{i_{n_{1}}}^{n_{1}} \in \emptyset .$

Step 2.3, obtain the weight L_iⁿ¹of the i th word

$w_{i_{n_{1}}}^{n_{1}}$

by the formula (1).

$\begin{matrix} L_{i}^{n_{1}} = {\begin{matrix} δ_{1}, & if t_{i}^{n_{1}} \in X \\ δ_{2}, & if t_{i}^{n_{1}} \in Y \\ 0, & if t_{i}^{n_{1}} \in C ⋃ {\emptyset} \end{matrix} & (1) \end{matrix}$

In the formula, the δ₁represents the first weight, the δ₂represents the second weight, and 0<δ₂<δ₁<1.

Step 2.4, obtain the weight L_i_e^e, of the i_eth word w_i_e^eby the same way of step 2.3.

Step 2.5, obtain the similarity

$g (w_{i_{e}}^{e}, w_{i_{n_{1}}}^{n_{1}})$

between the w_i_e^eand the

$w_{i_{n_{1}}}^{n_{1}}$

by the formula (2).

$\begin{matrix} g (w_{i_{e}}^{e}, w_{i_{n_{1}}}^{n_{1}}) = \frac{(\sum_{i_{e} = 1}^{I_{e}} w_{i_{e}}^{e} L_{i_{e}}^{e}) (\sum_{i_{n_{1}} = 1}^{I_{n_{1}}} w_{i_{n_{1}}}^{_{n_{1}}} L_{i_{n_{1}}}^{n_{1}})}{\begin{matrix} { \sum_{i_{e} = 1}^{I_{e}} w_{i_{e}}^{e} L_{i_{e}}^{e} }^{2} { \sum_{i_{n_{1}} = 1}^{I_{n_{1}}} w_{i_{n_{1}}}^{n_{1}} L_{i_{n_{1}}}^{n_{1}} }^{2} - \\ (\sum_{i_{e} = 1}^{I_{e}} w_{i_{e}}^{e} L_{i_{e}}^{e}) (\sum_{i_{n_{1}} = 1}^{I_{n_{1}}} w_{i_{n_{1}}}^{n_{1}} L_{i_{n_{1}}}^{n_{1}}) \end{matrix}} & (2) \end{matrix}$

Step 2.6, obtain the similarities between each of the two words respectively from two text-vector sets w_eand w_n1by the same way of step 2.5. Collect words with the highest similarity to be a candidate-words set in which one candidate word would be select to be the n₁th word of the w_e. A valid-text set V_i_e^eis defined by all candidate-words sets, V_i_e^e={v_1,i_e^e, v_2,i_e^e, . . . v_p,i_e^e, . . . , v_P,i_e^e}. Wherein the v_P,i_e^erepresents the p th candidate word of the i_eth word w_i_e^e, the p represents the total number of candidate words.

Step 3, according to the weight, pick words in the w_eand the V_i_e^ethat belong to the two dictionaries X and Y. The step 3 includes steps 3.1˜3.6.

Step 3.1, pick words in the w_ethat belong to the dictionary X.

When w_i_e^eL_i_eⁿ¹=δ₁, x_i_e^eis defined to mean the words corresponding to the w_i_e^eand is also from the dictionary X. The first words set is defined by many x_i_e^eaccordingly, and the L_i_eⁿ¹is the weight of the w_i_e^e.

Step 3.2, pick words in the V_i_e^ethat belong to the dictionary X.

When v_p,i_e^eL_p,i_e^e=δ₁, x_p,i_e^eis defined to mean the words corresponding to the v_p,i_e^eand is also from the dictionary X. The second words set is defined by many V_i_e^eaccordingly, and the L_p,i_e^eis the weight of the v_p,i_e^e.

Step 3.3, a large-subject terms set Z is defined by the first words set and the second words set, Z={z₁^X, z₂^X, . . . , z_n₅^X, . . . , z_N₅^X}.

Wherein the z_n₅^Xrepresents the n₅th large-subject term, 1≤n₅≤N₅, and the N₅represents the total number of large-subject terms.

Step 3.4, pick words in the w_ethat belong to the dictionary Y. When w_i_e^eL_iⁿ¹=δ₁, γ_i_e^eis defined to mean the words corresponding to the w_i_e^eand is also from the dictionary Y.

Step 3.5, pick words in the V_i_e^ethat belong to the dictionary Y; when L_iⁿ¹=δ₁, γ_i^validis defined to mean the words corresponding to the V_i_e^eand is also from the dictionary Y.

Step 3.6, a minor-subject terms set V is defined by the w_eand the V_i_e^e, V={v₁^Y, v₂^Y, . . . , v_n₆^Y, . . . , v_N₆^Y}. Wherein the vy, represents the n₆th minor-subject term, 1≤n₆≤N₆, and the N₆represents the total number of minor-subject terms.

Step 4, find the corresponding knowledge according to user satisfaction. The step 4 includes steps 4.1˜4.6.

Step 4.1, acquire a knowledge a to be identified, and calculate the frequency of each of the word appearing in the knowledge a after semantic decomposition under the dictionary X and the minor-subject terms set V,

${s_{x_{1}}^{a}, \dots s_{x_{n_{2}}}^{a}, \dots s_{x_{N_{2}}}^{a}, t_{v_{1}^{Y}}^{a}, \dots, t_{v_{n_{6}}^{Y}}^{a}, \dots, t_{v_{N_{6}}^{Y}}^{a}} .$

Wherein the

$s_{x_{n_{2}}}^{a}$

represents the frequency of the n₂th topic x_n₂appearing in the knowledge a

$0 \leq s_{x_{n_{2}}}^{a} \leq 1;$

and the

$t_{v_{N_{6}}^{Y}}^{a}$

represents the frequency of the n₆th subtopic v_n₆^Yappearing in the knowledge a, 0≤

$t_{v_{n_{6}}^{Y}}^{a} \leq 1 .$

Step 4.2, assigns a value to each of the word in the minor-subject terms set V, and a weighting function H(v_n₆^Y) of words in minor-subject terms set V is defined as formula (3).

$\begin{matrix} H (v_{n_{6}}^{_{Y}}) = \frac{v_{n_{6}}^{_{Y}}}{\sum_{n_{6} = 1}^{N_{6}} v_{n_{6}}^{_{Y}}} & (3) \end{matrix}$

Step 4.3, a user-demand degree function Q(v_n₆^Y) is defined as formula (4).

$\begin{matrix} Q (v_{n_{6}}^{Y}) = H (v_{n_{6}}^{Y}) k & (4) \end{matrix}$

In the formula, the K represents users' satisfaction, kϵ(0,100%).

Step 4.4, get a topic x_userrequired by the user in the topics dictionary X, and calculate the closing degree d₁^abetween the topic x_userand the knowledge a, d₁^a=1−s_x_user^a. Wherein the st represents the frequency of the topic x_userappearing in the knowledge a.

Step 4.5, get the user's demand for each of the minor-subject term in the minor-subject terms set V, and calculate the user's closing degree dg to all of the minor-subject terms,

$d_{2}^{a} = \sum_{n_{6} = 1}^{N_{6}} (Q {(v_{n_{6}}^{Y})}^{2} - {(t_{v_{n_{6}}^{Y}}^{a})}^{2}) .$

Step 4.6, calculate the closing degree d^abetween the user's demand and the knowledge a, d^a=d₁^a+d₂^a, obtain all of the closing degree of all of the knowledge, and select some knowledge with less closing degree to fed to the user.

The present invention further provides an electronic device, including a memory and a processor. The memory is used to store programs that could support the processor to execute. Wherein the programs are programmed according to the green knowledge recommendation method.

The present invention further provides a computer readable storage medium, used to store programs that are programmed according to the green knowledge recommendation method.

Compared with the prior art, the beneficial effects of the present invention are as follows.

1. The present invention firstly divides the collected text into words, and also sets the weight to improve the usefulness of similarity calculation. The present invention secondly divides the text into two parts according to the dependency relationship between the user's demand for large type and small type, so that the user's idea is more specific and detailed, and in the demand degree model, the user's demand for different types can be combined to make the search results conform to the user's demand. According to the received knowledge, the word frequency obtained after the dictionaries and the sets, the present invention is compared with the demand function to find out the knowledge that best meets the needs of the user.

2. The present invention uses a similarity model to quickly obtain usable text. The use of demand degree model can make users combine different types of needs, so that the search results meet the needs of users. The present invention combines the demand of the user with the results of previous searches, so that the accuracy of the pushed results is greatly improved.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flow diagram of the green knowledge recommendation method, according to the embodiment of the present invention.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

Referring to FIG. 1, in the present embodiment, a green knowledge recommendation method based on characteristic similarity and user demands includes following steps.

Step 1, obtain a current-search text e and a historical-search-texts set E_uboth from a user u, E_u={e_1,u, e_2,u. . . , e_n₁_,u, . . . , e_N₁_,u}. Wherein, the e_n1,urepresents the n₁th historical-search text, 1≤n₁≤N₁; the N₁represents the total number of historical-search texts.

Step 2.1, construct a topics dictionary X of a green knowledge base, X={x₁, x₂, . . . , x_n2, . . . , x_N2}. Wherein the x_n2represents the n₂th topics, the N₂represents the total number of topics in the dictionary X. The topics can be cars, machine tools, refrigerators, and other big categories.

Construct a subtopics dictionary Y of the green knowledge base, Y={y₁, y₂, . . . , y_n3, . . . , y_N3}. Wherein the y_n3represents the n₃th subtopics, the N₃represents the total number of subtopics in the dictionary Y. The subtopics can be a small type under a large type such as a large car, bus, truck, or a component such as a chassis, engine, shell, or a lightweight, energy-saving, wear-resistant effect.

$w_{n_{1}} = {w_{1}^{n_{1}}, w_{2}^{n_{1}}, \dots, W_{i_{n_{1}}}^{n_{1}}, \dots, w_{I_{n_{1}}}^{n_{1}}} .$

Wherein the w_i_e^erepresents the i_eth word of the current-search text e; the I_erepresents the total number of words in the current-search text e; the

$w_{i_{n_{1}}}^{n_{1}}$

represents the i th word of the n₁th historical-search text e_n1,u; the I_n₁represents the total number of words in the n₁th historical-search text e_n1,u. Here is the use of stuttering word segmentation system to carry out semantic decomposition, the use of stuttering word segmentation used the dictionaries X, Y, C. The dictionary to which the participle belongs is replaced by t_i_e^e, and t_iⁿ¹.

Define t_i_e^ebeing the label of the w_i_e^e. If the t_i_e^ebelongs to the dictionary X, define w_i_e^e∈X; if the t_i_e^ebelongs to the dictionary Y, define w_i_e^e∈Y; if the t_i_e^ebelongs to the dictionary C, define w_i_e^e∈C; otherwise define w_i_e^e∈Ø.

Define t_iⁿ¹being the label of

$w_{i_{n_{1}}}^{n_{1}} .$

If the t_iⁿ¹belongs to the dictionary X, define

$w_{i_{n_{1}}}^{n_{1}} \in X;$

if the t_iⁿ¹belongs to the dictionary Y, define

$w_{i_{n_{1}}}^{n_{1}} \in$

Y; if the t_iⁿ¹belongs to the dictionary C, define

$w_{i_{n_{1}}}^{n_{1}} \in C,$

otherwise define

$w_{i_{n_{1}}}^{n_{1}} \in \emptyset .$

Use labels to detect the dictionary that each word corresponds to, to simplify the identification of the relationship.

Step 2.3, obtain the weight L_iⁿ¹of the i th word

$w_{i_{n_{1}}}^{n_{1}}$

by the formula (1).

$\begin{matrix} L_{i}^{n_{1}} = {\begin{matrix} δ_{1}, & if t_{i}^{n_{1}} \in X \\ δ_{2}, & if t_{i}^{n_{1}} \in Y \\ 0, & if t_{i}^{n_{1}} \in C ⋃ {\emptyset} \end{matrix} & (1) \end{matrix}$

In the formula, the δ₁represents the first weight, the δ₂represents the second weight, and 0<δ₂<δ₁<1. Set weights for words that fall under topics, subtopics, and daily-expressions.

Step 2.4, obtain the weight L_i_e^e, of the i_eth word w_i_e^eby the same way of step 2.3.

Step 2.5, obtain the similarity

$g (w_{i_{e}}^{e}, w_{i_{n_{1}}}^{n_{1}})$

between the w_i_e^eand the

$w_{i_{n_{1}}}^{n_{1}}$

by the formula (2).

$\begin{matrix} \begin{matrix} (w_{i_{e}}^{e}, w_{i_{n_{1}}}^{n_{1}}) = \\ \frac{(\sum_{i_{e} = 1}^{I_{e}} w_{i_{e}}^{e} L_{i_{e}}^{e}) (\sum_{i_{n_{1}} = 1}^{I_{n_{1}}} w_{i_{n_{1}}}^{n_{1}} L_{i_{n_{1}}}^{n_{1}})}{({{ \sum_{i_{e} = 1}^{I_{e}} w_{i_{e}}^{e} L_{i_{e}}^{e} }^{2}  \sum_{i_{n_{1}} = 1}^{I_{n_{1}}} w_{i_{n_{1}}}^{n_{1}} L_{i_{n_{1}}}^{n_{1}}) }^{2} - (\sum_{i_{e} = 1}^{I_{e}} w_{i_{e}}^{e} L_{i_{e}}^{e}) (\sum_{i_{n_{1}} = 1}^{I_{n_{1}}} w_{i_{n_{1}}}^{n_{1}} L_{i_{n_{1}}}^{n_{1}})} \end{matrix} & (2) \end{matrix}$

The text vector set is converted into a numerical vector during the computation.

Step 2.6, obtain the similarities between each of the two words respectively from two text-vector sets w_eand w_n1by the same way of step 2.5.

Collect words with the highest similarity to be a candidate-words set in which one candidate word would be select to be the n₁th word of the w_e.

A valid-text set V_i_e^eis defined by all candidate-words sets, V_i_e^e={v_1,i_e^e, v_2,i_e^e, . . . , v_p,i_e^e, . . . , v_P,i_e^e}.

Wherein the v_P,i_e^erepresents the p th candidate word of the i_eth word, w_i_e^e, the p represents the total number of candidate words. Choose the text you want based on the similarity you want.

Step 3, according to the weight, pick words in the w_eand the V_i_e^ethat belong to the two dictionaries X and Y. The step 3 includes steps 3.1˜3.6.

Step 3.1, pick words in the w_ethat belong to the dictionary X.

Step 3.2, pick words in the V_i_e^ethat belong to the dictionary X.

Step 3.3, a large-subject terms set Z is defined by the first words set and the second words set, Z={z₁^X, z₂^X, . . . , z_n₅^X, . . . , z_N₅^X}. Wherein the z_n₅^Xrepresents the n₅th large-subject term, 1≤n₅≤N₅, and the N₅represents the total number of large-subject terms. The number of the large-subject terms is set to prepare the text content for the demand for words of a topic and the closeness of knowledge to the topic.

Step 3.4, pick words in the w_ethat belong to the dictionary Y. When w_i_e^eL_iⁿ¹=δ₁, γ_i_e^eis defined to mean the words corresponding to the w_i_e^eand is also from the dictionary Y.

Step 3.5, pick words in the V_i_e^ethat belong to the dictionary Y; when L_iⁿ¹=δ₁, γ_i^validis defined to mean the words corresponding to the V_i_e^eand is also from the dictionary Y.

Step 3.6, a minor-subject terms set V is defined by the w_eand the V_i_e^e, V={v₁^Y, v₂^Y, . . . , v_n6^Y, . . . v_N6^Y}. Wherein the v_n6^Y, represents the n₆th minor-subject term, 1≤n₆≤N₆, and the No represents the total number of minor-subject terms. The number of the minor-subject terms is set to prepare the text content for the demand for words of a subtopic and the closeness of knowledge to the subtopic.

Step 4, find the corresponding knowledge according to user satisfaction. The step 4 includes steps 4.1˜4.6.

${s_{x_{1}}^{a}, \dots s_{x_{n_{2}}}^{a}, \dots s_{x_{N_{2}}}^{a}, t_{v_{1}^{Y}}^{a}, \dots, t_{v_{n_{6}}^{Y}}^{a}, \dots, t_{v_{N_{6}}^{Y}}^{a}} .$

Wherein the

$s_{x_{n_{2}}}^{a}$

represents the frequency of the n₂th topic x_n₂appearing in the knowledge a,

$0 \leq s_{x_{n_{2}}}^{a} \leq 1;$

and the

$t_{v_{N_{6}}^{Y}}^{a}$

represents the frequency of the n₆th subtopic v_n6^Yappearing in the knowledge a, 0≤

$t_{v_{n_{6}}^{Y}}^{a} \leq 1 .$

Step 4.2, assigns a value to each of the word in the minor-subject terms set V, and a weighting function H(v_n6^Y) of words in minor-subject terms set V is defined as formula (3). Here, word frequency is used to show the proportion of each feature in the knowledge a, and it is also the influence of each feature in the knowledge a.

$\begin{matrix} H (v_{n_{6}}^{Y}) = \frac{v_{n_{6}}^{Y}}{\sum_{n_{6} = 1}^{N_{6}} v_{n_{6}}^{Y}} & (3) \end{matrix}$

Step 4.3, a user-demand degree function Q(v_n6^Y) is defined as formula (4).

$\begin{matrix} Q (v_{n_{6}}^{Y}) = H (v_{n_{6}}^{Y}) k & (4) \end{matrix}$

In the formula, the K represents users' satisfaction, kϵ(0,100%).

Because there are some effects in the user's overall text that are more searched, this is obviously what the user wants more.

Step 4.4, get a topic x_userrequired by the user in the topics dictionary X, and calculate the closing degree d^abetween the topic x_userand the knowledge a, d₁^a=1−s_x_user^a. Wherein the s_x_user^arepresents the frequency of the topic x_userappearing in the knowledge a. Because there is usually only one requirement for a car or airplane, for example, set the header requirement to 1.

Step 4.5, get the user's demand for each of the minor-subject term in the minor-subject terms set V, and calculate the user's closing degree d₂^ato all of the minor-subject terms,

$d_{2}^{a} = \sum_{n_{6} = 1}^{N_{6}} (Q {(v_{n_{6}}^{Y})}^{2} - {(t_{v_{n_{6}}^{Y}}^{a})}^{2}) .$

Because the semantic decomposition uses the minor-subject terms set V, the subscript of v_n6^Yis n₆.

The present embodiment further provides an electronic device, including a memory and a processor. The memory is used to store programs that could support the processor to execute. Wherein the programs are programmed according to the green knowledge recommendation method.

The present embodiment further provides a computer readable storage medium, used to store programs that are programmed according to the green knowledge recommendation method.

	Number	Date	Country
Parent	PCT/CN2023/118564	Sep 2023	WO
Child	18964413		US

GREEN KNOWLEDGE RECOMMENDATION MATHOD BASED ON CHARACTERISTIC SIMILARITY AND USERDEMANDS, ELECTRONIC DEVICE AND COMPUTER READABLE STORAGE MEDIUM THEREOF

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

Continuations (1)