This application relates to the field of artificial intelligence, and in particular, to a recommendation method, a recommendation network, and a related device.
Artificial intelligence (AI) is a theory, a method, a technology, and an application system that simulate, extend, and expand human intelligence by using a digital computer or a machine controlled by a digital computer, to perceive an environment, obtain knowledge, and achieve an optimal result by using the knowledge. In other words, artificial intelligence is a branch of computer science, and is intended to understand essence of intelligence and produce a new intelligent machine that can react in a manner similar to human intelligence. Artificial intelligence is to study design principles and implementation methods of various intelligent machines, so that the machines have perception, inference, and decision-making functions.
With development of internet information technologies, especially rise of E-commerce, internet content presents an explosive growth and an era of massive data is gradually entered. Users are at a loss when facing a wide variety of goods, movies, music, videos and other items. This is often mentioned as information overload.
Therefore, how to accurately recommend an appropriate item to a user in a large quantity of items is a technical problem that urgently needs to be resolved.
Embodiments of this application provide a recommendation method, a recommendation network, and a related device. A neural network may be run based on a knowledge graph and information of a user, to obtain a recommendation result. The neural network is used to perform computing based on at least one first direction and at least one second direction in the knowledge graph to obtain the recommendation result. Therefore, the new recommendation method is provided so that a more accurate recommendation result can be obtained, that is, precision and/or a recall of the recommendation result can be improved. The new recommendation method is provided may be applied to a scenario in which an item is recommended to the user, for example, movie recommendation or game recommendation.
According to a first aspect of embodiments of this application, a recommendation method is provided. The method may be performed by a recommendation apparatus, or may be performed by a component (such as a processor, a chip, or a chip system) of the recommendation apparatus. The method includes: obtaining a knowledge graph, where the knowledge graph includes a plurality of first entities indicating items and a plurality of second entities indicating item attributes; and running, based on the knowledge graph and information of a user, a neural network to obtain a recommendation result, where the information of the user indicates an item or an item attribute in a history record of the user. The recommendation result includes a recommended item. The neural network is used to perform computing based on at least one first direction and at least one second direction in the knowledge graph to obtain the recommendation result. The first direction starts from a first entity indicating the recommended item or a to-be-recommended item. The second direction starts from a first entity indicating the item in the history record of the user, or the second direction starts from a second entity indicating the item attribute in the history record of the user. The information of the user may include a user identifier, an item clicked by the user or an item attribute, and an attribute related to the user, for example, an age, an occupation, or a gender of the user. The to-be-recommended item may be understood as an item that is not clicked by the user. The recommendation method is to find the recommended item appropriate for the user from the to-be-recommended item, to improve precision and a recall. It may be understood that, in the knowledge graph, all first entities that do not indicate the item in the history record of the user may be the to-be-recommended items. Certainly, in some implementations, preliminary filtering may be performed to narrow down a range of the to-be-recommended items in the knowledge graph, so that some items indicated by the first entity are selected as the to-be-recommended items. The information of the user may indicate an item or an item attribute in a positive event in the history record of the user. It should be understood that the information of the user may be filtered. The information of the user for recommendation indicates the item or the item attribute in the positive event in the history record of the user. The item or the item attribute in the positive event in the history record of the user may be understood as an item or an item attribute once in the positive event of the user that is indicated by the first entity or the second entity. The positive event is an event in which the user does not clearly express a rejective or negative intention (that is, dislike, unconcern, or being unwilling to follow), or expresses a positive intention such as following, like, or interest. For example, that the user clicks, forwards, follows, comments, incentives (such as puts a coin and likes), favorites, browses, and browses an item or an item attribute for more than a threshold time all are positive events; and that the user clicks “Dislike”, clicks “Do not show me again”, clicks “Not interested”, and selects “Do not recommend this on my homepage” all are negative events, and an item or an item attribute related to such history record should be removed and not entered. In addition, the first direction starts from the first entity indicating the recommended item or the to-be-recommended item. It may be understood that a start point of the first direction is the first entity indicating the recommended item or the to-be-recommended item. An entity to which the first direction points is not limited. The second direction starts from the first entity indicating the item in the history record of the user, or the second direction starts from the second entity indicating the item attribute in the history record of the user. It may be understood that a start point of the second direction is the first entity indicating item in the history record of the user, or a start point of the second direction is the second entity. An entity to which the second direction points is not limited. For example, in a movie recommendation scenario, the start point of the first direction is a movie that is not watched by the user, and the start point of the second direction is a movie that is watched by the user or a corresponding movie genre, or the like.
In this embodiment of this application, the neural network can be run based on the knowledge graph and the information of the user, to obtain the recommendation result. The neural network is used to perform computing based on the at least one first direction and the at least one second direction in the knowledge graph to obtain the recommendation result. Therefore, the new recommendation method is provided, so that the more accurate recommendation result can be obtained, that is, precision and/or a recall of the recommendation result can be improved.
In an embodiment, the recommendation result in the foregoing operation further includes an explanation for the recommended item, and the explanation is associated with one first direction and one second direction.
In such possible implementation, in one aspect, the explanation for the recommended item can be obtained based on the first direction and the second direction. Because the explanation is related to the two directions, reasonableness of the explanation can be improved. In another aspect, in the recommendation method, in addition to the recommended item, the explanation for the recommended item can also be obtained. The explanation can express a reason why the recommended item is recommended.
In an embodiment, the foregoing operation of running, based on the knowledge graph and information of a user, a neural network to obtain a recommendation result includes: extracting, based on the knowledge graph and the information of the user, a plurality of paths from the knowledge graph, where each path indicates an association between entities at two ends of the path, one end of each path corresponds to the to-be-recommended item, and the other end of each path corresponds to the item or the item attribute in the history record of the user; and inputting the plurality of paths into the neural network to score a plurality of to-be-recommended items, so that the neural network obtains the recommendation result, where the neural network is used to perform computing based on a first direction and a second direction of each of the plurality of paths to obtain the recommendation result; and for one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In this embodiment, the plurality of paths are used for scoring the plurality of to-be-recommended items, and the neural network is used to perform computing based on the first direction and the second direction of each of the plurality of paths to obtain the recommendation result, so that the two directions of the path are considered during scoring. In this way, the recommendation result is more accurate compared with a recommendation result obtained in one direction in the conventional technology. In actual application, each of the plurality of to-be-recommended items is computed based on the first direction and the second direction of each of the plurality of paths, to obtain the plurality of recommended items, and then the final recommendation result is obtained based on ranking of the recommended items. In this way, several recommended items can be preset, so that the more accurate recommended item can be provided.
In an embodiment, the foregoing operation of running, based on the knowledge graph and information of a user, a neural network to obtain a recommendation result includes: inputting the knowledge graph and the information of the user into the neural network; extracting, based on the knowledge graph and the information of the user, a plurality of paths from the knowledge graph, where each path indicates an association between entities at two ends of the path, one end of each path corresponds to the to-be-recommended item, and the other end of each path corresponds to the item or the item attribute in the history record of the user; and scoring a plurality of to-be-recommended items based on a first direction and a second direction of each of the plurality of paths, so that the neural network obtains the recommendation result, where for one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In this embodiment, the plurality of to-be-recommended items can be scored in the first direction and the second direction of each of the plurality of paths, so that the two directions of the path are considered during scoring. In this way, the recommendation result is more accurate compared with a recommendation result obtained in one direction in the conventional technology. In actual application, the plurality of to-be-recommended items are scored based on the first direction and the second direction of each of the plurality of paths, to obtain the plurality of recommended items, and then the final recommendation result is obtained based on ranking of the recommended items. In this way, several recommended items can be preset, so that the more accurate recommended item can be provided.
In an embodiment, the first direction is consistent with a direction of a first vector; the second direction is consistent with a direction of a second vector; and for one path, the first vector is a vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to an entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item, and the second vector is a vector pointing from the first entity that is in the path and that indicates the item in the history record of the user to an entity that is in the path and that is adjacent to the first entity indicating the item in the history record of the user; or the second vector is a vector pointing from the second entity that is in the path and that indicates the item attribute in the history record of the user to an entity that is in the path and that is adjacent to the second entity indicating the item attribute in the history record of the user. The scoring, based on a first direction and the second direction of each of the plurality of paths, a plurality of to-be-recommended items corresponding to the plurality of paths, so that the neural network obtains the recommendation result includes: scoring, based on a probability that a first vector and a second vector of each of the plurality of paths are aggregated at one entity in the path, a to-be-recommended item corresponding to the path, to obtain the recommended item.
In this embodiment, an overlap probability of information in left and right flow directions of the path in embedding space is computed, and an overlap probability feature of an aggregation node is computed. The overlap probability feature is used for outputting a semantic-level explanation, that is, a preference degree of the user for an entity node. In actual application, for the plurality of to-be-recommended items, the to-be-recommended item corresponding to each path is scored based on the probability that a first vector and a second vector of the path are aggregated at one entity in the path, to obtain the plurality of recommended items, and then the final recommendation result is obtained based on the ranking of the recommended items. In this way, the recommended item appropriate for the user is arranged in the front, and an interest point of the user can be accurately found.
In an embodiment, the foregoing operation further includes: separately scoring the plurality of paths based on the first direction and the second direction of each of the plurality of paths, to obtain an explanation for the recommendation result, where the explanation is a description of an association between entities at two ends indicated by the one or more paths in the obtained recommendation result.
In this embodiment, the plurality of paths can be scored in the first direction and the second direction of each of the plurality of paths, so that the two directions of the path are considered during scoring. In this way, the explanation of the recommendation result is more comprehensive and accurate compared with a recommendation result obtained in one direction in the conventional technology. In actual application, each of the plurality of paths is scored based on the first direction and the second direction of each of the plurality of paths, to obtain a plurality of explanations, and then the final recommendation result is obtained based on ranking of the explanations. In this way, the several recommended items can be preset, so that the more accurate recommended item can be provided.
In an embodiment, the first direction of the foregoing is consistent with the direction of the first vector; the second direction of the foregoing operation is consistent with the direction of the second vector; and for the one path, the first vector is the vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to the entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item, and the second vector is the vector pointing from the first entity that is in the path and that indicates the item in the history record of the user to the entity that is in the path and that is adjacent to the first entity indicating the item in the history record of the user; or the second vector is the vector pointing from the second entity that is in the path and that indicates the item attribute in the history record of the user to the entity that is in the path and that is adjacent to the second entity indicating the item attribute in the history record of the user. The scoring the plurality of paths based on the first direction and the second direction of each of the plurality of paths, to obtain an explanation for the recommendation result includes: separately computing the probability that the first vector and the second vector of each path are aggregated at the one entity included in the path, to obtain the explanation for the recommendation result.
In this embodiment, the overlap probability of the information in the left and right flow directions of the path in the embedding space is computed, and the overlap probability feature of the aggregation node is computed. The overlap probability feature is used for outputting the semantic-level explanation, that is, the preference degree of the user for the entity node. That is, the probability that vectors in the two flow directions are aggregated is computed. This helps find an interest point of the user more accurately and give an appropriate recommendation.
In an embodiment, the foregoing operation of separately computing, a probability that the first vector and the second vector of each path are aggregated at the one entity included in the path includes:
where
pm represents the probability; smleft represents the first vector and smright represents the second vector; or smright represents the first vector and smleft represents the second vector; exp represents an exponential function based on e; and M is a positive integer and M represents a quantity of paths between the to-be-recommended item and an item or an item attribute in a history record of a user.
In an embodiment, the neural network in the foregoing operation is obtained by training the neural network by using a training sample as an input for the neural network and a loss function value less than a threshold as an objective. The training sample includes a plurality of training paths, or the training sample includes the knowledge graph for training and the item and/or the item attribute in the history record of the user for training. Each training path indicates the association between entities at the two ends of the training path. The two ends of each training path correspond to the item or the item attribute in the history record of the user. The loss function includes a first loss function. The first loss function indicates a difference between the recommendation result output by the neural network and a target result. The target result is related to the item or the item attribute in the history record of the user.
In this embodiment, the neural network is trained by using the first loss function, so that the recommendation result is more accurate and meets the preference degree of the user.
In an embodiment, the loss function in the foregoing operation further includes a second loss function. The second loss function indicates a difference between embedding vectors of a first training vector and a second training vector in the training path of the knowledge graph for training. Two ends of the training path correspond to the item or the item attribute in the history record of the user.
In this embodiment, a purpose of the second loss function is to minimize a computing error of the vectors in the left and right flow directions of the path that flow to a same node, so that in a relation matrix with a higher dimension in the embedding space, a computed embedding vector in the left is equal to a computed embedding vector in the right as much as possible. That is, the second loss function can be used to minimize an error that the embedding vectors at the two ends of the path flow to the same node, to resolve a problem that the two ends of the path flow to the same node in a complex relation.
In an embodiment, the loss function in the foregoing operation further includes a third loss function. The third loss function indicates a difference between a computed embedding vector of the first entity and a real embedding vector of the first entity, and/or the third loss function indicates a difference between a computed embedding vector of the second entity and a real embedding vector of the second entity.
In this embodiment, the third loss function can minimize an error of a head node vector plus a relation vector and a tail node vector that are of a triplet in a same relational space, to resolve a problem that one-to-many or many-to-one mapping occurs in a single triplet in the complex relation.
According to a second aspect of embodiments of this application, a recommendation method is provided. The method may be performed by a recommendation apparatus, or may be performed by a component (such as a processor, a chip, or a chip system) of the recommendation apparatus. The method includes: obtaining a knowledge graph, a first identifier, and a second identifier, where the knowledge graph includes a plurality of first entities indicating items and a plurality of second entities indicating item attributes, the first identifier indicates a to-be-recommended item, the second identifier indicates an item or an item attribute in a history record of a user, and the knowledge graph includes a first entity corresponding to the first identifier and a first entity or a second entity corresponding to the second identifier; and running a neural network based on the knowledge graph, the first identifier, and the second identifier, to obtain an evaluation result, where the evaluation result includes a probability that the to-be-recommended item is recommended; the neural network is used to perform computing based on at least one first direction and at least one second direction in the knowledge graph, to obtain the evaluation result. The first direction starts from the first entity corresponding to the first identifier. The second direction starts from the first entity or the second entity corresponding to the second identifier.
In this embodiment of this application, the neural network can be run based on the knowledge graph, the first identifier, and the second identifier, to obtain the probability that the to-be-recommended item is recommended. The neural network is used to perform computing based on the at least one first direction and the at least one second direction in the knowledge graph, to obtain the probability that the to-be-recommended item is recommended. Therefore, the new recommendation method is recommended. In actual application, each of the plurality of to-be-recommended items is computed based on the at least one first direction and the at least second direction of the knowledge graph, to obtain a plurality of probabilities, and then a final evaluation result is obtained based on ranking of the probability. In this way, the several recommended items can be preset, so that the more accurate recommended item can be provided.
In an embodiment, the evaluation result in the foregoing operation further includes a probability corresponding to a path in the knowledge graph based on which the to-be-recommended item is recommended, one end of the path is the first entity corresponding to the first identifier, and the other end of the path is the first entity corresponding to the second identifier.
In an embodiment, the foregoing operation of running a neural network based on the knowledge graph, the first identifier, and the second identifier, to obtain an evaluation result includes: extracting a plurality of paths from the knowledge graph based on the knowledge graph, the first identifier, and the second identifier, where each path indicates an association between entities at two ends of the path, one end of each path corresponds to the to-be-recommended item, and the other end of each path corresponds to the item or the item attribute in the history record of the user; and inputting the plurality of paths into the neural network to score a plurality of to-be-recommended items, so that the neural network obtains the evaluation result, where the neural network is used to perform computing based on a first direction and a second direction of each of the plurality of paths to obtain the probability. For one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In this embodiment, the plurality of to-be-recommended items can be scored by inputting the plurality of paths into the neural network, so that the neural network obtains the evaluation result. In this way, the two directions of the path are considered during scoring. The evaluation result is more accurate compared with an evaluation result obtained in one direction in the conventional technology.
In an embodiment, the foregoing operation of running a neural network based on the knowledge graph, the first identifier, and the second identifier, to obtain an evaluation result includes: inputting the knowledge graph, the first identifier, and the second identifier into the neural network; and extracting, a plurality of paths from the knowledge graph based on the knowledge graph, the first identifier, and the second identifier, where each path indicates an association between entities at two ends of the path, one end of each path corresponds to the to-be-recommended item, and the other end of each path corresponds to the item or the item attribute in the history record of the user; and scoring, based on a first direction and a second direction of each of the plurality of paths, a plurality of to-be-recommended items corresponding to the plurality of paths, so that the neural network obtains the evaluation result. For one path, the first direction is a direction in which one end of the path points to the other end of the path. The second direction is a direction in which the other end of the path points to one end of the path.
In this embodiment, the plurality of to-be-recommended items corresponding to the plurality of paths can be scored in the first direction and the second direction of each of the plurality of paths, so that the two directions of the path are considered during scoring. In this way, the evaluation result is more accurate compared with the evaluation result obtained in one direction in the conventional technology.
In an embodiment, the first direction of the foregoing operation is consistent with a direction of a first vector. The second direction of the foregoing operation is consistent with a direction of a second vector. For one path, the first vector is a vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to an entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item. The second vector is a vector pointing from the first entity that is in the path and that indicates the item in the history record of the user to an entity that is in the path and that is adjacent to the first entity indicating the item in the history record of the user; or the second vector is a vector pointing from the second entity that is in the path and that indicates the item attribute in the history record of the user to an entity that is in the path and that is adjacent to the second entity indicating the item attribute in the history record of the user. The scoring, based on a first direction and a second direction of each of the plurality of paths, a plurality of to-be-recommended items corresponding to the plurality of paths includes: separately computing a probability that the first vector and the second vector of each path are aggregated at the one entity included in the path.
In this embodiment, the overlap probability of the information in the left and right flow directions of the path in the embedding space is computed, and the overlap probability feature of the aggregation node is computed. The overlap probability feature is used for outputting the semantic-level explanation, that is, the preference degree of the user for the entity node. The interest point of the user can be accurately found.
In an embodiment, the foregoing operation of separately computing, the probability that the first vector and the second vector of each path are aggregated at the one entity included in the path includes:
where
pm represents the probability; smleft represents the first vector and smright represents the second vector; or smright represents the first vector and smleft represents the second vector; exp represents an exponential function based on e; and M is a positive integer and M represents a quantity of paths between the to-be-recommended item and an item or an item attribute in a history record of a user.
In an embodiment, the neural network in the foregoing operation is obtained by training the neural network by using a training sample as an input for the neural network and a loss function value less than a threshold as an objective. The training sample includes a plurality of training paths, or the training sample includes the knowledge graph for training, a first training identifier, and a second training identifier. An item indicated by the first training identifier and an item or an item attribute indicated by the second training identifier are an item or an item attribute in a historical item of the user. The loss function includes a first loss function. The first loss function indicates a difference between the evaluation result output by the neural network and a target result. The target result is related to the item or the item attribute in the history record of the user.
In this embodiment, the neural network is trained by using the first loss function, so that the recommendation result is more accurate and meets the preference degree of the user.
In an embodiment, the loss function in the foregoing operation further includes a second loss function. The second loss function indicates a difference between embedding vectors of a first training vector and a second training vector in the training path of the knowledge graph for training. Two ends of the training path correspond to the item or the item attribute in the history record of the user.
In this embodiment, the purpose of the second loss function is to minimize the computing error of the vectors in the left and right flow directions of the path that flow to a same node, so that in a relation matrix with a higher dimension in the embedding space, a computed embedding vector in the left is equal to a computed embedding vector in the right as much as possible. That is, the second loss function can be used to minimize the error that the two ends of the path flow to the same node, to resolve the problem that the two ends of the path flow to the same node in the complex relation.
In an embodiment, the loss function in the foregoing operation further includes a third loss function. The third loss function indicates a difference between a computed embedding vector of the first entity and a real embedding vector of the first entity, and/or the third loss function indicates a difference between a computed embedding vector of the second entity and a real embedding vector of the second entity.
In this embodiment, the third loss function can minimize an error of a head node vector plus a relation vector and a tail node vector that are of a triplet in a same relational space, to resolve a problem that one-to-many or many-to-one mapping occurs in a single triplet in the complex relation.
According to a third aspect of embodiments of this application, a recommendation apparatus is provided. The recommendation apparatus includes: an obtaining unit, configured to obtain a knowledge graph, where the knowledge graph includes a plurality of first entities indicating items and a plurality of second entities indicating item attributes; and a running unit, configured to run a neural network based on the knowledge graph and information of a user, to obtain a recommendation result, where the information of the user indicates an item or an item attribute in a history record of the user. The recommendation result includes a recommended item. The neural network is used to perform computing based on at least one first direction and at least one second direction in the knowledge graph to obtain the recommendation result. The first direction starts from a first entity indicating the recommended item or a to-be-recommended item. The second direction starts from a first entity indicating the item in the history record of the user, or the second direction starts from a second entity indicating the item attribute in the history record of the user.
In an embodiment, the recommendation result further includes an explanation for the recommended item, and the explanation is associated with one first direction and one second direction.
In an embodiment, the running unit of the recommendation apparatus is specifically configured to extract a plurality of paths from the knowledge graph based on the knowledge graph and the information of the user. Each path indicates an association between entities at two ends of the path. One end of each path corresponds to the to-be-recommended item. The other end of each path corresponds to the item or the item attribute in the history record of the user. The running unit is specifically configured to input the plurality of paths into the neural network to score a plurality of to-be-recommended items, so that the neural network obtains the recommendation result. The neural network is used to perform computing based on a first direction and a second direction of each of the plurality of paths to obtain the recommendation result. For one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In an embodiment, the running unit of the communication apparatus is specifically configured to input the knowledge graph and the information of the user into the neural network. The running unit is specifically configured to extract, by the neural network, a plurality of paths from the knowledge graph based on the knowledge graph and the information of the user. Each path indicates an association between entities at two ends of the path, one end of each path corresponds to the to-be-recommended item. The other end of each path corresponds to the item or the item attribute in the history record of the user. The running unit is specifically configured to score a plurality of to-be-recommended items based on the first direction and the second direction of each of the plurality of paths, so that the neural network obtains the recommendation result. For one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In an embodiment, the first direction is consistent with a direction of a first vector. The second direction is consistent with a direction of a second vector. For one path, the first vector is a vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to an entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item, and the second vector is a vector pointing from the first entity that is in the path and that indicates the item in the history record of the user to an entity that is in the path and that is adjacent to the first entity indicating the item in the history record of the user; or the second vector is a vector pointing from the second entity that is in the path and that indicates the item attribute in the history record of the user to an entity that is in the path and that is adjacent to the second entity indicating the item attribute in the history record of the user. The running unit is specifically configured to score, based on a probability that a first vector and a second vector of each of the plurality of paths are aggregated at one entity in the path, a to-be-recommended item corresponding to the path, to obtain the recommended item.
In an embodiment, the running unit of the communication apparatus is further configured to separately score the plurality of paths based on a first direction and a second direction of each of the plurality of paths, to obtain an explanation for the recommendation result. The explanation is a description of an association between entities at two ends indicated by the one or more paths in the obtained recommendation result.
In an embodiment, the first direction is consistent with the direction of the first vector. The second direction is consistent with the direction of the second vector. For one path, the first vector is the vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to the entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item, and the second vector is the vector pointing from the first entity that is in the path and that indicates the item in the history record of the user to the entity that is in the path and that is adjacent to the first entity indicating the item in the history record of the user; or the second vector is the vector pointing from the second entity that is in the path and that indicates the item attribute in the history record of the user to the entity that is in the path and that is adjacent to the second entity indicating the item attribute in the history record of the user. The running unit is specifically configured to separately compute the probability that the first vector and the second vector of each path are aggregated at the one entity included in the path, to obtain the explanation for the recommendation result.
In an embodiment, the running unit of the communication apparatus is specifically configured to separately compute, by using the following formula, the probability that the first vector and the second vector of each path are aggregated at the one entity included in the path:
where
pm represents the probability; smleft represents the first vector and smright represents the second vector; or smright represents the first vector and smleft represents the second vector; exp represents an exponential function based on e; and M is a positive integer and M represents a quantity of paths between the to-be-recommended item and an item or an item attribute in a history record of a user.
In an embodiment, the neural network is obtained by training the neural network by using a training sample as an input for the neural network and a loss function value less than a threshold as an objective. The training sample includes a plurality of training paths, or the training sample includes the knowledge graph for training and the item and/or the item attribute in the history record of the user for training. Each training path indicates the association between entities at the two ends of the training path. The two ends of each training path correspond to the item or the item attribute in the history record of the user. The loss function includes a first loss function. The first loss function indicates a difference between the recommendation result output by the neural network and a target result. The target result is related to the item or the item attribute in the history record of the user.
In an embodiment, the loss function further includes a second loss function. The second loss function indicates a difference between embedding vectors of a first training vector and a second training vector in the training path of the knowledge graph for training. Two ends of the training path correspond to the item or the item attribute in the history record of the user.
In an embodiment, the loss function further includes a third loss function. The third loss function indicates a difference between a computed embedding vector of the first entity and a real embedding vector of the first entity, and/or the third loss function indicates a difference between a computed embedding vector of the second entity and a real embedding vector of the second entity.
According to a fourth aspect of embodiments of this application, a recommendation apparatus is provided. The recommendation apparatus includes: an obtaining unit, configured to obtain a knowledge graph, a first identifier, and a second identifier, where the knowledge graph includes a plurality of first entities indicating items and a plurality of second entities indicating item attributes, the first identifier indicates a to-be-recommended item, the second identifier indicates an item or an item attribute in a history record of a user, and the knowledge graph includes a first entity corresponding to the first identifier and a first entity or a second entity corresponding to the second identifier; and a running unit, configured to run a neural network based on the knowledge graph, the first identifier, and the second identifier, to obtain an evaluation result, where the evaluation result includes a probability that the to-be-recommended item is recommended. The neural network is used to perform computing based on at least one first direction and at least one second direction in the knowledge graph, to obtain the evaluation result. The first direction starts from the first entity corresponding to the first identifier. The second direction starts from the first entity or the second entity corresponding to the second identifier.
In an embodiment, the evaluation result further includes a probability corresponding to a path in the knowledge graph based on which the to-be-recommended item is recommended, one end of the path is the first entity corresponding to the first identifier, and the other end of the path is the first entity corresponding to the second identifier.
In an embodiment, the running unit of the communication apparatus is specifically configured to extract a plurality of paths from the knowledge graph based on the knowledge graph, the first identifier, and the second identifier. Each path indicates an association between entities at two ends of the path. One end of each path corresponds to the to-be-recommended item. The other end of each path corresponds to the item or the item attribute in the history record of the user. The running unit is specifically configured to input the plurality of paths into the neural network to score a plurality of to-be-recommended items, so that the neural network obtains the evaluation result. The neural network is used to perform computing based on the first direction and the second direction of each of the plurality of paths to obtain the probability. For one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In an embodiment, the running unit of the communication apparatus is specifically configured to input the knowledge graph, the first identifier, and the second identifier into the neural network. The running unit is specifically configured to extract, by the neural network, a plurality of paths from the knowledge graph based on the knowledge graph, the first identifier, and the second identifier. Each path indicates an association between entities at two ends of the path. One end of each path corresponds to the to-be-recommended item. The other end of each path corresponds to the item or the item attribute in the history record of the user. The running unit is specifically configured to score, based on a first direction and a second direction of each of the plurality of paths, a plurality of to-be-recommended items corresponding to the plurality of paths, so that the neural network obtains the evaluation result. For one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path.
In an embodiment, the first direction is consistent with a direction of a first vector. The second direction is consistent with a direction of a second vector. For one path, the first vector is a vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to an entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item, and the second vector is a vector pointing from the first entity that is in the path and that indicates the item in the history record of the user to an entity that is in the path and that is adjacent to the first entity indicating the item in the history record of the user; or the second vector is a vector pointing from the second entity that is in the path and that indicates the item attribute in the history record of the user to an entity that is in the path and that is adjacent to the second entity indicating the item attribute in the history record of the user. The running unit is specifically configured to separately compute a probability that the first vector and the second vector of each path are aggregated at the one entity included in the path.
In an embodiment, the running unit of the communication apparatus is specifically configured to separately compute, by using the following formula, the probability that the first vector and the second vector of each path are aggregated at the one entity included in the path:
where
pm represents the probability; smleft represents the first vector and smright represents the second vector; or smright represents the first vector and smleft represents the second vector; exp represents an exponential function based on e; and M is a positive integer and M represents a quantity of paths between the to-be-recommended item and an item or an item attribute in a history record of a user.
In an embodiment, the neural network is obtained by training the neural network by using a training sample as an input for the neural network and a loss function value less than a threshold as an objective. The training sample includes a plurality of training paths, or the training sample includes the knowledge graph for training, a first training identifier, and a second training identifier. An item indicated by the first training identifier and an item or an item attribute indicated by the second training identifier are an item or an item attribute in a historical item of the user. The loss function includes a first loss function. The first loss function indicates a difference between the evaluation result output by the neural network and a target result. The target result is related to the item or the item attribute in the history record of the user.
In an embodiment, the loss function further includes a second loss function. The second loss function indicates a difference between embedding vectors of a first training vector and a second training vector in the training path of the knowledge graph for training. Two ends of the training path correspond to the item or the item attribute in the history record of the user.
In an embodiment, the loss function further includes a third loss function. The third loss function indicates a difference between a computed embedding vector of the first entity and a real embedding vector of the first entity, and/or the third loss function indicates a difference between a computed embedding vector of the second entity and a real embedding vector of the second entity.
According to a fifth aspect of embodiments of this application, a recommendation apparatus is provided. The recommendation apparatus or a component (such as a processor, a chip, or a chip system) of the recommendation apparatus performs the method according to the first aspect or any possible implementation of the first aspect.
According to a sixth aspect of embodiments of this application, a recommendation apparatus is provided. The recommendation apparatus or a component (such as a processor, a chip, or a chip system) of the recommendation apparatus performs the method according to the second aspect or any possible implementation of the second aspect.
According to a seventh aspect of embodiments of this application, a computer-readable storage medium is provided. The computer-readable storage medium stores instructions, and when the instructions are executed on a computer, the computer is enabled to perform the method according to the first aspect or any possible implementation of the first aspect, or the second aspect or any possible implementation of the second aspect.
According to an eighth aspect, a computer program product is provided. When the computer program product is run on a computer, the computer is enabled to perform the method according to the first aspect or any possible implementation of the first aspect, or the second aspect or any possible implementation of the second aspect.
According to a ninth aspect of embodiments of this application, a recommendation apparatus is provided, including a processor. The processor is coupled to a memory. The memory is configured to store a program or instructions. When the program or the instructions is/are executed by the processor, the apparatus is enabled to perform the method according to the first aspect or any possible implementation of the first aspect.
According to a tenth aspect of embodiments of this application, a recommendation apparatus is provided, including a processor. The processor is coupled to a memory. The memory is configured to store a program or instructions. When the program or the instructions is/are executed by the processor, the apparatus is enabled to perform the method according to the second aspect or any possible implementation of the second aspect.
For technical effects brought by the third aspect, the fifth aspect, the seventh aspect, the eighth aspect, and the ninth aspect, or any one of the embodiments thereof, refer to the technical effects brought by the first aspect or the different embodiments of the first aspect. Details are not described herein again.
For technical effects brought by the fourth aspect, the sixth aspect, the seventh aspect, the eighth aspect, and the tenth aspect, or any one of the embodiments thereof, refer to the technical effects brought by the second aspect or the different embodiments of the second aspect. Details are not described herein again.
It can be learned from the foregoing technical solutions that embodiments of this application have the following advantages: The knowledge graph is obtained, and the neural network is run based on the knowledge graph and the information of the user, to obtain the recommendation result. Compared with the conventional technology in which only the history record of the user is used as the start point for computing (that is, in the conventional technology, the recommendation result is computed in one direction) to obtain the recommendation result, the present application performs the computing based on the at least one first direction and the at least one second direction in the knowledge graph to obtain the recommendation result. Therefore, the new recommendation method is provided, and the computing is performed in the two directions, so that the more accurate recommendation result is obtained, that is, the precision and/or the recall of the recommendation result can be improved.
To describe technical solutions in embodiments of the present application or in the conventional technology more clearly, the following briefly introduces the accompanying drawings for describing embodiments or the conventional technology. It is clear that the accompanying drawings in the following description show merely some embodiments of the present application, and persons of ordinary skill in the art may still derive other drawings from these accompanying drawings without creative efforts.
Embodiments of this application provide a recommendation method, a recommendation network, and a related device. A neural network may be run based on a knowledge graph and information of a user, to obtain a recommendation result. The neural network is used to perform computing based on at least one first direction and at least one second direction in the knowledge graph to obtain the recommendation result. Therefore, the new recommendation method is provided, so that a more accurate recommendation result can be obtained, that is, precision and/or a recall of the recommendation result can be improved. The new recommendation method may be applied to a scenario in which an item is recommended to a user, for example, movie recommendation or game recommendation.
The following describes technical solutions in embodiments of this application with reference to the accompanying drawings in embodiments of this application. It is clear that the described embodiments are only some rather than all of embodiments of this application.
The recommendation method provided in embodiments of this application can be applied to a scenario in which an item needs to be recommended, such as the movie recommendation, a product recommendation, a music recommendation, a restaurant recommendation, a book recommendation, or a web page recommendation. The following briefly describes a movie recommendation scenario and a game recommendation scenario.
According to the recommendation method provided in embodiments, the neural network performs bidirectional propagation computing (described in detail below) to conclude that, when a user is a fan of both Chen Kaige and Zhang Guorong, if a movie Farewell My Concubine is recommended for the user, satisfaction of the user on a recommendation result can be greatly improved.
In the scenario in which the game is recommended for a user, for example, there are numerous complex relations between a recommended destined game Team Fortress 2 (Team Fortress 2) and a history play record of the user.
Some terms or concepts in embodiments of this application are described, to help persons skilled in the art have a better understanding.
The neural network may include a neuron. The neuron may be an operation unit that uses Xs and an intercept of 1 as an input. An output of the operation unit may be as follows:
h
W,b(x)=f(WTx)=f(Σs=1nEsxs+b).
s=1, 2, . . . , or n, n is a natural number greater than 1, Ws is a weight of Xs, b is a bias of the neuron, and f is an activation function of the neuron, and is used to introduce a nonlinear feature into the neural network, to convert an input signal in the neuron into an output signal. The output signal of the activation function may be used as an input for a next convolutional layer. The activation function may be a sigmoid function. The neural network is a network formed by connecting many single neurons together. To be specific, an output of a neuron may be an input for another neuron. An input for each neuron may be connected to a local receptive field of a previous layer to extract a feature of the local receptive field. The local receptive field may be a region including several neurons.
The deep neural network (DNN), also referred to as a multi-layer neural network, may be understood as a neural network having a plurality of hidden layers. There is no special metric for “a plurality of” herein. The DNN is divided based on locations of different layers, and the neural network in the DNN may be divided into three types: an input layer, a hidden layer, and an output layer. Generally, a first layer is the input layer, a last layer is the output layer, and a middle layer is the hidden layer. Layers are fully connected. To be specific, any neuron at an ith layer is necessarily connected to any neuron at an (i+1)th layer. Certainly, the deep neural network may not include the hidden layer. This is not specifically limited herein.
Work at each layer of the deep neural network may be described by using a mathematical expression {right arrow over (y)}=α(W{right arrow over (x)}+{right arrow over (b)}). From a physical layer, work at each layer of the deep neural network may be understood as completing transformation from input space to output space (namely, from row space to column space of a matrix) by performing five operations on the input space (a set of input vectors). The five operations are as follows: 1. dimension increasing/dimension reduction; 2. scaling up/scaling down; 3. rotation; 4. translation; and 5. “bending”. The operation 1, the operation 2, and the operation 3 are performed by W{right arrow over (x)}, the operation 4 is performed by +{right arrow over (b)}, and the operation 5 is performed by α( ) The word “space” is used herein for expression because a classified object is not a single thing, but a type of things. Space is a collection of all individuals of such type of things. W is a weight vector, and each value in the vector indicates a weight value of one neuron in the neural network at this layer. The vector W determines space transformation from the input space to the output space described above. In other words, a weight W at each layer controls how to transform space. A purpose of training the deep neural network is to finally obtain a weight matrix (a weight matrix formed by vectors W at a plurality of layers) at all layers of the trained neural network. Therefore, a training process of the neural network is essentially a manner of learning and controlling the space transformation, and more specifically, learning the weight matrix.
During training of the neural network, because it is expected that an output of the neural network is as close as possible to a value that is actually expected to be predicted, a current predictive value of the network may be compared with a target value that is actually expected, and then a weight vector at each layer of the neural network is updated based on a difference between the current predictive value and the target value (certainly, there is usually an initialization process before a first update, that is, a parameter is preconfigured for each layer of the deep neural network). For example, if the predictive value of the network is large, the matrix vector is adjusted to lower the predictive value, and the adjustment continues until the neural network can predict the target value that is actually expected. Therefore, “how to obtain, through comparison, a difference between the predictive value and the target value” needs to be predefined. This is the loss function or an objective function. The loss function and the objective function are important equations that measure the difference between the predictive value and the target value. The loss function is used as an example. A higher output value (loss) of the loss function indicates a larger difference. Therefore, training of the deep neural network is a process of minimizing the loss as much as possible.
The embedding vector may also be referred to as “word embodiment”, “vectorization”, “vector mapping”, “embedding”, or the like. In form, the embedding vector is used to represent an object by using a dense vector, for example, a vector is used to represent a user identifier document (ID), an item ID, or the like.
The knowledge graph is a semantic network. The knowledge graph includes a plurality of entities and relations between the entities. The entity is a most basic element in the knowledge graph. The entity may be classified into a first entity and a second entity. The first entity is an entity indicating an item. The second entity is an entity indicating an item attribute. In an implementation, the second entity is directly connected to the first entity, that is, the first entity and the second entity are arranged in a staggered manner.
The item is an object that is distinguishable and exists independently, for example, a city, a commodity, a movie, or a game.
The item attribute can be considered as a description, an explanation, content, and an attribute of an item. Generally, the entity indicating the item attribute is directly connected to the entity indicating an item. For example, in a movie recommendation scenario, a movie genre of Farewell My Concubine is history. Farewell My Concubine is equivalent to the first entity indicating the movie. History is equivalent to the second entity indicating an attribute of the movie. The movie genre is a relation between the first entity and the second entity.
The triplet is a concept in a computer data structure, is a compression mode mainly used for storing a sparse matrix, and is also called a triplet table. For example, the triplet includes an entity 1 (which may be referred to as a node), a relation between the entity 1 and an entity 2 (which may also be referred to as an edge), and the entity 2 (which may be referred to as a node), which form the triplet stored in a computer.
The knowledge representation is mainly to embed entities and relations in the knowledge graph into continuous vector space.
The cross entropy is a common concept in deep learning, and is generally used to compute a difference between a target value and a predictive value. The cross entropy is often used as the loss function of the neural network.
The L2 norm is also referred to as a Euclidean norm, and represents a Euclidean distance from an origin to a point determined by a vector X.
The AUC is a performance metric for measuring merits and demerits of a learner, and may quantify a classification capability of ROC curve expression. The classification capability is closely related to a probability and a threshold. A larger AUC indicates a more reasonable output probability and a more reasonable ranking result.
Recommendation precision=Quantity of pieces of correct recommended information/Quantity of pieces of recommended information.
Recommendation recall=Quantity of pieces of correct recommended information/Quantity of pieces of information in a sample.
The neural network with a recommendation function in embodiments of this application, may also be understood as a recommendation network. To distinguish the neural network from an original neural network, the following replaces the neural network with the recommendation function with the recommendation network for description.
To better understand the recommendation method, the recommendation network, and the related device that are disclosed in embodiments of this application, the following first describes a system architecture that embodiments of the present application are used in. As shown in
The recommendation method provided in this application may be applied to a propagation computing and a loss function module of the recommendation network architecture 100. The recommendation network architecture is a general framework for neural network training oriented toward recommendation systems, and is more widely used in recent years.
To resolve a problem of how to obtain a recommendation result with a high accuracy in a bidirectional propagation manner when data is computed in embedding space, the propagation computing module runs a recommendation network based on a knowledge graph and information of a user, to obtain a recommendation result. The recommendation result includes a recommended item (or an identifier of a recommended item).
The recommendation network provided in this application may be a neural network, and may be specifically a deep neural network.
In an embodiment, to resolve a problem of how to obtain high-semantic explanation information when the data is computed in the embedding space, the propagation computing module uses a path as an input, and computes, in the bidirectional propagation manner from two ends of the path, a high-semantic explanation at nodes that are aggregated.
The loss function module resolves a problem that recommendation accuracy cannot be ensured in a recommendation network training process. In this application, a path loss function and a node loss function are added, so that performance metrics such as recommendation precision and a recall are greatly improved when a model is quickly aggregated, and reliability of the recommendation result and an explanation is improved.
A knowledge graph module is configured to create a knowledge graph or obtain an existing knowledge graph. For a description of the knowledge graph, refer to the foregoing description.
A new user processing module is configured to: when information such as a history record of the user is not newly added, first recommend some items to the user by using some policies (for example, a random method), quickly obtain feedback information (for example, node information such as an item ID or an item attribute clicked by the user), and then input these data into a network for more accurate recommendations.
A data obtaining module is configured to obtain the information of the user. The information of the user indicates an item (which may also be referred to as a historical item) or an item attribute (which may also be referred to as a historical item attribute) in the history record of the user.
In an embodiment, the data obtaining module may be configured to obtain the information of the user, item information, and related external knowledge. The external knowledge includes but is not limited to the attribute information of the item (for example, a director, a cast member, and a genre of a film; a developer, a carrier, and classification of a game; or composition of news, time related, places related, and persons related), or user dimension characteristics (for example, an age or an occupation of the user). These data is used to create the knowledge graph.
A data storage module is configured to store the foregoing related data. This is not specifically limited herein.
As shown in
The target model/rule 301 obtained by the training device 320 through training may be applied to different systems or devices, for example, applied to an execution device 310 in
A preprocessing module 313 is configured to perform preprocessing based on the input data received by the I/O interface 312. In this embodiment of this application, the preprocessing module 313 may be configured to randomly recommend some items to the user, and then determine a historical item (the historical item may be understood as the item in the history record of the user) based on feedback of the user. The preprocessing module 313 can be configured to: extract, based on the knowledge graph and the information of the user, the plurality of paths from the knowledge graph, and send the extracted paths to a computing module 311.
In a related processing process in which the execution device 310 preprocesses the input data or the computing module 311 of the execution device 310 performs computing, or the like, the execution device 310 may invoke data, code, and the like in a data storage system 350 to implement corresponding processing, or may store, in a data storage system 350, data, instructions, and the like obtained through corresponding processing.
Finally, the I/O interface 312 returns a processing result, for example, the foregoing obtained recommendation result, to the customer device 340, so as to provide the processing result to the user.
It should be noted that the training device 320 may generate corresponding target models/rules 301 based on different training data for different targets or different tasks. The corresponding target model/rule 301 may be used to implement the foregoing targets or complete the foregoing tasks, so as to provide a required result for the user.
In a case shown in
It should be noted that
As shown in
The following describes a hardware structure of a chip provided in an embodiment of this application.
The neural network processor 40 may be any processor appropriate for large-scale exclusive OR operation processing, for example, a neural-network processing unit (NPU), a tensor processing unit (TPU), or a graphics processing unit (GPU). For example, the neural network processor 40 is the NPU. The neural network processing unit NPU 40, as a coprocessor, is mounted to a host central processing unit (host CPU). The host CPU assigns a task. A core part of the NPU is an operation circuit 403. The controller 404 controls the operation circuit 403 to extract data in a memory (a weight memory or an input memory) and perform an operation.
In some implementations, the operation circuit 403 internally includes a plurality of processing units (PEs). In some implementations, the operation circuit 403 is a two-dimensional systolic array; or the operation circuit 403 may be a one-dimensional systolic array or another electronic line capable of performing mathematical operations such as multiplication and addition. In some implementations, the operation circuit 403 is a general-purpose matrix processor.
For example, it is assumed that there is an input matrix A, a weight matrix B, and an output matrix C. The operation circuit 403 extracts corresponding data of the matrix B from a weight memory 402, and buffers the data on each PE in the operation circuit 403. The operation circuit extracts data of the matrix A from an input memory 401, perform a matrix operation on the data of the matrix A and the data of the matrix B, and store a partial result or a final result of an obtained matrix in an accumulator 408.
A vector computing unit 407 may perform further processing on an output of the operation circuit, for example, vector multiplication, vector addition, exponential operation, logarithmic operation, or value comparison. For example, the vector computing unit 407 may be configured to perform network computing, such as pooling, batch normalization, or local response normalization, at a non-convolution/non-FC layer in a neural network.
In some implementations, the vector computing unit 407 can store a processed output vector into a unified memory 406. For example, the vector computing unit 407 may apply a non-linear function to the output, for example, a vector of an accumulated value, of the operation circuit 403, to generate an activation value. In some implementations, the vector computing unit 407 generates a normalized value, a combined value, or both. In some implementations, the processed output vector can be used as an activation input to the operation circuit 403, for example, used in a subsequent layer in the neural network.
The unified memory 406 is configured to store input data and output data.
For weight data, a direct memory access controller (DMAC) 405 transfers the input data in an external memory to the input memory 401 and/or the unified memory 406. The weight data from the external memory is stored in the weight memory 402, and data in the unified memory 406 is stored in the external memory.
A bus interface unit (BIU) 410 is configured to implement interaction between the host CPU, the DMAC, and an instruction fetch buffer 409 through a bus.
The instruction fetch buffer 409 connected to the controller 404 is configured to store instructions used by the controller 404.
The controller 404 is configured to invoke the instructions buffered in the instruction fetch buffer 409, to control a working process of an operation accelerator.
Generally, the unified memory 406, the input memory 401, the weight memory 402, and the instruction fetch buffer 409 each are an on-chip memory. The external memory is a memory outside the NPU. The external memory may be a double data rate synchronous dynamic random access memory (DDR SDRAM), a high bandwidth memory (HBM), or another readable and writable memory.
The following describes in detail the training method and the recommendation method of the recommendation network in embodiments of this application with reference to the accompanying drawings.
A training method of a recommendation network in an embodiment of this application is first described in detail with reference to
In an embodiment, the training method may be processed by a CPU, or may be processed by both a CPU and a GPU, or may not be processed by a GPU but another processor appropriate for computing of a neural network is used. This is not limited in this application.
The training method includes operation 501 and operation 502. The following describes the operation 501 and the operation 502 in detail.
Operation 501: Obtain a training sample.
There are a plurality of cases of the training sample in this embodiment of this application, and the cases are separately described below.
1. The training sample includes a plurality of training paths.
Each training path indicates an association between entities at two ends of the training path. The two ends of each training path correspond to an item or an item attribute in a history record of a user.
In this embodiment of this application, an item or an item attribute in a positive event in the history record of the user may be understood as an item or an item attribute once in the positive event of the user that is indicated by a first entity or a second entity. The positive event is an event in which the user does not clearly express a rejective or negative intention (that is, dislike, unconcern, or being unwilling to follow), or expresses a positive intention such as following, like, or interest. For example, that the user clicks, forwards, follows, comments, incentives (such as puts a coin and likes), favorites, browses, and browses an item or an item attribute for more than a threshold time all are positive events; and that the user clicks “Dislike”, clicks “Do not show me again”, clicks “Not interested”, and selects “Do not recommend this on my homepage” all are negative events, and an item or an item attribute related to such history record should be removed and not entered.
2. The training sample includes a knowledge graph for training and an item or an item attribute in a history record of a user for training.
In an embodiment, the knowledge graph for training and a knowledge graph in a specific recommendation process may be the same or different. This is not specifically limited herein.
In an embodiment, the knowledge graph for training includes a plurality of triplets. The plurality of triplets are connected with a same entity.
3. The training sample includes a knowledge graph for training, a first training identifier, and a second training identifier.
An item indicated by the first training identifier and an item or an item attribute indicated by the second training identifier each are an item (which may be referred to as a historical item) or an item attribute (which may also be referred to as a historical item attribute) in a historical item of a user.
Identifiers in this embodiment of this application (a first identifier, a second identifier, the first training identifier, and the second training identifier) each may be an ID, a name, or the like. This is not specifically limited herein.
There are the plurality of cases of the training sample in this embodiment of this application. The foregoing several cases are merely examples. It can be understood that in actual application, adaptive adjustment may be performed on the training sample based on a requirement. This is not specifically limited herein.
An item (a to-be-recommended item, a recommended item, and the item in the history record of the user for training) in this embodiment of this application may be a movie, a game, music, news, or the like. This is not specifically limited herein.
A relation between two entities in this embodiment of this application may be a movie genre, a leading actor, a director, a screenwriter, or the like; or may be a game developer, a game publisher, a game operating platform, a game type, a game release year, or the like; or may be a singer, a composer, a lyricist, or the like. This is not specifically limited herein.
Operation 502: Train the recommendation network by using the training sample as an input for the recommendation network and a loss function value less than a first threshold as an objective, to obtain the trained recommendation network.
The recommendation network provided in this embodiment of this application may be an independent recommendation system with explanation information, or may output explanation information for an existing recommendation system. In other words, the recommendation network provided in this embodiment of this application may output an explanation for the recommended item. The recommended item may be a recommended item output by the existing recommendation system. Certainly, the recommendation network provided in this embodiment of this application may further output a score of the recommended item and an explanation for the recommended item or the to-be-recommended item.
For a structure of the recommendation network in this embodiment of this application, refer to
The bidirectional propagation can be understood as performing computing from at least one first direction and at least one second direction to obtain a recommendation result (for a specific computing process, refer to embodiments shown in
Further, a plurality of to-be-recommended items are scored based on a first direction and a second direction of each of a plurality of paths. TOP K recommended items are selected as recommended items based on a requirement, where K is an integer greater than 1.
For example, if K is 2, to-be-recommended items corresponding to first two scores in the plurality of to-be-recommended items are determined as recommended items.
In an embodiment, the bidirectional propagation can also be understood as performing operation-by-operation computing from left and right endpoints of a path to an intermediate node. The path is extracted from the knowledge graph based on the knowledge graph and information of the user.
In an embodiment, one end of the path corresponds to the to-be-recommended item, and the other end of the path corresponds to the item (which can also be understood as the historical item) or the item attribute in the history record of the user. The intermediate node may be the item or the item attribute. This is not specifically limited herein.
In this embodiment of this application, the information of the user may include user-related information, such as a user ID, an item clicked by the user (which may also be understood as the historical item) or an item attribute of an item, an age of the user, and an occupation of the user.
In an embodiment, through the bidirectional propagation, the plurality of paths are separately scored based on the first direction and the second direction of each of the plurality of paths, to obtain the explanation for the recommendation result.
Further, the plurality of paths are scored based on the first direction and the second direction of each of the plurality of paths. TOP K recommended items are selected as recommended items based on the requirement, where K is an integer greater than 1.
In this embodiment of this application, for one path, the first direction is a direction in which one end of the path points to the other end of the path, and the second direction is a direction in which the other end of the path points to the one end of the path. One end of the path corresponds to the to-be-recommended item, and the other end of the path corresponds to the historical item or the historical item attribute.
For example, if K is 2, paths corresponding to first two scores in the plurality of paths are determined as the explanation for the recommendation result, or first two probabilities in probabilities corresponding to aggregation nodes in paths of both left and right flow directions of the plurality of paths are determined as the explanation for the recommendation result.
Further, an overlap probability of information in left and right flow directions of the path in embedding space is computed, and an overlap probability feature of an aggregation node is computed. The overlap probability feature is used for outputting a semantic-level explanation, that is, a preference degree of the user for an entity node. That is, the probability that vectors in two flow directions are aggregated is computed. The overlap probability feature indicates a probability that a connection between the to-be-recommended item and the history record of the user (the item or the item attribute clicked by the user) is activated (a larger probability indicates a more significant preference degree of the user). An objective of this operation is to find an interest of the user. For example, eight of ten movies watched by a user A are all cast by actor Stephen Chow Sing-Chi. In this case, there is a high probability that the path includes a node “Stephen Chow Sing-Chi”. The overlap probability can be used as the explanation for the recommended item, that is, an entity node with semantics in the path (which may also be understood as the second entity corresponding to the item attribute) can be used to describe potential preferences of the user (for example, a name of a movie director and a book type). An operation (for example, multiplication) is performed on an embedding vector of the attribute node and a probability that a corresponding path is activated, to obtain feedback information of a single path (for example, Feedback information=Probability value×Embedding vector of Stephen Chow Sing-Chi in the embedding space, that is, the feedback information may be v×Probability value p).
In
Each path includes a plurality of entities e and a relation R between the entities e, where e may be a first entity indicating an item, or may be a second entity indicating an item attribute, and m is an integer greater than 1.
The following describes the bidirectional propagation computing by using the pairing block 1 in
The pairing block 1 corresponds to a historical item i1. There are m paths between the pairing block 1 and the to-be-recommended item v. A first path includes v-R1st- e1-R2nd- . . . -e-Rend-i1. A second path includes v-R1st- e2-R2nd- . . . -e-Rend-i1An mth path includes v-R1st- em-R2nd- . . . -e-Rend-i1. R1st represents a relation matrix between the to-be-recommended item v and an entity e1. R2nd represents a relation matrix between the entity e1 and another entity e. Rend represents a relation matrix between another entity e and the historical item i1. Bidirectional propagation computing performed on the first path in the pairing block 1 is equivalent to computing a probability that vectors in left and right flow directions of the first path are aggregated, that is, a probability that the to-be-recommended item v and the historical item i1 in the first path flow to an intermediate entity.
In an embodiment, one path corresponds to one probability. Therefore, the probability may be used as a score of the path. Score selection is performed from high to low based on a requirement of TOP K, to obtain the explanation for the recommended item. A denominator for computing the probability may be a sum of m probabilities corresponding to m paths in one pairing block. Certainly, to avoid a probability of 100% caused by only a single path, and avoid affecting accuracy of subsequent recommendation, the denominator for computing the probability may be a sum of nxm probabilities corresponding to nxm paths in n pairing blocks. This is not specifically limited herein.
The explanation in this embodiment of this application may be the probability, or may be a description of an association between entities at two ends indicated by one or more paths of the obtained recommendation item.
R2nd may also be Rend, and e2 is em, that is, one path includes at least three entities (v, e, i) and two relation matrices (R1st, Rend). One pairing block corresponds to one u. After u corresponding to each pairing block is obtained, u corresponding to the pairing blocks can be directly added or accumulatively weighted summed to obtain a feedback embedding vector u of the user.
In this embodiment of this application, that a score of the to-be-recommended item is a distance (that is, ŷ) between the to-be-recommended item and the feedback embedding vector is used as an example. It can be understood that the score of the to-be-recommended item may alternatively be u corresponding to one pairing block. The score of the to-be-recommended item may alternatively be a feedback embedding vector obtained by adding the plurality of pairing blocks, or the like. This is not specifically limited herein.
In an embodiment, one pairing block corresponds to one u. Therefore, u may be used as the score of the to-be-recommended item. Score selection is performed from high to low based on a requirement of TOP K, to obtain the recommended item. Certainly, the feedback embedding vector obtained by adding u corresponding to the plurality of pairing blocks may alternatively be used as a score to select the recommended item. This is not specifically limited herein.
For example, a path between the to-be-recommended item v and the pairing block 1 is shown in
In an embodiment, the recommendation network further computes an abstract user weight and the score of the to-be-recommended item. The abstract user weight is used to compute the feedback embedding vector u of the user. The score computing is used to compute the distance (namely, the score ŷ of the to-be-recommended item) between the to-be-recommended item and the feedback embedding vector u. Further, the recommendation network may output, based on scores of the to-be-recommended items, to the user in descending order of scores, the to-be-recommended items (namely, recommended items) corresponding to the scores, or may select a score in descending order based on a requirement of the TOP K, to obtain the recommended item. Further, the abstract user weight may be used to update the embedding vector of the to-be-recommended item v, that is, p is a probability corresponding to a path with a maximum probability in the plurality of paths in the pairing block. Certainly, p may alternatively be set to 1, that is, the embedding vector of the to-be-recommended item v is not updated. This is not specifically limited herein.
The recommendation result in this embodiment of this application may include the to-be-recommended item. The recommendation result may alternatively include the to-be- recommended item and the explanation for the to-be-recommended item. Certainly, the recommendation result may alternatively include the to-be-recommended item, the explanation for the to-be-recommended item, and the score corresponding to the to-be-recommended item. This is not specifically limited herein.
In this embodiment of this application, a loss function indicates a difference between the recommendation result of the recommendation network and a target recommendation result. In this case, the recommendation network is trained by using an objective of reducing a value of the loss function, to be specific, continuously reducing the difference between the recommendation result of the recommendation network and the target recommendation result. Such training process may be understood as a recommendation task. The loss function may be understood as a loss function corresponding to the recommendation task.
In an embodiment, if the recommendation result includes the score of the to-be-recommended item, the score is used to describe a preference degree of the user for the to-be-recommended item. A higher score indicates that the user likes the to-be-recommended item more. Then, score selection is performed from high to low based on the requirement of the TOP K, to obtain the recommended item.
In an embodiment, if the recommendation result includes the score of the to-be-recommended item, the loss function includes a first loss function. The first loss function indicates a difference between a computed score of the recommended item output by the recommendation network and an actual score. The actual score is a score corresponding to the historical item selected by the user. The computed score is the score output by the recommendation network.
In an embodiment, the first loss function uses the following formula:
min L1=Σ−(y log (ŷ)+(1−y)log (1−ŷ)).
In a binary classification scenario, y is a positive or negative sample, that is, a value of y is 1 or 0, and ŷ represents a distance between u and v, and may be specifically a distance between uT and v. This is not limited herein.
Further, the loss function further includes a second loss function. The second loss function indicates a difference between embedding vectors of a first training vector and a second training vector in the path of the knowledge graph. For a training path, a first training vector is a vector pointing from a first entity that is in the training path and that indicates the to-be-recommended item to an entity that is in the training path and that is adjacent to the first entity indicating the to-be-recommended item. A second training vector is a vector pointing from a first entity that is in the training path and that indicates the item in the history record of the user to an entity that is in the training path and that is adjacent to the first entity indicating the item in the history record of the user, or the second training vector is a vector pointing from a second entity that is the training path and that indicates the item attribute in the history record of the user to an entity that is in the training path and that is adjacent to the second entity indicating the item attribute in the history record of the user. In the training process, the to-be-recommended item is also an item clicked by the user.
In an embodiment, the first training vector is the vector of the first entity corresponding to the to-be-recommended item in the path pointing to the entity adjacent to the first entity. The second training vector is the vector of the first entity corresponding to the historical item pointing to the entity adjacent to the first entity, or the second training vector is the vector of the second entity corresponding to the historical item attribute pointing to the entity adjacent to the second entity.
In other words, a purpose of the second loss function is to minimize a computing error of the vectors in the left and right flow directions of the path that flow to a same node, so that in a relation matrix with a higher dimension in the embedding space, a computed embedding vector in the left is equal to a computed embedding vector in the right as much as possible.
For example, two triplets in the training knowledge graph are respectively “Farewell My Concubine-leading actor-Zhang Guorong” and “Farewell My Concubine-leading actor-Zhang Fengyi”. In this case, intermediate entities “Zhang Guorong” and “Zhang Fengyi” are not a same entity, that is, the intermediate entities cannot be aligned, and the training of the recommendation network is difficult to converge. Therefore, the second loss function is introduced to minimize the difference between the first training vector and the second training vector (that is, a vector obtained by Farewell My Concubine+“First training relation: leading actor” is equal to a vector obtained by Days of Being Wild+“First training relation: leading actor” as much as possible, so that computing of the path is correct and convergence of the recommendation network is accelerated).
In an embodiment, the second loss function uses the following formula:
min L2=Σ−1nσ(max |εim−εim′|).
σ is an activation function, εim represents a first feature vector that flows from one end of the path to the intermediate entity (or an adjacent entity), εim′ represents a second feature vector that flows from the other end of the path to the intermediate entity (or the adjacent entity), and max represents a number with a maximum difference in a plurality of dimensions.
It can be understood that the foregoing formula may further have other variants. For example, max may be replaced with avg (representing an average of differences in the plurality of dimensions), εim is replaced with g (εim), and g ( ) represents graph computing. A specific formula of the second loss function may further have other variants. This is not specifically limited herein.
Further, the loss function may further include a third loss function. The third loss function indicates a difference between a computed embedding vector of an entity and a real embedding vector of the entity. The entity includes the first entity indicating the item and the second entity indicating the item attribute.
The foregoing example is still used. A purpose of the third loss function is to make a vector obtained by adding a head Farewell My Concubine of the triplet and the relation “leading actor” equal to a vector of a dimension “Zhang Guorong” as much as possible.
In an embodiment, the third loss function uses the following formula:
min L3=Σ−lnσ(sim−Mre)−X2; and
εim=simT*e
σ is an activation function, sim represents a computed embedding vector of the tail node obtained by flowing from a head node in the triplet to a tail node, sim can be understood as the first training vector or the second training vector, Mre is a real embedding vector of the tail node, and X2 is a regularization function, and is used to avoid overfitting. In an embodiment, X2 is mean-square error.
It can be understood that the foregoing formula may further have other variants. For example, sim may be replaced with g (sim ) , Mre is replaced with g(Mre) , and g ( ) represents graph computing. A specific formula of the third loss function may further have other variants. This is not specifically limited herein.
In an embodiment, if the loss function includes the first loss function, a value of the loss function is a value of the first loss function. If the loss function includes the first loss function and the second loss function, a value of the loss function may be a sum, an average, a weighted average or the like of a value of the first loss function and a value of the second loss function. This is not specifically limited herein. Similarly, if the loss function includes the first loss function, the second loss function, and the third loss function, a value of the loss function may be a sum, an average, a weighted average or the like of a value of the first loss function, a value of the second loss function, and a value of the third loss function. A specific manner of determining the value of the loss function is not limited herein.
In an embodiment, the value of the loss function may be computed in the following computing manner:
λ1, λ2, and λ3 are adjustable coefficients, and may be set based on an actual requirement. This is not specifically limited herein.
According to an aspect, the recommendation network provided in this application can score the to-be-recommended item through the bidirectional propagation computing, to obtain the recommended item; and score a path, to obtain the explanation for the to-be-recommended item, that is, to obtain an explanation for semantic information between any two connected entities. According to another aspect, the loss function provided in this embodiment of this application can improve accuracy of the recommendation network in the training process (for details, refer to the following evaluation of the recommendation network and a description corresponding to Table 1). Specifically, the second loss function provided in this embodiment of this application can be used to minimize an error that the two ends of the path flow to a same node, to resolve a problem that the two ends of the path flow to the same node in a complex relation. The third loss function in this embodiment of this application can minimize an error of a head node vector and a tail node vector that are of a triplet mapping to relational space, to resolve a problem that one-to-many or many-to-one mapping occurs in a single triplet in the complex relation.
It should be noted that the training process can alternatively use another training method except the foregoing method. This is not limited herein.
The following describes in detail the recommendation method in embodiments of this application with reference to the accompanying drawings.
In an embodiment, the method may be performed by a CPU, or may be a CPU and a GPU, or may be not performed by a GPU but another processor appropriate for neural network computing is used. This is not specifically limited herein.
The recommendation method in this embodiment of this application may be applied to scenarios such as movie recommendation, game recommendation, music recommendation, or news recommendation. This is not specifically limited herein. The following describes the recommendation method provided in this embodiment of this application by using a movie recommendation scenario as an example.
The method includes:
Operation 801: Obtain a knowledge graph.
The knowledge graph in this embodiment of this application includes a plurality of first entities indicating items and a plurality of second entities indicating item attributes. The knowledge graph may be an existing knowledge graph, or may be a newly created knowledge graph. This is not specifically limited herein.
In addition, for a description of the knowledge graph, refer to the foregoing term description and a description corresponding to
The knowledge graph in this embodiment of this application may have a function associated with a history record of a user (or may be understood that the knowledge graph can distinguish which items in the knowledge graph are not clicked by the user and which items are clicked by the user, that is, a path can be directly selected based on the knowledge graph), or may not have a function associated with a history record of a user (or may be understood that a path can be determined based on an identifier of the user, an identifier of an item, and the knowledge graph). This is not specifically limited herein.
The knowledge graph in this embodiment of this application may be a knowledge graph created for one user, or may be a knowledge graph created for a plurality of users. This is not specifically limited herein.
If the knowledge graph is the newly created knowledge graph, information of a user, item (a to-be-recommended item and a historical item) information, and related external knowledge are first obtained. The external knowledge includes but is not limited to attribute information of the item (for example, a director, a cast member, and a genre of a movie; a developer, a carrier, and classification of a game; or composition of news, time related, places related, and persons related), or user dimension characteristics (for example, an age distribution or an occupation of the user). Certainly, the obtained information may also be filtered, that is, different users are recommended by group. For example, users with similar age distributions are set as a group, to improve recommendation efficiency. The to-be-recommended item is an item that is not clicked by the user (for example, a movie that is not watched by the user). The historical item is an item that is clicked by the user (for example, a movie that is previously watched by the user).
In an embodiment, if the user is a new user, there may be a relatively small amount of item information, and some policies (for example, a random method) may be used to first recommend some items to the user. After feedback information is obtained, the item information and/or item attribute information are/is obtained.
In an embodiment, after the foregoing information is obtained, a triplet may be created first, and then triplets with a same node are connected to create the knowledge graph for recommendation.
For example, one triplet includes: Movie name: Farewell My Concubine-Relation: leading actor-Entity: Zhang Guorong. Another triplet includes: Movie name: Days of Being Wild-Relation: leading actor-Entity: Zhang Guorong. The two triplets have a common entity node (Zhang Guorong), and the two triplets are connected through Zhang Guorong.
Operation 802: Run a recommendation network based on the knowledge graph and information of the user, to obtain a recommendation result.
The recommended network used in this operation may be constructed by using the method in
In this embodiment of this application, there are a plurality of cases in which the recommendation network is run based on the knowledge graph and the information of the user, and the cases are separately described below.
Case 1: The recommendation network is run by using a path as an input.
Based on the knowledge graph and the information of the user, a plurality of paths are extracted from the knowledge graph. Each path indicates an association between entities at two ends of the path.
In an embodiment, the plurality of paths are input into the recommendation network, to score a plurality of to-be-recommended items, so that the recommendation network obtains the recommendation result.
Case 2: The recommendation network is run by using the knowledge graph and the information of the user as an input.
The knowledge graph and the information of the user are input into the recommendation network. The recommendation network extracts a plurality of paths from the knowledge graph based on the knowledge graph and the information of the user. Each path indicates an association between entities at two ends of the path.
In an embodiment, the information of the user may include a first identifier and a second identifier. The first identifier indicates a to-be-recommended item. The second identifier indicates the historical item or a historical item attribute. The knowledge graph includes a first entity corresponding to the first identifier and a first entity or a second entity corresponding to the second identifier.
In an embodiment, based on a first direction and a second direction of each of the plurality of paths, a plurality of to-be-recommended items are scored, so that the recommendation network obtains the recommendation result.
It may be understood that the foregoing several manners are merely examples. In actual application, there may be another input manner. This is not specifically limited herein.
Regardless of the foregoing input manners, an objective of the foregoing plurality of manners is to determine the plurality of paths, and then obtain the recommendation result. The recommendation result may include a recommended item. The recommendation result may alternatively include an explanation for the recommended item (which may also be understood as an evaluation result). The recommendation result may alternatively include the recommended item and an explanation for the recommended item. This is not specifically limited herein.
In an embodiment, for each of the plurality of paths, the to-be-recommended item corresponding to each path is scored based on a probability that a first vector and a second vector of the path are aggregated at one entity in the path, to obtain the recommended item.
In an embodiment, the plurality of paths are separately scored based on the first direction and the second direction of each of the plurality of paths, to obtain the explanation for the recommendation result. The explanation is a description of an association between the entities at the two ends indicated by the one or more paths in the obtained recommendation result.
With reference to
Operation 8021: Compute a first vector and a second vector of a path in the knowledge graph.
A direction of the first vector is consistent with the first direction. A direction of the second vector is consistent with the second direction. For one path, the first vector is a vector pointing from the first entity that is in the path and that indicates the to-be-recommended item to an entity that is in the path and that is adjacent to the first entity indicating the to-be-recommended item. The second vector is a vector pointing from the first entity that is in the path and that indicates the historical item to an entity that is in the path and that is adjacent to the first entity indicating the historical item; or the second vector is a vector pointing from the second entity that is in the path and that indicates the historical item attribute to an entity that is in the path and that is adjacent to the second entity indicating the historical item attribute.
For ease of description, it is defined that the first entity for indicating the item in this embodiment of this application is classified into a first recommendation entity and a first historical entity, that is, the first entity indicating the recommended item or the to-be-recommended item is the first recommendation entity, and the first entity indicating the item in the history record of the user is the first historical entity. The path includes the first recommendation entity, a first associated entity, a first relation between the first recommendation entity and the first associated entity, a second associated entity, the first historical entity, and a second relation between the first historical entity and the second associated entity. The first associated entity and/or the second associated entity may be an entity for indicating the item or an entity for indicating the item attribute. This is not specifically limited herein.
For example, one first recommendation entity and three first historical entities are used as an example. As shown in
A first path: Farewell My Concubine-Genre-Hong Kong movie-Genre-Days of Being Wild.
A second path: Farewell My Concubine-Leading actor-Zhang Guorong-Leading actor-Days of Being Wild.
A third path: Farewell My Concubine-Genre-History-Genre-White Vengeance.
A fourth path: Farewell My Concubine-Director-Chen Kaige-Director-Forever Enthralled.
A fifth path: Farewell My Concubine-Type-Beijing opera-Type-Forever Enthralled.
After the path is obtained, the first vector and the second vector in the path can be determined. The first vector is a vector of one end of the path (for example, the first recommendation entity) pointing to the intermediate entity. The second vector is a vector of the other end of the path (for example, the first historical entity) pointing to the intermediate entity. The intermediate entity can be understood as an entity adjacent to the first recommendation entity or an entity adjacent to the first historical entity. The intermediate entity is the first associated entity and/or the second associated entity. The first vector and the second vector may be multi-dimensional vectors
The foregoing example is still used. For the second path, the first vector may be a vector from Farewell My Concubine to Zhang Guorong. The second vector may be a vector from Days of Being Wild to Zhang Guorong. In this case, the intermediate entity is Zhang Guorong. It can be understood that the first vector may alternatively be a vector from Days of Being Wild to Zhang Guorong. The second vector may alternatively be a vector from Farewell My Concubine to Zhang Guorong. That is, for
In an embodiment, the first vector and the second vector can be computed by using the following formulas:
s
m
left
=M
rτ+rm1st; and
s
m
right
=M
r
i
u
+r
m
end.
smleft represents the first vector and smright represents the second vector; or smleft represents the second vector and smright represents the first vector; M r represents a relation matrix between the to-be-recommended item and the item or the item attribute corresponding to the intermediate entity; τ represents an embedding vector of the to-be-recommended item; rm1st represents an embedding vector of a relation between the to-be-recommended item and the item or the item attribute corresponding to the intermediate entity; iu represents an embedding vector of the historical item; rmend represents an embedding vector of a relation between the historical item and the item or the item attribute corresponding to the intermediate entity; and m is an integer greater than 0, and a value of m is the same as a quantity of paths. An example in which smleft represents the first vector and smright represents the second vector is used below for description.
It can be understood that the first vector and the second vector can alternatively be computed by using another formula. This is not specifically limited herein.
The foregoing example in
Operation 8022: Separately compute a probability that the first vector and the second vector of each path are aggregated at one entity (which may also be referred to as the intermediate entity) included in each path.
After the first vector and the second vector of the path are determined, a probability that the first vector and the second vector coincide in a mapping relation between the first vector and the second vector (that is, the probability that the first vector and the second vector are aggregated) is computed. The probability can be abstracted as an embedding vector of a user dimension, and may reflect a preference degree of the user for the intermediate entity. The probability value can be used to indicate importance of each path.
It can be understood that, if a quantity of nodes of the path is an odd number, the intermediate entity is an intermediate node of the path; or if a quantity of nodes of the path is an even number, the intermediate entity may be any one of two nodes in the middle of the path (generally, a node of two middle nodes that is closer to the to-be-recommended item is determined as the intermediate entity).
For example, eight of ten movies watched by the user are all cast by the actor Zhang Guorong. In this case, there is a high probability that the path includes the node “Zhang Guorong”.
In an embodiment, the probability that the first vector and the second vector of each path are aggregated at the intermediate entity may be computed by using the following formula:
pm represents the probability, smleft represents the first vector, smright represents the second vector, exp represents an exponential function based on e, M is a positive integer and M represents a quantity of paths between the to-be-recommended item and an item or an item attribute in a history record of a user (for example, three Ms in
It can be understood that the probability that the first vector and the second vector are aggregated at the same intermediate entity can alternatively be computed by using another formula. This is not specifically limited herein.
For example, with reference to
Mathematical expressions of a probability p1 that the first vector and the second vector are aggregated at a Hong Kong movie are as follows:
In an embodiment, a denominator is similar to a description corresponding to
A mathematical expression of a probability p2 that the first vector and the second vector are aggregated at Zhang Guorong is as follows:
A mathematical expression of a probability p3 that the first vector and the second vector are aggregated at History is as follows:
A mathematical expression of a probability p4 that the first vector and the second vector are aggregated at Chen Kaige is as follows:
A mathematical expression of a probability p5 that the first vector and the second vector are aggregated at Beijing opera is as follows:
The probability that the first vector and the second vector are aggregated at the same intermediate entity is computed, that is, the explanation for the to-be-recommended item is obtained. For example, the foregoing example is still used. The explanation may include: an explanation 1: a path attribute node (Hong Kong movie) corresponding to p1, and an explanation 2: a path attribute node (Zhang Guorong) corresponding to p2.
Operation 8023: Determine a feedback embedding vector of the user based on the probability and an embedding vector corresponding to the intermediate entity.
In an embodiment, after the probability that the first vector and the second vector are aggregated at the same intermediate entity is computed, the feedback embedding vector of the user can be determined based on the probability and the embedding vector corresponding to the intermediate entity. In other words, feature values including user preferences in each group can be first summarized as the feedback embedding vector of the user.
In an embodiment, the feedback embedding vector of the user can be computed by using the following formulas:
cn represents a feedback embedding vector of the to-be-recommended item and an nth historical item in one pairing block; em represents the embedding vector of the intermediate entity; pm represents a probability that a left vector and a right vector are aggregated at a same intermediate entity; u represents the feedback embedding vector of the user; and N is an integer greater than 1 and N represents a quantity of historical items.
It can be understood that the feedback embedding vector of the user can alternatively be computed by using another formula. This is not specifically limited herein.
For example, the foregoing example in
e1 is an embedding vector corresponding to the Hong Kong movie; e2 is an embedding vector corresponding to Zhang Guorong; e3 is an embedding vector corresponding to History; e4 is an embedding vector corresponding to Chen Kaige; and e5 is an embedding vector corresponding to Beijing opera. For an explanation for another parameter, refer to the explanation corresponding to the foregoing formula. Details are not described herein again.
In other words, an entity node with semantics in the path can be used to describe potential preferences of the user (for example, a name of the leading actor of a movie, a genre of a movie, or a name of a movie director). An operation (for example, multiplication) is performed on embedding vectors of these attribute nodes and a probability corresponding to a path, so that feedback information of a single path can be obtained.
For example, the foregoing example in
Operation 8024: Compute a distance between an embedding vector corresponding to the to-be-recommended item and the feedback embedding vector, to obtain a score of the to-be-recommended item.
In an embodiment, the distance between the embedding vector corresponding to the to-be-recommended item and the feedback embedding vector may alternatively be computed, to obtain the score of the to-be-recommended item.
In this embodiment of this application, the distance between the embedding vector corresponding to the to-be-recommended item and the feedback embedding vector may be computed by using a method such as a vector dot product, a cosine value, or a similarity method. The following uses the vector dot product as an example.
In an embodiment, the distance between the embedding vector corresponding to the to-be-recommended item and the feedback embedding vector can be computed by using the following formula:
ŷ=f(u, τ)=σ(uT·τ).
ŷ represents the score; τ represents the embedding vector of the to-be-recommended item; σ is an activation function; uT represents a transposed matrix corresponding to the feedback embedding vector u of the user; and represents an inner product operation.
It can be understood that the distance between the embedding vector corresponding to the to-be-recommended item and the feedback embedding vector can alternatively be computed by using another formula. This is not specifically limited herein.
In an embodiment, the activation function may be a SIGMOID function, and is used to limit the score within [0, 1].
A smaller score indicates that the to-be-recommended item is closer to the feedback embedding vector of the user, and indicates that the user likes the to-be-recommended item more.
For ease of understanding, the following uses an embedding vector dimension of 10 and a computing manner of inner product of vectors as an example to describe the foregoing score computing process: If a movie set {Farewell My Concubine, Tiny Times, Interstellar} is recommended, and the recommendation network completes computing of a weight of a training embedding vector, the following embedding vectors are computed:
Score computing results are as follows:
Values output after the activation function (that is, recommendation scores):
Operation 8025: Determine the recommended item based on the score of the to-be-recommended item.
After the score of the to-be-recommended item is obtained, the score is selected in descending order based on a requirement of TOP K, to obtain the recommended item.
In an embodiment, after the scores of the to-be-recommended items are obtained, the to-be-recommended items can be ranked based on scores. If Top 5 are selected, to-be-recommended items with scores ranked in the first 5 are selected from the recommended movie set as the recommended items, and are displayed to the user.
For example, the foregoing example is still used. For the user A, the recommended items are ranked as Farewell My Concubine, Interstellar, and Tiny Times. If top 1 is selected, Farewell My Concubine is the recommended item.
Operation 8026: Obtain an explanation for the recommended item or the to-be-recommended item based on the probability. This operation is optional.
After the probability of the path is obtained, the probability may be abstracted into the embedding vector of the user dimension, and the probability is output (for example, the explanation for the to-be-recommended item or the recommended item). This can reflect the preference degree of the user for the intermediate entity. A value of the probability can be used to indicate the importance of each path.
In an embodiment, the explanation for the recommended item or the to-be-recommended item may alternatively be displayed in a graph. The graph shown in
As shown in
In an embodiment, the explanation may be the probability, or may be the description of the associations between the entities at the two ends indicated by the one or more paths in the obtained recommendation result.
For example, the foregoing example is still used. If a distance between an embedding vector of Farewell My Concubine is very close to an embedding vector of Days of Being Wild in the embedding space, the following explanation statement may be generated: “A movie Farewell My Concubine is recommended to the user because the movie Farewell My Concubine is very similar to the movie Days of Being Wild that is watched by the user, and both of them are cast by the star Zhang Guorong”.
Operation 8027: Evaluate accuracy of the recommended item based on the score. This operation is optional.
In an embodiment, after the score of the recommended item is obtained, the accuracy of the recommended item may be evaluated based on the score. Alternatively, it can be understood that accuracy of the recommendation network is evaluated based on the score.
In this embodiment of this application, the accuracy of the recommended item is evaluated based on the score in a plurality of manners, which are separately described below.
1. Precision (namely, the foregoing recommendation precision)
The precision is computed as described above. Recommendation precision=Quantity of pieces of correct recommended information/Quantity of pieces of recommended information.
A specific process is as follows: Historical items watched by the user are divided into a training set and a test set without repetition. The training set is used for the training method in an embodiment shown in
For example, there are 40 historical items watched by the user, where the training set includes 30 historical items, and the test set includes 10 historical items. 100 to-be-recommended items are input to the recommendation network. If four recommended items in Top 5 items that are recommended are historical items watched by the user, the precision=4/5=0.8.
2. Recall (namely, the foregoing recommendation recall)
The recall is computed as described above. Recall=Quantity of pieces of correct recommended information/Quantity of pieces of information in a sample.
For example, the foregoing example of computing the precision is still used. The recall is 4/10=0.4.
3. AUC
A specific description of the AUC is described above, and the AUC is to compute an area under a ROC curve. It mainly evaluates correctness of a single recommended item and whether overfitting occurs, and determines whether the single recommended item is “true positive” or “false positive” by using a binary classifier and a threshold.
For example, the threshold is 0.5. If an output result of the recommended item in the binary classifier is 0.7, that is, greater than the threshold, the recommended item is correct; or if an output result of the recommended item in the binary classifier is 0.3, that is, less than the threshold, the recommended item is incorrect.
It can be understood that the foregoing three manners of evaluating the recommendation network are merely examples. In actual application, the accuracy of the recommended item can be evaluated in another manner (for example, accuracy). This is not specifically limited herein.
The evaluation manner in embodiments of this application may be applied to a training phase of the recommendation network (for example, the training method shown in
To more intuitively see recommendation accuracy of the recommendation network provided in embodiments of this application, an existing recommendation model is compared with TB-Net in embodiments of this application.
The existing recommendation models include a ripple netrecommendation model, a collaborative knowledge base embedding (CKE) recommendation network, a Bayesian personalized ranking matrix factorization (BPRMF) recommendation algorithm, a neural factorization machine (NFM) recommendation algorithm, a deep knowledge-aware network (DKN), and a recommendation model of Wide&Deep.
1. Comparison results in metrics of the precision and the recall
For a comparison result in the precision, refer to
The comparison results are obtained when the TB-Net provided in this application and four mainstream models (the ripple net, the CKE, the BPRMF, and the NFM) are used for training news recommendation dataset: MIND (Microsoft news dataset), to complete a task of recommending news. The metrics of the precision and the recall are used for measuring the accuracy of the recommended item. It can be learned from the comparison results of
2. Comparison results in metrics of AUC and the accuracy.
Table 2 shows the comparison results in the metrics of the AUC and the accuracy.
It can be learned from the comparison results in Table 2 that, in the metrics of the AUC and the recommended accuracy, the recommendation network (that is, the TB-Net) in embodiments of this application is better than that in the conventional technology, and is basically equal to an industry-leading deep learning recommendation model (the ripple net launched by Microsoft and the Wide&Deep launched by Google), and is improved to a specific degree. The recommendation network provided in embodiments of this application features high recommendation accuracy.
Operation 8028: Evaluate accuracy of the explanation based on the probability and the knowledge graph. This operation is optional.
In an embodiment, after the probability corresponding to the path is obtained, ranking may be performed based on a value of the probability; and the accuracy of the explanation for the to-be-recommended item or the recommended item is evaluated based on the ranking and degree ranking of subgraph nodes in the knowledge graph. A degree of a node refers to a quantity of edges (that is, a quantity of connected relations) associated with the node.
For example, if the explanation corresponds to Top 2, two of p2 (that is, a probability corresponding to the second path in which Zhang Guorong is located) and p4 (that is, a probability corresponding to the fourth path in which Chen Kaige is located) with high probabilities in the path are used. Top two nodes ranking in the degree ranking in the knowledge graph are respectively the Zhang Guorong node and the Beijing opera node. In other words, both the probability p2 corresponding to Zhang Guorong and the Zhang Guorong node rank in the top two in the degree ranking in knowledge graph, and the probability p4 corresponding to Chen Kaige and the Chen Kaige node do not rank in the top two in the degree ranking in knowledge graph. It indicates that an explanation for the second path in which Zhang Guorong is located is accurate, and an explanation for the fourth path in which Chen Kaige is located is not accurate compared with the explanation corresponding to Zhang Guorong.
To more intuitively see the accuracy of the explanation output by the recommendation network provided in embodiments of this application, comparison is performed on explanations corresponding to the existing recommendation explanation model and the recommendation network in embodiments of this application in the movie recommendation scenario (MovieLens dataset). The recommendation network provided in embodiments of this application may also be referred to as a tower-bridge net (TB-Net), or referred to as a bidirectional significant explainable recommendation network.
The existing recommendation explanation model includes a ripple net recommendation model, an attentive multitask collaborative filtering (AMCF) model, and an interpretable linear regression model (LR).
Table 3 shows a comparison result of the explanations.
A recall of the explanation is equivalent to TopG@K. K indicates a quantity of explanations output by a recommendation model. G indicates a quantity of explanations that are recognized by the user and that rank in front in a dataset. For example, Top1@A3 indicates a probability that a first explanation in the dataset exists in three explanations output by the recommendation model, which is equivalent to determining, by using user feedback, whether an explanation for the recommended item output by the recommendation model meets a real expectation of the user.
The comparison result proves that, compared with the existing recommendation explanation model (the ripple net, the AMCF, and the LR), the TB-Net provided in embodiments of this application significantly improves accuracy of output explanation information. The recommendation network provided in embodiments of this application features both high interpretability and high recommendation accuracy, that is, providing a semantic explanation while maintaining the high recommendation accuracy.
In this embodiment of this application, a time sequence relation is not limited in each operation shown in
It can be understood that, if the recommendation network outputs only the explanation for the to-be-recommended item, operations shown in
In a possible implementation, operations of obtaining the recommendation result based on the plurality of paths may include the operation 8021 to the operation 8025. In another possible implementation, operations of obtaining the recommendation result based on the plurality of paths may include the operation 8021 to the operation 8026. In another possible implementation, operations of obtaining the recommendation result based on the plurality of paths may include the operation 8021 to the operation 8027. In another possible implementation, operations of obtaining the recommendation result based on the plurality of paths may include the operation 8021 to the operation 8028.
In embodiments of this application, in one aspect, the high-semantic explanation information can be provided for the to-be-recommended item or the recommended item through the bidirectional propagation computing, and can be grafted with the recommendation system in the conventional technology, that is, the high-semantic explanation information can be provided for a recommended item output by the existing recommendation system. In another aspect, the overlap probability of information in the left and right flow directions of the path aggregated at the intermediate entity is computed at the aggregation point (namely, the intermediate entity), to indicate importance of the path. In another aspect, preference feedback information of the user (that is, the feedback embedding vector of the user) can be obtained by computing the overlap probability and the intermediate entity. The feedback embedding vector is used for outputting a trustworthy and reliable semantic explanation. In another aspect, the preference feedback information at two ends of the path can be aggregated to form the feedback embedding vector, and the feedback embedding vector and the to-be-recommended item are computed to output the recommended item, to obtain the more accurate recommendation result, that is, to improve the recommendation recall.
The recommendation method in embodiments of this application is described above. Recommendation apparatuses in embodiments of this application are described below. A neural network is equivalent to the foregoing recommendation network. As shown in
In this embodiment, operations performed by units in the recommendation apparatus are similar to those described in embodiments shown in
In this embodiment, the running unit 1402 can run the neural network based on the knowledge graph and the information of the user, to obtain the recommendation result. The neural network is used to perform computing based on the at least one first direction and the at least one second direction in the knowledge graph to obtain the recommendation result. Therefore, a new recommendation method is provided.
As shown in
In this embodiment, operations performed by units in the recommendation apparatus are similar to those described in embodiments shown in
In this embodiment, the running unit 1502 can run the neural network based on the knowledge graph, the first identifier, and the second identifier, to obtain the evaluation result. The neural network is used to perform computing based on the at least one first direction and the at least one second direction in the knowledge graph, to obtain the probability that the to-be-recommended item is recommended. The probability can be used to output an explanation for the to-be-recommended item, thereby providing an explanatory recommendation method.
As shown in
In addition, the processor 1601 may be a central processing unit, a general-purpose processor, a digital signal processor, an application-specific integrated circuit, a field programmable gate array or other programmable logic devices, a transistor logic device, a hardware component, or any combination thereof. The processor may implement or perform various example logical blocks, modules, and circuits described with reference to content disclosed in this application. The processor may be a combination of processors implementing a computing function, for example, a combination of one or more microprocessors, or a combination of a digital signal processor and a microprocessor. It may be clearly understood by persons skilled in the art that, for the purpose of convenient and brief description, for a detailed working process of the foregoing system, apparatus, and unit, refer to a corresponding process in the foregoing method embodiments, and details are not described herein again.
It should be noted that the recommendation apparatus shown in
An embodiment of this application further provides a computer-readable storage medium that stores one or more computer-executable instructions. When the computer-executable instruction is executed by a processor, the processor performs a method according to embodiments in the foregoing embodiments. The recommendation apparatus may specifically perform operations in method embodiments corresponding to
An embodiment of this application further provides a computer program product. When the computer program product is run by a processor, the processor performs a method in the foregoing embodiments. The recommendation apparatus may specifically perform operations in method embodiments corresponding to
An embodiment of this application provides a chip system. The chip system includes a processor, configured to support a recommendation apparatus in implementing functions in the foregoing embodiments of the recommendation apparatus. In a possible design, the chip system further includes a memory. The memory is configured to store program instructions and data that are necessary for a recommendation apparatus. The chip system may include a chip, or may include a chip and another discrete device. The recommendation apparatus may specifically perform operations in method embodiments corresponding to
In several embodiments provided in this application, it should be understood that the disclosed system, apparatus, and method can be implemented in other manners. For example, the described apparatus embodiment is merely an example. For example, division into the units is merely logical function division and may be other division in actual implementation. For example, a plurality of units or components may be combined or integrated into another system, or some features may be ignored or may not be performed. In addition, the displayed or discussed mutual couplings or direct couplings or communication connections may be indirect couplings or communication connections through some interfaces, apparatuses or units, and may be implemented in electrical, mechanical, or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of network elements. Some or all of the units may be selected based on actual requirements to achieve the objectives of the solutions of embodiments.
In addition, functional units in embodiments of this application may be integrated into one processing unit, or each of the units may exist alone physically, or two or more units are integrated into one unit. The integrated unit may be implemented in a form of hardware, or may be implemented in a form of a software functional unit.
When the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, the integrated unit may be stored in a computer-readable storage medium. Based on such an understanding, the technical solutions of this application essentially, or the part contributing to the conventional technology, or all or some of the technical solutions may be implemented in the form of a software product. The computer software product is stored in a storage medium and includes several instructions for instructing a computer device (which may be a personal computer, a server, a network device, or the like) to perform all or some of the operations of the methods described in the embodiments of this application. The foregoing storage medium includes any medium that can store the program code, such as a USB flash drive, a removable hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disc.
Number | Date | Country | Kind |
---|---|---|---|
202011640574.7 | Dec 2020 | CN | national |
This application is a continuation of International Application No. PCT/CN2021/141472, filed on Dec. 27, 2021, which claims priority to Chinese Patent Application No. 202011640574.7, filed on Dec. 31, 2020. The disclosures of the aforementioned applications are hereby incorporated by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/141472 | Dec 2021 | US |
Child | 18215959 | US |