The present invention relates to artificial intelligence (AI) and machine learning (ML) and in particular to a method, system and computer-readable medium for providing a joint prediction for a group of individual entities.
In an embodiment, the present invention provides a computer-implemented method of consolidating recommendations based on a plurality of individual recommendations. The method is implemented in one or more processors connected to a memory and includes receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities. Each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities. Using a trained prediction learning model, a prediction is determined for each source entity based on the knowledge graph. The trained prediction model was trained using prediction training data including historical data. The prediction for each source includes recommendation data identifying one or multiple target entities. Using a trained consolidation learning model, a consolidated prediction is determined for the plurality of source entities based on the prediction for each source entity. The trained consolidation learning model was trained using consolidation training data including the historical data and the recommendation data. The consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.
Subject matter of the present disclosure will be described in even greater detail below based on the exemplary figures. All features described and/or illustrated herein can be used alone or combined in different combinations. The features and advantages of various embodiments will become apparent by reading the following detailed description with reference to the attached drawings, which illustrate the following:
Often preferences of different individuals contradict each other, but common environments of individuals can require a joint decision. Embodiments of the present invention provide a mechanism where the predictions of a group of individual entities are consolidated into one joint prediction, which respects individual wishes as well as possible rankings of individuals within the group.
AI systems typically make predictions for one particular entity. However, often different entities are linked and the goal can be to arrive at one joint solution. For example, consider a group of people who would like to go together on vacation. Individually, the people might choose a different vacation destination than the one they would choose as a group. This should be respected when the AI system gives vacation recommendations to users. Embodiments of the present invention therefore have a technical aim and serve a specific technical purpose, in particular to improve the technical functionality and flexibility of non-human, computerized and automated or semi-automated AI systems. In particular, the accuracy of joint predictions, and the ability to make joint predictions generally, are improved.
Existing or naive solutions suffer from two major drawbacks: (1) if a group of entities (e.g., individuals) is represented as one entity, then it is not possible to respect the wishes of individual entities, nor is it possible to later leverage learned information if one entity becomes part of another group of entities; and (2) if a majority vote is taken across individual predictions, this does not respect the strength of individual wishes, nor the aspect that the prediction of one entity might have a stronger influence on the final prediction than the prediction of other entities.
In an embodiment, method is provided with which the predictions of a group of individuals can be consolidated to form one joint prediction. In an embodiment, the group is not modeled as a whole (as this would lead to a loss of information about the individuals of the group), which is advantageous and provides improvements over existing or naïve solutions for several reasons: (1) the individual entities are able to later be a part of other groups; (2) the individual entities are able to have varying strengths of preferences with regards to certain prediction that can be taken into account; and (3) the prediction of some individual entities are able to weigh higher than that of others. Another advantageous improvement is that the links between group members can be explicitly encoded in order to learn the relations between the group members.
Embodiments use a knowledge graph which includes triples t=(h, r, t), where h and t are entities and r is a relation. Entities of the knowledge graph are grouped in the following subgroups: (1) source entity; (2) target entity; and (3) attribute entity (see
In a first aspect, the present disclosure provides a computer-implemented method of consolidating recommendations based on a plurality of individual recommendations. The method is implemented in one or more processors connected to a memory and includes receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities. Each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities. Using a trained prediction learning model, a prediction is determined for each source entity based on the knowledge graph. The trained prediction model was trained using prediction training data including historical data. The prediction for each source includes recommendation data identifying one or multiple target entities. Using a trained consolidation learning model, a consolidated prediction is determined for the plurality of source entities based on the prediction for each source entity. The trained consolidation learning model was trained using consolidation training data including the historical data and the recommendation data. The consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.
In a second aspect, the present disclosure provides the method according to the first aspect, wherein one or more of the plurality of source entities are linked to one or more other source entities and/or one or more target entities.
In a third aspect, the present disclosure provides the method according to the first or second aspect, wherein the determining, using a trained prediction learning model, a prediction for each source entity includes learning vector representations of the knowledge graph.
In a fourth aspect, the present disclosure provides the method according to any of the aspects above, wherein the consolidated prediction identifies multiple target entities in a ranked order.
In a fifth aspect, the present disclosure provides the method according to any of the aspects above, further comprising applying source entity constraints of one or more of the plurality of source entities to the ranked order to create a filtered ranked order of the identified multiple target entities.
In a sixth aspect, the present disclosure provides the method according to any of the aspects above, wherein the recommendation data for each prediction includes a prediction explanation, and the consolidated prediction includes a consolidated prediction explanation.
In a seventh aspect, the present disclosure provides the method according to any of the aspects above, wherein the prediction for each source entity includes a weight, and wherein the determining a consolidated prediction is further based on the weights of the predictions for each source entity.
In an eighth aspect, the present disclosure provides the method according to any of the aspects above, further comprising: fusing a new source entity into the knowledge graph by linking the new source entity to one or more of the plurality of attribute entities to produce a fused knowledge graph, updating the step of determining a prediction for each source entity and the new source entity using the fused knowledge graph, and updating the step of determining a consolidated prediction for the plurality of source entities and the new source entity.
In a ninth aspect, the present disclosure provides a system configured for consolidating recommendations based on a plurality of individual recommendations, the system comprising one or more processors, which alone or in combination, are configured to provide for execution of a method comprising: receiving a knowledge graph including a plurality of source entities, a plurality of target entities and a plurality of attribute entities, wherein each source entity is linked to one or more of the plurality of attribute entities, and each target entity is linked to one or more of the plurality of attribute entities; determining, using a trained prediction learning model, a prediction for each source entity based on the knowledge graph, the trained prediction model having been trained using prediction training data including historical data, wherein the prediction for each source includes recommendation data identifying one or multiple target entities; and determining, using a trained consolidation learning model, a consolidated prediction for the plurality of source entities based on the prediction for each source entity, the trained consolidation learning model having been trained using consolidation training data including the historical data and the recommendation data, wherein the consolidated prediction identifies a target entity that maximizes a joint probability of the plurality of source entities.
In a tenth aspect, the present disclosure provides the system according to the ninth aspect, wherein the method further includes: fusing a new source entity into the knowledge graph by linking the new source entity to one or more of the plurality of attribute entities to produce a fused knowledge graph, updating the step of determining a prediction for each source entity and the new source entity using the fused knowledge graph, and updating the step of determining a consolidated prediction for the plurality of source entities and the new source entity.
In an eleventh aspect, the present disclosure provides the system according to the ninth or tenth aspect, wherein one or more of the plurality of source entities are linked to one or more other source entities and/or one or more target entities.
In a twelfth aspect, the present disclosure provides the system according to any of the ninth through eleventh aspects, wherein the consolidated prediction identifies multiple target entities in a ranked order, and wherein the method further includes applying source entity constraints of one or more of the plurality of source entities to the ranked order to create a filtered ranked order of the identified multiple target entities.
In an thirteenth aspect, the present disclosure provides the system according to any of the ninth through twelfth aspects, wherein the recommendation data for each prediction includes a prediction explanation, and the consolidated prediction includes a consolidated prediction explanation.
In a fourteenth aspect, the present disclosure provides the system according to any of the ninth through thirteenth aspects, wherein the prediction for each source entity includes a weight, and wherein the determining a consolidated prediction is further based on the weights of the predictions for each source entity.
In a fifteenth aspect, the present invention provides a tangible, non-transitory computer-readable medium having instructions thereon which, upon being executed by one or more processors, alone or in combination, provide for execution of the method according to any of the first through eighth aspects.
The domain alignment component (input:
The prediction engine component (input:
When a new source entity is added to the knowledge graph by the graph fusion module, the prediction is updated to learn a vector representation for this new source entity. Given a new source entity, the graph fusion module establishes links between the new source entity and existing entities in the graph in the form of triples. These triples are then used to update the prediction engine, such that only the vector of the new source entity is modified during the updates and the other vectors are left untouched. This is to ensure that the new source entity vector fits into the knowledge graph presentation and is only possible due to the definition of how source entities are added to the knowledge graph via the graph fusion module. The prediction engine component can be a knowledge base learner (KBlrn) and the explainer can be an explainable AI (XAI) engine that uses gradient rollback.
Once the prediction engine component has produced an output with an explanation (see
Taking the information collected so far into account (e.g., graph fusion data, recommendation data, preference data (see
In addition, constraints (e.g., if the target destination is at the ocean, then it should not be winter) are fed to the model in the form of rules (any soft or hard constraints, e.g., of the source entities), and are passed along the graph as training data. These constraints are applied as a post-processing step to the output list from the machine learning model.
The output of the consolidation engine component is passed to the next third party system to process the consolidated recommendations and related explanations for the entities of interest (e.g., the source entities).
The present embodiments can be practically applied to improve the technical systems of a smart city for green resource control as follows:
The present embodiments can be practically applied to improve the technical systems of a smart city for maintenance control as follows:
The present embodiments can be practically applied to improve the technical systems of a marketplace (e.g., a supermarket) for dynamic advertisements as follows:
The present embodiments can be practically applied to improve the technical systems of a travel recommendation system for providing group suggestions as follows:
The present embodiments can be practically applied to improve the technical systems of a digital government system for project execution for positive environmental, social and corporate governance (ESG) as follows:
Embodiments may provide the following advantages and improvements over existing technology:
1. Provision of a graph fusion module that combines three subgroups (source, target, and attribute entities) into a single knowledge graph representation to derive, first, individual recommendations (which may be rated) and, subsequently, a consolidated recommendation. This enables, given a new source entity, to update the prediction engine (such that only the vector of the new source entity are modified), to be able to predict for this source entity by learning about the source entity from its connection to attribute, target and other source entities (supplied from the graph fusion module) while holding on to the historic data.
2. Provision of a mechanism that, given the attribute and target entities associated with each source entity of interest and the personal recommendation (from the prediction engine component), links source entities with target entities, and a mechanism that assigns these triples a weight (for the recommendation and personal interests) that indicates the personal importance of the link.
3. Enabling to combine entities, their preferences and personal opinions with domain related knowledge, and personal recommendations into a unified representation to derive a consolidated recommendation for all entities.
An embodiment provides a method for providing a joint recommendation comprising:
1 Providing that the following information is set:
Existing recommender systems typically use matrix factorization where entities are modeled only as source (often called user) and target (often called item) entities. In this setup, preferences and other concepts cannot be taken directly into account. Faisal M. Almutairi, Nicholas D. Sidiropoulos, and Bo Yang, “XPL-CF: Explainable Embeddings for Feature-based Collaborative Filtering,” In Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM '21), Association for Computing Machinery, New York, N.Y., USA, pp. 2847-2851 (2021), which is hereby incorporated by reference herein, use such a system and make it explainable. In this kind of system, no additional relations can be presented, nor can other entities be presented, such as the attribute entities. Further, explanations can only be other users and items. Therefore, such a system is technically less flexible, less expressive and provides less insights for users compared to the present embodiments. Additionally, an existing recommendation system such as this cannot explicitly model different relations between users, which also is less expressive. It is also not possible using the existing recommendation system to provide a recommendation for a group of users.
There are also other existing recommendation systems that do take other relations and entities into account and can only supply a recommendation for an individual (see, e.g., Lawrence, C., Sztyler, T., and Niepert, M., “Explaining Neural Matrix Factorization with Gradient Rollback,” Proceedings of the AAAI Conference on Artificial Intelligence, 35(6), pp. 4987-4995 (2021), and Pezeshkpour, P., Tian, Y., and Singh, S., “Investigating Robustness and Interpretability of Link Prediction via Adversarial Modifications,” Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), pp. 3336-3347 (2019), each of which is incorporated by reference herein). In contrast to embodiments herein, these recommendation systems cannot provide a joint prediction for a group of entities and consolidate the predictions into one suitable joint decision while considering individual preference strengths. These recommendation systems also do not explicitly model the graph as a set of three entity types. In addition, there is no mechanism to obtain preferences and take a preference weighting into account.
In contrast to these existing recommendation systems, embodiments herein advantageously provide the following technical improvements:
A recent survey by Tran, T., Felfernig, A. and Tintarev, N., “Humanized recommender systems: State-of-the-art and research issues,” ACM Transactions on Interactive Intelligent Systems (TiiS), 11(2), pp. 1-41 (2021), which is hereby incorporated by reference herein, highlights that psychological factors such as personality, emotions and social connections can significantly affect the outcome of a decision process. Further, the survey describes that the problem of decision bias in group recommender systems is a common problem as, in most cases, users do not have a clear picture of their preferences in mind before starting a decision process.
Embodiments herein are able to overcome these issues by providing a way to reflect personalities and emotions through attribute entities, where social connections are described by edges in a knowledge graph. In addition, by the system providing as a first step personal recommendations to each source entity, the decision bias problem is mitigated.
An embodiment herein can be applied to any technical system where there are individual predictions for a series of entities that would like to arrive at one joint prediction, to boost existing technology, or to improve activities on ESG or material science. A user interface is provided for interacting with the system and providing the input data, and receiving and viewing the output data (e.g., recommendations and explanations).
It is particular advantageous that, according to an embodiment, not only is the consolidated recommendation shown, but also the individual predictions and an explanation. If only the consolidated recommendation would be provided, then a user would not understand why the recommendation was made, therefore lowering the user's trust by not being able to see how individual predictions are combined into a consolidated prediction.
In contrast to an approach in which source entities are represented as one entity, embodiments herein provide: (1) a better recommendation because source entities' preferences can be taken into account; and (2) a more efficient prediction system that can take advantage of the connection(s) between source entities, leading to an overall improved prediction. Embodiments herein also provide improvements by being able to adapt the prediction engine to a new source entity and by collecting weights for the individual connections to the source entities in order to adapt and take preferences into account and thereby return an improved prediction result.
While subject matter of the present disclosure has been illustrated and described in detail in the drawings and foregoing description, such illustration and description are to be considered illustrative or exemplary and not restrictive. Any statement made herein characterizing the invention is also to be considered illustrative or exemplary and not restrictive as the invention is defined by the claims. It will be understood that changes and modifications may be made, by those of ordinary skill in the art, within the scope of the invention, which may include any combination of features from different embodiments described above.
The present application claims priority to U.S. Provisional Patent Application No. 63/309,016, filed Feb. 11, 2022, which is hereby incorporated by reference in its entirety herein.
Number | Date | Country | |
---|---|---|---|
63309016 | Feb 2022 | US |