INFORMATION RECOMMENDATION METHOD, DEVICE AND STORAGE MEDIUM

Description

CROSS-REFERENCE TO RELATED APPLICATION(S)

This application claims priority to the Chinese Patent Application No. 201911319036.5, filed on Dec. 19, 2019, which is incorporated herein by reference in its entirety.

TECHNICAL FIELD

The present disclosure relates to the field of data processing technology, and more particularly, to an information recommendation method, device and storage medium.

BACKGROUND

With the development of science and technology, people are exposed to more and more data, and need to identify data of interest from these data, which requires a lot of energy. For example, when Internet users purchase items on the Internet, they need to browse and compare various items; as another example, when a user reads an article on the Internet, he/she may only select an article in which he/she may be interested based on a title of the article; as a further example, when a user listens to music on the Internet, he/she may only select music in which he/she may be interested based on a name of the music.

Currently, in some schemes, information may be recommended to users, but in most of such recommendation schemes, recommendation information is randomly selected, which has poor accuracy of recommendation.

SUMMARY

In a first aspect of the embodiments of the present disclosure, there is provided an information recommendation method, comprising:

determining, in a case where a user behavior is detected, an object to which the user behavior is directed as an object to be processed;

determining similar objects of the object to be processed based on object similarity relationships; and

recommending the similar objects;

wherein the object similarity relationships are established by:

acquiring labels of a plurality of sample objects;

clustering the labels to obtain a plurality of label categories;

for each of the sample objects, calculating similarities between a label of the sample object and the plurality of label categories to obtain a similarity set corresponding to the sample object; and

establishing, according to the similarity set corresponding to each sample object, a similarity relationship between the sample object and any other one sample object of the plurality of sample objects.

In an embodiment, the labels are word vectors; and acquiring labels of a plurality of sample objects comprises: acquiring text data of the plurality of sample objects; performing word segmentation processing on the text data to obtain a plurality of words; and

mapping each of the words to a word vector space to obtain a word vector.

In an embodiment, performing word segmentation processing on the text data to obtain a plurality of words comprises:

determining, based on a pre-generated prefix dictionary, candidate words in the text data, and generating a directed acyclic graph composed of the candidate words;

calculating a probability of each path in the directed acyclic graph based on occurrence frequencies of prefix words in the prefix dictionary; and

determining, based on the probability of each path, the plurality of words obtained by performing word segmentation processing.

In an embodiment, mapping each of the words to a word vector space to obtain a word vector comprises:

inputting each word into a semantic analysis model, to obtain a word vector carrying semantic information output by the semantic analysis model.

In an embodiment, clustering the labels to obtain a plurality of label categories comprises:

traversing each label to determine whether there is a node in a clustering feature tree having a distance from the label less than a preset distance threshold, if so, determining that the label belongs to the node, and if not, establishing a new node in the clustering feature tree based on the label;

traversing each node in the clustering feature tree to determine whether a number of labels contained in the node is greater than a preset number threshold, and if so, dividing the node into two nodes; and

for each node, classifying labels contained in the node into a label category.

In an embodiment, calculating similarities between a label of the sample object and the plurality of label categories comprises:

for each label category, calculating a distance between each label of the sample object and a centroid of the label category as a similarity between the sample object and the label category.

In a second aspect of the embodiments of the present disclosure, there is provided an information recommendation method, comprising:

determining, in a case where a behavior of a first user is detected, an object which is preferred by the first user based on a relationship of preferences of users for objects; and

recommending the object which is preferred by the first user,

wherein the relationship of preferences of users for objects is established by:

acquiring labels of behavior objects corresponding to a plurality of sample users respectively;

clustering the labels to obtain a plurality of label categories;

for each of the sample users, performing statistics on a preference of the sample user for each label category according to a label of a behavior object corresponding to the sample user, and establishing a relationship of the preference of the sample user for the behavior object according to the preference and the acquired label of the behavior object.

In an embodiment, the labels are word vectors; and

acquiring labels of behavior objects corresponding to a plurality of sample users respectively comprises:

acquiring text data of the behavior objects corresponding to the plurality of sample users respectively;

performing word segmentation processing on the text data to obtain a plurality of words; and