The present disclosure relates to the field of computer technologies, and in particular, to a method for content recommendation, device, electronic equipment, and storage medium.
With the rapid development of Internet technology, users can access pictures, videos, music and other content provided by the Internet through mobile phones, laptops and other electronic equipment.
When users browse the content provided by the Internet, appropriate content is recommended to the corresponding target users based on the user behavior data accumulated previously.
According to a first aspect of the embodiments of the present disclosure, there is provided a method for content recommendation, including: obtaining a plurality of content cover images corresponding to a plurality of contents accessed by a user account; obtaining a cover image feature of the plurality of content cover images, and determining a user account feature of the user account based on the cover image feature of the plurality of content cover images; determining an access probability value of the user account accessing a content to be recommended based on the cover image feature of the content to be recommended and the user account feature; providing the content to be recommended to the user account based on the access probability value.
According to a second aspect of the embodiments of the present disclosure, there is provided a method for video recommendation, including: obtaining a plurality of video cover images corresponding to a plurality of videos accessed by a user account; obtaining a cover image feature of the plurality of video cover images, and determining a user account feature of the user account based on the cover image feature of the plurality of video cover images; determining an access probability value of the user account accessing a video to be recommended based on the cover image feature of the video to be recommended and the user account feature; and providing the video to be recommended to the user account based on the access probability value.
According to a third aspect of the embodiments of the present disclosure, there is provided a device for content recommendation, including: a first obtaining unit, configured to obtain a plurality of content cover images corresponding to the plurality of contents accessed by a user account; a first feature determination unit, configured to obtain a cover image feature of the plurality of content cover images and determining a user account feature of the user account based on the cover image feature of the plurality of content cover images; a first probability value obtaining unit, configured to determine an access probability value of the user account accessing a content to be recommended based on the cover image feature of the content to be recommended and the user account feature; and a first recommendation unit, configured to perform providing the content to be recommended to the user account based on the access probability value.
According to a fourth aspect of the embodiments of the present disclosure, there is provided a device for video recommendation, including: a second obtaining unit, configured to obtain a plurality of video cover images corresponding to a plurality of videos accessed by a user account; a second feature determination unit, configured to obtain a cover image feature of the plurality of video cover images and determining a user account feature of the user account based on the cover image feature of the plurality of video cover images; a second probability value obtaining unit, configured to determining an access probability value of the user account accessing a video to be recommended based on the cover image feature of the video to be recommended and the user account feature; and a second recommendation unit, configured to perform providing the video to be recommended to the user account based on the access probability value.
According to a fifth aspect of the embodiments of the present disclosure, there is provided a server, including a memory and a processor. The memory stores a computer program. When the processor executes the computer program, it implements the method for content recommendation described in the first aspect or any possible implementation of the first aspect, and the method for video recommendation described in the second aspect.
According to a sixth aspect of the embodiments of the present disclosure, there is provided a storage medium on which a computer program is stored. When the computer program is executed by a processor, it implements the method for content recommendation described in the first aspect or any possible implementation of the first aspect, and the method for video recommendation described in the second aspect.
According to a seventh aspect of the embodiments of the present disclosure, there is provided a computer program product including a computer program. The computer program is stored in a readable storage medium, and at least one processor of an equipment reads and executes the computer program from the readable storage medium, so that the equipment executes the method for content recommendation described in the first aspect or any possible implementation of the first aspect, and the method for video recommendation described in the second aspect.
Through the technical solution provided by the embodiments of the present disclosure, the user account feature used to represent the personalized preferences of the user account may be encode accurately based on the image features of the content cover images corresponding to a plurality of contents accessed by the user account, and then the access probability of the user account accessing the content to be recommended may be determined based on the user account features and the cover image features of the content to be recommended. Thus, it can be accurately determined that the content to be recommended is provided to the user account without using the user behavior data accumulated by the content to be recommended.
In order to make those skilled in the art better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.
It should be noted that the terms “first”, “second” and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be implemented in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. On the contrary, they are only examples of devices and methods consistent with some aspects of the present disclosure as detailed in the appended claims.
The relevant information of user accounts (including social relationship identity information and the like) described in the embodiments of the disclosure are obtained under user permission. On the premise of obtaining user permission and authorization, the method, apparatus, device and storage medium involved in the disclosure can obtain the relevant information of users.
The method for content recommendation provided by the present disclosure can be applied to the application environment shown in
In S210, content cover images corresponding to a plurality of contents accessed by the user account are obtained.
In some embodiments, the contents may refer to multimedia contents used to be provided to the user account, such as short videos, videos, news, articles (e.g. official account tweets), multimedia advertisements, e-books, songs and albums, etc.
In practical applications, the plurality of contents accessed by the user account may also be named as historically accessed contents.
In some embodiments, the content cover image may refer to a cover image corresponding to the content, such as short video cover image, article cover image, etc.
In a specific implementation, the recommendation server obtains content cover images corresponding to the plurality of contents accessed by the user account. For example, the recommendation server obtains short video cover images corresponding to X short videos clicked by a certain user account.
In some embodiments, X is a positive integer greater than 0.
In practical applications, the selection of the length of user's click history X has a great influence on the results. If X is small, the user's click list (content click list) has a large randomness and the user's preferences may not be captured; if X is large and the click list is too long, it will increase the difficulty of training. In practical applications, X may be equal to 20, that is, the content cover images corresponding to the contents of the last 20 clicks in the user's click history may be used, which can effectively express the user's personalized preference.
In S220, cover image features of the plurality of content cover images are obtained and user account feature of the user account is determined according to the cover image features of the plurality of content cover images.
In some embodiments, the cover image feature may refer to an image feature corresponding to the content cover image.
In some embodiments, the user account feature may refer to a corresponding feature used to represent the user account.
In specific implementation, after the recommendation server obtains content cover images corresponding to a plurality of contents, the recommendation server can perform feature extraction on each content cover image through a pre-trained image feature extraction network, and then obtain the cover image feature corresponding to each content cover image. In practical applications, the image feature extraction network may be a CNN visual feature extraction network.
In some embodiments, the CNN visual feature extraction network can adopt convolutional neural networks such as resnet50, VGG, inception, etc., and the last full connection layer is replaced with a dimension reduction layer, et., the full connection of 1024 dimensions to 32 dimensions. The CNN network adopts imagenet for pre-training, otherwise the network is difficult to converge.
Then, the recommendation server determines the user account feature of the user account according to the cover image features of the plurality of content cover images. Specifically, the recommendation server may input the cover image feature of each content cover image into a pre-trained user coding network, and perform fusion processing on the cover image feature of each content cover image through the pre-trained user coding network, and generate a user account feature corresponding to the user account. In practical applications, the recommendation server may adopt a user vector representation (e.g. user_embed) output by the pre-trained user coding network to represent the user account feature corresponding to the user account.
In S230, an access probability value of the user account accessing a content to be recommended is determined, based on the cover image feature of the content to be recommended and the user account feature.
In practical applications, the access probability value may also be named as content click-through rate (ctr).
In some embodiments, the cover image feature of the content to be recommended may refer to an image feature corresponding to the content cover image of the content to be recommended. In practical applications, the specific process of obtaining the image feature corresponding to the content cover image of the content to be recommended is similar to the specific process of obtaining the cover image features of the plurality of content cover images above, and will not be repeated here.
In specific implementation, after the recommendation server obtains the cover image feature of the content to be recommended and the user account feature corresponding to the user account, the recommendation server may input the cover image feature of the content to be recommended and user account feature into a pre-trained click-through rate prediction network; the cover image features of the content to be recommended and the user account feature corresponding to the user account are processed through the pre-trained click-through rate prediction network, and then the output result of the pre-trained click-through rate prediction network is taken as the access probability value of the user account accessing the content to be recommended.
In practical applications, the above pre-trained click-through rate prediction network may also be named as a CTR prediction network.
In some embodiments, the CTR prediction network may adopt a three-layer DNN (Deep-learning Neural Network) network, and train the loss by adopting a cross entropy loss function (SoftMax cross entropy loss). In order to train the network better, an optimal configuration scheme is selected by performing tuning tests in terms of Log sampling method, data enhancement, activation function, loss function, attention method, etc.
In S240, the content to be recommended is provided to the user account according to the access probability value.
In a specific implementation, after obtaining the access probability value of the user account accessing the content to be recommended, the recommendation server may provide the content to be recommended to the user account based on the access probability value. Specifically, the recommendation server may determine whether the access probability value is greater than a preset threshold or not. When the access probability value is greater than the preset threshold, the recommendation server provides the content to be recommended to the user account.
When there is a plurality of contents to be recommended, the recommendation server may sort each of the contents to be recommended based on the access probability values corresponding to each of the contents to be recommended, determine the target content from each of the contents to be recommended after being sorted, and provide the target content to the user account.
In the above method for content recommendation, the user account feature of the user account is determined according to the cover image features of a plurality of content cover images by acquiring content cover images corresponding to a plurality of contents accessed by the user account and acquiring the cover image features of the plurality of content cover images. The access probability value of the user account accessing the content to be recommended is determined based on the cover image feature of the content to be recommended and the user account feature. The content to be recommended is provided to the user account according to the access probability value. Thus, it may be realized to encode the user account features used to represent the personalized preferences of the user account based on the image features of the content cover images corresponding to the plurality of contents accessing by the user account. Then, the access probability of the account to access the content to be recommended may be determined based on the user account features and the cover image features of the content to be recommended, such that it may be accurately determined to provide the content to be recommended to the user account without using the user behavior data accumulated by the content to be recommended.
In an embodiment, determining the user account feature of the user account based on the cover image features of the plurality of content cover images includes: obtaining a fused feature by performing fusion processing on the cover image features of the plurality of content cover images, and determining the user account feature of the user account as the fused feature.
In a specific implementation, in the process that the recommendation server determines the user account features of the user account according to the cover image features of the plurality of content cover images, the following may be specifically included: the recommendation server may input the cover image feature of each of the content cover images into a pre-trained user coding network, and perform a series of fusion processing on the cover image feature of each of the content cover images through the pre-trained user coding network, and generate the user account feature corresponding to the user account. In practical applications, the recommendation server may adopt the user vector representation user_embed output by the pre-trained user coding network to represent the user account feature corresponding to the user account.
In the technical solution of the embodiments in the present disclosure, the fused feature obtained by fusing the cover image features of the plurality of content cover images, is taken as the user account feature of the user account, such that the cover image feature corresponding to each of the content cover images may be fully considered, so as to obtain the user account feature used to characterize the user account accurately.
In an embodiment, obtaining the fused feature by performing fusion processing on the cover image feature of the plurality of content cover images includes: obtaining a weight value corresponding to the cover image feature of each of the content cover images; then, performing weighted processing on the cover image feature of each of the content cover images based on the weight value; and obtaining the fused feature by performing fusion processing on the cover image feature of each of the content cover images. In the process of obtaining a weight value corresponding to the cover image feature of each of the content cover images, the following are included: obtaining a feature similarity between the cover image feature of each of the content cover images and the cover image feature of the content to be recommended respectively; obtaining the weight value corresponding to the cover image feature of each of the content cover images based on each of the feature similarities.
In a specific implementation, in the process that the recommendation server performs fusion processing on cover image features of the plurality of content cover images to obtain the fused features, it may specifically include: the recommendation server may first obtain weight values corresponding to the cover image features of each of the content cover images. Specifically, the recommendation server may input the cover image feature of each of the content cover images into a pre-trained attention layer, obtain the feature similarity between the cover image feature of each of the content cover images and the cover image feature of the content to be recommended through the attention layer, and obtain the weight value corresponding to the cover image feature of each of the content cover images based on each feature similarity.
Then, the recommendation server performs weighted processing on the cover image features of each of the content cover images based on the weight value, and performs fusion processing on the weighted cover image features of each of the content cover images to obtain the fused features, which are taken as the user account feature of the user account. In this way, the user account feature may represent the user's personalized preference accurately.
In the technical solution of this embodiment, in the process of obtaining the fused feature by performing fusion processing on the cover image feature of the plurality of content cover images, the feature similarity between the cover image feature of each of the content cover images and the cover image feature of the content to be recommended is obtained respectively, the weight value corresponding to the cover image feature of each of the content cover images is determined based on each feature similarity, and the cover image feature of each of the content cover images is weighted based on the weight value, such that the impact of noisy cover images in click history of the user account on the user coding results may be reduced effectively.
In an embodiment, said obtaining the cover image feature of the plurality of content cover images includes: obtaining initial cover image feature of the plurality of content cover images by performing feature extraction processing on the plurality of content cover images; determining a content similarity between first content and second content based on the initial cover image features of the plurality of content cover images; adjusting the initial cover image feature of the first content and the initial cover image feature of the second content, such that the feature similarity obtained after adjustment between the cover image feature of the first content and the cover image feature of the second content which is obtained after adjusting is matched with the content similarity.
Where, the first content and the second content are any two contents in the plurality of contents.
In a specific implementation, in the process that the recommendation server obtains the cover image features of the plurality of content cover images, it specifically includes: the recommendation server performs feature extraction processing on the plurality of content cover images to obtain initial cover image features of the plurality of content cover images, and determines the content similarity between the first content and the second content based on initial cover image features of the plurality of content cover images.
Specifically, in a user's click list (content click list), there may be a plurality of points of interest, and there may also be clicks that deviate from interest due to random/accidental events. Learning the correlation between the content cover images in the click list and suppressing the outlier noise data can help to judge the importance of the photo in the current click list, which is very important for feature learning.
In practical applications, the recommendation server can regard the user's click list as a sequence analogy, and regard the feature of each image as a word. In this way, it plays an important role in understanding the meaning of the sequence to learn the dependencies between each word and know which words are important. For example, users like three types of videos, such as movies, basketball, and football, and Multi-Head Attention can obtain this relationship from the click list by learning.
In the field of NLP (Natural Language Processing), there are many methods for processing sequence, similar to the Multi-Head Attention model, which can capture long-distance dependencies better. Specifically, the recommendation server can adopt the manner of self attention and Q=K=V=F (where Q, K, and V are input elements of the Multi-Head Attention model) for a vector F composed of X (click list length) cover image features stack, and calculate the relationship between the two contents, that is, the content similarity between the first content and the second content.
Then, the initial cover image feature of the first content and the initial cover image feature of the second content are adjusted, so that the feature similarity obtained after adjustment between the cover image feature of the first content and the cover image feature of the second content is matched with the content similarity, and the cover image features corresponding to the two historically accessed contents with similar contents are also similar.
In the technical solution of this embodiment, the initial cover image feature of the plurality of content cover images is obtained by performing feature extraction processing on the plurality of content cover images; the content similarity between first content and second content is determined based on the initial cover image features of the plurality of content cover images; finally, the initial cover image feature of the first content and the initial cover image feature of the second content is adjusted, such that the feature similarity obtained after adjustment between the cover image feature of the first content and the cover image feature of the second content is matched with the content similarity. In this way, it is possible to extract the plurality of points of interest in the user's content click list accurately, and identify clicks that deviate from interest due to randomness/accident accurately, and suppress the outlier noise data, and improve the accuracy with which the user account feature expresses the personalized preference of the user account.
In an embodiment, providing the content to be recommended to the user account according to the access probability value includes: sorting the plurality of contents to be recommended according to the access probability value; selecting N contents to be recommended from the sorted plurality of contents to be recommended as target contents, where, the minimum value of the access probability value of the target contents selected is greater than the maximum value of the access probability value of the target contents unselected; and providing the target contents to the user account.
Among them, N is a positive integer greater than or equal to 1.
In the specific implementation, when there is a plurality of contents to be recommended, in the process of providing the content to be recommended to the user account based on the access probability value, the recommendation server may sort the plurality of contents to be recommended according to the access probability value to obtain each of the sorted contents to be recommended. Then the recommendation server may select N contents to be recommended from each of the sorted contents to be recommended as target contents, where, the minimum value of the access probability value of the target contents selected is greater than the maximum value of the access probability value of the target contents unselected. Specifically, the recommendation server may sort the contents to be recommended in descending order of the access probability values corresponding to each of the contents to be recommended; then, the recommendation server selects the top N contents from each of the sorted contents to be recommended as the target contents.
Finally, the recommendation server may provide the target contents to the user account. Specifically, the recommendation server may send the content information of the target contents to the user terminal of the user account, so as to provide the target contents to the user account.
In the technical solution of this embodiment, when there is a plurality of contents to be recommended, N contents to be recommended from each of the sorted contents to be recommended are selected as target contents, provided to the user account, where the minimum value of the access probability value of the target contents selected is greater than the maximum value of the access probability value of the target contents unselected. In this way, the target content that is more in line with the personalized preference of the user account may be selected from the plurality of contents to be recommended, and then the target content is provided to the user account to improve the personalized recommendation accuracy of the content recommendation system.
In an embodiment, said providing the content to be recommended to the user account based on the access probability value includes: obtaining a content recommendation threshold corresponding to the user account; and providing the content to be recommended to the user account, in response to determining that the access probability value is greater than the content recommendation threshold.
In the specific implementation, in the process that the recommendation server provides the content to be recommended to the user account based on the access probability value, it specifically includes that: the recommendation server may obtain the content recommendation threshold corresponding to the user account; then, the recommendation server may determine whether the access probability value is greater than the content recommendation threshold. when the recommendation server determines that the access probability value of the user account accessing the content to be recommended is greater than the content recommendation threshold corresponding to the user account, the recommendation server provides the content to be recommended to the user account. Specifically, the recommendation server may send the content information of the content to be recommended to the user terminal of the user account, so as to provide the content to be recommended to the user account. When the recommendation server determines that the access probability value of the user account accessing the content to be recommended is less than or equal to the content recommendation threshold corresponding to the user account, the recommendation server does not provide the content to be recommended to the user account.
For example, when the content recommendation threshold corresponding to user account A is 0.7, and the access probability value of user account A accessing the content B to be recommended is 0.75, then the access probability value is greater than the content recommendation threshold, and the recommendation server may send the content information (such as access links, etc.) of the content B to be recommended to the user terminal of the user account A.
In the technical solution of this embodiment, when the access probability value is greater than the content recommendation threshold, the content to be recommended is provided to the user account, so that the content that meets the personalized preferences of the user account is provided to the user account, and the personalized recommendation accuracy of the content recommendation system is improved.
The above method for video recommendation can accurately encode the user account features used to represent the personalized preferences of the user account based on the image features of the video cover images corresponding to the plurality of videos accessed by the user account, and then can determine the access probability of the user account accessing the video to be recommended based on the account feature of the user account and the cover image feature of the video to be recommended, so that it may be accurately determined that the video to be recommended are recommended to the user account without using the user behavior data accumulated in the video to be recommended.
It should be understood that although the steps in the flowcharts of
a first obtaining unit 710, configured to perform obtaining a plurality of content cover images corresponding to a plurality of contents accessed by a user account;
a first feature determination unit 720, configured to perform obtaining a plurality of cover image feature of the plurality of content cover images and determining a user account feature of the user account based on the cover image features of the plurality of content cover images;
a first probability value obtaining unit 730, configured to determine an access probability value of the user account accessing a content to be recommended based on the cover image features of the content to be recommended and the user account feature; and
a first recommending unit 740, configured to perform providing the content to be recommended to the user account based on the access probability value.
In an embodiment, the first feature determining unit 720 is specifically configured to perform obtaining a fused feature by performing fusion processing on the cover image feature of the plurality of content cover images; and determining the user account feature of the user account as the fused feature.
In an embodiment, the first feature determination unit 720 is specifically configured to perform obtaining a weight value corresponding to the cover image feature of each of the content cover images; performing weighted processing on the cover image feature of each of the content cover images based on the weight value; and obtaining the fused feature by performing fusion processing on the cover image feature of each of the content cover images.
In an embodiment, the first feature determining unit 720 is specifically configured to execute the method to perform obtaining a feature similarity between the cover image feature of each of the content cover images and the cover image feature of the content to be recommended respectively; and obtaining the weight value corresponding to the cover image feature of each of the content cover images based on each feature similarity.
In an embodiment, the first obtaining unit 710 is specifically configured to perform obtaining initial cover image feature of the plurality of content cover images by performing feature extraction processing on the plurality of content cover images; determining a content similarity between first content and second content based on the initial cover image features of the plurality of content cover images, where the first content and the second content are any two contents in the plurality of contents; and adjusting the initial cover image feature of the first content and the initial cover image feature of the second content, such that the feature similarity obtained after adjustment between the cover image feature of the first content and the cover image feature of the second content is matched with the content similarity.
In an embodiment, there are a plurality of contents to be recommended, and the first recommending unit 740 is specifically configured to perform: sorting the plurality of contents to be recommended according to the access probability value; selecting N contents to be recommended from the sorted plurality of contents to be recommended as target contents, where, the minimum value of the access probability value of the target contents selected is greater than the maximum value of the access probability value of the target contents unselected, and N is a positive integer greater than or equal to 1; and providing the target contents to the user account.
In an embodiment, the first recommending unit 740 is specifically configured to perform obtaining a content recommendation threshold corresponding to the user account; and providing the content to be recommended to the user account, in response to determining that the access probability value is greater than the content recommendation threshold.
a second obtaining unit 810, configured to perform obtaining a plurality of video cover images corresponding to a plurality of videos accessed by a user account;
a second feature determination unit 820, configured to perform obtaining cover image features of the plurality of video cover images and determining a user account feature of the user account based on the cover image features of the plurality of video cover images;
a second probability value obtaining unit 830, configured to determining an access probability value of the user account accessing a video to be recommended based on the cover image features of the video to be recommended and the user account feature;
a second recommendation unit 840, configured to perform providing the video to be recommended to the user account based on the access probability value.
Regarding the apparatus in the above-mentioned embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.
Equipment 900 may also include a power component 924 configured to perform power management of equipment 900, a wired or wireless network interface 926 configured to connect equipment 900 to a network, and an input output (I/O) interface 928. Equipment 900 may operate an operating system stored in memory 922, such as Window9erver™, Mac O9 X™, Unix™, Linux™, FreeB9D™ or the like.
In an embodiment, there is also provided a storage medium including instructions, such as a memory 922 including instructions, which are executable by the processor of the device 900 to execute the above method. The storage medium may be a non-transitory computer-readable storage medium such as ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
All embodiments of the present disclosure can be implemented alone or in combination with other embodiments, which are regarded as the scope of protection required by the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
202011206827.X | Nov 2020 | CN | national |
The present application claims priority to International Application No. PCT/CN2021/117034, filed on Sep. 7, 2021, which is based on and claims the benefit of priority to Chinese Application No. 202011206827.X, filed on Nov. 3, 2020, the disclosures of which are herein incorporated by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/117034 | Sep 2021 | US |
Child | 17890421 | US |