Content recommendation apparatus and method using tag cloud

Description

CLAIM OF PRIORITY

This application claims priority under 35 U.S.C. §119(a) to Korean Patent Application No. 10-2007-0117987, filed on Nov. 19, 2007, in the Korean Intellectual Property Office, the entire contents of which are hereby incorporated by reference.

FIELD OF THE INVENTION

The invention relates to content recommendation apparatus and methods, and in particular to content recommendation apparatus and methods that (a) compute similarities between users using tag clouds assigned to content and (b) recommend content to a user based on the computed similarities.

BACKGROUND OF THE INVENTION

As the Internet continues to increase in popularity, the amount of available content continues to increase geometrically. In particular, the number of moving image-type UCCs (User Created Content) is increasing explosively. In this environment, users with limited information and limited time are having difficulty finding content that satisfies their interests. Recommendation systems are thus being used more and more to recommend appropriate content to users based on their inclinations and preferences. Known recommendation systems recommend content by determining a user neighborhood having similar inclinations as a target user and utilizing a relationship between the target user and a user of the user neighborhood.

Conventional recommendation technology, however, has the following limitations. First, in the case of explicit data collection, the actual number of contents that are purchased, used, and/or evaluated by users is often small and, thus, the amount of content that can be recommended is limited.

Also, conventional recommendation technologies often select users similar to a target user based on those users who used the same content as the target user. Thus, the range of similar users may be limited and, consequently, the range of recommendable content also may be limited, possibly resulting in the recommendation of undesired content.

Moreover, in cases where the number of contents is larger than the number of users, the number of users who used the same content as the target user is usually small. Thus, finding users similar to a target user is difficult.

Another limitation of conventional recommendation technologies is that similar users are selected on the basis of content used by a target user in the past. Then, only a content related to the subject in which the target user had an interest in the past is recommended.

And because similar users are selected from those who used the same content as a target user, coverage for content beyond that actually used by the similar and target users is very low, for example, only about 10% to 30%.

SUMMARY OF THE INVENTION

An object of the invention is to provide content recommendation apparatus and methods that increase coverage for contents by using tag clouds assigned to the contents. In those cases where the amount of content is large, the invention increases recommendation performance by effectively finding users similar to a target user.

A content recommendation apparatus according to a first aspect of the invention provides a content recommendation service via a network, and includes a content tag cloud generating module configured to generate a content tag cloud by analyzing a tag assigned to each content and accumulating frequencies per tag of each content. Frequencies per tag is the number of times a particular tag was assigned by users to a given content. The content recommendation apparatus also includes a user tag cloud generating module, a similarity computing module, and a recommending module. The user tag cloud generating module is configured to generate a user tag cloud by accumulating frequencies per tag of contents used by a user. The similarity computing module is configured to compute a similarity between users using the user tag cloud, and the recommending module is configured to recommend a content by computing a probability that a target user will use a specific content based on the computed similarity between users.

A content recommendation method according to a second aspect of the invention provides a content recommendation service via a network, and includes (a) generating a content tag cloud by analyzing a tag assigned to each content and accumulating frequencies per tag of each content; (b) generating a user tag cloud by accumulating frequencies per tag of contents used by a user; (c) computing a similarity between users using the user tag cloud; and (d) recommending a content by computing a probability that a target user will use a specific content based on the computed similarity between users.

BRIEF DESCRIPTION OF THE DRAWINGS

The features and advantages of the invention will become more apparent upon consideration of the detailed description taken in conjunction with the accompanying drawings, in which like reference characters refer to like parts throughout, and in which:

FIG. 1 illustrates a network configuration of a content recommendation system according to an embodiment of the invention;

FIG. 2 illustrates an embodiment of the service management unit of FIG. 1;

FIG. 3 illustrates an embodiment of the content recommendation unit of FIG. 1;

FIG. 4 is a flow chart illustrating a process of constructing a content tag cloud according to an embodiment of the invention;

FIG. 5 is a flow chart illustrating a process of constructing a user tag cloud according to an embodiment of the invention; and

FIG. 6 is a flow chart illustrating a process for recommending content according to an embodiment of the invention.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 1 shows a network configuration of a content recommendation system in accordance with the invention. The content recommendation system includes a plurality of user terminals 110 coupled to a content recommendation server 200 via a network 130. Network 130 is capable of sending and receiving content and may be, for example, the Internet, a mobile communication network, or a wired or wireless network. User terminals 110 preferably have a wired or wireless Internet browser, and can be connected to the content recommendation server 200 using the browser. User terminals 110 are capable of receiving content provided by the content recommendation server 200, and may be, for example, a desktop PC (personal computer), a notebook PC, a PDA (personal digital assistant), or a mobile communication terminal. Content recommendation server 200 recommends content to users at user terminals 110. Content provided by content recommendation server 200 may include a moving image, a still image, a sound source (for example, an MP3 file), or an advertisement.

The content recommendation server 200 provides a Web site including content and a content recommendation service to user terminals 110 connected to server 200 via network 130. As shown in FIG. 1, content recommendation server 200 includes a service management unit 210 and a content recommendation unit 230 in accordance with the invention. The service management unit 210 stores and manages various content to be provided to a plurality of user terminals 110. When a user terminal 110 is connected, service management unit 210 provides a Web site including content to the user terminal 110. The service management unit 210 provides the user terminal 110 with a tag setting interface to enable a user to assign a related word (i.e., a tag) to each content included in the Web site, and stores and manages the tag assigned to each content. A set of tags assigned to each content is stored and managed as a tag cloud of each content. A user can assign, for example, a related word or words as a tag to a specific music video, such as, “sexy,” “wonder girls,” or “dance.” The service management unit 210 then stores and manages “sexy,” “wonder girls” and “dance” as a tag cloud of that music video.

The service management unit 210 also provides content to a plurality of user terminals 110, and generates, stores, and manages a tag cloud for each user. Where any one user uses a plurality of contents, the service management unit 210 synthesizes tag clouds of contents used by the user, generates a tag cloud for the corresponding user, and stores and manages the tag cloud. In generation of a tag cloud for each user, the service management unit 210 manages a generation frequency of each tag of the tag cloud.

The content recommendation unit 230 selects a user neighborhood having similar inclinations using tag clouds of users that are generated, stored and managed by the service management unit 210. The content recommendation unit 230 also recommends content to users in the similar user neighborhood based on a list of contents used by the users in the user neighborhood. A content recommendation algorithm is described in detail below.

FIG. 2 shows a configuration of the service management unit 210 of FIG. 1 according to an embodiment of the invention. Service management unit 210 includes a content usage list managing module 211, a content tag cloud generating module 213, a user tag cloud generating module 215, a usage list database 212, a content tag cloud database 214, and a user tag cloud database 216.

The content usage list managing module 211 registers in the usage list database 212 a list of contents used by a user among various contents provided to a plurality of the user terminals 110 by the content recommendation server 200. The content usage list managing module 211 also manages the content usage list. When a user terminal 110 connects to the content recommendation server 200 and uses a specific content, the content usage list managing module 211 indicates that usage in a user identification information (for example, cookie information, ID, and so on) of the user terminal 110, and stores the content usage information in the usage list database 212. The usage list database 212 maps an identification information of a content used by a user with each user identification information, and stores the mapping information.

The content tag cloud generating module 213 registers in the content tag cloud database 214 tags assigned to various content provided to a plurality of user terminals 110 by the content recommendation server 200. The content tag cloud generating module 213 manages the tags, provides the user terminals 110 with an interface to enable a user to directly assign a tag to a content, and receives the tag assigned to the content from the user.

The content tag cloud generating module 213 receives a user-defined tag as a tag of a content. Alternatively, the content tag cloud generating module 213 provides a user with a plurality of preset tags, and receives any one tag or a plurality of tags as a tag of a content. The content tag cloud generating module 213 stores in the content tag cloud database 214 frequencies of tags assigned to each content by a user. The content tag cloud database 214 maps an identification information of each content with frequencies of tags assigned to a corresponding content, and stores the mapping information. For example, where a user #1 assigns tags of “Park Ji-Sung” and “Manchester United” to a moving image A, and a user #2 assigns tags of “Football” and “Park Ji-Sung” to the moving image A, the following is constructed in the content tag cloud database 214 as a tag cloud of the moving image A: {Park Ji-Sung2. Manchester United1. Football1}.

The content tag cloud constructed by the content tag cloud generating module 213 is represented by Equation 2 below. Specifically, assuming that

U={u₁, u₂, . . . , u_l} is a neighborhood of users,

V={v₁, v₂, . . . , v_m} is a set of contents,

T={t₁, t₂, . . . , t_n} is a set of tags, and

f_ijkis a variable having a value of ‘1’ when a user u_iassigns a tag t_kto a content v_j, or otherwise having a value of ‘0’,

a tag cloud VTC_jof content v_jis represented by the following Equation 2.

$\begin{matrix} {VTC}_{j} = (\sum_{i = 1}^{l} f_{ij 1}, \sum_{i = 1}^{l} f_{ij 2}, \dots, \sum_{i = 1}^{l} f_{ijn}) & Equation 2 \end{matrix}$

That is, VTC_jis an n-dimensional vector indicating the total frequency of each tag t_kthat users have assigned to a specific content v_j.

Whenever a user uses a content, the user tag cloud generating module 215 accumulates tag clouds assigned to the contents used by the user, and stores the tag clouds in the user tag cloud database 216 as a user tag cloud. The user tag cloud generating module 215 accumulates the frequency of each tag constructed for contents used by a user according to tag, and stores the frequencies of the entire tags in the user tag cloud database 216 as a user tag cloud.

The user tag cloud constructed by the user tag cloud generating module 215 is represented by Equation 3 below. Assuming that a tag cloud of a user u_iis UTC_i, when the user u_isigns up, the tag cloud UTC_i(which is an n-dimensional vector) is initialized to 0. Whenever the user u_iuses a content v_j, the tag cloud UTC_iof the user u_iis updated according to Equation 3.

UTC_i←UTC_i+NVTC_j Equation 3

Here, NVTC_jis obtained by normalizing VTC_j, which is defined in Equation 4 below:

$\begin{matrix} \begin{matrix} {NVTC}_{j} = \frac{1}{\sum_{k = 1}^{n} \sum_{i = 1}^{l} f_{ijk}} {VTC}_{j} \\ = (\frac{\sum_{i = 1}^{l} f_{ij 1}}{\sum_{k = 1}^{n} \sum_{i = 1}^{l} f_{ijk}}, \frac{\sum_{i = 1}^{l} f_{ij 2}}{\sum_{k = 1}^{n} \sum_{i = 1}^{l} f_{ijk}}, \dots, \frac{\sum_{i = 1}^{l} f_{ijn}}{\sum_{k = 1}^{n} \sum_{i = 1}^{l} f_{ijk}}) \end{matrix} & Equation 4 \end{matrix}$

Synthesizing Equations 3 and 4. UTC_iis the distribution of interest of user u_iin the subject of each tag. The use of the content v_jby the user u_imeans that the user u_ihas an interest in the subject of the content represented as NVTC_j. Thus, Equation 3 represents a process in which an interest of a user is updated through a content usage behavior.

For example, assume that {Park Ji-Sung2. Manchester United1, Football1} is constructed as a tag cloud of a moving image A and that a new user #1 uses the moving image A. A tag cloud of user #1 is normalized and constructed as {Park Ji-Sung1/2. Manchester United1/4. Football1/4}. And, in the case where user #1 uses a moving image B having a tag cloud of {Lee Young-Pyo2. Tottenham1, Football1}, the tag cloud of the moving image B {Lee Young-Pyo2. Tottenham1, Football1} is normalized and added to the tag cloud of user #1: {Park Ji-Sung1/2, Manchester United1/4. Football1/4}. Finally, {Park Ji-Sung1/2. Manchester United1/4. Lee Young-Pyo1/2. Tottenham1/4. Football1/2} is constructed as a tag cloud of user #1. Here, a frequency of tags is arranged in a tag cloud of an n-dimensional vector according to a process for generating the entire tags.

FIG. 3 shows a configuration of the content recommendation unit 230 of FIG. 1 according to an embodiment of the invention. Content recommendation unit 230 includes a user similarity computing module 231 and a content recommending module 233. User similarity computing module 231 computes a similarity between each user using tag clouds of the users stored in the user tag cloud database 216. The user similarity computing module 231 computes a tag cloud similarity between two arbitrary users u_iand u_i′based on user tag clouds stored in the user tag cloud database 216. The tag cloud similarity is represented by Equation 5 below.

Assuming that a tag cloud similarity between two arbitrary users u_iand u_i′is sim(u_i, u_i′), a tag cloud of a user u_iis UTC_i=(α₁ⁱ, α₂ⁱ, . . . , α_nⁱ) and a tag cloud of a user u_i′ is UTC_i′=(α₁^i′, α₂^i′, . . . , α_n^i′), sim(u_i, u_i′) can be computed as a cosine value of each angle between two vectors as shown in Equation 5.

$\begin{matrix} \begin{matrix} sim (u_{i}, u_{i^{'}}) = \cos ({UTC}_{i}, {UTC}_{i^{'}}) \\ = \frac{{UTC}_{i} \cdot {UTC}_{i^{'}}}{{ {UTC}_{i} }_{2} \times { {UTC}_{i^{'}} }_{2}} \\ = \frac{\sum_{k = 1}^{n} α_{k}^{i} α_{k}^{i^{'}}}{\sqrt{\sum_{k = 1}^{n} {(α_{k}^{i})}^{2}} \sqrt{\sum_{k = 1}^{n} {(α_{k}^{i^{'}})}^{2}}} \end{matrix} & Equation 5 \end{matrix}$

Here, UTC_i·UTC_i′ is a dot product of two vectors UTC_iand UTC_i′.

The content recommending module 233 computes for all contents a probability that a target user will use a specific content using a computed similarity by the user similarity computing module 231 between the target user and an arbitrary user. The content recommending module 233 recommends a content to the target user in a descending order of probability. At this time, the content recommending module 233 excludes content that the target user had already used. And, in computation of a probability that a target user will use a specific content, when a recommendation list should be changed promptly, the content recommending module 233 updates the probability whenever the target user uses a new content, or otherwise, simultaneously updates probabilities of all users in a type of periodic update processing.

A process in which the content recommending module 233 computes a probability that a target user will use a specific content is represented by Equation 6:

$\begin{matrix} π_{i, j} = k \sum_{u_{i^{'}} \in ⋃}^{} sim (u_{i}, u_{i^{'}}) \times p_{i^{'}, j} & Equation 6 \end{matrix}$

Here, U is a user neighborhood similar to a target user u_i, and p_i′,jis a variable indicating whether or not a user u_iused a specific content v_j. Where user u_iused the specific content v_j-p_i′,jis ‘1’, and where user u_idid not use the specific content v_j-p_i′,jis ‘0’. The normalization constant, ‘k,’ is as follows:

$k = 1 / \sum_{u_{i^{'}} \in ⋃}^{} sim (u_{i}, u_{i^{'}}) .$

The content recommending module 233 generally determines as a user neighborhood U similar to a target user, a neighborhood of users S_iwho have used at least two contents among a set of contents used by the target user u_i. However, where the number of contents is overwhelmingly larger than the number of users, the number of users who have used at least two contents is remarkably small, and thus it is difficult to find users similar to the target user. And where the target user has used just one content, it is impossible to determine a user neighborhood U. Therefore, the content recommending module 233 determines a user neighborhood U as follows:

(1) where |S_i|≧N, U=S_i; and

(2) where |S_i|<N, U is a neighborhood of N users most similar to the target user based on the similarity computed in Equation 5.

Here, N is a constant value determined experimentally to maximize the performance of the recommendation system.

FIG. 4 illustrates a process of constructing a content tag cloud in accordance with the invention. Content recommendation server 200 publishes a plurality of contents stored therein on a web page such as a home page (S401). The contents may be self-created contents and/or UCCs (User Created Contents) and may include moving images, photographs, or broadcasts. When the plurality of contents are published on a home page and so on by the content recommendation server 200, users can connect to the home page of the content recommendation server 200 using user terminals 110 and can access/use the plurality of contents.

When a user uses a specific content, the content recommendation server 200 provides the user terminal 110 with a related interface to enable the user to assign a tag that indicates features of the corresponding content. More particularly, the content recommendation server 200 provides an interface to enable the user to directly define a tag of the content, or offers the user a plurality of predefined tags from which to choose. After the tag is defined or chosen by the user, the content recommendation server 200 receives the tag of the content.

When the user uses specific content and assigns a tag to the content through the interface, the content recommendation server 200 stores the assigned tag and its generation frequency as a tag cloud of the corresponding content. The content recommendation server 200 also updates and stores a content usage list (S403). For example, after a user #1 uses a moving image A and assigns a tag {Park Ji-Sung, Football} to the corresponding moving image A, the content recommendation server 200 stores the tag {Park Ji-Sung, Football} and its generation frequency {1, 1} for the moving image A.

Next, when another user uses the same content and assigns a tag to the content through the interface, the content recommendation server 200 updates the stored tag cloud of the corresponding content and the stored content usage list (S405). That is, the content recommendation server 200 updates a tag of the content, its generation frequency, and the content usage list. For example, assume a tag of a stored tag cloud of a moving image A is {Park Ji-Sung, Football} and its generation frequency is {1, 1}. When a new tag {Park Ji-Sung, Manchester United} is assigned to the moving image A, the tag is updated to {Park Ji-Sung, Football, Manchester United} and its generation frequency is updated to {2, 1, 1}.

The above-mentioned process is repeatedly performed on all contents used by all users to construct tag clouds and content usage lists for those contents. This process is represented by Equation 2 above.

FIG. 5 illustrates a process of constructing a user tag cloud in accordance with the invention. This process begins after tag clouds of all contents provided by the content recommendation server 200 are constructed as described above with respect to FIG. 4. When a new user uses a content provided by the content recommendation server 200, a tag cloud of the corresponding new user is then constructed. As shown in FIG. 5, when a new user connects to the content recommendation server 200, the content recommendation server 200 initializes an n-dimensional (equal to the number of tags generated by the content recommendation server 200) tag cloud corresponding to the new user (S501).

Subsequently, when the new user uses any one of the contents provided by the content recommendation server 200, the content recommendation server 200 updates the tag cloud of the new user by adding a tag cloud of a content used by the new user to the initialized tag cloud of the new user (S503). In some implementations, the content recommendation server 200 normalizes a tag generation frequency of a content and adds the normalized tag generation frequency to a tag generation frequency of the new user. For example, assume a new user uses a moving image A having a tag of {Park Ji-Sung, Football, Manchester United} and a tag generation frequency of {2, 1, 1}. The tag of the new user is {Park Ji-Sung, Football, Manchester United} and the associated tag generation frequency is { 2/4, ¼, ¼}={0.5, 0.25, 0.25}.

When the new user uses another content provided by the content recommendation server 200, the content recommendation server 200 accumulates a tag cloud of that other content on the tag cloud of the new user that was generated at S503 to update the tag cloud of the new user (S505). At this time, the content recommendation server 200 preferably normalizes the tag generation frequency of the corresponding content and accumulates the normalized tag generation frequency on the tag generation frequency of the new user.

The above-described process is performed equally for all users to construct tag clouds for those users. This is represented by Equations 3 and 4 above.

FIG. 6 illustrates a process of recommending content in accordance with the invention. When tag clouds of users are constructed as described above with respect to FIG. 5, the content recommendation server 200 recommends content of interest to any one user (hereinafter referred to as a target user). To do so, content recommendation server 200 first computes a “similarity” between the target user and other users based on user tag clouds (S601). The content recommendation server 200 computes as a similarity cosine values of angles between a tag cloud vector of the target user and tag cloud vectors of other users. The computed similarity is represented above by Equation 5.

The content recommendation server 200 then determines a user neighbor having a similar inclination to the target user (S603). Assuming that a user neighbor is S_i, who has used at least two contents among a set of contents used by the target user, a user neighborhood U similar to the target user is determined as follows:

(1) where |S_i|≧N, U=S_i

(2) where |S_i|<N, U is a neighborhood of N users most similar to the target user based on the computed similarity.

Here, N is a constant determined experimentally to maximize the performance of the recommendation system.

After the user neighbor is determined, the content recommendation server 200 computes a probability for all contents not used in the past by the target user that the target user will use a content based on (a) similarity between the target user and users in the similar user neighborhood and (b) content usage lists of the users of the similar user neighborhood (S605). This is represented by Equation 6 above.

Next, the content recommendation server 200 recommends a content to the target user in a descending order of probability based on the computed probabilities for all contents that the target user will use a content (S607). The content recommendation server 200 may recommend to the target user all contents in order or some contents of higher rank.

Although this embodiment shows that the content recommendation server 200 recommends individual content to the target user, the invention is not limited in this regard. After determining content to be recommended to the target user, the content recommendation server 200 may also recommend to a user a package of items having a large amount of content or a web page having a large amount of content in addition to the determined recommended content. Thus, the content recommendation unit 230 of FIG. 1 may include a package and shop recommending module for recommending to a user a package of items or a web page having a large amount of content that may be additionally recommended to the target user by the content recommending module 233.

In sum, the invention (a) generates and updates a tag cloud of a user according to content usage type of the user based on a tag cloud assigned collectively to content by a plurality of users, (b) measures similarities between users, and (c) recommends more suitable and appropriate content to users.

As mentioned above, the content recommendation method according to the invention deduces similarities between two users using a tag cloud, and although the two users did not use the same content, the users can be identified as user neighbors having similar inclinations. Conventionally, similarities between two users was deduced using an average or statistical value of evaluations, feedback, etc. given by users to the same content or purchase information of the same content by the users. However, such conventional methods has low coverage for a large set of contents, while the invention advantageously has high coverage for a large set of contents.

The recommendation apparatus and method according to the invention may be incorporated as computer readable code on a computer readable storage medium. The computer readable storage medium includes all kinds of storage devices for storing data readable by a computer system. For example, the computer readable medium may be a CD-ROM (compact disc read only memory), RAM (random access memory), ROM (read only memory), floppy disc, hard disc, or magneto-optical disc.

While this specification contains many specifics, these should not be construed as limitations on the scope of the invention or of what may be claimed, but rather, they should be construed as descriptions of features that may be specific to particular embodiments of the invention. Certain features that are described herein in the context of separate embodiments may also be implemented in a single embodiment. Conversely, various features that are described in the context of a single embodiment may also be implemented in multiple embodiments separately or in any suitable subcombination. Moreover, although features may have been described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination may in some cases be excised from the combination, and the claimed combination may be directed to a subcombination or a variation of a subcombination.

Similarly, while processes are depicted in the drawings in a particular order, this should be understood as requiring that such processes be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. Note that in certain circumstances, multitasking and parallel processing may be possible and advantageous. Moreover, the separation of various system components in the embodiments described above should not be understood as requiring such separation in all embodiments, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.

Although only a few implementations and examples are described above, other implementations, enhancements, and variations may be made by those of ordinary skill in the art that would still be within the scope of the invention. The invention is therefore only limited by the following claims.

Claims

1. Content recommendation apparatus, the apparatus comprising: a content recommendation server providing a content recommendation service and recommending content via a network; anda plurality of user terminals receiving the content provided by the content recommendation server via the network,wherein the content recommendation server comprisesa content tag cloud generating module including a processor to generate a content tag cloud by analyzing a tag assigned to each content and accumulating frequencies per tag of each content, wherein a plurality of tags as to one content are created by a plurality of users and accumulated as the content tag cloud as to the one content and are associated with words related to the content;a user tag cloud generating module including a processor to generate a user tag cloud by accumulating frequencies per tag of contents used by a user;a similarity computing module including a processor to compute a similarity between users using the user tag cloud; anda recommending module including a processor to recommend a content by computing a probability that a target user will use a specific content based on the computed similarity between users,wherein, in the accumulation of frequencies per tag of contents used by a user, the user tag cloud generating module normalizes a frequency per tag of a content used by the user and accumulates the normalized frequency per tag,wherein the recommending module determines a user neighborhood similar to a target user and recommends a content based on a similarity between the target user and a user in the determined user neighborhood, andwherein the recommending module determines a user neighborhood based on users who have used at least two contents in a set of contents used by the target user and determines N users from the user neighborhood most similar to the target user based on the similarity between users computed by the similarity computing module.
2. The content recommendation apparatus of claim 1 wherein the similarity computing module computes a cosine value of an angle between two vectors of tag clouds of two users.
3. The content recommendation apparatus of claim 1 further comprising a second recommending module configured to recommend a package of items or an online shop having a large amount of content recommended by the recommending module.
4. A content recommendation method of a content recommendation apparatus, the apparatus comprising: a content recommendation server providing a content recommendation service and recommending content via a network; anda plurality of user terminals receiving the content provided by the content recommendation server via the network, the method comprising:analyzing a tag assigned to each content and accumulating frequencies per tag of each content to generate a content tag cloud by a processor included in a content cloud generating module of the content recommendation server, wherein a plurality of tags as to one content are created by a plurality of users and accumulated as the content tag cloud as to the one content and are associated with words related to the content;accumulating frequencies per tag of contents used by a user to generate a user tag cloud by a processor included in a user tag cloud generating module of the content recommendation server;computing a similarity between users using the user tag cloud by a processor included in a similarity computing module of the content recommendation server; andcomputing a probability that a target user will use a specific content based on the computed similarity between users to recommend a content by a processor included in a recommendation module of the content recommendation server,wherein, in accumulation of frequencies per tag of contents used by a user, generating the user tag cloud normalizes a frequency per tag of a content used by the user and accumulates the normalized frequency per tag,wherein the recommending a content comprises: determining a user neighborhood similar to a target user; and recommending a content based on a similarity between the target user and a user in the determined user neighborhood,wherein a user neighborhood is determined based on users who have used at least two contents in a set of contents used by the target user, andwherein N users are determined from the user neighborhood most similar to the target user based on the similarity between users computed by the similarity computing module.
5. The content recommendation method of claim 4 wherein computing the similarity comprises computing a cosine value of an angle between two vectors of tag clouds of two users as similarity.
6. The content recommendation method of claim 4, further comprising: recommending a package of items or an online shop having a large amount of content recommended in the recommending step.
7. A computer readable storage medium having a computer program stored thereon for implementing a content recommendation function using a tag cloud in a computer with a processor, the function comprising: generating a content tag cloud by analyzing a tag assigned to each content and accumulating frequencies per tag of each content to generate a content tag cloud by a processor included in a content cloud generating module of the content recommendation server, wherein a plurality of tags as to one content are created by a plurality of users and accumulated as the content tag cloud as to the one content and are associated with words related to the contentgenerating a user tag cloud by accumulating frequencies per tag of contents used by a user to generate a user tag cloud by a processor included in a user tag cloud generating module of the content recommendation server;computing a similarity between users using the user tag cloud by a processor included in a similarity computing module of the content recommendation server; computing a similarity between users using the user tag cloud by a processor included in a similarity computing module of the content recommendation server;computing a probability that a target user will use a specific content based on the computed similarity between users to recommend a content by a processor included in a recommendation module of the content recommendation server,wherein, in accumulation of frequencies per tag of contents used by a user, generating the user tag cloud normalizes a frequency per tag of a content used by the user and accumulates the normalized frequency per tag,wherein the recommending a content comprises: determining a user neighborhood similar to a target user; and recommending a content based on a similarity between the target user and a user in the determined user neighborhooddetermining a user neighborhood based on users who have used at least two contents in a set of contents used by the target user; anddetermining N users from the user neighborhood most similar to the target user based on the similarity between users computed by the similarity computing module.

Priority Claims (1)

Number	Date	Country	Kind
10-2007-0117987	Nov 2007	KR	national

US Referenced Citations (28)

Number	Name	Date	Kind
5749081	Whiteis	May 1998	A
5790426	Robinson	Aug 1998	A
6138128	Perkowitz et al.	Oct 2000	A
6313745	Suzuki	Nov 2001	B1
6873967	Kalagnanam et al.	Mar 2005	B1
6981040	Konig et al.	Dec 2005	B1
7231419	Gheorghe et al.	Jun 2007	B1
7584159	Chakrabarti et al.	Sep 2009	B1
20020010625	Smith et al.	Jan 2002	A1
20020065721	Lema et al.	May 2002	A1
20030105682	Dicker et al.	Jun 2003	A1
20030149612	Berghofer et al.	Aug 2003	A1
20040076936	Horvitz et al.	Apr 2004	A1
20050071328	Lawrence	Mar 2005	A1
20050125307	Hunt et al.	Jun 2005	A1
20050267818	Kaplan	Dec 2005	A1
20060020662	Robinson	Jan 2006	A1
20060200556	Brave et al.	Sep 2006	A1
20060253590	Nagy et al.	Nov 2006	A1
20060271953	Jacoby et al.	Nov 2006	A1
20070039023	Kataoka	Feb 2007	A1
20070078832	Ott, IV et al.	Apr 2007	A1
20080222105	Matheny	Sep 2008	A1
20080242279	Ramer et al.	Oct 2008	A1
20090024962	Gotz	Jan 2009	A1
20090077081	Sarma et al.	Mar 2009	A1
20100228691	Yang et al.	Sep 2010	A1
20110191311	Polonsky et al.	Aug 2011	A1

Foreign Referenced Citations (10)

Number	Date	Country
2184693	May 2010	EP
2001-142890	May 2001	JP
2005-196415	Jul 2005	JP
2006-302097	Nov 2006	JP
2007-213401	Aug 2007	JP
10-0720762	May 2007	KR
WO 0036544	Jun 2000	WO
WO 0051050	Aug 2000	WO
WO 2007124225	Nov 2007	WO
2008-109980	Sep 2008	WO

Related Publications (1)

	Number	Date	Country
	20090132526 A1	May 2009	US

Content recommendation apparatus and method using tag cloud

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications