1. Field of the Invention
This invention relates in general to methods and data processing system readable media, and more particularly, to methods of profiling a user and data processing system readable media for carrying out those methods.
2. Description of the Related Art
Many different Internet Service Providers (“ISPs”) or sites are interested in creating or using profiles of their users. One way to obtain a profile for a user is to rely on information that the user may voluntarily provide. One problem with this is that the user may lie or give inaccurate, misleading, or incomplete data regarding himself or herself.
Another attempt at establishing a profile for a user may include reviewing the Internet Protocol (“IP”) address of the user and determining the geographic location and time zone for the user. However, that information may be misleading depending upon the ISP and where it is located. For example, if the user is using America Online, the user may be listed as residing in Virginia and being in the Eastern Time zone. In reality, the user may be in a different country and residing within a time zone many hours different from the Eastern Time zone. Additionally, many different users may be using the same computer (same IP address). Clearly, such information is of little use in determining interests or other information regarding an individual. Even if the information regarding state and time zone are correct and only one user is using a specific IP address with a local ISP, this may yield little useful information about the individual user. The ISP may have thousands or millions of users.
A user profile can be generated using a table that tracks a user by category information and temporal information. In one embodiment, the user profile can be generated with relatively minimal resources and relies on the user behavior rather than information provided by the user or just the IP address. In one configuration, a network access provider or network site may track a user's network addresses by timestamp and correlate the network addresses to meta tags for the network address that are obtained for free or using a third party source. A table including a user identifier, corresponding category, and timestamp may provide a basis for forming a user profile.
In one set of embodiments, a method of profiling a user can comprise accessing first data including a first identifier for the user, network addresses accessed by the user, and temporal information related to the user identifier and the network addresses. The method can also comprise accessing second data including at least some the network addresses and corresponding category information for each of those network addresses. The method can further comprise generating a user profile based at least in part on the first identifier, category information, and at least some of the temporal information.
In a specific embodiment, generating the user profile can include creating a table (third data), comparing the data for the user within the table to existing profiles, and associating the user with a particular profile. In another embodiment, the method can further comprise selecting marketing information regarding an item, where the selection is based at least in part on the user profile, and sending the marketing information to the user. In still another embodiment, the user profile can be sold.
In other embodiments, a data processing system readable medium can have code embodied within it. The code can include instructions executable by a data processing system. The instructions may be configured to cause the data processing system to perform the methods described herein.
The foregoing general description and the following detailed description are exemplary and explanatory only are not restrictive of the invention, as claimed.
The present invention is illustrated by way of example and not limitation in the accompanying figures, in which:
Skilled artisans appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the figures may be exaggerated relative to other elements to help to improve understanding of embodiments of the present invention.
Reference is now made in detail to the exemplary embodiments of the invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings to refer to the same or like parts (elements).
A method can be used to profile a user using network addresses and temporal information of the network addresses when the user requests or is sent information for those network addresses. The method can comprise accessing first data including a user identifier, network addresses accessed by the user, and temporal information. The method can also comprise accessing second data including at least some of the network addresses and corresponding category information for each of those network addresses. The method can further comprise generating the user profile based at least in part on the user identifier, category information, and at least some of the temporal information.
A few terms are defined or clarified to aid in understanding the descriptions that follow. A network includes an interconnected set of server and client computers over a publicly available medium (e.g., the internet) or over an internal (company-owned) system. A user at a client computer may gain access to the network using a network access provider. An Internet Service Provider (“ISP”) is a common type of network access provider. A network address includes information that can be used by a server computer to locate information, whether internal to that server computer or at a different, remote computer or database. URLs are examples of network addresses. A network site corresponds to a location specified by a network address. A web site is a common type of network site. Note that the examples given within this paragraph are for purposes of illustration and not limitation.
Before discussing details of embodiments of present invention, a hardware architecture for using embodiments is described.
Each of the client computer 12 and the server computer 16 is an example of a data processing system. ROM 122 and 162, RAM 124 and 164, HD 126 and 166, and the database 18 include media that can be read by the CPU 120 or 160. Therefore, each of these types of memories includes a data processing system readable medium. These memories may be internal or external to the computers 12 and 16.
The methods described herein may be implemented in suitable software code that can reside within ROM 122 or 162, RAM 124 or 164, or HD 126 or 166.
In an illustrative embodiment of the invention, the computer-executable instructions may be lines of compiled C++, Java, or other language code. Other architectures may be used. For example, the functions of the client computer 12 may be incorporated into the server computer 16, and vice versa. In alternative embodiments, more server computers can be used, such that the functions of server computer 16 are performed by a combination of server computers.
Communications between the client computer 12 and the server computer 16 can be accomplished using radio frequency, electronic, or optical signals. When a user (human) is at the client computer 12, the client computer 12 may convert the signals to a human understandable form when sending a communication to the user and may convert input from the user to appropriate signals to be used by the client computer 12 or the server computer 16.
Attention is now directed to methods of profiling users as illustrated in
In one embodiment, a user can “surf” the network on client computer 12, while the server computer 16 collects and stores the user identifier, network addresses, and timestamps within the first table in the database 18. The user may be routed to other server computers (not shown) that correspond to the network sites requested by the user at client computer 12. The server computer 16 may access and read the information within the first table within database 18. Other users (not shown) may be using the network 14 and server computer 16.
The method can also comprise accessing data in a second table that includes network addresses and corresponding category information (304). This act may be performed by the server computer 16 in one embodiment. The data for the second table may reside in the storage device 18 or, alternatively, may reside within a database at an external location of a third party (not shown). For example, the second table may be part of the Netscape Open Directory Project or may be provided by LookSmart, Ltd. of San Francisco, Calif. Either of these third-party sources may provide a categorization of each Internet site. The categorization may be in the form of meta tags, which are metadata that correspond to the network addresses. Skilled artisans may realize that meta tags are used by search engines, such as Yahoo!, Lycos, Excite, and the like in performing user specified searches for information. To the inventor's knowledge, the meta tags for the search engines have not been used in generating user profiles. A short text portion of a URL may optionally be used. In any event, a table of network addresses and categories can be created or accessed using a third-party source that is different from the user and the party that significantly owns or controls the server computer 16.
The method can further include creating a third table that includes user identifiers, category information, and temporal information as shown in block 322. The table can be created by matching the network addresses in the first and second tables. Other information may be included within the third table. In one embodiment, the third table can sort by user identifier and time to show categories in chronological or reverse chronological order.
Different method may be used to generate the user profile. The methods described herein are only to illustrate and not limit the present invention. In one embodiment, the information in the third table can be used in generating a user profile by performing click-stream analysis. To generate the user profile, the data within the third table can be compared to existing profiles (block 342). The existing profiles can represent another person or can represent a theoretical individual having attributes similar to a person. For each user, his or her information from the third table can be compared to the existing profiles. The method can also include associating each of the users with a profile that most closely matches that particular user (block 344). Alternatively, data mining rules may be used to determine the characteristics of the user, and the user profile may include a set of discrete, but potentially interrelated, characteristics.
The information gathered may be used in wide variety of ways. The examples described below are meant to illustrate and not limit the present invention. In one embodiment, the information may be useful for providing information or offering an item to the user. As used herein, “item” means a product or a service. The method can include selecting marketing information regarding an item based on the user profiles (block 362). The marketing information may vary widely from embodiment to embodiment. In one embodiment, the marketing information may include banner advertisements that may be displayed near a periphery of a view as seen by the user on I/O 128 of client computer 12. A network access provider may perform the selection of the banner advertisements. Alternatively, network sites may be owned or controlled by a company that sells the item. In this example, the marketing information may comprise an offer to sell the item to the user.
The method can further comprise applying one or more filters as shown in block 364. These filters can be used to remove information that may have been previously seen by the user, to remove information regarding some items having lower profit margins from a collection of items, to remove information that may be offensive or objectionable to be user, or the like. The method can further comprise an optional act of sending the filtering marketing information to the user as shown in block 366.
In another embodiment, a company creating user profiles using data within the tables may sell the user profiles to a marketing company, a mailing service, or other third party. Alternatively, the information can be used in performing internal analytics within the company. After reading this specification, skilled artisans appreciate that the number and variety of uses of the user profiles is nearly limitless.
Embodiments of the present invention may allow creation of a user profile without having to rely on the user for his or her input or having to rely upon information only within a HyperText Transfer Protocol (HTTP) stream, such as an IP address. The click stream may be the only source of user information used for the profile. Solely by a using a user's chronological click stream and network address categorization, a reasonably accurate user profile can be obtained for an individual at a reasonably low cost without complex software code requirements.
In the foregoing specification, the invention has been described with reference to specific embodiments. However, one of ordinary skill in the art appreciates that various modifications and changes can be made without departing from the scope of the present invention as set forth in the claims below. Accordingly, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of present invention.
Benefits, other advantages, and solutions to problems have been described above with regard to specific embodiments. However, the benefits, advantages, solutions to problems, and any element(s) that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as a critical, required, or essential feature or element of any or all the claims. As used herein, the terms “comprises,” “comprising,” or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
This application claims priority under 35 U.S.C. § 119(e) to U.S. Patent Application No. 60/208,889 entitled “Method For Click-Stream Analysis Using Web Directory Reverse Categorization” by McCullough filed Jun. 2, 2000, and U.S. Patent Application No. 60/208,890 entitled “Method For Continuous, Frame-Specific Click-Stream Recording” by McCullough Jun. 2, 2000. This is also related to U.S. patent application Ser. No. 09/681,758, now U.S. Pat. No. 7,155,506, entitled “Method For Continuous, Frame-Specific Click-Stream Recording” by McCullough filed of even date. All applications listed in this paragraph and are assigned to the current assignee hereof and are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4754428 | Schultz et al. | Jun 1988 | A |
4811207 | Hikita et al. | Mar 1989 | A |
5210824 | Putz et al. | May 1993 | A |
5212787 | Baker et al. | May 1993 | A |
5226161 | Khoyi et al. | Jul 1993 | A |
5257369 | Skeen et al. | Oct 1993 | A |
5331673 | Elko et al. | Jul 1994 | A |
5339392 | Risberg et al. | Aug 1994 | A |
5421015 | Khoyi et al. | May 1995 | A |
5557717 | Wayner | Sep 1996 | A |
5572643 | Judson | Nov 1996 | A |
5704017 | Heckerman et al. | Dec 1997 | A |
5727129 | Barrett et al. | Mar 1998 | A |
5732218 | Bland et al. | Mar 1998 | A |
5740430 | Rosenberg et al. | Apr 1998 | A |
5761416 | Mandal et al. | Jun 1998 | A |
5774660 | Brendel et al. | Jun 1998 | A |
5790426 | Robinson | Aug 1998 | A |
5790790 | Smith et al. | Aug 1998 | A |
5796952 | Davis et al. | Aug 1998 | A |
5870559 | Leshem et al. | Feb 1999 | A |
5878223 | Becker et al. | Mar 1999 | A |
5884282 | Robinson | Mar 1999 | A |
5918014 | Robinson | Jun 1999 | A |
5958008 | Pogrebisky et al. | Sep 1999 | A |
6012052 | Altschuler et al. | Jan 2000 | A |
6041311 | Chislenko et al. | Mar 2000 | A |
6041335 | Merritt et al. | Mar 2000 | A |
6049777 | Sheena et al. | Apr 2000 | A |
6067565 | Horvitz | May 2000 | A |
6085226 | Horvitz | Jul 2000 | A |
6092049 | Chislenko et al. | Jul 2000 | A |
6094662 | Hawes | Jul 2000 | A |
6112186 | Bergh et al. | Aug 2000 | A |
6112279 | Wang | Aug 2000 | A |
6119103 | Basch et al. | Sep 2000 | A |
6128655 | Fields et al. | Oct 2000 | A |
6128663 | Thomas | Oct 2000 | A |
6138141 | DeSimone et al. | Oct 2000 | A |
6138156 | Fletcher et al. | Oct 2000 | A |
6141737 | Krantz et al. | Oct 2000 | A |
6144962 | Weinberg et al. | Nov 2000 | A |
6178419 | Legh-Smith et al. | Jan 2001 | B1 |
6185586 | Judson | Feb 2001 | B1 |
6185608 | Hon et al. | Feb 2001 | B1 |
6199067 | Geller | Mar 2001 | B1 |
6205472 | Gilmour | Mar 2001 | B1 |
6286043 | Cuomo et al. | Sep 2001 | B1 |
6321206 | Honarvar | Nov 2001 | B1 |
6430539 | Lazarus et al. | Aug 2002 | B1 |
6456305 | Qureshi et al. | Sep 2002 | B1 |
6509898 | Chi et al. | Jan 2003 | B2 |
6559882 | Kerchner | May 2003 | B1 |
6606657 | Zilberstein et al. | Aug 2003 | B1 |
6629136 | Naidoo | Sep 2003 | B1 |
6640215 | Galperin et al. | Oct 2003 | B1 |
6691106 | Sathyanarayan | Feb 2004 | B1 |
6732331 | Alexander | May 2004 | B1 |
6757740 | Parekh et al. | Jun 2004 | B1 |
6839682 | Blume et al. | Jan 2005 | B1 |
20010037321 | Fishman et al. | Nov 2001 | A1 |
20020108121 | Alao et al. | Aug 2002 | A1 |
20050261965 | Eisen et al. | Nov 2005 | A1 |
Number | Date | Country | |
---|---|---|---|
60208889 | Jun 2000 | US | |
60208890 | Jun 2000 | US |