This disclosure relates generally to monitoring media and, more particularly, to methods and apparatus for estimating total unique audiences exposed to media.
Traditionally, audience measurement entities have measured audience engagement levels for media based on registered panel members. That is, an audience measurement entity (AME) enrolls people who consent to being monitored into a panel. The AME then monitors those panel members to determine media (e.g., television programs, radio programs, movies, DVDs, advertisements, streaming media, websites, etc.) presented to those panel members. In this manner, the AME can determine exposure metrics for different media based on the collected media measurement data.
Techniques for monitoring user access to Internet resources, such as webpages, advertisements and/or other Internet-accessible media, have evolved significantly over the years. Internet-accessible media is also known as online media. Some known systems perform such monitoring primarily through server logs. In particular, entities serving media on the Internet can use known techniques to log the number of requests received at their servers for media.
The figures are not to scale. Wherever possible, the same reference numbers will be used throughout the drawing(s) and accompanying written description to refer to the same or like parts
Techniques for monitoring user access to Internet-accessible media, such as websites, advertisements, content and/or other media, have evolved significantly over the years. Internet-accessible media is also known as online media. In the past, such monitoring was done primarily through server logs. In particular, entities serving media on the Internet would log the number of requests received for their media at their servers. Basing Internet usage research on server logs is problematic for several reasons. For example, server logs can be tampered with either directly or via zombie programs, which repeatedly request media from the server to increase the server log counts. Also, media is sometimes retrieved once, cached locally and then repeatedly accessed from the local cache without involving the server. Server logs cannot track such repeat views of cached media. Thus, server logs are susceptible to both over-counting and under-counting errors.
The inventions disclosed in Blumenau, U.S. Pat. No. 6,108,637, which is hereby incorporated herein by reference in its entirety, fundamentally changed the way Internet monitoring is performed and overcame the limitations of the server side log monitoring techniques described above. For example, Blumenau disclosed a technique wherein Internet media to be tracked is tagged with monitoring instructions. In particular, monitoring instructions (also known as a media impression request) are associated with the hypertext markup language (HTML) of the media to be tracked. When a client requests the media, both the media and the impression request are downloaded to the client. The impression requests are, thus, executed whenever the media is accessed, be it from a server or from a cache.
Impression requests cause monitoring data reflecting information about an access to the media to be sent from the client that downloaded the media to a monitoring entity. Sending the monitoring data from the client to the monitoring entity is known as an impression request. Typically, the monitoring entity is an AME that did not provide the media to the client and who is a trusted (e.g., neutral) third party for providing accurate usage statistics (e.g., The Nielsen Company, LLC). Advantageously, because the impression requests are associated with the media and executed by the client browser whenever the media is accessed, the monitoring information is provided to the AME (e.g., via an impression request) irrespective of whether the client corresponds to a panelist of the AME.
There are many database proprietors operating on the Internet. These database proprietors provide services to large numbers of subscribers. In exchange for the provision of services, the subscribers register with the database proprietors. Examples of such database proprietors include social network sites (e.g., Facebook, Twitter, MySpace, etc.), multi-service sites (e.g., Yahoo!, Google, Axiom, Catalina, etc.), online retailer sites (e.g., Amazon.com, Buy.com, etc.), credit reporting sites (e.g., Experian), streaming media sites (e.g., YouTube, etc.), etc. These database proprietors set cookies and/or other device/user identifiers on the client devices of their subscribers to enable the database proprietor to recognize their subscribers when they visit their website.
The protocols of the Internet make cookies inaccessible outside of the domain, Internet domain, domain name, etc.) on which they were set. Thus, a cookie set in, for example, the amazon.com domain is accessible to servers in the amazon.com domain, but not to servers outside that domain. Therefore, although an AME might find it advantageous to access the cookies set by the database proprietors, they are unable to do so.
The inventions disclosed in Mainak et al., U.S. Pat. No. 8,370,489, which is incorporated by reference herein in its entirety, enable an AME to leverage the existing databases of database proprietors to collect more extensive Internet usage by extending the impression request process to encompass partnered database proprietors and by using such partners as interim data collectors. The inventions disclosed in Mainak et al. accomplish this task by structuring the AME to respond to impression requests from clients (Who may not be a member of an audience member panel and, thus, may be unknown to the audience member entity) by redirecting the clients from the AME to a database proprietor, such as a social network site partnered with the audience member entity, using an impression response. Such a redirection initiates a communication session between the client accessing the tagged media and the database proprietor. For example, the impression response received from the AME may cause the client to send a second impression request to the database proprietor. In response to receiving this impression request, the database proprietor (e.g., Facebook) can access any cookie it has set on the client to thereby identify the client based on the internal records of the database proprietor. In the event the client corresponds to a subscriber of the database proprietor, the database proprietor logs/records a database proprietor demographic impression in association with the client/user and subsequently forwards logged database proprietor demographic impressions to the AME.
As used herein, an impression is defined to be an event in which a home or individual accesses and/or is exposed to media (e.g., an advertisement, content, a group of advertisements and/or a collection of content). In Internet advertising, a quantity of impressions or impression count is the total number of times media (e.g., content, an advertisement or advertisement campaign) has been accessed by a web population (e.g., the number of times the media is accessed). In some examples, an impression or media impression is logged by an impression collection entity (e.g., an AME or a database proprietor) in response to a beacon request from a user/client device that requested the media. In some examples, a media impression is not associated with demographics. A panelist demographic impression is a media impression logged by an AME for which the AME has panelist demographics corresponding to a household and/or audience member exposed to media. As used herein, a database proprietor demographic impression is an impression recorded by a database proprietor in association with corresponding demographic information provided by the database proprietor in response to a beacon request from a client device of a registered subscriber of the database proprietor.
In the event the client does not correspond to a subscriber of the database proprietor, the database proprietor may redirect the client to the AME and/or another database proprietor. If the client is redirected to the AME, the AME may respond to the redirection from the first database proprietor by redirecting the client to a second, different database proprietor that is partnered with the AME. That second database proprietor may then attempt to identify the client as explained above. This process of redirecting the client from database proprietor to database proprietor can be performed any number of times until the client is identified and the media exposure logged, or until all database partners have been contacted without a successful identification of the client. In some examples, the redirections occur automatically so the user of the client is not involved in the various communication sessions and may not even know they are occurring.
Periodically or aperiodically, the partnered database proprietors provide their logs and demographic information to the AME, which then compiles the collected data into statistical reports identifying audience members for the media.
Example techniques disclosed herein use database proprietors to identify audience demographics based on impression requests from client devices to track quantities of impressions attributable to users of those client devices. In some examples, the database proprietor demographic impressions collected by a database proprietor (e.g., Facebook, Yahoo, Google, etc.) may be inaccurate and/or incomplete when the database proprietor does not have complete coverage of device/user identifiers (e.g., cookies) at all of the client devices associated with impression requests or, more generally associated with an impression to be logged. As used herein in this context, coverage represents the extent to which a database proprietor has set cookies or, more generally, device/user identifiers in client devices associated with beacon requests. For example, if only 50% of client devices that send an impression request associated with a media impression to the database proprietor have a cookie set therein by the database proprietor, then the database proprietor has 50% coverage of such client devices. A client device may not have a cookie set by the database proprietor in its web browser if, for example, a user does not have an account with the database proprietor or if the user has an account with the database proprietor but has cleared the cookie cache and deleted the database proprietor's cookie before or at the time of a media exposure. In yet other examples, the database proprietor may set a cookie on the client device but the client device does not correspond to a registered user of the database proprietor. In any of such examples, the database proprietor would not be able to identify the user associated with one or more media impressions and, thus, would not report any database proprietor demographic impressions for those impressions.
Examples to estimate a unique audience size for logged media impressions based on logged database proprietor demographic impressions are disclosed herein. In some examples, estimates of the unique audience size are determined from database proprietor demographic impression data collected by database proprietors. In some disclosed examples, an AME estimates a unique audience size using a number of media impressions, a number of recorded (e.g., logged) database proprietor demographic impressions, a frequency distribution of the recorded database proprietor demographic impressions across a partial audience and the number of people associated with the database proprietor demographic impressions (e.g., the partial audience). The number of recorded database proprietor demographic impressions and the partial audience size can be determined from the frequency distribution of the recorded database proprietor demographic impressions. As used herein, a frequency distribution is indicative of (1) a total quantity of unique audience members who have not been exposed to a particular media, (2) a total quantity of unique audience members who have been exposed to the particular media exactly once, (3) a total quantity of unique audience members who have been exposed to the particular media exactly twice, etc.
The people associated with the database proprietor demographic impressions at the database proprietor are referred to as the partial audience. The term partial audience is used because some individuals associated with the media impression requests sent to the AME, may not be registered with the database proprietor. As such, the database proprietor will not record (e.g., log) media impressions for these individuals in response to impression requests redirected by the AME to the database proprietor because these individuals are not registered with the database proprietor.
In some disclosed examples, an AME sends a list of logged impressions for particular online media to one or more database proprietor(s). The database proprietor(s) respond with a number of recorded database proprietor demographic impressions from the partial audience, and the size of the partial audience. In other examples, the database proprietor may receive media impression requests for media directly from client devices (e.g., without being redirected by the AME) that access the media via one or more websites. In some examples, accessing media may include media retrieved from a server through a website in response to a user-request specifically requesting the media. In some examples, the media could be delivered by a server for presentation via a website without a user intentionally requesting the media. For example, some media is presented on a website as a result of the website being programmed to request and present the media as part of the website being rendered. The database proprietor may record a quantity of media impressions (e.g., impressions that are not matched with a user of the database proprietor) and a quantity of database proprietor demographic impressions (e.g., impressions that are matched with a user of the database proprietor). In these other examples, the database proprietor will provide the total quantity of media impressions not matched to a user of the database proprietor and the total quantity of database proprietor demographic impressions (e.g. the partial audience) to the AME.
Using examples disclosed herein, the AME determines an estimate size of an audience based on logged impressions using techniques designed to optimize computer resources (e.g., processor resources and memory resources) based on the number and/or complexity (e.g., number of websites associated with the logged impressions and/or number of logged impressions per user) of the logged impressions. In particular, three different UA/FD processes are disclosed herein to estimate unique audience sizes. The example UA/FD processes have different trade-offs between memory resource usage and processor resource usage under different circumstances. For example, a first unique audience and/or frequency distribution process, herein referred to as “UA/FD process 1,” requires the least amount of processor resources (e.g., is least computationally intensive) and requires the least amount of memory resources. Example UA/FD process 1 is configured to estimate unique audiences based on logged media impressions and database proprietor demographic impressions associated with media accessed via a website. A second unique audience and/or frequency distribution process, herein referred to as “UA/FD process 2,” requires more processor resources and memory resources than UA/FD process 1. Example UA/FD process 2 is configured to estimate unique audiences based on logged impressions and database proprietor demographic impressions associated with media accessed via one or more websites. Although UA/FD process 2 requires more processor resources and memory resources, the estimates from UA/FD process 2 are more accurate than UA/FD process 1 because UA/FD process 2 uses data (e.g., logged impressions) corresponding to media accesses via more than one website to estimate unique audiences. A third unique audience and/or frequency distribution process, herein referred to as “UA/FD process 3,” requires the most processor resources, but less memory resources than UA/FD process 2. UA/FD process 3 is designed to estimate unique audiences based on logged impressions and audience data corresponding to media accesses via one or more websites.
In some examples, although UA-FD process 3 requires the most processor resources, UA/FD process 3 is useable under certain situations in which UA/FD processes 1 and 2 disclosed herein are not capable to determine unique audience. For example, the AME may receive impression requests and database proprietor demographic impressions associated with media accessed via hundreds of websites, where each person may have been exposed to the media hundreds of times. In such an example, UA/FT) process 1 and UA/FD process 2 may not be useable to calculate the unique audience due to the large number of websites and/or impressions. For example, UA/FD process 1 is only used to determine a unique audience based on impressions corresponding to media accessed via one website. In addition, UA/FD process 2 may not be usable because a processor system (e.g., a computer) may not have sufficient available memory resources for UA/FD process 2 to process the large number of impressions from the hundreds of websites through which the media was accessed. In such an example, by adjusting for available processor resources and memory resources, UA/FD process 3 may be the relatively best solution of the three UA/FD processes disclosed herein to estimate a unique audience for the media accessed via the hundreds of websites.
As described above, the example UA/FD process 1 101 requires the least amount of the example processor resources 104 and requires the least amount of the example memory resources 105. The example UA/FD process 1 101 uses the principle of maximum entropy and minimum cross entropy. Given (1) an unspecified univariate distribution (A) with unknown probabilities, qk, (where k can be any non-negative integer), (2) a known expected value E[A]=μ1, and (3) a known initial probability qo=P[A=0], the principle of maximum entropy is used to determine the unspecified univariate distribution (A). To determine the unspecified univariate distribution (A), Equation 1 below is determined.
maximize Q,H=−Σk=0∞ log(qk), Equation 1
subject to q0 given Σk=0∞ qk=1 and Σk=0∞ kqk=μ1
The solution is a zero-modified geometric distribution of the form q0 given qk=Crk where k=1, 2, . . . , ∞,
Once the unspecified univariate distribution (A) is determined, the principle of minimum cross entropy is used to determine a second unspecified univariate distribution (B) with probabilities pk on the same mathematical domain as the first unspecified univariate distribution (A), with constraint E[B]=μ2. Calculating for the initial probability p0=P[B=0] results in a simple and accurate estimation for a unique audience based on logged impressions corresponding to media associated with one website.
The solution becomes a previous distribution to a minimize cross entropy problem (e.g., q is the prior distribution and p is the unknown distribution to be solved) as shown in Equation 2 below.
The solution is a zero-modified geometric distribution:
p0=s0q0,pk=s0s1kCrk, where k=1,2, . . . ,∞.
In the illustrated example, s0 and s1 are solved to satisfy the constraints (e.g., Σk=0∞ pk=1 and Σk=0∞ kpk=μ2). The zero-modified geometric distribution (pk) is a unique solution. The distribution (pk) is plugged into the first constraint (e.g., Σk=0∞ pk=1) and solved for s1 to determine Equation 3 below.
Substituting s1 back into pk and applying the second constraint (e.g., Σk=0∞ kpk=μ2) results in Equation 4 below μ2:
Rearranging (e.g., using known mathematical properties) Equation 4 results in Equation 5 below:
The example UA/FD process 1 101 estimates a unique audience (X) in a universe population (U) that was exposed to the example media 100 (e.g., by sending an impression request associated with the media) using Equation 5 above. Additionally, a total number of logged media impressions (T), a total number of database proprietor demographic impressions (e.g., logged media impressions matched to a user of a database proprietor) (R), and a total identified database proprietor audience exposed to the media 100 (A) are also utilized in Equation 5 above. The example UA/FD process 1 101 equates
to produce a first unique audience sub-process
which is used by UA/FD process 1 101 to determine the total unique audience (X).
In some examples, when the universe is large (e.g., larger than a threshold size) and the number of logged impressions (e.g., R and T) is small (e.g., based on a threshold) relative to the audience sizes (e.g., A and X), the first unique audience sub-process can be simplified to a second unique audience sub-process. The second unique audience sub-process is
In some examples, when the universe is large and the number of logged impressions (e.g., R and T) are large (e.g., based on the threshold) relative to the audience sizes (e.g., A and X), the second unique audience sub-process can be simplified to a third unique audience sub-process. The third unique audience sub-process is
The example UA/FD process 1 101 applies data associated with the impressions of a particular media to one of the first, second, or third unique audience sub-processes based on the thresholds. Additionally, the example UA/FD process 1 101 can determine the number of people in the unique audience associated with exactly one logged impression, exactly two logged impressions, etc. (e.g., a frequency distribution) based on a geometric distribution formula (e.g., P(Z=k)=(1−p)k-1p, where
and kϵ{1, 2, . . . , ∞}).
Since the three equations are simple to compute, processor resources required for the example UA/FD process 1 101 to determine a unique audience are low, Additionally, since only A, R, T, and U are required to be stored in memory, required memory for the example UA/FD process 1 101 is low.
As described above, the example UA/FD process 2 102 requires more processor resources and memory than the example UA/FD process 1 101. However, the example UA/FD process 2 102 can determine a unique audience based on data from one or more websites associated with one or more database proprietors. Additionally, if the estimate is based on two or more websites, the estimate is more accurate. The example UA/FD process 2 102 is derived from the process of maximum entropy and minimum cross entropy. The example UA/FD process 2 102 constructs an audience/impressions model constraint matrix (e.g., CQ) and an audience/impressions total constraint vector (e.g., DQ) to represent the total audience exposed to media based on one or more logged impressions. The example audience/impressions representation constraint (CQ) matrix includes rows on constraints associated with a universe population, a total audience, an expected value of the total media impressions, etc., as further described below in connection with
The unique audiences exposed to media associated with logged impressions can be determined by solving for an impression characteristics column vector (Q). The impressions characteristics column vector (Q) includes probabilities representing a number or people associated with zero impressions corresponding to website A and/or website B, one impression from website A and/or website B, two impressions from website A and/or website B, and/or any combination thereof. The example UA/FD process 2 102 determines impressions characteristic column vector (Q) using Equation 6 below.
maximize Q,H=−Σk=0∞qk log(qk), Equation 6
subject to CQQ=DQ.
To solve the total unique audience for logged impressions associated with the example media 100, the example UA/FD process 2 102 uses impressions characteristics column vector (Q) as the previous distribution for estimating the same distribution of probabilities based on audience (e.g., population) characteristics (e.g., P) using different constraints. The audience characteristics (P) are used to determine the total unique audience, as further described in
Since the example UA/FD process 2 102 involves a system of non-linear equations, the example processor resources 104 required to determine the unique audience is higher than the resources required to determine the unique audience using the example UA/FD process 1 101. Additionally, since the example UA/FD process 2 102 stores the audience/impression model constraint matrix (CQ), the audience/impressions total constraint vector (DQ), the impressions characteristics vector (Q), and the audience characteristics (P), the amount of memory resources 105 required for the example UA/FD process 2 102 is more than the mount of the example memory resources 105 required for the example UA/FD process 1 101. However, as described above, the example UA/FD process 1 101 cannot determine a unique audience based on logged impressions corresponding to media accessed via one or more websites.
The example UA/FD process 3 103 requires the most processor resources, but less memory than the example UA/FD process 2 102. As the number of websites associated with a request to log and/or the number of logged impressions per person per website increases, the amount of the example memory resources 105 associated with the example UA/FD process 2 102 becomes substantially large. In some examples, there is not enough of the example memory resources 105 to store all the values associated with the example UA/FD process 2 102. In such examples, the UA/FD process 3 103 may be used to determine the unique audience and/or frequency distribution, because the UA/FD process 3 103 uses less to determine the unique audience and/or frequency distribution for the large number of websites and/or logged impressions.
The example UA/FD process 3 103 decreases the amount of required memory resources by calculating a combination of a set of the probabilities (e.g., to represent both the audience characteristics and the impression characteristics) associated with the example UA/FD process 2 102, instead of calculating all of the audience characteristics and the impression characteristics individually. For example, media exposures from two websites may have billions of probabilities to represent the impressions and audience characteristics, depending on the number of exposures per person. The example UA/FD process 2 102 calculates and stores the billions of probabilities individually, while the example UA/FD process 3 103 only determines and stores four combinations.
The example UA/FD process 3 103 creates a combination matrix by enumerating all combinations that can occur, with each column being a constraint corresponding to a webpage. In some examples, the constraints may be logged impressions corresponding to a first website, logged impressions corresponding to a second website, a total number of impressions, a total audience, etc. The example UA/FD process 3 103 utilizes properties of the geometric series infinite summations to create a general formula for N websites, as shown in Equations 8 and 9 below.
where zi is representative of the ith constraint.
The example UA/FD process 3 103 creates a column based on a union of logged impressions and the population. The example UA/FD process 3 103 solves for N z values (e.g., exponents of LaGrangian multipliers for each constraint used during optimization) to satisfy the N constraints. The example UA/FD process 3 103 calculates the N z values using a system of non-linear equations. The example UA/FD process 3 103 modifies the z values corresponding to population constraints to solve for the population constraints. Although the example UA/FD process 3 103 is the most computationally intensive UA/FD process requiring the most processor resources, it requires less of the example memory resources 105 than the example UA/FD process 2 102.
As shown in the example situation 110, when the example UA/FD process 1 101 calculates the unique audience for logged impressions associated with the media 100 from the example website 106 with a small number of impressions per viewer, the example processor resources 104 and the example memory resources 105 used for such calculations are low. As shown in the example situation 115, the example UA/FD process 2 102 requires more of the example processor resources 104 than the example UA/FD process 1 101 and less of the example resources 104 than the example UA/FD process 3 103. Additionally, the example memory resources 105 needed for the example UA/FD process 2 102 remain relatively low. As shown in the example situation 120, the example UA/FD process 3 103 uses more of the example processor resources 104 than the UA/FD process 1 101 and the UA/FD process 2 102, but the example memory resources 105 remain relatively low. In the illustrated example of
In the example situation 125, the example UA/FD process 1 101 cannot be used determine a unique audience for each of the example small number of webpages 108 (e.g., a unique audience for website A, a unique audience for website B, etc.). That is, the example UA/FD process 1 101 is configured to determine a unique audience for a single website such as the website 106 of
In the example situation 140, the example UA/FD process 1 101 cannot be used to determine a unique audience for each of the example large number of websites 109 (e.g., a unique audience for website A, a unique audience for website B, etc.). That is, the example UA/FD process 1 101 is configured to determine a unique audience for a single website such as the website 106 of
The example client devices 202 of the illustrated example may be any device capable of accessing media over a network (e.g., the example network 204). For example, the client devices 202 may be an example mobile device 202a, an example computer 202b, 202d, an example tablet 202c, an example smart television 202e, and/or any other Internet-capable device or appliance. Examples disclosed herein may be used to collect impression information for any type of media including content and/or advertisements. Media may include advertising and/or content delivered via websites, streaming video, streaming audio, Internet protocol television (IPTV), movies, television, radio and/or any other vehicle for delivering media. In some examples, media includes user-generated media that is, for example, uploaded to media upload sites, such as YouTube, and subsequently downloaded and/or streamed by one or more other client devices for playback. Media may also include advertisements. Advertisements are typically distributed with content (e.g., programming). Traditionally, content is provided at little or no cost to the audience because it is subsidized by advertisers that pay to have their advertisements distributed with the content. As used herein, “media” refers collectively and/or individually to content and/or advertisement(s).
The example network 204 is a communications network. The example network 204 allows the example impression requests 206 from the example client devices 202 to the example impression collection entities 208. The example network 204 may be a local area network, a wide area network, the Internet, a cloud, or any other type of communications network.
The impression requests 206 of the illustrated example include information about accesses to media at the corresponding client devices 202 generating the impression requests. Such impression requests 206 allow monitoring entities, such as the impression collection entities 208, to collect a number of media impressions for different media accessed via the client devices 202. By collecting media impressions, the impression collection entities 204 can generate media impression quantities for different media (e.g., different content and/or advertisement campaigns).
The impression collection entities 208 of the illustrated example include the example database proprietor 210 and the example AME 212. In the illustrated example, the example database proprietor 210 may be one of many database proprietors that operate on the Internet to provide services to subscribers. Such services may be email services, social networking services, news media services, cloud storage services, streaming music services, streaming video services, online retail shopping services, credit monitoring services, etc. Example database proprietors include social network sites (e.g., Facebook, Twitter, My Space, etc.), multi-service sites (e.g., Yahoo!, Google, Axiom, Catalina, etc.), online retailer sites (e.g., Amazon.com, Buy.com, etc.), credit reporting sites (e.g., Experian), streaming media sites YouTube, etc.), and/or any other site that maintains user registration records.
In some examples, execution of the beacon instructions corresponding to the media 100 causes the client devices 202 to send impression requests 206 to servers 211, 213 (e.g., accessible via an Internet protocol (IP) address or uniform resource locator (URI)) of the impression collection entities 208 in the impression requests 206. In some examples, the beacon instructions cause the client devices 202 to locate device and/or users identifiers and media identifiers in the impression requests 206. The device/users identifier may be any identifier used to associate demographic information with a user or users of the client devices 202. Example device/user identifiers include cookies, hardware identifiers (e.g., an international mobile equipment identity (MEI), a mobile equipment identifier (MEID), a media access control (MAC) address, etc.), an app store identifier (e.g., a Google Android an Apple ID, an Amazon ID, etc.), an open source unique device identifier (OpenUDID), an open device identification number (ODIN), a login identifier (e.g., a username), an email address, user agent data (e.g., application type, operating system, software vendor, software revision, etc.), an Ad ID (e.g., an advertising ID introduced by Apple, Inc. for uniquely identifying mobile devices for purposes of serving advertising to such mobile devices), third-party service identifiers (e.g., advertising service identifiers, device usage analytics service identifiers, demographics collection service identifiers), etc. In some examples, fewer or more device/user identifier(s) 228 may be used. The media identifiers (e.g., embedded identifiers, embedded codes, embedded information, signatures, etc.) enable the impression collection entities 208 can identify to media (e.g., the media 100) objects accessed via the client devices 202. The impression requests 206 of the illustrated example cause the AME 212 and/or the database proprietor 210 to log impressions for the media 100. In the illustrated example, an impression request is a reporting to the AME 202 and/or the database proprietor 210 of an occurrence of the media 100 being presented at the client device 202. The impression requests 206 may be implemented as a hypertext transfer protocol (HTTP) request. However, whereas a transmitted HTTP request identifies a webpage or other resource to be downloaded, the impression requests 206 include audience measurement information (e.g., media identifiers and device/user identifier) as its payload. The server 211, 213 to which the impression requests 206 are directed is programmed to log the audience measurement information of the impression requests 206 as an impression (e.g., a media impression such as advertisement and/or content impressions depending on the nature of the media accessed via the client device 202). In some examples, the server 211, 213 of the database proprietor 201 or the AME 212 may transmit a response based on receiving an impression request 206. However, a response to the impression request 206 is not necessary. It is sufficient for the server 211, 213 to receive the impression request 206 to log an impression request 206.
The example database proprietor 210 maintains user account records corresponding to users registered for services (such as Internet-based services) provided by the database proprietors. That is, in exchange for the provision of services, subscribers register with the database proprietor 210. As part of this registration, the subscribers provide detailed demographic information to the database proprietor 210. Demographic information may include, for example, gender, age, ethnicity, income, home location, education level, occupation, etc. In the illustrated example, the database proprietor 210 sets a device/user identifier on a subscriber's client device 202 that enables the database proprietor 210 to identify the subscriber.
In the illustrated example, the example AME 212 does not provide the media 100 to the client devices 202 and is a trusted (e.g., neutral) third party (e.g., The Nielsen Company, LLC) for providing accurate media access (e.g., exposure) statistics. The example AME 212 includes the example audience/impressions determiner 214. As further disclosed herein, the example audience/impressions determiner 214 provides media access statistics related to the example impression requests 206. In some examples, the audience/impressions determiner 214 calculates a total reach (e.g., a total unique audience) exposed to particular media (e.g., the media 100) based on the example impression requests 206 and data from the example database proprietor 210 (e.g., database proprietor demographic impressions and/or partial audience), Additionally or alternatively, the example audience/impressions determiner 214 calculates a frequency distribution indicative of (1) a total quantity of unique audience members who have not been exposed to a particular media, (2) a total quantity of unique audience members who have been exposed to the particular media, exactly once, (3) a total quantity of unique audience members who have been exposed to the particular media exactly twice, etc. Additionally, the example audience/impressions determiner 214 may calculate any statistic related to the example impression requests 206. As disclosed herein, the example audience/impressions determiner 214 determines an optimal UA/FD process (e.g., one of the UA/FD processes 101, 102, 103 of
In operation, the example client devices 202 employ web browsers and/or applications (e.g., apps) to access media. Some of the web browsers, applications, and/or media include instructions that cause the example client devices 202 to report media monitoring information to one or more of the example impression collection entities 208. That is, when the client device 202 of the illustrated example accesses media, a web browser and/or application of the client device 202 executes instructions in the media, in the web browser, and/or in the application to send the example impression request 206 to one or more of the example impression collection entities 208 via the example network 206. The example impression requests 206 of the illustrated example include information about accesses to the media 100 and/or any other media at the corresponding client devices 202 generating the impression requests 206. Such impression requests allow monitoring entities, such as the example impression collection entities 208, to collect media impressions for different media accessed via the example client devices 202. In this manner, the impression collection entities 208 can generate media impression quantities for different media (e.g., different content and/or advertisement campaigns).
When the example database proprietor 210 receives the example impression request 206 from the example client device 202, the example database proprietor 210 requests the client device 202 to provide a device/user identifier that the da′abase proprietor 210 had previously set for the example client device 202. The example database proprietor 210 uses the device/user identifier corresponding to the example client device 202 to identify the subscriber of the client device 202.
In the illustrated example, three of the client devices 202a, 202b, and 202c have DP IDs (DP device/user IDs) that identify corresponding subscribers of the database proprietor 210. In this manner, when the client devices 202a, 202b, 202c corresponding to subscribers of the example database proprietor 210 send impression requests 206 to the impression collection entities 208, the database proprietor 210 may record database proprietor demographic impressions for the user. In the illustrated example, the client devices 202d, 202e do not have DP IDs. As such, the example database proprietor 210 is unable to identify the client devices 202d, 202e due to those client devices not having DP IDs set by the example database proprietor 210. The client devices 202d, 202e may not have DP IDs set by the database proprietor 210 if, for example, the client devices 202d, 202e do not accept cookies, a user does not have an account with the database proprietor 210 or the user has an account with the database proprietor 210 but has cleared the DP ID (e.g., cleared a cookie cache) and deleted the database proprietor's DP ID before or at the time of a media exposure. In such instances, if the user device 202 is, for example, redirected to contact the database proprietor 210 using the system disclosed in Mainak et al., U.S. Pat. No. 8,370,489, the database proprietor 210 is not able to detect demographics corresponding to the media exposure and, thus, does not report/log any audience or database proprietor demographic impressions for that exposure. In examples disclosed herein, the client devices 202d, 202e are referred to herein as client devices over which the database proprietor 210 has non-coverage because the database proprietor 210 is unable to identify demographics corresponding to those client devices 202d. 202e. As a result of the non-coverage, the database proprietor 210 underestimates the audience size and number of media impressions for corresponding media accessed via the client devices 202 when, for example, operating within the system of Mainak et al., U.S. Pat. No. 8,370,489.
The example AME 212 receives database proprietor demographic impression data from the example database proprietor 210. The database proprietor demographic impression data may include information relating to a total number of the logged database proprietor demographic impressions that correspond with a registered user of the database proprietor 210, a total number of registered users (e.g., a partial audience) that were exposed to media associated with the logged database proprietor demographic impressions, and/or any other information related to the logged database proprietor demographic impressions (e.g., demographics, a total number of registered users exposed to the media 100 more than once, etc.). The example audience/impressions determiner 214 determines a total number of logged media impressions (including but not limited to the number of logged database proprietor demographic impressions from the example database proprietor 210) and a total unique audience based on impression requests 206, survey data, census data, and/or data from a media provider. As further disclosed herein, the example audience/impression determiner 214 determines impression statistics based on the database proprietor demographic impressions associated with the example impression requests 206 and the database proprietor demographic impression data from the example database proprietor 210. The example audience/impression determiner 214 determines a unique audience(s) and/or frequency distribution using at least one of three UA/FD processes (e.g., the example UA/FD process 1 101, the example UA/FD process 1 102, and the example UA/FD process 3 103 of
The example data interface 300 receives the example impression requests 206 and data from the example database proprietor 303 (e.g., demographic database proprietor impression data). Initially, the example data interface 300 receives an impression request 206 to log an impression. The impression request 206 of the illustrated example includes a media identifier (ID) 301 to identify the example media 100. The media ID 301 is used to monitor impressions of media 100 and aggregate database proprietor impression data (e.g., database proprietor demographic impressions and/or a partial audience) associated with the media 100.
The example process selector 302 processes the received database proprietor demographic impression data from the example database proprietor 303 to select an optimal UA/FD process (e.g., one of the example UA/FD processes 101, 102, 103 of
The example audience data calculator 304 calculates a unique audience based on the received database proprietor demographic impression data 303 and the selected UA/FD process. In some examples, the audience data calculator 304 inputs data related to the logged media impressions and/or the aggregate database proprietor impression data into a formula to calculate the unique audience. In some examples, the audience data calculator 304 creates a population model constraint matrix (e.g., the example constraint matrix 825 of
The example impression data calculator 306 calculates a frequency distribution based on the received data 303 and the selected UA/FD process. In some examples, the impression data calculator 306 inputs data related to the logged media impressions and/or the aggregate database proprietor impression data into a formula to calculate the frequency distribution. In some examples, the impression data calculator 306 creates an audience/impressions model constraint matrix (CQ) and an audience/impressions total constraint vector (DQ) to calculate the frequency distribution as described above in connection with Equation 6. In some examples, the impression data calculator 306 solves various non-linear systems of equations (e.g., as described above in connection with Equations 8 and 9) to calculate the frequency distribution. The calculation of the frequency distribution and/or various statistics is further described below in connection with
The example report generator 308 generates reports based on the various statistics calculated by the example audience data calculator 304 and the example impression data calculator 306. In some examples, the report generated by the example report generator 308 includes a unique audience(s) and/or a frequency distribution. In some examples, the report generated by the example report generator 308 includes demographic data (e.g., a unique audience and/or frequency distribution for a particular demographic). In some examples, the report generator 308 includes data for the one or more websites that are associated with an impression associated with the example media 100. In some examples, the report generator 308 includes data in reports that describe how many people within the unique audience were exposed to the media 100 exactly once, twice, etc. In some examples, the report generator 308 combines data from logged impressions of the media 100 and/or other media associated with a particular company. For example, the report generated by the example report generator 308 may combine data indicating a total audience for three distinct advertisements for a particular company. Such reports may include data indicating how many people saw at least one of the three advertisements, how many people saw two of the three advertisements from website A or B, how many people saw the first two advertisements, but missed the last advertisement, etc. In some examples, the report generator 308 credits media associated with logged impressions based on the unique audience and/or frequency distribution.
While example manners of implementing the example audience/impressions determiner 214 of
Flowcharts representative of example machine readable instructions for implementing the example audience/impression determiner 214 of
As mentioned above, the example processes of
The example machine readable instructions 400 illustrated in
Initially, at block 402 of
At block 404 of
If the example processor selector 302 determines that the instructions of blocks 402 are not for measuring media exposure based on media impressions accessed via one website (e.g., the request is based on the logged impressions from the example two or more websites 108, 109), the example selector 302 determines if the number of probabilities associated with the numerous websites and impressions per person is more than a threshold amount (block 406). For example, as described above, as the number of websites and/or impressions per person increases, the amount of possible probabilities to solve and store increases. In some examples, a computer to execute the selected UA/FD process does not have enough of the required example memory resources 105 to store all of the possible probabilities associated with the websites and/or impressions. Since a person can view the media 100 a large number of times, the number of possible probabilities becomes nearly infinite. Thus, the example UA/FD process 2 102 (FIGS. 1A-1C) usually limits the number of impressions per person in order to determine a unique audience based on the limited number of impressions per person. The number of probabilities is equivalent to the number of impressions per person to the power of the number of websites. For example, if the request to log impressions is based on the media 100 from two websites (e.g., the example websites 108,109) where the most a client device can report an impression associated with the media 100 (e.g., be exposed to the media) is five times, then the number of probabilities is twenty-five (e.g., 52=25) due to the capacity of mathematics associated with the UA/FD process modeling.
If the number of probabilities is not more than the threshold amount at block 406, then the example process selector 302 selects the example UA/FD process 2 102 (
Turning now to
At block 502 of
At block 504 of
At block 506 of
An example manner of implementing the example unique audience sub-process 1.1 is described below in connection with
If the universe population is more than the first threshold, unique audience sub-process 1.1 can be simplified. That is, when the universe is sufficiently large the unique audience sub-process 1.1 is simplified by taking the limit of the unique audience sub-process 1.1 as U goes to infinity, as further described below.
If the example audience data calculator 304 determines at block 506 that the universe population is more than the threshold size, control advances to block 510. At block 510, the example audience data calculator 304 determines if there is a difference between the number of logged impressions and audience size less than a second threshold. The second threshold size may be based on a user and/or manufacturer preference. For example, the threshold size may be based on a desired precision of the results (e.g., the higher the threshold the more precise the results). If at block 510 the difference between the number of logged impressions (R and T) and the audience sizes (A and X) is less than the second threshold, the example audience data calculator 304 determines a unique audience (X) based on an example second unique audience sub-process (e.g., herein referred to as unique audience sub-process 1.2) (block 512). An example manner of implementing the example unique audience sub-process 1.2 is described below in connection with
Unique audience sub-process 1.2 is based on a mathematical rearrangement of unique audience sub-process 1.1 since the universe population is large
Using the unique audience sub-process 1.2, the audience data calculator 304 determines the unique audience (X) to be 270
If at block 510 the difference between the number of logged impressions (R and T) and the audience sizes (A and X) is not less than the second threshold, the example audience data calculator 304 determines a unique audience (X) based on a third unique audience sub-process herein referred to as unique audience sub-process 1.3) (block 514). An example manner of implementing the example unique audience sub-process 1.3 is described below in connection with
Unique audience sub-process 1.3 is based on a mathematical rearrangement of unique audience sub-process 1.2 since the difference between the logged impressions and the audience size is large
when R>>A and T>>X). Using unique audience sub-process 1.3, the audience data calculator 304 determines the unique audience (X) to be 258
At block 516 of
where kϵ{1, 2, . . . , ∞}
In Equations 10 and 11 above, k is the number of impressions per person. Using the above example (e.g., X=270 and T=1000), the example impression data calculator 306 determines that 27% of the unique audience (e.g., 27%×270=72.9 people) were exposed to the example media 100 once, Thus, each of those 72.9 users is associated with only one corresponding logged impression, (e.g., ((1−0.27)1-1)*0.27=0.27), and 19.7% of the unique audience (e.g., 19%×270=51.3 people) was exposed to the example media 100 twice. Thus, each of those 51.3 users is associated with two corresponding logged impressions (e.g., ((1−0.27)2-1)*0.27=0.197), etc.
At block 518 of
Turning now to
The example instructions represented by the flowchart of
At block 520 of
Turning now to
The example instructions of
At block 528 of
Turning now to
The example instructions of
At block 536 of
At block 540, the audience data calculator 304 determines a total unique audience (X) based on a square root of the quotient. As shown in Equation 17 below.
At block 602 of
At block 604 of
At block 606 of
At block 608 of
At block 610 of
maximize Q,H=−Σk=0∞qk log(qk), Equation 18
subject to CQQ=DQ.
Where CQ is the constraint matrix, DQ is the constraint vector, and Q is the impression characteristics. As disclosed above, the impressions characteristics (Q) include probabilities representing a number or people associated with zero impressions corresponding to website A and/or website B, one impression from website A and/or website B, two impressions from website A and/or website B, and/or any combination thereof. Example phases are shown in
Returning to
At block 614 of
In Equation 19 above, CP is the population constraint matrix based on the sub-matrices associated with the impression constraints (e.g., the example sub-matrices 804, 812, 814 of
At block 616 of
At block 618 of
At block 702 of
At block 704 of
At block 706 of
At block 708 of
Where variable zi is representative of the ith website.
The following formula is used for the audiences:
For example, the value associated with impression combination I2 in the example combination table is represented by:
Additionally, the value associated with impression combination I3 in the example combination table is represented by:
Since a sum of the impression combinations I2, I3 (e.g., I2+I3) represents all the logged database proprietor demographic impressions accessed via website A, then I2+I3=200/1000 (e.g., the logged database proprietor demographic impressions accessed via website A/the universal population). Additional equations can be determined using a similar manner (e.g., I1+I4=300/1000, etc.) The result is N equations (e.g., one for each constraint) with N unknowns (e.g. the z variables). A third phase of
A fourth phase of
Additionally, the value associated with audience combination A3 in the example combination table shown at time t2 904 is represented by:
Since the sum of the audience combinations A2, A3 (e.g., A2+A3) represents the audience exposed to the media 100 accessed via website A, then A2+A3=300/1000 (e.g., the logged impressions associated with the population audience of website A/the universal population). Additional equations can be determined using a similar manner (e.g., A1+A4=500/1000, etc.) This particular mathematical problem creates three equations with three unknowns (e.g., z1(X), z2(X), ž⋅). The example audience data calculator 304 (
At block 716 of
X1=UA((i)+(ii))=1000(0.08654+0.122699)=209.24 (e.g., unique audience of website A)
X2=UA((iii)+(iv))=1000(0.125720+0.122699)=248.42 (e.g., unique audience of website B)
X3=UA((v)+(vi)+(vii))=1000(0.08654+0.125720+0.122699)=334.96 (e.g., total unique audience)
At block 718 of
The processor platform 1000 of the illustrated example includes a processor 1012. The processor 1012 of the illustrated example is hardware. For example, the processor 1012 can be implemented by integrated circuits, logic circuits, microprocessors or controllers from any desired family or manufacturer.
The processor 1012 of the illustrated example includes the example memory 212 (e.g., a cache). The example processor 1012 of
The processor platform 1000 of the illustrated example also includes an interface circuit 1020. The interface circuit 1020 may be implemented by any type of interface standard, such as an Ethernet interface, a universal serial bus (USB), and/or a PCI express interface.
In the illustrated example, one or more input devices 1022 are connected to the interface circuit 1020. The input device(s) 1022 permit(s) a user to enter data and commands into the processor 1012. The input device(s) can be implemented by, for example, a sensor, a microphone, a camera (still or video), a keyboard, a button, a mouse, a touchscreen, a track-pad, a trackball, isopoint and/or a voice recognition system.
One or more output devices 1024 are also connected to the interface circuit 1020 of the illustrated example. The output devices 1024 can be implemented, for example, by display devices (e.g., a light emitting diode (LED), an organic light emitting diode (OLED), a liquid crystal display, a cathode ray tube display (CRT), a touchscreen, a tactile output device, and/or speakers). The interface circuit 1020 of the illustrated example, thus, typically includes a graphics driver card, a graphics driver chip or a graphics driver processor.
The interface circuit 1020 of the illustrated example also includes a communication device such as a transmitter, a receiver, a transceiver, a modem and/or network interface card to facilitate exchange of data with external machines (e.g., computing devices of any kind) via a network 1026 (e.g., an Ethernet connection, a digital subscriber line (DSL), a telephone line, coaxial cable, a cellular telephone system, etc.).
The processor platform 1000 of the illustrated example also includes one or more mass storage devices 1028 for storing software and/or data. Examples of such mass storage devices 1028 include floppy disk drives, hard drive disks, compact disk drives, Blu-ray disk drives, RAID systems, and digital versatile disk (DVD) drives.
The coded instructions 1032 of
From the foregoing, it will be appreciated that the above disclosed examples may be used to select a UA/FD process that based on both processor resources and memory resources to provide estimations of unique audiences (e.g., a unique audience for a particular website and/or a total unique audience) that is more accurate than prior techniques by using counts of logged impressions and a number of unique people identified by a database proprietor (e.g., a partial audience). Using examples disclosed herein, unique audiences can be determined in a faster and more accurate manner with less required memory resources.
Prior techniques for determining a unique audience for a media presentation include simple scaling or negative binomial distribution. However, simple scaling is inaccurate, negative binomial distribution includes computations requiring a large amount of processor resources, and both simple scaling and negative binomial distribution cannot determine a unique audiences for particular websites and a total unique audience. Examples disclosed herein alleviate such problems by selecting one of three UA/FD processes based on available processor resources and memory resources to determine a unique audience and/or a frequency distribution of impressions for a media presentation. Additionally, two of the three UA/FD processes disclosed herein determine a unique audience based on instructions to measure media exposure accessed via more than one website leading to more accurate estimations.
Example methods are disclosed to determine a unique audience exposed to media. Such example methods include receiving impression requests at a server from a plurality of client devices via a network. Such example methods include, based on the impression requests, logging, with the server, a plurality of impressions corresponding to the media accessed at the client devices; obtaining a number of demographic impressions logged by a database proprietor; obtaining a number of registered users of the database proprietor exposed to the media; multiplying, by executing an instruction with a processor, a number of the plurality of impressions by a square of the number of the registered users to generate a product; dividing, by executing an instruction with the processor, the product by the number of the demographic impressions to generate a quotient; and determining, by executing an instruction with the processor, the unique audience based on a square root of the quotient.
In some example methods, the registered users of the database proprietor exposed to the media correspond to the demographic impressions and to at least some of the plurality of impressions. In some examples, the crediting of the media is associated with the plurality of impressions based on the unique audience. In some examples, a frequency distribution is determined for the plurality of impressions based on the unique audience by: dividing the unique audience by the number of the plurality of impressions to determine a second quotient; and calculating a geometric distribution based on the second quotient.
Example methods are disclosed to determine a unique audience exposed to media. Such example methods include receiving impression requests at a server from a plurality of client devices via a network. Such example methods include, based on the impression requests, logging, with the server, a plurality of impressions corresponding to the media accessed at the client devices; obtaining a first number of first demographic impressions corresponding to the media accessed via a first website and logged by a first database proprietor, the first demographic impressions corresponding to first registered users of the first database proprietor; obtaining a second number of second demographic impressions corresponding to the media accessed via a second website and identified by a second database proprietor, the second demographic impressions corresponding to second registered users of the second database proprietor; obtaining a first number of the first registered users exposed to the media; obtaining a second number of the second registered users exposed to the media; generating, by executing an instruction with a processor, a constraint matrix and a constraint vector based on the first number of the first demographic impressions and the second number of the second demographic impressions, the constraint vector representative of a plurality of ratios of constraints to a number of the plurality of impressions; determining, by executing an instruction with the processor, audience characteristics based on the constraint matrix and the constraint vector; and determining, by executing an instruction with the processor, a first unique audience exposed to the media via the first website, a second unique audience exposed to the media via the second website, and a total unique audience exposed to the media via the first and second websites based on the audience characteristics.
In some example methods, the total unique audience is a count of unique audience members across the first unique audience and the second unique audience. In some example methods, the constraint matrix is a first constraint matrix and the constraint vector is a first constraint vector, and further including determining the first unique audience, the second unique audience, and the total unique audience by: generating the first constraint matrix and the first constraint vector based on the first number of the first demographic impressions, the first number of the first registered users, the second number of the second demographic impressions, and the second number of the second registered users; determining the impression characteristics associated with the first constraint matrix and the first constraint vector, the impression characteristics including maximized values based on the first constraint matrix and the first constraint vector; and generating a second constraint matrix and a second constraint vector based on the impression characteristics, the audience characteristics including maximized values based on the second constraint matrix and the second constraint vector, the first unique audience, the second unique audience, and the total unique audience being based on the audience characteristics.
In some example methods, a frequency distribution is determined based on the impression characteristics. In some example methods, the impression characteristics include probabilities representing likelihoods of different numbers of people exposed to the media via at least one of the first website or the second website. In some example methods, the audience characteristics include probabilities representing likelihoods of different sizes of unique audiences corresponding to at least one of the first website or the second website. In some example methods, the first database proprietor is the second database proprietor. In some example methods, a report is generated indicating at least one of the first unique audience, the second unique audience, or the total unique audience.
In some example methods, the media associated with the plurality of impressions is credited based on at least one of the first unique audience, the second unique audience, or the total unique audience. In some example methods, the constraint matrix includes constraints represented in sub-matrices, the constraints including at least one of a first size of a first audience exposed to the media via the first website, a second size of a second audience exposed to the media via the second website, a third size of a total audience exposed to the media via the first and second websites, a fourth size of a universe audience via any website, a first count of first impressions corresponding to the media accessed via the first website, and a second count of second impressions corresponding to the media accessed via the second website.
Example methods are disclosed to determine a unique audience exposed to media. Such example methods include receiving impression requests at a server from a plurality of client devices via a network. Such examples include, based on the impression requests, logging, with the server, a plurality of impressions corresponding to the media accessed at the client devices; obtaining a first number of first demographic impressions corresponding to the media accessed via a first website and logged by a first database proprietor, the first demographic impressions corresponding to first registered users of the first database proprietor; obtaining a second number of second demographic impressions corresponding to the media accessed via a second website and logged by a second database proprietor, the second demographic impressions corresponding to second registered users of the second database proprietor; obtaining a first number of the first registered users exposed to the media, obtaining a second number of the second registered users exposed to the media, determining, by executing an instruction with a processor, a first enumeration table, the first enumeration table including first values based on a system of non-linear equations associated with the first number of the first demographic impressions, the first number of the first registered users, the second number of the second demographic impressions and the second number of the second registered users; determining, by executing an instruction with the processor, a second enumeration table including second values based on a second system of non-linear equations associated with the first values calculated in the first enumeration table and the plurality of impressions; and determining, by executing an instruction with the processor, a first unique audience of the media accessed via the first website, a second unique audience of the media accessed via the second website, and a total unique audience of the media accessed via the first and second websites using second expressions solved based on the second values in the second enumeration table.
In some example methods, the first enumeration table is an estimated distribution of the plurality of impressions across the first registered users and the second registered users. In some example methods, the second enumeration table is an estimated distribution of the plurality of impressions across the total unique audience. In some example methods, a frequency distribution of impressions for the total unique audience is determined based on the second enumeration table.
In some example methods, the second enumeration table is based on population constraints. In such example methods, the population constraints are representative of: a third number of the first demographic impressions corresponding to the media accessed via the first website, the third number of the first demographic impressions including the first number of the first demographic impressions; and a fourth number of the second demographic impressions corresponding to the media accessed via the second website, the fourth number of the second demographic impressions including the second number of the second demographic impressions.
In some examples, the first database proprietor is the second database proprietor. In some examples, a report is generated indicating at least one of the first unique audience, the second unique audience, or the total unique audience. In some examples, the media associated with the plurality of impressions is credited based on at least one of the first unique audience, the second unique audience, or the total unique audience.
Although certain example methods, apparatus and articles of manufacture have been described herein, the scope of coverage of this patent is not limited thereto. On the contrary, this patent covers all methods, apparatus and articles of manufacture fairly falling within the scope of the claims of this patent.
This Patent arises from a continuation of U.S. patent application Ser. No. 16/741,487 (Now U.S. Pat. No. 10,979,324), filed on Jan. 13, 2020, entitled “METHODS AND APPARATUS FOR ESTIMATING TOTAL UNIQUE AUDIENCES,” which is a continuation of U.S. patent application Ser. No. 16/388,666 (Now U.S. Pat. No. 10,536,358), filed on Apr. 18, 2019, entitled “METHODS AND APPARATUS FOR ESTIMATING TOTAL UNIQUE AUDIENCES,” which is a continuation of U.S. patent application Ser. No. 15/008,220 (Now U.S. Pat. No. 10,270,673), filed on Jan. 27, 2016, entitled “METHODS AND APPARATUS FOR INCREASING THE ROBUSTNESS OF MEDIA SIGNATURES.” Priority to U.S. patent application Ser. No. 16/741,487, U.S. patent application Ser. No. 16/388,666, and U.S. patent application Ser. No. 15/008,220 is claimed. U.S. patent application Ser. No. 16/741,487, U.S. patent application Ser. No. 16/388,666, and U.S. patent application Ser. No. 15/008,220 are incorporated herein by reference in their entireties.
Number | Name | Date | Kind |
---|---|---|---|
4621325 | Naftzger et al. | Nov 1986 | A |
5113518 | Durst, Jr. et al. | May 1992 | A |
5214780 | Ingoglia et al. | May 1993 | A |
5675510 | Coffey et al. | Oct 1997 | A |
5793409 | Tetsumura | Aug 1998 | A |
5796952 | Davis et al. | Aug 1998 | A |
5832520 | Miller | Nov 1998 | A |
5848396 | Gerace | Dec 1998 | A |
5870740 | Rose et al. | Feb 1999 | A |
6035339 | Agraharam et al. | Mar 2000 | A |
6052730 | Felciano et al. | Apr 2000 | A |
6108637 | Blumenau | Aug 2000 | A |
6138155 | Davis et al. | Oct 2000 | A |
6141694 | Gardner | Oct 2000 | A |
6223215 | Hunt et al. | Apr 2001 | B1 |
6279036 | Himmel et al. | Aug 2001 | B1 |
6286140 | Ivanyi | Sep 2001 | B1 |
6415323 | McCanne et al. | Jul 2002 | B1 |
6434614 | Blumenau | Aug 2002 | B1 |
6457010 | Eldering et al. | Sep 2002 | B1 |
6460079 | Blumenau | Oct 2002 | B1 |
6529952 | Blumenau | Mar 2003 | B1 |
6606657 | Zilberstein et al. | Aug 2003 | B1 |
6658410 | Sakamaki et al. | Dec 2003 | B1 |
6704787 | Umbreit | Mar 2004 | B1 |
6714917 | Eldering et al. | Mar 2004 | B1 |
6839680 | Liu et al. | Jan 2005 | B1 |
6877007 | Hentzel et al. | Apr 2005 | B1 |
7031932 | Lipsky et al. | Apr 2006 | B1 |
7039699 | Narin et al. | May 2006 | B1 |
7092926 | Cerrato | Aug 2006 | B2 |
7139723 | Conkwright et al. | Nov 2006 | B2 |
7146329 | Conkwright et al. | Dec 2006 | B2 |
7150030 | Eldering et al. | Dec 2006 | B1 |
7181412 | Fulgoni et al. | Feb 2007 | B1 |
7260837 | Abraham et al. | Aug 2007 | B2 |
7363643 | Drake et al. | Apr 2008 | B2 |
7386473 | Blumenau | Jun 2008 | B2 |
7406516 | Davis et al. | Jul 2008 | B2 |
7526538 | Wilson | Apr 2009 | B2 |
7590568 | Blumenau | Sep 2009 | B2 |
7613635 | Blumenau | Nov 2009 | B2 |
7634786 | Knee et al. | Dec 2009 | B2 |
7644156 | Blumenau | Jan 2010 | B2 |
7647418 | Ash et al. | Jan 2010 | B2 |
7650407 | Blumenau | Jan 2010 | B2 |
7653724 | Blumenau | Jan 2010 | B2 |
7716326 | Blumenau | May 2010 | B2 |
7720963 | Blumenau | May 2010 | B2 |
7720964 | Blumenau | May 2010 | B2 |
7756974 | Blumenau | Jul 2010 | B2 |
7882054 | Levitan | Feb 2011 | B2 |
7890451 | Cancel et al. | Feb 2011 | B2 |
7925694 | Harris | Apr 2011 | B2 |
7941525 | Yavilevich | May 2011 | B1 |
7949565 | Eldering et al. | May 2011 | B1 |
7949639 | Hunt et al. | May 2011 | B2 |
7958234 | Thomas et al. | Jun 2011 | B2 |
7962603 | Morimoto | Jun 2011 | B1 |
8006259 | Drake et al. | Aug 2011 | B2 |
8060601 | Brown et al. | Nov 2011 | B1 |
8087041 | Fu et al. | Dec 2011 | B2 |
8131763 | Tuscano et al. | Mar 2012 | B2 |
8131861 | Butler et al. | Mar 2012 | B2 |
8151194 | Chan et al. | Apr 2012 | B1 |
8180112 | Kurtz et al. | May 2012 | B2 |
8234408 | Jungck | Jul 2012 | B2 |
8235814 | Willis et al. | Aug 2012 | B2 |
8260663 | Ranka | Sep 2012 | B1 |
8280683 | Finkler | Oct 2012 | B2 |
8302120 | Ramaswamy | Oct 2012 | B2 |
8307006 | Hannan et al. | Nov 2012 | B2 |
8370489 | Mazumdar et al. | Feb 2013 | B2 |
8402027 | Dange et al. | Mar 2013 | B1 |
8453173 | Anderson et al. | May 2013 | B1 |
8484511 | Tidwell et al. | Jul 2013 | B2 |
8495198 | Sim et al. | Jul 2013 | B2 |
8504411 | Subasic et al. | Aug 2013 | B1 |
8504507 | Srinivasaiah | Aug 2013 | B1 |
8543454 | Fleischman et al. | Sep 2013 | B2 |
8600796 | Sterne et al. | Dec 2013 | B1 |
8631122 | Kadam et al. | Jan 2014 | B2 |
8688524 | Ramalingam et al. | Apr 2014 | B1 |
8713168 | Heffernan et al. | Apr 2014 | B2 |
8751461 | Abraham et al. | Jun 2014 | B2 |
8775332 | Morris et al. | Jul 2014 | B1 |
8831362 | Steffens | Sep 2014 | B1 |
8839291 | Anderson et al. | Sep 2014 | B1 |
8898689 | Georgakis | Nov 2014 | B2 |
8910195 | Barney et al. | Dec 2014 | B1 |
8930701 | Burbank et al. | Jan 2015 | B2 |
8973023 | Rao et al. | Mar 2015 | B1 |
8984547 | Lambert et al. | Mar 2015 | B2 |
9055122 | Grecco et al. | Jun 2015 | B2 |
9117217 | Wilson et al. | Aug 2015 | B2 |
9215288 | Seth et al. | Dec 2015 | B2 |
10270673 | Sheppard et al. | Apr 2019 | B1 |
10536358 | Sheppard et al. | Jan 2020 | B2 |
20030037131 | Verma | Feb 2003 | A1 |
20030046385 | Vincent | Mar 2003 | A1 |
20030065770 | Davis et al. | Apr 2003 | A1 |
20030105604 | Ash et al. | Jun 2003 | A1 |
20040088212 | Hill | May 2004 | A1 |
20040107125 | Guheen et al. | Jun 2004 | A1 |
20040167928 | Anderson et al. | Aug 2004 | A1 |
20050033657 | Herrington et al. | Feb 2005 | A1 |
20050223093 | Hanson et al. | Oct 2005 | A1 |
20060089754 | Mortenson | Apr 2006 | A1 |
20060294259 | Matefi et al. | Dec 2006 | A1 |
20070106787 | Blumenau | May 2007 | A1 |
20070106792 | Blumenau | May 2007 | A1 |
20070112714 | Fairweather | May 2007 | A1 |
20070156532 | Nyhan et al. | Jul 2007 | A1 |
20070198327 | Yazdani et al. | Aug 2007 | A1 |
20070260603 | Tuscano et al. | Nov 2007 | A1 |
20070271518 | Tischer et al. | Nov 2007 | A1 |
20070271580 | Tischer et al. | Nov 2007 | A1 |
20080004958 | Ralph et al. | Jan 2008 | A1 |
20080010144 | Chatwin | Jan 2008 | A1 |
20080091639 | Davis et al. | Apr 2008 | A1 |
20080126420 | Wright et al. | May 2008 | A1 |
20080201472 | Bistriceanu et al. | Aug 2008 | A1 |
20080222201 | Chen et al. | Sep 2008 | A1 |
20080235243 | Lee et al. | Sep 2008 | A1 |
20080276179 | Borenstein et al. | Nov 2008 | A1 |
20080300965 | Doe | Dec 2008 | A1 |
20090030780 | York et al. | Jan 2009 | A1 |
20090055241 | Chen et al. | Feb 2009 | A1 |
20090070443 | Vanderhook et al. | Mar 2009 | A1 |
20090070797 | Ramaswamy et al. | Mar 2009 | A1 |
20090076899 | Gbodimowo | Mar 2009 | A1 |
20090292587 | Fitzgerald | Nov 2009 | A1 |
20090327026 | Bistriceanu et al. | Dec 2009 | A1 |
20100004977 | Marci et al. | Jan 2010 | A1 |
20100076814 | Manning | Mar 2010 | A1 |
20100088152 | Bennett | Apr 2010 | A1 |
20100088373 | Pinkham | Apr 2010 | A1 |
20100070621 | Urdan et al. | May 2010 | A1 |
20100121676 | Jackson | May 2010 | A1 |
20100153175 | Pearson et al. | Jun 2010 | A1 |
20100161385 | Karypis et al. | Jun 2010 | A1 |
20100205057 | Hook et al. | Aug 2010 | A1 |
20100262498 | Nolet et al. | Oct 2010 | A1 |
20100268540 | Arshi et al. | Oct 2010 | A1 |
20100268573 | Jain et al. | Oct 2010 | A1 |
20100281178 | Sullivan | Nov 2010 | A1 |
20100299604 | Blumenau | Nov 2010 | A1 |
20100312854 | Hyman | Dec 2010 | A1 |
20100313009 | Combet et al. | Dec 2010 | A1 |
20110016482 | Tidwell et al. | Jan 2011 | A1 |
20110087519 | Fordyce, III et al. | Apr 2011 | A1 |
20110131596 | Amsterdam et al. | Jun 2011 | A1 |
20110137733 | Baird et al. | Jun 2011 | A1 |
20110157475 | Wright et al. | Jun 2011 | A1 |
20110191664 | Sheleheda et al. | Aug 2011 | A1 |
20110191831 | Chan et al. | Aug 2011 | A1 |
20110196735 | von Sydow et al. | Aug 2011 | A1 |
20110202500 | Warn et al. | Aug 2011 | A1 |
20110231240 | Schoen et al. | Sep 2011 | A1 |
20110246297 | Buchalter et al. | Oct 2011 | A1 |
20110246641 | Pugh et al. | Oct 2011 | A1 |
20110282730 | Tarmas | Nov 2011 | A1 |
20120005213 | Hannan et al. | Jan 2012 | A1 |
20120030037 | Carriero | Feb 2012 | A1 |
20120072469 | Perez et al. | Mar 2012 | A1 |
20120109709 | Fordyce, III et al. | May 2012 | A1 |
20120109882 | Bouse et al. | May 2012 | A1 |
20120110027 | Falcon | May 2012 | A1 |
20120110071 | Zhou et al. | May 2012 | A1 |
20120151322 | Lindsay et al. | Jun 2012 | A1 |
20120158490 | Neumeyer et al. | Jun 2012 | A1 |
20120158954 | Heffernan et al. | Jun 2012 | A1 |
20120166520 | Lindsay et al. | Jun 2012 | A1 |
20120173701 | Tenbrock | Jul 2012 | A1 |
20120185274 | Hu | Jul 2012 | A1 |
20120206331 | Gandhi | Aug 2012 | A1 |
20120215621 | Heffernan et al. | Aug 2012 | A1 |
20120239809 | Mazumdar et al. | Sep 2012 | A1 |
20120254466 | Jungck | Oct 2012 | A1 |
20120265606 | Patnode | Oct 2012 | A1 |
20120302222 | Williamson et al. | Nov 2012 | A1 |
20130007794 | Besehanic et al. | Jan 2013 | A1 |
20130013308 | Cao et al. | Jan 2013 | A1 |
20130014144 | Bhatia et al. | Jan 2013 | A1 |
20130019262 | Bhatia et al. | Jan 2013 | A1 |
20130046615 | Liyanage | Feb 2013 | A1 |
20130066713 | Umeda | Mar 2013 | A1 |
20130097311 | Mazumdar et al. | Apr 2013 | A1 |
20130097312 | Mazumdar et al. | Apr 2013 | A1 |
20130117103 | Shimizu et al. | May 2013 | A1 |
20130138506 | Zhu et al. | May 2013 | A1 |
20130145022 | Srivastava | Jun 2013 | A1 |
20130204694 | Banister et al. | Aug 2013 | A1 |
20130212188 | Duterque et al. | Aug 2013 | A1 |
20130246609 | Topchy et al. | Sep 2013 | A1 |
20130282898 | Kalus et al. | Oct 2013 | A1 |
20130290070 | Abraham et al. | Oct 2013 | A1 |
20130325588 | Kalyanam et al. | Dec 2013 | A1 |
20130331971 | Bida et al. | Dec 2013 | A1 |
20130332604 | Seth et al. | Dec 2013 | A1 |
20140075004 | Van Dusen et al. | Mar 2014 | A1 |
20140082660 | Zhang et al. | Mar 2014 | A1 |
20140108130 | Vos et al. | Apr 2014 | A1 |
20140122703 | Pugh et al. | May 2014 | A1 |
20140156761 | Heffernan et al. | Jun 2014 | A1 |
20140236705 | Shao | Aug 2014 | A1 |
20140279074 | Chen et al. | Sep 2014 | A1 |
20140298025 | Burbank et al. | Oct 2014 | A1 |
20150186403 | Srivastava et al. | Jul 2015 | A1 |
20150189500 | Bosworth et al. | Jul 2015 | A1 |
20150193816 | Toupet et al. | Jul 2015 | A1 |
20150262207 | Rao et al. | Sep 2015 | A1 |
20200228427 | Sheppard et al. | Jul 2020 | A1 |
Number | Date | Country |
---|---|---|
2013205736 | May 2013 | AU |
101077014 | Nov 2007 | CN |
101222348 | Jul 2008 | CN |
101505247 | Aug 2009 | CN |
104520839 | Apr 2015 | CN |
104541513 | Apr 2015 | CN |
104584564 | Apr 2015 | CN |
2176639 | Dec 1986 | GB |
H07262167 | Oct 1995 | JP |
2001282982 | Oct 2001 | JP |
2001357192 | Dec 2001 | JP |
2002163562 | Jun 2002 | JP |
2002373152 | Dec 2002 | JP |
2004504674 | Feb 2004 | JP |
2006127320 | May 2006 | JP |
2006127321 | May 2006 | JP |
2010039845 | Feb 2010 | JP |
2010257448 | Nov 2010 | JP |
20020037980 | May 2002 | KR |
20090020558 | Feb 2009 | KR |
20100094021 | Aug 2010 | KR |
20110017895 | Feb 2011 | KR |
20110023293 | Mar 2011 | KR |
20120091411 | Aug 2012 | KR |
20120123148 | Nov 2012 | KR |
9617467 | Jun 1996 | WO |
9628904 | Sep 1996 | WO |
9641495 | Dec 1996 | WO |
2000041115 | Jul 2000 | WO |
200207054 | Jan 2002 | WO |
2003027860 | Apr 2003 | WO |
2005013072 | Feb 2005 | WO |
2005024689 | Mar 2005 | WO |
2008150575 | Dec 2008 | WO |
2010088372 | Aug 2010 | WO |
2010104285 | Sep 2010 | WO |
2011097624 | Aug 2011 | WO |
2012019643 | Feb 2012 | WO |
2012040371 | Mar 2012 | WO |
2012087954 | Jun 2012 | WO |
2012128895 | Sep 2012 | WO |
2012170902 | Dec 2012 | WO |
2012177866 | Dec 2012 | WO |
2013122907 | Aug 2013 | WO |
2013188429 | Dec 2013 | WO |
2014059319 | Apr 2014 | WO |
2014176343 | Oct 2014 | WO |
2014179218 | Nov 2014 | WO |
2014182764 | Nov 2014 | WO |
2015005957 | Jan 2015 | WO |
2015023589 | Feb 2015 | WO |
Entry |
---|
United States Patent and Trademark Office, “Non-Final Office Action”, issued in connection with U.S. Appl. No. 15/008,220, dated Jan. 25, 2018, 24 pages. |
United States Patent and Trademark Office, “Non-Final Office Action”, issued in connection with U.S. Appl. No. 15/008,220 dated Jul. 23, 2018, 10 pages. |
United States Patent and Trademark Office, “Notice of Allowance and Fee(s) Due”, issued in connection with U.S. Appl. No. 15/008,220 dated Nov. 29, 2018, 5 pages. |
United States Patent and Trademark Office, “Non-Final Office Action”, issued in connection with U.S. Appl. No. 16/388,666 dated Aug. 7, 2019, 6 pages. |
United States Patent and Trademark Office, “Notice of Allowance and Fee(s) Due”, issued in connection with U.S. Appl. No. 16/388,666 dated Aug. 21, 2019, 7 pages. |
United States Patent and Trademark Office, “Non-Final Office Action”, issued in connection with U.S. Appl. No. 16/741,487, dated Aug. 20, 2020, 4 pages. |
United States Patent and Trademark Office, “Notice of Allowance and Fee(s) Due”, issued in connection with U.S. Appl. No. 16/741,487, dated Dec. 8, 2020, 5 pages. |
Number | Date | Country | |
---|---|---|---|
20210234776 A1 | Jul 2021 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16741487 | Jan 2020 | US |
Child | 17224842 | US | |
Parent | 16388666 | Apr 2019 | US |
Child | 16741487 | US | |
Parent | 15008220 | Jan 2016 | US |
Child | 16388666 | US |