This disclosure relates generally to media audience measurement, and, more particularly, to methods and apparatus to estimate population reach from different marginal rating unions.
Determining a size and demographics of an audience of a media presentation helps media providers and distributors schedule programming and determine a price for advertising presented during the programming. In addition, accurate estimates of audience demographics enable advertisers to target advertisements to certain types and sizes of audiences. To collect these demographics, an audience measurement entity enlists a group of media consumers (often called panelists) to cooperate in an audience measurement study (often called a panel) for a predefined length of time. In some examples, the audience measurement entity obtains (e.g., directly, or indirectly from a media service provider) return path data (e.g., census data representative of a population of users) from media presentation devices (e.g., set-top boxes) that identifies tuning data from the media presentation device. In such examples, because the return path data may not be associated with a known panelist, the audience measurement entity models and/or assigns viewers to represent the return path data. Additionally, the media consumption habits and demographic data associated with the enlisted media consumers are collected and used to statistically determine the size and demographics of the entire audience of the media presentation. In some examples, this collected data (e.g., data collected via measurement devices) may be supplemented with survey information, for example, recorded manually by the presentation audience members.
In general, the same reference numbers will be used throughout the drawing(s) and accompanying written description to refer to the same or like parts.
Descriptors “first,” “second,” “third,” etc. are used herein when identifying multiple elements or components which may be referred to separately. Unless otherwise specified or understood based on their context of use, such descriptors are not intended to impute any meaning of priority, physical order or arrangement in a list, or ordering in time but are merely used as labels for referring to multiple elements or components separately for ease of understanding the disclosed examples. In some examples, the descriptor “first” may be used to refer to an element in the detailed description, while the same element may be referred to in a claim with a different descriptor such as “second” or “third.” In such instances, it should be understood that such descriptors are used merely for ease of referencing multiple elements or components.
Audience measurement entities seek to understand the composition and size of audiences of media, such as television programming. Such information allows audience measurement entity researchers to, for example, report advertising delivery and/or targeting statistics to advertisers that target their media (e.g., advertisements) to particular audiences. Additionally, such information helps to establish advertising prices commensurate with audience exposure and demographic makeup (referred to herein collectively as “audience configuration”). One way to gather media presentation information is to gather the media presentation information from media output devices (e.g., gathering television presentation data from a set-top box (STB) connected to a television). As used herein, media presentation includes media output by a media device regardless of whether or not an audience member is present (e.g., media output by a media output device at which no audience is present, media exposure to an audience member(s), etc.).
A media presentation device (e.g., STB) provided by a service provider (e.g., a cable television service provider, a satellite television service provider, an over-the-top (OTT) service provider, a music service provider, a movie service provider, a streaming media provider, etc.) or purchased by a consumer may contain processing capabilities to monitor, store, and transmit tuning data (e.g., which television channels are tuned by the media presentation device at a particular time) back to the service provider, which can then aggregate and provide such return path data to an audience measurement entity (e.g., The Nielsen Company (US), LLC.) to analyze media presentation activity. Data transmitted from a media presentation device back to the service provider is referred to herein as return path data which may include census data. Return path data includes tuning data. Tuning data is based on data received from the media presentation device while the media presentation device is on (e.g., powered on, switched on, and/or tuned to a media channel, streaming, etc.). Although return path data includes tuning data, return path data may not include data related to the user viewing the media corresponding to the media presentation device. Accordingly, return path data may not be able to be associated with specific viewers, demographics, locations, etc. However, census data may be derived or extracted from return path data. Census data is indicative of the total percentage of a population of users (e.g., based on the return path data) that was exposed to media at a particular margin. For example, if 20% of a population was exposed to a first margin (e.g., a first 15 minute segment) of a television show, the census data may be indicative the 20% exposure.
To determine aspects of media presentation data (e.g., which household member is currently consuming a particular media and the demographics of that household member), market researchers may perform audience measurement by enlisting a subset of the media consumers as panelists. Panelists or monitored panelists are audience members (e.g., household members, users, panelists, etc.) enlisted to be monitored, who divulge and/or otherwise share their media activity and/or demographic data to facilitate a market research study. An audience measurement entity typically monitors media presentation activity (e.g., viewing, listening, etc.) of the monitored panelists via audience measurement system(s), such as a metering device(s) and/or a local people meter (LPM). Audience measurement typically includes determining the identity of the media being presented on a media output device (e.g., a television, a radio, a computer, etc.), determining data related to the media (e.g., presentation duration data, timestamps, channel data, etc.), determining demographic information of an audience, and/or determining which members of a household are associated with (e.g., have been exposed to) a media presentation. For example, an LPM in communication with an audience measurement entity communicates audience measurement (e.g., metering) data to the audience measurement entity. As used herein, the phrase “in communication,” including variances thereof, encompasses direct communication and/or indirect communication through one or more intermediary components and does not require direct physical (e.g., wired) communication and/or constant communication, but rather additionally includes selective communication at periodic or aperiodic intervals, as well as one-time events.
In some examples, metering data (e.g., including media presentation data) collected by an LPM or other meter is stored in a memory and transmitted via a network, such as the Internet, to a datastore managed by the audience measurement entity. Typically, such metering data is combined with additional metering data collected from a group of LPMs monitoring a group of panelist households. The metering data may include, but are not limited to, a number of minutes a household media presentation device was tuned to a particular channel, a number of minutes a household media presentation device was used (e.g., consumed) by a household panelist member and/or a visitor (e.g., a presentation session), demographics of the audience (which may be statistically projected based on the panelist data), information indicative of when the media presentation device is on or off, and/or information indicative of interactions with the media presentation device (e.g., channel changes, station changes, volume changes, etc.), etc. As used herein, a channel may be a tuned frequency, selected stream, an address for media (e.g., a network address), and/or any other identifier for a source and/or carrier of media.
Examples disclosed herein receive the marginal media exposure data (e.g., different episodes of a television series, different quarter hour time slots of a television program, or a radio program, etc.) for different unions of marginal data and/or smaller unions and estimates a total population reach (e.g., a total number of deduplicated users that were exposed to media) across all of the different unions. As used herein, a margin or marginal is a subpart of media. For example, if the media corresponds to advertisement, the margins may be different websites that include the advertisement. In another example, if the media corresponds to a one-hour program, the margins may be 4 15-minute increments of the one-hour program. As used herein, a union can be made up of smaller unions (e.g., a union of smaller unions of marginals, such as a union of smaller unions of time-periods) and/or individual marginals (e.g., time-periods, such as quarter hours). For example, a first union may include a first quarter hour marginal and a second subsequent quarter hour marginal, a second union may include a third, fourth, and fifth quarter hour marginal, and a third union may include the first union and the second union. As used herein, child unions or children are the marginal time intervals and/or smaller unions that make up a larger union and a parent union or a parent is a larger union that includes the child union(s) and/or children. Using the above example, the children of the first union include the first quarter hour marginal and the second quarter hour marginal and the parent of the first union is the third union.
In some examples, the audience measurement entity processes the collected and/or aggregated metering data from panelist meters and obtains (e.g., from one or more service provider) return path data for devices where a panel is not maintained. Return path data may include, for example, a total number of or a percentage of unique users (e.g., deduplicated users) from a universe of users that was exposed to media within different margins (e.g., 15 minute increments) of time. However, return path data may be missing a total number of or a percentage of unique users from a universe of users that was exposed to the media within two or more margins of time (e.g., unions). Examples disclosed herein leverage panelist data to be able to estimate unique audiences of universe (e.g., herein also referred to as population) and/or reach(es) across different unions of time. As used herein, reach is a cumulative percentage or total of a population that has been counted as a viewer of the media at least once during a specified time interval (e.g., hourly, daily, weekly, monthly, etc.).
Some traditional techniques of determining the total population reach for a union of marginal media ratings data include numerical calculations that enumerate all combinations in which someone can watch a program based on the individual marginal media ratings of making up the union. The number of probabilities to be solved by such traditional techniques is 2n number of probabilities, where n is the number of marginal probabilities provided in the marginal ratings data (e.g., the ratings for the different possible instances of media exposure) of each union. For example, if reach is to be calculated across 4 quarter hours (e.g., for an hour-long media exposure), which corresponds to four possible marginal ratings, the total number of probabilities to be solved using such a traditional technique is 16 (e.g., 24). If the reach is to be calculated across 96 quarter hours (e.g., a day-long media exposure), the total number of probabilities to be solved using such a traditional technique is 8*1028 (e.g., 296), thereby exceeding the memory limit and/or processing power of any existing computer. Examples disclosed herein alleviate such memory/processing resource problems associated with such a traditional technique by calculating the solution using the disclosed analytical process.
To estimate the total population reach of a union (e.g., population union reach), examples disclosed herein develop a tree graph association or tree association for the margins and union(s). The tree graph association corresponds to the structure of the margins and/or unions where each margin and each union corresponds to a node. Examples disclosed herein tag each node as a descendant (e.g., child, grandchild, etc.) and/or ancestor (e.g., parent, grandparent, etc.) depending on the structure of the unions corresponding to stored panel data. For example, if panel data includes a unique audience number or percentage corresponding to three margins (e.g., A, B, and C) and the panel data further includes a unique audience number or percentage corresponding to a first union (e.g., AB) and a second union (e.g., ABC), examples disclosed herein may tag (1) nodes A and B as having parent AB and grandparent ABC, (2) node C as having parent ABC, (3) node AB as having children A and B and parent ABC, and (4) node ABC as having children AB and C and grandchildren A and B. Additionally, examples disclosed herein may tag the margin nodes (e.g., A, B, and C) as leaves and tag ABCD as a root. As used herein, a leaf is a node that does not have children and a root is a node that has no parents.
As described above, the census data (e.g., corresponding to the population or total audience) may include unique audience data related to particular margins, but may not include unique audience data for unions of the margins. Accordingly, once the tree structure association is complete, examples disclosed herein utilize the panel data and corresponding tree structure association to solve for unique audience(s) (e.g., reach(es)) of unions of the population based on the census data. The unique audience of a particular union corresponding to a node of the tree structure association can be determined by taking the difference between the total audience and the total audience of the union that was removed from the tree structure association (e.g., the statistical rule of subtraction), as shown the below-Equation 1. Examples disclosed herein leverage Equation 1 to determine unique audiences for a population union.
AUD(X)=AUD(T)−A(T\X) (Equation 1)
In Equation 1 above, AUD(X) represents the unique audience of union or margin X (e.g., the audience which belonged to at least one of the margins corresponding to union X), AUD(T) represents the total audience, and AUD(T\X) represents the total audience excluding the audience members of union X. In some examples, the total audience AUD(T) may be known (e.g., for panelist data). The total audience AUD(T) corresponds to the below-Equation 2.
AUD(T)=z0zNsN (Equation 2)
In Equation 2 above, z0 corresponds to a normalize constraint, N corresponds to the node of the total audience, zi=eλ
As used herein, node height refers to the level or hierarchy level of the tree structure. For example, all the leaf nodes (e.g., margins) correspond to the lowest height and the root node (e.g., the union corresponding to all leaf nodes of a tree association structure) corresponds to the highest height. As the calculations of si depend on its children, if any, examples disclosed herein utilize parallel commercial solvers to solve for si in parallel (e.g., independently) at each height to be combined at a later height.
As described above AUD(T\X) corresponds to the audience of T when the audience of the union and/or margin X is removed from the tree structure, so the new graph only contains people who visited anything other than X. Because removing a union and/or margin X only affects the ancestors of X, AUD(T\X) can be determined based on the below-Equation 4. Utilizing Equation 4 reduces the number of computations for tree structure associations (e.g., the larger the number of nodes in the tree structure association, the larger the reduction of computations), because updating the ancestors to compute AUD(T\X) (e.g., using the below Equation 4) can be done in parallel, once for each node. Although new tree associations can be calculated for each node removal graph to determine the total audience of Equation 3, Equation 4 utilizes the property that the only si variables which are impacted by node deletion are the ancestors of that node.
AUD(T\X)=z0zNsN(new) (Equation 4)
In Equation 4 above, sN(new) is defined recursively by node height (e.g., margins being nodes at the lowest height, unions of margins being at the next height, etc.) when the X union and/or margin is removed. sN(new) can be determined using the Equation 5, as shown below.
In the preceding equation, kj is the node representing the jth ancestor of X (e.g., k0 is X, k1 is the parent of X, . . . , kJ is node N, the root node). As the calculations of skj depend on its children, if any, examples disclosed herein, when solving for skj can utilize parallel commercial solvers to solve for skj in parallel (e.g., independently) at each height to be combined at a later height.
Equation 1 (e.g., corresponding to Equations 2-5) results in a system of equations with variable Lagrange multipliers (λs). Examples disclosed herein determine the unique audience of one or more unions based on census data corresponding to the margins and panelist data corresponding to the margins and the unions by performing three tasks corresponding to Equation 1. The first task includes solving a system of Equations corresponding to the below Equations 6 and 7 with respect to the panelist data.
AUG(T)−AUD(T\{i})=Ai for all nodes of the tree structure association i={1, . . . ,N} (Equation 6)
z0+AUG(T)=100% (Equation 7)
In Equation 7 above, z0 represents the total audience (e.g., panelist audience) count that was not exposed to the media. Equations 6 and 7 correspond to a system of equations with λs as solutions to the system. Estimates of AUD(T\{i}) (corresponding to the above Equations 4 and 5) can be determined using commercial solvers in parallel. The commercial solvers estimating values for the λs of AUD(T\{i}) and iteratively adjusting such that until all the constraints of Equations 6 and 7 are met. Because AUG(T) is known in panelist data, it follows that λ0=ln(1−AN), where AN is AUD(T). Although it is assumed all quantities of audiences in the above Equations are expressed in percentages of the Universe Estimate (e.g., Equation 7 illustrates that for total population), Equations 1-7 may be used in conjunction with audience totals by slightly adjusting the Equations.
Once the first task is complete, examples disclosed herein perform a second task in which a first subset of the λs (e.g., λM+1-λN) corresponding to unions of the tree structure association are kept (e.g., the λs corresponding to unknowns in the census data) and a second subset of the λs (e.g., λo-λM) corresponding to margins of the tree structure association are discarded (e.g., the λs corresponding to knowns in the census data). However, different subsets of the λs may be kept and/or discarded based on the census data. For example, if the census data includes data related to a first margin, a second margin, a third margin, and a first union and is missing data related to a fourth margin and a second union, the subset of λs that are kept will correspond to the fourth margin and the second union and the subset of λs that are discarded will correspond to the first margin, the second margin, the third margin and the first union. In response to discarding the first set of λs, examples disclosed herein solve the system corresponding to Equations 6 and 7 using the census data, the unknown Lagrange multipliers (e.g., λo-λM), and the known kept Lagrange multipliers of the first task (e.g., λM+1-λN), thereby resulting in a set of second task-based Lagrange multipliers λŝ based on both the panelist data and the census data. As used herein the hat notation ({circumflex over ( )}) corresponds to the solutions to systems of equations based on the second task (e.g., including panelist and census data). Once the second task is complete, examples disclosed herein perform a third task to solve for the missing population unions using the below Equation 8.
AUD(G)−(AUD(G\{j})=Âj where j={M+1, . . . N} (Equation 8)
The example media provider 104 of
When the example media presentation device 106 of
By way of example, the example media presentation device 106 may be tuned to channel 5. In such an example, the media presentation device 106 outputs media (from the example media provider 104) corresponding to the tuned channel 5. The media presentation device 106 may gather tuning data corresponding to which channels, stations, websites, etc., that the example media presentation device 106 was tuned. The example media presentation device 106 generates and transmits the example return path data 100 (e.g., census data corresponding to the total population of users) to the example media provider 104. The example return path data 100 includes the tuning data and/or data corresponding to the example media provider 104. Although the illustrated example of
The example media output device 110 of
In some examples, the example LPM 112 of
The example return path data 100 of
The example return path data audience storage 116 of the example AME 114 of
The example union population reach determiner 120 of
Once the example union population reach determiner 120 of
The example marginal and union data 200 of
For example, as shown in the marginal and union data 200 of
The example tree structure 202 of
The example tree structure conversion 204 of
The example interface(s) 300 of
The example data converter 302 of
The example association controller 304 of
The example commercial solver(s) 308 of
While an example manner of implementing the example population reach determiner 120 of
Flowcharts representative of example hardware logic, machine readable instructions, hardware implemented state machines, and/or any combination thereof for implementing the example population reach determiner 120 of
The machine readable instructions described herein may be stored in one or more of a compressed format, an encrypted format, a fragmented format, a packaged format, etc. Machine readable instructions as described herein may be stored as data (e.g., portions of instructions, code, representations of code, etc.) that may be utilized to create, manufacture, and/or produce machine executable instructions. For example, the machine readable instructions may be fragmented and stored on one or more storage devices and/or computing devices (e.g., servers). The machine readable instructions may require one or more of installation, modification, adaptation, updating, combining, supplementing, configuring, decryption, decompression, unpacking, distribution, reassignment, etc. in order to make them directly readable and/or executable by a computing device and/or other machine. For example, the machine readable instructions may be stored in multiple parts, which are individually compressed, encrypted, and stored on separate computing devices, wherein the parts when decrypted, decompressed, and combined form a set of executable instructions that implement a program such as that described herein. In another example, the machine readable instructions may be stored in a state in which they may be read by a computer, but require addition of a library (e.g., a dynamic link library (DLL)), a software development kit (SDK), an application programming interface (API), etc. in order to execute the instructions on a particular computing device or other device. In another example, the machine readable instructions may need to be configured (e.g., settings stored, data input, network addresses recorded, etc.) before the machine readable instructions and/or the corresponding program(s) can be executed in whole or in part. Thus, the disclosed machine readable instructions and/or corresponding program(s) are intended to encompass such machine readable instructions and/or program(s) regardless of the particular format or state of the machine readable instructions and/or program(s) when stored or otherwise at rest or in transit.
As mentioned above, the example processes of
“Including” and “comprising” (and all forms and tenses thereof) are used herein to be open ended terms. Thus, whenever a claim employs any form of “include” or “comprise” (e.g., comprises, includes, comprising, including, having, etc.) as a preamble or within a claim recitation of any kind, it is to be understood that additional elements, terms, etc. may be present without falling outside the scope of the corresponding claim or recitation. As used herein, when the phrase “at least” is used as the transition term in, for example, a preamble of a claim, it is open-ended in the same manner as the term “comprising” and “including” are open ended. The term “and/or” when used, for example, in a form such as A, B, and/or C refers to any combination or subset of A, B, C such as (1) A alone, (2) B alone, (3) C alone, (4) A with B, (5) A with C, (6) B with C, and (7) A with B and with C. As used herein in the context of describing structures, components, items, objects and/or things, the phrase “at least one of A and B” is intended to refer to implementations including any of (1) at least one A, (2) at least one B, and (3) at least one A and at least one B. Similarly, as used herein in the context of describing structures, components, items, objects and/or things, the phrase “at least one of A or B” is intended to refer to implementations including any of (1) at least one A, (2) at least one B, and (3) at least one A and at least one B. As used herein in the context of describing the performance or execution of processes, instructions, actions, activities and/or steps, the phrase “at least one of A and B” is intended to refer to implementations including any of (1) at least one A, (2) at least one B, and (3) at least one A and at least one B. Similarly, as used herein in the context of describing the performance or execution of processes, instructions, actions, activities and/or steps, the phrase “at least one of A or B” is intended to refer to implementations including any of (1) at least one A, (2) at least one B, and (3) at least one A and at least one B.
At block 401, the example interface(s) 300 (
At block 402, the example data converter 302 (
At block 406, the example commercial solver(s) 308 (
At block 408, the example commercial solver(s) 308 keep(s) values of a first subset of the Lagrange multipliers corresponding to the unknown census data (e.g., λ5=−2.0149, λ6=−1.19392, λ7=0.81093, corresponding to the union nodes 5, 6, and 7 of the tree structure 202 of
At block 412, the example commercial solver(s) 308 determine(s) the variable Lagrange multipliers (e.g., λ0, λ1, λ2, λ3, and λ4) based on census parameters. For example, the commercial solver(s) solve(s) for the Lagrange multipliers (e.g., λ0, λ1, λ2, λ3, and λ4) using the system of equations corresponding to Equations 6 and 7 based on the population parameters (e.g., AUD(A)=0.2, AUD(B)=0.4, AUD(C)=0.6, AUD(D)=0.8) and the first subset of the Lagrange multipliers (e.g., λ5=−2.0149, λ6=−1.19392, λ7=0.81093), thereby resulting in second task-based Lagrange multipliers λŝ (e.g., the values of λ0, λ1, λ2, λ3, and λ4 from block 412 and the values of λ5, λ6, and λ7 from block 406) as further described below in conjunction with
At block 502, the example commercial solver(s) 308 (
At block 506, the example commercial solver(s) 308 set(s) up a system of equations based on Equation 6 corresponding to removal off each node of the tree structure association using the panelist data. For example, the commercial solver(s) 308 set(s) up a system of equations including 0.7−AUD(T\1)=0.1 for node 1 (e.g., margin A), 0.7−AUD(T\2)=0.2 for node 2 (e.g., margin B), . . . , and 0.7−AUD(T\6)=0.55 for node 6 (e.g., union CD).
At block 510, the example commercial solver(s) 308 select(s) all equations corresponding to node(s) at the lowest available height (e.g., margins being the lowest height and the union corresponding to the total representing the highest height). As described above, because removing a node from the tree structure association only affects the ancestors of the node, the commercial solver(s) 308 performs parallel computations with a processor to can calculate the Lagrange multipliers at the same height in parallel without affecting the other nodes. For example, initially the commercial solver(s) select(s) the equations corresponding to the lowest level nodes (e.g., 1, 2, 3, and 4) corresponding to the margins A, B, C, and D (e.g., 0.7−AUD(T\1)=0.1, 0.7−AUD(T\2)=0.2, 0.7−AUD(T\3)=0.3, and 0.7−AUD(T\4)=0.3).
At block 510, the example commercial solver(s) 308 select(s) value(s) for the Lagrange multiplier(s) that satisfy(ies) the equations. As described above, each of the selected equations corresponds to Equation 4, which include variable(s) that can be identified using Equation 5. Because both Equation 4 and Equation 5 include z values and zi=eλ
If the example commercial solver(s) 308 determine(s) that the selected value(s) do(es) not satisfy(ies) the constraint(s) of the selected equation(s) (block 512: NO), the commercial solver(s) 308 adjust(s) the selected value(s) of the Lagrange multipliers (block 514), and the control returns to block 512 until the selected value(s) do(es) satisfy the constraints (e.g., within the threshold amount of error). In this manner, the example commercial solver(s) 308 perform(s) parallel computations with a processor to determine the Lagrange multipliers in an iterative fashion in parallel. If the example commercial solver(s) 308 determine(s) that the selected value(s) do(es) satisfy the constraint(s) of the selected equation(s) (block 512: YES), the example commercial solver(s) 308 determine(s) if there are additional node(s) of the tree structure association at additional height(s) (block 516). For example, once the example commercial solver(s) 308 determine(s) the Lagrange multipliers for the margins A, B, C, D, the commercial solver(s) 308 determine(s) that there are additional nodes (e.g., AB and CD) at the next lowest height. If the commercial solver(s) 308 determine(s) that there are additional node(s) at additional height(s) (block 516: YES), the control returns to block 508 to determine the Lagrange multipliers corresponding to the additional node(s). If the commercial solver(s) 308 determine(s) that there are not additional node(s) at additional height(s) (block 516: NO), the example process of
At block 602, the example commercial solver(s) 308 set(s) up a system of equations based on Equation 6 corresponding to removal of each node of the tree structure association. For example, the commercial solver(s) 308 use(s) the panelist data where the first subset Lagrange multipliers are used for the equations corresponding to union removals. For example, the commercial solver(s) 308 sets of a system of equations including AUD(T)−AUD(T\1)=0.2 for node 1 (e.g., margin A), AUD(T)−AUD(T\2)=0.4 for node 2 (e.g., margin B), . . . , and AUD(T)−AUD(T\6)=? for node 6 (e.g., union CD).
At block 604, the example commercial solver(s) 308 select(s) value(s) for the variable Lagrange multiplier(s) (e.g., λ0, λ1, λ2, λ3, and λ4) that satisfies the equations while using the first subset of the Lagrange multiplier(s) (e.g., λ5=−2.0149, λ6=−1.19392, λ7=0.81093). As described above, each of the selected equations corresponds to Equations 2 and 4, which includes a variable s and a variable s(new) that can be identified using equations 3 and 5, respectively. Because Equations 2-5 include z values and zi=eλ
If the example commercial solver(s) 308 determine(s) that the selected value(s) do(es) not satisfy the constraint(s) of the selected equation(s) (block 606: NO), the commercial solver(s) 308 adjust(s) the selected value(s) of the variable Lagrange multipliers (block 608) and the control returns to block 512 until the selected value(s) do(es) satisfy the constraints (e.g., within the threshold amount of error). In this manner, the example commercial solver(s) 308 performs parallel computations with a processor to determine the variable Lagrange multipliers in an iterative fashion in parallel. If the example commercial solver(s) 308 determine(s) that the selected value(s) do(es) satisfy the constraint(s) of the selected equation(s) (block 606: YES), the example process of
At block 702, the example commercial solver(s) 308 determine(s) the total population audience (e.g., AUD(T)). As described above in conjunction with Equation 2, AUD(T)=z0zNsN, where z0zN correspond to the Lagrange multipliers determined in block 412 and sN can be solved using Equation 3. Accordingly, the commercial solver(s) 308, using Equation 2, determine(s) that since N=7, s1=s2=s3=s4=1, s5=(1+s1{circumflex over (z)}1)(1+s2{circumflex over (z)}2)−1=5.57925, and s6=(1+s3{circumflex over (z)}3)(1+s3{circumflex over (z)}3)−1=17.87222. Accordingly, the example commercial solver(s) 308 determine(s) that s7=(1+s5{circumflex over (z)}5)(1+s6{circumflex over (z)}6)−1=10.1885552. Thus, the example commercial solver(s) 308 determine(s) that AUG(T)=(0.418)(2.25)(10.188552)=0.9582.
At block 704, the example commercial solver(s) 308 select(s) an available unknown population union audience. For example, in the example of
Accordingly, the example commercial solver(s) 308 determine(s) that the
Thus, the commercial solver(s) 308 determine(s) that AUD(T\5)=(0.0417986)(2.25)(5.4158)=0.5093.
At block 708, the example commercial solver(s) 308 subtract(s) the determined total population audience with the removed population union from the total population audience to determine the population union audience percentage (e.g., corresponding to Equation 1). For example, the commercial solver(s) 308 determines the total deduplicated population audience percentage of union AB to be 0.4486 (e.g., 0.4486=0.9582−0.5093). At block 710, the example commercial solver(s) 308 determine(s) the population reach across the selected union by multiplying the population union audience percentage by the universe estimate. For example, if the universe estimate is 100,000 people, then the example commercial solver(s) 308 determine(s) the population reach for union AB to be 44,860 people. At block 712, the example commercial solver(s) 308 determine if there are additional unknown population union audiences to solve for. If the example commercial solvers) 308 determine that there are additional population reach(es) for additional union(s) to solve for (block 712: YES), the control returns to block 704 to determine an additional population reach for an additional union. If the example commercial solvers) 308 determine that there are not additional population reach(es) for additional union(s) to solve for (block 712: NO), the example process of
The processor platform 800 of the illustrated example includes a processor 812. The processor 812 of the illustrated example is hardware. For example, the processor 812 can be implemented by one or more integrated circuits, logic circuits, microprocessors, GPUs, DSPs, or controllers from any desired family or manufacturer. The hardware processor may be a semiconductor based (e.g., silicon based) device. In this example, the processor 812 implements the example data converter 302, the example association controller 304, and the example commercial solver(s) 308.
The processor 812 of the illustrated example includes a local memory 813 (e.g., a cache). In
The processor platform 800 of the illustrated example also includes an interface circuit 820. The interface circuit 820 may be implemented by any type of interface standard, such as an Ethernet interface, a universal serial bus (USB), a Bluetooth® interface, a near field communication (NFC) interface, and/or a PCI express interface. In the example of
In the illustrated example, one or more input devices 822 are connected to the interface circuit 820. The input device(s) 822 permit(s) a user to enter data and/or commands into the processor 812. The input device(s) can be implemented by, for example, an audio sensor, a microphone, a camera (still or video), a keyboard, a button, a mouse, a touchscreen, a track-pad, a trackball, isopoint and/or a voice recognition system.
One or more output devices 824 are also connected to the interface circuit 820 of the illustrated example. The output devices 824 can be implemented, for example, by display devices (e.g., a light emitting diode (LED), an organic light emitting diode (OLED), a liquid crystal display (LCD), a cathode ray tube display (CRT), an in-place switching (IPS) display, a touchscreen, etc.), a tactile output device, a printer and/or speaker. The interface circuit 820 of the illustrated example, thus, typically includes a graphics driver card, a graphics driver chip and/or a graphics driver processor.
The interface circuit 820 of the illustrated example also includes a communication device such as a transmitter, a receiver, a transceiver, a modem, a residential gateway, a wireless access point, and/or a network interface to facilitate exchange of data with external machines (e.g., computing devices of any kind) via a network 826. The communication can be via, for example, an Ethernet connection, a digital subscriber line (DSL) connection, a telephone line connection, a coaxial cable system, a satellite system, a line-of-site wireless system, a cellular telephone system, etc.
The processor platform 800 of the illustrated example also includes one or more mass storage devices 828 for storing software and/or data. Examples of such mass storage devices 828 include floppy disk drives, hard drive disks, compact disk drives, Blu-ray disk drives, redundant array of independent disks (RAID) systems, and digital versatile disk (DVD) drives.
Machine executable instructions 832 represented by
From the foregoing, it should be appreciated that the above disclosed methods, apparatus, and articles of manufacture estimate population reach from marginal ratings. Example disclosed herein determine the reach analytically using the above Equations 1-8. Traditional techniques for determining reach across various unions include determining the reach numerically. However, such traditional techniques are unsolvable for a large number of marginals dues to memory and/or processing constraints. Examples disclosed herein alleviate the problems associated with such traditional techniques by demining the reach analytically (e.g., via solving the disclosed Equations 2-8 using a population reach union estimates) in a manner that facilitates parallel processing. The parallel processing results in a faster more efficient calculation. In this manner, the estimations of reach can be determined in a faster, more efficient manner that requires less memory than traditional numerical techniques. Using examples disclosed herein reach can be determined from a nearly infinite number of instances and/or unions of media exposure based on the marginal media exposure data.
Although certain example methods, apparatus and articles of manufacture have been described herein, the scope of coverage of this patent is not limited thereto. On the contrary, this patent covers all methods, apparatus and articles of manufacture fairly falling within the scope of the appended claims either literally or under the doctrine of equivalents.
Number | Name | Date | Kind |
---|---|---|---|
5956692 | Foley | Sep 1999 | A |
6460025 | Fohn et al. | Oct 2002 | B1 |
6775663 | Kim | Aug 2004 | B1 |
7746272 | Vollath | Jun 2010 | B2 |
7954120 | Roberts et al. | May 2011 | B2 |
8112301 | Harvey et al. | Feb 2012 | B2 |
8149162 | Pauls | Apr 2012 | B1 |
8200693 | Steele et al. | Jun 2012 | B2 |
8619892 | Vetter et al. | Dec 2013 | B2 |
9111186 | Blasinski et al. | Aug 2015 | B2 |
9224094 | Oliver et al. | Dec 2015 | B2 |
9236962 | Hawkins et al. | Jan 2016 | B2 |
9420320 | Doe | Aug 2016 | B2 |
9529836 | Hale | Dec 2016 | B1 |
10602224 | Sullivan et al. | Mar 2020 | B2 |
20020123928 | Eldering et al. | Sep 2002 | A1 |
20040001538 | Garrett | Jan 2004 | A1 |
20060190318 | Downey et al. | Aug 2006 | A1 |
20080228543 | Doe | Sep 2008 | A1 |
20080300965 | Doe | Dec 2008 | A1 |
20080313017 | Totten | Dec 2008 | A1 |
20100161385 | Karypis et al. | Jun 2010 | A1 |
20100191723 | Perez et al. | Jul 2010 | A1 |
20120023522 | Anderson et al. | Jan 2012 | A1 |
20120052930 | McGucken | Mar 2012 | A1 |
20120072940 | Fuhrer | Mar 2012 | A1 |
20120110027 | Falcon | May 2012 | A1 |
20120254911 | Doe | Oct 2012 | A1 |
20130198125 | Oliver et al. | Aug 2013 | A1 |
20130290233 | Ferren et al. | Oct 2013 | A1 |
20130339991 | Ricci | Dec 2013 | A1 |
20140101685 | Kitts et al. | Apr 2014 | A1 |
20140280891 | Doe | Sep 2014 | A1 |
20150180989 | Seth | Jun 2015 | A1 |
20150186403 | Srivastava et al. | Jul 2015 | A1 |
20160012314 | Ramamurthy et al. | Jan 2016 | A1 |
20160086208 | Oliver et al. | Mar 2016 | A1 |
20160165277 | Kirillov et al. | Jun 2016 | A1 |
20160232563 | Perez et al. | Aug 2016 | A1 |
20160249098 | Pecjak et al. | Aug 2016 | A1 |
20160269783 | Mowrer et al. | Sep 2016 | A1 |
20160323616 | Doe | Nov 2016 | A1 |
20160373820 | Meyer et al. | Dec 2016 | A1 |
20160379246 | Sheppard et al. | Dec 2016 | A1 |
20170006342 | Nagaraja Rao et al. | Jan 2017 | A1 |
20170155956 | Nagaraja Rao et al. | Jun 2017 | A1 |
20180073933 | Keskin et al. | Mar 2018 | A1 |
20180225709 | Ferber et al. | Aug 2018 | A1 |
20180249208 | Sheppard et al. | Aug 2018 | A1 |
20180249210 | Sheppard et al. | Aug 2018 | A1 |
20180249211 | Sheppard et al. | Aug 2018 | A1 |
20180249214 | Sullivan et al. | Aug 2018 | A1 |
20180376198 | Sheppard et al. | Dec 2018 | A1 |
Number | Date | Country |
---|---|---|
2015529870 | Oct 2015 | JP |
2014210597 | Dec 2014 | WO |
Entry |
---|
Huckett et al., “Combining Methods to Create Synthetic Microdata: Quantile Regression, Hot Deck, and Rank Swapping,” ResearchGate, Apr. 2008, 11 pages. |
Golub et al., “Linear Least Squares and Quadratic Programming,” Technical Report No. CS 134, Stanford University, Computer Science Department, May 1969, 38 pages. |
Byrne, “Iterative Algorithms in Inverse Problems,” Apr. 25, 2006, 347 pages. |
Byrne, “Applied Iterative Methods,” Jan. 23, 2007, 396 pages. |
Bourguignon et al., “On the Construction of Synthetic Panels,” Oct. 2015, 42 pages. |
Verbeek,“Pseudo-Panels and Repeated Cross-Sections,” The Econometrics of Panel Data, Springer-Verlag Berlin Heidelberg, 2008, 15 pages. |
Teunissen, “Least-Squares Estimation of the Integer GPS Ambiguities,” Delft University of Technology, Department of the Geodetic Engineering, Aug. 1993, 16 pages. |
United States Patent and Trademark Office, “Non-Final Office action” issued in connection with U.S. Appl. No. 15/445,530, dated Apr. 11, 2018, 10 pages. |
United States Patent and Trademark Office, “Notice of Allowance,” issued in connection with U.S. Appl. No. 15/445,530, dated Jan. 2, 2019, 8 pages. |
United States Patent and Trademark Office, “Notice of Allowance,” issued in connection with U.S. Appl. No. 15/445,530, dated May 3, 2019, 5 pages. |
United States Patent and Trademark Office, “Corrected Notice of Allowability,” issued in connection with U.S. Appl. No. 15/445,530, dated May 21, 2019, 2 pages. |
United States Patent and Trademark Office, “Notice of Allowance and Fee(s) Due,” issued in connection with U.S. Appl. No. 15/445,530, dated Sep. 25, 2019, 5 pages. |
Haggin, Patience et al., “Google Nears a Long-Tipped Limit on Tracking “Cookies,” in Blow to Rivals,” The Wall Street Journal, May 6, 2019, obtained from https://www.wsj.com/articles/gcogles-new-privacy-tools-to-make-cookies-crumble-competitors-stumble-11557151913, 3 pages. |
United States Patent and Trademark Office, “Notice of Allowance,” issued in connection with U.S. Appl. No. 15/445,530, dated Jan. 31, 2020, 10 pages. |
International Searching Authority, “International Search Report & Written Opinion,” issued in connection with application No. PCT/US2020/022438 dated Jul. 6, 2020, 10 pages. |
International Searching Authority, “International Search Report & Written Opinion,” issued in connection with application No. PCT/US2020/022436 dated Jul. 6, 2020, 9 pages. |
United States Patent and Trademark Office, “Non-Final Office Action,” issued in connection with U.S. Appl. No. 16/526,747, dated Jul. 23, 2020, 7 pages. |
Number | Date | Country | |
---|---|---|---|
20200296441 A1 | Sep 2020 | US |