This disclosure relates generally to audience measurement and, more particularly, to methods and apparatus to de-duplicate partially-tagged media entities.
In recent years, digital media has been tagged or otherwise embedded with software, scripts, or other programs to report audience measurement information to a media monitoring company. However, not all digital media is tagged. Digital media may be fully-tagged, non-tagged, or partially-tagged (e.g., a combination of tagged and non-tagged).
Example methods and apparatus disclosed herein de-duplicate digital audience measurement of partially-tagged media entities. As used herein, entities are collections of one or more instances of media, such as, for example media platforms (e.g., video, application, text, etc.), channels (e.g., a collection of platforms), sub-brands (e.g., a collection of channels), and/or brands (e.g., a collection of sub-brands) representative of a media provider. In some examples, other categorizations may be included such as, for example, assets (e.g., a program series) and/or episodes (e.g., an instance of a program series). In some examples, entities are tagged (e.g., with software, scripts, etc.) to measure and report audience measurements (e.g., unique audience member views) to a media monitoring company with respect to the media provider. Entities may be fully-tagged, non-tagged, or partially-tagged. In some examples, entities include one or more sub-entities. As used herein, sub-entities may also include sub-entities. Sub-entities may be categorized based on a hierarchy, content type (e.g., video, text. etc.), platform type (e.g., mobile, desktop, etc.), or any combination thereof.
In some examples, a media platform is partially-tagged if at least one media of that media platform is fully-tagged, but not all media are fully-tagged. In some examples, a media platform is fully-tagged if an access monitor (e.g., a software development kit (“SDK”)) is implemented on (e.g., incorporated with, embedded in, etc.) a media player, application, or webpage presenting media. As used herein, an access monitor sends impressions of media exposure via the media player, application, or webpage to a media monitoring company (e.g., The Nielsen Company (US), LLC) in response to user action (e.g., loading/selecting a media player, a webpage, an application, an advertisement, etc.).
An impression refers to a recordation of a presentation of an item of media (e.g., from a media campaign) to an audience member. As used herein, the “audience” of a designated item of media refers to the number of persons who have viewed the designated item of media. An “audience member” of an audience refers to an individual person within the audience. Whereas the calculation of the audience of a media item may, in some examples discussed herein, count a single audience member multiple times, the “unique audience” of a media is an audience of the media item in which each audience member is represented only once. “Reach” refers to the amount of a population that corresponds to the measured audience. For example, if the population of an area is 280,000 and the measured audience is 140,000, the reach for a given media campaign is ½ or 50% of the population. “Frequency” refers to the number of times a unique audience member is presented a same media campaign. “Duration” refers to an amount of time that a media is presented and/or viewed.
Media monitoring companies desire knowledge of how users interact with media devices such as smartphones, tablets, laptops, smart televisions, etc. In particular, media monitoring companies want to monitor media presentations made at the media devices to, among other things, monitor exposure to advertisements, determine advertisement effectiveness, determine user behavior, identify purchasing behavior associated with various demographics, etc.
As used herein, “media platforms” are sub-entities of “channels”, “channels” are sub-entities of “sub-brands”, and “sub-brands” are sub-entities of “brands”. As disclosed herein, the status of an entity is dependent on the status of the sub-entities of the entity. Table 1 describes some example entity statuses based on its sub-entities.
With reference to Table 1, a channel (e.g., entity) is fully-tagged if all platforms (e.g., sub-entities) under the channel are fully-tagged. In some examples, a sub-brand (e.g., entity) is fully-tagged if all channels (e.g., sub-entities) under the sub-brand are fully-tagged and all platforms under the channels are fully-tagged. In some examples, a brand (e.g., entity) is fully-tagged if all sub-brands (e.g., sub-entities) under the brand are fully-tagged, all channels under the sub-brands are fully-tagged, and all platforms under the channels are fully-tagged. If at least one sub-entity is not fully-tagged, then the entity cannot be fully-tagged. In examples wherein an entity is fully-tagged, tagged data is used in calculating audience metrics.
In some examples, a channel (e.g., entity) is non-tagged if all platforms (e.g., sub-entities) under the channel are non-tagged. In some examples, a sub-brand (e.g., entity) is non-tagged if all channels (e.g., sub-entities) under the sub-brand are non-tagged and all platforms under the channels are non-tagged. In some examples, a brand (e.g., entity) is non-tagged if all sub-brands (e.g., sub-entities) under the brand are non-tagged, all channels under the sub-brands are non-tagged, and all platforms under the channels are non-tagged. If at least one sub-entity is tagged, then the entity cannot be non-tagged. In examples wherein an entity is non-tagged, panelist data is used in calculating audience metrics.
However, in examples wherein an entity is partially-tagged, a combination of panelist data and tagged data is used, as further disclosed herein. As used herein, panelists are users registered on panels maintained by a media monitoring company that owns and/or operates the ratings entity subsystem. Traditionally, media monitoring companies determine demographic reach for advertising and media programming based on registered panel members. That is, a media monitoring company enrolls people that consent to being monitored into a panel. During enrollment, the media monitoring company receives demographic information from the enrolling people so that subsequent correlations may be made between advertisement/media exposure to those panelists and different demographic markets. People become panelists via, for example, a user interface presented on the media device (e.g., via a website). People become panelists in additional or alternative manners such as, for example, via a telephone interview, by completing an online survey, etc. Additionally or alternatively, people may be contacted and/or enlisted using any desired methodology (e.g., random selection, statistical selection, phone solicitations, Internet advertisements, surveys, advertisements in shopping malls, product packaging, etc.).
The methods and apparatus disclosed herein identify a tagged audience for a first sub-entity, identify a panel audience for a second sub-entity, determine a panel duplication between the first sub-entity and the second sub-entity, determine a duplicated unique audience based on the tagged audience, the panel audience, and the panel duplication, and determine a de-duplicated unique audience for the partially-tagged entity based on the duplicated unique audience and a total unique audience, the total unique audience including the tagged audience for the first sub-entity and the panel audience for the second sub-entity. While example methods, apparatus, and articles of manufacture are described herein with reference to sub-entities, it will be appreciated that the descriptions are applicable to entities (e.g., entities may be sub-entities with respect to other entities), as further disclosed herein.
An example implementation of a non-tagged brand 108 includes an example sub-brand 110, which includes an example channel 1112 and an example channel 2114. The example channel 1112 includes example browser text 116, example application text 118, example browser video 120. The example channel 2114 includes example browser text 122, example application text 124, and example browser video 126. Similar to the partially-tagged channel 1100, the example methods and apparatus use fusion panelist data for the non-tagged brand 108 because the brand 108 does not include any fully-tagged sub-entities (e.g., the sub-brand 110, the channel 1112 and corresponding browser text 116, application text 118, browser video 120, the channel 2114 and corresponding browser text 122, application text 124, browser video 126, etc.).
In examples without at least one fully-tagged entity, a panel of online and mobile panelists (e.g., a fusion panel) is used to determine duplicate unique audience (UA) members between entities or sub-entities (e.g., the number of panelists that view multiple entities or sub-entities). The methods and apparatus disclosed herein remove the duplicate UA members from a total unique audience to determine a de-duplicated unique audience.
An example implementation of a partially-tagged brand 128 includes a partially-tagged sub-brand 1130 and a fully-tagged sub-brand 2132. The example partially-tagged sub-brand 1130 includes an example fully-tagged channel 1134 and an example non-tagged channel 2136. The example fully-tagged channel 1134 includes fully-tagged browser text 138, application text 140, and browser video 142. The example non-tagged channel 1136 includes non-tagged browser text 144, application text 146, and browser video 148.
The example fully-tagged sub-brand 2132 includes an example fully-tagged channel 3150, which includes fully-tagged browser text 152, application text 154, and browser video 156. In such examples, tagged data is used for the fully-tagged sub-brand 2132 and corresponding channel 3150, browser text 152, application text 154, and browser video 156. However, as disclosed herein, tagged data is not readily available for the partially-tagged sub-brand 1130. In examples wherein at least one entity or sub-entity is fully-tagged (but not all), the example methods and apparatus use a combination of fusion panelist data and tagged data to determine the de-duplicated unique audience.
An example implementation of a partially-tagged channel 1158 includes non-tagged browser text 160, partially-tagged application text 162, and fully-tagged browser video 164. The example channel 1158 differs from the example channel 1100 because the example channel 1158 includes at least one fully-tagged sub-entity (e.g., fully-tagged browser video 164). In some examples, a partially-tagged entity/sub-entity is treated as a non-tagged entity/sub-entity if the entity/sub-entity does not include at least one fully-tagged sub-entity. For example, partially-tagged platforms are treated as non-tagged entities (e.g., partially-tagged platforms do not include at least one fully-tagged entity). In some such examples, partially-tagged channels with no fully-tagged platforms are also treated as non-tagged entities. As disclosed above, the example methods and apparatus use a combination of fusion panelist data and tagged data to determine the de-duplicated unique audience in examples wherein at least one entity or sub-entity, but not all entities or sub-entities, is fully-tagged.
In the illustrated example of
In some examples, unique audience members are duplicated (e.g., double counted) at the entity level. For example, when the browser text 138 media platform and the browser video 142 media platform (e.g., the sub-entities) are presented to the first one of the audience members 204, the media monitoring company 210 associates one unique audience member for the browser text 138 media platform and one unique audience member for the browser video 142 media platform. In such examples, the media monitoring company 210 associates two unique audience members for the channel 1134 (e.g., the entity) based on unique audience members of the sub-entities. However, the first one of the audience members 204 is only one unique audience member for the channel 1134 (e.g., the entity). In some such examples, a count of the unique audience members is de-duplicated.
In some examples, when the browser text 138 media platform (e.g., a sub-entity of channel 1134) and the application text 154 media platform (e.g., a sub-entity of channel 3150) are presented to the second one of the audience members 206, the media monitoring company 210 associates one unique audience member for the browser text 138 media platform and one unique audience member for the application text 154 media platform. In such examples, the media monitoring company 210 associates one unique audience member for the channel 1134 (e.g., a first entity) and associates one unique audience member for the channel 3150 (e.g., a second entity, because at the channel level, the second one of the audience members 206 is a unique audience member of channel 1134 and of channel 3150. In such examples, the channel 1134 and the channel 3150 have a correct number of unique audience members.
However, at a higher entity level, the media monitoring company 210 would associate two unique audience members for the brand 1128 (e.g., the entity) based on the unique audience members of the sub-entities (e.g., one unique audience member for the channel 1134 and one unique audience member for the channel 3150). The second one of the audience members 204 is only one unique audience member for the brand 1128, and the count of the unique audience members is to be de-duplicated. As disclosed herein, the example methods and apparatus de-duplicate the count of unique audience members from sub-entities to entities by determining the number of audience members that view multiple sub-entities of a first entity. The example methods and apparatus roll up an example entity hierarchy (
Tagged media is often double counted due to the media monitoring company 210 not knowing which unique audience members were presented multiple instances of media. In some examples, the media monitoring company 210 knows which panelists are presented multiple instances of media. However, a panel is limited to those panelists who are enrolled, while tagged media may be distributed to the entire population of a country to acquire audience measurements of the entire population. Accordingly, the example methods and apparatus disclosed herein utilize panelist data in combination with tagged data to determine de-duplicated unique audiences. The example ratings manager 202 utilizes panelist data from the example fusion panel database 212 to de-duplicate partially-tagged entities (e.g., platforms, channels, sub-brands, brands, etc.) accessible by one or more audience members.
The example entity manager 300 determines the status of entities and sub-entities such as, for example, the first media platform 222. As disclosed herein, entities may have a status of fully-tagged, partially-tagged, or non-tagged. In some examples, the entity manager 300 detects whether an entity has an access monitor and/or other audience measurement device that sends media impressions to the media monitoring company 210 to determine whether the entity is fully-tagged. In some examples, the entity manager 300 determines whether all sub-entities are fully-tagged and/or whether all sub-entities are non-tagged. In some such examples, if any sub-entity is not fully-tagged, then the entity manager 300 determines the entity is not fully-tagged. In some such examples, if any sub-entity is not non-tagged, then the entity manager 300 determines the entity is not non-tagged. In some such examples, if the sub-entities are a combination of tagged (e.g., fully and/or partially) and non-tagged, then the entity manager 300 determines the entity is partially-tagged. In an illustrated example, the example ratings manager 202 processes two sub-entities of an entity at a time. Although the illustrated example is disclosed in connection with the processing of two sub-entities (or entities), any number of entities may be processed. The two sub-entities may be at the bottom of a hierarchy (e.g.,
The example metrics calculator 302 determines metrics for media campaigns of one or more entities and/or sub-entities. As used herein, metrics refer to audience measurements such as, for example, impressions, audience size, reach, frequency, unique audience, duration, etc. Based on the status of the sub-entities received from the example entity manager 300, the example metrics calculator 302 will use different data. For example, the metrics calculator 302 will use tagged data for fully-tagged and partially-tagged entities and the example metrics calculator 302 will use panelist data for partially-tagged and non-tagged entities. In some examples, the metrics calculator 302 determines metrics for a first data type separately from a second data type. For example, metrics for applications may be separated from metrics for text and/or video. In some examples, metrics for online media may be separated from metrics for mobile media. In some examples, the metrics calculator 302 utilizes audience measurements from tagged media such as, for example, media platform 222 (
In the illustrated example, the first sub-entity SE1 is a tagged entity, and thus, the metrics calculator 302 determines the audience from tagged data. In some examples, the UE is 280,000. Alternatively, the UE may be the population of a country (e.g., the United States population).
In some examples, the metrics calculator 302 utilizes audience measurements from the example fusion panel database 212 (
In the illustrated example, the second sub-entity SE1 is a non-tagged entity, and thus, the metrics calculator 302 determines the audience from panelist data (e.g., from the fusion panel database 212). In some examples, the metrics calculator 302 aggregates the metrics for each sub-entity to determine total metrics.
The example audience manager 304 determines a duplicated unique audience based on a duplication factor determined by the example de-duplicator 306 and the universe estimate according to Equation 3:
Duplicated Unique Audience(DUA)=DF·UE where: 0≤DUA≤min(panel audience,tag audience) Equation 3
The example audience manager 304 determines a de-duplicated unique audience based on a total unique audience between the first sub-entity and the second sub-entity and the duplicated unique audience. For example, the example audience manager 304 subtracts the duplicated unique audience from the total unique audience to determine the de-duplicated unique audience.
The example de-duplicator 306 determines panel duplication and a duplication factor between tagged (e.g., fully-tagged and/or partially-tagged) and non-tagged entities using metrics from entities and/or sub-entities determined by the example metrics calculator 302. As disclosed herein, an entity is only fully-tagged when all sub-entities are fully-tagged. As disclosed herein, an entity is only non-tagged when all sub-entities are non-tagged. As disclosed herein, an entity is partially-tagged when not all sub-entities are fully-tagged and when not all sub-entities are non-tagged. For example, one sub-entity may be non-tagged and one sub-entity may be partially-tagged, one sub-entity may be non-tagged and one sub-entity may be fully-tagged, one sub-entity may be partially-tagged and one sub-entity may be fully-tagged, multiple sub-entities are partially-tagged, etc. In some examples, a partially-tagged entity is treated like a non-tagged entity, as described above, if the entity does not include at least one fully-tagged sub-entity.
In examples wherein all entities/sub-entities are non-tagged, the de-duplicator 306 uses fusion panelist data (e.g., from the fusion panel database 212) to determine a panel duplication factor, as further described in connection with
DDPA=TOTAL AUDIENCE−(PANEL DUPLICATION FACTOR*TOTAL AUDIENCE) Equation 4
In examples involving partially-tagged entities, the example de-duplicator 306 determines a duplication factor (DF) based on an odds ratio approach, as shown in Equation 5:
To solve Equation 5, the example de-duplicator 306 utilizes the reach a1 for the first sub-entity SE1 (e.g., Equation 1), the reach a2 for the second sub-entity SE2 (e.g., Equation 2), and a duplication multiplier K. The example de-duplicator 306 determines the duplication multiplier K according to Equation 6:
To solve Equation 6, the example de-duplicator 306 determines a plurality of variables M, S, F, and D. The example de-duplicator 306 determines a panel duplication reach M according to Equation 7:
The example de-duplicator 306 determines a de-duplicated panel reach F for the first sub-entity SE1 according to Equation 8:
For example, the de-duplicator 306 divides a de-duplicated panel audience for the first sub-entity (e.g., the panel audience of the first sub-entity minus the panel duplication) by the universe estimate.
The example de-duplicator 306 determines a de-duplicated panel reach D for the second sub-entity SE2 according to Equation 9:
For example, the de-duplicator 306 divides a de-duplicated panel audience the second sub-entity (e.g., the panel audience of the second sub-entity minus the panel duplication) by the universe estimate.
The example de-duplicator 306 determines the variable S according to Equation 10:
In operation, the example entity manager 300 determines the status of a first sub-entity SE1 and a second sub-entity SE2. For example, the entity manager 300 determines that the first sub-entity SE1 is a tagged entity and the second sub-entity SE2 is a non-tagged entity. The example entity manager 300 communicates the status of the first sub-entity SE1 and the second sub-entity SE2 to the example metrics calculator 302. The example metrics calculator 302 determines metrics such as, for example, the reach for the first sub-entity SE1 (e.g., using tagged data) and the reach for the second sub-entity SE2 (e.g., using panelist data). The example de-duplicator 306 utilizes the reach for the first sub-entity SE1, the reach for the second sub-entity SE2, and the duplication multiplier determined to determine the duplication factor DF according to Equation 5. The example audience manager 304 determines the duplicated unique audience based on the duplication factor DF and the universe estimate according to Equation 3. The example audience manager 304 removes the duplicated unique audience from the total unique audience to determine the de-duplicated unique audience for an entity. The example ratings manager 202 repeats this operation by treating the entity of sub-entities SE1 and SE2 as a sub-entity in a subsequent iteration (e.g., roll-up). For example, the ratings manager 202 initially processes channels as entities and platforms as sub-entities. After all channels are processed, the ratings manager 202 processes sub-brands as entities and channels as sub-entities. Similarly, after all sub-brands are processed, the ratings manager 202 processes brands as entities and sub-brands as sub-entities.
While an example manner of implementing the example ratings manager 202 of
Flowcharts representative of example machine-readable instructions for implementing the example ratings manager 202 of
As mentioned above, the example processes of
If the example entity manager 300 determines that the entity (e.g., the first channel) does not only include fully-tagged sub-entities (block 402: NO), control proceeds to block 404. At block 404, the example entity manager 300 determines whether the entity (e.g., the first channel) includes only non-tagged sub-entities. In some examples, a partially-tagged entity/sub-entity is treated as a non-tagged entity/sub-entity if the entity/sub-entity does not include at least one fully-tagged sub-entity. For example, partially-tagged platforms are treated as non-tagged entities (e.g., partially-tagged platforms do not include at least one fully-tagged entity). In some such examples, partially-tagged channels with no fully-tagged platforms are also treated as non-tagged entities. In some examples, entities with only non-tagged sub-entities are non-tagged entities and only use panelist data for audience measurement. If the example entity manager 300 determines that the entity (e.g., the first channel) does not only include non-tagged sub-entities (block 404: NO), control proceeds to block 406.
In the illustrated example, the entity (e.g., the first channel) is a partially-tagged entity including a tagged sub-entity SE1 and a non-tagged sub-entity SE2. At block 406, the example metrics calculator 302 calculates panel metrics for the example sub-entity SE1 and the example sub-entity SE2. For example, the metrics calculator 302 determines at least one of impressions, audience, reach, frequency, unique audience, and/or duration for each sub-entity based on panelist data. In the illustrated example, the panel audience for sub-entity SE1 is equal to 60, the panel unique audience for sub-entity SE2 is equal to 30, the panel reach for SE1 is equal to 60/280,000=0.00021486, and the panel reach for SE2 (e.g., a2) is equal to 30/280,000=0.00010714. In some examples, the metrics calculator 302 receives panelist data from the fusion panel database 212 (
At block 408, the example de-duplicator 306 determines the panel duplication across the sub-entities (e.g., SE1 and SE2) from the fusion panel database 212 (e.g., 20). As disclosed herein, the panel duplication is the number of panelists who view all the sub-entities (e.g., both SE1 and SE2). At block, 410, the example metrics calculator 302 calculates tagged metrics for the example sub-entity SE1, because tagged data is available for SE1. For example, the metrics calculator 302 determines at least one of impressions, audience, reach, frequency, unique audience, and/or duration for the sub-entity SE1. In the illustrated example, the tagged unique audience for sub-entity SE1 is equal to 110, and the tag reach of sub-entity SE1 (e.g., a1)=110/280,000=0.00039285.
At block 412, the example audience manager 304 determines the total unique audience based on the tagged metrics for the example sub-entity SE1 (e.g., SE1 tagged unique audience=110) and the panel metrics for the example sub-entity SE2 (e.g., SE2 panel unique audience=30). For example, the example audience manager 304 adds the SE1 tagged unique audience (e.g., 110) to the SE2 panel unique audience (e.g., 30) to determine the total unique audience (e.g., 110+30=140). At block 414, the example de-duplicator 306 determines the duplication factor according to Equation 2 and Equations 4-10, and further described in connection with
At block 420, the example entity manager 300 determines whether there are more sub-entities (e.g., media platforms) under the entity (e.g., the first channel). If the example entity manager 300 determines that there are more sub-entities under the entity (block 420: YES), control proceeds to block 422. At block 422, the example metrics calculator 302 calculates metrics for the combination of the previous sub-entities (e.g., the first platform and the second platform). For example, the metrics calculator 302 calculates at least one of impressions, audience, reach, frequency, and/or duration for the combination of the first platform and the second platform (e.g., SE1+SE2). In some examples, the combination of the previous sub-entities is treated as a sub-entity for processing with an additional sub-entity (e.g., SE1+SE2→SE1). For example, combination of the previous sub-entities is treated as SE1 for the remainder of the computer-readable instructions 400. At block 424, the example metrics calculator 302 calculates panel metrics for an additional sub-entity (e.g., a third platform). For example, the metrics calculator 302 calculates at least one of impressions, audience, reach, frequency, unique audience, and/or duration for the additional sub-entity with panelist data. In some examples, the additional sub-entity is treated as SE2 for the remainder of the computer-readable instructions 400.
At block 426, the example de-duplicator 306 determines the panel duplication between the previously determined de-duplicated unique audience (e.g., the first platform and the second platform) and the additional sub-entity (e.g., the third platform). At block 428, the example entity manager 300 determines whether the additional sub-entity is tagged or non-tagged. If the additional sub-entity is tagged (block 428: YES), control proceeds to block 430. At block 430, the example metrics calculator 302 calculates tagged metrics for the additional sub-entity. For example, the metrics calculator 302 calculates at least one of impressions, audience, reach, frequency, unique audience, and/or duration for the additional sub-entity with tagged data. At block 432, the example audience manager 304 determines the total unique audience based on the previously determined de-duplicated unique audience and the tagged metrics for the example additional sub-entity SE2 (e.g., SE2 tag unique audience). Thereafter, control returns to block 414.
If the additional sub-entity is non-tagged (block 428: NO), control proceeds to block 434. At block 434, the example audience manager 304 determines the total unique audience based on the previously determined de-duplicated unique audience and the panel metrics for the example additional sub-entity SE2 (e.g., SE2 panel unique audience). Thereafter, control returns to block 414.
If the example entity manager 300 determines that there are no more sub-entities under the entity (block 420: NO), if the example entity manager 300 determines that the entity only includes fully-tagged sub-entities (block 402: YES), or if the example entity manager 300 determines that the entity only includes non-tagged sub-entities (block 404: YES), control proceeds to block 436. In some examples, if the example entity manager 300 determines that the entity only includes non-tagged sub-entities (block 404: YES), the entity is processed as discussed in connection with
At block 436, the example entity manager 300 determines whether there are more entities. For example, once the first channel has been processed, other channels under a sub-brand may need to be processed. Therefore, if the example entity manager 300 determines that there are more entities (block 436: YES), control returns to block 402 to process another entity. If the example entity manager 300 determines that there are no more entities (block 436: NO), control proceeds to block 438.
At block 438, the example entity manager 300 rolls up the entities. For example, the example instructions 400 are first executed with channels as entities and platforms as sub-entities. After all channels are processed, the example instructions 400 are re-executed with sub-brands as entities and channels as sub-entities. Similarly, after all sub-brands are processed, the example instructions 400 are re-executed with brands as entities and sub-brands as sub-entities. Once all entities for a media campaign have been processed, the example instructions 400 cease operation.
At block 502, the example de-duplicator 306 divides the panel duplication determined at block 408 (e.g. 20) by the universe estimate (e.g., 280,000) to determine a panel duplication reach M according to Equation 7. In the illustrated example, M=20/280,000=0.000071429.
At block 504, the example de-duplicator divides the difference of the universe estimate minus the panel audience for the second sub-entity and the panel audience of the first sub-entity minus the panel duplication determined at block 408 by the universe estimate. As disclosed herein, the de-duplicator 306 stores this quotient as the variable S according to Equation 9. For example, the de-duplicator 306 subtracts the panel audience for the second sub-entity (e.g., 30) from the universe estimate (e.g., 280,000) to determine a non-panel universe (e.g., 279,970). The example de-duplicator 306 subtracts the panel duplication (e.g., 20) from the panel audience for the first sub-entity (e.g., 60) to determine a de-duplicated panel audience for the first sub-entity (e.g., 40). The example de-duplicator 306 divides the difference between the non-panel universe (e.g., 279,970) and the de-duplicated panel audience for the first sub-entity (40) by the universe estimate (e.g., 280,000). In the illustrated example, S=(279,070−40)/280,000=0.99975.
At block 506, the example de-duplicator 306 divides the difference between the panel audience for the second sub-entity (e.g., 30) and the panel duplication (e.g., 20) by the universe estimate (e.g., 280,000). As disclosed herein, the de-duplicator 306 stores this quotient as the variable D according to Equation 10. In the illustrated example, D=(30−20)/280,000=0.000035714.
At block 508, the example de-duplicator 306 divides the difference between the panel audience for the first sub-entity (e.g., 60) and the panel duplication (e.g., 20) by the universe estimate (e.g., 280,000). As disclosed herein, the de-duplicator 306 stores this quotient as the variable F according to Equation 8. In the illustrated example, F=(60−20)/280,000=0.000142857.
At block 510, the example de-duplicator 306 divides the product of the quotient of block 502 (e.g., M=0.000071429) and the quotient of block 504 (e.g., S=0.99975) by the product of the quotient of block 506 (e.g., D=0.000035714) and the quotient of block 508 (e.g., F=0.000142857) to determine a duplication multiplier (e.g., Equation 6). As disclosed above, the non-panel universe and the de-duplicated panel audience for the first sub-entity are determined at block 504 and the de-duplicated panel audience for the second sub-entity is determined at block 506. Accordingly, the duplication multiplier is based on at least the non-panel universe, the de-duplicated panel audience for the first sub-entity, and the de-duplicated panel audience for the second sub-entity. In the illustrated example, K=(0.000071429*0.99975)/(0.000035714*0.000142857)=13996.5.
The example de-duplicator 306 multiplies the quotient from block 510 (e.g., the duplication multiplier K) by the reach of the first sub-entity (e.g., a1=110/280,000=0.00039285) and the reach of the second sub-entity (e.g., a2=30/280,000=0.00010714) (block 512). As disclosed above, the reach of the first sub-entity is based on the tagged audience (e.g., 110) and the universe estimate (e.g., 280,000), and the reach of the second sub-entity is based on the panel audience (e.g., 30) and the universe estimate (e.g., 280,000). In the illustrated example, Ka1a2=13996.5*0.00039285*0.00010714=0.00589138.
As disclosed herein, the example de-duplicator 306 solves Equation 5 to determine the duplication factor. At block 514, the example de-duplicator 306 determines a duplication factor DF based on the sum from block 500, the quotients from blocks 502, 504, 506, 508 and 510, and the product from block 512. In the illustrated example,
As disclosed herein, the duplication factor DF is used in block 416 of
The example de-duplicator 306 determines a panel duplication factor based on the number of duplicated panelists and the total unique audience (block 606). For example, the panel duplication factor may be a ratio of the number of duplicated panelists and the total unique audience. At block 608, the example de-duplicator 306 determines a duplicated unique audience based on the panel duplication factor and the total unique audience. The example de-duplicator 306 determines a de-duplicated unique audience based on the total unique audience and the duplicated unique audience (block 610). For example, the de-duplicator 306 removes the duplicated unique audience from the total unique audience to determine the de-duplicated unique audience. Thereafter, the example computer-readable instructions 600 cease operation.
The processor platform 700 of the illustrated example includes a processor 712. The processor 712 of the illustrated example is hardware. For example, the processor 712 can be implemented by one or more integrated circuits, logic circuits, microprocessors or controllers from any desired family or manufacturer.
The processor 712 of the illustrated example includes a local memory 713 (e.g., a cache). The processor 712 of the illustrated example is in communication with a main memory including a volatile memory 714 and a non-volatile memory 716 via a bus 718. The volatile memory 714 may be implemented by Synchronous Dynamic Random Access Memory (SDRAM), Dynamic Random Access Memory (DRAM), RAMBUS Dynamic Random Access Memory (RDRAM) and/or any other type of random access memory device. The non-volatile memory 716 may be implemented by flash memory and/or any other desired type of memory device. Access to the main memory 714, 716 is controlled by a memory controller.
The processor platform 700 of the illustrated example also includes an interface circuit 720. The interface circuit 720 may be implemented by any type of interface standard, such as an Ethernet interface, a universal serial bus (USB), and/or a PCI express interface.
In the illustrated example, one or more input devices 722 are connected to the interface circuit 720. The input device(s) 722 permit(s) a user to enter data and commands into the processor 712. The input device(s) 722 can be implemented by, for example, an audio sensor, a microphone, a camera (still or video), a keyboard, a button, a mouse, a touchscreen, a track-pad, a trackball, isopoint and/or a voice recognition system.
One or more output devices 724 are also connected to the interface circuit 720 of the illustrated example. The output devices 624 can be implemented, for example, by display devices (e.g., a light emitting diode (LED), an organic light emitting diode (OLED), a liquid crystal display, a cathode ray tube display (CRT), a touchscreen, a tactile output device, a light emitting diode (LED), a printer and/or speakers). The interface circuit 720 of the illustrated example, thus, typically includes a graphics driver card, a graphics driver chip or a graphics driver processor.
The interface circuit 720 of the illustrated example also includes a communication device such as a transmitter, a receiver, a transceiver, a modem and/or network interface card to facilitate exchange of data with external machines (e.g., computing devices of any kind) via a network 726 (e.g., an Ethernet connection, a digital subscriber line (DSL), a telephone line, coaxial cable, a cellular telephone system, etc.).
The processor platform 700 of the illustrated example also includes one or more mass storage devices 728 for storing software and/or data. Examples of such mass storage devices 728 include floppy disk drives, hard drive disks, compact disk drives, Blu-ray disk drives, RAID systems, and digital versatile disk (DVD) drives.
The coded instructions 732 of
From the foregoing, it will be appreciated that the above-disclosed methods, apparatus and articles of manufacture de-duplicate partially-tagged entities. For example, the above-disclosed methods, apparatus and articles of manufacture determine numerous metrics for tagged and non-tagged sub-entities that make up partially-tagged entities. The above-disclosed methods, apparatus and articles of manufacture de-duplicate unique audiences of the tagged, partially-tagged, and/or non-tagged sub-entities and entities to report accurate audience measurements.
Although certain example methods, apparatus and articles of manufacture have been disclosed herein, the scope of coverage of this patent is not limited thereto. On the contrary, this patent covers all methods, apparatus and articles of manufacture fairly falling within the scope of the claims of this patent.
This patent arises from a continuation of U.S. patent application Ser. No. 15/240,865, which was filed on Aug. 18, 2016, which claims the benefit of U.S. Provisional Application Ser. No. 62/206,810, which was filed on Aug. 18, 2015. U.S. patent application Ser. No. 15/240,865 and U.S. Provisional Application Ser. No. 62/206,810 are hereby incorporated herein by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
62206810 | Aug 2015 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15240865 | Aug 2016 | US |
Child | 17160201 | US |