The invention relates generally to profiling of recipients, and more particularly, to a method for profiling recipients on the basis of responses given by the recipients to content items delivered to the recipients.
Conventional methods for delivering advertisement data typically involve broadcasting messages to mass markets. This is usually described as a “Spray and Pray” approach, wherein the advertisement data is delivered to a wide audience and it is hoped that the advertisement data will be received by a sufficient number of potential recipients that are appropriate targets of the advertisement. Although an advertiser may take steps to ensure that the advertisement data is delivered via channels that traditionally are expected to reach a significant concentration of potential recipients, there is nevertheless little or no means to guarantee that the advertisement data is delivered to most appropriate recipients. An example of conventional mass marketing strategy is delivery of advertisement data through television channels and inclusion of the advertising data into commonly visited Internet websites.
Direct mailing campaigns via traditional mail and via electronic mail are considered to be more accurate in delivering advertisement information to targeted individuals and/or groups. In addition to the conventional electronic mail it is possible to use other electronic message delivery means for delivery of advertisement data, for example SMS-messages (Short Message Service) or MMS-messages (Multi Media Service) that can be delivered via a mobile communication network. Sending advertisement messages to recipients via a mobile communication network in a large scale causes often a lot of situations in which a certain advertisement message is received by an individual that is far from an optimal target for that advertisement message. For example, a message advertising large cars such as suburban vehicles (SUV) may be received by an environmentally conscious person that has adopted an attitude of hostility to such cars. In order to avoid situations of the kind described above or at least to minimise the amount of such situations there is a need to profile the recipients in such a manner that advertisement messages can be targeted to suitable recipients.
The profiling of the recipients can be based on answers given by the recipients to questions that have been delivered to the recipients e.g. via a communication network. Furthermore, the profiling can be based on demographic data related to the recipients. The answers to the questions and possibly also the demographic data constitutes raw data with the aid of which the recipients are categorised. In a situation in which there is only one question or only a few questions, the profiling may be too coarse or, in some cases, even misleading. For example, a question may be “Do you think the environment is important: Yes/No?”. Most of the people would answer “Yes” to this question albeit their actions and/or attitudes do not support that because the answer “No” would indicate exceptional egomania. From the advertisement point of view this “Yes” answer would lead to addressing ecologically friendly products to such recipients who would actually, for example, drive a SUV with a large consumption of gas and/or practise other behaviour that is far from environmental. In a situation in which there are a large number of questions, the number of different answer combinations gets high. For example, if there are N questions each of which having M answer-alternatives, the number of different answer combinations is MN. From the viewpoint of practical needs, the number of different recipient categories into which the recipients will be profiled has to be substantially smaller than the number of different answer combinations (MN). Therefore, the different answer combinations have to be mapped to a lower number of recipient categories in a manner that provides a sufficiently veracious profiling of the recipients.
The following presents a simplified summary in order to provide a basic understanding of some aspects of various invention embodiments. The summary is not an extensive overview of the invention. It is neither intended to identify key or critical elements of the invention nor to delineate the scope of the invention. The following summary merely presents some concepts of the invention in a simplified form as a prelude to the more detailed description below.
In accordance with at least one embodiment of the invention, a method, a system, a database, and a computer program are provided for profiling recipients into pre-determined recipient categories. The profiling is based at least partly on link rankings that are defined according pre-determined rules for content items delivered to the recipients and for the recipient categories. The content items can be embodied as messages containing data prompting the recipient for a response. An example of a content item can be an advertisement, which can be considered to implicitly contain a question in the sense that it prompts the recipient to respond with what can be considered an answer that is intrinsically linked to the content of the question.
The supporting and/or the implementation of the functionality for profiling the recipients are/is achieved by a combination of features recited in each independent claim. Accordingly, dependent claims prescribe further detailed implementations of the present invention.
Various exemplifying embodiments of the invention together with additional objects and advantages will be best understood from the following description of exemplifying embodiments when read in connection with the accompanying drawings.
The exemplifying embodiments of the invention presented in this document are not to be interpreted to pose limitations to the applicability of the appended claims. The verb “to comprise” is used in this document as an open limitation that does not exclude the existence of also unrecited features. The features recited in depending claims are mutually freely combinable unless otherwise explicitly stated.
The exemplifying embodiments of the invention and their advantages are explained in greater detail below with reference to the accompanying drawings, in which:
a is a schematic diagram showing an example of questions and categories used to profile respondents;
b is a schematic diagram showing an example of a profiling network comprising links between the questions and categories of
c is a schematic diagram showing an alternative profiling network comprising links between the questions and categories of
a is a schematic diagram showing a first example of paths traced through the profiling network of
b is a schematic diagram showing a second example of paths traced through the profiling network of
a shows a diagram illustrating an exemplifying pre-determined rule according to which links pointing to questions and to recipient categories can be set in a method according to an embodiment of the invention;
b shows a diagram illustrating exemplifying links that have been set according to the pre-determined rule of
As can be seen from
As mentioned briefly above, it is to be noted that questions can be retrieved according to Phase 101 at any time, and indeed the design of the network can be amended according to Phase 102 at any time, either to account for newly added questions and/or to change the link rankings applied to existing questions. Indeed,
Phase 103 involves collecting answers to the questions and tracing a path through the network on the basis of the collected answers so as to assign category values to the respondents. As an alternative to tracing a path on the basis of actual answers, paths can be identified on the basis of a set of hypothetical answers which may, for example, be specified in a log file or similar. Tracing paths through the network N1 can be visualised as activating links through the network. An exemplary set of paths is shown in
Whilst the examples shown in
In embodiments of the invention it is assumed that the questions utilized to form a profiling network N1 are available from sources such as exemplary server S3, and thence stored in data storage 20 for retrieval by a profiling system S2 for formulating a profiling network N1 of questions and for delivery as messages M1 via the communications network 6, 10; similarly, the responses M2 to the questions can be received and stored in database 21, while the links defining a given profiling network, together with associated link rankings and link weightings can be stored in database 22. It is to be appreciated that while these databases are shown as distinct entities, they could alternatively be part of an integrated storage system. Similarly, while the profiling system S2 is shown as being embodied in a single server S2, it is to be understood that the profiling system could be distributed between different devices according to the functionality required to a) process the questions, b) form a profiling network, and c) receive and process responses according to the profiling network. Further these different devices on which embodiments of the invention are configured could include web servers and/or store and forward devices such as the SMSC 16 and MMSC 14 shown in
Whilst shown as a mobile network 6 and the Internet 10, the communications network can be a mobile communication network capable of supporting, for example, one or more of the following communication protocols: GSM (Global System Mobile), WCDMA (Wideband Code Division Multiple Access), GPRS (General Packet Radio Service). In addition to or instead of the mobile communication network, a local area network such as a Wireless Local area network (WLAN) or BlueTooth® (BT) and/or other technologies such as WiMax, Broadcasting over DVB-H (Digital Video Broadcasting-Handhelds), ISDB-T (Integrated Services Digital Broadcasting for Terrestrial television broadcasting), DMB (Digital Media Broadcasting) or broadcasting over cellular can be used. The communication network can be also a combination of two or more technologies i.e. hybrid communication network. The communication network can also be arranged to support generic Internet access using any transport methods. The questions and the answers given to questions can be transferred in the electrical communication network, for example, as SMS-messages (Short Message Service), MMS-messages (Multi Media Service), Wireless Application Protocol (WAP) pages, Internet pages, HTML (Hypertext Mark-up Language) pages, XHTML (eXtended HTML) pages, IP-datagrams (Internet Protocol), or email letters (electronic mail).
In some embodiments of the invention it is assumed that the user of the terminal 2 is a subscriber of the profiling service according to embodiments of the invention, and that subscribers have entered data indicative of at least some of demographic data, preferences and interests, these data being received and stored by the registration server S1 in the subscriber database 24. As described above, the subscriber database 24 can be associated with a HLR for the mobile network 6: in a preferred arrangement, the preference data can be stored in a logically distinct storage area to that in which the network services and subscription data are stored, thereby decoupling the storage of preference data from the storage of the profiling network data. Alternatively the user can choose not to enter any preference data, in which case messages can be selected at random and a profile built up on real time (on the fly) based on responses to the messages.
Turning now to
Further, the links can be derived on the basis of data related to the answers given to the questions, as identified by the message processing component 505, described below. Data related to an answer can include for example: the content of the answer, a location where a recipient was situated when giving the answer, a point of time (a time of day, a day of week, etc) when the answer has been given, and/or a temporal delay from a moment of delivering the question to a recipient to a moment when the answer has been given. Such automated derivation of the links based on feedback from responses to questions can be performed by the linking component 503, which can additionally set several links between questions Q1, Q2, Q3, Q4 . . . on the basis of data related to an answer given to the question Q1. The information that indicates how the links are to be set can be included, for example, in metadata associated with the question Q1. The linking rules so derived can be stored in the database 20, for future use by the network generating component 501 or can trigger the network generating component 501 to perform real time generation of a profiling network N1. Yet further, links can be associated with time-to-live conditions. For example, a link may be defined to be valid only for a limited time interval after setting the link and to be removed after the limited time interval has elapsed.
Turning now to the distribution of the questions to recipients, the message processing component 505 is arranged to retrieve questions from the database 20 and formulate messages M1 associated therewith for transmission to recipients via the communications network. In an arrangement according to an embodiment of the invention, the message processing component 503 is arranged to select one or more recipients to be targets of a predetermined action as a response to a situation in which rankings of the recipient categories fulfill a pre-determined condition. The pre-determined action can be, for example, an advertisement campaign related to a specified product or service, an offer to provide a specified product or service for a reduced price, or sending a set of pre-determined questions to the selected recipients in order to collect further information about the selected recipients. In addition the message processing component 505 is arranged to process received responses M2 to the questions (i.e. answers to questions), and to store the responses, in association with an identifier associated with the respondent, in a database 21 for use by the network processing component 507 and the linking component 503 in the manner described above.
a shows an exemplifying profiling network N1, according to which links to questions Q1-Q10 and to recipient categories C1-C3 have been set by the network generating component 501 on the basis of linking rules stored in the database 20. For example, a question Q1 can be answered with three alternative answers A1(1), A1(2), and A1(3). For example Q1 can be “Do you use milk products?”, A1(1) can be “Yes, very much”, A1(2) can be “Yes quite a lot”, and A1(3) can be “A little”. It can be seen from
In the foregoing, each link that set according to an answer to a question is a “positive link” that gives importance to a linked question or recipient category. It is also possible that a link is a “negative link” that decreases the importance of a linked question or recipient category, whereby a certain answer to a certain question decreases importance, i.e. ranking, of another question or a recipient category. Such link rankings can be pre-specified or specified on the basis of response messages; for example, in relation to the latter scenario, a lack of answer can be defined to represent a situation in which no link is set or the lack of answer can be defined to correspond to setting a link in a same manner as an answer. For example, a link from Q3 to Q6 can be conditional upon Q3 being answered with A3(1), links from Q3 to Q9 and to Q10 can be conditional upon Q3 being answered with A3(2), and a link from Q3 to Q8 can be conditional upon a null response in respect of Q3.
In the profiling network N1 created by the network generating component 501 and shown in
It is also possible to assign constant or question-specific initial rankings to all the answers to questions Q1-Q10 and to assign constant or recipient category specific initial rankings to all the recipient categories C1-C3. Without limiting generality, it can be assumed that for questions having no initial link from another question, R0(Qi)=0. The same applies for the recipient categories. It is also possible that a certain question or a certain recipient category has a negative initial ranking. A negative initial ranking means a purposive reduction of importance of an answer to a particular question or recipient category (and of answers to those questions and/or recipient categories that are pointed by that question or recipient category provided that a higher value of link ranking is defined to mean higher importance (by contrast if a lower value of link ranking were defined to mean higher importance, the situation would be reversed)).
As described above, whilst the network generating component 501 creates a network N1 of the form shown in
As an alternative, the profiling network N1 can be pre-processed, that is to say that hypothetical paths can be traced through the network N1, each representing a set of answers relating to one or more respondents, thereby enabling any given respondent to be profiled on the basis of his answers in an expedient fashion. Whilst either situation is possible, for illustrative purposes it will be assumed that the responses have been received from respondents tracing the paths indicated in
R(Q1)=R0(Q1),
R(Q2)=R0(Q2),
R(Q3)=R0(Q3),
R(Q4)=R0(Q4),
R(Q5)=R0(Q5),
R(Q6)=R0(Q6)+R(Q1)+R(A1(1))/3+R(Q3)+R(A3(1)),
R(Q7)=R0(Q7)+R(Q1)+R(A1(1))/3+R(Q2)+R(A2(2))/2+R(Q4)+R(A4(2))/2+R(Q6)+R(A6(2))/2,
R(Q8)=R0(Q8)+R(Q1)+R(A1(1))/3+R(Q2)+R(A2(2))/2,
R(Q9)=R0(Q9)+R(Q4)+R(A4(2))/2+R(Q8)+R(A8(2))/2, and
R(Q10)=R0(Q10)+R(Q5)+R(A5(3))/2. (1)
Two things are to be noted in relation to this example:
Working with the set of question rankings of equation (1), the category rankings R(C1)-R(C3) of the recipient categories C1-C2, respectively, can be calculated, for example, as follows:
R(C1)=R0(C1)+R(Q7)+R(A7(2))
R(C2)=R0(C2)+R(Q6)+R(A6(2))/2+R(Q10)+R(A10(1))/2, and
R(C3)=R0(C3)+R(Q8)+R(A8(2))/2+R(Q9)+R(A9(2))+R(Q10)+R(A10(1))/2+R0(Q5)+R(A5(3))/2. (2)
The network processing component 507 is arranged to profile respondents into the recipient categories C1-C3 on the basis of the category rankings R(C1)-R(C3) calculated from the responses, and a measure of correlation of a given respondent with each category is given by the values output in relation to equations (2). Thus the output of equations (2) indicates which one of the recipient categories C1-C3 matches best with the recipient or recipients.
If for example R(C1)>R(C2)>R(C3), the recipient category C1 matches best with the recipient or recipients and the recipient category C2 matches secondly best with the recipient or recipients (if a higher value of ranking is defined to mean higher importance). If for example R(C1)=R(C2)>R(C3) and there is a need to select one recipient category, the selection between C1 and C2 can be made, for example, on the basis of demographic or other data related to the recipient or recipients.
It is also possible to select a pricing structure that is used for pricing services or products on the basis of rankings of the recipient categories calculated for a recipient or recipients. For example, a mobile operator that is financed with e.g. commercials related to outdoor activities may use more customer-friendly pricing policy for those subscribers (recipients) whose ranking of a recipient category “interested in outdoor activities” is above a pre-determined limit value than for other subscribers in order to maintain and strengthen customer connections with those subscribers who are good targets for advertising campaigns distributed by the mobile operator. It is also possible to select or tailor an action that will be targeted to one or more recipients on the basis of a ranking assigned to a certain recipient category or rankings assigned to certain recipient categories.
For example, the recipient category may be “Environmental mindset”, as exemplified in
In a method according to an embodiment of the invention, the rankings of respondents in respect of the various categories are sent to an external device in order to enable the external device to select the one or more recipients to be targets of a pre-determined action as a response to a situation in which the category rankings of the recipient categories fulfil a pre-determined condition.
As described above, a profiling network N1 is created on the basis of linking information stored in the database 20 (or specified in real-time, as a network is being built). Thus, whilst the database 20 might contain a set of questions relating to a variety of different products, when a profiling network N1 is being generated in relation to a given product type, the linking rules are likely to specify a type of question, namely one suitable to the given product type, so as to generate a network of questions that are relevant to the product in question.
It is also possible that one or more questions are excluded from the network N1 because an advertiser, who has ordered an advertisement campaign in relation to a given product, is not willing to pay for messages containing questions Q8 and Q9 to be sent to recipients. For example, in the situation shown in
For the example shown in
R(Q1)=R0(Q1),
R(Q2)=R0(Q2),
R(Q3)=R0(Q3),
R(Q4)=R0(Q4),
R(Q5)=R0(Q5),
R(Q6)=R0(Q6)+R(A1(1))/2+R(Q1)+R(A3(1))+R(Q3),
R(Q7)=R0(Q7)+R(A1(1))/2+R(Q1)+R(A2(2))+R(Q2)+R(A4(2))+R(Q4)+R(Q6)+R(A6(2))/22, and
R(Q10)=R0(Q10)+R(A5(3))/2+R(Q5). (3)
The rankings of the recipient categories C1-C2 can be calculated, for example, as follows:
R(C1)=R0(C1)+R(A7(2))+R(Q7),
R(C2)=R0(C2)+R(A6(2))/2+R(Q6)+R(A10(1))/2+R(Q10), and
R(C3)=R0(C3)+R(A10(3))/2+R(Q10)+R(Q5(3))/2+R(Q5). (4)
By comparing equations (3) and (4) with equations (1) and (2) it can clearly be seen that the category rankings R(C1), R(C2), and R(C3) may have different values when Q8 and Q9 are omitted from the network N1 even if the initial link rankings R0(Q1)-R0(Q10) were the same in both networks.
As mentioned above, in addition or as an alternative to rankings, links may carry link weightings. When link weighting information is specified, the network processing component 507 is arranged to multiply link rankings and categories rankings with link weighting factors so as to generate category values.
Each link has been associated with a link weight factor, e.g. w(Q4, Q1) that can be used for increasing or decreasing a level of link ranking from a pointing question (or recipient category) to a pointed question (or recipient category). Values of the link weighting factors can be defined, for example, on the basis of demographic data related to the recipients. The equations for the rankings can be formulated, for example, as follows:
R(Q1)=R0(Q1),
R(Q2)=R0(Q2)+w(Q2,Q6)×R(Q6),
R(Q3)=R0(Q3),
R(Q4)=R0(Q4)+w(Q4,Q1)×R(Q1)+w(Q4,Q3)×R(Q3),
R(Q5)=R0(Q5)+w(Q5,Q1)×R(Q1)+w(Q5,Q2)×R(Q2),
R(Q6)=R0(Q6)+w(Q6,Q5)×R(Q5)+w(Q6,C2)×R(C2),
R(C1)=R0(C1)+w(C1,Q5)×R(Q5) and
R(C2)=R0(C2)+w(C2,Q4)×R(Q4)+w(C2,C1)×R(C1). (5)
Equations (5) cannot be solved directly in the same manner as equations (1) and (2), and equations (3) and (4), because the ranking R(Q2) depends on the question ranking R(Q6) that depends on the question ranking R(Q5) that depends in turn on the question ranking R(Q2), i.e. there is at least one closed loop of links Equations (5) can be presented in the matrix form:
(I−A)×R=R0, (6)
where R is a ranking vector [R(Q1), R(Q2), R(Q3), R(Q4), R(Q5), R(Q6), R(C1), R(C2)]T (T=transposition), R0 is a known initial ranking vector [R0(Q1), R0(Q2), R0(Q3), R0(Q4), R0(Q5), R0(Q6), R0(C1), R0(C2)]T, I is a unit matrix, and A is a matrix whose non-zero elements are the link weight factors, e.g. w(C2, C1), presented in equations (5). Equation (6) can be solved with standard methods of the linear algebra, e.g. by forming an inverse matrix (I−A)−1 or with an iterative method.
Whilst the link rankings associated with the various answers to the questions are not shown in
In this example the link rankings assigned to respective links between questions is calculated by the linking component 503 on the basis of the total number of responses received and numbers of responses matching the various possible answers to a respective question. Thus in the case of answers A1(1) and A1(2) to Q1, the link ranking associated with A1(1) is N1/(N1+M1) and that associated with A1(2) is M1/(N1+M1).
The equations for the question and category rankings can be formulated, for example, as follows:
R(Q1)=R0(Q1),
R(Q2)=R0(Q2),
R(Q4)=R0(Q4)+w(Q4,Q1)×N1/(N1+M1)+R(Q1),
R(Q5)=R0(Q5)+w(Q5,Q1)×M1/(N1+M1)+w(Q5,Q2)×N2/(N2+M2)+R(Q1)+R(Q2),
R(Q6)=R0(Q6)+w(Q6,Q2)×M2/(N2+M2)+R(Q2),
R(C1)=R0(C1)+w(C1,Q5)×N5/(N5+M5)+R(Q5), and
R(C2)=R0(C2)+w(C2,Q5)×M5/(N5+M5)+w(C2,Q6)+R(Q5)+R(Q6). (7)
Any given link weight factor, e.g. w(Q4, Q1), shown in
The link rankings that can be calculated from equations (7) can be interpreted to represent average link rankings for all those recipients that have answered the questions Q1, Q2, Q4, Q5, and Q6.
Whilst in the above embodiments the content items are shown as messages with content that can be paraphrased as questions, it is to be appreciated that the content items could comprise data having links (URL) to web sites and the like, and for which, clicking on a given URL has the effect of navigating the recipient to the web site. The web site can have rules associated therewith, which determine a response based on the user action. There might be several possible responses, each associated with a particular URL, which are stored in the database 22 and processed in the manner described above. Further, each of the content items, in this case URLs to web sites, can be linked to other URLs to form the profiling network N1 in any of the manners described above.
As described above, the profiling system S2 comprises a set of computer software components, and these can be e.g. created in accordance with a procedural programming language or an object oriented programming language.
The components so created can be stored in a computer readable medium and/or distributed over a network by means of conventional transport techniques. The computer readable medium can be e.g. a CD-ROM (Compact Disc Read Only Memory) or a RAM-device (Random Access Memory).
While there have been shown and described and pointed out fundamental novel features of the invention as applied to embodiments thereof, it will be understood that various omissions and substitutions and changes in the form and details of the processes and devices described may be made by those skilled in the art without departing from the inventive idea defined in the independent claims. For example, it is expressly intended that all combinations of those process steps or device elements which perform substantially the same function in substantially the same way to achieve the same results are within the scope of the invention. Moreover, it should be recognized that process steps and device elements shown and/or described in connection with any disclosed form or embodiment of the invention may be incorporated in any other disclosed or described or suggested form or embodiment as a general matter of design choice. The specific examples provided in the description given above should not be construed as limiting. Therefore, the invention is not limited merely to the embodiments described above, many variants being possible without departing from the inventive idea defined in the independent claims.
Number | Date | Country | Kind |
---|---|---|---|
0811799.6 | Jun 2008 | GB | national |
08167423 | Oct 2008 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2009/058135 | 6/29/2009 | WO | 00 | 8/18/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2009/156520 | 12/30/2009 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20040095907 | Agee et al. | May 2004 | A1 |
20050044147 | Yap | Feb 2005 | A1 |
20070271272 | McGuire et al. | Nov 2007 | A1 |
20090006083 | Bachand | Jan 2009 | A1 |
20090158211 | Gogolak | Jun 2009 | A1 |
20100153832 | Markus et al. | Jun 2010 | A1 |
Number | Date | Country | |
---|---|---|---|
20120066225 A1 | Mar 2012 | US |
Number | Date | Country | |
---|---|---|---|
61076395 | Jun 2008 | US |