This application relates to a method of optimizing a message, a method for generating a modified message, a communications server equipment, a computer program, and a computer program product.
We live in a world where increasingly things get pushed to us via call to actions—these call to actions are frequently immediately actionable. Examples of such actions is clicking on sponsored links that are advertised on the right hand side of Google search results (or other search engines), responding to an SMS call to action by clicking on a WAP link on a handset or via responding through the sending of an SMS, or via calling a number. Our world is getting more and more congested with short marketing sentences that prompt consumers into action. The gratification mechanic on those actions becomes more and more immediate through the use of technology. Direct marketing agencies have long now tested the effectiveness of different calls to actions for the same marketing item, in an attempt to optimize the response rate. It is widely accepted that optimizing the way you phrase a call to action can have different effect on people. However, so far, no analytical method has been applied to actually compose the optimal way to say something. When the marketing message is only just a few words, such an analytical exercise is feasible, as alternatives are finite within a language.
The invention is set out in the claims. In one embodiment, a method of the invention comprises:
receiving an input message;
identifying message components of the input message;
applying at least one rule to modify at least one message component to create at least one variation of the input message;
sending a plurality of message variants each to a respective sample of users;
measuring a response criterion for each message variant; and
selecting a message variant according to the measured response criterion.
Embodiments optimize the response rate of any short marketing message which contains a call to action. A short marketing message may comprise 300 characters. The purpose, given any marketing pitch, is to be able to compose the optimal way of phrasing the short marketing message in order to maximize response rates. The method works because a majority of people within a group (which is sampled) react in non-intuitive manner to prompts, yet do so in a very consistent way that can be well defined, and if analyzed, predicted.
Marketing agencies and online marketing help software tools test the effect of various ways to phrase a short message in order to maximize the response. None of these methods use a mathematically rigorous algorithm that examines a wide variety of possible alternatives, tests them and arrive to the optimal composition of the short message that delivers an optimized response.
The method will be described in connection with the optimization of marketing messages that consist of a few words (about 300 characters or less), leading to an explicit, or implicit call to action. Examples of these calls to action are the following:
The call to action is measurable and is usually of a digital nature.
An example of the method will now be described with reference to a dyslexia related website.
Upon receiving a marketing message that contains a call to action of a digital nature as described above, the method requires the identification of different possible ways to phrase the message and to solicit the same action.
The above is achieved through utilizing lexical, syntactical, and grammatical software tools to identify a plurality of possible alternatives of phrasing the original sentence that is written by the marketer/advertiser. In other words, original words are replaced with possible synonyms within the said language, and possible re-positioning and re-phrasings take place. This gives a finite number of alternatives to phrase the message. Some of these alternatives may actually not make any grammatical or common sense to the consumer.
For example, with reference to a marketing pitch for a dyslexia related website (traffic builder), the input message is:
“FOUR KEY SKILLS THAT HELP WITH DYSLEXIA AND IMPROVE LEARNING”
An example of a couple of synonym alternative is the following:
“four IMPORTANT DEXTERITIES that ASSIST with DYSLEXIA and ENHANCE learning”
“four IMPORTANT SKILLS that HELP with DYSLEXIA and IMPROVE KNOWLEDGE” etc.
After all the synonym alternatives are created, the syntactical alternatives are built. For example:
“HELP DYSLEXIA AND IMPROVE LEARNING WITH FOUR KEY SKILLS”
“IMPROVE LEARNING AND HELP DYSLEXIA WITH FOUR KEY SKILLS”
“HELP DYSLEXIA! IMPROVE LEARNING WITH FOUR KEY SKILLS”
The final step in generating all possible permutations is to eliminate possible words or paraphrase existing phrases. For example:
“HELP FOR DYSLEXIA! KEY SKILLS THAT IMPROVE LEARNING.” (four is omitted)
“FOUR KEY SKILLS THAT HELP WITH DYSLEXIA AND MAKE LEARNING EASIER”
The end result of the above step is a finite number of alternative messages which can act as marketing pitches for the same weblink that corresponds to the dyslexia website.
After all different alternatives are listed, the non-sensical alternatives are eliminated.
A grammatical check is performed to identify the non-sense alternatives. These need to be eliminated as possible ways to phrase the same marketing message whose optimized composition is being sought. Following the above example, the following computer generated alternative would have to be eliminated as it makes no sense in English:
“FOUR SKILLS IMPORTANT HELP THAT DYSLEXIA AND LEARNING IMPROVE”
The result of this step is to be left with a finite set of legitimate message alternatives for the same original input message.
The legitimate message alternatives are then tested on sample groups of consumers—the sample sizes are sufficient to provide “statistically significant” results given the expected response rate range.
Different message versions are tested in similar size groups that are statistically significant and representative of the ultimately targeted group. It is important to note that individual characteristics are pitted against its variations while keeping all other variables constant. In our original example, let's assume that after our test we derive the following:
“FOUR SKILLS THAT HELP WITH DYSLEXIA AND IMPROVE LEARNING”—1.8%
“FOUR SKILLS THAT HELP WITH DYSLEXIA AND ENHANCE LEARNING”—2.1%
The above proves that the word “ENHANCE” is more effective than the word “IMPROVE”, at least in that particular sequence of words. Alternatively a test can “pit” syntactical choices against others—choices on the ordering/sequencing of words in the said sentence. For example:
“FOUR SKILLS THAT IMPROVE LEARNING AND HELP WITH DYSLEXIA”—3%
“FOUR SKILLS THAT HELP WITH DYSLEXIA AND IMPROVE LEARNING”—1.1%
The above proves that talking about learning before dyslexia, at least with this particular choice of words makes a lot more sense and delivers better response rate. Similar testing is performed on a wide variety of variations to identify best of breed components (“winning ingredients”).
Different winning ingredients are combined to compose second and subsequent rounds of messages for testing till the optimization is complete—at that point the invention will have delivered the optimal way to deliver the particular message, in a few words, working with the limitations and information that the original marketer has put in.
So if we take into consideration the results in our example above, we can combine the two elements that we know are more powerful in a more “potent” message—one that uses the word ENHANCE and the sequence of words that is identified as better by the second test. Thus the composed “super message”, if all other variables were not to be taken into consideration would be the following:
“FOUR SKILLS THAT ENHANCE LEARNING AND HELP WITH DYSLEXIA”—3.5%
Every change is essentially reduced to what can be called an independent variable and is examined as a factor that affects the dependent variable which is the response rate of the marketing message. Additional rounds of testing are done (as many as needed) in order to derive the optimal message that would consist of the optimal choice of words, in the right sequence and with the necessary, if any, paraphrasing of the original message written by the marketer.
Numerous statistical methods and techniques can be used in order to derive the optimal message and to logically navigate the steps above—these include but are not limited to multivariate analysis, regression, correspondence analysis and redundancy analysis.
The above method allows testing of finite components. These are readily identified and controlled within the context of text advertising and within a constrained number of characters. The testing described allows for a) identification all possible message alternatives and b) the identification of the relative importance of the various message components (or independent variables) with respect to the response rate.
The method is based upon that understanding that small components and changes that “common sense” would dictate as not having any impact in the final response rate, actually do—and in fact up to an incredible degree.
Given the above broad explanation of the method disclosed herein, a more rigorous example will now be described to highlight some additional details.
The components of an input message are identified. These are treated as independent variables which each influence the response rate. The components are identified as A,B,C,D, etc. (for example A is how to say FRUIT)—synonyms is not the only way to go, sometimes related terms can achieve better response rates. The message components can comprise more than one word for example an alternative for the variable IMPROVE can be MAKE THINGS BETTER. Also, message component alternatives can be the call to action of the origination address. That is, the originating address can be considered as another message component, e.g. am SMS message addressed from Vodafone, can be compared to an SMS message addressed from the short number 444.
For each independent variable A, there can be finite alternatives, A(1) to A(a)—for example A(1) is FRUIT, A(2) is PRODUCE, etc.—the size of a depends on the alternatives that are found when implementing the method.
So a short message input into the system is reduced to a vector such as {A, B, C, . . . }. The lexical combinations then become {A(a), B(b), . . . }—essentially a vector of variable sized vectors.
The syntactical combinations that are possible come from the re-arrangement of different vectors within the vector of vectors. So {D(d), A(a), E(e) . . . } can become an alternative that simply signifies ordering the lexical parts of the short message in a different way. All the syntactical combinations of the lexical alternatives define the finite space within which we are optimizing.
Assuming that after constructing a sentence (which is a value of the vector of vectors), it is checked for whether it makes sense or not as it is computer generated. Only sensible alternatives are checked as the test subjects samples are real consumers, sampled from the group of people for which we wish to be optimizing. Keep in mind that this sampling can occur as sampling theory dictates, in a different way for say the subscriber base of a mobile operator compared to the people that are searching for BMW on Google in the United States.
In one embodiment, we first generate and test ALL lexical elements and define the most potent values for {A(a), B(b) . . . }—say that for a certain short message, after the lexical testing of components you reach {A(1),B(3),C(2),D(1)}. For simplicity purposes, let's assume that there are only 4 lexical components.
Then, we generate all ordering permutations, i.e. The syntactical combinations, but of the winning lexical ingredients only. From them we eliminate the ones that are non-sensical, i.e. fail the grammar check. Consider the example where the following message variants are derived as sensible:
These message variants are then each tested in respective groups of subjects to identify the most potent message.
In a further embodiment the testing of lexical and syntactical variations is performed at the same time.
In yet a further embodiment, different variations are tested in separate rounds of testing. Indeed, instead of testing messages and ingredients one at a time, particular combinations are tested and the various importances of the independent variables assessed. This can reduce the total amount of testing required.
The above described embodiments allow a short message to be reduced into various message components that affect the response rate. The message components are handled as independent variables. This allows for the testing of particular components and the combining of favourable traits. In this way a message can be optimized to have the best response criterion in much the same way that a species evolves to fill an ecological niche.
If the threshold criterion is not met, then the winning message is modified at 540 to create a variant message. The variant message is sent to a respective subset of users at 550 and a response criterion is measured at 560. At 572, the response criterion measured at 560 is compared to the response criteria of the first message measured at 530 (or the previous most successful message measured at 572 in a previous iteration) and a winning message selected at 574 or 576. The process then returns to 580 and a determination is made as to whether a threshold criterion has been met.
In this way, the process of
The method of
The identification of optimum components and creation of new generations of optimized message variants which are then tested can be repeated in a plurality of iterations until a threshold criterion is met and an optimized message is generated.
As discussed above, message variants may be created by defining key components of the message, treating the key components as variables, and identifying finite alternatives for such key components. In the embodiments described below, a key component that can be manipulated is referred to as a “gene.” A gene is a sentence part or message part that can be manipulated in one or more ways. As will be discussed below, there also can be “intangible genes” that represent attributes of a message.
Examples of types of genes are as follows:
As an example, take the message:
“Surprise! Are you ready for the real mobile internet? 500 MB of internet+unlimited Vodafone live+2 months free for only $3.99/month! Just send YES to 400!”
One way in which the above messages can be disassembled into genes is as follows:
“Intro” gene: Surprise!
“Call to Action” gene: Just send YES to 400!
“Product 1” gene: 500 MB of internet+
“Product 2” gene: unlimited Vodafone live+
“Price” gene: for only $3.99/month
An equivalent message may or may not be semantically the same as the original message, but it essentially conveys the same message. For example, if the original message is a product offer, an equivalent message would have the same product offer but might phrase it differently.
Message variants are created by applying the rules (step 830). The number of potential message variations grows exponentially with the number of genes. For example, if there are seven genes with five possible values each, then there are 78,000 variations of the message. It is not always practical to test a large number of variations, and, in such cases, a subset of message variants is identified to test (step 840). As will be discussed in more detail below, experimental design is used in one embodiment to identify the subset.
A supervised learning method is applied to identify the best combination of gene values for the message. Specifically, the identified message variations are tested by sending each variation to a representative sample of people (step 850), and the response rate is measured for each message variant (step 860). In one embodiment, the response rate is measured for each message variant by dividing the number of people who responded to the message by the number of people to whom the message was sent. What is considered a response varies and depends on the call to action in the message (e.g., call a number, send an SMS, click on a link). The response rate is used to identify the best message (step 870). As will be discussed with respect to
As stated above, when a message designer creates genes and values, the number of potential combinations grows exponentially with the number of genes and often gets to numbers that do not make sense to test. In one embodiment, a multivariate analysis method, such as experimental design, is used to choose a representative subset of message variants to test. For purposes of this discussion, a “design” for a message is the chosen representative subset along with the algorithm used to produce the subset. Types of designs that can be used include:
Those skilled in the art will appreciate that other forms of multivariate analysis can be used instead of experimental design.
As illustrated in
First, a message designer may test several complete messages to determine the starting message that will be optimized (step 1010). Initially, the message designer may send a variety of messages to a relatively small sample of recipients, and then send the “uppermost” messages (i.e., the messages with the best responses rates) to a larger population to confirm the message rankings. The message with the best response rate is usually chosen as the starting message.
The message designer then identifies genes and defines rules that specify alternate values or positions for the genes (step 1020). Variations of the message are created using the rules (step 1030), and an experimental design algorithm (e.g., a D-Optimal algorithm) is applied to select a representative subset of message variations to test (step 1040). The subset is then sent to a representative sample of people and response rates are measured for each message variations tested (step 1050). An algorithm, such as regression analysis, is applied to the response results to identify the best performing gene values and interactions between gene values (step 1055). Other types of algorithms that can be used include (but are not limited to) gradient descent and genetic algorithms.
The message designer then essentially repeats steps 1020-1055 by “drilling down” from coarse to fine on the best performing gene values. For example, the designer may concentrate on specific words instead of larger phrases or concentrate on other more subtle differences. In other words, a message designer may define genes that relate to specific words (instead of larger phrases) or other subtle differences.
In the preferred embodiment, steps 1020-1060 are used to determine the best value(s) for each gene. Then, in step 1070, various combinations of the best gene values are created, and such combinations are tested to find the combination that works best (step 1080). One reason that multiple combinations may be created and tested is to ensure that combinations of gene values work well together as a whole sentence or message. Individual gene values may test well, but may not work well together. This step is used to identify the combination of gene values that work the best. Another reason for testing multiple combinations is that sometime genes have two close values that performed well, and it is desirable to confirm the best values and interactions by testing them on a larger population. The combination with the best response rate is identified as the most potent message.
In one embodiment, the method illustrated in
The messages tested in each of phases 2-4 are based on the performance of messages tested in the previous phase. For example, the messages created and tested in phase 2 are based on the performance of messages in phase 1, the messages created and tested in phase 3 are based on the performance of the messages in phase 2, and the messages created and tested in phase 4 are based on the performance of the messages in phase 3. Those skilled in the art will appreciate that there may be less or more than four phases.
The method of
In one embodiment, values for genes are modelled by an ontology and are categorized into “families” of words. Words with equivalent meaning or effect (or are otherwise related according to the ontology) may be categorized into the same family. For example, opening words for a message, like “Congrats!” and “Felicitations!” would typically be in the same family.
If a phrase in a message is defined as a gene, such phrase can include additional genes. This is referred to as a “gene within a gene.”
The rules for replacing the string value of contained genes can either be different for each container gene value (step 1420a-1) or they can be the same for all container values (step 1420a-2).
Context-free grammars (or a similar construct) can be used to define rules for contained genes that apply across container gene values. Below is an example of a rule (taken from the example in
PRIZE gene→Surprise! You can <wingain>$40 tonight and <Car_Prize>. Call now!|Surprise! You can <wingain><Car_Prize> and $40 tonight. Call now!
Car_Prize→car|Volvo
Wingain→win|gain
The left side variable can be replaced by any string on the right side. On the right side, there can be a variable as well, thereby enabling a hierarchical structure. When there are multiple values for container genes, context-free grammars (or a similar construct) enable rules for manipulating container genes and contained genes to be written more efficiently.
In one embodiment, “intangible genes” are used to optimize a message. “Intangible genes” represent attributes of a message. Examples of attributes represented by intangible genes include the following:
Multiple intangible genes can be combined in a single message. For example, a sentence can be both in imperative and show loss aversion. In one embodiment, a statistician, using experimental design methods, gives the message designer a subset of all possible combinations of intangible genes. For instance, the statistician may provide the message designer with the following subset:
Message 1: Formal+question+loss aversion
Message 2: informal+questions+winning feeling
The message designer then creates messages with the above attributes that are as similar as possible in other attributes. An example of a message with the attributes of message 1 above is:
Dear Sir, would you like to win a unique 4 MB internet plan offer at $5/month? Please reply TES' to purchase.
The created messages are tested to find which attributes worked well and which combination of attributes worked well. Once the best attributes and combinations are identified, the wording of the message is tested in the ways described above with respect to
The message optimization system 1500 includes a graphical user interface 1510 for defining genes and values. The GUI is a software tool that enables a message designer to define genes and values, such as the GUI illustrated in
An Experimental Design Analyzer 1520 performs the experimental design analysis to derive a design. A message designer uses the GUI 1510 to create the message variants based on the design. Alternately, messages may be generated automatically by a software module that uses the design and gene values as input. The messages are sent to a representative sample of people via Message Sending Interface 1540. A Response Aggregator 1550 tracks and aggregates messages responses. A Regression Analysis module 1560 performs regression analysis on response rates for applicable messages.
In an alternate embodiment of the invention, the steps of defining genes and gene values, as well as creating combinations of the best gene values, can be performed automatically by a computer program instead of manually by a message designer.
The methods described herein are not limited to optimizing advertisement/promotional messages. They can be used to optimize articles, books, and other compilations of words.
The invention is not restricted to the features of the described embodiments. It will be readily apparent to those skilled in the art that it is possible to embody the invention in specific forms other than those of the described embodiments above.
Number | Date | Country | Kind |
---|---|---|---|
0922608.5 | Dec 2009 | GB | national |
This application is a continuation of U.S. patent application Ser. No. 13/517,032, titled “Message Optimization” filed on Jun. 18, 2012, which is a U.S. National Stage of PCT/EP2010/006920 under 35 USC 371 (filed on Nov. 12, 2010), which claims priority from United Kingdom Application 0922608.5 filed on Dec. 23, 2009. The contents of parent application Ser. No. 13/517,032 are incorporated by reference as if fully disclosed herein.
Number | Name | Date | Kind |
---|---|---|---|
4930077 | Fan | May 1990 | A |
5173854 | Kaufman et al. | Dec 1992 | A |
5652828 | Silverman | Jul 1997 | A |
6278967 | Akers et al. | Aug 2001 | B1 |
6400996 | Hoffberg et al. | Jun 2002 | B1 |
6470306 | Pringle et al. | Oct 2002 | B1 |
6647383 | August et al. | Nov 2003 | B1 |
6687404 | Hull et al. | Feb 2004 | B1 |
6760695 | Kuno et al. | Jul 2004 | B1 |
6934748 | Louviere et al. | Aug 2005 | B1 |
6978239 | Chu et al. | Dec 2005 | B2 |
7006881 | Hoffberg et al. | Feb 2006 | B1 |
7013427 | Griffith | Mar 2006 | B2 |
7130808 | Ranka et al. | Oct 2006 | B1 |
7133834 | Abelow | Nov 2006 | B1 |
7137126 | Coffman | Nov 2006 | B1 |
7167825 | Potter | Jan 2007 | B1 |
7181438 | Szabo | Feb 2007 | B1 |
7302383 | Valles | Nov 2007 | B2 |
7363214 | Musgrove et al. | Apr 2008 | B2 |
7406434 | Chang | Jul 2008 | B1 |
7493250 | Hecht et al. | Feb 2009 | B2 |
7769623 | Mittal et al. | Aug 2010 | B2 |
7797699 | Kagi et al. | Sep 2010 | B2 |
7945573 | Barnes et al. | May 2011 | B1 |
8260663 | Ranka et al. | Sep 2012 | B1 |
8689253 | Hallberg et al. | Apr 2014 | B2 |
9268769 | Shalit | Feb 2016 | B1 |
9741043 | Vratskides | Aug 2017 | B2 |
20020156688 | Horn et al. | Oct 2002 | A1 |
20020174144 | Wolpe | Nov 2002 | A1 |
20050027507 | Patrudu | Feb 2005 | A1 |
20050039116 | Slack-Smith | Feb 2005 | A1 |
20050060643 | Glass et al. | Mar 2005 | A1 |
20050076003 | DuBose et al. | Jul 2005 | A1 |
20050234850 | Buchheit et al. | Oct 2005 | A1 |
20060155567 | Walker et al. | Jul 2006 | A1 |
20060212524 | Wu et al. | Sep 2006 | A1 |
20060259360 | Flinn et al. | Nov 2006 | A1 |
20070033002 | Dymetman | Feb 2007 | A1 |
20070055931 | Zaima et al. | Mar 2007 | A1 |
20070127688 | Doulton | Jun 2007 | A1 |
20070153989 | Howell et al. | Jul 2007 | A1 |
20070168863 | Blattner et al. | Jul 2007 | A1 |
20070233566 | Zlotin et al. | Oct 2007 | A1 |
20070239444 | Ma | Oct 2007 | A1 |
20070260740 | Guan | Nov 2007 | A1 |
20070281286 | Palacios | Dec 2007 | A1 |
20080059149 | Martin | Mar 2008 | A1 |
20080109285 | Reuther et al. | May 2008 | A1 |
20080178073 | Gao et al. | Jul 2008 | A1 |
20080281627 | Chang et al. | Nov 2008 | A1 |
20080300857 | Barbaiani et al. | Dec 2008 | A1 |
20080313259 | Correa | Dec 2008 | A1 |
20090110268 | Dejean et al. | Apr 2009 | A1 |
20090150400 | Abu-Hakima et al. | Jun 2009 | A1 |
20090177750 | Lee et al. | Jul 2009 | A1 |
20090210899 | Lawrence-Apfelbaum et al. | Aug 2009 | A1 |
20090226098 | Takahashi et al. | Sep 2009 | A1 |
20090254836 | Bajrach | Oct 2009 | A1 |
20090276419 | Jones et al. | Nov 2009 | A1 |
20090313026 | Coffman | Dec 2009 | A1 |
20100100817 | Trotter | Apr 2010 | A1 |
20100241418 | Maeda et al. | Sep 2010 | A1 |
20100312769 | Bailey et al. | Dec 2010 | A1 |
20100312840 | Chestnut et al. | Dec 2010 | A1 |
20110040611 | Simmons et al. | Feb 2011 | A1 |
20120095831 | Aaltonen et al. | Apr 2012 | A1 |
20120259620 | Vratskides | Oct 2012 | A1 |
20130311269 | Forte | Nov 2013 | A1 |
Number | Date | Country |
---|---|---|
2001048666 | Jul 2001 | WO |
2011076318 | Jun 2011 | WO |
Entry |
---|
International Search Report and Written Opinion PCT/EP2010/006920, 10 pages, dated Feb. 7, 2011. |
International Preliminary Report on Patentability PCT/EP2010/006920, 9 pages, dated May 15, 2012. |
“Drop-Down list”, Dec. 21, 2008. |
Hill Associates, “Weighted and Round Robin Queuing”, Jul. 14, 2007. |
Katevenis, M., et al., “Weighted Round-Robin Cell Multiplexing in a General-Purpose ATM Switch Chip”, Oct. 1991, IEEE Journal on Selected Areas in Communication, vol. 9, No. 8. |
Lee, Jinkyu, et al., “Multiprocessor Real-Time Scheduling Considering Concurrency and Urgency”, Special Issue of the Work-in-Progress (WIP) Session at the 2009 IEEE Real-Time Systems Symposium, Washington, D.C., Dec. 1-4, 2009. |
Sawyer, Alan G., “Statistical Power and Effect Size in Consumer Research”, in NA—Advances in Consumer Research vol. 09, eds, 1982, pp. 1-7. |
“Slider (Computing)”, Sep. 27, 2007. |
Number | Date | Country | |
---|---|---|---|
20170316430 A1 | Nov 2017 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13517032 | US | |
Child | 15653974 | US |