The present technology is in the field of computer systems and, more specifically, related to serving additional content in response to a promoted phrase being spoken.
While an advertisement is being played, there is a short window of time during the advertisement that a user may interact with the advertisement. Although the user may be interested in the advertisement in the future, the user may not be interested in the advertisement now. In this situation, an advertiser misses identifying engagement opportunities due to the timing of the advertisement.
Systems and methods are disclosed that enable a user to speak a promoted phrase in response to a voice advertisement that includes the promoted phrase. The user will then be provided additional advertisement content. According to various examples, detection of the promoted phrase is enabled once the voice advertisement ends. According to various examples, the additional advertisement content is related to the voice advertisement or the promoted phrase. According to various examples, detection of the user speaking the promoted phrase is done within a time frame. Once the time frame has lapsed, detection of the user speaking the promoted phrase may be disabled. This avoids unexpected future delivery of the additional content by the user or others. Additionally or alternatively, detection of the user speaking the promoted phrase can be disabled in response to detecting the user speaking promoted phrase.
According to various examples, the user interacts with the device before detection of the spoken phrase. For example, the user presses a button on a device to enable detection of the spoken phrase. Releasing the button disables detection. By requiring a button for detection of the spoken phrase, ambient speech does not cause false positive detection if the phrase or a similar-sounding phrase is spoken. Avoiding false positives improves user experience and therefore positive reviews of devices and services, resulting in higher sales.
According to various examples, detection of the promoted phrase may be generally enabled or disabled. For example, the user may choose to have promoted phrase detection enabled as a feature within a device that supports voice interaction. In some systems, disabling phrase detection aids in preserving privacy or limiting access to content that might be inappropriate for some settings or for some types of people such as children. This enables sales to users and use case markets that otherwise could not be reached.
According to various examples, before serving additional advertisement content is performed, a determination is made if the same user that listened to the voice advertisement also spoke the promoted phrase. Detecting which user spoken the promoted phrase can be performed with voice fingerprinting, with beamforming using multiple microphones to detect the relative direction of speech, with radio frequency or other type of electromagnetic signals, such as Bluetooth, presence detection, by camera, radar, or similar types of sensing the presence of people. Additional advertisement content is served when the same user that spoke the promoted phrase also was served the voice advertisement. Delivering content, conditionally, only to the same user that listened to the voice advertisement, ensures that the listener is aware of the context of the advertisement when receiving the additional content. That improves user satisfaction and advertisement success. According to various examples, serving additional advertisement content may depend on the other people and/or devices that could hear/record the additional advertisement content.
According to various examples, the additional advertisement includes a call to action (CTA). This encourages consumers to further engage with content from a vendor, which increases the likelihood of a sale conversion and, therefore, profitability for the vendor and the service that delivers the advertisements and additional content. According to various examples, the additional advertisement includes additional questions for the user and the response to the user is based on the additional questions. Additional questions also encourage user engagement and advertisement conversion to sales.
The following describes various examples of the present technology that illustrate various interesting aspects. Generally, examples can use the described aspects in any combination. Statements herein reciting principles, aspects, and examples are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents and equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
It is noted that, as used herein, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Reference throughout this specification to “one,” “an,” “certain,” “various,” and “cases,” “examples” or similar language means that a particular aspect, feature, structure, or characteristic described in connection with the example is included in at least one embodiment of the invention. Thus, appearances of the phrases “in one case,” “in at least one example,” “in an example,” “in certain cases,” and similar language throughout this specification may, but do not necessarily, all refer to the same embodiment or similar embodiments. Furthermore, aspects and examples of the invention described herein are merely exemplary, and should not be construed as limiting of the scope or spirit of the invention as appreciated by those of ordinary skill in the art. The disclosed invention is effectively made or used in any example that includes any novel aspect described herein. Furthermore, to the extent that the terms “including”, “includes”, “having”, “has”, “with”, or variants thereof are used in either the detailed description and the claims, such terms are intended to be inclusive in a similar manner to the term “comprising.” In examples showing multiple similar elements, even if using separate reference numerals, some such examples may work with a single element filling the role of the multiple similar elements.
The following describes systems of process steps and systems of machines and components for serving additional advertising content in response to a promoted phrase being spoken by a user while the promoted phrase was part of a voice advertisement. Some implementations use computers that execute software instructions stored on non-transitory computer readable media. Examples below show design choices for various aspects of such systems. In general, design choices for different aspects are independent and can work together in any combination.
Referring now to
The promoted phrase can be any phrase within the content or advertisement that can be used to trigger follow-on content, advertisement, interaction, or one or more question. For example, when the advertisement is “Melody Basset Hound Insurance, say ‘best friend’ to keep you safe” then the promoted phrase can be “best friend.” In another example, when the advertisement is “Acoustic Plott Hound Banking, just say ‘houndecoin’ to open your account”, then the promoted phrase can be “houndecoin”. According to various examples, the promoted phrase can be any sound a human can make. For example, the advertisement could be “for the best Halloween experience, scream” and the promoted phrase could be someone screaming. For another example, the advertisement could be “for the funniest comedy show you have seen, laugh” and the promoted phrase could be laughter. For another example, the advertisement could be “for your car to be as clean as a whistle, whistle” and the user whistling could be the promoted phrase. According to various examples, an advertisement may contain multiple promoted phrases. According to various examples, a user may specify the promoted phrase. For example, when the voice advertisement is “Melody Basset Hound Insurance, say ‘best friend’ to keep you safe. You can also say what word you would like to use to get more information”, then the user can say “insurance” to specify the promoted phrase. Later, when the user says “insurance”, insurance will be understood to be the promoted phrase.
According to various examples, the voice advertisement may include a time frame to speak the promoted phrase. For example, the voice advertisement may be “Melody Basset Hound Insurance, say ‘best friend’ within two hours to keep you safe.” In another example, the voice advertisement may be “Acoustic Plott Hound Banking, just say ‘houndecoin’ any time today to open your account.”
According to various examples, the voice advertisement with promoted phrase may be created by any source. For example, a sales writer may write and/or speak the voice advertisement with promoted phrase and indicate what the promoted phrase is. For another example, the advertisement may be created by a machine learning algorithm trained with data about the user, other users, and/or potential users.
At step 104, a user speaks the promoted phrase from the voice advertisement. For example, when the advertisement is “Acoustic Plott Hound Banking, just say ‘houndecoin’ to open your account” and the user says “houndecoin”, an additional advertisement from Acoustic Plott Hound Banking about opening an account is served. According to various examples, a variation of the promoted phrase can be recognized as the promoted phrase. For example, when the promoted phrase is “home mortgage”, phrases such as “house mortgage”, “home loan”, “mortgage, “mortgage rate”, etc. may be recognized as the promoted phrase. According to various examples, part of the promoted phrase may be recognized as the promoted phrase. For example, when the promoted phrase is “home mortgage”, the phrase “home”, “mort”, “hm”, etc. may be recognized as the promoted phrase. According to various examples, the promoted phrase may be a reference to an aspect of the advertisement. For example, when the user says “tell me more about the banking advertisement” then this could be recognized as the promoted phrase of the advertisement. For another example, when the user says “replay that mortgage ad” then this could be recognized as the promoted phrase of the mortgage advertisement. According to various examples, synonyms of the promoted phrase can be recognized as the promoted phrase. For example, when “home mortgage” is the promoted phrase, “house mortgage” could be recognized as the promoted phrase. According to various examples, similar meanings may be used to recognize the promoted phrase. For example, when “home mortgage” is the promoted phrase, the phrase “condo mortgage” may be recognized as the promoted phrase. According to various examples, the nationality and/or language of the user may be used to determine what is recognized as the promoted phrase. For example, when the promoted phrase is “home mortgage” and the user speaks Spanish, then “casa mortgage” may be added to the recognized promoted phrases.
According to various examples, one device may deliver the voice advertisement and another device may listen for the promoted phrase. For example, an earphone may be used to deliver the advertisement and the microphone on a smart phone may listen for the promoted phrase to be spoken.
According to various examples, when unable to determine which promoted phrase was spoken by the user, the user may be prompted to clarify. For example, when a voice advertisement was served for both an automotive insurance advertisement and home insurance, and the user says the promoted phrase “insurance”, then the user may be prompted if they mean the automotive insurance or home insurance. The user's response will determine which additional advertisement to serve.
At step 106, additional advertisement content is served. According to various examples, the additional advertisement content is delivered in response to the user speaking a promoted phrase. According to various examples, the additional advertisement content is related to the voice advertisement.
Referring now to
In accordance with various aspects and embodiments of the invention, the user speaks the promoted phrase and includes additional delivery information to allow for delivery of the additional content at a different time and via different means. For example, the user speaks the promoted phrase and states a date and time for re-delivery of the voice content or delivery of additional content. In accordance with various aspects and embodiments of the invention, the user can provide instructions on how to receive re-delivery or delivery of additional content, such as via any one or more delivery methods, including: a text message; an email message; a phone call; or mail to a physical address. The user's information may be stored and automatically provided or it may be provided by the user as part of the user's spoken words with the promoted phrase.
Referring now to
At step 304, the voice advertisement ends. According to various examples, step 304 may be the same or similar to step 204.
At step 306, a user speaks the promoted phrase from the voice advertisement within a time frame. According to various examples, the time frame is referenced to an event within the voice advertisement. According to various examples, the time frame is referenced to and relative to the intended duration of the voice advertisement. For example, the time frame is defined from the end of the advertisement, the beginning of the advertisement, the midpoint of the advertisement, the total length of time frame allotted for the advertisement, or when the promoted phrase is spoken, etc. According to various examples, the time frame may be defined from any source capable of defining the time frame. For example, the time frame may be specified globally for every advertisement, an advertiser may specify the time frame on per advertisement basis and/or advertisement group basis, the user may specify the time frame, the time frame may be determined based on user preferences and/or previous actions, etc. According to various examples, the time frame may be inclusive or exclusive of the end points. For example, if the time frame is set to be five minutes and the promoted phrase is spoken at five minutes, then the spoken promoted phrase may be determined to have been spoken within the time frame (i.e., inclusive) or not to have been spoken within the time frame (i.e., exclusive). According to various examples, speaking the promoted phrase may be the same or similar to step 104.
At step 308, additional advertisement content related to the promoted phrase is served. According to various examples, once the time frame has expired, when a user speaks the promoted advertisement, the additional advertisement is not served. For example, if the time frame is specified as one hour after the voice advertisement ends, then if a user speaks the promoted phrase thirty minutes after advertisement ends, the user would be served the additional advertisement. On the other hand, if a user speaks the promoted phrase two hours after the voice advertisement ends, the user will not be served the additional advertisement. According to various examples, step 308 is the same or similar to step 208.
Referring now to
At step 406, a user interacts with the device. According to various examples, the user interaction informs the device to listen for a promoted phrase. According to various examples, any user interaction with the device may be considered an interaction. For example, a user can press a button, speak a wake-up phrase, move the device (e.g., shake, turn over, etc.), make a gesture toward the device (e.g., wave a hand), etc. It is appreciated that the device would have hardware to implement the user interaction. For example, to determine if a device is moved would require one or more sensors such as an accelerometer, a gyroscope, a camera, etc.
At step 408, a user speaks the promoted phrase from the voice advertisement. According to various examples, the user presses a button before speaking the promoted phrase. According to various examples, the user presses and holds a button before speaking the promoted phrase. According to various examples, step 408 may be the same or similar to step 104.
At step 410, additional advertisement content related to the promoted phrase is served. According to various examples, step 410 may be the same or similar to step 208.
Referring now to
At step 504, a voice advertisement that includes a promoted phrase is served from an electronic device. According to various examples, the user may speak a phrase to enable detection of promoted phrases. For example, the user may say “more” during the voice advertisement to enable detection of promoted phrases. According to various examples, the user may speak a phrase to disable detection of promoted phrases. For example, the user may say “no” during the advertisement to disable detection of promoted phrases. According to various examples, step 504 may be the same or similar to step 102.
At step 506, the voice advertisement ends. According to various examples, step 506 may be the same or similar to step 204. At step 508, a user speaks the promoted phrase from the voice advertisement. According to various examples, step 508 may be the same or similar to step 104. At step 510, additional advertisement content related to the promoted phrase is served. According to various examples, step 510 may be the same or similar to step 208.
Referring now to
Referring now to
At step 708, a determination is made if the user that spoke the promoted phrase is the same user that was served the voice advertisement with the promoted phrase. According to various examples, voice recognition may be used to determine if the same user that was served the voice advertisement spoke the promoted phrase. For example, when a user starts to engage with content using a voice command, the device recognizes the user's voice and the device stores that the user was the person served the voice advertisement. Then when a user speaks the promoted phrase, the user's voice is compared to the user that was served the voice advertisement. For another example, the device could have the user log in before using the device. A login could be speaking the user's name, clicking the user's avatar, etc. The identity of a user could be confirmed with a password, a personal identification number (PIN), (e.g., four-digit PIN), etc. A potential benefit of determining if the same user spoke the promoted phrase as was served the voice advertisement is to protect a child from hearing an advertisement intended for an adult, such as an adult beverage advertisement. Another potential benefit of determining if the same user spoke the promoted phrase as was served the voice advertisement is for better targeted advertisement. For example, a child could not open a bank account. Another potential benefit of determining if the same user spoke the promoted phrase as was served the voice advertisement is to avoid disclosing personal information. For example, a user may be shopping for a birthday present and not want a product the user was shopping for to be served as an advertisement while in the presence of the intended recipient of the present.
According to various examples, determining if the same user that was served the voice advertisement spoke the promoted phrase, also includes determine if other people present. For example, a user may be searching for a new home and does not want the user's current roommate to know about the new home search. Determining other people can be performed by any way of detecting the presence of other people. For example, recognizing voices in the room, using one or more cameras to detect other people, using proximity sensors to detect other people, detecting radio signal emissions from electronic devices of other people (e.g., another person's smart phone), etc.
According to various examples, determining if the same user that was served the voice advertisement spoke the promoted phrase also includes determining the age and/or gender of other people present. For example, in a room with all adults, an adult beverage would be more appropriate than in a room with both adults and children. Age detection can be done, for example, by voice characterization or camera image analysis.
According to various examples, determining if the same user that was served the voice advertisement spoke the promoted phrase also includes determining if the user is listening via a private method. For example, if the user is using headphones, then other people may not be able to hear the advertisement thus an advertisement may be served.
When, in step 708, the same user that spoke the promoted phrase was served the voice advertisement with the promoted phrase, step 710 is performed. When different users spoke the promoted phrase than was served the voice advertisement with the promoted phrase, step 706 is repeated. According to various examples, when one or more prompted phrase is required to be spoken within a time frame and the one or more time frames has passed, the device may stop listening for the one or more promoted phrases.
At step 710, additional advertisement content related to the promoted phrase is served. According to various examples, step 710 can be the same or similar to step 208.
Referring now to
At step 808, additional advertisement content related to the promoted phrase with a call to action (CTA) is served. According to various examples, a CTA is the next action an advertiser prefers the user to take. For example, CTA can include scanning a quick response (QR) code, receiving discount code via a short message service (SMS), signing up for a newsletter, setting an appointment with the advertiser's sales team, setting up an account, buying a product, adding product to shopping cart, getting a sample product, etc.
Referring now to
At step 908, the user is asked one or more questions to provide additional information the user is interested in. For example, when the user speaks a promoted phrase related to refinancing a house mortgage, questions can be asked about loan balance, current interest rate, desired loan term (e.g., thirty-year mortgage, fifteen-year mortgage), etc. For another example, when the user speaks a promoted phrase related to automotive insurance, questions can be asked about location of the automobile, type of automobile, type of insurance, insurance limits, insurance deductibles, etc.
At step 910, additional advertisement content is served based on the user response to the one or more questions. For example, when the user speaks a promoted phrase related to refinancing a house mortgage, the advertisement content may include an estimated new mortgage payment. For another example, when the user speaks a promoted phrase related to automotive insurance, the advertisement content may include a new estimated monthly automobile insurance payment. According to various examples, the additional advertisement may include a CTA.
According to various examples, the user enters a conversation dialog with the advertiser to get personalized results.
Referring now to
According to various examples, detection of the promoted phrases by voice advertisement module 1002 may be delayed until the voice advertisement with promoted phrase has ended. According to various examples, the additional advertisement content served by voice advertisement module 1002 is related to the promoted phrase.
According to various examples, voice advertisement module 1002 listens for the promoted phrase for a time frame.
According to various examples, detection of the promoted phrase by voice advertisement module 1002 may be enabled or disabled.
According to various examples, detection of the promoted phrase by voice advertisement module 1002 includes determining the promoted phrase is spoken by the same user that was served the voice advertisement with promoted phrase and promoted phrase is determined to have been spoken when the same user that spoke the promoted phrase was also served the voice advertisement.
According to various examples, serving additional advertisement by voice advertisement module 1002 includes a CTA.
According to various examples, after voice advertisement module 1002 determines a promoted phrase was spoken by a user, voice advertisement module 1002 asks additional one or more questions using sound output device 1004. After voice advertisement module 1002 receives the answers to these questions via sound input device 1006, voice advertisement module 1002 provides advertisement content based on the user's response to the one or more questions.
According to various examples, voice advertisement module 1002, sound output device 1004, and sound input device 1006 may perform any step, both in part and in whole, from any of the
Referring now to
Referring now to
According to various examples, steps or functions, both in part and in whole, for the methods and/or apparatuses of
According to various examples, when the voice advertisement with a promoted phrase is “Are you still making mortgage payments based on high interest rates? Just say ‘home mortgage’ in the next 2 hours and I'll show you how much others are saving though refinancing options.” then when user says, “home mortgage”, the user is served a longer video with examples and a CTA.
According to various examples, when the voice advertisement with a promoted phrase is “did you know 59 seconds can save you 19% or more on car insurance. Just say ‘Acme Insurance’ into you voice remote in the next 2 hours to calculate your new rates”. Then, when the user says “Acme Insurance” the ad delivery system ask follow-up questions and the user answers about location, type of car, etc. and receives a rate quote from Acme Insurance.
According to various examples, when the voice advertisement with a promoted phrase is “why did mascot cross the road? Press the voice button and say ‘mascot’ in the next 2 hours for the answer and tips on how to bundle and save on car insurance” then when the user says “mascot” then the user is served a longer advertisement with the punchline, tips, and CTA.
According to various examples, when the voice advertisement with a promoted phrase is “did you know that your skin renews itself every 28 days? Just say ‘new skin’ in the next two hours and I'll give you some skin care advice based on your skin type.” then the user says “new skin” then the user is able to interact with the advertiser by answering questions and getting custom advice for skin care products.
According to various examples, when the voice advertisement with a promoted phrase is “did you know that people who take care of their skin are more likely to make other healthy choices? Say ‘glow’ in the next 2 hours and I'll let you in on some beauty secrets.” then when the user says “glow” the user is served a longer video advertisement with tips and a CTA.
Referring now to
Referring now to
In certain examples, a non-transitory computer-readable storage medium may be provided that stores instructions to implement any of the described examples herein. The non-transitory computer readable medium may comprise one or more of a rotating magnetic disk, a rotating optical disk, a flash random access memory (RAM) chip, and other mechanically moving or solid-state storage media.
Various examples are methods that use the behavior of either or a combination of humans and machines. Method examples are complete wherever in the world most constituent steps occur. Some examples are one or more non-transitory computer readable media arranged to store such instructions for methods described herein. Whatever machine holds non-transitory computer readable media comprising any of the necessary code may implement an example. Some examples may be implemented as: physical devices such as semiconductor chips; hardware description language representations of the logical or functional behavior of such devices; and one or more non-transitory computer readable media arranged to store such hardware description language representations. Descriptions herein reciting principles, aspects, and embodiments encompass both structural and functional equivalents thereof.
Referring now to
Referring now to
Practitioners skilled in the art will recognize many possible modifications and variations. The modifications and variations include any relevant combination of the disclosed features. Descriptions herein reciting principles, aspects, and examples encompass both structural and functional equivalents thereof.
Various embodiments are methods that use the behavior of either or a combination of humans and machines. The behavior of either or a combination of humans and machines (instructions that, when executed by one or more computers, would cause the one or more computers to perform methods according to examples described and claimed and one or more non-transitory computer readable media arranged to store such instructions) embody methods described and claimed herein. Each of more than one non-transitory computer readable medium needed to practice the invention described and claimed herein alone embodies the invention. Method embodiments are complete wherever in the world most constituent steps occur. Some embodiments are one or more non-transitory computer readable media arranged to store such instructions for methods described herein. Whatever entity holds non-transitory computer readable media comprising most of the necessary code holds a complete embodiment. Some embodiments are physical devices such as semiconductor chips; hardware description language representations of the logical or functional behavior of such devices; and one or more non-transitory computer readable media arranged to store such hardware description language representations.
Although the invention has been shown and described with respect to a certain preferred embodiment or embodiments, it is apparent that equivalent alterations and modifications will occur to others skilled in the art upon the reading and understanding of this specification and the drawings. Practitioners skilled in the art will recognize many modifications and variations. The modifications and variations include any relevant combination of the disclosed features. In particular regard to the various functions performed by the above described components (assemblies, devices, systems, etc.), the terms (including a reference to a “means”) used to describe such components are intended to correspond, unless otherwise indicated, to any component which performs the specified function of the described component (i.e., that is functionally equivalent), even though not structurally equivalent to the disclosed structure which performs the function in the herein illustrated exemplary embodiments. In addition, while a particular feature may have been disclosed with respect to only one of several embodiments, such feature may be combined with one or more other features of the other embodiments as may be desired and advantageous for any given or particular application.
Some embodiments of physical machines described and claimed herein are programmable in numerous variables, combinations of which provide essentially an infinite variety of operating behaviors. Some embodiments herein are configured by software tools that provide numerous parameters, combinations of which provide for essentially an infinite variety of physical machine embodiments of the invention described and claimed. Methods of using such software tools to configure hardware description language representations embody the invention described and claimed. Physical machines can embody machines described and claimed herein, such as: semiconductor chips; hardware description language representations of the logical or functional behavior of machines according to the invention described and claimed; and one or more non-transitory computer readable media arranged to store such hardware description language representations.
In accordance with the teachings herein, a client device, a computer and a computing device are articles of manufacture. Other examples of an article of manufacture include: an electronic component residing on a motherboard, a server, a mainframe computer, or other special purpose computer each having one or more processors (e.g., a Central Processing Unit, a Graphical Processing Unit, or a microprocessor) that is configured to execute a computer readable program code (e.g., an algorithm, hardware, firmware, and/or software) to receive data, transmit data, store data, or perform methods.
An article of manufacture or system, in accordance with an embodiment of the invention, is implemented in a variety of ways: with one or more distinct processors or microprocessors, volatile and/or non-volatile memory and peripherals or peripheral controllers; with an integrated microcontroller, which has a processor, local volatile and non-volatile memory, peripherals and input/output pins; discrete logic which implements a fixed version of the article of manufacture or system; and programmable logic which implements a version of the article of manufacture or system which can be reprogrammed either through a local or remote interface. Such logic could implement a control system either in logic or via a set of commands executed by a processor.
Furthermore, examples and conditional language recited herein are principally intended to aid the reader in understanding the principles of the invention and the concepts contributed by the inventors to furthering the art and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, statements herein reciting principles, aspects, and embodiments of the invention, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents and equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
The scope of the invention, therefore, is not intended to be limited to the exemplary embodiments or the various aspects shown and described herein. Rather, the scope and spirit of the present invention is embodied by the appended claims.