The present invention generally relates to inserting an advertisement object into a multimedia object, and more particularly, to a method, system, and computer program product for contextually inserting the advertisement object into a multimedia object.
In today's competitive world, businesses are consistently looking for new ways to market their products and services. They generally want as many consumers as possible to view their advertisements. Businesses consider it more likely that a potential customer will view advertisements presented during the customer's viewing of a multimedia object, such as a television show, a movie, or other audio/video (A/V). For example, providing an advertisement object along with a videocast, podcast, or other web-based A/V product is an effective method of presentation. One reason for this is that the potential consumer is most likely interested in viewing the multimedia object, and will willingly continue viewing through the advertisement object in order to view the remainder of the multimedia object.
The present invention provides a method for inserting a contextually relevant advertisement object into a multimedia object. The method includes identifying context information from one or more portion of the multimedia object, selecting the advertisement object based on the identified context information, determining an appropriate position within the portion of the multimedia object to insert the advertisement object and inserting the selected advertisement object into the multimedia object at the determined appropriate position.
The present invention further provides system for inserting a contextually relevant advertisement object in a multimedia object. The system includes a context identifier module to identify context information from one or more portions of the multimedia object, an advertisement selection module to select the advertisement object based on the identified context information and a position identifier module to determine an appropriate position within the portion of the multimedia object to insert the advertisement object.
The present invention further provides a computer program product for inserting a contextually relevant advertisement object in a multimedia object. The computer program product includes computer readable program for identifying context information from one or more portion of the multimedia object, computer readable program for selecting the advertisement object based on the identified context information and computer readable program for determining an appropriate position within the portion of the multimedia object to insert the advertisement object.
The present invention further provides a method for inserting an advertisement object into a multimedia object. The method includes processing the multimedia object to identify context information and an appropriate position for insertion of the advertisement object in each of the multimedia object, searching one or more advertisement objects for an advertisement object having a context relevant to the identified context information of the multimedia object, and inserting the advertisement object in the multimedia object at the appropriate position. The method further includes charging an advertiser an additional advertising fee when context of the advertisement object closely matches with the context information.
So that the manner in which the above recited features of the present invention can be understood in detail, a more particular description of the invention, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only typical embodiments of this invention and are therefore not to be considered limiting of its scope, for the invention may admit to other equally effective embodiments.
In the following description, for purposes of explanation, specific numbers, materials and configurations are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one having ordinary skill in the art, that the invention may be practiced without these specific details. In some instances, regular features may be omitted or simplified so as not to obscure the present invention. Furthermore, reference in the specification to “one embodiment” or “an embodiment” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The appearances of the phrase “in an embodiment” in various places in the specification are not necessarily all referring to the same embodiment.
The headings used herein are for organizational purposes only and are not meant to be used to limit the scope of the description or the claims. As used throughout this application, the word “may” is used in a permissive sense (i.e., meaning having the potential to), rather than the mandatory sense (i.e., meaning must). Similarly, the words “include,” “including,” and “includes” mean including, but not limited to. Also, the terms “advertisement object” and “advertising” are used interchangeably, unless otherwise indicated.
The present invention provides a method, system, and computer program product for inserting a contextually relevant advertisement object in a multimedia object. The multimedia object in the present invention is a media that uses one or more forms of information content and information processing including, without limitation, text, graphics, audio, video, and animation. In various embodiments of the invention, the system identifies context information of a portion of the multimedia object. Further, the system may use the context information to select an advertisement object contextually related to the portion of the multimedia object.
In another embodiment of the invention, the system 100, multimedia sources 105 and advertisement sources 110 may co-exist within a single network or even on a single data processing device. In such cases, the system 100, multimedia sources 105, and advertisement sources 110 would then be interconnected by appropriate communication links 107a, 107b. For example, if all the components are co-located on a single personal computer device, the communication links 107a, 107b may be an internal data bus, SCSI-II connection, or the like.
In one embodiment of the invention, the system 100 comprises a context identifier module 115. The context identifier module 115 identifies context information from the multimedia object. In various embodiments of the invention, the context identifier module 115 identifies the context information of one or more portions of the multimedia object. The context identifier module further provides the position of the context information in the multimedia object, hereinafter referred to as the “context position”. The context position is the location of the corresponding context information in the multimedia object. An exemplary embodiment of the context identifier module 115 is described in detail with reference to
Further, the system 100 includes an advertisement selection module 120 to select an advertisement object based on the context information. In various embodiments of the invention, the advertisement selection module 120 does a keywords based search to select the advertisement objects that match the context information. An exemplary embodiment of the advertisement selection module 120 is described in detail with reference to
The system 100 further includes a position identifier module 125 to determine an appropriate position within the portion of the multimedia object to insert the advertisement object. The appropriate position is a location in the multimedia object where the system 100 may insert an advertisement object, and it can be determined in relation to the portion of the multimedia object of the corresponding context information. Alternatively, the appropriate position may be determined relative to the context position of the context information in the multimedia object. For example, an appropriate position for an advertisement related to sport shoe in a news A/V may be immediately after sports news is presented. The selected advertisement object is then inserted into the multimedia object at the determined appropriate position. The system 100 includes an insertion module 130 to insert the advertisement object to the determined position. In various embodiments of the invention, the insertion module 130 may transform the advertisement object prior to insertion in the multimedia object. Such a transform is intended to improve the presentation of the inserted advertisement object in the multimedia object. The transform may include, without limitation, normalizing volume differences between the advertisement object and multimedia object, render text from the advertising for insertion into the video stream, equalizing file types (e.g. wma, mp3, AAC, etc.) of advertisement object and multimedia object, normalizing different bit-rates across advertisement object and multimedia object.
In an operational example, multimedia source 105[1] provides a multimedia object, OBJECT1, to the system 100. The context identifier 115 identifies context information of one or more portions of OBJECT1, and provides the identified context information to the advertisement selection module 120. The context identifier module 115 further provides the position of the context information in the multimedia object hereinafter referred to as ‘context position’. The advertisement selection module 120 selects the advertisement object most closely matching the context information of the portion of the OBJECT1. The position identifier module 125 determines an appropriate position within OBJECT1 to insert the selected advertisement object. The insertion module then inserts the advertisement object into OBJECT1 at the appropriate position.
The context identifier module 115, the advertisement selection module 120, the position identifier module 125, and the insertion module 130 reside in the memory of one or more data processing devices. Those skilled in the art will appreciate that various forms of data processing devices may be employed in various embodiments the present invention including, without limitation, personal computers, servers, mainframes, and the like. An exemplary embodiment of a data processing device is described in detail with reference to
As a business model of the present invention, the above system may be implemented in various ways. One of the ways is by using the above system as a service based model using various service providers. The service provider is defined as entity that can perform the task of one or more of creating, maintaining, supporting, and the like of a computer infrastructure that performs one or more process steps of the invention for customers. In return, the service provider can receive payment from the customer(s) under some predetermined criteria, such as a subscription, fee agreement, or the like. In one embodiment of the invention, a service provider may process the multimedia object to identify context information and an appropriate position for insertion of the advertisement object into the multimedia object, search one or more advertisement objects for an advertisement object having a context relevant to the identified context information of the multimedia object, and insert the advertisement object into the multimedia object at the appropriate position. The service provider may further select the advertising object based on criteria selected from a group consisting of demographics, geography, user behavior, business rules and agreements. Companies or individuals interested in availing their service may opt for any one or all of the above.
In another embodiment of the invention various service providers may come together to provide the services provided by the various modules of the present invention. For example, a first service provider may specifically process the multimedia object to identify context information and an appropriate position for insertion of the advertisement object in each of the multimedia object. A second service provider may specifically search one or more advertisement objects for an advertisement object having a context relevant to the identified context information of the multimedia object. The second service provider may further select the advertising object based on criteria selected from a group consisting of demographics, geography, behavior, business rules and agreements. A third service provider may specifically insert the advertisement object in the multimedia object at the appropriate position. In one embodiment of the invention, the first service provider and the second service provider are one entity. In another embodiment of the invention, the second service provider and the third service provider may be a single entity.
In various embodiments of the invention, the service provider may charge an additional advertising fee to an advertiser when the context of the advertisement object matches closely with the context information extracted from the multimedia object. The advertiser may be a company or individual benefiting by the advertisement object. In one embodiment of the invention, the service provider may grade the relevance of the advertisement object with the context information on a varying scale. The service provider may charge an additional advertising fee to an advertiser based on the relevance scale determined for the advertisement object.
In one embodiment of the invention, the first text converter 205 disregards lower confidence word results from the textual data. Usually the speech recognition engine determines a confidence value for each translated word/phrase/sentence. The words having lower confidence value have higher probability of incorrectly identified. In various embodiments of the invention, the words with the confidence value below a cutoff are defined as lower confidence words.
In various embodiments of the invention, the first text converter 205 disregards stop word results from the textual data. The stop words may be the general English words used to complete the structure of a sentence. Examples of the stop words include, without limitation, ‘a’, ‘to’, ‘and’, and ‘the’.
The first text converter 205 may further include a domain specific vocabulary engine, comprising a set of words, terms, and codes specific to a domain. For example, a domain specific vocabulary for the food industry comprises words, phrases, terminologies specific to the industry. When a particular set of extracted keywords represents multiple meanings at a time, the domain specific vocabulary may distinctly identify the word relevant to a specific domain. In one embodiment of the invention, the domain specific vocabulary may be dynamically updated to as per the word usage patterns in the domain. In one embodiment of the invention, the domain specific vocabulary may be updated using text from standard web pages of the domain. For example, to create a sports specific vocabulary a sports website may be used. In one embodiment of the invention, the speech recognition engine is based on the Hidden Markov Model. Those skilled in the art will appreciate that various models of the speech recognition engine may be employed in various embodiments of the present invention including, without limitation, Hidden Markov Model based speech recognition, Neural Network-based speech recognition, Dynamic Time-wrapping based speech recognition, Knowledge based speech recognition, and the like.
In one embodiment of the invention, the first text converter comprises an Optical-Character Recognition (OCR). The OCR is used to translate images of handwritten or typewritten text into machine-editable text.
The context identifier module 115 includes a keyword-extracting module 210 to extract keywords and keyword locations from the textual data. The location of the keyword in the textual data may be used to determine location of the context information in the multimedia object.
The context identifier module 115 further includes a segment identifying module 215, and a context-building module 220. The segment identifying module 215 identifies at least one segment in the textual data. The segment may comprise a sub-set of extracted keywords along with the position of the extracted keywords in the multimedia object. The context-building module 220 determines the context information for the segment using the corresponding sub-set of keywords. In various embodiments of the invention, the context-building module may abstract or derive context information for the segment based on the sub-set of keywords. Various embodiments of the invention advantageously allow the system to select an advertisement object which is not associated with any specific extracted keyword. For example, the sub-set of keywords may mention words like “tiger wood”, “champion”, “open”, “masters”, etc., that are related to context information “Golf”. In such an example, the context-building module may determine the context information of the segment as “Golf”. In one embodiment of the invention, the segment identifying module 215 may identify a continuous segment related to particular context information based on the context information.
In various embodiments of the invention, the keyword-extracting module may further augment additional keywords to determine the context information. In one embodiment of the invention, the keyword-extracting module 210 may augment the additional keywords from metadata information associated with the multimedia object. Metadata information includes, without limitation, text contents associated with the multimedia object (e.g. title text, description text), Rich Site Summary (RSS) feed associated to the multimedia object, and tags provided by the source. In one embodiment of the invention, the keyword-extracting module 210 may further augment the additional keywords from the content on the webpage hosting the multimedia object. In one embodiment of the invention, the keyword-extracting module 210 may further augment the additional keywords from the content on the webpage in the domain of the multimedia object. The context-building module 220 may then correlate the addition keywords with the keywords extracted by the first text converter.
In one embodiment of the invention, the second text converter comprises an Optical-Character Recognition (OCR). The OCR is used to translate images of handwritten or typewritten text into machine-editable text.
Continuing with step 515, the position identifier module determines an appropriate position within the portion of the multimedia object to insert the advertisement object. At step 520, the insertion module inserts the advertisement object into the multimedia object at the determined position. In one embodiment of the invention, the insertion module transforms the advertisement object before inserting it into the multimedia object. For example, if the advertisement object and multimedia object are in different file formats, the insertion module may equalize the format of advertisement object with the format of the multimedia object before the insertion. Other types of transformations are also relevant, such as matching audio volume or tone, video color or tinting, or other factors.
At step 620, the segment identifier module identifies at least one segment in the textual data. The segment comprises a sub-set of keywords along with the corresponding position in the multimedia object. At step 625, the context building module determines and builds context of the segment. The context building module uses a sub-set of keywords to build the context for the segment.
The foregoing descriptions of specific embodiments of the present invention have been presented for purposes of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise forms disclosed, and obviously many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, to thereby enable others skilled in the art to best utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. It is understood that various omissions, substitutions of equivalents are contemplated as circumstance may suggest or render expedient, but is intended to cover the application or implementation without departing from the spirit or scope of the claims of the present invention.
While the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.