The present invention provides a method, system and apparatus for accessing product data that is displayed or otherwise shown on visual displays, including, but not limited to, televisions, movies, personal computers, personal digital assistants (PDA) and the like.
Traditional forms of marketing and advertisement have primarily relied on commercials. Thirty or sixty-second spots are strategically placed throughout programs. Through the use of demographics and other well know marketing methods advertisements are shown during programs in hopes of attracting purchasers. New technology, including new VCR recording devices, make it increasingly easy for viewers to discard the commercials. As viewers become accustomed to movies on demand or advertising free content, which is made at least in part possible by the Internet, the reliance on traditional commercial advertising is no longer ideal.
Movies, which lack the ability to stop and play a commercial, have bypassed the use of commercials through the use of product placements. Products are strategically placed in a movie, often for a predetermined price, such that the viewers will notice the product and want to purchase it. Well-known examples of product placements include BMW's placement of automobiles and motorcycles in several of the James Bond films. The problem that occurs is that absent blatant product placements that make the viewer aware of the product, its maker and where it may be purchased, product placement adverting is limited. Although it may be applied to traditional television programs, its applicability is limited. Products that do not have immediate visual cues as to maker, name and model are not well suited for product placement use.
Industry has tried to marry the Internet with traditional forms of advertising by placing banner ads in shows. As predetermined by the show's producer, web links may appear in which the user is directed to go to the link if background information is desired. Banner ads have been primarily used for background information and have not been used as advertisements. Banner ads do not provide effective advertisements, as they require visual interruptions.
The present invention solves this and other problems by providing a unique method of creating a dynamic product placement database that can be accessed in real-time or on a delayed basis, and provides the viewer with key marketing information about a specific product. In doing so, the need for commercials that interrupt a program is reduced and increased revenue can be recognized by groups producing and displaying programs.
VISUAL IMAGE MARKETING (VIM) is a system whereby the visual image on a film or television program is utilized to market that specific product. In VIM, film\television is combined with computer technology to provide real-time or delayed access to product data. Viewers identify a product or object displayed in a video, television show or movie and by merely pointing and clicking on the object to obtain marketing and other data.
One embodiment the present invention is integrated into a set-top box that provides access to programs that are specially designed to work with VIM as well as access to programs which were subsequently catalogued so as to work with VIM. A user is permitted to watch a movie and with the use of a selection device, which may include a mouse, highlight a product. The user is then provided with selected marketing data associated with that product. For example, if the user is watching a sitcom and likes the shirt that the main character is wearing, the user can click on the shirt as it appears in real-time and obtain instantaneous marketing data.
In another embodiment, a database is created that can be accessed on a delay basis. The database contains search items, such as show name, character name, approximate time into program, article description, etc., which may be used to facilitate retrieval of market data.
It is expressly contemplated that the present invention may be operated either in conjunction with the displaying of video, such as a television, or that it may be separately provided. In addition, it is expressly contemplated that the Internet, satellite networks or other networking technology may be used to facilitate access to the product database.
By providing access to marketing information on virtually all of the items displayed in a video, television show, movie or the like, the need for commercials is reduced, viewers have access to increased marketing information and additional revenues sources may be realized.
The present invention is described with reference to the following figures:
The present invention may be implemented using a variety of hardware. The present invention is preferably designed so that it is usable with a variety of different hardware configurations. Homes containing a traditional television and computer may utilize the VIM database, whereas homes that have an integrated television-VIM apparatus may access real-time marketing data as well as delayed data.
1. Hardware
As shown in
An alternate hardware configuration is shown in
2. Method of Selecting Objects
The present invention combines the ability to visually select items that are displayed. In a preferred embodiment the video is digitally recorded such that pixel data can be recorded that corresponds to each item for which marketing data will be available. Information on the objects in a video may be recorded on the side of the film traditionally used for sound data. Alternatively, the object information may recorded on an interleaved into or between one of the 30 frames per second which make up a video such that the computing means may retrieve the data while the viewer does not notice the data transmission.
In another invention, the video is scanned by an object extraction device and the object data is provided either in toto or in an as needed basis to the computing means. The object extraction device may on its most simple level include an overlay screen under which video plays. The overlay screen is divided into X and Y coordinates and used to mark the position of objects in the video. Object location and time data is recorded and made available to the computing means.
As shown in
As the show is run 16, the images on the show are compared to the known outlines. Known methods of digital signal processing, such as through the use of wavelet filtering, may be used to assist in outline recognition. It is expressly contemplated that the show may be run through the process several times using a variety of known filtering techniques to assist in identifying products and their placement 18.
The present system is also designed, however, to work with existing video. In one such embodiment, a video grid overlay is used, as shown in
As shown in
Upon selection of an object, the computing means retrieves predetermined marketing data. The type of marketing data may vary with the show, time, expected demographics, and the like.
3. VIM Demo
By way of example only, a VIM demo, also known as the diamond head project, has been created using a prerecorded video on a dedicated PC platform. The present invention is not limited to the VIM demo configuration and features. Rather, one possible embodiment has been implemented in the VIM demo to assist in describing the VIM apparatus and method. The demo was created to run on a stand-alone PC, although it is expressly contemplated that the VIM computing means may be incorporated into a set-top box or into a television.
In the demo the ASF file format was used. ASF is a file format that stores audio and video information and is specially designed to run over networks like the Internet. It is a highly flexible and compressed format that contains streaming audio, video, slide shows, and synchronized events.
The compelling feature of Advanced Streaming Format (ASF) streams is that they can deliver script commands to the Microsoft® Windows MediaT Player control, along with the audio and video streams. These script commands are pairs of UnicodeT strings synchronized with a particular time in the multimedia stream. The first string identifies the type of command being sent, and the second specifies the command to process. When the stream reaches the time associated with a command, the control sends a ScriptCommand event to the web page which contains it. An event-handling routine can then respond to this event. The script command strings are passed to the event handler as parameters of the ScriptCommand event.
These synchronized events are used in this project. The position (rectangular co-ordinates) of the car are stored in the ASF file and the definition of the car and the URL are also stored in the ASF file. In this project two global variables are used for the car position and for the car description which are always updated by the event-handler routine. When a user, viewing the ASF file via a web browser or other media player, clicks in the car position it will show the prestored message or goto the URL which are stored in the global variables.
The two files used by this project include:
The Diamond.asf file is created by converting an AVI file. Microsoft Windows Media Encoder has been used to this conversion. Microsoft Windows Media Encoder is a component of “Windows Media Tools” which can be downloaded form the following site:
http.//www.microsoft.com/windows/windowsmedia/en/download/default.a-sp Windows Media Tools also has a component “Windows Media ASF Indexer” which is used to edit and create script commands in the Diamond.asf. Additional information concerning script commands can be found at the MSDN Libray-January 2000.fwdarw.Platform SDK.fwdarw.Graphics and Multimedia Services.fwdarw.Windows Media Player Control.fwdarw.Using the Windows Media Player Control.fwdarw.Processing Embedded Script Command
There are two types of user defined script commands that are used in Diamond.asf. One is “DHO” and other one is “DHC”. “DHO” is used for the definition of the object or the URL of the object. Here, at the beginning of the parameter of “DHO” type script command, “URL” is used to define that it is a URL and the value of the URL is follows by it with a separator “.vertline.”.
In “DHC” type script command, values of the co-ordinates of the current object are kept in the parameter. In this exemplary embodiment, the coordinate values that are kept are the upper-left corner of the object and lower right corner of the object in sequence.
In Index.html, a “Windows Media Player” ActiveX control is used to view the diamond.asf file. The code is as follows:
In the above code, the MediaPlayer1_ScriptCommand(sType, sParam) is the method which hooks the script_command event of media player. When a script command is found from the diamond.asf this method executes and if it is “DHO” type then the value of the parameter is saved in a string type variable. If the command is “DHC” type then it is saved in the four variables X1, Y1, X2, Y2 which are the coordinates of the rectangle in which the object resides.
If the user clicks on the view panel of the media player, then the MediaPlayer1_Click method is executed and if the mouse point is on the rectangle of the object (i.e., the car), then the corresponding action is triggered.
4. Product Database
An illustrative version of the marketing database 5 is shown in
Locating information, including, but not limited to: Show name; Airing date; Channel; Length; Start time; End Time; Commercial breaks; Story line information; Character information; or Products coordinates (X,Y).
Product Information, including but not limited to: Name; Distributor; Price; Link to store; Link to vendor web site; or versions based on demographics. The database is designed to be accessible through the Internet or other known networks by all individuals, including individuals that do not have access computing means or other real-time access methods. As shown in
This increased accessibility permits requires that users who are manually searching for the product information have sufficient show based location data that permits them to reasonably locate the item desired.
For example, if a viewer sees a lamp in the living room scene of a sit-com and wants to get more information. The viewer can access the database that is connected to the Internet. The viewer may identify the show name, the date of viewing, the channel the program was seen on and enter the word lamp. If there are too many lamps, the user may also specify that the lamp was during the first half of the show or after the first commercial break. The user may indicate that the lamp or product was seen within the first 5-10 minutes of the show.
It is expressly contemplated that once the user retrieves an item, as shown in
5. Acquiring the Image and Object Location Data
The present invention contemplates a variety of functionally equivalent ways to identify the market-related items and their respective locations in a sequence of video or movie frames. These different techniques for identifying the products' locations can be used individually or in combination with one another. While an individual can manually review images and identify products within these images, the present invention also relies on automated methods so that someone is not required to identify the region of each image that corresponds to each product.
Conventional image capturing electronics and cameras include technology with digital signal processing already built into the camera (e.g., CCD image sensors). Alternatively, the image processing capability can be provided by equipment parallel to the image capturing functionality of the camera so that both occur relatively simultaneously. Similarly, any image processing could also occur subsequent to the image capture. Using this last alternative, previously acquired film and video can be processed to identify product related regions even if not originally captured by appropriately configured cameras and equipment.
One particular image-region identification technique contemplated by the present invention uses an infrared camera located at a predetermined location to capture the same scene as a more traditional camera. Because the infrared camera is at a known location relative to the conventional camera, the infrared-image can be easily coordinate-transformed onto the visible image to identify those regions of the image occupied by the various actors or other individuals. Another alternative technique, would be to use a camera having sensitivity in both the visible and infrared range; in this alternative, no coordinate transforms are necessary to locate image regions occupied by people that might be wearing or otherwise using products of interest.
Another technique for roughly identifying the location of items in an image is through the use of attached transponders for which a receiver (incorporated in the camera or operating in conjunction with the camera) can detect their location. This technique is similar in practice to that used by video game designers to provide realistic animation. Athletes, or other participants, are outfitted with transponders on various body locations and then filmed while performing different physical activities. These films are then converted into animation that closely mimics the athlete's motion.
Another alternative for locating a product in an image is to use laser pointers similar to the technology of laser-guided ordnance. During filming of a scene, a laser is targeted on a product, or products, and an appropriate receiver tracks that laser target during the scene so as to correspond with the product's location during the scene. Alternatively, later processing equipment could scan a previously captured film for the laser target information to identify objects of interest.
Regardless of the technique or technology used to capture a video image composed of various items which will eventually have marketing information associated therewith, virtually any conventional image processing and recognition method can be used to automate the identification of the separate items within the sequence of images.
Using conventional contour representation, the contour of a region or of an object within an image can be described as one of several compact representations that facilitate manipulation of the object. Examples of conventional contour representations can include chain codes, crack codes and run codes. These object outlines can be used individually or with shape recognition software to easily identify which pixels within an image correspond to different objects within the image.
One of ordinary skill will recognize an be able to apply considerations such as smoothing (or filtering) operations, pixel size and formation, camera sampling rates, and aspect ratio all play a role in accurately identifying those regions of an image that correspond to a particular object.
Other alternatives for segmenting an image into regions of interest can rely on such techniques as color (or chromaticity) regions. Using this segmenting method a viewer can be permitted to query for a “yellow sweater” or a “red car” and appropriate regions of the image will be detected and the VIM information associated therewith will be made available to the viewer.
Other image processing methods can include segmentation thresholding or edge finding.
Regardless of the techniques or methods used, the end result is a number of regions are identified in each image that correspond to a particular product which has associated marketing information. This marketing information can be made available to a viewer who, when viewing the sequence of images, selects a particular region of the viewing screen. When a region is selected the associated object is identified whose location coincides with the selected region and then this object identification information is used to search and retrieve appropriate marketing information from a database of information.
6. Distributing VIM Data
There are a variety of methods in which the video images, the object location information and the marketing information database can be distributed to viewers. In particular, HDTV is one current distribution technique, utilizing terrestrial as well as satellite transmitters, that has available bandwidth for auxiliary information in addition to the digitally encoded image data. This auxiliary information such as item identifiers and item pixel ranges in each image frame can, thus, be distributed to a viewer's equipment (e.g., television, computer, or video terminal) having an appropriate decoder for separating the auxiliary information from the image data.
Alternatively, if the video data is transmitted via MPEG-2, for example, over ATM, the additional information about the various items and their pixel locations can be included through the use of the adaptation layer of the cell-based transport. As known to those of ordinary skill, in order to carry data units other than the 48-octet payload size in ATM cells, an adaptation layer is needed. The ATM Adaptation layer (AAL) provides for segmentation and reassembly of higher-layer data units and detection of errors in transmission. For digital television transmission, the MPEG-2 transport standard is the conventional format being considered and both AAL1 and AAL5 have been used as a design for packaging digital video.
Another possible alternative contemplated by the present invention includes image sequences that are captured on film. Film conventionally includes the image data made up of individual frames sequentially arranged along with audio tracks and other identifying data encoded on the sides of the frames so as not to be interrupted by the film's sprocket holes. The auxiliary VIM information about products and their locations within a frame can be encoded in one of the many audio channels or other “track” areas on the sides of the film. An appropriate projector is then used, upon displaying the film, to detect and decode the auxiliary VIM information during the showing or playback of the film. Presented below is one exemplary embodiment which uses a particular formatting and encoding sequence developed by Sony known as SDDS. However, one of ordinary skill would easily recognize that the present invention contemplates, and can be modified to include, variations that can involve track placement on the film, encoding/decoding algorithms, the number of encoded tracks, decoding hardware, etc.
The SDDS system developed by Sony can be modified to incorporate, or substitute, the features of the present invention. In particular in the SDDS system, as shown in
As mentioned above, but not illustrated, two decoders may be used, one providing conventional SDDS audio information and the other providing the VIM information. Similar to an SDDS audio decoder, the VIM decoder 914 will receive data from the reader 912, optionally perform some type of error checking or error correcting, and then extract the VIM information for a number of different purposes that may include inputting to a viewer's computer system, transmitting along with the image frames, or some other similar use. If the original VIM information encoded on the film is first compressed, then the VIM decoder 914 can also include decompression hardware and software to retrieve the compressed information before outputting the VIM information.
The present invention is not limited to the above describes examples and may be modified as would be appreciated by one of ordinary skill in the art.
This application is a continuation application of U.S. application Ser. No. 14/503,918 filed Oct. 1, 2014, which is a continuation application of U.S. application Ser. No. 13/023,657, filed Feb. 9, 2011, now U.S. Pat. No. 8,856,830, which is a continuation application of U.S. application Ser. No. 10/885,067, filed Jul. 7, 2004, now U.S. Pat. No. 7,899,705, which is a divisional application of U.S. application Ser. No. 09/961,392, filed Sep. 25, 2001, which is a nonprovisional application and claims benefit of the filing date of U.S. Provisional Application No. 60/234,981, filed Sep. 25, 2000, the contents of each of which are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
5343239 | Lappington et al. | Aug 1994 | A |
5561708 | Remillard | Oct 1996 | A |
5706049 | Moghadam et al. | Jan 1998 | A |
5708845 | Wistendahl et al. | Jan 1998 | A |
5765176 | Bloomberg | Jun 1998 | A |
5918012 | Astiz et al. | Jun 1999 | A |
6006265 | Rangan et al. | Dec 1999 | A |
6297853 | Sharir et al. | Oct 2001 | B1 |
6332139 | Kaneko et al. | Dec 2001 | B1 |
6381583 | Kenney | Apr 2002 | B1 |
6411725 | Rhoads | Jun 2002 | B1 |
6415307 | Jones et al. | Jul 2002 | B2 |
6496981 | Wistendahl et al. | Dec 2002 | B1 |
6570586 | Kamen et al. | May 2003 | B1 |
6868415 | Kageyama | Mar 2005 | B2 |
7000242 | Haber | Feb 2006 | B1 |
7139767 | Taylor et al. | Nov 2006 | B1 |
7356830 | Dimitrova | Apr 2008 | B1 |
8745657 | Chalozin et al. | Jun 2014 | B2 |
20040109087 | Robinson | Jun 2004 | A1 |
20040250297 | Fuisz | Dec 2004 | A1 |
Number | Date | Country |
---|---|---|
9-274554 | Oct 1997 | JP |
WO 9737497 | Oct 1997 | WO |
WO 9830025 | Jul 1998 | WO |
WO 9910822 | Mar 1999 | WO |
WO 9940506 | Aug 1999 | WO |
WO 9963514 | Dec 1999 | WO |
WO 0042768 | Jul 2000 | WO |
WO 0043899 | Jul 2000 | WO |
Entry |
---|
Sammon, Rick, Digital Imaging Takes Off, St. Louis Post Dispatch, Jan. 17, 1999, Travel & Leisure section, p. T9. |
Understanding ATM, downloaded Nov. 23, 2009, from http://www.hn-networks.co.uk/atm.htl. |
Number | Date | Country | |
---|---|---|---|
20150264451 A1 | Sep 2015 | US |
Number | Date | Country | |
---|---|---|---|
60234981 | Sep 2000 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 09961392 | Sep 2001 | US |
Child | 10885067 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14503918 | Oct 2014 | US |
Child | 14728956 | US | |
Parent | 13023657 | Feb 2011 | US |
Child | 14503918 | US | |
Parent | 10885067 | Jul 2004 | US |
Child | 13023657 | US |