1. Field of the Invention
The invention is generally related to the area of augmented reality. In particular, the invention is related to techniques for overlaying corresponding augmented reality onto an image or a video being shown on a TV device.
2. The Background of Related Art
Augmented Reality (AR) is a type of virtual reality that aims to duplicate the world's environment in a computer device. An augmented reality system generates a composite view for a user that is the combination of a real scene viewed by the user and a virtual scene generated by the computer device that augments the scene with additional information. The virtual scene generated by the computer device is designed to enhance the user's sensory perception of the virtual world the user is seeing or interacting with. The goal of Augmented Reality is to create a system in which the user cannot tell the difference between the real world and the virtual augmentation of it. Today Augmented Reality is used in entertainment, military training, engineering design, robotics, manufacturing and other industries.
The recent development of mobile devices and cloud computing allows software developers to create many AR applications or programs to overlay virtual objects and/or additional 2D/3D multi-media information in a captured image. In order to display AR contents such as virtual objects in a real screen area that displays a real image, a user is required to scan AR specific markers (e.g. a QR code) or marker equivalent images to retrieve AR contents through the server.
There are some difficulties to implement AR for a television TV program. Because users usually sit in a couch to see a TV screen, it creates various issues by the distance between the TV screen and the viewers. When an AR marker is placed on a TV screen, it would create a visual difficulty to correctly detect the AR marker or a marker equivalent image that is related to the specific TV program at the time the TV program is shown. Another issue is that a TV broadcasting company might not accept to add continuous visual images in a TV program just to realize the AR function for a TV show. A TV program also has a specific difficulty for AR implementation. The time table of TV programs has an inherent problem of changeable situation in broadcasting schedule due to possible natural disasters or other emergency situations. Thus there is a need for techniques of providing interactive augmented reality content to an ongoing television program.
This section is for the purpose of summarizing some aspects of the present invention and to briefly introduce some preferred embodiments. Simplifications or omissions may be made to avoid obscuring the purpose of the section. Such simplifications or omissions are not intended to limit the scope of the present invention.
In general, the present invention is related to techniques of displaying any augmented reality (AR) based multi-media information corresponding to a television (TV) program on a TV screen without scanning any AR markers or related images to retrieve specific AR contents. According to one aspect of the present invention, an AR system for TV broadcasting programs comprises a mobile device, a digital TV or an Internet TV set and a cloud computing based TV-AR management server. The TV-AR management server is configured to provide correct AR contents for the TV program that is being broadcasted and received in a TV set being used by a user at the time.
Depending on implementation, the present invention may be implemented as a method, an apparatus or part of a system. According to one embodiment, it is a method for providing augmented reality (AR) content, the method comprises: receiving a request from a mobile device to download the AR content in accordance with an image being displayed on a display screen of a TV device, where the mobile device is communicating wirelessly with the TV device to receive detailed information about the image being displayed thereon; searching appropriate AR content from a database in accordance with the detailed information about the image, wherein the appropriate AR content is in synchronized in time with the image being shown on the TV device; and releasing the appropriate AR content to the mobile device for displaying the AR content on top of the image.
According to another embodiment, it is a method for providing augmented reality (AR) content, the method comprises: sending a request from a mobile device to a server to obtain an appropriate AR content for overlaying the AR content onto an image being displayed on a display screen of a TV device, wherein the mobile device is communicating wirelessly with the TV device to receive detailed information about the image being displayed thereon; and displaying the appropriate AR content on top of the image.
One of the objects, features and advantages of the present invention is to provide a lot of flexibility in displaying corresponding AR content on an image being displayed on a TV device. The use of a mobile device is to facilitate the retrieval of correct AR content corresponding to the TV program being displayed on a TV device.
Other objects, features, benefits and advantages, together with the foregoing, are attained in the exercise of the invention in the following description and resulting in the embodiment illustrated in the accompanying drawings.
These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying drawings where:
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. However, it will become obvious to those skilled in the art that the present invention may be practiced without these specific details. The description and representation herein are the common means used by those experienced or skilled in the art to most effectively convey the substance of their work to others skilled in the art. In other instances, well-known methods, procedures, components, and circuitry have not been described in detail to avoid unnecessarily obscuring aspects of the present invention.
Reference herein to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the invention. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Further, the order of blocks in process flowcharts or diagrams representing one or more embodiments of the invention do not inherently indicate any particular order nor imply any limitations in the invention.
Embodiments of the present invention are discussed herein with reference to
According to one embodiment, the synchronization between a TV program guide (e.g., Internet Electronic Program Guide (IEPG)) and the built-in clock of a mobile device is utilized for the mobile device to download corresponding AR contents from a dedicated server (e.g., a cloud server), where the AR contents are exactly matched with the TV program currently being broadcast or watched by a user.
The mobile device 102 is caused to communicate with a cloud server 106 to retrieve AR content corresponding to the program being shown in the channel. The cloud server 106 is figured to be coupled to a server 108 (referred to as an IEPG server herein) providing the IEPG or the TV program currently being selected and viewed on the TV set 104. As shown in
According to one embodiment as shown in
According to one embodiment, a software module or program is developed and executed in the TV-AR management server 106. The module is configured to acquire the IEPG data from the server 108 run by the TV broadcasting company. In one embodiment, the IEPG data is in XMLTV maintained by XMLTV project, where XMLTV is an open source and very popular XML based file format for describing TV program listings. XMLTV is also an interface software between programs that emit guide data and programs that consume it. XMLTV consists of the collection of software tools to obtain, manipulate, and search updated TV program listings.
In one embodiment, the TV-AR management server 106 is designed to have several Comma Separated Values (CSV) files in its server environment to contain descriptions of each TV channel program. The attributes for the IEPG dataset corresponding to each TV channel shall have at least the following information:
Date and time of day when the TV program will start.
Duration or total running time for the described program.
Title that the program should show for described program.
Description that the program should show during on-air time.
The number of attributes for IEPG could be increased, depending on the application of AR contents and the timing of the display on the mobile device.
In operation, a mobile device is caused to send a request with data including the current time and the TV channel to the TV-AR management server in a cloud computing network. In return, the mobile device downloads the AR contents corresponding to the TV program. The TV broadcasting station server uploads continuously the updated TV program dataset to the TV-AR management server through the Internet. If the mobile device successfully downloads the correct AR content for the TV program, then an image processing application is executed to determine the local 3D coordinates of the TV frame by using the video camera of the mobile device. Once the local coordinates of TV frame are determined, the mobile device displays the AR contents to fit into the currently captured video view including the TV screen frame.
The TV broadcasting company that performs terrestrial/cable/satellite digital TV broadcasting could provide its own IEPG data. The IEPG has an adaptive function to adjust a sudden change of the original TV program schedule by some incidents, such as emergency news or natural disasters, the IEPG provides adaptive functions to update the time table of the TV program by (1) receiving an alert notice from the TV company and displaying it on the smart phone (2) updating the a rescheduled TV time table. The IEPG data includes program descriptions, transmission schedules (start time and finish time), flags to indicate the state thereof.
The TV broadcasting company continuously updates its TV program schedule and uploads the IEPG data to the TV-AR management server. The TV-AR management server identifies the correct AR contents corresponding to the TV program at the time. The mobile device downloads the AR content selected by the TV-AR management server. After the AR content is successfully downloaded to the mobile device, the mobile device overlays the AR content on a camera captured image being displayed on the screen of the mobile device.
By utilizing the IEPG for digital TV broadcasting, the AR content management located on a cloud computing server is an entirely new approach to display a broad array of AR contents. Because the identification of correct AR contents does not require any conventional image processing method such as conventional markers (e.g. black and white rectangle image), QR codes or other image pieces that is used to retrieve the correct AR contents from the cloud server.
According to one embodiment, an image processing algorithm is designed to determine the local 3D coordinates of a visually identified 3D object in the reference 3D coordinates (i.e., world coordinates). The image processing algorithm is referred to as the simultaneous location and mapping (SLAM) algorithm which is a well known image processing method in the field of computer vision to resolve the problem of building a 3D map while at the same time localizing the mobile camera within that map. The purpose is to eventually obtain the 3D coordinates of captured 3D object (e.g., a TV frame) in a camera view. The SLAM based TV frame tracking algorithm creates a point cloud of (3D map) of distinctive object features in the camera scene including the TV frame and determines the local 3D coordinates of the TV frame. It is also beneficial for the SLAM algorithm to provide the prior knowledge about the size of TV frame (e.g. the actual size of the TV display screen) for efficient initialization of the SLAM based 3D tracking.
According to one embodiment,
According to one embodiment, there are optional modes for displaying the AR contents.
The default mode, or Display Mode 1 of AR contents may be implemented as functional steps as follows:
The optional mode or Display Mode 2 of the AR content shall start after successful image capture of the TV frame by the video camera at beginning. Once the AR content is displayed, the user does not have to continuously capture the TV frame to maintain the display of AR content. The AR content is kept on displaying and updated without the image capture of the TV frame by the video camera.
The other optional mode or Display Mode 3 of the AR contents shall independently be displayed without the image capture of the TV frame. When the mobile device completes the download of the AR content, then the AR content shall be displayed on the screen of the mobile device regardless of the currently captured image status of the video camera.
According to one embodiment, the content of a TV program by a TV broadcasting company may vary from one location to another. Therefore, without one embodiment of the present invention, a user would receive correct AR content at one location, but may receive incorrect AR content at another location.
The invention is preferably implemented in software, but can also be implemented in hardware or a combination of hardware and software. The invention can also be embodied as computer readable code on a computer readable medium. The computer readable medium is any data storage device that can store data which can thereafter be read by a computer system. Examples of the computer readable medium include read-only memory, random-access memory, CD-ROMs, DVDs, magnetic tape, optical data storage devices, and carrier waves. The computer readable medium can also be distributed over network-coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
The processes, sequences or steps and features discussed above are related to each other and each is believed independently novel in the art. The disclosed processes and sequences may be performed alone or in any combination to provide a novel and unobvious system or a portion of a system. It should be understood that the processes and sequences in combination yield an equally independently novel combination as well, even if combined in their broadest sense; i.e. with less than the specific manner in which each of the processes or sequences has been reduced to practice.
The present invention has been described in sufficient details with a certain degree of particularity. It is understood to those skilled in the art that the present disclosure of embodiments has been made by way of examples only and that numerous changes in the arrangement and combination of parts may be resorted without departing from the spirit and scope of the invention as claimed. Accordingly, the scope of the present invention is defined by the appended claims rather than the foregoing description of embodiments.
This application claims the benefits of U. S. Provisional Application No.61/854,162, filed Apr. 19, 2013, and entitled “Software method to provide interactive augmented reality information corresponding to television programs”, which is hereby incorporated by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
61854162 | Apr 2013 | US |