The invention relates to a method for automatically tagging media content.
Known media management systems such as User Generated Content applications or collaborative tools make extensive use of tags, also called metadata, to describe the medias. These tags allow for later search or reference amongst a large amount of contents. They are added either manually, or by automated tagging engines using information or metadata available at the level of the tagging engine, such as f.i. titles of movies, names of authors a.s.o. .
However, some information that could be used for tagging may originate from the media capture process itself, especially in interactive environments. For example, in a videoconference such information could be f.i. which conferee is speaking at a given time or what are the user interactions used. This information is only visible from the media capture that is generally handled by media servers and is not available at the level of a tagging engine.
It is an object of the invention to overcome this drawback.
The method according to the invention realizes this object in that it is a method for automatically tagging media content said method including the steps of a media server obtaining capture tagging information from said media content upon capturing thereof and sending said capture tagging information and said media content to an application server, said application server tagging said media content with tagging information comprising at least part of said capture tagging information.
In this way, the information that is only available from the captured media information upon capturing of this data by the media server is made available to the application server, where the tagging engine resides, for tagging of the media data.
An additional feature of an embodiment of the method according to the invention is that said application server in addition obtains complementary tagging information from an external source and includes said complementary tagging information in said tagging information. Such complementary tagging information can for instance, in the case the media data is a movie, be the title of the movie or the names of the actors as mentioned earlier.
Another additional feature of an embodiment of the method according to the invention is that said media server obtains said capture tagging information under control of said application server. In this way the cappturing of the tagging information is made interactive.
The invention also relates to a media server for realizing the above method, said media server including media content capturing means adapted to capture media content, characterized in that said media server additionally includes processing means adapted to obtain capture tagging information from said media content upon capturing thereof and sending means to send said capture tagging information and said media content to an application server.
An additional feature of an embodiment of the media server according to the invention is that said media server is part of a videoconference system and that said content capturing means is adapted to record and mix audio and video flows of a videoconference, said processing means being adapted to obtain as said capture tagging information, information related to actions performed by at least one participant of said videoconference.
In this environment the capture tagging information can for instance be who is speaking at a given time or what keyboard keys are pressed by whom. Another additional feature of an embodiment of the media server according to the invention is that it additionally includes receiving means adapted to receive control information, said processing means being additionally adapted to obtain said capture tagging information based on said control information.
The invention also relates to an application server for realizing the above method, said application server including receiving means adapted to receive from a media server media content, characterized in that said receiving means are also adapted to receive from said media server capture tagging information, and that said application server additionally includes tagging means adapted to tag said media content with tagging information that comprises at least part of said capture tagging information to obtain tagged media content. A feature of an embodiment of the application server according to the invention is that said receiving means are additionally adapted to receive additional tagging information and that said application server also includes processing means to combine said capture tagging information with said additional tagging information to obtain said tagging information.
A additional feature of an embodiment of the application server according to the invention is that said application server is part of a videoconference system and that said capture tagging information is information related to actions performed by at least one participant of said videoconference. As mentioned above, in this environment the capture tagging information can for instance be who is speaking at a given time or what keyboard keys are pressed by whom. The complementary tagging information can for instance be information on the geolocalization of conferees or on their presence status.
Embodiments of a method and of servers in accordance with embodiments of the present invention are hereafter described, by way of example only, and with reference to the accompanying figures where:
The videoconference system as shown in
AE controls AS over that network and can be accessed by user entities UE1, UE2, UE3 and UE4 via which participants (not shown) have access to the video conference (
In alternative embodiments (not shown) the user entities directly control the application server or, the application server or the application entity are controlled by a third party that is not necessarily a participant to the conference.
MS mixes the audio and video flows and performs the conference “floor management”, such as selecting which speaker has to be seen. How a video conference is realized and managed is not described in detail since this is not the subject of the invention. More details thereon can f.i. be found on http://www.rfc-editor.org/rfc/rfc4722.txt (chapter 5).
Upon request from AE (
Under control of AS (
The thus obtained information is sent to AS together with the recorded media (arrows 4 and 6). This can be done on-the-fly or MS can collect all data and send a complete file to AS after the video conference. AS then forwards the data to MR for storage (
AS also receives metadata from an external source (not shown) such as f.i. from a geolocalization server or a User Profile HSS (
Tagging information related to the recognized picture such as picture size and position in a camera field, is obtained by a media server MSI when recording the picture as provided by a user entity UE, being a camera, via interface I1. This tagging information is together with the picture communicated to an application server IAS via I2 and I3 interfaces. IAS tags the picture with the obtained information and possibly with additional information obtained externally (arrow I4 in
The capturing by MSI is controlled by AS (
It has to be remarked that the above described functions may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Other hardware, conventional and/or custom, may also be included.
Also, the above description and drawing merely illustrate the principles of the invention. It will thus be appreciated that, based on this description, those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the invention and are included within its spirit and scope. Furthermore, the examples recited herein are principally intended expressly to be only for pedagogical purposes to aid the reader in understanding the principles of the invention and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited example and conditions. Moreover, all statements herein reciting principles, aspects, and embodiments of the invention, as well as specific example thereof, are intended to encompass equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
10305247.8 | Mar 2010 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2011/053359 | 3/7/2011 | WO | 00 | 9/11/2012 |