The present invention generally relates to the processing of multimedia content. In particular, the present invention relates to a method and apparatus for classifying a multimedia file.
In the computer processing of multimedia content, there is a need in some applications to classify a multimedia file according to one or more criteria. The result of the classification can be used, for example, in the searching and customized service providing of the multimedia content.
For example, some current systems classify or characterize audiovisual content (database online, VOD catalogs, TV guide . . . ) with one or more genres. The information generated by the classification is relevant to the audiovisual content and can be used to provide the user with features such as searching or providing the audiovisual content by genres. Some available online databases (websites) to date which have such features are listed below:
Some of the above websites, such as http://www.filmotv.fr, associate only one genre to a movie. Some other websites, http://www.imdb.com for example, may combine several genres into one movie for the classification. However, the genres for one movie are not prioritized. That is, all the genres for one movie are at the same level.
The website http://www.vodmania.com offers to search a movie by a genre and then filter the result by a sub-genre. But the genre and the sub-genre cannot be combined for a searching, in which case no result will be generated.
Known technologies in this field only used a limited number of criteria to classify multimedia content. In addition, the level of relevancy of the criteria to the multimedia content is not considered or used effectively.
In view of the above problem in the conventional technologies, the invention provides a method and apparatus which can classify multimedia content taking into account of more than one criteria and their level of relevancy to the multimedia content.
According to one aspect of the invention, a method for classifying a first multimedia file is provided. The method comprises the steps of: obtaining at least one feature of the first multimedia file; determining at least one second multimedia file which is relevant to the first multimedia file according to the obtained at least one feature; attributing at least one parameter for classification of the first multimedia file to the first multimedia file, wherein the at least one parameter is weighted as a statistical function of the occurrence of the at least one parameter for the at least one second multimedia file.
According to one aspect of the invention, an apparatus for classifying a first multimedia file is provided. The apparatus comprises: a processor configured to: obtain at least one feature of the first multimedia file; determine at least one second multimedia file which is relevant to the first multimedia file according to the obtained at least one feature; attribute at least one parameter for classification of the first multimedia file to the first multimedia file, wherein the at least one parameter is weighted as a statistical function of the occurrence of the at least one parameter for the at least one second multimedia file.
According to one aspect of the invention, a computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor is provided. The computer program product comprises program code instructions for implementing the steps of the method according to one aspect of the invention.
According to one aspect of the invention, a Non-transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor. The Non-transitory computer-readable medium includes program code instructions for implementing the steps of a method according to one aspect of the invention.
It is to be understood that more aspects and advantages of the invention will be found in the following detailed description of the present invention.
The accompanying drawings are included to provide further understanding of the embodiments of the invention together with the description which serves to explain the principle of the embodiments. The invention is not limited to the embodiments.
In the drawings:
An embodiment of the present invention will now be described in detail in conjunction with the drawings. In the following description, some detailed descriptions of known functions and configurations may be omitted for conciseness.
An embodiment of the invention provides a method for classifying multimedia content with a plurality of criteria and wherein each of the plurality of criteria is attributed with a weighting value.
Next, the method of the embodiment of the present invention will be described in the context of classifying a movie with a plurality of weighted genres.
As shown in
Next, at step S102, it obtains information on the directors and the actors of the selected movie. The information on the directors and the actors of the selected movie can be obtained from the video catalog of the metadata source mentioned in the step S101, the website www.themoviedb.org in this example. It can be appreciated that normally the metadata provided by the metadata source will provide information on the director, actor list, release date, production country, genre attribution of the movie, and so on. In the example described above, information on the directors and actors of the movie “The Curious Case of Benjamin Button” can be obtained from the metadata provided by the website www.themoviedb.org. In a normal case like this example, a movie only has one director. But it can be appreciated that the relevant information about the additional director(s) can also be obtained from the metadata if a movie has more than one director. Normally a movie will be acted by a plurality of actors. It can be appreciated that in the step S102 it is not necessary to obtain the information of the total casting including background actors of the movie. It can obtain only the major actors of the movie for purpose of reducing the amount of operation of the following steps. For the movie “The Curious Case of Benjamin Button” in this example, it can obtain from the video catalog of the website www.themoviedb.org that the movie is directed by David Fincher and the major actors are Cate Blanchett, Brad Pitt and Tilda Swinton.
At step S103, it obtains, for each of the determined directors and actors, the relevant movies which are directed by the director or in which the actor is acting and the genre attribution for the relevant movies from the metadata source. In this example, a two dimensional table can be used to record and analyze said relevant movies to obtain the genre attribution of the relevant movies. Each table has in row the list of relevant movies which were directed by a director or in which an actor is playing and in column the list of genres provided by the metadata source, that is, the website www.themoviedb.org. The value of a cell (movie ‘i’, genre ‘j’) of the table is set “1” when the genre ‘j’ is indicated as genre of the movie ‘i’ by the metadata source.
It can be appreciated that, for each director and actor determined in the step S102, a table described above is generated.
At the next step S104, it calculates, for each director and actor, a global distribution of genres according to the above occurrences of the genres in the table. It can be appreciated that, when all relevant movies in the tables generated in the step S103 have been filled, a number of occurrences of the genre of relevant movies can be obtained for each genre. Based on this information, a global distribution of genres for each director or actor can be calculated. As shown in
At the next step S105, it attributes weighted genres to the selected movie according to the statistical result of the global distribution of genres of the determined directors and actors.
The selected movie can be attributed with the same genres provided by the metadata source to this movie or with an extended list of available genres.
In this example, the website www.themoviedb.org classifies the movie “The Curious Case of Benjamin Button” with five genres, which are Drama, Fantasy, Mystery, Romance and Thriller. The method of the embodiment of the invention can attribute these five genres to this movie, with each genre assigned a weighting value according to the statistical result of the global distribution of genres of the determined directors and actors in the relevant movies.
The attribution of genres to the selected movie can also be based on an extended list of genres over the five genres which are attributed to the movie “The Curious Case of Benjamin Button” by the website www.themoviedb.org. As shown in the first row from the bottom of the table shown in
In the above example, each actor has the same weight for determining weighted genres according to the statistical result of the global distribution of genres of the determined directors and actors in the relevant movies. However, in another example, a weight can be assigned to each actor according to his/her presence time in the movie for determining the weighted genres according to the statistical result of the global distribution of genres of the actor. The weight assigned to an actor can be in directly proportional to the presence time in the movie.
As described above, an embodiment of the present invention provide a method for classifying a movie with a plurality of weighted genres. The genres and their weights are determined by a statistical calculation of the genre distribution of the directors and actors of the selected movie in the relevant movies.
The weighted genres attributed to a movie according to the embodiment of the invention can be used to compute a distance between two movies. Based on the distance, similar movies within a catalog can be determined, which are those with the smaller distance between them compare to all the other ones.
Below is an example for computing the distance.
Let:
The distance between these two movies can be determined by the following formula: Distance=Sum of the gaps in absolute value on each of the genre.
That is
Preferably, the Euclidean distance will give a better result in the respect, as below:
As shown in
(x[genre(i), movie 1]−x[genre(i), movie 2])2
The row of distance (the sum of squared deviations) shows the result of the computation of:
The row of absolute value of the difference shows the result of the computation of:
|x[genre(i), movie 1]−x[genre(i), movie 2]|
The row of distance (the sum of the differences in absolute value) shows the result of the computation of:
An embodiment of the invention provides a corresponding apparatus for implementing the method for classifying a multimedia file as described above. Generally, the apparatus comprises means for obtaining a feature of the multimedia file. As described above, the multimedia file can be a movie and the feature obtained can be the directors and actors of the movie.
With the user interface 830, a user can select a first multimedia file for the classification from a metadata source. As described in the method according to the embodiment of the invention, the first multimedia file can be a movie. A movie can be selected for example from a VOD catalog provider or a website which references movies.
The processor 810 comprises a first unit for obtaining at least one feature of the multimedia file. In the context of a movie, the at least one feature of the movie is for example the information on the directors and actors of the movie. The above information can also be obtained from the above metadata source. In one example, the result can be provided to the user by the user interface 830.
The processor 810 further comprises a second unit for determining at least one second multimedia file which is relevant to the first multimedia file according to the obtained at least one feature. In the context of a movie, the second multimedia file can be a relevant movie of the given movie which is directed by the directors or acted by the actors of the given movie.
The apparatus further comprises a third unit for attributing at least one parameter for classification of the first multimedia file to the first multimedia file. The at least one parameter is weighted as a statistical function of the occurrence of the at least one parameter for the at least one second multimedia file. In this example, the parameter can be a genre attributed to a movie for classifying the movie. A genre is weighted as a statistical function of its distribution for the at least one relevant movies.
An embodiment of the invention provides a computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor, comprising program code instructions for implementing the steps of the method described above.
An embodiment of the invention provides a non-transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of a method described above.
The invention can be implemented on a server which for example provides on-line multimedia service to user over Internet or local broadband network. The invention can classify multimedia content with more than one criteria and also considering the level of relevancy of the criteria to the multimedia content. The criteria for the classification and their level of relevancy to the multimedia can be determined according to the metadata from an independent metadata source, for example an online video website. However, the classification according to the invention will somehow adapt or improve the initial classification provided by the independent metadata source.
It is to be understood that the present invention may be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof. Moreover, the software is preferably implemented as an application program tangibly embodied on a program storage device. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units (CPU), a random access memory (RAM), and input/output (I/O) interface(s). The computer platform also includes an operating system and microinstruction code. The various processes and functions described herein may either be part of the microinstruction code or part of the application program (or a combination thereof), which is executed via the operating system. In addition, various other peripheral devices may be connected to the computer platform such as an additional data storage device and a printing device.
It is to be further understood that, because some of the constituent system components and method steps depicted in the accompanying figures are preferably implemented in software, the actual connections between the system components (or the process steps) may differ depending upon the manner in which the present invention is programmed. Given the teachings herein, one of ordinary skill in the related art will be able to contemplate these and similar implementations or configurations of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
13306709.0 | Dec 2013 | EP | regional |