The present invention relates to method and apparatus for generating a sequence of a plurality of images. In particular it relates to method and apparatus for generating a sequence of a plurality of images to be displayed whilst accompanied by an audio item.
The price of the storage devices has significantly dropped in the last few years. As a result, users have collections of thousands of images (photographs) which are tedious and difficult to browse and view. This has resulted in an increasing demand for new and different ways to present such images.
Sharing memorable moments with friends and family has shifted from more traditional albums and photo frames to digital media, such as personal computers, television sets and digital photo frames which present their own difficulties. People tend to take a lot of similar pictures of the same objects so they can ensure that there will be one with the right lighting, colours and composition. However, with the low price of storage devices, they rarely seem to delete the redundant photos. So that, the former pleasurable activity of sharing memories with others now has turned into silent watching of endless monotonous slide shows.
There has, therefore, been an increasing demand for delivering more engaging presentations which combine music and photos allowing consumers to once again enjoy the experience of photo viewing alone or with family and friends.
Many systems have been developed to combine music and image presentation. In particular, changing the images according to the beat of the music as disclosed, for example, by US20070101355. However, this system does not necessarily provide a visually pleasing display.
The present invention seeks to provide a display of images which is visually more pleasurable.
This is achieved according to a first aspect by a method of generating a sequence of a plurality of images to be displayed whilst accompanied by an audio item, the method comprising the steps of: extracting at least one feature of an audio item; extracting at least one feature of each of a plurality of images; and determining the next image to generate a sequence of selected ones of the plurality of images to be displayed whilst accompanied by the audio item on the basis of the extracted at least one feature of the audio item and on the basis of the extracted at least one feature of the image.
This is also achieved by a second aspect by apparatus for generating a sequence of a plurality of images to be displayed whilst accompanied by an audio item, the apparatus comprising: a first extractor for extracting at least one feature of an audio item; a second extractor for extracting at least one feature of each of a plurality of images; a processor for determining the next image to generate a sequence of selected ones of the plurality of images to be displayed whilst accompanied by the audio item on the basis of the extracted at least one feature of the audio item and on the basis of the extracted at least one feature of the image.
In this way, both characteristics of the audio and content of each image are taken into account to generate the sequence of images to be displayed whilst accompanied by the audio items, providing a more pleasurable viewing experience.
Further, the duration of display of each image of the sequence of the selected set of the plurality of images (304_1 to 304—n, 306_1, 306—n) may be determined on the basis of the extracted at least one feature of the audio.
In an embodiment a slideshow may be created according to the pace of music. The choice of photo view time and/or which photo to display next is carried out based on a combination of a numerical measure of music pace and a numerical representation of the distance or similarity between photos and/or groups of photos. For example, in the case of fast paced music, images may be chosen to be very different from each other while if the music is slow, images may be chosen to be similar. As a result, very similar images are clustered to present smoother transitions of the images which may also be displayed longer to compliment slow-paced music and, further, present a sequence of dissimilar images at a faster pace of music. Consequently, a natural flow of view of the images is created that follows the music rhythm.
For a more complete understanding of the present invention, reference is made to the following description in conjunction with the accompanying drawings, in which:
With reference to
The apparatus of the embodiment of the present invention is shown in
The first storage device 101 is connected to a first extractor 105. The second storage device 103 is connected to a second extractor 107. The outputs of the first and second extractors 105, 107 are connected to respective inputs of a processor 109. The output of the processor is connected to a display 111 such as a computer monitor, display of a handheld device, projector screen, television, digital photo frame etc. The first storage device 101 is connected to a loudspeaker 113.
Operation of the apparatus of
The first extractor 105 extracts at least one feature from the retrieved audio item, step 209, such as tempo (number of beats per minute), rhythm (beat's structure), rhythm change or melody, for example to determine the pace of the audio item.
The second extractor 107 extracts at least one feature from the retrieved images, step, 203, such as colour, texture, capture time, capture date, capture location, presence and identity of faces using known facial recognition techniques. A distance measure between each image is computed, step 205. This distance measure is a measure of the similarity and reflects how similar or related images are and can be based on one or a combination of the extracted feature(s).
A set (303, 305) of a plurality of images (304_1 to 304—n, 306_1 to 306—n) is then selected, step 211, on the basis of the extracted features of the audio item and the images. This may, of course, result in all the images being selected.
During the display of a sequence of the retrieved images, for example, during a slideshow (or when preparing a slideshow offline), for each image, the duration of display of each image is determined and which image to show next in the sequence is determined by the processor 109, steps 213, 215.
The display duration of each image is determined as to the amount of time the image is shown on the screen and is short for a fast pace audio item and longer for slow paced audio item.
In order to determine the next image within the sequence to be shown, images that are significantly different (dissimilar—e.g., within a large distance) 306_1 to 306—n are selected for fast paced music e.g. the extracted pace is above a threshold, as shown, for example, in the group 305 of
In addition to the basic system, different transitions can be used within the slideshow. For example, when the music is fast paced, abrupt transitions between two photos can be used. If the music is slow-paced, slow dissolves between photos can be used instead.
A further embodiment can include predefined mood sets (e.g. happy, relaxing, emotional, festive, etc.) where both the music and the images are trying to convey a certain mood. For example classical music and landscape pictures can be in a relaxing set, while jazz and pictures with a lot of faces and can be in an emotional set.
Although embodiments of the present invention have been illustrated in the accompanying drawings and described in the foregoing detailed description, it will be understood that the invention is not limited to the embodiments disclosed, but is capable of numerous modifications without departing from the scope of the invention as set out in the following claims.
‘Means’, as will be apparent to a person skilled in the art, are meant to include any hardware (such as separate or integrated circuits or electronic elements) or software (such as programs or parts of programs) which reproduce in operation or are designed to reproduce a specified function, be it solely or in conjunction with other functions, be it in isolation or in co-operation with other elements. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the apparatus claim enumerating several means, several of these means can be embodied by one and the same item of hardware. ‘Computer program product’ is to be understood to mean any software product stored on a computer-readable medium, such as a floppy disk, downloadable via a network, such as the Internet, or marketable in any other manner.
Number | Date | Country | Kind |
---|---|---|---|
08165964.1 | Oct 2008 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB09/54234 | 9/28/2009 | WO | 00 | 4/5/2011 |