The present invention generally relates to the field of providing additional data in a media signal and more particularly to methods, devices, a signal and an information storage medium related to embedding of additional data in a media signal.
With the evolution of the Internet it is possible to access or retrieve a virtually limitless amount of informational content. Content can then be provided by different content providers in the form of media signals of varying shapes and forms. Media signals can for instance be provided as audio signals, in either compressed or uncompressed form, image signals in compressed or uncompressed form as well as video signals in compressed or uncompressed form. In order to inhibit that media content is unlawfully obtained by persons not entitled to it or that illegal copies of content are being made, there is a need for content owners to protect their content. In order to do this they often need to provide additional information in the media signals. Additional information can also be provided for other reasons, like for instance for providing text in relation to a piece of audio (e.g., lyrics).
One field of use where additional data is provided in media signals is in the field of Digital Rights Management (DRM), where additional data in the form of watermarks are used to indicate the origin of media content and possibly of user in order to inhibit unlawful tampering of the media content.
The possibility of correct and effective watermark detection depends heavily on the method used for embedding the data into the host signal and on properties of this signal. One frequently used type of watermark embedding is the so-called multiplicative watermarking, where the media signal to be watermarked is multiplied with the watermark in question. On the other hand, normally a media signal has a lot of different frequency components, whereas sometimes it can have few such components. When the components are few it can be hard to detect a watermark that has been embedded using multiplicative watermarking.
International patent application WO-A-02/15587 describes how additional data, like a watermark, is added to a media signal. The signal is here described in relation to a sine wave. A binary code is added to the signal in a high frequency band through either adding noise or not adding noise in this high frequency band. Upon detection, the sequence of digits (i.e., zeroes and ones) obtained represents (a coded version of) the watermark information. The document thus describes a technique for additive watermarking, which is not applicable in a multiplicative watermarking environment. Besides, since the additional information is only provided in a high frequency band, which can easily be filtered away using a simple low-pass filter, it is fragile and therefore not suitable when robustness is an important condition.
In a more robust, multiplicative watermarking scheme, a plurality of circular shifted chip sequences of real numbers is multiplied with a properly scaled version of the media signal and added back to the original media signal. Upon detection, the distances between the diverse correlation peaks carry (a coded version of) the watermark information. If the host signal contains few frequency components, the correlation will be weak. There is thus a need for enabling a higher level of detectability for additional data that has to be embedded in a media signal with few frequency components using a multiplicative embedding technique.
It is thus an object of the present invention to provide multiplicative embedding of additional data in a media signal that is more robust (i.e., has a higher level of detectability of the additional data), especially in sections of the media signal that have few frequency components.
According to a first aspect of the present invention, this objective is achieved by a method of embedding additional data in a media signal comprising the steps of: obtaining a media signal,
mixing at least one section of said media signal with a noise signal for providing a modified media signal, and
combining said additional data with said modified media signal for providing a first host modifying media signal.
According to a second aspect of the present invention, this objective is also achieved by a device for embedding additional data in a media signal comprising:
a first adding unit for mixing at least one section of said media signal with a noise signal in order to provide a modified media signal, and
a combiner unit for combining said additional data with said modified media signal for providing a first host modifying media signal.
According to a third aspect of the present invention, this objective is furthermore achieved by a media signal comprising:
at least one section of modified media signal comprising media signal mixed with a noise signal, where additional data has been combined with this modified media signal.
According to a fourth aspect of the present invention, this objective is also achieved by an information storage medium comprising:
a media signal including at least one section with modified media signal comprising:
media signal mixed with a noise signal,
where additional data has been combined with this modified media signal.
The present invention is furthermore directed towards providing a technique for (automatically) switching between the media signal and a modified version of the media signal in order to selectively enhance the detectability of multiplicatively embedded information to this new host signal.
According to a fifth aspect of the present invention, this objective is achieved by a method of embedding additional data in a media signal comprising the steps of:
obtaining a media signal,
analysing the media signal,
mixing at least one section of said media signal with a noise signal for providing a modified media signal, and
combining, for different sections of the media signal, said additional data with said modified media signal for providing a first host modifying media signal or with said media signal in dependence of the analysis.
According to a sixth aspect of the present invention, this objective is also achieved by a device for embedding additional data in a media signal comprising:
a first adding unit for mixing at least one section of said media signal with a noise signal in order to provide a modified media signal,
a combiner unit for combining said additional data with said modified media signal for providing a first host modifying media signal or with said media signal, and
an analysing unit arranged to analyse said media signal and control, for different sections of said media signal, the provision of said media signal mixed with noise or said media signal to the combiner unit in dependence of the analysis.
Claims 2 and 16 are directed towards performing the combining using multiplication.
Claims 5 and 17 are directed towards shaping the noise signal based on a model of human perception. This has the advantage of making sure that the added noise is not perceptible.
Claims 6 and 18 are directed towards shaping also the modified media signal that is combined with said additional data with a signal shaping function based on a model of human perception. This has the advantage of making sure that both the added noise and the embedded watermark are not perceptible.
Claims 8, 9, 10, 20, 21 and 22 are directed towards scaling the added noise, adding the media signal to the modified media signal that is combined with said additional data and adding the unscaled noise signal to the media signal that is combined with the additional data. This has the advantage of providing a more predictable control mechanism for the embedding of additional data.
Claims 12 and 23 are directed towards analysing the media signal and combining the additional data with sections of the media signal or the media signal mixed with noise in dependence of the analysis.
The present invention has the advantage of providing better detectability of additional data when it is embedded in a media signal having few frequency components, e.g. highly tonal signals like excerpts of pitch-pipe or harpsichord. With the invention it is for instance possible to embed a more easily detectable watermark in a modified media signal compared with an ordinary media signal having these properties. Because of this higher level of detectability the additional data remains detectable even if the quality of the media signal is degraded, i.e. the probability of a correct detection has increased. It is then easier to perform for instance forensic tracking of a processed media signal.
The general idea behind the invention is thus to mix a media signal with a noise signal and combine the additional data with the media signal that has been modified in this way.
These and other aspects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
The present invention will now be explained in more detail in relation to the enclosed drawings, where
The present invention relates to the field of providing additional data in media signals having a sparse frequency content in at least parts of the signal. In the field of audio such signals can include the sound from instruments like harpsichord and pitch pipe. The invention is however not limited to audio but can be applied on other media signals like for instance video or digital images. The additional data is preferably provided in the form of a watermark. It should however be realised that the invention is not limited to watermarks, but the additional data can be any additional data that needs to be detected in a media signal, like for instance additional text in relation to a song.
However, also signals that have many different frequency components may benefit from this type of embedding, especially by insertion of noise shaping in the higher frequency range. This will not significantly improve robustness of the watermark, but for unprocessed watermarked audio it may yield significantly better detection reliabilities.
The above described frequency domain combiner unit can be modified in many ways. It is for instance possible to remove the branch including the amplifying unit and also to remove the scaling unit, although this would degrade the signal quality.
The above described combiner units are just examples of multiplicative combiner units than can be used in the present invention. It should be realised that many other types of multiplicative combiner units can be used instead.
The thus described watermarking technique shown in
A block schematic of a device for performing embedding of a watermark into a media signal according to a second embodiment of the invention is shown in
It is possible to further vary the device according to the invention by also including a second signal shaping unit using a signal shaping function M2, which is also based on information from the filter control unit 38. A device according to this third embodiment is shown in a block schematic in
It is possible to vary the function used. As an alternative a so-called threshold-in-quite (TQ) function can be used when the media signal is an audio signal instead of the functions M1 and/or M2 above. In this case the noise is pre-filtered such that it falls below the hearing threshold. Similar functions can be used for image signals and/or video.
The device and method according to the third embodiment of the invention shown in
As mentioned above the noise signal is added for enabling safer detection of the watermark when the host or media signal has few frequency components, which can be sound frequency components when the signal is an audio signal or spatial frequency components when the signal is an image signal. An audio signal is however not often only made up of spectrally sparse sounds, but can often have few frequency components in just some passages or sections of a piece of music. There can therefore be no need for using the above-described embodiments of the invention in a whole media signal, but only in some pieces or sections of it. There is thus a need for being able to embed a watermark according to the above-described embodiments of the invention as well as to be able to embed a watermark according to known principles depending on the properties of the media signal.
It should be realised that the switching does not have to be soft or graceful, although this is preferred. In case no soft switching is performed, it might be sufficient to only provide one switch, which either connects the modified host signal, or the unmodified host signal to the watermark combiner unit 14. When a single switch is used it is furthermore possible to provide it in any position which achieves the proper switching of the signals, like for instance before the first adding unit 12.
The output signal y can be provided on a storage medium, of which one 72 in the form of a CD disc is shown in
There has thus been described a device and a method for multiplicatively embedding additional data in a media signal when the media signal has few frequency components. With the invention it is possible to embed a watermark in such a media signal which is easier to detect than an ordinary media signal having these properties. The second embodiment makes sure that the added noise is not perceptible and the third embodiment makes sure that both the added noise and the embedded watermark are not perceptible. The fourth embodiment has the advantage of providing a more predictable control mechanism for the embedding of a watermark. A higher level of detectability has furthermore the following advantages. The additional data remains detectable even if the quality of the media signal is degraded. It is then easier to perform for instance copy control or forensic tracking of a processed media signal.
The invention can be varied in many ways. It is for instance possible that the noise signal can be made to include data. This can be made in the way that one random sequence can be made to represent a “zero” and another can be made to represent a “one”. In this way additive and multiplicative watermarks can be integrated into a single system. As mentioned before the watermark can be embedded in both the time as well as the frequency domain and the media signal can be any type of media signal. A media signal can furthermore be an audio, video or image signal. In the case of audio it can be uncompressed audio such as PCM. The invention is however also possible to apply on compressed media, which in the case of audio can be a MP3 bitstream. However, then the noise has to be appropriately converted to the bitstream. Therefore the present invention is only to be limited by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
03101792.4 | Jun 2003 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB04/50906 | 6/15/2004 | WO | 12/14/2005 |