The invention relates to watermarking of multimedia signals, and in particular to watermarking with sampled watermarks.
Digital watermarking is a technology that may be used for a variety of purposes, such as proof of copyright ownership, tracing of illegal copies, controlling copy control equipment, broadcast monitoring, authenticity verification, adding auxiliary information into multimedia signals, etc.
In consumer digital devices, like the CD, the nominal sampling frequency is 44.1 kHz. In designing audio watermarking algorithms this is the sample rate of choice. However, for high-end audio equipment one finds 48 kHz and higher sampling rates, and also lower sampling rates may be chosen for given purposes. At these rates (i.e. rates other than 44.1 kHz) optimization of the watermark done for a sample rate of 44.1 kHz may result in the watermark not being detected properly or the watermark channel not being optimally used.
A solution is to re-sample the input and output signals by a non-integer factor and use a high-quality band-bass filter. However, this extra computational overhead is quite expensive.
Another solution is to match the watermark sampled and optimized at a given frequency to another frequency includes zero-padding of the watermark, however such a method wastes watermark channel by carrying less information than possible.
The published US patent application 2003/0004589 discloses methods of embedding and detecting a watermark in an information signal which are robust for sample rate conversions. A method is disclosed where the watermark is embedded in the information signal sampled at a first sampling rate and where the watermark is to be detected at a second sampling rate. In order to provide a watermarking scheme which is robust against sample rate conversion, a watermark is generated which have special properties in the frequency domain. The disclosure is an example of the practice that watermarks typically are optimized for the sampling rate of the information signal into which it is to be embedded. Optimization of the watermark to a first sampling rate is a computational heavy task, application of the optimized watermark at a second sampling rate, typically requires re-optimization. There is therefore a need in the art for providing a solution other than straight-forward re-sampling or zero-padding for adapting a watermark already generated for a given sampling frequency for embedding and detection at a different sampling frequency.
The inventors of the present invention have had the insight that a watermark sampled at a first frequency can be matched to a signal of a second frequency, by approximate re-sampling using a number of integer re-scale factors. In general, the present invention seeks to provide an improved way of handling watermarks generated for a given sampling frequency to be embedded and/or detected at a different sampling frequency. Preferably, the invention alleviates, mitigates or eliminates one or more of the above or other disadvantages singly or in any combination.
According to a first aspect of the present invention there is provided, a method of matching a watermark sampled at a first sampling rate to multimedia host signal sampled at a second sampling rate, the method comprising:
receive the watermark sampled at the first sampling rate, the watermark being based on a number of watermark sequences, each watermark symbol of each watermark sequence being repeated by a first integer width;
determinate the scaling factor between the first sampling rate and the second sampling rate, and determine a first re-scale width of the watermark symbols so as to approximate the watermark sequences to the second sampling rate, and set at least two integer re-scale widths, wherein at least a second re-scale width being larger than or equal to the first re-scale width and at least a third re-scale width being smaller than or equal to the first re-scale width;
generate a modified watermark based on the number of watermark sequences, wherein the watermark symbols of the modified watermark being of either the at least second or third re-scale width, so as to substantially match the modified watermark sequences to the second sampling rate.
The invention is particularly but not exclusively advantageous for providing a solution of matching a watermark to a different sampling frequency than the sampling frequency to which it was generated. That is to transform a watermark obtained at a reference frequency to a target frequency. In the present invention, a method is proposed that combines the simplicity of matching the watermark pattern to the sampling frequency at embedding and transmitting the maximum watermark energy allowed at a given audio quality.
In an advantageous embodiment, a modified watermark window may be calculated so that a circular buffer of modified watermark sequences is generated. The circular buffer may be generated so that the number of sub-windows of the modified watermark window is the minimum number of sub-windows, under the constraint that a boundary error is minimized. By generating a circular buffer, accumulation of errors of the modified watermark sequences is avoided, so that the modified watermark sequences may be repeated indefinitely. By applying a minimal buffer the embedding process is rendered less complex, since the smallest buffer is applied.
In advantageous embodiments, the modified sequence of watermark symbols is convoluted with a window shaping function. The convolution is performed so as to form a smoothly varying signal, in addition the width and/or order of the symbols of the modified sequence and the offset of the watermark window's sub-windows or window shaping function may advantageously be chosen under the constraint that a boundary error is minimized. The boundary error may be the error obtained at a sub-window boundary, such as at a local maximum, when comparing the modified watermark, or modified watermark window, with the watermark, or watermark window, obtained with direct re-sampling.
In a second aspect of the invention, an apparatus for matching a watermark sampled at a first sampling rate to multimedia host signal sampled at a second sampling rate, the apparatus comprising:
a receiver unit for receiving the watermark sampled at the first sampling rate, the watermark being based on a number of watermark sequences, each watermark symbol of each watermark sequence being repeated by a first integer width;
a determination unit for determining the scaling factor between the first sampling rate and the second sampling rate, and determine a first re-scale width of the watermark symbols so as to approximate the watermark sequences to the second sampling rate, and set at least two integer re-scale widths, wherein at least a second re-scale width being larger than or equal to the first re-scale width and at least a third re-scale width being smaller than or equal to the first re-scale width;
a modifier unit for generating a modified watermark based on the number of watermark sequences, wherein the watermark symbols of the modified watermark being of either the at least second or third re-scale width, so as to substantially match the modified watermark sequences to the second sampling rate.
In a third aspect a watermark host signal is provided, where the watermark comprise a number of watermark sequences, wherein the watermark symbols being of either an at least second or third re-scale width, so as to substantially match the watermark sequences to the sampling rate of the host signal.
In a fourth aspect of the invention is provided a computer readable code for implementing the first aspect of the invention.
The invention in accordance with the various aspects may in general be used for sample-rate dependent signal processing to synchronize between transmitted signal and carrier by scaling transmitted signal to a given target rate of the carrier.
In general the various aspects of the invention may be combined and coupled in any way possible within the scope of the invention. These and other aspects, features and/or advantages of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.
Embodiments of the invention will be described, by way of example only, with reference to the drawings, in which
The generation, embedding and detection of a watermark into a multimedia signal, may be done at a number of ways. The published patent applications WO 03/083858, WO 03/083860 and WO 05/029466 disclose such methods, and they are hereby incorporated by reference. In the present invention, a watermark sampled at a first sampling rate is matched to a multimedia host signal sampled at a second sampling rate. Having matched the watermark to the sample frequency of the multimedia signal, the matched watermark may be embedded into the multimedia signal by a known embedding technique, e.g. as disclosed by the three mentioned published patent applications. The watermark may be embedded in continuation of the matching process at the same location and possible by the same equipment, however, the matched watermark may also be transmitted via a communication line, such as the Internet or other computer network or via a record carrier, for later implementation at another site.
A watermark window may be provided based on a given ordering and construction of the reference sequence and the one or more shifted sequences.
A watermark window in the context of this application corresponds to a sequence of partially superposed sub-windows (in
Also in
In an embodiment, let the sequence of
A re-sampled watermark window (or a modified watermark window) in accordance with embodiments of the present invention is schematically illustrated in
The re-sampling of the watermark window is explained in connection with
In a first step 41 the watermark sampled at the first sampling rate is received or accessed.
In a next step 42, a scaling factor between the first sampling rate and the second sampling rate is determined, here being 1.088, resulting in a single scaling factor or width, referred to as the first re-scale width, of the watermark symbols, here being 8.707, so as to match the watermark sequences to the second sampling rate. Applying this scaling width would result in the watermark window as shown in
In a next step 43, a modified watermark is generated, so that the corresponding watermark symbols of the modified watermark being of either the second or third re-scale widths, so as to substantially match the watermark sequences to the second sampling rate.
In an embodiment, the modified watermark is calculated so that a circular buffer of modified watermark sequences is generated. The total number of sequences in the modified watermark sequences may be provided, such that the total number is the minimum number of sequences needed to provide a circular buffer under the constraint that the errors obtained at boundaries, e.g. sub-window local maxima, are minimized.
Moreover, the modified sequence of watermark symbols may be convoluted with a window shaping function so as to form a smoothly varying signal. The width of the window shaping function is adapted to the width of the symbols of the modified watermark sequence.
And even further, the window shaping function for at least some of the symbols of the modified watermark sequence may be offset by an integer value. The offset may in an embodiment be in the range of the integral of half the smaller re-scale width, incremented by 1 or decreased by 1.
The sequence of modified watermark symbols which represents the minimum number of elements in order to provide a circular buffer, may be provided under the constraint that boundary errors are minimized at local window maxima, by properly choosing the order of the second and third re-scale widths and by properly choosing the presence and order of offsets of window shaping functions.
In an embodiment, the watermark sequence of a circular buffer is generated by using a repeating method. The result for the watermark window of
In a first step, the width is set to 9, since the error made at the window maximum 32 is smaller for a width of 9 than for a width of 8, as compared to the corresponding window maximum of the re-sampled version 21. To minimize the error made at the next window maximum 33 as compared to the re-sampled version 22, one may chose an offset of 4 or 5 and a width of 8 or 9. The minimum error is for an offset of 4 (as indicated by reference numeral 34) and a width of 9, for the next window, an offset of 9 and a width of 8 is found. In principle, boundary errors may be minimized at any given boundary along a window, window maxima are chosen since, after the application of a window shaping function, the watermark energy is maximum at the window maxima, thus the probability of detecting the watermark is maximal there, and the best conditions for ensuring proper detection is typically provided by minimizing errors at window maxima.
The window offsets, widths and errors may be calculated by the following C-code resulting in the values as shown in TABLE 1.
The C-code for generating the numbers of TABLE 1 is the following:
The code is not generalized to all conditions, however the skilled person is able to adapt the code for a specific condition if necessary.
TABLE 1 shows the sequence number, i, the width, W, of the window function, the offset, o, and the error, e, made at window maxima. The first 14 sub-windows of TABLE 1 are shown in
The error is always limited to a maximum of plus or minus ¼ of a sample.
The sequence repeats indefinitely without accumulation errors if a minimum of number of windows is taken into account. This number is given by the reference frequency divided by the great common divider between the reference and the target frequencies.
These windows are stored in memory. Conversely, one can store only the two base windows and the list of widths and offsets. Another option could be to run the given algorithm to find out the current window width and offset.
The modified watermark sequence obtained would for the watermark of
The window shaping function may have an anti-symmetric temporal behavior or a bi-phase behavior. The bi-phase window may comprise at least to Hanning windows of opposite polarities. The use of such window shaping functions may offer improved performance, both with respect to audibility and robustness as disclosed in the published patent applications WO 03/083858, WO 03/083860 and WO 05/029466.
In an initialization process, a watermark sampled at a first sampling frequency is filled into a watermark payload buffer 50, so that a watermark sequence w[f0] including the payload is generated 51, f0 referring to the first sampling frequency. The watermark w[f0] is frequency-matched and stored in a watermark payload buffer 52 by application of the method as explained in connection with the
In the embodiment illustrated in
The buffer 52 is filled with each of e.g. 1,024 watermark symbols for each sequence repeated a number of times (the respective shaping window length) for as many sub-windows as the minimum given in the description, say 147. Resulting in about 61,000 values for 48 kHz. If memory can be a problem, one may prefer to calculate the respective watermark value on the fly with the given C-code, and one can reduce the circular buffer to 1,024 times the number of unique sequences (1, 3 or 7).
To this end, the generation of the modified watermark signal may comprise generating a number of circularly shifted sequences of symbols, the sequences circularly shifted with respect to a non-shifted sequence and generating the modified watermark signal by adding the values of the shifted sequences. That is in a similar way as a payload may be embedded into the watermark at the first frequency.
A more detailed description of embedding a watermark into a multimedia signal can be found in the published patent applications WO 03/083858, WO 03/083860 and WO 05/029466. In those disclosures only a reference sequence and a single shifted sequence are disclosed. However, the skilled person would be able to extend the disclosure to the one as presented here, in connection with the figures.
The watermark may be detected and the payload extracted in a process including the steps of receiving the multimedia signal that may potentially be watermarked by a watermark signal modifying the host multimedia signal. An estimate of the watermark may be extracted from the received signal, and the estimate may be processed with a respect to a reference version of the watermark so as to determine whether the received signal is watermarked. The processing may include a correlation processing. Again, a more detailed description of performing the tasks may be found in the published patent applications WO 03/083858, WO 03/083860 and WO 05/029466.
In
A re-sampled watermark window at 32 kHz with either 40 or 41 samples is illustrated in
The invention can be implemented in any suitable form including hardware, software, firmware or any combination of these. The invention or some features of the invention can be implemented as computer software running on one or more data processors and/or digital signal processors. The elements and components of an embodiment of the invention may be physically, functionally and logically implemented in any suitable way. Indeed, the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units. As such, the invention may be implemented in a single unit, or may be physically and functionally distributed between different units and processors.
While the above embodiments have been described with reference to an audio signal, it will be appreciated that the present invention can be applied to other types of signal, for instance video and data signals.
In summary, the invention deals with matching of a watermark to a host sampling rate of a multimedia signal. A watermark sampled at a first sampling rate is matched to multimedia host signal sampled at a second sampling rate, in a process where the watermark sampled at the first sampling rate is received, a scaling factor between the first sampling rate and the second sampling rate is determined, and re-scale widths of the watermark symbols are set. A modified watermark is generated wherein the watermark symbols of the modified watermark being of re-scale widths, so as to substantially match the modified watermark sequences to the second sampling rate.
Although the present invention has been described in connection with the specified embodiments, it is not intended to be limited to the specific form set forth herein. Rather, the scope of the present invention is limited only by the accompanying claims. In the claims, the term “comprising” does not exclude the presence of other elements or steps. Additionally, although individual features may be included in different claims, these may possibly be advantageously combined, and the inclusion in different claims does not imply that a combination of features is not feasible and/or advantageous. In addition, singular references do not exclude a plurality. Thus, references to “a”, “an”, “first”, “second” etc. do not preclude a plurality. Furthermore, reference signs in the claims shall not be construed as limiting the scope.
Number | Date | Country | Kind |
---|---|---|---|
06125886.9 | Dec 2006 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2007/054960 | 12/7/2007 | WO | 00 | 6/8/2009 |