The present invention relates to methods of detecting watermarks and thereby determining a measure of scale factor; in particular, but not exclusively, the invention concerns a method of detecting one or more watermarks in sequences of images where the one or more watermarks are changed in the sequence, for example shifted and/or mirrored and/or rotated, in some of the images of the sequence relative to other images thereof Moreover, the invention also relates to watermark detectors capable of detecting the presence of such relatively changed watermarks. Furthermore, the invention also relates to programme content including one or more relatively changed watermarks, for example shifted and/or mirrored and/rotated watermarks in the content. Yet further, the invention concerns software for executing on computing hardware for implementing the aforesaid methods of detecting watermarks.
Watermarks are often included in a subtle manner in audio-video programme content either as a mechanism of validating authenticity of the content or for forensic purposes of detecting unauthorized pirating or copying of the content. In order for the watermarks not to perceptibly degrade or distort programme content when embedded thereinto, the watermarks are only faintly added to the content whose composition can vary considerable from image to image therein. In consequence, it is established practice to accumulate watermark information over numerous images of a sequence in one or more buffers and then apply correlation techniques using one or more expected watermark templates in order to confirm presence or alternatively absence of the watermarks in the audio-video content.
It is known that spatial correlation based watermark detection is extremely difficult, if not virtually impossible, to implement unless an original scale of the video content is known. Many contemporary watermarking schemes employ watermark patterns embedded in video content such that the watermark patterns are repeated in a tiled manner, namely tiled according to a known spatial reference grid repeatedly throughout images in video content. It is conventional practice to retrieve a measure of the original scale by correlating adjacent watermark tiles extracted from the video content; the measure of original scale factor corresponds to a position of a correlation peak thereby derived.
Watermark schemes using tiled watermarks that are conventionally employed have several disadvantages as follows:
A particular problem encountered by the inventors concerns a watermark embedder which is operable to tile a 128 element×128 element watermark pattern over video fields. A watermark detector compatible with the embedder is capable of retrieving a measure of horizontal scale factor by correlating two horizontally adjacent 128 element×128 element watermark tiles; similarly, the detector is capable of retrieving a measure of vertical scale factor by correlating two vertically adjacent such watermark tiles. The measure of the scale factor is determined from the position of highest correlation peaks in a first row and column of an associated correlation field.
The detector applying the above approach to scale factor determination is capable of coping with full-D1 watermarked video. However, in practice, video programme content data conveyed by DVD's as data carriers is often subjected to scaling down and subsequent conversion to corresponding data in DIVX format. Such processing can pose problems for watermark detectors for reliably determining occurrence of watermark tiles in the converted DIVX data. For example, original video programme content includes a sequence of images in wide screen format, namely having an aspect ratio of 16:9 (width: height) corresponding to dimensions of 720×480 elements. The video content subsequently has its black borders removed so as to generate cropped video content where images then have a format of 720×280 elements (width: height). Sub-sampling is then applied to the cropped video content to yield image frame dimensions of 520×200 elements in cropped sub-sampled video content. As a result, the sub-sampled images each comprise two sub-fields of 512×100 elements. If the original video content includes two vertically adjacent watermark tiles, the cropped sub-sampled video content presented to a watermark detector expecting one or more 128×128 element watermark tiles results in the detector trying to determine a measure of scale factor based on correlating two 128×50 element tiles; in consequence, the detector's watermark detection performance is drastically reduced which causes considerable difficulties in determining a measure of vertical scale factor. If the detector is unable to determine vertical scale factor correctly, it is then virtually impossible for it to detect a data payload associated with the watermark included in the original video content. This data payload includes, for example, information indicative of routes of distribution, this information being useable for forensic tracking.
In a Japanese patent application no. JP2000151984, it is proposed to alternately embed in audio-video content an original watermark and a shifted version of that watermark; the shifted watermark is translated by a known amount relative to the original watermark. Subsequently, watermark detectors arranged to correlate a test watermark corresponding to the original watermark and the shifted watermark are able to detect the watermark shift in the audio-video content. If the content has been scaled, the shift vector will have been altered by a similar scale factor. Thus, detection of the shift enables a scale factor to which the audio-video content has been scaled to be determined.
In order to address the aforementioned problems encountered with tile watermark detection, the inventors have developed the present invention. The present invention is found by the inventors to function especially well for wide-screen DIVX movies.
An object of the present invention is to provide an improved method of detecting watermarks and thereby deriving a measure of scale factor.
According to a first aspect of the present invention, there is provided a method of determining a measure of scale factor (MSF) by detecting a watermark included in a sequence of images wherein the watermark is included in several mutually changed forms in the sequence, the method comprising steps of:
The invention is of advantage in that it is capable of providing a more accurate and/or reliable measure of scale factor in comparison to contemporary approaches.
Preferably, in the method, the one or more changes detected include at least one of: a translation change, a mirror-transposition change, and a rotation change. Although the method is found to function well for watermarks subject to transposition corresponding to lateral translation, these other forms of transposition are beneficially alternatively or additionally utilized in the method, especially mirror transposition.
Preferably, in the method, the analysis in step (d) includes steps of:
Such processing enables image information corresponding to un-watermarked programme content to be distinguished from watermark information on account of these being more clearly mutually distinguished in the spatial format. Preferably, the method further includes a step of normalizing the processed data prior to converting the processed data to the spatial format.
Preferably, the method includes steps of subjecting contents of the buffers to high-pass spatial filtration prior to converting them to corresponding data sets. Such high-pass filtration is of benefit in that it is capable of attenuating image content substantially irrelevant for purposes of watermark detection.
Preferably, the method includes steps of:
Preferably, the method further comprises a step of cropping the processed data in spatial format to remove spurious peripheral correlation arising from correlation of image programme content prior to identifying the one or more peaks for determining the measure of scale factor (MSF). Such cropping is capable of enhancing reliability of watermark identification and thereby enhancing accuracy and/or reliability of watermark detection.
Preferably, the method is adapted for determining measures of scale factor (MSF) in a plurality of mutually different image directions. Application of the method in more than one direction is relevant when determining scale factor of 2-dimensional images which have been subject to mutually different scale-factor change in substantially vertical and horizontal image directions.
Preferably, in the method, the one or more changes applied to the watermark in the sequence includes a translation in at least one direction of magnitude s arranged to be less than half a width L of the watermark in the at least one direction, such that the measure of scale factor is calculated substantially from z=(Hp/s) where Hp is a determined position of a peak of correspondence. Alternatively, in the method, the one or more changes applied to the sequence of images includes a translation in at least one direction of magnitudes arranged to be more than half a width L of the watermark in the at least one direction, such that the measure of scale factor is calculated substantially from z=(Hp−L)/(s−L) where Hp is a determined position of a peak of correspondence.
According to a second aspect of the invention, there is provided a watermark detector for determining a measure of scale factor (MSF) by detecting a watermark in a sequence of images wherein the watermark is included in several mutually changed forms in the sequence, the detector including:
According to a third aspect of the present invention, there is provided video data comprising programme content including a sequence of images, wherein the images alternate between corresponding groups depending on one or more relative changes applied to one or more watermarks included in the images, the one or more changes being useable for determining a measure of scale factor (MSF) for the sequence.
Preferably, in the video data, the one or more relative changes includes at least one of: a translation change, a mirror-transposition change, and a rotation change.
Preferably, the video data is stored on a data carrier.
According to a fourth aspect of the invention, there is provided software executable in computer hardware for implementing the method according to the first aspect of the invention.
It will be appreciated that features of the invention are susceptible to being combined in any combination without departing from the scope of the invention.
Embodiments of the invention will now be described, by way of example only, with reference to the following diagrams wherein:
FIGS. 4 to 10 are illustrations of transformed watermark patterns for correlation using the process of
In overview, the present invention is concerned with temporally changing certain geometrical watermark properties in video programme content comprising a sequence of images grouped in time slots for enabling a watermark detector to process images from different time slots and analyze them for deriving a measure of scale factor. For example, a watermark embedder is beneficially arranged to embed a standard watermark pattern in an original position in the first 600 image frames in the sequence of images. In 600 subsequent image frames in the sequence, the embedder is operable to embed the watermark in a transposed format, for example mirrored and/or spatially translated and/or rotated. In a next 600 image frames, the embedder is arranged to embed the watermark in the original position and so on in a repetitive alternating manner as depicted in
In
Alternating a normal and a changed version of the watermark provides sufficient information for enabling a watermark detector processing the sequence 10 to identify the watermark embedded therein, for example for determining measures of scale factor, provided that the detector knows the magnitude of the change applied, for example how many pixels the original watermark was shifted to generate the changed watermark. The detector is preferably able to determine one or more vectors associated with the change.
In circumstances where the change corresponds to shifting the watermark in an alternating manner from time slot to time slot, the detector collects an arbitrary part of the sequence 10, for example it extracts data corresponding to a 128×128 element tile from frame images in a time slot and accumulates them in a first buffer. Subsequently, the detector continues such extraction for image frames in a next time slot and accumulates them in a second buffer. The detector then correlates accumulated data in the first and second buffers, for example using a SPOMF procedure, and outputs correlation results to a correlation buffer; a peak corresponding to greatest correlation is identified by the detector by analyzing correlation results from the correlation buffer. From this peak of greatest correlation, a measure of scale factor is then derivable using the detector.
In overview, the present invention has potential application in at least one of the following technical fields:
The present invention is especially pertinent to determining a measure of scale factor in upcoming HD video content format. On account of such a high quality format, watermark patterns are only lightly embedded into HD video content to preserve outstanding HD quality. However, when lightly watermarked HD video content is subjected to a long processing path, namely subjecting to numerous transformation operations, it is desirable that lightly embedded watermark information is still detectable in processed HD video content output from the path. An example of a long processing path involves steps of:
In order to further elucidate the present invention, embodiments of the invention will now be described.
A detector for determining a measure of scale factor from watermarks included in the sequence 10 will now be described with reference to
The detector is preferably implemented in computer hardware executing software to perform functions as depicted in
Operation of the process depicted in
In the first buffer 130, the process accumulates watermark tile data from the window 118 in a first 600 video fields, namely 300 image frames. The window 118 can have any size, depending upon requirements; however, the process of
Accumulated data from the first and second buffers 130, 180 are subject in the filter functions 140, 190 respectively to spatial high-pass filtration and thereafter to a transformation to the spatial frequency domain. Output from the Fourier function 150 is subject to generation of corresponding complex conjugates in the conjugate function 160.
The high pass functions 140, 190 are preferably implemented using a 3×3 matrix filter having coefficients as provided in Equation 1 (Eq. 1):
A correlation of contents of the buffer 130 with contents of the buffer 180 is preferably implemented using a SPOMF approach which includes the normalization function 220 therein. Correlation in the point-wise multiplication function 210 is preferably implemented using a SPOMF approach which involves point-wise multiplication. Results of this point-wise multiplication are normalized in the normalization function 220 followed by translation from the spatial frequency domain to corresponding spatial information which is cropped to remove spurious edge artefacts before being analyzed in the function 250 to determine spatial position of highest correlation peak as described earlier.
Normalization in the function 220 for an input array z of complex values is achieved by applying Equations 2 and 3 (Eq. 2, 3) to replace complex values in the array with normalized equivalents:
Regarding the cropping function 240, the inventors identified during experiments when devising the present invention that high spurious peaks occurred at borders of correlation array output from the inverse transform function 230. These spurious peaks arise as a result of strong correlation of underlying video content and are not generated by any consequence of scale factor change. Since these spurious peaks potentially mask in magnitude correlation peaks relating to watermark content of interest, these spurious peaks are set to zero value, namely they are subjected to a cropping operation. In other words, correlation values corresponding to elements of the window 118 close to its borders are effectively set to zero. For example, first and last five rows and columns of a 128×128 element output correlation field from the point-wise multiplication function 210 are set to zero.
A highest correlation peak identified by the function 250 is used to determine a measure of scale factor (MSF) as illustrated by several one-dimensional examples in FIGS. 4 to 10.
In
If the patterns 400, 410 are both spatially scaled by a zoom factor z, for example due to reformatting the watermarked video content in a temporally alternating manner as depicted in
As a further example, there is shown in
In
In
In
In
In
From FIGS. 4 to 10, it is observed for the re-scaled patterns that one or two correlation peaks are obtained:
One solution to achieve such a single correlation peak is to select the shift s pursuant to Equation 4 (Eq. 4):
L−s·z>>s·z−L·z+L Eq. 4
wherein L is the size of the watermark, namely 128 elements as depicted in FIGS. 4 to 10. From Equation 4, it will be appreciated for this solution that the shift s is preferably much less than L/2, for preferable s<L/4. A scale factor z can then be derived using Equation 5 (Eq. 5):
wherein Hp is the position of the highest correlation peak generated in the point-wise multiplication function 210.
Another solution to achieve such a single correlation peak is to select the shift s pursuant to Equation 6 (Eq. 6):
L×s·z<<s·z−L·z+L Eq. 6
wherein L is the size of the watermark, namely 128 elements as depicted in FIGS. 4 to 10. From Equation 4, it will be appreciated for this solution that the shift s is preferably much greater than L/2. A scale factor z can then be derived using Equation 7 (Eq. 7):
In implementing watermarking of video content so as to be optimally compatible with the process depicted in
These conditions (a)-(c) potentially mutually contradict and often a compromise is required. Conveniently, for L=128 elements, for example pixels, a practical compromise is:
Similar considerations pertain to mirror-transposed watermarks and rotation transposed watermarks.
The inventors have further appreciated that use of alternating watermark patterns as depicted in
In the accompanying claims, numerals and other symbols included within brackets are included to assist understanding of the claims and are not intended to limit the scope of the claims an any way.
It will be appreciated that embodiments of the invention described in the foregoing are susceptible to being modified without departing from the scope of the invention as defined by the accompanying claims.
Expressions such as “comprise”, “include”, “incorporate”, “contain”, “is” and “have” are to be construed in a non-exclusive manner when interpreting the description and its associated claims, namely construed to allow for other items or components which are not explicitly defined also to be present. Reference to the singular is also to be construed in be a reference to the plural and vice versa.
Number | Date | Country | Kind |
---|---|---|---|
04102007.4 | May 2004 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB05/51453 | 5/4/2005 | WO | 11/6/2006 |