Watermark detection

Description

FIELD OF THE INVENTION

The invention relates to a method of detecting a watermark in an information signal that has possibly been watermarked by modifying values of said information signal in accordance with (temporally or spatially) corresponding values of a watermark pattern. The invention also relates to an arrangement for detecting a watermark.

BACKGROUND OF THE INVENTION

A prior art method as defined in the opening paragraph is disclosed in International Patent Application WO-A-98/03014. The watermark is detected by computing the correlation of the suspect information signal with an applied watermark pattern, and comparing the correlation with a predetermined threshold. If the correlation is larger than the threshold, the watermark is said to be present, otherwise it is said to be absent.

OBJECT AND SUMMARY OF THE INVENTION

It is an object of the invention to provide a suitable criterion for setting the threshold.

To this end, the invention provides a method of detecting a given watermark in an information signal, comprising the steps of: computing the correlation of said watermark and said information signal for a plurality of positions of said watermark with respect to said information signal; and detecting whether at least one of the respective correlation values exceeds a given threshold; characterized in that the method comprises the step of determining the standard deviation of the respective correlation values, wherein said given threshold is a given multiple of said standard deviation.

The invention exploits the insight that watermark detectors need to compute the correlation value for a plurality of (temporal or spatial) positions of the watermark with respect to the information signal (for example, an image) in practice, because the position of the watermark with respect to image is not absolutely known and/or information is embedded in shifts of one or more watermark patterns. The detection thus yields a series of correlation values, and it is the occurrence of relative peaks in such a series which is of interest rather than absolute correlation values.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1

shows schematically an arrangement for embedding a watermark in a signal.

FIGS. 2 and 3

show diagrams to illustrate the operation of the embedder which is shown in FIG.

1

.

FIG. 4

shows schematically an arrangement for detecting the embedded watermark.

FIGS. 5

,

6

A and

6

B show diagrams to illustrate the operation of the detector which is shown in FIG.

4

.

FIG. 7

shows a device for playing back a video bit stream with an embedded watermark.

FIG. 8

shows schematically a preferred embodiment of the arrangement for detecting the embedded watermark.

FIGS. 9A and 9B

show diagrams to illustrate the operation of the detector which is shown in FIG.

8

.

FIG. 10

shows schematically a further embodiment of the arrangement for detecting the embedded watermark.

DESCRIPTION OF PREFERRED EMBODIMENTS

For the sake of convenience, the watermarking scheme in accordance with the invention will be described as a system for attaching invisible labels to video contents but the teachings can obviously be applied to any other contents, including audio and multimedia. We will hereinafter often refer to this method as JAWS (Just Another Watermarking System).

FIG. 1

shows a practical embodiment of the watermark embedder to provide background information. The embedder comprises an image source

11

which generates an image P, and an adder

12

which adds a watermark W to the image P. The watermark W is a noise pattern having the same size as the image, e.g. N

1

pixels horizontally and N

2

pixels vertically. The watermark W represents a key K, i.e. a multi-bit code which is to be retrieved at the receiving end.

To avoid that the watermark detection process needs to search the watermark W over the large N

1

×N

2

space, the watermark is generated by repeating, and if necessary truncating, smaller units called “tiles” W(K) over the extent of the image. This “tiling” operation (

15

) is illustrated in FIG.

2

. The tiles W(K) have a fixed size M×M. The tile size M should not be too small: smaller M implies more symmetry in W(K) and therefore a larger security risk. On the other hand M should not be too large: a large value of M implies a large search space for the detector and therefore a large complexity. In JAWS we have chosen M=128 as a reasonable compromise.

Then, a local depth map or visibility mask λ(P) is computed (

16

). At each pixel position, λ(P) provides a measure for the visibility of additive noise. The map λ(P) is constructed to have an average value equal to 1. The extended sequence W(K) is subsequently modulated (

17

) with λ(P), i.e. the value of the tiled watermark W(K) at each position is multiplied by the visibility value of λ(P) at that position. The resulting noise sequence W(K,P) is therefore dependent on both the key K and the image content of P. We refer to W(K,P) as an adaptive watermark as it adapts to the image P.

Finally, the strength of the final watermark is determined by a global depth parameter d which provides a global scaling (

18

) of W(K,P). A large value of d corresponds to a robust but possibly visible watermark. A small value corresponds to an almost imperceptible but weak watermark. The actual choice of d will be a compromise between the robustness and perceptibility requirements. The watermarked image Q is obtained by adding (

12

) W=d×W(K,P) to P, rounding to integer pixel values and clipping to the allowed pixel value range.

In order to embed the multi-bit code K in the watermark W, every tile W(K) is built up from a limited set of uncorrelated basic or primitive tiles {W

1

. . . W

n

} and shifted versions thereof, in accordance with

W (K) = \sum_{i, j} s_{i_{j}} shift (W_{i}, k_{i_{j}})

where “shift(W

i

, k

i

j

)” represents a spatial shift of a basic M*M tile W

i

over a vector k

i

j

with cyclic wrap around. The signs sε{−1,+1} and the shifts k depend on the key K via an encoding function E (

13

). It is the task of the detector to reconstruct K after retrieving the signs s

i

and the shifts k

i

. Note that each basic tile W

i

may occur several times. In

FIG. 1

, the encoder

13

generates W(K)=W

1

+W

2

−W

2

′ where W

2

′ is a shifted version of W

2

.

FIG. 3

illustrates this operation.

FIG. 4

shows a schematic diagram of the watermark detector in accordance with the invention. The watermark detector receives possibly watermarked images Q. Watermark detection in JAWS is not done for every single frame, but for groups of frames. By accumulating (

21

) a number of frames the statistics of detection is improved and therefore also the reliability of detection. The accumulated frames are subsequently partitioned (

22

) into blocks of size M×M (M=128) and all the blocks are stacked (

23

) in a buffer q of size M×M. This operation is known as folding.

FIG. 5

illustrates this operation of folding.

The next step in the detection process is to assert the presence in buffer q of a particular noise pattern. To detect whether or not the buffer q includes a particular watermark pattern W, the buffer contents and said watermark pattern are subjected to correlation. Computing the correlation of a suspect information signal q with a watermark pattern w comprises computing the inner product d=<q,w> of the information signal values and the corresponding values of the watermark pattern. For a one-dimensional information signal q={q

n

} and watermark pattern W={W

n

}, this can be written in mathematical notation as:

d = \frac{1}{N} \sum_{n = 1}^{N} q_{n} w_{n},

For the two-dimensional M×M image q={q

ij

} and watermark pattern W={w

ij

}, the inner product is:

d = \frac{1}{M^{2}} \sum_{i = 1}^{M} \sum_{j = 1}^{M} q_{ij} w_{ij} .

In principle, the vector k

i

by which a tile W

i

has been shifted can be found by successively applying W

i

with different vectors k to the detector, and determining for which k the correlation is maximal. However, this brute force searching algorithm is time consuming. Moreover, the image Q may have undergone various forms of processing (such as translation or cropping) prior to the watermark detection, so that the detector does not know the spatial location of the basic watermark pattern W

i

with respect to the image Q.

Instead of brute force searching JAWS exploits the structure of the patterns W(K). The buffer q is examined for the presence of these primitive patterns, their signs and shifts. The correlation d

k

of an image q and a primitive pattern w being shifted by a vector k (k

x

pixels horizontally and k

y

pixels vertically is:

d_{k} = \frac{1}{M^{2}} \sum_{i = 1}^{M} \sum_{j = 1}^{M} q_{ij} w_{i + k_{x}, j + k_{y}} .

The correlation values d

k

for all possible shift vectors k of a basic pattern W

i

are simultaneously computed using the Fast Fourier transform. As shown in

FIG. 4

, both the contents of buffer q and the basic watermark pattern W

i

are subjected to a Fast Fourier Transform (FFT) in transform circuits

24

and

25

, respectively. These operations yield:

{circumflex over (q)}=FFT

(

q

)

and

ŵ=FFT

(

w

),

where {circumflex over (q)} and ŵ are sets of complex numbers.

Computing the correlation is similar to computing the convolution of q and the conjugate of W

i

. In the transform domain, this corresponds to:

{circumflex over (d)}={circumflex over (q)}{circle around (x)}

conj (

ŵ

)

where the symbol {circle around (x)} denotes pointwise multiplication and conj( ) denotes inverting the sign of the imaginary part of the argument. In

FIG. 4

, the conjugation of ŵ is carried out by a conjugation circuit

26

, and the pointwise multiplication is carried out by a multiplier

27

. The set of correlation values d={d

k

} is now obtained by inverse Fourier transforming the result of said multiplication:

d=IFFT

(

{circumflex over (d)}

)

which is carried out in

FIG. 4

by an inverse FFT circuit

28

. The watermark pattern W

i

is detected to be present if a correlation value d

k

is larger than a given threshold.

FIG. 6A

shows a graph of correlation values d

k

if the presence of watermark pattern W

1

(see

FIGS. 1 and 3

) in image Q is being checked. The peak

61

indicates that W

1

is indeed found. The position (

0

,

0

) of this peak indicates that the pattern W

1

applied to the detector happens to have the same spatial position with respect to the image Q as the pattern W

1

applied to the embedder.

FIG. 6B

shows the graph of correlation values if watermark pattern W

2

is applied to the detector. Two peaks are now found. The positive peak

62

at (

0

,

0

) denotes the presence of watermark W

2

, the negative peak

63

at (48,80) denotes the presence of watermark −W

2

′. The relative position of the latter peak

63

with respect to peak

62

(or, what is similar, peak

61

) reveals the relative position (in pixels) of W

2

′ with respect to W

2

, i.e. the shift vector k. The embedded data K is derived from the vectors thus found.

The embedded information may identify, for example, the copy-right holder or a description of the content. In DVD copy-protection, it allows material to be labeled as ‘copy once’, ‘never copy’, ‘no restriction’, ‘copy no more’, etc.

FIG. 7

shows a DVD drive for playing back an MPEG bitstream which is recorded on a disc

71

. The recorded signal is applied to an output terminal

73

via a switch

72

. The output terminal is connected to an external MPEG decoder and display device (not shown). It is assumed that the DVD drive may not play back video signals with a predetermined embedded watermark, unless other conditions are fulfilled which are not relevant to the invention. For example, watermarked signals may only be played back if the disc

71

includes a given “wobble” key. In order to detect the watermark, the DVD drive comprises a watermark detector

74

as described above. The detector receives the recorded signal and controls the switch

72

in response to whether or not the watermark is detected.

The Fourier coefficients d are complex numbers, that is, they have a real part and an imaginary part, or a magnitude and a phase. The inventors have found that the reliability of the detector is significantly improved if the magnitude information is thrown away and the phase is considered only.

FIG. 8

shows a preferred embodiment of the detector's correlation circuitry. The embodiment differs from the one shown in

FIG. 4

in that a magnitude normalization circuit

30

has been inserted between the multiplier

27

and the inverse Fourier Transform circuit

28

. The operation of the normalization circuit comprises pointwise dividing each coefficient by its magnitude. In mathematical notation:

{circumflex over (d)}:={circumflex over (d)}Φabs

(

{circumflex over (d)}

) (1)

where Φ denotes pointwise division and abs( ) denotes:

abs

(

{circumflex over (d)}

k

)=

R

(

{circumflex over (d)}

k

)

2

+I

(

{circumflex over (d)}

k

)

2

(2)

where R( ) and I( ) denote the real and imaginary part of the argument, respectively.

Said normalization of magnitudes is referred to as Symmetrical Phase Only Matched Filtering (SPOMF).

FIGS. 9A and 9B

illustrate the effect of SPOMF correlation. More particularly,

FIG. 9A

shows the correlation values d

k

when using linear correlation, i.e. without the magnitude normalization circuit

30

. The correlation value d

00

, expressed in units of standard deviation of the whole matrix, amounts to 9.79.

FIG. 9B

shows the correlation values when using SPOMF correlation. The correlation value d

00

is now 62.77 times the standard deviation. It will be appreciated that the peak in

FIG. 9B

can more reliably be detected than the peak in FIG.

9

A.

Because normalizing the magnitudes of {circumflex over (d)} is equivalent to normalizing the magnitudes of both {circumflex over (q)} and ŵ, the normalization circuit

30

in

FIG. 8

may be replaced by two normalization circuits after the FFT circuits

24

and

25

. However, the embedded watermark will already have a reasonably white (flat) frequency spectrum because it is a pseudo-random noise pattern in practice, in which each sample is independently and identically drawn from a normal distribution. In view hereof, normalizing the magnitude of the information signal only has been found to suffice.

FIG. 10

shows such an embodiment. The magnitude normalization circuit

30

is now located between the FFT circuit

24

and the multiplier

27

. In this embodiment, the magnitudes of {circumflex over (d)} are not exactly, but substantially, the same.

It should further be noted that the FFT and the conjugation of the applied watermark W

1

(c.f. circuits

25

and

26

, respectively, in

FIGS. 4

,

8

and

10

), as well as the optional normalization of the magnitudes of ŵ, can be pre-computed and stored in a memory.

This invention addresses the aspect of determining criteria for correlation values d

k

to be peaks. To this end we consider the decision variable d

k

as a stochastic variable. A way of formulating “largeness” of d

k

is by comparison to the standard deviation σ

d

of d. If a particular measurement d

k

is larger than Tσ

d

for some suitably chosen threshold T, then we say that d

k

is an statistical outlier. This will be interpreted as the presence of a watermark.

Experiments have shown that d can be modeled to a very good approximation as a normal distribution. This holds both for linear correlations as well as for SPOMF correlations. This allows us to associate false positive rates to the threshold T. In particular the probability that a measurement d

k

is larger than Tσ

d

can be computed as erfc(T), where erfc is the error function

erfc (T) = \frac{1}{2 π} \int_{T}^{\infty} ⅇ^{\frac{- x^{2}}{2}} ⅆ x

A threshold value T=5 is generally considered to be safe and corresponds with a false alarm probability of 2.8×10

−7

.

Each of the values d

k

can be seen as the correlation of a fixed watermark {w

j

} with an image q

k

, where q

k

is a shifted version of q (with cyclic wrap around). Therefore we can view the matrix d

k

as a matrix of instantiations of the stochastic process d. Experiments have shown that d can be modeled to a very good approximation as a normal distribution. This holds both for linear correlations as well as for SPOMF correlations. This allows us to associate false positive rates to the threshold T. In particular the probability that a measurement d

k

is larger than Tσ

d

can be computed as erfc(T), where erfc is the error function

erfc (T) = \frac{1}{2 π} \int_{T}^{\infty} ⅇ^{\frac{- x^{2}}{2}} ⅆ x

A threshold value T=5 is generally considered to be safe and corresponds with a false alarm probability of 2.8×10

−7

. The actual false alarm rate is orders of magnitudes lower due to the way watermark information K is encoded as a combination of signs and peak positions. The probability that for T=5 a legal combination of peaks and signs occurs by chance is vanishingly small.

The actual false alarm rate is orders of magnitudes lower due to the way watermark information K is encoded as a combination of signs and peak positions. The probability that for T=5 a legal combination of peaks and signs occurs by chance is vanishingly small.

For linear correlations as described above, it can be shown that σ

d

can be directly expressed in terms of the standard deviation σ

q

of the image q and the standard deviation σ

w

of the watermark w:

σ_{d} \approx \frac{1}{M} σ_{q} σ_{w}

In practice, this implies that σ

d

may be assumed to have a fixed predetermined value.

The value of σ

d

can also be estimated directly from the matrix {d

k

}, viz.

σ_{d} \approx \sqrt{\frac{1}{M^{2}} \sum_{k} d_{k}^{2}}

The latter method of computing σ

d

is particularly useful when the correlation values are computed using SPOMF because for SPOMF correlation there are no easy theoretical formulas for σ

d

. With SPOMF detection, peak heights in {d

k

} are compared with this measured standard deviation and judged relevant if larger than 5σ

d

.

In summary, a method and arrangement for detecting a watermark in an information signal is disclosed. The method comprises the steps of computing (24-28,30) the correlation (d

k

) of said watermark (W

i

) and said information signal (e.g. an image Q) for a plurality of positions (k) of said watermark with respect to said information signal, and detecting (

29

) whether at least one of the respective correlation values exceeds a given threshold. The step of detecting (

29

) comprises determining the standard deviation (σ

d

) of the respective correlation values (d

k

), and setting the threshold to a given multiple (T) of said standard deviation. The multiple (T) is derived form a desired false alarm rate (watermark detected when there is none, or no watermark detected when there is one).

Claims

1. A method of detecting a given watermark (w) in an information signal (q), comprising the steps of:computing a correlation (dk) of said watermark and said information signal for a plurality of positions (k) of said watermark with respect to said information signal; detecting whether at least one of the respective correlation values exceeds a given threshold; and determining the standard deviation (σd) of the respective correlation values (dk), wherein said given threshold is a given multiple (T) of said standard deviation.
2. A method as claimed in claim 1, comprising the step of computing said standard deviation (σd) from the correlation values (dk) in accordance with σd≈1G⁢∑k⁢dk2where G is the number of correlation values.
3. A method as claimed in claim 1, wherein said standard deviation is assumed to have a predetermined fixed value.
4. A method as claimed in claim 1, wherein said multiple (T) is derived from a desired false detection probability p in accordance with p=erfc(T), where erfc(T) is the error function erfc⁢(T)=12⁢π⁢∫T∞⁢ⅇ-x22⁢ⅆx.
5. An arrangement for detecting a given watermark (w) in an information signal (q), comprising:means for computing the correlation (dk) of said watermark and said information signal for a plurality of positions (k) of said watermark with respect to said information signal; means for detecting whether at least one of the respective correlation values exceeds a given threshold; and means for determining the standard deviation (σd) of the respective correlation values (dk), wherein said given threshold is a given multiple (T) of said standard deviation.
6. An arrangement as claimed in claim 5, comprising:means for computing said standard deviation (σd) from the correlation values (dk) in accordance with σd≈1G⁢∑k⁢dk2where G is the number of correlation values.
7. An arrangement as claimed in claim 5, wherein said standard deviation has a predetermined value.
8. An arrangement as claimed in claim 5, wherein said multiple (T) is derived from a desired false detection probability p in accordance with p=erfc(T), where erfc(T) is the error function erfc⁢(T)=12⁢π⁢∫T∞⁢ⅇ-x22⁢ⅆx.
9. A detector for detecting a given watermark (w) in an information signal (q), the detector comprising:a first device configured to compute the correlation (dk) of said watermark and said information signal for a plurality of positions (k) of said watermark with respect to said information signal; a threshold detector configured to detect if at least one of the respective correlation values exceeds a given threshold; and a second device configured to determine the standard deviation (σd) of the respective correlation values (dk), wherein said given threshold is derived from said standard deviation.
10. A detector as claimed in claim 9, comprising a third device configured to compute said standard deviation (σd) from the correlation values (dk) in accordance with σd≈1G⁢∑k⁢dk2where G is the number of correlation values.
11. A detector as claimed in claim 9, wherein said standard deviation has a predetermined value.
12. A detector as claimed in claim 9, wherein said multiple (T) is derived from a desired false detection probability p in accordance with p=erfc(T), where erfc(T) is the error function erfc⁢(T)=12⁢π⁢∫T∞⁢ⅇ-x22⁢ⅆx.

Priority Claims (1)

Number	Date	Country	Kind
98200656	Mar 1998	EP

PCT Information

Filing Document	Filing Date	Country	Kind
PCT/IB99/00358		WO	00

Publishing Document	Publishing Date	Country	Kind
WO99/45706	9/10/1999	WO	A

US Referenced Citations (6)

Number	Name	Date	Kind
4669344	Herrig	Jun 1987	A
5960081	Vynne et al.	Sep 1999	A
6198832	Maes et al.	Mar 2001	B1
6246775	Nakamura et al.	Jun 2001	B1
6252972	Linnartz	Jun 2001	B1
6341350	Miyahara et al.	Jan 2002	B1

Foreign Referenced Citations (2)

Number	Date	Country
WO9803014	Jun 1997	WO
WO-9912347	Mar 1999	WO

Non-Patent Literature Citations (2)

Entry
US 2002/0009208 A1, Alattar et al., Jan. 24, 2002, Authentication of Physical and Electronic Media Objects Using Digital Watermarks.*
US 2002/0015509 A1, Nakamura et al., Feb. 7, 2002, Method and Apparatus for Superposing a Digital Watermark and Method and Apparatus for Detecting a Digital Watermark.

Watermark detection

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications