The invention relates to digital watermarking.
Digital watermarking is a process for modifying physical or electronic media to embed a hidden machine-readable code into the media. The media may be modified such that the embedded code is imperceptible or nearly imperceptible to the user, yet may be detected through an automated detection process. Most commonly, digital watermarking is applied to media signals such as images, audio signals, and video signals. However, it may also be applied to other types of media objects, including documents (e.g., through line, word or character shifting), software, multi-dimensional graphics models, and surface textures of objects.
Digital watermarking systems typically have two primary components: an encoder that embeds the watermark in a host media signal, and a decoder that detects and reads the embedded watermark from a signal suspected of containing a watermark (a suspect signal). The encoder embeds a watermark by subtly altering the host media signal. The reading component analyzes a suspect signal to detect whether a watermark is present. In applications where the watermark encodes information, the reader extracts this information from the detected watermark.
Several particular watermarking techniques have been developed. The reader is presumed to be familiar with the literature in this field. Particular techniques for embedding and detecting imperceptible watermarks in media signals are detailed in the assignee's co-pending application Ser. No. 09/503,881 and U.S. Pat. No. 6,122,403, which are hereby incorporated by reference.
These watermarking techniques can be applied to embed auxiliary data in the compressed domain of multimedia signals. One challenge, however, is that some compression methods use a technique called “motion compensation” that may interfere with the watermark signal or create artifacts in the host signal in which the watermark is embedded. Motion compensation refers to a process often used in compression where a signal from a particular frame is predicted from one or more other frames. This technique enhances compression efficiency because the compressed version need only include a “key” frame (called the I frame), and predicted frames, coded as the difference between the frame and a predicted version of that frame.
“Drift compensation” refers to a method that prevents accumulation of visual distortions due to additive watermarking in compression schemes with motion compensation. Unless compensated properly, watermark error propagates both temporally and spatially as a result of motion compensation. Propagation of watermark signal to the consecutive frames has two potential disadvantages: it degrades the visual quality, and may interfere (most probably deconstructive interference) with the consecutive watermark signals reducing detection performance.
The invention provides a methods, systems and devices for enhancing watermark embedding in time varying signals. One particular implementation applies to digital watermarking of compressed video stream in the MPEG compression format. However, the invention is not limited to a particular compression format. For example, the method may be applied to watermark embedding of video in the spatial domain and uncompressed video.
One aspect of the invention is a system of placing a watermark in a video stream. The system estimates motion between frames in the video stream, and computes a representative motion for a frame. The system spatially adjusts a digital watermark by the representative motion before embedding it in the frame. This approach is particularly suited for embedding a watermark in a video stream compressed using motion estimation. The system can use the motion data in the video stream to compute a dominant motion for a frame, and then shift the watermark by this dominant motion before embedding it in the frame. Additional aspects of the invention include methods executed by programmable computing devices, and device implementations.
Further features will become apparent with reference to the following detailed description and accompanying drawings.
This section describes a method for placing a watermark in compressed video. The following assumptions are made as a context for this method:
In general, a watermarking process with drift compensation may be represented by the following equations. Superscript denotes the frame number. The letter I refers to an intra-coded Video Object Plane (VOP), and P refers to an inter-coded VOP with forward prediction.
I
k
w
=I
k
+W
k
P
k+1
w
=P
k+1
+W
k+1
−D
k,
Where Dk is the drift compensation component, which is a motion compensated version of the cumulative error induced on the previous frame, i.e. the difference between decoded original and watermarked frames.
After drift compensation, each watermarked and reconstructed frame will only differ from the original reconstructed frame by the watermark amount.
Rk=Ik
R
k+1
=R
k
MC
+P
k+1
=I
k
MC
+P
k+1
R
k
=I
k
+W
k
R
k+1
=R
k
MC
+P
k+1
+W
k+1
−D
k
=RkMC+WkMC+Pk+1+Wk+1−WkMC
=IkMC+Pk+1+Wk+1
Note that, for each P-VOP the modification includes adding a signal with two components, namely Wk+1 and Dk. In scenes where there is a dominant motion, motion compensation in Dk translates to a spatial shift. Under assumption (3), we can say that Wk+1 and Dk are uncorrelated, even if Wk=Wk+1. As a result, the variance of the sum is expected to be larger than the variance of Wk+1. Under assumption (5), this translates to a higher increase in bit-rate. Note that, if Wk=Wk+1 and there is no motion, i.e. Dk=Wk, then the additive modification to the signal is zero. Therefore, there is no need for additional bits.
Above, we established the relation between the bit-rate increase due to watermarking and the drift compensation. The method explained here provides an improvement in terms of bit-rate and/or watermark detection, when there is a dominant motion in the sequence, e.g., panning.
As detailed below, our particular implementation spatially shifts the watermark signal in synchronization with the dominant motion in the sequence. When the watermark signal, or part of it, is constant (Assumption.4), the proposed method minimizes the variance of the additive modification signal, Wk+1−Dk. As a result, the same watermark detection strength is achieved with minimum increase in the bit-rate of the compressed video. Equivalently, this may translate to better detection for bit-rate constrained applications.
As an additional advantage, the proposed method improves the detection of the watermark in reconstructed B-VOP's, even if those frames (or the error residuals) have not been watermarked. In essence, we prevent possible de-constructive interference from two reference frames on the B-VOP.
We would like to determine a global spatial shift value that will coincide with the motion of the most number of blocks of the frame. The compressed domain implementation has a unique advantage at this point. Motion vectors for each block (macro-block) of the frame may be obtained by simply decoding the motion vector VLC's, without employing a costly motion estimation algorithm. Under this observation, pseudo-code of the method is given below. This method is also illustrated in
The method described above predicts how the watermark in the previous frame will propagate to the current frame. In order to achieve this we have used the motion vectors for the image blocks, and weigh them equally. We implicitly assume that same amount of watermark has been added to all blocks. When and if a local scaling is used for the watermark, the method may be implemented so as to weigh different image blocks with respect to the watermark energy they carry.
Having described and illustrated the principles of the technology with reference to specific implementations, it will be recognized that the technology can be implemented in many other, different, forms. To provide a comprehensive disclosure without unduly lengthening the specification, applicants incorporate by reference the patents and patent applications referenced above.
The methods, processes, and systems described above may be implemented in hardware, software or a combination of hardware and software. For example, the auxiliary data encoding processes may be implemented in a programmable computer or a special purpose digital circuit. Similarly, auxiliary data decoding may be implemented in software, firmware, hardware, or combinations of software, firmware and hardware. The methods and processes described above may be implemented in programs executed from a system's memory (a computer readable medium, such as an electronic, optical or magnetic storage device).
The particular combinations of elements and features in the above-detailed embodiments are exemplary only; the interchanging and substitution of these teachings with other teachings in this and the incorporated-by-reference patents/applications are also contemplated.
This application is a continuation of U.S. application Ser. No. 10/300,921, filed Nov. 19, 2002, which claims benefit of U.S. Provisional Application 60/404,038 filed Aug. 15, 2002, which is hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
60404038 | Aug 2002 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10300921 | Nov 2002 | US |
Child | 12543414 | US |