The present invention relates to embedding watermarks in CABAC (Context-based Adaptive Binary Arithmetic Coding) video streams.
Today, the demand for digital watermarking as an antipiracy technology is strong. To make it more difficult for pirates to circumvent watermarks it is important for many potential watermarks to be proposed and used. However, it is important for watermarks to not interfere with the intended viewing experience for the intended audience. As such, a need exists for more efficient watermarking techniques. As such, a goal of this invention is to generate a list of possible changes generally associated with watermarking that are CABAC (Context-based Adaptive Binary Arithmetic Coding)/AVC (Advanced Video Coding) compliant, and yet do not create visible artifacts, thereby ultimately providing an efficacious method for embedding watermarks in a CABAC video stream.
A method comprises accessing potential changes that can be changeable syntax elements in a coded data stream, which can be a coded video stream; determining detectability and/or visibility of the changes to an observer, prior to applying the changes; determining recoverability of the changes for a watermarking detector, prior to applying the changes; determining embedibility of the changes for the coding means which can include compliance with standards or various rules or constraint, prior to applying the changes; and generating a list of changes meeting threshold criteria for detectability and recoverability. The method can comprise determining a feature vector for the changeable syntax elements, wherein the feature vector is a function of the detectability, recoverability, and embedibility, a limit can be set for the feature vector, and only changes meeting the limit for the feature vector are added to the list. The method can further comprise establishing balance criterion between said detectability, recoverability, and embedibility and adding only the changes meeting the balance criterion. The method can also include accessing the video data that is divided into blocks and characterizing at least one block by at least one of the following: luminance, before and after applying changes, mean square error between the video before and after applying changes, block pixel variance, before and after applying changes, or blockiness. The method can further include combinations in which propagation maps for the changes prior to applying the changes are made and the propagation maps are used for the selection of the changes to add to the list. The selection criterion can include size of the propagation map; maximum luminance change of all blocks in the propagation map; maximum mean square error of all blocks in the propagation map; and number of blocks in the propagation path that satisfy detectability criteria.
An additional method comprises accessing changes which can be watermarks in a list for coded data, the changes having an syntax element, an original value and an candidate alternative value; determining for a subset a group of compliant changes from the changes, the complaint changes being compliant with a coding protocol such as a CABAC encoding protocol; and selecting for the subset only compliant changes that result in the subset having only one candidate alternative value for each syntax elements and result in only compliant changes also meeting at least one performance criterion. The method can include determining fidelity, recoverability, or robustness of the watermarks and removing or preventing watermarks from being in the subset based on the fidelity, recoverability, or robustness, wherein fidelity, recoverability, and robustness are performance criteria. The method can include determining at least two performance values for the watermarks, determining some collective metric of the at least two performance values, and removing or preventing watermarks from being in the subset based on the some collective metric.
Another method comprises accessing, generating, or compiling changes or watermarks in a list for coded data, the changes having an syntax element, an original value and an candidate alternative value; determining for a subset a group of compliant changes from the changes, the complaint changes being compliant with a coding protocol; and selecting for the subset only compliant changes having at least one performance criterion other than the coding protocol. The coded data can be in a transport stream and changes can be removed or prevented from being in the subset when the changes have syntax elements that cross a transport stream packet boundary. Further steps can include generating a propagation map for the changes prior to applying the changes; and removing or preventing changes from being in the subset that have any block that falls in the propagation path of a previously selected change. The method can further include determining fidelity, recoverability, or robustness of the changes and removing or preventing changes from being in the subset based on the fidelity, recoverability, or robustness, wherein fidelity, recoverability, and robustness are performance criteria.
An apparatus comprises a means for accessing or generating changes such as watermarks in a list for coded data such as video stream, the changes having a syntax element, an original value and a candidate alternative value; a means for determining for a subset a group of compliant changes from the changes, the complaint changes being compliant with a coding protocol; and a means for selecting for the subset only compliant changes that result in the subset having one or more candidate alternative value for each syntax elements and result in only compliant changes also meeting at least one performance criterion. The coding protocol can be a CABAC encoding protocol. The apparatus can further comprise a means for determining fidelity, recoverability, or robustness of the watermarks and a means for removing or preventing watermarks from being in the subset based on the fidelity, recoverability, or robustness, wherein fidelity, recoverability, and robustness are performance criteria. Additionally, the apparatus can comprise a means for determining at least two performance values for the watermarks, a means for determining some collective metric of the at least two performance values, and a means for removing or preventing watermarks from being in the subset based on the some collective metric.
The invention will now be described by way of example with reference to accompanying drawings.
Embodiments of the invention will now be described generally within the context of CABAC encoded H.264/AVC video streams. However, the embodiments can have broader applications.
The changes can be watermarks and these changes can be applied by embedding changing data bytes in a CABAC coded video stream. The method involves identifying changeable syntax elements in a H.264 coded video stream which can be modified into a candidate list of changes for watermark embedding. A subset of the changeable syntax elements list is used for watermark embedding. The embodiments can include implementations of steps that address at least the problem of selecting which elements of the list that will be in the subset used for watermarking.
Herein will be described a method for modifying a CABAC-encoded H.264/AVC stream and a method for generating a list of CABAC/AVC compliant changes. Each entry in the resulting list identifies a specific syntax element, its original value, and a candidate alternative value. A syntax element that appears in this list is considered a changeable syntax element and can appear in the list more than once, each time with a different candidate alternative value.
Embodiments can also include the feature of a subset of the entries in this list being selected and used for watermarking. One choice of subset is to select one and only one candidate alternative value for each changeable syntax element. Another choice of subset is to select more than one candidate alternative values for each changeable syntax element, where each selection may represent different information to embed in the watermark. However, the list can contain changes that, although CABAC and AVC compliant, will not serve the goals of the particular application.
Furthermore, the current disclosure describes at least one implementation that selects a subset of entries in the list when the application is watermarking. The selection step is performed to find the best subset in a given set of watermarking goals that include high fidelity, high robustness, and high capacity. Herein, the selection process or step is referred to as Changeable Block Selection (CBS).
Turning to watermarking algorithms, it is important to point out several of their properties. One property is the visual impact of the watermark embedding, i.e. fidelity. For many watermarking applications, the visual impact should be as small as possible. Another property is the effectiveness of the watermark after embedding. This describes likelihood that a watermark detector will be able to recover a watermark immediately after embedding. For most applications, a very high effectiveness is required. If the watermarked content is to be subject to attacks between the time of embedding and detection, many watermarking applications require that the watermark data still be recoverable after such attacks. This leads to a third property, which is robustness. Finally, a watermarking algorithm can be characterized by the amount of data that can be embedded. This property is called the capacity.
Performance of these four properties is often traded one for another depending on the application. In embodiments of the invention, the trading or balancing of features can be thought of in two steps as illustrated in
Each changeable syntax element in the list includes a set of candidate alternative values. The syntax element value can be changed to any value in the set without interfering with the AVC/CABAC compliance of the bitstream. Replacing the value of the syntax element to a candidate alternative value will change the reconstructed pixel values in the block in which the syntax element resides. Therefore, for each candidate alternative value, several block features are evaluated. Some examples of block features include:
The substitution of a candidate alternative value of a changeable syntax element will change the imagery data at the target block (T) where the syntax element resides. Because a whole set of inter-dependency is present in the coded video stream, blocks other than T may also be affected by the substitution. In other words, a modification introduced into block T can propagate to other blocks in the decoded sequence. In order to truly access the impact of a candidate change on the fidelity, robustness, effectiveness, and capacity, a good selection process considers the pixel value changes due to propagation as well as the direct changes to block T. The building of a propagation map that indicates all of the blocks affected by a single change to block, T, can be extremely helpful and even paramount in determining the suitability of substitutions.
An example propagation map or path 200 is illustrated in
When a change affects blocks other than the target block, the features considered should assess the impact in all affected blocks, not just the target block. Thus, a propagation map is generated and used to show the entire impact of the change as opposed to just considering the target block. Some examples of block features include:
To understand why usage of propagation maps is important, the consideration of a propagation map fidelity test in the selection steps can show which changes are acceptable if all the blocks in the propagation path pass a block-based fidelity test. In other words, a change will be unacceptable if it results in a visible artifact anywhere in its propagation path.
In general, a key feature of the invention is selection of a subset of the candidate alternatives. The selection process is based on the evaluation of a set of features as described above. The general process 305 is a tool that evaluates each candidate alternative in light of the feature values and the application requirements to do the subset selection.
Three application properties of the watermark are considered in the selection process. These three are detectability, fidelity, and robustness. For a change to be acceptable, it generally must satisfy application requirements in each of these properties.
The tests of
Fidelity selection can be based on a simple thresholding test applied to one or more of the generated features. Candidate elements that pass the threshold tests are deemed to have sufficiently high fidelity. Those that fail one or more of the thresholding tests are assumed to introduce visible artifacts too severe for the application. These candidates are removed from the list of potential changes.
In at least one embodiment, the feature vector includes the worst case, of all the blocks in the propagation path, of the sum (over all pixels in the block) of absolute luminance change that results from the change. This feature is compared to a luminance threshold. Any candidate which results in a block anywhere in its propagation path that has a sum of absolute luminance change greater than the threshold will be removed from the list.
At least one embodiment of the feature vector also includes the worst case, of all the blocks in the propagation path, of a blockiness measure indicating the amount of blockiness introduced by the change. This feature is compared to a blockiness threshold. Any candidate which results in a block anywhere in its propagation path that has a blockiness score greater than the blockiness threshold will be removed from the list.
A third possible fidelity test is based on the size of the propagation map. Here it is assumed that larger propagation maps are more likely to introduce visible artifacts. The size of the propagation map need not be listed as a feature since it is easily obtained directly from the data structure that contains the propagation map. Any candidate that has a propagation map which is larger than a threshold will be removed from the list.
In at least one embodiment, the recovery or robustness is based on the change in luminance in the block where the syntax element change is made. In other embodiments, the recovery can be based on the change in luminance in one or more blocks in the propagation path.
Thus, a simple measure of robustness is the amount of luminance change introduced by the candidate change. In this simple model, one assumes that candidate changes that result in higher luminance changes will be more robust.
In at least one embodiment, the feature vector includes the luminance change that will result from the candidate change. This value is compared to a robustness threshold. Any candidate for which the change in luminance is below the robustness threshold will be removed from the list.
When recovery is based on the entire propagation path, the size of the propagation map can be used to estimate the robustness of a change. Here it is assumed that larger propagation maps are more likely to survive processing of the marked video. Any candidate that has a propagation map which is smaller than a threshold will be removed from the list.
The final selection can be based on a number of different application requirements. One example of an application requirement is that, in a transport stream, the change must fully reside within a single transport stream packet. Any candidate change that would result in the modification of a syntax element that crosses transport stream packet boundaries will be removed from the list.
In at least one embodiment, the final selection process examines all of the candidate changes that have passed all of the previous tests in a slice. For a given syntax element, there may be a number of possible alternative values that satisfy the other tests, but only one can be selected for the final output. This choice may be based on the same fidelity and robustness features in the feature vector (e.g., selecting the value with the highest fidelity). This part of the selection could also be done in either of the other two selection processes.
In at least one embodiment, no change is made to any block that falls in the propagation path of a previously selected change. This rule is implemented in the final selection process, but could also be implemented elsewhere.
In at least one embodiment, no change is made if its propagation map would intersect with that of a previously selected change. This rule is implemented in the final selection process, but could also be implemented elsewhere.
The final selection is illustrated in
The implementations described herein may be implemented in, for example, a method or process, an apparatus, a software program, a datastream, or a signal. Even if only discussed in the context of a single form of implementation such as being discussed only as a method, the implementation or features discussed may also be implemented in other forms such as an apparatus or program. An apparatus may be implemented in appropriate hardware, software, and firmware. The methods may be implemented in an apparatus such as a computer or other processing device. Additionally, the methods may be implemented by instructions being performed by a processing device or other apparatus, and such instructions may be stored on a computer readable medium such as a CD, or other computer readable storage device, or an integrated circuit. Further, a computer readable medium may store the data values produced by an implementation.
As should be evident to one of skill in the art, implementations may also produce a signal formatted to carry information that can be stored or transmitted. The information can include instructions for performing a method, or data produced by one of the described implementations. For example, a signal may be formatted to carry a watermarked stream, an unwatermarked stream, a fidelity measure, or other watermarking information.
Additionally, many implementations may be implemented in one or more of an encoder, a decoder, a post-processor processing output from a decoder, or a pre-processor providing input to an encoder. Further, other implementations are contemplated by this disclosure. For example, additional implementations may be created by combining, deleting, modifying, or supplementing various features of the disclosed implementations.
This application claims the benefit, under 35 U.S.C. §365 of International Application PCT/US2009/004706 and filed Aug. 18, 2009, which was published in accordance with PCT Article 21(2) on Feb. 25, 2010, in English and which claims the benefit of U.S. provisional patent application No. 61/189,551, filed on Aug. 20, 2008, in English.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2009/004706 | 8/18/2009 | WO | 00 | 2/18/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/021694 | 2/25/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5867109 | Wiedeman | Feb 1999 | A |
6009176 | Gennaro et al. | Dec 1999 | A |
6064661 | Benn | May 2000 | A |
6341350 | Miyahara et al. | Jan 2002 | B1 |
6373960 | Conover et al. | Apr 2002 | B1 |
6687384 | Isnardi | Feb 2004 | B1 |
6894628 | Marpe et al. | May 2005 | B2 |
6900748 | Marpe et al. | May 2005 | B2 |
7113612 | Sugahara et al. | Sep 2006 | B2 |
7159117 | Tanaka | Jan 2007 | B2 |
7197164 | Levy | Mar 2007 | B2 |
7286710 | Marpe et al. | Oct 2007 | B2 |
7646881 | Zarrabizadeh | Jan 2010 | B2 |
7839312 | Tanaka et al. | Nov 2010 | B2 |
7865034 | So | Jan 2011 | B2 |
7974714 | Hoffberg | Jul 2011 | B2 |
8121341 | Tapson et al. | Feb 2012 | B2 |
8189854 | Watson | May 2012 | B2 |
8559501 | Chen et al. | Oct 2013 | B2 |
20020097892 | Oami et al. | Jul 2002 | A1 |
20020136428 | Sugahara et al. | Sep 2002 | A1 |
20030070075 | Deguillaume | Apr 2003 | A1 |
20040017852 | Garrido et al. | Jan 2004 | A1 |
20040168110 | Fuldseth et al. | Aug 2004 | A1 |
20040247154 | Bodo et al. | Dec 2004 | A1 |
20050044411 | Somin et al. | Feb 2005 | A1 |
20050069169 | Zarrabizadeh | Mar 2005 | A1 |
20050123207 | Marpe et al. | Jun 2005 | A1 |
20050207499 | Hwang et al. | Sep 2005 | A1 |
20060078292 | Huang et al. | Apr 2006 | A1 |
20060222344 | Ukai et al. | Oct 2006 | A1 |
20060236130 | Ito et al. | Oct 2006 | A1 |
20060269096 | Kumar et al. | Nov 2006 | A1 |
20070053438 | Boyce et al. | Mar 2007 | A1 |
20070110033 | Tu et al. | May 2007 | A1 |
20070242862 | Watson et al. | Oct 2007 | A1 |
20080009272 | Toledano | Jan 2008 | A1 |
20080063071 | Suzuki | Mar 2008 | A1 |
20080165849 | Moriya et al. | Jul 2008 | A1 |
20080247469 | Vadapalli et al. | Oct 2008 | A1 |
20090290750 | Tapson et al. | Nov 2009 | A1 |
20100176610 | He et al. | Jul 2010 | A1 |
20110129116 | Thorwirth | Jun 2011 | A1 |
20110176610 | He et al. | Jul 2011 | A1 |
20110222723 | He et al. | Sep 2011 | A1 |
20110293016 | Suzuki | Dec 2011 | A1 |
20120237078 | Watson et al. | Sep 2012 | A1 |
20130058395 | Nilsson et al. | Mar 2013 | A1 |
20130058405 | Zhao et al. | Mar 2013 | A1 |
20130208814 | Argyropoulos et al. | Aug 2013 | A1 |
Number | Date | Country |
---|---|---|
101218830 | Jul 2008 | CN |
1515506 | Mar 2005 | EP |
1909508 | Apr 2008 | EP |
11331622 | Nov 1999 | JP |
11341450 | Dec 1999 | JP |
11346302 | Dec 1999 | JP |
2001119557 | Apr 2001 | JP |
2003125227 | Apr 2003 | JP |
2003134329 | May 2003 | JP |
2003179740 | Jun 2003 | JP |
2003529297 | Sep 2003 | JP |
2004221715 | Aug 2004 | JP |
2005533410 | Nov 2005 | JP |
2006279992 | Oct 2006 | JP |
2006287364 | Oct 2006 | JP |
2006303580 | Nov 2006 | JP |
2007053687 | Mar 2007 | JP |
2007525074 | Aug 2007 | JP |
WO2004066206 | Aug 2004 | WO |
WO2007067168 | Jun 2007 | WO |
WO2008065814 | Jun 2008 | WO |
WO2008118145 | Oct 2008 | WO |
WO2008154041 | Dec 2008 | WO |
Entry |
---|
Dekun Zou, et al: “H.264/AVC stream replacement technique for video watermarking”, Acoustics,Speech and Signal Processing 2008, Mar. 31, 2008, pp. 1749-1752, XP031250910, Piscataway, NJ, USA. |
Mobasseri, B.G., et al: Authentication of H.264 streams by direct watermarking of CAVLC blocks, The International Society for Optical Engineering, Spie, US, pp. 1-5, Feb. 27, 2007. |
European Search Report dated Jan. 2, 2010. |
Nguyen et al., “A Fast Watermarking System for H.264/AVC Video,” 2006 IEEE, Dept. of Electronic Engineering, La Trobe University, Bundoora, Australia, pp. 81-84. |
Seo et al., “Low-Complexity Watermarking Based on Entropy Coding in H.264/AVC,” IEICE Trans. Fundamentals, vol. E91-A, No. 8, Aug. 2008. |
Noorkami, “Secure and Robust Compressed-Domain Video Watermarking for H.264,” A Thesis Presented to The Academic Faculty at Georgia Institute of Technology, 124 pages, Aug. 2007. |
Song et al., “A Data Embedded Video Coding Scheme for Error-Prone Channels”, IEEE Transactions on Multimedia, vol. 3, No. 4, Dec. 1, 2001, pp. 415-423. |
Liu et al., “Data Hiding in Inter and Intra Prediction Modes of h.264/AVC”, IEEE Int'l. Symposium on Circuits and Systems, 2008 (ISCAS 2008), May 18, 2008, pp. 3025-3028. |
Profrock et al., “H.264/AVC Video Authentication using Skipped Macroblocks for an Erasable Watermark”, Visual Communications and Image Processing, 2005 SPIE, Bellinigham, WA 2005. |
Hu, “Information Hiding Based on Intra Predictioin Modes for H.264 AVC”, Multimedia and Expo, 2007 IEEE, International Conference, IEEE PI, Jul. 1, 2007, pp. 1231-1234. |
Winkler, “Preceptual Quality Assessments for Video Watermarking”, Proceedings from the IEEE Conference on Standardizaton and Innovation in Information Technology, Oct. 18, 2002, pp. 90-94. |
Number | Date | Country | |
---|---|---|---|
20110158465 A1 | Jun 2011 | US |
Number | Date | Country | |
---|---|---|---|
61189551 | Aug 2008 | US |