The present invention relates to a process for generating and using propagation maps in AVC watermarking.
A video picture is encoded into one or more non-overlapping slices and each slice contains an integer number of macroblocks. There are five types of coded slices in H.264 (see Table 6.1 of lain E. G. Richardson, H.264 and MPEG-4 Video Compression: Video Coding for Next-generation Multimedia, John Wiley and Sons, 2003). Further, there are some video processing applications that modify an H.264/AVC encoded video. Digital watermarking is one such application. Unfortunately, the application of digital watermarking can cause objectionable artifacts in display video images. Consequently, a need exists to predict, characterize and prevent objectionable artifacts associated with digital watermarking.
A method comprises accessing encoded data which can be divided into blocks, accessing a list of possible changes to encoded data, constructing propagation maps of resultant changes to encoded data that will be caused by implementing at least one of the possible changes, and applying one of the possible changes based on the propagation maps. The encoded data can be video data in which the frames are divided into slices which are further divided into blocks, and the changes can represent potential watermarks. The method can further include constructing the propagation maps which reflect propagation due to inter-prediction, wherein resultant changes include a direct block and at least one neighbor block. The inter-prediction can be temporal inter-predicted, wherein resultant changes can include a direct block and at least one neighbor block. The method can further incorporate intra-prediction to construct propagation maps, wherein resultant changes can include a direct block and at least one neighbor block. The resultant propagation maps can then be used, for example, to determine impact on the fidelity that would be caused by the possible changes, thereby providing a means of eliminating possible changes in relation to fidelity thresholds.
The invention will now be described by way of example with reference to accompanying drawings.
Embodiments of the invention will now be described that address the problem of artifact generation associated with the application of digital watermarking where the watermarking process modifies the motion vectors in the compressed stream. These artifacts introduced by motion vector (mv) modifications not only change the reconstructed pixels of the current block, i.e., the block to which the motion vector belongs, but can also cause luminance or chrominance changes in other blocks, because the reconstructed my or the reconstructed pixels of the current block can serve as a reference for other blocks in the motion vector prediction or block prediction. These changes in other blocks can further propagate.
In particular, the invention creates a propagation map that indicates how a single change will propagate through a picture. A representation of a propagation map and an algorithm to build a propagation map to track the affected blocks and their changes is presented in the following text. The invention contemplates the incorporation of the propagation map construction into an H.264 decoder. Such a propagation map is useful in many aspects. The propagation map can be used to (1) examine the visual distortion resulting from a change, (2) examine changes that result in overlapping propagation maps, (3) examine changes that fall in the propagation path of a previous change, (4) identify multiple changes that combine such that a third block is affected, to improve the detection region when these changes are employed for watermarking, and (5) sometimes to avoid changes. A propagate path is how a single change in one block propagates to other blocks.
The invention recognizes that there are some video processing applications that modify an H.264/AVC encoded video such as digital watermarking and that a fidelity change predictor could be a parameter to ensuring that a selected watermarking is not diminishing video quality. The invention further provides a means for characterizing fidelity changes associated with decoding of areas in which the change was directly made, but further capture some changes that can propagate to other areas of the picture. As such, a map according to the invention indicates the areas to which a change will propagate and can apply the fidelity assessment to all of those areas. Furthermore, it is contemplated that in watermarking applications that multiple changes in a slice can be made. With this in mind, the invention was devised to apply to watermarking when there is a need to know what effect a change will have on the decoded imagery in which a given slice could experience a plurality of changes and there is a means for expressing the composite change on the unmarked version of the content. However, if a previous change has already propagated to the current region and another change is made, the resulting decoded imagery can include the effects of both changes. If the first change is known, then the result can be predicted, but possibly without knowledge a priori of whether or not the first change will be employed. If a change can be mapped indicating all of the areas to which a change will propagate, then making other changes inside of that propagation path can be avoided. A combination of these problems can also occur. If a region of a picture is modified indirectly, because a change in a different region has propagated to the current region, then the current region in an assessment of the fidelity impact of that change can be examined or determined. However, it is possible that there are multiple changes, all of which can propagate into the current region. If the propagation paths of all the changes are mapped, these regions of overlapping propagation can be identified and all combinations of impacts can be considered.
The inventions can be utilized in H.264 standard for block change prediction. Inter-prediction macroblocks are coded as the sum of a prediction from a reference frame/picture and a residual. The motion vector defines the block of pixels in the reference frame to be used as the prediction. Additionally, in the H.264 standard, pictures/frames are divided into slices. A slice is a sequence of macroblocks, wherein the macroblocks can be addressed/numbered in raster scan order within a frame. Each macroblock is comprised of one 16×16 luma array and can be divided in one of four ways: one 16×16 macroblock, two 8×16 macroblock partitions, two 16×8 macroblock partitions, or four 8×8 macroblock partitions. If the 8×8 mode is chosen, each of the four 8×8 macroblock partitions can be further divided into one of four sub-macroblock partitions: one 8×8, two 4×8, two 8×4, or four 4×4. The arrays can be associated chroma data also. The macroblock partitions and sub-macroblock partitions are indexed as shown in
The motion vector of a partition (mv) is coded as the sum of a motion vector prediction (MVp) and a motion vector differential (mvd). The prediction is based on the motion vectors of the neighboring partitions in the same frame/picture: A, B, C, and D as shown in
If a block is encoded with direct prediction mode, there will be no motion vector transmitted for the block. Instead, its motion vector will be calculated based on previously decoded blocks. There are two modes of direct prediction: spatial direct mode and temporal direct mode.
In the spatial direct mode, the first reference frame in a list 1 is first considered, which stores a number of different reference frames for one direction prediction in a bi-prediction mode. The co-located block in that frame can be indentified and its motion vectors examined or evaluated. If motion vectors are within the range [−1, 1] and their reference index is 0 (along with some other conditions), it means the motion is stationary and small, and its motion vector is set to 0. Otherwise, the motion is significant, and the motion vector is predicted in the same way as described above in predicting motion vectors from neighbors.
In the temporal direct mode, the motion vector of the current block can be directly derived from the motion vectors of the co-located block in the first reference of list 1. Since this mode does not involve neighbor block motion vectors, it will not be a source of propagation in B slices.
Regarding intra-prediction, intra-predicted macroblocks are coded as the sum of a prediction from within the current frame/picture and a residual. If one or more of the reference blocks are on the propagation map of a change, then the prediction can be affected by that change, in which case the current block would be also on the propagation map. There can be three types of intra-prediction: intra—4×4, intra—8×8, and intra—16×16. In the Intra—4×4 mode, the macroblock is predicted for each of the 16 4×4 blocks. There are a total of 8 modes (per table 8-2 of ITU-T Recommendation H.264 f ISO/IEC 14496-10 International Standard with Amendment 1) involving all 4 of the neighboring blocks, A, B, C, and D shown in
In the Intra—8×8 mode, the macroblock is predicted for each of the four 8×8 blocks. There are 8 modes (per table 8-3 of ITU-T Recommendation H.264|ISO/IEC 14496-10 International Standard with Amendment 1) involving all 4 of the neighboring blocks, A, B, C, and D as shown in
In the Intra—16×16 mode, the macroblock is predicted as a whole. There are 4 modes (per table 8-4 of ITU-T Recommendation H.264|ISO/IEC 14496-10 International Standard with Amendment 1) involving 3 neighboring blocks, A, B, and D as shown in
Specifically, the reference components are neighbor A's rightmost column, neighbor B's last row, neighbor C's last row, and neighbor D's last (lower-right) pixel as shown in
In an H.264/AVC watermarking system, a previous step has created a list of potential modifications. Each potential modification consists of a block identifier, an indication of which motion vector can be modified, and the alternative value for that motion vector. Note that, at this point, there can be a number of potential modifications for a single block. In a later step, the list of potential modifications will be pruned so that no block has more than one modification. Each entry in the list represents a change to a motion vector associated with a B-slice inter-predicted macroblock. Changing the motion vector of an inter-predicted block will have the effect of reconstructing the block with a different reference than what was intended during encoding, thus will likely change the decoded pixel values. This change can propagate in 2 ways: (1) if a second block is coded using inter-prediction and predicts its motion vector from the current, then that second block will also use a different reference than what was intended; and (2) if a second block is coded using intra-prediction and predicts its pixel values from the current, then the reconstructed pixels of that second block will be different from what was intended. The first kind of propagation, propagation to a neighboring motion vector, can again propagate to the next set of neighbors in the same way. The second kind of propagation, propagation directly to the pixel values, can only propagate further to neighboring blocks that also use intra-prediction.
In the H.264/AVC watermarking system, the potential changes can be evaluated one at a time. For each entry, one can assume that the potential change is made and then track how that change will propagate to other blocks in the frame/picture. The propagation map is the tool used to represent this information. Later, when assessing the fidelity impact of a potential change, there is only a need to consider the blocks on the propagation path of the potential change, because no other blocks will be affected. Similarly, it is possible to quickly determine if a potential change is in the propagation path of any previously selected change or if the propagation map of a potential change overlaps with that of any previously selected change.
a) illustrates one example of a propagation map. This propagation map 500 is associated with one B-slice block 510 whose motion vector has been directly changed. The other blocks 520 in the figure are blocks that will be indirectly changed due to propagation. When a block changes, either due to a direct modification or because it falls in the propagation path of another change, this change has the potential to further propagate to its neighbors.
The “head_node” uniquely identifies the changed block in terms of position and an alternative value of the motion vector that initiated the changes. All of the nodes in the propagation map P will have the same “head_node.” The element “mode” indicates the prediction mode of the current block, which can be either intra-prediction or inter-prediction. The element “cur_node” records the information about the current block. It contains the original and new motion vectors for inter-predicted blocks, and intra-prediction mode and reference blocks for intra-prediction blocks.
The integration of propagation map construction into an H.264 decoder will now be described. This integration is presented in the context of an H.264/AVC watermarking system similarly as described before.
The propagation map P is represented as a linked list, which, as a whole, will identify the macroblocks/partitions affected by the potential modification. As the decoder processes the macroblocks of the B slice in raster scan order, adding affected macroblocks/partitions to the corresponding linked lists can be continued. The detailed integrated algorithm will be described with reference to
Given a list of potential modifications in a B-slice, containing l entries, each corresponding to a modifiable motion vector with one alternative, l linked lists (710) can be constructed and each list can be initialized to contain at least one node (720), which can be the potential modification itself. The data structure of the node p is shown in Table 4, which contains the location information of the macroblock/partition, original and new motion vector information. Since the process of building propagation map is built into the AVC decoder, most of this information can be directly obtained from the decoder. Other information can be added based on the application of the propagation map.
Propagation map construction takes the original encoded video stream as one input. The other input is the list of all potential modifications. The process can be described as a series of three steps in
Propagation map initialization 835 is illustrated in
A propagation map builder in
In
For updating a propagation map according to only intra-prediction, all intra-prediction blocks with any of the three modes/types 1021 (i.e. 4×4, 8×8 or 16×16) are examined by the propagation map update process 1022 described in
The final step shown in
It is important to note that in the above algorithm that one can envision a need to go through all the nodes of the l propagation maps for every macroblock partition. This can incur a high computational cost. To speed up the process, an improved algorithm is formulated based on two observations. The first is that the parents of the current partition (identified neighbors) can only reside within the same row of macroblocks or the row above; as such, the nodes of list i which are more than one row away from the current block can be excluded during a parent search. The second is that if list i has not been updated for an entire row of macroblocks, the effect of modification on changeable block will not be able to propagate to the remaining blocks in the slice. Thus, the propagation map i is complete and there is no need to check it for future blocks within the current slice. The modified algorithm is presented in
For updating a propagation map according to intra-prediction and inter-prediction,
Several of the implementations and features described in this application can be used in the context of the H.264/MPEG-4 AVC (AVC) standard. However, these implementations and features can be used in the context of another standard (existing or future), or in a context that does not involve a standard. However, features and aspects of described implementations can also be adapted for other implementations.
The implementations described herein can be implemented in, for example, a method or process, an apparatus, a software program, a datastream, or a signal. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method), the implementation or features discussed can also be implemented in other forms (for example, an apparatus or program). An apparatus can be implemented in, for example, appropriate hardware, software, and firmware. The methods can be implemented in, for example, an apparatus such as a computer or other processing device. Additionally, the methods can be implemented by instructions being performed by a processing device or other apparatus, and such instructions can be stored on a computer readable medium such as a CD, or other computer readable storage device, or an integrated circuit. Further, a computer readable medium can store the data values produced by an implementation.
As should be evident to one of skill in the art, implementations can also produce a signal formatted to carry information that can be, for example, stored or transmitted. The information can include, for example, instructions for performing a method, or data produced by one of the described implementations. For example, a signal can be formatted to carry a watermarked stream, an unwatermarked stream, or watermarking information.
Additionally, many embodiments can be implemented in one or more of an encoder, a decoder, a post-processor processing output from a decoder, or a pre-processor providing input to an encoder. Further, other implementations are contemplated by this disclosure. For example, additional implementations can be created by combining, deleting, modifying, or supplementing various features of the disclosed implementations.
This application claims the benefit, under 35 U.S.C. §365 of International Application PCT/US 2009/004715, filed Aug, 19, 2009 which was published in accordance with PCT Article 21(2) on Feb. 25, 2010 in English and which claims the benefit of U.S. provisional patent application No. 61/189,370 filed Aug, 19, 2008.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2009/004715 | 8/19/2009 | WO | 00 | 1/25/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/021700 | 2/25/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5867109 | Wiedeman | Feb 1999 | A |
6009176 | Gennaro et al. | Dec 1999 | A |
6064661 | Benn | May 2000 | A |
6341350 | Miyahara et al. | Jan 2002 | B1 |
6373960 | Conover et al. | Apr 2002 | B1 |
6415041 | Oami et al. | Jul 2002 | B1 |
6553127 | Kurowski | Apr 2003 | B1 |
6687384 | Isnardi | Feb 2004 | B1 |
6894628 | Marpe et al. | May 2005 | B2 |
6900748 | Marpe et al. | May 2005 | B2 |
7113612 | Sugahara et al. | Sep 2006 | B2 |
7159117 | Tanaka | Jan 2007 | B2 |
7197164 | Levy | Mar 2007 | B2 |
7286710 | Marpe et al. | Oct 2007 | B2 |
7646881 | Zarrabizadeh | Jan 2010 | B2 |
7839312 | Tanaka et al. | Nov 2010 | B2 |
7865034 | So | Jan 2011 | B2 |
7974714 | Hoffberg | Jul 2011 | B2 |
8121341 | Tapson et al. | Feb 2012 | B2 |
8189854 | Watson et al. | May 2012 | B2 |
8559501 | Chen et al. | Oct 2013 | B2 |
8571256 | Tourapis et al. | Oct 2013 | B2 |
8588459 | Bloom et al. | Nov 2013 | B2 |
8824727 | Zou et al. | Sep 2014 | B2 |
20020071593 | Muratani | Jun 2002 | A1 |
20020097892 | Oami et al. | Jul 2002 | A1 |
20020136428 | Sugahara et al. | Sep 2002 | A1 |
20030070075 | Deguillaume et al. | Apr 2003 | A1 |
20030152225 | Kunisa | Aug 2003 | A1 |
20040017852 | Garrido et al. | Jan 2004 | A1 |
20040168110 | Fuldseth et al. | Aug 2004 | A1 |
20040247154 | Bodo et al. | Dec 2004 | A1 |
20050044411 | Somin et al. | Feb 2005 | A1 |
20050069169 | Zarrabizadeh | Mar 2005 | A1 |
20050123207 | Marpe et al. | Jun 2005 | A1 |
20050207499 | Hwang et al. | Sep 2005 | A1 |
20060078292 | Huang et al. | Apr 2006 | A1 |
20060222344 | Ukai et al. | Oct 2006 | A1 |
20060236130 | Ito et al. | Oct 2006 | A1 |
20060269096 | Kumar et al. | Nov 2006 | A1 |
20070053438 | Boyce et al. | Mar 2007 | A1 |
20070110033 | Tu et al. | May 2007 | A1 |
20070242862 | Watson et al. | Oct 2007 | A1 |
20080009272 | Toledano | Jan 2008 | A1 |
20080063071 | Suzuki | Mar 2008 | A1 |
20080165849 | Moriya et al. | Jul 2008 | A1 |
20080247469 | Vadapalli et al. | Oct 2008 | A1 |
20090279603 | Chen et al. | Nov 2009 | A1 |
20090290750 | Tapson et al. | Nov 2009 | A1 |
20110129116 | Thorwirth | Jun 2011 | A1 |
20110176610 | He et al. | Jul 2011 | A1 |
20110222723 | He et al. | Sep 2011 | A1 |
20110293016 | Suzuki | Dec 2011 | A1 |
20120237078 | Watson et al. | Sep 2012 | A1 |
20130058395 | Nilsson et al. | Mar 2013 | A1 |
20130058405 | Zhao et al. | Mar 2013 | A1 |
20130208814 | Argyropoulos et al. | Aug 2013 | A1 |
Number | Date | Country |
---|---|---|
101218830 | Jul 2008 | CN |
1515506 | Mar 2005 | EP |
1909508 | Apr 2008 | EP |
11331822 | Nov 1999 | JP |
11341450 | Dec 1999 | JP |
11346302 | Dec 1999 | JP |
2001119557 | Apr 2001 | JP |
2003125227 | Apr 2003 | JP |
2003134329 | May 2003 | JP |
2003179740 | Jun 2003 | JP |
2003244419 | Aug 2003 | JP |
2003529297 | Sep 2003 | JP |
2004221715 | Aug 2004 | JP |
2005533410 | Nov 2005 | JP |
2006279992 | Oct 2006 | JP |
2006287364 | Oct 2006 | JP |
2006303580 | Nov 2006 | JP |
2007053687 | Mar 2007 | JP |
2007525074 | Aug 2007 | JP |
2004066206 | Aug 2004 | WO |
2007067168 | Jun 2007 | WO |
WO2008065814 | Jun 2008 | WO |
2008118145 | Oct 2008 | WO |
2008154041 | Dec 2008 | WO |
Entry |
---|
Profrock et al., “H.264/AVC Video Authentication Using Skipped Macroblocks for an Erasable Watermark”, Visual Communications and Image Processing, 2005, SPIE, Bellingham, WA, 2005. |
Nguyen et al., “A Fast Watermarking System for H.264/AVC Video”, 2006 IEEE, 2006. |
Zou et al., “H.264/AVC Stream Replacement Technique for Video Watermarking”, ICASSP 2008, 2008 IEEE, 2008. |
Hu, “Information Hiding Based on Intra Prediction Modes for H.264 AVC”, Multimedia and Expo, 2007 IEEE, International Conference, IEEE PI, Jul. 1, 2007, pp. 1231-1234. |
Seo et al., “Low-Complexity Watermarking Based on Entropy Coding in H.264 AVC,” IEICE Trans. Fundamentals, vol. E91-A, No. 8, Aug. 2008. |
Noorkami, “Secure and Robust Compressed-Domain Video Watermarking for H.264,” A Thesis Presented to the Academic Faculty at Georgia Institute of Technology, 124 pages, Aug. 2007. |
Song et al., “A Data Embedded Video Coding Scheme for Error-Prone Channels”, IEEE Transactions on Multimedia, vol. 3, No. 4, Dec. 1, 2001. pp. 415-423. |
Liu et al., “Data Hiding in Inter and Intra Prediction Modes of h.264/AVC”, IEEE Int'l Symposium on Circuits and Systems, 2008 (ISCAS 2008), May 18, 2008, pp. 3025-3028. |
Winkler, “Preceptual Quality Assessments for Video Watermarking”, Proceedings from the IEEE Conference on Standardizaton and Innovation in Information Technology, Oct. 18, 2002, pages. |
Mobasseri et al., “Authentication of H.264 Streams by Direct Watermarking of CAVLC Blocks”, Security, Steganography, and Watermarking of Multimedia Contents IX, Proceedings of SPIE—IS&T Electronic Imaging, SPIE, vol. 6505, 65951W, 2007. |
Number | Date | Country | |
---|---|---|---|
20110176610 A1 | Jul 2011 | US |
Number | Date | Country | |
---|---|---|---|
61189370 | Aug 2008 | US |