The present invention relates to embedding and detection of a digital watermark for audio content, and particularly to an embedding method and detection method of a digital watermark high in robustness to deterioration of the content.
Nowadays, as means for preventing secondary use of audio contents converted into digital data, such as illicit copy and modification thereof, a digital watermark technology for embedding specific information in the audio contents is widely utilized.
In the digital watermark technology, the same information (watermark information) is repeatedly embedded in a plurality of spots of a piece of audio content. Then, when detecting the watermark information, values detected from the respective spots embedding the watermark information are accumulated in a buffer, intensified together, and then subjected to processing such as error correction. Thereafter, a detection result is outputted.
As a general technique of the digital watermark technology for embedding watermark information in audio content, a technique is employed, in which a pseudo random number sequence is generated by use of data called a key, a frequency component in data of the audio contents is processed by use of this pseudo random number sequence to create a signal (watermark signal) containing desired watermark information, and the signal is added to the data of the original audio content. Then, when detecting the watermark information, a technique is employed, in which the frequency component of the data of the audio contents is processed by use of a pseudorandom number sequence generated by the same key, detected values as a result of the processing are accumulated in the buffer, then the watermark signal is extracted from the accumulated values, and an embedded message (watermark information) is decoded.
The following documents are considered:
A length (time) of the accumulation of the detected values when detecting the watermark information is usually one type of fixed length. For example, a detection apparatus is designed such that 30 seconds are set as a cycle of the accumulation and the detection result of the watermark signal is outputted for every 30 seconds. Moreover, in a digital watermark technology for digital contents of a motion picture, a technique of varying the length of the accumulation of detected values when detecting watermark information is proposed (for example, refer to Patent Document 1). In this technology, watermark signals are weakly embedded so as not to deteriorate quality of the motion picture, and at the time of detection of the watermark information, the detected values are accumulated in a buffer until the detected values reach intensity sufficient for detecting the watermark information.
Moreover, in audio contents, there is one composed of a plurality of channels, such as one recorded in stereo. When the digital watermark is embedded in such audio contents, in general, one pseudo random number sequence is generated by use of one key, audio data in the respective channels is processed by use of this one pseudo random number, and thus the embedding is performed. Specifically, the same watermark signals are embedded in the audio data in the respective channels. In this case, when detecting the digital watermark, a technique is employed, in which the watermark signals are detected from the audio data in the respective channels and are synthesized, and an embedded message (watermark information) is decoded. When the digital watermarks are embedded in the respective channels, detected values from the respective channels highly correlate with one another, and accordingly, a component of the message in the detected values is intensified, thus facilitating the message to be restored. Furthermore, in the case of using the digital watermark technology for the purpose of ensuring security, a technique is proposed, in which a plurality of digital watermarks are created by use of different keys depending on features of contents and a passage of time thereof and are embedded in signals to be processed in order to enhance maintainability (for example, refer to Patent Document 2).
Meanwhile, audio contents converted into digital data are in themselves delivered through a broadcast and a network, or distributed by being recorded in a variety of recording media. In addition, audio contents are provided by being processed in various ways such as used as a piece of BGM (background music) of other contents and a jingle for a program. Hence, there are also audio contents which are extremely short in terms of time (for example, approximately two seconds), ones which are deteriorated due to superposition of another sound thereon, and the like.
Considering the existence of audio contents which are short in terms of time, it is preferable that the embedding of a digital watermark in audio contents also be performed for a short time span of the audio contents. On the other hand, in order to detect a digital watermark from audio contents, which are subjected to the superposition of another sound thereon and then deteriorated by being used as a piece of BGM and the like, it is necessary that detected values from the audio contents for a somewhat long time (for example, approximately 30 seconds) be accumulated (specifically, samples of the detected values be increased) and the watermark signal be intensified and then extracted.
However, when an accumulation cycle of the detected values is prolonged, the digital watermark embedded in the short audio contents cannot be detected. For example, even when attempting to detect a digital watermark from audio contents of approximately two seconds in the accumulation cycle set at 30 seconds, detected values, which come from sounds other than the intended audio contents, are included in the accumulated detected values for approximately 28 (=30−2) seconds. Accordingly, the message (watermark information) embedded in the audio contents cannot be correctly detected.
The above-mentioned prior art, in which the accumulation cycle of the detected values varies, has an aspect to intensify and combine the weakly embedded watermark signals by accumulating the signals until the signals reach the intensity sufficient for detecting the watermark information. In the prior art, to set an appropriate accumulation cycle for detecting the watermark information individually from the short audio contents and the deteriorated audio contents is left out of consideration.
If audio contents are a stereo-recorded audio composition or the like, the same watermark signals are embedded in the audio data in the respective channels, as mentioned above. When the digital watermark is detected, the watermark signals are detected from the audio data in the respective channels and synthesized, and then the message is restored.
However, when such audio contents are used as a piece of BGM of a narration, a sound of the narration superimposed on the audio contents has a signal analogous to a monaural one in many cases, and the correlation between the audio data of the narration in the respective channels is high. Hence, when the detected values from the respective channels are synthesized together to intensify components of the highly correlated message, components of the narration sounds are also intensified. Accordingly, it is difficult to distinguish between the message components and the noise components (narration sounds) in the detected values, thereby making it difficult to restore the message.
In order to detect the watermark signals in such a case, it is necessary to set a threshold value (a degree of correlation) for identifying the components of the watermark signals among the detected values, to a large one. However, when this threshold value is set to a large one, a much higher correlation between the watermark signals in the respective channels will be required in order to detect the digital watermark, and robustness to the deterioration of the digital watermark will be reduced.
The above-mentioned prior art which creates the watermark signals by use of the different keys in response to the features of the contents and the passage of time and embeds the created watermark signals in the signals to be processed does not consider the deterioration when a sound analogous to a monaural one, such as a narration, is superimposed on the audio contents having a plurality of channels though the prior art embeds the different watermark signals in response to the features of the contents and the passage of time. Hence, when the digital watermarks are embedded in the audio data in the respective channels, the same watermark signals using one key are likewise embedded in the audio data in the respective channels. Accordingly, the above-described problem can not be solved.
To solve the above-described problems, it is an aspect of the present invention to improve robustness of a digital watermark embedded in audio contents which are to be processed in various ways.
Moreover, it is another aspect of the present invention to provide a method and a system for appropriately detecting the digital watermark particularly from audio contents short in terms of time or deteriorated audio contents.
Furthermore, it is still another aspect of the present invention to provide a method and a system for embedding and detecting a digital watermark particularly having a strong robustness to deterioration due to superposition of another sound on audio contents.
In order to achieve the above-described aspects, the present invention is realized as a digital watermark detection apparatus constituted as below. A digital watermark detection apparatus includes: a plurality of watermark signal detection units for individually calculating, for PCM data in the respective channels of audio contents, detected values which are correlation coefficients between frequency components of the PCM data and pseudo random number sequences generated by a plurality of keys used to embed the digital watermark; a plurality of detected value adding units for adding the detected values calculated by the plurality of watermark signal detection units and corresponding to the respective channels and the respective keys for each possible combination of the respective channels and the respective keys; a comparison/selection unit for receiving and comparing the respective results of the addition by the plurality of detected value adding units with one another, and selecting and outputting one result of the addition; and a message restoration unit for restoring a message embedded as a digital watermark based on the result of the addition of the detected values, the result being outputted from the comparison/selection unit, and detecting the audio contents in which the digital watermark is embedded.
According to the present invention, the robustness of the digital watermarks embedded in audio contents processed in various ways can be improved. Specifically, a method and system for appropriately detecting a digital watermark particularly from audio contents short in terms of time or deteriorated audio contents can be provided. Moreover, a method and system embedding and detecting a digital watermark particularly having a strong robustness to deterioration due to superposition of another sound on audio contents.
For a more complete understanding of the present invention and the advantage thereof, reference is now made to the following description taken in conjunction with the accompanying drawings, in which:
The present invention provides systems, apparatus and methods to improve robustness of a digital watermark embedded in audio contents which are to be processed in various ways. Moreover, the present invention provides a method and a system for appropriately detecting the digital watermark particularly from audio contents short in terms of time or deteriorated audio contents. Furthermore, the present invention provides a method and a system for embedding and detecting a digital watermark particularly having a strong robustness to deterioration due to superposition of another sound on audio contents.
In an example embodiment, the present invention is realized as a digital watermark detection apparatus. A digital watermark detection apparatus includes: a plurality of watermark signal detection units for individually calculating, for PCM data in the respective channels of audio contents, detected values which are correlation coefficients between frequency components of the PCM data and pseudo random number sequences generated by a plurality of keys used to embed the digital watermark; a plurality of detected value adding units for adding the detected values calculated by the plurality of watermark signal detection units and corresponding to the respective channels and the respective keys for each possible combination of the respective channels and the respective keys; a comparison/selection unit for receiving and comparing the respective results of the addition by the plurality of detected value adding units with one another, and selecting and outputting one result of the addition; and a message restoration unit for restoring a message embedded as a digital watermark based on the result of the addition of the detected values, the result being outputted from the comparison/selection unit, and detecting the audio contents in which the digital watermark is embedded.
Here, more preferably, in the plurality of detected value adding units, a detected value adding unit for outputting the detected value calculated by one of the watermark signal detection units as is, without adding the detected value to detected values calculated by the other watermark signal detection units, is included.
Moreover, from the results of the addition by the plurality of detected value adding units, the comparison/selection unit selects the following as the added results to be outputted.
One in which a sum of absolute values of the detected values is maximum.
One in which a sum of squares of the detected values is the maximum.
One in which the number of bits, an error has occurred therein, is minimum after performing error correction for the results of the addition.
One in which a signal-to-noise ratio (SNR) is maximum found as a result of performing error correction for the results of the addition and calculating the SNRs by obtaining correct symbols of individual bits.
Furthermore, a configuration can be adopted, in which orders of processing by the above-mentioned comparison/selection unit and processing by the message restoration unit are changed. In this case, a plurality of message restoration units are prepared, and the plurality of message restoration units individually receive the results of the addition by a plurality of detected value adding units, and restore the messages embedded as the digital watermarks. Then, the comparison/selection unit receives and compares results of restoring the messages by the plurality of message restoration units, and selects and outputs a series of messages.
Moreover, another embodiment of the present invention achieving the above-described aspects is realized as a digital watermark detection apparatus constituted as below. The digital watermark detection apparatus includes: a watermark signal detection unit for calculating a detected value which is a correlation coefficient between a frequency component of PCM data of audio contents and a pseudo random number sequence generated by a key used to embed the digital watermark; a plurality of message restoration units for accumulating the detected value calculated by the watermark signal detection unit in accumulation cycles different from one another, restoring the message embedded as the digital watermark from the accumulated detected values, and moreover, performing boundary detections of the audio contents, and detecting the audio contents in which the digital watermark is embedded; and a detection result output unit for receiving the respective processing results by the plurality of message restoration units, and synthesizing and outputting detection results of the audio contents in the respective processing results, with the digital watermarks being embedded in the audio contents.
Here, in more detail, when the audio contents in which the digital watermark is embedded are detected by any of the message restoration units, the detection result output unit determines that the detected audio contents are the ones in which the digital watermark is embedded, and determines a portion where the audio contents in which the digital watermark is embedded are not detected by any of the message restoration units as audio contents in which the digital watermark is not embedded. Moreover, when the audio contents in which the digital watermarks are embedded are detected by the plurality of message restoration units, a detection result by the message restoration unit of which accumulation cycle is shorter is prioritized. Furthermore, when the audio contents in which the digital watermark is embedded is detected by a predetermined one of the message restoration units, a detection result by the message restoration unit is outputted without waiting for detections by the other message restoration units of which accumulation cycles are longer than that of the message restoration unit.
Moreover, still another embodiment of the present invention achieving the above-described aspects is realized as a digital watermark detection method as below for detecting a digital watermark embedded in audio contents by a computer. Specifically, the digital watermark detection method includes: a first step of individually calculating, for PCM data in the respective channels of audio contents, detected values which are correlation coefficients between frequency components of the PCM data and pseudo random number sequences generated by a plurality of keys used to embed the digital watermark, and of storing the calculated detected values in storing means; a second step of adding the calculated detected values corresponding to the respective channels and the respective keys for each possible combination of the respective channels and the respective keys, and of storing the added detected values in the storing means; a third step of receiving and comparing the respective results of the addition for each combination of the respective channels and the respective keys, and of selecting and outputting one result of the addition; and a fourth step of restoring a message embedded as the digital watermark based on the selected and outputted result of the addition of the detected values, and moreover, of performing boundary detection of the audio contents, and detecting the audio contents in which the digital watermark is embedded.
Furthermore, another digital watermark detection method according to the present invention includes: a first step of calculating a detected value which is a correlation coefficient between a frequency component of PCM data of audio contents and a pseudo random number sequence generated by a key used to embed the digital watermark, and of storing the detected value in storing means; a second step of accumulating the calculated detected value in accumulation cycles different from one another, of restoring a message embedded as the digital watermark from the accumulated detected values, and moreover, of performing boundary detection of the audio contents, of detecting the audio contents in which the digital watermark is embedded, and of storing detection results in the storing means; and a third step of receiving the detection results based on the plurality of accumulation cycles, and synthesizing and outputting detection results of the audio contents in the respective processing results, with the digital watermarks being embedded in the audio contents.
Moreover, the present invention is realized as a program for controlling a computer to execute processing corresponding to the respective steps of the above-described digital watermark detection methods, or as a program for allowing the respective functions of the above-mentioned digital watermark detection apparatuses to be realized by the computer. This program is provided by being stored in a magnetic disk, an optical disc, a semiconductor memory or other recording media and distributed, or by being distributed through a network.
According to the present invention constituted as described above, the robustness of the digital watermarks embedded in audio contents processed in various ways can be improved. Specifically, a method and system for appropriately detecting a digital watermark particularly from audio contents short in terms of time or deteriorated audio contents can be provided. Moreover, a method and system embedding and detecting a digital watermark particularly having a strong robustness to deterioration due to superposition of another sound on audio contents.
A best mode for carrying out the present invention (hereinafter, an embodiment) is described below in detail with reference to the accompanying drawings.
Here, the following two embodiments are described in order to enhance robustness of audio contents in consideration that the audio contents are provided in various modes.
An embodiment in which a digital watermark is detected by use of a plurality of accumulation cycles is described.
When considering a mode of use of audio contents, audio contents deteriorated due to superposition of another sound thereon, such as audio contents used as a BGM, are audio contents having some length in terms of time. Meanwhile, audio contents of approximately two seconds, which are extremely short in terms of time, are used as a jingle or a sound effect for a broadcast program and hardly superposed on another sound. In other words, it can be said that, while it is necessary to accumulate detected values in a long accumulation cycle in order to cope with deterioration of audio contents having some length in terms of time and thus to restore a message (watermark information), the message can be restored relatively satisfactorily even from detected values accumulated in a short accumulation cycle, in the audio contents short in terms of time.
Moreover, as described in the “Problems to be Solved by the Invention,” when the accumulation cycle of the detected values is prolonged, it becomes difficult to restore the message from audio contents short in terms of time.
Accordingly, in this embodiment, detected values of watermark signals are accumulated in a plurality of different accumulation cycles. Audio contents which are short in terms of time and resistant to deterioration and audio contents which have some length in terms of time and have a possibility of deterioration are coped with by means of appropriate accumulation cycles, and the message is restored.
The computer apparatus shown in
Note that
Referring to
The watermark signal detection units 11 are realized, for example, by the CPU 101 which is shown in
As shown in
The calculated detected value is temporarily stored in a storage device, for example, such as the main memory 103 and a cache memory of the CPU 101 in
The detected value adding unit 12 is realized, for example, by the CPU 101 which is shown in
Each of the message restoration units 13 is realized, for example, by the CPU 101 and the storing means such as the main memory 103 which are shown in
Moreover, the message restoration units 13 detect boundaries of the audio contents based on the respective restoration results of the messages. The restoring of the messages and the detection of the boundaries of the audio contents are performed, and thus the audio contents in which the digital watermarks are embedded are detected. Portions which are not determined to be audio contents in which the digital watermarks are embedded in the PCM data are determined to be audio contents in which the digital watermarks are not embedded.
In this embodiment, as shown in
As described above, audio contents deteriorated due to superposition of another sound thereon have some length in terms of time, and accordingly, the message restoration units 13 of which accumulation cycles are long are provided in order to cope with such audio contents having a possibility of deterioration. Therefore, as accumulation cycles of message restoration units 13 become higher, threshold values for restoring the messages from detected values may be set higher. Thus, the message restoration unit 13b becomes less prone to be affected by an influence of the deterioration caused by superposition of another sound thereon than the message restoration unit 13a, the message restoration unit 13c becomes less prone than the message restoration unit 13b, and noise due to the deterioration is removed appropriately, thus making it possible to restore the messages correctly.
The messages restored by the respective message restoration units 13 are individually held temporarily in storage device such as, for example, the main memory 103 and the cache memory of the CPU 101 in
The detection result output unit 14 is realized, for example, by the CPU 101 which is shown in
Rule 1: When the audio contents in which the digital watermark is embedded are detected by any of the message restoration units 13, the detection result output unit 14 determines the audio contents as audio contents in which the digital watermark is embedded, and outputs the detection result. Specifically, only when the audio contents in which the digital watermark is embedded are not detected by any of the message restoration units 13, the detection result output unit 14 determines the portion concerned as the audio contents in which the digital watermark is not embedded.
Rule 2: When the audio contents in which the digital watermark is embedded are detected by the plurality of message restoration units 13, the detection result output unit 14 prioritizes the detection result by a message restoration unit 13 of which accumulation cycle is shorter. Specifically, when contents of the messages restored by a plurality of message restoration units 13 contradict one another, the detection result output unit 14 employs the message restored by a message restoration unit 13 of which accumulation cycle is shorter as the detection result.
Rule 3: When the audio contents in which the digital watermark is embedded is detected by a predetermined message restoration unit 13, the detection result output unit 14 outputs the detection result without waiting for detection of the audio contents in which the digital watermark is embedded by the other message restoration unit 13 of which accumulation cycle is longer than that of the predetermined message restoration unit 13. This is because the detection result by a message restoration unit 13 of which accumulation cycle is shorter is prioritized by Rule 2, and thus the detection result by a message restoration unit 13 of which accumulation cycle is longer does not affect the final detection result by the detection result output unit 14.
Moreover, in the integration processing of the detection results, the detection result output unit 14 integrates results of the boundary detection of the audio contents by the respective message restoration units 13, and performs appropriate boundary detection.
The following case is assumed: where there is PCM data in which audio contents [1] having no digital watermark embedded therein, audio contents [2] having a message A embedded therein, and audio contents [3] having a message B embedded therein continue in such an order of [1], [3], [1], [3] and [2].
In this case, referring to
However, the message B in the audio contents [3] concerned is detected by the message restoration unit 13b having a longer accumulation cycle. Hence, according to the above-described Rule 1, the boundary of the audio contents is determined based on the detection result by the message restoration unit 13b.
Meanwhile, it is understood that the message restoration unit 13b was not able to detect the audio contents [2] which appeared finally in the PCM data. This means the following: the audio contents [3] which appeared immediately before the audio contents [2] concerned are contained in the accumulation cycle because the audio contents [2] are extremely short; the message A in the audio contents [2] and the message B in the audio contents [3] are mixed in the detected value; and thus the message A was not able to be restored. Note that, as a general action of the message restoration units 13, a current state is maintained until a message different from a current message is restored or until it is settled that a different message is not restored, and accordingly, in the detection result by the message restoration unit 13b in
However, the message A in the audio contents [2] concerned is detected by the message storing unit 13a having the short accumulation cycle. Therefore, according to the above-mentioned Rule 2, the boundary of the audio contents is determined based on the detection result by the message restoration unit 13a.
An action of the detection result output unit 14 is described below in detail by giving a specific detection example.
In
In Section I, no message is restored in any of the respective message restoration units 13a, 13b and 13c. Accordingly, a final detection result of Section I also becomes the audio contents [1] in which no message is embedded.
In Section II, the message A is restored by the message restoration unit 13a, and accordingly, a final detection result of Section II becomes the audio contents [2] in which the message A is embedded according to the above-mentioned Rules 2 and 3. Moreover, a length of Section II is determined based on the boundary detection result by the message restoration unit 13a.
In Section III, no message is restored in any of the respective message restoration units 13a, 13b and 13c. Accordingly, a final detection result of Section III also becomes the audio contents [1] in which no message is embedded.
In Section IV, the messages B are restored by the message restoration units 13b and 13c, and accordingly, a final detection result of Section IV becomes the audio contents [3] in which the message B is embedded according to the above-mentioned Rule 1. Moreover, a length of Section IV is determined in a manner that a section of the audio contents [3] (section where the message 6B is restored) by a boundary detection result of the message restoration unit 13c and a section of the audio contents [3] by a boundary detection result of the message restoration unit 13b are superposed on each other.
In Section V, no message is restored in any of the respective message restoration units 13a, 13b and 13c. Accordingly, a final detection result of Section V also becomes the audio contents [1] in which no message is embedded.
In Section VI, the messages B are restored by the message restoration units 13a and 13b, and accordingly, a final detection result of Section VI becomes the audio contents [3] in which the message B is embedded according to the above-mentioned Rule 1. A length of Section VI is explained in combination with that of the next Section VII.
In Section VII, the message A is restored by the message restoration unit 13a. Moreover, in a part of Section VII, the message B is restored by the message restoration unit 13b. Detection results by the message restoration unit 13a and the message restoration unit 13b contradict each other. However, the detection result by the message restoration unit 13a of which accumulation cycle is shorter is prioritized according to the above-mentioned Rule 2, and a final detection result of Section VII becomes the audio contents [2] in which the message A is embedded.
Here, the lengths of Sections VI and VII are described. When assuming a section formed by adding Sections VI and VII, the message restoration unit 13b restores the message B from Section VI to a part of Section VII, and performs the boundary detection while regarding this described section as a section of the audio contents [3]. Meanwhile, the message restoration unit 13a restores the message B and the message A, and performs the boundary detection while regarding the section where the message B is restored as a section of the audio contents [3] and the section where the message A is restored as a section of the audio contents [2]. Rules 2 and 3 are applied to these boundary detection results, and the lengths of Sections VI and VII are determined based on the boundary detection results by the message restoration unit 13a.
In Section VIII, no message is restored in any of the respective message restoration units 13a, 13b and 13c. Accordingly, a final detection result of Section VIII also becomes the audio contents [1] in which no message is embedded.
As described above, in this embodiment, messages embedded in audio contents by digital watermarks are restored by use of the plurality of message restoration units 13 having different accumulation cycles. Thus, a message restoration unit 13 of which accumulation cycle is shorter copes with audio contents short in terms of time, and a message restoration unit 13 of which accumulation cycle is longer copes with audio contents which have some length and a possibility of deterioration, thus making it possible to restore the messages embedded in the audio contents.
Moreover, because the audio contents short in terms of time have less possibility of deterioration due to superposition of another sound thereon, this embodiment prioritizes the detection result by the message restoration unit 13 of which accumulation cycle is shorter, which copes with the audio contents short in terms of time. Then, when the message is restored and the digital watermark is detected by the message restoration unit 13 of which accumulation cycle is shorter, the detection result is outputted without waiting for the restoring of the message by the other message restoration units 13. Therefore, a time required for detecting the digital watermark can be shortened by not waiting for the restoring of the message by the message restoration units 13 of which accumulation cycles are longer.
Next, an embodiment of embedding and detecting a digital watermark by use of a plurality of pseudo random number patterns is described.
A digital watermark embedding apparatus and a digital watermark detection apparatus according to this embodiment, are realized, for example, by the computer apparatus as in
Referring to
When the watermark signals are created, the adding unit 22 adds together the PCM data as the aspect to be processed and the watermark signals in the time domain, which are created by the watermark signal creating unit 21, thus creating PCM data in which the digital watermarks are already embedded (Step 706).
Here, in this embodiment, the digital watermarks are embedded in the PCM data of the respective channels according to the above-described procedure, and in Step 704, the watermark signal creating unit 21 creates the pseudo random number sequence for the respective channels by use of a plurality of keys, thus creating the watermark signals. Hence, the digital watermarks embedded in the respective channels differ in the watermark signals added to the original PCM data though messages thereof are the same. Note that, in order to allow the watermark signals added to the data of the respective channels to entirely differ from one another, the number of keys is equalized to the number of channels of the audio contents.
In this embodiment, as described above, the digital watermarks are embedded in the plurality of channels of the audio contents by the watermark signals created by use of the plurality of keys. Accordingly, the digital watermark detection apparatus 30 of this embodiment detects the watermark signals from the data of the respective channels of the audio contents by use of the respective keys used for embedding the digital watermarks. Then, the digital watermark detection apparatus 30 synthesizes the obtained detected values in various ways of possible combinations, selects appropriate combinations, and restores the messages. Hereinafter, for simplicity, it is assumed that the audio contents are recorded in two-channel stereo (channels L and R), and description is made by taking as an example the case where digital watermarks are embedded in the data of the respective channels by use of watermark signals created by two keys (K1 and K2).
The watermark signal detection units 31 are realized, for example, by the CPU 101 which is shown in
Here, in this embodiment, it is necessary to detect the watermark signals created in the data of the two channels (L and R) by use of the keys (K1 and K2) different from each other. It is not known, in the data of each of the two channels, which of the keys is used for the embedding of watermarks. Accordingly, for the data of each channel, a watermark signal detection unit 31 which detects watermark signals by use of the key K1 and a watermark signal detection unit 31 which detects watermark signals by use of the key K2 are individually prepared. In
Note that, though the four watermark signal detection units 31 are illustrated in
Moreover, a configuration is also conceivable, in which information indicating which of the keys is used for the embedding of digital watermarks is provided in the data of the respective channels from the digital watermark embedding apparatus 20 to the digital watermark detection apparatus 30, and thus the number of watermark signal detection units 31 are reduced. However, when considering that the data of the respective channels change places with each other by only changing cables on instruments on a path where the audio contents are distributed, it is preferable to prepare the four watermark signal detection units 31 described above.
The detected value adding units 32 are realized, for example, by the CPU 101 which is shown in
Here, when the watermarks are embedded in the data of the two channels (L and R) by use of the keys (K1 and K2) different from each other, it is understood that the different keys are used for the data of the respective channels, and accordingly, possible combinations are two, which are: a combination of the detected values by the watermark signal detection units 31LK1 and 31RK2; and a combination of the detected values by the watermark signal detection units 31LK2 and 31RK1. Hence, as shown in
In general, a combination of the same channels or the same keys has no possibility to be present, and accordingly, the number of possible combinations is n!, and it is necessary to prepare n! pieces of detected value adding units 32.
Incidentally, when the audio contents from which digital watermarks are to be detected are received as one-channel monaural contents though the audio contents are originally two-channel stereo contents, possible combinations differ from those in the above-described case.
In the monaural contents, digital watermarks are embedded by the watermark signals created by use of the key K1 or the key K2 as in the above-described case, and accordingly, two watermark signal detection units 31 are used, which are: a watermark signal detection unit 31MK1 which detects watermark signals from the one-channel (channel M) data by use of the key K1; and a watermark signal detection unit 31MK2 which detects watermark signals therefrom by use of the key K2.
As a mode in which the two-channel stereo contents are converted into the one-channel monaural content, there are two cases, which are: a case where the data of the two channels are added together; and a case where any one of the data of the two channels is deleted. Therefore, as cases of monaural contents, there are cases which are: a case where both of the digital watermarks using the key K1 and the key K2 are embedded; a case where digital watermarks using the key K1 is embedded; and a case where digital watermarks using the key K2 is embedded. Accordingly, a detected value adding unit 32 which adds together the detected values by the watermark signal detection units 31MK1 and 31MK2 in combination with each other, a detected value adding unit 32 which receives the detected value by the watermark signal detection unit 31MK1 and outputs the detected value as is, and a detected value adding unit 32 which receives the detected value by the watermark signal detection unit 31MK2 and outputs the detected value as is, are prepared. Note that watermark signal detection units 32 which output one of the detected values by the watermark signal detection unit 31MK1 and the detected value by the watermark signal detection unit 31MK2 as is are not essential constituent elements, and it is also possible to allow a storage device to hold the detected values by one of the watermark signal detection unit 31MK1 and the watermark signal detection unit 31MK2 as is and to allow the comparison/selection unit 33 to be described later to process the detected values.
The comparison/selection unit 33 is realized, for example, by the CPU 101 which is shown in
Although various methods are conceivable as a technique of selecting the best detected values, for example, the following methods can be adopted.
The message restoration unit 34 is realized, for example, by the CPU 101 which is shown in
In such a way, the embedding of digital watermarks using a plurality of pseudo random number patterns and the detection of such digital watermarks are realized. As described above, digital watermarks embedded by use of pseudo random number patterns by use of a plurality of keys are detected from data processed by the pseudo random number patterns by the keys for use when the digital watermarks are embedded. Specifically, in order to detect watermark signals from the data of the plurality of channels, different pieces of processing by different keys are implemented for the data of the respective channels in the watermark signal detection units 31. Therefore, even if a sound analogous to a monaural sound, such as a narration, is superposed on audio contents, the superposed sound becomes data sequences completely different from each other for each of the channels through the processing by pseudo random number patterns generated by use of the keys, and the correlation therebetween becomes lowered. Hence, while components of watermark signals in the detected values are intensified together by being added together in detected value adding units 32, components of audio data superposed on the audio contents are not intensified together, and accordingly, it becomes easy to detect watermark signals.
In this embodiment, digital watermarks are embedded in respective pieces of data of a plurality of channels of audio contents by watermark signals created by use of keys different from each other. However, when audio contents are two-channel stereo contents, a configuration may be adopted, in which a pseudo random number sequence generated by use of a key is allowed to change sign and to be used as two pseudo random number sequences, and watermark signals are created and embedded in the data of the respective channels. In such a way, when detecting the digital watermarks, differences between the detected values by the watermark signal detection units 31 in the respective channels are subtracted, and thus influences of the monaural sounds superposed on the audio contents are cancelled each other out. Only the components of the watermark signals in the detected values can be intensified together, and it becomes easier to detect the watermark signals.
Moreover, when embedding watermark signals in the data of the respective channels, it is also possible to perform a so-called permutation encryption. In this case, permutation is performed and decryption of the watermark information is performed when the detected values by the watermark signal detection units 31 are added together in detected value adding units 32. In such a way, security of digital watermarks can be enhanced.
Moreover, when completely different audio contents are individually recorded in a plurality of channels, there is also a possibility that the components of the watermark signals in the detected values are not appropriately intensified together even if the detected values from the data of the respective channels are added together. Accordingly, in a use environment where there is a possibility of receiving such audio contents as aspects from which digital watermarks are to be detected, it is preferable to set not only the values obtained by adding together the detected values from the data of the respective channels but also the detected values from the data of the respective channels as they are as aspects to be processed in the comparison/selection unit 33.
In the functional configuration of the digital watermark detection apparatus 30 according to this embodiment, which is shown in
A plurality of message restoration units 41 are realized, for example, by the CPU 101 which is shown in
A comparison/selection unit 42 is realized, for example, by the CPU 101 which is shown in
As above, Embodiment 1 which performs the detection of the digital watermark by use of the plurality of accumulation cycles and Embodiment 2 which performs the embedding and detection of the digital watermark by use of the plurality of pseudo random number patterns are described. It is also possible to constitute a digital watermark detection apparatus by combining these embodiments.
Note that the watermark signal detection units 31, the detected value adding units 32 and the comparison/selection unit 33 are similar to the watermark signal detection units 31, the detected value adding units 32 and the comparison/selection unit 33, which are described in Embodiment 2, and the message restoration units 13 and the detection result output unit 14 are similar to the message restoration units 13 and the detection result output unit 14, which are described in Embodiment 1. Hence, the same reference numerals are added to these components, and detailed description thereof is omitted.
In the digital watermark detection apparatus of
The message restoration units 51 are realized, for example, by the CPU 101 which is shown in
Comparison/selection units 52 are realized, for example, by the CPU 101 which is shown in
Although the preferred embodiments of the present invention have been described in detail, it should be understood that various changes, substitutions and alternations can be made therein without departing from spirit and scope of the inventions as defined by the appended claims.
Variations described for the present invention can be realized in any combination desirable for each particular application. Thus particular limitations, and/or embodiment enhancements described herein, which may have particular advantages to a particular application need not be used for all applications. Also, not all limitations need be implemented in methods, systems and/or apparatus including one or more concepts of the present invention.
The present invention can be realized in hardware, software, or a combination of hardware and software. A visualization tool according to the present invention can be realized in a centralized fashion in one computer system, or in a distributed fashion where different elements are spread across-several interconnected computer systems. Any kind of computer system—or other apparatus adapted for carrying out the methods and/or functions described herein—is suitable. A typical combination of hardware and software could be a general purpose computer system with a computer program that, when being loaded and executed, controls the computer system such that it carries out the methods described herein. The present invention can also be embedded in a computer program product, which comprises all the features enabling the implementation of the methods described herein, and which—when loaded in a computer system—is able to carry out these methods.
Computer program means or computer program in the present context include any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after conversion to another language, code or notation, and/or reproduction in a different material form.
Thus the invention includes an article of manufacture which comprises a computer usable medium having computer readable program code means embodied therein for causing a function described above. The computer readable program code means in the article of manufacture comprises computer readable program code means for causing a computer to effect the steps of a method of this invention. Similarly, the present invention may be implemented as a computer program product comprising a computer usable medium having computer readable program code means embodied therein for causing a function described above. The computer readable program code means in the computer program product comprising computer readable program code means for causing a computer to effect one or more functions of this invention. Furthermore, the present invention may be implemented as a program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for causing one or more functions of this invention.
It is noted that the foregoing has outlined some of the more pertinent aspects and embodiments of the present invention. This invention may be used for many applications. Thus, although the description is made for particular arrangements and methods, the intent and concept of the invention is suitable and applicable to other arrangements and applications. It will be clear to those skilled in the art that modifications to the disclosed embodiments can be effected without departing from the spirit and scope of the invention. The described embodiments ought to be construed to be merely illustrative of some of the more prominent features and applications of the invention. Other beneficial results can be realized by applying the disclosed invention in a different manner or modifying the invention in ways known to those familiar with the art.
Number | Date | Country | Kind |
---|---|---|---|
2004-99592 | Mar 2004 | JP | national |
Number | Date | Country | |
---|---|---|---|
Parent | 11087298 | Mar 2005 | US |
Child | 12140750 | US |