The present invention relates to an apparatus and method for embedding and extracting a capturing-resistant audio watermark based on discrete wavelet transform, and a copyright management system using the same; and more particularly, to a copyright protection scheme for audio contents, which embeds a capturing-resistant watermark including user information in an audio content, and extracts the capturing-resistant watermark and uses the user information included in the extracted watermark when a digital rights management (DRM) packaged audio content is captured after passing a copyright authentication process and the captured audio content is illegally duplicated and distributed.
Encryption based digital rights management (DRM) is a copyright protection and management scheme that encodes files and gives a right to only an authenticated user to decode the coded files. However, the encryption based DRM cannot prevent the copyright infringement made by a capturing attack after passing an authentication process. In order to overcome such a shortcoming of the encryption based DRM, a conventional watermarking method for protecting the copyright was introduced. The watermarking method embeds a watermark having user information in digital contents, and traces a person who illegally distributes the digital contents based on the user information in the watermark when the digital contents are illegally distributed.
However, conventional audio watermarking methods cannot accurately extract the user information due to various distortions, sampling rate conversion, compression, format conversion and filter.
A method of amplitude-scaling resilient audio watermarking by quantizing audio frequency component is disclosed in Korean Patent Publication No. 2005-0020040 (Application No. 2003-0057682). Although this conventional method is robust against lossy compression or amplitude variation, which may occur during capturing audio contents, it is weak in sampling variation.
A method for protecting the copyright of video content by embedding a watermark into video content when the video content is reproduced is disclosed in Korean Patent Publication No. 2006-0017420 (Application No. 2004-0066085). That is, it is a copyright protection and tracking method for video content. However, the present invention relates to a capturing resistant audio watermarking method that protects the copyright of audio contents by embedding the capturing resistant watermark into an audio content when the audio content is reproduced.
A conventional method of automatically synchronizing using discreet wavelet transform (DWT) was introduced in an article by S. Wu, J. Huang, D. Huang, and Y. Q. Shi, entitled “Self-synchronized audio watermarking in DWT domain”, in Proc. IEEE Int. Sym. On Circuits and Systems, vol. 5, pp. 712-715, May 2004. This article teaches the method of extracting a watermark although a watermarked signal loses a synchronization point. Differently from the conventional method, the apparatus and method for embedding and extracting a capturing-resistant watermark according to the present invention embeds an artificial synchronization signal into an audio content when a watermark is embedded, and uses the inserted artificial synchronization signal with a synchronization detecting function for the synchronization.
It is an object of the present invention to provide an apparatus and method for embedding a wavelet based watermark robust against lossy compression by embedding a watermark by changing the mean of lowest subband coefficients according to a watermark by transforming an audio signal based on discrete wavelet transform (DWT).
It is another object of the present invention to provide an apparatus and method for embedding a wavelet based watermark for providing the inaudibility of a watermark by controlling a degree of embedding power after applying an audio signal and a discrete wavelet transformed signal to a human auditory model.
It is still another object of the present invention to provide an apparatus and method for embedding a wavelet based watermark for providing a robust characteristic against synchronization point loss by inserting an artificial synchronization signal when a watermark is embedded and using a synchronization detecting function.
To achieve these objects and other advantages and in accordance with the purpose of the invention, as embodied and broadly described herein, there is provided an apparatus for embedding a wavelet based audio watermark, including: a framing unit for dividing an input audio signal into small signals with a regular length; a discrete wavelet transform unit for calculating an mean value of wavelet coefficients by transforming the small signals based on a discrete wavelet transform; and an embedding unit for changing the calculated mean value according to a watermark where a synchronization signal is inserted and inserting the watermark into the audio signal.
The apparatus may further include a human auditory model unit for controlling an insertion level of the watermark using the input audio signal and the discrete wavelet transformed signal.
The apparatus may further include an encoding unit for inserting an additional signal to the watermark and transferring the watermark with the additional signal to the embedding unit in order to detect a synchronization of a watermark.
In another aspect of the present invention, there is provided an apparatus for extracting a watermark based on a discrete wavelet transform including: a synchronization unit for receiving an audio signal with a watermark inserted and searching a synchronization point; an extracting unit for extracting inserted information using a mean value of wavelet coefficients by wavelet transform of an audio signal having the watermark of the searched synchronization point; and a decoding unit for decoding a watermark using the extracted information.
The mean of the wavelet coefficients in the extracting unit may be a mean of lowest subband wavelet coefficients.
The synchronization unit may search a synchronization point using information of adjacent frame, and the information of the adjacent frame may be information about combination of a mean and variance
The decoding of the watermark may extract a watermark using difference information of extracted values between adjacent frames.
In still another aspect of the present invention, there is provided a method for extracting a wavelet based audio watermark including the steps of: a) dividing input audio signal into small signals with a regular length; d) calculating a mean value of wavelet coefficients by performing a discrete wavelet transform on the small signals; and c) inserting a watermark in the audio signal by changing the calculated mean value of discrete wavelet coefficients.
The step c) may include the step of c-1) controlling a degree of embedding the watermark using the input audio signal and the discrete wavelet transformed signal.
The method may further include the step of d) inserting an additional signal into the watermark for detecting a synchronization of the inserted watermark, and transferring the watermark to the step c).
In further still another aspect of the present invention, there is provided a method for extracting a wavelet based audio watermark including the steps of: a) receiving a watermarked audio signal and searching a synchronization point; b) extracting inserted information from the watermarked audio signal using a mean of wavelet coefficients by discrete wavelet transform of the watermarked audio signal having the searched synchronization point; and c) decoding a watermark using the extracted information.
In the step a), the synchronization point may be searched using information about adjacent frames.
The mean of the wavelet coefficients in the step a) may be a mean of lowest subband wavelet coefficients.
In the step c), the watermark may be extracted using difference information of extracted values between adjacent frames.
In even still further another aspect of the present invention, there is provided a copyright management system using an apparatus for embedding and extracting a wavelet based audio watermark, including: a watermark embedding apparatus for embedding the watermark into the audio signal by changing a mean value of wavelet coefficients according to a watermark by performing a discrete wavelet transform on an audio signal which is divided into small signals with a regular length; and a watermark extracting apparatus for searching a synchronization point of a watermarked audio signal, extracting inserted information using a mean of wavelet coefficients by wavelet transforming the watermarked audio signal, and decoding a watermark using the extracted information.
An apparatus and method for embedding and extracting a capturing-resistant audio watermark based on discrete wavelet transform according to the present invention and a copyright managing system using the same can extract a watermark after capturing because it is robust against various distortions that may be made during capturing. Also, the apparatus and method according to the present invention inserts user information into an audio content as a watermark when a DRM packaged audio content is reproduced. Therefore, the copyright of the audio contents can be protected by tracking an illegal distributor using the user information included in the watermark when the audio content is illegally duplicated and distributed. The introduced audio watermarking method according to the present invention uses the simple human auditory model. Also, in order to find a synchronization point, a mean value is made to zero when the watermark is inserted, and a detection function is used. Therefore, the inaudibility of the watermark is achieved and the synchronization point can be detected easily.
The capturing resistant characteristic may be confirmed through experimental results in below tables. The capturing resistant characteristics for sampling rate variation, synchronization loss and compression, which are representative distortions generated during capturing, are confirmed, and then, a watermark is inserted while capturing a DRM packaged file for various types of capturing attacks.
As shown in Tables 1 and 2, the strong capturing-resistant characteristics are shown, and the extraction may fail according to a network environment or a system performance. It is expected that the extraction may fail rarely if experimental results are obtained from a sufficiently longer region using a longer file, as like as the real service, compared to a length of a file used at a test.
Also, as an inaudibility test, a preference test is performed using a watermarked file and a file without a watermark. As shown in Table 3, a user cannot recognize different two files. Therefore, the inaudibility characteristic can be provided by the present invention.
1extracted at a random location,
2extracted after compressing into MP3 at 128 kbps,
3extracted after re-sampling at 32 kHZ,
4extracted after capturing as MP3 32 kHz at 128 kbps.
1capturing as MP3 128 kbps 44.1 kHz,
2capturing as MP3 128 kbps 32 kHz,
3capturing as MP3 128 kbps 48 kHz, and
4capturing WMA 96 kbps 44.1 kHz.
The accompanying drawings, which are included to provide a further understanding of the invention, are incorporated in and constitute a part of this application, illustrate embodiments of the invention and together with the description serve to explain the principle of the invention. In the drawings;
Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings
Referring to
The framing unit 110 divides an original audio signal into small sized frames in order to embed a watermark in the original audio signal. The discrete wavelet transform (DWT) is applied using the divided audio frames from the framing unit 110.
The DWT unit 130 may use one of well-known wavelet filters.
The DWT divides a signal into a low-band signal and a high-band signal using a wavelet filter as shown in a below diagram, where H[n] denotes a high pass filter, and G[n] denotes a low pass filter. The signal can be disassembled with a low band signal with a center, and the disassembled signal can be restored into the original signal through the inverse DWT. The below diagram shows two steps of DWT.
In the present embodiment, a watermark is embedded into an audio signal by changing a mean value of lowest subband coefficients using the DWT.
The human auditory model unit 120 decides a degree of embedding a watermark after applying the human auditory model using the original audio signal and the DWT applied signal.
The human auditory model unit 120 controls the degree of embedding the watermark using the ratio between the energy S1 of the input audio signal and the energy S2 of a high frequency signal in a lowest subband coefficient of the DWT. In this case, inaudibility can be obtained by controlling the degree of embedding the watermark according to the input signal. Also, using the human auditory model with the original audio signal and the discrete wavelet transformed audio signal requires less computation amount compared to the human auditory model used in the conventional MP3.
The encoding unit 150 encodes the watermark before inserting the watermark in order to enhance the performance of detecting a synchronization signal and a watermark.
The operation of encoding the watermark will be described with reference to
An original watermark is expressed as 1 and −1, and these are mapped to a bit 1 and a bit 0, respectively.
The encoding unit 150 inserts a bit having a different sign compared to the uppermost bit of the original watermark signal (a) as shown in (b) of
The embedding unit 140 embeds the encoded watermark block from the encoding unit 150 using the decided inserting degree decided by the human auditory model unit 120 by changing a mean of lowest subband coefficients transformed by DWT at the DWT unit 130.
The mean of lowest subband coefficients changes according to the information of the encoded watermark.
If the information of the encoded watermark is 1, the mean value changes to a positive value, and if the information of the encoded watermark is −1, the mean value changes to a negative value. The changing of the mean of the lowest subband coefficients can be expressed as a below equation.
Cmk[n]
denotes a nth coefficient value of a transformed kth frame.
,Cok[n]
denotes a nth coefficient of a kth original frame. m denotes an mean value of kth frame. Pak denotes an insertion level of a kth frame. H[n] denotes a window function.
The inverse discrete wavelet transform (IDWT) unit 160 restores an original signal with a watermark inserted by transforming watermarked frames based on the IDWT.
The wavelet based audio watermark extracting apparatus according to the present invention includes a synchronization unit 210 for finding a synchronization point, an extraction unit 220 for extracting inserted information when a synchronization point is selected, and a decoding unit 230 for decoding a watermark using the extracted information.
The synchronization unit 210 finds a synchronization point using a 0 signal which is used for encoding the watermark. The synchronization unit 210 finds a synchronization point using characteristics that the mean value of the lowest subband wavelet coefficients between adjacent frames is close to 0 at the synchronization point. The synchronization unit 210 uses a function h[n], which uses the sum and variance of the adjacent frames in order to extract a synchronization which is less effected by various variations such as noises.
h[n]=α*f[n]+β*g[n]
α
and
β
denote weights, f[n] denotes a wavelet coefficient mean of adjacent N frames, and g[n] denotes a wavelet coefficient variance of adjacent N frames.
The extraction unit 220 calculates a mean value of lowest subband coefficients of DTF for each frame of each audio signal through DTF using the same method for inserting the watermark after selecting the synchronization point by the synchronization unit 210. According to its sign, the inserted information is extracted. At first, a middle value of the repeated values is extracted from the repeatedly inserted data. This information is sensitive to noise or amplitude variation. On the contrary, since the difference between adjacent frames is less sensitive to peripheral environment, the watermark is extracted using the difference information.
Also, the extracted value is normalized by estimating a degree of embedding a watermark is estimated from a signal to extract. Although the estimated embedding degree is different from a real embedding degree used to insert the watermark, small variation may be made after inserting a watermark. That is, large variation is not made because of using a ratio between the energy of the original signal and a high band coefficient among wavelet coefficient.
The decoding unit 230 restores a watermark using information extracted by the extraction unit 220. The decoding unit 230 restores a watermark using information related to the difference of the extraction unit 220. That is, the decoding unit 230 restores information about 1 and −1 of the original watermark using values corresponding to the difference of the extraction unit 220.
In order to accurately extract a watermark, it is preferable to use a mean value between frames by gathering several watermark blocks. Using the mean value can prevent failure of information extraction because information is lost during capturing or noise is inserted during capturing.
The DRM client module 700 includes a decryption unit 710 for assigning a user license to access an encoded file through an authentication process when a DRM packaged file inputs, and an embedding unit 720 for embedding information about a user such as User ID to raw data with the access right assigned.
As described above, supplementary information with the user information is inserted into an audio content as a watermark at the moment of releasing the encryption according to the present invention. Therefore, the copyright of the audio content can be conveniently protected by tracking an illegal distributor using user information included in the audio content when the audio content is illegally duplicated and distributed.
It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention. Thus, it is intended that the present invention covers the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
10-2006-0055953 | Jun 2006 | KR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/KR2007/002803 | 6/11/2007 | WO | 00 | 10/29/2008 |