The invention relates to a method and an apparatus for transmitting watermark data bits using a spread spectrum, and to a method and an apparatus for regaining watermark data bits embedded in a spread spectrum.
‘Watermarking’ means imperceptible insertion of information into multimedia data, e.g. audio data and/or video data. The insertion of additional information data, such as a number or a text, into multimedia data is performed through slight modification of the original multimedia data. Watermarking can be used for e.g. copyright protection, labelling (e.g. URL of a site or a site's logo), monitoring, tamper proofing, or conditional access.
Applying ‘spread spectrum’ in a (RF) communications system, means that a small baseband signal bandwidth is intentionally spread over a larger bandwidth by injecting or adding a higher-frequency signal, or spreading function. As a direct consequence, the energy used for transmitting the signal is spread over a wider bandwidth, and appears as noise.
Spread spectrum technology and the related inserted or added information signal can be used for implementing watermarking of e.g. digital audio signals, whereby the spread spectrum can use the complete audio spectrum from 0 Hz to one half of the sampling frequency. This spectrum carries the information of one bit. In a modification of such systems shorter spread spectrum sequences are used leading to band limited spread spectrum signals, so that several ones of the band limited spread spectrum signals can be added at different centre frequencies to the audio spectrum, at which centre frequencies the original audio signal has been notch filtered, in order to increase the bitrate of the watermark signals and/or to prevent attacks on the watermarked signals. In this watermark system the spread spectrum signals are modulated on a carrier.
A known processing for retrieving at receiver or decoder side the watermark signal information bit from the spread spectrum is convolving the received or replayed spectrum with a spreading function that is time-inverse with respect to the original spreading function, which kind of processing is also called ‘applying a matched filter’. If BPSK modulation was used for applying the spread spectrum function, the output of this process is a peak at the middle of the sequence of correlation values, whereby the sign of such peak represents the value of the desired watermark signal information bit, c.f.
This decoding processing works fine in case undisturbed signal are received by the decoder. However, if the received input signals contain multipath or echo or reverberation distortions, the convolved output signals will contain more than one peak per watermark signal information bit (i.e. per convolution result) to be decoded so that, e.g. depending on the amplitude or power of the distortion peaks, it is difficult or in many cases even impossible to retrieve the correct watermark information bits.
A problem to be solved by the invention is to increase the robustness of spread spectrum systems against echo and reverberation distortions, and to reduce the number of erroneously demodulated watermark signal information bits. This problem is solved by the methods disclosed in claims 1 to 3. Corresponding apparatuses which utilise these methods are disclosed in claims 6 to 8, respectively.
In a first embodiment of the invention, two or more orthogonal spreading sequences or functions are combined at transmitter or encoder or source side with the original or encoded audio signal in baseband, i.e. without modulating the spreading sequences or functions on a carrier before combining them with the original or encoded audio signal. ‘Orthogonal’ spreading sequences or functions means that the cross-correlation of such sequences yields a zero-value result, or a very small-value result. When applying the corresponding time-inverse orthogonal spreading sequences or functions at receiver or decoder side, echoes that are longer than each one of spreading sequence's or function's lengths can be fully removed.
In a second embodiment of the invention the time-inverse versions of not necessarily orthogonal spreading sequences or functions are modified at receiver or decoder side according to pre-known or estimated echo delay values and fading parameters. In case of estimated echo delay values the delay time period measurements/calculations can be repeated for several succeeding audio signal frames before a valid delay time period value is formed.
Advantageously the number of watermark signal bit errors due to echoes caused by multipath or reverberated reception conditions is substantially decreased.
The features of the first and second embodiment can also be combined in that two or more orthogonal baseband spreading sequences or functions are used which are being modified at decoder side according to echo delay values and fading parameters.
In principle, the inventive method is suited for transmitting watermark data bits using a spread spectrum, said method including the steps:
In principle, the inventive method is suited for regaining watermark data bits embedded in a spread spectrum, whereby the corresponding original watermark data bits were modulated at encoder side on an encoder pseudo-noise sequence and said modulated encoder pseudo-noise sequence was transformed into the frequency domain and shaped in amplitude according to the masking level curve of an audio signal together with which the watermark data bit information was transmitted or transferred, and said shaped encoder pseudo-noise frequency domain sequence was transformed back into the time domain and was combined with a current frame of data of said audio signal, wherein the length of said encoder pseudo-noise sequence was one Nth of the length of a frame of said audio signal, N being an integer number greater one, wherein N orthogonal encoder pseudo-noise sequences were used per frame of said audio signal for carrying out said combining for corresponding sections of a current frame, said method including the steps:
In principle, the inventive method is also suited for regaining watermark data bits embedded in a spread spectrum, whereby the corresponding original watermark data bits were modulated at encoder side on an encoder pseudo-noise sequence and said modulated encoder pseudo-noise sequence was transformed into the frequency domain and shaped in amplitude according to the masking level curve of an audio signal together with which the watermark data bit information was transmitted or transferred, and said shaped encoder pseudo-noise frequency domain sequence was transformed back into the time domain and was combined with a current frame of data of said audio signal, wherein the length of said encoder pseudo-noise sequence corresponded to the length of a frame of said audio signal and said encoder pseudo-noise sequence was used for carrying out said combining for a current frame, said method including the steps:
In principle the inventive apparatus is suited for transmitting watermark data bits using a spread spectrum, said apparatus including:
In principle the inventive apparatus is suited for regaining watermark data bits embedded in a spread spectrum, whereby the corresponding original watermark data bits were modulated at encoder side on an encoder pseudo-noise sequence and said modulated encoder pseudo-noise sequence was transformed into the frequency domain and shaped in amplitude according to the masking level curve of an audio signal together with which the watermark data bit information was transmitted or transferred, and said shaped encoder pseudo-noise frequency domain sequence was transformed back into the time domain and was combined with a current frame of data of said audio signal, wherein the length of said encoder pseudo-noise sequence was one Nth of the length of a frame of said audio signal, N being an integer number greater one, wherein N orthogonal encoder pseudo-noise sequences were used per frame of said audio signal for carrying out said combining for corresponding sections of a current frame, said apparatus including:
In principle the inventive apparatus is suited for regaining watermark data bits embedded in a spread spectrum, whereby the corresponding original watermark data bits were modulated at encoder side on an encoder pseudo-noise sequence and said modulated encoder pseudo-noise sequence was transformed into the frequency domain and shaped in amplitude according to the masking level curve of an audio signal together with which the watermark data bit information was transmitted or transferred, and said shaped encoder pseudo-noise frequency domain sequence was transformed back into the time domain and was combined with a current frame of data of said audio signal, wherein the length of said encoder pseudo-noise sequence corresponded to the length of a frame of said audio signal and said encoder pseudo-noise sequence was used for carrying out said combining for a current frame, said apparatus including:
Advantageous additional embodiments of the invention are disclosed in the respective dependent claims.
Exemplary embodiments of the invention are described with reference to the accompanying drawings, which show in:
In the watermark signal encoder section in
According to the first embodiment of the invention these different sequences ENCPNSEQ_1 to ENCPNSEQ_N are orthogonal. ‘Orthogonal’ means that any pair of sequences out of the N sequences has a cross-correlation that has an output value of zero, or a very small output value near zero. According to the second embodiment of the invention a single encoder pseudo-noise sequence ENCPNSEQ is used.
The pieces of watermark signals WATS resulting from stage BVMOD are combined with, or added to, corresponding frame sections of spectral audio data in baseband fashion in the watermark shaping and embedding stage WATSE. This is performed in stage WATSE as follows. A current encoder pseudo-noise sequence section ENCPNSEQ_i is transformed into the frequency domain. In the frequency domain, this sequence is ‘shaped’ according to, i.e. its amplitudes envelope is made conforming to, the corresponding frame section masking level shape or curve in masking level threshold information MLAUD. In case there are two encoder pseudo-noise sequence sections per audio frame, sequence ENCPNSEQ_1 is shaped according to the masking level shape or curve in the first half of the audio frame and sequence ENCPNSEQ_2 is shaped according to the masking level shape or curve in the second half of the audio frame.
Following such shaping the encoder pseudo-noise sequence section ENCPNSEQ_i is inversely transformed back into the time domain. The inversely transformed sequence sections ENCPNSEQ_1 to ENCPNSEQ_N are added or combined with the time domain sample values of the current audio frame.
As an alternative, the encoder pseudo-noise sequence section ENCPNSEQ_i as shaped in the frequency domain can be combined with the frequency domain coefficient values of the current audio frame, whereby an encoded audio signal is transmitted via channel WATAUTRMCH.
The output signal of stage WATSE passes through transmitter stage TRM (which includes e.g. a D/A converter and/or an amplifier) and channel WATAUTRMCH to a watermark signal decoder or receiver.
Unintended, in the watermarked audio transmission channel WATAUTRMCH a noise or reverberation or echo signal NRE is added. This channel can be represented by an acoustic connection between a loudspeaker and a microphone.
In the first-embodiment watermark signal decoder section in
In
According to the invention, after the watermark signal decoder receiver part is synchronised, in order to remove a negative echo peak signal in the DECPNSEQ_2 correlation result, either the ‘wrong’ position or the smaller amplitude >‘−1’ (or <‘+1’, respectively) or both facts are used in stage DRECMF to not considering such echo peak signals as valid watermark data bits, or to remove such echo peaks leading to a correlation output signal according to
Stage DRECMF provides the watermark signal decoder output watermark data OWATD which, despite the NRE added on the transmission channel, correspond 100% or nearly 100% to the input watermark data IWATD.
In the second embodiment of the invention a single encoder pseudo-noise sequence ENCPNSEQ is used in the watermark signal encoder and a single correspondingly time-inverse decoder pseudo-noise sequence DECPNSEQ is used in the watermark signal decoder. Apart from that, the watermark signal encoder operates like in the first embodiment.
The watermark signal decoder stages REC, SYNC, and DRECMF operate like in the first embodiment. However, the output signal of receiver stage REC is also fed to an echo detection stage EDET in which echo, multipath or reverberation distortions are detectable and the related delays are calculated. The delay can be estimated using different known methods, e.g. by correlation of the received signal with the unmodified decoder pseudo-noise sequence for one or more frames.
In stage EDET a modified decoder spread spectrum or pseudo-noise sequence MDECPNSEQ is formed by shifting or multiple shifting the position of the original decoder pseudo-noise sequence DECPNSEQ according to the calculated delay or delays, respectively. The output modified decoder spread spectrum sequence MDECPNSEQ is the sum of the original sequence DECPNSEQ and correspondingly delayed (and possibly amplified due to fading) versions of the original sequence, whereby the corresponding cut-off tails of the delayed versions are not considered.
In stage DRECMF the delay measurements, or the correlation/convolution results, for several (succeeding) audio frames are evaluated before a final result on the echo delay is formed.
As an alternative, the encoder pseudo-noise sequence section ENCPNSEQ_i or ENCPNSEQ, respectively, as shaped in the frequency domain can be combined with the frequency domain coefficient values of the current audio frame, whereby an encoded audio signal is transmitted via channel WATAUTRMCH and is correspondingly decoded in a watermark signal decoder.
The pseudo-noise sequences used are calculated by a given algorithm based on a start value. In order to transmit secret watermark data, the start value or even that algorithm can be encrypted and transmitted to the watermark signal decoder wherein it is used to calculate the decoder pseudo-noise sequences DECPNSEQ_i and the modified decoder pseudo-noise sequence MDECPNSEQ.
Instead of audio signals, video signals can be used correspondingly for transmitting watermark data according to the invention.
Number | Date | Country | Kind |
---|---|---|---|
03090431.2 | Dec 2003 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP04/10222 | 9/13/2004 | WO | 6/6/2006 |