1. Field of the Invention
The present invention relates to a method for coding signals for allowing users to play back preview (trial) data, and for implementing high-quality recording/playback operations by adding a small amount of data if a user decides to purchase corresponding data.
2. Description of the Related Art
According to known software distribution methods, audio-visual data is broadcasted by encrypting corresponding signals, or data is recorded in recording media, and only users who have purchased a certain key are allowed to view or listen to the data. As an encryption method, the initial value of a random number sequence is given to a bit string of a pulse code modulation (PCM) audio signal as a key signal, and a bit string obtained by performing an exclusive OR of the generated 0/1 random number sequence and the above-described PCM bit string is transmitted or recorded in a recording medium. According to this method, only the users who have obtained the key signal are allowed to correctly play back the audio signal, and those who have not obtained the key signal are unable to play back the audio signal, and only noise is heard. A more complicated encryption method may be employed by using, for example, the Data Encryption Standard (DES).
Details of DES are described in Federal Information Processing Standards Publication 46, Specifications for the DATA ENCRYPTION STANDARD, Jan. 15, 1977.
Methods for compressing audio signals and then broadcasting corresponding data or recording the data in a recording medium are available. Accordingly, recording media, such as magneto-optical disks, are widely used for recording coded audio signals. There are various techniques for coding audio signals with high efficiency. For example, in a block-less frequency-band division technique, i.e., a so-called “sub-band coding (SBC)”, an audio signal in the time domain is divided into a plurality of frequency bands and coded without dividing them into blocks. In a block frequency-band division technique, i.e., a so-called “transform coding”, a signal in the time domain is transformed (spectrum transform) into a signal in a frequency domain so as to be divided into a plurality of frequency bands. The signal components are then coded in each band. Another high-efficiency coding technique, which is a combination of the above-described sub-band coding and transform coding, has also been considered. In this case, for example, after sub-band division is performed in the above-described SBC, signal components in each sub band are transformed into signal components in the frequency domain, and are then coded in each band.
Filters used in the above-described high-efficiency coding methods include quadrature mirror filters (QMF), details of which are described in R. E. Crochiere, Digital coding of speech in subbands, Bell Syst. Tech. J., vol. 55, No. 8, 1976.
An equal-bandwidth filtering technique is described in ICASSP 83, BOSTON, Polyphase Quadrature filters—A new subband coding technique, Joseph H. Rothweiler.
As the above-described spectrum transform, for example, an input audio signal is formed into blocks in predetermined time units (frames), and discrete Fourier transform (DFT), discrete cosine transform (DCT), or modified DCT (MDCT) is performed on the signal components in each block, thereby transforming a time-domain signal into a frequency-domain signal.
Details of MDCT are described in ICASSP 1987, Subband/Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing Cancellation, J. P. Princen, A. B. Bradley, Univ. of Surrey, Royal Melbourne Inst. of Tech.
In the spectrum transform using the above-described DFT or DCT, when the spectrum transform is performed in a time block consisting of M samples, M items of independent real-number data are obtained. Generally, in order to reduce distortion at the connections between time blocks, M1 samples overlap between adjacent blocks, and thus, on average, in DFT or DCT, M items of real-number data are quantized and coded for (M-M1) samples.
In contrast, in the spectrum transform using the above-described MDCT, M items of independent real-number data are obtained from 2M samples, in which M samples overlap with half of adjacent blocks. Accordingly, in MDCT, M items of real-number data are quantized and coded for M samples. In a decoding apparatus, coded data obtained by performing MDCT is inverse-transformed in each block, and the resulting waveform components are added together while interfering with each other so as to reconstruct a waveform signal.
Generally, the spectrum frequency resolution is enhanced as the time block for spectrum transform becomes longer, thereby allowing energy to be concentrated in specific spectral components. As described above, in MDCT, the spectrum transform is performed with an increased block length by overlapping samples between adjacent blocks, and the number of spectral signal components remains the same as the original number of samples. By using such MDCT, coding can be performed with higher efficiency than by using DFT or DCT. Also, by allowing a sufficiently long overlapping portion between adjacent blocks, inter-block distortion of the waveform signal can be reduced.
By quantizing signal components divided into sub bands by using a filter or spectrum transform, bands in which quantizing noise is generated can be controlled, and high-efficiency coding can be performed by utilizing the masking effect. Before performing quantizing, if signal components in each band are normalized by the maximum of the absolute values of the signal components in the corresponding band, higher efficiency coding can be performed.
Signal components are divided into frequency bands with bandwidths considering, for example, human acoustic characteristics. That is, generally, an audio signal is divided into a plurality of bands (for example, 25 bands) so that the bandwidth of the higher bands, which are referred to as the “critical bands”, becomes greater. Then, data in each band is coded according to a predetermined bit distribution or an adaptive bit allocation. For example, coefficient data obtained by the above-described MDCT processing in each block is coded by the number of adaptively allocated bits.
The following two bit allocation techniques are known.
One technique is disclosed in Adaptive Transform Coding of Speech Signals, R. Zelinski and P. Noll, IEEE Transactions of Acoustics, Speech, and Signal Processing, Vol. ASSP-25, No. 4, August 1977. In this technique, bit allocation is performed according to the magnitude of the signal in each band, and thus, the quantizing noise spectrum becomes smooth to minimize the noise energy. However, since the masking effect is not employed, the actual sound is not acoustically optimal.
The other technique is disclosed in ICASSP 1980, The Critical band coder—digital encoding of the perceptual requirements of the auditory system, M. A. Kransner, MIT. In this method, by utilizing the masking effect, fixed bit allocation is performed by obtaining a signal-to-noise (S/N) ratio required for each band. However, due to the fixed bit allocation, even when the characteristic of a sinusoidal wave input is measured, a precise value cannot be obtained.
In order to overcome the above drawbacks, the following high-efficiency coding apparatus has been proposed. Fixed bit allocation is partially performed on some blocks, and adaptive bit allocation is partially performed so that bits determined by the magnitudes of the signal components in the other blocks are allocated to the corresponding blocks. The division ratio of the two types of bit allocations is determined by an input signal, and the division ratio of the fixed bit allocation becomes higher as the signal spectrum becomes smoother.
According to the above-described coding apparatus, many bits can be allocated to blocks containing specific spectral components, such as sinusoidal waves, in which energy is concentrated, thereby making it possible to considerably improve the overall S/N ratio characteristics. Generally, the human acoustic characteristics are extremely sensitive to signals having sharp spectral components. Accordingly, an improved S/N ratio by using this method is effective not only in enhancing precise measurements, but also in improving the sound quality.
Many other bit allocation techniques have been proposed, and acoustic models are becoming increasingly precise. Accordingly, if the performance of a coding apparatus becomes higher, even higher efficiency coding is possible. In these methods, the bit-allocation real-number reference value is determined so that the calculated S/N ratio can be faithfully achieved, and the integer approximating the reference value is used as the number of allocation bits.
In International Publication No. WO94/28633 (corresponding to U.S. Pat. No. 5,717,821) filed by the present inventors, another coding method has been proposed in which tone components that are particularly important in an acoustic sense, i.e., signal components in which energy is concentrated, are extracted from a spectrum signal, and are separately coded from the other spectral components. According to this coding method, audio signals can be efficiently coded with a high compression ratio with very little degradation.
In forming code strings, quantizing-precision information and normalizing-coefficient information are coded with a predetermined number of bits in each band, and the resulting normalized and quantized spectrum signal is coded.
A high-efficiency coding method in which the number of bits representing the quantizing precision differs according to the band is described in ISO/IEC 11172-3: 1993(E), 1993. In this standard, the number of bits indicating the quantizing-precision information becomes smaller as the band becomes higher.
Instead of directly coding quantizing precision information, the quantizing-precision information may be determined from the normalizing-coefficient information in a decoding apparatus. According to this method, however, the relationship between the normalizing-coefficient information and the quantizing-precision information is determined when the standard is set, which makes it impossible to introduce the quantizing precision based on more precise acoustic models in the future. Additionally, if the compression ratio has a range, the relationship between the normalizing-coefficient information and the quantizing-precision information has to be determined according to each range.
Another known coding method is disclosed in D. A. Huffman: A Method for Construction of Minimum Redundancy Codes, Proc. I.R.E., 40, p.1098 (1952). In this method, a quantized spectrum signal is coded more efficiently by using variable codes.
The signal coded as described above can be encrypted and distributed, as in PCM signals, in which case, those who have not obtained the corresponding key are unable to play back the original signal. Alternatively, instead of encrypting a coded bit string, a PCM signal may be converted into a random signal, which is then coded for compression. It is also impossible for users who have not obtained the corresponding key to play back the original signal.
In this so-called-scrambling method, however, without the key, the software data cannot be checked. Also, if the user plays back the data with regular playback means, only noise is heard. Accordingly, this method cannot be used for, for example, the following application. A disk in which music with a relatively low audio quality is recorded is distributed, and after listening to the disk, the user purchases the key only for a music piece that he or she likes and is able to play back that music piece with high audio quality. Alternatively, after listening to the distributed disk, the user is able to purchase a new disk in which music is recorded at high quality.
When encrypting signals subjected to high-efficiency coding, it is very difficult to maintain the compression efficiency while providing code strings that are meaningful for regular playback means. That is, when a scrambled code string is played back, as described above, only noise is heard, and also, playback means may not operate at all if the scrambled code string is not compatible with the original high-efficiency code standard. Also, if a scrambled PCM signal is coded with high efficiency by decreasing the amount of information by utilizing the acoustic characteristics, the scrambled PCM signal cannot always be reproduced when the coded signal is decoded. Thus, it is difficult to descramble the signal. Accordingly, a method for precisely descrambling the signal must be employed by sacrificing the compression efficiency.
U.S. Pat. No. 6,081,784 or Japanese Unexamined Patent Application Publication No. 10-135944 filed by the present inventors discloses the following audio coding method. In this method, among spectral signal components coded from a music signal, signal components only in higher bands are encrypted, thereby enabling users to play back a preview file (trial file) without a corresponding key. More specifically, in this method, signal components only in higher bands are encrypted, and also, high-band bit allocation information is replaced by dummy data, true bit allocation information being recorded in a position ignored by decoders. According to this method, the users are able to enjoy music pieces that please them with high audio quality after listening to the preview file.
In this method, however, since the security is uniquely dependent on encryption, if the data is deciphered, high-quality music can be illegally played back.
In order to overcome this drawback, International Publication No. WO02/065449 discloses the following method. Part of the information to be recorded in a recording medium is replaced by dummy data so that it can be played back at relatively low quality, and when it becomes necessary to play back the information with high quality, the dummy data is replaced by true data, thereby eliminating the possibility of the data being deciphered. Additionally, data can be played back by regular playback devices regardless of whether the data is recorded with high quality or low quality. According to this method, the data can be played back with high quality after checking the content while enhancing the security compared to a method employing an encryption key.
According to the above method, however, dummy data must be replaced by high quality data, and the amount of such data is large, though it is smaller than a preview file. Accordingly, it takes time to transmit the required data, thereby increasing the overall time.
Thus, in International Application No. PCT/JP03/04526 or Japanese Patent Application No. 2002-107084, the present inventors have proposed that part of the dummy data be contained in preview data, and the amount of true data in a high quality file required for replacing the dummy data when the data is played back with high quality is reduced, thereby decreasing the overall time. To implement this, when the number of coding units indicating the number of bands to be coded is contained in an original code string, a small value is written in the original code string as dummy data so that the code string is played back in accordance with the dummy data, and part of the true coding information is recorded in a position ignored by a decoder. With this method, the amount of data required for achieving a high-quality playback operation can be reduced.
If high quality data is encrypted and then sold, network transactions are possible.
According to the above-described method, the true coding information (true band information) has to be recorded in a high-quality file. If the bandwidth is changed in every frame when coding the data, such a change becomes noticeable as noise. Thus, the true band information is normally set to the same value without being changed. Then, when decrypting the key information of the high-quality file, it is highly likely that the same band information appears in regular positions. Such an analysis can be automatically performed by using, for example, acomputer, thereby increasing the possibility of the high-quality file being deciphered.
Accordingly, in order to solve the above-described problems, it is an object of the present invention to prevent important data from being deciphered by separating regularly appearing data items in an encrypted file from the important data and by not encrypting such data items, or by encrypting such data items separately from the important data.
In order to achieve the above object, according to one aspect of the present invention, there is provided a code-string generating method including: a generation step of generating a first code string and a second code string from at least part of an original code string obtained by coding a signal, the first code string containing a code string having a low playback quality and the second code string being used for increasing the quality of a playback signal of the first code string; and an encryption step of encrypting the second code string. In the encryption step, a first portion of the second code string having an exactly or almost regular pattern is encrypted by a first encryption process, or the first portion is not encrypted, and a second portion, which is part of the second code string other than the first portion, is encrypted by a second encryption process different from the first encryption process.
In the above method, “having an exactly or almost regular pattern” means that all the data items can be found by predetermined processing, or a predetermined ratio, for example, 90%, of the data items can be found by predetermined processing, for example, the values of data items are the same or are increased by one at regular intervals.
The first encryption process and the second encryption process can be differentiated by changing the algorithm or the encryption key. The length of the processing unit of the first and/or the second portion of the second code string can be set to an integral multiple of the length of the processing unit of the encryption processing algorithm, thereby increasing the encryption processing rate.
The first portion of the second code string may include at least one of playback band information and synchronizing signal information. The first and second code strings do not have to be completely separated into different bit streams. For example, the first or second portion of the second code string may be embedded into a portion of the first code string which is ignored by a normal signal playback step. In this case, it is preferable that the first or second portion of the second code string be identified separately from the data of the first code string by, for example, being embedded in a predetermined position of the first code string, or by adding an ID code, or by including information indicating the position at which the first or second portion is embedded in the second code string.
The above-described generation step may include a replacement step of replacing at least part of the original code string by dummy data. Then, the first code string may contain the dummy data and the original code which is not replaced by the dummy data, and the second code string may contain part of or all of true data of the original code string which is replaced by the dummy data.
In this case, various portions, such as portions corresponding to spectrum coefficients of the original code string and coding parameters, can be replaced by dummy data. If the present invention is applied to, for example, the distribution of audio signals, various portions in audio code strings, such as intermediate-spectrum coefficients, high-band normalizing coefficient information and/or quantizing precision information, and quantizing units, can be replaced by dummy data. Replacement of such data by dummy data can be made in units of frames, and within the same frame, only part of or the whole frame can be replaced. Also, according to the type of true data to be replaced by dummy data, the configuration of the dummy data may be changed, or the true data may be replaced in various manners.
Part of or all of the true data of the original code string, which is different from the true data contained in the second code string, may be contained at a position in the first code string and that is different from the dummy data. That is, in the present invention, the true data does not have to be wholly contained in the second code string, that is, part of the true data may be contained in the second code string, and the rest of the true data may be contained in the first code string. In this case, it is preferable that the true data contained in the first code string may be encrypted or scrambled. Alternatively, it may be contained in the first code string at a position that cannot be played by a normal decoder, thereby preventing the true data being played back by a simple playback operation of the first code string.
According to another aspect of the present invention, there is provided a code-string generating apparatus including: a generator for generating a first code string and a second code string from at least part of an original code string obtained by coding a signal, the first code string containing a code string having a low playback quality and the second code string being used for increasing the quality of a playback signal of the first code string: and an encryption unit for encrypting the second code string. In the encryption unit, a first portion of the second code string having an exactly or almost regular pattern is encrypted by a first encryption process, or the first portion is not encrypted, and a second portion, which is part of the second code string other than the first portion, is encrypted by a second encryption process different from the first encryption process.
According to still another aspect of the present invention, there is provided a code-string decrypting method including: a first decryption step of decrypting by a first decryption process, or not decrypting, a first encrypted code string corresponding to a first portion of a code string which has an exactly or almost regular pattern before being encrypted; and a second decryption step of decrypting a second encrypted code string corresponding to a second portion, which is part of the code string other than the first portion, by a second decryption process different from the first decryption process. In this case, the length of the processing unit of the first and/or the second portion can be set to an integral multiple of the length of the processing unit of the encryption processing algorithm, thereby increasing the decryption processing rate.
According to a further aspect of the present invention, there is provided a code-string decrypting apparatus including: a first decryption unit for decrypting by a first decryption process, or for not decrypting, a first encrypted code string corresponding to a first portion of a code string which has an exactly or almost regular pattern before being encrypted; and a second decryption unit for decrypting a second encrypted code string corresponding to a second portion, which is part of the code string other than the first portion, by a second decryption process different from the first decryption process.
According to a yet further aspect of the present invention, there is provided a code-string encrypting method including the steps of: separating a code string into a first code-string portion having an exactly or almost regular pattern and a second code-string portion other than the first code-string portion; and encrypting, or not encrypting, the first code-string portion by a first process, and encrypting the second code-string portion by a second process different from the first process.
According to a further aspect of the present invention, there is provided a code-string encrypting apparatus including a unit for separating a code string into a first code-string portion having an exactly or almost regular pattern and a second code-string portion other than the first code-string portion, and for encrypting, or not encrypting, the first code-string portion by a first process, and encrypting the second code-string portion by a second process different from the first process.
According to a further aspect of the present invention, there is provided a recording medium including: a first code-string portion having an exactly or almost regular pattern encrypted or not encrypted by a first process; and a second code-string portion other than the first code-string portion encrypted by a second process different from the first process.
According to the present invention, the possibility of an encrypted file being deciphered becomes low. The security of a high-quality file for increasing the quality of audio code strings of a preview file can be enhanced. The security when encrypting audio code strings themselves can also be enhanced.
The present invention is described in detail below with reference to the accompanying drawings through illustration of preferred embodiments.
The optical head 53 is formed of a laser light source, such as a laser diode, optical elements, such as a collimator lens, an object lens, a polarizing beam splitter, and a cylindrical lens, and a photodetector provided with a light-receiving portion having a predetermined pattern. The optical head 53 is disposed such that it faces the magnetic head 54 with the magneto-optical disk 1 therebetween. When recording data on the magneto-optical disk 1, the magnetic head 54 is driven by a magnetic-head driving circuit 66 of a recording system, which is described below, so as to apply a modulated magnetic field corresponding to the recording data, and also, laser light is applied to a target track of the magneto-optical disk 1 by the optical head 53, thereby performing thermomagnetic recording according to the magnetic modulation method. The optical head 53 also detects light reflected by the target track irradiated with laser light so as to detect focus errors by, for example, the astigmatism, and detect tracking errors by, for example, the push-pull method. When playing back data from the magneto-optical disk 1, the optical head 53 detects the above-described focus errors and tracking errors, and also detects the difference of the polarizing angle (Kerr rotation angle) of the reflected light from the target track, thereby generating a playback signal.
An output of the optical head 53 is supplied to an RF circuit 55. The RF circuit 55 extracts the above-described focus error signal and tracking error signal from the output of the optical head 53, and supplies them to a servo control circuit 56. The RF circuit 55 also binarizes the playback signal and supplies it to a decoder 71 of a playback system, which is described below.
The servo control circuit 56 is formed of, for example, a focus servo control circuit, a tracking servo control circuit, a spindle-motor servo control circuit, and a sled servo control circuit. The focus servo control circuit performs focus control on the optical system of the optical head 53 so that the above-described focus error signal becomes zero. The tracking servo control circuit performs tracking control on the optical system of the optical head 53 so that the above-described tracking error signal becomes zero. The spindle-motor servo control circuit controls the spindle motor 51 so that the magneto-optical disk 1 is rotated at a predetermined rotational speed (for example, constant linear speed). The sled servo control circuit moves the optical head 53 and the magnetic head 54 to the position of a target track designated by the system controller 57. The servo control circuit 56, which performs the above-described various control operations, sends information indicating the operating statuses of the individual elements to the system controller 57.
A key input operation device 58 and a display device 59 are connected to the system controller 57. The system controller 57 controls the recording system and the playback system upon receiving operation information input via the key input operation device 58. The system controller 57 controls the recording position and the playback position on a recording track traced by the optical head 53 and the magnetic head 54 based on address information of each sector, which is played back from the recording track by using the header time or sub-code Q data. The system controller 57 also controls the display device 59 to display the playback time based on the data compression ratio of the recording/playback apparatus shown in
The playback time is calculated as follows. The address information (absolute time information) of each sector, which is played back from the recording track of the magneto-optical disk 1 based on the header time or the sub-code Q data, is multiplied by the reciprocal of the data compression ratio (for example, if the compression ratio is 1/4, the reciprocal is 4). The resulting actual time information is displayed on the display device 59. Not only during the playback operation, but also during the recording operation, if absolute time information is pre-recorded (pre-formatted) in a recording track of a magneto-optical disk, it may be read and then multiplied by the reciprocal of the data compression data, thereby making it possible to display the current position by the actual recording time.
In the recording system of this recording/playback apparatus, an analog audio input signal Ain from an input terminal 60 is supplied to an analog-to-digital (A/D) converter 62 via a low-pass filter 61, and is quantized in the A/D converter 62. A digital audio signal output from the A/D converter 62 is supplied to an adaptive transform coding (ATC) encoder 63. Meanwhile, a digital audio signal Din from an input terminal 67 is supplied to the ATC encoder 63 via a digital input interface circuit 68. The ATC encoder 63 performs bit compression (data compression) according to a predetermined data compression ratio on the digital audio PCM data at a predetermined transfer rate, which is obtained by quantizing the input signal Ain by the A/D converter 62. The compressed data (ATC data) output from the ATC encoder 63 is supplied to a memory 64. If the data compression ratio is, for example, 1/8, the data transfer rate is reduced to 1/8 (9.375 sectors/second) of the data transfer rate (75 sectors/second) of the standard CD-DA.
The reading and writing of data from and into the memory 64 is controlled by the system controller 57, and the memory 64 serves as a buffer memory for temporarily storing the ATC data supplied from the ATC encoder 63 and for recording it on the disk 1 if necessary. More specifically, if the data compression ratio is, for example, 1/8, the data transfer rate of the compressed audio data supplied from the ATC encoder 63 is reduced to 9.375 sectors/second, which is 1/8 of the data transfer rate (75 sectors/second) of the standard CD-DA format, and this compressed data is continuously written into the memory 64. It is sufficient that only one eighth of the sectors of this compressed data (ATC data) be recorded. In practice, however, it is almost impossible to record data every eight sectors, and thus, the consecutive-sector recording, which is described below, is performed.
In this recording, burst-recording is performed with a recording pause at a data transfer rate (75 sectors/second), which is the same as that of the standard CD-DA format, by using a cluster consisting of a predetermined plurality of sectors (for example, 32 sectors plus several sectors) as the recording unit. That is, in the memory 64, the ATC audio data having a 1/8 data compression ratio continuously written into the memory 64 at a low transfer rate of 9.375 (=75/8) is read at the above-described transfer rate of 75 sectors/second. Although the overall transfer rate of the data read and recorded as described above including a recording pause is as low as 9.375 sectors/second, the instantaneous data transfer rate during the burst-recording operation time results in 75 sectors/second. Accordingly, when the disk rotational speed is the same as the transfer rate (constant linear speed) of the standard CD-DA format, recording is performed with the same recording density and the same storage pattern as those of the standard CD-DA format.
The ATC audio data, i.e., recording data, read from the memory 64 at an instantaneous transfer rate of 75 sectors/second is supplied to an encoder 65. The amount of data continuously recorded into the encoder 65 at one time is one cluster consisting of a plurality of sectors (for example, 32 sectors) and a few cluster-connecting sectors disposed before and after the cluster. The cluster-connecting sectors are set to be longer than the interleaving length in the encoder 65 so as to prevent the cluster-connecting sectors from influencing the other clusters even if interleaving is performed.
The encoder 65 performs error-correcting coding (parity addition and interleaving) and eight-to-fourteen modulation (EFM) coding on the recording data supplied from the memory 64. The coded recording data is supplied to the magnetic-head driving circuit 66. The magnetic-head driving circuit 66 is connected to the magnetic head 54 so as to drive it to apply a modulated magnetic field corresponding to the recording data to the magneto-optical disk 1.
As described above, the system controller 57 controls the reading and writing of data from and into the memory 64, and also controls the recording positions so that the data read from the memory 64 is continuously recorded into the recording tracks of the magneto-optical disk 1. More specifically, the system controller 57 controls the recording position by supplying a control signal indicating the recording position on a recording track of the magneto-optical disk 1 to the servo control circuit 56.
The playback system is now described below. The playback system plays back the data continuously recorded on the recording tracks of the magneto-optical disk 1 by the above-described recording system. The playback system is provided with the decoder 71. A playback output obtained by tracing the recording tracks of the magneto-optical disk 1 with laser light by the optical head 53 is binarized in the RF circuit 55 and is supplied to the decoder 71. In this case, not only magneto-optical disks, read only optical discs, such as compact discs, can be read.
The decoder 71 corresponds to the encoder 65 of the recording system, and performs error-correcting decoding and EFM decoding on the playback output binarized by the RF circuit 55. The ATC audio data having a data compression rate of 1/8 is played back at a transfer rate of 75 sectors/second, which is higher than the normal transfer rate. The playback data obtained by the decoder 71 is supplied to a memory 72.
The reading and writing of data from and into the memory 72 is controlled by the system controller 57. Burst-writing is performed on playback data supplied from the decoder 71 at a transfer rate of 75 sectors/second into the memory 72 at a transfer rate of 75 sectors/second. The playback data written into the memory 72 at the transfer rate of 75 sectors/second is continuously read from the memory 72 at a transfer rate of 9.375 sectors/second, which is equivalent to the data compression ratio of 1/8.
The system controller 57 controls the playback data to be written into the memory 72 at a transfer rate of 75 sectors/second and also to be continuously read from the memory 72 at a transfer rate of 9.375 sectors/second. The system controller 57 also controls the playback positions so that the playback data written into the memory 72 can be continuously played back from the recording tracks of the magneto-optical disk 1. More specifically, the system controller 57 controls the playback position by supplying a control signal indicating the playback position on a recording track of the magneto-optical disk 1 or the optical disc 1 to the servo control circuit 56.
ATC audio data obtained as the playback data continuously read from the memory 72 at a transfer rate of 9.375 sectors/second is supplied to an ATC decoder 73. The ATC decoder 73, which corresponds to the ATC encoder 63 of the recording system, performs data decompression (bit decompression) on the ATC data by, for example, eight times, thereby playing back 16-bit digital audio data. The digital audio data is then supplied to a digital-to-analog (D/A) converter 74.
The D/A converter 74 converts the digital audio data supplied from the ATC decoder 73 into an analog signal so as to form an analog audio output signal Aout. The analog audio signal Aout is then output from an output terminal 76 via a low-pass filter 75.
Details of high-compression coding are described below. High-efficiency coding operations on input digital signals, such as audio PCM signals, using techniques, such as SBC, ATC, and adaptive bit allocation, are discussed below with reference to
The coding efficiency achieved by the above-described coding method can further be improved. For example, among quantized spectral components, a relatively short code length can be assigned to signal components which appear with higher frequency, and a relatively long code length can be assigned to signal components which appear with lower frequency, thereby enhancing the coding efficiency. Also, by increasing the transform block length, the amount of sub information, such as quantizing-precision information and normalizing-coefficient information, can be relatively reduced, and also, the frequency resolution can be increased, thereby making it possible to more precisely control the quantizing precision in the frequency domain. As a result, the coding efficiency can be enhanced.
In International Publication No. WO94/28633 (corresponding to U.S. Pat. No. 5,717,821) filed by the present inventors, another coding method has been proposed in which tone components that are particularly important in an acoustic sense, i.e., signal components having energy being concentrated in specific frequencies, are extracted from a spectrum signal, and are separately coded from the other spectral components. According to this coding method, audio signals can be efficiently coded with a high compression ratio with very little degradation.
International Publication No. WO02/065449 discloses a coding method shown in
In the example shown in
Although in the above-described example both the quantizing-precision information and the normalizing-coefficient information are partially replaced by dummy data, dummy data may be used for only one item of the information. If only the quantizing-precision information is replaced by 0-bit dummy data, narrow-band data having a spectrum, such as that shown in
A determination as to whether quantizing-precision information or normalizing-coefficient information is to be replaced by dummy data can be made by the possibility of true values being predicted. In terms of the security, the best is to replace both the quantizing-precision information and the normalizing coefficient information by dummy data since no data is then available for predicting the true values. If dummy data is used for only the quantizing-precision information, the possibility of quantizing-precision information being predicted from the normalizing-coefficient information becomes higher if the quantizing-precision information has been determined based on the normalizing coefficients according to a bit allocation algorithm. In contrast, it is relatively difficult to predict the normalizing coefficient information from the quantizing-precision information. Thus, if dummy data is used for only one of the normalizing-coefficient information and the quantizing-precision information, the normalizing-coefficient information is preferably replaced by dummy data in terms of the security. The quantizing-precision information and the normalizing-coefficient information may be selectively replaced by dummy data depending on the band.
Regardless of whether the quantizing-precision information or the normalizing-coefficient information is replaced by dummy data, it is difficult to predict a relatively large amount of data than to decipher a relatively short key used for normal encryption. Accordingly, for example, the possibility of protecting the copyright of music can be increased. Even if true values of a certain piece of music are predicted from dummy data, there is no danger of expanding such damage to other pieces of music, unlike when an encryption algorithm is deciphered. Thus, the above-described method achieves higher security than an encryption method.
The coding methods applicable to the present invention have been described. By applying one of the above-described coding methods to an audio recording/playback operation, relatively low-quality audio data can be freely previewed or listened to as a preview file, and high-quality audio data can be viewed or listened to by, for example, purchasing an additional file having a data amount smaller than the preview file.
The data amount of an additional file for achieving a high-quality playback operation can further be decreased, as disclosed in International Application No. PCT/JP03/04526 or Japanese Patent Application No. 2002-107084 (which was not published when the present application was filed). By decreasing the data amount of an additional file, the time required for obtaining the additional data by communication means can be reduced, and also, the overall time for obtaining high-quality audio data after the user has decided to purchase it can also be decreased.
The method for reducing the data amount of an additional file is described below, assuming that tone components are not separated. However, to further develop such a method, tone components can be separated.
In this example, for easily identifying the start position of a code string, the quantizing-precision information and the normalizing-coefficient information are sequentially coded from a lower band to a higher band from the end of the frame. Such information may be disposed in another order. However, particularly when the frame length is fixed, it is very convenient to specify the position of a code string if true data is coded from the end of the frame. Also in this example, part of the spectrum-coefficient information in the intermediate bands is replaced by dummy data, and this data is ignored while playing back the preview file, thereby preventing the occurrence of noise during the playback operation. In this example, variable-length codes are used for coding the spectrum-coefficient information. Accordingly, if intermediate-band data is unknown to the user, the spectrum-coefficient information higher than the intermediate-band data cannot be read, thereby making it possible to enhance the security.
In step S11, a code string containing dummy data is decomposed. Then, it is determined in step S12 whether a high-quality playback operation is to be performed. If the outcome of step S12 is yes, the process proceeds to step S13 in which the number of true coding units is embedded into the code string. If a high-quality playback operation is not performed, the process proceeds to step S19. In step S14, true quantizing-precision information and true normalizing-coefficient information are read. Then, in step S15, true spectrum-coefficient information is read. Subsequently, in step S16, spectrum-coefficient information containing dummy data is read, and then, in step S17, the true quantizing-precision information and the true normalizing-coefficient information are embedded into the code string. Thereafter, in step S18, true spectrum-coefficient information is generated from the information read in steps S15 and S16, and is then embedded into the code string from the position subsequent to the position of the true quantizing-precision information and the true normalizing-coefficient information. In step S19, the resulting code string is decoded into signal components, and in step S20, the decoded signal components are transformed into a time-series signal. The processing is then completed.
In step S21, it is determined whether a high-quality recording operation is to be performed. If the outcome of step S21 is yes, the process proceeds to step S22 in which a code string containing dummy data is decomposed. If a high-quality recording operation is not performed, the process proceeds to step S29. In step S23, the number of true coding units is embedded into a code string. In step S24, true quantizing-precision information and true normalizing-coefficient information are read. Then, in step S25, true spectrum-coefficient information is read. Subsequently, in step S26, spectrum-coefficient information containing dummy data is read, and then, in step S27, the true quantizing-precision information and the true normalizing-coefficient information are embedded into the code string. Thereafter, in step S28, true spectrum-coefficient information is generated from the information read in steps S25 and S26, and is then embedded into the code string from the position subsequent to the position of the true quantizing-precision information and the true normalizing-coefficient information. In step S29, the resulting code string is recorded. The processing is then completed.
Although in this example the information of the number of true coding units is coded as part of the high-quality file, it may be coded in an area, which can be ignored by a decoder, of a code string of each frame of the original preview file. In this case, to reserve such an area, the number of bits used for coding the corresponding frame with high quality is reduced by the information for the number of coding units. The method of the present invention also encompasses such a modification.
The present inventors have proposed a method for performing a recording/playback operation by changing part of a first code string containing dummy data based on a second code string (high-quality file), as disclosed in Japanese Patent Application No. 2002-146731 (which was not published when this application was filed). In this method, true data that replaces dummy data in the first code string is embedded in only part of the frames of the first code string. According to this method, the data amount of the second code string (high-quality file) can be reduced while maintaining a high level of security. The method of the present invention can be applied to such a method.
Meanwhile, the code string portions 883 having almost a regular pattern are supplied to an encryption device A 1883, and are encrypted by using a key 889 different from the key 890 used in the encryption device B 1884, the key 889 also being generated by the controller 1886. An encrypted code string 885 is supplied to the high-quality code-string synthesizer 1885. As stated above, however, the encryption device A 1883 may be omitted, and the code string portions 883 having almost a regular pattern may be supplied to the high-quality code-string synthesizer 1885 as plaintext. The high-quality code-string synthesizer 1885 combines the code strings 885 and 886 to create a high-quality file 887, such as that shown in
However, if the encryption device A 1883 is omitted, the code-string portions 902 are not encrypted. In this case, the decryption device A 1902 is omitted, and the code-string portions 902 are directly input to the high-quality processor 1904 as the code string 904. A code string 906, which is equivalent to the preview file 888 shown in
For sending and receiving the decryption keys 910 and 911 between the server for a high-quality file and the high-quality processor 1904, the following known processing can be performed, assuming that a common private key and a random-number generator are provided for the two elements. The server first generates a random number, encrypts it with a private key shared with the high-quality processor 1904 by using DES, and then sends the encrypted random number to the high-quality processor 1904. Then, the high-quality processor 1904 decrypts the random number by using the private key, encrypts the value obtained by adding one to the decrypted value by using the private key according to DES, and sends the encrypted value to the server. The server then decrypts the encrypted value received from the server to check whether the decrypted value is one greater than the random number generated by the server. If this verification succeeds, the server determines that the high-quality processor 1904 is an authenticated device that possesses the correct private key. Subsequently, the above-described verification is performed by changing roles, that is, the high-quality processor 1904 checks whether the server is an authenticated device.
After verifying authentication of both the high-quality processor 1904 and the server, the high-quality processor 1904 generates a random number R, encrypts the random number R with, for example, the random number that was previously sent to the server, according to DES, and sends the encrypted random number R to the server. The server then decrypts the random number R, and sends a key to encrypt a high-quality file by using the random number R as a key to the high-quality processor 1904. If there are a plurality of keys to encrypt the high-quality file, such keys may be repeatedly sent by varying the value of the random number R. In the above-described example, the private key is stored in a highly secure ROM in the controller 1905, and the controller 1905 contains a built-in random-number generator. Also in the server, the private key is highly secure, and the server has a built-in random-number generator.
If the code string portions having almost a regular pattern are not encrypted, in step S41, the code string portions having almost a regular pattern and the other code string portions are separated, and step S42 is omitted. Then, in step S44, a code string that can be played back with high quality is generated from the preview file, the code string portions having almost a regular pattern, and the other code string portions decrypted in step S43. The key required for decrypting the code string can be received from the server according to a method similar to the above-described method.
The method for coding and encrypting the number of coding units and the intermediate-spectrum coefficient has been described. This method is also applicable to the method proposed in Japanese Patent Application No. 2002-160996 (which was not published when the present invention was filed) by the present inventors. In this method, in some frames, a first signal indicating an original music signal and a second signal indicating a message, such as “this is a preview signal”, are superimposed and coded. If the user desires to purchase the music signal, the frames in which the first and second signals are superimposed are replaced by a code string indicating only the first signal.
The method of the present invention has been described as an encryption method for a high-quality file that achieves a high-quality playback operation from a preview file. The method of the present invention is also applicable to an encryption method for a music-code-string file itself. That is, an entire music code string may be encrypted with a format such as that shown in
Although audio signals are employed in the above-described embodiment, the present invention can be used for image signals. For example, when an image signal is coded with MPEG-1, the start code of the picture layer is fixed. If a code string containing such a start code is encrypted, it may be deciphered based on the start code of the picture layer for a reason similar to that described above. The method of the present invention can also be effectively employed in such a case.
The method of the present invention is also effective not only when recording code strings, but also when transmitting code strings. The present invention is applicable to, for example, audio signals to be broadcasted. In this case, the security of code strings that achieve high-quality playback from code strings of a preview signal can be enhanced, and the security of the preview signal can also be enhanced.
In the description of the present invention, code strings encrypted with different methods include code strings encrypted with different keys.
Number | Date | Country | Kind |
---|---|---|---|
2002-249263 | Aug 2002 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5423003 | Berteau | Jun 1995 | A |
5546379 | Thaweethai et al. | Aug 1996 | A |
5602991 | Berteau | Feb 1997 | A |
5717821 | Tsutsui et al. | Feb 1998 | A |
6081784 | Tsutsui | Jun 2000 | A |
Number | Date | Country |
---|---|---|
10-135944 | May 1998 | JP |
2003-304158 | Oct 2003 | JP |
2003-337599 | Nov 2003 | JP |
WO9428633 | Jun 2000 | WO |
WO 02065449 | Aug 2002 | WO |
WO 03085836 | Oct 2003 | WO |
Number | Date | Country | |
---|---|---|---|
20040141612 A1 | Jul 2004 | US |