1. Technical Field of the Invention
The present invention relates to a sound signal encoding apparatus for and a sound signal encoding method of encoding and transmitting a sound signal, and more particularly to a sound signal encoding apparatus for and a sound signal encoding method of encoding and transmitting a sound signal such as a music sound signal data in a manner that the sound signal is encoded at a relatively high quality and smoothly transmitted to other electrically operating units via computer network.
2. Description of the Related Art
There have so far been proposed a wide variety of sound signal encoding apparatuses of this type one typical example of which is shown in
The conventional sound signal encoding apparatus further comprises second sound signal sections analyzing means 105, frequency components calculating means 106, and quantization bit numbers allocating means 107. The second sound signal sections analyzing means 105 is operative to analyze a masking threshold level for each of the divided sound signal sections based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics.
The frequency components calculating means 106 is operative to calculate frequency components with two different information consisting of first and second signals for each of the sound signal sections sampled at the selected sampling rate, the above first signal being indicative of intensities, while the above second signal being indicative of frequencies. The quantization bit numbers allocating means 107 is designed to allocate quantization bit numbers for each of the calculated frequency components for each of the sound signal sections.
The conventional sound signal encoding apparatus further comprises first frequency components compressing means 108, and second frequency components compressing means 109. The first frequency components compressing means 108 is adapted to compress the frequency components for each of the sound signal sections with two different information consisting of first and second signals. The second frequency components compressing means 109 is operative to compress the frequency components for each of the sound signal sections with two different information consisting of first and second signals. The above first signal is indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal, while the above second signal is indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal.
The conventional sound signal encoding apparatus further comprises frequency components quantizing means 110 and frequency components encoding means 111. The frequency components quantizing means 110 is operative to quantize each of the frequency components for each of the sound signal sections at the predetermined quantization bit numbers. The frequency components encoding means 111 is operative to encode the quantized frequency components for each of the sound signal sections to a multiplexed bit stream. The multiplexed bit stream is constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The sound signal is encoded by the conventional sound signal encoding apparatus in accordance with the MPEG2 AAC (Advanced Audio Coding) decided by the Motion Picture Experts Group, and is then transmitted at a predetermined transmitting bit rate to other electrically operating units via computer network.
The conventional sound signal encoding apparatus thus constructed in the above encounters such a problem that the sound signal tends to be encoded at a relatively low quality. The reason is due to the fact that the first frequency components compressing means 108 is operative to compress the frequency components for each of the sound signal sections with two different information consisting of the first and second signals. The above second signal is intended to indicate the intensity ratio of one of the frequency components for the first channel signal and the frequency components for the second channel signal to the other of the frequency components for the first channel signal and the frequency components for the second channel signal. The compression of the sound signal thus performed by the first frequency components compressing means 108 results in an excessive compression to the sound signal, and contributing to a wasteful load to computers building the network and deteriorating a music sound quality when the sound signal is decoded.
It is, therefore, an object of the present invention to provide a sound signal encoding apparatus which can prevent the quality of the decoded sound signal from deteriorating resulting from the excessive compression to the sound signal.
It is another object of the present invention to provide a delivery system for delivering sound signal data related to a music at a relatively high quality irrespective of either the compressed sound signal or the non-compressed sound signal.
The one aspect of the sound signal encoding apparatus according to present invention comprises sound signals dividing means for dividing each of the two different sound signals into a plurality of sound signal sections along a time axis for each of the sound signals to be taken for receiving therein, the sound signals consisting of a first channel signal and a second channel signal; first sound signal sections analyzing means for analyzing each of the divided sound signal sections based on the sound signal characteristics inherent in the sound signal; sampling rate selecting means for selecting one arbitrary sampling rate for each of the analyzed sound signal sections from among predetermined sampling rates; sound signal sampling means for sampling each of the analyzed sound signal sections at the sampling rate selected by the sampling rate selecting means; second sound signal sections analyzing means for analyzing each of the divided sound signal sections based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics; frequency components calculating means for calculating frequency components with two different information consisting of first and second signals for each of the sound signal sections sampled at the selected sampling rate, the above first signal being indicative of intensities, and the above second signal being indicative of frequencies; quantization bit numbers allocating means for allocating quantization bit numbers for each of the calculated frequency components for each of the sound signal sections; compression level calculating means for calculating a compression level for each of the sound signal sections; compression level judging means for judging whether or not the calculated compression level for each of the sound signal sections exceeds a predetermined threshold compression value; first frequency components compressing means for compressing the frequency components for each of the sound signal sections with two different information consisting of first and second signals, the above first signal being indicative of the intensities and the above second signal being indicative of ratio of one of the frequency components for the first channel signal and the frequency components for the second channel signal to the other of the frequency components for the first channel signal and the frequency components for the second channel signal; second frequency components compressing means for compressing the frequency components for each of the sound signal sections with two different information consisting of first and second signals, the above first signal being indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal, and the above second signal being indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal; frequency components quantizing means for quantizing each of the frequency components for each of the sound signal sections at predetermined quantization bit numbers under two different states consisting of a first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means when the compression level judging means is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and a second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means when the compression level judging means is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value; and frequency components encoding means for encoding the quantized frequency components for each of the sound signal sections to a multiplexed bit stream with a predetermined bit rate under two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means when the compression level judging means is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means when the compression level judging means is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value, the multiplexed bit stream with the predetermined bit rate being constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The another aspect of the sound signal encoding apparatus according to present invention comprises sound signals dividing means for dividing each of the two different sound signals into a plurality of sound signal sections along a time axis for each of the sound signals to be taken for receiving therein, the sound signals consisting of a first channel signal and a second channel signal; first sound signal sections analyzing means for analyzing each of the divided sound signal sections based on the sound signal characteristics inherent in the sound signal; sampling rate selecting means for selecting one arbitrary sampling rate for each of the analyzed sound signal sections from among predetermined sampling rates; sound signal sampling means for sampling each of the divided sound signal sections at the sampling rate selected by the sampling rate selecting means; second sound signal sections analyzing means for analyzing each of the divided sound signal sections based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics; frequency components calculating means for calculating frequency components with two different information consisting of first and second signals for each of the sound signal sections sampled at the selected sampling rate, the above first signal being indicative of intensities, and the above second signal being indicative of frequencies; quantization bit numbers allocating means for allocating quantization bit numbers for each of the calculated frequency components for each of the sound signal sections; energy ratio calculating means for calculating an energy ratio for each of the sound signal sections; energy ratio judging means for judging whether or not the energy ratio for each of the sound signal sections exceeds a predetermined threshold energy value; first frequency components compressing means for compressing the frequency components for each of the sound signal sections with two different information consisting of first and second signals, the above first signal being indicative of the intensities and the above second signal being indicative of ratio of one of the frequency components for the first channel signal and the frequency components for the second channel signal to the other of the frequency components for the first channel signal and the frequency components for the second channel signal; second frequency components compressing means for compressing the frequency components for each of the sound signal sections with two different information consisting of first and second signals, the above first signal being indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal, and the above second signal being indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal; frequency components quantizing means for quantizing each of the frequency components for each of the sound signal sections at predetermined quantization bit numbers under two different states consisting of a first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value and a second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value; and frequency components encoding means for encoding the quantized frequency components for each of the sound signal sections to a multiplexed bit stream with a predetermined bit rate under two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value, the multiplexed bit stream with the predetermined bit rate being constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The further aspect of the sound signal encoding apparatus according to present invention comprises sound signals dividing means for dividing each of the two different sound signals into a plurality of sound signal sections along a time axis for each of the sound signals to be taken for receiving therein, the sound signals consisting of a first channel signal and a second channel signal; first sound signal sections analyzing means for analyzing each of the divided sound signal sections based on the sound signal characteristics inherent in the sound signal; sampling rate selecting means for selecting one arbitrary sampling rate for each of the analyzed sound signal sections from among predetermined sampling rates; sound signal sampling means for sampling each of the divided sound signal sections at the sampling rate selected by the sampling rate selecting means; second sound signal sections analyzing means for analyzing each of the divided sound signal sections based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics; frequency components calculating means for calculating frequency components with two different information consisting of first and second signals for each of the sound signal sections sampled at the selected sampling rate, the above first signal being indicative of intensities, and the above second signal being indicative of frequencies; quantization bit numbers allocating means for allocating quantization bit numbers for each of the calculated frequency components for each of the sound signal sections; compression level calculating means for calculating a compression level for each of the sound signal sections; threshold energy value selecting means for selecting one arbitrary threshold energy value for each of the sound signal sections from among predetermined threshold energy values based on the compression level calculated by the compression level calculating means; energy ratio calculating means for calculating an energy ratio for each of the sound signal sections; energy ratio judging means for judging whether or not the energy ratio for each of the sound signal sections exceeds the threshold energy value selected by the threshold energy value selecting means; first frequency components compressing means for compressing the frequency components for each of the sound signal sections with two different information consisting of first and second signals, the above first signal being indicative of the intensities and the above second signal being indicative of ratio of one of the frequency components for the first channel signal and the frequency components for the second channel signal to the other of the frequency components for the first channel signal and the frequency components for the second channel signal; second frequency components compressing means for compressing the frequency components for each of the sound signal sections with two different information consisting of first and second signals, the above first signal being indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal, and the above second signal being indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal; frequency components quantizing means for quantizing each of the frequency components for each of the sound signal sections at predetermined quantization bit numbers under two different states consisting of a first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections exceeds the selected threshold energy value and a second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections does not exceed the selected threshold energy value; and frequency components encoding means for encoding the quantized frequency components for each of the sound signal sections to a multiplexed bit stream with a predetermined bit rate under two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections exceeds the selected threshold energy value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means when the energy ratio judging means is operative to judge that the energy ratio for each of the sound signal sections does not exceed the selected threshold energy value, the multiplexed bit stream with the predetermined bit rate being constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The still further aspect of the sound signal encoding apparatus according to present invention comprises a sound signals dividing step of dividing each of the two different sound signals into a plurality of sound signal sections along a time axis for each of the sound signals to be taken for receiving therein, the sound signals consisting of a first channel signal and a second channel signal; a first sound signal sections analyzing step of analyzing each of the divided sound signal sections based on the sound signal characteristics inherent in the sound signal; a sampling rate selecting step of selecting one arbitrary sampling rate for each of the analyzed sound signal sections from among predetermined sampling rates; a sound signal sampling step of sampling each of the analyzed sound signal sections at the sampling rate selected by the sampling rate selecting step; a second sound signal sections analyzing step of analyzing each of the divided sound signal sections based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics; a frequency components calculating step of calculating frequency components with two different information consisting of first and second signals for each of the sound signal sections sampled at the selected sampling rate, the above first signal being indicative of intensities, and the above second signal being indicative of frequencies; a quantization bit numbers allocating step of allocating quantization bit numbers for each of the calculated frequency components for each of the sound signal sections; a compression level calculating step of calculating a compression level for each of the sound signal sections; a compression level judging step of judging whether or not the compression level for each of the sound signal sections exceeds the predetermined threshold compression value; a first frequency components compressing step of compressing the frequency components for each of the sound signal sections with the two different information consisting of the first and second signals, the above first signal being indicative of the intensity and the above second signal being indicative of ratio of one of the frequency components for the first channel signal and the frequency components for the second channel signal to the other of the frequency components for the first channel signal and the frequency components for the second channel signal; a second frequency components compressing step of compressing the frequency components for each of the sound signal sections with two different information consisting of the first and second signals, the above first signal being indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal, and the above second signal being indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal; a frequency components quantizing step of quantizing each of the frequency components for each of the sound signal sections at the predetermined quantization bit numbers under the two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing step when the compression level judging step is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing step when the compression level judging step is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value; and a frequency components encoding step of encoding the quantized frequency components for each of the sound signal sections to the multiplexed bit stream with the predetermined bit rate under the two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing step when the compression level judging step is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing step when the compression level judging step is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value, the multiplexed bit stream with the predetermined bit rate being constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The objects, features and advantages of the present invention will become apparent as the description proceeds when taken in conjunction with the accompanying drawings, in which:
The embodiments of the sound signal encoding apparatus according to the present invention will be described in detail hereinafter.
The first embodiment of the sound signal encoding apparatus 100 according to the present invention is shown in
The sound signals thus inputted to the sound signal encoding apparatus 100 include first and second channel signals represented by reference legends “CH1” and “CH2”, respectively, as shown in
Similarly to the division of the first channel signal CH1, the division of the second channel signal CH2 is performed as follows.
Each of the sound signal sections for the second channel signal CH2 is sequentially represented by reference numbers 5 to 7 as shown in
The first sound signal sections analyzing means 102 is adapted to analyze each of the divided sound signal sections 1 to 3 for the second channel signal CH1 and the divided sound signal sections 5 to 7 for the second channel signal CH2 based on the sound signal characteristics inherent in the sound signal. The sampling rate selecting means 103 is adapted to select one arbitrary sampling rate for each of the sound signal sections from among predetermined sampling rates based on each of the sound signal sections analyzed by the first sound signal analyzing means 102. The sound signal sampling means 104 is adapted to sample each of the sound signal sections at the selected sampling rate.
Each of the divided sound signal sections 1 to 3 for the first channel signal CH1 is sequentially sampled at the selected sampling rate as shown in
Similarly to the division of the first channel signal CH1, the division of the second channel signal CH2 is performed as follows.
Each of the divided sound signal sections 5 to 7 for the second channel signal CH2 is sequentially sampled at the selected sampling rate as shown in
The sound signal encoding apparatus 100 further comprises second sound signal sections analyzing means 105, frequency components calculating means 106, and quantization bit numbers allocating means 107. The second sound signal sections analyzing means 105 is operative to analyze each of the sound signal sections based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics. The frequency components calculating means 105 is operative to calculate the frequency components for each of the sound signal sections sampled at the selected sampling rate based on the Modified Discrete Cosine Transformation.
Each of the frequency components for the sound signal sections 1 is represented by the reference legends “f11”, “f12”, and “f1n”.
Similarly to the division of the first channel signal CH1, the division of the second channel signal CH2 is performed as follows.
Each of the frequency components for the sound signal sections 2 is represented by the reference legends “f21”, “f22”, and “f2n”.
The quantization bit numbers allocating means 106 is operative to allocate quantization bit numbers for each of the calculated frequency components for each of the sound signal sections of both the first channel signal CH1 and the second channel signal CH2 based on the second sound signal sections analyzing means 105.
The sound signal encoding apparatus 100 further comprises compression level calculating means 121 and compression level judging means 122. The compression level calculating means 121 is operative to calculate the compression level for each of the sound signal sections. The compression level judging means 122 is operative to judge whether or not the compression level for each of the sound signal sections exceeds a predetermined threshold compression value. The compression level judging means 122 has a flag with two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means 108 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means 108 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value.
The sound signal encoding apparatus 100 further comprises first frequency components compressing means 107 and second frequency components compressing means 108. The first frequency components compressing means 107 is operative to compress the frequency components f11, f12, and f1n (n=1 to N) for each of the sound signal sections 1 and the frequency components f21, f22, and f2n (n=1 to N) for each of the sound signal sections 2 with two different information consisting of first and second signals based on the second sound signal section analyzing means 105. The above first signal is indicative of the intensity signal based on both each of the frequency components f11, f12, and f1n (n=1 to N) for the first channel signal CH1 and each of the frequency components f21, f22, and f2n (n=1 to N) for the second channel signal CH2. The above second signals is indicative of ratio of one of the frequency components f11, f12, and f1n (n=1 to N) for the first channel signal CH1 and the frequency components f21, f22, and f2n (n=1 to N) for the second channel signal CH2 to the other of the frequency components f11, f12, and f1n (n=1 to N) for the first channel signal CH1 and the frequency components f21, f22, and f2n (n=1 to N) for the second channel signal CH2.
The second frequency components compressing means 108 is operative to compress the frequency components f11, f12, and f1n (n=1 to N) for each of the sound signal sections 1 and the frequency components f21, f22, and f2n (n=1 to N) for each of the sound signal sections 2 with two different information consisting of a first and second signals based on the second sound signal section analyzing means 105. The above first signal is indicative of the addition f11+f21, f12+f22, and f1n+f2n (n=1 to N) for each of the frequency components f11, f12, and f1n (n=1 to N) for the first channel signal CH1 and each of the frequency components f21, f22, and f2n (n=1 to N) for the second channel signal CH2, and while the above second signal being indicative of the difference f11−f21, f12−f22, and f1n−f2n (n=1 to N) between each of the frequency components f11, f12, and f1n (n=1 to N) for the first channel signal CH1 and each of the frequency components f21, f22, and f2n (n=1 to N) for the second channel signal CH2.
The sound signal encoding apparatus 100 further comprises frequency components quantizing means 109 and frequency components encoding means 110. The frequency components quantizing means 109 is operative to quantize both the frequency components f11, f12, and f1n (n=1 to N) for each of the sound signal sections of the first channel signal CH1 and the frequency components f21, f22, and f2n (n=1 to N) for each of the sound signal sections of the second channel signal CH2 at the predetermined quantization bit numbers based on the second sound signal sections analyzing means 105 under two different states consisting of a first state in which the frequency components f11, f12, and f1n (n=1 to N) for each of the sound signal sections of the first channel signal CH1 and the frequency components f21, f22, and f2n (n=1 to N) for each of the sound signal sections of the second channel signal CH2 are compressed by the first frequency components compressing means 107 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value, and a second state in which the frequency components f11, f12, and f1n (n=1 to N) for each of the sound signal sections of the first channel signal CH1 and the frequency components f21, f22, and f2n (n=1 to N) for each of the sound signal sections of the second channel signal CH2 are not compressed by the first frequency components compressing means 107 when the compression level judging means 122 is operative to judge that the compression level for each the sound signal sections does not exceed the predetermined threshold compression value.
The frequency components encoding means 110 is operative to encode the quantized frequency components for each of the sound signal sections of both the first channel signal CH1 and the second channel signal CH2 to a multiplexed bit stream under two different states consisting of the first state in which the sound signals for the sound signal sections are compressed by the first frequency components compressing means 107 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and the second state in which the frequency components for the sound signal sections are compressed by the first frequency components compressing means 107 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value, the multiplexed bit stream being constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The following description will be directed to the operation of the first embodiment of the sound signal encoding apparatus 100 according to the present invention with reference to
Each of the two different sound signals CH1 and CH2 is initially divided into a plurality of the sound signal sections along the time for each of the sound signals to be taken for receiving therein in sound signals dividing step S101. Each of the divided sound signal sections is then analyzed based on the sound signal characteristics inherent in the sound signal in first sound signal sections analyzing step S102. The one arbitrary sampling rate for each of the sound signal sections is then selected from among predetermined sampling rates based on each of the analyzed sound signal sections in sampling rate selecting step S103. Each of the sound signal sections is then sampled at the selected sampling rate in sound signal sampling step S104.
Each of the sound signal sections is then analyzed in second sound signal sections analyzing step S105 based on a psycho acoustic model obtained by taking advantage of human's hearing characteristics. Each of the frequency components for each of the sampled sound signal sections is then calculated in the frequency components calculating step S105 based on the Modified Discrete Cosine Transformation. The quantization bit numbers for each of the frequency components for each of the sound signal section is then allocated in the quantization bit numbers allocating step S107 based on the second sound signal sections analyzing step S105.
The compression level for each of the sound signal sections is then calculated in compression levels calculating step S121. The judgment is made whether or not the compression level for each of the sound signal sections exceeds a predetermined threshold compression level in compression level judging step S122. When the answer in compression level judging step S122 is in the affirmative “YES”, i.e., the compression level for each of the sound signal sections exceeds a predetermined threshold compression value, the compression level judging step S122 goes to the first frequency components compressing step S108. When the answer in compression level judging step S122 is in the negative “NO”, i.e., the compression level for each of the sound signal sections does not exceed a predetermined threshold compression value, the compression level judging step S122 goes to the second frequency components compressing step S109.
Each of the frequency components for each of the sound signal sections is then compressed with two different information consisting of a first and second signal based on the second sound signal sections analyzing means S105 in the first sound signal sections compressing step S108. The above first signal is indicative of the intensity signal jointed with each of the frequency components for each of the sound signal sections 1 to 3 and frequency components the sound signal sections 5 to 7. The above second signal is indicative of ratio of one of the frequency components for each of the sound signal sections 1 to 3 and frequency components the sound signal sections 5 to 7.
Each of the frequency components for each of the sound signal sections is then compressed with two different information consisting of a first and second signal based on the second sound signal sections analyzing means S105 in the first sound signal sections compressing step S109. The above first signal is indicative of the addition f11+f21 of each of the frequency components the first channel signal CH1 and each of the frequency components for the second channel signal CH2, while the above second signal being indicative of the difference f11−f21 between each of the frequency components for the first channel signal CH1 and each of the frequency components for the second channel signal CH2 based on the sound signal sections 1 to 3 and the sound signal sections 5 to 7.
Each of the frequency components for each of the sound signal sections is then quantized at the quantization bit numbers based on the second sound signal sections analyzing step S105 in the frequency components quantizing step S109 under two different states consisting of a first state in which the frequency components for each of the sound signal sections are compressed in the first frequency components compressing step S108 when the compression level judging step S122 is of judging that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and a second state in which the frequency components for each of the sound signal sections are not compressed in the first frequency components compressing step S108 when the compression level judging step S122 is of judging that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value.
The quantized frequency components for each of the sound signal sections is then encoded to a multiplexed bit stream in the frequency components encoding step S111 under two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed in the first frequency components compressing step S108 when the compression level judging step S122 is of judging that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value and the second state in which the frequency components for each of the sound signal sections are compressed in the first frequency components compressing step S108 when the compression level judging step S122 is of judging that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value. The multiplexed bit stream is constituted by the sound signals for each of the sound signal sections and general information needed for the sound signals to be encoded and decoded.
The following description will now be directed to the calculation of the compression level of the first embodiment of the sound signal encoding apparatus 100 according to the present invention with reference to
The compression level calculating step S121 is of calculating the compression level for each of the sound signal sections with the compression ratio of the selected sampling rate to the bit rate at which the multiplexed bit stream is outputted in the frequency components encoding step S111. The calculation of the compression level for each of the sound signal sections is performed by the following equation (1) for calculating the compression level for each of the sound signal sections with the compression ratio of the selected sampling rate to the bit rate.
compression level=quantization bit numbers×sampling rate×channel number/multiplexed bit rate (1)
wherein the sampling rate is a value selected in the sampling rate selecting step S103, the multiplexed bit rate is a bit rate of the encoded sound signals outputted in the frequency components encoding step S111, the quantization bit numbers and the channel number are each a fixed value, for example, 16 bits, 2 channels respectively. The compression level for each of the sound signal sections CH1 and CH2 is calculated without the consideration of the channel numbers and the quantization bit numbers of the sound signals, as will be seen from the following equation.
compression level=sampling rate/multiplexed bit rate (2)
When the ratio of the selected sampling rate to the multiplexed bit rate of the encoded sound signal outputted by the frequency components encoding step S111 is larger than the predetermined threshold compression value to allow the flag of the first sound signal sections compressing step S109 to be set, the flag of the first sound signal sections compressing step S109 is set to allow the first sound signal sections compressing S109 to start. When the ratio of the sampling rate to the multiplexed bit rate of the sound signals, on the other hand, is smaller than the predetermined compression value to inhibit the flag of the first sound signal sections compressing step S109 to be reset, the flag of the first sound signal sections compressing step S109 is set to inhibit the first sound signal sections compressing step S109 to start.
From the above detailed description, it will be understood that the sound signal can be encoded at a relatively high quality under two different states consisting of first and second states. In the above first state, the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means 108 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value. In the above second state, the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means 108 when the compression level judging means 122 is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value. The compression of the sound signal performed by the sound signal encoding apparatus thus constructed makes it possible to encode the sound signal at a relatively high quality with the inhibition of the first compression calculating process in the event that the compression rate of the sound signal is larger than the threshold compression value.
Although there has been described in the above about the first embodiment of the sound signal encoding apparatus according to the present invention, this embodiment may be replaced by the second to sixth embodiments of the sound signal encoding apparatus according to the present invention in order to attain the objects of the present invention. The second to sixth embodiments of the sound signal encoding apparatus will then be described hereinafter.
Referring then to
The following description will be directed to the constitution elements and the steps of the second embodiment of the sound signal encoding apparatus 200 different from those of the first embodiment of the sound signal encoding apparatus.
In addition to sound signals dividing means 101, first sound signal sections analyzing means 102, sampling rate selecting means 103, sampling rate selecting means 104, second sound signal sections analyzing means 105, frequency components calculating means 106, quantization bit numbers allocating means 107, first frequency components compressing means 108, and second frequency components compressing means 109, frequency components quantizing means 110, and frequency components encoding means 111, the second embodiment of the sound signal encoding apparatus according to the present invention is shown in
The operation of the second embodiment of the sound signal encoding apparatus according to the present invention is carried out through a load condition judging process shown in
The compression level calculating step S221 is shown in
The compression level for each of the sound signal sections CH1 and CH2 is calculated without the consideration of the channel numbers and the quantization bit numbers of the sound signals by the equation (2) appearing for describing the first embodiment of the sound signal encoding apparatus.
When the ratio of the selected sampling rate to the multiplexed bit rate of the encoded sound signal outputted by the frequency components encoding step S111 is larger than the predetermined threshold compression value to allow the flag for the second frequency components compressing step S109 to be set, the flag for the second frequency components compressing step S109 is set to allow the second sound signal sections compressing S109 to start. When the ratio of the sampling rate to the multiplexed bit rate of the sound signals, on the other hand, is smaller than the predetermined compression value to inhibit the flag for the second sound signal sections compressing step S109 to be reset, the flag for the second sound signal sections compressing step S109 is set to inhibit the second sound signal sections compressing step S109 to start.
From the above detailed description, it will be understood that the sound signal can be encoded at a relatively high quality under two different states consisting of first and second states. In the first state, the frequency components for each of the sound signal sections are compressed by the second frequency components compressing means 109 when the compression level judging means 222 is operative to judge that the compression level for each of the sound signal sections exceeds the predetermined threshold compression value. In the second state, the frequency components for each of the sound signal sections are not compressed by the second frequency components compressing means 109 when the compression level judging means 222 is operative to judge that the compression level for each of the sound signal sections does not exceed the predetermined threshold compression value. The compression of the sound signal performed by the sound signal encoding apparatus thus constructed makes it possible to encode the sound signal at a relatively high quality.
The following description will now be directed to the constitution elements and the steps of the third embodiment of the sound signal encoding apparatus 300 different from those of the first and second embodiments of the sound signal encoding apparatus.
In addition to sound signals dividing means 101, first sound signal sections analyzing means 102, sampling rate selecting means 103, sampling rate selecting means 104, second sound signal sections analyzing means 105, frequency components calculating means 106, quantization bit numbers allocating means 107, first frequency components compressing means 108, and second frequency components compressing means 109, frequency components quantizing means 110, and frequency components encoding means 111, the third embodiment of the sound signal encoding apparatus according to the present invention is shown in
The operation of the third embodiment of the sound signal encoding apparatus according to the present invention is carried out through a load condition judging process shown in
When the ratio of the selected sampling rate to the multiplexed bit rate of the encoded sound signal outputted by the frequency components encoding step S111 is larger than the predetermined threshold compression value to allow the flag for the first frequency components compressing step S108 to be set, the flag for the first frequency components compressing step S108 is set to allow the first sound signal sections compressing S108 to start. When the ratio of the sampling rate to the multiplexed bit rate of the sound signals, on the other hand, is smaller than the predetermined compression value to inhibit the flag for the first sound signal sections compressing step S108 to be reset, the flag for the first sound signal sections compressing step S108 is set to inhibit the first sound signal sections compressing step S108 to start.
From the above detailed description, it will be understood that the sound signal can be encoded at a relatively high quality under two different states consisting of first and second states. In the first state, the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means 108 when the energy ratio judging means 324 is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value. In the second state, the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means 108 when the energy ratio judging means 324 is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value. The compression of the sound signal performed by the sound signal encoding apparatus thus constructed makes it possible to encode the sound signal at a relatively high quality.
The following description will be directed to the constitution elements and the steps of the fourth embodiment of the sound signal encoding apparatus 400 different from those of the first to third embodiments of the sound signal encoding apparatus.
In addition to sound signals dividing means 101, first sound signal sections analyzing means 102, sampling rate selecting means 103, sampling rate selecting means 104, second sound signal sections analyzing means 105, frequency components calculating means 106, quantization bit numbers allocating means 107, first frequency components compressing means 108, and second frequency components compressing means 109, frequency components quantizing means 110, and frequency components encoding means 111, the fourth embodiment of the sound signal encoding apparatus according to the present invention is shown in
The operation of the third embodiment of the sound signal encoding apparatus according to the present invention is carried out through a load condition judging process shown in
When the ratio of the selected sampling rate to the multiplexed bit rate of the encoded sound signal outputted by the frequency components encoding step S111 is larger than the predetermined threshold compression value to allow the flag for the second frequency components compressing step S109 to be set, the flag for the second frequency components compressing step S109 is set to allow the second sound signal sections compressing S109 to start. When the ratio of the sampling rate to the multiplexed bit rate of the sound signals, on the other hand, is smaller than the predetermined compression value to inhibit the flag for the second sound signal sections compressing step S109 to be reset, the flag for the second sound signal sections compressing step S109 is set to inhibit the second sound signal sections compressing step S109 to start.
From the above detailed description, it will be understood that the sound signal can be encoded at a relatively high quality under two different states consisting of first and second states. In the first state, the frequency components for each of the sound signal sections are compressed by the second frequency components compressing means 109 when the energy ratio judging means 424 is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value. In the second state, the frequency components for each of the sound signal sections are not compressed by the second frequency components compressing means 109 when the energy ratio judging means 424 is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value. The compression of the sound signal performed by the sound signal encoding apparatus thus constructed makes it possible to encode the sound signal at a relatively high quality.
The following description will be directed to the constitution elements and the steps of the fifth embodiment of the sound signal encoding apparatus 500 different from those of the first to fourth embodiments of the sound signal encoding apparatus.
In addition to sound signals dividing means 101, first sound signal sections analyzing means 102, sampling rate selecting means 103, sampling rate selecting means 104, second sound signal sections analyzing means 105, frequency components calculating means 106, quantization bit numbers allocating means 107, first frequency components compressing means 108, and second frequency components compressing means 109, frequency components quantizing means 110, and frequency components encoding means 111, the fifth embodiment of the sound signal encoding apparatus according to the present invention is shown in
The energy ratio calculating means 523 is operative to calculate five different information consisting of first to fifth signals. The above first signal is indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal The above second signal is indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above third signal is indicative of the energy level with the above first signal; the above fourth signal is indicative of the energy level with the above second signal. The above fifth signal is indicative of the energy ratio of the above third signal to the above fourth signal. The energy ratio judging means 526 is operative to judge whether or not the energy ratio for each of the sound signal sections exceeds a predetermined threshold energy value. The energy ratio judging means 526 has a flag with two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means 108 when the energy ratio judging means 526 is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means 109 when the energy ratio judging means 526 is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value.
The energy ratio judging means 526 has a flag with two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means 108 when the energy ratio judging means 526 is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value and the second state in which the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means 109 when the energy ratio judging means 526 is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value.
The operation of the fifth embodiment of the sound signal encoding apparatus according to the present invention is carried out through a load condition judging process shown in
The compression level calculating step S521 is shown in
The compression level for each of the sound signal sections CH1 and CH2 is calculated without the consideration of the channel numbers and the quantization bit numbers of the sound signals by the equation (2) appearing for describing the first embodiment of the sound signal encoding apparatus.
The threshold energy value selecting step S525 is of selecting one arbitrary threshold energy value for each of said sound signal sections from among predetermined threshold energy values based on said compression level calculated in said compression level calculating step S521. The energy ratio calculating step S523 is of calculating five different information consisting of first to fifth signals. The above first signal is indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above second signal is indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above third signal is indicative of the energy level with the above first signal. The above fourth signal is indicative of the energy level with the above second signal. The above fifth signal is indicative of the energy ratio of the above third signal to the above fourth signal. The energy ratio judging step S526 is of judging whether or not the energy ratio for each of the sound signal sections exceeds a predetermined threshold energy value.
When the ratio of the selected sampling rate to the multiplexed bit rate of the encoded sound signal outputted by the frequency components encoding step S111 is larger than the predetermined threshold compression value to allow the flag for the first frequency components compressing step S108 to be set, the flag for the first frequency components compressing step S108 is set to allow the first sound signal sections compressing S108 to start. When the ratio of the sampling rate to the multiplexed bit rate of the sound signals, on the other hand, is smaller than the predetermined compression value to inhibit the flag for the first sound signal sections compressing step S108 to be reset, the flag for the first sound signal sections compressing step S108 is set to inhibit the first sound signal sections compressing step S108 to start.
From the above detailed description, it will be understood that the sound signal can be encoded at a relatively high quality under two different states consisting of first and second states. In the first state, the frequency components for each of the sound signal sections are compressed by the first frequency components compressing means 108 when the energy ratio judging means 124a is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value. In the second state, the frequency components for each of the sound signal sections are not compressed by the first frequency components compressing means 108 when the energy ratio judging means 124a is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value. The compression of the sound signal performed by the sound signal encoding apparatus thus constructed makes it possible to encode the sound signal at a relatively high quality.
The following description will be directed to the constitution elements and the steps of the sixth embodiment of the sound signal encoding apparatus 600 different from those of the first to fifth embodiments of the sound signal encoding apparatus.
In addition to sound signals dividing means 101, first sound signal sections analyzing means 102, sampling rate selecting means 103, sampling rate selecting means 104, second sound signal sections analyzing means 105, frequency components calculating means 106, quantization bit numbers allocating means 107, first frequency components compressing means 108, and second frequency components compressing means 109, frequency components quantizing means 110, and frequency components encoding means 111, the sixth embodiment of the sound signal encoding apparatus according to the present invention is shown in
The energy ratio calculating means 623 is operative to calculate five different information consisting of first to fifth signals. The above first signal is indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above second signal is indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above third signal is indicative of the energy level with the above first signal. The above fourth signal is indicative of the energy level with the above second signal. The above fifth signal is indicative of the energy ratio of the above third signal to the above fourth signal. The energy ratio judging means 626 is operative to judge whether or not the energy ratio for each of the sound signal sections exceeds a predetermined threshold energy value.
The energy ratio judging means 626 has a flag with two different states consisting of the first state in which the frequency components for each of the sound signal sections are compressed by the second frequency components compressing means 109 when the energy ratio judging means 626 is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value and the second state in which the frequency components for each of the sound signal sections are not compressed by the second frequency components compressing means 108 when the energy ratio judging means 626 is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value.
The compression level calculating step S621 is shown in
The compression level for each of the sound signal sections CH1 and CH2 is calculated without the consideration of the channel numbers and the quantization bit numbers of the sound signals by the equation (2) appearing for describing the first embodiment of the sound signal encoding apparatus.
The operation of the sixth embodiment of the sound signal encoding apparatus according to the present invention is carried out through a load condition judging process shown in
The compression level for each of the sound signal sections CH1 and CH2 is calculated without the consideration of the channel numbers and the quantization bit numbers of the sound signals by the equation (2) appearing for describing the first embodiment of the sound signal encoding apparatus.
The threshold energy value selecting step S625 is of selecting one arbitrary threshold energy value for each of said sound signal sections from among predetermined threshold energy values based on said compression level calculated in said compression level calculating step S621. The energy ratio calculating step S623 is of calculating five different information consisting of first to fifth signals. The above first signal is indicative of the addition of each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above second signal is indicative of the difference between each of the frequency components for the first channel signal and each of the frequency components for the second channel signal. The above third signal is indicative of the energy level with the above first signal. The above fourth signal is indicative of the energy level with the above second signal. The above fifth signal is indicative of the energy ratio of the above third signal to the above fourth signal. The energy ratio judging step S626 is of judging whether or not the energy ratio for each of the sound signal sections exceeds a predetermined threshold energy value.
When the ratio of the selected sampling rate to the multiplexed bit rate of the encoded sound signal outputted by the frequency components encoding step S111 is larger than the predetermined threshold compression value to allow the flag for the second frequency components compressing step S109 to be set, the flag for the second frequency components compressing step S109 is set to allow the second sound signal sections compressing S109 to start. When the ratio of the sampling rate to the multiplexed bit rate of the sound signals, on the other hand, is smaller than the predetermined compression value to inhibit the flag for the second sound signal sections compressing step S109 to be reset, the flag for the second sound signal sections compressing step S109 is set to inhibit the second sound signal sections compressing step S109 to start.
From the above detailed description, it will be understood that the sound signal can be encoded at a relatively high quality under two different states consisting of first and second states. In the first state, the frequency components for each of the sound signal sections are compressed by the second frequency components compressing means 109 when the energy ratio judging means 624 is operative to judge that the energy ratio for each of the sound signal sections exceeds the predetermined threshold energy value. In the second state, the frequency components for each of the sound signal sections are not compressed by the second frequency components compressing means 109 when the energy ratio judging means 124b is operative to judge that the energy ratio for each of the sound signal sections does not exceed the predetermined threshold energy value. The compression of the sound signal performed by the sound signal encoding apparatus thus constructed makes it possible to encode the sound signal at a relatively high quality.
While the subject invention has been described with relation to the preferred embodiments, various modifications and adaptations thereof will now be apparent to those skilled in the art as far as such modifications and adaptations fall within the scope of the appended claims intended to be covered thereby.
Number | Date | Country | Kind |
---|---|---|---|
2000-358070 | Nov 2000 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5469474 | Kitabatake | Nov 1995 | A |
5481614 | Johnston | Jan 1996 | A |
5684922 | Miyakawa et al. | Nov 1997 | A |
5736943 | Herre et al. | Apr 1998 | A |
6456968 | Taniguchi et al. | Sep 2002 | B1 |
Number | Date | Country | |
---|---|---|---|
20020064284 A1 | May 2002 | US |