This application claims priority from Korean Patent Application No. 10-2007-0027271, filed on Mar. 20, 2007 in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference.
1. Field of the Invention
Apparatuses and methods consistent with the present invention relate to encoding and decoding of an audio signal, and more particularly, to encoding and decoding an audio signal which apply an effective coding method for each band by dividing the audio signal into a plurality of bands.
2. Description of the Related Art
An encoding method of an audio signal can be classified into a parametric coding method and a time-frequency coding method. In the case of the parametric coding method, an encoding efficiency is high when a bit rate of data is low. In other words, the encoding efficiency of the parametric coding method decreases as the bit rate increases. The time-frequency coding method is more effective than the parametric coding method when sound quality of the audio signal is high, that is, the bit rate is high. However, the time-frequency coding method is ineffective when the bit rate is low, since information on all frequency indices should be transmitted.
Thus, in order to improve the encoding efficiency, a related art method in which only either the parametric coding method or the time-frequency coding method is applied, has to be improved.
Exemplary embodiments of the present invention overcome the above disadvantages and other disadvantages not described above. Also, the present invention is not required to overcome the disadvantages described above, and an exemplary embodiment of the present invention may not overcome any of the problems described above.
The present invention provides a method and apparatus for encoding an audio signal, in which the audio signal is divided into a plurality of bands and an efficient coding method is applied for each of the bands, and a computer readable recording medium having recorded thereon a program for executing the above described method.
The present invention also provides a method and apparatus for decoding an audio signal, in which a bit stream generated by the encoding method is decoded for each band, and a computer readable recording medium having recorded thereon a program for executing the above described decoding method.
According to an aspect of the present invention, there is provided a method of encoding an audio signal including, the method comprising: dividing an input audio signal into a plurality of audio bands; selecting a coding method for each of the audio bands; encoding audio data included in each of the audio bands according to the selected coding method for each of the bands; and generating a bit stream including all the encoded audio data for each of the audio bands, wherein the selecting of the coding method comprises selecting a coding method providing smaller encoded data from among a parametric coding method and a time-frequency coding method.
The selecting the coding method for the each audio band may include: calculating a number of sinusoidal signals included in a corresponding audio band; selecting the time-frequency coding method when the number of sinusoidal signals is equal to or greater than a predetermined value; and selecting the parametric coding method when the number of sinusoidal signals is smaller than the predetermined value.
According to another aspect of the present invention, there is provided an apparatus for encoding an audio signal including: a band divider which divides an input audio signal into a plurality of audio bands; a coding method selector which selects a coding method for each of the audio bands; an audio encoder which encodes audio data included in each of the audio bands according to the selected coding method for each of the bands; and a bit stream generator generating a bit stream including all the encoded audio data for each of the audio bands, wherein the coding method selector selects a coding method providing smaller encoded data from among a parametric coding method and a time-frequency coding method.
The coding method selector may select the time-frequency coding method when the number of sinusoidal signals included in an audio band is equal to or greater than a predetermined value, and selects the parametric coding method when the number of sinusoidal signals is smaller than the predetermined value.
In the method and apparatus for encoding the audio signal, the parametric coding method may be a Sinusoidal Coding (SSC) method and the time-frequency coding method may be an Advanced Audio Coding (AAC) method.
According to another aspect of the present invention, there is provided a method of encoding an audio signal including: dividing an input audio signal into a plurality of audio bands; encoding audio data included in each of the audio bands by applying a parametric coding method and a time-frequency coding method respectively; selecting a coding method providing smaller data for each of the audio bands from among the encoded audio data using the parametric coding method and the time-frequency coding method; and generating a bit stream including all the encoded audio data selected for the each of the audio bands.
According to another aspect of the present invention, there is provided a method of decoding an audio signal including: dividing an input bit stream into audio data encoded for a plurality of audio bands; extracting information on a coding method used by an encoding apparatus for encoding the audio data, for each of the audio bands; decoding the encoded audio data for each of the audio bands, according to the coding method on the basis of the extracted information; and generating the audio signal by combining the decoded audio data for the respective audio bands, wherein the coding method is a coding method providing smaller encoded data that is selected from among a parametric coding method and a time-frequency coding method.
According to another aspect of the present invention, there is provided an apparatus of decoding an audio signal including: a bit stream divider which divides an input bit stream into audio data encoded for a plurality of audio bands; a coding method extractor which extracts information on a coding method used by an encoding apparatus for encoding the audio data, for each of the audio bands; an audio decoder which decodes the encoded audio data for each of the audio bands, according to the coding method on the basis of the extracted information; and an audio signal generator which generates the audio signal by combining the decoded audio data for each of the respective audio bands, wherein the coding method is a coding method providing smaller encoded data that is selected from among a parametric coding method and a time-frequency coding method.
In the methods and apparatuses of the decoding audio signal, the time-frequency coding method is selected as the coding method when the number of sinusoidal signals included in the corresponding audio band is equal to or greater than a predetermined value, and selects the parametric coding method when the number of sinusoidal signals is smaller than the predetermined value.
In the decoding method and apparatus, the parametric coding method may be an SSC method and the time-frequency method may be an AAC method.
The above and other aspects of the present invention will become more apparent by describing in detail exemplary embodiments thereof with reference to the attached drawings in which:
Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the appended drawings.
Referring to
Referring to
The coding method selector 120 selects a coding method for each audio band (S110). The coding method selector 120 selects a more effective encoding method for a corresponding band from a parametric coding method and a time-frequency coding method. An effective encoding method denotes encoding by which encoded data is smaller than when encoded by using other methods.
A coding method selecting method according to an exemplary embodiment of the present invention will now be described. First, the number of sinusoidal signals included in the corresponding audio band, that needs to select a coding method, is calculated. When the calculated number of sinusoidal signals is equal to or greater than a predetermined value, a time-frequency coding method is selected. When the calculated number of sinusoidal signals is smaller than the predetermined value, a parametric coding method is selected. This coding method selecting method will be explained in more detail with reference to
The audio encoder 130 encodes each audio band according to the coding method selected for the each audio band (S120).
When the parametric coding method is selected for a corresponding audio band, an audio signal included in the corresponding audio band is encoded by using the parametric coding method. An SSC method may be an example of the parametric coding method.
When the time-frequency coding method is selected for the corresponding audio band, an audio signal included in the corresponding audio band is encoded by using the time-frequency coding method. The time-frequency coding method denotes a coding method which converts data in the time domain into the frequency domain value. An AAC method may be an example of the time-frequency coding method.
The bit stream generator 140 generates a bit stream 2 which includes all of the encoded data for the each audio band (S130).
Referring to
Referring to
The coding method extractor 220 extracts information on the coding method for each of the audio bands (S210). The coding method is a method used for encoding audio data of the corresponding audio band in an encoding apparatus. As described above, the encoding apparatus selects a method that provides smaller encoded data from among the parametric coding method and the time-frequency coding method, for each audio band. As explained above, according to an exemplary embodiment of the present invention, the encoding apparatus calculates the number of sinusoidal signals included in an audio band to select a coding method, and selects the time-frequency coding method when the calculated number of sinusoidal signals is equal to or greater than a predetermined value or selects the parametric coding method when the calculated number of sinusoidal signals is smaller than the predetermined value.
The audio decoder 230 decodes audio data encoded according to the coding method based on the extracted information for the each audio band (S220).
When the information on a coding method for the corresponding audio band indicates the parametric coding method, encoded audio data for the corresponding audio band is decoded by using the parametric coding method. The SSC method is an example of the parametric coding method.
When the information on a coding method for the corresponding audio band indicates the time-frequency coding method, encoded audio data for the corresponding audio band is decoded by using the time-frequency coding method. The AAC is an example of the time-frequency method.
The audio signal generator 240 generates an output audio signal 12 by combining audio data decoded for each audio band (S230).
A selection of the coding method according to the number of sinusoidal signals will now be explained in detail, with reference to
In the time-frequency coding method, a fundamental frequency is set and amplitude values and phase values of all frequencies which are multiples of the fundamental frequency are extracted and encoded. Accordingly, the size of the encoded data stays the same since information on the same number of frequencies is encoded regardless of the number of sinusoidal signals included in the audio signal, as indicated by a horizontal line 30 parallel to the X-axis.
Meanwhile, in the parametric coding method, information on a frequency, an amplitude, and a phase value for each sinusoidal signal is encoded. Accordingly, as the number of sinusoidal signals increases, the size of encoded data increases, as indicated by a straight line 32 heading towards the top right hand side in
Accordingly, as shown in
There are various ways to determine the value N.
The value N is the number of sinusoidal signals where the size of the data encoded by using the parametric coding method and the size of data encoded by using the time-frequency coding method are the same. Accordingly, the number of frequencies used in the time-frequency coding method, namely, the number of frequency indices, may be selected as the value N. The value N will be slightly less than the number of frequency indices, since information on a frequency is not encoded in the time-frequency coding method.
Alternatively, instead of determining a value N in advance, a method of applying the parametric coding method and the time-frequency coding method to a corresponding audio band and selecting smaller encoded data from the two pieces of encoded data obtained by using the parametric coding method and the time-frequency coding method may be considered.
The invention can also be embodied as computer (including all devices having data processing functions) readable codes on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, optical data storage devices.
As described above, in the methods and apparatuses for encoding an audio signal, and the methods and apparatuses for decoding an audio signal according to exemplary embodiments of the present invention, by dividing the audio signal into a plurality of bands and selecting a coding method where the size of encoded data is small for each band, an effective encoding method is possible in comparison to a method of applying one coding method to the entire audio data. In other words, the exemplary embodiments of the present invention provide a method in which the time-frequency method and the parametric method are mixed and used according to each audio band.
While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present invention as defined by the following claims.
Number | Date | Country | Kind |
---|---|---|---|
10-2007-0027271 | Mar 2007 | KR | national |