Method and apparatus for encoding and decoding stereo signal and multi-channel signal

Information

  • Patent Grant
  • 8625811
  • Patent Number
    8,625,811
  • Date Filed
    Monday, February 6, 2012
    12 years ago
  • Date Issued
    Tuesday, January 7, 2014
    10 years ago
Abstract
Provided are a method and apparatus for encoding and decoding a stereo signal or a multi-channel signal. According to the method and apparatus, a stereo signal or a multi-channel signal can be encoded and/or decoded by generating parameters based on a mono signal.
Description
BACKGROUND OF THE INVENTION

1. Field of the Invention


The present invention relates to encoding and decoding of a stereo signal and a multi-channel signal, and more particularly, to a method and apparatus for encoding and decoding a stereo signal or a multi-channel signal by using a parameter generated based on a mono signal.


2. Description of the Related Art


Conventionally, a stereo signal and a multi-channel signal are generally encoded by encoding information related to the differences between these signals for each channel. For example, the differences between the intensities, coherences, and phases of signals for each channel are extracted and then information related to the differences is encoded. A decoding terminal receives the encoded information, and decodes it into the stereo signal and the multi-channel signal by using the related information.


However, there is a need to encode or decode a stereo signal and a multi-channel signal, based on the differences between the stereo signal and a mono signal and between the multi-channel signal and the mono signal.


SUMMARY OF THE INVENTION

The present invention provides a method and apparatus for encoding or decoding a stereo signal or a multi-channel signal by generating parameters based on a mono signal.


Additional aspects and utilities of the present general inventive concept will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the general inventive concept.


According to an aspect of the present invention, there is provided a method of encoding a stereo signal, comprising: encoding the stereo signal by downmixing the stereo signal to a mono signal; generating and encoding a parameter that represents a ratio of the amplitude of at least one of signals contained in the stereo signal to the size of the mono signal; and generating and encoding a parameter that represents the difference between phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a method of transmitting parameters, comprising: transmitting a parameter that represents a ratio of the amplitude of at least one of signals contained in a stereo signal to the amplitude of a mono signal; and transmitting a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a method of decoding a stereo signal, comprising: decoding a parameter that represents a ratio of the amplitude of at least one of signals contained in the stereo signal to the amplitude of a mono signal; decoding a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal; and upmixing the mono signal to the stereo signal by using the decoded parameters.


According to another aspect of the present invention, there is provided a method of receiving parameters, comprising: receiving a parameter that represents a ratio of the amplitude of at least one of signals contained in a stereo signal to the amplitude of a mono signal; and receiving a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of encoding a stereo signal, comprising: encoding the stereo signal by downmixing the stereo signal to a mono signal; generating and encoding a parameter that represents a ratio of the amplitude of at least one of signals contained in the stereo signal to the size of the mono signal; and generating and encoding a parameter that represents the difference between phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of transmitting parameters, comprising: transmitting a parameter that represents a ratio of the amplitude of at least one of signals contained in a stereo signal to the amplitude of a mono signal; and transmitting a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of decoding a stereo signal, comprising: decoding a parameter that represents a ratio of the amplitude of at least one of signals contained in the stereo signal to the amplitude of a mono signal; decoding a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal; and upmixing the mono signal to the stereo signal by using the decoded parameters.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of receiving parameters, comprising: receiving a parameter that represents a ratio of the amplitude of at least one of signals contained in a stereo signal to the amplitude of a mono signal; and receiving a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided an apparatus for encoding a stereo signal, comprising: a signal encoding unit encoding the stereo signal by downmixing the stereo signal to a mono signal and encoding; a size encoding unit generating and encoding a parameter that represents a ratio of the amplitude of at least one of signals contained in the stereo signal to the amplitude of the mono signal; and a phase encoding unit generating and encoding a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided an apparatus for transmitting parameters, comprising: a size parameter transmission unit transmitting a parameter that represents a ratio of the amplitude of at least one of signals contained in a stereo signal to the amplitude of a mono signal; and a phase parameter transmission unit transmitting a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a n apparatus for decoding a stereo signal, comprising: a size parameter decoding unit decoding a parameter that represents a ratio of the amplitude of at least one of signals contained in the stereo signal to the amplitude of a mono signal; a phase parameter decoding unit decoding a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal; and an upmixing unit upmixing the mono signal to the stereo signal by using the decoded parameters.


According to another aspect of the present invention, there is provided an apparatus for receiving parameters, comprising: a size parameter receiving unit receiving a parameter that represents a ratio of the amplitude of at least one of signals contained in a stereo signal to the amplitude of a mono signal; and a phase parameter receiving unit receiving a parameter that represents the difference between the phases of at least one of the signals contained in the stereo signal and the mono signal.


According to another aspect of the present invention, there is provided a method of encoding a multi-channel signal, comprising: encoding multi-channel signal containing two or more signals by downmixing the multi-channel signal to one or more signals; generating and encoding a parameter that represents a ratio of the amplitude of at least one of the signals of the multi-channel signal to the amplitude of at least one of the downmixed signals; and generating and encoding a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided a method of transmitting parameters, comprising: transmitting a parameter that represents a ratio of the amplitude of at least one of signals contained in a multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; and transmitting a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided a method of decoding a multi-channel signal, comprising: decoding a parameter that represents a ratio of the amplitude of at least one of the signals contained in the multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; decoding a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals; and upmixing the downmixed signals to the multi-channel signal by using the decoded parameters.


According to another aspect of the present invention, there is provided a method of receiving parameters, comprising: receiving a parameter that represents a ratio of the amplitude of at least one of signals contained in a multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; and receiving a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of encoding a multi-channel signal, comprising: encoding multi-channel signal containing two or more signals by downmixing the multi-channel signal to one or more signals; generating and encoding a parameter that represents a ratio of the amplitude of at least one of the signals of the multi-channel signal to the amplitude of at least one of the downmixed signals; and generating and encoding a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of transmitting parameters, comprising: transmitting a parameter that represents a ratio of the amplitude of at least one of signals contained in a multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; and transmitting a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of decoding a multi-channel signal, comprising: decoding a parameter that represents a ratio of the amplitude of at least one of the signals contained in the multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; decoding a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals; and upmixing the downmixed signals to the multi-channel signal by using the decoded parameters.


According to another aspect of the present invention, there is provided a computer readable medium having recorded thereon a method of receiving parameters, comprising: receiving a parameter that represents a ratio of the amplitude of at least one of signals contained in a multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; and receiving a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided an apparatus for encoding a multi-channel signal, comprising: a signal encoding unit encoding the multi-channel signal by downmixing the multi-channel signal to one or more signals, wherein the multi-channel signal contains at least two signals; a size parameter encoding unit generating and encoding a parameter that represents a ratio of the amplitude of at least one of the signals of the multi-channel signal to the amplitude of at least one of the downmixed signals; and a phase parameter encoding unit generating and encoding a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided an apparatus for transmitting parameters, comprising: a size parameter transmission unit transmitting a parameter that represents a ratio of the amplitude of at least one of signals contained in a multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises two or more signals; and a phase parameter transmitting a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.


According to another aspect of the present invention, there is provided an apparatus for decoding a multi-channel signal, comprising: a size parameter decoding unit decoding a parameter that represents a ratio of the amplitude of at least one of the signals contained in the multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; a phase parameter decoding unit decoding a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals; and an upmixing unit upmixing the downmixed signals to the multi-channel signal by using the decoded parameters.


According to another aspect of the present invention, there is provided a method of receiving parameters, comprising: a size parameter receiving unit receiving a parameter that represents a ratio of the amplitude of at least one of signals contained in a multi-channel signal to the amplitude of at least one of downmixed signals, where the multi-channel signal comprises at least two signals; and a phase parameter receiving unit receiving a parameter that represents the difference between phases of least one of the signals of the multi-channel signal and at least one of the downmixed signals.





BRIEF DESCRIPTION OF THE DRAWINGS

These and/or other aspects and utilities of the present general inventive concept will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:



FIG. 1 is a block diagram of an apparatus for encoding a stereo signal, according to an embodiment of the present invention;



FIG. 2 is a block diagram of an apparatus for encoding a stereo signal, according to another embodiment of the present invention;



FIG. 3 is a block diagram of an apparatus for encoding a stereo signal, according to another embodiment of the present invention;



FIG. 4 is a block diagram of an apparatus for encoding a stereo signal, according to another embodiment of the present invention;



FIG. 5 is a block diagram of a parameter extraction unit included in an apparatus for encoding a stereo signal, according to an embodiment of the present invention;



FIG. 6 is a block diagram of a parameter extraction unit included in an apparatus for encoding a stereo signal, according to another embodiment of the present invention;



FIG. 7 is a block diagram of a size parameter extraction unit included in an apparatus for encoding a stereo signal, according to an embodiment of the present invention;



FIG. 8 is a block diagram of a phase parameter extraction unit included in an apparatus for encoding a stereo signal, according to an embodiment of the present invention;



FIG. 9 is a block diagram of an enhancement parameter extraction unit included in an apparatus for encoding a stereo signal, according to an embodiment of the present invention;



FIG. 10 is a block diagram of an apparatus for decoding a stereo signal, according to an embodiment of the present invention;



FIG. 11 is a block diagram of an apparatus for decoding a stereo signal, according to another embodiment of the present invention;



FIG. 12 is a block diagram of an apparatus for decoding a stereo signal, according to another embodiment of the present invention;



FIG. 13 is a block diagram of an apparatus for decoding a stereo signal, according to another embodiment of the present invention;



FIG. 14 is a block diagram of parameter decoding units included in an apparatus for decoding a stereo signal, according to an embodiment of the present invention;



FIG. 15 is a block diagram of parameter decoding units included in an apparatus for decoding a stereo signal, according to another embodiment of the present invention;



FIG. 16 is a block diagram illustrating in detail an up-mixing unit included in an apparatus for encoding a stereo signal, according to an embodiment of the present invention;



FIG. 17 is a block diagram illustrating in detail an up-mixing unit included in an apparatus for encoding a stereo signal, according to another embodiment of the present invention;



FIG. 18 is a graph illustrating a method of generating a left-channel signal and a right-channel signal from a mono signal by using a method and apparatus for decoding a stereo signal mono signal, according to an embodiment of the present invention;



FIG. 19 is a conceptual diagram illustrating a method of generating a left-channel signal and a right-channel signal from a mono signal by using a method and apparatus for decoding a stereo signal mono signal, according to an embodiment of the present invention;



FIG. 20 is a flowchart illustrating a method of encoding a stereo signal, according to an embodiment of the present invention;



FIG. 21 is a flowchart illustrating a method of encoding a stereo signal, according to another embodiment of the present invention;



FIG. 22 is a flowchart illustrating a method of encoding a stereo signal, according to another embodiment of the present invention;



FIG. 23 is a flowchart illustrating a method of encoding a stereo signal, according to another embodiment of the present invention;



FIG. 24 is a flowchart illustrating in detail operation 2030, 2140, 2250, or 2360 included in a method of encoding a stereo signal, according to an embodiment of the present invention;



FIG. 25 is a flowchart illustrating in detail operation 2030, 2140, 2250, or 2360 included in a method of encoding a stereo signal, according to another embodiment of the present invention;



FIG. 26 is a flowchart illustrating in detail operation 2400 illustrated in FIG. 24 or 25, according to an embodiment of the present invention;



FIG. 27 is a flowchart illustrating in detail operation 2420 illustrated in FIG. 25, according to an embodiment of the present invention;



FIG. 28 is a flowchart illustrating in detail operation 2420 illustrated in FIG. 25, according to another embodiment of the present invention;



FIG. 29 is a flowchart illustrating a method of decoding a stereo signal, according to an embodiment of the present invention;



FIG. 30 is a flowchart illustrating a method of decoding a stereo signal, according to another embodiment of the present invention;



FIG. 31 is a flowchart illustrating a method of decoding a stereo signal, according to another embodiment of the present invention;



FIG. 32 is a flowchart illustrating a method of decoding a stereo signal, according to another embodiment of the present invention;



FIG. 33 is a flowchart illustrating operation 2920, 3030, 3120 or 3230 included in a method of decoding a stereo signal, according to an embodiment of the present invention;



FIG. 34 is a flowchart illustrating operation 2920, 3030, 3120 or 3230 included in a method of decoding a stereo signal, according to another embodiment of the present invention;



FIG. 35 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 included in a method of decoding a stereo signal, according to an embodiment of the present invention;



FIG. 36 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 illustrated in FIG. 35 by using the graph illustrated in FIG. 18;



FIG. 37 is a flowchart illustrating operation 2930, 3040, 3130 or 3240 included in a method of decoding a stereo signal, according to another embodiment of the present invention; and



FIG. 38 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 illustrated in FIG. 37 by using the graph illustrated in FIG. 18.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

A method and apparatus for encoding and decoding a stereo signal and a multi-channel signal according to the present invention will now be described more fully with reference to the accompanying drawings, in which exemplary embodiments of the invention are shown.



FIG. 1 is a block diagram of an apparatus for encoding a stereo signal, according to an embodiment of the present invention. The apparatus includes a transformation unit 100, a down-mixing unit 110, a mono signal encoding unit 120, a parameter extraction unit 130, and a multiplexing unit 140.


The transformation unit 100 transforms a stereo signal received via an input terminal IN into a predetermined domain by using an analysis filterbank. Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units, such as in a time-frequency domain.


The down-mixing unit 110 downmixes the stereo signal that is transformed into the predetermined domain to a mono signal.


In this case, the amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal. Also, the mono signal may be generated on a half-sum vector of the left-channel signal and the right-channel signal.


The mono signal encoding unit 120 encodes the mono signal obtained by downmixing by the down-mixing unit 110.


The parameter extraction unit 130 extracts parameters from the stereo signal and encodes the parameters, where a decoding terminal uses the parameters in upmixing the mono signal to the stereo signal. The parameters are information for generating the left-channel signal and the right-channel signal based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter for enhancing information contained in the size parameter and the phase parameter, using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters may be generated for each frame and in band units.


The multiplexing unit 140 multiplexes the parameters encoded by the parameter extraction unit 130 and the mono signal encoded by the mono signal encoding unit 120 into a bitstream, and transmits the bitstream to the decoding terminal via an output terminal OUT.



FIG. 2 is a block diagram of an apparatus for encoding a stereo signal, according to another embodiment of the present invention. The encoding apparatus includes a transformation unit 200, a down-mixing unit 210, an inverse transformation unit 220, a mono signal encoding unit 230, a parameter extraction unit 240, and a multiplexing unit 250.


The transformation unit 200 transforms a stereo signal received via an input terminal IN into a predetermined domain, using an analysis filterbank. Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed the time domain for each of the sub bands at predetermined frequency units, using a Quadrature Mirror Filterbank (QMF) and/or Lapped Orthogonal Transform (LOT).


The down-mixing unit 210 downmixes the stereo signal that is transformed into the predetermined domain to a mono signal.


In this case, the amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal. Also, the mono signal may be generated on a half-sum vector of the left-channel signal and the right-channel signal.


The inverse transformation unit 220 inversely transforms the domain of the mono signal in the reverse manner of that which transformation unit 200 performs, using a synthesis filterbank. For example, the inverse transformation unit 220 performs inverse transformation such that the mono signal that is expressed in the time domain as spectra for each of the sub bands at predetermined frequency units is expressed as a time series only in the time domain.


The mono signal encoding unit 230 encodes the mono signal that is inversely transformed by the inverse transformation unit 220.


The parameter extraction unit 240 extracts parameters from the stereo signal and encodes them, where a decoding terminal uses the parameters in upmixing the mono signal to the stereo signal. The parameters are information for generating the left-channel signal and the right-channel signal based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contain information for enhancing information contained in the size parameter and the phase parameter, using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters may be generated for each frame and in band units.


The multiplexing unit 250 multiplexes the parameters encoded by the parameter extraction unit 240 and the mono signal encoded by the mono signal encoding unit 230 into a bitstream, and transmits the bitstream to the decoding terminal via an output terminal OUT.



FIG. 3 is a block diagram of an apparatus for encoding a stereo signal, according to another embodiment of the present invention. The encoding apparatus includes a transformation unit 300, a phase adjustment unit 310, an adjusted information encoding unit 320, a down-mixing unit 330, a mono signal encoding unit 340, a parameter extraction unit 350, and a multiplexing unit 360.


The transformation unit 300 transforms a stereo signal received via an input terminal IN into a predetermined domain by using an analysis filterbank. Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed as spectra in the time domain for each of the sub bands at predetermined frequency units.


If the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal transformed into the predetermined domain falls within a predetermined range, the phase adjustment unit 310 adjusts the phases of the left-channel signal and the right-channel signal by a predetermined phase. This is because the nearer the sum of the vectors of the left-channel signal and the right-channel signal approximates zero, the nearer the difference between the phases of the left-channel signal and the right-channel signal approximates 180 degrees. The predetermined range may be determined based on 180 degrees.


The phase adjustment unit 310 adjusts the phases of the left-channel signal and the right-channel signal by the same phase. For example, if the phase of the left-channel signal is adjusted by an angle of θ°, the phase of the right-channel signal is adjusted by an angle of −θ°.


A method of performing phase adjustment by the phase adjustment unit 310 according to an embodiment of the present invention will now be described. First, Sn,k is calculated as follows:











S

n
,
k


=



L

n
,
k


+

R

n
,
k



2


,




(
1
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


Next, Gn,k is calculated by substituting Sn,k into the following:










G

n
,
k


=


2




S

n
,
k









L

n
,
k




+



R

n
,
k










(
2
)







The phase adjustment unit 310 determines whether to adjust the phases of the left-channel signal and the right-channel signal, depending on whether Gn,k is less than 10−3 that is a predetermined threshold.


If Gn,k is less than 10−3, the phase adjustment unit 310 determines that the phases of the left-channel signal and the right-channel signal are to be adjusted. If Gn,k is equal to or greater than 10−3, the phase adjustment unit 310 determines that the phases of the left-channel signal and the right-channel signal are not to be adjusted.


If Gn,k is less than 10−3, phase adjustment is performed by transforming Sn,k as follows:











S

n
,
k


=




L

n
,
k








+


R

n
,
k






-





2


,




(
3
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, θ denotes a predetermined phase value, e.g., π/100, n denotes a frame number, and k denotes a band number.


Thereafter, the final Sn,k is calculated using Sn,k, as follows:











S

n
,
k


=


S


n
,
k








min






{

2
,







L

n
,
k




2

+




R

n
,
k




2



2





S

n
,
k




2





}



,




(
4
)







wherein Sn,k of the right-hand side of Equation (4) denotes a phasor calculated by Equation (3), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


As will later be described in detail, the down-mixing unit 330 produces a mono signal by using Sn,k calculated by Equation (4), as follows:











M

n
,
k


=


S

n
,
k










k
=
1

N










L

n
,
k




2


+




R

n
,
k




2



4





k
=
1

N










S

n
,
k




2







,




(
5
)







wherein Mn,k denotes the mono signal, Sn,k denotes the phasor calculated by Equation (4), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


If the phase adjustment unit 310 adjusts the phases of the left-channel signal and the right-channel signal, the adjusted information encoding unit 320 encodes information related to the adjusted phases. For example, if the phase adjustment unit 310 adjusts the phase of the left-channel signal by an angle of θ° and the phase of right-channel signal by an angle of −θ°, the adjusted information encoding unit 320 encodes information related to the value of the angle of θ°.


If the phase adjustment unit 310 adjusts the phase of the stereo signal, the down-mixing unit 330 downmixes the adjusted stereo signal to a mono signal. If the phase adjustment unit 310 does not adjust the phase of the stereo signal, the transformation unit 300 downmixes the stereo signal that is transformed into the predetermined domain to a mono signal.


The amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal. Also, the mono signal may be generated on a half-sum vector of the left-channel signal and the right-channel signal.


The mono signal encoding unit 340 encodes the mono signal obtained by downmixing by the down-mixing unit 330.


The parameter extraction unit 350 extracts parameters from the stereo signal and encodes the parameters, where a decoding terminal uses the parameters in upmixing the mono signal to the stereo signal. The parameters are information for generating the left-channel signal and the right-channel signal based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter, using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters may be generated for each frame and in band units.


The multiplexing unit 360 multiplexes the parameters encoded by the parameter extraction unit 350 and the mono signal encoded by the mono signal encoding unit 340 into a bitstream, and transmits the bitstream to the decoding terminal via an output terminal OUT. Also, if the phase adjustment unit 310 adjusts the phase of the stereo signal, the multiplexing unit 360 also multiplexes information related to the adjusted phase, which is encoded by the adjusted information encoding unit 320.



FIG. 4 is a block diagram of an apparatus for encoding a stereo signal, according to another embodiment of the present invention. The encoding apparatus includes a transformation unit 400, a phase adjustment unit 410, an adjusted information encoding unit 420, a down-mixing unit 430, an inverse transformation unit 440, a mono signal encoding unit 450, a parameter extraction unit 460, and a multiplexing unit 470.


The transformation unit 400 transforms a stereo signal received via an input terminal IN into a predetermined domain by using an analysis filterbank. Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed as a time domain for each of sub bands in predetermined frequency units.


If the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal transformed into the predetermined domain falls within a predetermined range, the phase adjustment unit 410 adjusts the phases of the left-channel signal and the right-channel signal by a predetermined phase. This is because the nearer the difference between the phases of the left-channel signal and the right-channel signal approximates 180 degrees, the nearer the sum of the vectors of the left-channel signal and the right-channel signal approximates zero. The predetermined range may be determined based on 180 degrees.


The phase adjustment unit 410 adjusts the phases of the left-channel signal and the right-channel signal by the same phase. For example, if the phase of the left-channel signal is adjusted by an angle of θ°, the phase of the right-channel signal is adjusted by an angle of −θ°.


A method of performing phase adjustment by the phase adjustment unit 410 according to an embodiment of the present invention will now be described. First, Sn,k is calculated by the following:











S

n
,
k


=



L

n
,
k


+

R

n
,
k



2


,




(
6
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


Next, Gn,k is calculated by substituting Sn,k into the following:










G

n
,
k


=


2




S

n
,
k









L

n
,
k




+



R

n
,
k










(
7
)







The phase adjustment unit 410 determines whether to adjust the phases of the left-channel signal and the right-channel signal, depending on whether Gn,k is less than 10−3 that is a predetermined threshold.


If Gn,k is less than 10−3, the phase adjustment unit 410 determines that the phases of the left-channel signal and the right-channel signal are to be adjusted. If Gn,k is equal to or greater than 10−3, the phase adjustment unit 410 determines that the phases of the left-channel signal and the right-channel signal are not to be adjusted.


If Gn,k is less than 10−3, phase adjustment is performed by transforming Sn,k as follows:











S

n
,
k


=




L

n
,
k








+


R

n
,
k






-





2


,




(
8
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, θ denotes a predetermined value, e.g., π/100, n denotes a frame number, and k denotes a band number.


Thereafter, Sn,k is calculated using Sn,k, as follows:











S

n
,
k


=


S


n
,
k








min






{

2
,







L

n
,
k




2

+




R

n
,
k




2



2





S

n
,
k




2





}



,




(
9
)







wherein Sn,k of the right-hand side of Equation (9) denotes a phasor calculated by Equation (8), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


As will later be described in detail, the down-mixing unit 430 produces a mono signal by using Sn,k calculated by Equation (9), as follows:











M

n
,
k


=


S

n
,
k










k
=
1

N










L

n
,
k




2


+




R

n
,
k




2



4





k
=
1

N










S

n
,
k




2







,




(
10
)







wherein Mn,k denotes the mono signal, Sn,k denotes the phasor calculated by Equation (9), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


If the phase adjustment unit 410 adjusts the phases of the left-channel signal and the right-channel signal contained in the stereo signal since the difference between the phases falls within the predetermined range, the adjusted information encoding unit 420 encodes information related to the adjusted phases. For example, if the phase adjustment unit 420 adjusts the phase of the left-channel signal by an angle of θ° and the phase of right-channel signal by an angle of −θ°, the adjusted information encoding unit 320 encodes information related to the value of the angle of θ°.


If the phase adjustment unit 410 adjusts the phase of the stereo signal, the down-mixing unit 430 downmixes the adjusted stereo signal to a mono signal. If the phase adjustment unit 310 does not adjust the phase of the stereo signal, the transformation unit 300 downmixes the stereo signal that is transformed into the predetermined domain to a mono signal.


The amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal. Also, the mono signal may be generated on a half-sum vector of the left-channel signal and the right-channel signal.


The inverse transformation unit 440 inversely transforms the domain of the mono signal downmixed by the downmixing unit 430 in the reverse manner that the transformation unit 400 performs transformation, using a synthesis filterbank. For example, the inverse transformation unit 440 performs inverse transformation such that the mono signal that is expressed as spectra in the time domain for each of sub bands at predetermined frequency units is expressed as a time series only in a time domain.


The mono signal encoding unit 450 encodes the mono signal inversely transformed by the inverse transformation unit 440.


The parameter extraction unit 460 extracts parameters from the stereo signal and encodes the parameters, where a decoding terminal uses the parameters in upmixing the mono signal to the stereo signal. The parameters are information for generating the left-channel signal and the right-channel signal based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitudes of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter, using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters may be generated for each frame and in band units.


The multiplexing unit 470 multiplexes the parameters encoded by the parameter extraction unit 460 and the mono signal encoded by the mono signal encoding unit 450 into a bitstream, and transmits the bitstream to the decoding terminal via an output terminal OUT. Also, if the phase adjustment unit 410 adjusts the phase of the stereo signal, the multiplexing unit 470 also multiplexes information related to the adjusted phase, which is encoded by the adjusted information encoding unit 420.



FIGS. 5 and 6 are block diagrams illustrating in detail the parameter extraction unit 350 illustrated in FIG. 3, which is included in an apparatus for encoding a stereo signal according to embodiments of the present invention. As illustrated in FIG. 5, the parameter extraction unit 350 includes a size parameter extraction unit 500 and a phase parameter extraction unit 510. Alternatively, as illustrated in FIG. 6, the parameter extraction unit 350 may further include an enhancement parameter extraction unit 520.


The size parameter extraction unit 500 extracts and encodes a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of a mono signal.


The phase parameter extraction unit 510 extracts and encodes a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal, and to the mono signal. Here, the phase parameter extraction unit 510 may extract the phase parameter that represents the difference between the phases of the left-channel signal and the mono signal, the difference between the phases of the right-channel signal and the mono signal, or the difference between the phases of each of the left-channel signal and the right-channel signal and the mono signal.


The enhancement parameter extraction unit 520 extracts and encodes an enhancement parameter that enhances and controls the phase indicated by the phase parameter, using a decorrelated signal that is a vertical vector component of the mono signal.



FIG. 7 is a block diagram block illustrating in detail the size parameter extraction unit 500 illustrated in FIG. 5 or 6, which is included in an apparatus for encoding a stereo signal according to an embodiment of the present invention. The size parameter extraction unit 500 includes a gain calculation unit 700 and a size parameter encoding unit 710.


The gain calculation unit 700 calculates a gain that minimizes the difference between the energy levels of an actual stereo signal and a stereo signal that is to be produced from a mono signal by applying the calculated gain in order to minimize an error between the amplitudes of the actual stereo signal and a stereo signal that is to be decoded by a decoding terminal, on an assumption that the amplitude of a left-channel signal has a predetermined relation to the amplitude of a right-channel signal.


The calculated gain is used to determine the amplitudes of the left-channel signal and the right-channel signal when the decoding terminal upmixes the mono signal to a stereo signal.


For example, if it is assumed that the predetermined relation between the amplitudes of the left-channel signal and the right-channel signal is that the amplitude of the mono signal is equal to the average of the amplitudes of the left-channel signal and the right-channel signal, the left-channel signal and the right-channel signal can be expressed as follows:

ãn,kL=gman,kM
ãn,kR=(2−gm)an,kM  (11)


wherein ãn,kL denotes the amplitude of the left-channel signal when the gain calculated by the gain calculation unit 700 is applied, ãn,kR denotes the amplitude of the right-channel signal to which the gain is applied, gm denotes the gain used to calculate the amplitude of a signal, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


The difference between the energy levels of the actual stereo signal and the stereo signal obtained by applying the calculated gain can be calculated by the following Equation (12) into which Equation (11) has been substituted:













E

n
,
k


L





R


=






n








(



a
~


n
,
k

L

-

a

n
,
k

L


)

2


+



n








(



a
~


n
,
k

R

-

a

n
,
k

R


)

2










=






n








(



g
m



a

n
,
k

M


-

a

n
,
k

L


)

2


+



n








(



(

2
-

g
m


)



a

n
,
k

M


-

a

n
,
k

R


)

2




,







(
12
)







wherein En,kLR denotes the difference between the energy levels of the actual stereo signal and the stereo signal to which the calculated gain is applied, ãn,kL denotes the amplitude of the left-channel signal to which the calculated gain is applied, ãn,kR denotes the amplitude of the right-channel signal to which the calculated gain is applied, an,kL denotes the amplitude of an actual left-channel signal, an,kR denotes the amplitude of an actual right-channel signal, gm denotes the gain used to calculate the amplitude of a signal, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


Equation (12) into which Equation (11) has been substituted can be expressed with respect to the gain gm, as follows:











g
m

=

1
+





n









k








a

n
,
k

M



a

n
,
k

L




-



n









k








a

n
,
k

M



a

n
,
k

R






2




n









k








(

a

n
,
k

M

)

2







,




(
13
)







wherein gm denotes the gain used to calculate the amplitude of a signal, an,kL denotes the amplitude of the actual left-channel signal, an,kR denotes the amplitude of the actual right-channel signal, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


Thus, the gain calculation unit 700 can calculate the gain that minimizes the difference between the energy levels of the actual stereo signal and the stereo signal that is produced from the mono signal by applying the gain by substituting the actual left-channel signal amplitude an,kL, the actual right-channel signal amplitude an,kR, and the mono signal amplitude an,kM into Equation (13).


The size parameter encoding unit 710 encodes the gain.



FIG. 8 is a block diagram block illustrating in detail the phase parameter extraction unit 510 illustrated in FIG. 5 or 6, which is included in an apparatus for encoding a stereo signal according to an embodiment of the present invention. The phase parameter extraction unit 510 includes a phase difference calculation unit 800 and a phase parameter encoding unit 810.


The phase difference calculation unit 800 calculates a phase difference that minimizes the difference between the phases of an actual stereo signal and a stereo signal that is to be generated by applying a phase difference that is to be calculated by the phase difference calculation unit 800, in order to minimize an error between the phases of the actual stereo signal and a stereo signal that is to be decoded by a decoding terminal, on an assumption that the phase of a left-channel signal has a predetermined relation to the phase of a right-channel signal.


The difference between the energy levels of the actual stereo signal and the stereo signal that is to be generated can be calculated by:

En,kLR=2(an,kR)2[1−cos(φn,kR−φn,kMn,kR)]+2(an,kL)2[1−cos(φn,kM−φn,kLn,kL)]  (14),


wherein En,kLR denotes the difference between the energy levels of the actual stereo signal and the stereo signal that is to be generated, an,kR denotes the amplitude of an actual right-channel signal, an,kL denotes the amplitude of an actual left-channel signal, φn,kR denotes the phase of the actual right-channel signal, φn,kM denotes the phase of a mono signal, φn,kL denotes the phase of the actual left-channel signal, ψn,kR denotes the difference between the phases of the mono signal and the right-channel signal, ψn,kL denotes the difference between the phases of the mono signal and the left-channel signal, n denotes a frame number, and k denotes a band number.


If it is assumed that the difference between the phases of the left-channel signal and the mono signal is equal to that between the phases of the right-channel signal and the mono signal in Equation (14), that is, if it is assumed that ψn,kR and ψn,kL has the same value, e.g., ψn,k, Equation (14) can be expressed by:











t






g


(

ψ

n
,
k


)



=








n









k









(

a

n
,
k

R

)

2


sin


(


φ

n
,
k

M

-

φ

n
,
k

R


)




+








n









k









(

a

n
,
k

L

)

2



sin


(


φ

n
,
k

L

-

φ

n
,
k

M


)















n









k









(

a

n
,
k

R

)

2



cos


(


φ

n
,
k

M

-

φ

n
,
k

R


)





+








n









k









(

a

n
,
k

L

)

2



cos


(


φ

n
,
k

L

-

φ

n
,
k

M


)










,




(
15
)







wherein ψn,k denotes the difference between the phases of the mono signal and the stereo signal, an,kR denotes the amplitude of the actual right-channel signal, an,kL denotes the amplitude of the actual left-channel signal, φn,kR denotes the phase of the actual right-channel signal, φn,kM denotes the phase of the mono signal, φn,kL denotes the phase of the actual left-channel signal, n denotes a frame number, and k denotes a band number.


Thus, the phase difference calculation unit 800 can calculate the phase difference that minimizes the difference between the energy levels of the actual stereo signal and the stereo signal that is to be generated, by substituting the actual left-channel signal amplitude an,kL, the actual right-channel signal amplitude an,kR, the actual left-channel signal phase φn,kL, the actual right-channel signal phase φn,kR, and the mono signal phase φn,kM into Equation (15).


The phase parameter encoding unit 810 encodes the phase difference calculated by the phase difference calculation unit 800.



FIG. 9 is a block diagram block illustrating in detail the enhancement parameter extraction unit 520 illustrated in FIG. 6, which is included in an apparatus for encoding a stereo signal according to an embodiment of the present invention. The enhancement parameter extraction unit 520 includes a phase calculation unit 900 and an enhancement parameter encoding unit 910.


The phase calculation unit 900 calculates a second phase for enhancing and controlling a first phase indicated by a phase parameter encoded by the parameter extraction unit 510, using a decorrelated signal that is a vertical vector component of a mono signal.


For example, the phase calculation unit 900 calculates the second phase for enhancing and controlling the first phase, using the following:











t






g


(

α
k

)



=

min
[

1
,



2


(





n
=

b
k




b

k
+
1


-
1










(

a

n
,
k

L

)

2



(

1
-

cos


(


φ

n
,
k

L

-

φ

n
,
k

M

-

ψ

n
,
k



)



)



+








n
=

b
k




b

k
+
1


-
1










(

a

n
,
k

R

)

2



(

1
-

cos


(


φ

n
,
k

R

-

φ

n
,
k

M

-

ψ

n
,
k



)



)




)







n
=

b
k




b

k
+
1


-
1









(

a

n
,
k

L

)

2


+




n
=

b
k




b

k
+
1


-
1









(

a

n
,
k

R

)

2






]


,




(
16
)







wherein an,kL denotes the amplitude of an actual left-channel signal, φn,kL denotes the phase of the actual left-channel signal, φn,kM denotes the phase of the mono signal, ψn,k denotes the difference between the phases of the mono signal and the stereo signal, an,kR denotes the amplitude of an actual right-channel signal, φn,kR denotes the phase of the actual right-channel signal, bk denotes a band border value, n denotes a frame number, and k denotes a band number.


Thus, the phase calculation unit 900 can calculate the second phase by using the actual left-channel signal amplitude an,kL, the actual left-channel signal phase φn,kL, the mono signal phase φn,kM the difference ψn,k between the phases of the mono signal and the stereo signal, the actual right-channel signal amplitude an,kR, and the actual right-channel signal phase φn,kR.



FIG. 10 is a block diagram of an apparatus for decoding a stereo signal, according to an embodiment of the present invention. The decoding apparatus includes an inverse multiplexing unit 1000, a mono signal decoding unit 1010, a parameter decoding unit 1020, an up-mixing unit 1030, and an inverse transformation unit 1040.


The inverse multiplexing unit 1000 receives a bitstream from an encoding terminal (not shown) via an input terminal IN, and inversely multiplexes the bitstream. The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus (not shown), and the mono signal encoded by the encoding apparatus.


The mono signal decoding unit 1010 decodes the encoded mono signal inversely multiplexed by the inverse multiplexing unit 1000.


The parameter decoding unit 1020 decodes the parameters inversely multiplexed by the inverse multiplexing unit 1000. The decoded parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The parameters may be produced for each frame and in band units.


The up-mixing unit 1030 upmixes the decoded mono signal to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter. When the up-mixing unit 1030 upmixes the mono signal to a stereo signal containing a left-channel signal and a right-channel signal, the amplitudes of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


The inverse transformation unit 1040 inversely transforms the domain of the stereo signal that was upmixed by the up-mixing unit 1030 in the reverse manner of that transformed by the transformation unit 100 illustrated in FIG. 1 performs transformation, by using the synthesis filterbank, and then outputs the result of inverse transformation via an output terminal OUT. For example, the inverse transformation unit 1040 performs inverse transformation such that the mono signal expressed in the time domain as spectra for each of the sub bands at predetermined frequency units is expressed only in the time domain.



FIG. 11 is a block diagram of an apparatus for decoding a stereo signal, according to another embodiment of the present invention. The decoding apparatus includes an inverse multiplexing unit 1100, a mono signal decoding unit 1110, a transformation unit 1120, a parameter decoding unit 1130, an up-mixing unit 1140, and an inverse transformation unit 1150.


The inverse multiplexing unit 1100 receives a bitstream from an encoding terminal (not shown) via an input terminal IN, and inversely multiplexes the bitstream. The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus (not shown), and the mono signal encoded by the encoding apparatus.


The mono signal decoding unit 1110 decodes the encoded mono signal demultiplexed from the inverse multiplexing unit 1100.


The transformation unit 1120 transforms the decoded mono signal into a predetermined domain by using an analysis filterbank. The predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed as spectra in the time domain for each of the sub bands at predetermined frequency units.


The parameter decoding unit 1130 decodes the parameters multiplexed by the inverse multiplexing unit 1100. The parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The parameters may be produced for each frame and in band units.


The up-mixing unit 1140 upmixes the decoded mono signal to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter. When the up-mixing unit 1130 upmixes the mono signal to a stereo signal containing a left-channel signal and a right-channel signal, the amplitudes of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


The inverse transformation unit 1150 inversely transforms the domain of the stereo signal that was upmixed by the up-mixing unit 1140 in the reverse manner of that performed by transformation unit 1120, using the synthesis filterbank, and then outputs the result of inverse transformation via an output terminal OUT. For example, the inverse transformation unit 1150 performs inverse transformation such that the mono signal expressed in the time domain as spectra for each of the sub bands at predetermined frequency units is expressed as a time series only in the time domain.



FIG. 12 is a block diagram of an apparatus for decoding a stereo signal, according to another embodiment of the present invention. The decoding apparatus includes an inverse multiplexing unit 1200, a mono signal decoding unit 1210, a parameter decoding unit 1220, an up-mixing unit 1230, an adjusted information decoding unit 1240, a phase adjustment unit 1250, and an inverse transformation unit 1260.


The inverse multiplexing unit 1200 receives a bitstream from an encoding terminal (not shown) via an input terminal IN, and inversely multiplexes the bitstream. The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus (not shown), and the mono signal encoded by the encoding apparatus. If the encoding apparatus has adjusted the phase of the stereo signal due to the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal fell within a predetermined range, the bitstream further contains information regarding the phase of the stereo signal, which is adjusted by the encoding apparatus.


The mono signal decoding unit 1210 decodes the inversely multiplexed mono signal.


The parameter decoding unit 1220 decodes the parameters that were inversely multiplexed by the inverse multiplexing unit 1200. The parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The parameters may be produced for each frame and in band units.


The up-mixing unit 1230 upmixes the decoded mono signal to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter. When the up-mixing unit 1230 upmixes the mono signal to a stereo signal containing the left-channel signal and the right-channel signal, the amplitude of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


If the encoding apparatus adjusted the phases of the left-channel signal and the right-channel signal because the difference between the phases of the left-channel signal and the right-channel signal fell within the predetermined range, that is, if the inversely multiplexed bitstream contains the information regarding the adjusted phases, the adjusted information decoding unit 1240 decodes the information regarding the adjusted phases. For example, if the encoding apparatus adjusts the phase of the left-channel signal by an angle of θ° and the phase of the right-channel signal by an angle of −θ°, the information regarding the adjusted phases indicates the angle of θ°.


If the inversely multiplexed bitstream contains the information regarding the adjusted phases, the phase adjustment unit 1250 respectively adjusts the phases of the left-channel signal and the right-channel signal that are upmixed to the stereo signal, by the adjusted phases. However, if the inversely multiplexed bitstream does not contain the information regarding the adjusted phases, the phase adjustment unit 1250 does not adjust the phases of the left-channel signal and the right-channel signal that are upmixed to the stereo signal.


If the inversely multiplexed bitstream contains the information regarding the adjusted phases, the inverse transformation unit 1260 inversely transforms the domain of the stereo signal adjusted by the phase adjustment unit 1250 in the reverse manner that the transformation unit 300 illustrated in FIG. 3 performs transformation, using the synthesis filterbank, and then outputs the result of transformation via an output terminal OUT. For example, the inverse transformation unit 1260 inversely transforms the mono signal, which is expressed in the time domain as spectra for each of the sub bands in predetermined frequency units, only as a time domain.


However, if the inversely multiplexed bitstream does not contain the information regarding the adjusted phases, the inverse transformation unit 1260 inversely transforms the stereo signal upmixed by the up-mixing unit 1230.



FIG. 13 is a block diagram of an apparatus for decoding a stereo signal, according to another embodiment of the present invention. The decoding apparatus includes an inverse multiplexing unit 1300, a mono signal decoding unit 1310, a transformation unit 1320, a parameter decoding unit 1330, an up-mixing unit 1340, an adjusted information decoding unit 1350, a phase adjustment unit 1360, and an inverse transformation unit 1370.


The inverse multiplexing unit 1300 receives a bitstream from an encoding terminal (not shown) via an input terminal IN, and inversely multiplexes the bitstream. The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus (not shown), and the mono signal encoded by the encoding apparatus. If the encoding apparatus has adjusted the phase of the stereo signal due to the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal falling within a predetermined range, the bitstream further contains information regarding the phase of the stereo signal, which is adjusted by the encoding apparatus.


The mono signal decoding unit 1320 decodes the inversely multiplexed mono signal.


The transformation unit 1320 transforms the mono signal, which was decoded by mono signal decoding unit 1320, into a predetermined domain by using the analysis filterbank. The predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units.


The parameter decoding unit 1330 decodes the parameters that were inversely multiplexed by the inverse multiplexing unit 1300. The parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The parameters may be produced for each frame and in band units.


The up-mixing unit 1340 upmixes the transformed mono signal to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter. When the up-mixing unit 1340 upmixes the mono signal to a stereo signal containing the left-channel signal and the right-channel signal, the amplitude of the left-channel signal and the right-channel signal are determined using the mono signal according to the amplitude parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


If the encoding apparatus adjusted the phases of the left-channel signal and the right-channel signal because the difference between the phases of the left-channel signal and the right-channel signal fell within the predetermined range, that is, if the inversely multiplexed bitstream contains the information regarding the adjusted phases, the adjusted information decoding unit 1350 decodes the information regarding the adjusted phases. For example, if the encoding apparatus adjusts the phase of the left-channel signal by an angle of θ° and the phase of the right-channel signal by an angle of −θ°, the information regarding the adjusted phases indicates the angle of θ°.


If the inversely multiplexed bitstream contains the information regarding the adjusted phases, the phase adjustment unit 1360 respectively adjusts the phases of the left-channel signal and the right-channel signal that are upmixed to the stereo signal, by the adjusted phases. However, if the inversely multiplexed bitstream does not contain the information regarding the adjusted phases, the phase adjustment unit 1360 does not adjust the phases of the left-channel signal and the right-channel signal that are upmixed to the stereo signal.


If the inversely multiplexed bitstream contains the information regarding the adjusted phases, the inverse transformation unit 1370 inversely transforms the domain of the stereo signal adjusted by the phase adjustment unit 1360 in the reverse manner of that performed by the transformation unit 1320, using the synthesis filterbank and then outputs the result of transformation via an output terminal OUT. For example, the inverse transformation unit 1370 inversely transforms the mono signal, which is expressed in the time domain as spectra for each of the sub bands at predetermined frequency units, as a time series only in the time domain.


However, if the inversely multiplexed bitstream does not contain the information regarding the adjusted phases, the inverse transformation unit 1370 inversely transform the stereo signal upmixed by the up-mixing unit 1340.



FIGS. 14 and 15 are block diagrams illustrating in detail the parameter decoding unit 1020, 1130, 1220 or 1330 that is included in an apparatus for encoding a stereo signal, according to embodiments of the present invention. The parameter decoding unit 1020, 1130, 1220 or 1330 includes a size parameter decoding unit 1400 and a phase parameter decoding unit 1410 as illustrated in FIG. 14 but may further include an enhancement parameter decoding unit 1430 as illustrated in FIG. 15.


The size parameter decoding unit 1400 decodes a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of a mono signal.


The phase parameter decoding unit 1410 decodes a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal, and the mono signal.


The enhancement parameter extraction unit 1420 decodes an enhancement parameter for enhancing and controlling the phase indicated by the phase parameter, using a decorrelated signal that is a vertical vector component of the mono signal.



FIG. 16 is a block diagram illustrating in detail the up-mixing unit 1030, 1140, 1230 or 1340 that is included in an apparatus for decoding a stereo signal, according to an embodiment of the present invention. The up-mixing unit 1030, 1140, 1230 or 1340 includes a amplitude calculation unit 1600, a phase calculation unit 1610, and a signal generation unit 1620.


The amplitude calculation unit 1600 calculates the amplitudes of a left-channel signal and a right-channel signal based on a mono signal, using the size parameter decoded by the size parameter decoding unit 1400 illustrated in FIG. 14 or 15. Here, the size parameter is a gain calculated by an encoding apparatus (not shown) so that the difference between the energy levels of an actual stereo signal and a stereo signal that is to be decoded by a decoding terminal (not shown) can be minimized in order, which minimizes an error between the amplitudes of the actual stereo signal and the stereo signal.


If it is assumed that the relation between the left-channel signal and the right-channel signal is set so that the amplitude of the mono signal can be equal to the average of the amplitudes of the left-channel signal and the right-channel signal, the amplitude calculation unit 1600 can calculate the amplitudes of the left-channel signal and the right-channel signal by using the following:

ãn,kL=gman,kM
ãn,kR=(2−gm)an,kM  (17),


wherein ãn,kL and ãn,kR respectively denote the amplitudes of the left-channel signal and the right-channel signal calculated by the amplitude calculation unit 1600, gm denotes the gain, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


The phase calculation unit 1610 calculates the phases of the left-channel signal and the right-channel signal, based on the phase of the mono signal by using the phase parameter decoded by the phase parameter decoding unit 1410 illustrated in FIG. 14 or 15. Here, the phase parameter is a phase difference ψn,k calculated so that the difference between the energy levels of the actual stereo signal and the stereo signal to which the calculated phase difference is to be applied can be minimized in order to minimize an error between the phases of the actual stereo signal and the stereo signal that is to be decoded.


If the phase parameter is the phase difference ψn,k on an assumption that both the encoding apparatus and the decoding apparatus have predetermined that the difference between the phases of the left-channel signal and the mono signal is equal to the difference between the phases of the right-channel signal and the mono signal, the phase calculation unit 1610 can calculate the phase of the left-channel signal by adding ψn,k to the phase of the mono signal and the phase of the right-channel signal by subtracting ψn,k from the phase of the mono signal.


The signal generation unit 1620 generates the stereo signal by generating the left-channel signal and the right-channel signal, based on the amplitudes of the left-channel signal and the right-channel signal, which are calculated by the amplitude calculation unit 1600, and the phases of the left-channel signal and the right-channel signal, which are calculated by the phase calculation unit 1610.


For example, referring to FIG. 18, a left-channel signal {tilde over (L)}n,k and a right-channel signal {tilde over (R)}n,k are produced by determining the amplitudes of them by applying the gain gm, based on a mono signal Mn,k, and then respectively determining the phases of the left-channel signal {tilde over (L)}n,k, and the right-channel signal {tilde over (R)}n,k by applying the phase difference θ, that is, by respectively rotating the mono signal Mn,k by an angle of θ° and an angle of −θ°.



FIG. 17 a block diagram illustrating in detail the up-mixing unit 1030, 1140, 1230 or 1340 that is included in an apparatus for encoding a stereo signal, according to another embodiment of the present invention. The up-mixing unit 1030, 1140, 1230 or 1340 includes a amplitude calculation unit 1700, a phase calculation unit 1710, a decorrelator 1720, a phase adjustment unit 1730, and a signal generation unit 1740.


The amplitude calculation unit 1700 calculates the amplitudes of a left-channel signal and a right-channel signal based on a mono signal, using the size parameter decoded by the size parameter decoding unit 1400 illustrated in FIG. 14 or 15. Here, the size parameter is a gain calculated by an encoding apparatus (not shown) so that the difference between the energy levels of an actual stereo signal and a stereo signal that is to be decoded by a decoding terminal (not shown) can be minimized in order to minimize an error between the amplitudes of the actual stereo signal and the stereo signal.


If it is assumed that the relation between the left-channel signal and the right-channel signal is set so that the amplitude of the mono signal can be equal to the average of the amplitudes of the left-channel signal and the right-channel signal, the amplitude calculation unit 1700 can calculate the amplitudes of the left-channel signal and the right-channel signal by using the following:

ãn,kL=gman,kM
ãn,kR=(2−gm)an,kM  (18),


wherein ãn,kL and ãn,kR respectively denote the amplitudes of the left-channel signal and the right-channel signal that are calculated by the amplitude calculation unit 1700, gm denotes the gain, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


The phase calculation unit 1710 calculates the phases of the left-channel signal and the right-channel signal, based on the phase of the mono signal by using the phase parameter decoded by the phase parameter decoding unit 1410 illustrated in FIG. 14 or 15. Here, the phase parameter is a phase difference ψn,k calculated so that the difference between the energy levels of the actual stereo signal and the stereo signal that is to be decoded can be minimized in order to minimize an error between the phases of the actual stereo signal and the stereo signal that is to be decoded.


If the phase parameter is a phase difference ψn,k on an assumption that both the encoding apparatus and the decoding apparatus have predetermined that the difference between the phases of the left-channel signal and the mono signal is equal to the difference between the phases of the right-channel signal and the mono signal, the phase calculation unit 1710 can calculate the phase of the left-channel signal by adding ψn,k to the phase of the mono signal and the phase of the right-channel signal by subtracting ψn,k from the phase of the mono signal.


The decorrelator 1720 produces a decorrelated signal that is a vertical vector component of the mono signal.


The phase adjustment unit 1730 adjusts the left-channel signal and the right-channel signal by enhancing the phases of the left-channel signal and the right-channel signal calculated by the phase calculation unit 1710 based on the decorrelated signal and the mono signal, using the enhancement parameter decoded by the enhancement parameter decoding unit 1420 illustrated in FIG. 15. If it is assumed that the enhancement parameter is αm calculated by the encoding apparatus, it is possible to adjust the left-channel signal by using Equation (19) and the right-channel signal by using Equation (20), as follows:














L
^


n
,
k


=






L
~


n
,
k




cos


(

α
m

)



+


g
m







n
,
k





D

n
,
k




sin


(

α
m

)











=





g
m



M

n
,
k








n
,
k




cos






(

α
m

)


+


g
m







n
,
k





D

n
,
k




sin


(

α
m

)





,







(
19
)







wherein {circumflex over (L)}n,k denotes the left-channel signal adjusted by the phase adjustment unit 1730, {tilde over (L)}n,k denotes the left-channel signal obtained by applying the amplitude and phase of the left-channel signal that are respectively calculated by the amplitude calculation unit 1700 and the phase calculation unit 1710, gm denotes the gain, ψn,k denotes the phase difference indicated by the phase parameter, Dn,k denotes the amplitude of the decorrelated signal, αm denotes the phase indicated by the enhancement parameter, and Mn,k denotes the amplitude of the mono signal.














R
^


n
,
k


=






R
~


n
,
k




cos


(

α
m

)



-


(

2
-

g
m


)





-



n
,
k






D

n
,
k




sin


(

α
m

)











=





(

2
-

g
m


)



M

n
,
k






-



n
,
k





cos






(

α
m

)


-


(

2
-

g
m


)





-



n
,
k






D

n
,
k




sin


(

α
m

)





,







(
20
)







wherein {circumflex over (R)}n,k denotes the right-channel signal adjusted by the phase adjustment unit 1730, {tilde over (R)}n,k denotes a right-channel signal obtained by applying the amplitude and phase of the right-channel signal that are respectively calculated by the amplitude calculation unit 1700 and the phase calculation unit 1710, gm denotes the gain, ψn,k denotes a phase difference indicated by phase parameter, Dn,k denotes the amplitude of the decorrelated signal, αm denotes the phase indicated by the enhancement parameter, and Mn,k denotes the amplitude of the mono signal.


The signal generation unit 1740 generates the stereo signal by generating the left-channel signal and the right-channel signal, based on the amplitude of the left-channel signal and the right-channel signal, which are calculated by the amplitude calculation unit 1700, the phases of the left-channel signal and the right-channel signal, which are calculated by the phase calculation unit 1710, and the phases of the left-channel signal and the right-channel signal adjusted by the phase adjustment unit 1730.


For example, referring to FIG. 18, a first left-channel signal {tilde over (L)}n,k and a first right-channel signal {tilde over (R)}n,k are produced by determining their amplitudes by the gain gm based on a mono signal Mn,k and respectively determining their phases of the first left-channel signal {tilde over (L)}n,k and the first right-channel signal {tilde over (R)}n,k by rotating the mono signal Mn,k by an angle of θ° and an angle of −θ° by applying the phase difference θ.


Next, a second left-channel signal Lk is produced by adjusting the amplitude of the first left-channel signal {tilde over (L)}n,k to |{tilde over (L)}n,k|cos(αm), a second decorrelated signal Dn,kL is produced by rotating the first decorrelated signal Dn,k by the phase difference ψn,k, a third left-channel signal Ln,k′ is produced by adjusting the amplitude of the second left-channel signal Ln,k to |{tilde over (L)}n,k|sin(αm)(=Mngm sin(αm)=gm|Mn|sin(αm)=gm|Dn|sin(αm)), and then, a fourth left-channel signal {circumflex over (L)}n,k is produced by vector addition of the second left-channel signal Ln,k and the third left-channel signal Ln,k′. A fourth right-channel signal {circumflex over (R)}n,k is produced in the same way that the left-channel signal {circumflex over (L)}n,k is produced.


As illustrated in FIG. 19, the up-mixing unit 1030, 1140, 1230 or 1340 can receive the mono signal Mn,k, produce the decorrelated signal Dn,k by the decorrelator 1720, and then, produce the left-channel signal {circumflex over (L)}n,k and the right-channel signal {circumflex over (R)}n,k based on the mono signal Mn,k and the decorrelated signal Dn,k by using the gain gm determined from the size parameter, the phase difference ψn,k determined from the phase parameter, and the phase αm determined from the enhancement parameter.


The method, illustrated in FIG. 19, of generating a left-channel signal and a right-channel signal can be simply expressed by:













[





L
^


n
,
k








R
^


n
,
k





]

=




[








n
,
k





0




0





-



n
,
k







]



[




g
m



0




0



(

2
-

g
m


)




]












[




cos






(

α
m

)





sin






(

α
m

)







cos






(

α
m

)






-
sin







(

α
m

)





]





[




M

n
,
k







D

n
,
k





]









=



[





g
m







n
,
k





cos


(

α
m

)







g
m







n
,
k





sin


(

α
m

)









(

2
-

g
m


)





-



n
,
k






cos


(

α
m

)







-

(

2
-

g
m


)






-



n
,
k






sin


(

α
m

)






]















[




M

n
,
k







D

n
,
k





]

,










(
21
)







wherein {circumflex over (L)}n,k denotes the finally generated left-channel signal, {circumflex over (R)}n,k denotes the finally generated right-channel signal, ψn,k denotes the phase difference represented by the phase parameter, gm denotes the gain, αm denotes the phase represented by the enhancement parameter, Mn,k denotes the mono signal, and Dn,k denotes the decorrelated signal.


According to Equation (21), first, rotation transformation is performed on the mono signal Mn,k and the decorrelated signal Dn,k size transformation is performed, and then, phase adjustment is performed, but the present invention is not limited thereto.



FIG. 20 is a flowchart illustrating a method of encoding a stereo signal, according to an embodiment of the present invention.


First, a received stereo signal is transformed into a predetermined domain by using an analysis filterbank (operation 2000). Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units.


Next, the stereo signal that is transformed into the predetermined domain is downmixed to a mono signal (operation 2010).


The amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal, and the mono signal may be generated on a half sum vector of the left-channel signal and the right-channel signal.


Next, the downmixed mono signal is encoded (operation 2020).


Next, parameters necessary to upmix the mono signal to the stereo signal by a decoding process are extracted from the stereo signal, and the extracted parameters are encoded (operation 2030). The extracted parameters are information for producing the left-channel signal and the right-channel signal, based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters, which are extracted in operation 2030, may be produced for each frame, in band units.


Thereafter, the parameters encoded in operation 2030 and the mono signal encoded in operation 2020 are multiplexed together, thereby obtaining a bitstream (operation 2040).



FIG. 21 is a flowchart illustrating a method of encoding a stereo signal, according to another embodiment of the present invention.


First, a received stereo signal is transformed into a predetermined domain by using an analysis filterbank (operation 2100). Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units. For example, the transformation may be achieved by using a Quadrature Mirror Filterbank (QMF) and/or Lapped Orthogonal Transform (LOT).


Next, the stereo signal that is transformed into the predetermined domain is downmixed to a mono signal operation (operation 2110).


The amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal, and the mono signal may be generated on a half sum vector of the left-channel signal and the right-channel signal.


Next, an inverse operation of the transformation performed in operation 2100 is performed on the domain of the downmixed mono signal, that is, the domain is inversely transformed using the synthesis filterbank (operation 2120). For example, in operation 2120, inverse transformation is performed so that the mono signal, which is expressed in the time domain for each of the sub bands at predetermined frequency units, can be expressed as a time series only in the time domain.


Next, the inversely transformed mono signal is encoded (operation 2130).


Next, parameters necessary to upmix the mono signal to the stereo signal by a decoding process are extracted from the stereo signal, and the extracted parameters are encoded (operation 2140). The extracted parameters are information for producing the left-channel signal and the right-channel signal, based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters, which are extracted in operation 2140, may be produced for each frame, in band units.


Thereafter, the parameters encoded in operation 2130 and the mono signal encoded in operation 2130 are multiplexed together, thereby obtaining a bitstream (operation 2150).



FIG. 22 is a flowchart illustrating a method of encoding a stereo signal, according to another embodiment of the present invention.


First, a received stereo signal is transformed into a predetermined domain by using an analysis filterbank (operation 2000). Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units.


Next, it is determined whether the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal, which is transformed into the predetermined domain, falls within a predetermined range (operation 2205). This is because the nearer the difference between the phases of the left-channel signal and the right-channel signal approximates 180 degrees, the nearer the sum of the vectors of the left-channel signal and the right-channel signal approximates zero. The predetermined range may be determined based on 180 degrees.


In an embodiment of the present invention, operation 2205 is performed as follows. First, Sn,k is calculated by:











S

n
,
k


=



L

n
,
k


+

R

n
,
k



2


,




(
22
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


Next, Gn,k is calculated by substituting Sn,k into the following:











G

n
,
k


=


2




S

n
,
k









L

n
,
k




+



R

n
,
k







,




(
23
)







In operation 2205, it is determined whether to adjust the phases of the left-channel signal and the right-channel signal, depending on whether Gn,k is less than 10−3 that is a predetermined threshold.


If Gn,k is less than 10−3, the phases of the left-channel signal and the right-channel signal are determined to be adjusted. If Gn,k is equal to or greater than 10−3, the phases of the left-channel signal and the right-channel signal are determined not to be adjusted.


If it is determined in operation 2205 that the difference between the phases of the left-channel signal and the right-channel signal falls within the predetermined range, the phases of the left-channel signal and the right-channel signal are adjusted by a predetermined phase (operation 2210).


In operation 2210, the phases of the left-channel signal and the right-channel signal are adjusted by the same phase. If the phase of the left-channel signal is adjusted by an angle of θ°, the phase of the right-channel signal is adjusted by an angle of −θ°.


Returning to operation 2205, phase adjustment is performed by transforming Sn,k as follows:











S

n
,
k


=




L

n
,
k








+


R

n
,
k






-





2


,




(
24
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, θ denotes a predetermined value, e.g., π/100, n denotes a frame number, and k denotes a band number.


Next, information regarding the phases adjusted in operation 2210 is encoded (operation 2220). For example, if the phases of the left-channel signal and the right-channel signal are respectively adjusted by the angle of θ° and the angle of −θ° in operation 2210, information regarding the angle of θ° is encoded.


Next, the stereo signal whose phase is adjusted in operation 2210 or the stereo signal transformed into the predetermined domain in operation 2200 is downmixed to the mono signal (operation 2230).


Returning to operation 2205, the final Sn,k is calculated by:











S

n
,
k


=


S

n
,
k







min






{

2
,







L

n
,
k




2

+




R

n
,
k




2



2





S

n
,
k




2





}



,




(
25
)







wherein Sn,k of the right-hand side of Equation (25) denotes a phasor calculated by Equation (3), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


In operation 2230, the mono signal is produced a mono signal by using Sn,k calculated by Equation (25), as follows:











M

n
,
k


=


S

n
,
k










k
=
1

N






L

n
,
k




2


+




R

n
,
k




2



4





k
=
1

N






S

n
,
k




2







,




(
26
)







wherein Mn,k denotes the mono signal, Sn,k denotes the phasor calculated by Equation (25), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


The amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal, and the mono signal may be generated on a half sum vector of the left-channel signal and the right-channel signal.


Next, the downmixed mono signal is encoded (operation 2240).


Next, parameters necessary to upmix the mono signal to the stereo signal by a decoding process are extracted from the stereo signal, and the extracted parameters are encoded (operation 2250). The extracted parameters are information for producing the left-channel signal and the right-channel signal, based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters, which are extracted in operation 2250, may be produced for each frame, in band units.


Thereafter, the parameters encoded in operation 2250 and the mono signal encoded in operation 2240 are multiplexed together, thereby obtaining a bitstream (operation 2260). Also, in operation 2260, if the phase of the stereo signal is adjusted in operation 2210, the information regarding the adjusted phases, which is encoded in operation 2220, is multiplexed together with the parameters and the mono signal.



FIG. 23 is a flowchart illustrating a method of encoding a stereo signal, according to another embodiment of the present invention.


First, a received stereo signal is transformed into a predetermined domain by using an analysis filterbank (operation 2300). Here, the predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units.


Next, it is determined whether the difference between the phase of a left-channel signal and a right-channel signal contained in the stereo signal, which is transformed into the predetermined domain, falls within a predetermined range (operation 2305). This is because the nearer the difference between the phases of the left-channel signal and the right-channel signal approximates 180 degrees, the nearer the sum of the vectors of the left-channel signal and the right-channel signal approximates zero. The predetermined range may be determined based on 180 degrees.


In an embodiment of the present invention, operation 2305 is performed as follows. First, Sn,k is calculated by:











S

n
,
k


=



L

n
,
k


+

R

n
,
k



2


,




(
27
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


Next, Gn,k is calculated by substituting Sn,k into the following:










G

n
,
k


=


2




S

n
,
k









L

n
,
k




+



R

n
,
k










(
28
)







In operation 2305, it is determined whether to adjust the phases of the left-channel signal and the right-channel signal, depending on whether Gn,k is less than 10−3 that is a predetermined threshold.


If Gn,k is less than 10−3, the phases of the left-channel signal and the right-channel signal are determined to be adjusted. If Gn,k is equal to or greater than 10−3, the phases of the left-channel signal and the right-channel signal are determined not to be adjusted.


If it is determined in operation 2305 that the difference between the phases of the left-channel signal and the right-channel signal falls within the predetermined range, the phases of the left-channel signal and the right-channel signal are adjusted by a predetermined phase. This is because the nearer the difference between the phases of the left-channel signal and the right-channel signal approximates 180 degrees, the nearer the sum of the vectors of the left-channel signal and the right-channel signal approximates zero. The predetermined range may be determined based on 180 degrees.


In operation 2310, the phases of the left-channel signal and the right-channel signal are adjusted by the same phase. For example, if the phase of the left-channel signal is adjusted by an angle of θ°, the phase of the right-channel signal is adjusted by an angle of −θ°.


Returning to operation 2305, phase adjustment is performed by transforming Sn,k as follows:











S

n
,
k


=




L

n
,
k








+


R

n
,
k






-





2


,




(
29
)







wherein Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, θ denotes a predetermined value, e.g., π/100, n denotes a frame number, and k denotes a band number.


Next, information regarding the phases adjusted in operation 2310 is encoded (operation 2320). For example, if the phases of the left-channel signal and the right-channel signal are respectively adjusted by the angle of θ° and the angle of −θ° in operation 2310, information regarding the angle of θ° is encoded.


Next, the stereo signal whose phase is adjusted in operation 2310, or the stereo signal transformed into the predetermined domain in operation 2300 is downmixed to the mono signal (operation 2330).


Returning to operation 2305, the final Sn,k is calculated by:











S

n
,
k


=


S

n
,
k







min






{

2
,







L

n
,
k




2

+




R

n
,
k




2



2





S

n
,
k




2





}



,




(
30
)







wherein Sn,k of the right-hand side of Equation (29) denotes a phasor calculated by Equation (29), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


In operation 2330, the mono signal is produced a mono signal by using Sn,k calculated by Equation (30), as follows:











M

n
,
k


=


S

n
,
k










k
=
1

N






L

n
,
k




2


+




R

n
,
k




2



4





k
=
1

N






S

n
,
k




2







,




(
31
)







wherein Mn,k denotes the mono signal, Sn,k denotes the phasor calculated by Equation (30), Ln,k denotes the left-channel signal, Rn,k denotes the right-channel signal, n denotes a frame number, and k denotes a band number.


The amplitude of the downmixed mono signal may be equal to the average of the amplitudes of a left-channel signal and a right-channel signal, and the mono signal may be generated on a half sum vector of the left-channel signal and the right-channel signal.


Next, an inverse operation of the transformation performed in operation 2300 is performed on the domain of the mono signal downmixed in operation 2330, that is, the domain is inversely transformed using the synthesis filterbank (operation 2340). For example, in operation 2340, inverse transformation is performed so that the mono signal, which is expressed in the time domain for each of the sub bands at predetermined frequency units, can be expressed as a time series only in the time domain.


Next, the mono signal that was inversely transformed in operation 2340 is encoded (operation 2350).


Next, parameters necessary to upmix the mono signal to the stereo signal by a decoding process are extracted from the stereo signal, and the extracted parameters are encoded (operation 2360). The extracted parameters are information for producing the left-channel signal and the right-channel signal, based on the mono signal.


The parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal.


The parameters, which are extracted in operation 2360, may be produced for each frame, in band units.


Thereafter, the parameters encoded in operation 2360 and the mono signal encoded in operation 2350 are multiplexed together, thereby obtaining a bitstream (operation 2370). Also, in operation 2370, if the phase of the stereo signal is adjusted in operation 2310, the information regarding the adjusted phases, which is encoded in operation 2320, is multiplexed together with the parameters and the mono signal.



FIGS. 24 and 25 are flowcharts illustrating in detail operation 2030, 2140, 2250, or 2306 included in a method of encoding a stereo signal, according to embodiments of the present invention. Operation 2030, 2140, 2250, or 2306 includes operation 2400 and 2410 as illustrated in FIG. 24, but may further include operation 2420 as illustrated in FIG. 25.


First, a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of a mono signal is extracted and encoded (operation 2400).


After operation 2400, a phase parameter that represents the difference between at least one of the left-channel signal and the right-channel signal and the mono signal is extracted and encoded (operation 2420). Alternatively, the phase parameter extracted in operation 2420 may represent the difference between the phases of the left-channel signal and the mono signal, the difference between the phases of the right-channel signal and the mono signal, or the difference among the phases of the left-channel signal and the right-channel signal and the mono signal.


In operation 2420 included in the embodiment illustrated in FIG. 25, an enhancement parameter for enhancing and controlling the phase indicated by the phase parameter using a decorrelated signal that is a vertical vector component of the mono signal is extracted and encoded.


However, the sequence of performing operations 2400 through 2420 is not limited.



FIG. 26 is a flowchart illustrating in detail operation 2400 illustrated in FIG. 24 or 25, according to an embodiment of the present invention.


First, on an assumption that the amplitude of a left-channel signal has a predetermined relation to that of a right-channel signal, a gain is calculated to minimize the difference between the energy levels of an actual stereo signal and a stereo signal that is to be generated from a mono signal by applying the calculated gain, so that an error between the amplitudes of the actual stereo signal and a stereo signal that is to be decoded by a decoding process can be minimized (operation 2600).


The calculated gain is used in determining the amplitude of the left-channel signal and the right-channel signal when the decoding terminal upmixes the mono signal to a stereo signal.


For example, if it is assumed that the predetermined relation between the left-channel signal and the right-channel signal is that the amplitude of the mono signal is equal to the average of the amplitudes of the left-channel signal and the right-channel signal, the left-channel signal and the right-channel signal can be expressed by:

ãn,kL=gman,kM
ãn,kR=(2−gm)an,kM  (32),


wherein ãn,kL denotes the amplitude of the left-channel amplitude to which the gain calculated in operation 2600 is applied, ãn,kR denotes the amplitude of the right-channel signal to which the calculated gain is to be applied, gm denotes the gain used to determine signal amplitude, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


The difference between the energy levels of the actual stereo signal and the stereo signal to which the calculated gain is applied, can be calculated by the following Equation (33) into which Equation (32) has been substituted:













E

n
,
k


L





R


=




n







(



a
~


n
,
k

L

-

a

n
,
k

L


)

2


+



n







(



a
~


n
,
k

R

-

a

n
,

R


)

2










=




n







(



g
m



a

n
,
k

M


-

a

n
,
k

L


)

2


+



n







(



(

2
-

g
m


)



a

n
,
k

M


-

a

n
,
k

R


)

2




,







(
33
)







wherein En,kLR denotes the difference between the energy levels of the actual stereo signal and the stereo signal to which the calculated gain is applied, ãn,kL denotes the amplitude of the left-channel signal to which the calculated gain is applied, ãn,kR denotes the amplitude of the right-channel signal to which the calculated gain is applied, an,kL denotes the amplitude of an actual left-channel signal, an,kR denotes the amplitude of an actual right-channel signal, gm denotes the gain used to calculate the amplitude of a signal, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


Equation (33) into which Equation (32) has been substituted can be expressed with respect to the gain gm, as follows:











g
m

=

1
+





n








k







a

n
,
k

M



a

n
,
k

L




-



n








k







a

n
,
k

M



a

n
,
k

R






2




n








k







(

a

n
,
k

M

)

2







,




(
34
)







wherein gm denotes the gain used to calculate the amplitude of a signal, an,kL denotes the amplitude of the actual left-channel signal, an,kR denotes the amplitude of the actual right-channel signal, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


Thus, in operation 2600, it is possible to calculate the gain that minimizes the difference between the energy levels of the actual stereo signal and the stereo signal to which the gain is applied by substituting the actual left-channel signal amplitude an,kL, the actual right-channel signal amplitude an,kR, and the mono signal amplitude an,kM into Equation (34).


Thereafter, the gain calculated in operation 2600 is encoded (operation 2610).



FIG. 27 is a flowchart illustrating in detail operation 2420 illustrated in FIG. 25, according to an embodiment of the present invention.


First, a phase difference that minimizes the difference between the phases of an actual stereo signal and a stereo signal that is to be generated by applying the phase difference is calculated in order to minimize an error between the phases of the actual stereo signal and a stereo signal that is to be decoded by a decoding terminal, on an assumption that the phase of a left-channel signal has a predetermined relation to the phase of a right-channel signal (operation 2700).


The difference between the energy levels of the actual stereo signal and the stereo signal that is to be generated can be calculated by:

En,kLR=2(an,kR)2[1−cos(φn,kR−φn,kMn,kR)]+2(an,kL)2[1−cos(φn,kM−φn,kLn,kL)]  (35),


wherein En,kLR denotes the difference between the energy levels of the actual stereo signal and the stereo signal that is to be generated, an,kR denotes the amplitude of an actual right-channel signal, an,kL denotes the amplitude of an actual left-channel signal, φn,kR denotes the phase of the actual right-channel signal, φn,kM denotes the phase of a mono signal, φn,kL denotes the phase of the actual left-channel signal, ψn,kR denotes the difference between the phases of the mono signal and the right-channel signal, ψn,kL denotes the difference between the phases of the mono signal and the left-channel signal, n denotes a frame number, and k denotes a band number.


If it is assumed that the difference between the phases of the left-channel signal and the mono signal is equal to the difference between the phases of the right-channel signal and the mono signal in Equation (35), that is, if it is assumed that ψn,kR and ψn,kL has the same value, e.g., ψn,k, Equation (35) can be expressed by:











t






g


(

ψ

n
,
k


)



=





n








k








(

a

n
,
k

R

)

2



sin


(


φ

n
,
k

M

-

φ

n
,
k

R


)





+












n








k








(

a

n
,
k

L

)

2



sin


(


φ

n
,
k

L

-

φ

n
,
k

M


)










n








k








(

a

n
,
k

R

)

2


cos


(


φ

n
,
k

M

-

φ

n
,
k

R


)




+







n








k








(

a

n
,
k

L

)

2



cos


(


φ

n
,
k

L

-

φ

n
,
k

M


)








,




(
36
)







wherein ψn,k denotes the difference between the phases of the mono signal and the stereo signal, an,kR denotes the amplitude of the actual right-channel signal, an,kL denotes the amplitude of the actual left-channel signal, φn,kR denotes the phase of the actual right-channel signal, φn,kM denotes the phase of the mono signal, φn,kL denotes the phase of the actual left-channel signal, n denotes a frame number, and k denotes a band number.


Thus, in operation 2700, the phase difference that minimizes the difference between the energy levels of the actual stereo signal and the stereo signal that is to be generated can be calculated by substituting the actual left-channel signal amplitude an,kL, the actual right-channel signal amplitude an,kR, the actual left-channel signal phase φn,kL, the actual right-channel signal phase φn,kR, and the mono signal phase φn,kM into Equation (36).


Thereafter, the calculated phase difference is encoded (operation 2710).



FIG. 28 is a flowchart illustrating in detail operation 2420 illustrated in FIG. 25, according to another embodiment of the present invention.


First, a second phase for enhancing and controlling a first phase indicated by a phase parameter encoded, is calculated using a decorrelated signal that is a vertical vector component of a mono signal (operation 2800).


For example, in operation 2800, the second phase for enhancing and controlling the first phase can be calculated by:











t






g


(

α
k

)



=

min
[

1
,



2


(





n
=

b
k




b

k
+
1


-
1






(

a

n
,
k

L

)

2



(

1
-

cos


(


φ

n
,
k

L

-

φ

n
,
k

M

-

ψ

n
,
k



)



)



+












n
=

b
k




b

k
+
1


-
1






(

a

n
,
k

R

)

2



(

1
-

cos


(


φ

n
,
k

R

-

φ

n
,
k

M

-

ψ

n
,
k



)



)




)







n
=

b
k




b

k
+
1


-
1





(

a

n
,
k

L

)

2


+




n
=

b
k




b

k
+
1


-
1





(

a

n
,
k

R

)

2






]


,




(
37
)







wherein an,kL denotes the amplitude of an actual left-channel signal, φn,kL denotes the phase of the actual left-channel signal, φn,kM denotes the phase of the mono signal, denotes the difference between the phases of the mono signal and the stereo signal, an,kR denotes the amplitude of an actual right-channel signal, φn,kR denotes the phase of the actual right-channel signal, bk denotes a band border value, n denotes a frame number, and k denotes a band number.


Thus, in operation 2800, the second phase can be calculated by using the actual left-channel signal amplitude an,kL, the actual left-channel signal phase φn,kL, the mono signal phase φn,kM the difference ψn,k between the phases of the mono signal and the stereo signal, the actual right-channel signal amplitude an,kR, and the actual right-channel signal phase φn,kR.


Thereafter, the second phase is encoded (operation 2810).



FIG. 29 is a flowchart illustrating a method of decoding a stereo signal, according to an embodiment of the present invention.


First, a bitstream is received from an encoding terminal, and inversely multiplexed (operation 2900). The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus, and the mono signal encoded by the encoding apparatus.


Next, the inversely multiplexed, encoded mono signal is decoded (operation 2910).


Next, the inversely multiplexed parameters are decoded (in operation 2920). The decoded parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The decoded parameters may be produced for each frame and in band units.


Next, the decoded mono signal is upmixed to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter (operation 2930). When the mono signal is upmixed to a stereo signal containing a left-channel signal and a right-channel signal in operation 2930, the amplitudes of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


Next, an inverse operation of the transformation performed in operation 2000 illustrated in FIG. 20 is performed, that is, the domain of the stereo signal upmixed in operation 2930 is inversely transformed using the synthesis filterbank (operation 2940). For example, in operation 2940, the mono signal, which is expressed as spectra in the time domain for each of the sub bands at predetermined frequency units, is inversely transformed so that it can be expressed as a time series only in the time domain.



FIG. 30 is a flowchart illustrating a method of decoding a stereo signal, according to another embodiment of the present invention.


First, a bitstream is received from an encoding terminal and inversely multiplexed (operation 3000). The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus, and the mono signal encoded by the encoding apparatus.


Next, the inversely multiplexed, encoded mono signal is decoded (operation 3010).


Next, the decoded mono signal is transformed into a predetermined domain by using an analysis filterbank (operation 3020). The predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units.


Next, the inversely multiplexed parameters are decoded (operation 3030). The decoded parameters include a size parameter that represents the ratio of the amplitude of at least one of a left-channel signal and a right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The decoded parameters may be produced for each frame and in band units.


Next, the decoded mono signal is upmixed to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter (operation 3040). When the mono signal is upmixed to a stereo signal containing a left-channel signal and a right-channel signal in operation 3040, the amplitudes of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


Thereafter, an inverse operation of the transformation performed in operation 3020 is performed, that is, the domain of the stereo signal upmixed in operation 3040 is inversely transformed using the synthesis filterbank (operation 3050). For example, in operation 3050, the mono signal, which is expressed in the time domain as spectra for each of the sub bands at predetermined frequency units, is inversely transformed so that it can be expressed as a time series only in the time domain.



FIG. 31 is a flowchart illustrating a method of decoding a stereo signal, according to another embodiment of the present invention.


First, a bitstream is received from an encoding terminal and inversely multiplexed (operation 3100). The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus, and the mono signal encoded by the encoding apparatus. If the encoding apparatus has adjusted the phase of the stereo signal because the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal fell within a predetermined range, the bitstream further contains information regarding the phase of the stereo signal, which is adjusted by the encoding apparatus.


Next, the inversely multiplexed, encoded mono signal is decoded (operation 3110).


Next, the inversely multiplexed parameters are decoded (operation 3120). The decoded parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The decoded parameters may be produced for each frame and in band units.


Next, the decoded mono signal is upmixed to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter. When the mono signal is upmixed to a stereo signal containing the left-channel signal and the right-channel signal in operation 3130, the amplitudes of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


After operation 3130, it is determined whether the phases of the left-channel signal and the right-channel signal have been adjusted due to the difference between the phases of the left-channel signal and the right-channel signal falling within the predetermined range (operation 3140). In other words, it is determined whether the bitstream being inversely multiplexed in operation 3100 contains the information regarding the adjusted phases.


If it is determined in operation 3140 that the encoding apparatus has adjusted the phases of the left-channel signal and the right-channel signal, the information regarding the adjusted phases is decoded (operation 3145). For example, if the encoding apparatus adjusts the phase of the left-channel signal by an angle of θ° and the phase of the right-channel signal by an angle of −θ°, the information regarding the adjusted phase indicates the angle of θ°.


Next, the phases of the left-channel signal and the right-channel signal of the upmixed stereo signal are respectively adjusted by the adjusted phases (operation 3150).


If the inversely multiplexed bitstream contains the information regarding the adjusted phases, an inverse operation of the transformation performed in operation 2200 illustrated in FIG. 22 is performed, that is, the domain of the stereo signal that is upmixed in operation 3130 or is adjusted in operation 3150 is inversely transformed using the synthesis filterbank (operation 3160). For example, in operation 3160, the mono signal, which is expressed in the time domain as spectra for each of the sub bands at predetermined frequency units, is inversely transformed so that it can be expressed as a time series only in the time domain.



FIG. 32 is a flowchart illustrating a method of decoding a stereo signal, according to another embodiment of the present invention.


First, a bitstream is received from an encoding terminal and inversely multiplexed (operation 3200). The bitstream contains parameters necessary to upmix a mono signal generated by an encoding apparatus, and the mono signal encoded by the encoding apparatus. If the encoding apparatus adjusted the phase of the stereo signal due to the difference between the phases of a left-channel signal and a right-channel signal contained in the stereo signal falling within a predetermined range, the bitstream further contains information regarding the adjusted phase of the stereo signal.


Next, the inversely multiplexed, encoded mono signal is decoded (operation 3210).


Next, the decoded mono signal is transformed into a predetermined domain by using the analysis filterbank (operation 3210). The predetermined domain may have a complex-number format in which both the amplitude and phase of each signal can be expressed. For example, the predetermined domain allows each signal to be expressed in the time domain as spectra for each of the sub bands at predetermined frequency units.


Next, the inversely multiplexed parameters are decoded (operation 3230). The decoded parameters include a size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal, and a phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal. The parameters may further include an enhancement parameter that contains information for enhancing information contained in the size parameter and the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal. The decoded parameters may be produced for each frame and in band units.


Next, the transformed mono signal is upmixed to a stereo signal by using the decoded parameters, such as the size parameter, the phase parameter, and the enhancement parameter (operation 3240). When the mono signal is upmixed to a stereo signal containing the left-channel signal and the right-channel signal in operation 3240, the amplitudes of the left-channel signal and the right-channel signal are determined using the mono signal according to the size parameter, the phases of the left-channel signal and the right-channel signal are determined using the mono signal according to the phase parameter, and the determined phases of the left-channel signal and the right-channel signal are enhanced and controlled using a decorrelated signal according to the enhancement parameter.


After operation 3240, it is determined whether the encoding apparatus has adjusted the phases of the left-channel signal and the right-channel signal because the difference between the phases of the left-channel signal and the right-channel signal fell within the predetermined range (operation 3250). That is, it is determined whether the inversely multiplexed bitstream contains the information regarding the adjusted phases.


If it is determined in operation 3250 that the encoding apparatus adjusted the phases of the left-channel signal and the right-channel signal, the information regarding the adjusted phases is decoded (operation 3255). For example, if the encoding apparatus adjusted the phase of the left-channel signal by an angle of θ° and the phase of the right-channel signal by an angle of −θ°, the information regarding the adjusted phases indicates the angle of θ°.


Next, the phases of the left-channel signal and the right-channel signal of the upmixed stereo signal are respectively adjusted, by the adjusted phases (operation 3260).


However, the inversely multiplexed bitstream does not contain the information regarding the adjusted phases, the phase adjustment unit 1360 does not adjust the phases of the left-channel signal and the right-channel signal that are upmixed to the stereo signal.


If the inversely multiplexed bitstream contains the information regarding the adjusted phases, an inverse operation of the transformation performed in operation 3220 is performed, that is, the domain of the stereo signal that is upmixed in operation 3240 or whose phase is adjusted in operation 3260 is inversely transformed using the synthesis filterbank (operation 3270). For example, in operation 3270, the mono signal, which is expressed in the time domain as spectra for each of the sub bands at predetermined frequency units, is inversely transformed so that it can be expressed as a time series only in the time domain.



FIGS. 33 and 34 are flowcharts illustrating in detail operation 2920, 3030, 3120, or 3230 included in a method of decoding a stereo signal, according to embodiments of the present invention. Operation 2920, 3030, 3120, or 3230 includes operation 3300 and operation 3320 as illustrated in FIG. 33, but may further include operation 3320 as illustrated in FIG. 34.


First, the size parameter that represents the ratio of the amplitude of at least one of the left-channel signal and the right-channel signal to the amplitude of the mono signal is decoded (operation 3300).


After operation 3300, the phase parameter that represents the difference between the phases of at least one of the left-channel signal and the right-channel signal and the mono signal is decoded (operation 3310).


In operation 3320 included in the embodiment illustrated in FIG. 34, the enhancement parameter for enhancing and controlling the phase indicated by the phase parameter by using a decorrelated signal that is a vertical vector component of the mono signal, is decoded.


However, the sequence of performing operations 3300 through 3320 is not limited.



FIG. 35 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 included in a method of decoding a stereo signal, according to an embodiment of the present invention.


First, the amplitudes of the left-channel signal and the right-channel signal are calculated based on the amplitude of the mono signal, using the size parameter decoded in operation 3300 illustrated in FIG. 33 or 34 (operation 3500). Here, the size parameter refers to a gain that an encoding apparatus calculates to minimize the difference between the energy levels of an actual signal and a stereo signal to which the gain is to be applied, in order to minimize an error between the amplitudes of the actual stereo signal and a stereo signal that is to be decoded by a decoding terminal.


If it is assumed that the relation between the left-channel signal and the right-channel signal is predetermined so that the amplitude of the mono signal can be equal to the average of the amplitudes of the left-channel signal and the right-channel signal, the amplitudes of the left-channel signal and the right-channel signal can be calculated by:

ãn,kL=gman,kM
ãn,kR=(2−gm)an,kM  (38),


wherein ãn,kL and ãn,kR respectively denote the amplitudes of the left-channel signal and the right-channel signal calculated in operation 3500, gm denotes the gain, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


Next, the phases of the left-channel signal and the right-channel signal are calculated using the phase parameter decoded in operation 3310 illustrated in FIG. 33 or 34, based on the phase of the mono signal (operation 3510). Here, the phase parameter is a phase difference ψn,k calculated so that the difference between the energy levels of the actual stereo signal and the stereo signal to which the calculated phase difference is to be applied can be minimized in order to minimize an error between the phases of the actual stereo signal and a stereo signal that is to be decoded by a decoding apparatus.


If the phase parameter is the phase difference ψn,k on an assumption that both the encoding apparatus and the decoding apparatus predetermine that the phase between the left-channel signal and the mono signal is equal to the phase between the right-channel signal and the mono signal, the phase of the left-channel signal is calculated by adding ψn,k to the phase of the mono signal and the phase of the right-channel signal is calculated by subtracting ψn,k from the phase of the mono signal in operation 3510.


Thereafter, the stereo signal is produced by generating the left-channel signal and the right-channel signal, based on the amplitudes of the left-channel signal and the right-channel signal, which are calculated in operation 3500, and the phases of the left-channel signal and the right-channel signal which are calculated in operation 3510 (operation 3520).



FIG. 36 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 illustrated in FIG. 35 by using the graph illustrated in FIG. 18.


First, the amplitudes of a left-channel signal {tilde over (L)}n,k and a right-channel signal {tilde over (R)}n,k are determined by applying the gain gm, based on a mono signal Mn,k (operation 3600).


Next, the phases of the left-channel signal {tilde over (L)}n,k and the right-channel signal {tilde over (R)}n,k are determined by applying the phase difference θ, that is, by respectively rotating the mono signal Mn,k by an angle of θ° and an angle of −θ° (operation 3610).


Then, the left-channel signal {tilde over (L)}n,k and the right-channel signal {tilde over (R)}n,k are produced using the amplitudes of the left-channel signal and the right-channel signal that are calculated in operation 3600 and the phases of the left-channel signal and the right-channel signal that are calculated in operation 3610 (operation 3620).



FIG. 37 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 included in a method of decoding a stereo signal, according to another embodiment of the present invention.


First, the amplitudes of the left-channel signal and the right-channel signal are calculated based on the amplitude of the mono signal, using the size parameter decoded in operation 3300 illustrated in FIG. 33 or 34 (operation 3700). Here, the size parameter refers to a gain that an encoding apparatus calculates to minimize the difference between the energy levels of an actual signal and a stereo signal to which the gain is to be applied, in order to minimize an error between the amplitudes of the actual stereo signal and a stereo signal that is to be decoded by a decoding terminal.


If it is assumed that the relation between the left-channel signal and the right-channel signal is predetermined so that the amplitude of the mono signal can be equal to the average of the amplitudes of the left-channel signal and the right-channel signal, the amplitudes of the left-channel signal and the right-channel signal can be calculated by:

ãn,kL=gman,kM
ãn,kR=(2−gm)an,kM  (39),


wherein ãn,kL and ãn,kR respectively denote the amplitudes of the left-channel signal and the right-channel signal that are calculated in operation 3700, gm denotes the gain, an,kM denotes the amplitude of the mono signal, n denotes a frame number, and k denotes a band number.


Next, the phases of the left-channel signal and the right-channel signal are calculated using the phase parameter decoded in operation 3310 illustrated in FIG. 33 or 34, based on the phase of the mono signal (operation 3710). Here, the phase parameter is a phase difference ψn,k calculated so that the difference between the energy levels of the actual stereo signal and the stereo signal to which the calculated phase difference is to be applied can be minimized in order to minimize an error between the phases of the actual stereo signal and a stereo signal that is to be decoded by a decoding apparatus.


If the phase parameter is the phase difference ψn,k on an assumption that both the encoding apparatus and the decoding apparatus have determined that the difference between the phases of the left-channel signal and the mono signal is equal to the difference between the phases of the right-channel signal and the mono signal, the phase of the left-channel signal is calculated by adding ψn,k to the phase of the mono signal and the phase of the right-channel signal is calculated by subtracting ψn,k from the phase of the mono signal in operation 3710.


Thereafter, a decorrelator produces a decorrelated signal that is a vertical vector component of the mono signal (operation 3720).


Next, the left-channel signal and the right-channel signal are adjusted by enhancing the phases of the left-channel signal and the right-channel signal that are calculated in operation 3710, based on the decorrelated signal and the mono signal by using the enhancement parameter decoded in operation 3320 illustrated in FIG. 33 (operation 730). If it is assumed that the enhancement parameter is αm calculated by the encoding apparatus, it is possible to adjust the left-channel signal by using Equation (40) and the right-channel signal by using Equation (41), as follows:














L
^


n
,
k


=




L
~


n
,
k




cos


(

α
m

)



+


g
m







n
,
k





D

n
,
k




sin


(

α
m

)











=



g
m



M

n
,
k








n
,
k





cos


(

α
m

)



+


g
m







n
,
k





D

n
,
k




sin


(

α
m

)





,







(
40
)







wherein {circumflex over (L)}n,k denotes the left-channel signal adjusted in operation 3730, {tilde over (L)}n,k denotes the left-channel signal obtained by applying the amplitude and phase of the left-channel signal that are respectively calculated in operations 3700 and 3710, gm denotes the gain, ψn,k denotes a phase difference indicated by the phase parameter, ψn,k denotes the amplitude of the decorrelated signal, αm denotes the phase indicated by the enhancement parameter, and Mn,k denotes the amplitude of the mono signal.














R
^


n
,
k


=






R
~


n
,
k




cos


(

α
m

)



-


(

2
-

g
m


)





-



n
,
k






D

n
,
k




sin


(

α
m

)










=





(

2
-

g
m


)



M

n
,
k






-



n
,
k






cos


(

α
m

)



-












(

2
-

g
m


)





-



n
,
k






D

n
,
k




sin


(

α
m

)



,








(
41
)







wherein {circumflex over (R)}n,k denotes the right-channel signal adjusted in operation 3730, {tilde over (R)}n,k denotes a right-channel signal obtained by applying the amplitude and phase of the right-channel signal that are respectively calculated in operations 3700 and 3710, gm denotes the gain, ψn,k denotes the phase difference indicated by phase parameter, Dn,k denotes the amplitude of the decorrelated signal, αm denotes the phase indicated by the enhancement parameter, and Mn,k denotes the amplitude of the mono signal.


Then, the stereo signal is produced by generating the left-channel signal and the right-channel signal, based on the amplitudes of the left-channel signal and the right-channel signal that are calculated in operation 3700, the phases of the left-channel signal and the right-channel signal that are calculated in operation 3710, and the phases of the left-channel signal and the right-channel signal that are adjusted in operation 3730 (operation 3740).



FIG. 38 is a flowchart illustrating in detail operation 2930, 3040, 3130 or 3240 illustrated in FIG. 37 by using the graph illustrated in FIG. 18.


First, the amplitudes of the left-channel signal and the right-channel signal are calculated by applying the gain gm, based on the mono signal Mn,k (operation 3800).


Next, the phases of the left-channel signal and the right-channel signal are calculated by applying the phase difference 8, that is, by respectively rotating the mono signal Mn,k by an angle of θ° and an angle of −θ° (operation 3810).


Next, the first left-channel signal {tilde over (L)}n,k and the first right-channel signal {tilde over (R)}n,k are produced using the amplitudes of the left-channel signal and the right-channel signal that are calculated in operation 3800 and the phases of the left-channel signal and the right-channel signal that are calculated in operation 3810 (operation 3820).


Next, a second left-channel signal Ln,k is produced by adjusting the amplitude of the first left-channel signal {tilde over (L)}n,k to |{tilde over (L)}n,k|cos(αm), and a second right-channel signal Rn,k is produced by adjusting the amplitude of the first right-channel signal {tilde over (R)}n,k to |{tilde over (R)}n,k|cos(αm) (operation 3830).


Next, a second decorrelated signal Dn,kL is produced by rotating the first decorrelated signal Dn,k by the phase difference ψn,k, a third left-channel signal Ln,k′ is produced by adjusting the amplitude of the second left-channel signal Ln,k to |{tilde over (L)}n,k|sin(αm)(=Mngm sin(αm)=gm|Mn|sin(αm)=gm|Dn|sin(αm)), and then, a third right-channel signal Rn,k′ is produced similarly (operation 3840).


Thereafter, a fourth left-channel signal {circumflex over (L)}n,k is produced by combining the second left-channel signal Ln,k and the third left-channel signal Ln,k′, and a fourth right-channel signal {circumflex over (R)}n,k is produced by combining the second right-channel signal Rn,k and the third right-channel signal Rn,k′ (operation 3850).


As illustrated in FIG. 19, in operation 2930, 3040, 3130 or 3240, the mono signal Mn,k is received, the decorrelated signal Dn,k is produced by the decorrelator 1720, and then, the left-channel signal {circumflex over (L)}n,k and the right-channel signal {circumflex over (R)}n,k are produced based on the mono signal Mn,k and the decorrelated signal Dn,k by using the gain gm represented by the size parameter, the phase difference ψn,k represented by the phase parameter, and the phase αm represented by the enhancement parameter.


The method, illustrated in FIG. 19, of generating a left-channel signal and a right-channel signal can be simply expressed by:













[





L
^


n
,
k








R
^


n
,
k





]

=




[








n
,
k





0




0





-



n
,
k







]



[




g
m



0




0



(

2
-

g
m


)




]












[




cos


(

α
m

)





sin


(

α
m

)







cos


(

a
m

)





-

sin


(

α
m

)






]



[




M

n
,
k







D

n
,
k





]








=



[





g
m







n
,
k





cos


(

α
m

)







g
m







n
,
k





sin


(

α
m

)









(

2
-

g
m


)





-



n
,
k






cos


(

α
m

)







-

(

2
-

g
m


)






-



n
,
k






sin


(

α
m

)






]











[




M

n
,
k







D

n
,
k





]

,








(
42
)







wherein {circumflex over (L)}n,k denotes the finally generated left-channel signal, {circumflex over (R)}n,k denotes the finally generated right-channel signal, ψn,k denotes the phase difference represented by the phase parameter, gm denotes the gain, αm denotes the phase represented by the enhancement parameter, Mn,k denotes the mono signal, and Dn,k denotes the decorrelated signal.


According to Equation (42), first, rotation transformation is performed on the mono signal Mn,k and the decorrelated signal Dn,k, size transformation is performed, and then, phase adjustment is performed, but the present invention is not limited thereto.


A method and apparatus for encoding and decoding a stereo signal according to the present invention have been described above with reference to FIGS. 1 through 38. Those of ordinary skill in the art may easily derive from FIGS. 1 through 38 a method and apparatus for encoding a multi-channel signal by downmixing three or more signals to one or less than the number of signals and encoding the downmixed signal(s), and a method and apparatus for decoding a multi-channel signal by upmixing one or more signals to three or more signals and decoding the upmixed signals.


The present invention can be embodied as code that can be read by a computer system (any device capable of processing information) in a computer readable medium. Here, the computer readable medium may be any recording apparatus capable of storing data that is read by the computer system, e.g., a read-only memory (ROM), a random access memory (RAM), a compact disc (CD)-ROM, a magnetic tape, a floppy disk, an optical data storage device, and so on.


In a method and apparatus for encoding and decoding a stereo signal and a multi-channel signal according to the present invention, a stereo signal or a multi-channel signal can be encoded or decoded by producing parameters based on a mono signal.


While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present invention as defined by the following claims.

Claims
  • 1. A method of generating a stereo signal, the method comprising: receiving a down-mixed mono signal;decoding the down-mixed mono signal;decoding a parameter that represents an energy difference between channels of the stereo signal;decoding a parameter that represents a phase difference between channels of the stereo signal; andup-mixing the decoded down-mixed mono signal by using the decoded parameters, to generate the stereo signal.
  • 2. The method of claim 1, wherein the upmixing is performed by using a decorrelated signal.
  • 3. A method of generating a stereo signal, the method comprising: receiving a down-mixed mono signal;decoding the down-mixed mono signal;decoding a parameter that represents an energy difference between channels of the stereo signal;decoding a parameter that represents a phase difference between channels of the stereo signal;generating a parameter representing a phase difference between one of the stereo signal and the down-mixed mono signal; andup-mixing the decoded down-mixed mono signal by using the decoded parameters and the generated parameter, to generate the stereo signal.
  • 4. The method of claim 2, wherein the upmixing is performed by using a decorrelated signal.
Priority Claims (1)
Number Date Country Kind
2007-37165 Apr 2007 KR national
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a divisional application of U.S. Ser. No. 11/876,947, filed Oct. 23, 2007, now U.S. Pat. No. 8,111,829, which claims the benefit of Korean Patent Application No. 10-2007-0037165, filed on Apr. 16, 2007, in the Korean Intellectual Property Office, the disclosures of which are incorporated herein in their entirety by reference.

US Referenced Citations (4)
Number Name Date Kind
8498422 Oomen et al. Jul 2013 B2
20070255572 Miyasaka et al. Nov 2007 A1
20080091419 Yoshida et al. Apr 2008 A1
20090171676 Oh et al. Jul 2009 A1
Non-Patent Literature Citations (1)
Entry
Korean Office Action dated Aug. 20, 2013 issued in KR Application No. 10-2007-0037165.
Related Publications (1)
Number Date Country
20120134501 A1 May 2012 US
Divisions (1)
Number Date Country
Parent 11876947 Oct 2007 US
Child 13366455 US