The invention relates to an apparatus and a method for encoding an input signal and in particular an audio signal.
In a conventional audio digital signal communication system a digital signal is compressed at an encoder and the compressed information, i.e. bit stream, is then packetized and sent to a decoder through a communication channel frame by frame. An entity comprising an encoder and a decoder is also called a codec unit. The term “coding” comprises the encoding and the corresponding decoding. Audio compression may be used to reduce the number of bits that represent the respective audio signal, thereby reducing the bandwidth and/or bit rate needed for data transmission through the communication channel. However, audio compression of a signal may result in perceived quality degradation of the decompressed signal on the receiver side. In general, transmission of data with a higher bit rate results in a decoder signal of higher quality while transmission of a signal with a lower bit rate results in a decoded signal of lower quality.
In a conventional audio digital signal communication system a classification of the input signal is often performed to improve the coding efficiency. One possible way to classify the input signal is to distinguish between stationary signals and transient signals. A transient is generally defined as a sudden change in the input signal which occurs for a short period of time. A transient signal is usually characterized by a fast change in signal power and amplitude. This fast change can represent for example a so-called attack by instruments such as drums or simply the start of a talk spurt. A transient usually contains a high degree of non-periodic components and a higher magnitude of high frequency content. In general, transients are difficult to encode and some of the conventional signal coding models are not well adapted to this kind of signals. Accordingly, for encoding a transient signal a codec may adapt the coding model to the characteristics of such an input signal.
The invention provides according to a first aspect an apparatus for encoding an input signal applied to said apparatus, the apparatus comprising:
a transient detector adapted to detect whether the applied input signal comprises a transient;
at least two transient signal encoders adapted to encode the applied input signal if a transient is detected by said transient detector; and
a selection unit adapted to select a transient signal encoder among the at least two transient signal encoders according to at least one predetermined selection criterion.
In a first implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the transient detector is adapted to detect a transient position of a transient of the input signal applied to said apparatus.
In a further possible second implementation of the apparatus according to the first aspect of the present invention the transient detector is adapted to detect a transient energy of a transient of the input signal applied to said apparatus.
In a further possible third implementation of the apparatus according to the first aspect of the present invention the transient detector is adapted to detect a transient position and a transient energy of a transient of the input signal applied to said apparatus.
In a further possible fourth implementation of the first to third implementation of the apparatus according to the first aspect of the present invention a selection unit is adapted to perform a closed loop selection of said transient signal encoder.
In an alternative fifth implementation of the first to third implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is adapted to perform an open loop selection of said transient signal encoder.
In a possible sixth implementation of the fourth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is provided for performing a closed loop selection by selecting the transient signal encoder within a codec unit comprising the transient signal encoder and a corresponding transient signal decoder which supplies the selection unit with the decoded transient signal having the highest signal quality.
In a further possible seventh implementation of the fifth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is provided for performing an open loop selection by selecting the transient signal encoder among the at least two transient signal encoders depending on the transient position of the applied input signal detected by said transient detector.
In a further possible eighth implementation of the fifth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is provided for performing an open loop selection by selecting the transient signal encoder among the transient signal encoders depending on the transient energy of the applied input signal detected by said transient detector.
In a further possible ninth implementation of the fifth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is provided for performing an open loop selection by selecting the transient signal encoder among the at least two transient signal encoders depending on the transient position and the transient energy of the applied input signal detected by said transient detector.
In a possible tenth implementation of the first to ninth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the at least two transient signal encoders are adapted to perform the encoding of the input signal by using different transient encoding mechanisms or algorithms.
In a possible eleventh implementation of the tenth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the different transient encoding mechanisms used by the at least two transient signal encoders comprise window switching, short window DCT or are based on transient parameters, sinusoidal parameters and time/frequency envelope extraction.
In a further possible twelfth implementation of the first to eleventh implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the at least one selection criterion used by said selection unit comprises a signal to noise ratio SNR.
In a further possible thirteenth implementation of the first to eleventh implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the at least one selection criterion used by said selection unit comprises a segmental signal to noise ratio SSNR.
In a further possible fourteenth implementation of the first to eleventh implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the at least one selection criterion used by said selection unit comprises a weighted segmental signal to noise ratio WSSNR.
In a further possible fifteenth implementation of the first to eleventh implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the at least one selection criterion used by said selection unit comprises a pre-echo signal energy.
In a further possible sixteenth implementation of the fourteenth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is adapted to calculate the weighted segmental signal to noise ratio WSSNR using a weighting function stored in a local memory of said apparatus.
In a further possible seventeenth implementation of the sixteenth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the weighting function applies different weighting factors before and after the detected transient position detected by said transient detector.
In a possible eighteenth implementation of the first to seventeenth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the apparatus comprises a non-transient signal encoder adapted to encode the input signal if the transient detector does not detect a transient in said input signal. The non-transient signal encoder can be a stationary signal encoder.
In a possible nineteenth implementation of the first to eighteenth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the transient detector generates a transient indication, for example a transient flag, indicating whether a transient has been detected by said transient detector or not.
In a possible twentieth implementation of the first to nineteenth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit is adapted to control a switch to select a transient signal encoder among the at least two transient signal encoders according to the at least one predetermined selection criterion.
In a further possible twenty-first implementation of the nineteenth or twentieth implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the apparatus further comprises a multiplexer adapted to multiplex the encoded transient signal output by the selected transient signal encoder or the encoded non-transient signal, and the transient indication, for example a transient flag, generated by said transient detector onto a communication channel.
In a possible twenty-second implementation of the first to twenty-first implementation of the apparatus for encoding an input signal according to the first aspect of the present invention the selection unit generates a transient type indication, for example a transient type indicator, indicating the type of the selected transient signal encoder or codec.
In a possible twenty-third implementation of the twenty-second implementation of the apparatus for encoding an input signal according to the first aspect of the present invention, the apparatus further comprises a multiplexer adapted to multiplex the encoded signal, encoded by one of the at least two transient encoders, and the transient type indication associated to the transient encoder selected for encoding the input signal onto a communication channel or into an output bitstream or datastream.
The invention further provides according to a second aspect a method for encoding an input signal, the method comprising the steps of:
detecting whether the input signal comprises a transient; and
selecting a transient signal encoder among a group of transient signal encoders according to at least one predetermined selection criterion if a transient is detected in said input signal.
In a possible first implementation of the method for encoding an input signal according to the second aspect of the present invention the selecting of the transient signal encoder is performed in a closed loop selection.
In a further possible second implementation of the method for encoding an input signal according to the second aspect of the present invention the selecting of the transient signal decoder is performed as an open loop selection of said transient signal encoder.
According to a third aspect, the invention provides an apparatus for decoding, the apparatus comprising:
a receiver adapted to receive an encoded transient signal and a transient type indication, wherein the transient type indication indicates a type of a transient coding;
at least two transient signal decoders; and
a selection unit to select a transient signal decoder among the at least two transient signal decoders based on the transient type indication to decode the encoded transient signal.
According to a fourth aspect, the invention provides an apparatus for decoding, the apparatus comprising:
a receiver adapted to receive an encoded signal, a transient indication and a transient type indication, wherein the transient indication indicates, whether the encoded signal is a transient signal or a non-transient signal, and wherein the transient type indication indicates a type of transient coding incase the transient indication indicates that the encoded signal is a transient signal;
at least two transient signal decoders; and
a selection unit to select a transient signal decoder among the at least two transient signal decoders based on the transient type indication to decode the encoded transient signal in case the transient indication indicates that the encoded signal is a transient signal.
In a first possible implementation of the apparatus for decoding according to the fourth aspect, the apparatus further comprises:
a non-transient decoder;
wherein the selection unit is further adapted to select the non-transient decoder to decode the encoded signal in case the transient indication indicates that the encoded signal is a non-transient signal.
According to a fifth aspect of the invention, a method for decoding is provided, the method comprising:
receiving an encoded signal, a transient indication and a transient type indication, wherein the transient indication indicates, whether the encoded signal is a transient signal or a non-transient signal, and wherein the transient type indication indicates a type of transient coding in case the transient indication indicates that the encoded signal is a transient signal; and
selecting a transient signal decoder among at least two transient signal decoders based on the transient type indicator to decode the encoded transient signal in case the transient indication indicates that the encoded signal is a transient signal.
The transient indication and the transient type indication of any of the previous aspects and their implementation forms can be separate or joint indications. In case of separate indications the transient indication can be a transient flag and the transient type indication can be a separate transient type flag or transient type indicator comprising several bits to distinguish the different transient codecs. In case of a joint indication the transient indication and the transient type indication, or the related information, are combined into one joint transient indicator or joint transient indicator field, wherein a first value of the joint transient indicator indicates that the encoded signal is a non-transient signal, a second value indicates that the encoded signal is a transient signal encoded according to a first transient codec, a third value indicates that the encoded signal is a transient signal encoded according to a second transient codec, and so on.
In the following possible implementations and embodiments of the apparatus and method for encoding an input signal according to the first and second aspect of the present invention are described with reference to the enclosed figures.
The selection unit 2 is further adapted to output an transient type indicator (TTI) to the multiplexing unit 17, wherein the transient type indicator indicates the type of the selected transient encoding, for example, by comprising the index i of the codec 10-i as shown in
As can be seen in
The at least two transient signal encoders 7-i of the apparatus 1 are adapted to perform the encoding of the signal input 2 of the apparatus 1 by using different transient encoding mechanisms. These transient encoding mechanisms can comprise window switching, a short window DCT or can be based on transient parameters, sinusoidal parameter and time/frequency envelope extraction.
The selection unit 12 can use one or more selection criteria to select a transient signal encoder 7-i among the group of transient signal encoders. A possible selection criterion used by the selection unit 12 can be a signal to noise ratio SNR. A further possible selection criterion used by the selection unit 12 can be a segmental signal to noise ratio SSNR. A further possible selection criterion used by the selection unit 12 can be a weighted segmental signal to noise ratio or a pre-echo signal energy. In these cases, the decoded transient signal having the highest signal quality or the best quality will be defined as the decoded transient signal having the highest signal to noise ratio or segmental signal to noise ratio or the lowest pre-echo signal energy, which indicates a better reconstruction of the original signal with less quantization noise.
In a possible implementation the transient detector 3 of the apparatus 1 can be adapted to detect whether the input signal is a transient audio signal or not. If the input signal is classified by the transient detector 3 as comprising a transient, two or more transient encoding mechanisms as well as their corresponding local decoding mechanisms can be applied to the input signal and the quality of the decoded signals is compared by the selection unit 12 according to at least one predetermined quality selection criterion. Based on this selection criterion one of the encoding mechanisms as employed by the transient signal encoders 7-i is selected. The selection can be based on the signal to noise ratio SNR, the weighted signal to noise ratio or other objective quality measurement criteria. The codec unit 10-i providing the signal with the best quality is selected by the selection unit 12 and its parameters are used to generate the encoded signal. As can be seen in
In a possible implementation the selection criterion used by the selection unit 12 can be based on a segmental SNR in a current encoded frame, with more weight on the signal before the transient. In this implementation the position of the transient can be detected by the transient detector 3 as well. In this implementation the detected position of the transient is detected and sent to a quality evaluation and comparison module within the selection unit 12 to adapt the quality selection criterion. The segmental signal to noise ratio SSNR can be defined as an average of a signal to noise ratio SNR values over a plurality of signal segments L as follows:
wherein L is the number of segments, M is the number of samples per segments and s represents the input signal and n the quantization noise.
In a possible implementation in order to better adapt the objective selection criterion to a specific signal characteristic of a transient a different weight can be applied before and after the transient position as follows:
with w1+w2=2 and w1>w2, wherein M1 is the position of the transient in the current frame.
A temporal masking effect in the auditory system can affect the perception of the quantization noise before the transient. During a short period before the actual transient position a noise will not be perceived if its energy stays below a certain threshold. This property can also be used as a selection criterion. In a possible implementation the current frame can be decomposed in several segments for the computation of the segmental signal to noise ratio SSNR, wherein a different weight can be applied according to a pre-masking as follows:
where L1 is the position of the start of the pre-masking effect,
L2 is the position of the detected transient.
In a further possible embodiment of the apparatus 1 according to the first aspect of the present invention the selection unit 12 as shown in
In a further possible embodiment of the apparatus 1 for encoding an input signal according to the first aspect of the present invention as shown in the block diagram of
In a possible embodiment different selection mechanisms can be implemented in the selection unit 12 and the desired selection mechanism can be activated in response to a corresponding control signal. In this implementation the apparatus 1 can be switched between different operation modes within the selection unit 12 using different selection criteria and also different selection mechanisms depending on an operation mode of the apparatus 1. The closed loop selection approach used by the implementation shown in
In a further alternative embodiment of the apparatus 1 for encoding an input signal according to the first aspect of the present invention the selection unit 12 is adapted to perform an open loop selection of a transient signal encoder 7-i of a plurality N of different transient signal encoders 7-i, wherein i=1 . . . N.
The implementation shown in
As described with regards to
In a still further possible implementation the open and closed loop selection as shown in
In a first step S1 it is detected whether an input signal comprises a transient or not. This detection can for example be performed by a transient detector 3 as shown in
In a further step S2 a transient signal encoder 7-i among a group of transient signal encoders is selected according to at least one predetermined selection criterion if a transient is detected in the input signal in step S1. In a possible implementation of the method according to the second aspect of the present invention as shown in
As can be seen in the implementation of
In a possible embodiment a transient signal encoder 7-i within the group of N transient signal encoders 7-i performs window switching.
Although window switching is an efficient transient signal encoding mechanism it can be less efficient than other mechanisms under certain conditions and situations. By using window switching the perceptual model and lossless coding modules of the codec unit have to support different time resolutions which translate usually into increased complexity of the apparatus 1. In addition, when using lapped transforms such as the MDCT in order to satisfy the construction constraints window switching needs to insert transition windows between short and long blocks as illustrated in
Another possible transient signal encoder 7-i or transient signal encoding mechanism which can be used by the apparatus 1 according to the present invention can be based on transient modeling, sinusoidal parameter and time/frequency envelope extraction as illustrated in
The number and types of the transient encoders 7-i as employed by the apparatus 1 are not limited. They can be chosen among different encoding models such as transform coding, transient, sinusoidal coding, CELP, etc.
The transient encoders 7-i do ensure a reconstruction with the adjacent frames. In a possible implementation the selected transient encoder is signaled to the decoder in the bit stream with the corresponding quantized parameters. In this implementation the multiplexing unit 17 does also multiplex an indication or an identifier of the selected transient encoder into the bit stream supplied to the decoder via the communication channel 19.
An embodiment of an apparatus for decoding, or a decoder, receives the bit stream comprising the encoded signal, the transient flag TF and the transient type indicator TTI. Accordingly, an embodiment of the decoder comprises a receiver, at least two transient signal decoders and a selection unit. The receiver is adapted to receive the encoded signal, the transient indication and the transient type indication, wherein the transient indication indicates, whether the encoded signal is a transient signal or a non-transient signal, and wherein the transient type indication indicates a type of transient coding in case the transient indication indicates that the encoded signal is a transient signal. The selection unit is adapted to select a transient decoder among the at least two transient decoders based on the transient type indicator to decode the encoded transient signal in case the transient indication indicates that the encoded signal is a transient signal.
Different methods for encoding an input signal can be used for encoding an audio input signal. The apparatus and method allow codec units to provide an efficient reduction of pre-echo artifacts in case of transient audio signals and do improve the overall signal quality for this type of signal. The apparatus 1 as shown in the implementation of
In a possible implementation the transient detector 3 detects a transient within the applied input signal by performing a high-pass filtering of the input signal for example by means of a first order IIR filter. The high-pass filtered signal can then be sectioned into sub-frames and the energy for each sub-frame can be computed. For each sub-frame a comparison between a short term energy and a long term energy can be performed. A transient of a signal is then detected by the transient detector 3 whenever the energy ratio between the short term energy and the long term energy is above a certain threshold.
Although embodiments of the invention have been described primarily based on transient flags and transient type indicators being separate indicators, see for example
Furthermore, although embodiments have been described, wherein the encoded signal, the transition indication and the transition type indication are multiplexed into one bit stream, other embodiments of the encoder can be adapted to send these data via different channels, e.g. to the decoder or to a storage medium, and other embodiments of the decoder can be adapted to receive these data via different channels, e.g. from an encoder or from a storage medium.
This application is a continuation of International Application No. PCT/EP2011/071021, filed on Nov. 25, 2011, which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/EP2011/071021 | Nov 2011 | US |
Child | 14285976 | US |