TEMPORAL NOISE SHAPING

TECHNICAL FIELD

Examples herein relate to encoding and decoding apparatus, in particular for performing temporal noise shaping (TNS).

KNOWN TECHNOLOGY

The following documents are in the known technology:

[1] Herre, Jurgen, and James D. Johnston. “Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS).” Audio Engineering Society Convention 101. Audio Engineering Society, 1996.
[2] Herre, Jurgen, and James D. Johnston. “Continuously signal-adaptive filterbank for high-quality perceptual audio coding.” Applications of Signal Processing to Audio and Acoustics, 1997. 1997 IEEE ASSP Workshop on. IEEE, 1997.
[3] Herre, Jurgen. “Temporal noise shaping, quantization and coding methods in perceptual audio coding: A tutorial introduction.” Audio Engineering Society Conference: 17^thInternational Conference: High-Quality Audio Coding. Audio Engineering Society, 1999.
[4] Herre, Juergen Heinrich. “Perceptual noise shaping in the time domain via LPC prediction in the frequency domain.” U.S. Pat. No. 5,781,888. 14 Jul. 1998.
[5] Herre, Juergen Heinrich. “Enhanced joint stereo coding method using temporal envelope shaping.” U.S. Pat. No. 5,812,971. 22 Sep. 1998.
[6] 3GPP TS 26.403; General audio codec audio processing functions; Enhanced aacPlus general audio codec; Encoder specification; Advanced Audio Coding (AAC) part.
[7] ISO/IEC 14496-3:2001; Information technology—Coding of audio-visual objects—Part 3: Audio.
[8] 3GPP TS 26.445; Codec for Enhanced Voice Services (EVS); Detailed algorithmic description.

Temporal Noise Shaping (TNS) is a tool for transform-based audio coders that was developed in the 90s (conference papers [1-3] and patents [4-5]). Since then, it has been integrated in major audio coding standards such as MPEG-2 AAC, MPEG-4 AAC, 3GPP E-AAC-Plus, MPEG-D USAC, 3GPP EVS, MPEG-H 3D Audio.

TNS can be briefly described as follows. At the encoder-side and before quantization, a signal is filtered in the frequency domain (FD) using linear prediction, LP, in order to flatten the signal in the time-domain. At the decoder-side and after inverse quantization, the signal is filtered back in the frequency-domain using the inverse prediction filter, in order to shape the quantization noise in the time-domain such that it is masked by the signal.

TNS is effective at reducing the so-called pre-echo artefact on signals containing sharp attacks such as e.g. castanets. It is also helpful for signals containing pseudo stationary series of impulse-like signals such as e.g. speech.

TNS is generally used in an audio coder operating at relatively high bitrate. When used in an audio coder operating at low bitrate, TNS can sometimes introduce artefacts, degrading the quality of the audio coder. These artefacts are click-like or noise-like and appear in most of the cases with speech signals or tonal music signals.

Examples in the present document permit to suppress or reduce the impairments of TNS maintaining its advantages.

Several examples below permit to obtain an improved TNS for low-bitrate audio coding.

SUMMARY

According to an embodiment, an encoder apparatus may have: a temporal noise shaping, TNS, tool for performing linear prediction, LP, filtering on an information signal including a plurality of frames; and a controller configured to control the TNS tool so that the TNS tool performs LP filtering with: a first filter whose impulse response has a higher energy; and a second filter whose impulse response has a lower energy, wherein the second filter is not an identity filter, wherein the controller is configured to choose between filtering with the first filter and filtering with the second filter on the basis of a frame metrics, wherein the controller is further configured to: modify the first filter so as to acquire the second filter in which the filter's impulse response energy is reduced.

According to another embodiment, a method for performing temporal noise shaping, TNS, filtering on an information signal including a plurality of frames may have the steps of: for each frame, choosing between filtering with a first filter and filtering with a second filter, whose impulse response has a lower energy, on the basis of a frame metrics, wherein the second filter is not an identity filter; filtering the frame using the filtering with the filtering chosen between filtering with the first filter and filtering with the second filter; and modify the first filter so as to acquire the second filter in which the filter's impulse response energy is reduced.

Another embodiment may have a non-transitory digital storage medium having a computer program stored thereon to perform the method for performing temporal noise shaping, TNS, filtering on an information signal including a plurality of frames, the method having the steps of: for each frame, choosing between filtering with a first filter and filtering with a second filter, whose impulse response has a lower energy, on the basis of a frame metrics, wherein the second filter is not an identity filter; filtering the frame using the filtering with the filtering chosen between filtering with the first filter and filtering with the second filter; and modify the first filter so as to acquire the second filter in which the filter's impulse response energy is reduced, when said computer program is run by a computer.

In accordance with examples, there is provided an encoder apparatus comprising:

- a temporal noise shaping, TNS, tool for performing linear prediction, LP, filtering on an information signal including a plurality of frames; and
- a controller configured to control the TNS tool so that the TNS tool performs LP filtering with:
  - a first filter whose impulse response has a higher energy; and
  - a second filter whose impulse response has a lower energy than the impulse response of the first filter, wherein the second filter is not an identity filter,
- wherein the controller is configured to choose between filtering with the first filter and filtering with the second filter on the basis of a frame metrics.

It has been noted that it is possible to remove artefacts on problematic frames while minimally affecting the other frames.

Instead of simply turning on/off the TNS operations, it is possible to maintain the advantages of the TNS tool while reducing its impairments. Therefore, an intelligent real-time feedback-based control is therefore obtained by simply reducing filtering where needed instead of avoiding it.