The present invention contains subject matter related to Japanese Patent Application JP 2006-136876 filed in the Japanese Patent Office on May 16, 2006, the entire contents of which are incorporated herein by reference.
1. Field of the Invention
The present invention relates to a band analysis apparatus and method for performing, using a filter bank, band analysis of each of a plurality of pictures forming a moving image and dividing each of the pictures into a plurality of subbands, to a band synthesis apparatus and method for performing, using a filter bank, band synthesis of each of a plurality of pictures divided into a plurality of subbands, to an image encoding apparatus and method for performing, using a filter bank, band analysis of each of a plurality of pictures forming a moving image and encoding each of the pictures that has been subjected to band analysis to generate an encoded code-stream, to an image decoding apparatus and method for decoding an encoded code-stream and performing, using a filter bank, band synthesis of the decoded code-stream to reconstruct a moving image, to a program, and to a recording medium.
2. Description of the Related Art
As a typical method for compressing images, a Joint Photographic Experts Group (JPEG) method, which is standardized by the International Organization for Standardization (ISO), is available. The JPEG method uses discrete cosine transform (DCT) and provides excellent encoded images and decoded images at a relatively high bit rate. However, when the encoding bit rate is reduced to a predetermined value or less, block noise, which is specific to DCT transform, is significantly increased. Thus, deterioration becomes conspicuous from a subjective point of view.
In recent years, research and development of methods for dividing an image into a plurality of subbands using a filter bank, which is a combination of a low-pass filter and a high-pass filter, and performing encoding of each of the plurality of subbands has been actively conducted. In such circumstances, wavelet-transform encoding has been regarded as a new promising technique that will take the place of DCT transform since wavelet-transform encoding does not have a disadvantage that block noise becomes conspicuous at high compression, unlike DCT transform.
The JPEG 2000, for which international standardization was completed in January 2001, adopts a method in which the above-mentioned wavelet transform and high-efficiency entropy coding (bit modeling and arithmetic coding for each bit-plane) are combined together. The JPEG 2000 achieves a significant improvement in encoding efficiency, compared with any other JPEG method.
For example, a technique described in Japanese Unexamined Patent Application Publication No. 2001-197499 has been suggested.
Basically, the JPEG 2000 is a standard for encoding static images. Application of the JPEG-2000 technology to satellite images, map images, images for identification photographs, and the like has been expected. The Motion JPEG 2000, which encodes each of a plurality of pictures forming a moving image in accordance with the JPEG 2000, has been standardized as Part3 of the JPEG 2000 standard.
However, in order to encode moving images, such as video signals, using the JPEG 2000 technique, it is necessary to encode each of a plurality of continuously input pictures in real time. In particular, in wavelet transform used in the JPEG 2000, in order to improve compression efficiency, subband division of each of a plurality of pictures is generally performed until a desired division level is reached. Thus, it is necessary to complete analysis filtering of the current picture at the final division level before the next picture is input.
Not only the JPEG 2000 method but also other image compression methods for dividing each of a plurality pictures forming a moving image into a plurality of subbands in accordance with wavelet transform and performing encoding of each of the plurality of subbands are performed taking into consideration the above-mentioned condition.
For dedicated hardware, this condition can be satisfied by increasing the number of processing clocks of the hardware to increase the operation speed of wavelet transform. However, increasing the number of processing clocks causes an increase in power consumption. In addition, since the number of processing clocks of programmable hardware, such as a field programmable gate array (FPGA) or a programmable logic device (PLD), is small, such programmable hardware does not satisfy the condition.
Accordingly, it is desirable to provide a band analysis apparatus and method for performing wavelet transform of a moving image signal in real time, a band synthesis apparatus and method for performing inverse wavelet transform of a moving image in real time, an image encoding apparatus and method for performing encoding while performing wavelet transform of a moving image signal in real time, an image decoding apparatus and method for performing decoding while performing inverse wavelet transform of a moving image signal in real time, a program, and a recording medium.
An image processing apparatus according to an embodiment of the present invention includes input means for inputting image data; and analysis filtering means for generating coefficient data of a plurality of subbands by performing analysis filtering of the image data input by the input means for each predetermined number of lines from an upper-end line to a lower-end line such that analysis filtering of a lower-end line of the current picture is completed before analysis filtering of an upper-end line of the next picture starts.
An image processing apparatus according to another embodiment of the present invention includes input means for inputting coefficient data generated by performing filtering of image data for each predetermined number of lines from an upper-end line to a lower-end line; and synthesis filtering means for generating the image data by performing vertical and horizontal synthesis filtering of the coefficient data input by the input means for each predetermined number of lines from an upper-end line to a lower-end line of each of a plurality of subbands such that synthesis filtering of a lower-end line of the current picture is completed before synthesis filtering of an upper-end line of the next picture starts.
Accordingly, since wavelet transform or inverse wavelet transform of the current picture is completed before wavelet transform or inverse wavelet transform of the next picture starts, wavelet transform or inverse wavelet transform of a moving image signal can be performed in real time.
Embodiments of the present invention will be described with reference to the drawings.
A band analysis apparatus according to a first embodiment that performs, using an analysis filter bank, band analysis of an input video signal to divide the video signal into a plurality of subbands will be described.
The image line input unit 11 receives a video signal D10 for each line, and supplies a data stream D11 for the image line to the line buffer unit 12.
Video signals are normally defined by a standard. For example, currently, television broadcasting is performed in accordance with a National Television Standards Committee (NTSC) system. In addition, a high definition television (HDTV) system is standardized as a standard number “SMPTE 274M” by the Society of Motion Picture and Television Engineers (SMPTE), which is a standard-setting organization in the United States. In the description below, the HDTV system (a resolution of 1920×1080) will be described as an example.
As described above, concerning a video signal, vertical blank signals are disposed before and after actual data.
Since the band analysis apparatus 10 performs wavelet transform in units of pictures (fields/frames) forming a video signal, it is necessary to detect an end point of a picture and to reset an operation of analysis filtering. Thus, the image line input unit 11 detects the end point of the picture by detecting a vertical blank signal for the video signal.
The line buffer unit 12 stores and holds data streams D11 for individual lines. The line buffer unit 12 continues to store and hold data streams D11 until data streams D11 for the number of lines (N lines) necessary for vertical filtering are stored, as shown in
The vertical analysis filter unit 13 sequentially reads line data D12 for N lines, and performs vertical low-pass analysis filtering and vertical high-pass analysis filtering. Due to the vertical filtering, a low-frequency component (L) and a high-frequency component (H) D13, which are obtained by vertical division, are generated, as shown in
Immediately after the number of samples in a horizontal direction of low-frequency and high-frequency components D13 reaches M necessary for horizontal filtering, the horizontal analysis filter unit 14 performs horizontal low-pass analysis filtering and horizontal high-pass analysis filtering. Due to the horizontal filtering, a low-frequency component (1LL) D14 and high-frequency components (1HL, 1LH, and 1HH) D15, which are obtained by horizontal division, are generated, as shown in
As a result of analysis filtering at division level 1, the horizontal analysis filter unit 14 generates the low-frequency component (1LL) D14 and the high-frequency components (1HL, 1LH, and 1HH) D15, as described above.
In wavelet transform, normally, a low-frequency component is hierarchically divided until a desired division level is reached. Thus, in the first embodiment, the low-frequency component (1LL) D14 is supplied to the line buffer unit 12 so as to be further divided by an analysis filter bank. Immediately after the number of lines necessary for vertical analysis filtering is buffered in the line buffer unit 12, analysis filtering at division level 2 is performed. A low-frequency component is repeatedly divided as described above since most of the energy of an image signal is concentrated in the low-frequency component.
In the analysis filtering at division level 2, the vertical analysis filter unit 13 sequentially reads line data D12 for N/2 lines, and performs vertical low-pass analysis filtering and vertical high-pass analysis filtering, as shown in
In order to further increase the division level, analysis filtering can be repeatedly performed for a low-frequency component.
The number N of lines of each of subbands stored and held in the line buffer unit 12 becomes twice every time the division level is decreased by 1. Thus, as shown in
As the most general arithmetic method of the above-mentioned analysis filtering, a method called convolutional operation is available. The convolutional operation is the most fundamental procedure for achieving a digital filter. As the convolutional operation, convolution multiplication of a filter tap coefficient by actual input data is performed. However, the convolutional operation generates a problem in which the calculation load increases as the tap length increases.
As a technique for solving the above-mentioned problem, a lifting technique for wavelet transform described in W. Sweldens, “The Lifting Scheme: A Custom-design Construction of Biorthogonal Wavelets”, Appl. Comput. Harmon. Anal., vol. 3, no. 2, pp. 186-200, 1996 is known.
Referring to
Due to analysis filtering in which the lifting technique is applied to the 9×7-analysis filter, high-frequency components are obtained by the processing of step S3 and low-frequency components are obtained by the processing of step S4. The processing of steps S1 to S4 is expressed using the following equations:
Step S1: di1=di0+α(si0+si+10)
Step S2: si1=si0+β(di−11+di1)
Step S3: di2=di1+γ(si1+si+11)
Step S4: si2=si1+δ(di−12+di2)
Since analysis at a division level can be performed by analysis filtering using the lifting structure shown in
In the description below, for example, in a display device or the like, scanning is started from the pixel in the upper-left corner of the screen. When scanning from the leftmost pixel to the rightmost pixel in a line is completed, a line is formed. When scanning from the uppermost line to the lowest line is completed, a screen is formed.
In the analysis filtering at division level 1, components are generated in the order of a high-frequency component (1), a low-frequency component (2), a high-frequency component (3), a low-frequency component (4), etc. in a direction from the top to the bottom in
As is clear from
As described above, in wavelet transform, in general, subband division of a picture is performed until a desired division level is reached. However, a timing at which a high-frequency component or a low-frequency component is generated is delayed by a factor of two as the division level increases, as described above. Thus, when wavelet transform is performed for a video signal D10, wavelet transform of the current picture may not be completed within the vertical blank period shown in
In order to solve the above-mentioned problem, in analysis filtering of the current picture at each division level, the band analysis apparatus 10 according to the first embodiment advances the timing of analysis filtering of a lower-end line, as shown in
A method for advancing the timing of analysis filtering of a lower-end line of the current picture at each division level is described next.
In analysis filtering of the lower-end line at division level 1, components (10), (11), and (12) are generated in that order, and then, a high-frequency component (13) and a low-frequency component (14) are generated, as shown in
Arrows pointing from the current picture to pixels in the blank period indicate that samples are expanded symmetrically. “Symmetric expansion” means that a supplementary sample is provided from an image area to a portion where a sample does not actually exist by symmetric expansion when analysis filtering is performed at a boundary between pictures or subbands. Thus, the provided supplementary sample and the original sample are in a mirror-image relationship with respect to each other. As shown in
Although analysis filtering at division level 4 is completed within the blank period in
In order to solve the above-mentioned problem, the band analysis apparatus 10 according to the first embodiment advances the timing of symmetric expansion at the lower-end line, and thus advances the timing of analysis filtering of the lower-end line of the current picture. That is, as shown in
Accordingly, even when the number of division levels is larger, the band analysis apparatus 10 is capable of completing analysis filtering of the current picture until the final division level before starting analysis filtering of the upper-end line of the next picture.
An image encoding apparatus according to a second embodiment that compresses and encodes coefficient data generated by wavelet transform will be described.
The analysis filter bank 21 has a configuration similar to the band analysis apparatus 10 shown in
The quantization unit 22 performs quantization by dividing the coefficient data D21 generated by the analysis filter bank 21 by, for example, a quantization step size, and generates quantized coefficient data D22.
The entropy-coding unit 23 performs source encoding of the quantized coefficient data D22 generated by the quantization unit 22, and generates a compressed encoded code-stream D23. As source encoding, for example, Huffman coding adopted in the JPEG and the Moving Picture Experts Group (MPEG) or high-precision arithmetic coding adopted in the JPEG 2000 can be used.
The rate controller 24 performs control so as to achieve a desired bit rate or compression rate. After performing rate control, the rate controller 24 outputs an encoded code-stream D24 whose rate has been controlled. For example, in order to achieve a higher bit rate, the rate controller 24 transmits to the quantization unit 22 a control signal D25 for decreasing the quantization step size. In contrast, in order to achieve a lower bit rate, the rate controller 24 transmits to the quantization unit 22 a control signal D25 for increasing the quantization step size.
A band synthesis apparatus according to a third embodiment that corresponds to the band analysis apparatus 10 according to the first embodiment will be described.
The column buffer unit 31 stores and holds a low-frequency component D30 and a high-frequency component D31 for each column. The column buffer unit 31 continues to store and hold low-frequency components D30 and high-frequency components D31 until low-frequency components D30 and high-frequency components D31 for M samples are stored. A low-frequency component D30 only for a lowest-frequency subband is input to the column buffer unit 31. Then, low-frequency components D35 generated by synthesis filtering are supplied from the vertical synthesis filter unit 34.
The horizontal synthesis filter unit 32 sequentially reads column data D32 for M samples, and performs horizontal low-pass synthesis filtering and horizontal high-pass synthesis filtering. Due to the horizontal filtering, low-frequency and high-frequency components D33, which are obtained by horizontal synthesis, are generated.
The line buffer unit 33 stores and holds low-frequency and high-frequency components D33, which are obtained by horizontal synthesis, for individual lines, and continues to store and hold low-frequency and high-frequency components D33 until low-frequency and high-frequency components D33 for N lines are stored.
The vertical synthesis filter unit 34 sequentially reads line data D34 for N lines, and performs vertical low-pass synthesis filtering and vertical high-pass synthesis filtering. Due to the vertical filtering, a low-frequency component D35, which is obtained by vertical synthesis, is generated. The low-frequency component D35 is supplied to the column buffer unit 31, and stored and held in the column buffer unit 31 until synthesis filtering at the next division level is performed.
In inverse wavelet transform, synthesis filtering is performed in a direction, for example, from division level 4 to division level 1, which is opposite to the direction of wavelet transform. By repeatedly performing processing for generating a low-frequency signal at a division level lower by one than the previous level from the low-frequency component D35 and the high-frequency component D31, image data stream is generated. The generated image data stream is supplied to the vertical blank signal insertion unit 35.
As shown in
A lifting technique can also be applied to synthesis filtering. However, in synthesis filtering adopting a lifting structure, the timing at which a high-frequency component or a low-frequency component is generated is delayed by a factor of two as the division level increases. Thus, inverse wavelet transform of the current picture may not be completed within the vertical blank period shown in
In order to solve the above-mentioned problem, the band synthesis apparatus 30 according to the third embodiment advances the timing of synthesis filtering of the lower-end line of the current picture at each division level, as shown in
A method for advancing the timing of synthesis filtering of the lower-end line of the current picture at each division level is described next.
In synthesis filtering of the lower-end line at division level 4, components (40), (41), and (42) are generated in that order, and then, a high-frequency component (43) and a low-frequency component (44) are generated, as shown in
As shown in
In order to solve the above-mentioned problem, the band synthesis apparatus 30 according to the third embodiment advances the timing of symmetric expansion processing at the lower-end line, and thus advances the timing of synthesis filtering of the lower-end line of the current picture. That is, as shown in
In addition, since the band synthesis apparatus 30 according to the third embodiment delays the timing of synthesis filtering of the next picture at predetermined one or more division levels, inverse wavelet transform of the current picture is prevented from temporally overlapping with inverse wavelet transform of the next picture, as shown in
Thus, even when the number of division levels is larger, the band synthesis apparatus 30 is capable of completing synthesis filtering of the current picture until division level 1 before starting synthesis filtering of the upper-end line of the next picture.
If temporal overlapping between inverse wavelet transform of the current picture and inverse wavelet transform of the next picture is prevented only by advancing the timing of symmetric expansion at the lower-end line, the timing of synthesis filtering of the next picture at a predetermined division level may not be delayed as shown in
An image decoding apparatus according to a fourth embodiment that corresponds to the image encoding apparatus 20 according to the second embodiment will be described.
The entropy-decoding unit 41 performs source decoding of a received encoded code-stream D40, and generates quantized coefficient data D41. As source decoding, Huffman decoding or high-efficiency arithmetic decoding can be used, as described above.
The dequantization unit 42 performs dequantization by multiplying the quantized coefficient data D41 by a quantization step size, and generates coefficient data D42. Normally, the quantization step size is described in the header of an encoded code-stream.
The synthesis filter bank 43 has a configuration similar to the band synthesis apparatus 30 shown in
The present invention is not limited to any of the first to fourth embodiments described above. Various changes and modification can be made to the present invention without departing from the spirit and scope of the present invention.
For example, although a case where the band analysis apparatus 10 according to the first embodiment performs horizontal filtering after performing vertical filtering has been described, the band analysis apparatus 10 may perform vertical filtering after performing horizontal filtering.
In the band analysis apparatus 50, an image line input unit 51 receives a video signal D50 for each line, and supplies a data stream D51 for the image line to a column buffer unit 52. The column buffer unit 52 stores and holds data streams D51 for individual columns, and continues to store and hold data streams D51 until data streams D51 for M samples are stored. A horizontal analysis filter unit 53 sequentially reads column data D52 for M samples, and performs horizontal low-pass analysis filtering and horizontal high-pass analysis filtering. Due to the horizontal filtering, low-frequency and high-frequency components D53, which are obtained by horizontal division, are generated. Immediately after the number of lines of the low-frequency and high-frequency components D53 reaches N, a vertical analysis filter unit 54 performs vertical low-pass analysis filtering and vertical high-pass analysis filtering. Due to the vertical filtering, a low-frequency component (1LL) D54 and high-frequency components (1HL, 1LH, and 1HH) D55, which are obtained by vertical division, are generated. The low-frequency component (1LL) D54 is supplied to the column buffer unit 52 to be subjected to analysis filtering at level 2.
As described above, a subband that is generated when horizontal filtering is performed after vertical filtering is performed is the same as a subband that is generated when vertical filtering is performed after horizontal filtering is performed.
Although hardware configurations have been described in the foregoing embodiments, a series of processing may be performed by software. In this case, a program constituting the software may be incorporated in advance in dedicated hardware of a computer, such as a read-only memory (ROM) or a hard disk, or installed from a network or a recording medium on a general-purpose personal computer capable of performing various functions by installing various programs. As the recording medium, for example, a package medium including a magnetic disk (flexible disk), an optical disk, such as compact disk-read only memory (CD-ROM) or a digital versatile disc (DVD), a magnetic optical disk, such as mini-disk (MD) (trademark), or a semiconductor memory can be used.
It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and alterations may occur depending on design requirements and other factors insofar as they are within the scope of the appended claims or the equivalents thereof.
Number | Date | Country | Kind |
---|---|---|---|
2006-136876 | May 2006 | JP | national |
Number | Name | Date | Kind |
---|---|---|---|
5917943 | Washizawa | Jun 1999 | A |
5999656 | Zandi et al. | Dec 1999 | A |
6091777 | Guetz et al. | Jul 2000 | A |
6898323 | Schwartz et al. | May 2005 | B2 |
6898325 | Gormish | May 2005 | B2 |
7072520 | Schwartz et al. | Jul 2006 | B2 |
Number | Date | Country |
---|---|---|
2001-258034 | Sep 2001 | JP |
2002-91943 | Mar 2002 | JP |
2002-304624 | Oct 2002 | JP |
2003-23632 | Jan 2003 | JP |
2004-56632 | Feb 2004 | JP |
2004-135070 | Apr 2004 | JP |
2004-336404 | Nov 2004 | JP |
2005-94212 | Apr 2005 | JP |
Number | Date | Country | |
---|---|---|---|
20070269122 A1 | Nov 2007 | US |