Communication terminal

Description

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a structure of a communication terminal provided with a coding function for coding a moving picture.

FIG. 2 is a block diagram showing a structure for coding a moving picture.

FIG. 3 shows an inter prediction processing according to the present invention.

FIG. 4 shows an intra prediction processing according to the present invention.

FIG. 5 shows an intra prediction processing according to the present invention.

FIG. 6 shows a coding processing on a chrominance differential signal according to the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Hereinafter, an embodiment of the invention will be described with reference to the drawings.

FIG. 1 shows an exemplary structure of a communication terminal 10 which has a function to execute a moving picture coding process according to the present invention. The communication terminal 10 is, for example, a cellular phone, a personal computer, or the like. If the communication terminal 10 is a cellular phone or a personal computer, the communication terminal 10 includes a display, a key input unit and the like in addition to the structure shown in FIG. 1. However, since these features are not necessary for the description of the invention, they are not shown in FIG. 1.

In FIG. 1, the communication terminal 10 includes a picture input part 11 such as a camera, a coding part 12 to perform a moving picture coding process on the moving picture inputted from the picture input part 11, and a transmission part 13 to transmit the coded moving picture, which has been coded by the coding part 12, to a network, a storage, or the like. The coding part 12 includes a control part 121 to control a coding process, and a processing part 122 to perform the coding process for the inputted moving picture in accordance with the control of the control part 121.

As shown in FIG. 2, which is a block diagram showing an exemplary structure of the processing part 122, the processing part 122 includes a subtracting part 20, a DCT (Discrete Cosine Transform)/quantization part 21, an entropy coding part 22, an inverse quantization/inverse DCT part 23, an adding part 24, a deblocking filter 25, a frame memory 26, an inter prediction part 27, an intra prediction part 28, and a switching part 29.

The subtracting part 20 performs an operation to obtain a difference signal between a predictive signal outputted from the switching part 29 and a picture frame signal obtained from a moving picture inputted through the picture input part 11 by subtracting the predictive signal from the picture frame signal. The obtained difference is outputted as a difference signal for subsequent processing.

The DCT/quantization part 21 performs a DCT processing on the difference signal outputted from the subtracting part 20 to obtain DCT coefficients, and quantizes the DCT coefficients to obtain quantized coefficients.

The entropy coding part 22 performs an entropy coding processing on the quantized coefficients outputted from the DCT/quantization part 21. As the entropy coding, for example, a context adaptive variable length coding principle, or a context adaptive binary arithmetic coding principle is used.

Also, the quantized coefficients outputted from the DCT/quantization part 21 are inputted to the inverse quantization/inverse DCT part 23. In the inverse quantization/inverse DCT part 23, inverse quantization processing is performed on the inputted quantized coefficients to restore the DCT coefficients, and an inverse DCT processing is performed on the DCT coefficients to restore the difference signal.

The adding part 24 uses the difference signal outputted from the inverse quantization/inverse DCT part 23 and the same predictive signal as inputted to the subtracting part 20 by the switching part 29, and restores the picture frame signal. The restored picture frame signal is stored in the frame memory 26 via the deblocking filter 25, and is used for subsequent inter prediction.

The deblocking filter 25 performs a deblocking process on the restored picture frame signal output from the adding part 24, and performs a process to reduce distortion that has occurred between the blocks that serve as units of the coding process.

The inter prediction part 27 obtains an inter prediction evaluation value representing compression efficiency in inter prediction. The inter prediction evaluation value is calculated from the input signal (picture frame signal) from the input part 11 and the recovered past picture frame signal stored in the frame memory 26. The inter prediction part 27 also outputs, as an inter prediction signal, a partial picture frame signal extracted from the picture frame signal stored in the frame memory 26.

The intra prediction part 28 compresses a picture frame signal by using a correlation between blocks constituting the inputted one picture frame signal, and outputs an intra prediction signal obtained from a block to be coded and a block adjacent to the block to be coded, and an intra prediction evaluation value representing compression efficiency in this intra prediction.

In accordance with an instruction from the control part 121, the switching part 29 selects one of the inter coding and the intra coding to be used for coding the block divided from the picture frame signal as the input signal.

In the intra coding of H.264/AVC, there are three types of coding modes, that is, 4×4 intra prediction coding to perform coding on a luminance signal in 4×4 pixel units, 16×16 intra prediction coding to perform coding on the luminance signal in 16×16 pixel units, and intra prediction coding on a color-differential signal. Thus, counting the inter coding as well, there are four coding processes (coding modes) in total.

According to the present invention, in accordance with each of the coding modes, the discrete cosine transform and the quantization process are adaptively controlled, so that the loads of these processes are reduced without degradation of picture quality. Hereinafter, the discrete cosine transform and the quantization process executed according to the invention by the processing part 122 controlled by the control part 121 will be described.

First, the control part 121 performs a judgment to identify a coding mode. Then the control part 121 determines the content of the discrete cosine transform and the quantization according to the result of the judgment of the coding mode, which corresponds to one of: the inter prediction coding mode; the intra 4×4 prediction coding mode on the luminance; the intra 16×16 prediction coding mode on the luminance; and the intra prediction coding mode on the color difference, and the control part 121 issues an instruction to the processing part 122.

First, the case where the control part 121 has selected the inter prediction coding mode will be described with reference to FIG. 3.

As shown in FIG. 3, when the inter prediction coding mode is selected, the subtracting part 20 is controlled by the control part 121, and receives the inter prediction signal outputted from the inter prediction part 27 through the switching part 29 and the signal obtained by dividing the input signal to obtain a difference signal by subtracting the inter prediction signal from the signal obtained by dividing the input signal, and generates a difference picture (S11). When this difference picture is obtained, the control part 121 calculates an SAD (Sum of Absolute Difference) value representing a prediction error for each block including 4×4 pixels from the difference picture (S12).

Subsequently, when the calculation of the SAD value is ended, that is, when the prediction error is obtained, the control part 121 compares this prediction error with a predetermined threshold, and performs a judgment as to whether or not the prediction error is less than the threshold (S13). As a result of this judgment, in the case where the prediction error is less than the threshold, the DCT/quantization part 21 is controlled and zero values are substituted for all quantized coefficients (S14).

When the substitution process of the zero values is completed, the control part 121 checks whether or not a block index corresponding to the block on which the processing has been performed indicates a maximum value (S17), and in the case where the index indicates the maximum value, a CBP (coded_block_pattern) value is calculated for each block including 8×8 pixels, that is, for each block including four blocks each of which is the calculation object of the SAD value and includes 4×4 pixels (S18), and the inter prediction coding is ended.

Incidentally, the CBP value is information indicating whether or not there is a coefficient of an alternating current (AC) component of the quantized coefficient obtained by the quantization process, and/or a coefficient of a direct current (DC) component. When the CBP value is 0, it indicates that there is no coefficient in both the AC and DC components; when the CBP value is 1, it indicates that there is only the coefficient of the DC component; and when the CBP value is 2, it indicates that there is only the coefficient of the AC component or that there are coefficients of both the AC and DC components.

On the other hand, in the case where the block index is not the maximum value, process returns to S13, and the processes according to steps S13-S17 are repeated for a next block including 4×4 pixels until the block index reaches the maximum value.

At step S13, if the predictive error is equal to or exceeds the threshold, the control part 121 controls the DCT/quantization part 21 and causes the DCT process with integer precision to be performed in a unit of a block including 4×4 pixels (S15), and further causes the quantization process to be performed on the coefficient obtained by the DCT process with integer precision (S16).

Thereafter, as described above, the confirmation as to whether or not the block index indicates the maximum value (S17), and the calculation of the CBP (S18) are performed. When the above process is completed, the quantized coefficient outputted from the DCT/quantization part 21 is subjected to a variable length coding by the entropy coding part 22, and is outputted to the network or the like through the transmission part 13.

Next, the case where the control part 121 has selected the intra 4×4 prediction coding mode will be described with reference to FIG. 4.

As shown in FIG. 4, it is recognized that the mode is the intra 4×4 prediction mode, a prediction picture of a block including 4×4 pixels is generated based on the control by the control part 121 (S21). Here, the prediction picture of the block is generated by predicting based on pixels in a block existing around the block and located at the boundary position of the block (that is, using at least one block adjacent to the block to be coded).

When the generation of the predictive picture of the block is ended, the subtracting part 20 generates a difference picture by subtracting the prediction picture from the block to be coded included in the input signal (S22). When the difference picture is generated by the subtracting part 20, the control part 121 calculates an SAD value representing a predictive error from this difference picture (S23).

Subsequently, when the predictive error is obtained, the control part 121 compares this predictive error with a predetermined threshold (S24), and in the case where the predictive error is less than the threshold, the DCT process and the quantization process are not performed, and it is confirmed whether or not the block index corresponding to the block on which the processing has been performed indicates a maximum value (S28). In the case where the index indicates the maximum value, the processing is ended.

On the other hand, in the case where the block index is not the maximum value, the process returns to S21, and the processes according to S21-S28 are repeated for a next block including 4×4 pixels until the block index reaches the maximum value.

At step S24, if the predictive error is equal to or exceeds the threshold, the control part 121 controls the DCT/quantization part 21 to perform the DCT process on the block including 4×4 pixels (S25), and causes the quantization process to be performed on the coefficient obtained by this DCT process with integer precision (S26). The control part 121 calculates a CBP value based on the result of the processing performed in the DCT/quantization process part 21 (S27).

Thereafter, in a similar manner to the process described above, the control part 121 checks whether the block index according to the block on which the processing has been performed indicates a maximum value. In the case where the block index is not the maximum value, the process returns to S21, and the processes according to S21-S28 are repeated for a next block including 4×4 pixels until the block index reaches the maximum value. On the other hand, in the case where the index indicates the maximum value, the processing is ended.

When the above described processing is completed, the quantized coefficient outputted from the DCT/quantization part 21 is subjected to the variable length coding by the entropy coding part 22, and is outputted to the network or the like through the transmission part 13.

Next, the case where the control part 121 has selected the intra 16×16 prediction coding mode will be described with reference to FIG. 5.

As shown in FIG. 5, when it is recognized that the mode is the intra 16×16 prediction coding mode, based on the control by the control part 121, the intra prediction part 28 generates a prediction picture of a block including 16×16 pixels (S21). Here, similarly, the predictive picture is generated by predicting each pixel in the block to be coded by using pixels in a block existing around this block and located at the boundary position of the block (that is, using at least one block adjacent to the block to be coded).

Then, by controlling the subtracting part 20, a difference signal is obtained by subtracting the created predictive picture from the block which is obtained by dividing the input signal and which is 16×16 pixels, and a difference picture is generated (S32). When the difference picture is obtained, the control part 121 divides the difference picture including 16×16 pixels into blocks each including 4×4 pixels, and calculates an SAD value representing a predictive error in a unit of a block of 4×4 pixels (S33).

Subsequently, the control part 121 compares the calculated predictive error with a predetermined threshold (S34), and in the case where the predictive error is less than the threshold, the discrete cosine transform to obtain only a DC component is performed on the difference picture of 4×4 pixels (S35), and a zero value is substituted for an AC component (S36).

Incidentally, the above described discrete cosine transform to obtain only the DC component can be readily executed compared to the usual discrete cosine transform which obtains a DC component and an AC component.

On the other hand, in the case where the predictive error is equal to or exceeds the threshold, the DCT/quantization part 21 is controlled to perform the discrete cosine transform with integer precision on the difference picture including 4×4 pixels (S37), and the quantization process is performed on only the AC component obtained by this discrete cosine transform (S38).

When the processing of step S36 or step S38 is ended, the control part 121 confirms whether a block index indicated by the block of 4×4 pixels to be coded is a maximum value (S39). In the case where the index has not reached the maximum value, the process returns to step S34, and the same processing is performed on a next difference picture of 4×4 pixels.

In the case where the block index reaches the maximum value, the quantization process is performed on only the coefficient of the DC component (S40), and then, a CBP value is calculated and the processing is ended.

Incidentally, before the quantization process on the coefficient of the DC component is performed, an orthogonal transform such as a Hadamard transform is performed.

When the above processing is completed, the quantized coefficient outputted from the DCT/quantization part 21 is subjected to the variable length coding in the entropy coding part 22, and is outputted to the network or the like through the transmission part 13.

Next, the case where the control part 121 has selected that the coding processing is performed on a color-differential signal will be described with reference to FIG. 6.

As shown in FIG. 6, when the control part 121 selects that the coding is performed on the color-differential signal, the subtracting part 20 generates a difference picture by subtracting the predictive picture which is generated by the intra prediction part 28 from a block to be coded contained in an input signal (S51). Incidentally, in the case where the color-differential signal is encoded, since it is performed in a unit of a block including 8×8 pixels, the block as the coding object includes 8×8 pixels.

When the difference picture is obtained, the control part 121 divides the difference picture into blocks each including 4×4 pixels, calculates an SAD value representing a predictive error for each of the divided blocks (S52), and compares the predictive error with a predetermined threshold (S53).

As a result of this comparison, in the case where the predictive error is less than the threshold, the discrete cosine transform to obtain only a DC component is performed on the difference picture of 4×4 pixels (S54), and a zero value is substituted for an AC component (S55).

On the other hand, in the case where the predictive error is equal to or exceeds the threshold, the DCT/quantization part 21 is controlled to perform the discrete cosine transform with integer precision on the difference picture including 4×4 pixels (S56), and the quantization process is performed only on the AC component obtained by this discrete cosine transform (S57).

When the processing of step S55 or step S57 is ended, the control part 121 confirms whether the block index corresponding to the block of 4×4 pixels to be coded is a maximum value (S58). In the case where it is not the maximum value, the process returns to step S53, and the same processing is performed on a next difference picture of 4×4 pixels.

In the case where the block index is the maximum value, the quantization processing is performed on only the coefficient of the DC component (S59), and then, a CBP value is calculated and the processing is ended.

Incidentally, before the quantization process on the coefficient of the DC component is performed, an orthogonal transform such as a Hadamard transform is performed.

When the processing is completed, the quantized coefficient outputted from the DCT/quantization part 21 is subjected to the variable length coding by the entropy coding part 22, and is outputted to the network or the like through the transmission part 13.

As stated above, the discrete cosine transform and the quantization process are changed according to the coding mode, so that the processing load in the discrete cosine transform and the quantization process can be reduced while the degradation of picture quality is suppressed.

In addition, in the intra 16×16 prediction mode and in the mode of coding the color-differential signal, since the coefficient of the direct current component is liable to remain, it is also a feature that the quantization on the coefficient of the direct current component is performed.

Incidentally, in the comparison between the predictive error and the threshold, the judgment is performed on the basis of “predictive error<threshold” as described above; however, a judgment can be performed as to whether “predictive error>threshold” instead. That is, for example, in FIG. 3 at step S13 processing proceeds to step S14 if the prediction error is greater than or equal to the threshold, while processing proceeds to step S15 if the prediction error is less than the threshold. However, at step S13 a judgment may be performed as to whether the prediction error is greater than the threshold. In this case, processing would proceed to step S14 if the prediction error were greater than the threshold, while processing would proceed to step S15 if the prediction error were less than or equal to the threshold. A similar modification could be made at steps S24, S34 and S53 in FIGS. 4, 5 and 6.

Claims

1. A communication terminal comprising: a coding part for coding a moving picture by selectively using one of an intra-coding mode and an inter-coding mode;wherein, if the intra-coding mode is used by the coding part, the coding part: calculates a difference signal by subtracting, from a first block to be coded which is in a first size and is divided from the moving picture, a prediction block in the first size generated using at least one block adjacent to the first block to be coded;divides the difference signal into a plurality of second blocks in a second size;calculates a prediction error from the difference signal based on the second blocks;compares the prediction error with a predetermined threshold; andcalculates one of: (i) both a quantized AC component and a quantized DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than the threshold, and (ii) only a quantized DC component by executing the discrete cosine transform and quantization on the difference signal if the prediction error is less than or equal to the threshold.
2. The communication terminal according to claim 1, wherein if the prediction error is less than or equal to the threshold, the coding part substitutes a zero value for an AC component extracted by executing the discrete cosine transform so as to calculate the quantized DC component.
3. The communication terminal according to claim 1, wherein if the prediction error is less than or equal to the threshold, the coding part substitutes a zero value for an AC component extracted by executing the discrete cosine transform so as to calculate only the quantized DC component.
4. The communication terminal according to claim 1, wherein the first size is 16×16 pixels and the second size is 4×4 pixels.
5. The communication terminal according to claim 1, wherein the first size is 8×8 pixels and the second size is 4×4 pixels.
6. A communication terminal comprising: a coding part for coding a moving picture by selectively using one of an intra-coding mode and an inter-coding mode;wherein, if the intra-coding mode is used by the coding part, the coding part: calculates a difference signal by subtracting, from a first block to be coded which is in a first size and is divided from the moving picture, a prediction block in the first size generated using at least one block adjacent to the first block to be coded;divides the difference signal into a plurality of second blocks in a second size;calculates a prediction error from the difference signal based on the second blocks;compares the prediction error with a predetermined threshold; andcalculates ones of: (i) both a quantized AC component and a quantized DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than or equal to the threshold, and (ii) only a quantized DC component by executing the discrete cosine transform and quantization if the prediction error is less than the threshold.
7. The communication terminal according to claim 6, wherein if the prediction error is less than the threshold, the coding part substitutes a zero value for an AC component extracted by executing the discrete cosine transform so as to calculate the quantized DC component.
8. The communication terminal according to claim 6, wherein if the prediction error is less than the threshold, the coding part substitutes a zero value for an AC component extracted by executing the discrete cosine transform so as to calculate only the quantized DC component.
9. The communication terminal according to claim 6, wherein the first size is 16×16 pixels and the second size is 4×4 pixels.
10. The communication terminal according to claim 6, wherein the first size is 16×16 pixels and the second size is 4×4 pixels.
11. A communication terminal comprising: a coding part for coding a moving picture by selectively using one of an intra-coding mode and an inter-coding mode; anda memory configured to store at least one picture to be used for inter prediction;wherein if the inter-coding mode is used by the coding part, the coding part: calculates a difference signal by subtracting, from a first block that has a first size and is divided from the moving picture, a prediction block in the first size generated from the picture stored in the memory;calculates a prediction error from the difference signal;compares the prediction error with a predetermined threshold;calculates a quantized AC component and a quantized DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than the threshold; andsubstitutes a zero value for an AC component and a DC component extracted by executing the discrete cosine transform if the prediction error is less than or equal to the threshold.
12. A communication terminal comprising: a coding part for coding a moving picture by selectively using one of an intra-coding mode and an inter-coding mode;a memory configured to store at least one picture to be used for inter prediction;wherein if the inter-coding mode is used by the coding part, the coding part: calculates a difference signal by subtracting, from a first block that has a first size and is divided from the moving picture, a prediction block in the first size generated from the picture stored in the memory;calculates a prediction error from the difference signal,compares the prediction error with a predetermined threshold,calculates a quantized AC component and a quantized a DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than or equal to the threshold, andsubstitutes a zero value for an AC component and a DC component extracted by executing the discrete cosine transform if the prediction error is less than the threshold.
13. A method of controlling a coding part of a communication terminal to code a moving picture by selectively using an intra-coding mode, comprising: calculating a difference signal by subtracting, from a first block to be coded which is in a first size and is divided from the moving picture, a prediction block in the first size generated using at least one block adjacent to the first block to be coded;dividing the difference signal into a plurality of second blocks in a second size;calculating a prediction error from the difference signal based on the second blocks;comparing the prediction error with a predetermined threshold; andcalculating one of: (i) both a quantized AC component and a quantized DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than the threshold, and (ii) only a quantized DC component by executing the discrete cosine transform and quantization on the difference signal if the prediction error is less than or equal to the threshold;wherein the coding part is also operable in an inter-coding mode.
14. The method according to claim 13, wherein if the prediction error is less than or equal to the threshold, a zero value is substituted for an AC component extracted by executing the discrete cosine transform so as to calculate the quantized DC component.
15. The method according to claim 13, wherein if the prediction error is less than or equal to the threshold, a zero value is substituted for an AC component extracted by executing the discrete cosine transform so as to calculate only the quantized DC component.
16. The method according to claim 13, wherein the first size is 16×16 pixels and the second size is 4×4 pixels.
17. The method according to claim 13, wherein the first size is 8×8 pixels and the second size is 4×4 pixels.
18. A method of controlling a coding part of a communication terminal to code a moving picture by selectively using an intra-coding mode, comprising: calculating a difference signal by subtracting, from a first block to be coded which is in a first size and is divided from the moving picture, a prediction block in the first size generated using at least one block adjacent to the first block to be coded;dividing the difference signal into a plurality of second blocks in a second size;calculating a prediction error from the difference signal based on the second blocks;comparing the prediction error with a predetermined threshold; andcalculating one of: (i) both a quantized AC component and a quantized DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than or equal to the threshold, and (ii) only a quantized DC component by executing the discrete cosine transform and quantization if the prediction error is less than the threshold; wherein the coding part is also operable in an inter-coding mode.
19. The method according to claim 18, wherein if the prediction error is less than the threshold, a zero value is substituted for an AC component extracted by executing the discrete cosine transform so as to calculate the quantized DC component.
20. The method according to claim 18, wherein if the prediction error is less than the threshold, a zero value is substituted for an AC component extracted by executing the discrete cosine transform so as to calculate only the quantized DC component.
21. The method according to claim 18, wherein the first size is 16×16 pixels and the second size is 4×4 pixels.
22. The method according to claim 18, wherein the first size is 16×16 pixels and the second size is 4×4 pixels.
23. A method of controlling a coding part of a communication terminal to code a moving picture by selectively using an inter-coding mode, the communication terminal comprising a memory configured to store at least one picture to be used for inter prediction, the method comprising: calculating a difference signal by subtracting, from a first block that has a first size and is divided from the moving picture, a prediction block in the first size generated from the picture stored in the memory;calculating a prediction error from the difference signal;comparing the prediction error with a predetermined threshold;calculating a quantized AC component and a quantized DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than the threshold; andsubstituting a zero value for an AC component and a DC component extracted by executing the discrete cosine transform if the prediction error is less than or equal to the threshold;wherein the coding part is also operable in an intra-coding mode.
24. A method of controlling a coding part of a communication terminal to code a moving picture by selectively using an inter-coding mode, the communication terminal comprising a memory configured to store at least one picture to be used for inter prediction, the method comprising: calculating a difference signal by subtracting, from a first block that has a first size and is divided from the moving picture, a prediction block in the first size generated from the picture stored in the memory;calculating a prediction error from the difference signal;comparing the prediction error with a predetermined threshold;calculating a quantized AC component and a quantized a DC component by executing a discrete cosine transform and quantization on the difference signal if the prediction error is greater than or equal to the threshold; andsubstituting a zero value for an AC component and a DC component extracted by executing the discrete cosine transform if the prediction error is less than the threshold;wherein the coding part is also operable in an intra-coding mode.

Priority Claims (1)

Number	Date	Country	Kind
2006-187124	Jul 2006	JP	national

Communication terminal

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

US Classifications

International Classifications

Abstract

Description

Claims

Priority Claims (1)