The present invention relates to a data processing device that compresses data by vector quantization.
Heretofore, in the fields of information processing and multimedia, processing called pattern matching has often been used and vector quantization is one of such processing. The vector quantization is a technique used in, for example, data compression processing for image data and sound data. In typical image data compression using the vector quantization, individual image data corresponding to each block of a predetermined size (e.g. 4×4 pixels) is extracted from two-dimensionally arrayed image data and treated as a 16-dimensional input vector in case of the block of 4×4 pixels. In this case, a plurality of pixel patterns (template vectors) each having the same size as that of the block used in extracting the image data are prepared in advance.
Then, comparison is made between the input vector and each template vector to find out, among these template vectors, the template vector most similar to the input vector. Specifically, distance calculation between input vector data and template vector data is performed for every template vector data. One of the template vectors which gives a minimum distance as a result of the calculation is assumed to have the highest degree of similarity. In this manner, in the vector quantization, image data compression is carried out by quantizing the input vector by the use of the template vectors.
However, in the conventional vector quantization described above, the data compression processing for the image data or the sound data requires a very large amount of calculation. Accordingly, there arises a problem that a long time is required for the compression processing.
As one approach for solving the above-mentioned problem, there is known a method of utilizing feature quantities of vectors to reduce the calculation amount in the vector quantization. In this method, comparison is made between a feature quantity of the input vector and a feature quantity of each of the template vectors obtained in advance. By the use of the result of comparison, the number of those template vectors to be searched (to be subjected to distance calculation) is reduced. In this manner, it is possible to reduce the calculation amount of the distance calculation and the like in the vector quantization and to speed up the vector quantization.
However, in the above-mentioned method of utilizing the feature quantities of the vectors to reduce the calculation amount, there is a problem that, when plural kinds of feature quantities are used, feature quantity storage memories and feature quantity comparison circuits are increased in number, resulting in an increase in hardware scale required to realize the processing of the vector quantization. On the other hand, there is another problem that, when the number of kinds of the feature quantities is reduced on the contrary, the reduction of the calculation amount in the vector quantization becomes small and a high reduction effect of calculation cannot be obtained.
As another approach of solving the problem that a long time is required for the compression processing by the vector quantization, there is known a method of decreasing the number of steps required for distance calculation in the vector quantization by performing the distance calculation from an upper digit. In this method, upon realizing the processing of vector quantization by hardware, a calculation circuit supplied with input data from an upper digit and producing output data from an upper digit is provided as a circuit for calculating a distance between input vector data and template vector data. Thus, update judgment about the minimum distance based on the distance calculation is carried out successively from the upper digit and, at the time instant when it is judged that the minimum distance is not updated, the distance calculation can be stopped. Therefore, it is possible to reduce the number of the steps required for the distance calculation in the vector quantization to thereby speed up the vector quantization.
However, in the above-mentioned method of reducing the number of the steps required for the distance calculation in the vector quantization, search of the template vector data is carried out for all the template vector data. Therefore, even if the input vector data and the template vector data have patterns obviously different from each other, the distance calculation is executed (started) and a large amount of useless calculation is inevitably executed.
The present invention has been accomplished for solving the above-mentioned problems and has an object to enable data compression processing using vector quantization to be carried out by hardware at a high speed.
A data processing device of the present invention is a data processing device for outputting information indicative of a template pattern most similar to an input pattern among a plurality of template patterns prepared in advance, characterized by comprising first searching means for selecting template patterns to be subjected to calculation of the degree of similarity between the input pattern and the template patterns by the use of feature quantities of the input pattern and the template patterns upon searching the similar template pattern, and second searching means for performing the calculation in a bit serial fashion upon calculating the degree of similarity between the template patterns selected by the first searching means and the input pattern.
The data processing device of the present invention is further characterized in that the second searching means performs the calculation within a calculation unit time with respect to data inputted in a bit serial fashion and, based on a result of the calculation, judges per calculation unit time whether or not the degree of similarity is maximum.
The data processing device of the present invention is further characterized in that the first searching means reduces the number of the template patterns to be subjected to the calculation of the degree of similarity by the use of the feature quantities of the input pattern and the template patterns, and the second searching means performs the calculation of the degree of similarity between the input pattern and the template patterns selected by the first searching means in a bit serial fashion to thereby reduce the number of steps required for the calculation.
The data processing device of the present invention is further characterized in that each of the input pattern and the template patterns is one vector data composed of at least one element.
The data processing device of the present invention is further characterized in that each of the input pattern and the template patterns is composed of at least one element, the second searching means comprising similarity calculating means for calculating an absolute differential distance as the degree of similarity between the input pattern and each of the template patterns, the first searching means comprising similarity calculation judging means for judging, when an absolute value of a difference between the feature quantity of the input pattern and the feature quantity of a particular template pattern is not smaller than an absolute differential distance between the input pattern and a different template pattern different from the particular template pattern, not to perform calculation of an absolute differential distance between the different template pattern and the input pattern.
The data processing device of the present invention is further characterized in that the similarity calculating means is supplied with the template patterns selected by the first searching means and the input pattern both in a bit serial fashion from an upper digit of the data per calculation unit time, calculates the input data in a calculation unit time, and outputs an absolute differential distance as a result of the calculation in a bit serial fashion from an upper digit.
The data processing device of the present invention is further characterized in that the feature quantity is at least one of the sum of individual elements forming each of the input pattern and the template patterns, and the sum of the individual elements after a part of the individual elements have been inverted with respect to a reference value which is an intermediate value of possible values.
The data processing device of the present invention is further characterized in that the similarity calculating means is formed by a semiconductor circuit, the similarity calculating means being supplied with input data represented by a binary number system from an upper digit and producing output data represented by a redundant number system from an upper digit.
The data processing device of the present invention is further characterized in that the similarity calculating means comprises absolute differential value calculating means for calculating an absolute differential value between the elements of the input pattern and the template pattern per each element, and adding means for summing the absolute differential values for the respective elements calculated by the absolute differential value calculating means, the absolute differential value calculating means being equal in number to the elements of each pattern.
The data processing device of the present invention is further characterized in that the absolute differential value calculating means is supplied with input data in a binary number system from an upper digit, calculates differential values between the input data in a bit serial fashion, and outputs the differential values in a redundant number system from an upper digit.
The data processing device of the present invention is further characterized in that the absolute differential value calculating means further has a function of memorizing, as a state, a numerical value other than “0” that first appears in the differential value outputted in a bit serial fashion from the upper digit, inverting or non-inverting the differential value depending on the memorized state, and calculating the absolute differential value.
The data processing device of the present invention is further characterized in that the addition calculating section comprises delay means for giving a delay by a calculation unit time, the addition calculating section being supplied with input data in a redundant number system from an upper digit and outputting a calculation result in a redundant number system from an upper digit.
The data processing device of the present invention is further characterized in that the second searching means comprises holding means for holding a minimum absolute differential distance between the input pattern and the template pattern; and comparing means supplied with the minimum absolute differential distance held in the holing means in a bit serial fashion from an upper digit, and supplied with an absolute differential distance obtained by similarity calculating means in a bit serial fashion from an upper digit, the similarity calculating means being for calculating an absolute differential distance between a different template pattern different from the aforementioned template pattern and the input pattern, the comparing means being supplied with the absolute differential distances in the bit serial fashion from the upper digit and carrying out comparison in connection with the absolute differential distances; calculation for lower digits being stopped after it has been confirmed in the comparison by the comparing means that the absolute differential distance between the different template pattern different from the aforementioned template pattern and the input pattern is greater than the minimum absolute differential distance held in the holding means.
Now, embodiments of the present invention will be described with reference to the drawing.
As a data processing device according to each embodiment of the present invention is illustrated in the form of a data processing device which performs vector quantization as one example of pattern matching processing. It is noted here that, when the degree of similarity is estimated by an absolute differential distance in the vector quantization, an operation of searching out a template vector most similar to an input vector corresponds to an operation of searching out a template vector giving a minimum absolute differential distance.
In
Depending on the judgment result by the similarity calculation judging section 102, a similarity calculating section 103 performs the similarity calculation (distance calculation) between the input vector and the template vector. The feature quantity memorizing section 104 memorizes feature quantities of template vectors calculated in advance. A template memorizing section 105 memorizes a plurality of template vectors.
A maximum similarity searching section 106 compares the degree of similarity calculated by the similarity calculating section 103 and a maximum similarity obtained in the processing to the present and memorized in a maximum similarity memorizing section 107. After completion of the processing with respect to all the template vectors in the template memorizing section 105, an address designating section 108 outputs as an index an address of a particular template vector having the maximum similarity (most similar to the input vector) with reference to the comparison result in the maximum similarity searching section 106.
It is assumed here that each vector is n-dimensional. Then, the following relationship is established between a difference between sums of the elements in the input vector and sums of the elements in the template vector (right side of an expression (1) given below) and an absolute differential distance between the input vector and the template vector (left side of the expression (1) given below).
In the expression (1), il represents an I-th element of an input vector I and tk1 represents an l-th element of a k-th template vector Tk.
The expression (1) represents that an absolute differential distance between two vectors is equal to or greater than an absolute value of a difference between the respective sums of the elements in those vectors. Thus, a method using the expression (1) is a method of extracting and using a density in a whole block as a feature quantity.
Hereinafter, description will be made about a codebook searching procedure in the vector quantization using the relationship given by the expression (1), in which a template vector most similar to the input vector is found out from a codebook composed of a plurality of template vectors.
(Step 1) The total sum of elements in each template vector is calculated for all template vectors and memorized in the feature quantity memorizing section 104 in advance.
(Step 2) When the input vector I is given, a total sum of elements in the input vector I is calculated by the feature quantity calculating section 101. Then, an absolute differential distance between the first template vector T1 and the input vector I is calculated by the similarity calculating section 103 and memorized in the maximum similarity memorizing section 107 as a minimum absolute differential distance.
(Step 3) Then, a difference between a total sum of elements in the second template vector T2 and a total sum of the elements in the input vector I is calculated by the similarity calculation judging section 102. Further, the similarity calculation judging section 102 compares the calculated difference between the total element sums with the minimum absolute differential distance memorized in the maximum similarity memorizing section 107.
In case where, as a result of the above-mentioned comparison, the difference between the total element sums in both vectors is smaller than or equal to the minimum absolute differential distance, the similarity calculation judging section 102 judges that similarity calculation has to be carried out in the similarity calculating section 103. Following the above-mentioned judgment, the similarity calculating section 103 calculates an absolute differential distance between the template vector T2 and the input vector I.
The maximum similarity searching section 106 compares the absolute differential distance calculated by the similarity calculating section 103 and the minimum absolute differential distance. In case where, as a result of the comparison, the absolute differential distance calculated by the similarity calculating section 103 is smaller than the minimum absolute differential distance, an index of the template vector is stored. The absolute differential distance calculated by the similarity calculating section 103 is memorized in the maximum similarity memorizing section 107 as an updated minimum absolute differential distance to update the minimum absolute differential distance.
On the other hand, in case where the difference between the element sums is greater than the minimum absolute differential distance as a result of the comparison in the similarity calculation judging section 102, an absolute differential distance between the template vector and the input vector would not update the minimum absolute differential distance (become smaller than the minimum absolute differential distance). Therefore, the similarity calculation judging section 102 judges not to carry out similarity calculation in the similarity calculating section 103. Then, searching a next template vector is started without performing calculation of the absolute differential distance.
(Step 4) Subsequently, the step 3 is repeated to the last of all the template vectors memorized in the template memorizing section 105.
By performing the processing as described above, the first search section selects those template vectors to be searched (to be subjected to matching) with reference to the feature quantities. Thus, it is possible to reduce the number of times of matching between the input vector and the template vectors. Herein, calculation for obtaining the difference between the total sum of elements can be simply carried out with respect to all the template vectors but because of one-dimensional scalar calculation. On the other hand, calculation of the absolute differential distances can be realized by n-dimensional vector calculation. Accordingly, when n≧2, a large amount of n-dimensional vector calculation can be omitted by using simple scalar calculation so that the calculation amount can be decreased.
For example, as shown in
In this event, a difference between the total sum of the elements in the input vector I and a total sum of elements in the following k-th template vector Tk is equal to “18” which is smaller than the absolute differential distance “94” between the input vector I and the template vector Tk−1 as the minimum absolute differential distance. Therefore, calculation is carried out about an absolute differential distance between the input vector I and the following template vector Tk. The calculation gives an absolute differential distance of “21” which is smaller than the previous minimum absolute differential distance. Therefore, the minimum absolute differential distance is updated to “21”.
Next, a difference between the total sum of the elements in the input vector I and a total sum of elements in the next following (k+1)-th template vector Tk+1 is equal to “42” and is greater than the minimum absolute differential distance “21” between the input vector I and the template vector Tk+1 which is kept as the minimum absolute differential distance. Therefore, an absolute differential distance between the input vector I and the template vector Tk+1 may not be calculated. Thus, with respect to the template vector Tk+1, only the simple one-dimensional scalar calculation is carried out while the n-dimensional vector calculation is not carried out. Then, the processing proceeds to the next template vector.
Although a total sum of the elements in each vector is used as the feature quantity in the foregoing example, the feature quantity is not limited thereto.
For example, it is possible to use, as the feature quantity, a difference between total sums of elements in the input vector and the template vector both of which are inverted, as will be described hereinbelow. An inequality in an expression (2) holds between an absolute differential distance between the input vector and the template vector and a difference between total sums of elements in the respective vectors after inversion.
The expression (2) shows a case where a data value of each pixel falls within the range of “0” to “255” by way of example.
Herein, “inversion” means that the data values are subjected to white-black inversion with respect to an intermediate value as the center. For example, in case where a data value of each pixel falls within the range of “0” to “255”, the center value is equal to “127.5”. Therefore, a data value “128” is inverted into “127” and a data value “100” is inverted into “155”. In an example described hereinbelow, inversion is carried out by subtracting from “255” a data value of a pixel to be inverted.
Further, the inversion is not carried out with respect to all the pixels but the pixels are partly inverted. In the expression (2), 0-th to m-th pixels are non-inverted while (m+1)-th to n-th pixels are inverted. It is noted here that those pixels to be inverted need not be consecutively numbered. However, those pixels to be inverted should be common to the input vector and the template vector in element numbers.
The method using the expression (2) is a method of extracting and using a feature which appears in a direction of a luminance change in a block.
First, the feature quantity calculating section 101 inverts those pixels of the input vector corresponding to inversion by using the inversion pattern and calculates a total sum of elements after the inversion. For the template vector also, inversion is carried out by using the similar inversion pattern and a total sum of elements is calculated and memorized in the feature quantity memorizing section 104.
Then, the similarity calculation judging section 102 calculates a difference between the thus obtained total sum of the elements in the input vector after the inversion and the thus obtained total sum of the elements in the template vector after the inversion. In the example shown in
Although only one kind of inversion pattern is used in the example shown in
Like in the example shown in
In the example shown in
Thus, by increasing the number of the inversion patterns used for the feature quantity extraction, the calculation amount can further be reduced.
In the above-mentioned searching method using the feature quantities, the mathematical relationship is used which is established between the two vectors (the input vector and the template vector). Therefore, it is possible to omit or reduce an amount of calculation without causing decrease in accuracy.
Next, description will be made about second-stage processing for reducing the number of calculation cycles upon performing calculation of the degree of similarity of patterns. In the vector quantization, the degree of similarity is given by the absolute differential distance. The calculation of the degree of similarity, i.e. the absolute differential distance, is carried out in the similarity calculating section 103 shown in
The absolute differential value calculating section comprises two blocks, i.e. a block for performing difference calculation and another block for performing absolute value calculation. The difference calculation block 701 carries out subtraction per digit from the upper digit and outputs a result of the subtraction per digit. The subtraction result is latched by a temporary storage register 702 and supplied to a state control block 703. The state control block 703 is a state machine that checks the subtraction result from the upper digit and sends a sign inversion request signal to a signal inversion block if a number other than “0” that first appears is negative.
When the sign inversion request signal is outputted from the state control block 703, a differential value is negative. Therefore, an absolute value calculation block 704 carries out inversion to obtain an absolute value. On the other hand, when the sign inversion request signal is not outputted, a differential value is “0” or positive. Therefore, data is outputted without any inversion (as it is). By providing a plurality of the absolute differential value calculating sections shown in
Next, all the absolute differential values outputted by the absolute differential value calculating section 601 from the upper digit are summed by the addition calculating section from the upper digit. Thus, an absolute differential distance is calculated. In order to carry out the addition from the upper digit, the SD number system of the binary-coded ternary notation is used. Using the SD number system of the binary-coded ternary notation provides such a characteristic that propagation of a carry signal in the addition calculation is limited to two digits at maximum. Utilizing this characteristic, the calculation is executed with reference to a first lower digit and a second lower digit. Thus, it is possible to perform the addition calculation from the upper digit.
The addition calculating section shown in
Although the data inputted is equal in number to two in the addition calculation section shown in
Next, the maximum similarity searching section 106 will be described.
A “large” or “small” relationship between numbers of the SD number system of the binary-coded ternary notation can be determined at a time instant when a difference of “2” or more appears at a certain digit or its upper digit while numerical values are being checked from the upper digit. It is assumed that, with respect to two particular numbers, one of the numbers is greater by “2” at an m-th or upper digit, this number is greater than the other number.
In order to find out a minimum value from many numerical values inputted in a serial fashion from the upper digit, three states are defined. Upon reception of a particular digit, a numerical value judged to be greater than a numerical value supposed to be minimum is defined as “Larger”. A numerical value greater than the minimum by one is defined as “Can't Decide”. A numerical value supposed to be the minimum is defined as “Smaller”. Then, a circuit is designed on the basis of a state transition chart shown in
The “State Indicator” 1201 is reset into the “Smaller” state as an initial state. Data to be compared is inputted in a serial fashion from the upper digit. At a time instant when one is in the “Smaller” state while all the others are in the “Larger” state, the “Winner observer” block 1203 outputs a comparison result and a judgment end flag.
Summarizing the processing described above, the processing in this embodiment is realized according to a flow chart shown in
In
Next, a second embodiment will be described.
As one method of vector quantization, there is a method of removing mean values of blocks as a process prior to pattern matching.
Hereinbelow, description will be made about one example of the method of removing the mean values of the blocks.
As shown in expressions (3) and (4), a certain vector X is divided into its mean value m and a remainder or remnant R left after the mean value m is removed. Pattern matching is carried out using the remainder R as a new vector.
However, removing the mean value information of the blocks results in unifying mean values among all the blocks so that variation in mean value information becomes small. Here, the method of reducing the calculation amount utilizing the relationship given in the expression (1) is a method utilizing that variation in mean value information of the vectors is large and, therefore, does not work effectively when the variation in mean value information of the vectors is equalized.
However, even if the mean values are removed, the information related to a variation of luminance of the blocks is not lost. Therefore, the method of reducing the calculation amount utilizing the relationship given in the expression (2) is effective.
(Effect of the Invention)
As described above, according to the present invention, in pattern matching processing in vector quantization used in, for example, data compression processing of image data or sound data, it is possible to reduce the number of templates required to be subjected to matching, by utilizing feature quantities of an input pattern and template patterns. Further, by performing calculation upon matching from the upper digit, it is possible to reduce the number of cycles of the calculation. Thus, by combining the two kinds of calculation skipping processing, a high calculation skipping rate can be achieved and the data compression processing using the vector quantization can be carried out at a high speed.
Further, in case where a method of removing information about mean values of blocks is used in the vector quantization, it is possible to reduce the number of the templates required to be subjected to matching by the use of a plurality of kinds of feature quantities.
The present invention is applicable to a communication system in which image or sound data is compressed at a high speed and transmitted, a transmission device used in the system, and so forth. In the foregoing embodiment, the description has been made on the assumption that the data processing device shown in
In the foregoing, the description has been made only about the illustrative embodiments of the present invention. However, it will readily be understood by a person skilled in the art that the embodiments can be modified and changed in various manners without significantly departing from the novel structure and effect of the present invention. Therefore, all of those modifications and changes are covered by the scope of the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2002-44054 | Feb 2002 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP03/01629 | 2/17/2003 | WO |