Information
-
Patent Application
-
20040252125
-
Publication Number
20040252125
-
Date Filed
March 29, 200420 years ago
-
Date Published
December 16, 200420 years ago
-
CPC
-
US Classifications
-
International Classifications
Abstract
A memory device, a signal processing apparatus and an image signal processing apparatus and signal processing methods capable of performing matching processing with a small amount of calculation and accurately detecting motion vectors, provided with an ME memory 13 for converting by using a feature including a pixel value in a certain block range having a focused pixel at its center as an address based on information of a reference frame Fr stored in a second frame memory 12; and a matching portion 14 for reading storage information of the ME memory 13 by using a feature of a focused pixel included in information of a current frame Fc supplied from a first frame memory 11 as a feature address, calculating a distance between the focused pixel in the current frame and a feature address (position coordinates) read from the ME memory 13, and detecting differential coordinates based on position information having the minimum distance from a plurality of candidates as motion vectors (Vx, Vy) of the focused pixel.
Description
TECHNICAL FIELD
[0001] The present invention relates to a memory device, a signal processing apparatus, an image signal processing apparatus and signal processing methods applicable to processing requiring matching, such as a motion detection device and an object search device, etc. used, for example, in a motion image compression apparatus, etc.
BACKGOUND ART
[0002] A matching processing of a signal, particularly, a matching processing of an image signal has a problem of generally causing a large amount of calculation. A motion vector detection by block matching as described below is an example of using the matching processing.
[0003] Namely, in an image signal processing apparatus, motion detection for obtaining a motion vector indicating a motion of an image is one major technique for effectively performing the motion image compression. There are some proposed methods of obtaining the motion vector, but one major method is a method called block a matching algorithm.
[0004]
FIG. 1 is a block diagram showing an example of a configuration of a motion detection device in an image signal processing apparatus of the related art, wherein the block matching algorithm is applied.
[0005] The motion detection device 1 comprises frame memories 2 and 3 and a motion vector detection portion 4.
[0006] In the motion detection device 1, when an image signal is input from an input terminal TIN, an information of one frame is stored in the frame memory 2.
[0007] When an information of the next image is input, the previous (previously input) information in the frame memory 2 is stored in the frame memory 3, and the currently (this time) input information is stored in the frame memory 2.
[0008] Namely, an information of a current frame Fc is stored in the frame memory 2, and an information of a reference frame Fr is stored in the frame memory 3.
[0009] Next, the information of the current frame Fc and the reference frame Fr is sent to the vector detection portion 4 and divided to blocks in the motion vector detection portion 4 where a motion vector (Vx, Vy) is detected and output from a terminal TOUT.
[0010]
FIG. 2 is a view for explaining an outline of block matching algorithm. Below, the outline of algorithm will be explained with reference to FIG. 2.
[0011] In this algorithm, a motion vector of a focused pixel Fc (x, y) in the current frame Fc is obtained by calculating a differential absolute value sum of corresponding pixels in pixels in a reference block range (L×L) having the focused pixel Fc (x, y) at its center and pixels in the same block range as the above block range (L×L) in a search area SR in the reference frame Fr.
[0012] The above calculation is repeated while moving the block range extracted in the search area SR by one pixel, and a differential vector of the center position of a block having the smallest differential absolute value sum in all blocks and the focused pixel position is regarded as the solution (motion vector).
[0013] Next, a processing procedure of detecting a motion vector of the pixel Fc (x, y) in the current frame Fc will be explained in detail with reference to FIG. 3.
[0014] Step ST1:
[0015] In the step ST1, a search area SR using as a reference the same position in the reference frame as the position (x, y) of the focused pixel is determined after starting processing ST0.
[0016] Step ST2:
[0017] In the step ST2, the maximum value of a calculation formula is substituted to initialize a variable “min” storing the minimum value of a calculation result. When assuming that one pixel is 8 bits and the number of pixels in a block is 16, 28×16=4096 is assigned to the variable “min”.
[0018] Step ST3:
[0019] In the step ST3, a counter variable “n” for counting blocks in a search area SR is initialized to be
[0020] Step ST4:
[0021] In the step ST4, a variable “sum” for being substituted a calculation result is initialized to be “0”.
[0022] Step ST5:
[0023] In the step ST5, assuming that a range of a reference block is L×L, a pixel in a certain block in the current frame is Fc (i, j), and a pixel in the k-th block in the search area SR of the reference frame Fr is Frk (i, j), calculation of a differential absolute value sum of corresponding pixels, that is, the formula 1 below is performed and the calculation result is substituted for
1
[0024] Step ST6
[0025] In the step ST6, the relationship of large and small is distinguished between the calculated differential absolute value sum “sum” and the minimum value “min” of the differential absolute value sum. When the calculated differential absolute value sum “sum” is smaller, the procedure proceeds to a step ST7, while when it is larger (including being equal), the calculation result is not the minimum value, so that the step ST7 is skipped and the procedure proceeds to a step ST8.
[0026] Step ST7:
[0027] In the step ST7, the calculation result “sum” is updated by the minimum value “min”, and a counter value “n” of the block as a motion vector number is set.
[0028] Step ST8:
[0029] In the step ST8, if the block counter value “n” is the total number of blocks in the search area SR, that is, the final block comes, it means the finish, so the procedure proceeds to a step ST10, while if it is not the final block, the procedure proceeds to a step ST9.
[0030] Step ST9:
[0031] In the step ST9, the block counter value “n” is incremented to “n+1”, and the procedure proceeds to the step ST4 to repeat the calculation.
[0032] Step ST10:
[0033] In the step ST10, a motion vector is obtained from the center pixel and (x, y) of a block having a number stored in a motion number and output the same.
[0034] Since the block matching algorithm explained above repeats the calculation of the formula (1), there is a disadvantage that an amount of the calculation becomes enormous and most of the time of image compression processing, such as MPEG, is spent for that.
DISCLOSURE OF THE INVENTION
[0035] An object of the present invention is to provide a memory device, a signal processing apparatus, an image processing apparatus and signal processing methods capable of performing matching processing, etc. only with a small amount of calculation and accurately detecting a motion vector, etc.
[0036] To attain the above object, a first aspect of the present invention is a memory device for storing information, characterized by comprising an input/output means for receiving or outputting the information; and a storage portion for storing the information; wherein the storage portion is at least controlled by an address corresponding to a feature of a focused data in a predetermined signal; and the information is a position information of the focused data in the predetermined signal.
[0037] Preferably, the memory device is a semiconductor device.
[0038] A second aspect of the present invention is a signal processing apparatus for performing matching processing by using a first signal including a plurality of first data and a second signal including a plurality of second data, characterized by comprising a first feature extraction means for extracting a feature as focused data being data at a focused position in the first signal; and a storage means for storing position information of the second data at a position specified by an address corresponding to the feature of each of the second data; wherein the position information of the second data corresponding to the focused data is obtained by reading the position information to the second data stored in the storage means at an address corresponding to the feature of the focused data.
[0039] Preferably, the storage means stores the position information of the second data in advance.
[0040] Also, the above signal processing apparatus further comprises a second feature extraction means for extracting a feature of each of second data in the second signal.
[0041] Preferably, the first feature extraction means further comprises a second feature extraction means for extracting a feature of each of second data in the second signal.
[0042] Also, the first or second signal is an image signal.
[0043] Also, the feature is extracted by using a predetermined number of first or second data.
[0044] Also, the feature is a distribution of values of a predetermined number of first or second data.
[0045] Also, the first or second data is expressed by a plurality of bits, and the feature is a distribution of values of the predetermined number of first or second data wherein predetermined bits are omitted from the plurality of bits.
[0046] Preferably, the predetermined bits are lower side bits of the first or second data.
[0047] Also, the feature is a distribution of values of predetermined number of re-quantized first or second data. A feature is a distribution of values of predetermined number of first or second data subjected to adaptive quantization. Also, the adaptive quantization is an ADRC.
[0048] Preferably, when a plurality of position information of the second data corresponding to a feature of the focused data are read from the storage means, the position information having the shortest distance to a position of the focused data is regarded as position information of the second data corresponding to the focused data.
[0049] A third aspect of the present invention is an image signal processing device for detecting motion vectors by using a first image signal including a plurality of first data and a second image signal including a plurality of second data, characterized by comprising a first feature extraction means for extracting a feature as focused data being data at a focused position in the first image signal; a storage means for storing position information of the second data at a position specified by an address corresponding to the feature of each of the second data; and a motion vector calculation means for obtaining position information of the second data corresponding to the focused data by reading position information of the second data stored in the storage means at an address corresponding to the feature of the focused data and calculating a motion vector of the focused data by using position information of the focused data and the obtained position information of the second data.
[0050] Preferably, when a plurality of position information of the second data corresponding to a feature of the focused data are read from the storage means, the motion vector calculation means regards the position information having the shortest distance to a position of the focused data as position information of the second data corresponding to the focused data.
[0051] A fourth aspect of the present invention is a signal processing method for performing matching processing by using a first signal including a plurality of data and a second signal including a plurality of second data, comprising a first step for extracting a feature as focused data being data at a focused position in the first signal; a second step for storing position information of the second data at a position specified by an address corresponding to the feature of each of the second data; and a third step for obtaining position information of the second data corresponding to the focused data by reading position information to the second data stored in the second step at an address corresponding to the feature of the focused data.
[0052] A fifth aspect of the present invention is an image signal processing method for detecting a motion vector by using a first image signal including a plurality of first data and a second image signal including a plurality of second data, comprising a first step for extracting a feature as a focused data being data of a focused position in the first images signal; a second step for storing position information of the second data at a position specified by an address corresponding to the feature of each of the second data; a third step for obtaining position information of the second data corresponding to the focused data by reading position information to the second data stored in the storage means at an address corresponding to the feature of the focused data; and a fourth step for calculating a motion vector of the focused data by using position information of the focused data and the obtained position information of the second data.
BRIEF DESCRIPTION OF DRAWINGS
[0053]
FIG. 1 is a block diagram showing an example of a configuration of a motion detection device of the related art wherein block matching algorithm is applied.
[0054]
FIG. 2 is a view for explaining an outline of block matching algorithm.
[0055]
FIG. 3 is a flowchart for explaining a processing procedure of detecting a motion vector of a pixel Fc (x,
[0056] y) in a current frame FC.
[0057]
FIG. 4 is a block diagram showing a signal processing apparatus having a basic configuration of a motion detection device as a principal part of an image signal processing apparatus according to the present invention.
[0058]
FIG. 5 is a block diagram showing an example of a basic configuration of first and second data storing portions in FIG. 4.
[0059]
FIG. 6 is a block diagram showing an example of a configuration of a matching portion in FIG. 4.
[0060]
FIG. 7 is a block diagram showing a first embodiment of a motion detection device according to the present invention.
[0061]
FIG. 8 is a flowchart showing an outline of motion vector detection according to the present embodiment.
[0062]
FIG. 9 is a view for explaining the configuration of a motion memory in a feature address system according to the present embodiment.
[0063]
FIG. 10 is a view for explaining a procedure of storing in the motion memory in a feature address system according to the present embodiment.
[0064]
FIG. 11 is a flowchart for explaining an operation of motion detection in a feature address system according to the present embodiment.
[0065]
FIG. 12 is a block diagram showing a second embodiment of a motion detection device according to the present invention.
[0066]
FIG. 13 is a block diagram showing another example of a configuration of a motion detection device according to the second embodiment.
[0067]
FIG. 14A to FIG. 14D are views showing an example of taking a class tap.
[0068]
FIG. 15A and FIG. 15B are views for explaining the fact that using an ADRC quantization code is superior to using a surrounding pixel value.
[0069]
FIG. 16 is a view showing a correspondence relationship of decimal number and hexadecimal number of a luminance value.
BEST MODE FOR CARRYING OUT THE INVENTION
[0070] Below, preferred embodiments of the present invention will be explained with reference to the attached drawings.
[0071]
FIG. 4 is a block diagram showing a signal processing device having a basic configuration of a motion detection device as a principal part of an image signal processing apparatus according to the present invention.
[0072] The signal processing apparatus is for accurately estimating position information (a motion vector) with a small amount of calculation by performing processing of obtaining (determining) position information of second data corresponding to focused data by reading the position information of second data included in a second signal stored in a motion detection memory (hereinafter, referred to as an ME memory) at an address corresponding to a feature of the focused data being data at a focused position in a first signal including a plurality of first data.
[0073] Note that the ME memory (a feature address, i) indicates data stored at a position specified by an address (a feature address, i) in the ME memory.
[0074] Below, the specific configuration and function of the signal processing apparatus will be explained in detail with reference to the drawings.
[0075] The signal processing apparatus 10 comprises a first data storing portion 11, a second data storing portion 12, an ME memory 13 and a matching portion 14.
[0076] Note that the first data storing portion 11 configures a first feature extraction means according to the present invention, the second data storing portion 12 configures a second feature extraction means according to the present invention, the ME memory 13 configures a storage means according to the present invention, and the matching portion 14 configures a motion vector calculation means (matching means) according to the present invention.
[0077] The first data storing portion 11 stores a first signal including a plurality of first data input from an input terminal TIN.
[0078] The first data storing portion 11 receives as an input the next first signal and stores this next first signal and outputs the previously stored first signal as a second signal including a plurality of second data to the second data storing portion 12 and the matching portion 14.
[0079] Also, the first data storing portion 11 extracts a feature of a focused data being data at a focused position from the stored first signal and supplies feature information including position information of the focused data to the matching portion 14.
[0080] The second data storing portion 12 stores the previous (for example, one time before) second signal stored in the first data storing portion 11, extracts a feature of each of the second data from the stored second signal and stores in the ME memory 13.
[0081]
FIG. 5 is a block diagram showing an example of a configuration of the first and second data storing portions. As shown in FIG. 5, the first and second data storing portions respectively comprises a memory 111 and a feature extraction portion 112 for extracting a feature from the first or second signal stored in the memory 111.
[0082] The ME memory 13 receives features of second data of the second signal in the second data storing portion 12 and stores position information of the second data at positions specified by addresses corresponding to the features of the second data.
[0083] The matching portion 14 obtains (determines) position information of second data corresponding to the focused data by reading the position information of the second data included in the second signal stored in the ME memory 13 at an address corresponding to a feature of the focused data being data at a focused position input from the first data storing portion 11.
[0084] When a plurality of position information of the second data corresponding to a feature of the focused data is read from the ME memory 13, the matching portion 14 ragards the above position information closest to a position of the focused data as position information of second data corresponding to the focused data.
[0085]
FIG. 6 is a block diagram showing an example of a configuration of a matching portion 14.
[0086] The matching portion 14 comprises, as shown in FIG. 6, an input portion 141, a read portion 142 for reading the position information of the second data included in the second signal stored in the ME memory 13 at an address corresponding to a feature of the focused data being data of a focused position based on feature information from the input portion 141, and a determination portion 143 for determining position information of the second data corresponding to the focused data based on the position information of the first data from the input portion 141 and the information read from the ME memory 13. When used in motion detection, the determination portion 413 determines as a motion vector and output the same.
[0087] Note that the first and second signals are, for example, image signals.
[0088] Also, for example, feature extraction in the first and second data storing portions 11 and 12 is performed by using predetermined number of first and second data. Also, the feature is a distribution of values of the predetermined number of first or second data. Alternately, the first and second data is expressed by a plurality of bits, and the feature is a distribution of values of the above predetermined number of first or second data excluding predetermined bits from the above plurality of bits. Here, the predetermined bits are lower side bits of the first or second data. Also, the feature is a distribution of values of predetermined number of re-quantized first or second data. Alternately, the feature is a distribution of values of predetermined number of first or second data subjected to adaptive quantization, and the adaptive quantization is ADRC (Adaptive Dynamic Range Coding).
[0089] Below, embodiments of a motion detection device as a principal part of an image signal processing apparatus based on a signal processing apparatus having the above configuration will be explained.
[0090] First Embodiment
[0091]
FIG. 7 is a block diagram showing a first embodiment of a motion detection device as a principal part of an image processing apparatus according to the present invention.
[0092] The motion detection device is provided with an ME memory for storing position information by using a feature as an address and performs matching processing by using a surrounding pixel value as a feature, so that motion vectors can be accurately estimated with a little amount of calculation.
[0093] Below, the specific configuration and function of the motion detection device will be explained in detail with reference to the drawings.
[0094] The motion detection device 20 comprises a second frame memory 21 as a first data storing portion and a second frame memory 22 as a second data storing portion, an ME memory 23 and a matching portion 24 as a matching portion.
[0095] Note that the first frame memory 21 configures the first feature extraction means according to the present invention, the second frame memory 22 configures the second feature extraction means according to the present invention, the ME memory 23 configures the storing means according to the present invention, and the matching portion 24 configures the motion vector calculation means (matching means) according to the present invention.
[0096] The first frame memory 21 stores an information of one frame of an image signal input from an input terminal TIN.
[0097] When the next frame information is input, the first frame memory 21 stores previously stored frame information, that is, the information of the current frame Fc and outputs the information of the current frame Fc to the second frame memory 22 and the matching portion 24.
[0098] Also, the first frame memory 21 supplies a feature of the focused pixel, that is, address information together with information of the current frame Fc to the matching portion 24.
[0099] The second frame memory 22 stores previous (for example, one time before) frame information stored in the first frame memory 21 as information of a reference frame Fr.
[0100] The ME memory 23 stores information including position coordinates of the focused pixel corresponding to a feature by using the feature as an address as a pixel value of a certain block range having the focused pixel at its center based on the information of the reference frame Fr stored in the second frame memory 22.
[0101] The matching portion 24 reads storage information in the ME memory 23 by using the feature of the focused pixel included in the information of the current frame Fc supplied from the first frame memory 21 as a feature address. Differential coordinates based on the focused pixel in the current frame and the stored information read from the ME memory 23 are output as a motion vector (Vx, Vy) of the focused pixel from a terminal TOUT. Note that when there are a plurality of stored information (position coordinates) in a feature address corresponding to the feature of the focused pixel, a motion vector by which a position and distance of the focused pixel become the minimum among the plurality of candidates is output.
[0102]
FIG. 8 is a flowchart showing an outline of motion vector detection according to the present embodiment.
[0103] In this example, first, data of the reference frame Fr is stored in the ME memory 23 (step ST21). Next, the matching portion 24 reads data from the ME memory 13 at a feature address in accordance with a feature of a pixel Ln in the current frame Fc (Step ST22).
[0104] The matching portion 24 determines the position information of data corresponding to the pixel Ln in the read data (step ST 23). Then, motion vectors are calculated until motion vectors of all pixels are obtained (steps ST23 and ST24).
[0105] Below, the configuration and function of the ME memory 23 wherein a feature address system as a characteristic of the present embodiment will be explained further in detail with reference to FIG. 9 and FIG. 10.
[0106]
FIG. 9 is a view showing an example of a configuration of the ME memory wherein the feature address system is applied. Also, FIG. 10 is a flowchart for explaining a procedure of storing information of the reference frame in the ME memory.
[0107] The conventional memory is for storing pixel value by using the position information of a pixel as an address, while the ME memory 23 stores for every feature position information of a pixel having the feature in flag addresses FRGA1, 2, . . . , that is, B, C . . . in FIG. 9 successively.
[0108] In the present embodiment, one cell ME-B1 has a storage capacity of an amount of position information. Also, the number of position information stored for the feature is stored in a flag address FRGA0.
[0109] The feature is a pixel value in a certain block having a focused pixel at its center. For example, when assuming that a block range is 3×3, the vertical direction is “i”, the horizontal direction is “j”, and a pixel value of a position (i, j) is L(i, j), a feature in this case becomes as in the next formula (2).
{L(i−1, j−1), L(i−1, j), L(i−1, j+1), L(i, j−1), L(i, j), L(i, j+1), L(i+1, j−1), L(i+1, j), L(i+1, j+1)} (2)
[0110] Next, a procedure of storing information of the reference frame to the ME memory will be explained with reference to the flowchart in FIG. 10.
[0111] When information of the reference frame Fr is stored in the frame memory 22, the processing starts.
[0112] Step ST101:
[0113] In the step ST101, all data in the ME memory is initialized to be “0” by writing “0” or turning on a reset signal.
[0114] Step ST102:
[0115] In the step ST102, a counter variable “n” for counting pixels in one frame memory is initialized to be “0”.
[0116] Step ST103:
[0117] In the step ST103, a pixel value of a certain block range having the focused pixel Ln at its center is regarded as a feature (a feature address) in the frame memory 21 in FIG. 4.
[0118] Step ST104:
[0119] In the step ST104, a content in the ME memory (a feature, 0) is incremented by one.
[0120] Step ST105:
[0121] A feature address is set to be the feature in the step ST103 and the ME memory (a feature, 0) as a content of the ME memory 23 when a flag address is “0” is set to a read flag address. In the step ST105, the position information of the focused pixel Ln is written to a content of the ME memory (a feature, flag address) as a content of the ME memory 13.
[0122] Step ST106:
[0123] In the step ST106, the counter variable “n” is incremented.
[0124] Step ST107:
[0125] In the step ST107, whether the focused pixel Ln is the final pixel in the frame or not is judged. When it is not the final pixel, the procedure proceeds to the step ST103 and the same processing is repeated on the next pixel.
[0126] While, when it is the final pixel, the procedure proceeds to a step ST109 to finish the processing.
[0127] Next, a processing procedure of motion vector detection according to the present embodiment will be explained with reference to FIG. 7 and FIG. 11.
[0128] Note that FIG. 11 is a flowchart for explaining the processing procedure of motion vector detection according to the present embodiment.
[0129] Step ST201:
[0130] In the step ST201, after storing information of the current frame Fc and the reference frame Fr respectively in the frame memories 21 and 22, information of the reference frame is stored in the ME memory 23 while converting it to a feature address. The details are explained above (the steps ST100 to ST109).
[0131] Step ST202:
[0132] In the step ST202, the counter variable “n” for counting pixels in one frame is initialized to be “0”.
[0133] Step ST203:
[0134] In the step ST203, a feature of the focused pixel Ln in the first current frame 21 is a pixel value of a certain block range having the focused pixel at its center, so it is sent as a feature to the matching portion 24.
[0135] Step ST204:
[0136] In the step ST204, the matching portion 24 reads a value of a content ME memory (a feature address, 0) from the ME memory 23 by using the received feature as a feature address and substitutes for a variable “kn” meaning the number of options.
[0137] Also, a variable “k” meaning a counter of the number of options is initialized to be “1”, a variable “min” meaning the minimum value of a distance is initialized to be ∞ or a sufficiently large value, and a variable “L” meaning a distance is initialized to be “0”.
[0138] Step ST205:
[0139] In the step ST205, a distance between the focused pixel Ln in the current frame and an ME memory content (a feature address, k)=position coordinates read from the ME memory 23 is calculated and substituted for the variable “L”.
[0140] Step ST206:
[0141] In the step ST206, the distinction of large and small between the distance L obtained in the step ST205 and the minimum value “min” of the distance are performed.
[0142] As a result, when min>L, the procedure proceeds to a step ST207 for updating the minimum value L of the distance, while when min<L, the update step is skipped and the procedure proceeds to a step ST208.
[0143] Step ST207:
[0144] In the step ST207, the minimum value “min” of the distance is updated to be L. At this time, a flag address value, that is, “k” is stored to a variable “ans”.
[0145] Step ST208:
[0146] In the step ST208, whether the candidate counter indicates the number of candidates or not is judged, and when it is the number of candidates, the procedure proceeds to a step ST210, while when there are still other candidates, the procedure proceeds to the step ST209.
[0147] Step ST209:
[0148] In the step ST209, the candidate counter “k” is incremented, then, the procedure proceeds to the step ST 205.
[0149] Step ST210:
[0150] In the step ST210, the position information having the minimum distance to the pixel Ln in the current frame, that is, a value of a content ME memory (a feature address, ans) of the ME memory 23 is read and the differential coordinates are regarded as a motion vector.
[0151] Step ST211:
[0152] In the step ST211, a motion vector of the focused pixel is output.
[0153] Step ST212:
[0154] In the step ST 212, a counter variable “n” of pixels is incremented.
[0155] Step ST213:
[0156] In the step ST213, whether the focused pixel is the final pixel in the current frame or not is judged. As a result of the judgment, when it is the final pixel the procedure proceeds to a step ST214, while not, the procedure proceeds to the step ST203 to obtain a motion vector of the next pixel.
[0157] As to a pixel value, for example, when one pixel=8 bits, matching processing can be performed with full bit (8 bits) information on an image such as computer graphics (CG), while, since frames are uneven in a natural image, the matching processing is preferably performed by omitting predetermined bits from a plurality of bits. Specifically, several low bits may be masked for use or the number of bits may be decreased for re-quantization.
[0158] Namely, it is preferable to decrease the number of bits in nonlinear/linear quantization (decrease the number of quantization bits).
[0159] As explained above, according to the present first embodiment, a first frame memory 21 for storing information of a current frame Fc and outputting address information as a feature of a focused pixel together with the information of the current frame Fc; a second frame memory 12 for storing previous (one time before) frame information stored in the first frame memory 21 as information of a reference frame Fr; an ME memory 23 for converting a feature including a pixel value of a certain block range having the focused pixel at its center based on the information of the reference frame Fr stored in the second frame memory 22 and storing information including position information after the conversion; and a matching portion 24 for reading storage information in the ME memory 23 by using a feature of the focused pixel included in the information of the current frame Fc supplied from the first frame memory 21 as a feature address, calculating a distance between the focused pixel in the current frame and the feature address (position coordinates) read from the ME memory 23, and detecting the differential coordinates based on the position information having the minimum distance in a plurality of candidates as a motion vector (Vx, Vy) of the focused pixel, as a result, effects below can be obtained.
[0160] Namely, in the first embodiment, spatial pattern information in a block area is used as a feature and calculation and comparison of a distance are performed only for exactly the number of candidates, so that there is an advantage that highly accurate motion vector detection becomes possible with a smaller amount of calculation than that in the conventional method.
[0161] Note that when there are a large number of candidates, storage information in the ME memory 23 does not have to be the whole information of one frame and may be divided to some areas.
[0162] Second Embodiment
[0163]
FIG. 12 is a block diagram showing a second embodiment of a motion detection device as an image processing apparatus according to the present invention.
[0164] A different point in the present second embodiment from the first embodiment explained above is that matching with a preferable feature becomes possible by providing class generation portions 25-1 and 25-2 as feature generation means for obtaining a feature. Note that, as shown in FIG. 13, the configuration of providing one class generation portion 25 as in FIG. 13 is also possible. As a function of the circuit, it also has functions of the class generation portion 25-1 and 25-2.
[0165] The class generation portion 25-1 receives information of a current frame Fc of the first frame memory 21, generates a quantization code based on the ADRC as a feature of the current frame and outputs to a matching portion 24A.
[0166] Also, the class generation portion 25-2 receives information of a reference frame Fr of the second frame memory 22, generates a quantization code based on the ADRC as a feature of the current frame and outputs to the ME memory 23.
[0167] The matching portion 24A reads storage information of the ME memory 23 by using a feature of the current frame as a feature address and performs matching of a focused pixel in a current frame and quantization codes of the ADRC included in the feature address read from the ME memory 23 to detect a motion vector of the focused pixel.
[0168] As explained above, the ADRC is used for generating a feature in the class generation portions 25-1 and 25-2 according to the second embodiment. The ADRC (Adaptive Dynamic Range Coding) is an adaptive quantization method developed for high performance coding of a VTR (Video Tape Recorder) and is capable of effectively expressing a partial pattern of a signal level with a short word length, so that the ADRC is used for code generation of spatial class division in the second embodiment.
[0169] The ADRC is an algorithm for performing re-quantization by evenly dividing between the maximum value MAX and the minimum value MIN by a specified bit length by the formula 3 below, wherein a dynamic range of a spatial class tap is DR, a bit assignment is n, a data level of a pixel of a spatial class tap is L and a re-quantization code is Q.
DR
=MAX−MIN+1
Q
={(L−MIN+0.5)×2n÷DR} (3)
[0170] Note that { } indicates rounding processing. A matching processing flow will be omitted because it is equivalent to that using the feature as a quantization code of the ADRC in the above explanation of FIG. 11.
[0171] As an example of taking a spatial class tap, when a block size is 3×3, all pixels may be used as shown in FIG. 14A or it may be configured to be the shape of a cross, and it may be determined within a limit of an information amount given to the class code.
[0172] Similarly, as an example of the case where the block size is 5×5, embodiments shown in FIG. 14C and FIG. 14D can be applied.
[0173] The example in FIG. 14C is the case of configuring to be the shape of a cross, and the example in FIG. 14D is the case of configuring to be the shape of a cross and, furthermore, using pixels at corners.
[0174] Next, the fact that the case of using a surrounding pixel value is superior to the case of using the ADRC quantization code will be explained with reference to FIG. 15A and FIG. 15B.
[0175]
FIG. 15A and FIG. 15B show a pixel value when certain one line of an image is moved from a reference frame to a current frame for facilitating understanding. Also, FIG. 16 shows a correspondence relationship of decimal number expression and hexadecimal number expression of luminance values.
[0176] Normally, in the case of a natural image, even when the same pattern is moved, there is a low possibility to be the same pixel value and a pixel level deviates as shown in FIG. 15A and FIG. 15B.
[0177] In this case, the point is whether or not it is possible to correctly detect as the same pattern. A coding result in the case of masking lower bits to suppress an effect of noise components when using a surrounding pixel value as a feature is listed.
[0178] As described above, there are some cases of erroneous detection even being the same pattern.
[0179] On the other hand, quantization coding by the ADRC is resistible to small level fluctuations due to capability of effectively expressing a partial pattern of a signal level with a short word length, and the same coding result can be obtained.
[0180] Specifically, an ADRC code of a certain line of the reference frame is “01101”, and an ADRC code of a certain line of the current frame is also “01101”, so that the two match.
[0181] As explained above, according to the second embodiment, by using the ADRC quantization code as a feature, there is an advantage that motion vector detection at higher accuracy than conventional case becomes possible.
[0182] In the first and second embodiments, a memory device corresponding to motion vector detection was explained. Therefore, a memory device for storing spatial coordinates by using each feather quantity as an address was taken as an example.
[0183] However, the present invention can be applied to systems performing matching other than motion vector detection.
[0184] This idea can be expressed “a memory device for storing information on first data, characterized by comprising an input means for receiving as an input a first feature indicating a first feature (address) of focused data in second data being different from the above first data; and
[0185] a memory means for storing a second feature indicating a second feature (coordinates) of the above certain data in the above first data at a position corresponding to a third feature indicating the above first feature (address) of certain data in the above first data;
[0186] wherein the above second feature as the above second feature of the above first data is output from a position corresponding to the above first feature of the above memory means”.
[0187] Namely, a second feature indicating a feature B (second feature) of data may be stored by using each first feature indicating a feature A (first feature) of data as an address. The feature A and feature B can be suitably set/changed in accordance with an object of a system/device performing matching. For example, in the present embodiment, an explanation was made assuming that the feature A: a pixel value pattern/the ADRC code, and the feature B: coordinates, but other features may be also used.
INDUSTRIAL APPLICABILITY
[0188] According to a memory device, a signal processing apparatus, an image signal processing apparatus and a signal processing methods of the present invention, highly accurate motion vector detection becomes possible, so that they are applicable to processing requiring matching, such as a motion detection device and an object search device used in a moving image compression apparatus, etc.
Claims
- 1. A memory device for storing information, characterized by comprising
an input/output means for receiving or outputting said information; and a storage portion for storing said information; wherein said storage portion is at least controlled by an address corresponding to a feature of a focused data in a predetermined signal; and said information is a position information of said focused data in said predetermined signal.
- 2. A memory device as set forth in claim 1, characterized in that said memory device is a semiconductor device.
- 3. A signal processing apparatus for performing matching processing by using a first signal including a plurality of first data and a second signal including a plurality of second data, characterized by comprising:
a first feature extraction means for extracting a feature as focused data being data at a focused position in said first signal; and a storage means for storing position information of said second data at a position specified by an address corresponding to said feature of each of said second data; wherein: the position information of said second data corresponding to said focused data is obtained by reading the position information to said second data stored in said storage means at an address corresponding to said feature of said focused data.
- 4. A signal processing apparatus as set forth in claim 3, characterized in that said storage means stores the position information of said second data in advance.
- 5. A signal processing apparatus as set forth in claim 3, characterized in that said signal processing apparatus further comprises a second feature extraction means for extracting a feature of each of second data in said second signal.
- 6. A signal processing apparatus as set forth in claim 3, characterized in that said first feature extraction means further comprises a second feature extraction means for extracting a feature of each of second data in said second signal.
- 7. A signal processing apparatus as set forth in claim 3, characterized in that said first or second signal is an image signal.
- 8. A signal processing apparatus as set forth in claim 3, characterized in that said feature is extracted by using a predetermined number of first or second data.
- 9. A signal processing apparatus as set forth in claim 8, characterized in that said feature is a distribution of values of a predetermined number of first or second data.
- 10. A signal processing apparatus as set forth in claim 9, characterized in that:
said first or second data is expressed by a plurality of bits; and said feature is a distribution of values of said predetermined number of first or second data wherein predetermined bits are omitted from said plurality of bits.
- 11. A signal processing apparatus as set forth in claim 10, characterized in that said predetermined bits are lower side bits of said first or second data.
- 12. A signal processing apparatus as set forth in claim 8, characterized in that said feature is a distribution of values of predetermined number of re-quantized first or second data.
- 13. A signal processing apparatus as set forth in claim 8, characterized in that said feature is a distribution of values of predetermined number of first or second data subjected to adaptive quantization.
- 14. A signal processing apparatus as set forth in claim 13, characterized in that said adaptive quantization is an ADRC.
- 15. A signal processing apparatus as set forth in claim 3, characterized in that, when a plurality of position information of said second data corresponding to a feature of said focused data are read from said storage means, said position information having the shortest distance to a position of said focused data is regarded as position information of said second data corresponding to said focused data.
- 16. An image signal processing apparatus for detecting motion vectors by using a first image signal including a plurality of first data and a second image signal including a plurality of second data, characterized by comprising:
a first feature extraction means for extracting a feature as focused data being data at a focused position in said first image signal; a storage means for storing position information of said second data at a position specified by an address corresponding to said feature of each of said second data; and a motion vector calculation means for obtaining the position information of said second data corresponding to said focused data by reading the position information of said second data stored in said storage means at an address corresponding to said feature of said focused data and calculating a motion vector of said focused data by using position information of said focused data and the obtained position information of said second data.
- 17. An image signal processing apparatus as set forth in claim 16, characterized in that
said image signal processing apparatus further comprises a second feature extraction means for extracting a feature of each of second data in said second image signal.
- 18. An image signal processing apparatus as set forth in claim 16, characterized in that
said first feature extraction means further comprises a second feature extraction means for extracting a feature of each of second data in said second image signal.
- 19. An image signal processing apparatus as set forth in claim 16, characterized in that
when a plurality of position information of said second data corresponding to a feature of said focused data are read from said storage means, said motion vector calculation means regards said position information having the shortest distance to a position of said focused data as position information of said second data corresponding to said focused data.
- 20. A signal processing method for performing matching processing by using a first signal including a plurality of data and a second signal including a plurality of second data, characterized by comprising:
a first step for extracting a feature as focused data being data at a focused position in said first signal; a second step for storing position information of said second data at a position specified by an address corresponding to said feature of each of said second data; and a third step for obtaining the position information of said second data corresponding to said focused data by reading the position information to said second data stored in said second step at an address corresponding to said feature of said focused data.
- 21. A signal processing method as set forth in claim 20, further including a fourth step for extracting a feature of each of second data in said second signal.
- 22. A signal processing method as set forth in claim 20, characterized in that
said first step further includes a fourth step for extracting a feature of each of second data in said second signal.
- 23. A signal processing method as set forth in claim 20, characterized in that said first or second signal is an image signal.
- 24. A signal processing method as set forth in claim 20, characterized in that said feature is extracted by using a predetermined number of first or second data.
- 25. A signal processing method as set forth in claim 24, characterized in that said feature is a distribution of values of a predetermined number of first or second data
- 26. A signal processing method as set forth in claim 25, characterized in that:
said first or second data is expressed by a plurality of bits; and said feature is a distribution of values of said predetermined number of first or second data wherein predetermined bits are omitted from said plurality of bits.
- 27. A signal processing method as set forth in claim 26, characterized in that said predetermined bits are lower side bits of said first or second data.
- 28. A signal processing method as set forth in claim 24, characterized in that said feature is a distribution of values of predetermined number of re-quantized first or second data.
- 29. A signal processing method as set forth in claim 24, characterized in that said feature is a distribution of values of a predetermined number of first or second data subjected to adaptive quantization.
- 30. A signal processing method as set forth in claim 29, characterized in that said adaptive quantization is an ADRC.
- 31. A signal processing device as set forth in claim 20, characterized in that, when a plurality of position information of said second data corresponding to a feature of said focused data are read from said storage means, said position information having the shortest distance to a position of said focused data is regarded as position information of said second data corresponding to said focused data.
- 32. An image signal processing method for detecting a motion vector by using a first image signal including a plurality of first data and a second image signal including a plurality of second data, characterized by comprising:
a first step for extracting a feature as a focused data being data of a focused position in said first images signal; a second step for storing position information of said second data at a position specified by an address corresponding to said feature of each of said second data; a third step for obtaining position information of said second data corresponding to said focused data by reading position information to said second data stored in said storage means at an address corresponding to said feature of said focused data; and a fourth step for calculating a motion vector of said focused data by using position information of said focused data and the obtained position information of said second data.
- 33. An image signal processing method as set forth in claim 32, characterized in that said image signal processing method further includes a fifth step for extracting a feature of each of second data in said second image signal.
- 34. An image signal processing method as set forth in claim 32, characterized in that said first step further includes a fifth step for extracting a feature of each of second data in said second image signal.
- 35. An image signal processing method as set forth in claim 32, characterized in that, when a plurality of position information of said second data corresponding to a feature of said focused data is read from said storage means, said third step regards said position information having the shortest distance to a position of said focused data as position information of said second data corresponding to said focused data.
Priority Claims (1)
Number |
Date |
Country |
Kind |
2002-222008 |
Jul 2002 |
JP |
|
PCT Information
Filing Document |
Filing Date |
Country |
Kind |
PCT/JP03/09637 |
7/30/2003 |
WO |
|