The present invention relates to image processing devices having a function of identifying a position in a captured image pointed at with an image capture object by using image data for the captured image.
It is well known that image display devices built around various devices, such as mobile phones or PDAs (Personal Digital Assistants), and equipped with a liquid crystal display device as an image display section (hereinafter, “liquid crystal display devices”) are in popular use. Especially, the PDA traditionally contains touch sensors to enable a touch input whereby the user can input information by directly touching the liquid crystal display device with, for example, a finger. It is expected that broad ranges of mobile phones and like devices will also adopt a liquid crystal display device which come with touch sensors.
Patent Literature 1 discloses technology as an example of the liquid crystal display device incorporating touch sensors.
This conventional liquid crystal display device primarily includes an edge detection circuit, a touch/non-touch determining circuit, and a coordinate calculation circuit. The edge detection circuit is adapted to detect an edge of a captured image to obtain an edge image.
The touch/non-touch determining circuit is adapted to determine from the edge image obtained by the edge detection circuit whether or not an object has touched a display screen. The touch/non-touch determining circuit is adapted to examine the direction of motion of each edge (temporal changes of the coordinates of each edge) of the object and if there are edges moving in opposite directions, determines that the object has touched the display screen. This is an exploitation of the fact that the edges do not move in opposite directions unless the object is in contact with something. Specifically, the circuit is adapted to improve precision in the determination by so determining when the amount of motion in opposite directions is greater than or equal to a predetermined threshold.
Furthermore, the coordinate calculation circuit is adapted to calculate the center of mass of the edge as the coordinate position of the object when the object is determined to have come in contact with the surface. The circuit is thus prevented from calculating the coordinate position before the object comes into contact, allowing for improvement of precision in the calculation of the position.
The conventional liquid crystal display device, however, needs to retain image data or edge data throughout two or more frames because the circuit uses the edges moving in opposite directions (object in an image changing with time) in order to detect a touch/non-touch.
The touch/non-touch detection thus requires information for at least two frames or even more, which in turn disadvantageously requires large memory.
Another problem is that the identification of the touch position is time-consuming because the device is adapted to calculate the center of mass of the edge as the coordinate position of the object when the object is determined to have come in contact with the surface so that the coordinate position of the object can be calculated after the touch/non-touch detection.
Furthermore, in the conventional liquid crystal display device, the amount of image processing can be possibly reduced to allow for reducing the time taken by the calculation of the coordinate position of the object, by reducing the resolution of the captured image. That will however entail another problem of poor precision in the coordinate position detection because the center of mass of the edge is calculated as the coordinate position of the object.
Patent Literature 1
Japanese Patent Application Publication, Tokukai, No. 2006-244446 (Publication Date: Sep. 14, 2006)
Patent Literature 2
Japanese Patent Application Publication, Tokukai, No. 2004-318819 (Publication Date: Nov. 11, 2004)
Patent Literature 3
Japanese Patent Application Publication, Tokukai, No. 2007-183706 (Publication Date: Jul. 19, 2007)
The present invention, conceived in view of these conventional problems, has an objective of providing an image processing device, etc. capable of detection of a position in a captured image pointed at with an image capture object with small memory and short processing time by performing pattern matching using image data for only one frame, irrespective of detection of a touch/non-touch of the captured image with the image capture object, and also capable of both reduction in the amount of image processing in the calculation of the coordinate position and maintaining of precision in the coordinate position detection with small memory.
The image capture device in accordance with the present invention is, to address the problems, characterized in that it is an image processing device having a function of calculating a position in a captured image pointed at with an image capture object by using image data for the captured image, the device including:
gradient calculation means for calculating, for each pixel in the image data, a vertical-direction gradient quantity and a horizontal-direction gradient quantity for a pixel value of that pixel from the pixel value and pixel values of adjoining pixels;
gradient direction identifying means for identifying, for each pixel, either a gradient direction or null direction based on the vertical-direction gradient quantity and the horizontal-direction gradient quantity calculated by the gradient calculation means, the pixel having null direction if both the vertical-direction gradient quantity and the horizontal-direction gradient quantity or a gradient magnitude calculated from the vertical-direction gradient quantity and the horizontal-direction gradient quantity is less than a predetermined threshold;
correspondence degree calculation means for matching a matching region with a predetermined model pattern, the matching region being a region, around a target pixel, containing a predetermined number of pixels, and for calculating a number of pixels for which a gradient direction for each pixel in the matching region matches a gradient direction for each pixel in the model pattern as an correspondence degree which is a degree of matching of the matching region with the model pattern;
peak pixel search means for searching a search area containing a predetermined number of pixels around the target pixel for a peak pixel which is a pixel for which the correspondence degree calculated by the correspondence degree calculation means is a maximum; and
coordinate calculation means for calculating the position in the captured image pointed at with the image capture object by using the correspondence degree for each pixel in a peak pixel region which is a region containing a predetermined number of pixels centered around the peak pixel found by the peak pixel search means.
The method of controlling an image capture device in accordance with the present invention is characterized in that it is a method of controlling an image processing device having a function of calculating a position in a captured image pointed at with an image capture object by using image data for the captured image, the method including:
the gradient calculation step of calculating, for each pixel in the image data, a vertical-direction gradient quantity and a horizontal-direction gradient quantity for a pixel value of that pixel from the pixel value and pixel values of adjoining pixels;
the gradient direction identifying step of identifying, for each pixel, either a gradient direction or null direction based on the vertical-direction gradient quantity and the horizontal-direction gradient quantity calculated in the gradient calculation step, the pixel having null direction if both the vertical-direction gradient quantity and the horizontal-direction gradient quantity or a gradient magnitude calculated from the vertical-direction gradient quantity and the horizontal-direction gradient quantity is less than a predetermined threshold;
the correspondence degree calculation step of matching a matching region with a predetermined model pattern, the matching region being a region, around a target pixel, containing a predetermined number of pixels, and for calculating a number of pixels for which a gradient direction for each pixel in the matching region matches a gradient direction for each pixel in the model pattern as an correspondence degree which is a degree of matching of the matching region with the model pattern;
the peak pixel search step of searching a search area containing a predetermined number of pixels around the target pixel for a peak pixel which is a pixel for which the correspondence degree calculated by the correspondence degree calculation means is a maximum; and
the coordinate calculation step of calculating the position in the captured image pointed at with the image capture object by using the correspondence degree for each pixel in a peak pixel region which is a region containing a predetermined number of pixels centered around the peak pixel found in the peak pixel search step.
According to the configuration or method, the gradient calculation means or step calculates, for each pixel in the image data, a vertical-direction gradient quantity and a horizontal-direction gradient quantity for a pixel value of that pixel from the pixel value and pixel values of adjoining pixels.
The gradient direction identifying means or step identifies, for each pixel, either a gradient direction or null direction based on the vertical-direction gradient quantity and the horizontal-direction gradient quantity calculated by the gradient calculation means or in the gradient calculation step, the pixel having null direction if both the vertical-direction gradient quantity and the horizontal-direction gradient quantity or a gradient magnitude calculated from the vertical-direction gradient quantity and the horizontal-direction gradient quantity is less than a predetermined threshold.
Having null direction is defined here as “being less than a predetermined threshold.” Alternatively, it may be defined as “being less than or equal to a predetermined threshold.”
The advance labeling as “having null direction” limits occurrences of numerous unwanted gradient directions which would otherwise be caused by noise and other factors. The advance labeling also leads to reducing matching targets to gradient directions near the edge, allowing for more efficient matching.
The vertical-direction gradient quantity, the horizontal-direction gradient quantity, the gradient direction, the gradient magnitude, etc. for the pixel value are quantities obtained from a single-frame captured image. In addition, these quantities are obtainable irrespective of detection of a touch/non-touch of the captured image with the image capture object.
Next, the correspondence degree calculation means or step matches a matching region with a predetermined model pattern, the matching region being a region, around a target pixel, containing a predetermined number of pixels, and calculates an correspondence degree which is a degree of matching of the matching region with the model pattern from a number of pixels for which a gradient direction contained in the matching region matches a gradient direction contained in the model pattern.
A scalar quantity, such as a pixel value (density level), could possibly be used as the quantity used in the matching of a matching region with a predetermined model pattern (hereinafter, may be referred to as the “pattern matching”). It is however difficult to set up model patterns in advance because the scalar quantity, even when quantized (values within a predetermined range are treated by equally regarding them as a particular constant), is ever variable depending on, for example, the condition of the image capture object.
Meanwhile, the gradient of the pixel value is a vector quantity with both a magnitude (gradient magnitude) and a direction (gradient direction). Especially, the gradient direction (orientation), for example, when quantized into 8 directions, enables discretization of any potential states for the pixels with as few as 8 states (or 9 if null direction is included), which is an extremely small number. Furthermore, the discretized states render different directions readily distinguishable.
The gradient directions generally match a direction either from an edge part in the captured image to near the center of an area surrounded by the edge part or radially from near the center toward the edge part, for example, for the finger surface or like soft surface which forms a round contact face upon contact with another surface and for the round-tipped pen or like surface which forms a round contact face despite its hardness. For contact faces of other shapes, the gradient directions again generally match a direction either from an edge part in the captured image to the inside of an area surrounded by the edge part or from the inside of an area surrounded by an edge part toward the outside of the area. This tendency does not change much with the condition of the image capture object, for example. The gradient direction is hence a suitable quantity for pattern matching.
Hence, pattern matching using image data for only one frame becomes possible irrespective of detection of a touch/non-touch of the captured image with the image capture object. That in turn enables pattern matching with small memory and short processing time.
The peak pixel search means searches a search area containing a predetermined number of pixels around the target pixel for a peak pixel which is a pixel for which the correspondence degree calculated by the correspondence degree calculation means is a maximum.
According to the configuration, a search area containing a predetermined number of pixels is searched. Hence, the processing cost and the memory size are reduced when compared to searching the image data region containing the total pixel count for a peak pixel.
The coordinate calculation means calculates the position in the captured image pointed at with the image capture object by using the correspondence degree for each pixel in a peak pixel region which is a region containing a predetermined number of pixels centered around the peak pixel found by the peak pixel search means. For example, when the pointing position is to be obtained from the position of the center of mass of the edge image, the calculation would become increasingly difficult with deformation of the captured image.
However, in the image processing device in accordance with the present invention, the position in the captured image pointed at with the image capture object is calculated by using the correspondence degree for each pixel in the peak pixel region obtained by pattern matching. Even if the captured image is deformed, the neighborhood of a maximum of the correspondence degree in the pattern matching would be regarded as exhibiting a substantially similar tendency in distribution to the tendency before the deformation where the correspondence degree decreases radially from the neighborhood of the maximum.
Therefore, the position in the captured image pointed at with the image capture object can be calculated by predetermined procedures (for example, calculation of a center of mass for the correspondence degree in the peak pixel region) regardless of whether or not the captured image is deformed. Hence, the amount of image processing, the processing cost, and the memory size are all reduced in the calculation of the pointing position while maintaining precision in the coordinate position detection.
Hence, the image processing device, as an example, is provided which, irrespective of detection of a touch/non-touch of the captured image with the image capture object, can detect the position in the captured image pointed at with the image capture object with small memory and short processing time by performing the pattern matching using image data for only one frame and which can also both reduce the amount of image processing in the calculation of the coordinate position and maintain precision in the coordinate position detection with small memory.
Additional objectives, advantages and novel features of the invention will be set forth in part in the description which follows, and in part will become apparent to those skilled in the art upon examination of the following or may be learned by practice of the invention.
b) depicts the pattern shown in
a) depicts an exemplary model pattern subsequent to matching efficiency improvement in a dark environment.
b) depicts an exemplary model pattern subsequent to matching efficiency improvement in a bright environment.
b) depicts an exemplary correspondence degree calculation method for the pattern matching.
b) depicts another exemplary pattern correspondence degree calculation process.
a) depicts an exemplary pattern correspondence degree calculation process.
b) depicts another exemplary pattern correspondence degree calculation process.
c) depicts a further exemplary pattern correspondence degree calculation process.
a) depicts still another exemplary pattern correspondence degree calculation process.
b) depicts yet another exemplary pattern correspondence degree calculation process.
c) depicts further yet another exemplary pattern correspondence degree calculation process.
a) depicts the operation in the case of the coordinate calculation determining section in the image processing device determining that there is no peak pixel.
b) depicts the operation in the case of the coordinate calculation determining section in the image processing device determining that there is a peak pixel.
The following will describe an embodiment of the present invention in reference to
First, referring to
First, an overview of the configuration of the image processing device 1 and an image capturing mechanism for the image processing device 1 will be described. The image processing device 1 is similar to general liquid crystal display devices in that the former has a display function and includes a liquid crystal display device (display device) containing a plurality of pixels and a backlight illuminating the liquid crystal display device.
The liquid crystal display device in the image processing device 1 differs from general liquid crystal display devices in that the former contains a built-in light sensor (image capture sensor) in each pixel so that it can capture, by the light sensors, an image of, for example, an external object (image capture object) approaching the display screen of the liquid crystal display device and acquire as image data (image data produced by the image capture sensors).
The liquid crystal display device may contain a built-in light sensor in each of a predetermined number of all the pixels. Preferably, however, each of all the pixels includes a built-in light sensor for better captured image resolution obtained with the light sensors.
The liquid crystal display device in the image processing device 1, as in a general liquid crystal display device, includes a display section containing a plurality of scan lines and a plurality of signal lines intersecting the plurality of scan lines, pixels with various capacitances formed at the intersections, and thin film transistors and further includes driver circuits driving the scan lines and driver circuits driving the signal lines.
The liquid crystal display device in the image processing device 1 is adapted to contain a built-in photodiode (image capture sensor) in, for example, each pixel as an image capture sensor. The photodiode is connected to a capacitor and adapted to change the electric charge of the capacitor according to changes in quantity of the light that is incident to the display screen and received by the photodiode. Voltage across both ends of the capacitor is detected to generate image data for image capturing (acquiring). This is the image capturing mechanism by the liquid crystal display device in the image processing device 1.
The image capture sensor is not limited to a photodiode and may be anything that relies on photoelectric effect for its operation and that can be built in each pixel in, for example, the liquid crystal display device.
In this configuration, the image processing device 1 is adapted to have, in addition to an inherent display function by which the liquid crystal display device displays images, an image capture function by which the display device captures images of an external object (image capture object) approaching the display screen. The image processing device can hence be adapted to enable a touch input on the display screen of the display device.
Now, referring to
a) depicts image capturing for a finger pad in a dark environment.
The captured image 61 in
Next,
In this case, the captured image 62 in
e) depicts image capturing for a pen tip in a dark environment.
In this case, the captured image 63 in
Next,
In this case, the captured image 64 in
These gradient directions generally match a direction either from an edge part in the captured image to near the center of an area surrounded by the edge part or radially from near the center toward the edge part, for example, for the finger surface or like soft surface which forms a round contact face upon contact with another surface and for the round-tipped pen or like surface which forms a round contact face despite its hardness. For contact faces of other shapes, the gradient directions again generally match a direction either from an edge part in the captured image to the inside of an area surrounded by the edge part or from the inside of an area surrounded by an edge part toward the outside of the area. This tendency does not change much with the condition of the image capture object, for example. The gradient direction is hence a suitable quantity for pattern matching.
Next, referring to
The image processing device 1 has a function of identifying a position in a captured image pointed at with an image capture object from image data for the captured image as illustrated in
The resolution reduction section 2 reduces the resolution of image data for a captured image.
The pixel-value vertical-gradient-quantity calculation section 3a and the pixel-value horizontal-gradient-quantity calculation section 3b calculates, for each pixel in the image data, a vertical-direction gradient quantity and a horizontal-direction gradient quantity for a pixel value of a target pixel from the pixel value of the target pixel and the pixel values of adjoining pixels. Specifically, an edge extraction operator, such as the Sobel operator or the Prewitt operator, may be used.
As an example, the Sobel operator is described. The local vertical-direction gradient Sy and the horizontal-direction gradient Sx at pixel position x(i,j) of a pixel are given by a pair of equations (1) below:
S
x
=x
i+1j−1
−x
i−1j−1+2xi+1j−2xi−1j+xi+1j+1−xi−1j+1
S
y
=x
i−1j+1
−x
i−1j−1+2xij+1−2xij−1+xi+1j+1−xi+1j−1 (1)
where xij is the pixel value at pixel position x(i,j), i is the position of the pixel in the horizontal direction, j is the position of the pixel in the vertical direction, and i and j are positive integers.
Equations (1) are equivalent to applying the 3×3 Sobel operators (matrix operators Sx and Sy) in equations (2) and (3) to 3×3 pixels including the target pixel at pixel position x(i,j)
From the vertical-direction gradient Sy and the horizontal-direction gradient Sx, the gradient magnitude ABS(S) and the gradient direction ANG(S) at pixel position x(i,j) are given below. Note that throughout the following description, the vertical-direction gradient quantity and the horizontal-direction gradient quantity obtained by applying the vertical-direction gradient Sy and the horizontal-direction gradient Sx as operators to a pixel may be called respectively as the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx for convenience.
ABS(S)=(Sx2+Sy2)1/2 (4)
ANG(S)=tan−1(Sy/Sx) (5)
The edge extraction section 4 extracts (identifies) edge pixels (first edge pixels), or pixels in an edge part in the captured image, from results of calculation of the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx for the pixels performed by the pixel-value vertical-gradient-quantity calculation section 3a and the pixel-value horizontal-gradient-quantity calculation section 3b.
An edge pixel is a pixel forming a part (edge) of the image data at which brightness changes abruptly. More specifically, an edge pixel is a pixel for which both the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx or the gradient magnitude ABS(S) is greater than or equal to a predetermined first threshold.
The purpose of extracting the first edge pixels is to enable the gradient direction/null direction identifying section 5 to identify a gradient direction for the extracted first edge pixels and to regard and identify all the pixels that are not the first edge pixels as equally having null direction.
The important information in pattern matching is the gradient direction for the first edge pixels in the edge part.
Therefore, by regarding the gradient direction for pixels of relatively low importance as equally having null direction, the pattern matching efficiency is further improved. This scheme also reduces memory size and processing time in detecting a position in the captured image pointed at with an image capture object (discussed later), further reducing the cost for the detection of the pointing position.
Apart from the function above, the edge extraction section 4 has a function of generating an edge mask. The edge mask is binary data obtained by binarization of the image data generated by, for example, specifying a second threshold greater than the first threshold and setting the gradient magnitude ABS(S) calculated from the vertical-direction gradient quantity and the horizontal-direction gradient quantity to 1 when the gradient magnitude ABS(S) is in excess of (or greater than or equal to) the second threshold and 0 when the gradient magnitude ABS(S) is less than or equal to (or less than) the second threshold. This edge mask is referenced to identify the pixels at positions with a gradient magnitude ABS(S) of 1 as the second edge pixels.
The gradient direction/null direction identifying section 5 is adapted to identify a gradient direction for the extracted second edge pixels and to regard and identify the pixels that are not the second edge pixels as equally having null direction.
Alternatively, of the first edge pixels extracted based on the first threshold, those first edge pixels located at the positions where the edge mask value is 1 may be regarded as being valid, and those first edge pixels located at the positions where the edge mask value is 0 as being invalid so that the valid first edge pixels can be selected for pattern matching.
The gradient direction/null direction identifying section 5 identifies, for each pixel, either a gradient direction ANG(S) or null direction where both the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx or the gradient magnitude ABS(S) is less than the predetermined threshold, from the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx calculated by the pixel-value vertical-gradient-quantity calculation section 3a and the pixel-value horizontal-gradient-quantity calculation section 3.
Having null direction is defined here as “being less than a predetermined threshold.” Alternatively, it may be defined as “being less than or equal to a predetermined threshold.”
The advance labeling as “having null direction” limits occurrences of numerous unwanted gradient directions which would otherwise be caused by noise and other factors. The advance labeling also leads to reducing matching targets to gradient directions near the edge, allowing for more efficient matching.
Preferably, the gradient direction/null direction identifying section 5 identifies a gradient direction for the edge pixels identified by the edge extraction section 4 and identifies the pixels that are not the edge pixels by regarding those pixels as having null direction. It may be said that the important information in pattern matching is the gradient direction for the edge pixels in the edge part.
Therefore, by regarding the gradient direction for pixels of relatively low importance as equally having null direction in pattern matching, the pattern matching efficiency is further improved.
The gradient direction ANG(S) is a continuous quantity varying from 0 rad to 2π rad. In the present embodiment, the gradient direction ANG(S) is quantized into 8 directions which will be used as gradient directions, or the characteristic quantity (hereinafter, may be referred to as the “characteristic quantity”), for use in pattern matching. The gradient direction ANG(S) may be quantized into 16 directions for higher precision pattern matching. A specific process for quantization of direction will be detailed later. By quantization of direction, it is meant that the gradient direction ANG(S) within a predetermined range is treated by equally regarding it as a particular gradient direction.
The matching efficiency improving section 6 allows for more efficient matching of a matching region which is a region, around the target pixel, containing a predetermined number of pixels with a predetermined model pattern (hereinafter, may be referred to as the “pattern matching”).
The matching pixel count calculation section 7, for example, matches the matching region with the model pattern to calculate the number of pixels for which the gradient direction contained in the matching region matches the gradient direction contained in the model pattern (hereinafter, the “matching pixel count”).
The model pattern and comparative matching pattern storage section 8 stores the model patterns and the comparative matching patterns predetermined by analyzing matching patterns between the gradient direction for each pixel in the matching region and the gradient direction for each pixel in the model pattern. The model pattern and comparative matching pattern storage section 8 may be, for example, a tape, such as a magnetic tape or a cassette tape; a magnetic disk, such as a floppy® disk or a hard disk, or an optical disc, such as a CD-ROM/MO/MD/DVD/CD-R; a card, such as an IC card (memory card) or an optical card; or a semiconductor memory, such as a mask ROM/EPROM/EEPROM/flash ROM.
The pattern correspondence degree calculation section 9 calculates a pattern correspondence degree which is a degree of similarity of the matching pattern between the gradient direction for each pixel in the matching region and the gradient direction for each pixel in the model pattern to the predetermined comparative matching pattern.
The score calculation section 10 calculates an correspondence degree which is a degree of matching of the matching region with the model pattern from the matching pixel count calculated by the matching pixel count calculation section 7 and the pattern correspondence degree calculated by the pattern correspondence degree calculation section 9. The score calculation section 10 may be adapted to use either one of the matching pixel count calculated by the matching pixel count calculation section 7 and the pattern correspondence degree calculated by the pattern correspondence degree calculation section 9.
The score calculation section 10 may be adapted to calculate the correspondence degree if the number of types of corresponding gradient directions in the matching region is greater than or equal to a preset value.
The gradient direction has the general tendency described above. The tendency does not change much with the condition of the image capture object, for example. Therefore, for example, if the number of types of gradient directions is 8, the number of types of matching gradient directions in pattern matching should be close to 8. Hence, if the correspondence degree is calculated when the number of types of corresponding gradient directions in the matching region is greater than or equal to a preset value, the detection of the pointing position requires smaller memory and less processing time. That in turn further reduces the cost for the detection of the pointing position.
The light entering the built-in image capture sensors in the liquid crystal display device may be a mixture of reflection of the backlight and external light coming from the outside.
When this is the case, it is difficult to separate effects of the reflection of the backlight and effects of the external light coming from the outside from the captured image.
In backlight reflection base, the image obtained from the reflection of the backlight off the image capture object shows a blurred white round figure, for example, for a finger pad. Accordingly, in this case, the first threshold is set to a relatively low value so that the edge extraction section 4 can identify the first edge pixels.
On the other hand, in shadow base, the captured image is blurred (low contrast) if the image capture object (for example, the finger pad) is positioned off the panel surface (non-touch) and sharp (high contrast) if the image capture object is in contact with the panel surface. Therefore, in shadow base, the second threshold is set to a relatively high value so that the edge extraction section 4 can identify the second edge pixels in accordance with a more stringent edge determining standard than for the first threshold.
Pattern matching is thus carried out between the image data in which the first edge pixels are identified and a first model pattern predetermined in backlight reflection base and also between the image data in which the second edge pixels are identified and a second model pattern predetermined in shadow base, to obtain the first number of pixels and the second number of pixels. In this case, the score calculation section 10 can use, for example, the sum of the first number of pixels and the second number of pixels as the correspondence degree.
The score calculation section 10, as discussed above, calculates the correspondence degree from the first number of pixels for which the gradient directions of the first edge pixels contained in the matching region match the gradient directions contained in the predetermined first model pattern and the second number of pixels for which the gradient directions of the second edge pixels contained in the matching region match the gradient directions contained in the predetermined second model pattern.
Therefore, this single configuration can carry out processes compatible with both backlight reflection base and shadow base without switching the processes between backlight reflection base and shadow base. The embodiment hence provides an image processing device capable of identifying the position pointed at with the image capture object both under good and poor illumination.
The position identifying section 11 identifies the position in the captured image pointed at with the image capture object from the position of a pixel for which the correspondence degree calculated by the score calculation section 10 is a maximum (hereinafter, “peak pixel”). The section 11 includes a peak search section (peak pixel identifying means, position identifying means) 12, a coordinate calculation determining section (coordinate calculation determining means, position identifying means) 13, and a coordinate calculation section (coordinate calculation means, position identifying means) 14.
The peak search section 12 searches a search area containing a predetermined number of pixels around the target pixel (hereinafter, may be referred to as “first area”) for a peak pixel which is a pixel for which the correspondence degree calculated by the score calculation section 10 is a maximum.
The coordinate calculation determining section 13 causes the coordinate calculation section 14 to calculate the position in the captured image pointed at with the image capture object if the section 13 has determined that the peak pixel found by the peak search section 12 is present in a sub-area which contains a predetermined number of pixels that is less than the number of pixels in the search area and which is also completely enclosed in the search area (hereinafter, may be referred to as “second area”).
The coordinate calculation section 14 calculates the position in the captured image pointed at with the image capture object by using the correspondence degree for each pixel in a peak pixel region which is a region containing a predetermined number of pixels centered around the peak pixel found by the peak search section 12.
In the configuration discussed above, the pixel-value vertical-gradient-quantity calculation section 3a and the pixel-value horizontal-gradient-quantity calculation section 3b calculate, for each pixel in the image data, the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx from the pixel value for that pixel and the pixel values of adjoining pixels to the pixel.
In addition, from the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx calculated by the pixel-value vertical-gradient-quantity calculation section 3a and the pixel-value horizontal-gradient-quantity calculation section 3b, the gradient direction/null direction identifying section 5 identifies, for each pixel, either a gradient direction (direction quantized according to ANG(S) value; similar description will be omitted in the following) or null direction where both the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx or the gradient magnitude ABS(S) calculated from the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx is less than the predetermined threshold.
The vertical-direction gradient quantity Sy, the horizontal-direction gradient quantity Sx, the gradient direction, the gradient magnitude ABS(S), etc. for the pixel value are quantities obtained from a single-frame captured image. In addition, these quantities are obtainable irrespective of detection of a touch/non-touch of the captured image with the image capture object.
Next, the score calculation section 10 matches the matching region with the model pattern to calculate the correspondence degree which is a degree of matching of the matching region with the model pattern from the number of pixels (matching pixel count) for which the gradient direction contained in the matching region matches the gradient direction contained in the model pattern.
A scalar quantity, such as a pixel value (density level), could possibly be used as the quantity used in the matching of a matching region with a predetermined model pattern (pattern matching). It is however difficult to set up model patterns in advance because the scalar quantity, even when quantized (values within a predetermined range are treated by equally regarding them as a particular constant), is ever variable depending on, for example, the condition of the image capture object.
Meanwhile, the gradient of the pixel value is a vector quantity with both magnitude (gradient magnitude ABS(S)) and direction (gradient direction ANG(S)). Especially, the gradient direction (orientation), for example, when quantized into 8 directions, enables discretization of any potential states for the pixels with as few as 8 states (or 9 if null direction is included), which is an extremely small number. Furthermore, the discretized states render different directions readily distinguishable.
The gradient direction has the general tendency described above. The tendency does not change much with the condition of the image capture object, for example. The gradient direction is hence a suitable quantity for pattern matching.
Pattern matching is therefore possible by using image data for only one frame, irrespective of detection of a touch/non-touch of the captured image with the image capture object. Pattern matching is thus possible with small memory and short processing time.
Next, the position identifying section 11 identifies the position in the captured image pointed at with the image capture object from the position of the target pixel (peak pixel) for which the correspondence degree calculated by the score calculation section 10 is a maximum.
The gradient direction has the general tendency described above. Therefore, the neighborhood of the maximum of the correspondence degree would be regarded as indicating the neighborhood of the position in the captured image pointed at with the image capture object. Therefore, taking the tendency of the gradient direction into consideration, by setting up model patterns in advance for each image capture object (for example, for each illumination environment (bright or dark) for an image capture object for which the gradient direction is distributed like a doughnut in the image data or for each size of the image capture object (for example, the finger pad is large, whereas the pen tip small)), the position in the captured image pointed at with the image capture object can be identified from the position of the peak pixel obtained in the pattern matching.
Hence, the image processing device 1, as an example, is provided which, irrespective of detection of a touch/non-touch of the captured image with the image capture object, can detect the position in the captured image pointed at with the image capture object with small memory and short processing time by using image data for only one frame.
Next, referring to
The configuration is the same as in 1. Configuration of Image Processing Device (Electronic Apparatus) except those points raised in 2. Overview of Operation of Image Processing Device (Electronic Apparatus). For convenience in description, members of the present embodiment that have the same function as members depicted in the drawings referred to in 1. Configuration of Image Processing Device (Electronic Apparatus) are indicated by the same reference numerals and description thereof is omitted. The following description is, where necessary, divided into distinct sections, under which these special notes will not be repeated.
The resolution should be reduced as much as possible for high speed processing. To obtain necessary edge and other information, however, a preferred resolution reduction limit for 320×240 pixel (150 dpi) image data, as an example, is 80×60 pixels (resolution reduction ratio=¼). In addition, for high precision processing, the resolution is better not reduced at all, or if reduced to any extent, should not go below 160×120 pixels (resolution reduction ratio=½).
This image data resolution reduction allows for reduction in processing cost, memory size, and processing time in the pattern matching.
In S102, the pixel-value vertical-gradient-quantity calculation section 3a and the pixel-value horizontal-gradient-quantity calculation section 3b calculate the vertical-direction gradient quantity Sy and the horizontal-direction gradient quantity Sx for each pixel in the image data. Then, after the gradient direction/null direction identifying section 5 completes up to either the identifying of a gradient direction or the labeling as having null direction for each pixel (gradient direction/null direction identification process), the operation proceeds to S103.
In S103, for the case the matching efficiency improving section 6 matches the matching region with the model pattern, it is selected whether or not the matching efficiency for the matching region and the model pattern (matching efficiency improvement) is to be improved. If the matching efficiency improvement is to be carried out (Yes), the operation proceeds to S104 where the matching efficiency improving section 6 carries out the matching efficiency improvement before further proceeding to S105. If the matching efficiency improvement is not to be carried out (No), the operation continues at S107 where the matching efficiency improving section 6 performs no process at all on the data (image data, or if the resolution reduction section 2 has performed the resolution reduction, post-resolution-reduction image data), thereby leaving the data unchanged, before the operation further proceeding to S105.
The matching pixel count calculation section 7, in S105, matches the matching region with the model pattern to calculate the matching pixel count, and the pattern correspondence degree calculation section 9 calculates the pattern correspondence degree. Then, after the score calculation section 10 completes up to the calculating of the correspondence degree from the matching pixel count calculated by the matching pixel count calculation section 7 and the pattern correspondence degree calculated by the pattern correspondence degree calculation section 9 (pattern matching process), the operation proceeds to S106.
In S106, the position identifying section 11 identifies the position in the captured image pointed at with the image capture object from the position of a pixel for which the correspondence degree calculated by the score calculation section 10 is a maximum (hereinafter, “peak pixel”) (pointing position identification process), thereby ending the operation.
That is an overview of the entire operation of the image processing device 1. The following is a description of the operation of the image processing device 1 in the gradient direction/null direction identification process, the matching efficiency improvement, the pattern matching process, and the pointing position identification process.
First, referring to
In the flow chart in
In S201, the edge extraction section 4 determines whether or not the gradient magnitude ABS(S) (“gradient power” in
If the operation has proceeded to S210, the gradient direction/null direction identifying section 5 labels (identifies) a target pixel (pixel that is not the first edge pixels) as having null direction and moves to a next pixel before the operation returns to S201.
In S202, the gradient direction/null direction identifying section 5 determines whether or not the horizontal-direction gradient quantity Sx is 0. If Sx≠0, the operation returns to S203 (Yes); if Sx=0, the operation returns to S206 (No).
The gradient direction/null direction identifying section 5, in S203, determines whether or not the horizontal-direction gradient quantity Sx is positive. If Sx>0, the operation returns to S204 (Yes). Then, in accordance with the table in
Next, in S206, the gradient direction/null direction identifying section 5 determines whether or not the vertical-direction gradient quantity Sy is 0. If Sy≠0, the operation proceeds to S207 (Yes); if Sy=0, the operation proceeds to S210 (No) where the pixel (pixel that is neither the first edge pixels nor the second edge pixels) is labelled as having null direction. The process then moves to a next pixel before the operation returns to S201.
The gradient direction/null direction identifying section 5, in S207, determines whether or not the vertical-direction gradient quantity Sy is positive. If Sy>0, the operation continues at S208 (Yes) where the pixel (first edge pixel/second edge pixel) is set to the upward gradient direction before the operation returns to S201. In contrast, if Sy<0, the operation continues at S209 (No) where the pixel (first edge pixel/second edge pixel) is set to the downward gradient direction. The process then moves to a next pixel before the operation returns to S201. These steps are repeated until every pixel is either assigned a gradient direction or labelled as having null direction.
The important information in pattern matching is the gradient direction for the edge pixels (first edge pixels/second edge pixels) in the edge part.
Therefore, by regarding the gradient direction (pixel that is neither the first edge pixels nor the second edge pixels) for pixels of relatively low importance as equally having null direction in the operation, the pattern matching efficiency is further improved. The scheme also enables the detection of the position in the captured image pointed at with the image capture object with small memory and short processing time, further reducing the cost for the detection of the pointing position.
Next, referring to
The matching efficiency improving section 6 shown in
The score calculation section 10 matches the matching region with the model pattern with the efficiency as improved by the matching efficiency improving section 6 to calculate the number of matches of the gradient direction contained in each divisional region in the matching region with the gradient direction contained in the model pattern as the correspondence degree.
The gradient direction has the general tendency described above. The tendency does not change much with the condition of the image capture object, for example. Therefore, if the number of pixels in each divisional region is not set to a very large value, the positions of the pixels for the gradient direction in the divisional regions are not very important information in the pattern matching using the gradient direction.
Accordingly, by replacing, for each divisional region, the gradient direction/null direction information for each pixel contained in that divisional region with the gradient direction/null direction information contained in the divisional region, the matching efficiency improvement is accomplished, while maintaining precision in the pattern matching. In addition, the efficiency improvement results in reduction in the cost of the detection of the position in the captured image pointed at with the image capture object.
Hence, the image processing device 1, as an example, is provided which improves the matching efficiency and reduces the cost in the detecting of the position in the captured image pointed at with the image capture object, while maintaining precision in the pattern matching.
Referring to
As shown in
b) depicts the same image data as shown in
As shown in
For example, in one of the 2×2-pixel regions obtained by dividing the 14×14-pixel region shown in
Next, referring to
a) depicts an exemplary model pattern prior to matching efficiency improvement in a dark environment. The model pattern in
The model pattern in
The pixels are arranged in an odd number of rows by an odd number of columns (13×13) so that there is one central pixel. The central pixel is placed over a target pixel in the image data and shifted by one pixel at a time to implement the pattern matching.
In this case, since the matching is carried out for each pixel, the matching needs to be carried out on 13×13=169 pixels (the matching pixel count needs to be calculated 169 time).
Meanwhile,
Next,
b) depicts an exemplary model pattern subsequent to matching efficiency improvement in a bright environment.
a) depicts another exemplary model pattern subsequent to matching efficiency improvement in a dark environment. This model pattern is similar to the model pattern in
b) depicts another exemplary model pattern subsequent to matching efficiency improvement in a bright environment.
Now, referring to
Referring to
Next, in terms of the relationship with the matching efficiency improving section 6, the variations can be divided into those implemented on image data prior to matching efficiency improvement and those implemented on image data subsequent to matching efficiency improvement.
In terms of the relationship with the score calculation section 10, the variations can be divided into those calculating the score (correspondence degree) from the matching pixel count calculated by the matching pixel count calculation section 7 and those calculating the score (correspondence degree) from the pattern correspondence degree calculated by the pattern correspondence degree calculation section 9.
As described in the foregoing, the pattern matching has many variations. Any of the variations may be carried out either singly or in combination to calculate the score.
In S301, the matching pixel count calculation section 7 matches the matching region with the model pattern to calculate the number of pixels (matching pixel count) for which the gradient direction contained in the matching region matches the gradient direction contained in the model pattern. The operation then proceeds to S302.
In S302, the matching efficiency improving section 6 (the gradient direction/null direction identifying section 5 if no matching efficiency improving section 6 is included) determines whether to calculate also a pattern correspondence degree for the gradient direction. If it is determined to calculate the pattern correspondence degree, the pattern correspondence degree calculation section 9 is notified before proceeding to S303 (Yes). On the other hand, If it is determined not to calculate the pattern correspondence degree, the score calculation section 10 is notified before proceeding to S304.
The description here assumes that the matching pixel count is always calculated. This is, however, not intended to be limiting the invention. A configuration may be employed where the pattern correspondence degree is only calculated.
The pattern correspondence degree is a quantity indicative of a similarity of the matching pattern between the gradient direction for each pixel in the matching region and the gradient direction for each pixel in the model pattern to the predetermined comparative matching pattern stored in the model pattern and comparative matching pattern storage section 8.
In S303, the pattern correspondence degree calculation section 9 is notified by either the gradient direction/null direction identifying section 5 or the matching efficiency improving section 6 of the determination to calculate the pattern correspondence degree and calculates the pattern correspondence degree, before the operation proceeds to S304.
In S304, the pattern correspondence degree calculation section 9, if not having calculated the pattern correspondence degree, calculates the matching pixel count calculated by the matching pixel count calculation section 7 as the correspondence degree which is a degree of matching of the matching region with the model pattern. On the other hand, the pattern correspondence degree calculation section 9, if having calculated the pattern correspondence degree, calculates a combined quantity of the matching pixel count calculated by the matching pixel count calculation section 7 and the pattern correspondence degree calculated by the pattern correspondence degree calculation section 9 as the correspondence degree which is a degree of matching of the matching region with the model pattern.
The gradient directions generally match a direction either from an edge part in the captured image to near the center of an area surrounded by the edge part or radially from near the center toward the edge part, for example, for the finger surface or like soft surface which forms a round contact face upon contact with another surface and for the round-tipped pen or like surface which forms a round contact face despite its hardness. For contact faces of other shapes, the gradient directions again generally match a direction either from an edge part in the captured image to the inside of an area surrounded by the edge part or from the inside of an area surrounded by an edge part toward the outside of the area.
When the image capture object does not touch on the captured image, for example, when the image capture object is a finger pad, edges may in some cases result from a large blurry shadow of those fingers which are not in contact. In addition, for example, when the input device (photo sensor) or the sensing circuit has a defect, the defect may cause a band or line of noise with accompanying edges.
If these pattern-matching-disrupting edges (hereinafter, “unnecessary edges”) have occurred, the matching pixel count may be increased locally (only in one or two directions) even when the number of pixels in the model pattern is increased. Therefore, when such an unnecessary edge is present, the matching pixel count alone would be insufficient to achieve correct recognition and suitable pattern matching.
Accordingly, for example, when the finger or the pen has come in contact, if the matching pixel count and the correspondence pattern (for example, the number of types of gradient directions) are used together based on an assumption that at least 6 or more types of gradient directions, if not all the 8 directions (which would be ideal), should appear, the cases where the correspondence degree is increased due to the local increases in the matching pixel count (only in one or two directions) can be excluded.
Therefore, robustness to noise and deformation in image input would be improved by using both the matching pixel count and the pattern correspondence degree in the pattern matching.
Hence, the image processing device 1, as an example, is provided which, irrespective of detection of a touch/non-touch of the captured image with the image capture object, can detect the position in the captured image pointed at with the image capture object with small memory and short processing time by performing the pattern matching using image data for only one frame and which can also improve the robustness to noise in image input and deformation of the captured image in the pattern matching.
Next, referring to
a) depicts pattern matching between a matching region and a model pattern in a dark environment prior to matching efficiency improvement.
a) indicates results of pattern matching between the matching region in
The matching pattern in
Next, the calculation of the matching pixel count in
Next, the normalized matching pixel count shown in
Here, the normalized matching pixel count is defined by equation (6) below:
Normalized Matching Pixel Count=Appropriate Constant×(Matching Pixel Count/Number of Elements Having Directional Component in Model) (6)
The “appropriate constant” is determined in a suitable manner in consideration of convenience in calculation and other factors. The constant is set here to 10 so that the normalized matching pixel count falls in a range of 0 to 10. The normalized matching pixel count is used also in the following example of pattern matching, of which description is omitted.
The normalized matching pixel count for the case of
Normalized Matching Pixel Count=10×(85/136)=6.25≈6
Next,
a) indicates results of pattern matching between a matching region in
The matching pattern in
Next, the calculation of the matching pixel count in
In another example, at row 2, column 1, in the matching region, there are “two” pixels for which the gradient direction points at the “lower right” and “one” pixel for which the gradient direction points at the “right.” In contrast, there is one gradient direction in the model pattern which points at the “lower right.” There are “two” matching “lower right” gradient directions and no matching “right” gradient directions. Therefore, the matching pixel count in this case is calculated to be “2.” The pixels determined to have null direction here is excluded throughout the calculation.
Performing this calculation on all the pixels yields a result indicating that the matching pixel count for the meshed parts is “91.” The matching pixel count may be used as the score (correspondence degree) with or without the following normalization of the matching pixel count.
Here, the normalized matching pixel count defined by equation (7) below:
Normalized Matching Pixel Count=Appropriate Constant×(Matching Pixel Count/4 Times Number of Elements Having Directional Component in Model) (7)
where the constant is set here to 10.
The normalized matching pixel count for the case of
Normalized Matching Pixel Count=10×(91/176)=5.17≈5
Next,
a) indicates results of pattern matching between the matching region in
The matching pattern in
Next, the calculation of the matching pixel count in
In another example, at row 2, column 1, in the matching region, there are “two” pixels for which the gradient direction points at the “lower right” and “one” pixel for which the gradient direction points at the “right.” In contrast, there are two gradient directions in the model pattern: one pointing at the “right” and the other pointing at the “lower right.” There are “one” “right” matching gradient direction and “two” “lower right” matching gradient directions. Therefore, the matching pixel count in this case is calculated to be “3.” The pixels determined to have null direction here is excluded throughout the calculation.
Some numerals are underscored while the others are not. The underscored numerals indicate that the matching pixel count is increased over the case of
This results demonstrate that the use of the model pattern in
Performing this calculation on all the pixels yields a result indicating that the matching pixel count for the meshed parts is “119.” The matching pixel count may be used as the score (correspondence degree) with or without the following normalization of the matching pixel count.
The normalized matching pixel count for the case of
Normalized Matching Pixel Count=10×(119/176)=6.76≈7
Next, referring to
In
In S403, the matching pixel count calculation section 7 and the pattern correspondence degree calculation section 9 carry out gradient direction matching, etc. for each pixel (including those pixels having been subjected to matching efficiency improvement). The operation then proceeds to S404.
A configuration may be employed which is used together with a case where the edge extraction section 4 determines valid pixels using an edge mask immediately before S403. In that case, a single device enables pattern matching both in backlight reflection base and in shadow base.
In S404, if directions match (Yes), the operation continues at S405 where the matching pixel count calculation section 7 adds the number of elements with matching directions (“1” when no efficiency improvement is carried out) to the matching pixel count. The operation then proceeds to S406. On the other hand, if there are no pixels at all for which the directions match (No), the operation returns to S401.
In S406, the pattern correspondence degree calculation section 9 updates the matching gradient direction to “Available” before the operation proceeds to S407.
In S407, if the matching pixel count calculation section 7 and the pattern correspondence degree calculation section 9 have completed the matching for all the elements (pixels) in the model pattern (Yes), the operation proceeds to S408; if the sections 7 and 9 have not completed the matching (No), the operation returns to S403.
In S408, the pattern correspondence degree calculation section 9 checks the matching pattern. The operation then proceeds to S409. The checking of the matching pattern will be described later in detail.
In S409, the pattern correspondence degree calculation section 9 determines whether it is a “allowed pattern” in reference to the model pattern and comparative matching pattern storage section 8. If it is an allowed pattern (Yes), the operation proceeds to S410. On the other hand, if it is not an allowed pattern (No), the operation returns to S404. In this case, the pattern correspondence degree calculation section 9 may set the pattern correspondence degree to “1” if it is an “allowed pattern” and to “0” if it is not an “allowed pattern” so that the score calculation section 10 can multiply the matching pixel count calculated by the matching pixel count calculation section 7 by these values.
In S410, the score calculation section 10 calculates the normalized matching pixel count from the matching pixel count calculated by the matching pixel count calculation section 7 as the score (correspondence degree) for the pattern matching.
Next, referring to
a) depicts an exemplary pattern correspondence degree calculation process.
The description here assumes 8 gradient directions and a threshold (DN) of 5 for the number of types of gradient directions.
As shown in
On the other hand, if the number of “Available” (number of types of gradient directions) in the matching pattern is less than 5, the operation proceeds to S503 where the pattern correspondence degree calculation section 9 disallows matching pattern.
The flow from S601 to S603 in
Next, referring to
a) depicts an exemplary pattern correspondence degree calculation process.
In
In
In
From the results above, in the case of
Next, referring to
a) depicts still another exemplary pattern correspondence degree calculation process.
In
In
In
From the results above, in the case of
As described in the foregoing, the score calculation section 10 matches the matching region with the model pattern and calculates the score (correspondence degree) from the number of pixels (matching pixel count) for which the gradient direction contained in the matching region matches the gradient direction contained in the model pattern and a pattern correspondence degree which is a degree of similarity of the matching pattern between the gradient direction for each pixel in the matching region and the gradient direction for each pixel in the model pattern to the predetermined comparative matching pattern.
A scalar quantity, such as a pixel value (density level), could possibly be used as the quantity used in the matching of a matching region with a predetermined model pattern (hereinafter, may be referred to as the “pattern matching”). It is however difficult to set up model patterns in advance because the scalar quantity, even when quantized (values within a predetermined range are treated by equally regarding them as a particular constant), is ever variable depending on, for example, the condition of the image capture object.
Meanwhile, the gradient of the pixel value is a vector quantity with both a magnitude (gradient magnitude) and a direction (gradient direction). Especially, the gradient direction (orientation), for example, when quantized into 8 directions, enables discretization of any potential states for the pixels with as few as 8 states (or 9 if null direction is included), which is an extremely small number. Furthermore, the discretized states render different directions readily distinguishable.
The gradient directions generally match a direction either from an edge part in the captured image to near the center of an area surrounded by the edge part or radially from near the center toward the edge part, for example, for the finger surface or like soft surface which forms a round contact face upon contact with another surface and for the round-tipped pen or like surface which forms a round contact face despite its hardness. For contact faces of other shapes, the gradient directions again generally match a direction either from an edge part in the captured image to the inside of an area surrounded by the edge part or from the inside of an area surrounded by an edge part toward the outside of the area.
When the image capture object does not touch on the captured image, for example, when the image capture object is a finger pad, edges may in some cases result from a large blurry shadow of those fingers which are not in contact. In addition, for example, when the input device (photo sensor) or the sensing circuit has a defect, the defect may cause a band or line of noise with accompanying edges.
If these pattern-matching-disrupting edges (hereinafter, “unnecessary edges”) have occurred, the matching pixel count may be increased locally (only in one or two directions) even when the number of pixels in the model pattern is increased. Therefore, when such an unnecessary edge is present, the matching pixel count alone would be insufficient to achieve correct recognition and suitable pattern matching.
Accordingly, for example, when the finger or the pen has come in contact, if the matching pixel count and the correspondence pattern (for example, the number of types of gradient directions) are used together based on an assumption that at least 6 or more gradient directions, if not all the 8 directions (which would be ideal), should appear, the cases where the correspondence degree is increased due to the local increases in the matching pixel count (only in one or two directions) can be excluded.
Therefore, robustness to noise and deformation in image input would be improved by using both the matching pixel count and the pattern correspondence degree in the pattern matching.
In such a situation, considering image capture environment, it is preferable to set up a threshold in backlight reflection base on an assumption that the number of types of gradient directions is greater than or equal to 6 and to set up a threshold in shadow base on an assumption that the number of types of gradient directions is greater than or equal to 4. This is because, as described in the following in reference to
Hence, the image processing device 1, as an example, is provided which, irrespective of detection of a touch/non-touch of the captured image with the image capture object, can detect the position in the captured image pointed at with the image capture object with small memory and short processing time by performing the pattern matching using image data for only one frame and which can also improve the robustness to noise in image input and deformation of the captured image in the pattern matching.
Therefore, the robustness to noise in image input and deformation of the captured image is improved in the pattern matching.
If the matching pixel count and the number of successive matches are used together based on an assumption that at least 6 or more successive matches should appear similarly to the number of types of corresponding directions, the cases where the correspondence degree is increased due to the local increases in the matching pixel count (only in one or two directions) can be excluded.
Therefore, the robustness to noise in image input and deformation of the captured image is improved in the pattern matching. In addition, the use of the number of successive matches in place of the number of types of gradient directions in the calculation of the pattern correspondence degree enables more rigorous pattern matching and more reliable exclusion of wrong recognition.
As mentioned earlier, the comparison matching pattern is preferably the number of types of corresponding directions for the gradient direction for each pixel in the matching region and the gradient direction for each pixel in the model pattern.
If the matching pixel count and the number of types of gradient directions are used together based on an assumption that at least 6 or more gradient directions should appear as in the aforementioned example, wrong recognition can be excluded by excluding the cases where the correspondence degree is increased due to the local increases in the matching pixel count (only in one or two directions).
Therefore, the robustness to noise in image input and deformation of the captured image is improved in the pattern matching.
In addition, the comparison matching pattern is preferably the number of successive matches (number of successive matches of types of corresponding directions for the gradient direction for each pixel in the matching region and the gradient direction for each pixel in the model pattern).
If the matching pixel count and the number of successive matches are used together based on an assumption that at least 6 or more successive matches should appear similarly to the number of types of corresponding directions, the cases where the correspondence degree is increased due to the local increases in the matching pixel count (only in one or two directions) can be excluded.
Therefore, the robustness to noise in image input and deformation of the captured image is improved in the pattern matching. In addition, the use of the number of successive matches in place of the number of types of gradient directions in the calculation of the pattern correspondence degree enables more rigorous pattern matching and more reliable exclusion of wrong recognition.
Next, referring to
In S701, the peak search section 12 searches a first area (search area) containing a predetermined number of pixels around the target pixel for a peak pixel which is a pixel for which the correspondence degree calculated by the score calculation section 10 is a maximum. Upon the section 12 finding such a peak pixel, the operation proceeds to S702. If the peak search section 12 cannot find the peak pixel (not shown), the target pixel is shifted by a predetermined number (for example, the shortest path from the target pixel in the first area to a pixel on an edge (length of a side of a second area)). The operation then returns to S701.
If the coordinate calculation determining section 13 has in S702 determined that the peak pixel found by the peak search section 12 is present in the second area (sub-area) which contains the same target pixel as does the first area, which contains a predetermined number of pixels that is less than the number of pixels in the first area, and which is also completely enclosed in the first area, the operation then continues at S703 where the coordinate calculation determining section 13 determines “it has found the peak pixel.” The operation then proceeds to S704. On the other hand, if the coordinate calculation determining section 13 has determined that the peak pixel found by the peak search section 12 is not present in the second area (sub-area), the operation continues at S705 where the coordinate calculation determining section 13 determines “it has found no peak pixel.” the target pixel is shifted by a predetermined number (for example, the shortest path from the target pixel in the first area to a pixel on an edge (length of a side of a second area)). The operation then returns to S701.
This operation is repeated until the coordinate calculation section 14 calculates the pointing (interpolation) position.
In S704, the coordinate calculation section 14 calculates the position in the captured image pointed at with the image capture object by using the score for each pixel in a peak pixel region which is a region containing a predetermined number of pixels centered around the peak pixel found by the peak search section 12, which brings the operation to the “END.”
The above description assumes that the operation is repeated until the coordinate calculation section 14 calculates the pointing (interpolation) position. Alternatively, two or more pointing (interpolation) positions may be calculated, in which case, the first and second areas are moved until the operation as shown in the flow chart in
Next, referring to
a) depicts the operation in the case of the coordinate calculation determining section 13 in the image processing device 1 determining that there is no peak pixel.
The solid line in
In the example in
On the other hand, in the example in
In the example above, the difference in the number of pixels between the first area and the second area is set up so that the peak pixel can always move into the second area, by moving the first area and the second area by “5 pixels” which is the shortest path from the target pixel in the first area to a pixel on an edge (length of a side of a second area), if the first area contains a peak pixel whilst the second area contains no peak pixel.
Next, referring to
a) depicts a peak pixel region used for the calculation of a position in a captured image pointed at with an image capture object in the image processing device 1.
a) shows a case where the coordinate calculation determining section 13 has determined “there is a peak coordinate” as in the case of
a) shows both the first and the second area as areas bounded by broken lines. Meanwhile, the 5×5-pixel region bounded by solid lines is the peak pixel region which is a region containing a predetermined number of pixels centered around a peak pixel.
In example in
Next, referring to
This example assumes that when the image data contains 320×240 pixels, the resolution reduction section 2 shown in
Therefore, the entire area of the score image scaled up by 8 corresponds to the entire area of the image data. Therefore, interpolation quantity (scale-up ratio)=8.
The following will describe a specific calculation method. First, the sum of scores is calculated for each row in the peak pixel region (19, 28, 33, 24, and 11 in
Next, assuming that the score in the peak pixel region corresponds to a mass distribution, the coordinates of the center of mass in the entire area of the score image is obtained. That is followed by scaling up by 8, to yield the coordinates as in equations (8) and (9) below:
Next, by calibrating the positions of the scale marks with pixel size taken into consideration, the pointing coordinates (X, Y) are given by equation (10) below:
Math. 4
(X, Y)=(X′+8×0.5,Y′+8×0.5)=(61,43) (10)
From the description above, the peak search section 12 searches the first area (search area). Hence, the processing cost and the memory size are reduced over searching the image data region containing the total pixel count for a peak pixel.
For example, a small number of pixels being contained in the first area means that the scores for the entire data image (score image) (=all pixels) do not need to be stored in a buffer and also that the memory size does not need to be greater than required by the first area where a peak search is executed (for example, a line buffer for 9 lines for a 9×9-pixel first area).
This memory size reduction effect by way of implementation with a line buffer is achievable not only with a peak search, but also with temporarily storage for the vertical and horizontal gradient quantities, temporarily storage for gradient directions, and any like implementation where buffer memory is used to given data over to a later process.
The coordinate calculation section 14 calculates the pointing position by using the score for each pixel in the peak pixel region which is a region containing a predetermined number of pixels centered around the peak pixel found by the peak search section 12. For example, when the pointing position is to be obtained from its center of mass position by using its edge image, the calculation would become increasingly difficult with deformation of the captured image.
However, in the image processing device 1, the pointing position is calculated by using the score for each pixel in the peak pixel region obtained by pattern matching. Even if the captured image is deformed, the neighborhood of a maximum of the score in the pattern matching would be regarded as exhibiting a substantially similar tendency in distribution to the tendency before the deformation where the correspondence degree decreases radially from the neighborhood of the maximum.
Therefore, the pointing position can be calculated by predetermined procedures (for example, calculation of a center of mass for the score in the peak pixel region) regardless of whether or not the captured image is deformed. Hence, the amount of image processing, the processing cost, and the memory size are all reduced in the calculation of the pointing position while maintaining precision in the coordinate position detection.
Hence, the image processing device 1, as an example, is provided which, irrespective of detection of a touch/non-touch of the captured image with the image capture object, can detect the pointing position with small memory and short processing time and can also reduce the amount of image processing, while maintaining precision in the detection of the pointing position, and the memory size in the calculation of the pointing position, by performing the pattern matching using image data for only one frame.
The coordinate calculation section 14 preferably calculates the pointing position if the coordinate calculation determining section 13 has determined that the peak pixel found by the peak search section 12 is present in the second area (sub-area) which contains the same target pixel as does the first area, which contains a predetermined number of pixels that is less than the number of pixels in the first area, and which is also completely enclosed in the first area.
The peak pixel region is a region around a peak pixel (as a target pixel) that is present in the second area. The peak pixel region therefore contains many common pixels to the first area. In addition, since the score has already been calculated for the common pixels for the peak pixel region and the first area, the coordinate calculation section 14 can calculate the pointing position if the score is examined for the non-common pixels.
The peak pixel region can be included in the first area if the number of pixels is regulated in both the peak pixel region and the first area. In that case, since the score for each pixel in the peak pixel region is already known, the yet-to-be-known score for each pixel does not need to be examined for the calculation of the pointing position.
Hence, the amount of image processing and the memory size are further reduced in the calculation of the pointing position. In addition, the buffer size can be reduced (for example, only 9 lines, not the entire image) for the storage of the scores referenced in, for example, dealing with the case where a streak of rising scores exists toward the outside of the first area in peak coordinate determination and pipelining for each processing module in hardware implementation, etc.
Next, an embodiment will be described in which the image capture object is determined to have touched the liquid crystal display device in the image processing device 1.
First, the score calculation section 10 preferably determines that the image capture object has touched the liquid crystal display device if a maximum of the score which the section 10 calculates exceeds a predetermined threshold.
The score calculation section 10 is assumed here to have such a function. Alternatively, a separate determining section with the same function may be provided.
In the configuration above, the image capture object is determined to have touched the liquid crystal display device if a maximum of the score exceeds a predetermined threshold. The configuration thus restrains wrong detection which could occur if the image capture object is regarded as having touched the liquid crystal display device whenever the score is calculated.
In addition, the score calculation section 10 preferably determines that the image capture object has touched the liquid crystal display device if the correspondence degree which the section 10 calculates exceeds a predetermined threshold.
The score calculation section 10 determines that the image capture object is in contact with the liquid crystal display device if the section 10 has calculated a score in excess of a predetermined threshold (sufficient correspondence degree), in other words, if image information from which similar features to a model pattern are obtained is input.
Therefore, the configuration can make a decision as to touch/non-touch in the image processing in which the pointing position is identified, without a dedicated device or a processing section being provided to determine touch/non-touch.
The edge extraction section 4 preferably determines that the image capture object has touched the liquid crystal display device if the section 4 has identified either the first edge pixels or the second edge pixels. In the present embodiment, the edge extraction section 4 is assumed to have the function. Alternatively, a separate touch/non-touch determining section with the same function may be provided.
As mentioned earlier, the light entering the built-in image capture sensors in the liquid crystal display device may be a mixture of reflection of the backlight and external light coming from the outside.
When this is the case, it is difficult to separate effects of the reflection of the backlight and effects of the external light coming from the outside from the captured image.
In backlight reflection base, the image obtained from the reflection of the backlight off the image capture object shows a blurred white round figure, for example, for a finger pad. Accordingly, in this case, the first threshold may be set to a relatively low value so that the touch/non-touch determining means can determine that the image capture object has touched the liquid crystal display device if the edge pixel identification means has identified the first edge pixels.
On the other hand, in shadow base, the captured image is blurred (low contrast) if the image capture object (for example, the finger pad) is positioned off the panel surface (non-touch) and sharp (high contrast) if the image capture object is in contact with the panel surface. Therefore, in shadow base, the second threshold may be set to a relatively high value so that the touch/non-touch determining means can determine that the image capture object has touched the liquid crystal display device if the edge pixel identification means has identified the second edge pixels in accordance with the second threshold that is more stringent (greater) than the first threshold.
Hence, the touch/non-touch detection becomes possible in backlight reflection base and in shadow base by simply setting up the relatively low first threshold and the relatively stringent second threshold. In addition, the determination as to a touch/non-touch can be made in the image processing in which the pointing position is identified, without a dedicated device or a processing section being provided to determine as to a touch/non-touch.
The present invention is not limited to the examples above of the image processing device (electronic apparatus), but may be altered by a skilled person within the scope of the claims. An embodiment based on a proper combination of technical means disclosed in different embodiments is encompassed in the technical scope of the present invention.
Finally, the blocks of the image processing device 1, especially, the resolution reduction section 2, the pixel-value vertical-gradient-quantity calculation section 3a, the pixel-value horizontal-gradient-quantity calculation section 3b, the edge extraction section 4, the gradient direction/null direction identifying section 5, the matching efficiency improving section 6, the matching pixel count calculation section 7, the pattern correspondence degree calculation section 9, the score calculation section 10, and the position identifying section 11, may be implemented by hardware or software executed by a CPU as follows:
The image processing device 1 includes a CPU (central processing unit) and memory devices (storage media). The CPU executes instructions contained in control programs, realizing various functions. The memory devices may be a ROM (read-only memory) containing computer programs, a RAM (random access memory) to which the programs are loaded, or a memory containing the programs and various data. The objective of the present invention can be achieved also by mounting to the image processing device 1 a computer-readable storage medium containing control program code (executable programs, intermediate code programs, or source programs) for the image processing device 1, which is software implementing the aforementioned functions, in order for a computer (or CPU, MPU) to retrieve and execute the program code contained in the storage medium.
The storage medium may be, for example, a tape, such as a magnetic tape or a cassette tape; a magnetic disk, such as a floppy® disk or a hard disk, or an optical disc, such as a CD-ROM/MO/MD/DVD/CD-R; a card, such as an IC card (memory card) or an optical card; or a semiconductor memory, such as a mask ROM/EPROM/EEPROM/flash ROM.
The image processing device 1 may be arranged to be connectable to a communications network so that the program code may be delivered over the communications network. The communications network is not limited in any particular manner, and may be, for example, the Internet, an intranet, extranet, LAN, ISDN, VAN, CATV communications network, virtual dedicated network (virtual private network), telephone line network, mobile communications network, or satellite communications network. The transfer medium which makes up the communications network is not limited in any particular manner, and may be, for example, a wired line, such as IEEE 1394, USB, an electric power line, a cable TV line, a telephone line, or an ADSL; or wireless, such as infrared (IrDA, remote control), Bluetooth®, 802.11 wireless, HDR, a mobile telephone network, a satellite line, or a terrestrial digital network. The present invention encompasses a carrier wave, or data signal transmission, in which the program code is embodied electronically.
The image processing device in accordance with the present invention, being provided with the foregoing features, preferably further includes coordinate calculation determining means for causing the coordinate calculation means to calculate the pointing position if the coordinate calculation determining means has determined that the peak pixel found by the peak pixel search means is present in a sub-area which contains the same target pixel as does the search area, which contains a predetermined number of pixels that is less than a number of pixels in the search area, and which is also completely enclosed in the search area.
According to the configuration, the coordinate calculation determining means causes the coordinate calculation means to calculate the pointing position if the coordinate calculation determining means has determined that the peak pixel found by the peak pixel search means is present in a sub-area which contains a predetermined number of pixels that is less than a number of pixels in the search area and which is also completely enclosed in the search area.
The peak pixel region is a region around a peak pixel (as a target pixel) that is present in the sub-area. The peak pixel region therefore contains many common pixels to the search area. In addition, since the correspondence degree has already been calculated for the common pixels for the peak pixel region and the search area, the coordinate calculation means can calculate the pointing position if the correspondence degree is examined for the non-common pixels.
The peak pixel region can be included in the search area if the number of pixels is regulated in both the peak pixel region and the search area. In that case, since the correspondence degree for each pixel in the peak pixel region is already known, the yet-to-be-known correspondence degree for each pixel does not need to be examined for the calculation of the coordinates.
Hence, the amount of image processing and the memory size are further reduced in the calculation of the coordinate position.
The image processing device in accordance with the present invention, being provided with the foregoing features, preferably further includes edge pixel identification means for identifying first edge pixels for which both the vertical-direction gradient quantity and the horizontal-direction gradient quantity or the gradient magnitude is greater than or equal to a first threshold, wherein the gradient direction identifying means identifies a gradient direction for the first edge pixels identified by the edge pixel identification means and regards and identifies pixels that are not the first edge pixels as having null direction.
The first edge pixel is a pixel forming a part (edge) of the image data at which brightness changes abruptly. More specifically, the first edge pixel is a pixel for which both the vertical-direction gradient quantity and the horizontal-direction gradient quantity or the gradient magnitude is greater than or equal to a predetermined first threshold.
The purpose of extracting the first edge pixels is to enable the gradient direction identifying means to identify a gradient direction for the extracted first edge pixels and to regard and identify all the pixels that are not the first edge pixels as equally having null direction.
The important information in pattern matching is the gradient direction for the first edge pixels in the edge part.
Therefore, by regarding the gradient direction for pixels of relatively low importance as equally having null direction, the pattern matching efficiency is further improved. The scheme also reduces memory size and processing time in detecting the position in the captured image pointed at with the image capture object, further reducing the cost for the detection of the pointing position.
The image processing device in accordance with the present invention, being provided with the foregoing features, preferably further includes a display device containing pixels a predetermined number of which each include a built-in image capture sensor, wherein the image data is obtained by image capturing by the image capture sensors.
According to the configuration, the image processing device enables a touch input on the display screen of the display device.
The image processing device in accordance with the present invention, being provided with the foregoing features, is preferably such that:
the display device is a liquid crystal display device and includes a backlight illuminating the liquid crystal display device;
the edge pixel identification means identifies second edge pixels for which both the vertical-direction gradient quantity and the horizontal-direction gradient quantity or the gradient magnitude is greater than or equal to a second threshold which is greater than the first threshold;
the gradient direction identifying means identifies a gradient direction for the second edge pixels identified by the edge pixel identification means and regards and identifies pixels that are not the second edge pixels as having null direction; and
the correspondence degree calculation means calculates the correspondence degree from a first number of pixels for which gradient directions of the first edge pixels contained in the matching region match gradient directions contained in a predetermined first model pattern and a second number of pixels for which gradient directions of the second edge pixels contained in the matching region match gradient directions contained in a predetermined second model pattern.
The light entering the built-in image capture sensors in the liquid crystal display device may be a mixture of reflection of the backlight and external light coming from the outside.
When this is the case, it is difficult to separate effects of the reflection of the backlight and effects of the external light coming from the outside from the captured image.
When the image processing device is in dark environment (hereinafter, may be referred to as “in backlight reflection base”), the image obtained from the reflection of the backlight off the image capture object shows a blurred white round figure, for example, for a finger pad. Accordingly, in this case, the first threshold is set to a relatively low value so that the edge pixel identification means can identify the first edge pixels.
On the other hand, when the image processing device is in a bright environment (hereinafter, may be referred to as “in shadow base”), the captured image is blurred (low contrast) if the image capture object (for example, the finger pad) is positioned off the panel surface (non-touch) and sharp (high contrast) if the image capture object is in contact with the panel surface. Therefore, in shadow base, the second threshold is set to a relatively high value so that the edge pixel identification means can identify the second edge pixels in accordance with the second threshold that is more stringent (greater) than the first threshold.
Pattern matching is thus carried out between the image data in which the first edge pixels are identified and the first model pattern predetermined in backlight reflection base and also between the image data in which the second edge pixels are identified and the second model pattern predetermined in shadow base, to obtain the first number of pixels and the second number of pixels. In this case, the correspondence degree calculation means can use, for example, the sum of the first number of pixels and the second number of pixels as the correspondence degree.
Therefore, this single configuration can carry out processes compatible with both backlight reflection base and shadow base without switching the processes between backlight reflection base and shadow base. The invention hence provides an image processing device capable of identifying the position pointed at with the image capture object both under good and poor illumination.
The image processing device in accordance with the present invention, being provided with the foregoing features, preferably further includes touch/non-touch determining means for determining that the image capture object has touched the display device if the correspondence degree calculated by the correspondence degree calculation means has a maximum in excess of a predetermined threshold.
According to the configuration, the image capture object is determined to have touched the display device if a maximum of the correspondence degree exceeds a predetermined threshold. The configuration thus restrains wrong detection which could occur if the image capture object is regarded as having touched the display device whenever the correspondence degree is calculated.
The image processing device in accordance with the present invention, being provided with the foregoing features, preferably further includes touch/non-touch determining means for determining that the image capture object has touched the display device if the correspondence degree calculation means has calculated an correspondence degree in excess of a predetermined threshold.
The touch/non-touch determining means determines that the image capture object is in contact with the display device if the correspondence degree calculation means has calculated an correspondence degree in excess of a predetermined threshold (sufficient correspondence degree), in other words, if image information from which similar features to a model pattern are obtained is input.
Therefore, the configuration can make a decision as to touch/non-touch in the image processing in which the pointing position is identified, without a dedicated device or a processing section being provided to determine touch/ non-touch.
The image processing device in accordance with the present invention, being provided with the foregoing features, preferably further includes touch/non-touch determining means for determining that the image capture object has touched the display device if the edge pixel identification means has identified either the first edge pixels or the second edge pixels.
As mentioned earlier, the light entering the built-in image capture sensors in the liquid crystal display device may be a mixture of reflection of the backlight and external light coming from the outside.
When this is the case, it is difficult to separate effects of the reflection of the backlight and effects of the external light coming from the outside from the captured image.
In backlight reflection base, the image obtained from the reflection of the backlight off the image capture object shows a blurred white round figure, for example, for a finger pad. Accordingly, in this case, the first threshold may be set to a relatively low value so that the touch/non-touch determining means can determine that the image capture object has touched the display device if the edge pixel identification means has identified the first edge pixels.
On the other hand, in shadow base, the captured image is blurred (low contrast) if the image capture object (for example, the finger pad) is positioned off the panel surface (non-touch) and sharp (high contrast) if the image capture object is in contact with the panel surface. Therefore, in shadow base, the second threshold may be set to a relatively high value so that the touch/non-touch determining means can determine that the image capture object has touched the display device if the edge pixel identification means has identified the second edge pixels in accordance with the second threshold that is more stringent (greater) than the first threshold.
Hence, the touch/non-touch detection becomes possible in backlight reflection base and in shadow base by simply setting up the relatively low first threshold and the relatively stringent second threshold. In addition, the determination as to a touch/non-touch can be made in the image processing in which the pointing position is identified, without a dedicated device or a processing section being provided to determine as to a touch/non-touch.
The image processing device in accordance with the present invention, being provided with the foregoing features, is preferably such that the correspondence degree calculation means calculates the correspondence degree if a number of types of corresponding gradient directions in the matching region is greater than or equal to a preset value.
The gradient direction has the general tendency described above. The tendency does not change much with the condition of the image capture object, for example. Therefore, for example, if the number of types of gradient directions is 8, the number of types of matching gradient directions in pattern matching should be close to 8. Hence, if the correspondence degree is calculated when the number of types of corresponding gradient directions in the matching region is greater than or equal to a preset value, the detection of the pointing position requires smaller memory and less processing time. That in turn further reduces the cost for the detection of the pointing position.
The electronic apparatus in accordance with the present invention, being provided with the foregoing features, preferably includes the image processing device.
According to the configuration, the image processing device in accordance with the present invention becomes applicable to general electronic apparatus.
The image processing device may be computer-implemented. When that is the case, the present invention encompasses a control program executed on a computer to realize the image processing device by manipulating the computer as the individual means. The invention also encompasses a computer-readable storage medium containing the program.
The image processing device in accordance with the present invention is applicable to such devices (e.g., mobile phones and PDAs) that a user can manipulate or enter a command by touching a display on the liquid crystal or like display device. Specifically, the display device may be, for example, an active matrix liquid crystal display device, an electrophoretic display device, a twist-ball display device, a reflective display device using a fine prism film, and a display device using a digital mirror device or like optical modulation element, as well as a field emission display device (FED) and a plasma display device which contain luminance-variable, light-emitting elements, such as organic EL light-emitting elements, inorganic EL light-emitting elements, or LEDs (light-emitting diodes).
Number | Date | Country | Kind |
---|---|---|---|
2007-094993 | Mar 2007 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2008/056220 | 3/28/2008 | WO | 00 | 9/29/2009 |