The present invention relates to ranging camera apparatuses for recognizing an object in an imaged area.
Z=(B×f)/d (1)
where B is a base-line distance between the centers of optical axes (COA) of two cameras for taking images of the SOI from two different view points, f is a focal distance between a lens and an imaging element of the cameras, and d is a distance between corresponding points of the two images taken by the cameras (parallax).
In this way, information about the three-dimensional position of the SOI can be readily calculated. The ranging camera apparatus is capable of calculating the three-dimensional position of the subject only to the extent that the subject is present in both of the images taken by the two cameras. More specifically, the ranging camera apparatus calculates the three-dimensional position by utilizing luminance information of the photographed images.
When calculating the parallax from the photographed images, the images taken by the cameras from different viewpoints are partitioned into blocks, and a matching process is carried out on the blocks in terms of luminance. The simplest and fastest one of the methods using such a block matching process is a method based on a city block distance calculation by which a correspondence degree is calculated from the sum of absolute values of corresponding pixels by the sum of absolute difference (SAD) method, as discussed in Patent Document 1.
There is a need for automatic recognition of a forward-direction situation by taking pictures of objects ahead of a motor vehicle using an onboard camera. Patent Document 2 discusses a technology that enables the recognition of a road condition or the road edges, which are difficult to recognize with the conventional luminance information alone, by utilizing polarization ratio information. The technology according to Patent Document 1 also enables three-dimensional recognition of a forward-direction situation by utilizing the parallax information. Thus, there is a need for acquiring both polarization information and parallax information simultaneously.
When it is desired to perform automatic recognition of a forward-direction situation by taking pictures of objects ahead based on an image taken by the on-board camera, and control the vehicle based on the image, real-time processing is required.
In Patent Document 1, because the parallax is calculated from the city block distance by performing the SAD method on the luminance information (brightness information), a mismatch is caused if the cameras for acquiring the luminance images do not have the same sensitivity. If there is a mismatch, the calculation of the parallax would be affected, resulting in a ranging error. In order to prevent such an error, various matching algorithms have been proposed. Some of the algorithms involve normalizing or encoding the luminance information of the images prior to matching. However, such algorithms are complicated and the processing speed may be reduced. On the other hand, from the hardware point of view, a method may be employed that would electrically control the sensitivity of the imaging elements by selecting only those imaging elements having a predetermined sensitivity, or through a calibration step. However, these methods require the selecting or adjusting step for maintaining a uniform sensitivity of the cameras, resulting in a cost increase when the ranging camera apparatuses are to be mass-produced.
When calculating the parallax by partitioning the luminance images taken by the cameras from the different viewpoints into blocks, and then performing matching on the blocks, the images in the blocks used for parallax calculation need to have a sufficient luminance difference. For example, if there is no luminance difference in the image because the image is so dark, all of the blocks would have the same characteristics, so that a mismatch can be caused. In order to avoid such a mismatch, the duration of exposure time may be extended or a gain offset process may be performed on the imaging elements so that they can take images with high sensitivity under any conditions. However, this results in an increase in cost and processing time.
When polarization ratio information alone is utilized as discussed in Patent Document 2, no depth information of the forward-direction environment is available, so that it is difficult to separate objects that appear overlapping in a two-dimensional image.
The disadvantages of the prior art may be overcome by the present invention which, in one aspect, is a ranging camera apparatus that includes an imaging device configured to produce polarization images having a phase difference by imaging a subject; and a processing unit configured to perform parallax calculation using polarization information of the polarization images having the phase difference.
In another aspect, the invention provides a ranging camera apparatus that includes an imaging device configured to output first polarization image data having one phase and second polarization image data having another phase by imaging a subject; an operating process unit to which the first and the second polarization image data are fed; a memory connected to the operating process unit; and an image processing unit connected to the memory. The operating process unit includes first and second polarization ratio information processing units and a parallax calculating unit. The first polarization ratio information processing unit is configured to receive the first polarization image data from the imaging device and configured to calculate first polarization ratio information image data and first luminance information image data. The second polarization ratio information processing unit is configured to receive the second polarization image data from the imaging device and configured to calculate second polarization ratio information image data and second luminance information image data. The parallax calculating unit is configured to receive the first and the second polarization ratio information image data and configured to generate parallax information image data. The memory is configured to store the first and the second polarization ratio information image data and the first and the second luminance information image data from the first and the second polarization ratio information processing units, and the parallax information image data from the parallax calculating unit. The image processing unit is configured to recognize the subject based on the first and the second polarization ratio information image data, the first and the second luminance information image data, and the parallax information image data stored in the memory, and configured to calculate a three-dimensional position of the subject based on the parallax information image data.
A complete understanding of the present invention may be obtained by reference to the accompanying drawings, when considered in conjunction with the subsequent detailed description, in which:
Referring back to
The second imaging unit 21b similarly includes a second hood portion 22b, a second lens portion 23b, a second region dividing filter 24b, and a second imaging element 25b disposed on a printed circuit board 26b. The second region dividing filter 24b includes two polarizer regions configured to transmit an S polarization component or a P polarization component of light incident on the second region dividing filter 24b via the second lens portion 23b. Thus, the second region dividing filter 24b separates the light into S polarization component light and P polarization component light. The S and P polarization component lights are then incident on the second imaging element 25b. The second imaging element 25b outputs second polarization raw image data 27b to the operating process unit 31 of the image processor 3.
The operating process unit 31 includes first and second polarization ratio information processing units 33a and 33b, and a parallax calculating unit 34. The first polarization ratio information processing unit 33a generates first polarization ratio (which may be hereafter referred to as “PR”) information image data 35a by calculating a polarization ratio PR of the P polarization component and the S polarization component based on the first polarization raw image data 27a, in accordance with the following equation (2), and outputs the PR information image data 35a to the parallax calculating unit 34 and the memory 32.
PR=P/S (2)
where P is the P polarization component, and S is the S polarization component.
The PR is calculated in order to detect a characteristics difference between the acquired polarization components having different phase differences. Therefore, the polarization ratio PR may be calculated in accordance with any of the following equations (3) through (5):
PR=P−S (3)
PR=(P/S)/(P+S) (4)
PR=(P−S)/(P+S) (5)
Although equation (3) calculates a difference, the results of the calculations using the polarization information having a phase difference are collectively referred to as a polarization ratio.
The denominator in equations (4) and (5) is a normalizing portion. Alternatively, normalization may be based on a difference between P and S. Although the P polarization information and the S polarization information are utilized in obtaining the polarization ratio information in the present embodiment, circular polarization components may be utilized because it is only required that there be a phase difference.
The first polarization ratio information processing unit 33a generates first luminance information image data 36a by summing the P polarization component and the S polarization component in accordance with the following equation (6), and outputs the first luminance information image data 36a to the memory 32.
Luminance information image data=P+S (6)
On the other hand, the second polarization ratio information processing unit 33b generates second polarization ratio information image data 35b by calculating the polarization ratio PR based on the second polarization raw image data 27b, and outputs the second polarization ratio information image data 35b to the parallax calculating unit 34 and the memory 32. The second polarization ratio information processing unit 33b also generates second luminance information image data 36b by summing the P and S polarization components, and outputs the second luminance information image data 36b to the memory 32.
The parallax calculating unit 34, using the first and the second polarization ratio information image data 35a and 35b, calculates a total (“RSAD”) of the luminance differences in the image blocks of the images in accordance with the following equation (7), thereby obtaining a correspondence evaluation value. The correspondence evaluation value is evaluated such that the smaller the correspondence evaluation value, the higher the degree of correspondence between the blocks. The evaluation provides parallax information image data 37 that is outputted to the memory 32.
where i and j indicate pixel positions in the blocks, and I and T indicate luminance values of left and right pixels.
Thus, the parallax calculating unit 34 determines a block centered around a pixel of interest in the first polarization ratio information image data 35a, and determines a block of the same size in the second second polarization ratio information image data 35b. The parallax calculating unit 34 then calculates a correlation value each time one block is shifted from the other by one pixel. The parallax calculating unit 34 determines the distance to the pixel at the center of the block having the greatest correlation as a parallax. This step is performed for all of the pixels (or at certain intervals of the pixels) of the first polarization ratio information image data 35a. For the calculation of the correlation value, a variety of algorithms may be used, of which equation (4) above may be the most conventional example. The method according to the present embodiment may be applied to many other parallax calculating algorithms.
The MPU 41 may be configured to perform various recognizing processes by using the parallax information image data 37, the first and the second luminance information image data 36a and 36b, the first and the second polarization ratio information image data 35a and 35b stored in the memory 32.
The lines dividing the first and the second polarizer regions 241 and 242 may be inclined at such an angle that a change of one pixel of the first imaging element 25a (25b) in the lateral direction corresponds to a change of two pixels of the first (second) imaging element 25a (25b) in the vertical direction. Thus, the polarization ratio can be calculated without being easily affected by a position error between the first (second) imaging element 25a (25b) and the first (second) region dividing filter 24a (24b).
The first and the second polarizer regions 241 and 242 of the first and the second region dividing filters 24a and 24b may include polarizers made of photonic crystal. In the first polarizer region 241 of the first and the second region dividing filters 24a and 24b, for example, first transparent medium layers 244 having a high refractive index and second transparent medium layers 245 having a low refractive index are alternately layered on a transparent substrate 243 having periodic grooves, while a shape of the interface is preserved, as illustrated in
As illustrated in
In the first polarizer region 241, the direction of the grooves is parallel to the Y axis direction, as illustrated in
While the first and the second region dividing filters 24a and 24b are provided with the two kinds of concave/convex-shaped grooves in the illustrated example, the concave/convex shaped grooves may be oriented in three or more directions. By thus forming the first and the second polarizer regions 241 and 242 with a photonic crystal, superior resistance to degradation by ultraviolet rays can be obtained, thus enabling the apparatus to be used stably for a long period of time.
An opening area and the transmission axis of the first and the second polarizer regions 241 and 242 of the first and the second region dividing filters 24a and 24b can be freely designed by controlling the size or direction of the pattern of the grooves on the transparent substrate 243. The groove pattern may be formed by various methods, such as electron beam lithography, photolithography, interference exposure, and nanoprinting. In any of the methods, the direction of grooves can be highly accurately determined in each micro region. Thus, a polarizer region in which fine polarizers having different transmission axes are combined may be formed, and a polarizer consisting of an arrangement of plural of such fine polarizers may be formed. Because only specific regions having the concave/convex pattern perform the polarizer operation, a surrounding region may be flatly formed or provided with a concave/convex pattern that is isotropic in the plane so that the surrounding region has no polarization dependency. In this case, the light is transmitted by the surrounding region, so that a polarizer can be built only within a specific region.
The first and the second region dividing filters 24a and 24b are disposed adjacently to the first and the second imaging elements 25a and 25b, respectively. Preferably, the first and the second region dividing filters 24a and 24b may be bonded to the first and the second imaging elements 25a and 25b, respectively, which are die-mounted, with a filter structure surface of the filters facing the imaging element surface side, using an adhesive or the like. Generally, light from a lens travels toward an imaging element as converging finite light. Thus, if the first and the second region dividing filters 24a and 24b and the first and the second imaging elements 25a and 25b are spaced apart respectively from one another, the light near the boundary of the first or the second region dividing filter 24a or 24b may produce crosstalk noise in each region. Such crosstalk can be prevented and the imaging device 2 can perform stably by disposing the first and the second region dividing filters 24a and 24b adjacently to the first and the second imaging elements 25a and 25b, respectively.
Much time is required for the processes of extracting the S and P polarization components from the first and the second polarization raw image data 27a and 27b, generating the first and the second polarization ratio information image data 35a and 35b according to equations (2) and (3), and generating the first and the second luminance information image data 36a and 36b according to equation (6), using the first and the second polarization ratio information processing units 33a and 33b. Much time is also required for generating the parallax information image data 37 by performing a parallax calculation on the first and the second polarization ratio information image data 35a and 35b using the parallax calculating unit 34.
As mentioned above, it is very difficult to simultaneously output the three kinds of information including the first and the second polarization ratio information image data 35a and 35b, the first and the second luminance information image data 36a and 36b, and the parallax information image data 37 by performing complicated calculations using the polarization ratio information processing units 33a and 33b and the parallax calculating unit 34.
After a block of data for the parallax calculation is stored in the first and the second line buffers 92a and 92b, the parallax calculating unit 34 reads the block of data, such as four pixels vertically×four pixels laterally, and performs the parallax calculation according to equation (7), thereby generating the parallax information image data 37 which are then stored in the memory 32. Thus, a pipeline process is performed using the first and the second line buffers 91a, 91b, 92a, and 92b, so that the calculation result can be stored in the memory 32 with only several lines of delay. The above processes may be implemented by a field programmable gate array (FPGA) or an application specific IC (ASIC). Such a hardware structure enables the ranging camera apparatus 1 mounted on the vehicle 5 to process data in a real-time manner.
The pixel size used in the parallax calculation or the polarization ratio calculation may be dynamically determined. In this case, the first and the second line buffers 92a and 92b may also be configured to dynamically store the image data. When the imaging element comprises a CMOS sensor, several pixels vertically×several pixels laterally may be dynamically allocated to the buffers, instead of on a line by line basis. Such configuration may be dynamically varied depending on the imaging conditions.
In the second parallax image 371b of
The ranging camera apparatus 1b eliminates the need for taking into consideration a geometric positioning error and the like of the polarizing element and an improved resolution of an image can be obtained, although the ranging camera apparatus 1b may cost more than the ranging camera apparatus 1 or 1a.
Preferably, the number of the left- and right-side imaging units 210a and 210b of the ranging camera apparatus 1b may be increased in order to acquire finer polarization components or perform a stereoscopic parallax calculation.
In one method of preparing the wire grid, a metal film may be formed on a substrate, and then the metal film may be patterned by lithography in order to leave thin lines of metal. In another method, grooves may be formed in the substrate by lithography, and then a film of metal may be formed by performing vacuum vapor deposition from a direction perpendicular to the direction of the grooves and inclined from the normal to the substrate (i.e., from a direction inclined with respect to the substrate surface). In the case of vacuum vapor deposition, particles emitted by a source of vapor deposition travel from the source to the substrate along a straight line while hardly colliding with the other molecules or atoms. Thus, the film can be formed only on the convex portions of the grooves, while hardly any film is formed on the bottom (concave) portions of the grooves as the particles are blocked by the convex portions. Thus, by controlling the amount of film formed, a metal film can be only formed on the convex portions of the grooves on the substrate.
Preferably, the metal wire of the wire grid type polarizer may comprise aluminum or silver. Other metals, such as tungsten, may also be used and the same phenomenon may be realized. Lithography may include optical lithography, electron beam lithography, and X-ray lithography. Preferably, electron beam lithography or X-ray lithography may be used given the intervals of the thin lines on the order of 100 nm for an operation with visible light. While vacuum deposition may be preferably used for forming the film of metal, sputtering in a high-vacuum atmosphere or collimation sputtering using a collimator may be performed given the relative importance of directionality of the particles incident on the substrate. Because the wire grid type polarizer can be produced by a semiconductor process as in the case of the polarizer using a photonic crystal, a boundary of two regions, for example, can be accurately produced.
Although this invention has been described in detail with reference to certain embodiments, variations and modifications exist within the scope and spirit of the invention as described and defined in the following claims.
The present application is based on Japanese Priority Application No. 2009-239946 filed Oct. 19, 2009, the entire contents of which are hereby incorporated by reference.
Number | Date | Country | Kind |
---|---|---|---|
2009-239946 | Oct 2009 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2010/068537 | 10/14/2010 | WO | 00 | 3/22/2012 |