The present invention relates to an image capturing apparatus, an image capture system, an information processing apparatus, and a computer-readable storage medium.
Conventional image capturing apparatuses that capture images in all directions or spherically employ a hyperboloidal mirror or a fisheye lens, for example. These image capturing apparatuses are specific optical systems that need a non-linear image processing such as distortion correction or projective transformation so that the captured image with the apparatus can be visualized to a user.
One of known technologies is that a semispherical image is obtained with a wide-angle capture employing a fisheye lens and recorded as a distorted circular image. A part of the distorted circular image is cut out and image processing is performed on the image with a computer, whereby the distorted image is transformed into a plane regular image.
Another known technology is that while performing the image processing above, when the center position of the distorted circular image does not correctly correspond to the vertical point direction, a user specifies a parameter of a tilt angle. The load while performing distortion correction on the image is therefore reduced.
As described above, when images are captured in a state of the image capturing apparatus being tilted, an omni-directional spherical image in the wrong vertical direction may be generated. Some inventions to solve this problem have been already known.
Japanese Patent Application Laid-open No. 2003-223633 and Japanese Patent Application Laid-open No. 2006-059202 disclose a technology of generating an omni-directional spherical image having the correct vertical direction. This is achieved by adding rotational transformation according to the tilt of a camera to the process of non-linear image processing.
Japanese Patent Application Laid-open No. H11-309137 and Japanese Patent No. 4175832 disclose a technology of generating an omni-directional spherical image having the correct vertical direction at high speed, for the purpose of reducing the time required for generating the omni-directional spherical image having the correct vertical direction or reducing the cost on arithmetic operations. This technology is achieved by adding rotational transformation according to the tilt of the camera, in addition to distortion correction or projective transformation, preparing a conversion table used for non-linear transformation in advance, and performing batch transformation through the conversion table when capturing images.
With the structure of previous spherical image capturing apparatuses, however, users are required to specify a tilt angle of the apparatus during capture when combining a plurality of images captured at once from a plurality of image capturing directions, when transforming the combined image into a plane regular image, and when displaying the resulting image. That is to say, there was a problem in that a necessary parameter for correction could not be obtained during capture for automatically correcting the image.
Additionally, with the previously performed method for capturing images in all directions or spherically through a conversion table according to the tilt, if the amount or orientation of the tilt changes, the conversion table needs to be re-made from the beginning. In this case, processing requires a relatively long time to generate an omni-directional spherical image having the correct vertical direction according to any tilt of the image capturing apparatus.
Japanese Patent Application Laid-open No. H11-309137 and Japanese Patent No. 4175832 disclose in particular a technology generating images using a conversion table that includes rotational transformation according to the tilt of the camera. The conversion table is made so as to correspond to the amount of tilt determined in advance. Accordingly, if the amount or orientation of the tilt changes so as to differ from the predetermined amount or orientation, the conversion table needs to be re-made from the beginning. Also in this case, the processing still requires a relatively long time to generate an omni-directional spherical image having the correct vertical direction according to any tilt of the image capturing apparatus, therefore the problem is not yet solved.
Therefore, there is a need to provide an image capturing apparatus capable of generating an omni-directional spherical image having the correct vertical direction, according to any tilt of the image capturing apparatus.
It is an object of the present invention to at least partially solve the problems in the conventional technology.
According to an embodiment, there is provided an image capturing apparatus that includes a tilt detection unit configured to detect a tilt in a vertical direction; conversion data used for transforming plane coordinates into spherical coordinates; a correction unit configured to correct the conversion data according to the tilt; a plurality of image capturing units; a coordinate transformation unit configured to transform plane coordinates of a plurality of pixels included in images captured by the image capturing units into spherical coordinates according to the conversion data corrected by the correction unit; and a combining unit configured to combine the images including the pixels transformed into spherical coordinates by the coordinate transformation unit.
The above and other objects, features, advantages and technical and industrial significance of this invention will be better understood by reading the following detailed description of presently preferred embodiments of the invention, when considered in connection with the accompanying drawings.
Exemplary embodiments according to the present invention are described below with reference to the accompanying drawings. In the drawings and descriptions below, the same or equivalent components are designated by the same reference numerals, and overlapped explanation thereof will be simplified or omitted as appropriate. In the embodiments, when generating an omni-directional spherical image, the omni-directional spherical image capturing apparatus detects the vertical direction and corrects a conversion table used for image processing according to the vertical direction. In the embodiments, the omni-directional spherical image is generated through the corrected conversion table. Accordingly, the conversion table does not need to be re-made from the beginning, thereby reducing the processing time. The image capturing apparatus refers to a stand-alone apparatus, such as a digital camera; an image capture system refers to a system in which a plurality of apparatuses such as a digital camera and an information processing apparatus are utilized separately. In the embodiments, unless otherwise specified, the image capturing apparatus conceptually includes an image capture system.
A multi-eye image capturing apparatus will now be described that captures images in all directions spherically at once.
For example, the multi-eye image capturing apparatus illustrated on the left in
The image capturing apparatus with two-eye structure will be described hereafter, for convenience of description. The basic idea of the present invention may be applied to the image capturing apparatus with the four-eye structure illustrated on the right in
In the image capturing apparatus, it is assumed that the central axis of the image capturing apparatus in the vertical direction corresponds to the vertical axis of the sphere. Specifically, an image obtained through each of two fisheye lenses is presented as illustrated in
The borderline of 180 degrees illustrated in
With the structure employing four lenses illustrated on the right in
For example, when distortion correction is performed on a plurality of images and the images are combined with each other in the same manner as in Mercator projection, an image illustrated in
When the image capturing apparatus is placed so as to be correctly upright and images are captured without the apparatus being tilted, a correct image can be obtained having the straight horizontal line as illustrated in
If distortion correction and combination are performed on images captured with the image capturing apparatus in a state of being tilted without consideration of the tilt, the horizontal line is curved like a graph of a trigonometric function as illustrated in
The principles of measurement of a tilt angle will now be described with reference to
An acceleration sensor as illustrated in
The image capturing apparatus according to the embodiment will now be described in detail.
In the embodiment, two capturing elements (i.e., the two-eye structure) are employed so as to obtain omni-directional images, however, three or more capturing elements may be employed. When three capturing elements are employed, a lens corresponding to the image capturing element, which will be described with reference to
The controller 10 includes a central processing unit (CPU) 101, a static RAM (SRAM) 102, a read-only memory (ROM) 103, an image processing block 104, an SDRAM interface (I/F) 105, an external storage device I/F 106, and an external sensor I/F 107.
In the description of the embodiment, the image processing block 104 performs typical image processing such as distortion correction and pixel defect correction, and the CPU reads a predetermined table or a computer program, thereby performing correction processing in the vertical direction according to the tilt of the digital camera 100. It is to be understood, however, that the image processing block 104 may perform correction processing in the vertical direction.
During capture, pieces of digitized image data are input to the image processing block 104 of the controller 10 through the image capturing element 1 (reference numeral: 109) and the image capturing element 2 (reference numeral: 110). The pieces of image data input are image-processed using the image processing block 104, the CPU 101, the SRAM 102, and the SDRAM 111, for example, and finally stored in the external storage device 112. Examples of an external storage device include CompactFlash (registered trademark) or a Secure Digital (SD) memory.
In the controller 10, a universal serial bus (USB) connection interface for connecting to an external device or a wired or wireless network I/F for connecting to a network may be provided. A conversion table, which will be described later, a correction processing program for the conversion table, and a processing program for calculation of vertical correction are stored in the SRAM 102 or the SDRAM 111.
The acceleration sensor 113 is used for detecting a tilt of the digital camera 100 during capture. This enables detection in the tilting direction of the digital camera instantly and readily.
The acceleration sensor 113 is a three-axis acceleration sensor that detects acceleration of the digital camera 100 in the three directions perpendicular to each other, i.e., the up and down direction, the right and left direction, and the back and front direction. When a user holds the digital camera 100 in its hand(s) so that the digital camera 100 remains still, the acceleration sensor 113 detects the gravity acceleration only.
When acceleration is detected only in the down direction in the up and down direction, it is found that the up and down direction of the digital camera 100 corresponds to the top and bottom direction with respect to the ground. In other words, it is known that the digital camera is held horizontally in the same manner usually operating a digital camera.
When the image capturing apparatus tilts in the top and bottom direction, the acceleration sensor 113 detects acceleration in the right and left direction and the back and front direction according to the actual tilting direction. A tilt angle of the digital camera 100 can be obtained according to the magnitude of acceleration in the up and down direction, the right and left direction, and the back and front direction.
The omni-directional spherical image capturing apparatus will now be described.
The present embodiment aims to generate the omni-directional spherical image using the omni-directional spherical image capturing apparatus capable of capturing images in all directions from a capture point. That is to say, the omni-directional spherical image capturing apparatus (digital camera) can capture images in all directions viewed from the capture point.
The digital camera 100 (illustrated in
Operations of the omni-directional spherical image capturing apparatus according to the embodiment will now be described.
The acceleration sensor 113 detects a tilt angle of the digital camera 100 at Step (hereinafter, noted as “S”) 301. At S302, the controller 10 reads out a conversion table stored in the SDRAM 111, for example, according to a tilt angle of the digital camera 100 detected at S301. The controller 10 then corrects the conversion table in a predetermined correction method. The correction method for the conversion table will be described later.
At S303, two digitized fisheye images captured through the image capturing element 1 (reference numeral: 109) and the image capturing element 2 (reference numeral: 110) are input to the image processing block 104. The image processing block 104 performs typical image processing such as distortion correction. At S304, the controller 10 converts the two captured fisheye images (like the image as illustrated in
At S305, the controller 10 generates a composed omni-directional spherical image utilizing a superimposed field between the two images converted at S304. At S306, the controller 10 stores the omni-directional spherical image generated at S305 in the external storage device 112 through the external storage device I/F 106.
A projection relation of the fisheye lens will now be described as an example of wide-angle lenses employed in the digital camera according to the embodiment.
Examples of a technique (function) of projective transformation include central projection, stereographic projection, equidistant projection, equi-solid-angle projection, and orthographic projection. The central projection is employed when capturing images using a digital camera including a typical angle of view. Other four methods above are employed in a digital camera including a wide-angle lens with an ultra wide angle of view such as a fisheye lens.
A format (form of equation) of the omni-directional spherical image will now be described that is captured by the digital camera according to the embodiment.
Plane coordinates of images captured through the fisheye lens and spherical coordinates of the omni-directional spherical image can be related to each other using the projection function f (h=f(θ)) illustrated in
The process of generating the omni-directional spherical image will now be described with reference to an actual image captured through the fisheye lens.
The respective images illustrated in
Subsequently, the processing at 5305 illustrated in
The correction method for the conversion table at S304 illustrated in
A post-conversion image is generated from the captured image (pre-conversion image) according to the conversion table illustrated in
An explanation will now be given on processing of correcting the conversion table for the omni-directional spherical image captured by the digital camera according to the embodiment, according to the tilt of the digital camera.
At S801, the controller 10 obtains a camera tilt parameter (α, β) according to a tilt value of the digital camera 100 detected by the acceleration sensor 113 (
At S802, input values (θ1, φ1) of the conversion table are set. In
At S803, the input values (θ1, φ1) in the global coordinate system are transformed into the values (θ0, φ0) in the camera coordinate system through calculation of vertical correction by the controller 10. This calculation of vertical correction will be described later.
At S804, converted values (θ0, φ0) in the camera coordinate system are transformed into the coordinates (x, y) of the pre-conversion image through the conversion table not yet corrected (
At S805, the controller 10 stores input values (θ1, φ1) in the global coordinate system and coordinates (x, y) not yet corrected, which are calculated in the end above, as a set of coordinates corresponding to each other in the conversion table already corrected.
At S806, the controller 10 determines whether any outstanding input values (θ1, φ1) remain. In other words, the controller 10 determines whether any outstanding input values (θ1, φ1) in the global coordinate system, in which the corresponding coordinates (x, y) not yet corrected is not calculated remain. If the controller 10 determines that outstanding input values (θ1, φ1) remain (Yes at S806), the processing returns to S802 so that input values (θ1, φ1) in the global coordinate system are set as a subsequent value.
If the controller 10 determines that no outstanding input values (θ1, φ1) remain (No at S806), the processing ends. In this case, the controller 10 completes calculation of coordinates (x, y) not yet corrected corresponding to the pixels in the format of the omni-directional spherical image that have input values (θ1, φ1) in the global coordinate system as coordinates.
The tilt of the digital camera 100 according to the embodiment will now be described.
In
A tilt angle α in the gravity vector and a gradient angle β in the xy plane are obtained from the equation below using an output of the acceleration sensor. In the equation, Ax refers to a component value in the x0-axis direction in the camera coordinate system of the acceleration sensor, Ay refers to a component value in the y0-axis direction in the camera coordinate system of the acceleration sensor, and Az refers to a component value in the z0-axis direction in the camera coordinate system of the acceleration sensor.
α=Arc tan(Ax/Ay)
β=Arc cos(Az/√{square root over (Ax2+Ay2+Az2))}
The calculation of vertical correction will now be described.
In
Spherical coordinates (θ1, φ1) are transformed into spherical coordinates (θ0, φ0) through the equations illustrated in
Subsequently, a camera tilt parameter (α, β) is used to transform coordinates (x1, y1, z1) in the global coordinate system into coordinates (x0, y0, z0) in the camera coordinate system through the rotational coordinate transformation illustrated in
This means that the camera coordinate system is obtained by rotating the global coordinate system by α around the z-axis, and then rotating the global coordinate system by β around the x-axis. Finally, the three-dimensional Cartesian coordinates (x0, y0, z0) in the camera coordinate system are transformed back to spherical coordinates (θ0, φ0) in the camera coordinate system through Equations (5) and (6) illustrated in
Another example of calculation of vertical correction will now be described.
In the present embodiment, the calculation of vertical correction is accelerated. Equations (1) to (6) described above and illustrated in
Specifically, the rotation α and rotation γ around the z-axis is the rotation θ itself in spherical coordinates (θ, φ). Calculation for rotational transformation can be performed with simple addition or subtraction operations, without transforming into Cartesian coordinate system, thereby accelerating the calculation. Accordingly, transformation with Cartesian coordinate system is required only for the rotational transformation of the rotation β around the x-axis. This speeds up the calculation.
An explanation will now be given on another processing of correcting the conversion table for the omni-directional, spherical image captured by the digital camera according to the embodiment, according to the tilt of the digital camera.
In the present embodiment, the correction processing on the conversion table can be accelerated. In the embodiment illustrated in
With reference to
At S1202, the conversion table corresponding to the value of the camera tilt parameter (α, β) is obtained, then the processing ends. Specifically, the calculation of vertical correction is omitted by storing a plurality of conversion tables so that the conversion tables provide different values depending on the camera tilt parameter (α, β).
The camera tilt parameter (α, β) is a three-dimensional real vector, in principle. If conversion tables are provided only for specific camera tilt parameters (α, β), and the conversion table closest to the detected camera tilt parameter (α, β) is used, all parameters are covered with the provided tables. Alternatively, it is also effective that a plurality of tables close to the detected camera tilt parameter (α, β) are extracted and an interpolation operation such as weighting or obtaining difference. This enables correction of the conversion table only with an interpolation operation, which is relatively simple, thereby suppressing the processing on the operation.
The overall structure of the image capture system according to another embodiment will now be described.
As illustrated in
The tilt correction processing, however, requires tilt information on the spherical image capturing apparatus. The tilt information on the spherical image capturing apparatus can be detected only in the image capturing apparatus itself, thus the tilt information needs to be transmitted along with the images before correction. The conversion table may be stored in the spherical image capturing apparatus and transmitted along with the images. Alternatively, the conversion table may be transmitted to the destination device in advance as preparation for capturing (e.g., when the destination device recognizes the image capturing apparatus, or when the image capturing apparatus recognizes the destination device).
The conversion table does not need to be transmitted along with the images every time. The conversion table may be transmitted when the destination device requests the conversion table or checks whether the conversion table is the latest one. In addition, the conversion table may not be transmitted necessarily from the image capturing apparatus. For example, the conversion table may be uploaded to a website of the manufacturer that sells the image capturing apparatus so as to be downloaded and stored in the destination device. The conversion table transmitted from the image capturing apparatus may be customized so as to suit the image capturing apparatus.
An explanation will now be given on the destination device 200 in an image capture system according to another embodiment.
The controller 140 includes a CPU 1401, an SRAM 1402, a ROM 1403, an image processing block 1404, an SDRAM I/F 1405, and an external storage device I/F 1406. The controller 140 also includes a USB connection interface 1407 for connecting to an external device and a wired or wireless network I/F 1408 for connecting to a network.
In the description of the embodiment above, the image processing block 1404 performs typical image processing such as distortion correction and pixel defect correction, and the CPU 1401 reads a predetermined table or a computer program, thereby performing correction processing in the vertical direction according to the tilt of the digital camera 100. It is to be understood, however, that the image processing block 1404 may perform correction processing in the vertical direction.
Operations of the image capture system according to another embodiment will now be described.
Processing is performed in the digital camera 100 firstly. At S1501, a tilt angle of the digital camera 100 is detected by the acceleration sensor 113 (
As described above, the conversion table may be transmitted in advance when the digital camera 100 and the destination device 200 recognize each other. The conversion table may be transmitted to the destination device 200 only once, that is, the conversion table does not need to be transmitted every time.
The conversion table is stored in the SDRAM 1411 coupled to the digital camera 100, for example, and read out therefrom, then transmitted. This is the end of processing performed in the digital camera 100. The subsequent steps are performed in the destination device 200.
At S1503, the conversion table is corrected in the controller 140 through a predetermined correction method according to the tilt information, which is the transmitted angular information of the digital camera. The correction method for the conversion table is the same in the embodiment described above. At S1504, the transmitted fisheye images captured through the two capturing elements are input to the image processing block 1404 of the destination device 200.
In the image processing block 1404, typical image processing such as distortion correction is performed. At S1505, the two fisheye images are transformed by the controller 140 through the conversion table corrected at S1503. The transformation method is the same as in the embodiment described above.
At S1506, a composed omni-directional spherical image is generated by the controller 140 utilizing the superimposed field of the two images transformed at S1505. At S1507, the omni-directional spherical image generated at S1506 is stored by the controller 140 in the external storage device 1412 through the external storage device I/F 1406.
The operations relating to the flowcharts according to the embodiments illustrated in
As described above, in the omni-directional spherical image capturing apparatus, or in the image capture system, the vertical direction is detected, and correction is performed on a conversion table used for image processing, and an omni-directional spherical image is generated through the corrected conversion table. This eliminates the necessity of re-making the conversion table from the beginning, thereby reducing the processing time.
Although the invention has been described with respect to specific embodiments for a complete and clear disclosure, the appended claims are not to be thus limited but are to be construed as embodying all modifications and alternative constructions that may occur to one skilled in the art that fairly fall within the basic teaching herein set forth.
Number | Date | Country | Kind |
---|---|---|---|
2012-053190 | Mar 2012 | JP | national |
2012-263542 | Nov 2012 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2013/057338 | 3/11/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2013/133456 | 9/12/2013 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20010015751 | Geng | Aug 2001 | A1 |
20030142203 | Kawakami et al. | Jul 2003 | A1 |
20040104996 | Hayashi et al. | Jun 2004 | A1 |
20060029256 | Miyoshi | Feb 2006 | A1 |
20070269188 | Shiga | Nov 2007 | A1 |
20090167886 | Tonomura | Jul 2009 | A1 |
20100097444 | Lablans | Apr 2010 | A1 |
20100208032 | Kweon | Aug 2010 | A1 |
20110115879 | Homma et al. | May 2011 | A1 |
20130235149 | Tanaka | Sep 2013 | A1 |
20140071227 | Takenaka | Mar 2014 | A1 |
20140176542 | Shohara | Jun 2014 | A1 |
20150103197 | Djordjevic | Apr 2015 | A1 |
20160073024 | Yamamoto | Mar 2016 | A1 |
Number | Date | Country |
---|---|---|
1725857 | Jan 2006 | CN |
11-084482 | Mar 1999 | JP |
11-309137 | Nov 1999 | JP |
2003-223633 | Aug 2003 | JP |
2003-524306 | Aug 2003 | JP |
2003-524306 | Aug 2003 | JP |
2005-056295 | Mar 2005 | JP |
2005-063141 | Mar 2005 | JP |
2005-175952 | Jun 2005 | JP |
2006-33570 | Feb 2006 | JP |
2006-059202 | Mar 2006 | JP |
2007-311904 | Nov 2007 | JP |
4175832 | Aug 2008 | JP |
2008-205811 | Sep 2008 | JP |
2009-064225 | Mar 2009 | JP |
2009-176273 | Aug 2009 | JP |
2009-267792 | Nov 2009 | JP |
2010-239221 | Oct 2010 | JP |
2011-040898 | Feb 2011 | JP |
2011-061511 | Mar 2011 | JP |
2011-114496 | Jun 2011 | JP |
WO 9951027 | Oct 1999 | WO |
WO 2005015494 | Feb 2005 | WO |
Entry |
---|
Extended European Search Report issued Jun. 26, 2015 in Patent Application No. 13757254.1. |
Shigang Li, “Sky-ground Representation for Local Scene Description” Proceedings of the 17th International Conference on Pattern Recognition, vol. 3, XP010724647, Aug. 23, 2004, pp. 1-4. |
Mauro Turturici, et al., “Low-power Embedded System for Real-Time Correction of Fish-Eye Automotive Cameras” Design, automation and test in Europe, EDA Consortium, XP058021618A, Mar. 12, 2012, pp. 1-2. |
International Search Report issued on May 21, 2013 in PCT/JP2013/057338 filed on Mar. 11, 2013. |
Office Action issued Jul. 1, 2016 in Korean Patent Application No. 10-2016-7008975 (with English language translation). |
Office Action issued on Nov. 4, 2016 in Chinese Patent Application No. 201380012472.2 (with English language translation). |
Number | Date | Country | |
---|---|---|---|
20150062363 A1 | Mar 2015 | US |