A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The disclosed embodiments relate generally to image processing and computer vision, and more particularly, but not exclusively, to rectifying a wide-angle image.
Wide-angle lenses, such as the fisheye lenses, can be used in various recreational and commercial applications where a wide angle of view is beneficiary. For example, the fisheye lenses are useful in products such as panoramic cameras and vision systems (e.g. for parking or security monitoring). However, most wide-angle lens applications are not satisfactory, due to the need for efficiently eliminating geometry distortion in the wide-angle images. This is the general area that embodiments of the disclosure are intended to address.
Described herein are systems and methods that provide a technical solution for rectifying a wide-angle image captured using a wide-angle lens. The system can obtain a projection model for the wide-angle lens, wherein the projection model corresponds the wide-angle image with a plurality of target image portions in a target image. Furthermore, the system can determine, based on the projection model, a plurality of reference pixels in the wide-angle image for a target pixel in the plurality of target image portions. Then, the system can calculate one or more pixel values for said target pixel based on the plurality of reference pixels in the wide-angle image.
Also described herein are systems and methods that provide a technical solution for rectifying a fisheye image captured using a fisheye lens. The system can determine, for a set of target pixels in a target image, one or more reference points in the fisheye image. Furthermore, the system can obtain a subsection of the fisheye image based on the one or more reference points. Then, the system can calculate one or more pixel values for each said target pixel in the set of target pixels, based on pixel values of one or more pixels in the subsection of the fisheye image.
The disclosure is illustrated, by way of example and not by way of limitation, in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” or “some” embodiment(s) in this disclosure are not necessarily to the same embodiment, and such references mean at least one.
The description of the disclosure as following uses a fisheye lens as example for a wide-angle lens. It will be apparent to those skilled in the art that other types of wide-angle lenses can be used without limitation.
In accordance with various embodiments, a technical solution can be provided for rectifying a wide-angle image captured using a wide-angle lens. The system can obtain a projection model for the wide-angle lens, wherein the projection model corresponds the wide-angle image with a plurality of target image portions in a target image. Furthermore, the system can determine, based on the projection model, a plurality of reference pixels in the wide-angle image for a target pixel in the plurality of target image portions. Then, the system can calculate one or more pixel values for said target pixel based on the plurality of reference pixels in the wide-angle image.
In accordance with various embodiments, a technical solution can be provided for rectifying a fisheye image captured using a fisheye lens. The system can determine, for a set of target pixels in a target image, one or more reference points in the fisheye image. Furthermore, the system can obtain a subsection of the fisheye image based on the one or more reference points. Then, the system can calculate one or more pixel values for each said target pixel in the set of target pixels, based on pixel values of one or more pixels in the subsection of the fisheye image. In this disclosure, an image being rectified (such as the fisheye image) is also referred to as a “raw image,” and the pixel value for a target pixel is also referred to as a “target pixel value.”
For example, the camera may have a fisheye lens 110 with a 180 degree of angle of view and is capable of capturing images of a hemispherical field in front of the camera. Alternatively, the fisheye lens 110 may have a different degree of angle of view. Thus, the image device 101 can capture images with different field of view. In the example as shown in
In accordance with various embodiments, comparing with the conventional images, a fisheye image 102 can produce significant more imaging information due to a larger field of view. Thus, it is advantageous to use an image device 101 with a fisheye lens 110 in various applications. In accordance with various embodiments, the fisheye image 102 can be used in various applications 103 (such as computer vision). For example, instead of combining multiple conventional cameras with narrow angle of views, an unmanned aerial vehicle (UAV) can rely on a fisheye lens camera or image sensor for performing various computer vision based navigation tasks while keeping the weight and complexity of the system under control.
On the other hand, wide-angle lenses, including ultra wide lens such as the fisheye lenses, may have inherent distortions that is difficult to be approximated using the conventional pinhole camera model. As shown in
In order to support for various applications, e.g. the different computer vision algorithms, the system can use a rectification process for correcting or eliminating the geometry distortion in the captured fisheye image. In some instances, various algorithms can be used for rectifying the fisheye images. For example, a spherical coordinate method with longitude mapping can be used for rectifying the fisheye images. However, the accuracy for this method is not satisfactory. Also, the spherical coordinate method may need to abandon edge portion of the fisheye image, which limits the application in the field of computer vision. On the other hand, a polynomial coordinate transformation method requires complex and intensive computation for fitting high-order polynomials. Additionally, a rectification method, based on the spherical perspective projection constraint, needs to use a nonlinear optimization method to solve for the tangential and radial deformation parameters (i.e. the accuracy of this method depends on the selection of initial values). Both these methods are computationally intensive, which limit their application in the field of computer vision.
In accordance with various embodiments, a projection model 210 can be used for projecting a wide-angle image, such as the fisheye image 201, into multiple image portions 212 of a target image 202. For example, a cubic surface projection model can project a fisheye image 201 (e.g. with at least a portion of a spherical view such as a hemispherical view) into five image portions, e.g. the front, back, left, right, and bottom (or top) portions of a cubic surface. Alternatively, a projection model can be configured to project the fisheye image 201 into any number of image portions.
In accordance with various embodiments, the pixel value(s) for each pixel in the target image 202 can be determined based on the projection model 210. Here, the projection of a target pixel in the target image 202 may not locate exactly on a pixel in the fisheye image 201. Thus, the pixel values of multiple fisheye image pixels in a neighboring block 203 may be used for calculating the pixel value of a pixel 204 in the target image 202.
f(ρ)=a0+a1ρ+a2ρ2+a3ρ3+a4ρ4,
where ρ=√{square root over (u2+v2)} for each point (u, v) in an image coordinate system (U-V) in the fisheye image 401, and f(u, v)=f(p) is the corresponding mapping value 405 in a reference coordinate system (X-Y-Z) by an affine transformation.
In accordance with various embodiments, the coefficients in the above polynomial function can be determined via a calibration of the fisheye lens 403. For example, the calibration process may include using the fish-eye lens 403 to capture images of a checkerboard, and extracting corner points based on the number of horizontal and vertical cells, cell size and other parameters. Then, the calibration process can obtain the calibration parameters such as the polynomial coefficients by optimizing an objective function. Also, the calibration process can adjust the image center position to optimize the calibration results.
Referring back to
As shown in
As shown in
Thus, each point (u, v) in the image circle can be projected to a point (x, y, z) on the spherical surface 402 in the reference coordinate system X-Y-Z.
Referring back to
For example, as shown in
with x={0,−90,0,90,0}, y={90,0,0,0,−90}, and z=0.
Thus, each point in the fisheye image 401 can be projected to a point on the hemispherical surface 402, which in turn can be projected to a point on a cubic surface accordingly. As a result, a target image, computed using such a cubic surface projection model, can include a square area facing directly to the fisheye image 401 and four rectangular areas for the side angles. In the case of hemispherical view, only a half of each side cubic surface is used for projection from the hemispherical surface 402. In other examples, different sizes or portions of cubic surfaces can be used for projection from the surface 402.
In the example as shown in
In accordance with various embodiments, each points in the target image 502 can be obtained based on a projection model. In the example as shown in
As shown in
p(i+g,j+h)=(1−g)×(1−h)×p(i,j)+(1−g)×h×p(i,j+1)+g×(1−h)×p(i+1,j)+g×h×p(i+1,j+1)
Here, p(i, j) represents the pixel value at coordinate (i,j) in the fisheye image 501, and g and h indicates the floating relative distance (i.e. with a value between 0 and 1) from the corresponding fisheye image pixel(i,j), since the projected coordinate may not locate at an integer pixel location (i,j).
In accordance with various embodiments, the projection model can be determined for each fisheye lens after calibration. Also, the projected coordinates for each pixel in the target image 502 can be computed and stored. Thus, any image captured using the calibrated fisheye lens can be rectified by applying the pre-stored mapping relationship, i.e. the pixel values (s) for each target pixel in the target image 502 can be estimated based on the stored projected coordinates.
In accordance with various embodiments, the rectification of the fisheye image 801 can be based on a mesh-based approach. Instead of directly computing the projected coordinates in the fisheye image 801 for each target pixel in the target image 802, the rectification process 800 can take advantage of a predetermined mesh in the target image 802. For example, the system can compute the coordinates for a mesh of target pixels (or mesh pixels 812) in the target image 802. In some instances, the mesh pixels 812 can be evenly distributed in the target image 802. Alternatively, the distribution of mesh points 812 may not be even, and can be configured following certain rules (e.g. using a logarithm function). In some instances, multiple mesh pixels 812 can form various geometry shapes, e.g. polygon shapes such as rectangles.
Instead of directly computing and pre-storing the projected coordinates in the fisheye image, for each target pixel in the target image 802, the system can first compute the project coordinate 811 for each mesh pixel 812 (i.e. determining the mapping relationship for each mesh pixel). Here, the system may only pre-store the projected coordinates, in the fisheye image, for each mesh pixels 812. Then, the system can estimate the projected coordinates for the target pixels (that are not mesh pixels 812) using interpolation based on the mesh pixels 812.
As shown in
For example, for a pixel t 804 at location (i, j), in a mesh cell with a size N*N (e.g. 4×4) in the target image 802, the projected coordinates 803 in the fisheye image 801 can be calculated using the following formula.
xt=(N−i)((N−j)xa+jxb)+i((N−j)xc+jxd)
yt=(N−i)((N−j)ya+jyb)+i((N−j)yc+jyd)
In accordance with various embodiments, the pixel value for a target pixel 804 in the target image 802 can be determined based on the projected coordinate 803 in the fisheye image 801. In the example as shown in
For example, the coordinates for the pixels in a neighboring block 910 in the fisheye image can be determined using the following formula.
xl=floor(xt),xu=xl+1
yl=floor(yt),yu=yl+1
xd=xt−xl
yd=yt−yl
Then, the pixel value for the pixel t at location (i, j) can be determined using the following formula.
It=(1−yd)((1−xd)I(xl,yl)+xdI(xl+1,yl))+yd((1−xd)I(xl,yl+1)+xdI(xl+1,yl+1))
In accordance with various embodiments, using the projection model 1010, the rectification process 1000 can determine the mapping relationship 1020 between the mesh pixels 1022 and the projected coordinates 1021 in the fisheye image. Here, the projected coordinates 1021 are determined as the reference points for an image subsection 1012 in the target image 1002. For example, the rectification process 1000 can compute the projected coordinates 1021 for each mesh pixel 1022 using the projection model 1010. Then, the rectification process 1000 can determine an image area (e.g., an image subsection 1011) in the fisheye image 1001 based on the projected coordinates 1021 for the mesh points 1022. For example, this image subsection 1011 can be a minimum bounding rectangular area that encloses the projected coordinates 1021 for the mesh pixels 1022.
In accordance with various embodiments, the mesh-based approach can reduce the consumption of memory and input/output (I/O) bandwidth. In some instances, for a mesh with a cell size of m*m, the required number of pixels, for which the coordinate mapping need to be stored, can be estimated as (W/m+1)*(H/m+1) where W is the image width and H is the image height. In the example as shown in
In accordance with various embodiments, in order to compute the pixel values for the target pixels in each individual image block 1-7, a processor 1111 can read in a corresponding block of fisheye image 1101 into the memory 1110 for supporting the computation. The processor 1111 can compute the projected coordinate for each target pixel within an individual image block, based on the pre-stored projected coordinates for selected mesh points in the respective image block.
Referring back to
In accordance with various embodiments, the division of the target image 1101 can take into consideration the distortion of the fisheye image at the edge portion. In some instance, the system can alleviate the memory and I/O consumption by applying a division scheme on the target image. For example, the up and bottom portions of the target image 1102 can be further divided into multiple blocks, e.g. blocks (1, 2) and (6, 7) respectively, in order to reduce the width of the image block for performing the rectification calculation. Alternatively, depending on the division scheme, the left and right portions of the target image 1102 can be further divided into multiple blocks (e.g. into vertical stacks of blocks). Thus, the rectification processor 1111 can avoid the need for handling a large area of pixel values in the fisheye image when the rectification is performed. Afterwards, the different portions of the target image 1101 can be combined together for output.
Many features of the present disclosure can be performed in, using, or with the assistance of hardware, software, firmware, or combinations thereof. Consequently, features of the present disclosure may be implemented using a processing system (e.g., including one or more processors). Exemplary processors can include, without limitation, one or more general purpose microprocessors (for example, single or multi-core processors), application-specific integrated circuits, application-specific instruction-set processors, graphics processing units, physics processing units, digital signal processing units, coprocessors, network processing units, audio processing units, encryption processing units, and the like.
Features of the present disclosure can be implemented in, using, or with the assistance of a computer program product which is a storage medium (media) or computer readable medium (media) having instructions stored thereon/in which can be used to program a processing system to perform any of the features presented herein. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
Stored on any one of the machine readable medium (media), features of the present disclosure can be incorporated in software and/or firmware for controlling the hardware of a processing system, and for enabling a processing system to interact with other mechanism utilizing the results of the present disclosure. Such software or firmware may include, but is not limited to, application code, device drivers, operating systems and execution environments/containers.
Features of the disclosure may also be implemented in hardware using, for example, hardware components such as application specific integrated circuits (ASICs) and field-programmable gate array (FPGA) devices. Implementation of the hardware state machine so as to perform the functions described herein will be apparent to persons skilled in the relevant art.
Additionally, the present disclosure may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
While various embodiments of the present disclosure have been described above, it should be understood that they have been presented by way of example, and not limitation. It will be apparent to persons skilled in the relevant art that various changes in form and detail can be made therein without departing from the spirit and scope of the disclosure.
The present disclosure has been described above with the aid of functional building blocks illustrating the performance of specified functions and relationships thereof. The boundaries of these functional building blocks have often been arbitrarily defined herein for the convenience of the description. Alternate boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Any such alternate boundaries are thus within the scope and spirit of the disclosure.
The foregoing description of the present disclosure has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the disclosure to the precise forms disclosed. The breadth and scope of the present disclosure should not be limited by any of the above-described exemplary embodiments. Many modifications and variations will be apparent to the practitioner skilled in the art. The modifications and variations include any relevant combination of the disclosed features. The embodiments were chosen and described in order to best explain the principles of the disclosure and its practical application, thereby enabling others skilled in the art to understand the disclosure for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalence.
This application is a continuation of International Application No. PCT/CN2016/108720, filed on Dec. 6, 2016, the entire content of which is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
8463074 | Johnson, III | Jun 2013 | B2 |
9135678 | Feng | Sep 2015 | B2 |
9153014 | Yu | Oct 2015 | B2 |
9230339 | Wexler | Jan 2016 | B2 |
9560269 | Baldwin | Jan 2017 | B2 |
9787958 | Hattingh | Oct 2017 | B2 |
9805281 | Wu | Oct 2017 | B2 |
9807359 | Moule | Oct 2017 | B1 |
9883101 | Aloumanis | Jan 2018 | B1 |
10078782 | Krichen | Sep 2018 | B2 |
10105049 | Sinha | Oct 2018 | B2 |
10192135 | Krenzer | Jan 2019 | B2 |
10225473 | Alibay | Mar 2019 | B2 |
10354547 | Falstrup | Jul 2019 | B1 |
10390007 | Chen | Aug 2019 | B1 |
10412365 | Zhang | Sep 2019 | B2 |
10621743 | Kiyota | Apr 2020 | B2 |
10735713 | Zabatani | Aug 2020 | B2 |
10757395 | Nobori | Aug 2020 | B2 |
11037308 | Chen | Jun 2021 | B2 |
11044399 | Chen | Jun 2021 | B2 |
20040042662 | Wilensky | Mar 2004 | A1 |
20040076340 | Nielsen | Apr 2004 | A1 |
20050213838 | Kuramoto | Sep 2005 | A1 |
20090041378 | Yamaoka | Feb 2009 | A1 |
20100045774 | Len | Feb 2010 | A1 |
20100231721 | Meloche | Sep 2010 | A1 |
20100302395 | Mathe | Dec 2010 | A1 |
20110176731 | Fukushi | Jul 2011 | A1 |
20120114262 | Yu | May 2012 | A1 |
20130135474 | Sakano | May 2013 | A1 |
20130258047 | Morimoto | Oct 2013 | A1 |
20140056479 | Bobbitt | Feb 2014 | A1 |
20140125656 | Mishima | May 2014 | A1 |
20140347469 | Zhang | Nov 2014 | A1 |
20150036014 | Lelescu | Feb 2015 | A1 |
20150170002 | Szegedy | Jun 2015 | A1 |
20150178884 | Scholl | Jun 2015 | A1 |
20150232031 | Kitaura | Aug 2015 | A1 |
20150310274 | Shreve | Oct 2015 | A1 |
20160048973 | Takenaka | Feb 2016 | A1 |
20160065930 | Chandra | Mar 2016 | A1 |
20160080647 | Kimura | Mar 2016 | A1 |
20160119541 | Alvarado-Moya | Apr 2016 | A1 |
20160353094 | Rougeaux | Dec 2016 | A1 |
20170059412 | Ye | Mar 2017 | A1 |
20170251193 | Zhou | Aug 2017 | A1 |
20170287107 | Forutanpour | Oct 2017 | A1 |
20170330337 | Mizutani | Nov 2017 | A1 |
20180114291 | Yi | Apr 2018 | A1 |
20180268528 | Matsushita | Sep 2018 | A1 |
20180365797 | Yu | Dec 2018 | A1 |
20190089940 | Zhang | Mar 2019 | A1 |
20190096137 | Holzer | Mar 2019 | A1 |
20190197734 | Briggs | Jun 2019 | A1 |
20190208216 | Yamori | Jul 2019 | A1 |
20190260928 | Kunishige | Aug 2019 | A1 |
20190279681 | Yuan | Sep 2019 | A1 |
20190289205 | Lin | Sep 2019 | A1 |
20190318178 | Kauffmann | Oct 2019 | A1 |
20190392593 | Roa | Dec 2019 | A1 |
20200103281 | Ye | Apr 2020 | A1 |
20200219283 | Nishikawa | Jul 2020 | A1 |
20200234413 | Park | Jul 2020 | A1 |
20200258198 | Kuwabara | Aug 2020 | A1 |
20200273205 | Yamashita | Aug 2020 | A1 |
20200280678 | Hariyani | Sep 2020 | A1 |
20200366838 | Wang | Nov 2020 | A1 |
Number | Date | Country |
---|---|---|
101814181 | Aug 2010 | CN |
102479379 | May 2012 | CN |
103824296 | May 2014 | CN |
103839227 | Jun 2014 | CN |
104240236 | Dec 2014 | CN |
2005135096 | May 2005 | JP |
2012105246 | May 2012 | JP |
2015215817 | Dec 2015 | JP |
2015140514 | Sep 2015 | WO |
Entry |
---|
World Intellectual Property Organization (WIPO) International Search Report and Written Opinion for PCT/CN2016/108720 dated Aug. 28, 2017 7 Pages. |
Ciaran Hughes, et al., Review of Geometric Distortion Compensation in Fish-Eye Cameras, Irish Signals and Systems Conference, ISSC 2008: Jun. 18-19, 2008, pp. 162-167, National University of Ireland, Galway. |
Number | Date | Country | |
---|---|---|---|
20190287213 A1 | Sep 2019 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2016/108720 | Dec 2016 | US |
Child | 16432378 | US |