The embodiments described herein relate generally to electronic image and video processing, and particularly to correcting images severely distorted by wide-angle lenses.
Wide-angle lenses are commonly used in cameras amongst other image and video capture devices where a large viewing angle is required or preferred. There also exist so-called Ultra Wide-Angle (UWA) lenses, with Field of Views (FOV) of up to or even larger than 180°. A fisheye lens is an exemplary UWA that may cover up to 180° (2π steradians) of angular FOV. As such, having two fisheye lenses sitting back to back and facing opposite directions will in principle cover a 360° view of the surroundings.
Image capture systems equipped with UWA lenses have many useful applications. Examples are surveillance and security monitoring cameras, video-conferencing systems, specialty photographic cameras, and vehicle camera systems. On a negative side, these lenses exhibit significant amount of distortions that pose a number of challenges. For example, the output of a typical fisheye lens is normally a two-dimensional (2D) circular (oval for a general wide-angle lens) mapping of a 3D visually familiar Euclidian space where objects lie in real world. Inevitably, horizontal and vertical lines would look skewed and appear curvilinear, more noticeably for the objects farther away from the center of the lens. The relative sizes of objects closer to the lens look exaggeratingly larger, so that different parts of an object look out of proportion. These effects are generally known as the fisheye distortions that make it difficult to judge distances and object relationships. This is in contrast with standard camera lenses which maintain the scene perspective and linearity by capturing a planar view. Moreover, to achieve a larger angular FOV, manufactures may use a combination of several lenses. That may further introduce optical and geometric distortions.
Several techniques are known in the art to address the fisheye distortions. The curvilinear images produced by a fisheye lens can be remapped onto a conventional rectilinear display. In a series of prior art patents from OmniView and IPIX Corp. a portion of the full viewing sphere is selected and corrected for fisheye distortions. Although this approach provides a powerful tool for viewing up to 360°, only a selected portion is available at a time. The rest of the image is saved that could be available upon further selections.
U.S. Pat. No. 7,961,980B2 utilizes cylindrical projections in connection with circular fisheye mappings. As is well known, cylindrical projections stretch distances east-west. In particular, the stretching diverges to infinity at the poles. This solution would be useful for creating panoramic images when the polar areas can be chopped off, again meaning loss of information. Although it is possible to take different orientation cylinders to build several images, still this does not provide a single zero-content-loss (ZCL) image, that is, an image containing all content from the original UWA captured scene, with no loss of information. In fact, it is physically impossible to achieve full perspective correction for all portions of the image at the same time by applying any transformation.
Therefore, the main drawback of prior art is loss of image content and information pertaining to objects and their relative positioning. In certain applications, such as a security or monitoring camera, it would be desired to be able to select areas of interest and correct for distortions, and yet be able to preserve the full content of the image at all times. The present invention discloses an intermediate approach, whereby image data for all pixels of the entire UWA lens captured image is maintained to prevent any loss of information after the mapping that corrects for distortions to a certain level, as much as theoretically allowed. Furthermore, a locally adaptive transformation can be applied to the regions of interest that aims to partially restore the distortions caused by the UWA lens. Partial image restoration has the objective of giving a reasonable indication of the object and its relationships to the surroundings, not a strict full-image perspective correction. This solution will be particularly useful for video monitoring applications where a full view of the surveillance must be available for every image frame, yet it would be possible to transform objects of interest.
The embodiments described herein provide in one aspect, a method for transforming an input image captured through at least one ultra wide-angle (UWA) lens, said UWA lens characterized by a corresponding transformation that maps 3D object space onto a plane that displays the input image, said method comprising:
The embodiments described herein provide in another aspect an electronic processor for transforming an image captured through at least one ultra wide-angle (UWA) lens, said UWA lens characterized by a corresponding transformation that maps 3D object space onto a plane that displays the input image said processor comprising:
For a better understanding of the embodiments and/or related implementations described herein and to show more clearly how they may be carried into effect, reference will now be made, by way of example only, to the accompanying drawings which show at least one exemplary embodiment and/or related implementation in which:
It will be appreciated that for simplicity and clarity of illustration, elements shown in the figures have not necessarily been drawn to scale. For example, the dimensions of some of the elements may be exaggerated relative to other elements for clarity. Further, where considered appropriate, reference numerals may be repeated among the figures to indicate corresponding or analogous elements.
It will be appreciated that numerous specific details are set forth in order to provide a thorough understanding of the exemplary embodiments described herein.
However, it will be understood by those of ordinary skill in the art that the embodiments and/or implementations described herein may be practiced without these specific details. In other instances, well-known methods, procedures and components have not been described in detail so as not to obscure the embodiments and/or implementations described herein. Furthermore, this description is not to be considered as limiting the scope of the embodiments described herein, but rather to describe the structure and operation of the various embodiments and/or implementations described herein.
It is well understood that an image acquired through a UWA lens cannot be unfolded on a normal (usually rectangular) display in its entirety, while having it perfectly corrected for perspective distortions. As mentioned, the prior art solutions have to sacrifice portions of the image content, perhaps deemed as less important, in order to correct for the rest. In accordance with an embodiment of the present invention, an intermediate solution is provided where the entire UWA lens acquired image is unfolded via a transformation that provides a high degree of correction, yet maintains all image information. A localized and content adaptive correction can further be applied to the key regions of interest, if required, which aims to partially restore some of the perspective lost or severely distorted by the UWA lens. Such a full-view transformation, with possible locally adaptive perspective corrections, will be referred to as Zero-Content-Loss (ZCL) correction. Partial restoration aims to give a reasonable indication of objects and their relationships, and not a strict perspective correction where loss of content would be unavoidable. The restoration process can also be viewed as redistributing the original distortions to regions of less importance.
To describe the embodiments, mainly an exemplary 180° lens will be used as the UWA lens to describe the method. Although it should be understood that this approach is applicable generally to any UWA lens or a combination of lenses covering up to a 360° of field of view. In particular, the embodied methods are applicable to custom UWA lenses which do not necessarily image as a circle, but may rather image as an ellipse or oval, such as the Panomorph lens from ImmerVision.
Selecting the FOV-envelope and mapping of the output image to the surface of the FOV-envelope determine the amount and areas of the perspective transformations. Therefore if one requires local perspective adjustments to the transformations 180, one could re-select the envelope 130 and start over in order to modify the displayed output 170 accordingly. Alternatively, a different envelope transformation 140 can be applied with the same envelope (no need to alter 130) to tweak the output image. The local adjustments can be performed manually via an operator by selecting one or more areas of interest that need to be corrected, or it could be automated in response to changes in the captured image. For instance, a security camera might be equipped with a motion detector. Once a movement is detected, areas around the moving object (e.g. face of a person) can be selected as an area of interest for the local adjustments 180. Additional processing could also be applied to the selected areas, such as zooming in/out, rotation and reflection operations. This would assist image analytic techniques for post-processing, e.g. face recognition.
The imaging of a standard camera (SLR type), with a standard non-UWA lens (example 35 mm lens) can be described using the ideal pinhole camera model. Though real cameras are much more complicated, this simple model suffices for the description below. A point in the 3D real world space with coordinates (x,y,z) is mapped to a point in the image (u,v) given by:
where f is the lens focal length and the optical axis is assumed to be along the z-axis. Equation (1) is referred to as the lens mapping equation for a standard lens; it maps a real world point to a point in the captured image. The image can be viewed as placed at a distance f from the lens center along the optical axis. The standard lens transformation preserves relative distances and angles of objects that are in the same z-plane, i.e. a plane perpendicular to the z-axis at some z coordinate, hence perspective is maintained. The resulting image is rectangular. Furthermore, rectangular coordinates are a natural coordinate system for describing non-UWA lens imaging. In contrast, a UWA lens is best described using a spherical/polar coordinate system. Let be (r,θ,φ) the spherical coordinates of a point in the 3D real world space (its rectangular coordinates being (x,y,z)) as shown in
r′=FrL(θ,φ),φ′=FθL(θ,φ) (2)
where the superscript L refers to “Lens” and (r′,φ′) are the polar coordinates in the image related to rectangular coordinates by: x′=r′ cos φ′ and y′=r′ sin φ′. In terms of rectangular coordinates (2) can be written as:
It is understood that the inverse trig operations need to take into account quadrants and handle x=0. The functions are general with no dependence on the radial coordinate r; in particular they are also applicable to lenses which form elliptical or oval images. UWA lenses with no radial dependence will be referred to as conical-UWA lens. For special case of an ideal fisheye, these equations simplify to:
r′=αθ,φ′=φ, αa constant (4)
Equations (2) to (4) are referred to as the conical-UWA lens and fisheye lens mapping equations respectively. Imaging obeying Equation (2) or (4) does not maintain relative distances and angles on fixed planes, leading to much distorted images. For example, for Equation (4), all points on a cone at a fixed θ map onto a circle in the image plane. The resulting fisheye image is circular as opposed to rectangular.
The benefit of the UWA lens, offsetting the severe distortion, is its much larger FOV. A fisheye field-of-view is defined as twice the maximum angle θ that can be captured, which gives a cone from the lens center:
Fisheye FOV=2θmax (5)
For a 180° fisheye θmax=90, where the lens can capture a full hemispherical field of view. A standard lens, even with a wide angle variation is usually restricted to <=120° horizontally and much smaller vertically. We define a conical-UWA lens FOV by:
This definition implies that θmax may depend on φ, and the minimum is taken. For most practical cases, θmax is independent of φ and equation (5) can also be taken as the conical-UWA lens FOV.
Understanding the FOV is critical to implementing a zero-content-loss correction. To this extent we define a FOV-envelope for a lens, as mentioned in relation to coordinate system of
Conical-UWA FOV-envelope={r=1,0≤θ≤θmax,0≤φ<2π} (7)
Every point on the FOV-envelope, which corresponds to a real world ray, has its position in the captured image determined by the lens mapping equations. Given an envelope, an envelope mapping (or transformation) 140 can be defined, which combined with the lens mapping 120 results in a ZCL transformation 150. This is described in the following, referencing
Let (u,v) be the coordinates of the 2D input image data 110 captured by the UWA lens. Since image sensors (used in digital cameras, etc.) are rectangular, this will be a rectangular frame with pixel resolution (width×height) W0×H0, containing within it the oval or circular UWA image. Let (xf, yf) be the coordinates of the 2D image after the distortion correction, where the output image 170 is assumed to be a rectangular image of pixel resolution W1×H1. In most cases, the input and output resolutions are the same: W1×H1=W0×H0. However, the notation is kept general to allow for a change in resolution as well. An envelope mapping 140 is defined as a mapping (FxC,FyC,FzC) of the output coordinates (xf, yf) onto a basic (or any equivalent) FOV-envelope of the lens ((x,y,z) denote coordinates on the envelope):
x=FxC(xf,yf)
y=FyC(xf,yf)
z=FzC(xf,yf) (8)
An equivalent set of spherical coordinate equations can also be written:
r=FrC(xf,yf)
θ=FθC(xf,yf)
φ=FφC(xf,yf) (9)
The two sets related in the usual manner:
Maps (8) and (9) are also referred to as the envelope covering; hence the superscript “C” is used herein. Every 3D point on the FOV-envelope has one point in the output 2D image which maps onto it. Further, every point on the envelope will have a unique mapped point in (xf,yf). By definition of the FOV-envelope, this ensures that every point seen in the captured image will also be seen in the output image hence providing no loss of content. A zero-content-loss (ZCL) transformation 150 is obtained by concatenating the envelope mapping 140 with the lens mapping 120. This provides a mapping between the output and the input image containing all the original content, which is used to determine the pixel value at (xf,yf). The equations for a ZCL transformation take the form:
u=FuL(FxC(xf,yf),FyC(xf,yf),FzC(xf,yf))≡FuZCL(xf,yf)
v=FvL(FxC(xf,yf),FyC(xf,yf),FzC(xf,yf))≡FvZCL(xf,yf) (11)
Analogous equations can be written in spherical/polar coordinates.
The zero-content-loss transformation is designed to restore a certain level of perspective in areas of interest. Therefore it is also referred to as a zero-content-loss correction. As mentioned above, this is not strict perspective correction (which is theoretically impossible for the full view), but an approximation to provide a better viewing experience and an improved sense of spatial placement of objects. The above framework provides with two means of modifying the correction:
According to one embodiment, the ZCL correction in the present approach has the ability to adapt to the image content for further correction of local perspective and other transformations in a specific region. In what follows, a systematic approach to constructing local perspective adjustments will be described. A correction with this property will be referred to as being locally-perspective-adaptive (LPA). Preferably, the FOV-envelope and the envelope transform are built in a parameterized manner, so that they can be quickly modified by changing specific parameters. This can often be achieved by starting with an envelope and transformation, whose mathematical properties are simple to handle. Once a ZCL correction is obtained either in equation form or as a set of grid points, it can be applied to images or video signals at real-time rates. For example, a highly efficient image warping hardware as described in the U.S. Pat. No. 7,324,706 can be employed for achieving this.
To illustrate the embodied method, a 180° conical-UWA lens is described as an exemplary UWA lens, with the lens mapping simplified by restricting the conical-UWA lens to a fisheye lens. For a 180° conical-UWA lens, the basic FOV-envelope with θmax=90 is a half sphere, as shown in
Having identified a FOV-envelope, the next step is to construct the envelope transformation 140, which is the mapping of the output image onto the envelope. In an exemplary illustration within the framework of ZCL, a good starting point would be to consider standard projections onto the globe used by cartographers to build flat 2D maps. In particular, the most relevant is the cylindrical projection, whose mathematics is readily understandable. The polar divergence trouble of cylindrical transformations, as described above, makes cylindrical projections less useful for constructing ZCL transformations. Regardless of the limitations, it is informative to see the equations for a ZCL transformation built via cylindrical projection.
There exist various types of cylindrical projections. Lambert projection is one example, employed here without loss of generality. With reference to
where the range of (xf,yf) is [0,W1]×[0,H1], i.e. the output image, θy is the polar (zenith) angle relative to the +y axis and φxz is the azimuthal angle in the xz-plane measured from the +x axis, with clockwise towards the +z axis. The choice of using (θy,φxz) is dictated by the orientation of the cylinder relative to the envelope. This can be converted to the standard coordinates using (even though r=1 on the basic envelope its dependence via r=R is explicitly shown, which will be useful when modifying envelopes):
y=Rcos θy
Rxz≡√{square root over (R2−y2)}
x=Rxz cos φxz=R sin θy cos φxz
z=Rxz sin φxz=R sin θy sin φxz (13)
Converting to the standard spherical coordinates:
It should be understood that the arc-tangent needs to take into account quadrants and handle x=0. Equations (12), (13) and (14), all give the same envelope transform based on cylindrical projection. This transformation is illustrated in
where RI is the radius of the circular fisheye input image captured by the lens; the equation for the proportionality factor α is that of a circular fisheye.
Referring to
By modifying the envelop transform (12), within the framework of cylindrical projections, variations of the ZCL transformation can be obtained. However, the same “infinite stretching” limitation is encountered for all such cases. Two exemplary variations, which amount to modifying the second relation in (12), are given below:
Equation (15) continues to hold since it is based on the lens mapping equation and does not change. Combining (15) with the new substitutions (16), the zero-content-loss transformation can be computed. The first variation, which is labeled zenith linear, changes the scale in the y direction in comparison with the Lambert projection; the same infinite stretching is still seen at the poles. The second variation is actually a Mercator cylindrical projection, where sf is a scale factor that controls the vertical stretching. This becomes a zero-content-loss transformation only in the limit sf→∞, in which case there is infinite vertical stretching. Nevertheless, even for small (sf≥4), which may be “zero-content-loss” for most practical purposes, the distortion introduced is very severe.
Referring to
According to a preferred embodiment, a different envelope transform can be constructed to avoid the above artifacts resulted from cylindrical projections. This approach is specifically tailored towards ZCL transformation with perspective correction, using a parameterized envelope transform that can be modified to adjust the regions of, and amounts of, residual distortion. Furthermore, it includes a correction mechanism for recovering true local perspective. This technique supports any UWA lens, including non-circular conical-UWA lenses. Supporting arbitrary lenses in the context of ZCL correction is a direct consequence of the embodied method (as described in reference to
Accordingly, consider a conical-UWA lens with basic envelope that may cover greater than 180° FOV and is not limited to capturing circular images. An equivalent envelope is given by the partial surface of an ellipsoid shown in
The envelope 1100 is the part of the ellipsoid that lies above θmax. Here rx,y,z denote the semi-axes lengths. The ellipsoid generalizes a sphere and provides three parameters, the semi-axes lengths, for modifying the envelope transform. The lens is again assumed to be sitting at the origin looking towards the +z axis. rx and ry provide scaling controls in the x and y directions respectively, whereas rz gives control of scaling in the z direction corresponding to a magnification. The ellipsoid envelope 1100 of
Referring to
Similarly there are the 4 obvious boundaries of the output image (not shown), referred to as the image boundaries, or i-boundaries:
Top i-boundary:yf=0,0≤xf≤W1
Bottom i-boundary:yf=H1,0≤xf≤W1
Left i-boundary:xf=0,0≤yf≤H1
Right i-boundary:xf=W1,0≤yf≤H1 (19)
The i-boundaries are mapped to the corresponding e-boundaries (top to top, etc.) by finding a function that takes x or y to φ; the boundary mapping. For the top and bottom boundaries, any monotonic function of x can be used and for the left and right any monotonic function of y. To maintain symmetry about the horizontal and vertical center axes (which is common for most lenses); maps which are symmetric about the boundary centers are preferred, specifically the i-boundary centers are mapped to e-boundary centers. An exemplary boundary map can be given by the following linear functions:
Symmetry implies the following useful relations between the maps:
φT(xf)=−φB(xf)
φL(yf)=π−φR(yf) (21)
From the maps above, the x-y-z coordinates can be computed using:
x=rxyz sin θmax cos φ
y=rxyz sin θmax sin φ
z=rxyz cos θmax (22)
where rxyz is the distance from the center to the point on the e-boundary which, unlike the sphere, varies with the polar angles. It can be computed by plugging (22) into (17) to be:
Equation (22) gives the envelope transform for the boundary 1250.
More complicated boundary maps in place of (20) can be used to obtain different transformations. Another example is to select φ to depend on x or y, such that an equal change in x or y, corresponds to equal distance movements along the boundary curve. This requires computing the arc-length, and an example shall be provided below for the mapping of interior (non-boundary) points. Regardless of the map, equation (22) remains valid.
The next stage consists of extending the boundary map to all interior points, hence getting the full envelope transformation. The idea is to map image scan-lines (pixel rows) to the envelope by wrapping them around the ellipsoid, with their end-points lying on the left and right e-boundaries. Consider a pixel row in the output image:
Row at yf=yf0,0≤xf≤W1 (24)
The end-points of this row, (0, yf0) and (W1, yf0), have their mapping to the envelope already determined by the boundary mapping 1260. We label these end-points on the envelope boundary as follows:
Note that (21) implies:
xL=−xR
yL=yR
zL=zR (26)
From left-right symmetry about the y axis, the center of the pixel row (and for all rows) is mapped to the x=0 partial ellipse on the ellipsoid; which will give a function θ(yf). This map is mapping of the pixel column: xf=W1/2, and will be referred to as the polar map. Mapping the row centers to the x=0 curve is consistent with the boundary map, which maps the centers of the top and bottom rows to the points (θ=θmax,φ=π/2) and (θ=θmax,φ=−π/2) respectively on the x=0 curve. As for the boundary maps, an arbitrary monotonic map can be used. An example linear polar map is given by:
This map is continuous with the boundary maps described above, and maps the center of the image to the ellipsoid pole on the +z axis. On the y-z plane, it is convenient to define an azimuthal angle γ taken from the +y axis, with positive towards the +z axis. This angle is related to θ by:
The angle γ has a range of [π/2−θmax,π/2+θmax]. From (22) and (28), the coordinates on the envelope for the center vertical line (i.e. the pixel row centers) is given by:
x=0
y=ryz cos γ
z=ryz sin γ (29)
Note that this is the x=0 curve on the ellipsoid, hence x=0. The y-z plane distance from the center ryz is again not constant as for a sphere; instead it is computed from (17) to be a function of γ:
The points on the envelope corresponding to the row centers will be denoted as (xC,yC,zC).
The computations have thus provided three points on the envelope for every pixel row, corresponding to the left/right end-points and the center:
{right arrow over (x)}=(xL,yL,zL)
{right arrow over (x)}R=(xR,yR,zR)
{right arrow over (x)}C=(xC,yC,zC) (31)
The remaining row points can now be mapped by any well-behaved curve lying on the ellipsoid envelope interpolating these three points. We refer to these curves as the row mapping curves or the row mapping. A well-behaved means any curve that maintains the geometry of the pixel-row, in particular the curve should be mathematically smooth and the totality of all the curves (for all pixel rows) should form a smooth surface covering the ellipsoid. This is a general statement that provides extensive flexibility for varying the envelope transform.
Referring to
The points (31) lie on the ellipsoid as well as the intersecting plane. This leads to various constraints that simplify the construction. The normal to the plane, {right arrow over (n)}=(nx,ny,nz), can be computed from the points (31) to be:
Using (26) in (32) it can be shown that nx=0. A vector from {right arrow over (x)}C to any point (x,y,z) on the curve must be perpendicular to the normal since it lies on the intersecting plane:
(x−xC,y−yC,z−zC)·(0,ny,nz)=0 (33)
This leads to a relation between y and z on the curve:
The special case of nx=0 needs to be handled separately; however in this case the ellipse lies completely in the x-z plane and can be easily determined from the 2D ellipse equation and the points (31). Explicitly one finds for (34):
The equation of the ellipsoid (17) can be used in turn to determine y as a function of x. Substituting (35), equation (17) becomes:
Solving (36) gives:
The intersection ellipse is now completely determined by (35) and (37) in terms of x. The range of x is between the end-points of the ellipse:
xL≤x≤xR (38)
Recall that the end-points on the ellipse are mapped to the end-points of the pixel row.
It remains to map 0≤xf≤W1, the pixels along a given row in the output, to (38). Any monotonic smooth function can be used, such as a simple linear map:
Different functions lead to different local scaling along the x direction. In one exemplary embodiment, another approach is outlined based on traversing equal distances on the envelope, which is similar to the relation (12) for cylindrical projections, though the mathematics is complicated by the non-spherical envelope. On the intersecting plane 1420, the curves 1440 are standard ellipses. Therefore the curves can be analyzed by going to a rectangular two-dimensional coordinate system, {tilde over (x)},{tilde over (y)}, defined on this plane 1420. In addition, because there is no scaling involved in moving to the coordinate system, all distances are maintained. Define the {tilde over (y)}-axis to be going through the point {right arrow over (x)}C and perpendicular to the line segment through: {right arrow over (x)}L{right arrow over (x)}R, with positive direction determined by the sign of yC. This is the line that sits in the ellipsoid y-z plane. Note the line segment {right arrow over (x)}L{right arrow over (x)}R also lies in the intersection plane. The {tilde over (x)}-axis can be chosen either (a) to be such that the ellipse origin is at the center, or (b) along the {right arrow over (x)}L{right arrow over (x)}R line segment. The ellipse equation for these two cases takes the form:
The two differ only by a shift be. Additional points can be computed on the ellipse by using the fact that in the x-y-z system, these points must satisfy the ellipsoid equation. In particular, for case (b) we can compute the intercepts with the {tilde over (y)}-axis. One of the intercepts is associated with the point {right arrow over (x)}C, which will be denoted y1, and the other opposite intercept denoted y2. Also in case (b), the intercepts along the {tilde over (x)}-axis axis are found to be −xR (negative) and xR (positive). In terms of the intercepts the unknown quantities in (40) can be written as:
These quantities can be used to determine the arc-length of the ellipse (of the part on the ellipsoid). The arc-length sarc is more easily computed in the coordinate system (a), and takes the form:
Here
The negative sign and shift of the lower limit is required by the ordering of xf, namely as xf increases,
x={tilde over (r)}xy(
Note that the integrals (42) and (43) can be computed numerically using well known techniques, such as the quadrature method. Equations (35), (37) and (44), along with the maps (20) and (27), give the final envelope transform, mapping the output image to a general ellipsoid envelope.
The envelope boundary, defined by θmax, has been taken to be independent of φ. However, the method disclosed is equally applicable to having θmax(φ) depend on φ as in (6). The main difference being that the constant θmax needs to be replaced by θmax(φ), with φ first determined by the boundary maps. Another subtle issue for ellipsoid envelopes is that the envelope boundary may not lie in a plane, whereas the row mapping curves (ellipses) are planar. As the ellipses approach the top/bottom e-boundaries, they need to smoothly join to the boundary curves. This can be handled by generating the ellipses from a few rows below and above the top and bottom pixels rows respectively, which are then interpolated with the boundary curves over the remaining rows. This is not an issue for spherical envelopes, where the envelope boundary is planar.
The above method includes several parameters or means for customization:
Both cases use an ellipsoid envelope, with the second case being closer to a spherical envelope.
The zero-content-loss correction transformation is again built by combining the envelope mapping with the lens mapping. As mentioned before, computing the envelope transform requires no knowledge of the lens mapping. The only information required from the lens was its field of view characterization, or θmax, and that it be a conical-UWA lens. This implies that the same envelope transform can be used for a circular and a non-circular UWA lens; provided they have the same θmax value. In other words, the method is independent of the imaging behavior of the lens. This also means that distortions introduced during the manufacturing of UWA lenses or from other sources, causing non-ideal behavior for that lens, can be accounted for provided they are characterized in the lens mapping equation. For the lens mapping equation (4), the zero-content-loss correction becomes:
Only the main equations are listed in (45). Applying (45), with the 180° envelope transform (see
By varying the envelope parameters, variations of the correction can be made, including obtaining ZCL images which allow panoramic stitching, without “blowing up” the poles. The transformations obtained using cylindrical projection can also be recovered with an appropriate choice of parameters. Thus the zero-content-loss images of
Using lens mapping of (46) (instead of (4)) in (45) leads to the same image as in
According to an embodiment of the invention, the full view ZCL correction can be combined with local exact perspective corrections. As mentioned in regard to standard non-wide angle lens cameras, the basic envelope for a standard perspective camera is a plane encompassing the FOV, and perpendicular to the optical axis. The associated envelope transform is a linear mapping of the 2D output image coordinates to the 2D coordinates on the envelope plane. This suggests that to obtain local perspective correction for a UWA lens, along a specific direction specified by (θp1,φp1), the envelope locally needs to be a plane perpendicular to the axis through (θp1,φp1). To maintain the ZCL correction away from the (θp1,φp1) neighborhood, the envelope should remain an ellipsoid, with the transition being smooth. Accordingly, the method for local perspective correction consists of smoothly stitching the two envelopes (ellipsoid and plane) and their transformations together in the region of interest, and retaining the ZCL transformation outside the region of interest. Let {right arrow over (F)}e1 and {right arrow over (F)}e2 be two envelope transformations defined over (xf,yf), mapping (xf,yf) to (x,y,z). (For the local perspective correction formulation, we take the origin in (xf,yf) to be the center of the image, with positive to the right and the top.) The full (xf,yf) domain will be denoted AD. The domain is split into three disjoint sections: a region of interest AI, a transition region AT surrounding AI, and a base region AB containing all points not in AI or AT. The stitching builds a new envelope transform {right arrow over (F)}es, that is identical to {right arrow over (F)}e1 on AB, identical to {right arrow over (F)}e2 on AI, and interpolates between {right arrow over (F)}e1 and {right arrow over (F)}e2 on AT. This is described by the following equation:
where f1,f2 are appropriate interpolating functions, which we take as scalar functions. Note since the definition of the envelop transform includes a defining envelope, equation (47) implicitly defines the new stitched envelope. It is also useful to define the regions in the (x,y,z) space via a global transformation from (xf,yf) to (x,y,z). If {right arrow over (G)}(xf,yf) is such a globally defined map, the regions and stitching takes the form:
Here the bar distinguishes the two approaches. This stitching can be viewed as moving the pixels in the region of interest to recover perspective, which in turn twists and distorts the pixels in the transition region, and retaining the original ZCL correction in the base region. Thus we push the residual distortions out of the region of interest to the transition region. The function {right arrow over (F)}e1 becomes the ZCL envelope transform (45), which is re-written as:
The function {right arrow over (F)}e2 is taken as the perspective envelope transform, which is constructed below.
To construct the perspective envelope transform, we first take the +z-axis as the optical axis (θp1=0,φp1=0) for simplifying the construction. In this case the envelope would be the plane at z=zm1, which is orthogonal to the z-axis. By varying zm1, the amount of magnification can be changed, hence the z position is kept variable. Corresponding to the direction (θp1,φp1), we can associate the unique point in the output image coordinate space (xf,yf), denoted (xfp1,yfp1), to which that direction is mapped under the ZCL envelope transform (45). By definition of the envelop transform this point exists and can be computed from (45). The point (xfp1,yfp1) is the center of the region of interest AI, about which local perspective correction is required. Knowing (θp1,φp1), (xfp1,yfp1) can be computed and vice-versa. With these definitions, the perspective envelope transform about (θp1=0,φp1=0) can be written as:
The scale factors (sx,sy) are shown determined by the perspective field of view 2′pFOV, and the size lp, of the perspective corrected region in (xf,yf). The perspective envelope transform about an arbitrary direction (θp1,φp1)≠(0,0), can be computed by rotating (50). To bring the (0, 0) direction to an arbitrary (θp1,φp1) direction, a rotation of ±θp1 degrees about the negative y-axis (i.e. θp1 degrees counter-clockwise in the xz-plane, when viewed from the negative y-axis) followed by a rotation of
The angle φ0p1 can be adjusted to control amount of “twisting” in the transition region; normally a value of φ0p1≈−φp1 gives good results.
For UWA lenses with a conical FOV defined by (6), it is natural to use circular regions of interest about a given direction in the (x,y,z) space. The reason being that the (x,y,z) space gives the real world direction in which the lens is capturing, along which a perspective view is sought. These circular regions are basically the intersection of a cone of some angle with the perspective envelope plane. Using (48) and (50), the circular region of interest, and the transition and base regions, can be written as:
The rotation in (51) ensures that these regions are transformed to the correct circular regions in the required direction. The interpolation can now be done in r . As an example, a linear function combined with an adjustable exponent κe is used for the interpolation. The final zero-content-loss envelop transform, with local perspective correction about (θp1,φp1), or equivalently (,xfp1,yfy1), is:
The two component transforms being, {right arrow over (F)}e1 as in (45) and {right arrow over (F)}e2 as in (51).
The above examples have worked with small circular regions of interest. It is also possible to locally perspective correct large regions and use non-circular regions of interest. A useful non-circular geometry is defining rectangular regions on the output image, in which perspective correction is needed, as follows:
(xf,yf)ϵAD=AB∪AT∪AI
AI={(xf,yf) such that |xf−xfp1|≤lIx,|yf−yfp1|≤lIy}
AT={(xf,yf) such that lIx<|xf−xfp1|≤lTx,lIy<|yf−yfp1|≤lTy}
AB={(xf,yf) such that lTx<|xf−xfp1|,lTy<|yf−yfp1|} (54)
Here {lIx,lIy,lTx,lTy} define the sizes of the rectangular regions.
It should be noted that the above corrections being zero-content-loss, include all content from the original scene (
According to another embodiment, the disclosed method is used to facilitate optional follow-up video content analysis (VCA), also known as video analytics. In order to perform object and pattern recognition in an image, or generally to analyze a video automatically; good input images are required. That means all sorts of distortions must be minimized. For instance, a video monitoring camera can be equipped with one or more of motion sensors light/sound sensors, etc. that can trigger recording events for a period of time. In that case, the content of the recorded video will be needed to recognize the moving object such as the face of a person as illustrated in
In one preferred embodiment, the methods disclosed above are implemented in an electronic processor. The processor can be integrated to an image capture, image display or an image processing device. The disclosed methods can also be implemented as software for use in a computer-readable medium.
While the above description provides examples of the embodiments, it will be appreciated that some features and/or functions of the described embodiments are susceptible to modification without departing from the spirit and principles of operation of the described embodiments. Accordingly, what has been described above has been intended to be illustrative of the invention and non-limiting and it will be understood by persons skilled in the art that other variants and modifications may be made without departing from the scope of the invention as defined in the claims appended hereto.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2012/027189 | 3/1/2012 | WO | 00 | 9/13/2013 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2013/130082 | 9/6/2013 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6754379 | Xiong et al. | Jun 2004 | B2 |
20080150956 | Maeiwa et al. | Jun 2008 | A1 |
20110069160 | Ning | Mar 2011 | A1 |
20120098926 | Kweon | Apr 2012 | A1 |
Number | Date | Country |
---|---|---|
2011005024 | Jan 2011 | WO |
Entry |
---|
Supplementary European Search Report, completed Nov. 30, 2015, issued in corresponding EP Application No. 12870125.7, 8 pages. |
Robert Carroll et al., “Optimizing Content-Preserving Projections for Wide-Angle Images”, ACM Transactions on Graphics (Proceedings of SIGGraph 2009), 2009, pp. 43:1-43:9, 28(3). |
International Search Report for PCT/US2012/027189 dated May 30, 2012. |
Number | Date | Country | |
---|---|---|---|
20140368606 A1 | Dec 2014 | US |