The present disclosure relates generally to determining an optical flow associated with one or more images.
A camera or other image capture device can observe a three dimensional scene and project such scene onto a detector such that a series of two dimensional images are created over time. When the camera and the scene are in relative motion, the two dimensional images can change over time. The problem of tracking each point in space using such changing two dimensional images is generally known as computing the optical flow.
For example, optical flow can generally refer to a change in x-axis position and a change in y-axis position for each point within a pair of two dimensional images. An optical flow vector can describe such change in x and change in y in vector form and an optical flow field can aggregate the optical flow vectors for each point in the images. Such optical flow fields can be computed over a series of sequential images and prove useful in numerous applications, including real-time applications.
Existing methods for computing optical flow have various starting points and characteristics. For example, the Lucas-Kanade method and its derivative works use assumptions about images in grayscale. In particular, the Lucas-Kanade method assumes that optical flow is essentially constant in a local neighborhood of pixels and solves a basic optical flow equation for all pixels in the neighborhood together in a single computation. However, by assuming constant flow over a neighborhood of pixels, the Lucas-Kanade method fails to consider subpixel or single pixel changes in optical flow. Further, determining optical flow over a neighborhood of pixels using a single calculation can be computationally demanding and reduce opportunities for computational parallelizability, making such method undesirable for real-time applications. In addition, the use of grayscale images ignores color as a source of informational truth.
Other methods for computing optical flow, such as the Horn-Schunck method, include an assumption or global constraint with respect to smoothness. As a result, such methods attempt to minimize distortions in flow and prefer solutions which exhibit higher levels of smoothness. However, such assumptions and preferences with respect to smoothness inherently defeat the use of optical flow for applications such as edge detection or object segmentation.
Aspects and advantages of embodiments of the present disclosure will be set forth in part in the following description, or may be learned from the description, or may be learned through practice of the embodiments.
One example aspect of the present disclosure is directed to a computer-implemented method of determining an optical flow. The method includes obtaining, by one or more computing devices, a plurality of input color vectors respectively associated with a plurality of input pixels, each input pixel having a known position within an image. The method further includes obtaining, by the one or more computing devices, a color rate of change associated with each input pixel in the image. The method further includes determining, by the one or more computing devices, an optical flow estimate associated with a subject pixel within the image based at least in part on the plurality of input color vectors and the color rates of change associated with the input pixels in the image.
Other example aspects of the present disclosure are directed to systems, apparatus, tangible, non-transitory computer-readable media, user interfaces, memory devices, and electronic devices for determining an optical flow.
These and other features, aspects and advantages of various embodiments will become better understood with reference to the following description and appended claims. The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments of the present disclosure and, together with the description, serve to explain the related principles.
Detailed discussion of embodiments directed to one of ordinary skill in the art are set forth in the specification, which makes reference to the appended figures, in which:
Reference now will be made in detail to embodiments, one or more examples of which are illustrated in the drawings. Each example is provided by way of explanation of the embodiments, not limitation of the present disclosure. In fact, it will be apparent to those skilled in the art that various modifications and variations can be made to the embodiments without departing from the scope or spirit of the present disclosure. For instance, features illustrated or described as part of one embodiment can be used with another embodiment to yield a still further embodiment. Thus, it is intended that aspects of the present disclosure cover such modifications and variations.
Example aspects of the present disclosure are directed to determining an optical flow associated with one or more images. In particular, the optical flow can be determined for a subject pixel in an image based at least in part on a plurality of input color vectors and color rates of change respectively associated with a plurality of input pixels associated with the image. Input color vectors and color rates of change can be obtained for a plurality of input pixels in the image. An optical flow estimate can then be determined for the subject pixel. The optical flow estimate can describe an estimated displacement per unit time of the subject pixel in continuous time. In some implementations, the optical flow estimate can be a least squares estimate of the optical flow associated with the subject pixel.
In some implementations, the input pixels can represent a quadrant of four pixels of the subject image. The input pixels can each have a color represented by a color vector associated with the input pixel. The color vector can have a plurality of components that describe the color of the input pixel. For example, the color vector can have a red intensity component, a green intensity component, and a blue intensity component respectively describing the intensities of red, green, and blue which form the color of the input pixel. As another example, the color vector can have an ultraviolet component, an infrared component, and/or other suitable components. In this manner, the color vector can be associated with any suitable color space. In addition, a color rate-of-change vector associated with a pixel in the image can have a red intensity rate-of-change component, a green intensity rate-of-change component, and a blue intensity rate-of-change component respectively describing the intensity rate-of-change of red, green, and blue which form the color rate-of-change of the input pixel. Furthermore, the color rate-of-change vector can have an ultraviolet rate-of-change component, an infrared rate-of-change component, and/or other suitable components. In this manner, the color rate-of-change vector can be associated with any suitable color space. The color spaces for the intensity and intensity rate-of-change are typically the same.
The input pixels can reside at known locations within the subject image. As indicated, an image capture device in relative motion with the scene depicted in the subject image can capture one or more subsequent images of the scene, such that the colors of corresponding input pixels (e.g. input pixels having the same position) within the one or more subsequent sequential images can change over time. In this manner, a color rate of change associated with the respective colors of the input pixels (e.g. the input color rates of change) can be determined. For instance, in some implementations, the color rate of change can be determined by one or more sensors associated with the image capture device configured or constructed to directly monitor (e.g. using one or more color rate of change sensors) the color rates of change with respect to the sequence of images captured by the image capture device. In some implementations, the color rate of change can be determined based at least in part on one or more subsequent images relative to the subject image. For instance, the color rate of change can be determined based at least in part on a difference of the color vectors of the corresponding input pixels in one or more subsequent images and one or more respective times at which the one or more subsequent images were captured relative to the subject image.
A subject pixel can be selected within the subject image. For instance, the selected subject pixel can reside at a position (x,y) within the bounds of the plurality of input pixels. The optical flow estimate with respect to the subject pixel can be determined based at least in part on the color vectors of the input pixels and the color rates of change of the input pixels. In some implementations, bilinear interpolation can be performed to interpolate a color rate of change associated with the subject pixel based at least in part on the color rates of change associated with the input pixels.
In some implementations, the subject pixel can be selected to be the pixel in the center of the quadrant formed by the input pixels. For instance, the input pixels can have relative coordinates at (0, 0), (0, 1), (1, 0), and (1, 1). The subject pixel can be selected to have a relative coordinate position of (x=0.5, y=0.5). In this manner, bilinear interpolation performed with respect to the subject pixel can correspond to an average of the input pixel values. In such implementations, the color rate of change associated with the subject pixel (e.g. the subject color rate of change) can be determined by taking an average of the input color rates of change.
According to example aspects of the present disclosure, an optical flow equation used to determine an optical flow estimate associated with the subject pixel within the input quadrant according to example aspects of the present disclosure can be can be derived by equating a measured (e.g. interpolated) bilinear formula associated with the input pixels with an estimated bilinear formula associated with the input pixels. The measured bilinear formula can correspond to the interpolated subject color vector. Both sides of the equation can be differentiated, as will be described in more detail below. Such differentiation can result in three equations having two unknowns (e.g. the optical flow), which can be solved for, for instance, using a least squares fit. In particular, the least squares fit can provide an estimated optical flow (x′, y′) associated with the subject pixel. As indicated, the estimated optical flow can correspond to a displacement of the subject pixel with respect to time. In this manner, the estimated optical flow can correspond to an optical flow vector describing the displacement of the subject pixel. A least squares error associated with the optical flow can be determined by plugging the optical flow (x′, y′) into the original equation and subtracting the measured bilinear formula.
Executing the optical flow equation for each pixel within the subject image can result in a plurality of optical flow vectors that, when aggregated, form an optical flow field depicting total optical flow between the two images. Furthermore, repeating the above described method for each pixel within a plurality of sequential images can generate an optical flow field for each sequential image. The resulting plurality of optical flow fields can depict optical flow over a substantial length of time.
It will be appreciated that the example optical flow determination techniques in accordance with example aspects of the present disclosure can be implemented in a plurality of applications. For instance, such techniques can be implemented by a real-time tracking loop associated with an image capture device. The optical flow determined for images captured by the image capture device can be used to adjust a motion of the image capture device. As another example, outputs of the present disclosure (e.g. optical flow fields, etc.) can be used to construct depth maps of areas, for instance as used in aerial 3D reconstruction, satellite imagery, etc. Such optical flow can further be used in time-to-collision estimates associated with objects depicted in the subject images. Such optical flows can further be used in various stereoscopy applications, or other suitable applications.
With reference now to the figures, example aspects of the present disclosure will be discussed in greater detail. For instance,
The color vector generator 102 can be configured to measure the color contained within images, and to determine color vectors associated with a plurality of pixels in the images. For instance, the color vector generator 102 can be configured to access image data 108 to obtain one or more images. The image data 108 can be stored locally by the opto-mechanical system, and/or in one or more databases remote from the opto-mechanical system. For instance, the one or more databases can be stored in one or more server devices. In some implementations, the image data can be produced and processed in real time without the need for intervening storage. The one or more images accessed by the color vector generator can include a plurality of pixels that define the images. The color vector generator 102 can be configured to determine color vectors associated with at least a subset of the plurality of pixels in the one or more images. The color vectors can have a plurality of components that define the color of the pixels using any suitable color space, such as, for instance, RGB, CMY, HSV, YUV, RGB and infrared, RGB and ultraviolet, RGB and ultraviolet and infrared, or other suitable color spaces, including any combination of such spaces, whether trichromatic, tetrachromatic, or hyperspectral. In the particular case of the trichromatic color spaces RGB, CMY, HSV, and YUV, the transformation to a different color space will transform into a different coordinate system with the same number of dimensions, which may modify or improve the results of the optical flow computation slightly, but does not contribute any additional information to the algorithm. Conversely, adding an additional color, such as ultraviolet, to the input data, has been shown to improve the effectiveness of the algorithm.
For instance, color vector generator 102 can access an example image 120 depicted in
Referring back to
Upon a determination of the input color vectors 132-138 and the input color rates of change by the color vector generator 102 and the color rate of change determiner 104, respectively, the optical flow estimator 106 can be configured to determine an estimated optical flow associated with a subject pixel 130. For instance, the optical flow estimator 106 can receive the input color vectors 132-138 and the input rates of change as input. The optical flow estimator 106 can select a subject pixel 130. The subject pixel 130 can be any suitable pixel in the image 120 located within the quadrat formed by the input pixels 122-128. For instance, in some implementations, the subject pixel 130 can be the pixel having coordinates (xm+0.5, yn+0.5). In this manner, the subject pixel 130 can be the pixel directly in the middle of the quadrant formed by the input pixels 122-128. In some implementations, the optical flow estimator 106 can select a subject pixel location prior to a processing of the input color vectors and color rates of change associated with the input pixels.
The optical flow estimator 106 can be configured to determine the estimated optical flow associated with the subject pixel 130 based at least in part on the input color vectors 132-138 and the input color rates of change. For instance, the optical flow estimator 106 can be configured to execute an optical flow equation in accordance with example aspects of the present disclosure. In some implementations, such optical flow equation can be the following equation:
wherein the “^” term signifies a vector quantity. More particularly, as used herein, the “^” term is interchangeable with the “→” term to signify a vector quantity. The terms of the above optical flow equation can be defined as follows:
{right arrow over (C′xy)}={right arrow over (b′1)}+{right arrow over (b′2)}x+{right arrow over (b′3)}y+{right arrow over (b′4)}x y,
{right arrow over (b24y)}−({right arrow over (b2)}+{right arrow over (b4)}y), and
{right arrow over (b34x)}=({right arrow over (b3)}+{right arrow over (b4)}x)
wherein:
{right arrow over (b1)}={right arrow over (f00)}
{right arrow over (b2)}={right arrow over (f10)}−{right arrow over (f00)}
{right arrow over (b3)}={right arrow over (f01)}−{right arrow over (f00)}
{right arrow over (b4)}={right arrow over (f00)}−{right arrow over (f10)}−{right arrow over (f01)}+{right arrow over (f11)}
and wherein:
{right arrow over (b′1)}={right arrow over (f′00)}
{right arrow over (b′2)}={right arrow over (f′10)}−{right arrow over (f′00)}
{right arrow over (b′3)}={right arrow over (f′01)}−{right arrow over (f′00)}
{right arrow over (b′4)}={right arrow over (f′00)}−{right arrow over (f′10)}−{right arrow over (f′01)}+{right arrow over (f′11)}
In the above, {right arrow over (f00)}, {right arrow over (f01)}, {right arrow over (f10)}, and {right arrow over (f11)} represent the input color vectors of the input pixels associated with the subject pixel, and {right arrow over (f′00)}, {right arrow over (f′01)}, {right arrow over (f′10)}, and {right arrow over (f′11)} represent the color rates of change associated with the input pixels. The terms {right arrow over (f00)}, {right arrow over (f01)}, {right arrow over (f10)}, and {right arrow over (f11)} and {right arrow over (f′00)}, {right arrow over (f′01)}, {right arrow over (f′10)}, and {right arrow over (f′11)} are vector-valued quantities wherein the number of components of the vector is equal to the number of colors in the color space under consideration. For instance, when processing RGB data, the above vector terms will each have three quantities or components representing the red, green, and blue color intensities and corresponding color rates of change. As another example, when processing data associated with a color space using 4 colors, such as RGB+ultraviolet, the above vector terms will have 4 quantities or components.
The above optical flow equation can be derived according to example aspects of the present disclosure. As one example, referring particularly to
{right arrow over (b1)}+{right arrow over (b2)}x+{right arrow over (b3)}y+{right arrow over (b4)}x y
wherein {right arrow over (b1)} equals input color vector 132 (e.g., {right arrow over (f00)}); {right arrow over (b2)} equals input color vector 134 minus input color vector 132 (e.g., {right arrow over (f10)}−{right arrow over (f00)}); {right arrow over (b3)} equals input color vector 136 minus input color vector 132 (e.g., {right arrow over (f01)}−{right arrow over (f00)}); and {right arrow over (b4)} equals input color vector 132 minus input color vector 134 minus input color vector 136 plus input color vector 138 (e.g., {right arrow over (f00)}−{right arrow over (f10)}−{right arrow over (f01)}+{right arrow over (f11)}). Such bilinear formula outputs an estimated color vector for any suitable position (x,y) by bilinearly interpolating input color vectors 132, 134, 136, and 138. For instance, the bilinear formula can output an estimated color vector for the position (xm+0.5, yn+0.5) of the subject pixel 130.
According to example aspects of the present disclosure, the optical flow of the subject pixel 130 can be determined by setting the bilinear formula equal to itself as follows:
{right arrow over (b1)}+{right arrow over (b2)}x+{right arrow over (b3)}y+{right arrow over (b4)}xy={right arrow over (b1)}+{right arrow over (b2)}x+{right arrow over (b3)}y+{right arrow over (b4)}xy
wherein the left hand side of the equation corresponds to the measured (e.g. interpolated) output color of the subject pixel 130, and wherein the right hand side of the equation corresponds to a predicted or estimated optical flow associated with the subject pixel 130. Both sides of the equation can be differentiated with respect to time. In particular, the left hand side of the equation can be differentiated, allowing the colors to vary with time while holding x and y (e.g. the position of the subject pixel) constant. The right hand side of the equation can be differentiated, allowing x and y to vary, while holding the colors constant. For instance, the differentiated equation can take the following form:
{right arrow over (b′1)}+{right arrow over (b′2)}x+{right arrow over (b′3)}y+{right arrow over (b′4)}x y={right arrow over (b2)}x′+{right arrow over (b3)}y′+{right arrow over (b4)}x′y+{right arrow over (b4)}x y′
wherein {right arrow over (b′1)} is the rate of change of b1 with respect to time (dt), and x′ is the rate of change of x with respect to time (dt). In this manner, b′1, b′2, b′3, and b′4, and b1, b2, b3, and b4 are known (vector-valued) inputs, while x′ and y′ are (scalar-valued) unknowns representing the optical flow. For instance, b1, b2, b3, and b4 correspond to the input color vectors determined by the color vector generator 102, and b′1, b′2, b′3, and b′4 correspond to the input color rates of change determined by the color rate of change determiner 104. In addition, x and y represent the known position of the subject pixel 130. The optical flow can be determined, for instance, by solving for x′ and y′ in terms of the other quantities.
The above equation can be cast into matrix form to perform a least squares fit on the equation. For instance, the terms can be grouped around x′ and y′ as follows:
{right arrow over (b′1)}+{right arrow over (b′2)}x+{right arrow over (b′3)}y+{right arrow over (b′4)}x y=({right arrow over (b2)}+{right arrow over (b4)}y)x′+({right arrow over (b3)}+{right arrow over (b4)}x)y′
and the following terms can be defined:
{right arrow over (C′xy)}={right arrow over (b′1)}+{right arrow over (b′2)}x+{right arrow over (b′3)}y+{right arrow over (b′4)}x y,
{right arrow over (b24y)}=({right arrow over (b2)}+{right arrow over (b4)}y); and
{right arrow over (b34x)}=({right arrow over (b3)}+{right arrow over (b4)}x)
Substituting such terms into the above equation gives:
{right arrow over (C′xy)}={right arrow over (b24y)}x′+{right arrow over (b34x)}y′
The above equation can be expanded into a plurality of scalar equations. The number of scalar equations can be determined based at least in part on a color space associated with the color vectors 132-138. For instance, if an RGB color space is used, the above equation can be expanded into three scalar equations as follows:
C′xyR=b24yRx′+b34xRy′
C′xyG=b24yGx′+b34xGy′
C′xyB=b24yBx′+b34xBy′
wherein “R” is a subscript for red intensity, “G” is a subscript for green intensity, and “B” is a subscript for blue intensity. It will be appreciated that various other suitable color spaces can be used.
The scalar equations can then be cast into matrix form as follows:
As indicated, a least squares fit can be performed on the matrix equation to determine the optical flow. For instance, a standard least squares formula of
{circumflex over (β)}=(XTX)−1XTy=X+y
Can be used to solve for x′ and y′ as follows:
or in a more succinct vector form:
wherein the “^” term signifies a vector quantity.
According to example aspects of the present disclosure, the optical flow estimator 106 can execute such equation for x′ and y′ to determine the optical flow estimate for the subject pixel 130. It will be appreciated that such equation can be solved using standard matrix operations. The optical flow (x′, y′) can be linearly proportional to the rate of change of the colors, which can facilitate an accommodation of an optical flow determination at any speed. In this manner, the optical flow determination can be determined independent of the frame rate of the sequential images captured by the opto-mechanical system. It will be appreciated that such optical flow determination technique can be extended for use on color vectors having more than three colors (e.g. 4 colors, 5 colors, etc.).
The least squares error can then be determined based at least in part on the optical flow (x′, y′) and the original color output {right arrow over (C′xy)}. For instance, the least squares error can be determined by plugging [x′, y′] back into the original formula, and subtracting {right arrow over (C′xy)}. It will be appreciated that the determined error may have an error, but the error determined by such single iteration can be sufficient in determining the optical flow. In some implementations, if the determined error is greater than an error threshold, one or more additional iterations can be performed.
In some implementations, the optical flow of a subject pixel can be determined based at least in part on more than four input pixels. For instance,
At (202), method (200) can include selecting a subject pixel location in a subject image for which an optical flow is to be determined. For instance, the subject image can include a plurality of pixels that define the image. The subject pixel location can be a location associated with any suitable pixel within the subject image.
At (204), method (200) can include obtaining a plurality of input color vectors respectively associated with a plurality of input pixels in a subject image. The input pixels can be determined such that an optical flow of the subject pixel can be determined based at least in part on the color vectors of the input pixels. In this manner, the subject pixel can be located proximate the input pixels (e.g. located within a grouping of the input pixels), such that bilinear interpolation can be performed to determine a color vector describing the color of the subject pixel based on the input color vectors. In some implementations, the input pixels can form one or more quadrants. For instance, four input pixels can be evenly spaced relative to each other and can form a square quadrant. In this manner, the quadrant can be formed around the subject pixel. In some implementations, the subject pixel can be the pixel located equidistant from the input pixels. For instance, the input pixels can be located at coordinates of (0, 0), (0, 1), (1, 0), and (1, 1). In such instance, the selected subject pixel can be the pixel located at (0.5, 0.5). In this manner, the bilinear interpolation performed to determine the color rate of change vector of the subject pixel can correspond to an average of the input color rate of change vectors. Such selection of the subject pixel can facilitate a simpler and more efficient determination of the interpolated color rate of change vector of the subject pixel. The average color at (0.5, 0.5) may not be explicitly computed as part of the algorithm execution, although it may be implicitly present as part of the derivation.
The input color vectors can respectively describe a color of the input pixels based on any suitable color space. For instance, the input color vectors can describe the colors using RGB, CMY, HSV, YUV, RGB and infrared, RGB and ultraviolet, RGB and ultraviolet and infrared, or other suitable color spaces, including any combination of such spaces, whether trichromatic, tetrachromatic, or hyperspectral.
At (206), method (200) can include obtaining an input color rate of change associated with each input pixel. The color rates of change can describe how the colors of the input pixels change with respect to time. For instance, the subject image can be captured by an image capture device in relative motion with a scene. In this manner, the subject image can depict the scene. One or more subsequent images captured by the image capture device at subsequent times relative to the subject image while the image capture device was in relative motion with the scene can depict the scene at the subsequent times. The color rates of change of input pixels can describe a manner in which the colors of the input pixels change with respect to the one or more subsequent images relative to the subject image, or can be measured directly by the imaging device. For instance, the color rates of change can be directly determined by one or more sensors associated with the image capture device. In some implementations, the color rates of change can be determined based at least in part on the subsequent images captured by the image capture device. For instance, the color rates of change can be determined based at least in part on the color vectors of the pixels of the subsequent image(s) respectively corresponding to the input pixels of the subject image, and a time at which the subsequent images were captured.
In some implementations, the steps (204) and (206) can be performed substantially simultaneously. In this manner, the method (200) can include obtaining the plurality of input color vectors and the plurality of input color rate of change vectors respectively associated with the input color vectors in a single step. Further, as indicated above, various other suitable method steps described herein can be adapted, rearranged, expanded, omitted, or modified in various ways without deviating from the scope of the present disclosure.
At (208), the method (200) can include determining an estimated optical flow associated with the subject pixel based at least in part on the input color vectors and the input color rates of change. The optical flow estimate can be determined by executing an optical flow equation derived according to example aspects of the present disclosure. For instance, the optical flow estimate for the subject pixel can be determined by executing the following formula:
wherein the “^” term signifies a vector quantity. The quantities in the above equation are defined above. In this manner, one or more processors can be configured to perform one or more computations to execute such equation with respect to the subject pixel. As will be appreciated by those skilled in the art, such equation can be executed using standard matrix-based computations.
As will be described in greater detail with regard to the method (300) depicted in
At (302), method (300) can include accessing an optical flow equation having a measured bilinear formula on a first side of the equation and a predicted bilinear formula on a second side of the equation. For instance, the optical flow equation can correspond to the following:
{right arrow over (b1)}+{right arrow over (b2)}x+{right arrow over (b3)}y+{right arrow over (b4)}x y={right arrow over (b1)}+{right arrow over (b2)}x+{right arrow over (b3)}y+{right arrow over (b4)}x y
wherein {right arrow over (b1)} equals a first input color vector, {right arrow over (b2)} equals a second input color vector minus the first input color vector, {right arrow over (b3)} equals a third input color vector minus the first input color vector 132, and {right arrow over (b4)} equals the first input color vector minus the second input color vector minus the third input color vector plus a fourth input color vector (e.g. as described above with regard to
At (304), method (300) can include evaluating the optical flow equation to provide a plurality of scalar equations. The scalar equations can be associated with a subject color space. For instance, evaluating the optical flow equation can include determining or a differentiated optical flow as follows:
{right arrow over (b′1)}+{right arrow over (b′2)}x+{right arrow over (b′3)}y+{right arrow over (b′4)}x y={right arrow over (b2)}x′+{right arrow over (b3)}y′+{right arrow over (b4)}x′y+{right arrow over (b4)}x y′
wherein {right arrow over (b′1)} is the rate of change of b1 with respect to time (dt), and x′ is the rate of change of x with respect to time (dt). In this manner, b′1, b′2, b′3, and b′4, and b1, b2, b3, and b4 are known (vector-valued) inputs derived according to example aspects of the present disclosure, while x′ and y′ are (scalar-valued) unknowns representing the optical flow, and are outputs of the algorithm.
Such differentiated equation can yield the following:
{right arrow over (C′xy)}={right arrow over (b24y)}x′+{right arrow over (b34x)}y′
wherein:
{right arrow over (C′xy)}={right arrow over (b′1)}+{right arrow over (b′2)}x+{right arrow over (b′3)}y+{right arrow over (b′4)}x y,
{right arrow over (b24y)}=({right arrow over (b2)}+{right arrow over (b4)}y), and
{right arrow over (b34x)}=({right arrow over (b3)}+{right arrow over (b4)}x)
The differentiated equation can then be expanded into the following scalar equations:
C′xyR=b24yRx′+b34xRy′
C′xyG=b24yGx′+b34xGy′
C′xyB=b24yBx′+b34xBy′
At (306), method (300) can include performing a least squares fit on the plurality of scalar equations to determine an estimated optical flow associated with the subject pixel. For instance, the scalar equations can be cast into matrix form as follows:
A least squares fit can be performed on the matrix equation, which can yield the following:
or in a more succinct vector form:
wherein the “^” term signifies a vector quantity, and the formula generalizes to any number of colors.
Such equation can be solved for x′ and y′ to determine the optical flow. Such optical flow determination technique is intended for illustrative purposes. It will be appreciated that various suitable techniques taking into account the input color vectors and the input color rates of change of the input pixels with respect to the subject pixel can be used to determine the optical flow of the subject pixel.
At (308), method (300) can include determining a least squares error associated with the estimated optical flow. For instance, in some implementations, determining the least squares error can include plugging x′ and y′ back into {right arrow over (C′xy)}={right arrow over (b24y)}x′+{right arrow over (b34x)}y′. Such error can then be converted from the color rate of change space to an optical flow space. It will be appreciated that the least squares error can be determined using any suitable technique.
At (402), method (400) can include obtaining one or more images depicting a scene captured by an image capture device in relative motion with the scene. For instance, a data collection device, such as an image capture device can capture a plurality of sequential images based upon visual imagery associated with the scene collected by a sensor of the image capture device. In particular, the images can be captured such that they depict the scene at different times. Such one or more obtained images can be one or more images from the plurality of sequential images. The plurality of sequential images can be stored locally on one or more memory devices associated with the image capture device, and/or in one or more remote databases separate from the image capture device.
At (404), method (400) can include obtaining, for each image, a plurality of color vectors respectively associated with a plurality of pixels associated with the image. Each image can include a plurality of pixels defining the visual imagery depicted in the image. Color vectors for at least a subset of the plurality of pixels in each image can be obtained. As indicated, the color vectors can describe the color of the pixel in any suitable color space.
At (406), method (400) can include obtaining, for each image, a plurality of color rates of change respectively associated with the plurality of pixels of the image. The color rates of change can describe how the colors of the pixels change over time, and can be determined based at least in part on the relative motion between the image capture device and the scene. In some implementations, the color rates of change can be determined using one or more sensors associated with the image capture device.
At (408), method (400) can include determining, for each image, a plurality of optical flow estimates respectively specifying a displacement of the plurality of pixels over time. For instance, the optical flow estimates can take the form of optical flow vectors associated with each pixel, and can be determined in accordance with example aspects of the present disclosure.
At (410), method (400) can include aggregating, for each image, the plurality of optical flow estimates associated with the image to determine an optical flow field for the image. The optical flow field can describe an aggregate optical flow of the image. The optical flow field can include optical flow vectors associated with each pixel of the image and, therefore, depict total optical flow associated with the image.
The system 500 includes a data capture device 510, The data capture device 510 can be, or can be associated with, any suitable computing device, such as a general purpose computer, special purpose computer, laptop, desktop, mobile device, navigation system, smartphone, tablet, wearable computing device, a display with one or more processors, or other suitable computing device. In some implementations, the data capture device 510 can be an opto-mechanical device, and can include an image capture device configured to sense and collect visual imagery information, including color information. The data capture device 510 can have one or more processors 512 and one or more memory devices 514. The data capture device 510 can also include a network interface used to communicate with one or more remote devices 530 over the network 540. The network interface can include any suitable components for interfacing with one more networks, including for example, transmitters, receivers, ports, controllers, antennas, or other suitable components.
The one or more processors 512 can include any suitable processing device, such as a microprocessor, microcontroller, integrated circuit, logic device, graphics processing units (GPUs) dedicated to efficiently rendering images or performing other specialized calculations, and/or other suitable processing device. The one or more memory devices 514 can include one or more computer-readable media, including, but not limited to, non-transitory computer-readable media, RAM, ROM, hard drives, flash drives, or other memory devices. The one or more memory devices 514 can store information accessible by the one or more processors 512, including computer-readable instructions 516 that can be executed by the one or more processors 512. The instructions 516 can be any set of instructions that when executed by the one or more processors 512, cause the one or more processors 512 to perform operations. For instance, the instructions 516 can be executed by the one or more processors 512 to implement the color vector generator 102, the color rate of change determiner 104 and the optical flow estimator 106 described with reference to
As shown in
The data capture device 510 can include various input/output devices for providing and receiving information from a user, such as a touch screen, touch pad, data entry keys, speakers, and/or a microphone suitable for voice recognition. For instance, the remote device 530 can have a display device for presenting a user interface.
The data capture device 510 can exchange data with one or more remote devices 530 over the network 540. For instance, a remote device can be a server, such as a web server. Although two remote devices 530 are illustrated in
Similar to the data capture device 510, a remote device 530 can include one or more processor(s) 532 and a memory 534. The one or more processor(s) 532 can include one or more central processing units (CPUs), and/or other processing devices. The memory 534 can include one or more computer-readable media and can store information accessible by the one or more processors 532, including instructions 536 that can be executed by the one or more processors 532 and data 538.
In some implementations, one or more of the color vector generator 102, the color rate of change determiner 104, and the optical flow estimator 106 can be implemented within the remote device 530. In this manner, the functionality associated with the one or more of the color vector generator 102, the color rate of change determiner 104, and the optical flow estimator 106 can be performed by the remote device 530. For instance, the data capture device 510 can communicate with the remote device 530 to implement example aspects of the present disclosure.
The remote device 530 can also include a network interface used to communicate with one or more remote computing devices (e.g. data capture device 510) over the network 540. The network interface can include any suitable components for interfacing with one more networks, including for example, transmitters, receivers, ports, controllers, antennas, or other suitable components.
The network 540 can be any type of communications network, such as a local area network (e.g. intranet), wide area network (e.g. Internet), cellular network, or some combination thereof. The network 540 can also include a direct connection between a remote device 530 and the data capture device 510. In general, communication between the data capture device 510 and a remote device 530 can be carried via network interface using any type of wired and/or wireless connection, using a variety of communication protocols (e.g. TCP/IP, HTTP, SMTP, FTP), encodings or formats (e.g. HTML, XML), and/or protection schemes (e.g. VPN, secure HTTP, SSL).
The technology discussed herein makes reference to servers, databases, software applications, and other computer-based systems, as well as actions taken and information sent to and from such systems. One of ordinary skill in the art will recognize that the inherent flexibility of computer-based systems allows for a great variety of possible configurations, combinations, and divisions of tasks and functionality between and among components. For instance, server processes discussed herein may be implemented using a single server or multiple servers working in combination. Databases and applications may be implemented on a single system or distributed across multiple systems. Distributed components may operate sequentially or in parallel.
While the present subject matter has been described in detail with respect to specific example embodiments thereof, it will be appreciated that those skilled in the art, upon attaining an understanding of the foregoing may readily produce alterations to, variations of, and equivalents to such embodiments. Accordingly, the scope of the present disclosure is by way of example rather than by way of limitation, and the subject disclosure does not preclude inclusion of such modifications, variations and/or additions to the present subject matter as would be readily apparent to one of ordinary skill in the art.
Number | Name | Date | Kind |
---|---|---|---|
5987164 | Szeliski et al. | Nov 1999 | A |
6018349 | Szeliski et al. | Jan 2000 | A |
6044181 | Szeliski et al. | Mar 2000 | A |
6148056 | Lin et al. | Nov 2000 | A |
6157677 | Martens et al. | Dec 2000 | A |
6795578 | Kotani | Sep 2004 | B1 |
6917721 | Elad et al. | Jul 2005 | B2 |
7116831 | Mukerjee et al. | Oct 2006 | B2 |
7317839 | Holcomb | Jan 2008 | B2 |
7352905 | Mukerjee et al. | Apr 2008 | B2 |
7620109 | Srinivasan | Nov 2009 | B2 |
7715589 | Freeman et al. | May 2010 | B2 |
7852936 | Mukerjee et al. | Dec 2010 | B2 |
8064520 | Mukerjee et al. | Nov 2011 | B2 |
9008363 | Klier | Apr 2015 | B1 |
9819841 | Denisenko | Nov 2017 | B1 |
20040091170 | Cornog | May 2004 | A1 |
20050041842 | Frakes et al. | Feb 2005 | A1 |
20070122058 | Kitaura | May 2007 | A1 |
20080232643 | Leichter et al. | Sep 2008 | A1 |
20090079876 | Takeshima et al. | Mar 2009 | A1 |
20090310681 | Gaude | Dec 2009 | A1 |
20100124361 | Gaddy | May 2010 | A1 |
20110050864 | Bond | Mar 2011 | A1 |
20110063473 | Tsunekawa et al. | Mar 2011 | A1 |
20110206236 | Center, Jr. | Aug 2011 | A1 |
20120050074 | Bechtel et al. | Mar 2012 | A1 |
20120148110 | Chen | Jun 2012 | A1 |
20130176445 | Streeter | Jul 2013 | A1 |
20160104268 | Levin | Apr 2016 | A1 |
Entry |
---|
Aires et al., “Optical Flow Using Color Information: Preliminary Results”. Proceedings of the 2008 ACM Symposium on Applied Computing, Mar. 16-20, 2008, Fortaleza, Ceara, Brazil, pp. 1607-1611. |
Andrews et al., “Color Optical Flow”. Proceedings of APRS Workshop on Digital Image Computing, Brisbane, Australia, vol. 1, Feb. 2003, pp. 135-139. |
Barron et al., “Quantitative Color Optical Flow”, Proceedings of the 16th International Conference on Pattern Recognition, Quebec, Canada, vol. 4, Aug. 11-15, 2002, pp. 251-255. |
Golland et al., “Motion from Color”, Computer Vision and Image Understanding, vol. 68, No. 3, Dec. 1997, pp. 346-362. |
Li et al., “Learning for Optical Flow Using Stochastic Optimization”, Proceedings of the 10th European Conference on Computer Vision: Part II, Marseille, France, Oct. 12-18, 2008, pp. 379-391. |
Mathews et al., “Module for Fixed Point Iteration and Newton's Method in 2D and 3D”, http://math.fullerton.edu/mathews/n2003/FixPointNewtonMod.html, 9 pages. |
Schoueri, “Optical Flow from Motion Blurred Color Images”, Canadian Conference on Computer and Robot Vision, May 25-27, 2009, pp. 1-7. |
Number | Date | Country | |
---|---|---|---|
20180158194 A1 | Jun 2018 | US |