Learning from high quality depth measurements

Information

  • Patent Grant
  • 9470778
  • Patent Number
    9,470,778
  • Date Filed
    Tuesday, March 29, 2011
    14 years ago
  • Date Issued
    Tuesday, October 18, 2016
    8 years ago
Abstract
A depth camera computing device is provided, including a depth camera and a data-holding subsystem holding instructions executable by a logic subsystem. The instructions are configured to receive a raw image from the depth camera, convert the raw image into a processed image according to a weighting function, and output the processed image. The weighing function is configured to vary test light intensity information generated by the depth camera from a native image collected by the depth camera from a calibration scene toward calibration light intensity information of a reference image collected by a high-precision test source from the calibration scene.
Description
BACKGROUND

Measuring distances with a depth camera can be difficult. Measurement accuracy and precision may be influenced by camera hardware, such as illumination light and photodetector sensitivity, the effectiveness of which may be convoluted with environmental factors, such as contrast ratio and ambient light levels.


SUMMARY

Various embodiments of methods and hardware for improving the accuracy of depth measurements derived from images collected by a depth camera are disclosed. In one example, raw images collected by a depth camera are converted to processed images by applying a weighting function to raw image data. In this example, the weighting function is generated from an image of a calibration scene collected by the depth camera and an image of the calibration scene collected by a high-precision test source.


This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Furthermore, the claimed subject matter is not limited to implementations that solve any or all disadvantages noted in any part of this disclosure.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows an embodiment of a depth camera assembly system for assembling, calibrating, and packaging a depth camera.



FIG. 2 shows an embodiment of a method for calibrating a depth camera with a high-resolution test source and for outputting a processed image from a depth camera so-calibrated.



FIG. 3 schematically shows an embodiment of a calibration station for generating a reference image from a calibration scene using a high-resolution test source.



FIG. 4 schematically shows, at a later time, the embodiment of the calibration station depicted in FIG. 3 for generating a native image from the calibration scene using a depth camera.



FIG. 5 schematically shows another embodiment of a calibration station for calibrating a depth camera.



FIG. 6 schematically shows another embodiment of a calibration station for calibrating a depth camera.





DETAILED DESCRIPTION

The entertainment value provided by an interactive media experience may be amplified by immersing the user in the media experience. For example, a user's enjoyment of a video game may be intensified when a user controls the gaming experience using natural, physical gestures. As a more specific example, a user may block an incoming attack using a sudden parry gesture with the user's arm instead of a button press to a handheld game controller. Physical gestures can be detected with a depth camera oriented to monitor the user's body position. However, because some gestures may be small by nature, it can be difficult to provide a depth camera that detects such gestures with a precision and accuracy that is compatible with a user-friendly experience without including complicated illumination and imaging systems in the camera. Systems of such complexity may be delicate, making them hard to use in a typical home environment.


Accordingly, various embodiments are described herein that improve the accuracy of depth measurements made by a depth camera. In one example, raw images are processed by the application of a weighting function. In this example, the weighting function is generated in a calibration step in a depth camera manufacturing process. As used herein, calibration may have different meanings according to various embodiments. For example, in some embodiments, calibration may refer to tuning adjustments made to the image processing modules of depth cameras during assembly. Thus, such calibrations may be repeated as each camera is manufactured and tuned prior to shipment.


Additionally or alternatively, in some embodiments, calibration may refer to adjustments made to an image processing module during an image processing module development phase. For example, the calibration method described herein may be performed during testing and manufacturing development of a next-generation image processing module for a next-generation depth camera. It will be appreciated that, in some embodiments, the scale and scope of such calibrations may be comparatively greater than the calibrations performed at the camera level during volume manufacturing and may reflect development objectives for such next-generation depth cameras. In such embodiments, the calibration, once performed, may be incorporated into production depth cameras without additional calibration during a subsequent volume manufacturing phase.


In that step, images of a calibration scene are collected by the depth camera and by a high-precision test source. Differences between the images are used to develop the weighting function. In some examples, a plurality of weighting functions may be developed for application in particular use conditions. Because the calibration step may be performed under controlled conditions, the high-precision test source may achieve levels of accuracy and precision that would be incompatible with a consumer-friendly depth camera. Further, because a single set of reference images may be used to calibrate a virtually unlimited number of depth cameras during high-volume manufacturing, the comparatively higher cost of the high-precision test source (relative to the depth camera) may be distributed over many depth cameras, potentially reducing the unit cost of the depth camera compared to depth cameras including more complicated and expensive illumination and imaging subsystems.



FIG. 1 shows an embodiment of a depth camera assembly system 100 for assembling, calibrating, and packaging a depth camera. As explained above, depth camera assembly system 100 may be used to assemble, calibrate, and package depth cameras during test and development of a depth camera and/or during volume manufacturing of a depth camera. As shown in FIG. 1, depth camera assembly system 100 includes a depth camera assembly station 102 for assembling a depth camera 110. For example, an illumination subsystem and an imaging subsystem may be installed in depth camera 110 at depth camera assembly station 102.


At depth camera calibration station 104, depth camera 110 collects, in a calibration mode, one or more native images 132 from one or more calibration scenes 120. Native image 132 includes test light intensity information that describes the light intensity reflected from various portions and locations in calibration scene 120. The test light intensity information is useable to derive a native depth map for calibration scene 120, the native depth map describing, for a particular pixel of a photosensor included in the imaging system of depth camera 110, a respective physical location in the three-dimensional space of calibration scene 120 from which light incident at the particular pixel was reflected. Thus, the native depth map describes a constellation of physical points corresponding to various reflective surfaces included in calibration scene 120.


As will be described in more detail below, the native depth map for a particular calibration scene is used, in combination with a reference depth map derived from a reference image 130 collected from the particular calibration scene by a high-precision test source, to tune processing of a raw image collected by depth camera 110 in a later-performed operation mode (for example, when a depth camera purchaser collects images with depth camera 110). In this way, a processed image output by depth camera 110 during the operation mode may have, in some circumstances, image characteristics that are comparatively similar to those exhibited by the reference image.


Once depth camera 110 is calibrated at depth camera calibration station 104, depth camera 110 is passed to depth camera packaging station 106, where depth camera 110 is packaged. Packaging may include preparation for shipment to consumers (e.g., in a volume manufacturing scenario) and/or preparation for product testing (e.g., in a next-generation depth camera test/development scenario), concluding the manufacturing process. First use of depth camera 110 by a user (such as a consumer or a product tester) subsequent to manufacturing marks the start of the depth camera operation mode described above. However, it will be appreciated that, in some embodiments, depth camera calibration may be performed at any suitable time, such as in a scenario where depth camera 110 is serviced, inspected, and/or refurbished.



FIGS. 2A and 2B show a flow chart illustrating an embodiment of a method 200 for calibrating a depth camera. In the embodiment shown in FIGS. 2A and 2B, portions of method 200 performed during a calibration mode are performed at an image processing module of a computing device compatible with a depth camera, such as a depth camera calibration computing device operatively connected with the depth camera during calibration, while portions of method 200 performed during an operation mode are performed at an image processing module of the depth camera. However, it will be appreciated that, in some embodiments, method 200 may be performed at a depth camera alone, and that, in some other embodiments, portions of method 200 performed during the calibration mode may be performed by the depth camera calibration computing device and the depth camera.


Turning to FIG. 2A, at 202, method 200 includes, during a calibration mode for a depth camera, receiving a reference image from a high-precision test source, and at 204, deriving a reference depth map from calibration light intensity information included in the reference image. As introduced above, a reference image is collected for a particular calibration scene using a test source capable of generating depth information for the calibration scene with a high degree of accuracy and/or precision within an acceptable tolerance. In some embodiments, the high-precision test source may be able to generate depth information for a particular calibration scene that is reliably more accurate than depth information generated by the depth camera.


For simplicity, the embodiments described herein relate to a high-precision test source including a high-resolution time-of-flight depth camera system. In such systems, calibration light is emitted by the depth camera system at one time. The calibration light is reflected from a surface of an object being imaged and captured at a photosensor of the depth camera system at later time. Depth information from the reflected calibration light is derived from the difference between the emission and capture times. However, it will be understood that any suitable test source may be employed without departing from the scope of the present disclosure. Non-limiting examples high-precision test sources include high-resolution time-of-flight depth camera systems, three-dimensional triangulation systems (such as 3-D laser scanners), and laser light detection and ranging (LIDAR) systems.



FIG. 3 schematically shows an embodiment of depth camera calibration station 104 at a time T0. As shown in FIG. 3, high-precision test source 300, located at an image collection position 304 (shown as position (X0, Y0, Z0)), collects a reference image 130 of a motorcycle located at a scene position 302 (shown as position (X1, Y1, Z1)), scene position 302 and image collection position 304 separated by a suitable distance. As explained above, calibration light intensity data included in reference image 130 is used to derive a reference depth map 330. Reference depth map 330 describes, for a particular pixel of a photosensor included in high-precision test source 300, a respective physical location in the three-dimensional space of calibration scene 120 from which calibration light incident at the particular pixel was reflected. For example, the embodiment shown in FIG. 3 depicts reference image distance information 306 (shown as “DR”) derived from time-of-flight light reflectivity information for a particular point on the front fairing of the motorcycle.


In some embodiments, high-precision test source 300 may collect a plurality of reference images 130 from a plurality of calibration scenes 120. The plurality of calibration scenes 120 may include scenes generated by varying lighting conditions for a particular object (e.g., where illumination source temperature, intensity, and/or position are varied), by varying object distance and/or size, and by varying object surface reflectivity (e.g., surfaces causing specular or diffuse reflections). The plurality of calibration scenes 120 may be selected in consideration of how the depth camera may be used. For example, if a depth camera is configured to be used with a game console, conditions for a plurality of calibration scenes 120 may be selected to resemble conditions for rooms configured for leisure and social activities in a user's home.


Turning back to FIG. 2A, at 206, method 200 includes receiving a native image from the depth camera being calibrated and, at 208, deriving a native depth map from test light intensity information included in the reference image. For simplicity, the depth camera described herein is a time-of-flight depth camera configured to collect a native image by emitting test light and capturing reflected test light, such as test light reflected from a surface of an object included in a calibration scene. However, it will be appreciated that any suitable depth camera may be employed without departing from the scope of the present disclosure. One non-limiting example includes a three-dimensional scanner configured to emit structured light (such as infrared light).



FIG. 4 schematically shows an embodiment of the depth camera calibration station 104 shown in FIG. 3. In FIG. 4, depth camera calibration station is shown at a later time T1. As shown in FIG. 4, depth camera 110 is located at the same image collection position 304 (shown as position (X0, Y0, Z0)) at which high-precision test source 300 was positioned when reference image 130 was collected. Depth camera 110 collects a native image 132 of a motorcycle located at the same scene position 302 (shown as position (X1, Y1, Z1)) as shown in FIG. 3. Thus, in the embodiment shown in FIG. 4, depth camera 110 and high-resolution test source 300 are positioned so that the perspective of depth camera 110 matches, within an acceptable tolerance, the perspective of high-precision test source 300. Being so positioned, a native depth map 332 derived from test light intensity data included in native image 132 may be used, in conjunction with reference depth map 330, to calibrate depth camera 110, as described in more detail below.


In embodiments where a plurality of reference images 130 are collected from a plurality of calibration scenes 120, depth camera 110 collects corresponding native images 132 for the plurality of calibration scenes 120, being positioned to match the respective perspective of high-precision test source 300 for each native image 132 collected.


While the example provided above describes one or more native images 132 being collected from the same camera position and perspective from which respective reference images 130 were collected, it will be understood that, in some embodiments, acceptable position variation may exist to compensate for differences in perspective between high-precision test source 300 and depth camera 110 (e.g., physical differences and/or operational differences). Further, in some embodiments, acceptable perspective variation may be encountered from variations encountered during calibration (e.g., variation in calibration station configuration and/or operation). Regardless of the source of the position and/or perspective variation, in such embodiments, suitable position and/or perspective measurements may be performed to compensate for such variations.


As explained above, a native depth map and a reference depth map derived from images collected from a particular calibration scene may be used to calibrate the depth camera. Because the respective depth maps are derived from images of the same (within an acceptable tolerance) calibration scene, the algorithms used to derive a native depth map from a native image may be adjusted, in the calibration mode, to generate depth information that resembles the reference depth map for that scene. Set in the context of FIGS. 3 and 4, described above, a weighting function may be derived from reference depth map 330 and native depth map 332 so that, for a particular pixel or pixel region, native image distance information 308 (shown as “DN”) approaches reference image distance information 306 (shown as “DR”). For example, the weighting function may be configured so that a difference DR−DN approaches zero.


Turning back to FIG. 2A, at 210, method 200 includes fitting a local weighting function to vary test light intensity information toward the calibration light intensity information for the calibration scene. Because depth information for an image may be derived from light intensity information, in some embodiments, a weighting function may be generated to vary the native test light intensity for a particular pixel of the depth camera toward the calibration light intensity for the same pixel during the calibration mode. Subsequently, in the operation mode, the weighting function may act as a scalar for light intensity levels received from that particular pixel of the depth camera for inclusion in a processed image (in contrast with un-scaled light intensity information present in the corresponding raw image). Once scaled, the resulting depth information for the processed image may be more accurate and/or precise than depth information for the raw image.


For example, let LK(I,J) represent test light intensity information registered by a pixel in an I-th row and a J-th column (“pixel (I,J)”) of a photosensor of a K-th native image provided by the depth camera. Thus, if L*K(I,J) represents an intensity of a pixel in an I-th row and a J-th column of the processed image, L*K(I,J) may be generated from the weighting function and the raw light intensity information:








L
K
*



(

I
,
J

)


=




M
,

N

A










W


(


I
+
M

,

J
+
N


)





L


(


I
+
M

,

J
+
N


)


K








where W(I+M, J+N) is a weighting function and where A represents a neighborhood of pixels in the reference and native images including pixel (I,J).


In some embodiments, the weighting function may be fit so that a difference between the test light intensity information and the calibration light intensity information is minimized. In some of such embodiments, minimizing the difference may include minimizing a difference between the native depth map for a particular pixel of the native image and the reference depth map for a respective pixel included in the respective reference image. Thus, the weighting function may be fit by minimizing an objective function O. An example of such embodiments is described below, though it will be appreciated that other suitable fitting and optimization schemes may be employed without departing from the scope of the present disclosure.


For example, let O be a function of the processed light intensity information for pixel (I,J), L*K(I,J), and of the calibration light intensity information for pixel (I,J), TLK(I,J) of a K-th reference image provided by the high-precision test source:






O
=



TL
K



(

I
,
J

)


-


L
K
*



(

I
,
J

)









O
=




I
,
J
,
K









[



TL
K



(

I
,
J

)


-




M
,

N

A










W


(


I
+
M

,

J
+
N


)





L


(


I
+
M

,

J
+
N


)


K




]

2






In some embodiments, the local weighting function may be configured as a function of a set of N features of pixels in a neighborhood A of a particular pixel, so that the weighting function is fit according to variance, moments, and/or total variation in the feature set. In such embodiments, the weighting function may be a function of one or more of a variance, a moment, and a total variation between a pixel included in the native image and a respective pixel included in the respective reference image. However, it will be appreciated that other suitable methods of varying light intensity data for a particular pixel and/or for neighboring pixels proximal to that particular pixel may be employed without departing from the scope of the present disclosure.


While the weighting functions described above relate to processing performed on a raw image in an image processor of the depth camera (e.g., after the raw image is collected), it will be appreciated that, in some embodiments, a weighting function may be configured to vary hardware characteristics of the depth camera. For example, the weighting function may alter a gain for a photomultiplier operatively coupled with that particular pixel, so that the light intensity information generated by a pixel is scaled during generation of the image.


At 212, method 200 determines whether another local weighting function is to be fit for a different calibration condition. As explained above, in some embodiments, a plurality of reference and native images may be collected from a plurality of calibration scenes in consideration of various anticipated use conditions for the depth camera during the operation mode. In such embodiments, a plurality of local weighting functions may be generated according to such use conditions, so that a particular local weighting function tailored to a particular use condition may be selected. For example, in one scenario, a low-light local weighting function (or functions) may be generated from a calibration scene having a low-light condition. During an operation mode, the low-light local weighting function may be selected, programmatically or in response to user input, when low-light conditions exist. Thus, in some embodiments, a particular calibration scene may be one of a plurality of calibration scenes and a particular weighting function may be one of a plurality of local weighting functions. In such embodiments, each local weighting function may be fit to vary test light intensity information toward the calibration light intensity for a particular calibration scene of the plurality of calibration scenes.


In the embodiment shown in FIG. 2, if additional local weighting functions are to be fit, method 200 returns to 202, though it will be appreciated that in some embodiments, method 200 may return to any other suitable step, such as receiving a native image 206 or fitting a local weighting condition 210. If no additional local weighting functions are to be fit, method 200 continues to 214, shown in FIG. 2B.


Turning to FIG. 2B, method 200 includes, at 214, generating a global weighting function from a plurality of local weighting functions. In some embodiments, a global weighting function may be generated from a plurality of local weighting functions. With reference to the low-light condition scenario described above, in some of such embodiments, the global weighting function may be applied regardless of a condition of a use environment during the operation mode. In others of such embodiments, the global weighting function may be configured to emphasize the application of a low-light local weighting function in low-light conditions, while de-emphasizing the application of other local weighting functions, during the operation mode.


For clarity of explanation, FIG. 2B includes a metaphorical boundary 216, indicating a transition from a calibration phase to an operation mode for the depth camera. As explained above, in the operation mode, a raw image collected by the depth camera is converted to a processed image. In one example, during conversion, the weighting function (global or local) scales light intensity levels for a particular pixel of the raw image to generate the processed image. As explained when introducing method 200, in the embodiment of FIGS. 2A and 2B, portions of method 200 performed during an operation mode are performed at an image processing module of the depth camera.


At 218, method 200 includes receiving a raw image from the depth camera, and, at 220, converting the raw image into a processed image according to a weighting function. At 222, method 200 includes outputting the processed image. In some embodiments, outputting the processed image may include providing the processed image to a game console, though it will be appreciated that any suitable output may be performed without departing from the scope of the present embodiment.



FIG. 5 schematically shows an embodiment of a depth camera calibration station 104, including a depth camera calibration computing device 502, a data-holding subsystem 504, and a logic subsystem 506. Depth camera calibration computing device 502 also includes an image processing calibration module 508. In the embodiment shown in FIG. 5, image processing calibration module 508 is configured to receive, in a calibration mode, a plurality of reference images 130 from a high-precision test source for a plurality of calibration scenes 120, and a plurality of native images 132 output from depth camera 110 for the same set of calibration scenes 120. For each calibration scene, image processing calibration module 508 fits a local weighting function 512. In the embodiment shown in FIG. 5, local weighting function 512 is fit using an objective function 510. Depth camera calibration computing device 502 then outputs the plurality of local weighting functions 512 to depth camera 110.


As shown in the embodiment depicted in FIG. 5, depth camera 110 includes a logic subsystem 522 and a data-holding subsystem 520. Also included is an illumination subsystem 524 for illuminating an object to be imaged with illumination light. Illumination subsystem 524 may include a suitable light source (not shown) such as a laser configured to emit infrared light and suitable optics (not shown) such as diffraction gratings for tuning projected illumination light. Also included is an imaging subsystem 526 for receiving reflected illumination light. Imaging subsystem 526 may include various light filters (not shown) such as optical filters and band pass filters for filtering reflected illumination light prior to detection at a photosensor (not shown). Depth camera 110 also includes an image processing conversion module 528 configured to convert a raw image into a processed image using weighting functions 512 during an operation mode. Depth camera calibration computing device 502 and depth camera 110 may also include suitable peripheral input/output connections (not shown) configured to exchange reference images, native images, weighting functions, and processed images as described herein. Non-limiting examples of suitable connections include USB, USB 2.0, IEEE 1394, HDMI, 802.11x, and/or virtually any other suitable wired or wireless connection. In some embodiments, depth camera 110 may be included in a game console 550, which may include its own data-holding subsystem 552 and/or logic subsystem 554.


It will be appreciated that, in some embodiments, a portion or all of the elements described in depth camera calibration computing device 502 may be included in the depth camera. For example, FIG. 6 schematically shows another embodiment of a depth camera calibration station 104, image processing calibration module 508 and image processing conversion module 528 being included in depth camera 110. Depth camera 110 may also include suitable peripheral input/output connections (not shown) configured to receive reference images and output processed images as described herein.


Aspects of this disclosure will now be described by example and with reference to the illustrated embodiments listed above. Components, process steps, and other elements that may be substantially the same in one or more embodiments are identified coordinately and are described with minimal repetition. It will be noted, however, that elements identified coordinately may also differ to some degree. It will be further noted that the drawing figures included herein are schematic and generally not drawn to scale. Rather, the various drawing scales, aspect ratios, and numbers of components shown in the figures may be purposely distorted to make certain features or relationships easier to see.


It will be understood that he embodiments of computing devices shown in FIGS. 5 and 6 are shown in simplified form, and that virtually any computer architecture may be used without departing from the scope of this disclosure. Logic subsystems, such as logic subsystems 506, 522, and 554, may include one or more physical devices configured to execute one or more instructions. For example, the logic subsystem may be configured to execute one or more instructions that are part of one or more applications, services, programs, routines, libraries, objects, components, data structures, or other logical constructs. Such instructions may be implemented to perform a task, implement a data type, transform the state of one or more devices, or otherwise arrive at a desired result.


The logic subsystem may include one or more processors that are configured to execute software instructions. Additionally or alternatively, the logic subsystem may include one or more hardware or firmware logic machines configured to execute hardware or firmware instructions. Processors of the logic subsystem may be single core or multicore, and the programs executed thereon may be configured for parallel or distributed processing. The logic subsystem may optionally include individual components that are distributed throughout two or more devices, which may be remotely located and/or configured for coordinated processing. One or more aspects of the logic subsystem may be virtualized and executed by remotely accessible networked computing devices configured in a cloud computing configuration.


Data-holding subsystems, such as data-holding subsystems 504, 520, and 552, may include one or more physical, non-transitory, devices configured to hold data and/or instructions executable by the logic subsystem to implement the herein described methods and processes. When such methods and processes are implemented, the state of the data-holding subsystem may be transformed (e.g., to hold different data).


The data-holding subsystem may include removable media and/or built-in devices. The data-holding subsystem may include optical memory devices (e.g., CD, DVD, HD-DVD, Blu-Ray Disc, etc.), semiconductor memory devices (e.g., RAM, EPROM, EEPROM, etc.) and/or magnetic memory devices (e.g., hard disk drive, floppy disk drive, tape drive, MRAM, etc.), among others. The data-holding subsystem may include devices with one or more of the following characteristics: volatile, nonvolatile, dynamic, static, read/write, read-only, random access, sequential access, location addressable, file addressable, and content addressable. In some embodiments, the logic subsystem and the data-holding subsystem may be integrated into one or more common devices, such as an application specific integrated circuit or a system on a chip.


In some embodiments, the data-holding subsystem may include removable computer-readable storage media (not shown), which may be used to store and/or transfer data and/or instructions executable to implement the herein described methods and processes. Removable computer-readable storage media may take the form of CDs, DVDs, HD-DVDs, Blu-Ray Discs, EEPROMs, and/or floppy disks, among others.


It is to be appreciated that the data-holding subsystem includes one or more physical, non-transitory devices. In contrast, in some embodiments aspects of the instructions described herein may be propagated in a transitory fashion by a pure signal (e.g., an electromagnetic signal, an optical signal, etc.) that is not held by a physical device for at least a finite duration. Furthermore, data and/or other forms of information pertaining to the present disclosure may be propagated by a pure signal.


The term “module” may be used to describe an aspect of the computing devices described herein that is implemented to perform one or more particular functions. In some cases, a module may be instantiated via a logic subsystem executing instructions held by a data-holding subsystem. It is to be understood that different modules may be instantiated from the same application, service, code block, object, library, routine, API, function, etc. Likewise, the same module may be instantiated by different applications, services, code blocks, objects, routines, APIs, functions, etc. The term “module” is meant to encompass individual or groups of executable files, data files, libraries, drivers, scripts, database records, etc.


It is to be understood that the configurations and/or approaches described herein are exemplary in nature, and that these specific embodiments or examples are not to be considered in a limiting sense, because numerous variations are possible. The specific routines or methods described herein may represent one or more of any number of processing strategies. As such, various acts illustrated may be performed in the sequence illustrated, in other sequences, in parallel, or in some cases omitted. Likewise, the order of the above-described processes may be changed.


The subject matter of the present disclosure includes all novel and nonobvious combinations and subcombinations of the various processes, systems and configurations, and other features, functions, acts, and/or properties disclosed herein, as well as any and all equivalents thereof.

Claims
  • 1. At an image processing module of a computing device compatible with a depth camera, a method of calibrating the depth camera, the method comprising: receiving from the depth camera a native image including test light intensity information useable to derive a native depth map for a calibration scene;receiving from a high-precision test source a respective reference image taken from a same perspective as the native image and including calibration light intensity information useable to derive a reference depth map for the calibration scene independently from the native depth map; andfitting a weighting function to vary test light intensity information toward the calibration light intensity information for the calibration scene, the weighting function configured to scale light intensity information for a plurality of raw images received by the depth camera during a subsequent operation mode wherein the depth camera is operated independently from the high-precision test source.
  • 2. The method of claim 1, wherein the calibration scene is one of a plurality of calibration scenes, and wherein the weighting function is one of a plurality of local weighting functions, each local weighting function being fit to vary test light intensity information toward the calibration light intensity information for a particular calibration scene of the plurality of calibration scenes.
  • 3. The method of claim 2, further comprising generating a global weighting function for the depth camera from the plurality of local weighting functions.
  • 4. The method of claim 1, wherein receiving the respective reference image from the high-precision test source includes receiving the respective reference image from a high-resolution time-of-flight camera system.
  • 5. The method of claim 1, wherein receiving the respective reference image includes receiving the respective reference image from a 3-D triangulation system.
  • 6. The method of claim 1, wherein receiving the respective reference image includes receiving the respective reference image from a laser light detection and ranging system.
  • 7. The method of claim 1, wherein the weighting function is a function of one or more of a variance, a moment, and a total variation between a pixel included in the native image and a respective pixel included in the respective reference image.
  • 8. The method of claim 1, wherein fitting the weighting function includes minimizing a difference between the test light intensity information and the calibration light intensity information.
  • 9. The method of claim 8, wherein minimizing the difference includes minimizing a difference between the native depth map for a particular pixel of the native image and the reference depth map for a respective pixel included in the respective reference image.
  • 10. A computing device configured to calibrate a depth camera, the computing device comprising: a depth camera input configured to receive an image collected by the depth camera;a logic subsystem; anda data-holding subsystem holding instructions executable by the logic subsystem to: receive from the depth camera a native image including test light intensity information useable to derive a native depth map for each of a plurality of calibration scenes,receive from a high-precision test source that is different from the depth camera, a respective reference image taken from the same perspective as the native image including calibration light intensity information useable to derive a reference depth map for each of the plurality of calibration scenes,for each of the plurality of calibration scenes, fit a local weighting function to vary test light intensity information toward the calibration light intensity information, andgenerate a global weighting function for the depth camera from the local weighting functions.
  • 11. The computing device of claim 10, wherein the global weighting function is a function of one or more of a variance, a moment, and a total variation between a pixel included in the native image and a respective pixel included in the respective reference image.
  • 12. The computing device of claim 10, wherein fitting the local weighting function includes minimizing a difference between the test light intensity information and the calibration light intensity information.
  • 13. The computing device of claim 12, wherein minimizing the difference includes minimizing a difference between the native depth map for a particular pixel of the native image and the reference depth map for a respective pixel included in the respective reference image.
  • 14. The computing device of claim 10, wherein the high-precision test source includes one of a high-resolution time-of-flight camera system, a 3-D triangulation system, and a laser light detection and ranging system, and wherein the high-precision test source is configured to generate depth information for a particular calibration scene that is reliably more accurate than depth information generated by the depth camera.
  • 15. A depth camera computing device including an image processing module configured to output a processed image by application of a global weighting function to a raw image, the depth camera computing device comprising: a depth camera; anda data-holding subsystem holding instructions executable by a logic subsystem to: receive a raw image of a use environment from the depth camera,convert the raw image into a processed image according to a weighting function and without receiving a corresponding reference image of the use environment, the weighting function generated by varying test light intensity information generated by the depth camera from a native image previously collected by the depth camera from a calibration scene toward calibration light intensity information of a reference image collected by a high-precision test source, different than the depth camera, from a same perspective as the native image, andoutput the processed image.
  • 16. The depth camera of claim 15, wherein the test light intensity information is useable to derive a native depth map for the native image, and wherein the calibration light intensity information is useable to derive a reference depth map for the reference image.
  • 17. The depth camera of claim 16, wherein the weighting function is configured to adjust depth information for a particular pixel of the native image toward depth information for a respective pixel of the reference image included in the reference depth map.
  • 18. The depth camera of claim 16, wherein the calibration scene is one of a plurality of calibration scenes for which a native image and a respective reference image are collected, wherein the weighting function is a global weighting function generated from a plurality of local weighting functions, each local weighting function being fit to vary test light intensity information of a particular native image toward the calibration light intensity information of a particular reference image for a particular calibration scene of the plurality of calibration scenes.
  • 19. The depth camera of claim 18, wherein the instructions are further configured to select a particular local weighting function to be used instead of the global weighting function to process the native image according to an image quality of the native image prior to the native image being so processed.
  • 20. The depth camera of claim 15, wherein the weighting function is a function of one or more of a variance, a moment, and a total variation between a pixel included in the native image and a respective pixel included in the respective reference image.
US Referenced Citations (185)
Number Name Date Kind
4627620 Yang Dec 1986 A
4630910 Ross et al. Dec 1986 A
4645458 Williams Feb 1987 A
4695953 Blair et al. Sep 1987 A
4702475 Elstein et al. Oct 1987 A
4711543 Blair et al. Dec 1987 A
4751642 Silva et al. Jun 1988 A
4796997 Svetkoff et al. Jan 1989 A
4809065 Harris et al. Feb 1989 A
4817950 Goo Apr 1989 A
4843568 Krueger et al. Jun 1989 A
4893183 Nayar Jan 1990 A
4901362 Terzian Feb 1990 A
4925189 Braeunig May 1990 A
5101444 Wilson et al. Mar 1992 A
5148154 MacKay et al. Sep 1992 A
5184295 Mann Feb 1993 A
5229754 Aoki et al. Jul 1993 A
5229756 Kosugi et al. Jul 1993 A
5239463 Blair et al. Aug 1993 A
5239464 Blair et al. Aug 1993 A
5288078 Capper et al. Feb 1994 A
5295491 Gevins Mar 1994 A
5320538 Baum Jun 1994 A
5347306 Nitta Sep 1994 A
5359385 Ishida et al. Oct 1994 A
5385519 Hsu et al. Jan 1995 A
5405152 Katanics et al. Apr 1995 A
5417210 Funda et al. May 1995 A
5423554 Davis Jun 1995 A
5454043 Freeman Sep 1995 A
5469740 French et al. Nov 1995 A
5495576 Ritchey Feb 1996 A
5516105 Eisenbrey et al. May 1996 A
5524637 Erickson et al. Jun 1996 A
5534917 MacDougall Jul 1996 A
5563988 Maes et al. Oct 1996 A
5577981 Jarvik Nov 1996 A
5580249 Jacobsen et al. Dec 1996 A
5594469 Freeman et al. Jan 1997 A
5597309 Riess Jan 1997 A
5616078 Oh Apr 1997 A
5617312 Iura et al. Apr 1997 A
5638300 Johnson Jun 1997 A
5641288 Zaenglein Jun 1997 A
5682196 Freeman Oct 1997 A
5682229 Wangler Oct 1997 A
5690582 Ulrich et al. Nov 1997 A
5703367 Hashimoto et al. Dec 1997 A
5704837 Iwasaki et al. Jan 1998 A
5715834 Bergamasco et al. Feb 1998 A
5875108 Hoffberg et al. Feb 1999 A
5877803 Wee et al. Mar 1999 A
5913727 Ahdoot Jun 1999 A
5933125 Fernie Aug 1999 A
5980256 Carmein Nov 1999 A
5989157 Walton Nov 1999 A
5995649 Marugame Nov 1999 A
6005548 Latypov et al. Dec 1999 A
6009210 Kang Dec 1999 A
6054991 Crane et al. Apr 2000 A
6066075 Poulton May 2000 A
6072494 Nguyen Jun 2000 A
6073489 French et al. Jun 2000 A
6077201 Cheng et al. Jun 2000 A
6098458 French et al. Aug 2000 A
6100896 Strohecker et al. Aug 2000 A
6101289 Kellner Aug 2000 A
6128003 Smith et al. Oct 2000 A
6130677 Kunz Oct 2000 A
6141463 Covell et al. Oct 2000 A
6147678 Kumar et al. Nov 2000 A
6152856 Studor et al. Nov 2000 A
6159100 Smith Dec 2000 A
6173066 Peurach et al. Jan 2001 B1
6181343 Lyons Jan 2001 B1
6188777 Darrell et al. Feb 2001 B1
6215890 Matsuo et al. Apr 2001 B1
6215898 Woodfill et al. Apr 2001 B1
6226396 Marugame May 2001 B1
6229913 Nayar et al. May 2001 B1
6256033 Nguyen Jul 2001 B1
6256400 Takata et al. Jul 2001 B1
6283860 Lyons et al. Sep 2001 B1
6289112 Jain et al. Sep 2001 B1
6299308 Voronka et al. Oct 2001 B1
6308565 French et al. Oct 2001 B1
6316934 Amorai-Moriya et al. Nov 2001 B1
6363160 Bradski et al. Mar 2002 B1
6384819 Hunter May 2002 B1
6411744 Edwards Jun 2002 B1
6430997 French et al. Aug 2002 B1
6476834 Doval et al. Nov 2002 B1
6496598 Harman Dec 2002 B1
6503195 Keller et al. Jan 2003 B1
6539931 Trajkovic et al. Apr 2003 B2
6570555 Prevost et al. May 2003 B1
6633294 Rosenthal et al. Oct 2003 B1
6640202 Dietz et al. Oct 2003 B1
6661918 Gordon et al. Dec 2003 B1
6681031 Cohen et al. Jan 2004 B2
6714665 Hanna et al. Mar 2004 B1
6731799 Sun et al. May 2004 B1
6738066 Nguyen May 2004 B1
6765726 French et al. Jul 2004 B2
6788809 Grzeszczuk et al. Sep 2004 B1
6801637 Voronka et al. Oct 2004 B2
6873723 Aucsmith et al. Mar 2005 B1
6876496 French et al. Apr 2005 B2
6937742 Roberts et al. Aug 2005 B2
6950534 Cohen et al. Sep 2005 B2
7003134 Covell et al. Feb 2006 B1
7036094 Cohen et al. Apr 2006 B1
7038855 French et al. May 2006 B2
7039676 Day et al. May 2006 B1
7042440 Pryor et al. May 2006 B2
7050606 Paul et al. May 2006 B2
7058204 Hildreth et al. Jun 2006 B2
7060957 Lange et al. Jun 2006 B2
7113918 Ahmad et al. Sep 2006 B1
7121946 Paul et al. Oct 2006 B2
7170492 Bell Jan 2007 B2
7184048 Hunter Feb 2007 B2
7202898 Braun et al. Apr 2007 B1
7222078 Abelow May 2007 B2
7227526 Hildreth et al. Jun 2007 B2
7259747 Bell Aug 2007 B2
7308112 Fujimura et al. Dec 2007 B2
7317836 Fujimura et al. Jan 2008 B2
7348963 Bell Mar 2008 B2
7359121 French et al. Apr 2008 B2
7367887 Watabe et al. May 2008 B2
7379563 Shamaie May 2008 B2
7379566 Hildreth May 2008 B2
7389591 Jaiswal et al. Jun 2008 B2
7412077 Li et al. Aug 2008 B2
7421093 Hildreth et al. Sep 2008 B2
7430312 Gu Sep 2008 B2
7436496 Kawahito Oct 2008 B2
7450736 Yang et al. Nov 2008 B2
7452275 Kuraishi Nov 2008 B2
7460690 Cohen et al. Dec 2008 B2
7489812 Fox et al. Feb 2009 B2
7536032 Bell May 2009 B2
7555142 Hildreth et al. Jun 2009 B2
7560679 Gutierrez Jul 2009 B1
7560701 Oggier et al. Jul 2009 B2
7570805 Gu Aug 2009 B2
7574020 Shamaie Aug 2009 B2
7576727 Bell Aug 2009 B2
7590262 Fujimura et al. Sep 2009 B2
7593552 Higaki et al. Sep 2009 B2
7598942 Underkoffler et al. Oct 2009 B2
7607509 Schmiz et al. Oct 2009 B2
7620202 Fujimura et al. Nov 2009 B2
7668340 Cohen et al. Feb 2010 B2
7680298 Roberts et al. Mar 2010 B2
7683954 Ichikawa et al. Mar 2010 B2
7684592 Paul et al. Mar 2010 B2
7701439 Hillis et al. Apr 2010 B2
7702130 Im et al. Apr 2010 B2
7704135 Harrison, Jr. Apr 2010 B2
7710391 Bell et al. May 2010 B2
7729530 Antonov et al. Jun 2010 B2
7746345 Hunter Jun 2010 B2
7760182 Ahmad et al. Jul 2010 B2
7787105 Hipp Aug 2010 B2
7809167 Bell Oct 2010 B2
7834846 Bell Nov 2010 B1
7852262 Namineni et al. Dec 2010 B2
RE42256 Edwards Mar 2011 E
7898522 Hildreth et al. Mar 2011 B2
8035612 Bell et al. Oct 2011 B2
8035614 Bell et al. Oct 2011 B2
8035624 Bell et al. Oct 2011 B2
8072470 Marks Dec 2011 B2
20080026838 Dunstan et al. Jan 2008 A1
20100046802 Watanabe et al. Feb 2010 A1
20100208034 Chen Aug 2010 A1
20110025827 Shpunt et al. Feb 2011 A1
20110032341 Ignatov et al. Feb 2011 A1
20110069064 Zhang et al. Mar 2011 A1
20110096832 Zhang et al. Apr 2011 A1
20110211045 Bollano et al. Sep 2011 A1
20110254923 Choi et al. Oct 2011 A1
Foreign Referenced Citations (6)
Number Date Country
201254344 Jun 2010 CN
0583061 Feb 1994 EP
08044490 Feb 1996 JP
9310708 Jun 1993 WO
9717598 May 1997 WO
9944698 Sep 1999 WO
Non-Patent Literature Citations (31)
Entry
Kanade et al., “A Stereo Machine for Video-rate Dense Depth Mapping and Its New Applications”, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1996, pp. 196-202,The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
Miyagawa et al., “CCD-Based Range Finding Sensor”, Oct. 1997, pp. 1648-1652, vol. 44 No. 10, IEEE Transactions on Electron Devices.
Rosenhahn et al., “Automatic Human Model Generation”, 2005, pp. 41-48, University of Auckland (CITR), New Zealand.
Aggarwal et al., “Human Motion Analysis: A Review”, IEEE Nonrigid and Articulated Motion Workshop, 1997, University of Texas at Austin, Austin, TX.
Shao et al., “An Open System Architecture for a Multimedia and Multimodal User Interface”, Aug. 24, 1998, Japanese Society for Rehabilitation of Persons with Disabilities (JSRPD), Japan.
Kohler, “Special Topics of Gesture Recognition Applied in Intelligent Home Environments”, In Proceedings of the Gesture Workshop, 1998, pp. 285-296, Germany.
Kohler, “Vision Based Remote Control in Intelligent Home Environments”, University of Erlangen-Nuremberg/Germany, 1996, pp. 147-154, Germany.
Kohler, “Technical Details and Ergonomical Aspects of Gesture Recognition applied in Intelligent Home Environments”, 1997, Germany.
Hasegawa et al., “Human-Scale Haptic Interaction with a Reactive Virtual Human in a Real-Time Physics Simulator”, Jul. 2006, vol. 4, No. 3, Article 6C, ACM Computers in Entertainment, New York, NY.
Qian et al., “A Gesture-Driven Multimodal Interactive Dance System”, Jun. 2004, pp. 1579-1582, IEEE International Conference on Multimedia and Expo (ICME), Taipei, Taiwan.
Zhao, “Dressed Human Modeling, Detection, and Parts Localization”, 2001, The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
He, “Generation of Human Body Models”, Apr. 2005, University of Auckland, New Zealand.
Isard et al., “Condensation—Conditional Density Propagation for Visual Tracking”, 1998, pp. 5-28, International Journal of Computer Vision 29(1), Netherlands.
Livingston, “Vision-based Tracking with Dynamic Structured Light for Video See-through Augmented Reality”, 1998, University of North Carolina at Chapel Hill, North Carolina, USA.
Wren et al., “Pfinder: Real-Time Tracking of the Human Body”, MIT Media Laboratory Perceptual Computing Section Technical Report No. 353, Jul. 1997, vol. 19, No. 7, pp. 780-785, IEEE Transactions on Pattern Analysis and Machine Intelligence, Caimbridge, MA.
Breen et al., “Interactive Occlusion and Collusion of Real and Virtual Objects in Augmented Reality”, Technical Report ECRC-95-02, 1995. European Computer-Industry Research Center GmbH, Munich, Germany.
Freeman et al., “Television Control by Hand Gestures”, Dec. 1994, Mitsubishi Electric Research Laboratories, TR94-24, Caimbridge, MA.
Hongo et al., “Focus of Attention for Face and Hand Gesture Recognition Using Multiple Cameras”, Mar. 2000, pp. 156-161, 4th IEEE International Conference on Automatic Face and Gesture Recognition, Grenoble, France.
Pavlovic et al., “Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review”, Jul. 1997, pp. 677-695, vol. 19, No. 7, IEEE Transactions on Pattern Analysis and Machine Intelligence.
Azarbayejani et al., “Visually Controlled Graphics”, Jun. 1993, vol. 15, No. 6, IEEE Transactions on Pattern Analysis and Machine Intelligence.
Granieri et al., “Simulating Humans in VR”, The British Computer Society, Oct. 1994, Academic Press.
Brogan et al., “Dynamically Simulated Characters in Virtual Environments”, Sep./Oct. 1998, pp. 2-13, vol. 18, Issue 5, IEEE Computer Graphics and Applications.
Fisher et al., “Virtual Environment Display System”, ACM Workshop on Interactive 3D Graphics, Oct. 1986, Chapel Hill, NC.
“Virtual High Anxiety”, Tech Update, Aug. 1995, pp. 22.
Sheridan et al., “Virtual Reality Check”, Technology Review, Oct. 1993, pp. 22-28, vol. 96, No. 7.
Stevens, “Flights into Virtual Reality Treating Real World Disorders”, The Washington Post, Mar. 27, 1995, Science Psychology, 2 pages.
“Simulation and Training”, 1994, Division Incorporated.
Oprisescu, et al., “Measurements with ToF Cameras and Their Necessary Corrections”, Retrieved at <<http://www.artts.eu/publications/oprisescu—et—al—ieee.pdf>>, In the Proceedings of ISSCS, Jul. 12-13, 2007, p. 4.
Ringbeck, et al., “A 3D Time of Flight Camera for Object Detection”, Retrieved at <<http://www.ifm.com/obj/O1D—Paper-PMD.pdf>> Jul. 9-12, 2007, p. 10.
Joodaki, et al., “Using Neural Networks for High Resolution Distance Measurements Radar” Retrieved at <<http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=00928274>> , In the Proceedings of the 18th IEEE Instrumentation and Measurement Technology Conference, May 21-23, 2001, pp. 1242-1246.
Falie, et al., “Distance errors correction for the Time of Flight (ToF) Cameras” Retrieved at <<http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=04611675>>, In the Proceedings of 4th European Conference on Circuits and Systems for Communications, Jul. 10-11, 2008, pp. 193-196.
Related Publications (1)
Number Date Country
20120249738 A1 Oct 2012 US