Many computing systems include at least one display and at least one input device. The display may include, for example, a monitor, a screen, or the like. Example input devices include a mouse, a keyboard, a touchpad, or the like. Some computing systems include a touch-sensitive display to both display output of the computing system and receive physical (e.g., touch) input.
The following detailed description references the drawings, wherein:
In addition to the input devices mentioned above, a camera is another example of an input device for a computing system. In some examples, a computing system may capture video or a still image with the camera. The video or still image may be stored or provided to another computing system via a suitable computer network. In other examples, an image captured by the camera may be analyzed, and the computing system may utilize the content of the image as input to the computing system. For example, the computing system may analyze an object represented in the captured image, and determine input to the computing system based on characteristics (e.g., location, position, orientation, shape, etc.) of the object represented in the image.
In such examples, the computing system may perform a process on the captured image to extract an image of at least one foreground object represented in the captured image. This process may be referred to herein as “segmentation”. In some examples, the result of such a segmentation process may be an image of the foreground object separated from at least a background represented in the captured image.
In some examples, a segmentation process may comprise determining a segmentation boundary for an object represented in a captured image. As used herein, a “segmentation boundary” for an object represented in an image may be information representing an estimate of which portion(s) of the image represent the object and which portion(s) of the image represent features other than the object (e.g., a background). In some examples, a segmentation boundary for an object represented in an image may include information representing at least one outer edge of the object as represented in the image. When performing a segmentation process, a computing system may use the segmentation boundary to extract an image of the object from a larger captured image (e.g., also representing portions of a background).
However, it may be difficult to accurately determine a segmentation boundary for an object based on an image captured by a single camera, as certain conditions may make it difficult to accurately distinguish the foreground object from the background in the image. For example, it may be difficult to accurately determine a segmentation boundary based on an image captured by a color camera (e.g., an RGB camera) in the presence of shadows, or when the foreground object and the background are very similar in color.
To address these issues, examples described herein may determine a segmentation boundary based on multiple images captured by cameras of different types. Examples described herein may include an infrared (IR)-absorbing surface, and an IR camera disposed above and pointed at the IR-absorbing surface to capture IR image representing an object disposed between the IR camera and the IR-absorbing surface based on IR light reflected by the object. Examples may also include an RGB camera to capture an RGB image representing the object disposed between the RGB camera and the IR-absorbing surface, and a segmentation engine to determine a segmentation boundary representing at least one outer edge of the object based on the IR image and the RGB image.
By using multiple images from cameras of different types, examples described herein may more accurately determine a segmentation boundary, as conditions affecting segmentation performed on images from one type of camera may not affect segmentation on images from camera of a different type. For example, an image captured by an IR camera may not be affected by either shadows or color similarity. Additionally, examples described herein may capture images of objects over an IR-absorbing surface, which may increase the contrast between the background (e.g., the IR-absorbing surface) and foreground objects exhibiting greater reflection of IR light, which may thereby improve segmentation performed based, at least in pert, on IR images captured with the IR-absorbing surface as the background.
Referring now to the drawings.
Computing device 150 may comprise any suitable computing device complying with the principles disclosed herein. As used herein, a “computing device” may comprise an electronic display device, a smartphone, a tablet, a chip set, an al-in-one computer (e.g., a device comprising a display device that also houses processing resource(s) of the computer), a desktop computer, a notebook computer, workstation, server, any other processing device or equipment, or a combination thereof. In this example, device 150 is an all-in-one computer having a central axis or center line 155, first or top side 150A, a second or bottom side 1505 axially opposite the top side 150A, a front side 150C extending axially between sides 150A and 150B, a rear side 150D also extending axially between sides 150A and 150B and generally radially opposite front side 150C. A display 152 is disposed along front side 150C and defines a viewing surface of computing system 100 to display images for viewing by a user of system 100. In examples described herein, a display may include components of any technology suitable for displaying images, video, or the like.
In some examples, display 152 may be a touch-sensitive display. In examples described herein, a touch-sensitive display may include, for example, any suitable technology (e.g., components) for displaying images, video, or the like, and may include any suitable technology (e.g., components) for detecting physical contact (e.g., touch input), such as, for example, a resistive, capacitive, surface acoustic wave, infrared (IR), strain gauge, optical imaging, acoustic pulse recognition, dispersive signal sensing, or in-cell system, or the like. In examples described herein, display 152 may be referred to as a touch-sensitive display 152. Device 150 may further include a camera 154, which may be a web camera, for example. In some examples, camera 154 may capture images of a user positioned in front of display 152. In some examples, device 150 may also include a microphone or other device to receive sound input (e.g., voice input from a user).
In the example of
Upright member 140 includes a first or upper end 140A, a second or lower end 1408 opposite the upper end 140A, a first or front side 140C extending between the ends 140A and 1408, and a second or rear side 1400 opposite the front side 140C and also extending between the ends 140A and 1408. Lower end 1408 of member 140 is coupled to rear end 1208 of base 120, such that member 140 extends substantially upward from support surface 15.
Top 160 includes a first or proximate end 160A, a second or distal end 160B opposite the proximate end 160A, a top surface 160C extending between ends 160A and 1608, and a bottom surface 1600 opposite the top surface 160C and also extending between ends 160A and 1608. Proximate end 160A of top 160 is coupled to upper end 140A of upright member 140 such that distal end 160B extends outward from upper end 140A of upright member 140. As such, in the example shown in
IR-absorbing surface 200 may include a central axis or centerline 205, a first or front side 200A, and a second or rear side 200B axially opposite the front side 200A. In the example of
As described above, surface 200 may be aligned with base 120 of structure 110 to assist with proper alignment of surface 200 (e.g., at least during operation of system 100). In the example of
In some examples, region 202 of surface 200 and device 150 may be communicatively connected (e.g., electrically coupled) to one another such that user inputs received by region 202 may be communicated to device 150. Region 202 and device 150 may communicate with one another via any suitable wired or wireless communication technology or mechanism, such as, for example, WI-FI. BLUETOOTH, ultrasonic technology, electrical cables, electrical leads, electrical conductors, electrical spring-loaded pogo pins with magnetic holding force, or the like, or a combination thereof. In the example of
Referring to
Referring to
Referring again to
Referring still to
Sensor bundle 164 includes a plurality of sensors (e.g., cameras, or other types of sensors) to detect, measure, or otherwise acquire data based on the state of (e.g., activities occurring in) a region between sensor bundle 164 and surface 200. The state of the region between sensor bundle 164 and surface 200 may include object(s) on or over surface 200, or activit(ies) occurring on or near surface 200. In the example of
In some examples, RGB camera 164A may be a camera to capture color images (e.g., at least one of still images and video). In some examples, RGB camera 164A may be a camera to capture images according to the RGB color model, which may be referred to herein as “RGB images”. In some examples, RGB camera 164A may capture images with relatively high resolution, such as a resolution on the order of multiple megapixels (MPs), for example. As an example, RGB camera 164A may capture color (e.g., RGB) images with a resolution of 14 MPs. In other examples, RBG camera 164A may capture images with a different resolution. In some examples, RGB camera 164A may be pointed toward surface 200 and may capture image(s) of surface 200, object(s) disposed between surface 200 and RGB camera 164A (e.g., on or above surface 200), or a combination thereof.
IR camera 164B may be a camera to detect intensity of IR light at a plurality of points in the field of view of the camera 164B. In examples described herein. IR camera 164B may operate in conjunction with an IR light projector 166 (see FIG. TA) of system 100 to capture IR images. In such examples, each IR image may comprise a plurality of pixels each representing an intensity of IR light detected at a point represented by the pixel. In some examples, top 160 of system 100 may include an IR light projector 166 (see
As noted above, surface 200 may be an IR-absorbing surface 200. In examples described herein, an “IR-absorbing” surface may be a surface to at least partially absorb projected IR light such that an IR camera (e.g., IR camera 164B) may detect significantly less intensity of IR light reflected from the IR-absorbing surface than from a non-IR-absorbing object under the same IR light projection conditions. For example, an IR-absorbing surface may be a surface including, coated with, or otherwise prepared with an IR light absorbing substance (e.g., ink) to at least partially absorb IR light such that an IR camera may detect significantly less intensity of IR light reflected from the IR-absorbing surface than from another object not including, coated with, or otherwise prepared with such an IR absorbing substance. In some examples, the IR absorbing substance may be substantially transparent to visible light. In some examples, IR-absorbing surface 200 may appear white. In such examples, it may be difficult to detect the edges of a sheet of white paper from an RGB image of the white paper disposed on the white surface 200. In examples described herein, IR camera 184B may capture an IR image of the sheet of paper on surface 200 (e.g., as a background) in which the pixels of the IR image representing portions of the IR-absorbing background have significantly lower IR light intensity values than pixels representing portions of the sheet of paper, such that the difference in intensity values provide sufficient contrast to reliably detect the edges of the sheet of paper in the IR image.
Depth camera 164C may be a camera (sensor(s), etc.) to detect the respective distance(s) (or depth(s)) of portions of object(s) in the field of view of depth camera 164C. As used herein, the data detected by a depth camera may be referred to herein as “distance” or “depth” data. In examples described herein, depth camera 164C may capture a multi-pixel depth image (e.g., a depth map), wherein the data of each pixel represents the distance or depth (measured from camera 164C) of a portion of an object at a point represented by the pixel. Depth camera 164C may be implemented using any suitable technology, such as stereovision camera(s), a single IR camera sensor with a uniform flood of IR light, a dual IR camera sensor with a uniform flood of IR light, structured light depth sensor technology, time-of-flight (TOF) depth sensor technology, or a combination thereof. In some examples, depth sensor 164C may indicate when an object (e.g., a three-dimensional object) is on surface 200. In some examples, depth sensor 164C may detect at least one of the presence, shape, contours, motion, and the respective distance(s) of an object (or portions thereof) placed on surface 200.
Ambient light sensor 1640 may be arranged to measure the intensity of light in the environment surrounding system 100. In some examples, system 100 may use the measurements of sensor 164D to adjust other components of system 100, such as, for example, exposure settings of sensors or cameras of system 100 (e.g., cameras 164A-164C), the intensity of the light emitted from light sources of system 100 (e.g., projector assembly 184, display 152, etc.), or the like.
In some examples, sensor bundle 164 may omit at least one of sensors 164A-164D. In other examples, sensor bundle 164 may comprise other camera(s), sensor(s), or the like in addition to sensors 164A-164D, or in lieu of at least one of sensors 164A-164D. For example, sensor bundle 164 may include a user interface sensor comprising any suitable device(s) (e.g., sensor(s), camera(s)) for tracking a user input device such as, for example, a hand, stylus, pointing device, etc. In some examples, the user interface sensor may include a pair of cameras which are arranged to stereoscopically track the location of a user input device (e.g., a stylus) as it is moved by a user about the surface 200 (e.g., about region 202 of surface 200). In other examples, the user interface sensor may additionally or alternatively include IR camera(s) or sensor(s) arranged to detect infrared light that is either emitted or reflected by a user input device.
In examples described herein, each of sensors 164A-1640 of bundle 164 is communicatively connected (e.g., coupled) to device 150 such that data generated within bundle 164 (e.g., images captured by the cameras) may be provided to device 150, and device 150 may provide commands to the sensor(s) and camera(s) of sensor bundle 164. Sensors 164A-164D of bundle 164 may be communicatively connected to device 150 via any suitable wired or wireless communication technology or mechanism, examples of which are described above. In the example of
Referring to
In some examples, cameras 164A-164C of sensor bundle 164 are arranged within system 100 such that the field of view of each of cameras 164A-164C includes a space 168 of surface 200 that may overlap with some or all of display space 188 or may be coterminous with display space 188. In examples described herein, the field of view of cameras 164A-164C may be said to include space 168, though at times surface 200 may be at least partially occluded by object(s) on or over surface 200. In such examples, the object(s) on or over surface 200 may be in the field of view of at least one of cameras 164A-164C. In such examples, sensors of sensor bundle 164 may acquire data based on the state of (e.g., activities occurring in, object(s) disposed in) a region between sensor bundle 164 and space 168 of surface 200. In some examples, both space 188 and space 168 coincide or correspond with region 202 of surface 200 such that functionalities of touch sensitive region 202, projector assembly 184, and sensor bundle 164 are all performed in relation to the same defined area. A field of view 165 of cameras 164A-164C is schematically illustrated in
Referring now to
As an example, when a user interacts with region 202 of surface 200 (e.g., with a hand 35, as shown in
In some examples, sensors of sensor bundle 164 may also generate system input which may be provided to device 150 for further processing. For example, system 100 may utilize at least sensor(s) of bundle 164 and segmentation engine 170 detect at least one of the presence and location of a user's hand 35 (or a stylus 25, as shown in
In some examples, system 100 may capture two-dimensional (2D) image(s) or create a three-dimensional (3D) scan of a physical object such that an image of the object may then be projected onto region 202 for further use and manipulation thereof. For example, as shown in
Computing device 150 (or any other computing device implementing segmentation engine 170) may include at least one processing resource. In examples described herein, a processing resource may include, for example, one processor or multiple processors included in a single computing device or distributed across multiple computing devices. As used herein, a “processor” may be at least one of a central processing unit (CPU), a semiconductor-based microprocessor, a graphics processing unit (GPU), a field-programmable gate array (FPGA) configured to retrieve and execute instructions, other electronic circuitry suitable for the retrieval and execution instructions stored on a machine-readable storage medium, or a combination thereof.
Each of engines 170, 172, 174, 176, 178, and any other engines of computing device 150, may be any combination of hardware and programming to implement the functionalities of the respective engine. Such combinations of hardware and programming may be implemented in a number of different ways. For example, the programming may be processor executable instructions stored on a non-transitory machine-readable storage medium and the hardware may include a processing resource to execute those instructions. In such examples, the machine-readable storage medium may store instructions that, when executed by the processing resource, implement the engines. The machine-readable storage medium storing the instructions may be integrated in the same computing device (e.g., device 150) as the processing resource to execute the instructions, or the machine-readable storage medium may be separate from but accessible to the computing device and the processing resource. The processing resource may comprise one processor or multiple processors included in a single computing device or distributed across multiple computing devices.
In some examples, the instructions can be part of an installation package that, when installed, can be executed by the processing resource to implement the engines of system 100. In such examples, the machine-readable storage medium may be a portable medium, such as a compact disc, DVD, or flash drive, or a memory maintained by a server from which the installation package can be downloaded and installed. In other examples, the instructions may be part of an application or applications already installed on a computing device including the processing resource (e.g., device 150). In such examples, the machine-readable storage medium may include memory such as a hard drive, solid state drive, or the like.
As used herein, a “machine-readable storage medium” may be any electronic, magnetic, optical, or other physical storage apparatus to contain or store information such as executable instructions, data, and the like. For example, any machine-readable storage medium described herein may be any of a storage drive (e.g., a hard drive), flash memory, Random Access Memory (RAM), any type of storage disc (e.g., a compact disc, a DVD, etc.), and the like, or a combination thereof. Further, any machine-readable storage medium described herein may be non-transitory.
Examples of segmentation engine 170 are described below in relation to
IR camera 164B may capture an IR image 704 (see
Depth camera 164C may capture a depth image 706 (see
In some examples, IR camera 164B may be a low-resolution IR camera, and captured IR image 704 may be a low-resolution IR image. Also, in some examples, depth camera 164C may be a low-resolution depth camera and captured depth image 706 may be a low-resolution depth image. For example, one or both of IR camera 164B and depth camera 164C may have video graphics array (VGA) resolution, quarter video graphics array (QVGA) resolution, or any other resolution that is significantly less than the resolution of RGB camera 164A. In such examples, the captured IR and depth images 704 and 706 may have the resolution of the camera that captured it (e.g., VGA resolution, QVGA resolution, etc.).
In some examples described herein, the “resolution” of a camera or image may refer to a two-dimensional pixel resolution of the camera or image. Such a two-dimensional pixel resolution may be expressed as A pixels×B pixels, where A and B are positive integers. In examples described herein, a “low-resolution” camera of a computing system may be a camera having a significantly lower resolution than another camera of the computing system, and a “low-resolution” image captured by a camera of the computing system may be an image with a significantly lower resolution than an image capable of being captured with another camera of the computing system. Additionally, in examples described herein, a “high-resolution” camera of a computing system may be a camera having a significantly higher resolution than another camera of the computing system, and a ‘high-resolution’ image captured by a camera of the computing system may be an image with a significantly higher resolution than a maximum resolution of images capable of being captured with another camera of the computing system. For example, computing system 100 may comprise a low-resolution IR camera 164B to capture a low-resolution IR image 704, a low-resolution depth camera 164C to capture a low-resolution depth image 706, and a high-resolution RGB camera 164A to capture a high-resolution RGB image 702.
In some examples, the two-dimensional pixel resolution of RGB camera 184A and RGB images captured by it (including RGB image 702, for example) may be at least ten times greater, in each dimension, than the two-dimensional pixel resolution of IR camera 164B and IR images captured by it (including IR image 704, for example). For example, IR camera 164B and IR images captured by it may have a resolution of approximately 320 pixels×240 pixels, while RGB camera 164A and RGB images captured by it may have a resolution of approximately 4416 pixels×3312 pixels. In other examples, the cameras and images may have different resolutions.
In some examples, the two-dimensional pixel resolution of RGB camera 164A and RGB images captured by it may also be at least ten times greater. In each dimension, than the two-dimensional pixel resolution of depth camera 164C and depth images captured by it (including depth image 706, for example). For example, depth camera 164C and depth images captured by it may have a resolution of approximately 320 pixels×240 pixels, while RGB camera 164A and RGB images captured by it may have a resolution of approximately 4416 pixels×3312 pixels. In other examples, the cameras and images may have different resolutions.
In some examples, segmentation engine 170 may determine a segmentation boundary 708 based on captured IR image 704, captured depth image 706, and captured RGB image 702. A schematic diagram of an example segmentation boundary 708 is illustrated in
In some examples, computing device 150 may use segmentation boundary 708 to extract an image of object 35 from a captured image representing more than object 35 (e.g., RGB image 702). For example, computing device 150 may extract the portion of RGB image 702 corresponding to object representation 728 of segmentation boundary 708 to obtain a segmented image of the portion of RGB image 702 representing object 35. In the example of
In some examples, cameras 164A-164C may be at different physical locations. As such, cameras 164A-164C may capture respective images of the same scene (e.g., viewing surface 200 from above) from slightly different angles. In such examples, segmentation engine 170 may geometrically align images 702, 704, and 706 captured by cameras 164A-164C. For example, segmentation engine 170 may construct at least one homography (or other mapping(s)) for the pixels of cameras 164A-164C such that pixels corresponding to the same image features (e.g., object 35) may be identified in each of images 702, 704, and 706. The homography or other mapping may be determined in any suitable manner. In some examples, segmentation engine 170 may map the pixels of each of images 702, 704, and 706 to a common set of coordinates to geometrically align the images. In some examples, engine 170 may perform such geometric alignment prior to performing other functionalities for a segmentation process described below in relation to engines 172, 174, 176, and 178.
In the example of
Boundary engine 174 may determine a preliminary segmentation boundary for object 35 based on vector image 190. For example, engine 174 may analyze vector image 190 to estimate the location of edges of object 35 represented in vector image 190, based on the IR intensity and distance data at the pixels of vector image 190 to determine the preliminary segmentation boundary. The preliminary segmentation boundary may be a segmentation boundary, as described above, and may be represented in any suitable manner as described above. Engine 174 may perform the edge estimation (or edge detection) process in any suitable manner.
As an example, engine 174 may run a gradient filler over both the IR intensity and distance data of vector image 190 to detect portions of vector image 190 having relatively high gradient magnitudes to estimate at least the edge(s) of object 35. In such examples, engine 174 may determine the preliminary segmentation boundary based at least in part on at least one boundary 725 between the lesser IR light intensity values of IR image 704 (e.g., of surface representation 714) and the greater IR light intensity values of IR image 704 (e.g., of object representation 724). As described below, segmentation boundary 708 may be determined based on the preliminary segmentation boundary. As such, segmentation engine 170 may determine segmentation boundary 708 based at least in part on at least one boundary 725 between the lesser IR light intensity values of image 704 and the greater IR light intensity values of image 704.
In some examples, engine 174 may utilize the IR intensity and distance data together in any suitable manner. For example, engine 174 may estimate that a portion of vector image 190 represents an edge of object 35 if either one of IR intensity data and the distance data suggests (or otherwise indicates) the presence of an edge. In other examples, engine 174 may not estimate that a portion of vector image 190 represents an edge of object 35 unless both the IR intensity data and the distance data suggests (or otherwise indicates) the presence of an edge. In some examples, engine 174 may additionally or alternatively utilize various heuristic(s), rule(s), or the like, for estimating the presence of edges of object 35 based on the IR intensity and distance data of vector image 190.
In some examples, the preliminary segmentation boundary may have (or otherwise correspond to) the two-dimensional pixel resolution of at least one of IR image 704 and depth image 706 (i.e., a low resolution relative to RGB image 702). For example, the preliminary segmentation boundary may have approximately a VGA resolution (e.g., 640 pixels×480 pixels), a QVGA resolution (e.g., 320 pixels×240 pixels), or another low resolution relative to the resolution of RGB image 702. In such examples, upsample engine 176 may upsample the preliminary segmentation boundary to the resolution of RGB image 702 (described above), or to approximately that resolution. Engine 176 may upsample the preliminary segmentation boundary in any suitable manner. For example, engine 176 may scale up the preliminary segmentation boundary from the relatively low initial resolution to the relatively high resolution of RGB image 702 (or approximately that resolution). In such examples, engine 176 may scale up the preliminary segmentation boundary such that the determined preliminary segmentation boundary is represented with the higher number of pixels of the relatively high resolution of RGB image 702 (or approximately that resolution).
In some examples, the higher-resolution RGB image 702 may include greater may represent at least some portions of edge(s) of object 35 with more detail than either IR image 704 or depth image 706. As such, RGB image 702 may be used to improve the accuracy of the upsampled preliminary segmentation boundary for object 35 and thereby produce segmentation boundary 708. For example, refine engine 178 may refine the upsampled preliminary segmentation boundary based on RGB image 702 to obtain segmentation boundary 708. For example, engine 178 may filter or otherwise alter the upsampled preliminary segmentation boundary based on RGB image 702 in any suitable manner. As examples, engine 178 may utilize a bi-lateral filtering technique or a cross bi-lateral filtering technique to filter the upsampled preliminary segmentation boundary based on RGB image 702.
In other examples, segmentation engine 170 may determine segmentation boundary 708 in another manner. For example, after engine 170 geometrically aligns images 702, 704, and 706, upsample engine 176 may upsample the relatively low resolution IR and depth images 704 and 706 to the relatively high resolution of RGB image 702 (or approximately that resolution). In such examples, combine engine 172 may combine RGB image 702 and the upsampled IR and depth images 704 and 706 into a single vector image. In such examples, the vector image may comprise five components of information at each pixel, including IR intensity data from the upsampled IR image 704, distance data from the upsampled depth image 706, and each of red, blue, and green values from the RGB image 702. In such examples, boundary engine 174 may determine segmentation boundary 708 based on the vector image, in any manner as described above in relation to the preliminary segmentation boundary, but using all five components of information in the vector image of this example.
In examples described herein, segmentation engine 170 may determine segmentation boundary 708 based on RGB, IR, and depth images 702, 704, and 706, as described above, and independent of (e.g., without reference to) any prior image of object 35 captured by any of RGB, IR, and depth cameras 164A-164C. In such examples, segmentation engine 170 does not rely on tracking object 35 over time (i.e., based on images taken by any of cameras 164A-164C over time) to determine segmentation boundary 708. Additionally, as noted above projection assembly 184 may project visible image(s) on IR-absorbing surface 200 and object 35. In such examples, segmentation engine 170 may determine segmentation boundary 708 based on RGB, IR, and depth, images 702, 704, and 706 captured during the projection of the visible image(s).
In some examples, computing system 100 may omit at least one of cameras 164A-164C.
In the example of
In some examples, instructions 322 may acquire IR image 704 from IR camera 164B, acquire depth image 706 from depth camera 164C, and acquire RGB image 702 from RGB camera 164A. In examples described herein, instructions 322 may acquire the images from the camera actively (e.g., by retrieving, requesting, accessing, etc., the images) or passively (e.g., receiving the images, etc.).
In some examples, instructions 324 may geometrically align the images 702, 704, and 706 as described above prior to the functionalities described below in relation to computing device 350. In some examples, instructions 324 may determine a preliminary segmentation boundary for object 35 based on IR image data (e.g., IR intensity values of IR image 704) and depth image data (e.g., distance values of depth image 706), as described above in relation to engines 172 and 174. For example, instructions 342 may combine IR and depth images 704 and 706 into a single vector image (e.g., vector image 190 of
Instructions 326 may upsample the preliminary segmentation boundary to the resolution of RGB image 702, and instructions 328 may refine the upsampled preliminary segmentation boundary based on RGB image 702 to obtain a segmentation boundary 708 for object 35, as described above in relation to engines 176 and 178. As described above, in some examples, the two-dimensional pixel resolution of RGB image 702 is at least ten times greater than the two-dimensional pixel resolution of IR image 704 in each dimension, and at least ten times greater than the two-dimensional pixel resolution of depth image 706 in each dimension. In other examples, images 702, 704, and 706 may have respective resolutions according to any example described above in relation to
At 905 of method 900, IR camera 164B, disposed above and pointing at IR-absorbing surface 200 of computing system 100, may capture a low-resolution IR image 704 (see
At 920, engine 172 may combine IR image 704 and depth image 706 into a single vector image 190 comprising data from IR image 704 and from depth image 706 at each pixel, as described above. At 925, engine 174 may determine a preliminary segmentation boundary for object 35 based on vector image 190, as described above. At 930, engine 176 may upsample the preliminary segmentation boundary to the resolution of RGB image 702, as described above. At 935, engine 178 may refine the upsampled preliminary segmentation boundary based on RGB image 702, as described above, to determine a segmentation boundary 706 for object 35.
Although the flowchart of
At 1005 of method 1000, IR camera 164B, disposed above and pointing at IR-absorbing surface 200 of computing system 100, may capture a QVGA resolution IR image 704 (see
At 1020, engine 172 may combine IR image 704 and depth image 706 into a single vector image 190 comprising data from IR image 704 and from depth image 706 at each pixel, as described above. In some examples, prior to combining images 704 and 706, images 702, 704, and 706 may be geometrically aligned, as described above. At 1025, engine 174 may detect edges in vector image 190 where the presence of an edge is indicated by the either of the data from IR image 704 and the data from depth image 706. In such examples, engine 174 may determine a preliminary segmentation boundary for object 35 based on the detected edge(s). At 1030, engine 176 may upsample the preliminary segmentation boundary to the resolution of RGB image 702, as described above. At 1035, engine 178 may filter the upsampled segmentation boundary based on RGB image 702 to determine a refined segmentation boundary 708, as described above.
Although the flowchart of
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2013/061412 | 9/24/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/047225 | 4/2/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6044165 | Perona et al. | Mar 2000 | A |
6246395 | Goyins et al. | Jun 2001 | B1 |
6633671 | Munich et al. | Oct 2003 | B2 |
7023536 | Zhang et al. | Apr 2006 | B2 |
7038846 | Mandella et al. | May 2006 | B2 |
7088440 | Buermann et al. | Aug 2006 | B2 |
7110100 | Buermann et al. | Sep 2006 | B2 |
7113270 | Buermann et al. | Sep 2006 | B2 |
7161664 | Buermann et al. | Jan 2007 | B2 |
7203384 | Carl et al. | Apr 2007 | B2 |
7268956 | Mandella et al. | Sep 2007 | B2 |
7433024 | Garcia et al. | Oct 2008 | B2 |
7474809 | Carl et al. | Jan 2009 | B2 |
7561146 | Hotelling | Jul 2009 | B1 |
7599561 | Wilson et al. | Oct 2009 | B2 |
7710391 | Bell et al. | May 2010 | B2 |
7729515 | Mandella et al. | Jun 2010 | B2 |
7826641 | Mandella et al. | Nov 2010 | B2 |
7834855 | Hotelling et al. | Nov 2010 | B2 |
7961909 | Mandella et al. | Jun 2011 | B2 |
8121640 | Russ et al. | Feb 2012 | B2 |
8134637 | Rossbach et al. | Mar 2012 | B2 |
8199117 | Izadi et al. | Jun 2012 | B2 |
8294047 | Westerman et al. | Oct 2012 | B2 |
8348747 | Arezina et al. | Jan 2013 | B2 |
8358815 | Benkley et al. | Jan 2013 | B2 |
8401225 | Newcombe et al. | Mar 2013 | B2 |
8736583 | Anderson et al. | May 2014 | B2 |
8897494 | Mandella et al. | Nov 2014 | B2 |
20020065121 | Fukunaga et al. | May 2002 | A1 |
20050078092 | Clapper | Apr 2005 | A1 |
20050168437 | Carl et al. | Aug 2005 | A1 |
20060139314 | Bell | Jun 2006 | A1 |
20070216894 | Garcia et al. | Sep 2007 | A1 |
20070268273 | Westerman et al. | Nov 2007 | A1 |
20070279494 | Aman et al. | Dec 2007 | A1 |
20080018591 | Pittel et al. | Jan 2008 | A1 |
20080196945 | Konstas | Aug 2008 | A1 |
20090129674 | Lin | May 2009 | A1 |
20090309838 | Adan et al. | Dec 2009 | A1 |
20100302376 | Boulanger et al. | Dec 2010 | A1 |
20110001903 | Kang | Jan 2011 | A1 |
20110227876 | Ilmonen | Sep 2011 | A1 |
20110227915 | Mandella et al. | Sep 2011 | A1 |
20110242054 | Tsu | Oct 2011 | A1 |
20110285910 | Bamji | Nov 2011 | A1 |
20110293179 | Dikmen et al. | Dec 2011 | A1 |
20120038549 | Mandella et al. | Feb 2012 | A1 |
20120127070 | Ryoo et al. | May 2012 | A1 |
20120262407 | Hinckley et al. | Oct 2012 | A1 |
20120320157 | Junuzovic et al. | Dec 2012 | A1 |
20120320158 | Junuzovic et al. | Dec 2012 | A1 |
20120327089 | Lee et al. | Dec 2012 | A1 |
20130050133 | Brakensiek et al. | Feb 2013 | A1 |
20130057515 | Wilson | Mar 2013 | A1 |
20130077236 | Becze et al. | Mar 2013 | A1 |
20130136358 | Dedhia et al. | May 2013 | A1 |
20130194418 | Gonzalez-Banos et al. | Aug 2013 | A1 |
20130222287 | Bae et al. | Aug 2013 | A1 |
20130230237 | Schlosser et al. | Sep 2013 | A1 |
20130234992 | Hodges et al. | Sep 2013 | A1 |
20130246861 | Colley et al. | Sep 2013 | A1 |
20130300659 | Kang et al. | Nov 2013 | A1 |
20140029788 | Kang | Jan 2014 | A1 |
20140168367 | Kang | Jun 2014 | A1 |
Number | Date | Country |
---|---|---|
1191714 | Mar 2005 | CN |
200913673 | Mar 2009 | TW |
201120684 | Jun 2011 | TW |
201222288 | Jun 2012 | TW |
201301081 | Jan 2013 | TW |
201314582 | Apr 2013 | TW |
WO-2010135809 | Dec 2010 | WO |
WO-2012041419 | Apr 2012 | WO |
WO-2012173001 | Feb 2015 | WO |
WO-2015016864 | Feb 2015 | WO |
WO-2015076811 | May 2015 | WO |
Entry |
---|
Agarwal et al., “High Precision Multi-touch Sensing on Surfaces using Overhead Cameras,” Tabletop'07 IEEE, 2007 ˜ 4 pages. |
Au et al., “Skeleton Extraction by Mesh Contraction,” 2008, Proceedings for SIGGAPH 2008, 10 pages. |
Bergh, M.V.D. et al., Combining RGB and ToF Cameras for Real-time 3D Hand Gesture Interaction, (Research Paper), Oct. 24, 2010, 7 pages. |
Choi et al., “Extraction of the Euclidean skeleton based on a connectivity criterion,” 2003, Pattern Recognition 36, No. 3, pp. 721-729. |
Chung et al., “MirrorTrack—A Real-Time Multiple Camera Approach for Multi-touch interactions on Glossy Display Surfaces,” 37th IEEE AIPR'08, pp. 1-5. |
Fisher et at, “Skeletonization/Medial Axis Transform,” Nov 26, 2012, <http://web.archive.org ˜ 6 pages. |
Gao, Rui et al; Microsoft Research—Mobile Surface; Microsoft Research; 2010; http://research.microsoft.com ˜ 1 page. |
Hand, Randall; Infinite Z Launches zSpace Virtual Holographic 3D Display for Designers; VizWorld.com; Dec. 13, 2011; 2 pages. |
Harrison, B et al; Bringing Toys to Life: Intel Labs OASIS Project; Augmented Engineering; Jan. 26, 2011; 1 page. |
Harrison, Chris et al; OmniTouch: Wearable Multitouch Interaction Everywhere; UIST'11; Oct. 16, 2011; 10 pages. |
Hartmann, Bjorn et al; Pictionaire: Supporting Collaborative Design Work by Integrating Physical and Digital Artifacts; CSCW 2010; Feb. 6, 2010 ˜ 4 pages. |
Hinckley, Ken et al; Pen + Touch = New Tools; UIST'10; Oct. 3, 2010 ˜ 10 pages. |
Junuzovic, Sasa et al; Microsoft Research—IllumiShare; Microsoft Research 2012; http://delivery.acm.org ˜ 2 pages. |
Kane, Shaun K. et al; Bonfire: A Nomadic System for Hybrid Laptop—Tabletop Interaction; UIST'09; Oct. 4, 2009 ˜ 10 pages. |
Katz, I. et al., A Multi-touch Surface Using Multiple Cameras, (Research Paper), Jun. 3, 2007, http://wsnl2.stanford.edu ˜ 12 pages. |
Linder, Natan et al; LuminAR: Portable Robotic Augmented Reality Interface Design and Prototype; UIST'10, Oct. 3, 2010; 2 pages. |
Litomisky, K., Consumer RGB-D Cameras and Their Applications, (Research Paper), Jul. 13, 2012 ˜ 20 pages. |
Melanson, Donald; Microsoft Research Working on Portable Surface; Mar. 2, 2010; http://www.engadget.com ˜ 2 pages. |
Melanson, Donald; Wiimote Repurposed for Multi-Point Interactive Whiteboard; Dec. 10, 2007; http://www.engadget.com ˜ 2 pages. |
Salamati et al., “Semantic image Segmentation Using Visible and Near-Infrared Channels,” Jan. 2012, Computer Vision—ECCV 2012, pp. 461-471. |
Sato et at, “TEASAR: Tree-structure Extraction Algorithm for Accurate and Robust Skeletons,” 2000, Proc 8th Pacific Conf Computer Graphics and Applications, 6 pages. |
Shahram et al., “C-Slate: A Multi-Touch and Object Recognition System for Remote Collaboration using Horizontal Surfaces,” IEEE International Workshop, 2007, 8 pgs. |
Sidik et al., A Study on Natural Interaction for Human Body Motion Using Depth Image Data, (Research Paper), May 15-16, 2011, pp. 97-102. |
Simonite, Tom; A Kitchen Countertop With a Brain; MIT Technology Review; Jul. 2, 2010; ˜ 2 pages. |
SoftKinetic, “DS525 Datasheet,” Short Range Module, Mar. 8, 2013, <http://www.tiii.be ˜ 2 pages. |
Sturm, J. et al., “Towards a benchmark for RGB-D SLAM evaluation,” Jun. 2011, RGB-D Workshop ˜RSS ˜ 2 pages. |
Van den Bergh et al., “Haarlet-based hand gesture recognition for 3D interaction,” IEEE WACV 2009, Dec. 2009 ˜ 9 pages. |
Westerman, W., Hand Tracking, Finger Identification, and Chordic Manipulation on a Multi-touch Surface, (Research Paper), Sep. 2, 2003 ˜ First 30 pages of 363 pages. |
Wikipedia, “Sensor fusion,” Jul. 31, 2013, <https://en.wikipedia.org ˜ 4 pages. |
Wikipedia, “Leap Motion,” Aug. 15, 2013, retrieved from: <http://en.wikipedia.org/wiki/Leap_Motion> ˜ 3 pages. |
Wilson, Andrew D. et al; Combining Multiple Depth Cameras and Projectors for Interactions on, Above, and Between Surfaces; UIST'10; Oct. 3, 2010; 10 pages. |
Wilson, Andrew D.; Using a Depth Camera as a Touch Sensor; ITS 2010: Devices & Algorithms; Nov. 7, 2010 ˜ 4 pages. |
Hanping, Mao et al., “Image segmentation method based on multi-spectral image fusion and morphology reconstruction”, Jun. 2008, Transactions of the CSAE, vol. 24, No. 6. |
Number | Date | Country | |
---|---|---|---|
20160231866 A1 | Aug 2016 | US |