Two-dimensional and three-dimensional scanning technology allows for the digital capture or acquisition of the shape, contours, and other features of a physical object. For example, in a three-dimensional scanning application, various hardware and software may be used to capture an object such as a user's hand to display the object on a device or monitor.
In a computing system, a projector may be employed as both a display source and an illumination source. For example, a projector may project text, documents, images, video, or other digital media onto a surface and also illuminate an object, e.g., a hand, on or near the surface. In the case where the object is to be captured by, e.g., a camera, 3D sensor, infrared sensor, or other sensor, the object may have text or images superimposed on it, preventing a clean capture of the object by the sensor. In such cases, the projector does not optimally serve the dual purposes of a display source and an illumination source when digitally capturing an object.
As one example, in the case of a digital presentation or digital slide projected onto a surface where a user's hand is present or hovering between the projector and the surface, a digital capture of the hand would capture the slide and the hand, including the slide superimposed on the hand. In such examples, aside from preventing a clean capture of the hand, the projector already has a digital image of the slide it is projecting and may not need to capture it using a 3D sensor.
According to one example for outputting image data, an image comprising a surface and an object are captured on a sensor. An object mask based on the captured image is created on a processor. A first composite image based on the object mask and a source content file is created. In an example, the first composite image is projected to the surface.
In some examples, mat 112 and display 110 may display source content 114 and 120. In one example, the source content may be a digital presentation slide stored on or accessible by device 100. Source content 114 may be projected by the projector 108, which may be cantilevered over mat 112, while source content 120 may be displayed on display 110.
Device 100 may be used to capture or scan an object such as the hand 116 of user 102 within the field of view of a sensor 106, including a device in a user's hand such as a stylus 118 (referred to collectively herein as “object 116”). In some examples, a real-time representation 122 of the object 116 being captured or scanned may be displayed on display 110 as, e.g., part of source content 120 or overlaid on source content 120.
Device 100 in general may comprise any suitable computing device such as a desktop computer, laptop computer, notebook, netbook, all-in-one computer, tablet, or smartphone capable of interfacing with at least one sensor. In some examples, the housing of display 110 may comprise a processor, memory, and/or storage for use in device 100.
Display 110 may also be supported by a support structure (not shown), which may include a base and an upright member. The support structure may support the weight of display 110, as well as sensor cluster 104 and sensors 106, which may be cantilevered such that sensors 106 hover over mat 112. In the example shown in
In examples where mat 112 is touch-sensitive, mat 112 may comprise any suitable touch-sensitive technology for detecting and tracking one or multiple touch inputs by a user in order to allow the user to interact with software being executed by device 100 or some other computing device (not shown). For example, in some examples, mat 112 may utilize known touch sensitive technologies such as, for example, resistive, capacitive, acoustic wave, infrared, strain gauge, optical, acoustic pulse recognition, or some combination thereof.
In some examples, mat 112 and display 110, and/or a processor, memory, and storage of device 100, are electrically coupled to one another such that user inputs received by mat 112 are communicated to display 110 and/or the processor, memory, and storage housed in display 110 or external to display 110. Any suitable wireless or wired electrical coupling or connection may be used between mat 112 and display 110, or generally within device 100.
Sensor cluster 104 may comprise one or more sensors 106 and/or one or more illumination sources, such as a projector 108. Projector 108 may comprise any suitable digital light projector assembly for receiving data from a computing device and projecting an image or images that correspond with that input data. In some examples, projector 108 comprises a digital light processing (DLP) projector or a liquid crystal on silicon (LCoS) projector which are advantageously compact and power efficient projection engines capable of multiple display resolutions and sizes. Projector 108 may be electrically coupled to device 100 in order to receive data therefrom for producing light and images during operation such as through an electric conductor, WI-FI, BLUETOOTH®, NFC, an optical connection, an ultrasonic connection, or some combination thereof.
In some examples, sensor cluster 104 may also comprise a folding mirror, which may be a highly reflective surface disposed along a bottom surface of sensor cluster 104 and/or positioned to reflect images and/or light projected from projector 108 toward mat 112. The mirror may be a standard front surface vacuum metalized aluminum coated glass mirror. In other examples, the mirror may have a complex aspherical curvature to act as a reflective lens element to provide additional focusing power or optical correction. The mirror may allow projector 108 to be located off of the sensor cluster 104, with light reflected from the projector 108 onto the mirror of sensor cluster 104, such that illumination will appear to user 102 to be coming from the sensor cluster 104.
Sensors too may include a plurality of sensors and/or cameras to measure and/or detect various parameters. For example, sensors 106 may comprise at least one of a camera, an ambient light sensor, a depth sensor, and a three-dimensional (3D) user interface sensor.
More specifically, in an example, a camera 106 may be a high-resolution camera, a low-resolution camera, an infrared (“IR”) camera, or other camera type. Camera 106 may be arranged to capture a still image or a video of an object or other items disposed on mat 112 or generally below sensors 106, including projected content.
In an example, an ambient light sensor 106 may be arranged to measure the intensity of light of the environment surrounding device 100, in order to, in some examples, adjust exposure settings of another sensor in sensor cluster 104, and/or adjust the intensity of the light emitted from other sources throughout the device such as, for example, projector 108 or display 110.
In an example, a depth sensor 106 may indicate when a 3D object is on a work surface, such as on mat 112 or, in other examples, a table or other surface suitable for scanning. In particular, depth sensor 106 may sense or detect the presence, shape, contours, motion, and/or the 3D depth of an object, or specific feature(s) of an object. Thus, in some examples, depth sensor 106 may employ any suitable sensor or camera arrangement to sense and detect a 3D object and/or the depth values of each pixel, whether infrared, color, or other, disposed in the sensor's field-of-view. In some examples, depth sensor 106 may comprise a single IR camera sensor with a uniform flood of IR light, a dual IR camera sensor with a uniform flood of IR light, structured light depth sensor technology, time-of-flight (TOF) depth sensor technology, or some combination thereof.
In an example, a user interface sensor 106 may comprise any suitable device or devices (e.g., sensor or camera) for tracking a user input device such as, for example, a hand, stylus, or pointing device.
In various examples, sensor cluster 104 may comprise other sensors and/or cameras either in lieu afar in addition to sensors described above, and/or in different configurations, such as for use with a desktop, tablet, or smartphone.
Sensors 106 in sensor cluster 104, or any sensors accessible by device 100 in general, may be electrically and communicatively coupled to one other and/or device 100 or components of device 100 such that data generated within sensor cluster 104 may be transmitted to device 100, and commands issued by device 100 may be communicated to the sensors 106 through any suitable electrical and/or communicative coupling.
In block 302, an image is captured by, e.g., a sensor 106. The captured image may represent the projected source content with an object such as a user's hand or a stylus on or on top of the projected source content, as shown in the example of
In block 304, an object mask is created, as described below in more detail with respect to
In block 306, a composite image is created. In an example, the object mask and the source content are combined, e.g., via compensation or a compositor, to create a first composite image, as shown in the example of
In the example of block 308, the first composite image is projected, e.g., onto mat 112 or other surface, such that the object mask prevents source content from being projected on the object 116, e.g., a user's hand or stylus.
In some examples, the flow of
In some examples, at least one element of the re-captured image such as a user's hand may be displayed on a display, e.g., display 110 as shown in
In some examples, multiple objects, e.g., multiple hands, may be captured or re-captured, such that the image output to, e.g., a digital file, contains more than one object, including in collaboration use-cases where users may be remotely located.
In some examples, device 100 may comprise a real-time camera with a 3D shape of the image captured by the camera including at least one object, e.g., the user's hand on, or on top of, the projected source content. In some examples, the surface on which the source content is projected, e.g., mat 112 or other surface, may be known or assumed to be flat, such that a 3D depth map collected by the camera may be used to determine which parts of the capture do not fit the plane, e.g., which parts of the scene represent the object or hand. In some examples, a plane fitting algorithm may be used to determine which parts do not fit the plane, and in some examples, the plane fitting algorithm may reject or exclude outliers.
In block 402A, the 3D shape/depth map or other sensor information is accessed, and in block 404A, in an example, a plane fitting algorithm with outlier rejection is applied to the map.
In block 406A, the location or map of an object or objects within the captured image or scene is determined based on the 3D shape and plane fitting with outlier rejection algorithm. In some examples, other algorithms may be used to determine the location of objects.
In block 408A, an initial object mask is created based on the location/map of the object determined in block 406A. The initial object mask may be used as an input to, e.g., block 306 discussed above to create a composite image of the source content and object mask, or may be used as an input to, e.g., block 802 discussed below if the object mask is to be further refined.
In some examples, device 100 may compare the captured image with the source content to determine, e.g., where an object or hand is located based on a difference between the captured image and the source content, e.g., based on a color difference.
In block 402B, in an example, the source content file is accessed, and in block 404B, the captured image is accessed, e.g., as captured by block 302. In block 406B, the source content is compared to the captured image. Various algorithms, such as algorithms related to image change and/or 3D segmentation, may be applied.
In block 408B, an initial object mask is created based on the comparison of the source content to the captured image in block 406B. As discussed above with respect to
In various examples, the methods of
Sensors 106, e.g., a camera or 3D sensor, may be used to capture an image within the field-of-view of the sensor 106. In some examples, the captured image may represent the projected source content 502, and an object or objects 504 such as the user's hand and/or stylus. The capture of
In some examples, the captured image may be captured by more than one sensor and fused or otherwise combined, and/or the object mask may be created by use of inputs from more than one sensor. For example, as mentioned above with respect to
In the example of
In some examples, an initial object mask, as discussed below with respect to
In an example, in block 802, an initial object mask, e.g., 904A, is accessed, and in block 804, a captured image is accessed. A filter, e.g., a bilateral cross filter, may be applied in block 806, and a refined object mask, e.g., 904b, may be output in block 808. The refined object mask may be used as input to, e.g., block 306 discussed above.
The above discussion is meant to be illustrative of the principles and various embodiments of the present invention. Numerous variations and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. It is intended that the following claims be interpreted to embrace all such variations and modifications.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2014/065465 | 11/13/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2016/076874 | 5/19/2016 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4986651 | Vanderwerf | Jan 1991 | A |
5511148 | Wellner | Apr 1996 | A |
6067112 | Wellner | May 2000 | A |
6431711 | Pinhanez | Aug 2002 | B1 |
6840627 | Olbrich | Jan 2005 | B2 |
6965460 | Gann et al. | Nov 2005 | B1 |
7023536 | Zhang et al. | Apr 2006 | B2 |
7038846 | Mandella et al. | May 2006 | B2 |
7088440 | Buermann et al. | Aug 2006 | B2 |
7161664 | Buermann et al. | Jan 2007 | B2 |
7203384 | Carl et al. | Apr 2007 | B2 |
7268956 | Mandella et al. | Sep 2007 | B2 |
7432917 | Wilson et al. | Oct 2008 | B2 |
7474809 | Carl et al. | Jan 2009 | B2 |
7488079 | Hennes | Feb 2009 | B2 |
7573491 | Hartkop et al. | Aug 2009 | B2 |
7589747 | Matsui et al. | Sep 2009 | B2 |
7599561 | Wilson et al. | Oct 2009 | B2 |
7626578 | Wilson et al. | Dec 2009 | B2 |
7710391 | Bell et al. | May 2010 | B2 |
7729515 | Mandella et al. | Jun 2010 | B2 |
7743348 | Robbins | Jun 2010 | B2 |
7826641 | Mandella et al. | Nov 2010 | B2 |
7843470 | Bannai et al. | Nov 2010 | B2 |
7959300 | Hirahara et al. | Jun 2011 | B2 |
7961909 | Mandella et al. | Jun 2011 | B2 |
8121640 | Russ et al. | Feb 2012 | B2 |
8126580 | El-Siblani et al. | Feb 2012 | B2 |
8199117 | Izadi | Jun 2012 | B2 |
8570319 | van Baar et al. | Oct 2013 | B2 |
8619179 | Imai | Dec 2013 | B2 |
8736583 | Anderson et al. | May 2014 | B2 |
9069382 | Starner et al. | Jun 2015 | B1 |
20020186221 | Bell | Dec 2002 | A1 |
20030035061 | Iwaki et al. | Feb 2003 | A1 |
20030092470 | Kurakane | May 2003 | A1 |
20040004626 | Ida et al. | Jan 2004 | A1 |
20040095562 | Moffatt | May 2004 | A1 |
20050078092 | Clapper | Apr 2005 | A1 |
20050168437 | Carl et al. | Aug 2005 | A1 |
20060010400 | Dehlin et al. | Jan 2006 | A1 |
20060126128 | Ahmed et al. | Jun 2006 | A1 |
20070008340 | Endo et al. | Jan 2007 | A1 |
20080018591 | Pittel et al. | Jan 2008 | A1 |
20080030461 | Matsui et al. | Feb 2008 | A1 |
20080094588 | James et al. | Apr 2008 | A1 |
20080192017 | Hildebrandt et al. | Aug 2008 | A1 |
20090002342 | Terada et al. | Jan 2009 | A1 |
20090027337 | Hildreth | Jan 2009 | A1 |
20090115721 | Aull et al. | May 2009 | A1 |
20090168027 | Dunn | Jul 2009 | A1 |
20090262098 | Yamada | Oct 2009 | A1 |
20100017722 | Cohen | Jan 2010 | A1 |
20100157388 | Pishdadian et al. | Jun 2010 | A1 |
20100188474 | Robinson et al. | Jul 2010 | A1 |
20100259633 | Kii | Oct 2010 | A1 |
20100271394 | Howard | Oct 2010 | A1 |
20100299103 | Yoshikawa | Nov 2010 | A1 |
20100315825 | Kawamura | Dec 2010 | A1 |
20110025650 | Jocelyne | Feb 2011 | A1 |
20110063256 | Ye et al. | Mar 2011 | A1 |
20110117532 | Relyea et al. | May 2011 | A1 |
20110164191 | Brown | Jul 2011 | A1 |
20110169778 | Nungester et al. | Jul 2011 | A1 |
20110227915 | Mandella et al. | Sep 2011 | A1 |
20110234879 | Kashitani | Sep 2011 | A1 |
20110242054 | Tsu | Oct 2011 | A1 |
20110261260 | Gruen | Oct 2011 | A1 |
20120038549 | Mandella et al. | Feb 2012 | A1 |
20120044421 | Lin et al. | Feb 2012 | A1 |
20120054355 | Arrasvuori et al. | Mar 2012 | A1 |
20120069180 | Kawamura | Mar 2012 | A1 |
20120300080 | Batson | Nov 2012 | A1 |
20120320157 | Junuzovic et al. | Dec 2012 | A1 |
20130077236 | Becze et al. | Mar 2013 | A1 |
20130113920 | Blanton et al. | May 2013 | A1 |
20130194418 | Gonzalez-Banos et al. | Aug 2013 | A1 |
20130194554 | Aruga | Aug 2013 | A1 |
20140118705 | Hasegawa et al. | May 2014 | A1 |
20140139717 | Short | May 2014 | A1 |
20140368615 | van Baar | Dec 2014 | A1 |
20150042678 | Alt et al. | Feb 2015 | A1 |
Number | Date | Country |
---|---|---|
101292516 | Oct 2008 | CN |
101810003 | Aug 2010 | CN |
102693005 | Sep 2012 | CN |
103999025 | Aug 2014 | CN |
10222436 | Aug 1998 | JP |
2003-131319 | May 2003 | JP |
2004192623 | Jul 2004 | JP |
2006189712 | Jul 2006 | JP |
2011081556 | Apr 2011 | JP |
2005-266471 | Sep 2015 | JP |
1020110003705 | Jan 2011 | KR |
200402989 | Feb 2004 | TW |
WO2007006051 | Oct 2008 | WO |
WO2009018161 | Feb 2009 | WO |
WO2010137496 | Dec 2010 | WO |
Entry |
---|
Peter Ferschin et al., “Projected Time Travel: Architectural Heritage Projection in Situ,” International Conference on Cultural Heritage and New Technologies, Vienna, 2012, pp. 1-9. |
Shahram Izadi et al., “C-Slate: A Multi-Touch and Object Recognition System for Remote Collaboration using Horizontal Surfaces,” 2007, pp. 3-10, IEEE. |
Anton Fuhrmann et al.; “Computers and Graphics, Augmented Reality”; “Occlusion in collaborative augmente environments”; Vienna Austria; (1999), vol. 23, pp. 809-819. |
Hartmann, Bjorn et al., “Augmenting Interactive Tables with Mice & Keyboards,” UIST'09, Oct. 4-7, 2009, Victoria, British Columbia, Canada, pp. 149-152, ACM. |
Kurz, Daniel et al., “Mutual Occlusions on Table-top Displays in Mixed Reality Applications,”; VRST '88, 1988, pp. 1-4, New York, New York, USA. |
Extended European Search Report dated Apr. 23, 2015; EP Application No. 11870359.4; PCT/US2011/045983, 6 pp. |
Gao, Rui et al; “Microsoft Research—Mobile Surface”; Microsoft Research; 2010; http://research.microsoft.com/en-us/projects/mobilesurface/; 1 page. |
Hand, Randall: “Infinite Z Launches zSpace Virtual Holographic 3D Display for Designers”: VizWorld.com; Dec. 13, 2011; pp. 2-3 of 9 http://www.vizworld.com/2011/12/infinite-launches-zspace-virtual-holographic-3d-display-designers/#sthash.j6Ys61PX.dpbs. |
Harrison, Beverly & Ziola, Ryder; “Bringing Toys to Life: Intel Labs OASIS Project, Augmented Engineering”; Jan. 26, 2011; p. 1 of 2; http://augmentedengineering.wordpress.com/2011/01/26/bringing-toys-to-life-intel-labs-oasis-project/. |
Harrison, Chris et al: “OmniTouch: Wearable Multitouch Interaction Everywhere”; UIST'11; Oct. 16, 2011; Santa Barbara, California; 10 pp. http://research.microsoft.com/en-us/um/people/awilson/publications/HarrisonUIST2011/HarrisonUIST2011.html. |
Hartmann, Bjorn et al; “Pictionaire: Supporting Collaborative Design Work by Integrating Physical and Digital Artifacts”: CSCW 2010; Feb. 6, 2010; Savannah, Georgia; 4 pp.; http://research.microsoft.com/en-us/um/people/awilson/publications/HartmannCSCW2010/HartmannCSCW2010.html. |
Hinckley, Ken et al; “Pen + Touch = New Tools”; UIST'10; Oct. 3, 2010; New York, New York; 10 pp. http://research.microsoft.com/en-us/um/people/awilson/publications/HinckleyUIST2010/HinckleyUIST2010.html. |
Kane. Shaun K. et al; “Bonfire: A Nomadic System for Hybrid Laptop-Tabletop Interaction”; UIST'09; Oct. 4, 2009; Victoria, British Columbia, Canada; pp. 129-138; http://dub.washington.edu/dja. |
Linder, Natan et al: “LuminAR: Portable Robotic Augmented Reality Interface Design and Prototype”: UIST'10, Oct. 3, 2010; New York, New York; pp. 305-396; http://fluid.media.mit.edu/sites/default/files/2010-10-03-luminar_uist10_demo.pdf. |
Melanson, Donald; “Microsoft Research Working on Portable Surface”; Mar. 2, 2010; 2 pp.; http://www.engadget.com/2010/03/02/microsoft-research-working-on-portable-surface/. |
Melanson, Donald; “Wiimote Repurposed for Multi-Point Interactive Whiteboard”; Dec. 10, 2007; 2 pp.; http://www.engadget.com/2007/12/10/wiimote-repurposed-for-multi-point-interactive-whi. |
Michael Wooldridge, “Teach Yourself Visually Adobe Photoshop CS4: Chapter 8: Working with Layers”; 2009, pp. 168-199, Wiley Publishing Inc. |
PCT Search Report/Written Opinion dated Aug. 28, 2015; Application No. PCT/US2014/065465 filed Nov. 13, 2014; 11 pp. |
Ramesh Raskar et al., “The Office of the Future: A Unified Approach to Image-Based Modeling and Spatially Immersive Displays”; SIGGRAPH '98, Jul. 19-24, 1998, Orlando, Florida, 10 pp. |
Sasa Junuzovic et al.; Microsoft Research, “IllumiShare”; 2012, pp. 1-2, http://research.microsoft.com/en-us/projects/illumishare/default.aspx. |
Scott R. Klemmer et al., “Integrating Physical and Digital Interactions on Walls for Fluid Design Collaboration,” Human-Computer Interaction; Apr. 1, 2008, vol. 23. 77 pp. (138-213). |
Simonite, Tom; “A Kitchen Countertop With a Brain”; MIT Technology Review; Jul. 2, 2010; 2 pp.; http://www.technologyreview.com/news/419639/a-kitchen-countertop-with-a-brain/. |
Tyler Denis, “Superimpose a Car onto Another Image in Photoshop”; Design Instruct; Jun. 10, 2011, pp. 1-28, http://designinstruct.com/digitial-art/photo-manipulation/superimpose-a-car-onto-another-image-in-photoshop/. |
Wilson, Andrew D. et al; “Combining Multiple Depth Cameras and Projectors for Interactions on, Above, and Between Surfaces”; UIST'10; Oct. 3, 2010; New York, New York; 10 pp. |
Yuanyuan Ding et al; Catadioptric Projectors; 2009 IEEE Conference on Computer Vision and Pattern Recognition; Miami, FL.; 2009; http://www.eecis.udel.edu/˜jya/lab_research/09/cprojector.pdf; 8 pp. |
Office Action for U.S. Appl. No. 14/126,352, filed Dec. 12, 2013, dated Mar. 22, 2018; 26 pp. |
Office Action for U.S. Appl. No. 14/126,352, filed Dec. 13, 2013, dated Apr. 10, 2015; 32 pp. |
Final Office Action for U.S. Appl. No. 14/126,352, filed Dec. 13, 2013, dated Aug. 14, 2015; 31 pp. |
International Search Report and Writen Opinion dated Feb. 20, 2012 for PCT/US2011/045983, filed Jul. 29, 2011; 9 pp. |
Hartmann, Bjorn et al.;“Pictionaire Supporting Collaborative Design Work by Integrating Physical and Digitial Artifacts”; CSCW 2010; Feb. 6, 2010, Savannah Georgia; 4 pp. |
Wilson, Andrew D.; Microsoft Research; “Using a Depth Camera as a Touch Sensor”; ITS 2010; Devices & Algorithms; Nov. 7, 2010; Saarbrucken, Germany; pp. 69-72. |
Number | Date | Country | |
---|---|---|---|
20170316594 A1 | Nov 2017 | US |