There are a variety of conventional displays that offer an interactive experience supported by a computing system. Computer displays, for example, display images, which often have visualizations of controls embedded within the image. The user may provide user input by interacting with these controls using a keyboard, mouse, controller, or another input device. The computing system receives that input, and in some cases affects the state of the computing system, and further in some cases, affects what is displayed.
In some cases, the computer display itself acts as an input device using touch or proximity sensing on the display. Such will be referred to herein as “touch” displays. There are even now touch displays that can receive user input from multiple touches simultaneously. When the user touches the display, that event is fed to the computing system, which processes the event, and makes any appropriate change in computing system state and potentially the displayed state. Such displays have become popular as they give the user intuitive control over the computing system at literally the touch of the finger.
For instance, touch displays are often mechanically incorporated into mobile devices such as tablet device or smartphone, which essentially operate as a miniature computing system. That way, the footprint dedicated for input on the mobile device may be smaller, and even perhaps absent altogether, while still allowing the user to provide input. As such, mobile devices are preferably small and the display area is often also quite small.
Embodiments described herein relate to the ability to interact with different projected images that are pre-edited so that when projected, the image is better suited for viewing from a particular perspective. Thus, a variety of images might be projected such that some are suitable for one perspective, some are suitable for another perspective, and so forth. For instance, one image might be edited so that when projected, the projected first image is presented for better viewing from a first perspective. Another image might be edited so that when projected, the projected second image is presented for better viewing from a second perspective.
This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
In order to describe the manner in which the above-recited and other advantages and features can be obtained, a more particular description of various embodiments will be rendered by reference to the appended drawings. Understanding that these drawings depict only sample embodiments and are not therefore to be considered to be limiting of the scope of the invention, the embodiments will be described and explained with additional specificity and detail through the use of the accompanying drawings in which:
The principles described herein relate to the projection of interactive images such that different images are pre-edited so that when projected, the image is better suited for viewing from a particular perspective. Thus, a variety of images might be projected such that some are suitable for one perspective, some are suitable for another perspective, and so forth. For instance, one image might be edited so that when projected, the projected first image is presented for better viewing from a first perspective. Another image might be edited so that when projected, the projected second image is presented for better viewing from a second perspective.
The projected images 140 are each projected on the same surface, which is positioned within the field of projection 145 of the projection system 120. Although the images 140 are illustrated as different images, some or all of the images 140 might be based on the same image, but with different pre-editing to allow for better viewing from respective different perspectives. Some or all of the projected images 140 might be projected at the same time, and some or all of the projected images 140 might be projected one after the other. Although the projected images 140 are illustrated one over the other in
The system 100 may perform the method 200 so as to make each of the projected images more suitable for one perspective than for other perspectives. For instance, the system 100 causes the image 141 to be more suitable for viewing from perspective 151 (abstractly represented as a circle) than from perspective 152 (abstractly represented as a square). The system 100 causes the image 142 to be more suitable for viewing from perspective 152 than from perspective 151. Also, the system causes the image 143 to be more suitable for viewing from perspective 151 than from perspective 152. Thus, one or more of the images projected by the projection system 130 have a first perspective as the best perspective (“best” meaning out of the possible perspectives that the projection system 130 may aim for optimizing), one or more of the projected images may have a second perspective as the best perspective, and so on, for possible other numbers of perspectives.
Referring to
The controller 110 may perform its functions by using hardware, firmware, software, or a combination thereof. In one embodiment, in which the controller 110 uses software, the controller 110 may be a computing system. Accordingly, a basic structure of a computing system will now be described with respect to the computing system 300 of
As illustrated in
In the description that follows, embodiments are described with reference to acts that are performed by one or more computing systems. If such acts are implemented in software, one or more processors of the associated computing system that performs the act direct the operation of the computing system in response to having executed computer-executable instructions. For example, such computer-executable instructions may be embodied on one or more computer-readable media that form a computer program product. An example of such an operation involves the manipulation of data. The computer-executable instructions (and the manipulated data) may be stored in the memory 304 of the computing system 300. Computing system 300 may also contain communication channels 308 that allow the computing system 300 to communicate with other message processors over, for example, network 310.
Embodiments described herein may comprise or utilize a special purpose or general-purpose computer including computer hardware, such as, for example, one or more processors and system memory, as discussed in greater detail below. Embodiments described herein also include physical and other computer-readable media for carrying or storing computer-executable instructions and/or data structures. Such computer-readable media can be any available media that can be accessed by a general purpose or special purpose computer system. Computer-readable media that store computer-executable instructions are physical storage media. Computer-readable media that carry computer-executable instructions are transmission media. Thus, by way of example, and not limitation, embodiments of the invention can comprise at least two distinctly different kinds of computer-readable media: computer storage media and transmission media.
Computer storage media includes RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other tangible medium which can be used to store desired program code means in the form of computer-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer.
A “network” is defined as one or more data links that enable the transport of electronic data between computer systems and/or modules and/or other electronic devices. When information is transferred or provided over a network or another communications connection (either hardwired, wireless, or a combination of hardwired or wireless) to a computer, the computer properly views the connection as a transmission medium. Transmissions media can include a network and/or data links which can be used to carry desired program code means in the form of computer-executable instructions or data structures and which can be accessed by a general purpose or special purpose computer. Combinations of the above should also be included within the scope of computer-readable media.
Further, upon reaching various computer system components, program code means in the form of computer-executable instructions or data structures can be transferred automatically from transmission media to computer storage media (or vice versa). For example, computer-executable instructions or data structures received over a network or data link can be buffered in RAM within a network interface module (e.g., a “NIC”), and then eventually transferred to computer system RAM and/or to less volatile computer storage media at a computer system. Thus, it should be understood that computer storage media can be included in computer system components that also (or even primarily) utilize transmission media.
Computer-executable instructions comprise, for example, instructions and data which, when executed at a processor, cause a general purpose computer, special purpose computer, or special purpose processing device to perform a certain function or group of functions. The computer executable instructions may be, for example, binaries, intermediate format instructions such as assembly language, or even source code. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the described features or acts described above. Rather, the described features and acts are disclosed as example forms of implementing the claims.
Those skilled in the art will appreciate that the invention may be practiced in network computing environments with many types of computer system configurations, including, personal computers, desktop computers, laptop computers, message processors, hand-held devices, multi-processor systems, microprocessor-based or programmable consumer electronics, network PCs, minicomputers, mainframe computers, mobile telephones, PDAs, pagers, routers, switches, and the like. The invention may also be practiced in distributed system environments where local and remote computer systems, which are linked (either by hardwired data links, wireless data links, or by a combination of hardwired and wireless data links) through a network, both perform tasks. In a distributed system environment, program modules may be located in both local and remote memory storage devices.
The method 200 of
For instance, in the context of
Several examples of pre-editing for specific perspectives will now be described. The nature of the perspectives that the controller 110 pre-edits images for may depend on the projection system 100 and its deployment.
Keystoning
For instance, in an embodiment described further below, the image is projected onto a surface that is not perpendicular to the direction of projection. An example of this might be if the system 100 is an accessory to an image generation device, such as a smart phone, which assessor actually sits on the same surface as the surface onto which the accessory is projecting. In this case, keystoning, may occur. For instance, the width of the projected image will increase the further away the surface is from the projection source, thus resulting in a trapezoid-like shape, or a keystone-line shape.
Suppose now that there are three users that sit around a table, which is the surface onto which the projection is occurring. The keystoning will be observed differently for each of those sitting around the table. Accordingly, the pre-editing of the image may reduce the effects of keystoning when taking into consideration which of the three users is prioritized for viewing the particular image. For instance, if this were a game, in which the three were taking turns in the game, the image might be optimized for the user whose turn it presently is within the game. Thus, for one user who is viewing the projected surface from one angle, the image may be edited in a manner in which keystoning is reduced when viewed from that angle. For another user who is viewing the projected surface from another angle, the image may be edited in a manner in which keystoning is reduced when viewing from that angle, and so forth.
Object Adornment
Another example of pre-editing may be performed in response to the detection of an object within a field of projection of the projected image. For instance, suppose that a human hand of a user is inserted into the field of projection of the image. Upon detecting this, the image may be pre-edited such that the portion of the image corresponding to a location of the detected object is modified.
As an example, the image could be modified so as to colorize the object that has been placed into the field of projection. For instance, a hand coming in from one side of the projection might be colorized blue, a hand coming in from another side of the projection might be colorized red, and a hand coming in from yet another side of the projection might be colorized green. This may be accomplished by editing that portion of the image which emits upon the object such that the portion is of the color that the object is to be colorized. Of course, the object is not limited to a human hand, but might include other objects as well, such as a game piece.
As a further example, rather than only colorize the object inserted into the field of projection, one or more controls may be emitted onto the object inserted into the projection. Such may be accomplished by pre-editing the image to include one or more controls corresponding to the portion of the image that emits on the inserted object. The user might interact with the controls on the inserted object to thereby cause data representing a user input event to be captured by the camera system. For instance, in the case of a human hand, if the user were to insert their hand into the field of projection with the hand open and palm facing down, the image might be pre-edited so that each finger is adorned with a projected control. The control might be activated by, for example, bending that finger.
Transparency Emulation
The object inserted into the field of view may also be made to appear transparent to a particular user from a particular point of view by using pre-editing of the image. This might be accomplished by modifying the image during pre-editing such that the detected object has displayed thereon image data that is obscured by the detected object from the particular perspective. For instance, suppose that a game board is being displayed, and that a user inserts his hand into the field of projection. The image might then be pre-edited so that those portions of the projected game board that the user cannot see due to the presence of the hand, are instead projected on the hand itself. If done well enough, it will appear to the user that the user's hand goes in and out of existence when inserted into the field of projection.
In other embodiments, perhaps most of the object is emulated transparent in this way, but there might be instances in which it is desirable to have one or more portions of the object not be transparent. For instance, suppose the user preferences indicate that the user uses her index finger to do touch events on the surface on which the image is projected. In that case, perhaps all of the hand is emulated as transparent, except for the last inch of the index finger of the user. This allows the user to see what they are selecting, and understand where their selecting finger is, while still allowing the projection to appear to emit through the remainder of the hand.
Multiple Projectors
In some embodiments, there might be multiple projectors in the projections system 120, each projecting from a different angle and having a different field of projection, but still projecting the same image so that the fields of projection converge on the surface. In this case, if an object is detected in either or both of the fields of projection, then the copy of the image to be projected for one or both of the projectors might be blanked out in the area corresponding to the inserted object so as to not provide non-convergent versions of the image on the detected object. Of course, as described above, one of the copies of the image might be edited to perform colorization or adornment of the detected object also. The projectors may be positioned so as to reduce shadowing caused by objects inserted into the field of projection. For instance, a shadow created by the object in a first field of projection may be covered by a second field of projection of the image.
Each of the projected images might also be a three-dimensional image such that a portion of the frames of the corresponding image are to be viewed by a left eye of the corresponding user through the corresponding shuttering system, and such that a portion of the frames of the corresponding image are to be viewed by a right eye of the corresponding user through the corresponding shuttering system. For instance, the following Table 1 represents how the frames could be projected, and how the shuttering system would work to present three (3) three-dimensional images to corresponding three (3) users in which each three-dimensional frame is refreshed every 1/60 seconds.
In Table 1, the first six rows represent the projection and viewing by the respective user of the first frame of each of the three-dimensional images. The last six rows represent the projection and viewing by the respective user of the second frame of each of the three-dimensional images. A “Yes” for the first frame (and a “YES” for the second frame) represents that during this time frame, the particular image for the particular eye of the respective user is being projected, and thus the particular shutter for that particular eye of the respective user is open. The shutter of the other eye for that respective user, and all shutters for all of the other users are closed (as represented by the corresponding column being blank for that time frame). In this manner, three individuals can see entirely different three-dimensional images being projected on a surface. Of course, this principle might extend to any number of users and any number of projected images. Furthermore, some of the images might be two-dimensional for one or more of the users, and some of the images might be three-dimensional for one or more of the users. Whether or not something presents in two-dimensions or three dimensions might be a user preference.
The shuttering system described above allows different users to see different images entirely. The shuttering system also allows the same image to be viewed by all, but perhaps with a customized “fog of war” placed upon each image suitable for the appropriate state. For instance, one image might involve removing image data from one portion of the image (e.g., a portion of a game terrain that the user has not yet explored), while one image might involve removing image data from another portion of the image (e.g., a portion of the same game terrain that the other user has not yet explored).
Accordingly, the principles described herein allow for complex interactive projection of one or more images onto a surface. In one embodiment, the system 100 is an accessory to another image generation device.
A communication interface is provided between the image generation device 501 and the accessory 510. For instance, the accessory 510 includes input communication interface 511 that receives communications (as represented by arrow 521) from the image generation device 501, and an output communication interface 512 that provides communications (as represented by arrow 522) to the image generation device 501. The communication interfaces 511 and 512 may be wholly or partially implemented through a bi-directional communication interface though not required. Examples of communication interfaces include wireless interfaces, such as provided by 802.xx wireless protocols, or by close proximity wireless interface such as BLUETOOTH®. Examples of wired communication interface include USB and HDMI. However, the principles described herein are not limited to these interfaces, nor are they limited to whether or not such interfaces now exist, or whether they are developed in the future.
An image generation device accessory 600 also includes a processing module 610 that includes a post-processing module 611 that receives the input image as represented by arrow 642. The processing module 610 is an example of the controller 110 of
The projector system 612 projects (as represented by arrow 644) at least the derived image of the input image onto a surface 620. The projector system 612 is an example of the projection system 120 of
In the case of projecting on the same surface on which the accessory sits, there might be some post-processing of the input image to compensate for expected distortions, such as keystoning, when projecting at an acute angle onto a surface. Furthermore, although not required, the projector might include some lensing to avoid blurring at the top and bottom portions of the projected image. Alternatively, a laser projector might be used to avoid such blurring when projecting on a non-perpendicular surface.
Returning to
The control visualizations may perhaps both be generated within the original input image. Alternatively, one or both of the control visualizations may perhaps be generated by the post-processing module 611 (hereinafter called “inserted control visualization”). For instance, the inserted control visualizations might include a keyboard, or perhaps controls for the projection system 612. The inserted control visualizations might also be mapped to control visualizations provided in the original input image such that activation of the inserted control visualization results in a corresponding activation of the original control visualization within the original image.
The accessory 600 also includes a camera system 621 for capturing data (as represented by arrow 551) representing user interaction with the projected image. The camera system 621 is an example of the camera system 130 of
If the control visualization that the user interfaced with was one of the control visualizations in the original input image, or does not correspond to a control that the processing system 610 itself handles, the detection mechanism 622 sends (as represented by arrow 653) the input event to the output communication interface 602 for communication (as represented by arrow 654) to the image generation device.
In particular, the accessory receives an input image from the image generation device (act 701). This is represented by arrow 641 leading into input communication interface 601 in
The camera system captures data representing user interface with the projected image (act 901). For instance, the camera system might capture such data periodically, such as perhaps at 60 Hz or 120 Hz. Several examples of such a camera system will now be described. A first camera system will be referred to as a “light plane” camera system. A second camera system will be referred to as a “structured light” camera system. Each of these camera systems not only capture light, but also emit light so that resulting reflected light may be captured by one or more cameras. In these examples, the light emitted from the camera system is not in the visible spectrum, although that is not a strict requirement. For instance, the emitted light may be infra-red light.
The light plane camera system is particular useful in an embodiment in which the accessory sits on the same surface on which the image is projected. The camera system of the accessory might emit an infrared light plane approximately parallel to (and in close proximity to) the surface on which the accessory rests. More regarding an example light plane camera system will be described below with respect to
The infra-red image fed by the camera system 621 to the detection module 622. In the structured light camera system example, that image includes the reflected structured light that facilitates capture of depth information. The detection module 622 may detect the depth information, and be able to distinguish objects placed within the field of camera view. It may thus recognize the three-dimensional form of a hand and fingers placed within the field of view.
This information may be used for any number of purposes. One purpose is to help the post-processing unit 611 black out those areas of the input image that corresponds to the objected placed in the field of view. For instance, when a user places a hand or arm into the projected image, the projected image will very soon be blacked out in the portions that project on the hand or arm. The response will be relatively fast such that it seems to the user like he/she is casting a shadow within the projection whereas in reality, the projector simply is not emitting in that area. The user then has the further benefit of not being distracted by images emitting onto his hands and arm.
Another user of this depth information is to allow complex input to be provided to the system. For instance, in three-dimensional space, the hand might provide three positional degrees of freedom, and 3 rotational degrees of freedom, providing potentially up to 6 orthogonal controls per hand. Multiple hands might enter into the camera detection area, thereby allowing a single user to use both hands to obtain even more degrees of freedom in inputting information. Multiple users may provide input into the camera detection area at any given time.
The detection module 622 may further detect gestures corresponding to movement of the object within the field of camera view. Such gestures might involve defined movement of the arm, hands, and fingers of even multiple users. As an example, the detection module 622 might have the ability to recognize sign language as an alternative input mechanism to the system.
Another use of the depth information might be to further improve the reliability of touch sensing in the case in which both the structured light camera system and the light plane camera system are in use. For instance, suppose the depth information from the structured light camera system suggests that there is a human hand in the field of view, but that this human hand is not close to contacting the projection surface. Now suppose a touch event is detected via the light plane camera system. The detection system might invalidate the touch event as incidental contact. For instance, perhaps the sleeve, or side of the hand, incidentally contacted the projected surface in a manner not to suggest intentional contact. The detection system could avoid that turning into an actual change in state. The confidence level associated with a particular same event for each camera system may be fed into a Kalman filtering module to arrive at an overall confidence level associated with the particular event.
Other types of camera systems include depth camera and 3-D camera. The captured data representing user interaction with the projected image may then be provided (as represented by arrow) to a detection system 623 which applies semantic meaning to the raw data provided by the camera system. Specifically, the detection system 623 detects an image input event using the captured data from the camera system (act 902). For instance, the detection system 623 might detect a touch event corresponding to particular coordinates. As an example only, this touch event may be expressed using the Human Interface Device (HID) protocol.
In the light plane camera system example, the detection system 623 might receive the infra-red image captured by the infra-red camera and determine where the point of maximum infrared light is. From this information, and with the detection system 623 understanding the position and orientation of each infra-red camera, the detection system 623 can apply trigonometric mathematics to determine what portion of the image was contacted.
In making this calculation, the detection system 623 might perform some auto-calibration by projecting a calibration image, and asking the user to tap on certain points. This auto-calibration information may be used also to apply some calibration adjustment into the calculation of which portion of the projected image the user intends to contact.
The detection system 623 might also apply auto-calibration after the initial calibration process, when the user is actually interacting with a projected image. For instance, if the system notices that the user seems to select a certain position, and then almost always later correct by selecting another position slightly offset in a consistent way, the system might infer that this consistent offset represent an unintended offset within the initial selection. Thus, the detection system might auto-calibrate so as to reduce the unintended offset.
Returning to
The input event may take the form of floating point value representations of the detecting contact coordinates, as well as a time stamp when the contact was detected. The image generation device receives this input event via the receive socket level connection. If the receive socket level connection is managed by the operating system, then the event may be fed directly into the portion of the operating system that handles touch events, which will treat the externally generated touch event in the same manner as would a touch event directly to the touch display of the image generation device. If the receive socket level connection is managed by the application, the application may pass the input event into that same portion of the operating system that handles touch events.
As previously mentioned, the post-processing module 611 may perform color compensation of the input image prior to projecting the image. As the accessory may be placed on all types of surfaces including non-white surfaces, non-uniformly colored surfaces, and the like, the characteristics of the surface will impact the colorization of the viewed image. The color compensation component 630 accounts for this by comparing the color as viewed to the color as intended, and performing appropriate adjustments. This adjustment may be performed continuously. Thus, the system may respond dynamically to any changes in the surface characteristics. For instance, if the accessory is moved slightly during play, the nature of the surface may be altered.
The principles described herein are not limited to any particular physical deployment. However, three example physical deployments will now be described in further detail In the first described physical deployment, the controller, the projection system, and the camera system are all integrated, and are designed to sit on a same flat surface as the surface on which the projection system projects. In the second described embodiment, the projector system is mounted to a ceiling. In the third described physical deployment, the projection system is suitable for connection within a ceiling to emit a projection downward onto a horizontal surface (such as a floor, table, or countertop).
First Physical Embodiment
The light plane camera system (described above) is particular useful in an embodiment in which the accessory sits on the same surface on which the image is projected. The camera system of the accessory might emit an infrared light plane approximately parallel to (and in close proximity to) the surface on which the accessory rests. For instance, referring to
An infrared camera system may be mounted in an elevated portion of the accessory to capture reflections of the infra-red light when the user inserts an object into the plane of the infra-red light. For instance, referring to
Referring to
In the example of the light plane camera system, when an object is positioned to touch the surface in the area of the projected image, the object will also break the infra-red plane. One or both of the infra-red cameras 1203 or 1204 will then detect a bright infra-red light reflecting from the object at the position in which the object breaks the infra-red plane. As an example, the object might be a pen, a stylus, a finger, a marker, or any other object.
In the structured light camera system, infra-red light is again emitted. In the example of
The structured light might, for example, be some predetermined pattern (such as a repeating grid pattern) that essentially allows for discrete sampling of depth information along the full extent of the combined scope of the infra-red emitter 1212 and the infra-red cameras 1203 and 1204. As an example only, the infra-red emitter 1212 might emit an array of dots. The infra-red cameras 1203 and 1204 will receive reflections of those dots, wherein the width of the dot at each sample point correlates to depth information at each sample point. A visible range camera 1210 captures the projected images.
Second Physical Embodiment
Third Physical Embodiment
Accordingly, the principles described herein describe embodiments in which a dynamic interactive image may be projected on a surface by an accessory to the device that actually generates the image, thereby allowing interaction with the projected image, and thereby causing interactivity with the image generation device. As an example, the accessory may be an accessory to a smartphone or tablet, or any other image generation device.
The present invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.