This invention relates generally to user interfaces for computerized systems, and specifically to user interfaces that are based on three-dimensional sensing.
Many different types of user interface devices and methods are currently available. Common tactile interface devices include the computer keyboard, mouse and joystick. Touch screens detect the presence and location of a touch by a finger or other object within the display area. Infrared remote controls are widely used, and “wearable” hardware devices have been developed, as well, for purposes of remote control.
Computer interfaces based on three-dimensional (3D) sensing of parts of the user's body have also been proposed. For example, PCT International Publication WO 03/071410, whose disclosure is incorporated herein by reference, describes a gesture recognition system using depth-perceptive sensors. A 3D sensor provides position information, which is used to identify gestures created by a body part of interest. The gestures are recognized based on a shape of a body part and its position and orientation over an interval. The gesture is classified for determining an input into a related electronic device.
As another example, U.S. Pat. No. 7,348,963, whose disclosure is incorporated herein by reference, describes an interactive video display system, in which a display screen displays a visual image, and a camera captures 3D information regarding an object in an interactive area located in front of the display screen. A computer system directs the display screen to change the visual image in response to changes in the object.
Documents incorporated by reference in the present patent application are to be considered an integral part of the application except that to the extent any terms are defined in these incorporated documents in a manner that conflicts with the definitions made explicitly or implicitly in the present specification, only the definitions in the present specification should be considered.
The description above is presented as a general overview of related art in this field and should not be construed as an admission that any of the information it contains constitutes prior art against the present patent application.
There is provided, in accordance with an embodiment of the present invention a method, including presenting, on a display coupled to a computer, an image of a keyboard comprising multiple keys, receiving a sequence of three-dimensional (3D) maps including a hand of a user positioned in proximity to the display, processing an initial portion of the sequence of 3D maps to detect a transverse gesture performed by a hand of a user positioned in proximity to the display, presenting, on the display, a cursor at a position indicated by the transverse gesture, and selecting, while presenting the cursor in proximity to the one of the multiple keys, one of the multiple keys upon detecting a grab gesture followed by a pull gesture followed by a release gesture in a subsequent portion of the sequence of 3D maps.
There is also provided, in accordance with an embodiment of the present invention an apparatus, including a sensing device, a display, and a computer coupled to the sensing device and the display, and configured to present, on the display, an image of a keyboard comprising multiple keys, to receive a sequence of three-dimensional (3D) maps including a hand of a user positioned in proximity to the display coupled to the computer, to process an initial portion of the sequence of 3D maps to detect a transverse gesture performed by a hand of a user positioned in proximity to the display, to present, on the display, a cursor at a position indicated by the transverse gesture, and to select, while presenting the cursor in proximity to the one of the multiple keys, one of the multiple keys upon detecting a grab gesture followed by a pull gesture followed by a release gesture in a subsequent portion of the sequence of 3D maps.
There is further provided, in accordance with an embodiment of the present invention a computer software product, including a non-transitory computer-readable medium, in which program instructions are stored, which instructions, when read by a computer, cause the computer to present, on a display coupled to a computer, an image of a keyboard comprising multiple keys, to receive a sequence of three-dimensional (3D) maps including a hand of a user positioned in proximity to the display, to process an initial portion of the sequence of 3D maps to detect a transverse gesture performed by a hand of a user positioned in proximity to the display, to present, on the display, a cursor at a position indicated by the transverse gesture, and to select, while presenting the cursor in proximity to the one of the multiple keys, one of the multiple keys upon detecting a grab gesture followed by a pull gesture followed by a release gesture in a subsequent portion of the sequence of 3D maps.
There is additionally provided, in accordance with an embodiment of the present invention a method, including receiving, by a computer, a sequence of three-dimensional (3D) maps containing at least a hand of a user positioned in proximity to a display coupled to the computer, detecting, in the 3D maps, a pointing gesture directed toward a region external to the display and adjacent to an edge of the display, and presenting, in response to the pointing gesture, one or more interactive objects on the display.
There is also provided, in accordance with an embodiment of the present invention an apparatus, including a sensing device, a display, and a computer coupled to the sensing device and the display, and configured to receive a sequence of three-dimensional (3D) maps containing at least a hand of a user positioned in proximity to the display, to detect, in the 3D maps, a pointing gesture directed toward a region external to the display and adjacent to an edge of the display, and to present, in response to the pointing gesture, one or more interactive objects on the display.
There is further provided, in accordance with an embodiment of the present invention a computer software product, including a non-transitory computer-readable medium, in which program instructions are stored, which instructions, when read by a computer, cause the computer to receive a sequence of three-dimensional (3D) maps containing at least a hand of a user positioned in proximity to a display coupled to the computer, to detect, in the 3D maps, a pointing gesture directed toward a region external to the display and adjacent to an edge of the display, and to present, in response to the pointing gesture, one or more interactive objects on the display.
There is additionally provided, in accordance with an embodiment of the present invention a method, including detecting, by a computer at least two hands of at least one user of the computer, assigning, based on a position of each of the hands, a respective ranking value to each of the hands, selecting a hand from among the at least two hands responsively to the respective ranking values, receiving a sequence of three-dimensional (3D) maps containing at least the selected hand positioned in proximity to a display coupled to the computer; and analyzing the 3D maps to detect a gesture performed by the selected hand.
There is also provided, in accordance with an embodiment of the present invention an apparatus, including a sensing device, a display, and a computer coupled to the sensing device and the display, and configured to detect at least two hands of at least one user of the computer to assign, based on a position of each of the hands, a respective ranking value to each of the hands, to select a hand from among the at least two hands responsively to the respective ranking values, to receive a sequence of three-dimensional (3D) maps containing at least the selected hand positioned in proximity to a display coupled to the computer, and to analyze the 3D maps to detect a gesture performed by the selected hand.
There is further provided, in accordance with an embodiment of the present invention a computer software product, including a non-transitory computer-readable medium, in which program instructions are stored, which instructions, when read by a computer, cause the computer to detect at least two hands of at least one user of the computer, to assign, based on a position of each of the hands, a respective ranking value to each of the hands, to select a hand from among the at least two hands responsively to the respective ranking values, to receive a sequence of three-dimensional (3D) maps containing at least the selected hand positioned in proximity to a display coupled to the computer, and to analyze the 3D maps to detect a gesture performed by the selected hand.
The disclosure is herein described, by way of example only, with reference to the accompanying drawings, wherein:
When using physical tactile input devices such as buttons, rollers or touch screens, a user typically engages and disengages control of a user interface by touching and/or manipulating the physical device. Embodiments of the present invention provide methods and mechanisms for interacting with a display coupled to a computer executing a non-tactile zoom-based user interface that includes three-dimensional (3D) sensing, by a 3D sensor, of motion or change of position of one or more body parts, typically a hand or a finger, of the user.
In some embodiments the zoom-based user interface utilizes a ZoomGrid control scheme that enables the user to select a given interactive object from multiple interactive objects presented on a display. The ZoomGrid control scheme described hereinbelow utilizes a hierarchical data structure having multiple levels with multiple nodes, wherein non-leaf nodes may represent categories (e.g., movies and music), for example, while leaf nodes represent content (e.g., media files and software applications.
Using embodiments described herein, a user can perform 3D gestures to traverse the hierarchical data structure in order to find a specific node storing content and perform an operation on the content. In some embodiments, if the content comprises a movie, the user can perform 3D gestures to manipulate on-screen media controls for operations such as volume control, pause, seek, etc. In additional embodiments of the present invention, the user can perform 3D gestures to enter text via an on-screen keyboard, and point to areas just outside a display to select “hidden icons”.
When interacting with a computer executing a non-tactile zoom-based user interface, a user may be positioned so that both of the user's hands are positioned within a field of view of a 3D optical sensor coupled to the computer. Additionally, there may be more than one user positioned within the sensor's field of view. In embodiments of the present invention, the computer can analyze a position of each hand within the field of view, and identify which of the hands is most likely intentionally interacting with the non-tactile user interface by performing 3D gestures.
Computer 26, executing zoom-based interface 20, processes data generated by device 24 in order to reconstruct a 3D map of user 22. The term “3D map” refers to a set of 3D coordinates measured, by way of example, with reference to a generally horizontal X-axis 32 in space, a generally vertical Y-axis 34 in space and a depth Z-axis 36 in space, based on device 24. The 3D coordinates represent the surface of a given object, in this case the user's body. In embodiments described below, as user 22 moves hand 30 along Z-axis 36 and an X-Y plane 40, computer 26 is configured to process the inputs received from the user in order to control location of a cursor 42 presented on display 28. The Z-direction, i.e., the direction perpendicular to the plane of display 28, is referred to in the present description and in the claims as the longitudinal direction, while directions within an X-Y plane, parallel to the plane of display, are referred to as transverse directions.
In one embodiment, device 24 projects a pattern of spots onto the object and captures an image of the projected pattern. Computer 26 then computes the 3D coordinates of points on the surface of the user's body by triangulation, based on transverse shifts of the spots in the pattern. Methods and devices for this sort of triangulation-based 3D mapping using a projected pattern are described, for example, in PCT International Publications WO 2007/043036, WO 2007/105205 and WO 2008/120217, whose disclosures are incorporated herein by reference. Alternatively, interface 20 may use other methods of 3D mapping, using single or multiple cameras or other types of sensors, as are known in the art.
Computer 26 typically comprises a general-purpose computer processor, which is programmed in software to carry out the functions described hereinbelow. The software may be downloaded to the processor in electronic form, over a network, for example, or it may alternatively be provided on non-transitory tangible media, such as optical, magnetic, or electronic memory media. Alternatively or additionally, some or all of the functions of the image processor may be implemented in dedicated hardware, such as a custom or semi-custom integrated circuit or a programmable digital signal processor (DSP). Although computer 26 is shown in
As another alternative, these processing functions may be carried out by a suitable processor that is integrated with display 28 (in a television set, for example) or with any other suitable sort of computerized device, such as a game console or media player. The sensing functions of device 24 may likewise be integrated into the computer or other computerized apparatus that is to be controlled by the sensor output.
In operation, as user 22 traverses tree 50 and accesses a given interactive object 38, computer 26 presents a given ZoomGrid surface 52 comprising sub-objects 38 (i.e., children nodes in tree 50) of the given interactive object. In the example shown in
As described hereinbelow, starting from a given surface 52, user 22 can traverse data structure 50 toward a given interactive object 38 (e.g., interactive object 38L) by performing a Find gesture (also referred to herein as a Transverse gesture), followed by a Grab gesture, a Pull Gesture and a Release gesture. Likewise, starting from the given surface, user 22 can traverse data structure 50 toward root interactive object 38A by performing a Find gesture, followed by a Grab gesture, a Push Gesture and a Release gesture. The terms grab, push and release are used in the present description and in the claims in their literal senses, to describe hand motions that would be used to graph, push and release a physical object, respectively, although in embodiments of the present invention these gestures are generally performed with respect to an interactive object, without there being any actual physical object in the hand.
In a presentation step 62, computer 26 presents, on display 28, a subset (i.e., a given surface 52) of interactive objects 38 that are associated with one or more child nodes of a first interactive object 38. As described supra, user 22 traverses tree 50 by accessing a given interactive object 38. Therefore, when initiating an interaction with ZoomGrid surfaces 52, user initially accesses interactive object 38A, and computer 26 presents interactive objects 38B, 38C and 38D. Using embodiments described herein, user 22 can then traverse tree 50.
In a receive step 64, computer 26 receives, from sensing device 24, a sequence of 3D maps that include at least a part of hand 30 positioned in proximity to display 28, and in an identification step 66, the computer identifies, in the sequence of 3D maps, a Find gesture followed by a Grab gesture followed by a longitudinal gesture followed by an Execute gesture. The term Execute gesture is used in the present description and in the claims in their literal senses, to describe a hand motion that user 22 performs subsequent to the longitudinal gesture in order to instruct computer 26 to perform an operation on a selected interactive object 38.
Examples of the Execute gesture include:
The Find gesture is described in U.S. Patent Application Publication 2012/0223882, whose disclosure is incorporated herein by reference. To perform the Find gesture, user 22 moves hand 30 along X-Y plane 40, and computer 26 can position cursor 42 on display 28 in response to the motion of the hand.
The Grab and the Release gestures are described in U.S. Patent Application Publication 2012/0204133, whose disclosure is incorporated herein by reference. To perform the Grab gesture, user 22 closed hand 30 by folding one or more fingers of hand 30 toward a palm of the hand. To perform the Release gesture, user 22 opens hand 30 from a closed or folded state.
Longitudinal gestures include a Push gesture and a Pull gesture, are also described in U.S. Patent Application Publication 2012/0204133, referenced above. User 22 can perform the Push gesture by moving hand 30 along Z-axis 36 toward display 28. Likewise, user 22 can perform the Pull gesture by moving hand 30 along Z-axis 36 away from display 28.
In a selection step 68, in response to identifying the Find gesture, computer 26 selects a second interactive object 38 from the subset of interactive objects 38. For example, to select the second interactive object, user can move hand 30 along X-Y plane 40, and upon computer 26 responsively positioning cursor 42 over (or in proximity to) the second interactive object, the user can either transition to a longitudinal gesture or keeping the hand relatively steady for a specific period of time.
Finally, in a performance step 70, in response to detecting the Release gesture, computer 26 performs an operation on the selected second interactive object. Examples of operations that computer 26 can perform on the second interactive object include, but are not limited to:
After selecting the second interactive object in response to the Find gesture performed by the user, the computer can “zoom in” (i.e., increase the size) of the second interactive object in response to detecting user 22 performing a Grab gesture followed by a Pull gesture. While increasing the size of the second interactive object, computer 22 can present context information about the second interactive object. For example, if the second interactive object comprises a movie, computer 22 can present context information such as a plot and a list of actors, as the computer increases the size of the second interactive object.
Two stabilization mechanisms described U.S. patent application Ser. No. 13/541,786 (referenced above) comprise comfort zones and funnels. While interacting with a multi-level ZoomGrid, computer 26 can define certain zoom levels as “comfort zones,” because they enable computer 26 to present interactive objects 38 in a aesthetic manner (for example, with an integer number of rows and columns of icons, with no icons cut off at the edges of the display). In other words computer 26 can “lock” user 22 into a comfort zone (i.e., a comfortable zoom level) while the user browses horizontally (e.g., using a Find gesture).
While in a comfort zone, if user 22 transversely moves hand (i.e., along X-axis 32 and/or Y-axis 34) while display 28 is in a comfort zone, the zoom may be locked, so that only significant motions along Z-axis 36 motions changes the zoom level. Specifically, computer 26 can be configured to assign less significance to hand motion detected along Z-axis 36 than hand motion detected along X-axis 32 and Y-axis 34, while user 22 is performing a Find gesture. In other situations, the zoom levels can be biased in order to drive the display into a comfort zone in response to relatively small movement of hand 30 along Z-axis 36. For example, if a given interactive object 38 comprises a folder of sub-objects 38, then computer 26 can enlarge the given interactive object (and thereby display the sub-objects) upon detecting significant motion of hand 30 away from display 28.
The “Funnel” mechanism enables computer 26 to accommodate any inadvertent transverse motion while user 22 is performing a longitudinal gesture. Specifically, computer 26 can be configured to assign less significance to hand motion detected along X-axis 32 and/or Y-axis 34 than hand motion detected on Z-axis 36, while user 22 is performing a Pull or a Push gesture. Limiting the significance of any transverse motion as computer 26 enlarges (i.e., “zooms in” on) the active interactive object as the user performs a Pull gesture can create a “funnel” like sensation that can help guide the user towards a given interactive object 38 that the user intends to select.
The significance of the transverse motion can be inversely related to a location of hand 30 while performing a Pull gesture. In other words, computer 26 can assign less significance to any detected transverse motion of hand 30 as the distance between the hand and display 28 increases. In operation, if computer 26 “suspects” that user 22 has identified a given interactive object and detects the user starting to perform a Pull gesture, the computer can start to limit the significance of any detected transverse motion of hand 30. As the Pull gesture progresses (and computer 26 further zooms in on the given interactive object) the computer can responsively decrease the significance of any detected transverse motion.
In some embodiments, the “funnel” paradigm can be extended to inhibit the association of hand 30 with a different interactive object 38 when the associated interactive object has been enlarged beyond a predetermined threshold size, responsively to user 22 moving hand 30 away from display 28. In other words, upon computer 26 presenting the associated object at a size equal or greater to the predetermined size, the computer can substantially ignore any transverse movement of hand 30 along X-axis 32 and/or Y-axis 34.
As described supra, after selecting the second interactive object in response to a Find gesture performed by the user, the computer can “zoom in” (i.e., increase the size) of the second interactive object in response to detecting user 22 performing a Grab gesture followed by a Pull gesture. In embodiments of the present invention, as computer 26 increases the size of the second interactive object, the computer can also present a preview of the second interactive object. For example, in addition (or as an alternative) to increasing the size of the second interactive object, computer 26 can present a “preview” of the second interactive object's child nodes, similar to “peeking” through a glass door.
This glass door metaphor is based on a front glass wall of a store that includes sliding doors that open as a customer approaches the glass. In embodiments of the present invention, a “zoomable object” (i.e., a given interactive object 38 having child nodes) may use the glass door to control how a user interacts with data associated with the zoomable object. To view the data associated with the zoomable object, the user can pull hand 30 back away from the zoomable object, and either open the zoomable object (i.e., open the glass door) or “peek” inside the zoomable object. When the user opens the glass door, the user interface can transition to a comfort zone associated with the zoomable object's associated data.
For example, a given zoomable object may comprise an icon representing a collection of movies in a specific category (e.g., drama or comedy). As the user pulls his hand back away from the icon, computer 26 presents a listing of the movies in the category (i.e., data associated with the icon). Once the user “opens” the glass door, the user can perform an operation on the data associated with the icon (e.g., start playing one of the movies in the category)
Actions that can “open” the Glass Door include:
In embodiments of the present invention, the user can “peek” inside a given glass door without “opening” the door. When peeking inside, the user can view interactive objects 38 at that level, but may not be able to instruct computer 26 to perform an operation on the interactive objects at that level. For example, by pulling the hand back from the documentary movie icon (i.e., interactive object 381), the user can peek at the documentary movie selection, but the user can only select one of the movies after “opening” the glass door. User 22 can then pull hand 30 back to see the movies in a given category, and then push the hand forward to return to the different movie categories. By opening glass doors, user 22 can transition between different hierarchical levels of the zooming user interface.
For example, to start playing interactive object 38L, in response to user 22 opening a first glass door to select Movies (i.e., interactive object 38C), computer 26 responsively presents icons representing different categories of movies (i.e., interactive objects 38H, 381 and 38J). As user 22 peeks into the Documentary category (i.e., interactive object 381), computer 26 presents a grid of all the movies (represented by icons) in the Documentary category.
While surface 52D in the example shown in
In other words, the user can see all the objects at the next hierarchical level when peeking at the level, even though the number of objects shown may limit the visibility of the objects. However of the user opens a glass door to a next hierarchical level, then a smaller (and scrollable) set of objects are shown that enable the user to easily interact with the presented objects.
Methods of peeking that computer 26 can implement to view content (i.e., interactive objects 38) at a given surface 52 include:
As described supra, when user 22 peeks into a given surface 52, computer 26 can present the interactive objects of the given surface in a “fanned out” format, in a manner similar to a fanned out pile of magazines. When presenting objects at a given hierarchical level, computer 26 can use the following formula to present objects while looking at a first glass door and peeking inside a second glass door (i.e., at the next hierarchical level:
Ratio=(Amount of detail on glass door)/(Amount of detail in collapsed state)
where the numerator represents a number of interactive objects 38 at an adjacent lower hierarchical level (i.e., inside a given glass door) and the denominator represents a number of interactive objects 38 that computer 26 presents in detail at the current hierarchical level (i.e., when looking at the given glass door).
Similar to the manner in which the glass door can be used to moderate a zoom-in gesture (i.e., a Pull gesture), a back door can be used to moderate a zoom-out gesture (i.e., a Push gesture). For example, if the user is at a given hierarchical level in
Examples of back door implementations include:
As described supra, depending on a configuration of a given comfort zone, computer 26 can present a group of interactive objects 38 (e.g., icons) either in a pile, a grid or spread out in order to see details on each of the objects. In some embodiments, computer 26 can present the interactive objects in an interpolated layout. For example, a given interpolated layout may present a pile of 25 movies, with four of the movies fanned out.
In some embodiments, a given layout (e.g., interactive objects stacked or fanned out, as shown in
In additional embodiments, layout parameters can be changed to mix (i.e., interpolate) a given layout between presenting all the interactive objects as a “fanned out” pile and presenting the interactive objects as pile 80. In some embodiments, computer 26 can dynamically present the interpolation as an animation.
In further embodiments, the interpolated layouts can be nested, thereby enabling computer 26 to mix three or more layouts. Examples of layout that computer 26 can mix include, but are not limited to:
While playing a media file, computer 26 can present media controls that user 22 can engage using embodiments described herein. Media controls may consist of controls for actions such as play/pause, seek, mute, volume up/down, next/previous track, fast-forward, rewind etc. In embodiments of the present invention, while computer 26 plays a media file (either in the foreground or in the background), the computer can place media player controls an “invisible” ZoomGrid surface 52. During the playback, when the user performs a longitudinal gesture, the ZoomGrid surface containing the imaginary controls gradually becomes visible. User 22 can select the media controls in a manner similar to selecting any other interactive objects 38 presented on a ZoomGrid surface.
In operation, while playing a media file (i.e., subsequent to performing the operation on the selected interactive object 38 in step 70), computer 26 can receive an additional set of 3D maps and detect, in the additional set of 3D maps, an additional Grab gesture followed by an additional Find gesture followed by a further Grab gesture followed by an additional Release gesture. Computer 26 can present one or more media controls in response to the additional Grab gesture, and position cursor 42 in response to the additional Find gesture. Upon detecting the further Grab gesture, computer 26 can identify one of the one or more media controls presented in proximity to cursor 42, and perform an operation associated with the one of the one or more media controls (e.g., increasing the volume) upon detecting the additional Release gesture.
A special behavior of the ZoomGrid based player controls mechanism is that upon selecting a given control, the controls surface can retract back and “disappear” (i.e., as if a spring connects it to the surface on which the media is playing). Toggle controls like mute/unmute or pause/play or buttons can be implemented again by the same embodiments as those used to select the interactive objects (i.e., the object selection triggers a given operation). Computer 26 can implement continuous controls such as volume and seek by identifying further gestures, as explained hereinbelow.
As described supra, computer 26 can implement continuous controls such as volume and seek by identifying further gestures performed by hand 30. For example, if computer 26 identifies volume slider icon 102 in response to the additional Pull gesture, computer 26 can adjust an audio volume level (and reposition the volume slider icon) in response to detecting, in the additional set of 3D maps, a further Find gesture subsequent to the additional Pull gesture and prior to the additional Release gesture. Upon detecting the additional Release gesture subsequent to the further Find gesture, computer 26 can maintain the audio volume level indicated by a position of the volume slider icon on display 28.
In embodiments of the present invention, computer 26 presents an image of keyboard 120 on display 28, and receives a sequence of three-dimensional (3D) maps including a hand 30. Upon processing an initial portion of the sequence of 3D maps and detecting hand 30 performing a Find gesture, computer 26 positions, on display 28, cursor 42 at a position indicated by the Find gesture. While presenting cursor 42 in proximity to a given key 128, and detecting in a subsequent portion of the sequence of 3D maps, a Grab gesture followed by a Pull gesture followed by a Release gesture, computer 26 selects the given key.
In operation, as user 22 performs a Find gesture by moving hand 30 along X-Y plane 40, computer 26 responsively highlights a given key 128. Upon performing a Grab gesture followed by a Pull gesture, computer 26 can convey visual feedback such as increasing the size of the given key (“C” as shown in the figure), and upon detecting the Release gesture, the computer can present a character associated with the given key in text input area 124. In some embodiments, upon user 22 transitioning the hand's motion from the X-Y plane to the Z-axis (i.e., pulling the hand back), computer 26 can convey visual feedback such as increasing the size of the given key, as shown in
In some configurations, computer 26 can select a given key upon detecting, in the 3D maps, a Find gesture followed by a Grab gesture. In other words, while computer 26 is highlighting a given key 128, the computer can select the given key upon detecting a Grab gesture while the given key is highlighted.
In the configuration shown in
While interacting with non-tactile 3D user interface 20, user 22 typically manipulates interactive objects 38 presented on display 28. However, when using a computer or a device controlled by a tactile user interface, there may be controls that are positioned outside the display. For example, Android™ smartphones have buttons positioned below the touch screen.
In embodiments of the present invention, upon computer 26 receiving a sequence of three-dimensional (3D) maps containing at least hand 30 positioned in proximity to display 28 and detecting, in the 3D maps, a pointing gesture directed toward a region external to the display and adjacent to an edge of the display, the computer can present one or more interactive objects 38 on display in response to the pointing gesture. In some embodiments, computer 26 can present the one or more interactive objects along the edge of the display that is adjacent to the regions.
A pointing gesture typically comprises user 22 pointing a finger of hand 30 toward display 28 to select a given interactive object 38 presented on the display, and are described in PCT International Publication WO 2012/107892, whose disclosure is incorporated herein by reference. In embodiments of the present invention, user 22 may perform a pointing gesture to a region outside display 28.
While user 22 may have two hands 30 in proximity to sensing device 24, the user interacts with the computer one of the hands at any given time. Additionally, there may be multiple individuals in a field of view of sensing device 24. Embodiments of the present invention provide methods and systems for rating each hand 30 within the field of view of sensing device 24 in order to identify which of the hands is most likely intentionally interacting with non-tactile 3D user interface 20.
In operation, computer 26 detects at least two hands 30 of at least one user 22, and assigns a respective ranking value (also referred to herein as a rating) to each of the hands based on a position of each of the hands. Although the configuration of sensing device 24 shown in
Computer 26 can then select a given hand 30 from among the at least two hands responsively to the respective ranking values, and upon receiving a sequence of three-dimensional (3D) maps containing at least the selected hand positioned in proximity to display 28, computer 26 can analyze the 3D maps to detect a gesture performed by the selected hand. In embodiments where computer 26 identifies hands 30 from an initial sequence of 3D maps, the 3D maps that the computer analyzes to detect a gesture comprise a set of 3D maps that computer 26 receives subsequent to receiving the initial set of 3D maps that were used to detect the two or more hands.
In some embodiments, the computer can identify hands 30 within the field of view, detect poses of the user(s) positioned within the field of view, and assign a rating based on the position of each the hands and the pose of each of the users. If the rating for a given hand 30 exceeds a defined threshold, then the computer can accept gestures from the given hand (i.e., the hand is active). During times where there is no user 22 interacting with the system, the computer may require a more overt gesture in order to select a given hand 30 as being active.
In the example shown in
For example, computer 26 may be configured to identify gestures performed with both of the hands, and the rating can be used to identify which of the individuals is most likely interacting with user interface 20. Additionally or alternatively, the hand ratings can be used for session management and for reducing false positives (i.e., reducing chances that the computer interprets a motion of a given hand 30 as a gesture, when the user did not intend to perform a gesture).
It will be appreciated that the embodiments described above are cited by way of example, and that the present invention is not limited to what has been particularly shown and described hereinabove. Rather, the scope of the present invention includes both combinations and subcombinations of the various features described hereinabove, as well as variations and modifications thereof which would occur to persons skilled in the art upon reading the foregoing description and which are not disclosed in the prior art.
This application is a continuation-in-part of U.S. patent application Ser. No. 13/541,786, filed Jul. 5, 2012, which claims the benefit of U.S. Provisional Patent Application 61/504,339, filed Jul. 5, 2011, of U.S. Provisional Patent Application 61/521,448, filed Aug. 9, 2011, and of U.S. Provisional Patent Application 61/523,349, filed Aug. 14, 2011. This application also claims the benefit of U.S. Provisional Patent Application 61/652,899, filed May 30, 2012. All of the above related applications are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4550250 | Mueller et al. | Oct 1985 | A |
4789921 | Aho | Dec 1988 | A |
4988981 | Zimmerman et al. | Jan 1991 | A |
5264836 | Rubin | Nov 1993 | A |
5495576 | Ritchey | Feb 1996 | A |
5588139 | Lanier et al. | Dec 1996 | A |
5594469 | Freeman et al. | Jan 1997 | A |
5846134 | Latypov | Dec 1998 | A |
5852672 | Lu | Dec 1998 | A |
5862256 | Zetts et al. | Jan 1999 | A |
5864635 | Zetts et al. | Jan 1999 | A |
5870196 | Lulli et al. | Feb 1999 | A |
5917937 | Szeliski et al. | Jun 1999 | A |
5973700 | Taylor et al. | Oct 1999 | A |
6002808 | Freeman | Dec 1999 | A |
6005548 | Latypov et al. | Dec 1999 | A |
6064387 | Canaday et al. | May 2000 | A |
6072494 | Nguyen | Jun 2000 | A |
6084979 | Kanade et al. | Jul 2000 | A |
6111580 | Kazama et al. | Aug 2000 | A |
6191773 | Maruno et al. | Feb 2001 | B1 |
6215890 | Matsuo et al. | Apr 2001 | B1 |
6229541 | Kamen et al. | May 2001 | B1 |
6243054 | DeLuca | Jun 2001 | B1 |
6252988 | Ho | Jun 2001 | B1 |
6256033 | Nguyen | Jul 2001 | B1 |
6262740 | Lauer et al. | Jul 2001 | B1 |
6345111 | Yamaguchi et al. | Feb 2002 | B1 |
6345893 | Fateh et al. | Feb 2002 | B2 |
6452584 | Walker et al. | Sep 2002 | B1 |
6456262 | Bell | Sep 2002 | B1 |
6507353 | Huard et al. | Jan 2003 | B1 |
6512838 | Rafii et al. | Jan 2003 | B1 |
6519363 | Su et al. | Feb 2003 | B1 |
6559813 | DeLuca et al. | May 2003 | B1 |
6581068 | Bensoussan et al. | Jun 2003 | B1 |
6681031 | Cohen et al. | Jan 2004 | B2 |
6686921 | Rushmeier | Feb 2004 | B1 |
6690370 | Ellenby et al. | Feb 2004 | B2 |
6741251 | Malzbender | May 2004 | B2 |
6791540 | Baumberg | Sep 2004 | B1 |
6803928 | Bimber et al. | Oct 2004 | B2 |
6853935 | Satoh et al. | Feb 2005 | B2 |
6857746 | Dyner | Feb 2005 | B2 |
6951515 | Ohshima et al. | Oct 2005 | B2 |
6977654 | Malik et al. | Dec 2005 | B2 |
7003134 | Covell et al. | Feb 2006 | B1 |
7013046 | Kawamura et al. | Mar 2006 | B2 |
7023436 | Segawa et al. | Apr 2006 | B2 |
7042440 | Pryor et al. | May 2006 | B2 |
7042442 | Kanevsky et al. | May 2006 | B1 |
7151530 | Roeber et al. | Dec 2006 | B2 |
7170492 | Bell | Jan 2007 | B2 |
7215815 | Honda | May 2007 | B2 |
7227526 | Hildreth et al. | Jun 2007 | B2 |
7257237 | Luck et al. | Aug 2007 | B1 |
7259747 | Bell | Aug 2007 | B2 |
7264554 | Bentley | Sep 2007 | B2 |
7289227 | Smetak et al. | Oct 2007 | B2 |
7289645 | Yamamoto et al. | Oct 2007 | B2 |
7295697 | Satoh | Nov 2007 | B1 |
7301648 | Foxlin | Nov 2007 | B2 |
7302099 | Zhang et al. | Nov 2007 | B2 |
7333113 | Gordon | Feb 2008 | B2 |
7340077 | Gokturk | Mar 2008 | B2 |
7340399 | Friedrich et al. | Mar 2008 | B2 |
7348963 | Bell | Mar 2008 | B2 |
7358972 | Gordon et al. | Apr 2008 | B2 |
7370883 | Basir et al. | May 2008 | B2 |
7427996 | Yonezawa et al. | Sep 2008 | B2 |
7428542 | Fink et al. | Sep 2008 | B1 |
7433024 | Garcia | Oct 2008 | B2 |
7474256 | Ohta et al. | Jan 2009 | B2 |
7508377 | Pihlaja et al. | Mar 2009 | B2 |
7526120 | Gokturk et al. | Apr 2009 | B2 |
7536032 | Bell | May 2009 | B2 |
7573480 | Gordon | Aug 2009 | B2 |
7576727 | Bell | Aug 2009 | B2 |
7580572 | Bang et al. | Aug 2009 | B2 |
7590941 | Wee et al. | Sep 2009 | B2 |
7688998 | Tuma et al. | Mar 2010 | B2 |
7696876 | Dimmer et al. | Apr 2010 | B2 |
7724250 | Ishii et al. | May 2010 | B2 |
7762665 | Vertegaal et al. | Jul 2010 | B2 |
7774155 | Sato et al. | Aug 2010 | B2 |
7812842 | Gordon | Oct 2010 | B2 |
7821541 | Delean | Oct 2010 | B2 |
7834847 | Boillot | Nov 2010 | B2 |
7840031 | Albertson et al. | Nov 2010 | B2 |
7844914 | Andre et al. | Nov 2010 | B2 |
7925549 | Looney et al. | Apr 2011 | B2 |
7971156 | Albertson et al. | Jun 2011 | B2 |
8018579 | Krah | Sep 2011 | B1 |
8150142 | Freedman et al. | Apr 2012 | B2 |
8154781 | Kroll et al. | Apr 2012 | B2 |
8166421 | Magal et al. | Apr 2012 | B2 |
8183977 | Matsumoto | May 2012 | B2 |
8194921 | Kongqiao et al. | Jun 2012 | B2 |
8214098 | Murray et al. | Jul 2012 | B2 |
8218211 | Kroll et al. | Jul 2012 | B2 |
8249334 | Berliner et al. | Aug 2012 | B2 |
8368647 | Lin | Feb 2013 | B2 |
8390821 | Shpunt et al. | Mar 2013 | B2 |
8396252 | El Dokor | Mar 2013 | B2 |
8400494 | Zalevsky et al. | Mar 2013 | B2 |
8405604 | Pryor et al. | Mar 2013 | B2 |
8416276 | Kroll et al. | Apr 2013 | B2 |
8446459 | Fang et al. | May 2013 | B2 |
8448083 | Migos et al. | May 2013 | B1 |
8462199 | Givon | Jun 2013 | B2 |
8493496 | Freedman et al. | Jul 2013 | B2 |
8514221 | King et al. | Aug 2013 | B2 |
8514251 | Hildreth et al. | Aug 2013 | B2 |
8625882 | Backlund et al. | Jan 2014 | B2 |
8660869 | Maclntyre et al. | Feb 2014 | B2 |
8935631 | Leonard et al. | Jan 2015 | B2 |
8990733 | Deutsch et al. | Mar 2015 | B2 |
8996173 | Itkowitz | Mar 2015 | B2 |
9015606 | Zaman et al. | Apr 2015 | B2 |
9030528 | Pesach | May 2015 | B2 |
9030529 | Chen | May 2015 | B2 |
9052820 | Jarrett et al. | Jun 2015 | B2 |
9058307 | Tien et al. | Jun 2015 | B2 |
9075441 | St. Hilaire | Jul 2015 | B2 |
9075460 | Mak et al. | Jul 2015 | B2 |
9098931 | Shpunt | Aug 2015 | B2 |
9104271 | Adams | Aug 2015 | B1 |
9104440 | Jarrett et al. | Aug 2015 | B2 |
9158445 | Wong et al. | Oct 2015 | B2 |
9207955 | Wenig et al. | Dec 2015 | B2 |
9213468 | Zaman et al. | Dec 2015 | B2 |
20020041327 | Hildreth | Apr 2002 | A1 |
20020057383 | Iwamura | May 2002 | A1 |
20020071607 | Kawamura et al. | Jun 2002 | A1 |
20020158873 | Williamson | Oct 2002 | A1 |
20030057972 | Pfaff et al. | Mar 2003 | A1 |
20030063775 | Rafii et al. | Apr 2003 | A1 |
20030071814 | Jou et al. | Apr 2003 | A1 |
20030088463 | Kanevsky | May 2003 | A1 |
20030156756 | Gokturk et al. | Aug 2003 | A1 |
20030185444 | Honda | Oct 2003 | A1 |
20030227453 | Beier et al. | Dec 2003 | A1 |
20030235341 | Gokturk et al. | Dec 2003 | A1 |
20040046744 | Rafii et al. | Mar 2004 | A1 |
20040104935 | Williamson | Jun 2004 | A1 |
20040135744 | Bimber et al. | Jul 2004 | A1 |
20040155962 | Marks | Aug 2004 | A1 |
20040174770 | Rees | Sep 2004 | A1 |
20040183775 | Bell | Sep 2004 | A1 |
20040184640 | Bang et al. | Sep 2004 | A1 |
20040184659 | Bang et al. | Sep 2004 | A1 |
20040193413 | Wilson et al. | Sep 2004 | A1 |
20040222977 | Bear et al. | Nov 2004 | A1 |
20074025831 | Hashimoto | Dec 2004 | |
20050031166 | Fujimura et al. | Feb 2005 | A1 |
20050088407 | Bell et al. | Apr 2005 | A1 |
20050089194 | Bell | Apr 2005 | A1 |
20050110964 | Bell et al. | May 2005 | A1 |
20050122308 | Bell et al. | Jun 2005 | A1 |
20050162381 | Bell et al. | Jul 2005 | A1 |
20050190972 | Thomas et al. | Sep 2005 | A1 |
20050254726 | Fuchs et al. | Nov 2005 | A1 |
20050265583 | Covell et al. | Dec 2005 | A1 |
20060010400 | Dehlin et al. | Jan 2006 | A1 |
20060020905 | Wroblewski | Jan 2006 | A1 |
20060092138 | Kim et al. | May 2006 | A1 |
20060110008 | Vertegaal et al. | May 2006 | A1 |
20060115155 | Lui et al. | Jun 2006 | A1 |
20060139314 | Bell | Jun 2006 | A1 |
20060149737 | Du et al. | Jul 2006 | A1 |
20060159344 | Shao et al. | Jul 2006 | A1 |
20060187196 | Underkoffler et al. | Aug 2006 | A1 |
20060239670 | Cleveland | Oct 2006 | A1 |
20060248475 | Abrahamsson | Nov 2006 | A1 |
20070060336 | Marks | Mar 2007 | A1 |
20070078552 | Rosenberg | Apr 2007 | A1 |
20070130547 | Boillot | Jun 2007 | A1 |
20070154116 | Shieh | Jul 2007 | A1 |
20070230789 | Chang et al. | Oct 2007 | A1 |
20070285554 | Givon | Dec 2007 | A1 |
20080030460 | Hildreth et al. | Feb 2008 | A1 |
20080059915 | Boillot | Mar 2008 | A1 |
20080062123 | Bell | Mar 2008 | A1 |
20080094371 | Forstall et al. | Apr 2008 | A1 |
20080123940 | Kundu et al. | May 2008 | A1 |
20080150890 | Bell et al. | Jun 2008 | A1 |
20080150913 | Bell et al. | Jun 2008 | A1 |
20080170776 | Albertson et al. | Jul 2008 | A1 |
20080236902 | Imaizumi | Oct 2008 | A1 |
20080252596 | Bell et al. | Oct 2008 | A1 |
20080256494 | Greenfield | Oct 2008 | A1 |
20080260250 | Vardi | Oct 2008 | A1 |
20080281583 | Slothouber et al. | Nov 2008 | A1 |
20080287189 | Rabin | Nov 2008 | A1 |
20090009593 | Cameron et al. | Jan 2009 | A1 |
20090027335 | Ye | Jan 2009 | A1 |
20090027337 | Hildreth | Jan 2009 | A1 |
20090031240 | Hildreth | Jan 2009 | A1 |
20090033623 | Lin | Feb 2009 | A1 |
20090040215 | Afzulpurkar et al. | Feb 2009 | A1 |
20090073117 | Tsurumi et al. | Mar 2009 | A1 |
20090077504 | Bell | Mar 2009 | A1 |
20090078473 | Overgard et al. | Mar 2009 | A1 |
20090083122 | Angell et al. | Mar 2009 | A1 |
20090083622 | Chien et al. | Mar 2009 | A1 |
20090096783 | Shpunt et al. | Apr 2009 | A1 |
20090183125 | Magal et al. | Jul 2009 | A1 |
20090195392 | Zalewski | Aug 2009 | A1 |
20090228841 | Hildreth | Sep 2009 | A1 |
20090256817 | Perlin et al. | Oct 2009 | A1 |
20090284542 | Baar et al. | Nov 2009 | A1 |
20090297028 | De Haan | Dec 2009 | A1 |
20100002936 | Khomo et al. | Jan 2010 | A1 |
20100007717 | Spektor et al. | Jan 2010 | A1 |
20100034457 | Berliner et al. | Feb 2010 | A1 |
20100036717 | Trest | Feb 2010 | A1 |
20100053151 | Marti et al. | Mar 2010 | A1 |
20100053304 | Underkoffler et al. | Mar 2010 | A1 |
20100071965 | Hu et al. | Mar 2010 | A1 |
20100083189 | Arlein et al. | Apr 2010 | A1 |
20100095773 | Shaw et al. | Apr 2010 | A1 |
20100103106 | Chui | Apr 2010 | A1 |
20100118123 | Freedman | May 2010 | A1 |
20100149096 | Migos et al. | Jun 2010 | A1 |
20100164897 | Morin et al. | Jul 2010 | A1 |
20100177933 | Willmann et al. | Jul 2010 | A1 |
20100199228 | Latta et al. | Aug 2010 | A1 |
20100199231 | Markovic et al. | Aug 2010 | A1 |
20100229125 | Cha | Sep 2010 | A1 |
20100234094 | Gagner et al. | Sep 2010 | A1 |
20100235034 | Higgins | Sep 2010 | A1 |
20100235786 | Meizels et al. | Sep 2010 | A1 |
20100295781 | Alameh et al. | Nov 2010 | A1 |
20110006978 | Yuan | Jan 2011 | A1 |
20110007035 | Shai | Jan 2011 | A1 |
20110018795 | Jang | Jan 2011 | A1 |
20110029918 | Yoo et al. | Feb 2011 | A1 |
20110052006 | Gurman et al. | Mar 2011 | A1 |
20110081072 | Kawasaki et al. | Apr 2011 | A1 |
20110087970 | Swink et al. | Apr 2011 | A1 |
20110141053 | Bulea et al. | Jun 2011 | A1 |
20110144543 | Tsuzuki et al. | Jun 2011 | A1 |
20110164032 | Shadmi | Jul 2011 | A1 |
20110164141 | Tico et al. | Jul 2011 | A1 |
20110193939 | Vassigh et al. | Aug 2011 | A1 |
20110205186 | Newton et al. | Aug 2011 | A1 |
20110211044 | Shpunt | Sep 2011 | A1 |
20110211754 | Litvak et al. | Sep 2011 | A1 |
20110225536 | Shams et al. | Sep 2011 | A1 |
20110227820 | Haddick et al. | Sep 2011 | A1 |
20110231757 | Haddick et al. | Sep 2011 | A1 |
20110242102 | Hess | Oct 2011 | A1 |
20110248914 | Sherr | Oct 2011 | A1 |
20110254765 | Brand | Oct 2011 | A1 |
20110254798 | Adamson et al. | Oct 2011 | A1 |
20110260965 | Kim et al. | Oct 2011 | A1 |
20110261058 | Luo | Oct 2011 | A1 |
20110279397 | Rimon et al. | Nov 2011 | A1 |
20110291926 | Gokturk et al. | Dec 2011 | A1 |
20110292036 | Sali et al. | Dec 2011 | A1 |
20110293137 | Gurman et al. | Dec 2011 | A1 |
20110296353 | Ahmed et al. | Dec 2011 | A1 |
20110310010 | Hoffnung et al. | Dec 2011 | A1 |
20120001875 | Li et al. | Jan 2012 | A1 |
20120035934 | Cunningham | Feb 2012 | A1 |
20120038550 | Lemmey et al. | Feb 2012 | A1 |
20120078614 | Galor et al. | Mar 2012 | A1 |
20120089950 | Tseng et al. | Apr 2012 | A1 |
20120117514 | Kim et al. | May 2012 | A1 |
20120124516 | Friedman | May 2012 | A1 |
20120169583 | Rippel et al. | Jul 2012 | A1 |
20120173067 | Szczerba et al. | Jul 2012 | A1 |
20120200494 | Perski | Aug 2012 | A1 |
20120202569 | Maizels et al. | Aug 2012 | A1 |
20120204133 | Guendelman et al. | Aug 2012 | A1 |
20120223882 | Galor et al. | Sep 2012 | A1 |
20120249416 | Maciocci et al. | Oct 2012 | A1 |
20120268369 | Kikkeri | Oct 2012 | A1 |
20120275680 | Omi | Nov 2012 | A1 |
20120295661 | Kim et al. | Nov 2012 | A1 |
20120309535 | Langridge | Dec 2012 | A1 |
20120313848 | Galor et al. | Dec 2012 | A1 |
20120320080 | Giese et al. | Dec 2012 | A1 |
20130002801 | Mock | Jan 2013 | A1 |
20130014052 | Frey et al. | Jan 2013 | A1 |
20130044053 | Galor et al. | Feb 2013 | A1 |
20130055120 | Galor et al. | Feb 2013 | A1 |
20130055150 | Galor | Feb 2013 | A1 |
20130058565 | Rafii et al. | Mar 2013 | A1 |
20130106692 | Maizels et al. | May 2013 | A1 |
20130107021 | Maizels et al. | May 2013 | A1 |
20130127854 | Shpunt et al. | May 2013 | A1 |
20130155070 | Luo | Jun 2013 | A1 |
20130207920 | Mccann et al. | Aug 2013 | A1 |
20130222239 | Galor | Aug 2013 | A1 |
20130263036 | Berenson et al. | Oct 2013 | A1 |
20130265222 | Berenson et al. | Oct 2013 | A1 |
20130283208 | Bychkov et al. | Oct 2013 | A1 |
20130283213 | Bychkov et al. | Oct 2013 | A1 |
20140108930 | Asnis | Apr 2014 | A1 |
20140237432 | Geurts | Aug 2014 | A1 |
20160026265 | Shaw et al. | Jan 2016 | A1 |
Number | Date | Country |
---|---|---|
9935633 | Jul 1999 | WO |
03071410 | Aug 2003 | WO |
2004107272 | Dec 2004 | WO |
2005003948 | Jan 2005 | WO |
2005094958 | Oct 2005 | WO |
2007078639 | Jul 2007 | WO |
2007135376 | Nov 2007 | WO |
2012107892 | Aug 2012 | WO |
Entry |
---|
U.S. Appl. No. 12/683,452 Office Action dated Nov. 21, 2014. |
U.S. Appl. No. 14/055,997 Office Action dated Nov. 21, 2014. |
U.S. Appl. No. 13/592,352 Office Action dated Oct. 2, 2014. |
Scharstein, D., “Stereo vision for view synthesis”, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 852-858, year 1996. |
Zhu et al., “Generation and Error Characterization of Pararell-Perspective Stereo Mosaics from Real Video”, In-Video Registration, Springer, US, chapter 4,pp. 72-105, year 2003. |
Chai et al., “Parallel Projections for Stereo Reconstruction”, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition ,vol. 2, pp. 493-500, year 2000. |
Evers et al., “Image-based rendering of complex scenes from multi-camera rig”, IEEE Proceedings on Vision, Image and Signal Processing, vol. 152, No. 4, pp. 470-480, Aug. 5, 2005. |
Evers et al,. “Image-based Interactive rendering with view dependent geometry”, Computer Graphics Forum, (Eurographics '03), vol. 22, No. 3, pp. 573-582, year 2003. |
Kauff et al., “Depth map creation and image-based rendering for advanced 3DTV Services Providing Interoperability and Scalability”, Signal Processing: Image Communication, vol. 22, No. 2, pp. 217-234, year 2007. |
U.S. Appl. No. 13/161,508 Office Action dated Dec. 23, 2014. |
U.S. Appl. No. 13/244,490 Office Action dated Dec. 6, 2013. |
U.S. Appl. No. 13/423,314 Office Action dated Dec. 4, 2013. |
U.S. Appl. No. 13/423,322 Office Action dated Nov. 1, 2013. |
Bleiwess et al., “Fusing Time-of-Flight Depth and Color for Real-Time Segmentation and Tracking”, Dyn3D 2009, Lecture Notes in Computer Science 5742, pp. 58-69, Jena, Germany, Sep. 9, 2009. |
Bleiwess et al., “Markerless Motion Capture Using a Single Depth Sensor”, SIGGRAPH Asia 2009, Yokohama, Japan, Dec. 16-19, 2009. |
Bevilacqua et al., “People Tracking Using a Time-Of-Flight Depth Sensor”, Proceedings of the IEEE International Conference on Video and Signal Based Surveillance, Sydney, Australia, Nov. 22-24, 2006. |
Bradski, G., “Computer Vision Face Tracking for Use in a Perceptual User Interface”, Intel Technology Journal, vol. 2, issue 2 (2nd Quarter 2008). |
Comaniciu et al., “Kernel-Based Object Tracking”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, No. 5, pp. 564-577, May 2003. |
Gesturetec Inc., “Gesture Control Solutions for Consumer Devices”, Canada, 2009. |
Gokturk et al., “A Time-Of-Flight Depth Sensor—System Description, Issues and Solutions”, Proceedings of the 2004 Conference on Computer Vision and Patter Recognition Workshop (CVPRW'04), vol. 3, pp. 35, Jun. 27-Jul. 2, 2004. |
Grest et al., “Single View Motion Tracking by Depth and Silhouette Information”, SCIA 2007—Scandinavian Conference on Image Analysis, Lecture Notes in Computer Science 4522, pp. 719-729, Aalborg, Denmark, Jun. 10-14, 2007. |
Haritaoglu et al., “Ghost 3d: Detecting Body Posture and Parts Using Stereo”, Proceedings of the IEEE Workshop on Motion and Video Computing (Motion'02), pp. 175-180, Orlando, USA, Dec. 5-6, 2002. |
Haritaoglu et al., “W4S : A real-time system for detecting and tracking people in 2<1/2>D”, ECCV 98—5th European conference on computer vision, vol. 1407, pp. 877-892, Freiburg , Germany, Jun. 2-6, 1998. |
Harville, M., “Stereo Person Tracking with Short and Long Term Plan-View Appearance Models of Shape and Color”, Proceedings of the IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSSS—2005), pp. 522-527, Como, Italy, Sep. 15-16, 2005. |
Holte, M., “Fusion of Range and Intensity Information for View Invariant Gesture Recognition”, IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW '08), pp. 1-7, Anchorage, USA, Jun. 23-28, 2008. |
Kaewtrakulpong et al., “An Improved Adaptive Background Mixture Model for Real-Time Tracking with Shadow Detection”, Proceedings of the 2nd European Workshop on Advanced Video Based Surveillance Systems (AVBS'01), Kingston, UK, Sep. 2001. |
Kolb et al., “ToF-Sensors: New Dimensions for Realism and Interactivity”, Proceedings of the IEEE Conference on Computer Vision and Patter Recognition Workshops, pp. 1-6, Anchorage, USA, Jun. 23-28, 2008. |
Kolsch et al., “Fast 2D Hand Tracking with Flocks of Features and Multi-Cue Integration”, IEEE Workshop on Real-Time Vision for Human Computer Interaction (at CVPR'04), Washington, USA, Jun. 27-Jul. 2, 2004. |
Krumm et al., “Multi-Camera Multi-Person Tracking for EasyLiving”, 3rd IEEE International Workshop on Visual Surveillance, Dublin, Ireland, Jul. 1, 2000. |
Leens et al., “Combining Color, Depth, and Motion for Video Segmentation”, ICVS 2009—7th International Conference on Computer Vision Systems, Liege, Belgium Oct. 13-15, 2009. |
MacCormick et al., “Partitioned Sampling, Articulated Objects, and Interface-Quality Hand Tracking”, ECCV '00—Proceedings of the 6th European Conference on Computer Vision—Part II , pp. 3-19, Dublin, Ireland, Jun. 26-Jul. 1, 2000. |
Malassiotis et al., “Real-Time Hand Posture Recognition Using Range Data”, Image and Vision Computing, vol. 26, No. 7, pp. 1027-1037, Jul. 2, 2008. |
Morano et al., “Structured Light Using Pseudorandom Codes”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, issue 3, pp. 322-327, Mar. 1998. |
Munoz-Salinas et al., “People Detection and Tracking Using Stereo Vision and Color”, Image and Vision Computing, vol. 25, No. 6, pp. 995-1007, Jun. 1, 2007. |
Nanda et al., “Visual Tracking Using Depth Data”, Proceedings of the 2004 Conference on Computer Vision and Patter Recognition Workshop, vol. 3, Washington, USA, Jun. 27-Jul. 2, 2004. |
Scharstein et al., “High-Accuracy Stereo Depth Maps Using Structured Light”, IEEE Conference on Computer Vision and Patter Recognition, vol. 1, pp. 195-2002, Madison, USA, Jun. 2003. |
Shi et al., “Good Features to Track”, IEEE Conference on Computer Vision and Pattern Recognition, pp. 593-600, Seattle, USA, Jun. 21-23, 1994. |
Siddiqui et al., “Robust Real-Time Upper Body Limb Detection and Tracking”, Proceedings of the 4th ACM International Workshop on Video Surveillance and Sensor Networks, Santa Barbara, USA, Oct. 27, 2006. |
Softkinetic S.A., IISU™—3D Gesture Recognition Platform for Developers of 3D Applications, Belgium, Brussels, 2007-2010. |
Sudderth et al., “Visual Hand Tracking Using Nonparametric Belief Propagation”, IEEE Workshop on Generative Model Based Vision at CVPR'04, Washington, USA, Jun. 27-Jul. 2, 2004. |
Tsap, L. “Gesture-Tracking in Real Time with Dynamic Regional Range Computation”, Real-Time Imaging, vol. 8, issue 2, pp. 115-126, Apr. 2002. |
Xu et al., “A Multi-Cue-Based Human Body Tracking System”, Proceedings of the 5ths International Conference on Computer Vision Systems (ICVS 2007), Germany, Mar. 21-24, 2007. |
Xu et al., “Human Detecting Using Depth and Gray Images”, Proceedings of the IEE Conference on Advanced Video and Signal Based Surveillance (AVSS'03), Miami, USA, Jul. 21-22, 2003. |
Yilmaz et al., “Object Tracking: A Survey”, ACM Computing Surveys, vol. 38, No. 4, article 13, Dec. 2006. |
Zhu et al., “Controlled Human Pose Estimation From Depth Image Streams”, IEEE Conference on Computer Vision and Patter Recognition Workshops, pp. 1-8, Anchorage, USA, Jun. 23-27, 2008. |
International Application PCT/IB2010/051055 Search Report dated Sep. 1, 2010. |
La Viola, J. Jr., “Whole-Hand and Speech Input in Virtual Environments”, Computer Science Department, Florida Atlantic University, USA, 1996. |
Martell, C., “Form: An Experiment in the Annotation of the Kinematics of Gesture”, Dissertation, Computer and Information Science, University of Pennsylvania, 2005. |
U.S. Appl. No. 12/352,622 Official Action dated Mar. 31, 2011. |
Prime Sense Inc., “Prime Sensor™ NITE 1.1 Framework Programmer's Guide”, Version 1.2, year 2009. |
Primesense Corporation, “PrimeSensor Reference Design 1.08”, USA, year 2010. |
International Application PCT/IB2011/053192 Search Report dated Dec. 6, 2011. |
U.S. Appl. No. 12/352,622 Official Action dated Sep. 30, 2011. |
U.S. Appl. No. 13/541,786 Office Action dated Feb. 13, 2014. |
U.S. Appl. No. 13/584,831 Office Action dated Mar. 20, 2014. |
U.S. Appl. No. 13/314,207 Office Action dated Apr. 3, 2014. |
U.S. Appl. No. 12/683,452 Office Action dated Jan. 22, 2014. |
U.S. Appl. No. 12/314,210 Office Action dated Jan. 10, 2014. |
U.S. Appl. No. 13/592,352 Office Action dated Feb. 13, 2014. |
Nakamura et al, “Occlusion detectable stereo-occlusion patterns in camera matrix”, Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96), pp. 371-378, Jun. 1996. |
U.S. Appl. No. 13/423,322 Office Action dated Apr. 7, 2014. |
U.S. Appl. No. 13/584,831 Office Action dated Jul. 8, 2014. |
U.S. Appl. No. 13/423,314 Office Action dated Jul. 31, 2014. |
U.S. Appl. No. 12/683,452 Office Action dated Jul. 16, 2014. |
U.S. Appl. No. Advisory Action dated Jun. 26, 2014. |
Slinger et al, “Computer—Generated Holography as a Generic Display Technology”, IEEE Computer, vol. 28, Issue 8, pp. 46-53, Aug. 2005. |
Hilliges et al, “Interactions in the air: adding further depth to interactive tabletops”, Proceedings of the 22nd annual ACM symposium on User interface software and technology, ACM, pp. 139-148, Oct. 2009. |
U.S. Appl. No. 13/592,352 Office Action dated May 7, 2014. |
U.S. Appl. No. 12/721,582 Office Action dated Apr. 17, 2014. |
U.S. Appl. No. 14/055,997 Office Action dated May 28, 2014. |
Hart, D., U.S. Appl. No. 09/616,606 “Method and System for High Resolution , Ultra Fast 3-D Imaging” filed Jul. 14, 2000. |
International Application PCT/IL2007/000306 Search Report dated Oct. 2, 2008. |
International Application PCT/IL2007/000574 Search Report dated Sep. 10, 2008. |
International Application PCT/IL2006/000335 Preliminary Report on Patentability dated Apr. 24, 2008. |
Avidan et al., “Trajectory triangulation: 3D reconstruction of moving points from amonocular image sequence”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 22, No. 4, pp. 348-3537, Apr. 2000. |
Leclerc et al., “The direct computation of height from shading”, The Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 552-558, USA, Jun. 1991. |
Zhang et al., “Shape from intensity gradient”, IEEE Transactions on Systems, Man and Cybernetics—Part A: Systems and Humans, vol. 29, No. 3, pp. 318-325, May 1999. |
Zhang et al., “Height recovery from intensity gradients”, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 508-513, Jun. 21-23, 1994. |
Horn, B., “Height and gradient from shading”, International Journal of Computer Vision, vol. 5, No. 1, pp. 37-76, Aug. 1990. |
Bruckstein, A., “On shape from shading”, Computer Vision, Graphics & Image Processing, vol. 44, pp. 139-154, year 1988. |
Zhang et al., “Rapid Shape Acquisition Using Color Structured Light and Multi-Pass Dynamic Programming”, 1st International Symposium on 3D Data Processing Visualization and Transmission (3DPVT), Italy, Jul. 2002. |
Besl, P., “Active, Optical Range Imaging Sensors”, Machine vision and applications, vol. 1, pp. 127-152, year 1988. |
Horn et al., “Toward optimal structured light patterns”, Proceedings of International Conference on Recent Advances in 3D Digital Imaging and Modeling, pp. 28-37, Ottawa, Canada, May 1997. |
Goodman, J.W., “Statistical Properties of Laser Speckle Patterns”, Laser Speckle and Related Phenomena, pp. 9-75, Springer-Verlag, Berlin Heidelberg, 1975. |
Asada et al., “Determining Surface Orientation by Projecting a Stripe Pattern”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 10, No. 5, pp. 749-754, Sep. 1988. |
Winkelbach et al., “Shape from Single Stripe Pattern Illumination”, Luc Van Gool (Editor), (DAGM 2002), Lecture Notes in Computer Science 2449, p. 240-247, Springer 2002. |
Koninckx et al., “Efficient, Active 3D Acquisition, based on a Pattern-Specific Snake”, Luc Van Gool (Editor), (DAGM 2002), Lecture Notes in Computer Science 2449, pp. 557-565, Springer 2002. |
Kimmel et al., “Analyzing and synthesizing images by evolving curves with the Osher-Sethian method”, International Journal of Computer Vision, vol. 24, No. 1, pp. 37-56, year 1997. |
Zigelman et al., “Texture mapping using surface flattening via multi-dimensional scaling”, IEEE Transactions on Visualization and Computer Graphics, vol. 8, No. 2, pp. 198-207, Apr. 2002. |
Dainty, J.C., “Introduction”, Laser Speckle and Related Phenomena, pp. 1-7, Springer-Verlag, Berlin Heidelberg, 1975. |
Mendlovic et al., “Composite harmonic filters for scale, projection and shift invariant pattern recognition”, Applied Optics Journal, vol. 34, No. 2, Jan. 10, 1995. |
Fua et al., “Human Shape and Motion Recovery Using Animation Models”, 19th Congress, International Society for Photogrammetry and Remote Sensing, Amsterdam, The Netherlands, Jul. 2000. |
Allard et al., “Marker-less Real Time 3D modeling for Virtual Reality”, Immersive Projection Technology, Iowa State University, year 2004. |
Howe et al., “Bayesian Reconstruction of 3D Human Motion from Single-Camera Video”, Advanced in Neural Information Processing Systems, vol. 12, pp. 820-826, USA 1999. |
Li et al., “Real-Time 3D Motion Tracking with Known Geometric Models”, Real-Time Imaging Journal, vol. 5, pp. 167-187, Academic Press 1999. |
Grammalidis et al., “3-D Human Body Tracking from Depth Images Using Analysis by Synthesis”, Proceedings of the IEEE International Conference on Image Processing (ICIP2001), pp. 185-188, Greece, Oct. 7-10, 2001. |
Segen et al., “Shadow gestures: 3D hand pose estimation using a single camera”, Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 479-485, Fort Collins, USA, 1999. |
Vogler et al., “ASL recognition based on a coupling between HMMs and 3D motion analysis”, Proceedings of IEEE International Conference on Computer Vision, pp. 363-369, Mumbai, India, 1998. |
Nam et al., “Recognition of Hand Gestures with 3D, Nonlinear Arm Movements”, Pattern Recognition Letters, vol. 18, No. 1, pp. 105-113, Elsevier Science B.V. 1997. |
Nesbat, S., “A System for Fast, Full-Text Entry for Small Electronic Devices”, Proceedings of the 5th International Conference on Multimodal Interfaces, ICMI 2003, Vancouver, Nov. 5-7, 2003. |
Ascension Technology Corporation, “Flock of Birds: Real-Time Motion Tracking”, 2008. |
Segen et al., “Human-computer interaction using gesture recognition and 3D hand tracking”, ICIP 98, Proceedings of the IEEE International Conference on Image Processing, vol. 3, pp. 188-192, Oct. 4-7, 1998. |
Dekker, L., “Building Symbolic Information for 3D Human Body Modeling from Range Data”, Proceedings of the Second International Conference on 3D Digital Imaging and Modeling, IEEE computer Society, pp. 388-397, 1999. |
Holte et al., “Gesture Recognition using a Range Camera”, Technical Report CVMT-07-01 ISSN 1601-3646, Feb. 2007. |
Cheng et al., “Articulated Human Body Pose Inference from Voxel Data Using a Kinematically Constrained Gaussian Mixture Model”, CVPR EHuM2: 2nd Workshop on Evaluation of Articulated Human Motion and Pose Estimation, 2007. |
Microvision Inc., “PicoP® Display Engine—How it Works”, 1996-2012. |
Primesense Corporation, “PrimeSensor Nite 1.1”, USA, year 2010. |
Arm Ltd., “AMBA Specification: AHB”, Version 2, pp. 35-92, year 1999. |
Commission Regulation (EC) No. 1275/2008, Official Journal of the European Union, Dec. 17, 2008. |
Primesense, “Natural Interaction”, YouTube Presentation, Jun. 9, 2010 http://www.youtube.com/watch?v=TzLKsex43zl˜. |
Manning et al., “Foundations of Statistical Natural Language Processing”, chapters 6,7,9 and 12, MIT Press 1999. |
U.S. Appl. No. 12/762,336 Official Action dated May 15, 2012. |
Tobii Technology, “The World Leader in Eye Tracking and Gaze Interaction”, Mar. 2012. |
Noveron, “Madison video eyewear”, year 2012. |
International Application PCT/IB2012/050577 Search Report dated Aug. 6, 2012. |
U.S. Appl. No. 12/683,452 Official Action dated Sep. 7, 2012. |
Koutek, M., “Scientific Visualization in Virtual Reality: Interaction Techniques and Application Development”, PhD Thesis, Delft University of Technology, 264 pages, Jan. 2003. |
Azuma et al., “Recent Advances in Augmented Reality”, IEEE Computer Graphics and Applications, vol. 21, issue 6, pp. 34-47, Nov. 2001. |
Breen et al., “Interactive Occlusion and Collision of Real and Virtual Objects in Augmented Reality”, Technical Report ECRC-95-02, ECRC, Munich, Germany, 22 pages, year 1995. |
Burdea et al., “A Distributed Virtual Environment with Dextrous Force Feedback”, Proceedings of Interface to Real and Virtual Worlds Conference, pp. 255-265, Mar. 1992. |
Gordon et al., “The use of Dense Stereo Range Date in Augmented Reality”, Proceedings of the 1st International Symposium on Mixed and Augmented Reality (ISMAR), Darmstadt, Germany, pp. 1-10, Sep. 30-Oct. 1, 2002. |
Agrawala et al., “The two-user Responsive Workbench :support for collaboration through individual views of a shared space”, Proceedings on the 24th conference on computer graphics and interactive techniques (SIGGRAPH 97), Los Angeles, USA, pp. 327-332 , Aug. 3-8, 1997. |
Harman et al., “Rapid 2D-to 3D conversion”, Proceedings of SPIE Conference on Stereoscopic Displays and Virtual Reality Systems, vol. 4660, pp. 78-86, Jan. 21-23, 2002. |
Hoff et al., “Analysis of head pose accuracy in augmented reality”, IEEE Transactions on Visualization and Computer Graphics, vol. 6, No. 4, pp. 319-334, Oct.-Dec. 2000. |
Poupyrev et al., “The go-go interaction technique: non-liner mapping for direct manipulation in VR”, Proceedings of the 9th annual ACM Symposium on User interface software and technology (UIST '96), Washington, USA, pp. 79-80, Nov. 6-8, 1996. |
Wexelblat et al., “Virtual Reality Applications and Explorations”, Academic Press Inc., San Diego, USA, 262 pages, year 1993. |
U.S. Appl. No. 13/161,508 Office Action dated Apr. 10, 2013. |
U.S. Appl. No. 12/683,452 Office Action dated Jun. 7, 2013. |
Miller, R., “Kinect for XBox 360 Review”, Engadget, Nov. 4, 2010. |
U.S. Appl. No. 13/161,508 Office Action dated Sep. 9, 2013. |
International Application PCT/IB2013/052332 Search Report dated Aug. 26, 2013. |
U.S. Appl. No. 13/314,210 Office Action dated Jul. 19, 2013. |
U.S. Appl. No. 13/314,207 Office Action dated Aug. 5, 2013. |
Sun et al., “SRP Based Natural Interaction Between Real and Virtual Worlds in Augmented Reality”, Proceedings of the International Conference on Cyberworlds (CW'08), pp. 117-124, Sep. 22-24, 2008. |
Schmalstieg et al., “The Studierstube Augmented Reality Project”, Presence: Teleoperators and Virtual Environments, vol. 11, No. 1, pp. 33-54, Feb. 2002. |
Ohta et al., “Share-Z: Client/Server Depth Sensing for See-Through Head-Mounted Displays”, Presence: Teleoperators and Virtual Environments, vol. 11, No. 2, pp. 176-188, Apr. 2002. |
Gobbetti et al., “VB2: an Architecture for Interaction in Synthetic Worlds”, Proceedings of the 6th Annual ACM Symposium on User Interface Software and Technology (UIST'93), pp. 167-178, Nov. 3-5, 1993. |
Gargallo et al., “Bayesian 3D Modeling from Images Using Multiple Depth Maps”, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), vol. 2, pp. 885-891, Jun. 20-25, 2005. |
Zrro Ltd., “TeleTouch Device”, year 2011 (http://www.zrro.com/products.html). |
Berliner et al., U.S. Appl. No. 61/732,354, filed Dec. 12, 2012. |
Shpunt et al., U.S. Appl. No. 61/764,554, filed Feb. 14, 2013. |
U.S. Appl. No. 13/592,352 Office Action dated Jan. 29, 2015. |
U.S. Appl. No. 13/161,508 Office Action dated Aug. 18, 2015. |
Bailly et al., “Gesture-aware remote controls: guidelines and interaction technique”, Proceedings of the 13th international conference on multimodal interfaces, pp. 263-270, 2011. |
U.S. Appl. No. 13/904,050 Office Action dated Jan. 15, 2016. |
Number | Date | Country | |
---|---|---|---|
20130263036 A1 | Oct 2013 | US |
Number | Date | Country | |
---|---|---|---|
61652899 | May 2012 | US | |
61504339 | Jul 2011 | US | |
61521448 | Aug 2011 | US | |
61523349 | Aug 2011 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13541786 | Jul 2012 | US |
Child | 13904052 | US |