This application is a U.S. National Stage Application under 35 U.S.C. 371 of International Patent Application No. PCT/EP2020/074950, filed Sep. 7, 2020, which is incorporated herein by reference in its entirety.
This application claims the benefit of European Patent Application No. 19306153, filed Sep. 20, 2019, which is incorporated herein by reference in its entirety.
The present disclosure relates generally to user interfaces and in particular to a user interface for Virtual Reality (VR) and Augmented Reality (AR) environments.
This section is intended to introduce the reader to various aspects of art, which may be related to various aspects of the present disclosure that are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present disclosure. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
VR and AR technology provides an immersive experience to users running such applications on suitable displays such as smartTVs, PCs, projectors, head-mounted displays (HMDs), etc. Through a user interface, the user is typically able to interact with the VR or AR environment, for example to handle objects therein. To do this, it is necessary to receive user input, which for example can be done by user tracking systems using sensors (wearable or not) or cameras that detect color and depth. For instance, a connected smartTV equipped with a color/depth camera could be able to track the shape and the position of each element of the user's body, which could enable the use of hand gestures in a natural interaction system. This could give a user a way of interacting with a AR or VR scene or object in for example a broadcast TV program, a content item from the Internet or content shared during a video conference.
Conventional user interfaces can overlay images of the user's hands, which makes interaction with buttons and other menu items relatively straightforward. Interaction with objects is more difficult since the objects do not always react as in the real world.
Another conventional solution provides on the screen indications of the user's hand position and enable the user to interact with a virtual object to zoom in/out and rotate the object.
However, in particular when interacting with object, such user interfaces tend to be difficult to use, in particular for new users as they are not always intuitive.
It will thus be appreciated that there is a desire for a solution that addresses at least some of the shortcomings of user interfaces for AR and VR. The present principles provide such a solution.
In a first aspect, the present principles are directed to a method in a device for rendering a virtual reality or augmented reality environment. The method comprises in at least one hardware processor of the device rendering an overlay on a display, calculating a position of a centre indicator of the overlay based on detected positions of a user's hands, enabling at least one action based on the position of the centre indicator relative to a neutral zone, and performing, in response to user hand movement, an enabled action.
In a second aspect, the present principles are directed to a device for rendering a virtual reality or augmented reality environment, the device comprising at least one hardware processor configured to render an overlay on a display, calculate a position of a centre indicator of the overlay based on positions of a user's hands, enable at least one action based on the position of the centre indicator relative to a neutral zone, and perform, in response to user hand movement, an enabled action.
In a third aspect, the present principles are directed to a computer program product which is stored on a non-transitory computer readable medium and includes program code instructions executable by a processor for implementing the steps of a method according to any embodiment of the second aspect.
Features of the present principles will now be described, by way of non-limiting example, with reference to the accompanying drawings, in which:
The rendering device 110, the display 120 and the camera 130 have been depicted as separate entities, but in different variants, they are combined in every possible way. For example the three entities can be incorporated into a single device such as a television, the display 120 and the rendering device 110 to be together while the camera 130 is separate, or for the rendering device 110 to be separate from a display including a camera.
The display 120 is operative to display to a user (not shown) video received from the rendering device. The display 120 can for example be a television, a computer screen, a head-mounted display (HMD), a tablet or a smartphone.
The camera 130, which can be a depth and colour camera, is operative to capture images and/or video of whatever is in front of its aperture. In interactive system 100, the camera 130 will typically be arranged to capture an environment including the user. Other gesture detectors detecting positions of a user's hands are also possible, such as one or more sensors worn by the user.
The rendering device 110 can for example be a computer, a set-top box, a gaming console, a tablet, a smartphone or the like. The rendering device 110 typically includes a user input interface 111, at least one hardware processor (“processor”) 112, memory 113, a network interface 114 and an output interface, for example a display interface 115 or a speaker. The interfaces are at least partly implemented in hardware. It will be appreciated that parts of the device 110 that are not useful for understanding the present principles are not illustrated for the sake of brevity of description.
The user input interface 111 can be wireless and configured to receive input from an input device (not shown), but it can also be a touch screen, an interface to a keyboard and/or a mouse, and the like. The user input interface is configured to provide received input to the processor 112. The processor 112 is configured to execute program code instructions to perform a method according to the present principles described with reference to
A salient aspect of the present principles is the use of an overlay on top of the VR/AR scene.
In step S210 the user interface (120 in
Upon detection of a match, the user interface renders, in step S220, a overlay on the AR/VR scene.
The overlay 300 can include a centre indicator 310, a neutral zone 320 (diagonally hatched in grey) and a crown 330 that includes a first and a second lock position 341, 342.
The centre indicator 310 indicates the centre of the crown 330. When the centre indicator 310 is located inside the neutral zone 320, certain user actions are inhibited, as will be described. Conversely, actions can be enabled only when the centre indicator 310 is located inside the neutral zone 320. The neutral zone 320 can be invisible (or more transparent) when the centre indicator 310 is located inside, and made visible or less transparent when the centre indicator 310 is outside.
The lock positions 341, 342, preferably hand-shaped, indicate where the user has to put its hands to lock or unlock a scene or object, as will be described.
The positions of the user's hands are represented by a right hand indicator 351 and a left hand indicator 352, illustrated in exemplary positions.
In step S230, the user interface waits for the user to lock on to an object by a specific hand gesture, which in the present example is putting the hands over (or at least essentially over) the lock positions 341, 342 for a given period of time; in other words, to place the hand indicators 351, 352 on top of the lock positions 341, 342 for a given period of time, which can be 0.5 seconds, 1 second, 1.5 seconds and so on.
In an embodiment, the lock positions 341, 342 blinks when there is no lock. In addition, the user interface can render text to inform the user how to lock on to an object.
Upon lock on to the object, the user interface detects and renders, preferably essentially continuously, in step S240, the position of the centre indicator 310. In particular, the user interface can detect whether the centre indicator 310 lies inside or outside the neutral zone 320. When locked, the centre indicator 310 typically corresponds, possibly with some lag, to the centre of the user's two hands, i.e. the centre of the hand indicators 351, 352. It is noted that the crown 330 typically follows the movement of the centre indicator 310.
In step S250, the user interface enables and/or disables user actions depending on the position of the centre indicator 310 relative to the neutral zone 320.
For example, when inside, enabled user actions can include:
As will be appreciated, the neutral zone can make the AR/VR scene less prone to for example accidental rotations since small horizontal or vertical hand movements will have no effect while the centre indicator 310 is in the neutral zone 320.
The user interface can indicate the possibilities by for example rendering icons indicative of enabled actions. Additionally, icons can be rendered in a different aspect (for example greyed or faded) for disabled actions, as is conventional in user interfaces. The icons can be rendered for a limited time period before disappearing or fading away. Preferably, the icons are rendered in a peripheral area so as not to hide much of the scene or object.
It is noted that other user actions, such as moving an object, can be implemented by the user interface.
In step S260, the user interface responds to user actions tracked through hand position detection. The user actions can include the ones described with reference to step S250. Further actions can include unlocking (to be described with reference to step S270), and hiding and displaying the overlay 300 when the user crosses the hands, i.e. brings the right hand to the left of the left hand (and it is noted that this can be disabled when the user performs a rotation through circular movement of the hands, so as to enable great circular rotation, e.g. 180°. Alternatively, the hand indicators 351, 352 can be prevented from switching sides so that the left is always to the left of or at the same vertical position as the right (except, for example, when performing a circular movement).
It is noted that horizontal or vertical rotation of an object can be temporarily disabled as the centre indicator 310 passes through the neutral zone 320 on its way to the other side.
In step S270, the user interface detects unlock of the object or scene. This can be done by the user putting the hand indicators 351, 352 on top of the lock positions 341, 342 for a given time. As can be seen, the user can do the same as for locking on to the object or scene. The unlock may also require that the center indicator 310 is in the neutral zone 320.
In a variant suitable for one-hand use, the overlay can include a single lock position over which the user should put a hand indicator for a given time in order to lock the scene or object, while unlocking is done the same way. In the variant, enabled user actions can be limited to moving an object.
It will thus be appreciated that the present principles can be used to provide a AR/VR system that provides a user with information on enabled user actions and also, through the use of the neutral zone, to provide an AR/VR experience that is less subject to ‘jitter’ caused by small hand movements.
It should be understood that the elements shown in the figures may be implemented in various forms of hardware, software or combinations thereof. Preferably, these elements are implemented in a combination of hardware and software on one or more appropriately programmed general-purpose devices, which may include a processor, memory and input/output interfaces.
The present description illustrates the principles of the present disclosure. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the disclosure and are included within its scope.
All examples and conditional language recited herein are intended for educational purposes to aid the reader in understanding the principles of the disclosure and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions.
Moreover, all statements herein reciting principles, aspects, and embodiments of the disclosure, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
Thus, for example, it will be appreciated by those skilled in the art that the block diagrams presented herein represent conceptual views of illustrative circuitry embodying the principles of the disclosure. Similarly, it will be appreciated that any flow charts, flow diagrams, and the like represent various processes which may be substantially represented in computer readable media and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
The functions of the various elements shown in the figures may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, read only memory (ROM) for storing software, random access memory (RAM), and non-volatile storage.
Other hardware, conventional and/or custom, may also be included. Similarly, any switches shown in the figures are conceptual only. Their function may be carried out through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, the particular technique being selectable by the implementer as more specifically understood from the context.
In the claims hereof, any element expressed as a means for performing a specified function is intended to encompass any way of performing that function including, for example, a) a combination of circuit elements that performs that function or b) software in any form, including, therefore, firmware, microcode or the like, combined with appropriate circuitry for executing that software to perform the function. The disclosure as defined by such claims resides in the fact that the functionalities provided by the various recited means are combined and brought together in the manner which the claims call for. It is thus regarded that any means that can provide those functionalities are equivalent to those shown herein.
Number | Date | Country | Kind |
---|---|---|---|
19306153 | Sep 2019 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/074950 | 9/7/2020 | WO |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2021/052800 | 3/25/2021 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5568603 | Chen | Oct 1996 | A |
6874126 | Lapidous | Mar 2005 | B1 |
8274535 | Hildreth | Sep 2012 | B2 |
9122311 | Galor | Sep 2015 | B2 |
9229235 | Ballard | Jan 2016 | B2 |
9317130 | Krepec | Apr 2016 | B2 |
9383894 | Schwesinger et al. | Jul 2016 | B2 |
9460314 | Ballard | Oct 2016 | B2 |
9529513 | Balan et al. | Dec 2016 | B2 |
9582091 | Kim et al. | Feb 2017 | B2 |
9727211 | Ballard | Aug 2017 | B2 |
9829989 | Mcgrath et al. | Nov 2017 | B2 |
9996221 | Ballard | Jun 2018 | B2 |
10254846 | Kinstner | Apr 2019 | B1 |
10254920 | Reily | Apr 2019 | B2 |
10261594 | Marcolina | Apr 2019 | B2 |
10466858 | Ballard | Nov 2019 | B2 |
10551932 | Ito | Feb 2020 | B2 |
10558325 | Athey | Feb 2020 | B2 |
10692237 | Sompura | Jun 2020 | B2 |
10852913 | Yoganandan | Dec 2020 | B2 |
11334212 | Ravasz | May 2022 | B2 |
11475661 | Carter | Oct 2022 | B2 |
20030020755 | Lemelson | Jan 2003 | A1 |
20030210227 | Smith | Nov 2003 | A1 |
20080018595 | Hildreth | Jan 2008 | A1 |
20130055150 | Galor | Feb 2013 | A1 |
20130103446 | Bragdon | Apr 2013 | A1 |
20130127825 | Joshi | May 2013 | A1 |
20130135353 | Wheeler | May 2013 | A1 |
20130246967 | Wheeler | Sep 2013 | A1 |
20140282274 | Everitt | Sep 2014 | A1 |
20150153571 | Ballard | Jun 2015 | A1 |
20150153826 | Ballard | Jun 2015 | A1 |
20150153912 | Reily | Jun 2015 | A1 |
20150153913 | Ballard | Jun 2015 | A1 |
20150153922 | Ballard | Jun 2015 | A1 |
20150156028 | Ballard | Jun 2015 | A1 |
20150156803 | Ballard | Jun 2015 | A1 |
20160011724 | Wheeler | Jan 2016 | A1 |
20160091982 | Mongia et al. | Mar 2016 | A1 |
20160098579 | Ballard | Apr 2016 | A1 |
20160224123 | Antoniac | Aug 2016 | A1 |
20160274732 | Bang et al. | Sep 2016 | A1 |
20160306431 | Stafford et al. | Oct 2016 | A1 |
20160350971 | Athey | Dec 2016 | A1 |
20170017361 | Ballard | Jan 2017 | A1 |
20170097687 | Pinault | Apr 2017 | A1 |
20170354883 | Benedetto | Dec 2017 | A1 |
20180088677 | Zhang et al. | Mar 2018 | A1 |
20180143693 | Calabrese | May 2018 | A1 |
20180217672 | Ito | Aug 2018 | A1 |
20180284962 | Ballard | Oct 2018 | A1 |
20190079589 | Spalla | Mar 2019 | A1 |
20190107896 | Stafford | Apr 2019 | A1 |
20190371066 | Shiff | Dec 2019 | A1 |
20210248371 | Carter | Aug 2021 | A1 |
20220301231 | Eirinberg | Sep 2022 | A1 |
20230038709 | Ramani | Feb 2023 | A1 |
20230139977 | Carter | May 2023 | A1 |
Number | Date | Country |
---|---|---|
WO 2018071004 | Apr 2018 | WO |
Entry |
---|
Hingorani Mohit, “Gesture based Data Interaction using the Kinect”, May 16, 2014, YouTube, https://www.youtube.com/watch?v=Ew29FzodkxE, 2 pages. |
Anonymous, “Interaction Engine 1.0—Object Interactions, UI Toolkit, Handheld Controller Support and More”, Leap Motion Blog, Jun. 7, 2017, URL: http://blog.leapmotion.com/interaction-engine/, 18 pages. |
Blaga et al., “[POSTER] Usability Analysis of an Off-the-Shelf Hand Posture Estimation Sensor for Freehand Physical Interaction in Egocentric Mixed Reality”, Institure of Electronics and Electrical Engineers (IEEE), 2017 IEEE International Symposium on Mixed and Augmented Reality Adjunct Proceedings, Nantes, France, Oct. 9, 2017, 4 pages. |
Majd Jordan, “Hand-tracked-natural-interaction-in-VR”, Blue Penguin Blog, Apr. 24, 2017, URL: https://blog.bluepengu.in/hand-tracked-natural-interaction-in-vr-5e5ae43ca3ff, 6 pages. |
Number | Date | Country | |
---|---|---|---|
20220342485 A1 | Oct 2022 | US |