This disclosure relates to virtual reality (VR), augmented reality (AR), and mixed reality (MR) technologies.
Mixed reality (MR), sometimes referred to as hybrid reality, is the term commonly applied to the merging of real or physical world and virtual worlds to produce new environments and visualizations where physical and digital objects co-exist and interact. Mixed reality visualizations and environments can exists in the physical world, the virtual world, and can include a mix of reality, virtual reality (VR), and augmented reality (AR) via immersive technology including interactive environments and interactive three-dimensional (3D) virtual objects. Users of MR visualizations and environments can move around the MR visualizations and interact with virtual objects within the virtual environment.
Interactive 3D virtual objects can be complex and contain large amounts of information that describe different features of the virtual objects, including the geometry, appearance, scenery, and animation of the virtual objects. Particular features of a virtual object may include shape, surface geometry, color, texture, material type, light sources, cameras, peripheral objects, animation, physical properties, and kinematics.
MR, VR, and AR (or similar) devices can provide complex features and high-fidelity of representations of a physical world that can be useful in instruction or various types of training curricula or programs.
An aspect of the disclosure provides a method for operating a virtual environment provided by a mixed reality platform. The method can include receiving, at the platform, first position and orientation information from a first user device based on a first position and orientation of the first user device. The method can include importing, at the platform, data related to a first virtual object from the virtual environment based on the first position and orientation information. The method can include rendering the first virtual object as a rendered object relative to the first position and orientation. The method can include generating a two dimensional representation of the rendered object. The method can include transmitting the two dimensional representation to the first user device. The method can include causing a user device to display the generated visual representation.
The method can include receiving second position and orientation information from the first user device indicating a second position and orientation of the first user device.
The method can include rendering the first virtual object relative to the second position and orientation. The method can include generating a second two dimensional representation of the rendered object based on the second position and orientation. The method can include transmitting the second two dimensional representation to the first user device.
The method can include generating a visual representation of the rendered object relative to a new field of view from each of one or more possible combinations of position and orientation to which the user can move from the first position and orientation.
The method can include transmitting visual representations associated with the one or more possible combinations of position and orientation. The method can include causing the user device to display the visual representations generated for a possible position and orientation that matches a second position and orientation different from the first position and orientation.
The generating can include generating the two dimensional representation of portions of the rendered virtual object that user would see relative to a field of view of that user.
The two dimensional representation of the rendered object comprises a point cloud or a two dimensional image of the rendered object.
Another aspect of the disclosure provides a non-transitory computer-readable medium for operating a virtual environment provided by a mixed reality platform. The non-transitory computer-readable medium can have instructions that when executed by one or more processors, cause the one or more processors to receive first position and orientation information from a first user device based on a first position and orientation of the first user device. The instructions can further cause the one or more processors to import data related to a first virtual object from the virtual environment based on the first position and orientation information. The instructions can further cause the one or more processors to render the first virtual object as a rendered object relative to the first position and orientation. The instructions can further cause the one or more processors to generate a two dimensional representation of the rendered object. The instructions can further cause the one or more processors to transmit the two dimensional representation to the first user device. The instructions can further cause the one or more processors to cause a user device to display the generated visual representation.
The details of embodiments of the present disclosure, both as to their structure and operation, can be gleaned in part by study of the accompanying drawings, in which like reference numerals refer to like parts, and in which:
This disclosure relates to different approaches for using a virtual reality (VR) device to emulate user experience of an augmented reality (AR) device.
Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics of each of the described embodiments may be combined in any suitable manner in one or more embodiments.
As shown in
Some of the sensors 124 (e.g., inertial, optical, and location sensors) are used to track the pose (e.g., position and orientation) of a user in virtual environments and physical environments. Tracking of user position and orientation (e.g., of a user head or eyes) is commonly used to determine fields of view, and each field of view is used to determine what virtual content is to be rendered using the processor 126 for presentation to the user on a display of a user device. Rendering can refer to performing the calculations necessary to assimilate a virtual scene for a given virtual environment. The output of the rendering can be a collection of scene data which includes geometry, viewpoint, texture, lighting, and shading information. The scene data is used to generate a pixelated version to display on 3D capable user device. In some examples, the rendering can also generate a photorealistic or non-photorealistic image from a 2D or 3D model. Tracking the positions and orientations of the user or any user input device (e.g., a handheld device) may also be used to determine interactions with virtual content. In some embodiments, an interaction with virtual content (e.g., a virtual object) includes a modification (e.g., change color or other) to the virtual content that is permitted after a tracked position of the user or user input device intersects with a point of the virtual content in a geospatial map of a virtual environment, and after a user-initiated command is provided to make the desired modification. Positions in a physical environment may be tracked in different ways, including positioning using Global Navigation Satellite Systems (GNSS), Bluetooth, WiFi, an altimeter, or any other known way to estimate the position of a thing (e.g., a user) in a physical environment.
Some of the sensors 124 (e.g., cameras and other optical sensors of AR devices) may also be used to capture information about a physical environment, which is used to generate virtual representations of that information, or to generate geospatial maps of the physical environment that can be used to determine where and how to present virtual content among physical objects of the physical environment. Such virtual representations and geospatial maps may be created using any known approach. In one approach, many two-dimensional images are captured by a camera of an AR device, those two-dimensional images are used to identify three-dimensional points in the physical environment, and the three-dimensional points are used to determine relative positions, relative spacing and structural characteristics (e.g., surfaces and depths) of physical objects in the physical environment. Other optical sensors may be used in addition to a camera (e.g., a depth sensor). Textures, colors and other features of physical objects or physical environments can be determined by analysis of individual images.
Examples of the user devices 120 include VR, AR, MR and general computing devices with displays, including head-mounted displays, sensor-packed wearable devices with a display (e.g., glasses), mobile phones, tablets, desktop computers, laptop computers, or other computing devices that are suitable for carrying out the functionality described in this disclosure.
The methods or processes outlined and described herein and particularly those that follow below, can be performed by one or more processors of the platform 110 either alone or in connection or cooperation with the user device(s) 120. The processes can also be performed using distributed or cloud-based computing.
This disclosure is related to rendering or pre-rendering virtual content on the platform 110 (e.g., a server) for display on a user device 120. The system (e.g., the platform 110 in conjunction with the user device 120) can determine the direction a user and/or associated user device is/are moving so that images displayed via the respective AR/VR/MR device can also be pre-rendered on the server. This can reduce processing requirements at the user device. For example, a user device having limited processing power may be participating in a high-resolution virtual collaboration session. Accordingly, the user device may need to perform complicated calculations on large virtual objects to determine which portion of the object to display as the user moves about the (virtual) space. Systems and methods disclosed herein allow the user device to offload that complexity onto a server (e.g., the platform 110). While some servers have few limitations on processing power, network connections can impose certain speed limitations on how fast the server can calculate the rendered image and then transmit associated data back to the user device. Processing power limitations on the user device on the other hand may limit the ability to quickly render new images so that the associated operator or user wearing the AR/VR/MR device does not experience video delayed from actual or virtual motion. This can lead to negative user experience and even motion sickness, caused by even the slightest jitter in the virtual environment or lag between motion and visual display adjustments.
In some examples, the platform 110 can perform certain complicated calculations to produce (e.g., render) the scene data (geometry, viewpoint, texture, lighting and shading). The platform can then use the resulting scene data to produce a 2D image/video and send that to the device for display.
The systems and methods disclosed herein allow platform 110 to “guess” or otherwise estimate which direction and to where the user might be moving next and pre-calculate or render virtual content before arrival at the user device. For example, the server can provide a relatively constant stream of (rendered) images (image data) to the user device. The user device can then provide motion information back to the server related to where the user/user device actually moved. The server can then guess (e.g., estimate) again where the user will move next and pre-render the virtual object information (images) and transmit the data the user device 120.
At block 230 the server can receive information from a user device operated by a user (e.g., position and orientation of the user).
At block 235, the server can import a virtual object to render (e.g., in response to a request by the user, or other reason).
At block 240, the server can render the virtual object relative to a field of view of the user originating from the position and orientation of the user.
At block 245, the server can generate a visual representation (e.g., a 2D image, a point cloud) of portions of the rendered object the user would see relative to the field of view of the user, and transmit the visual representation for display on the user device.
At block 250, for each of one or more possible combinations of position and orientation to which the user can move from the user's current position and orientation, the server can generate a visual representation of the rendered object relative to a field of view originating from that possible combination of position and orientation.
The method of
The previously described methods provide visual representations (e.g., a 2D image, a point cloud) of virtual objects (e.g., three-dimensional virtual objects) to VR, AR and/or MR user devices instead of rendering the virtual objects using the VR, AR and/or MR user devices. Each visual representation may be of different quality, including a level of visual quality (e.g., resolution, detail, etc.) that matches the quality of a rendered version of the virtual object. The visual representations may be generated by capturing an image of the rendered object with a virtual camera from the position and orientation of that user, or by using a different approach. The visual representation may be presented to users of the VR, AR and/or MR user devices instead of a rendered version of the virtual object. The visual representation may be presented to users of the VR, AR and/or MR user devices in addition to other visual representations of other virtual objects, or in addition to rendered versions of other virtual objects. Each visual representation may be generated based on a rendered version of a virtual object that is rendered by a processor (e.g., a server) that is remote from each of the VR, AR and/or MR user devices, and that has greater processing capability than each of the VR, AR and/or MR user devices.
Use of the visual representation offers several advantages, including: (i) reduced bandwidth use between a server and each of the VR, AR and/or MR user devices (e.g., where transmission of the visual representation uses less data than transmission of the three-dimensional virtual object); (ii) ability to transmit image data of a virtual object to a user device above a minimum threshold speed that is needed to prevent an adverse user experience (e.g., to prevent the user from feeling sick when viewing delayed updates to changing image data); (iii) ability to display high-quality visual representations of three-dimensional virtual objects on devices with limited processing capability when that limited processing capability would prohibit rendering the three-dimensional virtual objects and displaying a rendered version of the three-dimensional virtual objects; (iv) elimination of expensive and bulky processing hardware in the VR, AR and/or MR user devices since less processing capability is needed to display the visual representation compared to the three- dimensional virtual object (which reduces weight, size and battery usage of VR, AR and/or MR user devices); (v) increased security by only transmitting images of certain portions of a virtual object that are in view of a user while not transmitting other portions (e.g., sensitive or confidential portions) of the virtual object (e.g., components inside the virtual object) that would otherwise be transmitted if the virtual object were provided to the VR, AR and/or MR user devices; and (vi) ability to display the same or similar visual representations of virtual objects on different VR, AR or MR user devices when the same or similar virtual object cannot be displayed on each of those different VR, AR or MR user devices. Each of these advantages are technical solutions to technical problems (e.g., data transmission, processing limitations, security, reduction of cost, reduction of battery usage, improved user experience, increased consumer use of smaller and/or lighter devices, etc.).
Each method of this disclosure can be used with virtual reality (VR), augmented reality (AR), and/or mixed reality (MR) technologies. Virtual environments and virtual content may be presented using VR technologies, AR technologies, and/or MR technologies.
Methods of this disclosure may be implemented by hardware, firmware or software.
One or more non-transitory machine-readable media embodying program instructions that, when executed by one or more machines, cause the one or more machines to perform or implement operations comprising the steps of any of the methods or operations described herein are contemplated. As used herein, machine-readable media includes all forms of machine-readable media (e.g. non-volatile or volatile storage media, removable or non-removable media, integrated circuit media, magnetic storage media, optical storage media, or any other storage media) that may be patented under the laws of the jurisdiction in which this application is filed, but does not include machine-readable media that cannot be patented under the laws of the jurisdiction in which this application is filed.
By way of example, machines may include one or more computing device(s), processor(s), controller(s), integrated circuit(s), chip(s), system(s) on a chip, server(s), programmable logic device(s), other circuitry, and/or other suitable means described herein (e.g., the platform 110, the user device 120) or otherwise known in the art. One or more machines that are configured to perform the methods or operations comprising the steps of any methods described herein are contemplated.
Systems that include one or more machines and the one or more non-transitory machine-readable media embodying program instructions that, when executed by the one or more machines, cause the one or more machines to perform or implement operations comprising the steps of any methods described herein are also contemplated.
Method steps described herein may be order independent, and can therefore be performed in an order different from that described. It is also noted that different method steps described herein can be combined to form any number of methods, as would be understood by one of skill in the art. It is further noted that any two or more steps described herein may be performed at the same time. Any method step or feature disclosed herein may be expressly restricted from a claim for various reasons like achieving reduced manufacturing costs, lower power consumption, and increased processing efficiency. Method steps can be performed at any of the system components shown in the figures.
Systems comprising one or more modules that perform, are operable to perform, or adapted to perform different method steps/stages disclosed herein are also contemplated, where the modules are implemented using one or more machines listed herein or other suitable hardware. When two things (e.g., modules or other features) are “coupled to” each other, those two things may be directly connected together, or separated by one or more intervening things. Where no lines and intervening things connect two particular things, coupling of those things is contemplated in at least one embodiment unless otherwise stated. Where an output of one thing and an input of another thing are coupled to each other, information sent from the output is received by the input even if the data passes through one or more intermediate things. Different communication pathways and protocols may be used to transmit information disclosed herein. Information like data, instructions, commands, signals, bits, symbols, and chips and the like may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, or optical fields or particles.
The words comprise, comprising, include, including and the like are to be construed in an inclusive sense (i.e., not limited to) as opposed to an exclusive sense (i.e., consisting only of). Words using the singular or plural number also include the plural or singular number, respectively. The word or and the word and, as used in the Detailed Description, cover any of the items and all of the items in a list. The words some, any and at least one refer to one or more. The term may is used herein to indicate an example, not a requirement—e.g., a thing that may perform an operation or may have a characteristic need not perform that operation or have that characteristic in each embodiment, but that thing performs that operation or has that characteristic in at least one embodiment.
This application claims priority to U.S. Provisional Patent Application Ser. No. 62/633,579, filed Feb. 21, 2018, entitled “SYSTEMS AND METHODS FOR GENERATING VISUAL REPRESENTATIONS OF A VIRTUAL OBJECT FOR DISPLAY BY USER DEVICES,” U.S. Provisional Patent Application Ser. No. 62/633,581, filed Feb. 21, 2018, entitled “SYSTEMS AND METHODS FOR GENERATING DIFFERENT LIGHTING DATA FOR A VIRTUAL OBJECT,” and to U.S. Provisional Patent Application Ser. No. 62/638,567, filed Mar. 5, 2018, entitled “SYSTEMS AND METHODS FOR GENERATING OR SELECTING DIFFERENT LIGHTING DATA FOR A VIRTUAL OBJECT,” the contents of which are hereby incorporated by reference in their entirety.
Number | Date | Country | |
---|---|---|---|
62633579 | Feb 2018 | US | |
62633581 | Feb 2018 | US | |
62638567 | Mar 2018 | US |