The present disclosure generally relates to virtual reality, and more particularly relates to systems and methods for integrating a virtual reality experience into a vehicle using multi-view cameras, sensor fusion, and point of interest information.
Vehicle perception systems include a number of cameras. The cameras are integrated into the vehicle body and capture the surrounding environment of the vehicle. Output from the cameras is analyzed in order to control the vehicle in an autonomous or partial autonomous manner.
The view of a rear seat passenger is typically restricted by the window size and location relative to the passenger. Thus, the passenger is unable to enjoy the full landscape along a road trip. Accordingly, it is desirable to provide systems and methods for leveraging the vehicle cameras in order to provide a virtual reality view of the landscape. It is further desirable to provide additional information to the passenger along the road trip. Furthermore, other desirable features and characteristics of the present disclosure will become apparent from the subsequent detailed description and the appended claims, taken in conjunction with the accompanying drawings and the foregoing technical field and background.
Systems and methods are provided for entertaining a passenger of a vehicle by providing an immersive experience. In one embodiment, a method includes: receiving image data from a plurality of camera devices coupled to the vehicle, wherein the image data depicts an environment surrounding the vehicle; receiving point of interest data associated with the environment of the vehicle; fusing, by a processor, the image data and the point of interest data using a localization method; orienting, by a processor, the fused image data based on a position of a user device; and rendering, by a processor, the oriented, fused data on a virtual reality display of the user device.
In various embodiments, the fusing is based on a probabilistic optimization method. In various embodiments, the fusing is further based on a fusing of inertia measurement unit data, global positioning system data, and the image data to determine a location, orientation, and speed of the vehicle in a first coordinate system. In various embodiments, the fusing is further based on fusing image data and inertia measurement data to obtain a result and fusing the result with global positioning system data.
In various embodiments, the fusing is based on a graph pose optimization. In various embodiments, the fusing is further based on fusing global positioning system data and inertia measurement unit data to obtain a result and fusing the result with the image data.
In various embodiments, the fusing is based on a graph pose optimization and an extended Kalman filter.
In various embodiments, the fusing further includes: fusing global positioning system data, inertia measurement unit data, camera data, and point of interest data into a single coordinate system; and transforming the fused data into a second coordinate system, wherein the second coordinate system is a of the user device. In various embodiments, the orienting comprises orienting the transformed data from the second coordinate system to a third coordinate system, where the third coordinate system is based on an orientation of the user device.
In various embodiments, the point of interest data includes at least one of a name, a logo, an address, contact information, sales information, hours of operation, historical facts relative to the point.
In another embodiment, a virtual reality system for a vehicle is provided. The virtual reality system includes a plurality of camera devices configured to be distributed about the vehicle, the camera devices sense an environment associated with the vehicle; and a controller that is configured to, by a processor, receive image data from the plurality of camera devices coupled to the vehicle, wherein the image data depicts an environment surrounding the vehicle; receive point of interest data associated with the environment of the vehicle; fuse the image data and the point of interest data; orient the fused image data based on a position of a user device; and render the oriented, fused data on a virtual reality display of the user device.
In various embodiments, the controller fuses based on a probabilistic optimization method. In various embodiments, the controller fuses further based on a fusing of inertia measurement unit data, global positioning system data, and the image data to determine a location, orientation, and speed of the vehicle in a first coordinate system. In various embodiments, the controller fuses further based on fusing image data and inertia measurement data to obtain a result and fusing the result with global positioning system data.
In various embodiments, the controller fuses based on a graph pose optimization. In various embodiments, the controller fuses further based on fusing global positioning system data and inertia measurement unit data to obtain a result and fusing the result with the image data.
In various embodiments, the controller fuses further based on fusing global positioning system data, inertia measurement unit data, camera data, and point of interest data into a single coordinate system; and transforming the fused data into a second coordinate system, wherein the second coordinate system is a of the user device. In various embodiments, the controller orients based on orienting the transformed data from the second coordinate system to a third coordinate system, wherein the third coordinate system is based on an orientation of the user device.
In various embodiments, the point of interest data includes at least one of a name, a logo, an address, contact information, sales information, hours of operation, historical facts relative to the point.
In another embodiment, a vehicle is provided. The vehicle includes a plurality of camera devices distributed about the vehicle, the camera devices sense an environment associated with the vehicle; and a controller that is configured to, by a processor, receive image data from the plurality of camera devices coupled to the vehicle, wherein the image data depicts an environment surrounding the vehicle; receive point of interest data associated with the environment of the vehicle; fuse the image data and the point of interest data; orient the fused image data based on a position of a user device; and render the oriented, fused data on a virtual reality display of the user device.
The exemplary embodiments will hereinafter be described in conjunction with the following drawing figures, wherein like numerals denote like elements, and wherein:
The following detailed description is merely exemplary in nature and is not intended to limit the application and uses. Furthermore, there is no intention to be bound by any expressed or implied theory presented in the preceding technical field, background, brief summary or the following detailed description. As used herein, the term module refers to any hardware, software, firmware, electronic control component, processing logic, and/or processor device, individually or in any combination, including without limitation: application specific integrated circuit (ASIC), an electronic circuit, a processor (shared, dedicated, or group) and memory that executes one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that provide the described functionality.
Embodiments of the present disclosure may be described herein in terms of functional and/or logical block components and various processing steps. It should be appreciated that such block components may be realized by any number of hardware, software, and/or firmware components configured to perform the specified functions. For example, an embodiment of the present disclosure may employ various integrated circuit components, e.g., memory elements, digital signal processing elements, logic elements, look-up tables, or the like, which may carry out a variety of functions under the control of one or more microprocessors or other control devices. In addition, those skilled in the art will appreciate that embodiments of the present disclosure may be practiced in conjunction with any number of systems, and that the systems described herein is merely exemplary embodiments of the present disclosure.
For the sake of brevity, conventional techniques related to signal processing, data transmission, signaling, control, and other functional aspects of the systems (and the individual operating components of the systems) may not be described in detail herein. Furthermore, the connecting lines shown in the various figures contained herein are intended to represent example functional relationships and/or physical couplings between the various elements. It should be noted that many alternative or additional functional relationships or physical connections may be present in an embodiment of the present disclosure.
With reference to
As depicted in
In various embodiments, the vehicle 10 may be an autonomous vehicle and the virtual reality system 100 may be incorporated into the autonomous vehicle. The autonomous vehicle is, for example, a vehicle that is automatically controlled (fully or partially) to carry passengers from one location to another. The vehicle 10 is depicted in the illustrated embodiment as a passenger car, but it should be appreciated that any other vehicle including motorcycles, trucks, sport utility vehicles (SUVs), recreational vehicles (RVs), marine vessels, aircraft, etc., can also be used.
As shown, the vehicle 10 generally includes a propulsion system 20, a transmission system 22, a steering system 24, a brake system 26, a sensor system 28, an actuator system 30, at least one data storage device 32, at least one controller 34, and a communication system 36. The propulsion system 20 may, in various embodiments, include an internal combustion engine, an electric machine such as a traction motor, and/or a fuel cell propulsion system. The transmission system 22 is configured to transmit power from the propulsion system 20 to the vehicle wheels 16-18 according to selectable speed ratios. According to various embodiments, the transmission system 22 may include a step-ratio automatic transmission, a continuously-variable transmission, or other appropriate transmission. The brake system 26 is configured to provide braking torque to the vehicle wheels 16-18. The brake system 26 may, in various embodiments, include friction brakes, brake by wire, a regenerative braking system such as an electric machine, and/or other appropriate braking systems. The steering system 24 influences a position of the of the vehicle wheels 16-18. While depicted as including a steering wheel for illustrative purposes, in some embodiments contemplated within the scope of the present disclosure, the steering system 24 may not include a steering wheel.
The sensor system 28 includes one or more sensing devices 31a-31n that sense observable conditions of the exterior environment and/or the interior environment of the autonomous vehicle 10. The sensing devices 31a-31n can include, but are not limited to, radars, lidars, global positioning systems, optical cameras, thermal cameras, ultrasonic sensors, inertial measurement units, microphones, and/or other sensors. The actuator system 30 includes one or more actuator devices 42a-42n that control one or more vehicle features such as, but not limited to, the propulsion system 20, the transmission system 22, the steering system 24, and the brake system 26. In various embodiments, the vehicle features controlled by the one or more actuator devices 42a-42n can further include interior and/or exterior vehicle features such as, but are not limited to, doors, a trunk, and cabin features such as air, music, lighting, etc. (not numbered).
In various embodiments, one or more of the sensing devices 31a-31n are the vehicle cameras 102 or other imaging devices. The camera devices 102 are coupled to an exterior of the body 14 of the vehicle 10 and/or coupled to an interior of the vehicle 10 such that they may capture images of the environment surrounding the vehicle 10. For example, an exemplary embodiment of sensing devices 31a-31j that include camera devices 102 distributed about the vehicle 10 is shown in
In the provided example, a first sensing device 31a is positioned at the front left (or driver) side of the vehicle 10 and is oriented 45° counterclockwise relative to the longitudinal axis of the vehicle 10 in the forward direction, and another sensor device 31c may be positioned at the front right (or passenger) side of the vehicle 10 and is oriented 45° clockwise relative to the longitudinal axis of the vehicle 10. Additional sensing devices 31i, 31j are positioned at the rear left and right sides of the vehicle 10 and are similarly oriented at 45° counterclockwise and clockwise relative to the vehicle longitudinal axis, along with sensing devices 31d and 31h positioned on the left and right sides of the vehicle 10 and oriented away from the longitudinal axis so as to extend along an axis that is substantially perpendicular to the vehicle longitudinal axis. The illustrated embodiment also includes a group of sensing devices 31e-31g positioned at or near the vehicle longitudinal axis and oriented to provide forward direction signals in line with the vehicle longitudinal axis.
With reference back to
The data storage device 32 stores data for use in automatically controlling the vehicle 10. In various embodiments, the data storage device 32 stores defined maps of the navigable environment. In various embodiments, the data storage device includes the data storage device 104 and the defined maps include information associated with various points of interest. Such information can include, but is not limited to, names, logos, address, contact information, sales information, hours of operation, historical facts, and/or any other information relative to the points. In various embodiments, the POI information can include images depicting the information and that can be rendered over a virtual reality scene to provide an augmented reality. In various embodiments, the POI information can be separated into different classifications. The classifications can then be selected based on a viewer's interests.
In various embodiments, the defined maps and/or POI information may be predefined by and obtained from a remote system. For example, the defined maps and/or POI information may be assembled by the remote system and communicated to the vehicle 10 (wirelessly and/or in a wired manner) and stored in the data storage device 32. As can be appreciated, the data storage device 32 may be part of the controller 34, separate from the controller 34, or part of the controller 34 and part of a separate system.
The controller 34 includes at least one processor 44 and a computer readable storage device or media 46. The processor 44 can be any custom made or commercially available processor, a central processing unit (CPU), a graphics processing unit (GPU), an auxiliary processor among several processors associated with the controller 34, a semiconductor based microprocessor (in the form of a microchip or chip set), a macroprocessor, any combination thereof, or generally any device for executing instructions. The computer readable storage device or media 46 may include volatile and nonvolatile storage in read-only memory (ROM), random-access memory (RAM), and keep-alive memory (KAM), for example KAM is a persistent or non-volatile memory that may be used to store various operating variables while the processor 44 is powered down. The computer-readable storage device or media 46 may be implemented using any of a number of known memory devices such as PROMs (programmable read-only memory), EPROMs (electrically PROM), EEPROMs (electrically erasable PROM), flash memory, or any other electric, magnetic, optical, or combination memory devices capable of storing data, some of which represent executable instructions, used by the controller 34 in controlling the vehicle 10.
The instructions may include one or more separate programs, each of which comprises an ordered listing of executable instructions for implementing logical functions. The instructions, when executed by the processor 44, receive and process signals from the sensor system 28, perform logic, calculations, methods and/or algorithms for automatically controlling the components of the autonomous vehicle 10, and generate control signals to the actuator system 30 to automatically control the components of the vehicle 10 based on the logic, calculations, methods, and/or algorithms. Although only one controller 34 is shown in
In various embodiments, the controller 34 includes the controller 106 and includes one or more instructions embodied in a virtual reality module 110 that, when executed by the processor 44, receive image data from the sensing devices 31-31j such as the camera devices, receive GPS and/or IMU data, and process the received data to localize the vehicle 10, fuse the data into a 360 degree virtual reality scene with point of interest information. The instructions, when executed by the processor, further cause the virtual reality scene having augmented reality content to be displayed to a passenger wearing the user display device 108.
In various embodiments, the user display device 108 receives the virtual scene and displays a portion of the scene based on an orientation of the user device relative to the environment. The scene depicts a virtual reality of the environment around the vehicle and information relating to certain points of interest. The information can be selectively displayed based on an interest of the passenger. For example, a child passenger may be interested in different points of interest than an adult passenger.
Referring now to
The localization module 112 receives as input GPS data 130, IMU data 132, and camera data 134. The localization module 112 determines an actual location (x={position, orientation, speed}) of the vehicle 10 with respect to the camera images (localizes the vehicle 10). In various embodiments, the localization module 112 localizes the vehicle 10 based on an Extended Kalman filter (EKF). As can be appreciated, the usage of EKF is just merely an example. It should be understood that other means of probabilistic inference optimization, such as Particle Filter, Pose Graph optimization, among many other alternatives, can be applied to solve the targeted problem. In one example, the localization module 112 fuses all sensor outputs including IMU data 132 (IMU), the GPS data 130 (GPS), and the camera data 136-146 (VO) to the EKF as:
In another example, the localization module 112 fuses the data using a multi-tiered approach, where the GPS data 130 (GPS) and the IMU data 132 (IMU) are fused first and the output (VIO) of that fusion is then fused with the camera data 136-146 (VO) as:
In various embodiments, the localization module 112 localizes the vehicle 10 based on a Pose Graph Optimization methods. For example, the localization module 112 fuses the IMU data 132 (IMU), the GPS data 130 (GPS), and the camera data 136-146 (VO) using a pose graph optimization.
In another example, the localization module 112 fuses the data using a multi-tiered approach, where the camera data 136-146 (VO) and the IMU data 132 (IMU) are fused first and the output (GPS/IMU) of that fusion is then fused with the GPS data 130.
The map matching module 114 receives a POI map 148, and the determined location of the vehicle 10. The map matching module 114 retrieves POI information that is located near the vehicle 10. For example, the POI information can be retrieved for a defined radial proximity from the location of the vehicle 10. In various embodiments, the defined radial proximity may be selected based on the current speed of the vehicle 10. For example, the faster the vehicle 10 is going, the greater the defined radius. In another example, the slower the vehicle 10 is traveling, the slower the defined radius.
The video stitching module 116 receives the camera data 136-146 from the camera devices 102 of the vehicle 10. The video stitching module 116 stitches the image data to provide a 360 degree view of the environment surrounding the vehicle 10. The video stitching module 116 stitches the image data based on a location of each camera relative to the vehicle 10 and one or more pixel blending techniques. In various embodiments, the video stitching module 116 stitches image data based on feature matching and random sample consensus (RANSAC) method. For example, the image features includes Harris corners, ORB features, SIFT features and SURF features. The RANSAC with DLT method is used to compute the homography. Then the stitched image data is projected to spherical or cylindrical surface.
The coordinate transformation module 118 receives the image data and the associated POI information data and transforms the coordinates of each into a coordinate system of the user display device 108. In various embodiments, the coordinate transformation module 118 transforms the POI objects from world coordinate system to camera coordinate system. For example, the stitched camera's location and direction in world coordinate are computed in the localization module 112 by using SLAM method. The POI object's location in world coordinate is extracted from module 114. Then the module 118 transforms the world coordinate to camera coordinate based on the camera's intrinsic and extrinsic parameters.
The encoding module 120 receives the transformed image data and the associated POI information data and encodes the data for transmission. In various embodiments, the encoding can be, for example, according to WI-FI or wired communication protocols. The encoding module 120 then transmits the encoded data to the user display device 108.
The decoding module 122 receives as input the encoded data and decodes the received data. In various embodiments, the decoding can be, for example, according to WI-FI or wired communication protocols.
The orientation transformation module 124 receives the decoded data. The orientation transformation module 124 transforms the coordinates of the image data and the corresponding POI information into coordinates that are based on a current orientation of the user display device 108. For example, an orientation of the user display device 108 can be determined relative to a defined location when the user is wearing the device 108 and looking to the left, looking to the right, looking behind the vehicle, looking to the front of the vehicle, etc. and the image data that is displayed is oriented based on the direction the user is looking.
The rendering module 126 receives the oriented data and renders the data for display by the user display device 108.
As shown in more detail with regard to
In one example, the method may begin at 405. Sensor data is received from the camera devices, the IMU, and the GPS at 410. The vehicle 10 is localized at 420, for example, based on an extended Kalman Filter and/or a pose graph optimization as discussed above. The location of the vehicle 10 is matched to a location on the POI map and the corresponding POI information is retrieved at 430. The camera data is stitched to provide a 360 degree view at 440. The POI map information and the camera data are then transformed into a coordinate system of the user device at 450 and encoded for transmission at 460. Thereafter, the user display device 108 receives and decodes the received data at 470 and transforms the data based on the orientation of the user display device 108 at 480 as discussed above and rendered at 490. The method then continues with receiving the sensor data and processing the sensor data in order to display augmented virtual reality content to the user. In this manner, the method provides for a way to entertain a user through displaying to the user an augmented virtual reality of the environment that the vehicle 10 is currently traveling.
While at least one exemplary embodiment has been presented in the foregoing detailed description, it should be appreciated that a vast number of variations exist. It should also be appreciated that the exemplary embodiment or exemplary embodiments are only examples, and are not intended to limit the scope, applicability, or configuration of the disclosure in any way. Rather, the foregoing detailed description will provide those skilled in the art with a convenient road map for implementing the exemplary embodiment or exemplary embodiments. It should be understood that various changes can be made in the function and arrangement of elements without departing from the scope of the disclosure as set forth in the appended claims and the legal equivalents thereof.