The present invention relates generally to VR/AR system, more particularly relates to 6DoF inside-out tracking game controller, head mount device in-side-out tracking and multi agent interaction; robot position and posture tracking, routing planning, collision avoidance.
The virtual reality (VR) and augmented reality (AR) is expected to continue to grow rapidly. With the development of new technology in both hardware and software could help the AR/VR market to grow even faster. With more applications using the technology the requirement for the system to run faster, be more accurate and without any drift for the localization.
The SLAM (simultaneous localization and mapping) algorithm is widely adopted to improve the system. However, there are three issues using the SLAM algorithm: the scale factor, the drift problem (even with a stereo camera), and the long processing time. The state of the art solutions for the drift are the on-line loop-closure and on-line re-localization (used in ORB-SLAM). Both are based on a bag of words approach (to store every patches). But the update of this bag of words is very time/CPU consuming.
Further, six degree of freedom (6DoF) data of a game controller are needed for the AR/VR system. However, the game controllers today are not efficient and fast enough to produce the 6DoF data in real time. The three dimensions for the translation in the 3D space are not obtained by the game controller of the market.
Enhancement and improvement are required tracking game controller.
Methods and apparatus are provided for 6DoF inside-out tracking game control. In one novel aspect, a multi-processor structure is used for VI-SLAM. In one embodiment, the apparatus obtains overlapping image frames and sensor inputs of an apparatus, wherein the sensor inputs comprise gyrometer data, accelerometer data and magnetometer data, splits computation work onto a plurality of vector processors to obtain six degree of freedom (6DoF) outputs of the apparatus based on a splitting algorithm, and performs a localization process to generate 6DoF estimations, and a mapping process to generate a cloud of three-dimensional points associated to the descriptors of the map. In one embodiment, the splitting algorithm involves: dividing a current frame in N equal part; and each of a set of selected vector processors processes a portion of the current frame based on a split-by-corner rule, and wherein the split-by-corner rule determining whether each pixel of is a corner and classifying each pixel determined to a corner to a compressed descriptor by converting each sub-image centered by the pixel to a 16-float descriptor using a base matrix. In one embodiment, the localization process and mapping process are configured to run sequentially, wherein the localization process is split over all of the vector processors and the mapping process is split over all the vector processors. In another embodiment, the localization process and mapping process are configured to run in parallel, wherein the localization process is split over a first subset of the vector processors and the mapping process is split over the rest subset of the vector processors. In one embodiment, the 6DoF outputs is in one format selecting from an output format group comprising: six floating point values with three for the translated 3D space and three for the rotation space, twelve floating point values with three for the translated 3D space and nine for the rotation space, six fix point values with three for the translated 3D space and three for the rotation space, and twelve fix point values with three for the translated 3D space and nine for the rotation space.
In one novel aspect, a map of the background environment is generated in advance. This reference map is a batch of visual features with pre-estimated 3D position and visual feature description. The map is used for real-time localization. During the localization process, the 3D position of the features is not updated, so the map is static. Because the map is known, there is no need to map the environment constantly. And because the map is static, the localization will not drift. The potential issue of this approach is a failure of the localization when we move too far from the reference map. We solve this problem using a light SLAM algorithm.
In one embodiment, client server topology is used in deploying the mapping and localization technology, which makes the client lighter in computing and less power hungry. There could be one or more clients working on the server network. Or the client works on its own without a server at the cost of power consumption.
In another embodiment, tracking and localization are based on a known map. This allows to achieve fast processing speed. This is useful for the VR/AR application. A calibrated stereo camera is provided in this approach to fix the scale factor problem.
Other embodiments and advantages are described in the detailed description below. This summary does not purport to define the invention. The invention is defined by the claims.
The accompanying drawings, where like numerals indicate like components, illustrate embodiments of the invention.
Reference will now be made in detail to some embodiments of the invention, examples of which are illustrated in the accompanying drawings.
Game controller 100 also includes an inertial measurement unit (IMU) 131, an optional external memory card (SD Card) 132 Other embodiments and advantages are described in the detailed description below. This summary does not purport to define the invention. The invention is defined by the claims, and one or more wireless interface 133, such as a WiFi interface, a Bluetooth interface. An interface module 111 communicates and controls the sensors, IMU 131, SD 132, and the wireless interface, such WiFi 133 and Bluetooth 134. A hardware accelerator and image signal processing unit 112 helps image processing of the sensor inputs. IMU 131 detects of movements and rotations and magnetic heading of game controller 100. In one embodiment, IMU 131 is an integrated 9-axis sensor for the detection of movements and rotations and magnetic heading. It comprises a triaxial, low-g acceleration sensor, a triaxial angular rate sensor and a triaxial geomagnetic sensor. IMU 131 senses orientation, angular velocity, and linear acceleration of game controller 100. In one embodiment, game controller 100 processes data of an IMU frame rate of at least 500 Hz.
In one embodiment, a plurality of cameras are mounted on the outer case of the game controller to generate overlapping views for the game controller. Using multiple cameras with overlapping view has many advantages compared to monocular solution, such as the scale factor of the 3D motion does not drift, the 3D points seen on the overlapping area can be triangulated without a motion of the device, the matching on the overlapping area is faster and more accurate using the epipolar geometry, the global field of view is wider which increase the accuracy and reduce the jittering.
In one novel aspect, the VI-SLAM algorithm is split to run on a plurality of processors based on a splitting algorithm and the sensor inputs.
In one embodiment, the feature detection and extraction procedure 510 is split to be run on N vector processors following the splitting rule. Step 511 divides the current frame to be processes into N equals part. Step 512 assign each frame part to a corresponding vector processor. Each processor processes one part of the frame following a predefine algorithm. First, a corner is determined. For each pixel pi, described by a 2D coordinate in the image, and an adjustable threshold t, pi is determined to be a corner if there exist a set of K contiguous pixels in the neighbor circle, which are all brighter than (pi+t) or all darker than (pi−t). In some embodiment, threshold t is in the range of 5<t<200. In another embodiment, the K is in the range of 5<K<13. In yet another embodiment, the neighbor circle has a radius of three pixels. Subsequently, at the second step, each corner pixel pi is classified, using a n×n sub-image centered on pi, to a compressed descriptor. This is done using a base matrix to convert each sub-image to a 16 floats descriptor. The base matrix is computed with a singular value decomposition on a large set of selected features. In one embodiment, the n×n sub-image is 11×11. Let P=(p1, . . . , pn) the list of features points (2D coordinate in the image) detected from the current frame. Let D=(d1, . . . , dn) the list of descriptors associated pair with each feature point with its associated descriptor.
In another embodiment, the matching procedure 520 is split onto N vector processors. Step 521 splits the descriptor list into N parts. In one embodiment, the descriptor list is equally split into N part. Step 522 performs descriptor matching for each descriptor Di by matching Di with a subset of the map descriptors. The descriptors are split in N equal range. For each vector process i, a matching algorithm applies for Di. The processor i (0<i<N+1) run the matching algorithm on the range Di, The descriptors Di are matched with a subset of the descriptors of the map LocalMap (subset of the map), using the cross-matching method: each match is a a pair of descriptor (da, db) such as da is the best candidate for db among the descriptors Di of the current frame and db is the best candidate for da among the descriptors of the map LocalMap. Some of the descriptors of the map are associated to some 3D points geo-referenced in world (this 3D estimation is performed by the mapping algorithm). So the matching associates each descriptor di de D to a 3D point p3d of the LocalMap. The output of the matching is a list of descriptor pairs associating the features points P to the 3D points of the map: Mi=((p1,p3d1), . . . , (pn,p3dn)).
In yet another embodiment, estimation 6DoF procedure 530 is split onto N processors. The input of this step is the N lists Mi (from the matching). The 6DoF estimation minimizes, for each pair (pi,p3di) in M, the difference in 2D between the projected of p3di in the current frame and pi. This minimization is performed with the non-linear least square algorithm Levenberg-Marquardt combined with the M-Estimator (robust method) of Geman-McClure. The robust method of Levenberg-Marquard is used on N processors. Once split, each processor i computes the reprojection error of all the elements of Mi:Ei, computes the Jacobian error function of all elements of Mi:Ji. Subsequently, the total number of N Ei in E and the total number of N Ji in J are merged with concatenation. The median of the absolute different of E (MAD) is computed. The estimation of 6DoF is obtained by solving the linear system of (JTJ) X=JTE.MAD, where X is the update of the 6DoF.
In one novel aspect, using the multi-processor processors architect, the efficiencies of the localization process and the mapping process are greatly improved.
Although the present invention has been described in connection with certain specific embodiments for instructional purposes, the present invention is not limited thereto. Accordingly, various modifications, adaptations, and combinations of various features of the described embodiments can be practiced without departing from the scope of the invention as set forth in the claims.
This application is a continuation, and claims priority under 35 U.S.C. § 120 from nonprovisional U.S. patent application Ser. No. 15/874,842, entitled “6DOF INSIDE-OUT TRACKING GAME CONTROLLER”, filed on Jan. 18, 2018, the subject matter of which is incorporated herein by reference. Application Ser. No. 15/874,842, in turn, claims priority under 35 U.S.C. § 119 from U.S. Provisional Application No. 62/447,867 entitled “A MULTI AGENT STEROSCOPIC CAMERA BASED POSITION AND POSTURE TRACKING SYSTEM FOR PORTABLE DEVICE” filed on Jan. 18, 2017, the subject matter of which is incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
6176837 | Foxlin | Jan 2001 | B1 |
8493323 | Griffin | Jul 2013 | B2 |
8983127 | Gupta et al. | Mar 2015 | B2 |
9261978 | Liberty et al. | Feb 2016 | B2 |
9384594 | Maciocci | Jul 2016 | B2 |
10032276 | Liu | Jul 2018 | B1 |
10043076 | Zhang et al. | Aug 2018 | B1 |
10095031 | Katz et al. | Oct 2018 | B1 |
10402663 | Tsai et al. | Sep 2019 | B1 |
10444021 | Hare | Oct 2019 | B2 |
10503351 | Gullicksen | Dec 2019 | B2 |
10535153 | Weising | Jan 2020 | B2 |
10620717 | McCulloch et al. | Apr 2020 | B2 |
20020060648 | Matsui et al. | May 2002 | A1 |
20030156144 | Morita | Aug 2003 | A1 |
20040073360 | Foxlin | Apr 2004 | A1 |
20050182518 | Karlsson | Aug 2005 | A1 |
20050216126 | Koselka | Sep 2005 | A1 |
20070156286 | Yamauchi | Jul 2007 | A1 |
20080027591 | Lenser | Jan 2008 | A1 |
20080249732 | Lee et al. | Oct 2008 | A1 |
20090005140 | Rose et al. | Jan 2009 | A1 |
20090189974 | Deering | Jul 2009 | A1 |
20100110412 | Basu | May 2010 | A1 |
20100208057 | Meler et al. | Aug 2010 | A1 |
20120194517 | Izadi et al. | Aug 2012 | A1 |
20120203453 | Lundquist et al. | Aug 2012 | A1 |
20120249741 | Maciocci et al. | Oct 2012 | A1 |
20120306850 | Balan | Dec 2012 | A1 |
20120306876 | Shotton et al. | Dec 2012 | A1 |
20130328927 | Mount et al. | Dec 2013 | A1 |
20140123507 | Gupta et al. | May 2014 | A1 |
20150098614 | Gee | Apr 2015 | A1 |
20150187133 | Martini | Jul 2015 | A1 |
20150237481 | Ben-Moshe et al. | Aug 2015 | A1 |
20160047675 | Tenenhaus et al. | Feb 2016 | A1 |
20160070265 | Liu | Mar 2016 | A1 |
20160179830 | Schmalstieg | Jun 2016 | A1 |
20160260251 | Stafford et al. | Sep 2016 | A1 |
20160267720 | Mandella et al. | Sep 2016 | A1 |
20170053447 | Chen et al. | Feb 2017 | A1 |
20170084051 | Weising et al. | Mar 2017 | A1 |
20170124396 | Todeschini | May 2017 | A1 |
20180096507 | Valdivia et al. | Apr 2018 | A1 |
20190060708 | Fung | Feb 2019 | A1 |
20190310472 | Schilt et al. | Oct 2019 | A1 |
Number | Date | Country |
---|---|---|
3086216 | Apr 2015 | EP |
Entry |
---|
“Hololens—Introduction to the Hololens”, by Adam Tuliper, Nov. 2016. Source: https://docs.microsoft.com/en-us/archive/msdn-magazine/2016/november/hololens-introduction-to-hololens (Year: 2016). |
“Hololens—Introduction to the Hololens, Part 2: Satial Mapping”, by Adap Tuliper, Jan. 2017. Source: https://docs.microsoft.com/en-us/archiver/madn-magazine/2017/january/hololens-introduction-to-thehololens-part-2-spatial-mapping (Year 2017). |
“Hololens Inside-Out Tracking Is Game Changing for AR & VR, and No One Is Talking About It” by Ben Lang, Sep. 20, 2016. Source: https://www.roadtovr.com/microsoft-hololens-inside-out-tracking-augmented-reality-virual-reality-ar-vr/ (Year 2016). |
“A look inside Microsoft's HoloLens holographic headset”, by Dean Takahashi, Aug. 23, 2016. Source: https://venturebeat.com/2016/08/23/a-look-inside-microsofts-hololens-hololens-holographic-headset/ (Year: 2016). |
“Inside Micrpsofts Hololens”, by Tom Warren, Apr. 6, 2016. Source: https://www.theverge.com/2016/4/6/11376442/microsoft-hololens-holograms-parts-teardown-photos-hands-on (Year 2016). |
“Microsoft HoloLens—Virtual Reality and Augmented Reality Wiki”, page last modified Dec. 15, 2017, describing the product released Mar. 30, 2016. Source: https://xinreality.com/wiki/Microsoft_HoloLens (Year 2017). |
“Microsoft HoloLens Development Edition Preview | PCMag”, by Will Greenwald, Aug. 22, 2016. Source: https://www.pcmag.com/reviews/microsoft-hololens-development-edition (Year 2016). |
“Microsoft HoloLens specs, news: Boasts six degrees of freedom”, by Kristine Fernandez, Nov. 2, 2016. Source: https://www.ecumenicalnews.com/article/microsft-hololens-specs-new-boasts-six-degrees-of-freedom-and-inside-out-tracking-features-to-unveil-more-details-in-december/54252.htm (Year 2016). |
“Microsoft reveals HoloLens hardware specs” by Tom Warren, Feb. 20, 2016. Source: https″//www.theverge.com/2016/2/29/11132090/Microsoft-hololens-hardware-specifications-developer-kit (Year 2016). |
“Microsoft's HoloLens explained: How it works and why it's different”, by Nick Statt, Jan. 24, 2015. Source: https://www.cnet.com/news/micorsoft-hololens-explained-how-it-works-and-why-its-different/ (Year: 2015). |
“Microsoft's HoloLens secret sauce: A 28nm customized 24-core DSP engine built by TSMC”, by Chris Williams, Aug. 22, 2016. Source: https://www.theregister.com/2016/08/22/microsoft_hololens_hpu/ (Year 2016). |
“Microsoft's HoloLens uses unreleased Intel atom Chip”, by Agam Shah, Jan. 22, 2015. Source: https://www.cio.com/article/2874353/microsofts-hololens-used-unreleased-intel-atom-chip.html (Year 2015). |
“Project HoloLens: Our Exclusive Hands-On With Microsoft's Holographic Goggles”, by Jessi Hempel, Jan. 21, 2015. Source: https://www.wired.com/2015/01/microsoft-hands-on/ (Year 2015). |
“What's Inside Microsoft's HoloLens And How It Works”, by Seth Colaner, Aug. 23, 2016. Source: https″//www.tomshardware.com/news/Microsoft-hololens-components-hpu-28nm.32546.html (Year: 2016). |
“VR's a hog: How t get the most out of multicore CPUs for VR performance”, by Matti Shaw, Feb. 11, 2016. Source: https://venturebeat.com/2016/02/11vrs-a-hog-how-to-get-the-most-of-multi-core-cpus-for-vr-performance/ (Year 2016). |
Number | Date | Country | |
---|---|---|---|
20210031097 A1 | Feb 2021 | US |
Number | Date | Country | |
---|---|---|---|
62447867 | Jan 2017 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 15874842 | Jan 2018 | US |
Child | 17075853 | US |