This relates generally to graphics processing. Graphics processing is the processing of electronic data for its display on a display screen, such as a computer monitor or television.
The Gilbert-Johnson-Keerthi (GJK) algorithm was invented by Elmer G. Gilbert, Daniel W. Johnson, and S. Sathiya Keerthi in 1988. See Gilbert, E. G. et al. “A Fast Procedure for Computing the Distance Between Complex Objects in Three Dimensional Space,” IEEE Journal of Robotics and Automation, Vol. 4, Issue 2, April 1988, pages 199-203.
The GJK algorithm determines the minimum distance between two convex sets. A convex set is basically a depiction of an object. The GJK algorithm uses a Minkowski sum of the two convex shapes. The smooth algorithm modifications obtain the closest pair of points for two convex sets A and B.
Referring to
The V-pipe 38 is coupled to an arithmetic logic unit 36 and the 32 vector registers 40 of sixteen general purpose registers 42. The input to the U-pipe 32 and the V-pipe 38 comes from a data cache 47 and an instruction cache 45 that feeds an instruction fetching, and picking unit 44.
A data cache 47 receives the output from various processing units 34 and 36 and provides data over a two-way way bus to a level 2 or L2 cache 48. The L2 cache 48 is coupled by a ring bus 46 to main memory. A clocking (CLX) unit 49 provides clocking signals to the bus between the data cache and the L2 cache and the bus between the L2 cache and the ring bus.
The processor core 30, shown in
Advantageously, the SIMD processor core 30 works with vectored or aligned data. The processor exploits data level parallelism by efficiently utilizing the SIMD hardware to improve performance in some embodiments. Thus, “aligned, vectored” data refers to data structures that are efficient for a parallel SIMD architecture because their starting addresses are multiples of the executing SIMD processor's width. The registers are 512 bit wide SIMD registers in one embodiment.
Referring to
The data structure, shown in
Referring to
“Convex” refers to the actual shape of the item within the bounds depicted by the objects A and B in
In accordance with some embodiments, the GJK algorithm is adapted to operate on aligned, vectored data suitable for multiple core, parallel processors, such as the one depicted in
The sequence for applying the GJK algorithm, according to one embodiment, is shown in
Thus, as shown at block 10, initially, the aligned, vectored data is prepared. Next, the data is processed using an iterative vectored GJK algorithm to compute the minimum distance between the two objects A and B. The vectored support mapping is implemented in the context of fully vectored GJK implementation. The instructions enable branch avoiding with the help of masked operations. Any “if-else” statement can be expressed as linear code using masked operations in one embodiment.
The vectored GJK algorithm contains only two loops in one embodiment. The first loop, indicated in block 12 supports the mapping function. This loop processes all points in a given set. The second loop, indicated in block 14, repeats the algorithm until the optimum point (i.e. shortest distance between objects) in the algorithm is identified.
The pseudo code for the algorithm uses the Minkowski sum for A and B sets for the objects A and B. Thus, the sum of the two objects results in their combination. Namely, A+B={a+b: a in A, b in B}. The Minkowski difference for A and B sets is a new set: A−B={a−b: a in A, b in B}=A+(−B). CH(S) denotes a convex hull of S vertices.
The input to the algorithm is a convex hull of the Minkowski difference of the sets A and B, which is M. First, an arbitrary simplex Q is chosen from M. Then a point P is computed, closest to the origin in the convex hull of Q vertices. If P is the origin, then exit. In such case, a zero is returned.
Otherwise, Q is reduced to the smallest subset Q′ of Q, such that P is in the convex hull of Q′ vertices. Then V is equal to the support map computation (Sc) of the furthest vertex along a given direction for −P, which is the supporting point in the direction −P. If V is no more extreme in direction −P, then P itself can exit and return ∥P∥. Next, add V to Q and then go back to computing the point P closest to the origin in the convex hull of Q vertices.
In some embodiments, the vectored approach enables processing pairs of sets employing SIMD units and simultaneously using multi-threaded processor capabilities. A significant performance increase may be achieved in multi-core processors in some embodiments. The greatest performance boost may be achieved by processing sets with the same number of points. In this case, the memory utilization is most effective. For games, this is a most likely estimation because even complex bodies have no more than a few dozen vertices.
The computer system 130, shown in
In the case of a software implementation, the pertinent code may be stored in any suitable semiconductor, magnetic, or optical memory, including the main memory 132 or any available memory within the graphics processor. Thus, in one embodiment, the code to perform the sequence 139 of
The techniques described herein apply to any convex object, including two, three, and higher dimensional surfaces. While a linear time algorithm is used to calculate the support map in the embodiment described above, other algorithms may also be used.
The graphics processing techniques described herein may be implemented in various hardware architectures. For example, graphics functionality may be integrated within a chipset. Alternatively, a discrete graphics processor may be used. As still another embodiment, the graphics functions may be implemented by a general purpose processor, including a multicore processor.
References throughout this specification to “one embodiment” or “an embodiment” mean that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one implementation encompassed within the present invention. Thus, appearances of the phrase “one embodiment” or “in an embodiment” are not necessarily referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be instituted in other suitable forms other than the particular embodiment illustrated and all such forms may be encompassed within the claims of the present application.
While the present invention has been described with respect to a limited number of embodiments, those skilled in the art will appreciate numerous modifications and variations therefrom. It is intended that the appended claims cover all such modifications and variations as fall within the true spirit and scope of this present invention.