Claims
- 1. A parallel processor, comprising:clusters of processing elements PEs for a d-dimensional regular torus of size 4 per dimension defined by: reshape (∏q=1d-1(I4q-1⊗G⊗I4d-q-1)vec(T),4,4,⋯ ,4⏟d)andcluster switches connected to multiplex inter PE communication paths between said clusters, thereby providing inter-PE connectivity equivalent to that of a torus-connected array, wherein reshape is an inverse operator which creates a tensor from a vector, wherein Π defines a product operator, wherein I defines an identity matrix, wherein G defines a permutation matrix, wherein {circle around (X)} defines a Kronecker product, wherein T defines a tensor representing the d-dimensional regular torus, wherein vec(T) is an operator which utilizes the tensor T to return a vector by stacking up elements of T along the dimensions of the tensor T.
- 2. The parallel array processor of claim 1 wherein said cluster switches are further connected to provide direct communications between PEs in a transpose PE pair within a cluster.
- 3. The parallel processor of claim 2 wherein said clusters are scalable by combining said clusters while maintaining the multiplexing or said cluster switch.
- 4. The parallel processor of claim 3 wherein said cluster switches are further connected to provide direct communications between PEs in a hypercube complement pair within a cluster.
- 5. The parallel processor of claim 1 wherein the permutation matrix G is defined as: G=[S00000S10000S20000S3]and the diagonal blocks of G are the powers of S, wherein S which is defined as: S=[0100001000011000]
- 6. A method of forming a parallel processor, comprising the steps of:arranging processing elements in N clusters of M processing elements, such that each cluster is defined by: reshape(GNvec(T),N,N) so that clusters communicate only in mutually exclusive directions with the processing elements of at least one other cluster; andmultiplexing said mutually exclusive direction communications, wherein reshape is an inverse operator which creates a tensor from a vector; wherein T defines a tensor representing a regular torus, wherein GN defines a permutation matrix, wherein vec(T) is an operator which utilizes the tensor T to return a vector by stacking up elements of T along the dimensions of the tensor T.
- 7. The of claim 6 wherein the permutation matrix G is defined as: GN=[SN00⋯00SN1⋯0⋮ ⋰⋮00⋯SNN-1]and the diagonal blocks of G are the powers of S, wherein S which is defined as: SN=[010⋯0001⋯0⋮⋮ ⋰⋮100⋯0]
Parent Case Info
This application is a continuation of application Ser. No. 09/949,122 filed Oct. 10, 1997, now U.S. Pat. No. 6,167,502.
US Referenced Citations (7)
Number |
Name |
Date |
Kind |
5566342 |
Denneau et al. |
Oct 1996 |
A |
5682491 |
Pechanek et al. |
Oct 1997 |
A |
5717943 |
Barker et al. |
Feb 1998 |
A |
5784632 |
Pechanek et al. |
Jul 1998 |
A |
5794059 |
Barker et al. |
Aug 1998 |
A |
6023753 |
Pechanek et al. |
Feb 2000 |
A |
6041398 |
Pechanek et al. |
Mar 2000 |
A |
Non-Patent Literature Citations (5)
Entry |
Kim et al. “Optimal Embeddings of Multiple Graphs into a Hypermesh” 1997.* |
Pechanek et al. “The Folded Axon/Dendrite Tree Neuron Model” 1994.* |
Gonzalez et al. “Executing Algorithms with Hypercube Topology on Torus Multicomputers” 1995.* |
Moller et al. “Architectural Simulation System for M.F.A.S.T.” 1996.* |
Pechanek et al. “A Massively Parallel Diagonal-Fold Array Processor”. |
Continuations (1)
|
Number |
Date |
Country |
Parent |
08/949122 |
Oct 1997 |
US |
Child |
09/707209 |
|
US |