The present invention relates to the field of user programmable logic devices (PLDs). More specifically, the present invention is directed to are configurable logic device having custom, embedded application-specific functional blocks.
Reconfigurable devices/fabrics, such as D-Fabrix (disclosed in, for example, U.S. Pat. No. 6,353,841, U.S. Pat. No. 6,252,792, US2002/0157066) are commonly made up of a plurality of interconnected user programmable logic blocks or tiles, the fundamental building blocks of the system. This arrangement facilitates the use of an easily scalable configuration mechanism of equally regular structure.
Each user programmable tile is connected to a programmable routing network which can implement arbitrary connectivity between the tiles. Because each tile is connected to the routing network in the same way, the resulting device has a high degree of homogeneity. That is to say that, the way in which a specific subset of tiles of an array can be used and interconnected will be the same, regardless of where on the array that subset is located. Such a homogeneous array can be found disclosed in, for example, U.S. Pat. No. 6,252,792.
The use of application-specific functional blocks in conjunction with a reconfigurable device/fabric is a known way of enhancing the performance of the device/fabric within specific application contexts. For example, a reconfigurable logic array can be used in conjunction with Random Access Memory (RAM) blocks and Digital Signal Processors (DSP) blocks.
Prior art methods of using these application-specific functional blocks have included the insertion of the blocks within the array in ways that led to modifications of the routing networks, creating a non-homogeneous structure. In one such embodiment, the array of programmable logic tiles is split at a specific point (e.g., along a row or column of logic tiles), thereby creating a gap in the fabric. The application-specific functional block is then inserted in that gap.
This method hinders or precludes direct connections between tiles on either side of the gap. These tiles on either side of the gap are typically connected directly to the application specific functional block and, in some cases also connected to the dedicated switches which allow the possibility of further connections to the tiles located on the other far side of the application-specific functional block.
Such gaps therefore create a significant disruption to the routing network, which will cause a decrease in the homogeneity of the array. As a result, a greater degree of care must be taken during fabrication of the device itself, as the structure of the programmable logic device is no longer regular. Moreover, the placement and routing phases of application mapping are made more difficult, as these disruptions to the homogeneity of the array of programmable logic tiles will pose extra constraints on where specific elements of a design can be placed on the array (placement), as well as how those elements can be connected to other parts of the array/circuit (routing).
There is therefore a clear need for a novel method of embedding an application specific functional block in a reconfigurable array of programmable logic tiles.
In order to solve the problems associated with the prior art, the present invention provides a programmable logic device which comprises:
a uniform routing network;
an array of user programmable tiles connected to the uniform routing network; and
at least one functional block arranged to span at least one tile and further arranged to be connected to the uniform routing network.
Preferably, each tile comprises a plurality of processing units and the same number of associated routing sections; and
the at least one functional block is arranged to physically replace at least one of the processing units of at least one tile.
Preferably, the uniform routing network comprises a grid of uniformly distributed multi-bit buses, the grid comprising a first plurality of parallel buses and a second plurality of parallel buses, the first plurality being substantially perpendicular to the second plurality; and
each routing section comprises a plurality of switches, each switch being arranged to selectively connect a bus from the first plurality of parallel buses to a bus of the second plurality of parallel buses.
Preferably, a plurality of the switches in each routing section are arranged to connect the routing section to its associated processing unit; and
a plurality of the switches in each routing section are arranged to connect the routing section to other routing sections.
Preferably, the at least one of the switches in the plurality of routing sections of a tile having at least one of its processing units replaced by the at least one functional block is used to connect at least one of an input or an output of the at least one functional block to the uniform routing network.
Preferably, each switch in each routing section comprises a driver for driving an electrical signal to one of a processing unit, a routing section or a functional block.
Preferably, the strength of the driver is a function of the distance between the driver and the one of a processing unit, a routing section or a functional block.
Preferably, the multi-bit buses are four-bit buses.
As will be appreciated, the present invention provides several advantages over the prior art. For example, the present invention allows the substitution of an arbitrary number of logic tiles, whereas prior art solutions only allow the substitution of a fixed number of positions. Moreover, because the overall routing architecture is not disrupted, it is easier to predict delays and routability. Furthermore, activities related to the fabrication of the device (layout, verification, manufacturing test, etc) are made significantly simpler, as the routing network patterns can be considered identical for all tiles.
Specific embodiments of the invention will now be described with reference to the accompanying drawings, in which:
Each ALU can perform standard arithmetic operations (such as ADD, SUBTRACT) as well as standard logic operations (such as AND, NAND, OR, NOR) on a set number of bits.
As can be seen from
The significant disadvantage associated with this technique is that the dedicated switching circuit modifies the pattern of the routing network of the array, and must therefore be taken into consideration when trying to predict routing delays. The homogeneity of the routing network is therefore significantly disrupted.
Although switches 15 and 8 appear different in the figures, it will be appreciated that these switches are similar in terms of function. As will also be appreciated, not all of the switches 15 which are adjacent to the custom block 30 are used to connect the switching sections to the custom block. Some of the switches can simply not be used. The unused switches 13 are shown in
When ALUs 2 are replaced with custom block 30, the switches 8 which were used to connect the routing network to the ALUs 2 will either be used as a switch 15 to connect the custom block 30 to the routing network or will be an unused switch 13. The switches 7 however will remain unchanged.
In the embodiment of
Accordingly, if a signal needs to be routed from one side of the application specific functional block 30 to the other side, it will not be necessary for a router to incorporate information related to the presence of application specific functional block 30 while trying to connect unrelated blocks. Furthermore, for the same router, it will not be necessary to incorporate delay information relating to a dedicated, external switching circuit, provided that the custom block 30 sits within the same physical footprint as the ALUs it replaces. In effect, the application specific functional block will become invisible to a router for the purposes of connecting signals across it, or calculating delays.
A second embodiment of the present invention is shown in
In the embodiment of
The physical size of the custom block 31 of the embodiment of
Accordingly, if a signal needs to be routed from one side of the application specific functional block 30 to the other side, it will not be necessary for a router to incorporate delay information relating to a dedicated, external switching circuit, provided that, if the size of the custom block exceeds the footprint of the ALUs it replaces, the switches on either side of the custom block 31 have drivers which are arranged to decrease the delay across the custom block 31 to a level similar to the delay between similar switches on either side of an ALU 2 in the fabric.
Whilst, in the embodiment of
As can be understood from the embodiments described above, any number of ALUs which are replaced by a custom block 30, 31 in accordance with the present invention will depend on the number of inputs and outputs that the particular block 30, 31 must have connected to the routing network. Moreover, the blocks themselves can be of any dimension, provided that any added delay relating to the difference in the distance between switches on either side of the custom block 30, 31, and the distance between any two adjoining tiles 20 in the fabric 1 is compensated for by the addition of more powerful drivers on either side of the custom blocks 30, 31.
Number | Date | Country | Kind |
---|---|---|---|
10153725 | Feb 2010 | EP | regional |