The present invention relates generally to arbitration of access to a shared resource. More particularly, the present invention relates to bus arbitration using dynamic priorities based on the waiting periods of the requests for the bus.
In many technologies, and especially in the arena of electronic computers, a scarce resource is shared among competing interests. For example, a shared bus in a computer is shared among several requestors. In such an environment, an efficient and simple arbitration scheme is desirable in order to increase the utilization of the bus, to increase bus access for the requestors, and to reduce the cost of the computer.
One conventional arbitration scheme simply assigns a fixed priority to each requestor. According to this scheme, access to the bus is always granted to the requestor having the highest priority. One disadvantage of this approach is that the low-priority requestors rarely, if ever, gain access to the bus.
In general, in one aspect, the invention features an arbitration circuit for granting access to a shared resource among requestors, comprising a plurality of request shapers each comprising an input unit to receive a request from one of a plurality of the requestors, a priority unit to assign a respective predetermined one of a plurality of priority levels to each of the requests, and an age unit to assign an age to each of the requests when the request is received by the request shaper; and an arbiter core to receive the requests from the request shapers, and to grant access to the shared resource to each of the requestors corresponding to the requests; wherein each of the age units increases the age of a respective one of the requests when the corresponding one of the requestors is not granted access to the shared resource; and wherein each of the priority units increases the priority level of a respective one of the requests, when the corresponding one of the requestors is not granted access to the shared resource, according to the age of the respective one of the requests.
Particular implementations can include one or more of the following features. The arbiter core comprises grant logic to grant access to the shared resource to one of the requestors according to the priority levels and the ages of the requests, comprising a priority encoder to select the one or more of the requests having the highest of the priority levels of the requests, and an arbitration unit to select the one, of the one or more of the requests selected by the priority encoder, having the greatest of the ages. Each of the requests has one of a plurality of delta periods of time, and each of the request shapers further comprises a priority adjuster to cause the respective priority unit to increase the priority level of the respective one of the requests when the age of the request has increased by the delta period of the request and the requestor corresponding to the request has not been granted access to the shared resource. The requests are received during a first interval, wherein the arbiter core further comprises a mask circuit to grant access to the shared resource to all of the requestors corresponding to the requests having one of the priority levels before granting access to the shared resource to requestors corresponding to any further requests having the one of the priority levels and received during a subsequent second interval. The mask circuit comprises a plurality of mask registers each corresponding to a respective one of the priority levels; wherein each of the mask registers stores a plurality of mask bits each corresponding to a respective one of the requestors; and a mask logic to set each of the mask bits when no corresponding request has been received having a corresponding one of the priority levels and the corresponding requestor has not been granted access to the shared resource; wherein the mask logic clears each of the mask bits when a corresponding request is received having a corresponding one of the priority levels. The arbiter core further comprises level filter logic to pass each of the requests to the grant logic only when the mask bit corresponding to the request is set. The shared resource is a shared communication bus; and wherein the requestors are communication units sharing the communication bus to exchange data.
In general, in one aspect, the invention features a method and computer-readable media for granting access to a shared resource among requestors. It comprises receiving a request from each of a plurality of the requestors; assigning a respective predetermined one of a plurality of priority levels to each of the requests; assigning an age to each of the requests when the request is received; and granting access to the shared resource to each of the requests, comprising granting access to the shared resource to the requestor corresponding to the one of the requests having the highest priority level and the greatest age, increasing the age of each of the requests corresponding to requestors that were not granted access to the shared resource, and increasing the priority level of each of the requests corresponding to requestors that were not granted access to the shared resource according to the age of the request.
Particular implementations can include one or more of the following features. Granting access to the shared resource to each of the requests comprises, when only one of the requests has a highest one of the priority levels of the requests, granting access to the shared resource to the requestor corresponding to the one of the requests having the highest priority level of the requests, and when more than one of the requests has the highest one of the priority levels of the requests, granting access to the shared resource to the requestor corresponding to the one of the requests having the highest one of the priority levels of the requests and the greatest age. Each of the requests has one of a plurality of delta periods of time, and wherein increasing the priority level of each of the requests corresponding to requestors that were not granted access to the shared resource according to the age of the request comprises increasing the priority level of each of the requests corresponding to requestors that were not granted access to the shared resource when the age of the request has increased by the delta period of the request. The requests are received during a first interval, and implementations further comprise receiving one or more further requests during a subsequent second interval; and granting access to the shared resource to all of the requestors corresponding to the requests having one of the priority levels before granting access to the shared resource to any of the requestors corresponding to the further requests having the one of the priority levels.
The details of one or more implementations are set forth in the accompanying drawings and the description below. Other features will be apparent from the description and drawings, and from the claims.
Further areas of applicability will become apparent from the description provided herein. It should be understood that the description and specific examples are intended for purposes of illustration only and are not intended to limit the scope of the present disclosure.
The drawings described herein are for illustration purposes only and are not intended to limit the scope of the present disclosure in any way.
The leading digit(s) of each reference numeral used in this specification indicates the number of the drawing in which the reference numeral first appears.
Logic circuit 208 grants access to bus 104 to one of requestors 102 in the following manner. Logic circuit 208 examines registers 204 to select the request having the highest priority level among the received requests (step 306). If only one of the requests has the highest priority of the received requests (step 308), logic circuit 208 grants access to bus 104 to the requestor 102 corresponding to the selected request (step 310). However, if more than one of the requests has the highest priority of the received requests (step 308), logic circuit 208 examines the counters 206 for those requests to select the request having greatest age among the received requests having the highest priority level (step 312). Logic circuit 208 then grants access to bus 104 to the requestor 102 corresponding to the selected request (step 310).
Arbiter 200 of
Arbiter core 402 selects one of the requestors 102 based on the REQ and PRI signals, and sends a GRANT signal to the selected requestor 102, and to the corresponding request shaper 404, which clears the request.
Equality comparator 504 compares the count in counter 502 to the delta period signal DP(N), which represents a value that is programmable. When the count in the counter 502 reaches the delta period, the output of equality comparator 504 goes high, resetting counter 502 and incrementing counter 508. Thus with the expiration of each delta period, the priority of a request is increased by one, up to the maximum priority level. Of course, the priority can be increased by other values instead. The count of counter 508 is output as signal PRI(N).
When the request is granted, counter 508 is reset.
In response to signals GRANT, LVLREQ and LVLACTV, mask logic 606 modifies the contents of mask store 602, as described in detail below. The contents of mask store 602 are provided to level filter logic 604.
Bus monitor 610 monitors the status of bus 104. When bus 104 is idle, bus monitor 610 causes a signal ALLOW_NEXT_ARB to be high. Signal LVLREQ is also provided to grant logic 608. When signal ALLOW_NEXT_ARB is high, and in response to signal LVLREQ, grant logic 608 modifies the GRANT signal, which comprises N signals GRANT(N), one for each requestor 102, thereby granting bus 104 to one of the requestors 102. When signal ALLOW_NEXT_ARB is low, indicating that bus 104 is not idle, grant logic 608 does not modify the GRANT signal. This method prevents the interruption of a current bus access by the requestor 102 previously granted access to bus 104.
Level filter 802A corresponds to the lowest priority level (PRI=0) and comprises N equality comparators 804A through 804N and N AND gates 806A through 806N, each pair representing one of the N requestors 102, and an OR gate 808. Each equality comparator 804 receives a LVL signal that indicates the priority level I served by that level filter. Each equality comparator also receives a respective one of the PRI signals. When arbiter core 402 receives a request having PRI=0, the output of the equality comparator 804 in level filter 802A (which corresponds to PRI=0) that corresponds to the requestor 102 sending the request goes high. If the REQ signal for that requestor 102 and the mask bit MASK(N,I) for that request are also high, the output of the corresponding AND gate 806 goes high. For example, if arbiter core 402 receives from requestor 102N a request (REQ(7)=1) with a priority level of 0 (PRI(7)=0), and the corresponding mask bit MASK(7,0) is set, then the output of AND gate 806N of level filter 802A, which is a signal LVLREQ(7,0) goes high, indicating that arbiter core 402 has received a request from requestor 102N at priority level 0 that is not masked.
All of the signals LVLREQ(N,I) produced by level filter 802A are fed to OR gate 808, which outputs a signal LVLACTV(0) that goes high when arbiter core 402 receives a request having a priority level of PRI=0.
Logic unit 902(N,I) comprises inverters 904A, 904B, and 904C, AND gates 906A and 906B, NOR gate 908, OR gate 912, and flip-flops 910A and 910B, which are clocked by a clock signal CLK. Flip-flop 910A receives signal LVLACTV(I) and provides a delayed version of that signal to inverter 904B, which provides its output to AND gate 906A. AND gate 906A also receives signal LVLACTV(I), and receives signal LVLREQ(N,I) after inversion by inverter 904C. NOR gate 908 receives the GRANT(I) signal and the output of AND gate 906A. AND gate 906B receives the outputs of NOR gate 908 and flip-flop 910B, and receives the signal LVLACTV(I) after inversion by inverter 904A. OR gate 912 receives the output of AND gate 906B and signal LVLACTV(I). The output of flip-flop 910B is the signal LVLMASK(N,I), which sets and clears the corresponding mask bit MASK(N,I) in mask store 602. On a system reset, the SYSTEM_RESET signal is asserted, which resets all of the flip-flops 910B in mask logic 900. This causes all of the LVLMASK signals to go high, which sets all of the mask bits in mask store 602, thereby permitting all requests to pass through level filter logic 604 to grant logic 608 until the mask bits are modified by level filter logic 604.
MUX 1004 receives I signals LVLREQ(I), where each signal LVLREQ(I) is an N-bit signal comprising the corresponding N signals LVLREQ(N,I). For example, in a system having 8 requestors and 16 priority levels, signal LVLREQ(5) for priority level 3 is an 8-bit signal comprising the corresponding 8 signals LVLREQ(0,5) through LVLREQ(7,5).
Priority encoder 1002 receives the I signals LVLACTV(0) through LVLACTV(I). As described above, each LVLACTV signal when high indicates that there is a pending request (that is, a request received and not masked, but not yet granted) of the corresponding priority level. Priority encoder 1002 selects the highest priority having a pending request, and passes that selection to MUX 1004 as signal LVLSEL, which causes MUX 1004 to pass to arbitration unit 1006 the signal LVLREQ(I) corresponding to the priority level I selected by priority encoder 1002. Priority encoder 1002 is preferably implemented as a conventional logic circuit according to well-known techniques.
The signal LVLREQ(I) received by arbitration unit 1006 represents all of the pending requests having the highest priority of the pending requests. Arbitration unit 1006 selects one of those requests according to a conventional priority scheme such as a fixed priority scheme, a fairness priority scheme, and the like, and issues a GRANT signal to the requestor 102 corresponding to the selected request. Arbitration unit 1006 is preferably implemented as a conventional logic circuit according to well-known techniques.
Arbiter 400 of
The invention can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. Apparatus of the invention can be implemented in a computer program product tangibly embodied in a machine-readable storage device for execution by a programmable processor; and method steps of the invention can be performed by a programmable processor executing a program of instructions to perform functions of the invention by operating on input data and generating output. The invention can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. Each computer program can be implemented in a high-level procedural or object-oriented programming language, or in assembly or machine language if desired; and in any case, the language can be a compiled or interpreted language. Suitable processors include, by way of example, both general and special purpose microprocessors. Generally, a processor will receive instructions and data from a read-only memory and/or a random access memory. Generally, a computer will include one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of non-volatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented by, or incorporated in, ASICs (application-specific integrated circuits).
A number of implementations of the invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, other implementations are within the scope of the following claims.
This present disclosure is a continuation of U.S. patent application Ser. No. 13/274,674 (now U.S. Pat. No. 8,307,137), filed Oct. 17, 2011, which is a continuation of U.S. patent application Ser. No. 12/758,849 (now U.S. Pat. No. 8,041,870), filed on Apr. 13, 2010, which is a continuation of U.S. patent application Ser. No. 11/390,627 (now U.S. Pat. No. 7,698,486), filed on Mar. 28, 2006, which is a continuation of U.S. patent application Ser. No. 10/390,431 (now U.S. Pat. No. 7,062,582), filed on Mar. 14, 2003. The entire disclosures of the above applications are incorporated herein by reference.
Number | Name | Date | Kind |
---|---|---|---|
4342995 | Shima | Aug 1982 | A |
4493036 | Boudreau et al. | Jan 1985 | A |
4642758 | Teng | Feb 1987 | A |
4672536 | Giroir et al. | Jun 1987 | A |
4682282 | Beasley | Jul 1987 | A |
4901234 | Heath et al. | Feb 1990 | A |
5025370 | Koegel et al. | Jun 1991 | A |
5241632 | O'Connell et al. | Aug 1993 | A |
5280608 | Beaverson et al. | Jan 1994 | A |
5392033 | Oman et al. | Feb 1995 | A |
5428796 | Iskiyan et al. | Jun 1995 | A |
5444855 | Thompson | Aug 1995 | A |
5463624 | Hogg et al. | Oct 1995 | A |
5528767 | Chen | Jun 1996 | A |
5537400 | Diaz et al. | Jul 1996 | A |
5544332 | Chen | Aug 1996 | A |
5546545 | Rich | Aug 1996 | A |
5560016 | Fiebrich et al. | Sep 1996 | A |
5778200 | Gulick | Jul 1998 | A |
5787482 | Chen et al. | Jul 1998 | A |
5809278 | Watanabe et al. | Sep 1998 | A |
5862355 | Logsdon | Jan 1999 | A |
5896380 | Brown et al. | Apr 1999 | A |
5909686 | Muller et al. | Jun 1999 | A |
5938736 | Muller et al. | Aug 1999 | A |
5944792 | Yamato et al. | Aug 1999 | A |
5958036 | Burns et al. | Sep 1999 | A |
5983302 | Christiansen et al. | Nov 1999 | A |
6044061 | Aybay et al. | Mar 2000 | A |
6078338 | Horan et al. | Jun 2000 | A |
6088751 | Jaramillo | Jul 2000 | A |
6092137 | Huang et al. | Jul 2000 | A |
6141344 | Delong | Oct 2000 | A |
6160812 | Bauman et al. | Dec 2000 | A |
6185221 | Aybay | Feb 2001 | B1 |
6209053 | Kurts | Mar 2001 | B1 |
6295553 | Gilbertson et al. | Sep 2001 | B1 |
6330647 | Jeddeloh et al. | Dec 2001 | B1 |
6343351 | Lackman et al. | Jan 2002 | B1 |
6363452 | Lach | Mar 2002 | B1 |
6363466 | Anand | Mar 2002 | B1 |
6385678 | Jacobs et al. | May 2002 | B2 |
6424659 | Viswanadham et al. | Jul 2002 | B2 |
6430194 | Ilyadis et al. | Aug 2002 | B1 |
6442648 | Genduso et al. | Aug 2002 | B1 |
6539451 | Zani et al. | Mar 2003 | B1 |
6578112 | Ono | Jun 2003 | B2 |
6606692 | Hill et al. | Aug 2003 | B2 |
6647449 | Watts | Nov 2003 | B1 |
6674720 | Passint et al. | Jan 2004 | B1 |
6738836 | Kessler et al. | May 2004 | B1 |
6763418 | Chou et al. | Jul 2004 | B1 |
6772256 | Regev et al. | Aug 2004 | B1 |
6802064 | Yao et al. | Oct 2004 | B1 |
6810455 | Wyland | Oct 2004 | B2 |
6842423 | Erimli et al. | Jan 2005 | B1 |
6898187 | Perlman et al. | May 2005 | B2 |
6915396 | Wiens et al. | Jul 2005 | B2 |
6956854 | Ganesh et al. | Oct 2005 | B2 |
6961803 | Purcell et al. | Nov 2005 | B1 |
7062582 | Chowdhuri | Jun 2006 | B1 |
7107386 | Purcell et al. | Sep 2006 | B1 |
7120715 | Chauvel et al. | Oct 2006 | B2 |
7139267 | Lu et al. | Nov 2006 | B2 |
7239633 | Chiou | Jul 2007 | B1 |
7363406 | Chai et al. | Apr 2008 | B2 |
7400580 | Synnestvedt et al. | Jul 2008 | B1 |
7417637 | Donham et al. | Aug 2008 | B1 |
7461190 | Subramanian et al. | Dec 2008 | B2 |
7698486 | Chowdhuri | Apr 2010 | B1 |
8041870 | Chowdhuri | Oct 2011 | B1 |
20020023186 | Kim | Feb 2002 | A1 |
20020069310 | Scandurra et al. | Jun 2002 | A1 |
20020133654 | Yamamoto | Sep 2002 | A1 |
20020176428 | Ornes et al. | Nov 2002 | A1 |
20020181440 | Norman et al. | Dec 2002 | A1 |
20030033461 | Malik et al. | Feb 2003 | A1 |
20040059879 | Rogers | Mar 2004 | A1 |
20040210695 | Weber et al. | Oct 2004 | A1 |
Number | Date | Country |
---|---|---|
3374464 | Dec 1987 | DE |
340347 | Nov 1989 | EP |
981093 | Feb 2000 | EP |
1182550 | Feb 2002 | EP |
1187029 | Mar 2002 | EP |
1418505 | May 2004 | EP |
63109695 | May 1988 | JP |
01265340 | Oct 1989 | JP |
01279354 | Nov 1989 | JP |
04246744 | Feb 1991 | JP |
03053338 | Mar 1991 | JP |
04035540 | Feb 1992 | JP |
05053980 | Mar 1993 | JP |
07182181 | Jul 1995 | JP |
WO 9629838 | Sep 1996 | WO |
Entry |
---|
“NA900644: Dynamic Workload Balancing,” Jun. 1, 1990, IBM, IBM Technical Disclosure Bulletin, vol. 33, Issue 1A, pp. 44-46. |
“NN71 033032: Line Scanner Providing Priority Controls,” Mar. 1, 1971, IBM, IBM Technical Disclosure Bulletin, vol. 13, Issue 10, pp. 3032-3033. |
“NN8207722: Page Replacement Method and Mechanism,” Jul. 1, 1982, IBM, IBM Technical Disclosure Bulletin, vol. 25, Issue 2, pp. 722-736. |
Chen et al., “A Priority Assignment Strategy of Processing Elements Over an On-Chip Bus,” Mar. 2007, ACM, Proceedings of the 2007 ACM Symposium on Applied Computing, pp. 1176-1160. |
Das, R.; Mutlu, O.; Moscibroda, T.; Das, C., “Aergia: A Network-on-Chip Exploiting Packet Latency Slack,” Micro, IEEE, vol. 31, No. 1, pp. 29-41, Jan.-Feb. 2011. |
Galles, M., “Spider: a high-speed network interconnect,” Micro, IEEE, vol. 17, No. 1, pp. 34-39, Jan./Feb. 1997. |
Kanodia et al., “Distributed Priority Scheduling and Medium Access in Ad Hoc Networks,” Sep. 2002, Kluwer Academic Publishers, vol. 3, Issue 5, pp. 455-466. |
Ilitzky, D.A.; Hoffman, J.D.; Chun, A.; Esparza, B.P., “Architecture of the Scalable Communications Core's Network on Chip,” Micro, IEEE, vol. 27, No. 5, pp. 62-74, Sep.-Oct. 2007. |
Meyerowitz et al., “A Tool for Describing and Evaluating Hierarchical Real-Time Bus Scheduling Policies,” Jun. 2003, ACM, Proceedings of the 40th Annual Design Automation Conference, pp. 312-317. |
Ramasubramanian, N.; Krishnan, P.; Kamakoti, V., “Studies on the Performance of Two New Bus Arbitration Schemes for MultiCore Processors,” Advance Computing Conference, 2009. IACC 2009. IEEE International, pp. 1192-1196, Mar. 6-7, 2009. |
Shin et al., “Round-Robin Arbiter Design and Generation,” Oct. 2002, ACM, Proceedings of the 15th International Symposium of System Synthesis, pp. 243-248. |
Number | Date | Country | |
---|---|---|---|
Parent | 13274674 | Oct 2011 | US |
Child | 13668791 | US | |
Parent | 12758849 | Apr 2010 | US |
Child | 13274674 | US | |
Parent | 11390627 | Mar 2006 | US |
Child | 12758849 | US | |
Parent | 10390431 | Mar 2003 | US |
Child | 11390627 | US |