The field of invention relates generally to computing systems; and, more specifically, to I/O configuration messaging within a link-based computing system.
a shows a depiction of a bus 120. A bus 120 is a “shared medium” communication structure that is used to transport communications between electronic components 101a-10Na and 110a. Shared medium means that the components 101a-10Na and 110a that communicate with one another physically share and are connected to the same electronic wiring 120. That is, wiring 120 is a shared resource that is used by any of components 101a-10Na and 110a to communicate with any other of components 101a-10Na and 110a. For example, if component 101a wished to communicate to component 10Na, component 101a would send information along wiring 120 to component 10Na; if component 103a wished to communicate to component 110a, component 103a would send information along the same wiring 120 to component 110a, etc.
Computing systems have traditionally made use of busses. For example, with respect to certain IBM compatible PCs, bus 120 corresponds to a PCI bus where components 101a-10Na correspond to “I/O” components (e.g., LAN networking adapter cards, MODEMs, hard disk storage devices, etc.) and component 110a corresponds to an I/O Control Hub (ICH). As another example, with respect to certain multiprocessor computing systems, bus 120 corresponds to a “front side” bus where components 101a-10Na correspond to microprocessors and component 110a corresponds to a memory controller.
Owing to an artifact referred to as “capacitive loading”, busses are less and less practical as computing system speeds grow. Basically, as the capacitive loading of any wiring increases, the maximum speed at which that wiring can transport information decreases. That is, there is an inverse relationship between a wiring's capacitive loading and that same wiring's speed. Each component that is added to a wire causes that wire's capacitive loading to grow. Thus, because busses typically couple multiple components, bus wiring 120 is typically regarded as being heavily loaded with capacitance.
In the past, when computing system clock speeds were relatively slow, the capacitive loading on the computing system's busses was not a serious issue because the degraded maximum speed of the bus wiring (owing to capacitive loading) still far exceeded the computing system's internal clock speeds. The same cannot be said for at least some of today's computing systems. That is, with the continual increase in computing system clock speeds over the years, the speed of today's computing systems are reaching (and/or perhaps exceeding) the maximum speed of wires that are heavily loaded with capacitance such as bus wiring 120.
Therefore computing systems are migrating to a “link-based” component-to-component interconnection scheme.
Each point-to-point link can be constructed with copper or fiber optic cabling and appropriate drivers and receivers (e.g., single or differential line drivers and receivers for copper based cables; and LASER or LED E/O transmitters and O/E receivers for fiber optic cables;, etc.). The network 140 observed in
Consistent with this trend,
Often I/O is viewed from the perspective of the computing system's processor(s) and system memory rather than the entire computing system as a whole. From this perspective, I/O is viewed as that portion of the computing system's functionality that can send information at least to and/or from the computing system's system memory. Thus, non-volatile storage devices such as disk storage devices (e.g., magnetic disc drive, CD ROM, etc.) and/or “flash cards” are often included in the list of a computing system's I/O units (along with the I/O units mentioned above). The later perspective of I/O is used by this application unless otherwise indicated.
The link-based I/O segment of
An I/O segment may comprise an access point through which information between the I/O segment and the rest of the computing system flows (note that the root complex 201 is the access point for the I/O segment 200 of
In the case of link-based computing systems, if multiple I/O segments are designed into the same computing system, it is important that configuration transaction packets targeted for a particular I/O segment (whether link-based or bus-based) reach their appropriate destination. Unfortunately, at least the PCI and PCI Express standards fail to specify configuration transactions that are designed to target any one of a plurality of different I/O segments within a same computing system. Hence, a system design that ensures the proper routing of an I/O configuration transaction packet to the correct target I/O segment, amongst a plurality of I/O segments that exist within the link-based computing system, is needed.
The present invention is illustrated by way of example, and not limitation, in the figures of the accompanying drawings in which like references indicate similar elements and in which:
a shows components interconnected through a bus;
b shows components interconnected through a mesh of point-to-point links;
Components 3012 and 3014 at least behave as gateways for the access point of I/O segments 3001 and 3002, respectively. As such, configuration transaction packets directed to I/O segment 3001 should be sent to component 3012 and configuration transaction packets directed to I/O segment 3002 should be sent to component 3014. Network 340 is the network of the link-based computing system. During operation, either of components 3011 or 3013 may seek to initiate a configuration transaction to an I/O unit associated with either one of I/O segments 3001 and 3002. At least two types of configuration transactions exist: 1) a write; and, 2) a read.
For a read configuration transaction, a packet is sent from the component executing the configuration software (e.g., one of components 3011 and 3013) to the I/O segment that the I/O unit that is targeted for the read is connected into. As mentioned above, in order to send a packet to the targeted I/O unit, the packet should be routed over the network 340 to the gateway component for the access point of the targeted I/O segment. The packet includes content (e.g., in the packet payload) that among other possible items of information: 1) identifies the target I/O unit; and, 2) identifies the register within the target I/O from which information is to be read. The I/O segment that the targeted I/O unit is connected into understands the packet's content and reads the information from the identified register within the identified target I/O unit. The register information that was read from the I/O unit is then placed into the payload of a second packet that is sent over network 340 to the component that initiated the transaction.
For a write configuration transaction, a packet is sent from the component executing the configuration software (e.g., one of components 3011 and 3013) to the I/O segment that the I/O unit that is targeted for the write is connected into. The packet includes content that among other possible items of information: 1) identifies the target I/O unit; 2) identifies the register within the target I/O to which information is to be written; and, 3) the information to be written into the identified register. The I/O segment that the targeted I/O unit is connected into understands the packet's content and writes the information into the identified register within the identified target I/O unit. According to at least one possible embodiment, a response (e.g., indicating a successful write) is then placed into the payload of a second packet that is sent over the network 340 to the component that initiated the transaction.
Transactions executed over a network 340 in link-based computing systems may be made identified with an address. That is, for example, each specific type of transaction may be given a unique address which is executed on the component that initiates the transaction. In order to initiate a specific transaction, the transaction's address is decoded by the component's hardware into the actions needed to perform the transaction.
Thus, in the case of a configuration read transaction, as address that corresponds to a configuration read transaction is decoded by the hardware of the component that initiates the transaction into the actions of sending a packet to the targeted I/O segment having content to be interpreted by the targeted I/O segment as a read. In the case of a configuration write transaction, as address that corresponds to a configuration write transaction is decoded by the hardware of the component that initiates the transaction into the actions of sending a packet to the targeted I/O segment having content to be interpreted by the targeted I/O segment as a write.
Because at least the PCI and PCI Express standards fail to specify configuration transactions that are designed to target any one of a plurality of different I/O segments within a same computing system, an address decoding process that is capable of targeting any one of a plurality of I/O segment needs to be designed into the operations of the components that initiate I/O unit configuration transactions.
Those of ordinary skill will recognize the Bus/Device/Function/Extended_Reg/Reg portion 505a of the configuration transaction address 505 as the standard format for a PCI, PCI-X or PCI_Express configuration transaction. Here, the “Bus” parameter identifies which PCI bus (in the case, PCI, PCI-X and PCI_Express) or PCI_Express link (in the case of PCI_Express) within the I/O segment is targeted for the configuration transaction. The “Device” parameter identifies which I/O unit on the targeted bus/link is targeted for the configuration transaction. The “Function” parameter identifies the function to be performed by the configuration transaction (e.g., read or write). The “Extended_Reg” (if available) and “Reg” parameters define the register space of the targeted I/O unit to be affected by the configuration transaction.
The Segment parameter 505a is a novel feature that identifies which I/O segment within the link based computing system is targeted by the configuration transaction. Note that the entire configuration transaction may include more information/parameters than the just the Segment/Bus/Device/Function/Extended_Reg/Reg structure 505. For purposes of identifying a memory mapped address decoding process that is sufficient for identifying a target I/O unit connected to any one of a plurality of I/O segments within a link-based computing system, however, only the Segment/Bus/Device/Function/Extended_Reg/Reg portion 505 of the transaction address needs to be shown.
The Segment parameter 505b, in identifying the targeted I/O segment for the transaction, serves as an input parameter to a source address decoder 402 that determines the specific network node (NodeID) of the gateway component for the targeted I/O segment (e.g., referring to
As alluded to above, the configuration transaction addressing space (and therefore memory map 413) may be partitioned so that a first address range is reserved for configuration transaction addresses whose corresponding configuration transactions all target the same “first” I/O segment (e.g., I/O segment 3001); a second address range is reserved for configuration transaction addresses whose corresponding configuration transactions all target the same “second” I/O segment (e.g., I/O segment 3002), etc. In an alternate embodiment, a plurality of parallel source address decoders are implemented to improve performance (i.e., improve the number of look-ups per second). According to a further embodiment, a first source address decoder is used to identify the NodeID for a first gateway component and a second address decoder is used to identify the NodeID for a second gateway component.
Regardless of how the look-up is performed and the manner in which the final NodeID output is determined (e.g., be pulled directly from the memory map, being determined from information found within the memory map, etc.), the NodeID output 406 is combined with the rest of the information 405b needed to fully characterize the configuration transaction (e.g., the Bus/Device/Function/Extended_Reg/Reg portion 505b) at the networking layer 403 of the component 401. The networking layer 403 is responsible for creating and sending a packet 503 over the link-based computing system's network 440.
Pertinent information 605 for purposes of explaining the response is observed in
The response is combined with the identification 605a of the component 3011 that initiated the transaction request (=SourceID1). As the NodeID specifies the identity of the destination component for a packet and as the SourceID specifies the identity of the sending component for the packet, setting the NodeID for the packet produced by the gateway component 3012 equal to the SourceID of the packet received by the gateway component 3012 (i.e., setting NodeID1=SourceID1) causes the response to be automatically sent to the component 3011 that initiated the configuration transaction. The SourceID for the response packet is the identity of the gateway component (i.e., SourceID2). Thus, the header 604a of the response packet is NodeID1/SourceID2. The payload 604b of the response packet is the response 606 from the executed configuration function.
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereto without departing from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.