Aggregating unoccupied PCI-e links to provide greater bandwidth

Description

BACKGROUND

The development of cheaper, more capable integrated circuits have led to the development of portable computing systems featuring smaller, sleeker designs while retaining relatively sophisticated computing capabilities. These computing systems refer primarily to laptops and netbooks, but also include smart phones, and portable audio devices, portable video devices and portable video game consoles. However, as the recent trend of miniaturizing portable computing systems continues, the space available for hardware for these designs has progressively decreased. As a result, the optimization of hardware design and architecture has become of primary importance.

Typical computing devices include at least a collection of microprocessors or a central processing unit (CPU), some memory, a motherboard (e.g., central printed circuit board) featuring a chipset, and at least one graphics processing unit for generating video output to a display. In some conventional motherboard designs, the chipset is arranged into two separate component hubs, which are commonly referred to as the “northbridge” and “southbridge,” respectively. The northbridge typically handles communications among the CPU, random access memory (RAM), video output interfaces, and the southbridge. In many contemporary netbook and laptop implementations, the video output interface is implemented as an integrated graphics processing unit. The southbridge, on the other hand, is one or more chips that provide a platform to support a plurality of peripheral components, such as input/output devices and mass storage devices. In many implementations, the southbridge may also include integrated peripherals, such as audio controllers, network interface cards, universal serial bus (USB) and PCI-express connections, etc.

Traditionally, netbooks and laptops have used integrated graphics solutions such as integrated graphics processing units (GPUs) coupled to the northbridge. Integrated graphics processing units are graphics processors that utilize a portion of a computer's system memory rather than having its own dedicated memory. In general, integrated GPUs are cheaper to implement than dedicated or “discrete” GPUs, and offer relatively improved battery life and lower power usage, but at the cost of reduced capability and performance levels relative to discrete GPUs. Advantageously, manufacturers of netbooks and laptops have begun to offer configurations with higher graphics processing capabilities by providing computer systems that include additional discrete graphics processing units in addition to the integrated graphics processors.

Discrete or “dedicated” GPUs are distinguishable from integrated GPUs by having higher performance and also having local memory dedicated for use by the GPU that the GPU does not share with the underlying computer system. Commonly, discrete GPUs are implemented on discrete circuit boards called “video cards” which include, among other components, a GPU, the local memory, communication buses and various output terminals. In conventional applications, these video cards typically interface with the main circuit board (e.g., motherboard) of a computing system through a PCI Express (PCI-e) interface, upon which the video card may be mounted. In general, discrete GPUs are capable of significantly higher performance levels relative to integrated GPUs but typically require and consume higher levels of power relative to integrated graphics solutions. Portable computing devices with both integrated and discrete graphics processing solutions often offer a mechanism or procedure that enables the user to alternate usage between the particular solutions so as to manage performance and battery life according to situational needs or desired performance levels.

As mentioned above, in typical netbooks and laptops, the PCI Express interface is a component of the southbridge. However, unlike PCI-e interfaces in other computing systems such as desktops, the PCI-e interface of a portable computing device is often of a reduced size and, consequently, of a reduced capacity. In a typical configuration, the PCI-e interface of any computing device comprises a plurality of links, with each link comprising a further plurality of “lanes,” and being configured to independently couple to a peripheral device. The number of lanes in a link coupled to a peripheral device correlates with the bandwidth of the connection, and thus, couplings between a peripheral device and a link with larger amounts of lanes have greater bandwidth than couplings with links comprised of only single lanes. Traditionally, the number of links in a PCI-e interface of a portable computing device may be configured by the manufacturer in separate configurations to suit specific hardware implementations.

In a popular configuration, the links in PCI-e interface of a portable computing device may be arranged in either of two combinations totaling up to four lanes. For example, implementations can comprise either a single link of four lanes (1×4), thereby offering relatively greater bandwidth for a coupled device. Alternatively, implementations may feature four separate links, with each link capable of being coupled to a separate device but limited to a single lane (4×1) with a correspondingly low bandwidth. Thus, whenever the PCI-e interface is coupled to one device, the single link (1×4) configuration may be optimal, but multiple devices require additional links that adversely impact the amount of bandwidth and throughput of each connection.

Unfortunately, since netbooks and laptops are often intended to be used with network connections, chipset manufacturers of computing devices that will include a discrete GPU will invariably manufacture southbridges (and/or motherboards in general) with PCI-e interfaces having four separate links of one lane each, one of which is occupied by a network controller (e.g., a network interface card). This results in the extremely inefficient configuration wherein one link is coupled to the network controller, another link is coupled to the graphics processing unit, and the other two links remaining unoccupied (or coupled to additional devices). While the bandwidth from a link with only one lane may be sufficient to run certain applications on certain devices, for usage in graphics processing a link having only a single lane is often insufficient and likely to drastically and adversely impact the performance of the discrete graphics processing unit. Moreover, this configuration results not only in substandard performance for discrete graphics processing units, but also commonly results in a waste of the remaining unoccupied links.

SUMMARY

This Summary is provided to introduce a selection of concepts in a simplified form that is further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.

Embodiments of the claimed subject matter are directed to systems and a method that allows the aggregation of multiple interfaces of a data communication bus to provide greater bandwidth for communication between a peripheral device and system memory within a computing system. In one embodiment, unoccupied interfaces of the data communication bus are combined with an interface coupled to a peripheral device to increase the bandwidth of data transfer requests between the peripheral device and the system memory.

In another embodiment, a process is provided that enables the distribution of requests for accessing system memory (e.g., direct memory access requests) initiated by a discrete graphics processing unit among aggregated links of a PCI-e interface. The process comprises receiving the requests in a link aggregator, parsing the requests to correspond to the number of aggregated links, and distributing the requests among the links evenly. In further embodiments, the requests may be distributed in a round robin fashion.

In yet another embodiment, an apparatus is provided for aggregating unoccupied links of a PCI-e interface to increase the bandwidth to a discrete graphics processing unit in a system with two or more graphics processing units. According to some embodiments, the system includes a printed circuit board with: a first and second graphics processing units; system memory; and a plurality of peripheral components including a PCI-e interface, wherein the PCI-e interface is comprised of a plurality of links which may be coupled to a plurality of devices, including the second graphics processing unit, to facilitate the transfer of data between the devices and the system memory. According to this embodiment, a link aggregator will aggregate the unoccupied links of the PCI-e interface to increase the bandwidth of requests from the second graphics processing unit and the system memory.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and form a part of this specification, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention:

FIG. 1 depicts a block diagram of an exemplary hardware configuration of a central printed circuit board featuring a PCI-e interface with one link, in accordance with various embodiments of the present invention.

FIG. 2 depicts a block diagram of an alternate exemplary hardware configuration of a central printed circuit board featuring a PCI-e interface with multiple PCI-e links, in accordance with various embodiments of the present invention.

FIG. 3 depicts a block diagram of an alternate exemplary hardware configuration of a central printed circuit board featuring a PCI-e interface with multiple links that are aggregated by a bandwidth aggregator, in accordance with various embodiments of the present invention.

FIG. 4 depicts an exemplary flowchart of a process of distributing memory access requests from a peripheral device over an aggregated PCI-e link, in accordance with various embodiments of the present invention.

FIG. 5 depicts a block diagram of a basic computing system, in accordance with various embodiments of the present invention.

DETAILED DESCRIPTION

Reference will now be made in detail to several embodiments. While the subject matter will be described in conjunction with the alternative embodiments, it will be understood that they are not intended to limit the claimed subject matter to these embodiments. On the contrary, the claimed subject matter is intended to cover alternative, modifications, and equivalents, which may be included within the spirit and scope of the claimed subject matter as defined by the appended claims.

Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the claimed subject matter. However, it will be recognized by one skilled in the art that embodiments may be practiced without these specific details or with equivalents thereof. In other instances, well-known processes, procedures, components, and circuits have not been described in detail as not to unnecessarily obscure aspects and features of the subject matter.

Portions of the detailed description that follow are presented and discussed in terms of a process. Although steps and sequencing thereof are disclosed in figures herein (e.g., FIG. 4) describing the operations of this process, such steps and sequencing are exemplary. Embodiments are well suited to performing various other steps or variations of the steps recited in the flowchart of the figure herein, and in a sequence other than that depicted and described herein.

Some portions of the detailed description are presented in terms of procedures, steps, logic blocks, processing, and other symbolic representations of operations on data bits that can be performed on computer memory. These descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. A procedure, computer-executed step, logic block, process, etc., is here, and generally, conceived to be a self-consistent sequence of steps or instructions leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated in a computer system. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.

It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussions, it is appreciated that throughout, discussions utilizing terms such as “accessing,” “writing,” “including,” “storing,” “transmitting,” “traversing,” “associating,” “identifying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.

Link Configurations

Specific configurations of the central printed circuit board (e.g., motherboard) in different portable computing devices may vary according to design and/or manufacturer preference, but often include: a central processing unit (CPU), system memory, a chipset that enables communication between various components within the computing device and the central printed circuit board specifically, as well as one or more graphics processing units. These graphics processing units may be implemented as integrated and/or discrete. For computing devices that include discrete graphics processing units, data may be transferred between the discrete graphics processing units and the system memory or the CPU via a communication bus. A popular communication bus standard is referred to as PCI Express or, “PCI-e,” alternatively. The PCI-e interface of a typical portable computing device comprises a plurality of sub-interfaces (referred to as “links”) which may be arranged according to either of two combinations totaling up to four lanes. For example, implementations can comprise either a single sub-interface (link) of four serially arranged lanes (1×4), thereby offering relatively greater bandwidth for a coupled device but allowing a coupling of only one device to the PCI-e interface, or, alternatively, four sub-interfaces (links) of a single lane each that allows up to four devices to be coupled to the interface, but at lower data transfer rates.

FIG. 1 displays a block diagram of an exemplary hardware configuration 100 of a central printed circuit board featuring a PCI-e interface with one link, in accordance with various embodiments of the present invention. As depicted, FIG. 1 depicts a chipset comprising two separate chips (e.g., northbridge 101 and southbridge 109, respectively) operating as hubs for various components. As displayed, the two chips (e.g., northbridge 101 and southbridge 109) are communicatively coupled. Additional features of FIG. 1 may include system memory (e.g., memory 105) and a plurality of graphics processing devices (e.g., iGPU 107 and dGPU 115).

In one embodiment, a central processing device (e.g., CPU 103) and system memory 105 are coupled to (or even disposed on) one of the chips. In further embodiments, the CPU 103 and the system memory may be coupled to (or disposed on) the same chip. As shown, both CPU 103 and memory 105 are coupled to the northbridge 101. According to some embodiments, the configuration 100 may include a plurality of graphics processing devices. The plurality of graphics processing devices may include, for example, an integrated graphics processing unit (e.g., iGPU 107) coupled to the northbridge 101 and also coupled to a display device (e.g., display 117). In one embodiment, the display device 117 is coupled to an output interface of the integrated graphics processing unit, and display data generated by other components (e.g., at a dGPU) must be passed to the display device 117 through the iGPU. According to some embodiments, the display device 117 may be implemented as, for example, a discrete monitor or the display panel of a portable computing device.

According to some embodiments, a chip of the printed circuit board may include one or more integrated data communication buses. As depicted in FIG. 1, a data communication bus (e.g., PCI-e 111) is coupled with the southbridge 109. In some embodiments, the data communication bus may include an interface to couple with one or more peripheral devices, such as a discrete graphics processing unit (e.g., dGPU 115). In a typical embodiment, the interface may comprise a plurality of links, with each link comprising a plurality of lanes. As depicted in FIG. 1, the PCI-e 111 interface is configured to provide a single link (e.g., link 113) comprising four lanes.

As presented, the link 113 couples the PCI-e 111 interface (and therefore the southbridge 109) with the dGPU 115, and enables the transfer of data between the dGPU and other components of the printed circuit board. In alternate embodiments, other peripheral devices utilizing the same communication standard, that is, other devices compatible with the PCI-e interface may be used in place of a dGPU. For example, a network interface card is a common peripheral device used in many mobile computing devices that is typically compatible with the PCI-e data transfer standard.

FIG. 2 displays a block diagram of an alternate exemplary hardware configuration 200 of a central printed circuit board featuring a PCI-e interface with multiple PCI-e links, rather than the single, larger PCI-e link as was featured in FIG. 1, in accordance with various embodiments of the present invention. As depicted, FIG. 2 depicts the chipset, comprising the northbridge 101 and southbridge 109; system memory (e.g., memory 105); display 117; and plurality of graphics processing devices (e.g., iGPU 107 and dGPU 115) as disclosed above with reference to FIG. 1. FIG. 2 however, depicts an alternate configuration, wherein the PCI-e interface 111 shared with FIG. 1 is no longer coupled to only one peripheral device (and thus requiring only one link) as in FIG. 1, but instead is coupled to multiple devices. For example, dGPU 115 is coupled to PCI-e interface 111 via link 201 and Network Device 209 is coupled to PCI-e interface 111 via link 207, thereby necessitating the configuration with four separate links, e.g., links 201, 203, 205, and 207.

According to some embodiments, network device 209 may be implemented as a network controller, such as a network interface card. Unfortunately, in typical configurations, when the number of peripheral devices coupled to the PCI-e interface is less than the number of links provided, the unoccupied links and their corresponding capability for data transport is wasted. For example, while link 201 is used by the dGPU 115 to couple to the southbridge 109, and link 207 of the PCI-e interface 111 is used by the Network Device 209 to couple to the southbridge 109, links 203 and 205 are unoccupied. Consequently, until such a time as additional peripheral devices are added to the PCI-e interface, links 203 and 205 are wasted.

Link Aggregation

According to embodiments of the present invention, a system, a method, and an apparatus that allows the aggregation of multiple links of a data transfer interface (e.g., a communication bus) to provide greater bandwidth for communication between a peripheral device and system memory within a computing system are provided. In a typical embodiment, a data transfer interface such as an exemplary PCI-e interface having a plurality of occupied and unoccupied links will have the unoccupied links aggregated by a hardware aggregator with a link directly coupled to a peripheral device, such as a discrete graphics unit. Such link aggregation increases the bandwidth of requests to access data (e.g., in the system memory) of the particular peripheral device. In one embodiment, the bandwidth of direct memory access requests between a dGPU and system memory may be increased by the incorporation of the bandwidth aggregator device. Accordingly, increased throughput and data transfer rates in a portable computing system may be advantageously increased to improve user experience. Moreover, while the invention is described herein with specificity to the PCI-e interface, the invention is operable over communication standards other than PCI-e, which is being described herein for exemplary purposes only.

FIG. 3 displays a block diagram of an alternate exemplary hardware configuration 300 of a central printed circuit board featuring a PCI-e interface with multiple links that are aggregated by a bandwidth aggregator, in accordance with various embodiments of the present invention. As depicted, FIG. 3 depicts the chipset, comprising the northbridge 101 and southbridge 109; system memory (e.g., memory 105); display 117; and plurality of graphics processing devices (e.g., iGPU 107 and dGPU 115) as disclosed above with reference to FIG. 1. FIG. 3, however, depicts an alternate embodiment, wherein the PCI-e interface 111 is coupled to a bandwidth aggregator 301 and an additional peripheral device. As depicted, links 201, 203 and 205 are coupled to the aggregator 301, which is subsequently coupled to the dGPU 115. The remaining link of the PCI-e interface 111 remains coupled to Network Device 209 via link 207, as described with reference to FIG. 2.

According to some embodiments, the bandwidth aggregator 301 is provided to couple to a plurality of unused interfaces of a data communication bus interface (e.g., one or more unoccupied single-lane PCI-e links) with an interface coupled to a peripheral device to increase the bandwidth of data transfer between the communication bus and the peripheral device. By aggregating the unused links with an occupied link, the bandwidth available for a peripheral device (such as a discrete graphical processing unit) can be increased significantly, thereby allowing greater rates of data transfer and a corresponding increase in processing performance. In some embodiments, an original equipment manufacturer (OEM) of the printed circuit board (e.g., motherboard) may manufacturer a bandwidth aggregator to couple to the desired number of interfaces of the data communication bus.

Communicating Data Over an Aggregated PCI-e Interface

With reference to FIG. 4, an exemplary flowchart 400 of a process for communicating data over an aggregated PCI-e interface is depicted, in accordance with various embodiments of the present invention. In one embodiment, the process is performed in a computing system comprising at least a system memory, a discrete graphics processing unit, a data communication bus comprising multiple interfaces (such as a PCI-e interface featuring a plurality of links) to transfer data between the discrete graphics processing unit and the system memory, and a bandwidth aggregator for combining some or all of the interfaces of the data communication bus into a single, aggregated interface coupled to the graphics processing unit. Steps 401-407 describe exemplary steps of the flowchart 400 in accordance with the various embodiments herein described. In one embodiment, flowchart 400 is provided to distribute requests evenly between the combined portions of the interface.

At step 401, one or more direct memory access requests are initiated by the discrete graphics processing unit. The direct memory access requests may comprise, for example, requests for data corresponding to a desired display output. Direct memory access requests allow the dGPU to read and write to and from the system memory without severely taxing the CPU of the system as would a traditional memory access request, thus allowing the CPU to perform other tasks simultaneously and potentially achieving greater efficiency of system resources.

At step 403, the memory access requests initiated in step 401 are received by the bandwidth aggregator. According to some embodiments, the bandwidth aggregator may be coupled directly between the data communication interface and the graphics processing unit, and memory access requests initiated by the graphics processing unit in step 401 may be received directly by the data aggregator at step 403. At step 405, the bandwidth aggregator may parse the received memory access requests from, for example, a stream of contiguous data received from the graphics processing unit into individual memory access requests suitable for distribution and communication over the data communication interface.

At step 407, the memory access requests initiated by the dGPU at step 401 and received and parsed by the aggregator at steps 403 and 405, respectively, are distributed evenly over the number of aggregated interfaces (e.g., links). In one embodiment, distribution may be performed according to a round robin schedule. According to further embodiments, the aggregator may monitor the distribution of requests such that exceptionally large or delayed requests occupying one link or interface may be allocated and distributed through an alternate link or interface. According to some embodiments, instructions to the dGPU initiated by the CPU of the system are communicated only through a link originally coupled to the dGPU and not through the other, previously unoccupied links that have been aggregated.

Exemplary Computing Device

As presented in FIG. 5, an exemplary system for implementing embodiments includes a general purpose computing system environment, such as computing system 500. In its most basic configuration, computing system 500 typically includes at least one processing unit 501 and memory, and an address/data bus 509 (or other interface) for communicating information. Depending on the exact configuration and type of computing system environment, memory may be volatile (such as RAM 502), non-volatile (such as ROM 503, flash memory, etc.) or some combination of the two.

Computer system 500 may also comprise an optional graphics subsystem 505 for presenting information to the computer user, e.g., by displaying information on an attached display device 510, connected by a video cable 511. According to embodiments of the present claimed invention, a bandwidth aggregator 515 is coupled to the graphics subsystem 505 and a communication bus 509 (e.g., a PCI-e interface) for aggregating unused portions of the interface and increasing data transfer rates to and from the graphics subsystem 505. In alternate embodiments, display device 510 may be integrated into the computing system (e.g., a laptop or netbook display panel) and will not require a video cable 511. In one embodiment, process 500 may be performed, in whole or in part, by graphics subsystem 505 in conjunction with bandwidth aggregator 515 and memory 502, with any resulting output displayed in attached display device 510.

Additionally, computing system 500 may also have additional features/functionality. For example, computing system 500 may also include additional storage (removable and/or non-removable) including, but not limited to, magnetic or optical disks or tape. Such additional storage is illustrated in FIG. 6 by data storage device 504. Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. RAM 502, ROM 503, and data storage device 504 are all examples of computer storage media.

Computer system 500 also comprises an optional alphanumeric input device 506, an optional cursor control or directing device 507, and one or more signal communication interfaces (input/output devices, e.g., a network interface card) 508. Optional alphanumeric input device 506 can communicate information and command selections to central processor 501. Optional cursor control or directing device 507 is coupled to bus 509 for communicating user input information and command selections to central processor 501. Signal communication interface (input/output device) 508, also coupled to bus 509, can be a serial port. Communication interface 509 may also include wireless communication mechanisms. Using communication interface 509, computer system 500 can be communicatively coupled to other computer systems over a communication network such as the Internet or an intranet (e.g., a local area network), or can receive data (e.g., a digital television signal).

Although the subject matter has been described in language specific to structural features and/or processological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.

Claims

1. An electronic system, comprising: a printed circuit board;a processor disposed on the printed circuit board;a memory disposed on the printed circuit board;a data transfer interface, disposed on the printed circuit board and comprising a plurality of links for transferring data through the data transfer interface;a peripheral device, coupled to the data transfer interface over a first link of the plurality of links and configured to perform a plurality of instructions from the processor; anda bandwidth aggregator, coupled to the peripheral device and unoccupied links of the plurality of links, the bandwidth aggregator being operable to distribute memory access requests to and from the peripheral device,wherein the unoccupied links of the plurality of links are combined with the first link to form an aggregated data transfer link operable to communicate data between the peripheral device and the memory with a greater bandwidth than the first link alone, further wherein, the plurality of instructions from the processor are sent to the peripheral device solely through the first link of the aggregated data transfer link.
2. The electronic system of claim 1, wherein the peripheral device comprises a discrete graphics processing unit (dGPU), and wherein further the processor comprises a central processing unit (CPU).
3. The electronic system according to claim 2, wherein the data transfer interface is substantially compliant with the PCI-e interface standard.
4. The electronic system of claim 1, wherein the data transfer interface comprises four links.
5. The electronic system of claim 4, wherein each link of the four links is configurable to be coupled to a peripheral device.
6. The electronic system of claim 1, further comprising an integrated graphical processing unit (iGPU) disposed on the printed circuit board.
7. The electronic system of claim 1, further comprising a network interface card.
8. The electronic system of claim 7, wherein the network interface card occupies a link of the data transfer interface.
9. The electronic system according to claim 8, wherein direct memory access requests initiated by the peripheral device are distributed across the links comprising the aggregated data transfer link according to a round robin schedule.
10. The electronic system of claim 1, wherein the bandwidth aggregator is operable to receive direct memory access requests initiated by the peripheral device and alternately distributes the requests across the links comprising the aggregated data transfer link.
11. The electronic system according to claim 1, wherein requests initiated from the processor are communicated to the peripheral device through the first link.
12. The electronic system according to claim 1, wherein the bandwidth aggregator is manufactured to directly couple a desired number of sub-interfaces of the data transfer interface.
13. The electronic system according to claim 1, wherein the system comprises a mobile computing system.
14. The method according to claim 1, wherein a desired number of sub-interfaces of the data transfer interface is directly coupled by the bandwidth aggregator by an original equipment manufacturer of the printed circuit board.
15. A method for communicating data over an aggregated PCI-e interface, the method comprising: receiving a plurality of programmed instructions in a discrete graphics processing unit from a processor of a computing system;initiating a plurality of memory access requests in the discrete graphics processing unit;receiving the plurality of memory access requests in a bandwidth aggregator coupled to the discrete graphics processing unit and a data communication interface, the bandwidth aggregator aggregating a plurality of sub-interfaces of the data communication interface with a first sub-interface of the data communication interface coupling the data communication interface with the discrete graphics processing unit;parsing the plurality of memory access requests to correspond to the aggregated plurality of sub-interfaces;alternately distributing the plurality of memory access requests across the aggregated plurality of sub-interfaces of a data communication interface; andmonitoring the plurality of memory access requests being distributed by the bandwidth aggregator,wherein the aggregated plurality of sub-interfaces of the data communication interface is operable to communicate data between the discrete graphics processing unit and the data communication interface with a greater bandwidth than a single sub-interface of the data communication interface,wherein, the plurality of programmed instructions from the processor are sent to the peripheral device solely through the first sub-interface of the aggregated plurality of sub-interfaces.
16. The method according to claim 15, wherein the aggregated plurality of sub-interfaces comprises a sub-interface of the data communication interface coupled to the discrete graphics processing unit and a plurality of unoccupied sub-interfaces of the data communication interface.
17. The method according to claim 16, further comprising: receiving, in the discrete graphics processing unit, instructions from a CPU of the system via the sub-interface of the data communication interface coupled to the discrete graphics processing unit.
18. The method according to claim 15, wherein the plurality of memory access requests comprises at least one of the group comprising: a memory read request and a memory write request.
19. The method according to claim 15, wherein the data communication interface is substantially compliant with a PCI-e standard and wherein the monitoring the plurality of memory access requests is performed in response to alternately distributing the plurality of memory access requests across an aggregated plurality of sub-interfaces of a data communication interface.
20. The method according to claim 15, wherein receiving the plurality of memory access requests in the bandwidth aggregator comprises receiving the plurality of memory access requests as a stream of contiguous data from the discrete graphics processing unit.
21. The method according to claim 20, wherein parsing the plurality of memory access requests to correspond to the aggregated plurality of sub-interfaces comprises parsing the stream of contiguous data into a plurality of individual memory access requests.
22. An apparatus for enabling the electronic coupling of a plurality of components in an electronic device, the apparatus comprising: a motherboard comprising a chipset, the chipset comprising a northbridge and a southbridge;a central processing unit coupled to the northbridge;a first graphics processing unit, wherein the first graphics processing unit isa system memory disposed on the motherboard, the system memory electronically coupled to the northbridge;a PCI-E interface integrated on the southbridge, the PCI-E interface comprising a plurality of links;a second graphics processing unit coupled to the motherboard through a first link of the plurality of links of the PCI-E interface; anda link aggregator operable to distribute and monitor memory access requests to the second graphics processing unit through an aggregated data transfer link comprising an unoccupied portion of the plurality of links aggregated with the first link of the plurality of links, and coupling the second graphics processing unit and the PCI-E interface, the link aggregator being operable to increase bandwidth for communication between the second graphics processing unit and the system memory,wherein, the central processing unit is configured to send a plurality of instructions to at least one of: the first graphics processing unit and the second graphics processing unit,further wherein the plurality of instructions from the central processing unit are sent to the second graphics processing unit solely through the first link of the aggregated data transfer link.
23. The apparatus according to claim 22, wherein a link of the plurality of links is coupled to a network interface card.
24. The apparatus according to claim 22, wherein the link aggregator is disposed on the motherboard between the PCI-E interface and the second graphical processing unit.
25. The apparatus according to claim 22, wherein the second graphical processing unit is capable of higher performance than the first graphical processing unit.
26. The system according to claim 1, wherein the processor is configured to send graphics rendering instructions to the peripheral device through the data transfer interface.
27. The apparatus according to claim 22, wherein the motherboard further comprises a processor operable to send graphics rendering instructions to the second graphics processing unit through the PCI-E interface.

US Referenced Citations (148)

Number	Name	Date	Kind
3940740	Coontz	Feb 1976	A
4541075	Dill et al.	Sep 1985	A
4773044	Sfarti et al.	Sep 1988	A
4885703	Deering	Dec 1989	A
4951220	Ramacher et al.	Aug 1990	A
4985988	Littlebury	Jan 1991	A
5036473	Butts et al.	Jul 1991	A
5125011	Fung	Jun 1992	A
5276893	Savaria	Jan 1994	A
5379405	Ostrowski	Jan 1995	A
5392437	Matter et al.	Feb 1995	A
5448496	Butts et al.	Sep 1995	A
5455536	Kono et al.	Oct 1995	A
5513144	O'Toole	Apr 1996	A
5513354	Dwork et al.	Apr 1996	A
5578976	Yao	Nov 1996	A
5630171	Chejlava, Jr. et al.	May 1997	A
5634107	Yumoto et al.	May 1997	A
5638946	Zavracky	Jun 1997	A
5671376	Bucher et al.	Sep 1997	A
5694143	Fielder et al.	Dec 1997	A
5705938	Kean	Jan 1998	A
5766979	Budnaitis	Jun 1998	A
5768178	McLaury	Jun 1998	A
5805833	Verdun	Sep 1998	A
5884053	Clouser et al.	Mar 1999	A
5896391	Solheim et al.	Apr 1999	A
5909595	Rosenthal et al.	Jun 1999	A
5913218	Carney et al.	Jun 1999	A
5937173	Olarig et al.	Aug 1999	A
5956252	Lau et al.	Sep 1999	A
5996996	Brunelle	Dec 1999	A
5999990	Sharrit et al.	Dec 1999	A
6003083	Davies et al.	Dec 1999	A
6003100	Lee	Dec 1999	A
6049870	Greaves	Apr 2000	A
6065131	Andrews et al.	May 2000	A
6067262	Irrinki et al.	May 2000	A
6069540	Berenz et al.	May 2000	A
6072686	Yarbrough	Jun 2000	A
6085269	Chan et al.	Jul 2000	A
6094116	Tai et al.	Jul 2000	A
6219628	Kodosky et al.	Apr 2001	B1
6249288	Campbell	Jun 2001	B1
6255849	Mohan	Jul 2001	B1
6307169	Sun et al.	Oct 2001	B1
6323699	Quiet	Nov 2001	B1
6348811	Haycock et al.	Feb 2002	B1
6363285	Wey	Mar 2002	B1
6363295	Akram et al.	Mar 2002	B1
6366968	Hunsaker	Apr 2002	B1
6370603	Silverman et al.	Apr 2002	B1
6377898	Steffan et al.	Apr 2002	B1
6388590	Ng	May 2002	B1
6389585	Masleid et al.	May 2002	B1
6392431	Jones	May 2002	B1
6429288	Esswein et al.	Aug 2002	B1
6429747	Franck et al.	Aug 2002	B2
6433657	Chen	Aug 2002	B1
6437657	Jones	Aug 2002	B1
6486425	Seki	Nov 2002	B2
6504841	Larson et al.	Jan 2003	B1
6530045	Cooper et al.	Mar 2003	B1
6535986	Rosno et al.	Mar 2003	B1
6598194	Madge et al.	Jul 2003	B1
6629181	Alappat et al.	Sep 2003	B1
6662133	Engel et al.	Dec 2003	B2
6700581	Baldwin et al.	Mar 2004	B2
6701466	Fiedler	Mar 2004	B1
6717474	Chen et al.	Apr 2004	B2
6718496	Fukuhisa et al.	Apr 2004	B1
6734770	Aigner et al.	May 2004	B2
6738856	Milley et al.	May 2004	B1
6741258	Peck, Jr. et al.	May 2004	B1
6747483	To et al.	Jun 2004	B2
6782587	Reilly	Aug 2004	B2
6788101	Rahman	Sep 2004	B1
6794101	Liu et al.	Sep 2004	B2
6806788	Marumoto	Oct 2004	B1
6823283	Steger et al.	Nov 2004	B2
6825847	Molnar et al.	Nov 2004	B1
6849924	Allison et al.	Feb 2005	B2
6850133	Ma	Feb 2005	B2
6879207	Nickolls	Apr 2005	B1
6938176	Alben et al.	Aug 2005	B1
6956579	Diard et al.	Oct 2005	B1
6982718	Kilgard et al.	Jan 2006	B2
7020598	Jacobson	Mar 2006	B1
7058738	Stufflebeam, Jr.	Jun 2006	B2
7069369	Chou et al.	Jun 2006	B2
7069458	Sardi et al.	Jun 2006	B1
7075542	Leather	Jul 2006	B1
7075797	Leonard et al.	Jul 2006	B1
7085824	Forth et al.	Aug 2006	B2
7099969	McAfee et al.	Aug 2006	B2
7136953	Bisson et al.	Nov 2006	B1
7170315	Bakker et al.	Jan 2007	B2
7174407	Hou et al.	Feb 2007	B2
7174411	Ngai	Feb 2007	B1
7185135	Briggs et al.	Feb 2007	B1
7187383	Kent	Mar 2007	B2
7225287	Wooten	May 2007	B2
7246274	Kizer et al.	Jul 2007	B2
7260007	Jain et al.	Aug 2007	B2
RE39898	Nally et al.	Oct 2007	E
7293125	McAfee et al.	Nov 2007	B2
7293127	Caruk	Nov 2007	B2
7305571	Cranford, Jr. et al.	Dec 2007	B2
7324458	Schoenborn et al.	Jan 2008	B2
7340541	Castro et al.	Mar 2008	B2
7363417	Ngai	Apr 2008	B1
7383412	Diard	Jun 2008	B1
7398336	Feng et al.	Jul 2008	B2
7412554	Danilak	Aug 2008	B2
7415551	Pescatore	Aug 2008	B2
7424564	Mehta et al.	Sep 2008	B2
7469311	Tsu et al.	Dec 2008	B1
7478187	Knepper et al.	Jan 2009	B2
7480757	Atherton et al.	Jan 2009	B2
7480808	Caruk et al.	Jan 2009	B2
7496742	Khatri et al.	Feb 2009	B2
7500041	Danilak	Mar 2009	B2
7525986	Lee et al.	Apr 2009	B2
7536490	Mao	May 2009	B2
7539801	Xie et al.	May 2009	B2
7562174	Danilak	Jul 2009	B2
7594061	Shen et al.	Sep 2009	B2
7600112	Khatri et al.	Oct 2009	B2
7617348	Danilak	Nov 2009	B2
7631128	Sgrosso et al.	Dec 2009	B1
7663633	Diamond et al.	Feb 2010	B1
7705850	Tsu	Apr 2010	B1
7756123	Huang et al.	Jul 2010	B1
7777748	Bakalash et al.	Aug 2010	B2
7782325	Gonzalez et al.	Aug 2010	B2
7788439	Tsu et al.	Aug 2010	B1
7793029	Parson et al.	Sep 2010	B1
7793030	Jenkins et al.	Sep 2010	B2
7849235	Ihara et al.	Dec 2010	B2
8132015	Wyatt	Mar 2012	B1
8532098	Reed et al.	Sep 2013	B2
8687639	Kumar	Apr 2014	B2
20070011383	Berke et al.	Jan 2007	A1
20080072098	Hunsaker et al.	Mar 2008	A1
20090006708	Lim	Jan 2009	A1
20090086747	Naven et al.	Apr 2009	A1
20090254692	Feehrer	Oct 2009	A1
20100309918	Kumar	Dec 2010	A1

Non-Patent Literature Citations (2)

Entry
Dictionary.com. definition of “monitor.” Viewed Jun. 15, 2011.
PCI-SIG. PCI Express Base Specification. Revision 1.1. Mar. 28, 2005.

Related Publications (1)

	Number	Date	Country
	20110145468 A1	Jun 2011	US

Aggregating unoccupied PCI-e links to provide greater bandwidth

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

CPC

Field of Search

US

CPC

International Classifications

Abstract

Description

Claims

US Referenced Citations (148)

Non-Patent Literature Citations (2)

Related Publications (1)