Aggregating unoccupied PCI-e links to provide greater bandwidth

Information

  • Patent Grant
  • 9176909
  • Patent Number
    9,176,909
  • Date Filed
    Friday, December 11, 2009
    15 years ago
  • Date Issued
    Tuesday, November 3, 2015
    9 years ago
  • CPC
  • Field of Search
    • US
    • 710 022000
    • 710 104000
    • 710 105000
    • 710 107000
    • 710 301000
    • 710 305000
    • 710 307000
    • 710 316000
    • 710 317000
    • CPC
    • G06F2213/0026
    • G06F3/1438
    • G06T2210/52
    • G09G2360/06
  • International Classifications
    • G06F13/40
    • G06F13/20
Abstract
Embodiments of the claimed subject matter are directed to systems and a method that allows the aggregation of multiple interfaces of a single data communication bus to provide greater bandwidth for communication between a peripheral device and system memory within a computing system. In one embodiment, a system is provided wherein the unoccupied interfaces of the data communication bus is aggregated with an occupied interface coupled to a peripheral device to increase the bandwidth of data transfer requests between the peripheral device and the system memory.
Description
BACKGROUND

The development of cheaper, more capable integrated circuits have led to the development of portable computing systems featuring smaller, sleeker designs while retaining relatively sophisticated computing capabilities. These computing systems refer primarily to laptops and netbooks, but also include smart phones, and portable audio devices, portable video devices and portable video game consoles. However, as the recent trend of miniaturizing portable computing systems continues, the space available for hardware for these designs has progressively decreased. As a result, the optimization of hardware design and architecture has become of primary importance.


Typical computing devices include at least a collection of microprocessors or a central processing unit (CPU), some memory, a motherboard (e.g., central printed circuit board) featuring a chipset, and at least one graphics processing unit for generating video output to a display. In some conventional motherboard designs, the chipset is arranged into two separate component hubs, which are commonly referred to as the “northbridge” and “southbridge,” respectively. The northbridge typically handles communications among the CPU, random access memory (RAM), video output interfaces, and the southbridge. In many contemporary netbook and laptop implementations, the video output interface is implemented as an integrated graphics processing unit. The southbridge, on the other hand, is one or more chips that provide a platform to support a plurality of peripheral components, such as input/output devices and mass storage devices. In many implementations, the southbridge may also include integrated peripherals, such as audio controllers, network interface cards, universal serial bus (USB) and PCI-express connections, etc.


Traditionally, netbooks and laptops have used integrated graphics solutions such as integrated graphics processing units (GPUs) coupled to the northbridge. Integrated graphics processing units are graphics processors that utilize a portion of a computer's system memory rather than having its own dedicated memory. In general, integrated GPUs are cheaper to implement than dedicated or “discrete” GPUs, and offer relatively improved battery life and lower power usage, but at the cost of reduced capability and performance levels relative to discrete GPUs. Advantageously, manufacturers of netbooks and laptops have begun to offer configurations with higher graphics processing capabilities by providing computer systems that include additional discrete graphics processing units in addition to the integrated graphics processors.


Discrete or “dedicated” GPUs are distinguishable from integrated GPUs by having higher performance and also having local memory dedicated for use by the GPU that the GPU does not share with the underlying computer system. Commonly, discrete GPUs are implemented on discrete circuit boards called “video cards” which include, among other components, a GPU, the local memory, communication buses and various output terminals. In conventional applications, these video cards typically interface with the main circuit board (e.g., motherboard) of a computing system through a PCI Express (PCI-e) interface, upon which the video card may be mounted. In general, discrete GPUs are capable of significantly higher performance levels relative to integrated GPUs but typically require and consume higher levels of power relative to integrated graphics solutions. Portable computing devices with both integrated and discrete graphics processing solutions often offer a mechanism or procedure that enables the user to alternate usage between the particular solutions so as to manage performance and battery life according to situational needs or desired performance levels.


As mentioned above, in typical netbooks and laptops, the PCI Express interface is a component of the southbridge. However, unlike PCI-e interfaces in other computing systems such as desktops, the PCI-e interface of a portable computing device is often of a reduced size and, consequently, of a reduced capacity. In a typical configuration, the PCI-e interface of any computing device comprises a plurality of links, with each link comprising a further plurality of “lanes,” and being configured to independently couple to a peripheral device. The number of lanes in a link coupled to a peripheral device correlates with the bandwidth of the connection, and thus, couplings between a peripheral device and a link with larger amounts of lanes have greater bandwidth than couplings with links comprised of only single lanes. Traditionally, the number of links in a PCI-e interface of a portable computing device may be configured by the manufacturer in separate configurations to suit specific hardware implementations.


In a popular configuration, the links in PCI-e interface of a portable computing device may be arranged in either of two combinations totaling up to four lanes. For example, implementations can comprise either a single link of four lanes (1×4), thereby offering relatively greater bandwidth for a coupled device. Alternatively, implementations may feature four separate links, with each link capable of being coupled to a separate device but limited to a single lane (4×1) with a correspondingly low bandwidth. Thus, whenever the PCI-e interface is coupled to one device, the single link (1×4) configuration may be optimal, but multiple devices require additional links that adversely impact the amount of bandwidth and throughput of each connection.


Unfortunately, since netbooks and laptops are often intended to be used with network connections, chipset manufacturers of computing devices that will include a discrete GPU will invariably manufacture southbridges (and/or motherboards in general) with PCI-e interfaces having four separate links of one lane each, one of which is occupied by a network controller (e.g., a network interface card). This results in the extremely inefficient configuration wherein one link is coupled to the network controller, another link is coupled to the graphics processing unit, and the other two links remaining unoccupied (or coupled to additional devices). While the bandwidth from a link with only one lane may be sufficient to run certain applications on certain devices, for usage in graphics processing a link having only a single lane is often insufficient and likely to drastically and adversely impact the performance of the discrete graphics processing unit. Moreover, this configuration results not only in substandard performance for discrete graphics processing units, but also commonly results in a waste of the remaining unoccupied links.


SUMMARY

This Summary is provided to introduce a selection of concepts in a simplified form that is further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter.


Embodiments of the claimed subject matter are directed to systems and a method that allows the aggregation of multiple interfaces of a data communication bus to provide greater bandwidth for communication between a peripheral device and system memory within a computing system. In one embodiment, unoccupied interfaces of the data communication bus are combined with an interface coupled to a peripheral device to increase the bandwidth of data transfer requests between the peripheral device and the system memory.


In another embodiment, a process is provided that enables the distribution of requests for accessing system memory (e.g., direct memory access requests) initiated by a discrete graphics processing unit among aggregated links of a PCI-e interface. The process comprises receiving the requests in a link aggregator, parsing the requests to correspond to the number of aggregated links, and distributing the requests among the links evenly. In further embodiments, the requests may be distributed in a round robin fashion.


In yet another embodiment, an apparatus is provided for aggregating unoccupied links of a PCI-e interface to increase the bandwidth to a discrete graphics processing unit in a system with two or more graphics processing units. According to some embodiments, the system includes a printed circuit board with: a first and second graphics processing units; system memory; and a plurality of peripheral components including a PCI-e interface, wherein the PCI-e interface is comprised of a plurality of links which may be coupled to a plurality of devices, including the second graphics processing unit, to facilitate the transfer of data between the devices and the system memory. According to this embodiment, a link aggregator will aggregate the unoccupied links of the PCI-e interface to increase the bandwidth of requests from the second graphics processing unit and the system memory.





BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and form a part of this specification, illustrate embodiments of the invention and, together with the description, serve to explain the principles of the invention:



FIG. 1 depicts a block diagram of an exemplary hardware configuration of a central printed circuit board featuring a PCI-e interface with one link, in accordance with various embodiments of the present invention.



FIG. 2 depicts a block diagram of an alternate exemplary hardware configuration of a central printed circuit board featuring a PCI-e interface with multiple PCI-e links, in accordance with various embodiments of the present invention.



FIG. 3 depicts a block diagram of an alternate exemplary hardware configuration of a central printed circuit board featuring a PCI-e interface with multiple links that are aggregated by a bandwidth aggregator, in accordance with various embodiments of the present invention.



FIG. 4 depicts an exemplary flowchart of a process of distributing memory access requests from a peripheral device over an aggregated PCI-e link, in accordance with various embodiments of the present invention.



FIG. 5 depicts a block diagram of a basic computing system, in accordance with various embodiments of the present invention.





DETAILED DESCRIPTION

Reference will now be made in detail to several embodiments. While the subject matter will be described in conjunction with the alternative embodiments, it will be understood that they are not intended to limit the claimed subject matter to these embodiments. On the contrary, the claimed subject matter is intended to cover alternative, modifications, and equivalents, which may be included within the spirit and scope of the claimed subject matter as defined by the appended claims.


Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the claimed subject matter. However, it will be recognized by one skilled in the art that embodiments may be practiced without these specific details or with equivalents thereof. In other instances, well-known processes, procedures, components, and circuits have not been described in detail as not to unnecessarily obscure aspects and features of the subject matter.


Portions of the detailed description that follow are presented and discussed in terms of a process. Although steps and sequencing thereof are disclosed in figures herein (e.g., FIG. 4) describing the operations of this process, such steps and sequencing are exemplary. Embodiments are well suited to performing various other steps or variations of the steps recited in the flowchart of the figure herein, and in a sequence other than that depicted and described herein.


Some portions of the detailed description are presented in terms of procedures, steps, logic blocks, processing, and other symbolic representations of operations on data bits that can be performed on computer memory. These descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. A procedure, computer-executed step, logic block, process, etc., is here, and generally, conceived to be a self-consistent sequence of steps or instructions leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated in a computer system. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.


It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussions, it is appreciated that throughout, discussions utilizing terms such as “accessing,” “writing,” “including,” “storing,” “transmitting,” “traversing,” “associating,” “identifying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.


Link Configurations


Specific configurations of the central printed circuit board (e.g., motherboard) in different portable computing devices may vary according to design and/or manufacturer preference, but often include: a central processing unit (CPU), system memory, a chipset that enables communication between various components within the computing device and the central printed circuit board specifically, as well as one or more graphics processing units. These graphics processing units may be implemented as integrated and/or discrete. For computing devices that include discrete graphics processing units, data may be transferred between the discrete graphics processing units and the system memory or the CPU via a communication bus. A popular communication bus standard is referred to as PCI Express or, “PCI-e,” alternatively. The PCI-e interface of a typical portable computing device comprises a plurality of sub-interfaces (referred to as “links”) which may be arranged according to either of two combinations totaling up to four lanes. For example, implementations can comprise either a single sub-interface (link) of four serially arranged lanes (1×4), thereby offering relatively greater bandwidth for a coupled device but allowing a coupling of only one device to the PCI-e interface, or, alternatively, four sub-interfaces (links) of a single lane each that allows up to four devices to be coupled to the interface, but at lower data transfer rates.



FIG. 1 displays a block diagram of an exemplary hardware configuration 100 of a central printed circuit board featuring a PCI-e interface with one link, in accordance with various embodiments of the present invention. As depicted, FIG. 1 depicts a chipset comprising two separate chips (e.g., northbridge 101 and southbridge 109, respectively) operating as hubs for various components. As displayed, the two chips (e.g., northbridge 101 and southbridge 109) are communicatively coupled. Additional features of FIG. 1 may include system memory (e.g., memory 105) and a plurality of graphics processing devices (e.g., iGPU 107 and dGPU 115).


In one embodiment, a central processing device (e.g., CPU 103) and system memory 105 are coupled to (or even disposed on) one of the chips. In further embodiments, the CPU 103 and the system memory may be coupled to (or disposed on) the same chip. As shown, both CPU 103 and memory 105 are coupled to the northbridge 101. According to some embodiments, the configuration 100 may include a plurality of graphics processing devices. The plurality of graphics processing devices may include, for example, an integrated graphics processing unit (e.g., iGPU 107) coupled to the northbridge 101 and also coupled to a display device (e.g., display 117). In one embodiment, the display device 117 is coupled to an output interface of the integrated graphics processing unit, and display data generated by other components (e.g., at a dGPU) must be passed to the display device 117 through the iGPU. According to some embodiments, the display device 117 may be implemented as, for example, a discrete monitor or the display panel of a portable computing device.


According to some embodiments, a chip of the printed circuit board may include one or more integrated data communication buses. As depicted in FIG. 1, a data communication bus (e.g., PCI-e 111) is coupled with the southbridge 109. In some embodiments, the data communication bus may include an interface to couple with one or more peripheral devices, such as a discrete graphics processing unit (e.g., dGPU 115). In a typical embodiment, the interface may comprise a plurality of links, with each link comprising a plurality of lanes. As depicted in FIG. 1, the PCI-e 111 interface is configured to provide a single link (e.g., link 113) comprising four lanes.


As presented, the link 113 couples the PCI-e 111 interface (and therefore the southbridge 109) with the dGPU 115, and enables the transfer of data between the dGPU and other components of the printed circuit board. In alternate embodiments, other peripheral devices utilizing the same communication standard, that is, other devices compatible with the PCI-e interface may be used in place of a dGPU. For example, a network interface card is a common peripheral device used in many mobile computing devices that is typically compatible with the PCI-e data transfer standard.



FIG. 2 displays a block diagram of an alternate exemplary hardware configuration 200 of a central printed circuit board featuring a PCI-e interface with multiple PCI-e links, rather than the single, larger PCI-e link as was featured in FIG. 1, in accordance with various embodiments of the present invention. As depicted, FIG. 2 depicts the chipset, comprising the northbridge 101 and southbridge 109; system memory (e.g., memory 105); display 117; and plurality of graphics processing devices (e.g., iGPU 107 and dGPU 115) as disclosed above with reference to FIG. 1. FIG. 2 however, depicts an alternate configuration, wherein the PCI-e interface 111 shared with FIG. 1 is no longer coupled to only one peripheral device (and thus requiring only one link) as in FIG. 1, but instead is coupled to multiple devices. For example, dGPU 115 is coupled to PCI-e interface 111 via link 201 and Network Device 209 is coupled to PCI-e interface 111 via link 207, thereby necessitating the configuration with four separate links, e.g., links 201, 203, 205, and 207.


According to some embodiments, network device 209 may be implemented as a network controller, such as a network interface card. Unfortunately, in typical configurations, when the number of peripheral devices coupled to the PCI-e interface is less than the number of links provided, the unoccupied links and their corresponding capability for data transport is wasted. For example, while link 201 is used by the dGPU 115 to couple to the southbridge 109, and link 207 of the PCI-e interface 111 is used by the Network Device 209 to couple to the southbridge 109, links 203 and 205 are unoccupied. Consequently, until such a time as additional peripheral devices are added to the PCI-e interface, links 203 and 205 are wasted.


Link Aggregation


According to embodiments of the present invention, a system, a method, and an apparatus that allows the aggregation of multiple links of a data transfer interface (e.g., a communication bus) to provide greater bandwidth for communication between a peripheral device and system memory within a computing system are provided. In a typical embodiment, a data transfer interface such as an exemplary PCI-e interface having a plurality of occupied and unoccupied links will have the unoccupied links aggregated by a hardware aggregator with a link directly coupled to a peripheral device, such as a discrete graphics unit. Such link aggregation increases the bandwidth of requests to access data (e.g., in the system memory) of the particular peripheral device. In one embodiment, the bandwidth of direct memory access requests between a dGPU and system memory may be increased by the incorporation of the bandwidth aggregator device. Accordingly, increased throughput and data transfer rates in a portable computing system may be advantageously increased to improve user experience. Moreover, while the invention is described herein with specificity to the PCI-e interface, the invention is operable over communication standards other than PCI-e, which is being described herein for exemplary purposes only.



FIG. 3 displays a block diagram of an alternate exemplary hardware configuration 300 of a central printed circuit board featuring a PCI-e interface with multiple links that are aggregated by a bandwidth aggregator, in accordance with various embodiments of the present invention. As depicted, FIG. 3 depicts the chipset, comprising the northbridge 101 and southbridge 109; system memory (e.g., memory 105); display 117; and plurality of graphics processing devices (e.g., iGPU 107 and dGPU 115) as disclosed above with reference to FIG. 1. FIG. 3, however, depicts an alternate embodiment, wherein the PCI-e interface 111 is coupled to a bandwidth aggregator 301 and an additional peripheral device. As depicted, links 201, 203 and 205 are coupled to the aggregator 301, which is subsequently coupled to the dGPU 115. The remaining link of the PCI-e interface 111 remains coupled to Network Device 209 via link 207, as described with reference to FIG. 2.


According to some embodiments, the bandwidth aggregator 301 is provided to couple to a plurality of unused interfaces of a data communication bus interface (e.g., one or more unoccupied single-lane PCI-e links) with an interface coupled to a peripheral device to increase the bandwidth of data transfer between the communication bus and the peripheral device. By aggregating the unused links with an occupied link, the bandwidth available for a peripheral device (such as a discrete graphical processing unit) can be increased significantly, thereby allowing greater rates of data transfer and a corresponding increase in processing performance. In some embodiments, an original equipment manufacturer (OEM) of the printed circuit board (e.g., motherboard) may manufacturer a bandwidth aggregator to couple to the desired number of interfaces of the data communication bus.


Communicating Data Over an Aggregated PCI-e Interface


With reference to FIG. 4, an exemplary flowchart 400 of a process for communicating data over an aggregated PCI-e interface is depicted, in accordance with various embodiments of the present invention. In one embodiment, the process is performed in a computing system comprising at least a system memory, a discrete graphics processing unit, a data communication bus comprising multiple interfaces (such as a PCI-e interface featuring a plurality of links) to transfer data between the discrete graphics processing unit and the system memory, and a bandwidth aggregator for combining some or all of the interfaces of the data communication bus into a single, aggregated interface coupled to the graphics processing unit. Steps 401-407 describe exemplary steps of the flowchart 400 in accordance with the various embodiments herein described. In one embodiment, flowchart 400 is provided to distribute requests evenly between the combined portions of the interface.


At step 401, one or more direct memory access requests are initiated by the discrete graphics processing unit. The direct memory access requests may comprise, for example, requests for data corresponding to a desired display output. Direct memory access requests allow the dGPU to read and write to and from the system memory without severely taxing the CPU of the system as would a traditional memory access request, thus allowing the CPU to perform other tasks simultaneously and potentially achieving greater efficiency of system resources.


At step 403, the memory access requests initiated in step 401 are received by the bandwidth aggregator. According to some embodiments, the bandwidth aggregator may be coupled directly between the data communication interface and the graphics processing unit, and memory access requests initiated by the graphics processing unit in step 401 may be received directly by the data aggregator at step 403. At step 405, the bandwidth aggregator may parse the received memory access requests from, for example, a stream of contiguous data received from the graphics processing unit into individual memory access requests suitable for distribution and communication over the data communication interface.


At step 407, the memory access requests initiated by the dGPU at step 401 and received and parsed by the aggregator at steps 403 and 405, respectively, are distributed evenly over the number of aggregated interfaces (e.g., links). In one embodiment, distribution may be performed according to a round robin schedule. According to further embodiments, the aggregator may monitor the distribution of requests such that exceptionally large or delayed requests occupying one link or interface may be allocated and distributed through an alternate link or interface. According to some embodiments, instructions to the dGPU initiated by the CPU of the system are communicated only through a link originally coupled to the dGPU and not through the other, previously unoccupied links that have been aggregated.


Exemplary Computing Device


As presented in FIG. 5, an exemplary system for implementing embodiments includes a general purpose computing system environment, such as computing system 500. In its most basic configuration, computing system 500 typically includes at least one processing unit 501 and memory, and an address/data bus 509 (or other interface) for communicating information. Depending on the exact configuration and type of computing system environment, memory may be volatile (such as RAM 502), non-volatile (such as ROM 503, flash memory, etc.) or some combination of the two.


Computer system 500 may also comprise an optional graphics subsystem 505 for presenting information to the computer user, e.g., by displaying information on an attached display device 510, connected by a video cable 511. According to embodiments of the present claimed invention, a bandwidth aggregator 515 is coupled to the graphics subsystem 505 and a communication bus 509 (e.g., a PCI-e interface) for aggregating unused portions of the interface and increasing data transfer rates to and from the graphics subsystem 505. In alternate embodiments, display device 510 may be integrated into the computing system (e.g., a laptop or netbook display panel) and will not require a video cable 511. In one embodiment, process 500 may be performed, in whole or in part, by graphics subsystem 505 in conjunction with bandwidth aggregator 515 and memory 502, with any resulting output displayed in attached display device 510.


Additionally, computing system 500 may also have additional features/functionality. For example, computing system 500 may also include additional storage (removable and/or non-removable) including, but not limited to, magnetic or optical disks or tape. Such additional storage is illustrated in FIG. 6 by data storage device 504. Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. RAM 502, ROM 503, and data storage device 504 are all examples of computer storage media.


Computer system 500 also comprises an optional alphanumeric input device 506, an optional cursor control or directing device 507, and one or more signal communication interfaces (input/output devices, e.g., a network interface card) 508. Optional alphanumeric input device 506 can communicate information and command selections to central processor 501. Optional cursor control or directing device 507 is coupled to bus 509 for communicating user input information and command selections to central processor 501. Signal communication interface (input/output device) 508, also coupled to bus 509, can be a serial port. Communication interface 509 may also include wireless communication mechanisms. Using communication interface 509, computer system 500 can be communicatively coupled to other computer systems over a communication network such as the Internet or an intranet (e.g., a local area network), or can receive data (e.g., a digital television signal).


Although the subject matter has been described in language specific to structural features and/or processological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.

Claims
  • 1. An electronic system, comprising: a printed circuit board;a processor disposed on the printed circuit board;a memory disposed on the printed circuit board;a data transfer interface, disposed on the printed circuit board and comprising a plurality of links for transferring data through the data transfer interface;a peripheral device, coupled to the data transfer interface over a first link of the plurality of links and configured to perform a plurality of instructions from the processor; anda bandwidth aggregator, coupled to the peripheral device and unoccupied links of the plurality of links, the bandwidth aggregator being operable to distribute memory access requests to and from the peripheral device,wherein the unoccupied links of the plurality of links are combined with the first link to form an aggregated data transfer link operable to communicate data between the peripheral device and the memory with a greater bandwidth than the first link alone, further wherein, the plurality of instructions from the processor are sent to the peripheral device solely through the first link of the aggregated data transfer link.
  • 2. The electronic system of claim 1, wherein the peripheral device comprises a discrete graphics processing unit (dGPU), and wherein further the processor comprises a central processing unit (CPU).
  • 3. The electronic system according to claim 2, wherein the data transfer interface is substantially compliant with the PCI-e interface standard.
  • 4. The electronic system of claim 1, wherein the data transfer interface comprises four links.
  • 5. The electronic system of claim 4, wherein each link of the four links is configurable to be coupled to a peripheral device.
  • 6. The electronic system of claim 1, further comprising an integrated graphical processing unit (iGPU) disposed on the printed circuit board.
  • 7. The electronic system of claim 1, further comprising a network interface card.
  • 8. The electronic system of claim 7, wherein the network interface card occupies a link of the data transfer interface.
  • 9. The electronic system according to claim 8, wherein direct memory access requests initiated by the peripheral device are distributed across the links comprising the aggregated data transfer link according to a round robin schedule.
  • 10. The electronic system of claim 1, wherein the bandwidth aggregator is operable to receive direct memory access requests initiated by the peripheral device and alternately distributes the requests across the links comprising the aggregated data transfer link.
  • 11. The electronic system according to claim 1, wherein requests initiated from the processor are communicated to the peripheral device through the first link.
  • 12. The electronic system according to claim 1, wherein the bandwidth aggregator is manufactured to directly couple a desired number of sub-interfaces of the data transfer interface.
  • 13. The electronic system according to claim 1, wherein the system comprises a mobile computing system.
  • 14. The method according to claim 1, wherein a desired number of sub-interfaces of the data transfer interface is directly coupled by the bandwidth aggregator by an original equipment manufacturer of the printed circuit board.
  • 15. A method for communicating data over an aggregated PCI-e interface, the method comprising: receiving a plurality of programmed instructions in a discrete graphics processing unit from a processor of a computing system;initiating a plurality of memory access requests in the discrete graphics processing unit;receiving the plurality of memory access requests in a bandwidth aggregator coupled to the discrete graphics processing unit and a data communication interface, the bandwidth aggregator aggregating a plurality of sub-interfaces of the data communication interface with a first sub-interface of the data communication interface coupling the data communication interface with the discrete graphics processing unit;parsing the plurality of memory access requests to correspond to the aggregated plurality of sub-interfaces;alternately distributing the plurality of memory access requests across the aggregated plurality of sub-interfaces of a data communication interface; andmonitoring the plurality of memory access requests being distributed by the bandwidth aggregator,wherein the aggregated plurality of sub-interfaces of the data communication interface is operable to communicate data between the discrete graphics processing unit and the data communication interface with a greater bandwidth than a single sub-interface of the data communication interface,wherein, the plurality of programmed instructions from the processor are sent to the peripheral device solely through the first sub-interface of the aggregated plurality of sub-interfaces.
  • 16. The method according to claim 15, wherein the aggregated plurality of sub-interfaces comprises a sub-interface of the data communication interface coupled to the discrete graphics processing unit and a plurality of unoccupied sub-interfaces of the data communication interface.
  • 17. The method according to claim 16, further comprising: receiving, in the discrete graphics processing unit, instructions from a CPU of the system via the sub-interface of the data communication interface coupled to the discrete graphics processing unit.
  • 18. The method according to claim 15, wherein the plurality of memory access requests comprises at least one of the group comprising: a memory read request and a memory write request.
  • 19. The method according to claim 15, wherein the data communication interface is substantially compliant with a PCI-e standard and wherein the monitoring the plurality of memory access requests is performed in response to alternately distributing the plurality of memory access requests across an aggregated plurality of sub-interfaces of a data communication interface.
  • 20. The method according to claim 15, wherein receiving the plurality of memory access requests in the bandwidth aggregator comprises receiving the plurality of memory access requests as a stream of contiguous data from the discrete graphics processing unit.
  • 21. The method according to claim 20, wherein parsing the plurality of memory access requests to correspond to the aggregated plurality of sub-interfaces comprises parsing the stream of contiguous data into a plurality of individual memory access requests.
  • 22. An apparatus for enabling the electronic coupling of a plurality of components in an electronic device, the apparatus comprising: a motherboard comprising a chipset, the chipset comprising a northbridge and a southbridge;a central processing unit coupled to the northbridge;a first graphics processing unit, wherein the first graphics processing unit isa system memory disposed on the motherboard, the system memory electronically coupled to the northbridge;a PCI-E interface integrated on the southbridge, the PCI-E interface comprising a plurality of links;a second graphics processing unit coupled to the motherboard through a first link of the plurality of links of the PCI-E interface; anda link aggregator operable to distribute and monitor memory access requests to the second graphics processing unit through an aggregated data transfer link comprising an unoccupied portion of the plurality of links aggregated with the first link of the plurality of links, and coupling the second graphics processing unit and the PCI-E interface, the link aggregator being operable to increase bandwidth for communication between the second graphics processing unit and the system memory,wherein, the central processing unit is configured to send a plurality of instructions to at least one of: the first graphics processing unit and the second graphics processing unit,further wherein the plurality of instructions from the central processing unit are sent to the second graphics processing unit solely through the first link of the aggregated data transfer link.
  • 23. The apparatus according to claim 22, wherein a link of the plurality of links is coupled to a network interface card.
  • 24. The apparatus according to claim 22, wherein the link aggregator is disposed on the motherboard between the PCI-E interface and the second graphical processing unit.
  • 25. The apparatus according to claim 22, wherein the second graphical processing unit is capable of higher performance than the first graphical processing unit.
  • 26. The system according to claim 1, wherein the processor is configured to send graphics rendering instructions to the peripheral device through the data transfer interface.
  • 27. The apparatus according to claim 22, wherein the motherboard further comprises a processor operable to send graphics rendering instructions to the second graphics processing unit through the PCI-E interface.
US Referenced Citations (148)
Number Name Date Kind
3940740 Coontz Feb 1976 A
4541075 Dill et al. Sep 1985 A
4773044 Sfarti et al. Sep 1988 A
4885703 Deering Dec 1989 A
4951220 Ramacher et al. Aug 1990 A
4985988 Littlebury Jan 1991 A
5036473 Butts et al. Jul 1991 A
5125011 Fung Jun 1992 A
5276893 Savaria Jan 1994 A
5379405 Ostrowski Jan 1995 A
5392437 Matter et al. Feb 1995 A
5448496 Butts et al. Sep 1995 A
5455536 Kono et al. Oct 1995 A
5513144 O'Toole Apr 1996 A
5513354 Dwork et al. Apr 1996 A
5578976 Yao Nov 1996 A
5630171 Chejlava, Jr. et al. May 1997 A
5634107 Yumoto et al. May 1997 A
5638946 Zavracky Jun 1997 A
5671376 Bucher et al. Sep 1997 A
5694143 Fielder et al. Dec 1997 A
5705938 Kean Jan 1998 A
5766979 Budnaitis Jun 1998 A
5768178 McLaury Jun 1998 A
5805833 Verdun Sep 1998 A
5884053 Clouser et al. Mar 1999 A
5896391 Solheim et al. Apr 1999 A
5909595 Rosenthal et al. Jun 1999 A
5913218 Carney et al. Jun 1999 A
5937173 Olarig et al. Aug 1999 A
5956252 Lau et al. Sep 1999 A
5996996 Brunelle Dec 1999 A
5999990 Sharrit et al. Dec 1999 A
6003083 Davies et al. Dec 1999 A
6003100 Lee Dec 1999 A
6049870 Greaves Apr 2000 A
6065131 Andrews et al. May 2000 A
6067262 Irrinki et al. May 2000 A
6069540 Berenz et al. May 2000 A
6072686 Yarbrough Jun 2000 A
6085269 Chan et al. Jul 2000 A
6094116 Tai et al. Jul 2000 A
6219628 Kodosky et al. Apr 2001 B1
6249288 Campbell Jun 2001 B1
6255849 Mohan Jul 2001 B1
6307169 Sun et al. Oct 2001 B1
6323699 Quiet Nov 2001 B1
6348811 Haycock et al. Feb 2002 B1
6363285 Wey Mar 2002 B1
6363295 Akram et al. Mar 2002 B1
6366968 Hunsaker Apr 2002 B1
6370603 Silverman et al. Apr 2002 B1
6377898 Steffan et al. Apr 2002 B1
6388590 Ng May 2002 B1
6389585 Masleid et al. May 2002 B1
6392431 Jones May 2002 B1
6429288 Esswein et al. Aug 2002 B1
6429747 Franck et al. Aug 2002 B2
6433657 Chen Aug 2002 B1
6437657 Jones Aug 2002 B1
6486425 Seki Nov 2002 B2
6504841 Larson et al. Jan 2003 B1
6530045 Cooper et al. Mar 2003 B1
6535986 Rosno et al. Mar 2003 B1
6598194 Madge et al. Jul 2003 B1
6629181 Alappat et al. Sep 2003 B1
6662133 Engel et al. Dec 2003 B2
6700581 Baldwin et al. Mar 2004 B2
6701466 Fiedler Mar 2004 B1
6717474 Chen et al. Apr 2004 B2
6718496 Fukuhisa et al. Apr 2004 B1
6734770 Aigner et al. May 2004 B2
6738856 Milley et al. May 2004 B1
6741258 Peck, Jr. et al. May 2004 B1
6747483 To et al. Jun 2004 B2
6782587 Reilly Aug 2004 B2
6788101 Rahman Sep 2004 B1
6794101 Liu et al. Sep 2004 B2
6806788 Marumoto Oct 2004 B1
6823283 Steger et al. Nov 2004 B2
6825847 Molnar et al. Nov 2004 B1
6849924 Allison et al. Feb 2005 B2
6850133 Ma Feb 2005 B2
6879207 Nickolls Apr 2005 B1
6938176 Alben et al. Aug 2005 B1
6956579 Diard et al. Oct 2005 B1
6982718 Kilgard et al. Jan 2006 B2
7020598 Jacobson Mar 2006 B1
7058738 Stufflebeam, Jr. Jun 2006 B2
7069369 Chou et al. Jun 2006 B2
7069458 Sardi et al. Jun 2006 B1
7075542 Leather Jul 2006 B1
7075797 Leonard et al. Jul 2006 B1
7085824 Forth et al. Aug 2006 B2
7099969 McAfee et al. Aug 2006 B2
7136953 Bisson et al. Nov 2006 B1
7170315 Bakker et al. Jan 2007 B2
7174407 Hou et al. Feb 2007 B2
7174411 Ngai Feb 2007 B1
7185135 Briggs et al. Feb 2007 B1
7187383 Kent Mar 2007 B2
7225287 Wooten May 2007 B2
7246274 Kizer et al. Jul 2007 B2
7260007 Jain et al. Aug 2007 B2
RE39898 Nally et al. Oct 2007 E
7293125 McAfee et al. Nov 2007 B2
7293127 Caruk Nov 2007 B2
7305571 Cranford, Jr. et al. Dec 2007 B2
7324458 Schoenborn et al. Jan 2008 B2
7340541 Castro et al. Mar 2008 B2
7363417 Ngai Apr 2008 B1
7383412 Diard Jun 2008 B1
7398336 Feng et al. Jul 2008 B2
7412554 Danilak Aug 2008 B2
7415551 Pescatore Aug 2008 B2
7424564 Mehta et al. Sep 2008 B2
7469311 Tsu et al. Dec 2008 B1
7478187 Knepper et al. Jan 2009 B2
7480757 Atherton et al. Jan 2009 B2
7480808 Caruk et al. Jan 2009 B2
7496742 Khatri et al. Feb 2009 B2
7500041 Danilak Mar 2009 B2
7525986 Lee et al. Apr 2009 B2
7536490 Mao May 2009 B2
7539801 Xie et al. May 2009 B2
7562174 Danilak Jul 2009 B2
7594061 Shen et al. Sep 2009 B2
7600112 Khatri et al. Oct 2009 B2
7617348 Danilak Nov 2009 B2
7631128 Sgrosso et al. Dec 2009 B1
7663633 Diamond et al. Feb 2010 B1
7705850 Tsu Apr 2010 B1
7756123 Huang et al. Jul 2010 B1
7777748 Bakalash et al. Aug 2010 B2
7782325 Gonzalez et al. Aug 2010 B2
7788439 Tsu et al. Aug 2010 B1
7793029 Parson et al. Sep 2010 B1
7793030 Jenkins et al. Sep 2010 B2
7849235 Ihara et al. Dec 2010 B2
8132015 Wyatt Mar 2012 B1
8532098 Reed et al. Sep 2013 B2
8687639 Kumar Apr 2014 B2
20070011383 Berke et al. Jan 2007 A1
20080072098 Hunsaker et al. Mar 2008 A1
20090006708 Lim Jan 2009 A1
20090086747 Naven et al. Apr 2009 A1
20090254692 Feehrer Oct 2009 A1
20100309918 Kumar Dec 2010 A1
Non-Patent Literature Citations (2)
Entry
Dictionary.com. definition of “monitor.” Viewed Jun. 15, 2011.
PCI-SIG. PCI Express Base Specification. Revision 1.1. Mar. 28, 2005.
Related Publications (1)
Number Date Country
20110145468 A1 Jun 2011 US