USB schedule prefetcher for low power

Abstract
A circuit for monitoring future Universal Serial Bus (USB) activities is described. Specifically, the circuit may comprise a Direct Memory Access (DMA) engine schedule prefetcher. The DMA engine schedule prefetcher accesses linked list schedule structures in main memory. The structures are checked for future frames where the linked list has USB activity scheduled. A periodic DMA engine subsequently accesses main memory only during frames where USB traffic is scheduled.
Description
FIELD

The present invention pertains to the field of computer system design. More particularly, the present invention relates to a USB schedule prefetcher that allows a processor to enter a power-saving mode during periods when no USB activities are scheduled.


BACKGROUND

A computer system may be equipped with a Universal Serial Bus (USB). USB ports allow USB-enabled devices to connect and communicate with the computer system. Examples of electronic devices that communicate with computer systems through USB ports include digital cameras, keyboards, hard drives, and printers.


A USB host is in charge of the USB bus in a computer system. The USB host is a collection of software and hardware inside the computer system that supports the USB bus. The USB host is typically responsible for identifying devices that are connected to a USB port. The USB host may then load any needed device drivers dynamically. Finally, the USB host may periodically poll each of the attached devices for data communications.




BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is an embodiment of a computer system having a chipset that comprises a USB schedule prefetcher.



FIG. 2 is an embodiment of a circuit for monitoring and storing frames having scheduled USB activities.



FIG. 3
a is an embodiment of a flowchart for a prefetch engine filling a future activity vector.



FIG. 3
b is an embodiment of a flowchart of a periodic direct memory access (DMA) engine accessing memory.



FIG. 4 is another embodiment of a circuit for monitoring and storing frames having scheduled USB activities.




DETAILED DESCRIPTION

In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details. In other instances, well-known methods, procedures, components and circuits have not been described in detail so as not to obscure the present invention.


USB data may be delivered isochronously. Software usually schedules a USB periodic list to communicate data transfer and interrupt information to a USB host controller. Such a periodic linked list schedule structure may be stored in the main memory of a computer system.


The USB host controller may be located in a chipset/controller hub. USB data may be transferred in approximately 125 microsecond-granular frames. However, the granularity of the frames is not so limited. As an exemplary range, frames may have a temporal length of 75 to 175 microseconds. The USB host controller may access the linked list structure directly from main memory. However, continuous access of main memory may require continuous snooping of a processor's cache. As a result, the processor may be prevented from being placed in a low power mode.


A processor is in the “C0” state if the processor is operating at full power. The processor is in the “C1” state if the processor gates some internal clocks. The processor is in the “C2” state if an external device drives a pin to the processor to stop internal clocks. However, in the C2 state, the processor cache may still be snooped. The “C3” state is similar to the “C2” state. In the C3 state, however, the cache may not be snooped. Finally, the processor is in the “C4” state if internal clocks are stopped and the processor voltage level is decreased. The C0, C1, C2, C3, and C4 states may be similar to or equal to the processor states defined by the Advanced Configuration and Power Interface (ACPI) specification.


For one embodiment of the invention, FIG. 1 depicts a computer system having a chipset that comprises a USB schedule prefetcher. The computer system of FIG. 1 comprises a processor 110, a chipset 120, a memory 130, and a USB port 140. Processor 110 is coupled to chipset 120. A chipset typically comprises a plurality of controller hubs, which may be located on a plurality of integrated circuits. As a specific example, a chipset includes a memory controller hub (MCH) for communicating with at least a memory and a processor and an input/output (I/O) controller hub for communicating with input/output devices, such as USB devices. Chipset 120 has a USB schedule prefetcher 125. USB port 140 and memory 130 are coupled to chipset 120. The computer system of FIG. 1 may be compliant with USB 1.0, USB 1.1, or USB 2.0 specifications.


Software of the computer system schedules a USB periodic list. The periodic list instructs a USB host controller when to run interrupt and isochronous transfers to and from a USB port. The periodic list is stored in memory 130. Memory 130 may be Dynamic Random Access Memory (DRAM) or any other commonly used random access memory (RAM). USB data may be transferred from chipset 120 to USB port 140 in approximately 125 microsecond granular frames. For this embodiment of the invention, the schedule prefetcher 125 in chipset 120 checks for frames where the periodic list has activity scheduled. Furthermore, schedule prefetcher 125 tracks the frames where USB activities are scheduled. Memory 130 is subsequently accessed by a periodic DMA engine during frames having scheduled USB activities. The periodic DMA engine will be discussed in more detail in reference to FIG. 2. Thus, during pre-determined periods of USB inactivity, processor 110's cache need not be snooped; allowing processor 110 to be placed in a C3, C4, or other power management state.



FIG. 2 is an embodiment of a circuit for monitoring and storing frames having scheduled USB activities. FIG. 2 comprises a schedule prefetcher 210, a future activity vector 220, a frame index counter 230, a pause logic 240, a periodic DMA engine 250, and a memory bus 260. Schedule prefetcher 210 may be a direct memory access (DMA) engine schedule prefetcher. Schedule prefetcher 210 is coupled to future activity vector 220, frame index counter 230, and memory bus 260. Future activity vector 220 is coupled to pause logic 240. Pause logic 240 is coupled to periodic DMA engine 250. Periodic DMA engine 250 is coupled to memory bus 260. For one embodiment of the invention, schedule prefetcher 210, future activity vector 220, frame index counter 230, pause logic 240, and periodic DMA engine 250 may be part of a chipset.


Schedule prefetcher 210 may access structures in memory via memory bus 260. The memory bus 260 may be coupled to a dynamic random access memory (DRAM). Schedule prefetcher 210 may read the structures and check for scheduled activity in frames, the frames being pointed to by a software generated periodic list with USB scheduled activities. Schedule prefetcher 210 may then mark frames having USB activities scheduled as “active” and frames not having USB activities schedules as “not active.” Schedule prefetcher 210 may store the results in the future activity vector 220. The results may be stored as bits, such as activity bits that represent whether activity for a corresponding frame is scheduled (active) or not scheduled (inactive/not active).


For example, each bit represents a 125 microsecond granular frame. Schedule prefetcher 210, if a frame has USB activity scheduled, sets a corresponding bit in the future activity vector 220. Similarly, schedule prefetcher 210, if the frame has no USB activity scheduled, clears the corresponding bit in future activity vector 220. Future activity vector 220 may be a circular first in first out (FIFO) structure having associated pointers to track storage locations in the FIFO structure.


Frame index counter 230 tracks the frames accessed-by periodic DMA engine 250. Frame index counter 230 may advance, or retard based on the design, the frame index approximately every 125 microseconds. Pause logic 240 reads the frame data from future activity vector 220 as the frame index counter 230 advances the frame index. The activity bit associated with the current frame index is used to determine whether or not to disable/pause the periodic DMA engine 250 for that frame.


Pause logic 240 pauses the data communication to the periodic DMA engine whenever the current frame's activity bit is cleared. As a result, the periodic DMA engine 250 may access memory only during frames where USB traffic is scheduled. By creating long enough idle times on a processor and memory, in the presence of connected, but inactive USB devices, the processor may enter a C3, a C4, or other power management state. As a specific example, if an activity bit within future activity vector 220 is set/cleared to represent no scheduled activity for the current frame by schedule prefetcher 210, then pause logic 240 disables/pauses periodic DMA engine 250. Therefore, periodic DMA engine 250 does not access memory and potentially cause a cache snoop that would disallow processor 110 from entering a low power state.


Schedule prefetcher 210 opportunistically refills future activity vector 220 while the processor and memory are busy anyway. As an example, prefetcher 210 accesses system memory and refills the future activity vector 220 in bursts during times when the system memory is in use anyway, thereby allowing long periods of idle time on the memory and cache interfaces.



FIG. 3
a is an embodiment of a flowchart for a schedule prefetcher filling a future activity vector. The system is powered-up in operation 310. An operating system may schedule interrupt and isochronous transfers for a USB port in operation 320. The USB transfers may be scheduled in approximately 125 microsecond granular frames and stored in main memory. Next, a schedule prefetcher may access and parse the USB periodic linked list schedule structures in main memory in operation 330. The transfer frames are marked as “active” or “inactive” in operation 340, based on whether activity is scheduled in the structures of the linked list. The results are then stored in a future activity vector. If the future activity vector is full then schedule prefetcher is paused in operation 355. However, if the future activity vector is not full, then the frame index counter advances, as normal, and the schedule prefetcher accesses and parses the next frame in memory. Once an opportunistic prefetch occurs in operation 365 or the future activity vector is no longer full, schedule prefetcher accesses and parses the next structure in memory, returning to operation 330.


Turning to FIG. 3b, an embodiment of a flowchart of a periodic direct memory access (DMA) engine accessing memory is shown. Just as in FIG. 3, the system powers up in operation 310, software schedules interrupt and isochronous transfers for a USB port in operation 320, and schedule prefetcher accesses and parses schedule structures in memory. Furthermore, in operation 340, the schedule prefetcher marks frames as “active: or “inactive” based on the parsing of structures in memory.


Once frame index counter is advanced in operation 370, which occurs approximately every 125 microseconds, data is read from the future activity vector by the periodic DMA engine in operation 375. If there is not USB traffic scheduled in the frame, as represented by the future activity vector, the frame index counter is advanced again in a return to operation 370 without the periodic DMA engine accessing memory. In contrast, if activity is scheduled in the frame, as represented by the future activity vector, memory is accessed in operation 385. The ability to not access memory in a frame where no activity is scheduled enables a processor to be placed in a low power mode, because unnecessary memory accesses and cache snoops are not committed.



FIG. 4 depicts another embodiment of a circuit for monitoring and storing frames having scheduled USB activities. This embodiment of the invention comprises schedule prefetcher 410, random access memory 415, future activity vector 420, frame index counter 430, pause logic 440, periodic DMA engine 450, and memory bus 460. Schedule prefetcher 410 is coupled to random access memory 415, memory bus 460, future activity vector 420, and frame index counter 430. Future activity vector 430 is further coupled to pause logic 440. Pause logic 440 is coupled to frame index counter 430 and periodic DMA engine 450.


DMA engine schedule prefetcher 410 may access structures in memory via memory bus 460. Memory bus 460 may be coupled to a DRAM. DMA engine schedule prefetcher 410 may read the structures and check for frames where a software generated periodic list has USB activities scheduled. DMA engine schedule prefetcher 410 may then mark frames having USB activities scheduled as “active” and frames not having USB activities schedules as “not active.” DMA engine schedule prefetcher 410 may store the results in future activity vector 420. DMA engine schedule prefetcher 410 may set a bit in future activity vector 420, if a frame is marked active. Similarly, DMA engine schedule prefetcher 410 may clear a bit in future activity vector 420, if a frame is marked not active. Future activity vector 420 may be a circular first in first out (FIFO) structure having associated pointers to track storage locations in the FIFO structure.


In addition to storing active and inactive bits in future activity vector 420, identified active control structures may be stored in random access memory 415.


The frame index counter 430 tracks the current frame for Periodic DMA Engine 450 and its associated Pause Logic 440. The frame index information is also passed to Schedule Prefetcher 410 in order to update Future Activity FIFO 420 before it is needed. Frame index counter 430 may advance the frame index approximately 125 microseconds at a time. Pause logic 440 reads the frame data from future activity vector 420, as frame index counter 430 advances the frame index. The data is passed from pause logic 440 to periodic DMA engine 450. However, pause logic 440 may pause the data communication to periodic DMA engine 450 whenever the current frame's activity bit is cleared. As a result, periodic DMA engine 450 may access memory during frames where USB traffic is scheduled. However, DMA engine 450 may read active control structures from random access memory 415 rather then re-reading the structures through memory bus 460. By creating long enough idle times on a processor and memory complex in the presence of connected, but inactive USB devices, the processor may enter a C3 a C4, or other power management state.


If DMA engine schedule prefetcher 410 fills future activity vector 420, pause logic 420 may pause schedule prefetcher 355. Alternatively, if future activity vector 420 is not full, DMA engine schedule prefetcher accesses and parses the next frame in memory 415.


In the foregoing specification the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modification and changes may be made thereto without departure from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than restrictive sense.

Claims
  • 1. A computer system, comprising: a processor capable of entering a plurality of power states; and a chipset coupled to the processor comprising a direct memory access (DMA) schedule prefetcher, the DMA schedule prefetcher for accessing a memory to check for scheduled Universal Serial Bus (USB) activities.
  • 2. The computer system of claim 1, wherein the chipset further comprises a future activity vector, and wherein the schedule prefetcher marks a first USB frame as “active” in the future activity vector, if a structure in memory for the first USB frame comprises future USB activity.
  • 3. The computer system of claim 2, wherein the chipset marks a second USB frame as “inactive” in the future activity vector, if the structure in memory for the second USB frame does not comprise future USB activity.
  • 4. The computer system of claim 3, wherein the chipset further comprises a periodic direct memory access (DMA) engine, the periodic DMA engine avoiding accesses to the memory during the second USB frame based on the marking of “inactive” in the future activity vector for the second frame.
  • 5. The computer system of claim 4, wherein the processor is placed in a low power mode if the memory is not accessed for a predefined period of time.
  • 6. The computer system of claim 5, wherein the predefined period of time is approximately one millisecond.
  • 7. The computer system of claim 5, wherein the low power mode is a C3 power management state.
  • 8. The computer system of claim 5, wherein the low power mode is a C4 power management state.
  • 9. The computer system of claim 1, wherein the memory is a Dynamic Random Access Memory (DRAM).
  • 10. The computer system of claim 1, wherein the computer system is USB 2.0 compliant.
  • 11. An apparatus, comprising: a direct memory access (DMA) engine schedule prefetcher to read data from a first memory, wherein the DMA engine schedule prefetcher marks frames having Universal Serial Bus activities scheduled; and a future activity vector coupled to the DMA engine schedule prefetcher to store frames marked by the DMA engine schedule prefetcher.
  • 12. The apparatus of claim 11, further comprising: a periodic DMA engine coupled to the future activity vector; and a frame index counter coupled to the DMA engine schedule prefetcher and to the periodic DMA engine to increment a frame index.
  • 13. The apparatus of claim 12, further comprising: a pause logic coupled between the future activity and the periodic DMA engine to read the marked frames from the future activity vector.
  • 14. The apparatus of claim 13, wherein the periodic DMA engine reads the marked frames from the pause logic, wherein the pause logic pauses the periodic DMA engine if a frame's activity bit is cleared.
  • 15. The apparatus of claim 11, further comprising: a second memory coupled to the DMA engine schedule prefetcher and the periodic DMA engine to store a copy of USB data read by the DMA engine schedule prefetcher from the first memory.
  • 16. An apparatus comprising: a schedule prefetcher to (1) check whether a current frame in a memory has universal serial bus (USB) activity scheduled and (2) fill a future activity vector, coupled to the DMA schedule prefetcher, with a first value, if the current frame has USB activity scheduled and a second value, if the current frame has no USB activity scheduled; a periodic DMA engine coupled to future activity vector to access the current frame in memory, if the future activity vector is filled with the first value; and pause logic coupled to the future activity vector, the DMA schedule prefetcher, and the periodic DMA engine to (1) pause the periodic DMA engine, if the future activity vector is filled with the second value and (2) pause the DMA schedule prefetcher, if the future activity vector is full.
  • 17. The apparatus of claim 16, wherein the schedule prefetcher is a direct memory access schedule prefetcher.
  • 18. The apparatus of claim 16, wherein filling the future activity vector with the first value comprises setting a first bit within the future activity vector.
  • 19. The apparatus of claim 18, wherein filling the future activity vector with the second value comprises clearing the first bit within the future activity vector.
  • 20. The apparatus of claim 16, wherein the current frame is incremented by a frame index counter.
  • 21. A method, comprising: reading linked list schedule structures from main memory; and checking for frames where the linked list schedule structures have Universal Serial Bus (USB) activity scheduled.
  • 22. The method of claim 21, further comprising: marking frames as “active” if a USB activity is scheduled; and marking frames as “inactive” if a USB activity is not scheduled.
  • 23. The method of claim 22, further comprising: storing the marked frames in a memory vector.
  • 24. The method of claim 23, further comprising: accessing main memory with a periodic Direct Memory Access (DMA) engine during active frames.
  • 25. The method of claim 24, further comprising: pausing the DMA engine if a current frame's activity bit is cleared.
  • 26. The method of claim 24, further comprising: pausing the prefetcher DMA engine if the memory vector is full.
  • 27. The method of claim 23, further comprising: incrementing a frame index counter.