Embodiments of the inventive subject matter generally relate to the field of data processing, or, more specifically, scheduling threads in a multiprocessor computer.
A thread is a unit of software execution on a multiprocessing computer. On such a computer, software programs are executed in units of execution called ‘processes’ that include all the processor registers, code segment and offset registers, data segment and offset registers, stack segment and offset registers, flag registers, instruction pointer registers, program counters, and so on, needed for execution of software programs. For efficiency, ‘processes’ are often organized further as threads, where each thread of a process individually possesses all the attributes needed for execution except that a thread shares memory among all the other threads of a process, thereby reducing the overhead of operating system switches from thread to thread (‘context switches’).
A ready queue contains all the threads of the system that are in the ‘ready’ state, waiting in priority order for dispatching to a processor. Threads are placed in the ready queue when they are first created and from a wait queue upon returns from system calls. When dispatched to a processor, each thread is typically authorized to occupy the processor for no more than a maximum amount of time referred to as a time ‘slice,’ after which the thread is said to be ‘preempted’ for return to the ready queue until other threads have a chance to run on the processor. Threads also are also typically placed on the ready queue when they are preempted while running on a processor; that is, when a higher priority thread arrives in the ready queue or when a thread's time slice expires.
Threads that are in the ‘wait’ state are maintained a wait queue. Threads in the wait state are often waiting for input/output returns from peripheral devices such as user input devices, display devices, communications adapters, memory, and others as will occur to those of skill in the art. Threads running on a processor are moved to the wait queue and to the ‘wait’ state when they issue system calls. Such system calls are often requests for data input from or output to peripheral devices.
An interrupt is a mechanism by which a computer subsystem or module external to a processor may interrupt the otherwise normal flow of operations on the processor. In particular, in interrupt-drive input/output processing, interrupts are provided so that a thread sending or receiving data to or from a peripheral device need not block and wait. Instead, the thread issues a system call and suspends operation while waiting on the wait queue for its data. When the peripheral device has the data ready, the peripheral device triggers an interrupt by signaling the processor, usually by way of a system bus. The processor ‘catches’ the interrupt, saves the running thread's operating context, and then hands control over to an interrupt handler that ‘clears’ the interrupt by processing it. The interrupted thread's saved operating context is at least all information needed to resume thread processing at the point at which it was interrupted, that is, at least the processor status registers and the location of the next instruction to be executed in the interrupted thread, in addition to whatever other information is needed by the particular operating system.
Modern interrupt handlers are typically split into two parts, a first level interrupt handler (“FLIH”) and a second level interrupt handler (“SLIH”). The first level interrupt handler discovers the cause of the interrupt. The first-level interrupt handler typically does not however process the interrupt. The first level interrupt handler instead typically calls a second level interrupt handler to process the interrupt. The second level interrupt handler is often associated with the particular device which generated the interrupt. After being called by the first level interrupt handler, the second level interrupt handler sits in the ready queue until processor time becomes available to process the interrupt.
Second level interrupt handlers may be assigned a lower priority than a thread currently running on the processor and therefore, may not have an opportunity to run for a relatively long period of time. In such situations, the second level interrupt handler often waits in the ready queue for some time before gaining access to the CPU to process the interrupt. When processing generates many interrupts, the delay in processing those interrupts caused by the second level interrupt handler waiting in the ready queue diminishes efficiency.
Prior art solutions included binding interrupt processing exclusively to a single processor or to a subset of the processors on a system and refraining from assigning threads to processors reserved for interrupt processing. Such an approach is relatively static, however, leaving interrupt processing on a subset of processors and thread processing on a subset of processors even when other processors would otherwise be available to spread occasional large loads of thread processing or interrupt processing. There is an ongoing need therefore for improvement in scheduling threads in a multi-processor computer system.
A computer program product for scheduling threads in a multiprocessor computer comprises computer program instructions configured to select a thread in a ready queue to be dispatched to a processor and determine whether an interrupt mask flag is set in a thread control block associated with the thread. If the interrupt mask flag is set in the thread control block associated with the thread, the computer program instructions are configured to select a processor, set a current processor priority register of the selected processor to least favored, and dispatch the thread from the ready queue to the selected processor.
The present embodiments may be better understood, and numerous objects, features, and advantages made apparent to those skilled in the art by referencing the accompanying drawings.
The description that follows includes exemplary systems, methods, techniques, instruction sequences and computer program products that embody techniques of the present inventive subject matter. However, it is understood that the described embodiments may be practiced without these specific details. In other instances, well-known instruction instances, protocols, structures and techniques have not been shown in detail in order not to obfuscate the description.
The present inventive subject matter is described to a large extent in this specification in terms of methods for scheduling threads in a multiprocessor computer. Persons skilled in the art, however, will recognize that any computer system that includes suitable programming means for operating in accordance with the disclosed methods also falls well within the scope of the present inventive subject matter. Suitable programming means include any means for directing a computer system to execute the steps of the method of the inventive subject matter, including for example, systems comprised of processing units and arithmetic-logic circuits coupled to computer memory, which systems have the capability of storing in computer memory, which computer memory includes electronic circuits configured to store data and program instructions, programmed steps of the method of the inventive subject matter for execution by a processing unit.
The inventive subject matter also may be embodied in a computer program product, such as a diskette or other recording medium, for use with any suitable data processing system. Embodiments of a computer program product may be implemented by use of any recording medium for machine-readable information, including magnetic media, optical media, or other suitable media. Persons skilled in the art will immediately recognize that any computer system having suitable programming means will be capable of executing the inventive subject matter as embodied in a program product. Persons skilled in the art will recognize immediately that, although most of the exemplary embodiments described in this specification are oriented to software installed and executing on computer hardware, nevertheless, alternative embodiments implemented as firmware or as hardware are well within the scope of the present inventive subject matter.
Scheduling Threads in a Multiprocessor Computer
Exemplary methods, systems, and computer program products for scheduling threads in a multiprocessor computer system according to embodiments of the present inventive subject matter are described with reference to the accompanying drawings, beginning with
The exemplary computer (134) of
The processors (156) of
The computer of
Also stored in RAM (168) is an operating system (154), which in turn includes a dispatcher (102) and an interrupt handler (118). Operating systems useful in computers according to embodiments of the present inventive subject matter include Unix™, Linux™, Microsoft NT™, and many others as will occur to those of skill in the art. Interrupt handler (118) is a software function in the operating system that processes interrupts. Although
The exemplary dispatcher (102) of
The term ‘least favored’ in this specification means least favored for interrupts. Setting a current processor priority register to least favored is often accomplished by storing the value of the highest available interrupt priority in the current processor priority register. There is currently no convention as to whether higher interrupt priorities are represented by high or low values. In some systems, high interrupt priorities are represented by low values, while in other systems, the high interrupt priorities are represented by high values. Any value system defining priorities for interrupts is well within the scope of the present inventive subject matter.
The exemplary computer (134) of
The example computer (134) of
The example computer (134) of
For further explanation,
The exemplary thread control block of
In the example of
After the thread leaves the processor, either because the thread issued a system call, the thread's time slice expired, or otherwise, the method of
In the example of
The method of
The method of
In the example of
If the count (404) of the number of processors (156) having a current processor priority register (203) set to least favored is greater (410) than a threshold value (408), the method of
It will be understood from the foregoing description that modifications and changes may be made in various embodiments of the present inventive subject matter without departing from its true spirit. The descriptions in this specification are for purposes of illustration only and are not to be construed in a limiting sense. The scope of the present inventive subject matter is limited only by the language of the following claims.
This continuation application claims the benefit of U.S. patent application Ser. No. 12/059,461 filed Mar. 31, 2008, which claims benefit of U.S. Pat. No. 7,487,503 filed on Aug. 12, 2004.
Number | Name | Date | Kind |
---|---|---|---|
5481719 | Ackerman et al. | Jan 1996 | A |
5515538 | Kleiman et al. | May 1996 | A |
5606696 | Ackerman et al. | Feb 1997 | A |
5630128 | Farrell et al. | May 1997 | A |
5694604 | Reiffin | Dec 1997 | A |
5708816 | Culbert | Jan 1998 | A |
5745778 | Alfieri | Apr 1998 | A |
5790871 | Qureshi et al. | Aug 1998 | A |
5875342 | Temple | Feb 1999 | A |
5905897 | Chou et al. | May 1999 | A |
5907702 | Flynn et al. | May 1999 | A |
5944816 | Dutton et al. | Aug 1999 | A |
5963911 | Walker et al. | Oct 1999 | A |
6003129 | Song et al. | Dec 1999 | A |
6006247 | Browning et al. | Dec 1999 | A |
6061710 | Eickemeyer et al. | May 2000 | A |
6105051 | Borkenhagen | Aug 2000 | A |
6212544 | Borkenhagen et al. | Apr 2001 | B1 |
6338078 | Chang et al. | Jan 2002 | B1 |
6430643 | Arndt | Aug 2002 | B1 |
6496925 | Rodgers et al. | Dec 2002 | B1 |
6542921 | Sager | Apr 2003 | B1 |
6549930 | Chrysos et al. | Apr 2003 | B1 |
6662204 | Watakabe et al. | Dec 2003 | B2 |
6697935 | Borkenhagen et al. | Feb 2004 | B1 |
6735769 | Brenner et al. | May 2004 | B1 |
6738846 | Slaughter et al. | May 2004 | B1 |
6754690 | Larson | Jun 2004 | B2 |
6792525 | Mukherjee et al. | Sep 2004 | B2 |
6857064 | Smith et al. | Feb 2005 | B2 |
6928482 | Ben Nun et al. | Aug 2005 | B1 |
7127716 | Jin et al. | Oct 2006 | B2 |
7234143 | Venkatasubramanian | Jun 2007 | B2 |
7353517 | Accapadi et al. | Apr 2008 | B2 |
7487503 | Accapadi et al. | Feb 2009 | B2 |
7962913 | Accapadi et al. | Jun 2011 | B2 |
20030184290 | Endo | Oct 2003 | A1 |
20040064676 | Burugula et al. | Apr 2004 | A1 |
20040215937 | Burky | Oct 2004 | A1 |
20040236879 | Croxford et al. | Nov 2004 | A1 |
20040268350 | Welland et al. | Dec 2004 | A1 |
20050108717 | Hong | May 2005 | A1 |
20050246461 | Accapadi | Nov 2005 | A1 |
20060037020 | Accapadi et al. | Feb 2006 | A1 |
20090106762 | Accapadi et al. | Apr 2009 | A1 |
Entry |
---|
“Transaction Execution Thread Affinity Management within a Multi-Node Server”, Research Disclsoure # 441131; International Business Machines Corporation; US Jan. 2001 , p. 193. |
“U.S. Appl. No. 10/422,020 Final Office Action”, Apr. 3, 2006 , 21 pages. |
“U.S. Appl. No. 10/422,020 Office Action”, Nov. 9, 2005 , 15 pages. |
“U.S. Appl. No. 10/671,057 Office Action”, Jul. 10, 2007 , 11 pages. |
“U.S. Appl. No. 10/834,498 Office Action”, Mar. 13, 2009 , 15 pages. |
“U.S. Appl. No. 10/916,976 Office Action”, May 12, 2008 , 13 pages. |
“U.S. Appl. No. 12/059,461 Office Action”, Jan. 5, 2011 , 22 Pages. |
“U.S. Appl. No. 12/342,352 Office Action”, Oct. 6, 2010 , 11 pages. |
Chen, Michael K. et al., “TEST: A Tracer for Extracting Speculative Threads”, IEEE Computer Society, Proceedings of the International Symposium on Code Generation and Optimization http://ogun.stanford.edu/˜kunle/publications/hydra—CGO03.pdf (Obtained from the Internet on Sep. 7, 2012) Mar. 2003 , pp. 301-312. |
Fowler, Robert et al., “Using Performance Reflection in Systems Software”, USENIX Association, HotOS IX: The 9th Workshop on Hot Topics in Operating Systems http://pdf.aminer.org/000/252/583/using—performance—reflection—in—systems—software.pdf (Obtained from the Internet on Sep. 7, 2012) May 2003 , pp. 97-101. |
Luo, Kun et al., “Balancing thoughput and fairness in SMT processors”, Performance Analysis of Systems and Software, 2001. ISPASS. 2001 IEEE International Symposium 2001 , pp. 164-171. |
Noto, et al., “Method for Achieving Hardware Interrupt Fairness”, TDB vol. 37 N2B Feb. 1994 , 265-266. |
Number | Date | Country | |
---|---|---|---|
20120260257 A1 | Oct 2012 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 12059461 | Mar 2008 | US |
Child | 13528645 | US | |
Parent | 10916976 | Aug 2004 | US |
Child | 12059461 | US |