1. Field of the Invention
This invention relates to data processing systems. More particularly, this invention relates to performance level setting in data processing systems capable of operating at a plurality of different performance levels.
2. Description of the Prior Art
It is known to provide data processing systems capable of operating at a plurality of different performance levels. A data processing system can typically switch between different processor performance levels at run-time. Lower performance levels are selected when running light workloads to save energy (power consumption) whereas higher performance levels are selected for more processing-intensive workloads. Typically, on a processor implemented in complimentary metal-oxide semi conductor (CMOS) technology, lower performance levels imply lower frequency and operating voltage settings.
However, if a workload spends most of its run-time running at close-to-peak performance levels there are likely to be only minor energy savings from switching to lower performance levels as a result of theoretical performance limits in computer scheduling theory (e.g. Amdahl's law). As a result of the difficulty of providing accurate performance prediction, situations are likely to occur whereby task deadlines are missed as a result of mispredictions. This is in turn detrimental to the processing performance of the data processing system and thus the quality of service experienced by the user. Thus there is a requirement to balance the energy savings achieved by reducing the operating frequency and voltage of a processor (according to current processing requirements) against the negative impact resulting from mispredictions that reduce the quality of service.
Viewed from one aspect the present invention provides a method of setting a processor performance level of a data processing apparatus, said method comprising:
selectively varying a processor performance level by selecting said processor performance level from a plurality of possible performance levels of a performance range having at least one performance-range limit;
dynamically varying said performance range by recalculating said at least one performance-range limit in dependence upon a quality of service value for a processing task.
The invention recognises that the likelihood of mispredictions of performance levels can be reduced by recalculating at least one performance-range limit in dependence upon a quality of service value for a processing task and by dynamically varying the performance range from which the current processor performance level can be selected. The recalculation of the performance-range limit enables the adaptive increase or reduction of the performance range in which the performance level needs to be predicted. The recalculation of the performance-range limit reduces the likelihood of mispredictions occurring and enables more efficient balancing of the energy savings acquired by reducing the performance level (i.e. processor frequency and voltage) against the negative impact on performance resulting from performance level mispredictions.
In one embodiment, the quality of service value depends on at least one task-specific value characteristic to the processing task. This allows the particular performance requirements of individual processing tasks to be taken into account in limiting the performance range.
In one embodiment, the task-specific value is a task deadline corresponding to a time interval within which the task should have been completed by the data processing apparatus. Task deadlines provide a convenient way of quantitatively assessing the quality of service, since if a given processing task does not meet its task deadline then there are likely to be implications for the quality of service such as delays in the supply of data generated by the given processing task and supplied as input to related processing tasks.
In one embodiment of this type, the task deadline is associated with an interactive task and corresponds to a smallest one of: (i) a task period; and (ii) a value specifying an acceptable response time for a user. This provides a convenient quality of service measure for applications where the response time of the data processing system to interactions with the user has an impact on the perceived quality of service.
In one embodiment, the performance-range limit is calculated in dependence upon a plurality of the task-specific values corresponding to a respective plurality of scheduled processing tasks. This allows the performance range limit to be set according to a plurality of concurrently scheduled processing tasks such that an overall quality of service is ensured, yet also takes account of individual requirements of individual processing tasks, which can vary widely in their quality of service requirements.
In one embodiment, the quality of service depends upon a task tolerance level giving an acceptable level of deviation from the task deadline for the processing task. This provides more flexibility in defining an acceptable performance range and enables a range of tolerances to be specified according to the particular processing task. In one such embodiment the task tolerance level corresponds to a time window containing the task deadline. This provides a convenient way of implementing an acceptable error margin in the tolerance level.
In one embodiment, the tolerance level corresponds to a probability measure associated with the task deadline. In one particular embodiment of this type the probability measure is one of a probability of hitting the task deadline and a probability of missing the task deadline. This enables mathematical models of the probability measure to be formulated and applied to make predictions about likelihoods of meeting and missing task deadlines. Thus the performance-range limit(s) are more accurately determined since run-time parameters of the data processing system are taken into account in estimating the probability measure. The probability measure can be assessed in dependence upon various system parameters such as the current processor frequency and the number of currently active tasks in the data processing system as well as task-specific parameters such as task deadlines and scheduler priority parameters for individual tasks. Other system parameters that can also be used to assess the probability measure are buffer size, and/or how full the buffer is, along with the rate at which the buffer is being drained. A buffer's parameters are particularly relevant as if a task has some real-time deadlines and it has already produced and stored in a buffer enough of the things it needs to (for example decoded music from a music stream), then if something were to go wrong, the data in the buffer can buy some recovery time without impacting on the real-time deadlines.
In one embodiment, the probability measure is calculated in dependence upon a state of an operating system of the data processing apparatus. It will be appreciated that the state of an operating system can be characterised by one or more of a number of different variables that reflect the current processing capability and workload of the data processing system, which in turn affects the quality of service for processing tasks.
In one embodiment, the probability measure is calculated in dependence upon at least one of:
a processor workload for a processing task;
a processor share allocated to the processing task;
a task switching period; and
a total number of scheduled tasks.
The values of these parameters are readily determined by the data processing system and provide an accurate reflection of processing conditions likely to affect the quality of service perceived by the user.
In one embodiment, the probability measure is calculated from a Poisson probability distribution model. A Poisson distribution is well understood mathematically and can be readily applied to model data processing tasks in a data processing system, since it represents a probability distribution that characterises discrete events occurring independently of one another in time.
In one embodiment, the performance limit is recalculated each time the processor performs a task scheduling operation. This ensures that the recalculated performance-range limit is accurate since it should then take account of all of the currently scheduled processing tasks.
It will be appreciated that the performance limit could correspond to any one of a number of different performance criteria of a processor such as a voltage and operating temperature. However, in one embodiment the performance limit corresponds to at least one of an upper limit and a lower limit for an operational frequency of the processor. For processors implemented in CMOS, a lower performance level implies lower frequency and operating voltage settings.
According to a second aspect the present invention provides a computer program product provided on a computer-readable medium, said computer program product comprising:
code for selectively setting a processor performance level by selecting said processor performance level from a plurality of possible performance levels of a performance range having at least one performance-range limit;
code operable to dynamically vary said at least one performance range by recalculating said at least one performance-range limit in dependence upon a quality of service value for a processing task.
According to a third aspect the present invention provides a data processing apparatus comprising:
logic for selectively varying a processor performance level by selecting said processor performance level from a plurality of possible performance levels of a performance range having at least one performance-range limit;
logic for dynamically varying said performance range by recalculating said at least one performance-range limit in dependence upon a quality of service value for a processing task.
The above, and other objects, features and advantages of this invention will be apparent from the following detailed description of illustrative embodiments which is to be read in connection with the accompanying drawings.
The operating system kernel 120 is the core that provides basic services for other parts of the operating system 110. The kernel can be contrasted with the shell (not shown) which is the outermost part of the operating system that interacts with user commands. The code of the kernel is executed with complete access privileges for physical resources, such as memory, on its host system. The services of the operating system kernel 120 are requested by other parts of the system or by an application program through a set of program interfaces known as a system core. The scheduler 122 determines which programs share the kernel's processing time and in what order. The supervisor 124 within the kernel 120 provides access to the processor by each process at the scheduled time.
The user processes layer 130 monitors processing work performed by the data processing system via system call events and processing task events including task switching, task creation and task exit events and also via application-specific data. The task events module 132 represents processing tasks performed as part of the user processes layer 130.
The intelligent energy management subsystem 150 is responsible for calculating and setting processor performance levels. The policy stack 154 comprises a plurality of performance level setting policies 156, 158 each of which uses a different algorithm to calculate a target performance level according to different characteristics according to different nm-time situations. The policy stack 154 co-ordinates the performance setting policies 156, 158 and takes account of different performance level predictions to select an appropriate performance level for a given processing situation at run-time. In effect the results of the two different performance setting policy modules 156, 158 are collated and analysed to determine a global estimate for a target processor performance level. In this particular embodiment the first performance setting policy module 156 is operable to calculate at least one of a maximum processor frequency and a minimum processor frequency in dependence upon a quality of service value for a processing task. The IEM subsystem 150 is operable to dynamically vary the performance range of the processor in dependence upon at least one of these performance limits (i.e. maximum and minimum frequencies). In the embodiment of
The operating system 110 supplies to the IEM kernel 152, information with regard to operating system events such as task switching and the number of active tasks in the system at a given moment. The IEM kernel 152 in turn supplies the task information and the operating system parameters to each of the performance setting policy modules 156, 158. The performance setting policy modules 156, 158 use the information received from the IEM kernel in order to calculate appropriate processor performance levels in accordance with the respective algorithm. Each of the performance setting policy modules 156, 158 supplies to the IEM kernel a calculated target performance level and the IEM kernel manages appropriate selection of a global target processor performance level. The performance level of the processor is selected from a plurality of different possible performance levels. However, according to the present technique, the range of possible performance levels that can be selected by the IEM kernel is varied dynamically in dependence upon run-time information about the required quality of service for different processing tasks. The frequency and voltage scaling hardware 160 supplies information to the IEM kernel with regard to the currently set operating frequency and voltage whereas the IEM kernel supplies the frequency and voltage scaling hardware with information regarding the required target frequency, which is dependent upon the current processing requirements. When the processor frequency is reduced, the voltage may be scaled down in order to achieve energy savings. For processors implemented in complimentary metal-oxide semiconductor (CMOS) technology, the energy used for a given work load is proportional to voltage squared.
In
When the processor is running at full capacity many processing tasks will be completed in advance of their deadlines and in this case, the processor is likely to be idle until the next scheduled task is begun. A larger idle time between the completion of execution of a task and the beginning of the next scheduled event corresponds to a less efficient system, since the processor is running at a higher frequency than necessary to meet performance targets. An example of a task deadline for a task that produces data is the point at which the generated data is required for use by another task. The deadline for an interactive task would be the perception threshold of the user (e.g. 50-100 milliseconds). A convenient quality of service measure for interactive tasks involves defining the task deadline to be the smaller of the task period and a value specifying an acceptable response time for a user. Thus for those processing tasks for which the response time is important in terms of a perceived quality of service, the task deadline can be appropriately set to a value smaller than the task period.
Going at full performance and then idling is less energy-efficient than completing the task more slowly so that the deadline is met more exactly. However, decreasing the CPU frequency below a certain value can lead to a decrease in the “quality of service” for processing applications. One possible way of measuring quality of service for a particular processing task is to monitor the percentage of task deadlines that were met during a number of execution episodes. For periodic applications or tasks having short periods, the task deadline is typically the start of the next execution episode. For a periodic applications or periodic applications with long periods, the task deadline depends on whether the application is interactive (shorter deadline) or performs batch processing (can take longer).
In the case of
Consider the case where the first performance setting policy 156 of the IEM subsystem 150 of
For the task associated with the probability curve of
Task scheduling events scheduled by the task scheduler 122 of
An equation describing the probability function such as the probability function plotted in
The probability of a given processing task hitting (or missing) its task deadline is calculated by the first performance setting policy module in dependence upon both operating system parameters and task-specific parameters.
For a task i scheduled for execution by the data processing system, the following task-specific parameters are relevant in order to calculate the minimum acceptable processor frequency (fmini) that enables the task deadline to be hit for the individual processing task i:
Ph
The system (global) parameters are:
Assuming that there are n tasks active in at seconds interval and a task switch occurs every τ seconds (τ is of an order of μs or ms), the number of periods a specific task is scheduled in (Nt), follows a Poisson distribution with the following probability mass function:
where λ is the rate or the expected number of task switches for a specific task in a time unit:
ρ is the CPU share allocated by the OS scheduler to a specific task. Note that for equal priority tasks, αi=1∀i=1 . . . n and ρ=1/n. α is a task priority parameter. It is an operating system (or scheduler) specific parameter associated to each task. It is not simple to calculate, whereas ρ can be statistically determined.
If M and N are two independent Poisson distributed random variables with λM and λN rates, the resulting M+N variable is Poisson distributed as well, with a λM+λN rate. This property of the Poisson variables simplifies the case when the number of tasks in the system is not constant in a period T, the resulting variable being Poisson distributed with a
rate.
The Poisson distribution, can be approximated by a normal distribution (the bigger λt, the better the approximation):
where μ=λt, σ2=λt and Fn(x) is the cumulative normal distribution function:
For small values of λt, the approximation can be improved using a 0.5 continuity correction factor.
A random normal variable X is equivalent to a standard normal variable
having the following cumulative distribution function:
where erf(x) is the error function (monotonically increasing):
with the following limits: erf(0)=0, erf(−∞)=−1, erf(∞)=1. The error function and its inverse can be found pre-calculated in various mathematical tables or can be determined using mathematics software packages such as Matlab®, Mathematica® or Octave®.
The approximated cumulative Poisson distribution function becomes:
where λ is the expected number of task switches for task i in a given time; Nt is the random number representing the scheduling events; k is the number of occurrences of the given event; and P(Nt≦k) is the probability that Nt is less than or equal to a given k, that is the probability of missing the deadline, i.e. the probability that the number of times a task was scheduled in is less than “k” the number required to complete its job—C cycles and t is time in seconds.
If the probability of a task i missing the deadline is Pm then it follows that the probability of hitting the deadline, Ph=1−Pm.
If C is the number of cycles that should be executed on behalf of a specific task in a period of time T then the number of times (k) that a task i needs to be scheduled so that the deadline is not missed is:
where τ is the task switching period in seconds and fCPU is the current processor frequency.
The probability of missing the deadline becomes:
In terms of an individual task i, the processor (CPU) workload W for task i is given by:
Since λ=ρ/τ (where λ is the expected number of task switches in a given time; ρ is the CPU share allocated by the OS scheduler 122 to task i; and τ is the task switching period in seconds), the probability Pm of missing the task deadline for an individual task is given by:
From the above equation for Pm, it can be seen that for tasks having the same priority and the same period, those tasks having a higher associated individual CPU workload have a greater likelihood of missing the task deadline. Furthermore, considering tasks having the same CPU workload, those tasks with longer periods (T) have a higher probability of missing the task deadline.
Since the probability of hitting the deadline (Ph=1−Pm) is fixed, the above equations lead to a linear equation in k:
where the inverse probability function zm for the probability of missing the task deadline is given by:
zm=Φ(Pm)=√{square root over (2)}erf1(2Pm−1) (eqn 15)
From the above equations, the CPU frequency for a given probability of missing the deadline is given by:
where C is the number of cycles that should be executed on behalf of a specific task in a period of time T; k is the number of times that a task i needs to be scheduled so that the deadline is not missed; T is a period of time corresponding to the task deadline (typically equal to the task period); ρ is the CPU share allocated by the OS scheduler 122 to task i; τ is the task switching period in seconds; and zm is the inverse probability function for the likelihood of missing the task deadline.
According to the algorithm implemented by the first performance setting policy module 156 of
For simplification, this probability can only take certain predetermined discrete values within a range. Based on these predetermined values, the inverse probability function zm (see eqn 15 above) is calculated and stored in memory (e.g. as a table) by the data processing system of
The first performance setting policy module 156 of
The constants τ (CPU share allocated by the OS scheduler to task i) and ρ (the task switching period in seconds) are statistically determined by the IEM subsystem at run-time.
Note that in alternative embodiments, an upper bound fCPUmax is calculated instead of or in addition to a lower bound. The upper bound fCPUmax is calculated based on task-specific maximum frequencies fmaxi, which are based on a specified upper bound for the required probability of meeting the task deadline associated with that task. The global value fCPUmax represents the smallest of the task-specific maximum frequencies fmaxi and should be larger than fcpumin to avoid increasing the probability of missing the deadline for some tasks. The goal of a good voltage-setting system is to arrive at a relatively stable set of predictions and avoid oscillations. The advantage of introducing an upper bound for the maximum frequency is that it helps the system arrive at a relatively stable set of predictions (avoiding or at least reducing oscillations). Oscillations waste energy, it is desirable to arrive at a correct stable prediction as early as possible.
Referring to the flow chart of
At stage 440, the task loop-index i is incremented and next at stage 450 it is determined whether or not i is less than or equal to the total number of tasks currently running in the system. If i exceeds the number of tasks in the system, then the process proceeds to stage 460 whereupon fCPUmin is fixed at its current value (corresponding to the maximum value for all i Of fmini) until the next task scheduling event and the process ends at stage 470. The policy stack 154 will then be constrained by the first performance setting policy to specifying to the IEM kernel a target processor performance level fCPUtarget that is greater than or equal to fCPUmin.
However, if at stage 450 it is determined that i is less than the total number of tasks currently running in the system then the process proceeds to stage 480 whereupon various task-specific parameters are estimated. In particular, the following task-specific parameters are estimated:
Once the task-specific parameters have been estimated, the process proceeds to stage 490 where the required (i.e. acceptable) probability to meet the deadline for the given task i is read from a database. The required probability will vary according to the type of task involved, for example, interactive applications will have different required probabilities from non-interactive applications. For some tasks, such as time-critical processing operations, the required probability of meeting the task deadline is very close to the maximum probability of one whereas for other tasks it is acceptable to have lower required probabilities since the consequences of missing the task deadline are less severe.
After the required probabilities have been established at stage 490, the process proceeds to stage 492, where the minimum processor frequency for task i (fmini) is calculated based on the corresponding required probability. The process then proceeds to stage 494, where it is determined whether or not the task-specific minimum processor frequency calculated at stage 492 is less than or equal to the current global minimum processor frequency fCPUmin.
If the task-specific minimum processor frequency fmini is greater than fCPUmin, then the process proceeds to stage 496 where fCPUmin, is reset to fmin. The process then returns to stage 440, where i is incremented and fmini is calculated for the next processing task.
On the other hand, if at stage 494 it is determined that fmini is less than or equal to the currently set global minimum frequency fCPUmin, then the process returns to stage 440 where the value of i is incremented and the calculation is performed for the next processing task. After stage 496 the process then returns to increment the current task at stage 440.
Although the described example embodiments use the probabilities that processing tasks will meet their task deadlines as a metric for the quality of service of the data processing system, alternative embodiments use different quality of service metrics. For example, in alternative embodiments the quality of service can be assessed by keeping track of the length, task deadline and speed for each execution episode for each processing task to establish the distribution of episode lengths. By speed “required speed” that would have been correct for an on-time execution of an episode is meant. After having executed an episode one can look back and figure out what the correct speed would have been in the first place. is then used to determine the minimum episode length and speed that is likely to save useful amounts of energy. If a performance level prediction lies above the performance-limit derived in this way then the processor speed is set in accordance with the prediction. On the other hand, if the prediction lies below the performance-limit then a higher minimum speed (performance-range limit) is set in order to reduce the likelihood of misprediction.
In the particular embodiment described with reference to the flow chart of
Although illustrative embodiments of the invention have been described in detail herein with reference to the accompanying drawings, it is to be understood that the invention is not limited to those precise embodiments, and that various changes and modifications can be effected therein by one skilled in the art without departing from the scope and spirit of the invention as defined by the appended claims.