The present invention relates to the field of packet based network and more precisely to packet processing linecards.
Packet processing linecards are used to process the aggregated data transmitted, at different layers of networking, from the Internet Protocol (IP) layer down to the physical layer in the network nodes.
Such process represents a major energy consumption in a packet transportation network and with a rising traffic tends to raise the operational and cooling costs and may also reduce the reliability of the routers.
Besides, the bursty aspect of data transportation corresponding to large fluctuations over different time scales makes the dimensioning of said packet processing linecards difficult and leads to a waste of energy during off-peak traffic.
Thus, it becomes essential to develop a solution allowing to reduce the energy consumption of the packet processing linecards.
One object of the present invention is to overcome the precited drawbacks of the state of the art and offer an alternative to the existing solutions of the state of the art to provide a reduction of the energy consumption in the packet processing linecards of the network nodes.
This is achieved by a method for reducing energy consumption in a packet processing linecard of a packet transmission network, said packet processing linecard comprising a plurality of microprocessors aimed at processing packet traffic wherein the number of active microprocessors is dynamically adjusted as a function of the computation of a traffic estimator based on a recurrent estimation of at least two statistical parameters including the average and a parameter representative of the statistical distribution of the packet traffic.
According to another aspect of the present invention, the parameter representative of the statistical distribution of the packet traffic comprises a statistical moment of the packet traffic.
According to a further aspect of the present invention, the statistical moment of the packet traffic comprises the standard deviation of the packet traffic.
According to an additional aspect of the present invention, the parameter representative of the statistical distribution of the packet traffic comprises a quantile of the packet traffic.
According to another aspect of the present invention, the recurrent estimation is achieved using multiple temporal ranges for the computation of the traffic estimator.
According to an additional aspect of the present invention, an inactive microprocessor is set in a low consumption mode.
According to a further aspect of the present invention, the driving voltage of the active microprocessors is dynamically adjusted in function of the computation of the traffic estimator.
According to another aspect of the present invention, the clock rate of the active microprocessors is dynamically adjusted in function of the computation of the traffic estimator.
According to an additional aspect of the present invention, the determination of the number of active microprocessors takes into account the maximum load of the microprocessors in order to respect a predetermined Quality of Service.
According to another aspect of the present invention, a delay is introduced before deactivating a microprocessor in order to reduce very short periods of inactivity of a microprocessor.
According to a further aspect of the present invention, packet traffic comprises packet flows and said packet flows are sorted in function of their quality of service (QoS) value and, in case of congestion, the most valuable flows are processed in priority by the microprocessors.
According to an additional aspect of the present invention, microprocessors are configured to process specific classes of packets such that the decision of activating or deactivating a microprocessor takes into account the packet class-specific configuration of said microprocessor.
According to another aspect of the present invention, the packet processing linecard is configured such that packets belonging to a common end-to-end flow are treated in the chronological order of the reception.
According to a further aspect of the present invention, the packet processing linecard is configured such that data synchronization is guaranteed by a regulation of the access of common resources.
The present invention also refers to a packet processing linecard wherein it comprises:
According to another aspect of the invention, the number of microprocessors of the packet processing linecard corresponds to the result of a computation using a priori traffic estimation based on at least two statistical parameters including the average and a parameter representative of the statistical distribution of the packet traffic and considering the maximum traffic to be processed by the packet processing linecard (1) and such that the overall energy efficiency remains maximum.
The embodiments of the present invention refer to the use, in packet processing linecards, of a plurality or array of microprocessors having a reduced processing clock rate such that the packet processing load is distributed among the plurality of microprocessors.
Such solution allows to reduce the energy consumption by adapting the number of active microprocessors in function of the traffic and by limiting the power dissipation of the microprocessors (due to their limited processing clock rates) and thus reducing the cooling expenses.
However, to obtain a significant reduction of energy consumption, the determination of the necessary number of microprocessors coupled with the determination, at any time, of the configuration of the array of microprocessors in function of the traffic is essential.
Thus, the embodiments of the present invention refer to the use of a traffic estimator allowing to determine dynamically the number of microprocessors of the array needed to be activated.
Moreover, statistical data concerning the traffic of packet processed in the pipeline 7 are sent as input to a traffic estimator 13. Based on the computation of the predicted traffic by said estimator 13, a microprocessor scheduler 15 controls the microprocessors 9 through task logic tools 17 and switches 19.
Furthermore, according to an embodiment of the present invention, the traffic estimator 13 directly controls the clock rate of the switches 19 and the driving voltage of the microprocessors 9 (represented by the dotted arrows 21 and 23). When all the processes are done, packets are forwarded as output packets 25 outside of the data exchange unit 7 through the packet FIFO equipment 5 to their destination.
It has to be noted that the array of microprocessors 9 represented in
The first step for using an array of microprocessors 9 is to dimension the packet processing linecard 1, that is to say, to determine the number of microprocessors 9 and the capacity of said microprocessors 9 to reach an optimal energy consumption. In the embodiments on the present invention, such dimensioning corresponds to a step of the linecard design and is achieved a priori and statically using traffic estimation and based on models of power consumption described in the literature:
The power consumption of a microprocessor 9 can be defined by
power=k·ν·V2
with k a constant of the microprocessor 9, ν the clock rate or clock frequency and V the driving voltage of the microprocessor 9.
Furthermore, ν and V are linked by the following model relation:
V=h
γ(ν)=a·νγ+b with νmin≦ν≦νmax and Vmin≦V≦Vmax
where γ is a parameter dependent of the microprocessor technology, Vmin and Vmax the respective minimal and maximal driving voltage functioning values for the microprocessor 9, and νmin and νmax the maximal clock rate functioning values corresponding to the respective voltages Vmin and Vmax for the microprocessor 9.
The hν(ν) function of the above equation is represented in
νn=νmax/N
which corresponds to the same processing capacity, the power consumption gain G (normalized bit speed divided by power consumption) is defined by:
G
N
=h
ν
2(νmax/N)/V2max
Such gain is represented in
Thus, based on
Besides, considering an aggregation of a plurality of independent packet flows and considering that the rate stochastic process of each partial aggregate of packet flows follows a Gaussian distribution, the processing pipeline rate (Pr) that can be reached is given by the Guerin's law:
Pr
p
=P·μ+α·√{square root over (P)}·α
with P the number of independent packet flow aggregates, μ and σ being respectively the average and the standard deviation of the Gaussian distribution of the packet flow aggregate rate and a α confidence factor, the value of which depends on the desired probability of not overloading the data exchange unit 7 (pipeline).
It has to be noticed that the Guerin's law allows to take into account the statistical multiplexing effect, that is to say, the fact that the aggregation of a large number of flows allows to attenuate the variations of the overall traffic (a flow peak compensating a flow off).
Thus, the use of a plurality of microprocessors corresponding to a parallelisation of the packet flow process leads to a reduction of the efficiency of the statistical multiplexing. Indeed, as in the case of a parallelisation of a plurality of microprocessors, the number of aggregated flows for a given microprocessor is reduced, the efficiency of the statistical multiplexing is also reduced. Such reduction of efficiency with respect to a single microprocessor is given by the rate loss factor RLF:
with N the number of microprocessors.
The plot of this rate loss factor, or normalized total packet processing speed, in function of the number of microprocessors for different values of the a parameter, corresponding to different desired probabilities of not overloading the data exchange unit 7 or pipeline, is shown in
The above part of the description shows that the replacement of a single microprocessor by an array of a plurality of microprocessors having a comparable processing capacity leads on one hand to the reduction of the power consumption (corresponding to the power consumption gain G) and on the other hand to a loss of the efficiency of the statistical multiplexing (corresponding to the rate loss factor RLF). As a consequence, in order to optimize the overall energy consumption, the goal is to determine the number of microprocessors corresponding to the optimal trade-off between both parameters and leading to a minimization of the energy consumption per packet, taking into account the loss due to statistical multiplexing when filling the data exchange units or pipes. Such trade-off can be defined by the maximization of an energy efficiency gain EEG defined by:
MaxN=(EEGN)=MaxN(Gn×RLFN)
A representation of this energy efficiency gain (normalized packet speed divided by power consumption) in function of the number of microprocessors for an α parameter corresponding to a probability of 10−8 and for different values of the γ parameter is shown in
From the different above representations, one can see that there is an optimal degree of parallelisation (or number of processors) in function of the processing traffic that leads to a maximisation of the energy efficiency, this degree depending on the technological properties of the microprocessors and on the variance of the packet traffic.
Thus, by considering the maximum amount of traffic that a linecard 1 is supposed to process (in practice considering a maximal average and a maximal standard deviation), an optimal dimensioning in terms of number of microprocessors 9 can be defined to minimize the energy consumption of the linecard 1.
The application of such optimization requires, in addition to the microprocessors features, the knowledge of statistical parameters of the flows to be processed which are the average and the standard deviation in the embodiment described previously.
The use of such a method using not only the average but also the standard deviation of the packet flows to be processed allows to take into account the loss of the statistical multiplexing due to the parallelisation in smaller data paths, leading to an improved determination of the optimized number of microprocessors 9.
Besides, the determination of the optimal number of microprocessors 9 in function of the amount of traffic can be used non only for dimensioning the linecard 1 during its conception process but also to dynamically determine the optimal number of microprocessors 9 that needs to be activated while in use, the non-activated microprocessors being set in a low consumption mode or sleep mode. Indeed, in order to further reduce the energy consumption, an idea of the embodiments of the present invention is to deactivate the idle microprocessors of the array when the traffic is reduced with respect to the peak periods in order to save energy. Thus, based on the previous equations, the traffic estimator 13 presented in
The recurrence can be periodical (for example, these statistics may be derived from the 200 latest 10 ms measurements of the instantaneous amount of packet input bytes and could be updated every 100 measurements, thus feeding the microprocessor scheduler 15 with new metrics every second), but the sample intervals can also vary with the time, for example the delay between two estimations can be reduced in case of large variations of the traffic. Furthermore, the choice of time intervals may depend on the nature of the traffic involved, for example, whether it is an aggregation of few or many flows. The size of the packet buffers available in the node may also influence the size of the interval.
In a typical implementation, the packet traffic b1, b2 . . . bn, in the m successive intervals t1-t0, t2-t1 . . . tm-tm-1 is monitored. The instantaneous rates ri=bi/(ti-ti-1) for i=1 . . . m are then computed to produce the statistical parameters:
It has also to be noted that other parameters or metrics representative of the statistical distribution of the packet traffic can be used by the traffic estimator 13. These metrics may refer to statistical moments of second or higher order defined by:
with p the order of the moment,
as well as auto-correlations at different time scales of the above mentioned rate sample sets. Metrics such as quantiles may also be used and a number of metrics higher than two (average and standard deviation) is used if necessary. Moreover, a temporal correlation of the packet traffic measurements may be additionally used, for example by using models that comprise the Hurst parameter, to produce an even improved prediction of the processing load.
Besides, independently of the metrics used for the computation, the values of the optimal number of microprocessors to be activated in function of the amount of traffic can be pre-calculated and stored in additional resource-planning look-up tables 14 attached to the traffic estimation unit 13, and different from the packet-forwarding lookup tables 11, as presented in
An example of additional look-up table is described in
In the same way as for the number of microprocessors 9, the optimal (in an energy efficiency way) clock rate and driving voltage of the active microprocessors can be determined based on the statistical parameters of the packet traffic and can be saved in additional look-up tables such that the traffic estimator 13 or the scheduler 15 controls said clock rate and driving voltage as described in
Thus, the dynamic estimation of the traffic allows to activate the right number of microprocessors to respect a predefined quality of service (QoS) such that the work load for each processor is not too high and not too low.
As described in
As a consequence, the overall consumption of the linecard varies as the steps of a stair in function of the traffic as described in
Besides, the array of microprocessors has to be configured such that the following constraints are respected:
Furthermore, according to an embodiment of the present invention, packet traffic is divided into packet classes and microprocessors are specialized to process a specific class of packet (using specific forwarding/queuing/dropping algorithms).
With such a configuration, the class specification of the microprocessors is taken into account to decide of the activation and the deactivation of the microprocessors.
According to a further embodiment, in case of congestion in the packet linecard, packet flows are sorted according to their quality-of-service (QoS) value, such that the most valuable flows are processed in priority.
Thus, the present invention, by replacing a single microprocessor of a packet linecard 1 by an array of microprocessors 9 wherein the dimensioning as well as the dynamic configuration (activation and deactivation of the microprocessors of the array) of said array are optimized by the use of an accurate traffic estimator 13. Said estimator 13 determining the processing load in function of different statistical parameters representative of the packet traffic while taking into account the reduction of statistical multiplexing due to the use of a plurality of microprocessors. Such traffic analysis allows to dynamically adjust the processing capacities to the amount of traffic and therefore leads to a significant reduction of the overall energy consumption of a packet processing linecard 1.
Number | Date | Country | Kind |
---|---|---|---|
10290171.7 | Mar 2010 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2011/054591 | 3/25/2011 | WO | 00 | 10/11/2012 |