This invention relates generally to data communications and more particularly to a method and apparatus for processing and rerouting the data packets in a data communication network
Data communication networks are commonly used to transport data between end users. A source device sends a group of data packets or a stream of data to a recipient device through a connection established in the data communication network. Nodes of the data communication network basically process the transport data of the packet streams at various level of granularity, that is, various traffic volume per aggregate and variable degrees of knowledge and effort required to handle the aggregate. For example, some VLAN low-layer aggregates have only local significance and very simple switching tables, while non-aggregated flows with end-to-end IP address pairs can need a huge routing table to find out what the network node should do. Conversely, an ideal network node would automatically reconfigure its resources so that all layers of networking, from highly power-efficient but wholesale optical wavelength switching to power-hungry but very fine-grained worldwide IP routing, receive exactly the required amount of hardware and power consumption, proportional to demand.
At the time being, new layers for next-generation networks should easily be integrated. Moreover, the packet grooming tasks that manage the multiplexing/demultiplexing of add and drop traffic with the transit traffic, at the optimal layer and pipe size, should operate at the highest efficiency by keeping as much traffic as possible in the lowest, most energy-efficient layers. The electronic implementations are nowadays mostly separate by layer. Parallel processors allow processing a huge routing talk by avoiding an overloading when the data load becomes very large.
But, the overall aggregation tasks are responsible for a large amount of the global consumption at the core network nodes. Conventional equipment often handles packets sequentially on specialized cards and shelves, one per layer of network, using power-hungry chip and module interconnections (backplane line drivers, serializer-deserializer, . . . ) To enhance the power efficiency, alternative solutions have to be proposed without compromising the layered organization of the network, however. A multi-layer control plane such as the Generalized Multi-Protocol Label Switching (GMPLS) is a standard integrating such new data plane solutions, across all the interfaces between equipments from different vendors.
It is useful now to develop a solution allowing reducing the energy consumption of the packet processing equipment.
One object of this invention is to improve the ability of the processing into the nodes of a data communication network and provide a reduction of the energy consumption in the packet processing equipment.
This is achieved by a method of processing of data received in a network communication node comprising a plurality of processors aimed at processing said data at a given layer of communication so as to route them towards a given destination, the method being characterized in that it comprises the steps of:
With this manner, the present invention proposes to design a better functional block, making its resources shareable and dynamically configurable on multiple layers. Such solution, among others, allows reducing the energy consumption by adapting the number of active processors in function to the layer of communication to process.
According to an aspect of the present invention, the selection of a given number of processors is performed periodically, the same processor being assigned to the processing of different layer of communication at different time. With this manner, a processor may be assigned according to the global need of the node at any time.
According to a further aspect of the present invention, the received data are associated with information identifying the at least one layer of communication to be processed by the said node. With this manner, the node easily identifies the layer of communication to process.
According to another aspect of the present invention, the received data are processed by several layers of communication; the step of selection comprises the selection of a first group of processors performing a first layer of communication and at least a second group of processors performing at least a second layer of communication. With this manner, the node configures its resources, i.e. the available processor according to the different layers of communication to perform. According to an additional aspect of the present invention, the numbers of processors in the group processing each layer are selected so that the time of data processing by groups of processors being appreciably the same one. With this manner, the number of used processors is optimized.
According to another aspect of the present invention, the processing at a given layer of communication comprises the aggregation of several data packets for sending together them towards the given destination. With this manner, several packets of data are processed together in the node and sent together to the next node.
According to a further aspect of the present invention, the processors are gathered in a plurality of clusters of processors located geographically near in the node, the processors selected for processing a given layer of communication being with priority selected in the same cluster. With this manner, the processing between the processors performing the same layer of communication is optimized.
The present invention also refers to a network communication node for processing data received comprising a plurality of processors aimed at processing said data at a given layer of communication so as to route them towards a given destination, the node being characterized in that it comprises:
The embodiments of the present invention refer to the use of a plurality of processors equipping a node of communication such that the energy consumption power is globally reduced.
In each intermediate node 102-105, the data packets are analyzed. According to the processing applied on the data, the data cross one or several logical layers above the physical layer. Above the optical layers that comprise the topology of fibers and the routes of wavelengths, there are different electronic layers of networking, such as Ethernet, Virtual Local Area Network (VLAN), MPLS, IP. . . . These layers are organized as shown in
In current equipments, the core routers for example, the packet forwarding process is performed independently at each layer, with at least a dedicated processing unit (card) per layer. Typically, an added or dropped packet flow traverses intermediate cards that correspond to the layer crossing jobs. It must be stressed here that the multiple interconnections between cards are a source of energy efficiency loss due to the need for driving copper tines, serializing parallel data, and protecting each data transfer by means of dedicated error correction techniques.
New network layers—corresponding to new packet headers—may be developed for next generations of networks; for example, the current Internet may become just a minor application on top of them. The hardware provides total flexibility, so that an arbitrary assignment of the current or future layers to processors can be made. Following power-saving design principles, many low-frequency low-voltage circuits advantageously replace few high-frequency processors.
According to the present invention, the interface modules, i.e. the line cards, commonly use many universally programmable processors driven by scheduling resources that is assigned to any type of packet a programmable local path through assigned processors.
The
According to the present invention, a certain number of processors are activated according to the layers crossed by the data. To make that, a generic pool of processing resources are shared to perform many different forwarding and cross-layer aggregation tasks: for example, processing at layer 2 and layer 3 use a single common processing platform while maintaining the logical layer separation.
The
The input and output line-cards successively go through the four basic functions:
More the processing is performed in high level of layer, more its contribution to resource consumption is high. The most important consumption of resource is used by the upper networking layers, such as IP routing. The present invention proposes to design a better functional block, making its resources shareable and dynamically configurable on multiple layers.
The
The intermediate nodes 102-105 comprise a plurality of processors that are close together on a packet processing card. In a card, or module, three sets of objects cooperate:
A packet on its way through the module can see a sequence of tasks and walk along a route to different processors. The switch matrix allows collecting data from different packets and put the data in a single packet. Then, this packet passes through the different steps of formatting and is placed in the flow.
At the level of an intermediate node, subsequent tasks can run on the same processor or on different ones that preferably are neighbors. In the simplest cases, designed to be the most frequent ones, the packet sees only one short task, one processor. The dynamic assignments of tasks to processors, and the assignments of packets to a route of pairs (task, processor), are the job of an optimizing task scheduler. The variable paths, or routes, of a packet in the module are a generalization of the well-known packet pipelines. Even if the design seemingly mixes all networking layers on the same chip, the tasks are designed to be logically independent per layer. For example, a task of layer N only touches the packet header for layer N; a task for the transition N+1 to N adds a piece of header for layer N; a task for the transition N to N+1 removes the header belonging to N. The software design neatly respects the layered organization of the network, while the hardware can do anything almost anywhere. This is the reason for the claimed power efficiency of the architecture, which eliminates long transmission lines and their serialization/deserialization between specialized boards.
At the left of the
The
The
The task scheduler determines the number of processors assigned to certain tasks according to several constraints:
The statistics of traffic, for example its burstiness, are continuously monitored. The number of processors available for processing each layer is overprovisioned with a safety margin that avoids buffering. There are a number of buffers on small on-chip memories; the overhead (delay, power consumption) of external RAMs would be too high. Advantageously, the intermediate node contains more processors than memory buffers to the chip. If the traffic demand exceeds an acceptable toad level, compatible with Quality of Service (QoS), and if no sleeping resources are available, any new traffic is discarded and appropriate signaling is sent to the relevant neighbors in the network and to the control plane.
Take an example: at a particular time, the intermediate node illustrated in the
The above numbers of processors assigned to each groups is a picture at a very short time (typically, one millisecond), the next moment, the number of processors for each group may be totally changed, and a given processor is assigned to the processing of another layer. The above example corresponds to a node interface where most of the traffic can be passed on in a lower layer (layer 2—Ethernet) : processing there requires only 10% of the time (and energy) per packet, compared to the higher layer (layer 3). It is the job of the intelligent control plane to manage a good sharing of “lower-layer bypass”. With this manner, the node electronics touches the packets at higher layers only when this is absolutely required, to perform the task of add, drop, access aggregation and de-aggregation. In the example above, the intermediate node is not overloaded since it has 7 processors in idle mode.
Multiplexing gain in shared processors, like the one in shared data flows, can schematically be expressed as follows, at a given time instant, processing resource needs for overall aggregation tasks are so that:
For each layer, the maximum number of processors is specified by the manufacturer for a given traffic at a given time instant. The number of processors of the pool is lower or equal than the sum of maximum numbers of processors specified for each layer for a given traffic at a given time instant. With the architecture disclosed for the invention, the number of processors may be optimized regarding the maximum numbers processing the layers for any time.
An enhanced control plane transmits GMPLS information identifying the type of processing task for each received packet flow. According to a simple mode of embodiment, the GMPLS information may be a binary flags field specifying the layers has to be processed by the intermediate node. Thank to this information, the task scheduler allocates resources for processing the identified layers at the level of intermediate node. With this manner, the task scheduler that shares the processors between the different layers is aware of which layers each data packet has to cross. According to a variant, the control plane may also supply the task scheduler with the required switching tables and packet handling rules. It can be coupled to a global on-line optimization software that maps the network's traffic-demand and physical resources onto power-efficient layered logical topologies. As far as possible, low-layer bypass paths are created and the task scheduler receives updated decision look-up tables from the control plane. Due to the top-down control, the same group of processors can handle packets on different cleanly separated layers.
According to a main aspect of the invention, the unused processors are turned off, so as to further decrease the power consumption. The task manager periodically scans the state of the processor by a traffic detector at each layer/sub-layer which allows the task scheduler to share the tasks between the processors, and switches the unused processor in an idle mode. Typically, the task manager scans the traffic 1000 times per second.
Packets can typically be handled as transit-traffic (input and output in the same layer), add-traffic (higher layer input, then multiplexing into a lower-layer output stream) or drop-traffic (low-layer aggregate is demultiplexed into higher layer subsets). The amount of saved power also depends on the quality of the above-mentioned optimizer software that creates and dimensions the low-layer paths in the first place. Such an optimization is not part of this description, but there are other efforts under way to integrate optical-path and packet-transport optimizers, and even to enhance the GMPLS control plane with relevant message types.
The task scheduler manages the analyzing of the input data and takes decision for the processing. The decisions are triggered by packet events: either incoming packets from a physical interface, or packets moving inside the chip along a “pipelined path”. The task manager performs the following tasks:
To simplify the processing of the task scheduler, the determination of the number of processors to activate is driven by lookup tables. The tower the layer, the simpler the table. For each received packet, the following task will be performed:
Based on the decision taken by the task scheduler, the pool of processor receives and emits specialized messages. In particular, most table entries are fed from the control plane's “Forwarding Information Bases”. Here are the events sent by the task scheduler:
Moreover, time-out events are performed at the level of each processors of the processor core. For example, if a processor or a buffer space is unused for more than a certain time, they are put to steep and the mappings from hash values to those pieces of hardware are deleted.
The
If the incoming of new data packets requires the activation of 8 processors, the task scheduler preferably activates 8 processors of the same cluster, by example: cluster 2,4,6,7 or 10. the cluster 6 will be selected in priority in order to occupy it to the maximum and thus to make it possible to leave the maximum of possibilities for the other clusters.
The
Packets are routed to a processor that handles their layer's forwarding job, with layer-specific program code and lookup data. Depending on configuration, a packet may change layers during processing and may be routed through more than one processor. For example, the packet enters on layer L1 through processor P3, which then hands it over on L2 to P4, and finally is routed to and processed in P1, using another variant of layer code L1. This corresponds to a moderately complex switching operation: de-multiplexing from a large L1 traffic aggregate, followed by re-multiplexing into another large aggregate. The task scheduler defines which processors are allocated to which layers or layer pairs (and eventually to which sub-program on the layer, if there are several branches).
For clarity, the interfaces to the control plane are not shown on
Moreover, old Internet protocols may be downgraded merely to specialized legacy applications on top of them. Equipment can be more compact and power efficient through resource sharing and re-use, while the logic of neatly separated networking layers and minimal interfaces is strictly preserved.
Where a component (e.g. an assembly, device, memory, etc.) is referred to above, unless otherwise indicated, reference to that component (including a reference to a “means”) should be interpreted as a reference to any component which performs the function of the described component (i.e. is functionally equivalent to the described component), including components which are not structurally equivalent to the disclosed structure which performs the function in the illustrated exemplary embodiments of the invention.
Where a step in a method is referred to above, unless otherwise indicated, reference to that step should be interpreted as a reference to any step which achieves the same result as the step (i.e. is functionally equivalent to the described step), including steps which achieve a stated result in different ways from those disclosed in the illustrated exemplary embodiments of the invention. As will be apparent to those skilled in the art in the light of the foregoing disclosure, many alterations and modifications are possible in the practice of this invention.
Number | Date | Country | Kind |
---|---|---|---|
11305165.0 | Feb 2011 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP12/52341 | 2/10/2012 | WO | 00 | 12/13/2013 |