The present invention relates to the telecommunication field and more in particular to the dynamic sharing of networks resources. Still more in particular, the invention concerns a method for deadlock detection.
This application is based on, and claims the benefit of, European Patent Application No. 05 290 643.5 filed on May 23, 2005 which is incorporated by reference herein.
Traffic engineering (TE) tools are used in modern telecommunication networks for calculating connections carrying user traffic, in order to balance the traffic load on the various segments of the connections and on the various network elements (routers, switches), so that none of these segments and network elements are over-utilized or under-utilized. This aim is achieved carrying user traffic over the connection calculated by the TE tool, crossing specific network elements; this connection is usually defined “intended” (or “nominal”) connection.
In this scenario, in case of a failure on a segment of the intended connection, a backup connection is calculated and activated for protecting the traffic over the intended connection, that is the user traffic is rerouted from the intended to the backup connection. The backup connection can be pre-calculated (and this process is usually defined “protection”) or calculated run-time (and it is usually defined “restoration”). In the first case the backup connection is calculated or assigned before the failure occurs and, if calculated, it is stored; it can be fully disjoint from the intended connection, that is intended and backup connections don't have common segments, or can be partially disjoint, that is intended and backup connections have at least one common segment. In case of fully disjoint backup connections, one backup connection is usually pre-calculated, because it can protect the intended connection after a failure of any segment of the intended connection, while in case of partially disjoint backup connections, for each possible failure on a segment is required one (or at least one) pre-calculated backup connection. On the contrary, in restoration the backup connection is calculated run-time, that is after the failure occurs. The advantage of protection compared to restoration is usually a smaller recovery time, while the disadvantage is to require more network resources (for example some segments are assigned in advance and are used only for protection or available segments are required for the disjoint backup connection).
When the failure clears, in traffic engineered networks the traffic is always reverted from the backup to the intended connection, in order to carry the user traffic over the intended connection calculated by the TE tool. Moreover, the reversion is required because in case of a further failure on the backup connection, the calculation of a further backup connection is more complex and this calculation could fail.
In traditional networks the calculation of the backup connection can be performed by a centralized network manager controlling network elements and controlling connections (intended and backup) crossing the network elements; the centralized manager is a software application running on a management station and performing network management functions and it is responsible for configuration of the connections, monitoring alarms and performance of the connections, for protecting the intended connections providing a backup connection in case of a failure on a segment of the intended connection. In order to perform these functions, it stores into a MIB (Management Information Base) the list of identifiers of the intended connections within the controlled domain and for each intended connection the source and destination network elements and the list of the intermediate network elements (or the list of segments, identified by two network elements). In a traffic engineered network including a protection mechanism, the central manager stores for each intended connection the list of the intermediate network elements calculated by the TE tool and in case of a backup connection activated for protecting the intended connection it stores also the list of the intermediate network elements of the backup connection. The centralized manager can be connected directly to the managed network elements or indirectly through channels carrying control data in the same physical network carrying user data or through a different physical network carrying only control data. In the new network architectures, based on the Automatically Switched Optical Network (ASON) defined in ITU-T G.808NY.1304 (11/2001), the backup connection is calculated by a control plane element (CPE), which controls one or more network elements, also defined Transport Plane Elements (TPE), for protecting the intended connections starting from the controlled network element (also defined source network element); in order to perform these functions, each CPE stores the list of identifiers of the intended connections starting from the controlled network element and stores for each intended connection the source and destination network elements, the list of the intermediate network elements (or the list of segments, identified by two network elements) of the intended connection calculated by the TE tool and in case of a backup connection activated for protecting the intended connection it stores also the list of the intermediate network elements of the backup connection. The CPEs are schematically indicated in
Referring to RSVP protocol, the base specification was designed to allow network elements (routers) to decide in advance, that is before the provisioning of the connection, if the network can meet the requirements of a Quality of Service (QoS) defined for the connection. The configuration of a new intended or backup connection is performed transmitting a Path message and receiving, in case of a successful configuration of the connection, a Resv message in the reverse direction of the Path message or receiving a Path_err message in case of an unsuccessful configuration (for example for lack of network resources). Reversion is performed in a similar way as a new intended connection, using Path and Resv messages, but in a ‘make-before-break’ manner, that is the intended connection is setup before the backup connection is torn down. A number of extensions were added to support provisioning and maintenance of explicitly routed connections (defined LSP=label switched paths). Finally, RSVP-TE allows the aggregation of connections, defined LSP tunnels, which share a common route and a common pool of shared network resources, reducing the amount of information carryed in the network. Moreover, when a failure is detected on a segment, it must be notified as fast as possible to all the source CPEs controlling the source network element of the intended connections crossing the failed segment, in order to route the traffic from the intended to a backup connection. A Notify message is defined for reporting of failures and it is described by Internet draft draft-ieff-mpls-generalized-rsvp-te-09.txt (Sept. 2002), issued by IETF and updated periodically; this message includes the address of the CPE controlling the network element which has detected the failure.
In traffic engineered network the concept of dynamic sharing of network resources applies: a network resource (a segment or part of the bandwidth of the segment) can be assigned to more that one backup connection, while it must be assigned to one intended connection. For example, after rerouting of user traffic from a first intended to a first backup connection, a second backup connection can use available segments of the first intended connection (that is segments not anymore used by the first intended connection and not affected by a failure) for protecting a second intended connection affected by a failure. This mechanism allows a better usage of network resources, but the concurrent usage of the same resources by different connections can create blocking situations, also defined as “deadlock”. The deadlock is more probable in an ASON network, because of the parallelism of the distributed managers, that is of the CPEs, which can require to use the same resources (segments) at nearly the same time, for example for restoration purposes.
Nowadays the deadlock is avoided, that is the deadlock situation is not reached but it is prevented according to two solutions. In the first solution partial sharing of network resources is performed: a group of segments is assigned in advance for providing backup connections and these segments are used for protecting different intended connections. The deadlock is avoided defining a rule for assigning the shared segments, for example defining a different priority for each intended connection within the network or using a first come-first serve rule. Referring to priority rule, a first example is when two different intended connections require to use the same segments for protection for a failure on a segment of the corresponding intended connection: these segments are assigned to the intended connection having the highest priority (this is also indicated with ‘contention’). In this first solution usage of network resources is not optimized (because only segments of the group are shared) and consequently more resources are required for protection of intended connections; moreover, it requires the definition of a rule, which can be complex in case of a network including many connections. In the second solution complete sharing of network resources is performed; in fact, as explained above, after routing of traffic from an intended to a backup connection, available segments of the intended connection can be used for backup connections, that is also segments of intended connections can be dynamically shared. In this second solution deadlock is avoided because an existing connection which was first successfully established, is intentionally torn down after (this is also indicated with ‘pre-emption’); referring to
In view of drawbacks of the known solutions, the main object of the present invention is to provide a method to support a better network resources usage; this is performed by a method according to claim 1. The basic idea is to:
Advantages of this solution are better network resources usage, automatically deadlock detection, small time required for the detection and not to require the definition of any rule for the connections. A correlated advantage is the detection of the intended connections involved in the deadlock; this is performed by a method according to claim 2.
In traffic engineered networks complete sharing of network resources is applied, in order to achieve the best usage of network resources, and consequently the deadlock can occur if no rules are defined. In this case nowadays the network operator could only detect from a network management station an alarm for indication of a reversion blocked status of an intended connection for indicating that although no failure is present on the intended connection, reversion from the backup to the intended connection is not possible because at least one segment of the intended connection is not available but now he also needs to know the reason of the reversion blocked status, which can be a deadlock or can be a different reason. For example the unavailable segment of a first intended connection can be used by a second backup connection protecting a second intended connection in a not revertible status indicating that the second backup connection is protecting a second intended connection for a failure on at least one segment on the second intended connection (this is shown in
Referring to
Referring to
The inventive method can be performed through a software program running on hardware of a central management station or running on hardware of a CPE of an ASON network. The central management station can be a workstation or a personal computer, while the hardware of the CPE can be a microprocessor, external or embedded into an ASIC (Application Specific Integrated Circuit) or into a FPGA (Field Programmable Gate Array). The programming language can be for example C, C++or Java.
For the network operator the result of the software program is a graphic user interface, which includes not only a first indication of the reversion blocked status of an intended connection, but also a second indication of the deadlock involving the intended connection. In case of detecting the deadlock, the graphic user interface can also include an indication of the intended connections involved in the deadlock. Referring for example to
In case of a central management station, it will start the deadlock detection for example after a request from the network operator, who detects an alarm indicating a reversion blocked status of an intended connection and starts the method from this intended connection. The network operator can detect the alarm from more than one intended connection and can start the deadlock detection procedure for example from the intended connection corresponding to the first alarm received. In case of a CPE, each source CPE will react automatically and will perform independently the method for deadlock detection, starting from the controlled intended connection. Referring to the example of
A signalling protocol can be used to carry messages between the CPEs and an extension of the protocol is required for performing the inventive method, for example defining a new message or defining a new field into a known message. In particular, the RSVP protocol can be used, defining a new object into the Path_err message and a new object into the Notify message. Moreover, each intermediate CPE further stores information of the connections crossing the controlled network element and specifically identifiers of the intended connections and for each intended connection the address of the source CPE (that is the CPE controlling the source network element of the intended connection); this is required for notification of a failure as fast as possible to all the source CPEs controlling the intended connections crossing the failed segment and also for notification of a not available segment to a source CPE requesting the not available segment for configuration of a connection. Storing of this information can be performed during set-up of the intended or backup connection when transmitting the Path message, using a new object of the Path message or using the already defined Notify Request object: after receiving the message carrying information of identifier of the intended connection and address of the source CPE, each intermediate CPE will store locally this information. Referring to
Afterwards CPE1 requests the status of the intended connections of the first list by transmitting to CPE2, CPE4 and CPE5 a Notify message, which includes a new object indicating the address of the source CPE of the first list, identifiers of the controlled intended connections and the request of the status of the controlled intended connections; thus the new object of the Notify message transmitted to CPE2 includes the address of CPE2, identifier of c2 and the indication of the request of the status of c2, while the new object of the Notify message transmitted to CPE4 includes the address of CPE4, identifier of c4 and the indication of the request of the status of c4 and finally the new object of the Notify message transmitted to CPE5 includes the address of CPE5, identifier of c5 and the indication of the request of the status of c5. CPE2, CPE4 and CPE5 will answer respectively to the request by transmitting to CPE1 a Path_err message indicating the status of the controlled intended connections, which can be not revertible for a failure or can be reversion blocked. In the first case the message will include the indication of a failure (and eventually the address of the CPE controlling the network element detecting the failure), the address of the source CPE controlling the intended connection affected by the failure and identifier of the intended connection affected by the failure. In the second case the message will include the indication of a reversion blocked status of an intended connection, the address of the source CPE controlling the intended connection in the reversion blocked status and identifier of the intended connection in the reversion blocked status; the message will also include the address of the source CPE controlling the intended connection protected by the backup connection blocking the reversion and identifier of the intended connection protected by the backup connection blocking the reversion. Specifically, CPE2 will transmit to CPE1 the new Path_err message comprising the new object including the address of CPE2, identifier of c2 and an indication of reversion blocked status of c2; the new object further includes the address of CPE1 and identifier of c1. CPE2 will also transmit to CPE1 the new Path_err message comprising the new object including the address of CPE2, identifier of c2 and an indication of reversion blocked status of c2; the new object further includes the address of CPE6 and identifier of c6. CPE4 will transmit to CPE2 the new Path_err message including an indication of a failure, the address of CPE4 controlling c4 affected by the failure and identifier of c4 affected by the failure. CPE5 will transmit to CPE2 the new Path_err message including an indication of a failure, the address of CPE5 controlling c5 affected by the failure and identifier of c5 affected by the failure. The result is a second list including addresses of source CPEs and identifiers of the controlled intended connections protected by the backup connections using segments of c2. CPE1, after receiving from CPE2 the Path_err message including the address of CPE2, identifier of c2 and indication of reversion blocked status of c2, the address of CPE1 and identifier of the controlled connection c1 protected by b1 blocking c2, detects the deadlock, which involves the intended connections c1 and c2. Finally CPE1, after receiving from CPE2 the Path_err message including the address of CPE2, identifier of c2 and indication of reversion blocked status of c2, the address of CPE6 and identifier of the controlled connection c6 protected by b6 blocking c2, CPE2 transmits to CPE6 the new Notify message including the address of CPE6, identifier of c6 and the indication of the request of the status of c6; afterwards CPE6 will transmit to CPE2 a not revertible status of c6, that is the new Path_err message including an indication of a failure, the address of CPE6 controlling the intended connection affected by the failure and identifier of c6 affected by the failure.
It must be noticed that CPE2, CPE4, CPE5 and CPE6 detect the reversion blocked status or the not revertible status as explained for CPE1, that is transmitting the Path message and receiving from the intermediate CPEs the Path_err message including the list of the intended connections protected by backup connections blocking the reversion or including an indication of a failure on the intended connection and address of CPE detecting the failure; for example intermediate CPE8 transmits to source CPE2 a Path_err message including the address of CPE1 and identifier of c1. Afterwards, CPE2, CPE4, CPE5 and CPE6 transmit to CPE1 the indication of reversion blocked or not revertible status as explained above.
The new messages of the signalling protocol indicated above are transmitted by the control plane elements; each CPE is a hardware device which includes hardware means (for example a transmitter) adapted to transmit the messages to the other CPEs and includes hardware means (for example a receiver) adapted to receive the messages from the other CPEs.
Number | Date | Country | Kind |
---|---|---|---|
05 290 643.5 | Mar 2005 | EP | regional |