1. Field of the Invention
The present invention relates to mesh networks and, more particularly, to a method and apparatus for cycle-based restoration in mesh networks.
2. Description of the Related Art
Data communication networks may include various computers, servers, nodes, routers, switches, hubs, proxies, and other devices coupled to and configured to pass data to one another. These devices will be referred to herein as “network devices.” Data is communicated through the data communication network by passing data packets (or cells, frames, or segments) between the network devices by utilizing one or more communication links. A particular packet may be handled by multiple network devices and cross multiple communication links as it travels between its source and its destination over the network. A subset of data packets destined to travel between the same end points on a network are referred to as a flow. Multiple flows may be carried by a single communication link.
Communications networks may experience local failure on a link for many reasons. To mitigate against the effect of such failures, it is desirable to be able to restore flows in the network with minimum delays, especially where the flows are associated with traffic of high importance or with revenue producing traffic. One way to restore the flows in the network is to cause the flows to be diverted around the failed area over one or more restoration paths until the area can be repaired.
Restoration paths may be created after a failure is detected utilizing a protocol exchange between network devices involved in handling traffic over the affected link. Unfortunately, protocol exchanges are time consuming. For example, it may take up to 30 seconds to establish a new path through an IP network using an Open Shortest Path First (OSPF) protocol exchange.
Alternatively, restoration paths may be determined ahead-of-time, before a failure takes place, and pre-programmed into the network devices handling the traffic on the network. By pre-programming the restoration paths into the network, a network device that senses a link failure may simply use the alternative restoration path to restore traffic in real time. One example of a network protocol that includes alternate restoration paths is SONET. Specifically, in SONET there are two choices (east/west). If east isn't available, the west choice is utilized. SONET restoration is thus very fast, since there is no need to notify the source to redirect the traffic—it is done by the SONET switch itself.
Networks have evolved since SONET was developed, so that now switches residing in mesh networks are able to choose between many different paths to restore traffic over a failed link. For example, if two nodes A and B are connected by three links, it is possible to set the maximum bandwidth on each link to be 66% and reserve 33% on each link as back-up. In the event that one of the links fails, the 66% traffic from the failed link may be split and passed over the 33% backup capacity on each of the other two links to achieve total restoration over the links.
To designate portions of a mesh network to back up particular links in the mesh, logical restoration rings, referred to herein as p-cycles, are defined. Defining a p-cycle enables the network devices to perform fast link restoration protection without requiring notification of the source or destination nodes. Thus, in a mesh network the working path will be carried over the mesh network, while the restoration path will be carried over the logical ring. Restoration of traffic from a failed arc, however, should not have an unplanned negative impact on traffic which normally uses the restoration path.
Several methods have been developed to attempt to create restoration paths through a mesh network, one of which is to attempt to restore protection cycles (p-cycles). Unfortunately, this method tends to establish back-up paths with many hops (arcs) which results in a less than optimal restoration path. Additionally, p-cycles do not account for flows or bandwidths on the various links used to establish a back-up path. Hence, p-cycles may result in restoring flows across a bottleneck in the network, which either causes additional connectivity displacement in the network or results in an ineffectual restoration of traffic from the failed link.
The present invention overcomes these and other drawbacks by providing a method of establishing restoration cycles on a mesh network that is able to account for individual flows and bandwidths on arcs in the mesh network. By considering flows and bandwidths over particular arcs or other sections of the network, it is possible to enhance restorability in the network and design redundancy into the network in a more effective manner.
According to an embodiment of the invention, a two step method for determining optimal P-cycles for a mesh network includes generating a set of possible restoration cycles, and selecting a subset of the cycles that optimize restorability links on the network, assuming a single link failure in the network. The subset of cycles may be selected by choosing one or more low-cost restoration cycles for each link to be restored. The cost of the restoration cycle may be based on the availability of capacity on the link to restore traffic with and/or without interference. The cost of the restoration cycle may also be based on network policy, such as whether the cost for the link should be based on the total cost for the entire restoration cycle or the average cost for the restoration cycle. Thus, embodiments of the invention provide flexibility in allowing for selection of which class, type, or priority of traffic to restore and which not to restore, whenever there is not enough capacity for complete restoration.
Aspects of the present invention are pointed out with particularity in the appended claims. The present invention is illustrated by way of example in the following drawings in which like references indicate similar elements. The following drawings disclose various embodiments of the present invention for purposes of illustration only and are not intended to limit the scope of the invention. For purposes of clarity, not every component may be labeled in every figure. In the figures:
The following detailed description sets forth numerous specific details to provide a thorough understanding of the invention. However, those skilled in the art will appreciate that the invention may be practiced without these specific details. In other instances, well-known methods, procedures, components, protocols, algorithms, and circuits have not been described in detail so as not to obscure the invention.
A(i), (iεN) will be defined as the set of arcs connected to node i. In the following discussion, it will be assumed that |A(i)|≧2; if this condition is not met there is only one arc connected to node i, thus preventing another arc from being used to restore traffic to node i. Note that when |A(i)|=2, both of the arcs emanating from node i will be part of the restoration cycle for that node.
Several other definitions are also useful to describe the network. Each arc ij has a capacity (bandwidth) cij, with ij belonging to the set A: ijεA. The network contains a set of source nodes No and sink nodes Nd, with NoεN and NdεN.
Tod will be defined as the traffic demand between a pair of nodes od, with o belonging to the set of source nodes (oεNo) and d belonging to the set of sink nodes (dεNd.). The path used to route traffic from o to d will be defined as Pod. It is assumed herein that these paths are known. To enable arcs on a particular path to be associated with the path, a quantity Yij,od is defined as: Yij,od=1 if arc ij is on path from Node o to Node d (Pod), otherwise Yij,od=0.
Thus, the traffic fij on a particular arc ij is the sum of the traffic between pairs of source and sink nodes that include that arc on the path between the source and sink nodes: fij=ΣodTodYij,od≦cij. As noted, the traffic on a particular arc must be less than or equal to the capacity cij for that arc. In a similar fashion a quantity fij,gh may be defined which represents the total flows which uses both arc ij and arc gh.
According to an embodiment of the invention, a two step method for determining optimal P-cycles for a mesh network includes generating a set of possible restoration cycles, and selecting a subset of low-cost restoration cycles on the network. The cost of a restoration cycle may be based, in part, on the availability of capacity on the link to restore traffic with and/or without interference, and network policy. Enabling the restoration cycles to be determined based on the bandwidth capacity of the restoring link, the flows carried by the other link, and other network policy considerations enables embodiments of the invention to account for traffic class, priority, and other traffic considerations when selecting restoration cycles on the network. As discussed below, by determining the cycles in the network and then selecting cycles which are to be used for restoration based at least in part on the flows on the arcs and the capacity of the arc, it is possible to more selectively and efficiently select restoration cycles in the network to restore high priority or high revenue flows in the event of a failure on the network.
Generating a Set of Possible Restoration Cycles in the Network
A planar graph is a graph in which no arcs cross each other. For any network that can be converted to a planar graph there are methods for determining all the cycles in that network. In this application, it will be assumed that the network is a planar graph, or that it is convertible into a planar graph and has been converted into a planar graph.
A p-cycle is a pre-configured cycle formed out of spare capacity arcs. A p-cycle is shown in bold lines in
One way to avoid excessively large p-cycles is disclosed by Wang, et al., U.S. patent application Ser. No. 09/703,715, the content of which is hereby incorporated herein by reference. In this application, Wang discloses a K-diversity line protection approach wherein an arc (i,j) is said to be K-diverse if there exists at least K disjoint paths from i to j. A set of nodes is then grouped according to the K-diversity category to obtain what is called K-diversity sub-network. A ring topology, for example, is a 2-diversity sub-network.
There are several known methods of generating all possible cycles in a planar graph network. For example, one well-known method was disclosed by Dogrusoz et al., Enumerating All Cycles Of A Planar Graph, Journal of Parallel Algorithms and Applications, No. 10, pages 21-36, (1996), the content of which is hereby incorporated herein by reference. In the following discussion it will be assumed that the cycles for a given planar graph network are known.
Selecting a Subset of Cycles
Once the set of cycles has been established, a subset of the cycles to be used for restoration is selected. According to one embodiment of the invention, costs are assigned to each cycle and a subset of cycles is selected to implement network policy to minimize the total cost, average cost, or other considerations, while also attempting to ensure that every arc is restorable upon failure. In the following discussion, the following variables will be used to describe the network and flows on the network:
Xk=0 or 1, ∀kεR
According to an embodiment of the invention, bj, and mkij are selected to enable network flows to be restored in addition to bandwidths on a link to be restored.
Consider the network G={N, A} and a pair of adjacent nodes i and j such that there exists an arc ijεA. For cycle k let there be Kij diverse routes for this node pair. For each diverse route v let wvk(ij), v=1, 2, . . . Kij, be all the arcs along this path (excluding arc ij).
The total amount of traffic which is restorable by cycle k without interference is determined by the arc on the restoration path that has the least unused capacity. Similarly, the amount of traffic that may be restored with interference is determined by the arc on the restoration path with the lowest total capacity. Stated mathematically:
dvk(ij)=min {cgh−fgh+fgh,ij}, ∀ghεwvk(ij), and evk(ij)=min {cgh}, ∀ghεwvk(ij).
If
when arc ij fails, then the total amount of traffic that is restorable without interference, is dk(ij) and the total amount of traffic that is restorable with interference is ek(ij).
When arc ij fails the maximum available spare capacity for restoration of its flow by all the cycles is
without interference and
with interference. To account for arcs K-diversity routes, wk(ij) is defined as the set of all the arcs in all the Kij diversity routes for arc
In a p-cycle, a spanning arc ij has only one restoration path, i.e. Kij=1, hence we have dk(ij)=d1k(ij), ek=e1k(ij), and wk(ij)=w1k(ij). However for a straddling arc ij there are two distinct paths, i.e. Kij=2, hence we have dk(ij)=d1k(ij)+d2k(ij), ek(ij)=e1k(ij)+e2k(ij), and wk(ij)=w1k(ij)∪w2k(ij). For example, assume, as shown in
c4,10=10, c4,5=10, c5,10=11, f4,10=9, f4,5=6, f5,10=11, f4,5:4,10=3, f5,10:4,10=4,
then we have dG(10,4)=d1G(10,4)=min {10−6+3; 11−5+4}=7 and eG(ij)=e1G(10,4)=min {10,10}=10. Thus, the maximum amount of traffic on arc (10,4) that is restorable without interference is 7, and the maximum amount of traffic on arc (10,4) that is restorable with interference is 10.
To ascertain the maximum available spare capacity, it is desirable to utilize a portion of the available spare capacity that may be used to restore the flow without interference, and a portion of the capacity that may be used to restore the flow with interference. Equation 1 allows a network manager or other entity to set the amount of each type of capacity to be utilized in the event of failure of a given arc.
mkij=αddk(ij)+αeek(ij), 0≦αd, αe≦1, αe=1−αd. Equation (1)
When αd=1, restoration traffic will not interfere with any other flows on the cycle, and when αd=0, restoration traffic will interfere with all flows on the cycle. The choice of αd, and hence αe, is made to implement network policy, such as by a network engineer or network policy software program, and the selection is based on the importance of the traffic on the failed arcs verses the importance of the traffic on the restoring paths. The relative importance may be judged in terms of revenue, traffic class, traffic sensitivity, or numerous other criteria deemed important in administering or establishing the network.
One general goal may be able to restore as much traffic as possible. One way of looking at this is to look at the ratios of flows that are restorable and not restorable. The maximum ratios of restorable flows with and without interference (rdk(ij) and rck(ij)) are defined as: rdk(ij)=min {1, dij/fij} and rek(ij)=min {1, eij/fij}. In the example discussed above in connection with
To determine the total cost of a protection cycle, the cost of each arc on the ring is evaluated. The total cost may include a sum of the costs for all arcs on the ring, an average cost for arcs on the ring, or some combination of the two depending on network policy. To calculate the costs, a quantity hk is defined as the number of hops on cycle k and let
be the set of all the arcs on cycle k, including ij, i.e. |wk|=hk. Depending on what is more important from a design perspective, it may be desirable to minimize the total cost for a given
or the average cost for the cycle
A weighted sum (Equation 2) is then used to enable a network engineer or other entity to select the relative importance of each cost metric on the particular implementation on the network.
bk=φ1bk1+φ2bk2, φ1≧0, φ2≧0, with φq>0 for either q=1 or 2 or both. Equation (2)
The ability to select φ1 and φ2 provides flexibility in deciding which traffic is to be restored. Specifically, it allows penalty costs to be established based on network policy and allows the penalty costs to be associated with non-restorability of flows on the network. The coefficients could be associated with delay, with penalty costs on partial links, or any other considerations. Additionally, if there is a particular arc that is not to be used to restore traffic, it is simple to select φ1 and φ2 such that the cost of going through the link is to high, so that other restoration paths will be preferred to that high cost path.
An additional advantage is that it is now possible to decide, during an arc failure, which traffic flows to restore, which traffic flows not to restore, and which traffic should preempt traffic on otherwise non-failing links. All of this allows more flexibility in designing the network and enables further flexibility in running the network.
After the set of possible restoration cycles has been ascertained, the software will determine traffic flows between nodes on the network 102. Ascertaining traffic flows between nodes, and optionally the type of traffic flowing between nodes, is useful to enable the software to take the traffic flows into account when ascertaining which arcs on the network have spare capacity and which are carrying traffic that may be preempted. The traffic flow information may contain information useful to the network engineer or other software program configured to set values as to which traffic may be preempted and which traffic should be afforded high priority status.
The software will then look to establish one or more restoration cycles for each of the arcs on the network. The mesh network will carry the traffic in the ordinary course, however the network devices forming the nodes will be configured to protection switch to one or more selected restoration paths upon occurrence of a failure on the arc destined to carry traffic from that node.
Accordingly, for each arc to be restored, the software will determine a set of possible restoration cycles on the network that may be used to restore flow over the particular arc to be restored 104. The set of possible restoration cycles may be determined in any convenient manner, such as by looking for all restoration cycles that include the restored arc as either a spanning arc or a straddling arc. Other methods may be utilized as well, and the invention is not limited to any particular manner of selecting a subset of possible restoration cycles.
The software will then select one of the possible restoration cycles having one or more arcs 106, and determine the penalty cost associated with the proposed restoration cycle 108.
The software, according to one embodiment of the invention, may implement several steps to ascertain the penalty cost of the particular restoration cycle of interest. For example, according to one embodiment of the invention, the software may ascertain, for each arc on the selected restoration cycle, the maximum available spare capacity for flow restoration without interference 110, and the maximum available spare capacity for flow restoration with interference 112. After these values are calculated, the software may obtain coefficients indicative of the amount of available spare capacity with and without interference that is to be used to restore flows over that particular arc 114. These values may be set by a network administrator or network policy software module taking into account the types of traffic on the respective arcs, the relative importance of the traffic, or any other number of criteria.
Once the coefficients have been selected and applied, the software will determine ratios of flows that are not restorable over the arc with and without interference 116. If there are additional arcs on the restoration cycle 118, the software will process each additional arc in a similar manner.
After relevant arcs on the restoration cycle have been evaluated, the software will determine the total cost for the restoration cycle and the average cost for the restoration cycle 120. Coefficients related to how the total cost and average cost are to be used to determine the penalty for the restoration cycle are then applied by the software 122 to enable a network administrator to select which cost metric should be utilized on the network. For example, the network administrator may determine that the total cost for a restoration path is more important than the average cost over the restoration path. Conversely, the network administrator may determine that it is important to maintain a low average cost on arcs on the network. Applying coefficients to enable these trade-offs to be made enables the software to be flexible to take into account network policy while accounting for flows on the network.
Once the penalty cost for a particular restoration cycle has been determined, the software determines if there are any additional potential restoration cycles for the arc to be restored 124. If there are additional restoration cycles to be evaluated, the software will select and evaluate restoration cycles until it has found a restoration cycle with a sufficiently low penalty cost or until it has evaluated all restoration cycles in the set of possible restoration cycles.
When there are no additional restoration cycles to evaluate for restoration of a particular arc, the software will select one or more restoration cycles with acceptable penalty costs as the restoration cycle(s) to be used in the event of failure on the particular arc 126. The software will continue to select restoration cycles for the other arcs on the network 128 until a set of restoration cycles has been established for the network 130.
The determination of restoration paths may also be used proactively during the network design phase or in connection with a network monitoring program, to help determine where to optimally place spare capacity in the network. For example, the software may identify arcs having relatively high penalty cost restoration cycles as candidate areas on the network that could benefit from additional capacity. This enables the network to be designed initially to increase restorability of connectivity between two nodes in the event a connecting arc fails. Likewise, this enables ongoing evaluation of the network to identify bottlenecks in the event of link failure to identify target areas for deployment of additional spare capacity.
The network device 50 includes conventional modules, such as a memory 58, and a protocol stack 60 containing instructions and data relevant to common communications protocols to enable the network device 50 to communicate with other network devices over one or more communications ports 62.
The restoration cycle software 56 may have one or modules configured to implement the functions and acts described above in connection with
Optionally, as illustrated in
The invention is not limited to the particular illustrated embodiment, but rather extends to all embodiments configured to implement the methods of the invention. Accordingly, the software need not be divided into the specific modules illustrated in
The control logic of
It should be understood that various changes and modifications of the embodiments shown in the drawings and described in the specification may be made within the spirit and scope of the present invention. For example, the invention has been described as looking at the amount of traffic that is restorable with and without interference, the invention is not limited to this embodiment but also includes embodiments wherein the type of traffic is considered, i.e. high priority traffic utilizes a first set of protection cycles and low priority traffic utilizes a second set of protection cycles. Accordingly, it is intended that all matter contained in the above description and shown in the accompanying drawings be interpreted in an illustrative and not in a limiting sense. The invention is limited only as defined in the following claims and the equivalents thereto.
This application claims the benefit of and claims priority to U.S. Provisional Patent Application. No. 60/451,898, filed Mar. 4, 2003, entitled “Cycle-Based Restoration Algorithm With Bandwidth and Flow Considerations for Mesh Networks: Developing the Objective Function”.
Number | Name | Date | Kind |
---|---|---|---|
5850505 | Grover et al. | Dec 1998 | A |
6377543 | Grover et al. | Apr 2002 | B1 |
6914880 | Grover et al. | Jul 2005 | B1 |
6982951 | Doverspike et al. | Jan 2006 | B2 |
7075892 | Grover et al. | Jul 2006 | B2 |
20020181393 | Grover et al. | Dec 2002 | A1 |
20030021222 | Boer et al. | Jan 2003 | A1 |
20030035379 | Zimmel et al. | Feb 2003 | A1 |
20030046378 | Zimmel et al. | Mar 2003 | A1 |
20040109407 | Grover et al. | Jun 2004 | A1 |
20040133663 | Grover et al. | Jul 2004 | A1 |
20040190445 | Dziong et al. | Sep 2004 | A1 |
Number | Date | Country | |
---|---|---|---|
60451898 | Mar 2003 | US |