The present invention relates to network resiliency schemes in connection oriented networks. It has particular application in multi-protocol label switched (MPLS), and connection oriented Ethernet networks.
Network resiliency schemes are generally arranged to compensate for failures in a network by detecting failure of one of the nodes, or inter-node connections, in the network, and re-routing traffic to bypass the failure. Networks generally have a number of edge nodes at which traffic can enter and leave the network, and a number of intermediate nodes through which traffic can pass to travel from any one edge node to any other. Customer equipment that is arranged to communicate over the network will generally communicate with one or more edge nodes. In the simplest type of scheme, any piece of customer equipment can only communicate with one edge node. Therefore any re-routing carried out by the resiliency scheme cannot bypass the ingress node at which traffic enters the network, or the egress node at which it leaves the network. In some systems, dual parenting schemes are known in which the customer equipment can communicate with more than one ingress or egress node. This has the advantage that, if one of the edge nodes suffers a failure, then the customer equipment can still communicate over the network. However, no such dual parenting scheme is currently available for MPLS or other connection oriented networks. This means that resiliency schemes for MPLS networks rely on path diversity between the same ingress and egress nodes, and are therefore limited in the degree of resilience they can provide.
Accordingly, the invention seeks to preferably mitigate, alleviate or eliminate one or more of the disadvantages mentioned above singly or in any combination.
According to a first aspect of the present invention there is provided a connection oriented communications network comprising a plurality of nodes including a plurality of edge nodes, wherein the network is arranged to define a primary tunnel connecting together a primary pair of the edge nodes, and a secondary tunnel connecting together a secondary pair of the edge nodes, which is different from the primary pair, and wherein the network is arranged to enable switching of traffic from the primary tunnel to the secondary tunnel in the event of detection of a failure affecting the primary tunnel.
The first and second pair of nodes may have a common node, but each have at least one node that is not common to both pairs.
According to a second aspect of the present invention there is provided a node for a connection oriented network having a plurality of edge nodes, the node being arranged to identify a primary tunnel connecting together a primary pair of the edge nodes, and a secondary tunnel connecting together a secondary pair of the edge nodes, which is different from the primary pair, and wherein the node is arranged to enable switching of traffic from the primary tunnel to the secondary tunnel in the event of detection of a failure in the primary tunnel.
Further features of the present invention are as claimed in the dependent claims.
The present invention beneficially allows for network resiliency in an MPLS network, or other connection oriented network with node diversity.
Preferred embodiments of the present invention will now be described by way of example only with reference to the accompanying drawings.
Referring to
As is conventional each of the nodes 12, 14, 16, 18, 20, 22 includes one or more processors and associated memory and is arranged to perform a number of functions relating to the operation of the network, and these functions are grouped, from an organisational point of view, into planes. The management plane provides overall management of the network and includes a network management system (NMS) that provides an interface for a user to provide manual control of the network. The control plane uses a number of protocols and allows for communication between the nodes and configuration of the forwarding plane. The forwarding plane performs the forwarding operations that transmit data across the network. The control plane protocols, which can include, for example, OSPF-TE and RSVP-TE, are also arranged to provide communication between the edge nodes 12, 14, 16, 18 and optionally the customer equipment 24, 26. One of the main functions of the control plane protocols is to determine the route that any communication will take across the network.
In order for the two customer equipment (CE) units 24, 26 to be able to communicate with each other, the control plane protocol of the network identifies one primary edge node 12 for one of the customer equipment units 24, and one primary edge node 14 for the other 26. For communication from the first unit 24 to the second 26, the first primary edge node 12 is defined as the primary ingress node and the second primary edge node 14 is defined as the primary egress node. The control plane protocols also define a number of communications tunnels each of which provides communication from one of the edge nodes 12, 14, 16, 18 to another. Each of the tunnels is a route between two nodes along which traffic can be transmitted in the network 10 such that the traffic, which is in the form of packets in an MPLS network, is opaque to intermediate nodes. In an MPLS network this can be achieved because the intermediate nodes only read the label on each packet and not the data within the packet. The ingress node 12 that directs the traffic across the network identifies the egress node 14 to which it needs to send the traffic, and then attaches labels to the information which identify which tunnel it is to be transmitted through. Each tunnel is defined by a number of parameters, which include the two edge nodes that the tunnel connects. Other defining parameters of the tunnels can include any one or more of the following, all of which are included in this embodiment: the characteristics of the traffic it can carry, the quality of service (QoS) scheme it operates, the service class associated to the MPLS tunnel, its maximum and mean bandwidth, its maximum burst size for example. It is therefore possible to have more than one tunnel between any pair of edge nodes, the two tunnels having different characteristics or parameters. Each tunnel may include only a single label switch path (LSP), or it may include a plurality of LSPs the most appropriate of which will be selected at any given time depending on the characteristics and performances required by the client service. It will be appreciated that the customer traffic transmitted in the tunnel can be encapsulated into pseudo-wires arranged to support Ethernet services as well as any type of service admitted in the network.
During communication over the network, a number of checks are carried out by the nodes as defined by the control plane protocols to identify failures in the network. Many methods of detecting failures are well known and will not be described in detail. A management information base (MIB), which is related to and assists the control plane, is arranged to store data relating to what faults are detected in the network. Also all of the nodes of the network are arranged to communicate with each other to check for, and communicate information about, failures in the network, which will affect which of the tunnels is available for use at any time. This form of checking is carried out when the CE unit 24 first transmits a communication for transmission across the network and will be described in more detail below. This allows selection of which of the tunnels is used depending on what faults are present.
For any type of communication between the CE unit 24 and the CE unit 26 a primary tunnel is defined which has one edge node 12 as its ingress node, and one edge node 14 as its egress node. In this embodiment the sending CE unit 24 is linked to, and can communicate with, a secondary ingress node 16, as well as the primary ingress node 12, and the receiving CE unit 26 is linked to, and can communicate with, a secondary egress node 18, as well as the primary egress node 14. The primary tunnel AC is therefore defined from the primary ingress node 12 to the primary egress node 14. Three secondary tunnels are also defined: one AD from the primary ingress node 12 to the secondary egress node 18, one BC from the secondary ingress node 16 to the primary egress node 14, and one BD from the secondary ingress node 16 to the secondary egress node 18. Therefore each of the tunnels is between a different pair of nodes, having one of its nodes in common with one other tunnel, another of its nodes in common with another tunnel, and no nodes in common with the remaining tunnel.
When the sending CE unit 24 wants to start communicating with the CE unit 26 it is arranged to direct the communication traffic to the primary ingress node 12. This traffic includes an identification of the customer equipment unit 26 that the communication is to be transmitted to, as well as information regarding the characteristics of the communication. On receiving this communication, the primary ingress node is arranged to start transmitting over the network to the primary egress node, and to trigger a number of communications between the edge nodes to check for any faults that might result in a need for a tunnel other than the primary tunnel to be used for the communication. It will be appreciated that these communications between the edge nodes can take a number of formats, but it is desirable that they are sufficient to identify which of the tunnels are available for use and which are not.
In this embodiment the primary ingress node 12, which is now the active ingress node, sends the traffic from the CE unit 24 to the primary egress node 14. The active egress node, on receiving the traffic, sends it on to the receiving CE unit 26 and monitors the traffic. Suitable instruments, using either standard messages or a proprietary protocol, are used to check whether or not the LSP tunnel is successfully transporting the traffic from the active ingress node to the active egress node. This enables the active ingress node 12 to determine if there is a fault in the primary tunnel AC. Provided no fault is detected on the primary tunnel, the traffic continues to be transmitted via the primary tunnel.
Also, when the communication begins, the primary and secondary edge nodes are arranged to exchange a series of messages so that each of them can determine which is the current active tunnel, which are the secondary or standby tunnels, and whether there are any faults in the primary or any of the secondary tunnels. It will be appreciated that these messages can take a number of forms. In this embodiment, the state message to verify LSP integrity is also sent to the standby ingress node 16. Proper signalling messages are also exchanged between the primary ingress node 12 and the secondary ingress node 16 to identify their respective status as ingress nodes. This enables the active ingress node 12 to check whether the secondary ingress node 16 has experienced a fault or not. The active ingress node 12 also sends a check signal to the secondary egress node 18 to enable it to identify its status as secondary egress node. The active ingress node 12 is enabled to check whether the standby egress node 18 is available for use if necessary, by means of keep-alive signalling messages, which are periodically exchanged between the two nodes. The active ingress node 12 is also arranged to send an acknowledgement signal back to the sending CE unit 24 to confirm that it is receiving the traffic and sending it over the network to the receiving CE unit 26. All of these signals and checks can be repeated during the communication to check continually on the status of each of the tunnels. These signals and checks can be exchanged among primary and standby edge nodes enabling them to check the status of the network.
For as long as the primary tunnel AC continues to function and provide the communication required, the communication will continue over it. However, if a fault is detected in either of the primary edge nodes 12, 16, or indeed in any intermediate nodes 20 forming part of the primary tunnel AC that cannot be bypassed within the primary tunnel, then the network is arranged to switch to one of the standby tunnels so that communication can continue. The most appropriate tunnel will be selected depending on which of the nodes and LSPs are functioning correctly. Each of the nodes 12, 14, 16, 18, 20, 22 is arranged to monitor for and identify faults during the communication, and if any of them identifies a fault that requires, or might require, a change of active tunnel, then it is arranged to communicate this to each of the other edge nodes. In this way all of the edge nodes can maintain an accurate record of which is the currently active tunnel, and whether there are any faults in any of the edge nodes 12, 14, 16, 18 or intermediate nodes 20, 22 that might affect the choice of active tunnel.
Referring to
If the active ingress node 12 receives a message indicating a fault affecting the primary tunnel, but is still functioning itself, then it can re-direct the traffic to the appropriate secondary tunnel, and the new active egress node can transmit the traffic on to the receiving CE 26. Therefore neither of the CE units 24, 26 needs to take an active part in the process, except that the receiving CE obviously needs to be able to receive data from the secondary egress node. If a fault occurs which results in a need to switch to the secondary ingress node 16, the communication is transported in the secondary tunnel.
The CE unit 24 needs to be able to transmit data to the secondary ingress node 16 as well as to the primary ingress node 12. Optionally, external means can be used to communicate with the sending CE unit 24, instructing it to switch to sending the traffic to the secondary ingress node 16. This means can be a proprietary message set or proprietary extensions to existing protocols (e.g. Spanning Tree), and is conceived to shorten the time, which is necessary for the CE unit to become aware of the necessity to transmit traffic to the secondary ingress node (without waiting for the learning process to take place). The CE unit 24 therefore needs to be arranged to respond to instructions from either of the ingress nodes 12, 16 to switch from directing traffic to one of them to sending traffic to the other. However, in this embodiment, the sending CE unit 24 does not take an active part in determining which ingress node it should communicate with. It is however not required that the sending CE unit 24 is aware of any fault occurring in the MPLS network.
Referring to
Referring to
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2006/009730 | 10/9/2006 | WO | 00 | 5/5/2009 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2008/043374 | 4/17/2008 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5483519 | Satomi et al. | Jan 1996 | A |
5835696 | Hess | Nov 1998 | A |
5848227 | Sheu | Dec 1998 | A |
6283465 | Etter | Sep 2001 | B1 |
7283465 | Zelig et al. | Oct 2007 | B2 |
7313087 | Patil et al. | Dec 2007 | B2 |
7345991 | Shabtay et al. | Mar 2008 | B1 |
7484010 | Sridhar et al. | Jan 2009 | B2 |
7505402 | Filsfils et al. | Mar 2009 | B2 |
7535828 | Raszuk et al. | May 2009 | B2 |
7644317 | Sajassi et al. | Jan 2010 | B1 |
7646769 | Arai et al. | Jan 2010 | B2 |
7983150 | Bruckman et al. | Jul 2011 | B2 |
20020112072 | Jain | Aug 2002 | A1 |
20030123446 | Muirhead | Jul 2003 | A1 |
20030140164 | Kitamura | Jul 2003 | A1 |
20030154259 | Lamberton et al. | Aug 2003 | A1 |
20030233595 | Charny et al. | Dec 2003 | A1 |
20040078632 | Infante et al. | Apr 2004 | A1 |
20040156310 | Fredette et al. | Aug 2004 | A1 |
20040165600 | Lee | Aug 2004 | A1 |
20050025069 | Aysan | Feb 2005 | A1 |
20050089010 | Rue et al. | Apr 2005 | A1 |
20050152269 | Liu | Jul 2005 | A1 |
20050243713 | Okuda | Nov 2005 | A1 |
20060002370 | Rabie et al. | Jan 2006 | A1 |
20060031490 | Provine et al. | Feb 2006 | A1 |
20060047851 | Voit et al. | Mar 2006 | A1 |
20060126495 | Guichard et al. | Jun 2006 | A1 |
20060164975 | Filsfils et al. | Jul 2006 | A1 |
20060209682 | Filsfils et al. | Sep 2006 | A1 |
20060221813 | Scudder et al. | Oct 2006 | A1 |
20070091794 | Filsfils et al. | Apr 2007 | A1 |
20070091795 | Bonaventure et al. | Apr 2007 | A1 |
20070091796 | Filsfils et al. | Apr 2007 | A1 |
20070121486 | Guichard et al. | May 2007 | A1 |
20070124464 | Lean et al. | May 2007 | A1 |
20070133406 | Vasseur | Jun 2007 | A1 |
20070140247 | Ansari | Jun 2007 | A1 |
20070201355 | Vasseur et al. | Aug 2007 | A1 |
20070217419 | Vasseur | Sep 2007 | A1 |
20070220175 | Khanna et al. | Sep 2007 | A1 |
20070268915 | Zelig et al. | Nov 2007 | A1 |
20070280102 | Vasseur et al. | Dec 2007 | A1 |
20080068983 | Dunbar et al. | Mar 2008 | A1 |
20080279110 | Hart et al. | Nov 2008 | A1 |
20080285442 | Bruckman et al. | Nov 2008 | A1 |
20090010182 | Tochio | Jan 2009 | A1 |
Number | Date | Country |
---|---|---|
1489861 | Dec 2004 | EP |
1653687 | May 2006 | EP |
11017686 | Jan 1999 | JP |
11041282 | Feb 1999 | JP |
11220486 | Aug 1999 | JP |
2003218911 | Jul 2003 | JP |
2004146989 | May 2004 | JP |
2005012812 | Jan 2005 | JP |
2005033485 | Feb 2005 | JP |
2005057514 | Mar 2005 | JP |
2006504186 | Feb 2006 | JP |
2006129094 | May 2006 | JP |
2006135872 | May 2006 | JP |
2006135972 | May 2006 | JP |
2007208794 | Aug 2007 | JP |
Entry |
---|
Lang, J. P. et al. “RSVP-TE Extensions in Support of End-to-End GMPLS-based Recovery.” CCAMP Working Group, Internet Draft, Feb. 2004. Available online at: http://tools.ietf.org/html/draft-ietf-ccamp-gmpls-recovery-e2e-signaling-03. XP002446707. |
Papadimitriou, P. “RE: Shared Mest Restoration—e2e Recovery Signaling Draft.” Jul. 26, 2006. Available online at: http://psg.com/lists/ccamp/ccamp.2006/msg00574.html. XP002446708. |
Sharma, V. et al. “Framework for Multi-Protocol Label Switching (MPLS)-based Recovery.” Network Working Group, Request for Comments: 3469, Feb. 2003. Available online at: http://www.ietf.org/rfc/rfc3469.txt. XP015009252. |
Bangnulo M. et al, “Multi-homing tunnel broker”, Euromicro Conference, 2004. Proceedings 30th Rennes, France Aug. 31-Sep. 3, 2004 Piscataway, NJ, USA, IEEE Aug. 31, 2004, pp. 282-280, XP010723602 ISBN: 978-0-7695-2199-2. |
Pelsser C. et al., Extending RSVP-TE to support inter-AS LSPs, High Performance Switching and Routing, 2003, HPSR. Workshop on Jun. 24-27, 2003, Piscataway, NJ, USA, IEEE Jun. 24, 2003, pp. 79-84, XP010654648 ISBN: 978-0-7803-7710-3. |
Higginson, Peter L. et al. “Development of Router Clusters to Provide Fast Failover in IP Networks.” Digital Technical Journal, vol. 9, No. 3, 1997, pp. 32-41. |
Lasserre, Marc et al. “Virtual Private Lan Services Using LDP.” 28 pages. Internet Engineering Task Force, Internet Draft Document, L2VPN Working Group, vol. 12vpn, No. 9, Jun. 2006. The Internet Society, Reston, VA. |
Muley, Praveen et al. “Pseudowire (PW) Redundancy.” 12 pages. Internet Engineering Task Force, Network Working Group, Internet Draft, Sep. 22, 2006. The Internet Society, Reston, VA. |
Li, T. et al. “Cisco Hot Standy Router Protocol (HSRP).” 16 pages. Internet Engineering Task Force, Network Working Group, Request for Comments: 2281, Mar. 1998. The Internet Society, Reston, VA. |
Network World, “Impact of Guaranteed Networks (advertisement)”, Network World, separate volume of Windows Server World, IDG Japan, Inc. No. 10, vol. 4, Apr. 1, 2005, pp. 6-7 (Published in Japanese Only). |
Number | Date | Country | |
---|---|---|---|
20100002578 A1 | Jan 2010 | US |