The disclosure relates to network protection, and more particularly, to methods and apparatus for network protection therefor.
Unless otherwise indicated herein, the approaches described in this section are not prior art to the claims in this application and are not admitted to be prior art by inclusion in this section.
Currently, storm protection is usually employed in layer 2/layer 3 Ethernet switches, to avoid excessive bandwidth consumption due to flooding of unknown-unicast/multicast/broadcast traffic (broadcast storm). The storm protection exercises an upper limit for unknown-unicast/multicast/broadcast traffic respectively on per port and/or Virtual Local Area Network (VLAN) basis.
However, the MAC learning speed of switch is not in line rate. Take a typical switch employing ERP (for example, ERICSSON SPO1400) as an example, 3 k/s MAC learning speed with 256 k dynamic MAC cache space, 256/3=85.3 s is needed if 256 k MAC involved. Unknown unicast traffic will be flood within unknown-unicast up limit, so, most packets are dropped because of storm protection. Even if protection switch is happened within 50 ms, the real traffic interruption time is coming to seconds instead of millisecond if related MAC number need learning is in thousands, since traffic interruption time is almost near to ERP switching time plus MAC learning time. A similar problem would arise in the MPLS-TP network protection and the other network protection methods employing storm protection.
The present disclosure proposes a method and apparatus for network protection.
In an aspect of the disclosure, there is provided a method for network protection, including the steps of: detecting a switch indicator in a network; setting a rate limit of storm protection, which is of a first value, as a second value, the second value being higher than the first value; and performing a flush operation of a Forwarding DataBase FDB.
Alternatively, the method further includes: starting a timer when the rate limit is set as the second value, wherein the rate limit is set back to the first value when the timer expires.
Alternatively, duration of the timer may be calculated based on a number of MAC needed to be learned, MAC learning speed, and protection switch time.
Alternatively, the method further includes: locking setting of the rate limit from being manually operated before setting the rate limit as the second value; and unlocking the setting of the rate limit after the rate limit is set back to the first value.
Alternatively, the network protection is an Ethernet Ring Protection ERP or Multi-Protocol Label Switching-Transport Profile MPLS-TP protection.
Alternatively, the switch indicator is a link failure indicator, a node failure indicator, an Operation Administration and Maintenance OAM indicator, an operator command, or a recovery event.
Alternatively, the second value is in a range of 50%-90%.
In another aspect of the disclosure, there is proposed an apparatus for network protection, including: a detecting unit configured to detect a switch indicator in a network; a setting unit configured to set a rate limit of storm protection, which is of a first value, as a second value, the second value being higher than the first value; and a flushing unit configured to perform a flush operation of a Forwarding DataBase FDB.
Alternatively, the apparatus further includes a timer configured to start when the rate limit is set as the second value, wherein the setting unit is further configured to set the rate limit back to the first value when the timer expires.
The above and other objects, features and advantages of the present disclosure will be clearer from the following detailed description about the non-limited embodiments of the present disclosure taken in conjunction with the accompanied drawings, in which:
Various embodiments of the present disclosure are now described with reference to the drawings, wherein like reference numerals are used to refer to like elements throughout. In the following description, numerous specific details are set forth for purposes of explanation, in order to provide a thorough understanding of one or more embodiments. It will be evident to one of ordinary skill in the art, however, that some embodiments of the present disclosure may be implemented or practiced without one or more of these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to facilitate describing embodiments.
Reference is now made to
In step S110, the node detects a switch indicator in the network. By way of a non-limiting example, the switch indicator may be a link failure indicator, a node failure indicator, an Operation Administration and Maintenance (OAM) indicator, an operator command, or a recovery event.
In step S120, the node sets a rate limit of storm protection, which is of a first value, as a second value, and the second value is higher than the first value. In general, the first value is usually 10% in default. As a non-limiting example, the second value may be in a range of 50%-90%.
In step S130, the node performs a flush operation of a FDB in step S130.
It shall be noted that the method as noted above is not limited to the above sequence, and various other sequences may be applied therein. For example, step S120 may be performed after step S130, simultaneously with step S130, or during step S130.
By way of a non-limiting example, the method 100 may further include starting a timer when the rate limit is set as the second value, wherein the rate limit is set back to the first value when the timer expires.
For example, duration of the timer may be calculated based on some parameters or factors, such as the number of MACs needed to be learned, MAC learning speed, and protection switch time. Also, the duration may be configurable, for example by a system administrator during an initial configuration phase.
By way of a non-limiting example, the method 100 may further include locking setting of the rate limit from being manually operated before setting the rate limit as the second value (not shown), so as to prevent for example system administrator from changing the setting of the rate limit. Correspondingly, the method 100 may further include unlocking the setting of the rate limit after the rate limit is set back to the first value (not shown).
In the following, the present application will be described in detail by taking ERP as an example.
When a link failure occurs, for example a link 210 between ring node C and ring node D, ring node C and ring node D block ports 220 and 230 for the failed link 210, and send a switch indicator, i.e. a Ring-APS (R-APS) Signal Failure (SF) message, to indicate the link failure. The SF messages are circulated around the Ethernet ring 200 through a Ring APS channel (not shown). When RPL Owner node A and RPL Partner node F receive such a message, they unblock the ports 240 and 250 to a Ring Protection Link (RPL).
When a link failure is restored, for example, if the link failure between ring node C and ring node D in
With
At state 310, the Ethernet ring 200 operates in a normal state where no link failure has occurred. The Ring Protection Link (RPL) blocking is provided by R-APS channel or port blocking at both ends of the RPL.
At state 320, a single link failure occurs between ring node C and ring node D.
At state 330, ring node C detects the link failure event and after respecting the holdoff time, and blocks the failed R-APS channel or port. At this time, ring node C may change the storm protection setting, for example by setting the unknown-unicast rate limit of the storm protection from a default value to a higher value. Usually, the default value is 10%, and the higher value may be a value from the range of 50%-90%. Then, ring node C performs the FDB flush. There are similar actions occurring at ring node D.
By way of a non-limiting example, ring node C and ring node D may start a timer for a period, during which the unknown-unicast rate limit of the storm protection setting is being the higher value. For example, duration of the timer may be calculated based on a number of MACs needed to be learned, MAC learning speed, and protection switch time. Also, the duration may be configurable, for example by a system administrator during an initial configuration phase.
By way of a non-limiting example, ring node C and ring node D may lock setting of the rate limit from being manually operated, for example by system administrator, before setting the rate limit as the second value, so as to prevent the storm protection setting from being manually changed. Correspondingly, ring node C and ring node D may further unlock the setting of the rate limit after the rate limit is set back to the first value.
At state 340, ring node C and ring node D periodically send a R-APS Signal Failure (SF) message, on both ring ports (i.e., port 220 and port 230), as long as the single link failure persists.
At state 350, all Ethernet Ring Nodes receiving R-APS (SF) messages lock storm protection setting, set unknown-unicast rate limit to 50%-90%, start unknown-unicast-limit_tuning_timer and perform the FDB flush, respectively. For example, when RPL Owner Node A or RPL Partner Node F receives an R-APS (SF) message, they unblock respective end of the RPL, lock storm protection setting, set unknown-unicast rate limit to 50%-90%, start unknown-unicast-limit_tuning_timer and perform the FDB flush.
It should be noted that the sequence of setting unknown-unicast rate limit to 50%-90%, starting unknown-unicast-limit_tuning_timer and performing the FDB flush is not limited to that noted above. Instead, setting unknown-unicast rate limit to 50%-90% and starting unknown-unicast-limit_tuning_timer may be performed after, simultaneously with, or during the step of performing the FDB.
At state 360, all Ethernet Ring Nodes receiving a second R-APS (SF) message perform the FDB flush again due to the Node ID and BPR-based mechanism.
At state 370, Stable SF condition—R-APS (SF) messages keep circulating around the ring in period Further R-APS (SF) messages trigger no further action.
Then, all Ethernet Ring Nodes set unknown-unicast up limit back to a default value (10% for example) once unknown-unicast-limit_tuning_timer time out.
Furthermore, the storm protection setting would be unlocked after the unknown-unicast rate limit is set back to its default value such as 10%. It should be noted that the number range of 50%-90% here is just an example, and it may be any other percentage higher than the corresponding default value, such as 90%-100%.
It should be noted that although the exemplary method above is described for single link failure event, the method may be similarly applied for other link failure events or recovery events. For example, in case of a single link failure recovery event occurring between ring nodes C and D, ring nodes C and D may detect the recovery event, start the guard timer which prevents the reception of R-APS messages and send out R-APS (NR) messages. When RPL Owner node A receives this message, the WTR timer is started. Ring node D may receive the new R-APS (NR) message with higher Node ID (from ring node C) and unblock non-failed ring port. At expiration of the WTR timer, RPL Owner node A blocks the port to RPL, sends R-APS (NR, RB) messages, changes respective storm protection setting, and then flushes FDB. Ring node C receiving an R-APS (NR, RB) message unblocks the non-failed ring port and stops sending R-APS (NR) messages. On the other hand, when RPL Partner node F receives an R-APS (NR, RB) message, it blocks the port to RPL. In additional, ring nodes B and F change respective storm protection setting, and then flush FDB upon receiving an R-APS (NR, RB) message. After that, the ERP ring is back to the normal state.
In addition to the above link failure event (i.e., link failure indicator) and the recovery event, other switch indicators for which the method described above may be implemented, may include a node failure indicator, an Operation Administration and Maintenance OAM indicator, or an operator command.
The number of MAC addresses to be learned has obvious impact on traffic interruption time because of protection switch. If the number of MAC addresses to be learned comes to thousands, the recovery time is in seconds instead of million seconds. Taking a typical switch with default unknown-unicast rate limit (10%) as an example:
With the above method of the present application, still taking a typical switch with same setting but has the solution implemented as an example:
Consequently, by setting the limit rate of the storm protection as a higher value, the present application may reduce the amount of dropped packets during the MAC learning, thereby alleviating or eliminating impact of packet drop during the MAC learning and enabling the traffic interruption time to be smaller and come close to ERP switching time during the MAC learning.
In the following, the present application will be described in detail by taking MPLS-TP 1:1 line protection as an example.
As shown in
As shown in
It should be noted that the steps of changing the unknown-unicast rate limit of the storm protection setting and starting a timer may be performed after, simultaneously with, or during the FDB flush.
Although
As shown in
The detecting unit 610 is configured to detect a switch indicator in a network. By way of a non-limiting example, the switch indicator may be a link failure indicator, a node failure indicator, an Operation Administration and Maintenance OAM indicator, an operator command, or a recovery event.
The setting unit 620 is configured to set a rate limit of storm protection, which is of a first value, as a second value, the second value being higher than the first value. The first value is usually 10%. By way of a non-limiting example, the second value may be in a range of 50%-90%. By way of another non-limiting example, the second value may be in a range of 90%-100%.
The flushing unit 630 is configured to perform a flush operation of a Forwarding DataBase (FDB).
The timer 640 is configured to start when the setting unit 610 sets the rate limit as the second value.
By way of a non-limiting example, the setting unit 620 may be further configured to set the rate limit back to the first value when the timer 640 expires.
For example, duration of the timer 640 may be calculated based on a number of MAC needed to be learned, MAC learning speed, and protection switch time. Alternatively, the duration may be configurable.
The locking unit 650 is configured to lock setting of the rate limit from being manually operated, for example by system administrator, before the setting unit 610 sets the rate limit as the second value. The unlocking unit 660 is configured to unlock the setting of the rate limit after the setting 610 sets the rate limit back to the first value.
It should be noted that two or more different units in this disclosure may be logically or physically combined. For example, the locking unit 650 and the unlocking unit 660 may be combined as one unit.
According to this embodiment of the present application, the present application may reduce the amount of dropped packets during the MAC learning, thereby alleviating or eliminating impact of packet drop during the MAC learning and enabling the traffic interruption time to be smaller and come close to ERP switching time during the MAC learning.
Other arrangements of the present disclosure include software programs performing the steps and operations of the method embodiments, which are firstly generally described and then explained in detail. More specifically, a computer program product is such an embodiment, which includes a computer-readable medium with a computer program logic encoded thereon. The computer program logic provides corresponding operations to provide the above described network protection scheme when it is executed on a computing device. The computer program logic enables at least one processor of a computing system to perform the operations (the methods) of the embodiments of the present disclosure when it is executed on the at least one processor. Such arrangements of the present disclosure are typically provided as: software, codes, and/or other data structures provided or encoded on a computer-readable medium such as optical medium (e.g., CD-ROM), soft disk, or hard disk; or other mediums such as firmware or microcode on one or more ROM or RAM or PROM chips; or an Application Specific Integrated Circuit (ASIC); or downloadable software images and share database, etc., in one or more modules. The software, hardware, or such arrangements can be mounted on computing devices, such that one or more processors in the computing device can perform the technique described by the embodiments of the present disclosure. Software process operating in combination with e.g., a group of data communication devices or computing devices in other entities can also provide the nodes and host of the present disclosure. The nodes and host according to the present disclosure can also be distributed among a plurality of software processes on a plurality of data communication devices, or all software processes running on a group of mini specific computers, or all software processes running on a single computer.
There is little distinction left between hardware and software implementations of aspects of systems; the use of hardware or software is generally (but not always, in that in certain contexts the choice between hardware and software can become significant) a design choice representing cost vs. efficiency tradeoffs. There are various vehicles by which processes and/or systems and/or other technologies described herein can be effected (e.g., hardware, software, and/or firmware), and that the preferred vehicle will vary with the context in which the processes and/or systems and/or other technologies are deployed. For example, if an implementer determines that speed and accuracy are paramount, the implementer may opt for a mainly hardware and/or firmware vehicle; if flexibility is paramount, the implementer may opt for a mainly software implementation; or, yet again alternatively, the implementer may opt for some combination of hardware, software, and/or firmware.
The foregoing description gives only the embodiments of the present disclosure and is not intended to limit the present disclosure in any way. Thus, any modification, substitution, improvement or like made within the spirit and principle of the present disclosure should be encompassed by the scope of the present disclosure.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CN2012/070460 | 1/17/2012 | WO | 00 | 6/13/2014 |