Methods and Apparatus for Fast Ethernet Link Switchover in the Event of a Link Failure

Information

  • Patent Application
  • 20070274204
  • Publication Number
    20070274204
  • Date Filed
    May 26, 2006
    18 years ago
  • Date Published
    November 29, 2007
    17 years ago
Abstract
An apparatus for fast failure switch over in an ETHERNET switch includes redundant switch (trunk) ports (a main and a backup) and hardware and software logic for redirecting traffic to the backup port when the main port (or the link associated with it) fails. The switchover is immediate and is based on the content of a local status register which indicates the port (link) status. Thus, frames addressed to the dead port are redirected to the backup port and few frames are lost. The STP function may proceed concurrently and eventually no more frames are addressed to the dead port.
Description

BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a high level block diagram of a provider premises ETHERNET switch incorporating the invention;



FIG. 2 is a high level block diagram illustrating the overall implementation of the invention in the switch of FIG. 1;



FIG. 3 is a high level block diagram illustrating the cross-over switch as a pair of multiplexers;



FIG. 4 is a state diagram illustrating the operation of one of the multiplexers;



FIG. 5 is a state diagram illustrating the operation of the other multiplexer;



FIG. 6 is a high level flow chart illustrating the software logic of FIGS. 1 and 2;



FIG. 7 is a high level flow chart illustrating the monitor link status function of FIG. 6;



FIG. 8 is a truth table illustrating the logic of the protection switching function



FIG. 9 is a high level block diagram of a subscriber premises ETHERNET switch incorporating the invention;



FIG. 10 is a high level block diagram illustrating the overall implementation of the invention in the switch of FIG. 9;



FIG. 11 is a high level flow chart illustrating the software logic of FIGS. 9 and 10; and



FIG. 12 is a high level flow chart illustrating the database switching function of FIG. 11.





DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Turning now to FIG. 1, a provider premises switch 10 incorporating the invention includes a plurality of access ports 12 (Ports 0 through m) which are coupled to user network interface switches (not shown) at subscriber premises. The access ports 12 are coupled to firmware logic 14 which is coupled to a forwarding information database (FIB) 16 and a class database (class DB) 18. The firmware logic 14 is also coupled to a plurality of CoS queues 20 which are coupled to schedulers 22, 24. The outputs of the schedulers are coupled via hardware logic 26 to switch ports 28, 30. The hardware logic 26 is partly controlled by software logic 32. According to the presently preferred embodiment, up to twenty-four access ports 12 are provided, each having a bandwidth of 10 Mbps or 100 Mbps and each of the two switch ports 28, 30 has a bandwidth 100 Mbps or 1,000 Mbps.


In operation, the firmware logic 14 receives an ETHERNET frame from one of the ports 12, examines the frame header and looks up information in the FIB 16 and class DB 18 to determine to which of the queues 20 the frame should be sent. The schedulers 22, 24 dequeue the frames from the queues 20 according to priority determined by CoS. The hardware logic 26 receives the frames from the schedulers and passes them to the switch ports 28, 30.


Turning now to FIG. 2, details of the hardware logic 26 are shown in more detail. The logic 26 includes two rate adaptation buffers 34, 36 (one for each switch port, each preferably sized to accommodate one ETHERNET frame), a cross-connect switch 38, and two switch port status registers 40, 42. As illustrated, the switch ports 28, 30 are coupled to an ETHERNET PHY device 44. The software logic 32 partially controls the cross-connect switch 38 based on the content of the status registers 40, 42. The contents of those registers are determined by reading registers on the PHY device 44.


As seen in FIG. 3, the cross-connect switch 38 can be implemented as two multiplexers 38a, 38b and two state machines 32a, 32b. The operation of the state machines is described in FIGS. 4 and 5.


Turning now to FIG. 4, starting at 50, the state machine 32a is initialized (S1). At 52 it is determined whether Port 0 (28 in FIGS. 1 and 2) is ready. If Port 0 is not ready, the state machine returns to S1. If Port 0 is ready as determined at 52, FIFO 0 is examined at 54 to determine whether a frame is available for transmission. If FIFO 0 is not empty, state 2 (S2) is entered at 56. In state 2, FIFO 0 is read until the end of the frame is detected at 58 (EOP=end of packet). If it was determined at 54 that FIFO 0 is empty, it is then determined at 60 whether alternate switching is turned on for Port 0, i.e. whether multiplexer 38a (FIG. 3) should be switched to allow a frame from FIFO 36 to exit through Port 0 rather than Port 1. Alternate switching is turned on and off by software logic (32 in FIGS. 1 and 2) as explained in detail with reference to FIGS. 6-8. If alternate switching is turned on, the content of FIFO 1 is determined at 62. If FIFO 1 is not empty, state three (S3) is entered at 64 where FIFO 1 is read until the end of the FRAME is detected at 66, then the machine returns to S1.



FIG. 5 illustrates the operation of state machine 32b. Starting at state one (S1) 70, the machine initializes and checks at 72 to see if Port 1 is ready. If Port 1 is ready, it is determined at 74 whether alternate switching is turned on for Port 1. Alternate switching is turned on and off by software logic (32 in FIGS. 1 and 2) as explained in detail with reference to FIGS. 6-8. If it is turned on and FIFO 0 is not empty as determined at 76, state two (S2) is entered at 78. In state two (S2), FIFO 0 is read until the end of the frame as determined at 80, the machine returns to (S1) 70. If it was determined at 74 that alternate switching was not turned on or at 76 that FIFO 0 was empty, the status of FIFO 1 is determined at 82. If FIFO 1 is not empty, state three (S3) is entered at 78. In state three FIFO 1 is read until the end of the frame as determined at 86. When FIFO 1 is empty as determined either at 82 or 86, the machine returns to (S1) 70.


Turning now to FIG. 6, the software logic (32 in FIGS. 1 and 2) runs on a host processor (not shown) and is based on an operating system timer, preferably set to 25 ms. When the timer counts to zero as determined at 100 in FIG. 6, the monitor link status function is performed at 102. This function is described in more detail in FIG. 7. The monitor link status function updates the link statuses at 104 which are stored with variables at 106. The protection switching function 108 examines the link statuses from the variables 106 and sets the Working_Link variable at 110 which is stored in variables 106. Using the Working_Link, revertive setting, and a restore timer value, the protection switching function sets a switch or no switch flag according to the truth table of FIG. 8. This flag is used to make the determinations 60 and 74 in FIGS. 4 and 5 respectively. As indicated at 112, the timer is watched again at 100 so that these functions are performed every 25 ms.



FIG. 7 illustrates how the monitor link status function sets the link status registers. Starting with phyID set to the base address (e.g. Port 0) variables are initialized at 120 based on constraints 122. It will be appreciated that a 32-bit register having regAddr=1 on PHY device having PhyID will be read and two items of information will be obtained by applying the masks listed in the constraints. Reading of the register takes place at 124 using two commands. “Set Command Register” reads the register on the PHY device and puts the contents in a local register called “Read Data Register”. The “Get Read Data Register (*regStatus) moves the contents to a software variable called “regStatus”. The Link Status is set at 126 by applying the two masks to the contents of the variable regStatus. It will be appreciated that the link failure could be due to a local hardware problem or due to a remote fault or both. At 128 it is determined whether both ports have been checked. If not, regStatus is set back to zero and phyID is incremented at 130 and the Port status register for Port 1 is read at 124 and Link status set for Port 1 at 126. It will then be determined at 128 that the status for both ports has been set and the function 102 will have completed as indicated at 132.


Those skilled in the art will appreciate that at the time of a link failure, there may be many frames residing in the CoS queues (20 in FIG. 1) which are destined for the failed link. According to the prior art practices, these frames will continue to flow toward the failed link until the spanning tree protocol changes the network topology and entries in the data base(s) are changed to direct frames to the backup link. Only after the data base(s) is updated will frames be sent to the backup link which otherwise would have been sent to the failed link. Since this may take some time, a serious interruption in service will be noticed, particularly in services such as video on demand, video conferencing, and voice over IP. According to the present invention, however, upon detecting a failure, the cross-over switch is activated in a matter of microseconds and all of the frames destined for the failed link are now automatically re-routed by hardware to the backup link and few frames are lost.



FIG. 8 illustrates the switch function action based on which link is the working link, whether reverting is allowed, whether restore time has elapsed (whether the failed link has been restored) and the link status. When Port 0 is the working link, a switch to Port 1 is made only when Link 0 status is OFF and Link 1 status is ON. There is no switch made at any other time. When Port 1 is the working link, a switch back to Port 0 (reversion) is made only when Port 0 becomes active after the restore time has elapsed.


Turning now to FIG. 9, a subscriber premises switch 210 incorporating the invention includes a plurality of access ports 212 (Ports 0 through m) which are coupled to user network devices at subscriber premises. The access ports 212 are coupled to firmware logic 214 which is coupled to an FIB 216 and class DB 218. The firmware logic 214 is also coupled to a plurality of CoS queues 220 which are coupled to schedulers 222, 224. The outputs of the schedulers are coupled via hardware logic 226 to uplink ports 228, 230. The hardware logic 226 is partly controlled by software logic 232. According to the presently preferred embodiment, access ports 12 each have a bandwidth of 10 Mbps or 100 Mbps and each of the two uplink ports 228, 30 has a bandwidth 10 Mbps or 100 Mbps.


In operation, the firmware logic 214 receives an ETHERNET frame from one of the ports 212, examines the frame header and looks up information in the FIB 216 and class DB 218 to determine to which of the queues 220 the frame should be sent. The schedulers 222, 224 dequeue the frames from the queues 220 according to priority determined by CoS. The hardware logic 226 receives the frames from the schedulers and passes them to the uplink ports 228, 230. According to an embodiment of the invention, a mirror FIB database 216′ is provided for the FIB database 216 and/or a mirror class database 218′ is provided for the class database 218.


Turning now to FIG. 10, the operation of the hardware logic 226 is shown in more detail. The logic 226 includes two rate adaptation buffers 234, 236 (one for each uplink port, each preferably sized to accommodate one ETHERNET frame), a cross connect switch 238, and two switch port status registers 240, 242. As illustrated, the switch ports 228, 230 are coupled to an ETHERNET PHY device 244. The software logic 232 partially controls the cross connect switch 238 based on the content of the status registers 240, 242. The contents of those registers are determined by reading registers on the PHY device 244. The switch 210 and associate logic operates in substantially the same manner as the switch 10 described above except for the mirror databases. The overall switching function is shown in FIG. 11 and the database switching function is shown in FIG. 12.


Referring now to FIG. 11 (which is similar to FIG. 6), the software logic (232 in FIGS. 9 and 10) runs on a host processor (not shown) and is based on an operating system timer, preferably set to 25 ms. When the timer counts to zero as determined at 300 in FIG. 11, the monitor link status function is performed at 302. This function is described in more detail in FIG. 7. The monitor link status function updates the link statuses at 304 which are stored with variables at 306. The protection switching function 308 examines the link statuses from the variables 306 and sets the Working_Link variable at 310 which is stored in variables 306. Using the Working_Link, revertive setting, and OS timer value, the protection switching function sets a switch or no switch flag according to the truth table of FIG. 8. This flag is used to make the determinations 60 and 74 in FIGS. 4 and 5 respectively. In addition, a database switching function 309 is provided which sets the working database(s) and working queue set at 311 based in part on the variables at 307. As indicated at 312, the timer is watched again at 300 so that these functions are performed every 25 ms.


Referring now to FIG. 12, queue status is initialized at 320 and the data base is switched to the mirror at 322. The status of the active queues is determined at 324 and it is determined at 326 whether the queues are all empty. If not, after a 10 ms wait at 328 the queues are checked again. If the queues are all empty as determined at 326, the schedulers are switched to working link queues at 328 and the function is complete at 330. Thus, the customer equipment keeps using the old database entries (from the mirrors) and the queues for the failed port (but redirected to the backup port) while the STP updates the main databases, then switches to the queues for the failed port and switches back to the main databases and makes new mirrors.


There have been described and illustrated herein several embodiments of methods and apparatus for fast ETHERNET switchover in the event of a link failure. While particular embodiments of the invention have been described, it is not intended that the invention be limited thereto, as it is intended that the invention be as broad in scope as the art will allow and that the specification be read likewise. Thus, while a particular number of ports and queues have been disclosed, it will be appreciated that different numbers could be used as well. In addition, while particular registers have been disclosed, it will be understood that in different implementations, different registers might be used. Furthermore, while particular switching circuits, state machines and software has been disclosed it will be understood that other circuits, state machines and software may achieve the same functions. It will therefore be appreciated by those skilled in the art that yet other modifications could be made to the provided invention without deviating from its spirit and scope as claimed.

Claims
  • 1. An apparatus for fast failure switch over in an ETHERNET switch, comprising: a main upstream port;a secondary upstream port;said ports adapted to be coupled to an ETHERNET PHY device which includes a register indicating the link status of the upstream ports;a first queue associated with the main upstream port;a second queue associated with the secondary upstream port;a switch coupled between said queues and said ports and operable to direct the outputs of the queues to one or both ports; andlogic arranged to accept input from the register and operate said switch to redirect the output of the first queue to the secondary upstream port when the link status of the main upstream port indicates that the link is down.
  • 2. The apparatus according to claim 1, wherein: said logic is arranged to operate said switch to redirect the output of the second queue to the main upstream port when the link status of the secondary upstream port indicates that the link is down.
  • 3. The apparatus according to claim 1, wherein: said logic is a combination of hardware and software.
  • 4. The apparatus according to claim 3, wherein: said hardware includes a state machine.
  • 5. The apparatus according to claim 3, wherein: said software includes a timer.
  • 6. The apparatus according to claim 5, wherein: the timer has a default value of 25 milliseconds.
  • 7. The apparatus according to claim 1, wherein: said queues are each sized to accommodate one ETHERNET frame.
  • 8. The apparatus according to claim 1, wherein: said switch comprises a pair of multiplexers each having two inputs and an output, each input of each multiplexer being coupled to an output of one of said queues and each output of said multiplexer being coupled to one of said ports.
  • 9. The apparatus according to claim 8, wherein: each multiplexer has a select input which is coupled to said logic.
  • 10. The apparatus according to claim 9, wherein: said logic includes two state machines, each controlling one of said multiplexers.
  • 11. A method for fast failure switch over in an ETHERNET switch, comprising: coupling a main upstream port to a first ETHERNET link;coupling a secondary upstream port to a second ETHERNET link;monitoring the status of the first and second ETHERNET links;associating a first queue with the main upstream port;associating a second queue with the secondary upstream port; andredirecting the output of the first queue to the secondary upstream port when the link status of the main upstream port indicates that the first ETHERNET link is down.
  • 12. The method according to claim 11, further comprising: redirecting the output of the second queue to the main upstream port when the link status of the second ETHERNET link indicates that the link is down.
  • 13. An apparatus for fast failure switch over in an ETHERNET switch comprising: a main upstream port;a secondary upstream port;said ports adapted to be coupled to an ETHERNET PHY device which includes a register indicating the link status of the main upstream port;a queue;a switch coupled between said queue and said ports and operable to direct the output of said queue to one of said ports; andlogic arranged to accept input from the register and operate said switch to redirect the output of the queue to the secondary upstream port when the link status of the main upstream port indicates that the link is down.
  • 14. The apparatus according to claim 13, wherein: said logic is a combination of hardware and software.
  • 15. The apparatus according to claim 14, wherein: said hardware includes a state machine.
  • 16. The apparatus according to claim 14, wherein: said software includes a timer.
  • 17. The apparatus according to claim 16, wherein: the timer has a default value of 25 milliseconds.
  • 18. The apparatus according to claim 13, wherein: said queue is sized to accommodate one ETHERNET frame.
  • 19. A method for fast failure switch over in an ETHERNET switch, comprising: coupling a main upstream port to a first ETHERNET link;coupling a secondary upstream port to a second ETHERNET link;monitoring the status of the first and second ETHERNET links; andredirecting the output of the first queue to the secondary upstream port when the link status of the main upstream port indicates that the first ETHERNET link is down.