The present invention relates generally to communication devices, and more particularly to automatically detecting and resetting a cable modem termination system upon detection of a lock-up condition.
Cable data systems are used to allow cable TV subscribers to use the Hybrid-Fiber-Coax network as a communication link between their home networks and the Internet. As a result, computer information (Internet Protocol packets) can be transmitted across the Hybrid-Fiber-Coax network between home computers and the Internet. The DOCSIS specification (defined by CableLabs) specifies the set of protocols that must be used to effect a data transfer across the Hybrid-Fiber-Coax network. Two fundamental pieces of equipment permit this data transfer: a cable modem (CM) which is positioned in the subscriber's home, and a Cable Modem Termination System (CMTS) which is positioned in the head end of the cable TV company.
In addition to data traffic, subscribers are more and more obtaining telephony voice services over networks other than the traditional public switched telephony network (“PSTN”). A multiple services operator (“MSO”) may provide such telephony services, in addition to data over cable service via DOCSIS. For example, CableLabs has established the PacketCable standard for providing telephony services over cable. A subscriber typically has a device that includes a DOCSIS cable modem for transmitting and receiving data and a media terminal adaptor (“MTA”) for processing voice traffic for transmission and reception over cable. The MTA, or embedded MTA (“EMTA”) known in the art, typically provides signals received from the cable network to the plain old telephone service (“POTS”) network in a user's home or office.
As with most complex electrical systems and equipment, a cable modem termination system is prone to becoming inoperable for a variety of reasons. For example, the card, or blade as referred to in the art, located at the CMTS that subscriber devices are coupled to may freeze, or lock-up, and cease operating normally. A customer recognizes this because service to his or her subscriber device stops working. If the subscriber is browsing the internet or sending e-mail, web pages may not be accessible or e-mail messages may not be delivered. If the subscriber is using a cable modem EMTA to carry telephony calls, the call may be dropped.
When a subscriber/user loses service, he or she does not know what caused the loss. However, a variety of scenarios can cause a freeze condition. For example, noise induced into an upstream channel, which is typically more susceptible to noise that downstream channels, can cause unrecognized information to arrive at the CAM, thus causing it to lock-up, or freeze. Or, if a user device, such as a computer, causes unrecognized information to be sent to the CAM, the CAM can freeze.
When a subscriber notices that service is not available, the subscriber may call the service provider using a telephone device and complain. This process may involve waiting for an available service representative. Furthermore, if the customer receives data, voice and video service from the same provider, a call to the provider may not be possible, since the call traffic would be directed to the locked-up CMTS blade.
Assuming that the customer/user/subscriber reaches a service representative, the service representative may check to determine where the problem lies. If it is determined that a blade, or cable access module (“CAM”) card, is not functioning, the service provider personnel may remotely send a message to the CMTS that resets the CAM card. Resetting typically involves removing power to the ‘frozen’ card and rebooting it. When this occurs, the user (or multiple users, since typically multiple users are coupled to a single CAM) waits until the CAM reboot process completes, which may take a few minutes.
Thus, there is a need in the art for a method and system that automatically detects when a CMTS blade is frozen, locked-up or otherwise inoperable. There is also a need for a method and system for automatically correcting the inoperable condition. In addition, there is a need for a method and system for providing the minimum corrective action needed to correct the inoperable condition so as to reduce time to operability.
As a preliminary matter, it will be readily understood by those persons skilled in the art that the present invention is susceptible of broad utility and application. Many methods, embodiments and adaptations of the present invention other than those herein described, as well as many variations, modifications, and equivalent arrangements, will be apparent from or reasonably suggested by the present invention and the following description thereof, without departing from the substance or scope of the present invention. It will be appreciated that the routines described herein may be implemented as executable software loaded an operating and/or operating in and/or on a broadband central device, such as a CMTS, at a service provider's head end location. In addition, the routines described herein may also be implemented in hardware or firmware.
Accordingly, while the present invention has been described herein in detail in relation to preferred embodiments, it is to be understood that this disclosure is only illustrative and exemplary of the present invention and is made merely for the purposes of providing a full and enabling disclosure of the invention. This disclosure is not intended nor is to be construed to limit the present invention or otherwise to exclude other embodiments, adaptations, variations, modifications and equivalent arrangements, the present invention being limited only by the claims appended hereto and the equivalents thereof.
Turning now to
Turning now to
Turning now to
At step 310 a timer counts down for a period equal to D. When the timer has counted from D to zero (or from zero to D), a call at step 315 to subroutine 400 is made. Subroutine 400 will be described in detail later. When control returns from subroutine 400, a new timer count is initialized and control returns to step 310. It will be appreciated that routine 300, subroutine 400 and other subroutines herein described may be implemented in the software programs that control and operate a communications network central device, such as, for example, a CMTS at a MSO-operated head end. However, other embodiments may include human performance of some of the steps herein described, for example, of corresponding steps in some aspects.
Turning now to
At step 410, a determination is made whether all of four criteria are true. If not, process 400 advances to step 415 and control of the process that monitors and determines whether to take action to correct a lockup condition of an interface module returns to step 325 shown in
If E=True, and if a downstream channel coupled to the broadband interface module under evaluation is in service, and if at least one upstream channel on the same interface module is in service and if the module is not in an overload condition, method 400 advance to step 420. At step 420, a determination is made whether an upstream threshold variable THflag, which has a default value of false, has been set to true. If THflag evaluates as true at step 420, the number of cable modems that are coupled and registered (a value ‘known’ to the CMTS operating system) to the upstream channel under evaluation is saved at step 425. The signal to noise ratio of the upstream channel under evaluation is also stored at step 425. It will be appreciated that these values may be stored to a memory coupled to the state machine or other device that performs the steps described herein.
After the count of the number of registered cable modems on the upstream channel under evaluation has been stored, and the signal to noise ratio of the channel under evaluation has been stored, a determination is made at step 430 whether the number of cable modems operably coupled, i.e., registered, to the upstream channel under evaluation is zero. If zero, then the determination is made at step 435 whether an amount of data greater than zero has passed from a cable modem to the interface module, a condition that may occur if a modem is unable to register (the DHCP server is inoperable, for example) but the upstream channel at the broadband interface module may be operational. This determination would be based on a information acquired since the previous call to routine 400 was at step 315 shown in
Returning to description of action at step 420, if THflag is set to true, a determination is made at step 455 whether the number of cable modems registered with the interface module is greater than the value TH. If the number of cable modems registered with the interface module does not exceed the threshold value TH, then the routine 400 advances to step 415 and returns to step 325 of method 300 shown in
If the number of cable modems registered with the interface module under evaluation is determined at step 455 to be greater than TH then THflag is set to equal true at step 465. Also at step 465 an upstream channel round robin counter (“UpRRC”) is set to zero and method 400 continues on to step 430.
If the number of cable modems registered is not equal to zero, as determined at step 430, then a PING process is initialized at step 470. At step 470, a variable named PINGstat is set to zero and a PING message is sent to a group of cable modems coupled to the upstream channel under evaluation. A PING message typically includes a message sent to a device requesting that the device return an acknowledgement message to the device that sent the PING message. An acknowledgement indicates to the sending device that the receiving device is operable. A typical initial group of cable modems includes one or two cable modems, although a group could comprise three or more cable modems. The round robin counter is used to sequentially send PING messages to the modems in the group. Next, Ping function subroutine 500 is called.
Turning now to
The passage of some bytes indicates that the upstream channel is operational. PINGstat is reset to 0 at step 540, at which step current and previous bandwidth usage values are store as well as current and previous SNR values. Subroutine 500 returns to step 325 shown in
Continuing with discussion of
Turning now to
At step 610, a determination is made whether a Boolean variable to enable recovery is set to ‘enable.’ It will be appreciated that this is a different variable from the ‘E’ variable that is initialized at step 305 in
Returning to the description of the determination made at step 610, as shown in
If at step 625 a determination is made that the recovery count value is not equal to one, routine 600 advances to step 630. At step 630, a determination is made whether recovery count value is equal to two. If so, the condition associated with either steps 450 or 550 is logged at step 635, as discussed above, and a command is caused to be generated and executed by an on-board interface module processor/controller to reset the PHY associated with the upstream channel under evaluation. Then, routine 600 returns to step 325 shown in
If a determination is made at step 630 that the recovery count value is not 2, then a determination is made at step 645 whether the recovery count value is equal to three. If so, the condition associated with either steps 450 or 550 is logged at step 650, as discussed above, and a command is caused to be generated and executed by an on-board interface module processor/controller to reset both upstream and downstream MACs of the interface module having the upstream channel being evaluated. Then, routine 600 returns to step 325 shown in
If a determination is made at step 645 that the recovery count value is not equal to three, then a determination is made at step 660 whether the recovery count value is equal to four. If so, the condition associated with either step 450 or 550 is logged at step 665, as discussed above, and a command is caused to be generated and executed by an on-board interface module processor/controller to reset the entire interface module associated with the upstream channel under evaluation. Then, routine 600 returns to step 325 shown in
Thus, escalating maintenance routine 600 escalatingly applies corrective action to the interface module based on recovery count value; the higher the recovery count variable value, the higher the more drastic the reset action the routine causes to occur.
In addition to causing a reset of the interface card at step 665, the command generated at step 665 may also include an instruction that all cable modems coupled to the CAM card in question be decoupled there from and coupled to a spare CAM card of the CMTS 10 shown in
After the instruction to reset the CAM card interface module and swap cable modems to a spare CAM (if this feature is used) is generated, method 600 advances to step 670 and returns to step 325 shown in
These and many other objects and advantages will be readily apparent to one skilled in the art from the foregoing specification when read in conjunction with the appended drawings. It is to be understood that the embodiments herein illustrated are examples only, and that the scope of the invention is to be defined solely by the claims when accorded a full range of equivalents.
This application priority under 35 U.S.C. 119(e) to U.S. provisional patent application No. 60/713,539 entitled “Method and system for detecting lock-ups in a cable modem termination system communication network,” which was filed Sep. 1, 2005, and is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
60713539 | Sep 2005 | US |