The present disclosure relates to the File Transfer Protocol (FTP) technology in the technical field of communications, particularly to a method and a device for FTP deadlock detection and self recovery.
In mobile communication equipment, data collected from base station equipment comprise performance data of the base station, detailed information data of each call, alarm data and baseband grasping frame data and so on; the collected data have features such as large volume, real-time property, continuity and so on. The base station is connected with an Operation Maintenance Centre (OMC) through the Ethernet, and an FTP with high data throughput is the first choice for data uploading. An Ethernet network environment is complex and FTP underlying protocol processing in a vxWorks real-time operation system produced by the Wind River System Inc. is unstable, therefore, such cases as close, write, read, select, abnormal deadlock or apparent death may suddenly occur in an FTP upload operation, which are characterized in that socket resources (a control connection socket and a data connection socket) cannot be released within operation time allowed by an operation system and exit this system scheduling, therefore, process scheduling of the real-time operation system goes out of control, a single board is reset, and robustness of the base station equipment is seriously affected; in view of this, it becomes very important to perform deadlock detection and deadlock recovery to the FTP upload operation.
With regard to deadlock detection of a process, the patent Method for detecting a deadlock on Semaphore (CN200510053781) describes whether a deadlock happens to semaphore is detected by the establishment of a list of relationships between processes and semaphore and according to whether a resource wait loop determination condition exists, so as to obtain the process corresponding to the semaphore to which a deadlock happens. The disadvantage of the patent is: a deadlock can not be determined until a resource wait loop appears; for an FTP upload operation, sudden apparent death of a process, which occurs due to network environment factors, cannot be detected.
The present disclosure provides a method and a device for FTP deadlock detection and self recovery, which can solve the problem of a sudden deadlock in an FTP upload task.
A method for FTP deadlock detection and self recovery in a communication system, which is provided by an embodiment of the present disclosure, includes the following steps:
In the abovementioned method, the step of determining by the daemon whether a deadlock occurs in the FTP upload task through the heartbeat detection mechanism may include:
In the abovementioned method, after determining that the deadlock occurs in the FTP upload task, the method may further include: resetting the heartbeat failure counter by the daemon, and returning to step A1 to initiate heartbeat detection in a next cycle, wherein a task priority to which the daemon belongs is higher than that of the FTP upload task, and the socket resources include a data connection socket, and/or a control connection socket.
Correspondingly, the present disclosure further provides a device for FTP deadlock detection and self recovery in a communication system, including: a heartbeat detection initiating module and a buffer queue managing module, wherein
In the abovementioned device, the heartbeat detection initiating module may include: a transmitting module and a determining module, wherein the transmitting module is configured to transmit a heartbeat detection message to the FTP upload task periodically; the determining module is configured to determine whether a response message of the heartbeat detection message is received within a certain period of time, if the response message is received, then reset a heartbeat detection timer, and notify the transmitting module to periodically transmit a heartbeat detection message to the FTP upload task, if the response message is not received, then add 1 to a heartbeat failure counter; and the determining module is further configured to determine whether a value of the heartbeat failure counter is greater than a preset value, if the value is greater than the preset value, then determine that the deadlock occurs in the FTP upload task, otherwise, periodically transmit a heartbeat detection message to the FTP upload task;
Correspondingly, the present disclosure further provides a system for FTP deadlock detection and self recovery, including: a daemon configured to determine that a deadlock occurs in an FTP upload task through a heartbeat detection mechanism, and then record socket resources used by an FTP upload task at a storage location in an assigned socket resource cycle queue, and start a process of ending the FTP upload task; and further configured to determine whether the socket resource cycle queue is full, if it is not full, then put socket used by the FTP upload task in the current deadlock into the socket resource cycle queue, otherwise, release earliest socket resources from the socket resource cycle queue, and put the resources used by the FTP upload task in the current deadlock into the socket resource cycle queue; an FTP upload task until configured to return a heartbeat response to the heartbeat detection of the daemon, acquire data from the demon, establish a connection the FTP and upload the acquired data to an FTP server;
In the abovementioned system, the daemon may include: a heartbeat detection initiating module configured to determine weather the deadlock occurs in the FTP upload task through the heartbeat detection mechanism; a buffer queue managing module configured to record the socket resources used by the FTP upload task at the storage location in the assigned socket resource cycle queue, and start the process of ending the FTP upload task; and further configured to determine whether the socket resource cycle queue is full, if it is not full, then put the socket resources used by the FTP upload task in the current deadlock into the socket resource cycle queue, if it is full, then release the earliest socket resources from the socket resource cycle queue, and put the socket resources used by the FTP upload task in the current deadlock into the socket resource cycle queue.
Compared with the prior art, embodiments of the present disclosure provide a method for detecting a deadlock by adopting a heartbeat detection mechanism, a method of releasing socket resources with delay from a socket resource cycle queue when a deadlock occurs, a self recovery method and the like. By adopting a heartbeat detection mechanism and a self recovery scheme in the embodiments of the present disclosure, problems, such as detection of a sudden deadlock in an FTP upload task, are solved, thereby abnormal resetting of a base station single board is decreased, this operation stability of base station equipment is improved, and the impact to system CPU, when a sudden situation appears, is decreased.
As shown in
After determining that the deadlock occurs in the FTP upload task, the daemon can reset the heartbeat failure counter, so as to initiate heartbeat detection in a next cycle.
It should be noted that, a task priority to which the daemon belongs is higher than that of the FTP upload task.
Correspondingly, the present disclosure further provides an embodiment of a system for FTP deadlock detection and self recovery; as shown in
Furthermore, the FTP upload task unit 22 comprises: a heartbeat ACK module, a socket resource reading and writing module with function of reading and storing when socket resources are recovered, and an FTP upload module; wherein the heartbeat ACK module sends direct feedback according to a heartbeat detection message initiated by a heartbeat; however, if an FTP upload task is being executed at the time, the heartbeat ACK module cannot respond to the heartbeat detection messages because it cannot be scheduled; each time initialization is carried out, the socket resource reading and writing module delays releasing according to earliest historical FTP deadlock information which is read by the socket resource cycle buffer queue managing module and socket information related to socket resources; the FTP upload task unit 22 reads data stored by the data receiving message interface, establishes a connection with the FTP server, and at the same time, stores the socket information related to socket resources and writes the socket information into a queue node assigned by the socket resource cycle buffer queue managing module.
Specifically, the daemon 21 is configured to determine whether a deadlock occurs in the FTP upload task through a heartbeat detection mechanism, if the deadlock occurs, the socket resources used by the FTP upload task are recorded at the storage location in the assigned socket resource cycle queue, and process of ending the FTP unload task is started; the daemon 21 is further configured to determine whether the socket resource cycle queue is full, if it is not full, socket resources used by the FTP upload task in the current deadlock is put into the socket resource cycle queue, if it is full, the earliest socket resources in the socket resource cycle queue are released, and the socket resources used by the FTP upload task in the current deadlock is put into the socket resource cycle queue.
It should be noted that, the daemon 21 is corresponding to a device for FTP deadlock detection and self recovery according to an embodiment of the present disclosure; the device mainly comprises: a heartbeat detection initiating module and a buffer queue managing module, wherein
In the abovementioned device, the heartbeat detection initiating module comprises: a transmitting module and a determining module, wherein the transmitting module is configured to transmit a heartbeat detection message to an FTP upload task periodically; the determining module is configured to determine whether an ACK of the heartbeat detection message is received within a certain period of time, if the ACK is received, then reset a heartbeat detection timer, and notify the transmitting module to periodically transmit a heartbeat detection message to the FTP upload task, if the ACK is not received, then add 1 to a heartbeat failure counter, and determine whether a value of the heartbeat failure counter is greater than a preset value, if the value is greater than the preset value, then determine the deadlock occurs in the FTP upload task, if the value is not greater than the preset value, the determining module periodically transmits a heartbeat detection message to the FTP upload task.
In the abovementioned device, the buffer queue managing module can be further configured to notify the determining module to reset the heartbeat failure counter after determining that the deadlock occurs in the FTP upload task.
Here, it should be explained that, the task priority to which the daemon belongs is higher than that of the FTP upload task.
The method, device and system for FTP deadlock detection and self recovery which are provided by the present disclosure are described briefly above. The technical solution of the present disclosure is described below with reference to specific embodiments.
As shown in
As shown in
It should be noted that, the FTP upload task is detected and monitored by a heartbeat detection message sent from the daemon in the whole life cycle all the time.
Compared with the prior art, the embodiments of the present disclosure introduce methods such as a method for detecting a deadlock through a heartbeat detection mechanism, a method for releasing socket resources with delay from a socket resource cycle queue when a deadlock occurs, a self recovery method, and the like. Furthermore, the heartbeat detection mechanism can monitor a deadlock, according to different deadlock determining policies (for example, modifying a value N), instead of taking presence of a semaphore waiting loop as the only basis for determination, thus not only improving the flexibility of coping with a sudden FTP deadlock of base station equipment, but also decreasing resetting times of a single board in abnormal situation and improving the reliability of the system. In addition, in the embodiments of the present disclosure, control connection socket resources and data connection socket resources are released with delay when a deadlock occurs, and the possibility of presence of a sudden abnormal situation is reduced, thereby ensuring the total amount of occupied socket resources in a base station system are as well as avoiding socket resources in use from overflowing; at the same time, abnormal data receiving and scheduling at a daemon, which is caused by an abnormal FTP upload task is prevented, thereby reducing the overstocking risk of all process messages of the task to which the daemon belongs, and playing a role of moderating the high utilization ratio of central processing unit (CPU) of a single board.
Obviously, various changes and modifications can be made to the present disclosure by those skilled in the art without departing from the spirit and scope of the present disclosure. Thus, if these modifications and equivalent replacements of the present disclosure are within the scope of the claims of the present disclosure and equal technology thereof, the present disclosure is also intended to include these changes and modifications.
Number | Date | Country | Kind |
---|---|---|---|
2009 1 0254305 | Dec 2009 | CN | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CN2010/073194 | 5/25/2010 | WO | 00 | 5/29/2012 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/148882 | 12/29/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6609213 | Nguyen et al. | Aug 2003 | B1 |
6763479 | Hebert | Jul 2004 | B1 |
7010657 | Lowe et al. | Mar 2006 | B2 |
7076555 | Orman et al. | Jul 2006 | B1 |
7159234 | Murphy et al. | Jan 2007 | B1 |
7433943 | Ford | Oct 2008 | B1 |
7703077 | Rehof et al. | Apr 2010 | B2 |
7831686 | Tran et al. | Nov 2010 | B1 |
7962623 | Undery et al. | Jun 2011 | B2 |
8365018 | McIntosh et al. | Jan 2013 | B2 |
20020087912 | Kashyap | Jul 2002 | A1 |
20030204641 | Rehof et al. | Oct 2003 | A1 |
20040042412 | Fan | Mar 2004 | A1 |
20040044771 | Allred et al. | Mar 2004 | A1 |
20040085894 | Wang et al. | May 2004 | A1 |
20050021915 | Lowe et al. | Jan 2005 | A1 |
20050132030 | Hopen et al. | Jun 2005 | A1 |
20060159011 | Dalal et al. | Jul 2006 | A1 |
20060190948 | Burger et al. | Aug 2006 | A1 |
20060271681 | Apreutesei et al. | Nov 2006 | A1 |
20070005773 | Apreutesei et al. | Jan 2007 | A1 |
20070189316 | Qing et al. | Aug 2007 | A1 |
20070288645 | Kass | Dec 2007 | A1 |
20080025226 | Mogul et al. | Jan 2008 | A1 |
20080104252 | Henniger | May 2008 | A1 |
20080129464 | Frey et al. | Jun 2008 | A1 |
20080133712 | Friedman et al. | Jun 2008 | A1 |
20080313339 | Faucher et al. | Dec 2008 | A1 |
20090037998 | Adhya et al. | Feb 2009 | A1 |
20100149996 | Sun | Jun 2010 | A1 |
Number | Date | Country |
---|---|---|
1831779 | Sep 2006 | CN |
1904852 | Jan 2007 | CN |
101272402 | Sep 2008 | CN |
Entry |
---|
International Search Report in international application No. PCT/CN2010/073194, mailed on Sep. 16, 2010. |
English Translation of the Written Opinion of the International Search Authority in international application No. PCT/CN2010/073194, mailed on Sep. 16, 2010. |
Number | Date | Country | |
---|---|---|---|
20120240119 A1 | Sep 2012 | US |