The present disclosure relates to the File Transfer Protocol (FTP) technology in the technical field of communications, particularly to a method and a device for FTP deadlock detection and self recovery.
In mobile communication equipment, data collected from base station equipment comprise performance data of the base station, detailed information data of each call, alarm data and baseband grasping frame data and so on; the collected data have features such as large volume, real-time property, continuity and so on. The base station is connected with an Operation Maintenance Centre (OMC) through the Ethernet, and an FTP with high data throughput is the first choice for data uploading. An Ethernet network environment is complex and FTP underlying protocol processing in a vxWorks real-time operation system produced by the Wind River System Inc. is unstable, therefore, such cases as close, write, read, select, abnormal deadlock or apparent death may suddenly occur in an FTP upload operation, which are characterized in that socket resources (a control connection socket and a data connection socket) cannot be released within operation time allowed by an operation system and exit this system scheduling, therefore, process scheduling of the real-time operation system goes out of control, a single board is reset, and robustness of the base station equipment is seriously affected; in view of this, it becomes very important to perform deadlock detection and deadlock recovery to the FTP upload operation.
With regard to deadlock detection of a process, the patent Method for detecting a deadlock on Semaphore (CN200510053781) describes whether a deadlock happens to semaphore is detected by the establishment of a list of relationships between processes and semaphore and according to whether a resource wait loop determination condition exists, so as to obtain the process corresponding to the semaphore to which a deadlock happens. The disadvantage of the patent is: a deadlock can not be determined until a resource wait loop appears; for an FTP upload operation, sudden apparent death of a process, which occurs due to network environment factors, cannot be detected.
The present disclosure provides a method and a device for FTP deadlock detection and self recovery, which can solve the problem of a sudden deadlock in an FTP upload operation.
A method for FTP deadlock detection and self recovery in a communication system, which is provided by an embodiment of the present disclosure, includes the following steps:
In the abovementioned method, the step of determining by the daemon whether a deadlock occurs in the FTP upload task through the heartbeat detection mechanism may include:
In the abovementioned method, after determining that the deadlock occurs in the FTP upload task, the method may further include: resetting the heartbeat failure counter by the daemon, and returning to step A1 to initiate heartbeat detection in a next cycle, wherein a task priority to which the daemon belongs is higher than that of the FTP upload task, and the socket resources include a data connection socket, and/or a control connection socket.
Correspondingly, the present disclosure further provides a device for FTP deadlock detection and self recovery in a communication system, including: a heartbeat detection initiating module and a buffer queue managing module, wherein
In the abovementioned device, the heartbeat detection initiating module may include: a transmitting module and a determining module, wherein the transmitting module is configured to transmit a heartbeat detection message to the FTP upload task periodically; the determining module is configured to determine whether a response message of the heartbeat detection message is received within a certain period of time, if the response message is received, then reset a heartbeat detection timer, and notify the transmitting module to periodically transmit a heartbeat detection message to the FTP upload task, if the response message is not received, then add 1 to a heartbeat failure counter; and the determining module is further configured to determine whether a value of the heartbeat failure counter is greater than a preset value, if the value is greater than the preset value, then determine that the deadlock occurs in the FTP upload task, otherwise, periodically transmit a heartbeat detection message to the FTP upload task;
Correspondingly, the present disclosure further provides a system for FTP deadlock detection and self recovery, including: a daemon configured to determine that a deadlock occurs in an FTP upload task through a heartbeat detection mechanism, and then record socket resources used by an FTP upload operation at a storage location in an assigned socket resource cycle queue, and start an FTP upload task end process; and further configured to determine whether the socket resource cycle queue is full, if it is not full, then put socket resource information occupied by the current deadlock into the socket resource cycle queue, otherwise, release earliest socket resources from the socket resource cycle queue, and put the socket resource information occupied by the current deadlock into the socket resource cycle queue; an FTP upload task unit configured to return a heartbeat response in response to the heartbeat detection of the daemon, acquire data from the daemon, establish a connection with the FTP and upload the acquired data to an FTP server;
In the abovementioned system, the daemon may include: a heartbeat detection initiating module configured to determine whether the deadlock occurs in the FTP upload task through the heartbeat detection mechanism; a buffer queue managing module configured to record the socket resources used by the FTP upload operation at the storage location in the assigned socket resource cycle queue, and start the FTP upload task end process; and further configured to determine whether the socket resource cycle queue is full, if it is not full, then put the socket resource information occupied by the current deadlock into the socket resource cycle queue, if it is full, then release the earliest socket resources from the socket resource cycle queue, and put the socket resource information occupied by the current deadlock into the socket resource cycle queue.
Compared with the prior art, embodiments of the present disclosure provide a method for detecting a deadlock by adopting a heartbeat detection mechanism, a method of releasing socket resources with delay from a socket resource cycle queue when a deadlock occurs, a self recovery method and the like. By adopting a heartbeat detection mechanism and a self recovery scheme in the embodiments of the present disclosure, problems, such as detection of a sudden deadlock in an FTP upload operation, are solved, thereby abnormal resetting of a base station single board is decreased, the operation stability of base station equipment is improved, and the impact is to system CPU, when a sudden situation appears, is decreased.
As shown in
After determining that the deadlock occurs in the FTP upload task, the daemon can reset the heartbeat failure counter, so as to initiate heartbeat detection in a next cycle.
It should be noted that, a task priority to which the daemon belongs is higher than that of the FTP upload task.
Correspondingly, the present disclosure further provides an embodiment of a system for FTP deadlock detection and self recovery; as shown in
Furthermore, the FTP upload task unit 22 comprises: a heartbeat ACK module, a socket resource reading and writing module with function of reading and storing when socket resources are recovered, and an FTP upload module; wherein the heartbeat ACK module sends direct feedback according to a heartbeat detection message initiated by a heartbeat; however, if an FTP upload operation is being executed at the time, the heartbeat ACK module cannot respond to the heartbeat detection message because it cannot be scheduled; each time initialization is carried out, the socket resource reading and writing module delays releasing according to earliest historical FTP deadlock information which is read by the socket resource cycle buffer queue managing module and socket information related to socket resources; the FTP upload task unit 22 reads data stored by the data receiving message interface, establishes a connection with the FTP server, and at the same time, stores the socket information related to socket resources and writes the socket information into a queue node assigned by the socket resource cycle buffer queue managing module.
Specifically, the daemon 21 is configured to determine whether a deadlock occurs in the FTP upload task through a heartbeat detection mechanism, if the deadlock occurs, the socket resources used by the FTP upload operation are recorded at the storage location in the assigned socket resource cycle queue, and an FTP upload task end process is started; the daemon 21 is further configured to determine whether the socket resource cycle queue is full, if it is not full, socket resource information occupied by the current deadlock is put into the socket resource cycle queue, if it is full, the earliest socket resources in the socket resource cycle queue are released, and the socket resource information occupied by the current deadlock is put into the socket resource cycle queue.
It should be noted that, the daemon 21 is corresponding to a device for FTP deadlock detection and self recovery according to an embodiment of the present disclosure; the device mainly comprises: a heartbeat detection initiating module and a buffer queue managing module, wherein
In the abovementioned device, the heartbeat detection initiating module comprises: a transmitting module and a determining module, wherein the transmitting module is configured to transmit a heartbeat detection message to an FTP upload task periodically; the determining module is configured to determine whether an ACK of the heartbeat detection message is received within a certain period of time, if the ACK is received, then reset a heartbeat detection timer, and notify the transmitting module to periodically transmit a heartbeat detection message to the FTP upload task, if the ACK is not received, then add 1 to a heartbeat failure counter, and determine whether a value of the heartbeat failure counter is greater than a preset value, if the value is greater than the preset value, then determine the deadlock occurs in the FTP upload task, if the value is not greater than the preset value, the determining module periodically transmits a heartbeat detection message to the FTP upload task.
In the abovementioned device, the buffer queue managing module can be further configured to notify the determining module to reset the heartbeat failure counter after determining that the deadlock occurs in the FTP upload task.
Here, it should be explained that, the task priority to which the daemon belongs is higher than that of the FTP upload task.
The method, device and system for FTP deadlock detection and self recovery which are provided by the present disclosure are described briefly above. The technical solution of the present disclosure is described below with reference to specific embodiments.
As shown in
As shown in
It should be noted that, the FTP upload task is detected and monitored by a heartbeat detection message sent from the daemon in the whole life cycle all the time.
Compared with the prior art, the embodiments of the present disclosure introduce methods such as a method for detecting a deadlock through a heartbeat detection mechanism, a method for releasing socket resources with delay from a socket resource cycle queue when a deadlock occurs, a self recovery method, and the like. Furthermore, the heartbeat detection mechanism can monitor a deadlock according to different deadlock determining policies (for example, modifying a value N), instead of taking presence of a semaphore waiting loop as the only basis for determination, thus not only improving the flexibility of coping with a sudden FTP deadlock of base station equipment, but also decreasing resetting times of a single board in abnormal situation and improving the reliability of the system. In addition, in the embodiments of the present disclosure, control connection socket resources and data connection socket resources are released with delay when a deadlock occurs, and the possibility of presence of a sudden abnormal situation is reduced, thereby ensuring the total amount of occupied socket resources in a base station system are as well as avoiding socket resources in use from overflowing; at the same time, abnormal data receiving and scheduling at a daemon, which is caused by an abnormal FTP upload operation, is prevented, thereby reducing the overstocking risk of all process messages of the task to which the daemon belongs, and playing a role of moderating the high utilization ratio of central processing unit (CPU) of a single board.
Obviously, various changes and modifications can be made to the present disclosure by those skilled in the art without departing from the spirit and scope of the present disclosure. Thus, if these modifications and equivalent replacements of the present disclosure are within the scope of the claims of the present disclosure and equal technology thereof, the present disclosure is also intended to include these changes and modifications.
Number | Date | Country | Kind |
---|---|---|---|
200910254305.4 | Dec 2009 | CN | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CN2010/073194 | 5/25/2010 | WO | 00 | 5/29/2012 |