This invention relates to automatic methods used to analyze and filter redundant alarms which are generated in the fault management system of the transceiver system. The method is utilized in 4G transmission stations (Evolved Universal Terrestrial Radio Access NodeB—eNodeB) and 5G transmission stations (Next Generation NodeB—gNodeB). Method support operators in monitoring faults are raised in operations process of transceiver system and help quickly find reasons of fault so that it reduces effort, time, cost of operation and maintenance procedures of telecommunication network which can interrupts network services.
Fault monitoring and management is the most important thing for operations of the telecommunication system. All faults must be supervised by a fault monitoring and management system. Fault sources would be from hardware, software, alarm of interruption of services. In the monitoring process, all abnormal cases would be shown in a fault interface in web or monitoring application in order to inform how the operations technician decides to resolve these faults. The problem is operations technicians must handle all faults as quickly as possible in order to prevent interruption of network services or decreases in the quality of service. In the worst cases, network services can be lost, the fault would not be resolved.
The operation of the telecommunication system is basically composed of an operation of EMS (Element Management System) and of the operation of base stations. An EMS manages and monitors base stations. Fault management functionality and raising alarms at the station would be sent to the central monitoring system—EMS. The operations technician must supervise all alarms of all transmission stations at EMS. Fault management of some stations will send SMS messages to a phone of a practical operations technician who is responsible for those stations.
As mentioned previously monitoring and maintaining all transceiver stations without raising fault is important to ensure faultless and clear communication. But in practice, there are many alarms in a station that are being raised by a fault and sent to EMS. The reason for this phenomenon is a consequence of an occurrence fault, for example, if a fault occurs in a hardware component, then software alarm and service alarm would be raised as a result of this. Therefore, the operations and maintenances will meet obstacles because this fault would not correctly be identified by these alarms and one would not know which alarm should be handled first. If the process of root fault handling is correct, then no alarm would be raised anymore. If the process of root fault is not correct, more time is spent for fault handling and service interruption time.
In addition, each alarm has an independent handling process and these processes would be handled separately by a different technician, so there are many interleaved handlings by different technicians which creates additional alarms. These make alarm monitoring and fault identifying more difficult. Work is that alarms must be filtered by the system and correct alarms with corresponding root fault will be showed to bring more effectiveness in the fault handling process.
To resolve this problem the inventor of the invention has researched and recommend an automatic analysis method and filtering out of alarm redundancies in the fault management system of the transceiver station. If the method in the invention is applied, less time is used for fault handling than not used and reduced personal cost and advance the quality of telecommunication services are achieved.
The invention provides an automatic analysis method and filtering alarm redundancy in the fault management system of the transceiver system. To implement this method, a correlation between alarms that are raised in the station would be found by the management system, so that the system identifies which alarms are important, and should be kept, and which are not important, with no need to keep any more. So these alarms are a consequence of important alarms and those that have no meaning will be filtered automatically. When applying this method, the number of alarms over time will be significantly reduced, making for easier operation and handling of errors.
In this invention, the design of a fault management system for radio transceiver stations includes the following components:
Fault Source Unit (FSU) for monitoring and detecting alarms. The location of this block is placed on all hardware components of radio stations such as distributed processing unit—DU (Distributed Unit), centralized processing unit—CU (Centralized Unit), processing block radio—RRU (Radio Remote Unit). In a system, there will be many monitoring and alarm detection blocks. The task of this block is to monitor and detect failures of hardware components, software failures, service failures, etc. The output of the block is all the raw alarms detected by the system.
Fault Analysis and Filtering Unit (FAFU). This block plays the most important role in the system. The location of this block is always located at the CU component. All alarms in alarm detection and monitoring blocks will be sent to this block for processing and filtering. The function of this block is based on the correlation, the relationship between the alarms, it will find out which is the most important root alarm to keep, other alarms will be filtered. To find the correlation between alarms, it is necessary to block the definition of alarm rules as input to the block that analyzes and filters the alarms.
Fault Rules Defined Unit (FRDU) block. The task of this block is to allow operators to define the relationships between alarms, in addition, to define other rules to filter out redundant alarms.
Fault Storage and Monitoring Unit (FSMU). This block has the function of storing the alarms in the database and displaying the alarms after filtering on the monitoring screen.
Specifically, the present invention provides a method to automatically analyze and filter out redundant alarms in the fault management system of radio transceiver stations, including the following steps:
Step 1: The mining operator will define the relationship between alarms, define rules to filter redundant alarms at the FRDU block. The present invention defines four types of relationships between two alarms: 1-1 relationship, 1-n relationship, n-1 relationship, and n-n relationship. Mining operators will use the EMS centralized management system interface to define a model for the relationships between alarms.
Step 2: The FSU block will detect the alarm and send it to the FAFU block. The alarm detection and monitoring unit (FSU) is located in the hardware components in the radio base station so that monitoring will detect the alarm and send the alarms to the analyzer and filter the residual alarm (FAFU).
Step 3: FAFU block will receive alarms from FSU block, based on the rules defined in FRDU block will analyze and filter out redundant alarms. At this step, after receiving the alarms from the FSU block, the FAFU block will rely on the rules defined in the FRDU block and the possible scenarios for the two related alarms, the system will analyze and filter out redundant alarms.
Step 4: The FSMU block will receive the alarms after being filtered, stored in the database, and also send these alarms to the EMS system. At this step, after receiving the alarms sent, the EMS system provides an interface to display a list of current alarms for timely handling by operators.
The following sections describe in detail the automatic analysis and filtering of redundancies in the base station fault management system.
Fault Source Unit (FSU) for monitoring and detecting alarms. The location of this block is placed on all hardware components of radio stations such as DU (Distributed Unit), CU (Centralized Unit), RRU (Radio Remote Unit). In a system, there will be many blocks that monitor and detect alarms. The task of this block is to monitor and detect failures of hardware components, software failures, service failures, etc. The output of the block is all the raw alarms detected by the system.
Fault Analysis and Filtering Unit (FAFU). This block plays the most important role in the system. The location of this block is always located at the CU component. All alarms in alarm detection and monitoring blocks will be sent to this block for processing and filtering. The function of this block is based on the correlation, the relationship between the alarms, it will find out which is the most important root alarm to keep, other alarms will be filtered. To find the correlation between alarms, it is necessary to block the definition of alarm rules as input to the block that analyzes and filters the alarms.
Fault Rules Defined Unit (FRDU) block. Function of this block is to allow operators to define relationships between alarms, in addition to define other rules to filter out redundant alarms.
Fault Storage and Monitoring Unit (FSMU). This block has the function of storing the alarms in the database and displaying the alarms after filtering on the monitoring screen.
Based on a system with the above functional blocks, the steps to implement the method are as follows:
Step 1: Operators will define relationship between alarms, define rules to filter redundant alarms at FRDU block.
To define the binding relationship between the alarms,
The first is a 1-1 relationship, which defines a parent alarm that will have only one child alarm. This means that when the system generates a parent alarm, there will always be a child alarm attached.
The second type of relationship is a 1-n relationship, one parent alarm will have more than one child alarm. This means that when the system generates a parent alarm, there will always be more than one child alarm associated with it.
The third type of relationship is the n-1 relationship, a child alarm can be the child of many different parent alarms. This means that when the system generates many different parent alarms, all of which are accompanied by the same child alarm
The final type of relationship is an n-n relationship, a parent alarm will have many different child alarms at the same time, a child alarm is also a child of many different parent alarms. The n-n relationship will be the most complex. The FRDU block will allow these relationships to be defined. The relationship definition will only need to be done once and will normally be defined during the development of the device manufacturer's alarms, the mining operator need only read the model instructions. describe it. During operation, error managers can still be given permission to change these relationships.
In practice, the relationship between alarms is not simply a relationship between two alarms, these relationships can be hierarchical according to a multi-level model as depicted for example in
Operators define these multi-level model relationships on the EMS interface in different ways such as using the drag and drop interface to create the model, using the Command Line Interface—CLI to add, edit, delete relationships.
Step 2: FSU block will detect alarm and send to FAFU block.
The alarm detection and monitoring unit will detect the alarm (FSU) located in the hardware components in the radio base station to monitor and send alarms to the analyzer and filter residual alarms (FAFU). An alarm will have the following properties: alarm name, alarm identifier, alarm object, alarm status, alarm severity, alarm time, and additional information to describe the cause of the occurrence as well as how to resolve the alarm, etc. The severity of the re-alarm includes levels from high to low as follows: critical, major, minor, warning. An alarm's life cycle will have three active states: new, changed, and cleared. When an alarm belonging to an object is detected, the FSU block will set the status of the alarm to the generated state, when the alarm persists but its properties are changed such as severity level. When the alarm's importance changes, the alarm's state will change to the changed state, once the alarm has been resolved, no longer exist the alarm state will change to cleared and end its life. When each state changes, the FSU block sends back alarm information through the FAFU block.
Step 3: FAFU block will receive alarms from FSU, based on the rules defined in FRDU block will analyze and filter out redundant alarms.
In order to be able to filter out redundant alarms, this step of the present invention explicitly describes a filtering mechanism based on the relationship between alarms. Suppose we have two alarms, A and B, where A is the parent alarm of alarm B. We will consider the cases where alarm A occurs first, then alarm B occurs later, the second case. alarm B appears first, then alarm A appears later, in the third case, alarm A goes to a cleared state before alarm B, the last case is alarm B goes to a cleared state before alarm A.
In case alarm A is detected first and has a new status, then alarm B is detected later, because alarm B is a child alarm, the consequence alarm of A, it will be filtered out, the system keeps only alarm A. Similarly for the second case, the system filters out the alarm that comes first as alarm B and keeps the alarm that comes later as alarm A. Alarm filtering This information in the processing system will not be displayed on the monitoring interface, but will still be stored in the database and assigned a status of hidden to distinguish it from unfiltered alarms.
In case alarm A goes to a cleared state before alarm B. The present invention provides modes of treatment for alarm B as follows:
The above operations between two parent and child alarms (A and B) will also be defined in the FRDU block. In the multilevel relationship model, the relationship between indirect parent and child alarms will be a non-tight relationship, so the way it works is to keep (KEEPING Mode).
In case alarm B changes to the cleared state before alarm A, it will not affect the status of alarm A, alarm A will still be displayed on the monitoring system.
The following is a detailed description of the execution flow at the FAFU block:
Step 4: the FSMU block will receive the alarms after being filtered, stored in the database and also send these alarms to the EMS system. The EMS system provides an interface to display a list current alarms for operators to handle timely.
Above are the described steps to implement the method of automatically analyzing and filtering redundant alarms in the fault management system of radio communication stations.
To demonstrate the effectiveness of the present invention, a method to automatically analyze and filter out redundant alarms in the fault management system of radio communication stations is implemented and integrated in the fault management system of the stations 5G gNodeB. The results of the invention will help to evaluate the effectiveness of the method proposed in the invention.
For the implementation of the present invention, the FSU blocks shall be placed on the hardware components of the gNodeB station such as the DU, RRU and CU hardware components. The FRDU, FAFU and FSMU blocks will be placed on the CU hardware component to receive alarms from the FSU.
To illustrate the effectiveness of the present invention, we will perform an experiment as follows: On DU hardware we will perform the withdrawal of the Small Form-factor Pluggable (SFP) module to generate an alarm SFP Not Present. As the relationship between the alarms is defined as shown in
While a preferred embodiment of the present invention has been shown and described, it will be apparent to those skilled in the art that many changes and modifications may be made without departing from the invention in its broader aspects. The appended claims are therefore intended to cover all such changes and modifications as fall within the true spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
1-2021-04079 | Jul 2021 | VN | national |