This is a Continuation of International Application No. PCT/FI03/00235 filed Mar. 27, 2003, which designated the U.S. and was published under PCT Article 21(2) in English.
The present invention relates to mobile telecommunication systems. In particular, the present invention relates to a novel and improved method, system and extended naming service framework for guaranteeing service availability.
Data communication and computer networks provide different kinds of services to parties requesting services. Service here refers to any instance, for example software or hardware components that are capable of offering some kind of useful information for other entities, users or clients.
An important property for a service is that it is available as much as possible. Some services are more important than others, and therefore the down time or fault time of the most important services should be minimised. Also an important aspect is that the state information (e.g. failure, down, etc.) should be available to the instances that might need the information, e.g. other service users that might send a service request to the service. In some occasions, a process provides only one service. However, there may be situations where a single process is offering one or more services.
It can be said that most of the computer and data communication systems have some kind of management system. The task of the management system is to monitor the functions of the system, report failures and manage possible recovery processes. The object to be monitored can be, e.g. a hardware component, e.g. a Computer Processing Unit (CPU) or a software component, service or process. Mobile networks comprise different kinds of Network Management Systems (NMS).
Conventional problem handling systems are often rule based solutions. This means that there are beforehand generated rules, and when a failure occurs, some information is acquired about the failure, and based on the rules affects of the failure are concluded. What is even more problematic is that the rules are often customer-specific. The main problem with the predetermined rules is that someone has generated the rules, and they are more or less based on before experienced situations. The weakness of the rule based solutions is that they are static by nature.
When a failure occurs, according to rule based solution, the predetermined rules are gone through, and it is tried to be decided what effects the failure causes. Imagine a situation where a certain failure has not occured ever, so there will not be any proper rule to apply. Therefore, we have a failure situation, and we do not any specific information what are the affects of the failure.
There exists high-availability carrier-grade server platforms. The use of mainstream hardware technologies and open-interface software components facilitate fast product creation. These network servers will eventually supersede, e.g. current mobile switches, enabling mobile networks to provide rich-call capabilities far beyond present voice and messaging-centric services.
Currently some systems use so called High Availability Services (HAS). The HAS provides services to build a system that is highly available i.e. provides good service. Clients are interested of service access points called Integration Reference Points (IRP). A client refers to an entity, e.g. a software process, which is able to use different services. A process can include many services, and thus many service access points.
A problem arises when the High Availability Services (HAS) finds that e.g. some process or physical equipment, e.g. a disk is broken. The problem is to determine which services are affected. Prior art solutions do not provide unambiguous answers to this important question. In current circuit switched based world the architecture was based on processes and communication between them. There the client knew their counterpart processes, and thus affect of some process failure was predictable, and it was possible to recover from the situation just with the HAS concept.
However, in the present architecture model the main points are services and their access points. The clients are not interested in the physical items like processes providing the services. Clients are only interested in the service access points (IRP) and their run-time references. This gives freedom to pack services in many ways, and thus provides very flexible architecture to support different kind of physical entities of the services.
The present invention describes a dynamic managing method, system and extended naming service framework for enabling service availability in a computer system. The computer system comprises at least one or more client applications, one or more server applications for providing one or more services, wherein the client applications use one or more services. The system further comprises one or more entities or physical components for providing one or more server applications and a management entity comprising information about the entities or pieces of physical equipment.
The extended naming service framework described in the present invention provides a solution to the problem of locating and managing services within a system. The framework provides in a preferred embodiment a centralised storage of information about all the services within a system. However, the extended naming service framework can also be a distributed solution as well. Source of information about the services are the service providers themselves or mapped information from the HA notifications concerning physical components, e.g. a LAN port etc.
Service availability must be transparent to service users. Service providers, even if they aid in service availability must not implement service availability themselves. The present invention provides a generic solution for implementing service availability.
The High Availability Services receive information e.g. of failures of physical component (Central Processing Units (CPU), Local Area Networks (LAN)), processes etc. Service users (client applications) are not interested in processes but in the service access points. Service providers can provide the “physical” details (e.g. process info) of their services to the extended naming service framework. By listening to the HAS notifications regarding physical components, the extended naming service framework can map the status of a service to the status of the physical component which provides the actual service.
The present invention has several advantages over the prior art solutions. The present invention provides a generic and dynamic solution for service availability. With the present invention it is possible to map processes or physical equipment to services provided by them.
The accompanying drawings, which are included to provide a further understanding of the invention and constitute a part of this specification, illustrate embodiments of the invention and together with the description help to explain the principles of the invention. In the drawings:
Reference will now be made in detail to the embodiments of the present invention, examples of which are illustrated in the accompanying drawings.
High Availability Services HAS receive information e.g. of failures of physical components, e.g. Central Processing Units (CPU), Local Area Networks (LAN), processes etc. The HAS manages the Recovery Units (RU), which include the processes. The recovery unit is usually responsible for a service but in practice it can include multiple service access interfaces or Integrated Reference Points (IRP).
Broken lines represent possible messaging routes. The main idea of the present invention is that the extended naming service framework ENS stores detailed information concerning client applications (CU1, CU2) and server applications (SP1, SP2). When a failure situation occurs the extended naming service framework ENS comprise all the needed information to conclude which services will suffer due to the failure.
The system in accordance with the present invention may also comprise Alarm Services AS (as in
The HAS will start the processes, and in the startup of a process the HAS will give the process physical location information and state of the process (active or standby) . So when the process is started, the current physical location will be told to it (like cluster-node-Recovery Unit) as well as the process state. After the process is started by the HAS, the process will register all service access points to the ENS. In these registration messages (or one big registration message) to the ENS, the process will add also the physical location of the service access point or integration reference point (IRP) and the state. Process can also give some other keys that could be used to subscribe the service later by the clients. After registering the ENS will have information like “IRP, IRP state (=process state), physical location, other search keys”.
Now, if something goes wrong with some CPU node or with some process, the HAS will send this information to the ENS. The ENS is now able to bind this information concerning the physical component to the real service access points. The ENS is able to change the statuses of the IRPs according to the physical component states.
The system represented in
In
In one embodiment of
The above mentioned means are in one embodiment implemented by using hardware and/or software components.
In one embodiment of
In one embodiment of
In one embodiment of
In one embodiment of
In one embodiment of
In another embodiment of
The service provider SP registers its services to the extended naming service framework ENS (20), e.g. using CORBA communication. In a preferred embodiment, the registration includes the following information:
After registration, the service is successfully registered to the ENS and is identified by a unique identity.
The service user SU subscribes to services from the framework ENS (22), using e.g. a CORBA call. In a preferred embodiment, the subscription includes the following information:
The service user SU has now successfully subscribed to services from the framework. If there are services available, which match the subscription, the service user is updated immediately (24). This update information has all the information about the service, including the service identifier, the service name, the IRP, the interface address of the service, physical information about the service as well as the service status. When a service registers, service users whose subscription matches to the registered service, are updated. In other words, a service user automatically receives service information with which it is involved with. To manage service subscription, the extended naming service framework ENS can build dynamic table concerning the relationship between clients and the services. By using this table the ENS is able to push information concerning the service status changes only to the clients that are interested of the IRPs in question.
The extended naming service framework ENS is up and running. The HAS initiates (30) the shutdown sequence of the service provider. Initiating a shutdown sequence means that new requests are not accepted any more. The service provider indicates (deregistration message) to the framework that it is gracefully shutting down (32). The extended naming service framework ENS removes the service information from its list of available services. It also updates all the users who are aware of this particular service instance with the new status of the service. Further, the extended naming service framework ENS informs the service provider SP that all service users being aware of the service have been updated (34). Finally, the service provider SP informs the HAS that the shutdown sequence has been completed and shuts down itself (36). Because all service users which were aware of this particular service instance are updated, the service users do not initiate any new transactions with the service instance.
The initial state of
The extended naming service framework is listening to HAS notifications regarding status change (active or standby etc.) of recovery units. It receives a notification that the status of a particular recovery unit has gone to standby mode (40). It maps this switchover information to the information regarding to the physical properties of the service instances as provided by the service providers themselves. In other words, the extended naming service framework ENS finds out which services were running on this particular recovery unit. As said before, the service user is assumed to be aware of both the active and standby instance of a particular service. The recovery unit which hosted the active instance of the service has switched over and gone to standby. Therefore, the extended naming service framework ENS updates the service user SU with the changed status of the service instance (active to standby) (42).
The extended naming service framework ENS now receives a notification that a particular recovery unit has switched to active state which can be mapped to the status of the services that are running on this recovery unit (44). The ENS then updates all the service users SU, which are subscribed to information about the service instances under question (46). The service users SU have now been informed of the new status of the services.
It is obvious to a person skilled in the art that with the advancement of technology, the basic idea of the invention may be implemented in various ways. The invention and its embodiments are thus not limited to the examples described above, instead they may vary within the scope of the claims.
| Number | Date | Country | Kind |
|---|---|---|---|
| 20020765 | Apr 2002 | FI | national |
| Number | Date | Country | |
|---|---|---|---|
| Parent | PCT/FI03/00235 | Mar 2003 | US |
| Child | 10966159 | Oct 2004 | US |