The invention relates to a method and means for ensuring high availability of services in a communication server. In particular the invention relates to a method and means for ensuring high availability of services in a software based communication server.
Packet switched communication is growing in importance everyday in comparison to traditional circuit communication. This development has resulted in a diversity of communication methods. Also, this development has resulted in many traditional hardware devices being replaced by substitutes that are typically software engineered on a simple communication server platform. For example, arrays of expensive digital signal processors were once needed in circuit switched servers, whereas now a server with a single processor may handle several packet switched connections.
For example corporate contact centres are today mainly realised with a server and associated software to handle a multitude of communication and contact management tasks. However, such contact centres are plagued by poor availability. For one reason or another an application required by an employer or a customer ceases to work at a critical moment. An IP-address is lost and cannot be promptly acquired to ensue communication. A server device crashes just when users are using applications on that particular server. Ensuring availability of services to users, even when some services fail at some network node is clearly a critical issue of growing importance.
One prior embodiment of a software call centre is exhibited in WO0215030A1, which is taken here as reference. This publication features a mechanism for improving availability, mirrored hot backup, where failed resources can be dynamically replaced by new resources in order to guarantee continued availability of the resource. This feature of the aforementioned document is demonstrated with reference to
Another prior art embodiment for ensuring the availability of a packet switched phone call is demonstrated in U.S. Pat. No. 6,389,005, where a PSTN phone connection is formed when the Quality of Service (QoS) for the packet switched connection deteriorates below a certain threshold value.
However, the prior art methods for ensuring availability have inherent disadvantages. In mirrored hot backup, a complete redundant set of hardware devices is required for the backup mirror. Mirroring as exhibited in WO0215030A1 is clearly a very costly alternative for ensuring availability, as the hardware and software investments for the mirror set of devices are nearly equal to those of the actual devices being used.
Another disadvantage in accordance with the prior art is the confusion that results with the reallocation of IP addresses, when services do crash and the mirror resource is being used. Typically in accordance with the prior art the mirror resources need to assume a new IP address. This introduces the added difficulty for external applications needing to decipher which IP address is related to which new resource.
The object of the invention is to overcome some of the drawbacks related to the methods mentioned above. Another object of the invention is to provide a new, simple and reliable method and device for ensuring high availability in software based contact centre. A further object of the invention is to reduce the costs and initial investments related to ensuring high availability of services in software based contact centre.
An even further object of the invention is to provide a new, simple and reliable method and device for transferring IP-addresses between resources in a software contact centre so that external applications only see an infinitesimal interruption in time, otherwise everything ensuring ceteris paribus.
In this application “software based contact centre” is taken to mean any contact centre designed to connect voice calls, VoIP (Voice over IP) calls, emails, facsimiles, SMS-messages (Short Message Service), MMS-messages (Multimedia Message Service), and/or web contacts for example. In addition to connecting aforementioned communication “software based contact centre” typically also manages the contact information needed for such communication.
Exemplary software based contact centre architecture is presented in
The server software 110 is typically arranged to run on at least one server platform 120, which is at the nearest interface with the server hardware 130. The server software 110 may perform intelligent routing functions such as ACD, Skill Based Routing (SBR), Call queues, IVR (Intelligent Voice Recognition). The server 110 may also feature a Gateway Controller, Communication Event Manager and/or a Contact Process Manager.
The software based contact centre 10 may for example connect and manage such communication as phone calls 140, emails 141, facsimiles 142, SMS 143, web contacts 144, messaging 145, mail 146.
The invention is based on a developed application of virtual backup units for ensuring availability in a software based contact centre. The software based contact centre will have a high availability controller, which will have logical connections to virtual backup units on the same server, or different, even distributed servers. The virtual backup unit will typically contain a set of resources that could potentially fail. Once a failure of a resource takes place, a virtual backup unit may be initialised at any server, and some or all parts of the virtual backup unit are arranged to replace the failed resource.
Thus the method of the invention is especially applicable to improving the availability and level of trust in software based contact centres.
In one preferred embodiment of the method of the invention, there are several virtual backup units distributed on several servers that can backup a particular service. In this embodiment virtual backup units can be started on any server to replace a failed resource.
In another preferred embodiment of the invention there is only one virtual backup unit on one server ensuring availability on several servers that are in use. This virtual backup unit may take the role of any server upon failure.
Consequently, a significant advantage of the invention is that it is possible to provide either significantly less or more virtual backup units than there are servers in use, and these virtual backup units may be realised on a random number of server machines. This allows for great flexibility in both hardware investments and the level of availability sought from the system.
The invention also has the advantage of allowing to capture or transfer the IP address of the failed resource to the virtual backup unit. Thus, the external agents only see a short interruption in service, and afterwards all services resume as normal.
A method for ensuring high availability in a software based contact centre in accordance with the invention, comprising at least one high availability control unit is characterised by the following steps,
An arrangement for ensuring high availability in a software based contact centre in accordance with the invention, comprising at least one high availability control unit is characterised in that,
A memory storage unit in accordance with the invention containing software based contact centre server software, further comprising at least one high availability control unit is characterised in that,
The best mode of the invention at this time is considered to be the application of several distributed virtual backup units in increasing the availability of a particular software based contact server.
The invention will be explained in detail below with reference to the accompanying drawings, in which
Some embodiments of the invention will be described in the dependent claims.
In
In phase 300 of
The application server 420 features typically at least one executable CEM 422, at least one executable Web Server 423 and at least one executable H323 gateway 421. The executable H 323 gateway is typically connected to at least one H323 phone. The application server 420 features also call control 470, which may connect for example the gateway server 430, at least one executable H323 gateway 421, at least one executable CEM 422, and/or at least one virtual phone 460 in some embodiments. The executable web server is typically connected to at least one database server 400.
Any of the typical resources of the outlined software based contact server may be duplicated in virtual backup units in any permutations or combinations. For example, in one non-restrictive and exemplary case, there are two database servers (primary and backup) running an SQL server with data files on a shared failsafe disk array. In addition, there are five application servers (three primaries and two backups) running IIS, CEM, Bridge and two gateway servers (primary and backups) with special gateway hardware and software.
The exemplary system will comprise also diverse software, such as one SQL server database system on a database server with the redundant backup unit. This will include also the configuration database. One gateway system on a gateway server is also included with one redundant backup unit, and two IIS's in two different configurations in any applications server. The system will also comprise three CEM software systems in three different configurations in any application server, and platform monitoring software on each machine.
The high availability controller 530 interfaces with one or several virtual units 560 and their resources 561, 562, 563, 564, or other external processes 540. The high availability controller 530 controls and monitors at least one virtual unit 560 and its resources, which may include in some embodiments IP-addresses, CEM/CD, gateways and bridges.
The high availability controller 530 and virtual unit 560 system, or a network of them is centrally configurable and manageable in many embodiments. This system 530, 560 will typically be able to fetch its configuration data for its managed processes from the central configuration database. In some embodiments the system configuration data is copied locally, and when the central configuration database 500 is unavailable, the local copy 550 will be used instead. In many embodiments the high availability controller 530 instances are uniquely identified for configuration and monitoring purposes. Likewise each high availability controller 530 is typically named uniquely, and this identity is used to associate configuration data and statistics.
The high availability controller 530 should be able to analyse the overall system status of a software based contact centre, and initiate corrective actions if problems are found. Upon a process failure, the failed unit could be automatically deactivated and a redundant backup unit activated. Additional automatic reconfigurations might also be taken to e.g. reroute some traffic through the software based contact centre or elsewhere in some embodiments.
Typically the shutting down, crashing, freezing or resetting of a high availability controller 530 instance will not prevent the production of a service offered by the managed processes unless so specifically desired in some embodiments. The high availability controller 530 is thus able to isolate failed processes in accordance with the invention.
The adapter and plug-in for proprietary services 660 interfaces service processes for statistical data via a shared memory mapped file in some embodiments. The adapter and plug-in for Windows services 670 is used to gather performance statistics from processes to high availability controller 640 using Windows Performance statistics collection mechanisms. The monitored process may publish the monitored statistics in order to use the data for status and health monitoring purposes. The adapter and plug-in for Windows services 670 may also be used to start, stop and restart a Windows process and monitor process existence and status, such as not installed, started, stopped, starting, stopping etc. In some embodiments the interface 670 allows the usage of Windows shell commands to execute custom monitoring and controlling mechanisms. The target shell command string is called to perform the actual tasks and return the results to this interface 670 in some embodiments.
The monitoring that the high availability controller 640 exhibits allows to monitor the status of the managed processes using configurable methods in some embodiments. In addition, high availability controller 640 is capable of collecting various operational statistics from the controlled processes in some embodiments. In many embodiments high availability controller 640 also features the intelligence relying on configuration and capabilities on deciding which statistics are collected and how. Monitoring activity will also typically include associating thresholds with the monitored statistics for noting a change in a status of a monitored target. The high availability controller 640 will also typically feature a timeout for monitoring procedures, so the high availability controller 640 should not get blocked indefinitely even if a monitoring thread does.
The controlling features that the high availability controller 640 exhibits allow the system to start and stop its controlled processes. The high availability controller 640 is able to initiate a reconfiguration of a controlled process in some embodiments. Also, the high availability controller 640 will be able to configure some attributes of its controlled processes. The high availability controller 640 will be able to set the entire configuration data for its controlled processes using configurable methods in some embodiments. Likewise, the high availability controller 640 is able to set the configuration data in Windows registry in many embodiments.
In some embodiments the high availability controller 640 is able to generate XML configuration files for its controlled processes. Likewise, the high availability controller 640 is able to generate custom configuration files for its controlled processes. The high availability controller 640 will also typically feature a timeout for monitoring procedures, so the high availability controller 640 should not get blocked indefinitely even if a monitoring thread does.
The high availability controller 640 may also issue alarms that signify the occurrence of unexpected circumstances. There are separate alarms for failed initialisation upon start-up, failing configuration, process termination and other conditions in accordance with the invention.
The high availability controller 640 may dispatch the alarms through SNMP (Simple Network Management Protocol). SNMP may also be used in other process and configuration monitoring and control in accordance with the invention.
The gateway primary 740 comprises a high availability controller for gateway pool, and at least one virtual unit with an IP-address 742, CEM/CD 743, and a series of gateways 744, 745, 746. The gateway backup node 750 comprises the same in many embodiments.
Direct commands to high availability controllers 711, 721, 731, 741, 751 may be given from the HAC control user interface 700. High availability controllers 711, 721, 731, 741, 751 may be forced to refresh its configuration data from the database in some embodiments. It is also possible that the high availability controllers 711, 721, 731, 741, 751 may also be forced manually to initiate a switch between backup unit(s). Especially in embodiments where high availability controllers 711, 721, 731, 741, 751 are independent of each other, the switch between backup units has to be typically done manually by inactivating the active 720, 740 and activating a standby backup unit 730, 750.
However, if configuration is read successfully from the local file, the system model is created from the data in phase 820. If model creation is successful a copy of the data is written to the registry 840. If the model creation fails backup data is read from registry and the system model is created 830, and if even this proves unsuccessful the process is paused 860 and retried. In phase 850 the created system model is finally used.
In some embodiments the high availability controller instances relay changes in the system model using the high availability controller protocol over a TCP/IP network. In addition version control mechanisms may be applied to the system model data, with which the changes in the model are propagated reliably to all instances automatically. This has an obvious benefit in that the local file need not be manually copied every time to different high availability controller instances, when changes are made to the system.
In the “view status” use case 911 the high availability controller 530, 640 is queried information about the current node status. In many embodiments the returned information contains all the managed entities, their status and health. The administrator 910 may use this feature to visually monitor the node. In the “alter current configuration manually” use case 912 the administrator may use the high availability controller 530, 640 user interface to manually change the current configuration. In some embodiments, the manual modifications are limited to role changes for the high availability controller 530, 640 and its managed entities, such as the virtual units 230, 240, 250. The modification typically has no effect on the configuration database 920 content and thus it is lost after a reset or restart of a high availability controller 530, 640 instance. In some embodiments overriding “save” conditions may be defined. However in some embodiments in order to make permanent changes, the administrator 910 needs to modify the central configuration database 920 and then signal the high availability controller 530, 640 to fetch the modified configuration data.
In the “reset HAC manually” use case 913 the high availability controller 530, 640 is reset, and the process is restarted and configuration is fetched from the configuration database. This use case is also available on the high availability controller user interface 510, 520 in some embodiments. In the “manage node” use case 901 the monitoring of all managed entities is initiated and control actions are generated accordingly. In some embodiments the high availability controller 530, 640 continuously manages its sub entities by executing entity monitoring, deducing necessary actions and implementing the actions via entity controlling mechanisms when needed. Typically the entity management initiates control actions to the managed entity.
In the “control system component” use case 902 control signals are issued to an entity, i.e. an internal high availability controller 530, 640 component, in order to change the status of a system component to a desired value. Each entity type will typically have a suitable control mechanism attached that interacts with the real system component represented by the entity and executes the actual control actions in some embodiments. In the “monitor system component” use case 903 system component 900 health and status is monitored. Each entity type will typically have a suitable monitoring mechanism attached that interacts with the real system component represented by the entity and executes the actual control actions in some embodiments.
In the “set entity role” use case 914 the role of an entity, i.e. an internal high availability controller 530, 640 component, is set or modified. The entity might be one of the managed entities or the, i.e. an internal high availability controller node itself. In the “get configuration” use case 921 the high availability controller 530, 640 fetches its configuration data from the central configuration database and applies the fetched configuration. A local copy of the data may also be made in some embodiments for being used later is the database is not available. In addition to the high availability controller 530, 640 configuration attributes, the configuration data includes the list of processes and configuration data for them in some embodiments.
The invention has been explained with reference to the aforementioned embodiments and several industrial and commercial advantages of the invention have been demonstrated. A significant advantage of the invention is that it is possible to provide either significantly less or more virtual backup units than there are servers in use, and these virtual backup units may be realised on a random number of server machines. This allows for great flexibility in both hardware investments and the level of availability sought from the system. The invention also has the advantage of allowing to capture or transfer the IP address of the failed resource to the virtual backup unit. Thus, the external agents only see a short interruption in service, and afterwards all services resume as normal.
The invention is not restricted only to the aforementioned embodiments, as many variants are conceivable without departing from the original inventive idea and scope of the attached claims.
Number | Date | Country | Kind |
---|---|---|---|
20021540 | Aug 2002 | FI | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/FI03/00620 | 8/25/2003 | WO | 6/30/2005 |