The present invention is based upon and claims the benefit of the priority of Japanese patent application No. 2018-151993, filed on Aug. 10, 2018, the disclosure of which is incorporated herein in its entirety by reference thereto.
The present invention relates to a fault recovery control method, a communication apparatus, a communication system, and a program.
With advancement of virtualization technology Software-Defined Networking (SDN), a method for constructing a network and providing a service therethrough is changing dramatically. As an example, unlike conventional network construction and service provision using a hardware-based network appliance(s), network functions are now separated from hardware and executed as software image(s) on general-purpose hardware using virtualization technology. In addition, these software-based network functions can be integrally managed by an SDN controller. As a result, overall operating costs can be reduced and a quick response to change in service demand, for example, can be achieved.
While software-based network functions can be controlled from a host device such as an SDN controller, special orchestration functions are required when providing a service using multiple Virtual Network Functions (VNFs), such as Universal Customer Premises Equipment (uCPE) described below. As is well known, regarding network function virtualization (NFV), NFV standardization organization ETSI (European Telecommunications Standards Institute) considers an architecture which is divided into three functional layers, i.e., virtualized infrastructure management (Virtualized Infrastructure Manager, VIM), VNF management (Virtual Network Function Manager, VNFM), and orchestration (NFV orchestrater).
The following describes a uCPE, which is one use case of virtualization technology.
The individual sites 10A and 10B are connected to a data center 20 (cloud) via a wide area network (WAN) 30. For example, the WAN 30 may be the Internet, MPLS (Multi-Protocol Label Switching), or the like. The WAN 30 may be configured as a Software-Defined (SD)-WAN.
Upper apparatuses such as an orchestrator 202 and a VNF controller 203 are deployed in the data center 20. For example, the orchestrator 202 is configured as an orchestrator (NFV Orchestrator) in NFV MANO (Network Functions Virtualization Management and Network Orchestration). For example, the orchestrator (NFV Orchestrator) 202 performs lifecycle management (instantiation, monitoring, operation, removal, etc.) of network services configured by a plurality of VNFs and is in charge of integrated operation and management of an entire system.
VNF controller 203 performs VNF management (VNF Manager: VNFM). The VNFM is in charge of VNF configuration, lifecycle management, and element management. In lifecycle management of an individual VNF, a VNF descriptor (VNFD) is used, which is a template including description of a VNF regarding deployment and operation requirement, etc.
When the sites A and B are not distinguished from each other, either the uCPE apparatuses 100A or 100B will be referred to as a uCPE apparatus 100 with a reference character A or B omitted. The individual uCPE apparatus 100 includes an NFV Infrastructure (Network Functions Virtualization Infrastructure: NFVI) that provides a virtual machine execution infrastructure for VNFs. For example, the NFVI provides a virtualization layer such as a hypervisor and computing, storage, and networking hardware components for hosting a VNF(s). Control of resources (physical resources and virtual resources) and lifecycle management of the computing, storage, and network of the NFVI are performed via a Virtualized Infrastructure Manager (VIM) in NFV MANO. The VIM may be provided in uCPE apparatuses 100A and 100B, for example.
Service chaining is a mechanism in which various network functions such as a router, a firewall, and a load balancer are coordinated with each other and packets are exchanged in an appropriate order. Various network services can be provided to customers (users) at individual sites by operating a plurality of VNFs on the NFVIs of the uCPE apparatuses 100 and connecting VNFs with service chaining.
To implement such service chaining among VNFs on the individual uCPE apparatus 100, the data center 20 includes a controller (uCPE-PF (platform) controller) 201 that sets/controls a path(s) inside a hardware platform, a controller group (VNF controller) 203 that sets/controls the VNFs at individual sites, and the orchestrator 202 that coordinates individual controllers and provides final network services. Generally, the upper apparatus group such as the uCPE-PF controller 201, the VNF 203, and the orchestrator 202 are deployed in the data center 20.
While not particularly limited, IN
In
In the uCPE system, the uCPE apparatus 100 which includes a plurality of VNFs, is deployed in each site 10. A group of upper apparatuses, such as a controller and an orchestrator, which control a plurality of VNFs and VNF service chains on the uCPE apparatus 100, are deployed in the data center 20.
Thus, for example, in a case of customers (users), such as global companies expanding their businesses globally, the site 10 and the data center 20 could be far away from each other by, for example, several tens of thousands of kilometers. In this case, a problem of a transmission delay between the site 10 and the data center 20 becomes apparent. For example, regarding an intra-network delay time or the like, SLA (Service Level Agreement) or the kike is affected.
Regarding service chaining, the orchestrator 202 in the data center 20 grasps an overall system and transmits an instruction (referred to as “a first control signal”) to the uCPE-PF controller 201 and the VNF controller 203 in response to a network service request from a customer (user).
On reception of the instruction, the uCPE-PF controller 201 and the VNF controller 203 transmit a control signal (referred to as “a second control signal”) to the corresponding uCPE apparatus 100 in the user site to control the uCPE apparatus 100. In this way, control across a plurality of control planes is performed until the service chaining is configured in the uCPE apparatus 100.
Depending on processing capability and load status of the upper apparatus (orchestrator 202, uCPE-PF controller 201, and VNF controller 203), a delay is caused until the second control signal reaches the uCPE apparatus 100 from the uCPE-PF controller 201 and the VNF controller 203. When the first control signal from the orchestrator 202 to the uCPE-PF controller 201 and the VNF controller 203, and the second control signal from the uCPE-PF controller 201 and the VNF controller 203 to the uCPE apparatus 100 are respectively delayed, time is needed from occurrence of a fault in the uCPE apparatus 100 to recovery from the fault.
In
Patent Literature (PTL) 1 discloses that a large impact is caused when a control apparatus, which concentratively controls virtualized network service functions, malfunctions and there is a problem on availability of a virtualized network service function. According to PTL 1, to address this problem, each apparatus in a service chaining system autonomously performs alive monitoring, fault detection and fault recovery of a link between apparatuses, and fault detection and fault recovery of a link of each apparatus in a decentralized manner. In addition, in the service chaining system, SCF (Service Chaining Forwarder) apparatuses autonomously perform mutual exchange of service function statement advertisement information for enabling service chaining among individual apparatuses in a distributed manner. In addition, each SCF apparatus refers to a topology information table managed thereby and appropriately selects a forwarding destination SF (Service Function) based on “resource information” and “a total cost value”. As described above, according to PTL 1, the SCF apparatuses need to perform mutual exchange of the service function statement advertisement information for enabling service chaining among the individual apparatuses.
PTL 1: Japanese Unexamined Patent Application Publication No. 2016-46736
As described above, in a uCPE system, when a fault occurs in a VNF or the like on a uCPE apparatus deployed at a site,
Thus, as a result of a delay of time from occurrence of a fault in the uCPE apparatus to recovery from the fault, deterioration of the service level, e.g., unacceptable service down-time, etc., might occur.
The present invention has been made in view of the above problem, and it is an object of the present invention to provide a fault recovery control method, a communication apparatus, a communication system, a program, and a recording medium, each enabling to reduce fault recovery time of service chaining.
According to one aspect of the present invention, there is provided a fault recovery control method for a communication apparatus in a communication system, wherein the communication system includes: the communication apparatus that is arranged in a site and includes a plurality of virtual network functions used for service chaining; and at least one upper apparatus that is connected to the communication apparatus in the site via a network and manages the virtual network functions and the service chaining on the communication apparatus. The method includes:
setting, at least a part of functions of the upper apparatus for changing the service chaining on the communication apparatus in the site, to be deployed on the communication apparatus in the site; and
on occurrence of a fault in the communication apparatus in the site, the communication apparatus rearranging autonomously the service chaining thereon to perform recovery from the fault.
According to one aspect of the present invention, there is provided a communication system including: a communication apparatus arranged in a site, the communication apparatus including a plurality of virtual network functions and a service chaining with the virtual network functions connected; and at least one upper apparatus connected to the communication apparatus via a network, the upper apparatus managing the virtual network functions and the service chaining on the communication apparatus. The communication apparatus includes a control part that changes the service chaining on the communication apparatus, wherein the control part of communication apparatus, on occurrence of a fault in the communication apparatus, rearranges the service chaining to perform recovery from the fault.
According to one aspect of the present invention, there is provided a communication apparatus arranged in a site and including a plurality of virtual network functions and service chaining with the virtual network functions connected, wherein the communication apparatus is connected to at least one upper apparatus that manages the virtual network functions and the service chaining on the communication apparatus via a network. The communication apparatus includes: a storage part that stores setting information about the service chaining connecting the virtual network functions; and a control part that, on occurrence of a fault, changes the service chaining, based on the setting information stored in the storage part to perform recovery from the fault.
According to one aspect of the present invention, there is provided a program, causing a computer that constitutes a communication apparatus that is arranged in a site, includes a plurality of virtual network functions and service chaining with the virtual network functions connected, and that is connected to at least one upper apparatus that manages the virtual network functions and the service chaining on the communication apparatus via a network, to execute processing including:
storing setting information about the service chaining connecting the virtual network functions in a storage part; and
on occurrence of a fault, changing the service chaining, based on the setting information stored in the storage part to perform recovery from the fault.
According to another mode of the present invention, there is provided a computer-readable recording medium storing a program, causing a computer that constitutes a communication apparatus that is arranged in a site, includes a plurality of virtual network functions and service chaining with the virtual network functions connected, and that is connected to at least one upper apparatus that manages the virtual network functions and the service chaining on the communication apparatus via a network, to execute processing including:
storing setting information about the service chaining connecting the virtual network functions in a storage part; and
on occurrence of a fault, changing the service chaining, based on the setting information stored in the storage part to perform recovery from the fault. For example, the recording medium is provided as a non-transitory computer-readable recording medium such as a semiconductor storage such as a RAM (Random Access Memory), a ROM (Read-Only Memory), or an EEPROM (Electrically Erasable and Programmable ROM), an HDD (Hard Disk Drive), a CD (Compact Disc), or a DVD (Digital Versatile Disc).
According to the present invention, the fault recovery time of service chaining can be reduced.
According to one of embodiments of the present invention, a service chaining change function, which is originally performed by an upper apparatus such as an orchestrator (NFV Orchestrator) on occurrence of a fault, is deployed in an individual uCPE apparatus in an individual site. When a fault occurs, the uCPE apparatus autonomously performs fault recovery processing by changing a corresponding service chaining for recover from the fault in the uCPE apparatus, without waiting for an instruction from the upper apparatus such as the orchestrator. Thus, while time is conventionally needed from occurrence of a fault to recovery from the fault, the prevent invention can shorten the time needed to change the service chaining. That is, the prevent invention can reduce fault recovery time.
In
When a fault occurs in a uCPE apparatus 100, the uCPE apparatus 100 analyzes a fault notification (a management signal) to be transmitted from the uCPE apparatus 100 to the upper apparatus and grasps content of the fault. The uCPE apparatus 100 may be configured to detect a fault about a logical port, etc. of a VNF implemented as a virtual machine (VM) on the NFVI (NFV Infrastructure) of the uCPE apparatus 100 and a fault about a hardware platform and/or a software platform of the uCPE apparatus 100, for example.
After grasping the fault, the uCPE apparatus 100 performs recovery from the fault by deriving (calculating) service chaining that bypasses a location (for example, a VNF) in which the fault has occurred and rearranging the service chaining, based on the VNF configuration information and service chaining information stored in the uCPE apparatus 100.
After the recovery from the fault, the uCPE apparatus 100 transmits a setting change notification, for example, about the internal paths among the VNFs used in the service chaining to the upper apparatus(es) (the VNF controller 203, the orchestrator 202, etc.) and requests the upper apparatus(es) to update the setting information managed by the upper apparatus(es). As a result, it is made possible that there is no difference generated regarding setting information about VNFs between the upper apparatus (the VNF controller 203, the orchestrator 202, etc.) and the uCPE apparatus 100.
The communication part 110 in the uCPE apparatus 100 includes an interface (network interface) not illustrated that communicates with the uCPE-PF controller 201, the VNF controller 203, etc. in the data center 20 via the WAN 30.
The site management part 120 mediates a control signal between the uCPE-PF management part 130 or the VNF management part 140 and the upper apparatuses such as the uCPE-PF controller 201 and the VNF controller 203 in the data center 20, extracts setting information from the control signal, and stores the setting information in a storage part not illustrated.
In addition, other than the control signal, the site management part 120 also mediates a management signal (for example, an SNMP (Simple Network Management Protocol) trap (SNMP agent transmits a change that occurs in an SNMP agent system to an SNMP manager as an SNMP trap) or a log). When fault information is included in a management signal, the site management part 120 determines whether fault recovery processing is possible based on stored setting information. If the site management part 120 determines that fault recovery processing is possible, in place of the upper apparatuses, the site management part 120 gives a service chaining switching instruction to the uCPE-PF management part 130 and the VNF management part 140.
The uCPE-PF management part 130 manages virtual machines (VMs) for implementing VNFs on the uCPE apparatus 100, manages internal paths among VNFs for service chaining. The uCPE-PF management part 130 is controlled by the uCPE-PF controller 201, which is an upper apparatus.
The VNF management part 140 manages a VNF(s) deployed on a virtual machine(s) (VM(s)) created by the uCPE-PF management part 130. The VNF management part 140 is controlled by the VNF controller 203, which is an upper apparatus.
The signal analysis section 121 mediates control and management signals. When mediating a control signal, the signal analysis section 121 instructs the path management section 123 or the configuration management section 124 to store corresponding setting information. When mediating a management signal, the signal analysis section 121 gives a notification to the fault recovery control section 122.
The fault recovery control section 122 receives the notification from the signal analysis section 121 and determines whether a fault has occurred. If the fault recovery control section 122 determines that a fault has occurred, the fault recovery control section 122 acquires the setting information stored in the path management section 123 or the configuration management section 124 and calculates service chaining for fault recovery.
When the fault recovery control section 122 determines that the fault recovery processing is possible, the fault recovery control section 122 gives a setting change instruction to the uCPE-PF management part 130 or the VNF management part 140. After the uCPE-PF management part 130 or the VNF management part 140 completes setting change, the fault recovery control section 122 transmits a notification of change of the setting to the upper apparatuses such as the uCPE-PF controller 201, the VNF controller 203, etc.
Based on an instruction from the signal analysis section 121, the path management section 123 stores internal path information about service chaining in a storage part (not illustrated). When receiving a setting information acquisition request from the fault recovery control section 122, the path management section 123 transfers the internal path information stored in the storage part (not illustrated) to the fault recovery control section 122.
As with the path management section 123, the configuration management section 124 stores and transfers information about kinds of the VNFs implemented on the uCPE apparatus 100 and the virtual ports used in service chaining.
After receiving a control signal from an upper apparatus (the uCPE-PF controller 201 or the VNF controller 203), the communication part 110 in the uCPE apparatus 100 transfers the control signal to the uCPE-PF management part 130 or the VNF management part 140, depending on the control content. Next, the communication part 110 transfers the control signal to the site management part 120.
Upon reception of the control signal, the signal analysis section 121 of the site management part 120 analyzes the control signal.
When a result of the analysis in step S12 indicates that the control signal is a control signal relating to service chaining addressed to the uCPE-PF management part 130, the signal analysis section 121 forwards the control signal to the path management section 123, to cause the path management section 123 to store setting information. Next, the processing proceeds to step S14. If the result of the analysis in step S12 indicates that the control signal is a control signal about a VNF, the signal analysis section 121 forwards the control signal to the configuration management section 124, to cause the configuration management section 124 to store setting information. Next, the processing proceeds to step S16.
Upon reception of the control signal (a control signal relating to service chaining addressed to the uCPE-PF management part 130), the path management section 123 internally stores information about service-chaining-related internal paths in the uCPE apparatus 100 (path information about physical ports, logical ports, virtual switches, etc.) in a storage part.
After storing the internal path information, the path management section 123 forwards the control signal to the uCPE-PF management part 130, which is originally a destination, which updates the uCPE-PF (uCPE platform) setting information.
Upon reception of the control signal (a control signal about a VNF), the configuration management section 124 stores information about a kind of the VNF, virtual ports used for the service chaining, etc. in a storage part (not illustrated).
After storing the information about the kind of the VNF, the virtual ports, etc., the configuration management section 124 forwards the control signal to the VNF management part 140, which is originally a destination. The VNF management part 140 updates the VNF setting information stored in the storage part.
When the communication part 110 in the uCPE apparatus 100 receives a management signal, which is to be transmitted to the upper apparatus (the uCPE-PF controller 201 and/or the VNF controller 203), from the CPE-PF management part 130 and the VNF management part 140, the communication part 110 forwards the management signal to the site management part 120, before transmitting the management signal to the upper apparatus(es).
When the site management part 120 receives the management signal, the fault recovery control section 122 acquires service-chaining-related setting information used to analyze whether fault recovery processing is possible, from the path management section 123 and the configuration management section 124.
The fault recovery control section 122 performs analysis to determine whether a fault has occurred, based on the management signal. Next, if the fault recovery control section 122 determines that a fault has occurred, the fault recovery control section 122 determines whether reconfiguration of the service chaining, which bypasses the fault occurrence location and enables fault recovery, is possible, based on the various setting information acquired in step S22.
The fault recovery control section 122, as a result of the analysis in step S23, when determining that fault recovery is possible by rearranging the service chaining, performs fault recovery processing in step S26. When the fault recovery control section 122 determines that changing the service chaining will not achieve fault recovery, the processing proceeds to step S25 in which the fault recovery control section 122 transmits a fault notification to the upper apparatuses.
A fault notification to mean that the fault recovery is not possible and the fault is confirmed, is performed from the communication part 110 of the uCPE apparatus 100 to the upper apparatuses (the uCPE-PF controller 201, the VNF controller 203, etc.). There is a case where the uCPE apparatus 100 cannot recover a fault by changing the service chaining. For example, when there is a fault in a hardware apparatus, a network failure or the like, a fault notification may be transmitted to the upper apparatus (e.g., the uCPE-PF controller 201), and necessary maintenance and recovery measures may be performed on the uCPE apparatus 100. A recovery completion notification may be transmitted to the upper apparatus(es).
The fault recovery control section 122 calculates service chaining that bypasses the fault occurrence location and calculates a setting to change to the service chaining calculated.
The fault recovery control section 122 sends a setting change instruction about the setting calculated in step S31 to the uCPE-PF management part 130 and the VNF management part 140, to rearrange the service chaining.
When receiving a setting change completion notification from the uCPE-PF management part 130 and the VNF management part 140, the fault recovery control section 122 determines that the setting change has been completed. Next, the processing proceeds to step S34 in which a setting change notification is transmitted to the upper apparatuses (the uCPE-PF controller 201 and the VNF controller 203).
The setting change operation performed by the fault recovery control section 122, generates a difference between the setting information held by the uCPE-PF management part 130 and the VNF management part 140 and the corresponding setting contents held by the upper apparatuses (the uCPE-PF controller 201 and the VNF controller 203). The fault recovery control section 122 transmits a setting change notification to respective upper apparatuses (the uCPE-PF controller 201 and the VNF controller 203), to control so that such difference will not be caused between the setting contents in the respective upper apparatuses (the uCPE-PF controller 201 and the VNF controller 203) and the actual setting contents.
As described above, calculation and setting of service chaining configured by a plurality of VNFs under a predetermined condition and in a predetermined order are originally performed in accordance with an instruction from an orchestrator that manages the corresponding lifecycle. However, due to a geographical distance between a uCPE apparatus in an individual site and the orchestrator and a load status of a control system, it takes time for a change of a setting of a service chaining to be reflected on the uCPE apparatus and a service down-time, etc., could be prolonged. Even in this situation, the above example embodiments can reduce the service down-time by implementing the site management part (120 in
In the above example embodiments, instead of performing control of the individual uCPE apparatus in an individual site only from the upper apparatus such as a controller in a data center, part of recovery control processing is autonomously and locally performed in the uCPE apparatus. The internal configuration and control method of the uCPE apparatus is not, as a matter of course, limited to what has been described in the above example embodiments. For example, the following modification or addition may be made to the configuration and control method described in the above example embodiments, as needed.
The present invention is applicable to, for example, hardware equipment such as a server and a network appliance in a virtualized environment, and provision of service using the hardware equipment.
The disclosure of PTL 1 cited above is incorporated herein in its entirety by reference thereto. It is to be noted that it is possible to modify or adjust the example embodiments or examples within the whole disclosure of the present invention (including the Claims) and based on the basic technical concept thereof. Further, it is possible to variously combine or select (or partially delete) a wide variety of the disclosed elements (including the individual elements of the individual claims, the individual elements of the individual example embodiments or examples, and the individual elements of the individual figures) within the scope of the disclosure of the present invention. That is, it is self-explanatory that the present invention includes any types of variations and modifications to be done by a skilled person according to the whole disclosure including the Claims, and the technical concept of the present invention. Particularly, any numerical ranges disclosed herein should be interpreted that any intermediate values or subranges falling within the disclosed ranges are also concretely disclosed even without specific recital thereof.
Number | Date | Country | Kind |
---|---|---|---|
2018-151993 | Aug 2018 | JP | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2019/031353 | 8/8/2019 | WO | 00 |