The invention is related to the field of communication systems and, in particular, to networks that provide network functions.
Service providers have traditionally implemented a physical network architecture to deploy network functions, such as routers, switches, gateways, servers, etc. For example, network functions were traditionally deployed as physical devices, where software was tightly coupled with the proprietary hardware. These physical network functions have to be manually installed into the network, which creates operational challenges and prevents rapid deployment of new network functions. To address these issues, service providers are turning to a virtualized network architecture, which is referred to as Network Functions Virtualization (NFV). In NFV, a Virtualized Network Function (VNF) is the implementation of a network function using software that is decoupled from the underlying hardware. A VNF may include one or more virtual machines (VM) running software and processes on top of servers, switches, storage, a cloud computing infrastructure, etc., instead of having dedicated hardware appliances for each network function.
The use of VNFs comes with several management considerations. If resources (e.g., compute, storage, network, etc.) of the virtualized architecture become overloaded, new resources may be deployed as needed. For example, an orchestration layer may deploy one or more new VMs to add capacity to the system. Presently, growth and de-growth are reactive in nature and contingent upon arriving at some trigger point (such as measurements taken that show resource usage beyond the engineered limits, or critical alarms that state loss of system functionality in part or whole). Reactive procedures are often inefficient, and service providers are looking for better ways of managing a network architecture.
Embodiments described herein provide predictive auto-scaling of VNFs of a network. Instead of having a network operator react to an event in the network and scale the VNFs to manage the event, the embodiments described herein predict when an event is about to occur, and automatically scale one or more of the VNFs to avert the event from occurring. Therefore, management of the network is handled in a more efficient manner.
One embodiment comprises an adaptive engine configured to auto-scale a VNF for a network of a network operator based on a set of auto-scaling rules. The adaptive engine comprises a storage device that stores initial rules for the set of auto-scaling rules. The adaptive engine also includes a processor that implements a learning module that monitors behavior of the network operator in scaling the VNF in response to events, and generates learned rules for the set of auto-scaling rules based on the behavior of the network operator. The processor of the adaptive engine implements a prioritizing module that adjusts a sequence of the initial rules and the learned rules in the set of auto-scaling rules. The processor of the adaptive engine implements a predicting module that predicts a future event in the network that will activate scaling of the VNF, and auto-scales the VNF for the network based on the set of auto-scaling rules before occurrence of the future event.
In another embodiment, each rule in the set of auto-scaling rules includes a trigger for scaling the VNF, and one or more actions to perform in response to the trigger to scale the VNF.
In another embodiment, the processor of the adaptive engine further implements a validating module that validates one or more of the initial rules based on the behavior of the network operator. A weighted value is assigned to each rule in the set of auto-scaling rules indicating a validity of its associated rule.
In another embodiment, the validating module increases the weighted value of an initial rule based on the number of times that the initial rule was executed by the network operator.
In another embodiment, the validating module validates the initial rule(s) when behavior of the network operator indicates that the network operator follows the action(s) in response to the trigger.
In another embodiment, the predicting module assigns an upper hysteresis value to each rule in the set of auto-scaling rules, predicts the future event that will trigger a scale-up or scale-out of the VNF when the upper hysteresis value is reached, and performs the action(s) in response to the upper hysteresis value being reached.
In another embodiment, the predicting module assigns a lower hysteresis value to each rule in the set of auto-scaling rules, predicts the future event that will trigger a scale-down or scale-in of the VNF when the lower hysteresis value is reached, and performs the action(s) in reverse in response to the lower hysteresis value being reached.
In another embodiment, the processor of the adaptive engine implements an adapting module that modifies the set of auto-scaling rules based on the behavior of the network operator.
In another embodiment, the prioritizing module orders the sequence of the initial rules and the learned rules in the set of auto-scaling rules based on a preference of the network operator and/or network conditions.
Another embodiment comprises a method for auto-scaling a VNF for a network of a network operator based on a set of auto-scaling rules. The method includes storing initial rules for the set of auto-scaling rules, monitoring behavior of the network operator in scaling the VNF in response to events, and generating learned rules for the set of auto-scaling rules based on the behavior of the network operator. The method further includes adjusting a sequence of the initial rules and the learned rules in the set of auto-scaling rules. The method further includes predicting a future event in the network that will activate scaling of the VNF, and auto-scaling the VNF for the network based on the set of auto-scaling rules before occurrence of the future event.
Another embodiment comprises a non-transitory computer readable medium embodying programmed instructions executed by a processor to implement an adaptive engine that auto-scales a VNF for a network of a network operator based on a set of auto-scaling rules. The instructions direct the processor to store initial rules for the set of auto-scaling rules. The instructions direct the processor to implement a learning module that monitors behavior of the network operator in scaling the VNF in response to events, and generates learned rules for the set of auto-scaling rules based on the behavior of the network operator. The instructions direct the processor to implement a prioritizing module that adjusts a sequence of the initial rules and the learned rules in the set of auto-scaling rules. The instructions direct the processor to implement a predicting module that predicts a future event in the network that will activate scaling of the VNF, and auto-scales the VNF for the network based on the set of auto-scaling rules before occurrence of the future event.
The above summary provides a basic understanding of some aspects of the specification. This summary is not an extensive overview of the specification. It is intended to neither identify key or critical elements of the specification nor delineate any scope of the particular embodiments of the specification, or any scope of the claims. Its sole purpose is to present some concepts of the specification in a simplified form as a prelude to the more detailed description that is presented later.
Some embodiments of the invention are now described, by way of example only, and with reference to the accompanying drawings. The same reference number represents the same element or the same type of element on all drawings.
The figures and the following description illustrate specific exemplary embodiments. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the embodiments and are included within the scope of the embodiments. Furthermore, any examples described herein are intended to aid in understanding the principles of the embodiments, and are to be construed as being without limitation to such specifically recited examples and conditions. As a result, the inventive concept(s) is not limited to the specific embodiments or examples described below, but by the claims and their equivalents.
Architecture 100 also includes Virtualized Network Functions (VNFs) 121-125. Each VNF 121-125 may comprise one or more virtual machines (VM) running different software and processes on top of NFV infrastructure 110. A VM is an operating system or application environment that is installed on software which imitates dedicated hardware. Specialized software called a hypervisor emulates a CPU, memory, hard disk, network, and/or other hardware resources, which allows the virtual machines to share the hardware resources. Each VNF 121-125 in the embodiments described herein perform one or more network functions. A network function is a “well-defined functional behavior” within a network, such as firewalling, domain name service (DNS), caching, network address translation (NAT), etc. Individual VNFs may be linked or chained (i.e., service chaining) together in a way similar to building blocks to offer a full-scale networking communication service.
Architecture 100 also includes management and orchestration layer 130. Management and orchestration layer 130 provides for planned automation and provisioning tasks within the virtualized environment. The tasks of orchestration include configuration management of compute resources 112, storage resources 113, and network resources 114. The tasks of orchestration also include provisioning of VMs and application instances, such as for VNFs 121-125. The tasks of orchestration may also include security and compliance assessment, monitoring, and reporting.
In the embodiments described herein, predictive auto-scaling is performed to scale-up or scale-out individual VNFs or a group of VNFs, or to scale-down or scale-in individual VNFs or a group of VNFs. A scale-up refers to deploying more resources (e.g., CPU, memory, hard disk, and network) for one or more VNFs, while scale-out refers to adding more nodes to the system. A scale-down refers to decreasing resources for one or more VNFs, while scale-in refers to reducing the number of nodes used in the system. An adaptive engine is described that predicts impending events that trigger auto-scaling to engage, performs the auto-scaling, measures the auto-scaling efficiency during overloads, and adapts its behavior for future events.
Instead of reacting to an event occurring within network 320 that requires scaling of one or more VNFs 321-325, adaptive engine 200 predicts when an event will occur within network 320, and automatically scales (auto-scale) one or more VNFs 321-325. This concept is referred to as predictive auto-scaling. Adaptive engine 200 is proactive in scaling the VNFs 321-325 to avert problems in providing the network functions.
The steps of method 400 may be described as phases of predictive auto-scaling. Step 402 represents an initialization phase, where initial rules for auto-scaling are pre-loaded onto adaptive engine 200 and stored in a storage device 202 (see
H: If trigger A1 occurs, then perform action B followed by action C;
H: If trigger A2 occurs, then perform actions D, E, and F in sequence;
H: If trigger A3 occurs, then execute trigger A4 and perform actions G and H;
H: If trigger A4 occurs, then perform actions I and J and then execute trigger A5;
H: If trigger A5 occurs, then wait 3600 seconds and perform action K.
The initial rules may be preceded by an “H:” or another indicator to connote that these are initially provisioned in adaptive engine 200. The set of auto-scaling rules 500 may also be assigned a weighted value that is used to indicate the validity of the rules. The initial rules may be assigned an initial weight indicated as “Init_w”. Adaptive engine 200 may perform a defensive check on the initial rules to ensure that they will not have an adverse impact on the VNF.
Steps 404-405 of
L: If trigger A6 occurs, then perform action B;
L: If trigger A7 occurs, then wait 1800 seconds and perform action C.
The learned rules may be preceded by an “L:” or another indicator to connote that these are rules learned by adaptive engine 200. Adaptive engine 200 may also assign a weighted value indicated by “L_W” to the learned rules. The rank of the learned rules is expected to be higher than “Init_W”, since “L_W” is based on learning in network 320. At this point, the ordering of the set of auto-scaling rules 500 may change based on rank or weight of the rules. Also, the rules in category “H:” may maintain the same rank as before, or may be validated in the learning phase. The learning phase may continue to further refine and augment the set of auto-scaling rules 500 in time.
Further, a reaction time may be set while adaptive engine 200 observes behavior of the network operator in performing an action in response to a specific trigger. For instance, if condition “x” generates reaction “y” by the network operator within time “t”, then adaptive engine 200 may be considered a valid learned rule. If the reaction “y” occurs after time “t”, then there probably there is no correlation between trigger “x” and reaction “y”. Therefore, adaptive engine 200 may not generate a learned rule.
Step 406 of
V: If trigger A1 occurs, then perform action B followed by action C;
H: If trigger A2 occurs, then perform actions D, E, and F in sequence;
V: If trigger A3 occurs, then execute trigger A4 and perform actions G and H;
V: If trigger A4 occurs, then perform actions I and J and then execute trigger A5;
V: If trigger A5 occurs, then wait 3600 seconds and perform action K;
L: If trigger A6 occurs, then perform action B;
L: If trigger A7 occurs, then wait 1800 seconds and perform action C.
Assume for example that adaptive engine 200 determines (e.g., based on the KPI) that trigger A1 has occurred. If adaptive engine 200 identifies that the network operator performs action B followed by action C in response to trigger A1, then adaptive engine 200 is able to validate this initial rule. Validation is performed on the initial rules in the set of auto-scaling rules 500. The learned rules are considered auto-validated, as adaptive engine 200 has already observed triggers and associated operator actions.
Adaptive engine 200 may also modify the weighted values of the initial rules that are validated in step 406. Adaptive engine 200 may increase the weighted values of an initial rule based on the number of times that the initial rule was executed by the network operator. For example, adaptive engine 200 may take the initial weight, and add a delta (D) multiplied by the number of times (N1) that an initial rule was executed by the network operator (Init W+N1*D).
Step 408 of
V: If trigger A1 occurs, then perform action B followed by action C;
V: If trigger A3 occurs, then execute trigger A4 and perform actions G and H;
V: If trigger A4 occurs, then perform actions I and J and then execute trigger A5;
V: If trigger A5 occurs, then wait 3600 seconds and perform action K;
L: If trigger A7 occurs, then wait 1800 seconds and perform action C.
L: If trigger A6 occurs, then perform action B;
H: If trigger A2 occurs, then perform actions D, E, and F in sequence.
In this example, the priorities for triggers A6 and A7 are swapped. The sequence of the rules indicates the order in which adaptive engine 200 will evaluate their associated triggers. The evaluation of the set of auto-scaling rules 500 starts from the top. The initial rules that were not validated during the learning or validation phases drop to the end of the sequence. When validated, these initial rules may climb up in the ordered list of execution, as their associated weights increase for each occurrence.
Step 410 of
V: If trigger A1 occurs, then perform action B, followed by actions C, D, E, F.
Steps 412-413 of
Adaptive engine 200 continually scales-up or scales-down the VNFs 321-325 in network 320 as needed to provide the network functions. Because adaptive engine 200 is predicting when events occur, it is able to scale-up a VNF 321-325 before a critical situation is reached so that network functions are not hindered. Adaptive engine 200 is also able to scale-down a VNF 321-325 as needed to return resources to an available pool. Adaptive engine 200 also continually learns from the actions taken in response to certain triggers so that it can more effectively manage network 320.
Any of the various elements or modules shown in the figures or described herein may be implemented as hardware, software, firmware, or some combination of these. For example, an element may be implemented as dedicated hardware. Dedicated hardware elements may be referred to as “processors”, “controllers”, or some similar terminology. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, a network processor, application specific integrated circuit (ASIC) or other circuitry, field programmable gate array (FPGA), read only memory (ROM) for storing software, random access memory (RAM), non-volatile storage, logic, or some other physical hardware component or module.
Also, an element may be implemented as instructions executable by a processor or a computer to perform the functions of the element. Some examples of instructions are software, program code, and firmware. The instructions are operational when executed by the processor to direct the processor to perform the functions of the element. The instructions may be stored on storage devices that are readable by the processor. Some examples of the storage devices are digital or solid-state memories, magnetic storage media such as a magnetic disks and magnetic tapes, hard drives, or optically readable digital data storage media.
Although specific embodiments were described herein, the scope of the disclosure is not limited to those specific embodiments. The scope of the disclosure is defined by the following claims and any equivalents thereof.