Various embodiments and methods of the invention relate generally to a multi-cloud fabric system and particularly to cloud migration.
Data centers refer to facilities used to house computer systems and associated components, such as telecommunications (networking equipment) and storage systems. They generally include redundancy, such as redundant data communications connections and power supplies. These computer systems and associated components generally make up the Internet. A metaphor for the Internet is cloud.
A large number of computers connected through a real-time communication network such as the Internet generally form a cloud. Cloud computing refers to distributed computing over a network, and the ability to run a program or application on many connected computers of one or more clouds at the same time.
The cloud has become one of the, or perhaps even the, most desirable platform for storage and networking. A data center with one or more clouds may have server, switch, storage systems, and other networking and storage hardware, but actually served up by virtual hardware, simulated by software running on one or more networking machines and storage systems. Therefore, virtual servers, storage systems, switches and other networking equipment are employed. Such virtual equipment do not physically exist and can therefore be moved around and scaled up or down on the fly without any difference to the end user, somewhat like a cloud becoming larger or smaller without being a physical object. Cloud bursting refers to a cloud, including networking equipment, becoming larger or smaller.
Clouds also focus on maximizing the effectiveness of shared resources, resources referring to machines or hardware such as storage systems and/or networking equipment. Sometimes, these resources are referred to as instances. Cloud resources are usually not only shared by multiple users but are also dynamically reallocated per demand. This can work for allocating resources to users. For example, a cloud computer facility, or a data center, that serves Australian users during Australian business hours with a specific application (e.g., email) may reallocate the same resources to serve North American users during North America's business hours with a different application (e.g., a web server). With cloud computing, multiple users can access a single server to retrieve and update their data without purchasing licenses for different applications.
Cloud computing allows companies to avoid upfront infrastructure costs, and focus on projects that differentiate their businesses, not their infrastructure. It further allows enterprises to get their applications up and running faster, with improved manageability and less maintenance, and that enable information technology (IT) to more rapidly adjust resources to meet fluctuating and unpredictable business demands.
Fabric computing or unified computing involves the creation of a computing fabric system consisting of interconnected nodes that look like a ‘weave’ or a ‘fabric’ when viewed collectively from a distance. Usually this refers to a consolidated high-performance computing system consisting of loosely coupled storage, networking and parallel processing functions linked by high bandwidth interconnects.
The fundamental components of fabrics are “nodes” (processor(s), memory, and/or peripherals) and “links” (functional connection between nodes). Manufacturers of fabrics (or fabric systems) include companies, such as IBM and Brocade. These companies are examples of fabrics made of hardware. Fabrics are also made of software or a combination of hardware and software.
A data center employed with a cloud currently has limitations relative to efficient usage of its resources and other clouds' resources resulting in latency and inefficiency.
Briefly, a method of cloud migration includes copying, by a cloud migration manager, a meta data and configuration associated with an application of an existing tier, bringing up, by the cloud migration manager, another tier, applying the copied metadata and configuration associated with the application of the existing tier to the another tier so that the another tier resembles the existing tier and re-directing traffic intended for the existing tier to the another tier.
A further understanding of the nature and the advantages of particular embodiments disclosed herein may be realized by reference of the remaining portions of the specification and the attached drawings.
a-c show exemplary data centers configured using various embodiments and methods of the invention.
The following description describes methods and apparatus for optimization of control and service planes in a data center. Optimization includes data center backups using software-defined networking (SDN) by determining the optimal paths and re-routing to the optimal paths by dynamically reprogramming layer 2 switches to re-route the traffic to those optimized paths.
Referring now to
The network 112 includes switches, router, and the like and the resources 114 includes networking and storage equipment, i.e. machines, such as without limitation, servers, storage systems, switches, servers, routers, or any combination thereof.
The application layers 110 are each shown to include applications 118, which may be similar or entirely different or a combination thereof.
The plug-in unit 108 is shown to include various plug-ins (orchestration). As an example, in the embodiment of
The multi-cloud fabric system 106 is shown to include various nodes 106a and links 106b connected together in a weave-like fashion. Nodes 106a are network, storage, or telecommunication or communications devices such as, without limitation, computers, hubs, bridges, routers, mobile units, or switches attached to computers or telecommunications network, or a point in the network topology of the multi-cloud fabric system 106 where lines intersect or terminate. Links 106b are typically data links.
In some embodiments of the invention, the plug-in unit 108 and the multi-cloud fabric system 106 do not span across clouds and the data center 100 includes a single cloud. In embodiments with the plug-in unit 108 and multi-cloud fabric system 106 spanning across clouds, such as that of
While two clouds are shown in the embodiment of
In an embodiment of the invention, the multi-cloud fabric system 106 is a Layer (L) 4-7 fabric system. Those skilled in the art appreciate data centers with various layers of networking. As earlier noted, multi-cloud fabric system 106 is made of nodes 106a and connections (or “links”) 106b. In an embodiment of the invention, the nodes 106a are devices, such as but not limited to L4-L7 devices. In some embodiments, the multi-cloud fabric system 106 is implemented in software and in other embodiments, it is made with hardware and in still others, it is made with hardware and software.
Some switches can use up to OSI layer 7 packet information; these may be called layer (L) 4-7 switches, content-switches, content services switches, web-switches or application-switches.
Content switches are typically used for load balancing among groups of servers. Load balancing can be performed on HTTP, HTTPS, VPN, or any TCP/IP traffic using a specific port. Load balancing often involves destination network address translation so that the client of the load balanced service is not fully aware of which server is handling its requests. Content switches can often be used to perform standard operations, such as SSL encryption/decryption to reduce the load on the servers receiving the traffic, or to centralize the management of digital certificates. Layer 7 switching is the base technology of a content delivery network.
The multi-cloud fabric system 106 sends one or more applications to the resources 114 through the networks 112.
In a service level agreement (SLA) engine, as will be discussed relative to a subsequent figure, data is acted upon in real-time. Further, the data center 100 dynamically and automatically delivers applications, virtually or in physical reality, in a single or multi-cloud of either the same or different types of clouds.
The data center 100, in accordance with some embodiments and methods of the invention, functions as a service (Software as a Service (SAAS) model, a software package through existing cloud management platforms, or a physical appliance for high scale requirements. Further, licensing can be throughput or flow-based and can be enabled with network services only, network services with SLA and elasticity engine (as will be further evident below), network service enablement engine, and/or multi-cloud engine.
As will be further discussed below, the data center 100 may be driven by representational state transfer (REST) application programming interface (API).
The data center 100, with the use of the multi-cloud fabric system 106, eliminates the need for an expensive infrastructure, manual and static configuration of resources, limitation of a single cloud, and delays in configuring the resources, among other advantages. Rather than a team of professionals configuring the resources for delivery of applications over months of time, the data center 100 automatically and dynamically does the same, in real-time. Additionally, more features and capabilities are realized with the data center 100 over that of prior art. For example, due to multi-cloud and virtual delivery capabilities, cloud bursting to existing clouds is possible and utilized only when required to save resources and therefore expenses.
Moreover, the data center 100 effectively has a feedback loop in the sense that results from monitoring traffic, performance, usage, time, resource limitations and the like, i.e. the configuration of the resources can be dynamically altered based on the monitored information. A log of information pertaining to configuration, resources, the environment, and the like allow the data center 100 to provide a user with pertinent information to enable the user to adjust and substantially optimize its usage of resources and clouds. Similarly, the data center 100 itself can optimize resources based on the foregoing information.
The applications unit 202 is shown to include a number of applications 206, for instance, for an enterprise. These applications are analyzed, monitored, searched, and otherwise crunched just like the applications from the plug-ins of the fabric system 106 for ultimate delivery to resources through the network 204.
The data center 100 is shown to include five units (or planes), the management unit 210, the value-added services (VAS) unit 214, the controller unit 212, the service unit 216 and the data unit (or network) 204. Accordingly and advantageously, control, data, VAS, network services and management are provided separately. Each of the planes is an agent and the data from each of the agents is crunched by the controller unit 212 and the VAS unit 214.
The fabric system 106 is shown to include the management unit 210, the VAS unit 214, the controller unit 212 and the service unit 216. The management unit 210 is shown to include a user interface (UI) plug-in 222, an orchestrator compatibility framework 224, and applications 226. The management unit 210 is analogous to the plug-in 108. The UI plug-in 222 and the applications 226 receive applications of various formats and the framework 224 translates the various formatted application into native-format applications. Examples of plug-ins 116, located in the applications 226, are VMware ICenter, by VMware, Inc. and System Center by Microsoft, Inc. While two plug-ins are shown in
The controller unit 212 serves as the master or brain of the data center 100 in that it controls the flow of data throughout the data center and timing of various events, to name a couple of many other functions it performs as the mastermind of the data center. It is shown to include a services controller 218 and a SDN controller 220. The services controller 218 is shown to include a multi-cloud master controller 232, an application delivery services stitching engine or network enablement engine 230, a SLA engine 228, and a controller compatibility abstraction 234.
Typically, one of the clouds of a multi-cloud network is the master of the clouds and includes a multi-cloud master controller that talks to local cloud controllers (or managers) to help configure the topology among other functions. The master cloud includes the SLA engine 228 whereas other clouds need not to but all clouds include a SLA agent and a SLA aggregator with the former typically being a part of the virtual services platform 244 and the latter being a part of the search and analytics 238.
The controller compatibility abstraction 234 provides abstraction to enable handling of different types of controllers (SDN controllers) in a uniform manner to offload traffic in the switches and routers of the network 204. This increases response time and performance as well as allowing more efficient use of the network.
The network enablement engine 230 performs stitching where an application or network services (such as configuring load balance) is automatically enabled. This eliminates the need for the user to work on meeting, for instance, a load balance policy. Moreover, it allows scaling out automatically when violating a policy.
The flex cloud engine 232 handles multi-cloud configurations such as determining, for instance, which cloud is less costly, or whether an application must go onto more than one cloud based on a particular policy, or the number and type of cloud that is best suited for a particular scenario.
The SLA engine 228 monitors various parameters in real-time and decides if policies are met. Exemplary parameters include different types of SLAs and application parameters. Examples of different types of SLAs include network SLAs and application SLAs. The SLA engine 228, besides monitoring allows for acting on the data, such as service plane (L4-L7), application, network data and the like, in real-time.
The practice of service assurance enables Data Centers (DCs) and (or) Cloud Service Providers (CSPs) to identify faults in the network and resolve these issues in a timely manner so as to minimize service downtime. The practice also includes policies and processes to proactively pinpoint, diagnose and resolve service quality degradations or device malfunctions before subscribers (users) are impacted.
Service assurance encompasses the following:
The structures shown included in the controller unit 212 are implemented using one or more processors executing software (or code) and in this sense, the controller unit 212 may be a processor. Alternatively, any other structures in
VAS unit 214 uses its search and analytics unit 238 to search analytics based on distributed large data engine and crunches data and displays analytics. The search and analytics unit 238 can filter all of the logs the distributed logging unit 240 of the VAS unit 214 logs, based on the customer's (user's) desires. Examples of analytics include events and logs. The VAS unit 214 also determines configurations such as who needs SLA, who is violating SLA, and the like.
The SDN controller 220, which includes software defined network programmability, such as those made by Floodlight, Open Daylight, PDX, and other manufacturers, receives all the data from the network 204 and allows for programmability of a network switch/router.
The service plane 216 is shown to include an API based, Network Function Virtualization (NFV), Application Delivery Network (ADN) 242 and on a Distributed virtual services platform 244. The service plane 216 activates the right components based on rules. It includes Application Delivery Controller (ADC), web-application firewall, DPI, VPN, DNS and other L4-L7 services and configures based on policy (it is completely distributed). It can also include any application or L4-L7 network services.
The distributed virtual services platform contains an Application Delivery Controller (ADC), Web Application Firewall (Firewall), L2-L3 Zonal Firewall (ZFW), Virtual Private Network (VPN), Deep Packet Inspection (DPI), and various other services that can be enabled as a single-pass architecture. The service plane contains a Configuration agent, Stats/Analytics reporting agent, Zero-copy driver to send and receive packets in a fast manner, Memory mapping engine that maps memory via TLB to any virtualized platform/hypervisor, SSL offload engine, etc.
The controller 306 is analogous to the controller unit 212 of
The flow-through orchestration 302 is analogous to the framework 224 of
The profiler 320 is a test engine. Service controller 322 is analogous to the controller 220 and SLA manager 324 is analogous to the SLA engine 228 of
In the exemplary embodiment of
The plug-ins 116 and the flow-through orchestration 302 are the clients 310 of the data center 300, the controller 306 is the infrastructure of the data center 300. Virtual machines and SLA agents 305 are a part of the clouds 308 and 310.
Each of the tiers 1-N is shown to include distributed elastic 1-N, 408-410, respectively, elastic applications 412, and storage 414. The distributed elastic 1-N 408-410 and elastic applications 412 communicate bidirectional with the underlying physical NW 416 and the latter unilaterally provides information to the SDN controller 220. A part of each of the tiers 1-N are included in the service plane 216 of
The cloud providers 402 are providers of the clouds shown and/or discussed herein. The distributed elastic controllers 1-N each service a cloud from the cloud providers 402, as discussed previously except that in
As previously discussed, the distributed elastic analytics engine 214 includes multiple VAS units, one for each of the clouds, and the analytics are provided to the controller 232 for various reasons, one of which is the feedback feature discussed earlier. The controllers 232 also provide information to the engine 214, as discussed above.
The distributed elastic services 1-N are analogous to the services 318, 316, and 314 of
The underlying physical NW 416 is analogous to the resources 114 of
The tiers 406 are deployed across multiple clouds and are enablement. Enablement refers to evaluation of applications for L4 through L7. An example of enablement is stitching.
In summary, the data center of an embodiment of the invention, is multi-cloud and capable of application deployment, application orchestration, and application delivery.
In operation, the user (or “client”) 401 interacts with the UI 404 and through the UI 404, with the plug-in unit 108. Alternatively, the user 401 interacts directly with the plug-in unit 108. The plug-in unit 108 receives applications from the user with perhaps certain specifications. Orchestration and discover take place between the plug-in unit 108, the controllers 232 and between the providers 402 and the controllers 232. A management interface (also known herein as “management unit” 210) manages the interactions between the controllers 232 and the plug-in unit 108.
The distributed elastic analytics engine 214 and the tiers 406 perform monitoring of various applications, application delivery services and network elements and the controllers 232 effectuate service change.
In accordance with various embodiments and methods of the invention, some of which are shown and discussed herein, an Multi-cloud fabric is disclosed. The Multi-cloud fabric includes an application management unit responsive to one or more applications from an application layer. The Multi-cloud fabric further includes a controller in communication with resources of a cloud, the controller is responsive to the received application and includes a processor operable to analyze the received application relative to the resources to cause delivery of the one or more applications to the resources dynamically and automatically.
The multi-cloud fabric, in some embodiments of the invention, is virtual. In some embodiments of the invention, the multi-cloud fabric is operable to deploy the one or more native-format applications automatically and/or dynamically. In still other embodiments of the invention, the controller is in communication with resources of more than one cloud.
The processor of the multi-cloud fabric is operable to analyze applications relative to resources of more than one cloud.
In an embodiment of the invention, the Value Added Services (VAS) unit is in communication with the controller and the application management unit and the VAS unit is operable to provide analytics to the controller. The VAS unit is operable to perform a search of data provided by the controller and filters the searched data based on the user's specifications (or desire).
In an embodiment of the invention, the multi-cloud fabric system 106 includes a service unit that is in communication with the controller and operative to configure data of a network based on rules from the user or otherwise.
In some embodiments, the controller includes a cloud engine that assesses multiple clouds relative to an application and resources. In an embodiment of the invention, the controller includes a network enablement engine.
In some embodiments of the invention, the application deployment fabric includes a plug-in unit responsive to applications with different format applications and operable to convert the different format applications to a native-format application. The application deployment fabric can report configuration and analytics related to the resources to the user. The application deployment fabric can have multiple clouds including one or more private clouds, one or more public clouds, or one or more hybrid clouds. A hybrid cloud is private and public.
The application deployment fabric configures the resources and monitors traffic of the resources, in real-time, and based at least on the monitored traffic, re-configure the resources, in real-time.
In an embodiment of the invention, the Multi-cloud fabric can stitch end-to-end, i.e. an application to the cloud, automatically.
In an embodiment of the invention, the SLA engine of the Multi-cloud fabric sets the parameters of different types of SLA in rea-time.
In some embodiments, the Multi-cloud fabric automatically scales in or scales out the resources. For example, upon an underestimation of resources or unforeseen circumstances requiring addition resources, such as during a super bowl game with subscribers exceeding an estimated and planned for number, the resources are scaled out and perhaps use existing resources, such as those offered by Amazon, Inc. Similarly, resources can be scaled down.
The following are some, but not all, various alternative embodiments. The multi-cloud fabric system is operable to stitch across the cloud and at least one more cloud and to stitch network services, in real-time.
The multi-cloud fabric is operable to burst across clouds other than the cloud and access existing resources.
The controller of the multi-cloud fabric receives test traffic and configures resources based on the test traffic.
Upon violation of a policy, the multi-cloud fabric automatically scales the resources.
The SLA engine of the controller monitors parameters of different types of SLA in real-time.
The SLA includes application SLA and networking SLA, among other types of SLA contemplated by those skilled in the art.
The multi-cloud fabric may be distributed and it may be capable of receiving more than one application with different formats and to generate native-format applications from the more than one application.
The resources may include storage systems, servers, routers, switches, or any combination thereof.
The analytics of the multi-cloud fabric include but not limited to traffic, response time, connections/sec, throughput, network characteristics, disk I/O or any combination thereof.
In accordance with various alternative methods, of delivering an application by the multi-cloud fabric, the multi-cloud fabric receives at least one application, determines resources of one or more clouds, and automatically and dynamically delivers the at least one application to the one or more clouds based on the determined resources. Analytics related to the resources are displayed on a dashboard or otherwise and the analytics help cause the Multi-cloud fabric to substantially optimally deliver the at least one application.
a-c show exemplary data centers configured using embodiments and methods of the invention.
At 420, development testing and production environment is shown. At 422, an optional deployment is shown with a firewall (FW), ADC, a web tier (such as the tier 404), another ADC, an application tier (such as the tier 406), and a virtual database (same as the database 428). ADC is essentially a load balancer. This deployment may not be optimal and actually far from it because it is an initial pass and without the use of some of the optimizations done by various methods and embodiments of the invention. The instances of this deployment are stitched together (or orchestrated).
At 424, another optional deployment is shown with perhaps greater optimization. A FW is followed by a web-application FW (WFW), which is followed by an ADC and so on. Accordingly, the instances shown at 424 are stitched together.
b shows an exemplary multi-cloud having a public, private, or hybrid cloud 460 and another public or private or hybrid cloud 464 communication through a secure access 464. The cloud 460 is shown to include the master controller whereas the cloud 462 is the slave or local cloud controller. Accordingly, the SLA engine resides in the cloud 460.
c shows a virtualized multi-cloud fabric system spanning across multiple clouds with a single point of control and management.
In accordance with embodiments and methods of the invention, load balancing is done across multiple clouds.
Although the description has been described with respect to particular embodiments thereof, these particular embodiments are merely illustrative, and not restrictive.
Disclosed herein are methods and apparatus for creating and publishing user interface (UI) for any cloud management platform with centralized monitoring, dynamic orchestration of applications with network services, with performance and service assurance capabilities across multi-clouds.
For example, client tier 502, UI tier 504 and networking functions 106 are shown where the client tier 502 includes a web browser 508 that is in communication with a jquery or D3 in the UI tier 504 through HTTP and an API clients 510 of the client tier 102 is shown in communication with a hateoas of the UI tier 104 through REST. The UI tier 104 is also shown to include a dashboard and widgets (desired graphics/data).
The network functions 506 is shown in communication with the UI tier 504 and includes functions such as orchestration, monitoring, troubleshooting, data API, and so forth, which are merely examples of many others.
In operation, projects start at client tier 502, such as the web server 508, resulting in applications in the UI tier 504 and multiple tiers.
The plugin, such as one of the plugins 603, is installed on the CMP during load up time, and fetches the definition files from the controller 604 describing the complete workflow compliant with the respective CMP thereby eliminating the need for any update in the CMP for any changes in the workflow.
Further details of the controller 604 of
The data model 702 is shown to be in communication with the complier 704. The compiler 704 is shown to be in communication with various components, such as the database model 710, which is transmitted to and from the database 712. Further shown to be in communication with the compiler 704 are the Java script artifact 706 and the Yang artifact 708. It should be noted that these are merely two examples of artifacts. The artifact 706 is also in communication with the Yang artifact 708, which is in turn in communication with the data base model 710.
The compiler 704 receives an input model, i.e. data model 702, and automatically creates both the client side (such as client tier 502) and server side artifacts (such as artifacts 706 and 708) in addition to the data base model 710, needed for creation and publishing of the User Interface (UI). The data base model 710 is saved and retrieved from the database 712. The database model 710 is used by the UI to retrieved and save inputs from users.
A unique model of deploying multi-tiered VM's working in conjunction to offer the characteristics desired from an application are realized by the methods and apparatus of the invention. The unique characteristics being: Automatic stitching of network services required for tier functioning; and service-level agreement (SLA)-based auto-scaling model in each of the tiers.
Accordingly, the compiler 704 of the multi-cloud fabric system 106 of the data center 100 uses one or more data model(s) 702 to generate artifacts for use by a (master or slave) controller of a cloud, such as the clouds 1002-1006, thereby automating the process of building an UI to be input to the UI tier 504. To this end, artifacts are generated for orchestrated infrastructures automatically and a data-driven, rather than a manual approach, is employed, which can also be done among numerous clouds and clouds of different types.
The output of the compiler 704 is the combination of artifacts 706 and 708, and the database model 710 which in turn are used for creating the UI. The UI is then uploaded to (or used by) the servers 1012, 1014 and/or 1016 is an image of the UI and provided to the UI tier 504 of
The UI of UI tier 504 may display a dashboard showing various information to a user. UI tier 504, as shown in
In an embodiment and method of the invention, the compiler 704 generates artifacts based on the (master or slave) controller of the servers 1012, 1014, and/or 1016.
In an embodiment and method of the invention, the compiler 704 generates different artifacts for different controllers, for example, controllers of different clouds and cloud types.
The data model 702 used by the compiler 704 is defined for the UI to be created, on an on-demand basis and typically when clouds are being added or removed or features and being added or removed and a host of other reasons. The data model may be in any desired format, such as without limitation, XML.
Each server of each cloud, in
Each of the clouds 1002-1006 is shown to include databases 1008 and switches 1010, both of which are communicatively coupled to at least one server, typically the server that is in the cloud in which the switches and databases reside. For instance, the databases 1008 and switches 1010 of the cloud 1002 are shown coupled to the server 1012, the databases 1008 and switches 1010 of cloud 1004 are shown coupled to the server 1014, and the databases 1008 and switches 1010 of cloud 1006 are shown coupled to the server 1016. The server 1012 is shown to include a multi-cloud master controller 1018, which is analogous to the multi-cloud master controller 232 of
Clouds may be public, private or a combination of public and private. In the example of
In the embodiment of
In
In some embodiments, the links 1024 and 1026 are each virtual personal network (VPN) tunnels or REST API communication over HTTPS, while others not listed herein are contemplated.
As earlier noted, the databases 1008 each maintain information such as the characteristics of a flow. The switches 1010 of each cloud cause routing of a communication route between the different clouds and the servers of each cloud provide or help provide network services upon a request across a computer network, such as upon a request from another cloud.
The controllers of each server of each of the clouds makes the system 1000 a smart network. The controller 1018 acts as the master controller with the controllers 1020 and 1022 each acting primarily under the guidance of the controller 1018. It is noteworthy that any of the clouds 1002-1006 may be selected as a master cloud, i.e. have a master controller. In fact, in some embodiments, the designation of master and slave controllers may be programmable and/or dynamic. But one of the clouds needs to be designated as a master cloud. Many of the structures discussed hereinabove, reside in the clouds of
In an exemplary embodiment, each of the links 1024 and 1026 use the same protocol for effectuating communication between the clouds, however, it is possible for these links to each use a different protocol. As noted above, the controller 1018 centralizes information thereby allowing multiple protocols to be supported in addition to improving the performance of clouds that have slave rather than a master controller.
While not shown in
The build server 700 sends the output of the complier 704 to the UI tier 504 of
The controller 604, of
The build server 700 may be externally located relative to the clouds and its output provided to a user for upload onto the UI tier 504, which would reside in the cloud, i.e. the servers 1012, 1014, and/or 1016.
In accordance with another embodiment of the invention, dynamic network access controlling is performed to allow selected peopled who are normally blocked from accessing certain resources. Policies are used to guide data packets' traffic flow in allowing such access. To this end, dynamic threat management and optimization are performed. In the event of much traffic, L7 ADC load balancers are offloaded to L4 ADC load balancers.
Referring now to
The services controller 1102 of
The type of communication between the switches 1116 and the services controller 1102, through the SDN controller, is primarily control information. The switches 1116 provide data to another layer of network equipment, such as servers and routers (not shown in
The SDN controller 1104 programs the SDN switches 1116 in a flow-based manner, either as shown in
Dynamic network access control is the process of determining whether to allow or deny access to the network by devices using authentication based on the application or subscriber information gleaned from the packet data. Further explanation of the functionality of some of the foregoing components is shown and discussed relative to subsequent figures.
Dynamic threat management is the process of detecting threats in real time and taking actions to dynamically redirect the traffic to nodes that can quarantine the flow of data traffic and learn more about the threat for the purpose of dealing with it in a more direct manner in the future. An example is detection of a similar threat in the future that would result in automatic redirection of traffic to a trusted application that replicates the actual application.
Various control and service plane optimizations that can be achieved using the dynamic programmability aspect of the SDN switches and real time learning of network traffic are discussed in subsequent paragraphs.
Optimization of server-backups in data centers that use SDN, such as the embodiment of
Via traffic steering, dynamic high availability (HA), load balancing and upgrades may be made advantageously through SDN as opposed to, for example, using Linux-based or customer-specified devices to perform load balancing, done currently by prior art systems, which results in inefficiency and unnecessary complexity.
Fully automated networks are created, in accordance with methods and embodiments of the invention, by dynamically expanding/shrinking with auto steering—dynamic HA for any services/applications such as a firewall. Accordingly, upgrades are made easy by using SDN via dynamic traffic steering, also referred to as “service chaining”.
Further, adaptive bit rate (ABR) is done for video using SDN by having multiple servers, such as ones for video and others for other type of traffic. Based on how congested the links are, determining which server is best to use based on link and number of flows (configuration) and bit rate. Based on this determination, changing the traffic flow so that the traffic is directed to the server that is determined to be the best server for the particular use at hand. This determination is continually changing where different servers are employed based on what they are well, or better, suited for given the conditions at hand. A practical example is to determine that the traffic is video traffic and using a video server accordingly, but that some time later, the traffic changes and is no longer video traffic, the traffic is then re-directed to another suitable server rather than the video server.
Thus, in accordance with an embodiment and method of the invention, an open flow switch between the services controller 1102 and the SDN controller 1104 receives a first and subsequent data packets. The services controller saves the flow entries in the flow database 1108. Upon the receipt of the first data packet, the open flow switch directs the first packet to the services controller 1102, and may or may not create a flow entry depending upon whether one already exists or not. The services controller 1102 makes authentication decisions based on authentication information. Based on authentication policies, the open flow controller determines to allow or deny access to a corporate network based on authentication policies and if the open flow controller determines to deny access, the first packet is re-directed to an authentication server for access. For instance, corporations typically allow access to information by employees and officers on a need-to-know basis. Highly sensitive data may not be accessible to applications of most employees' devices, such as hand-held tablets and iPhones. Additionally, access may change throughout time based on the employees' job functions. Most employees' access to sensitive information may need to be blocked whereas a smaller group of employees may be allowed access. To this end, applications running on the former employees' devices are denied access to certain information perhaps residing on servers whereas applications on the latter group of employees' devices are allowed access after authentication. The data center 1100 achieves the same by performing the foregoing process and those to be discussed blow and shown in figures herein, dynamically and in real-time.
The controller 1202 is shown to be in communication with the servers 1208 and 1210 through the switch 1206. As noted above, the controller 1202 can dynamically program the switch 1206, which is shown to be in communication with the client host 1204. An example of a client host is an iPad or a personal computer or any web site trying to access the network. Pro-active rules are used to program the switch 1206 based on apriori knowledge of traffic by, for example, a services controller. The switch 1206 is used as a L4 load balancer, which reduces costs. This is an example of the optimization performed by the services controller 1102.
In an exemplary embodiment of the invention, the server 1208 is any L7-based network server. If any of the servers 1208 or 1210 go down, traffic is re-directed to the other by the switch 1206, accordingly, traffic flow is not affected and appear seamless to the user/client.
The numbers appearing in
In some embodiments of the invention, meta-data is extracted from incoming packets (content) (of information or data), using L4-L7 service elements. Device (or “services controller”) is used to extract meta-data from any L4-L7 service, such as but not limited to HTTP, DPI, IDS, firewall (FW), and others too many to list herein but contemplated. The device or services controller 1102 applies network-based actions such as the following:
Blocking traffic
Re-routing traffic
Apply quality-of-service (QOS) policies, such as giving one application priority over another application
Bandwidth and any other network policy
In another embodiments of the invention, subscriber information (information about who is trying to access) is extracted from policy control and rule function (PCRF) and other policy servers and the extracted information, such as but not limited to analytics, is used to dynamically apply network actions to the subscriber traffic.
In yet another embodiment of the invention, extracted analytics information by using protocol in packets, i.e. source, destination, and the like, based on 5 tuple is used as the analytics engine output to apply network actions thereto.
In another embodiment, based on apriority information, that which has been learned, apply network actions and a suitable caching technique can be used to learn the traffic flow, subscriber information regarding the content and determine adaptive network actions accordingly.
In yet another embodiment and method of the invention, the meta data obtained from various L4-L7 services can be pushed to various VAS such as an analytics engine, PCRF, Radius, and the like, to generate advanced network actions (based on information from both L4-L7 actions and VAS. That is, meta-data obtained from various L4-L7 services can be passed to third parties and from third party rules, actions that need to be applied can be performed.
In yet another embodiment and method of the invention, load information and other information from any orchestration system can be used to determine not only compatibility issues of various network elements, VAS, but also services chains, network actions, optimizing traffic paths, and other relevant analytics. Examples of other information are how loaded net services are in the future, rate-limited traffic to avoid overload, and the like. Further, information from the network elements may be collected to determine optimal and dynamic service chains. The collection of information is based on L4-L7 information and learned optimal path based on load information, extracted meta-data, and other suitable information
A data packet comes in to the switch 1306, at 1, and the switch 1306 directs the packets to the open flow controller 1304. Thereafter, the packet is sent to the open flow controller 1304, at 2. Next, the services controller 1302 receives the packet at 3 and makes authentication decisions based on authentication policies, at 4. Also, a flow entry is created by the services controller if one does not exist and the services controller performs orchestration. Next at 5, the open flow controller 1304 programs actions to allow or to deny access based on the authentication policies from the services controller 1102. Accordingly, the flow of packets may be re-directed at 6. Subsequent packets arrive at the switch 1306, and at 7, actions are taken, such as, without limitation, dropping a packet is taken at 8. Accordingly, authenticated devices are allowed access to corporate network and un-authenticated devices can be re-directed to authentication server(s) to obtain access. Also, authorized devices reach a specific domain. Policies or rules, which may be used to make authentication decisions, are based on the application that is trying to gain access. To use the example above, an employee's device, i.e. iPad or smart phone, runs applications that may be denied access to certain corporate information residing on servers. This information is applied by way of authentication information.
In
Authentication decisions are made at “4” based on authentication information from the Radius. Orchestration is done and actions are programmed to allow or deny access based on an authentication policy, at “5” and “6”.
The open flow controller 1304 is programmed to send a copy of packets received from the switch 1306.
In the example of
The embodiment and method of
In accordance with various embodiments and methods of the invention, identification of which subscriber traffic is for is made and used as traffic characteristics for decision-making. For example, such subscriber-awareness, VoIP or video traffic, or pure traffic (traffic characteristics), are used to dynamically adjust characteristics of the network, such as programming the L2 switches accordingly.
The controller 1504 is shown to include a network enablement engine 1506, a service level agreement (SLA) and elasticity engine 1508, and a multi-cloud engine 1510. The network enablement engine 1506 is analogous to the network enablement engine 230 of
The cloud 1502 is shown to include VMs 1-4, or VM 1514, VM 1516, VM 1518 and VM 1520. VMs 1518 and 1520 are each applications. The VM 1516 is an L7 ADC with application and/or zonal firewall (FW) capabilities. The VM 1514 is shown to include a L4 Application Delivery Controller (ADC) and communicates with the VM 1516 and 1520. The VMs 1520 and 1518 communicate with the VM 1516. The VM 1520 further communicates with the VM 1514.
The VMs 1516, 1518 and 1520 are each shown to include a statistic/SLA/configure agent that are in communication with the VM 1514.
Among other functions performed by the service plane 1512 in conjunction with the controller 1504 is offloading the L7 ADC VM 1516 onto the L4 ADC 1522 of the VM 1514 in times of high traffic. This clearly, optimizes the performance of the cloud 1502.
The SLA and elasticity engine 1508, at least in part, cause the service plane 1512 to be elastic. The engines 1508 and 1510 contribute to the service plane 1512 being a distributed plane.
It is understood that the configuration shown in
The cloud balancing and burst module 1610 is shown to include an HTTP client 1614, an event manager 1622, a database manager 1624, a cloud migration manager 1628, and a policy manager 1632. The module 1610 is shown included in the cloud 1601, which may be a public, private, or hybrid cloud. The module 1610 serves to perform live migration for an entire service or individual instances with the following:
Exemplary embodiments of the storage nodes 1636 include service chains, service instances, location, proximity server, proximity rack, proximity dc, and proximity region.
Cloud migration manager 1628 enables substantially live migration of any applications, network services that are tied to the applications, or an entire development or test environment from the hosted cloud onto any other target cloud.
When a user makes an organizational decision to move its application from one cloud, such as the OpenStack kernel virtual machine (KVM)-based cloud 1602, to a public cloud, such as Amazon EC2 1604, the cloud migration manager 1628 provides procedures and apparatus to migrate the application. In environments, such as test or development environments seamless migration across homogenous and heterogeneous clouds is performed by use of the migration manager 1628.
Applications are typically executed on VMs, which may also be referred to as “instances”.
In
Automatic migration is performed without any manual intervention and based on the configuration SLA policies. Further, based on the metrics received from the operating instances which amounts to live feedback and also based on historical data and compliant to the configured SLA policies, the cloud migration manager 1528 allows for automatically migrating (moving) instances between clouds.
The cloud migration manager 1528 is a part of the master controller 232 of
When the SLA policies associated with an application is being violated, the migration algorithm automatically triggers the migration of the application from the hosted cloud to another cloud. The migration can also be based on policies such as hosting an application on one cloud during a certain time of the day and moving it to another cloud during another time of the day. For example, for an application supporting a 24-hours-a-day and seven days-a-week organizations with offices located in the United States and Japan, it is desirable to execute the application in data centers that are located in United States during certain number of hours and migrate the application to another data center that is located in Japan during another time of the day in an effort to reduce network latency. Migrating service instances to be geo-co-located near the traffic source substantially reduces network latency and improves quality of service.
Migration of instances can also be based on policies to reduce end-user costs. For example, an instance can be migrated between clouds that are on different time-zones in an effort to have the utilization of the instance be at lower night rates for use of compute/storage resources, to the extent possible. Accordingly, the cloud migration manager 1528 automatically moves the instances of an application from one cloud to another based on the rate of hosting a cloud. This is referred to as cost-based migration. Cost-based migration can result in substantial reduction in the cost of executing an application in cloud(s).
The cloud migration manager 1528 attempts to automatically select a target host (cloud) that best matches the host on which the application is currently being executed as well as having characteristics similar to those of the latter host in order to effectuate graceful migration. As a result, migration seems substantially invisible to the user since the target host behaves substantially the same as does the host on which the application is executed before migration.
The cloud migration manager 1528 attempts to seamlessly migrate an application between private and public clouds, or between private clouds, or between public clouds. To move applications seamlessly between private and public (heterogeneous or hybrid) clouds, the cloud migration manager 1528 triggers a cloud management platform to deploy a VM on the target host while trying to minimize the down-time associated with this effort.
The cloud migration manager 1528 provides support for commercially available migration tools such as, without limitation, VMware vMotion, KVM Live Migration, or Amazon EC2 EBS-backed instances with a single common Representational state transfer (RESTful) application programming interface (API).
Migrating an instance of an application from one cloud to another cloud substantially increases the east/west traffic, i.e. the traffic between clouds, because the migration manager has to access the instance images and bring up the instances. Migrating an instance further increases latency due to the delay associated with a new/migrated VM and its preparation for being ready to take on the traffic. The cloud migration manager 1528 employs the following to accelerate the instance:
1. VM Snapshot manager: to decrease the latency and migration, instances of the application, if possible, are pre-copied (snapshot taken) to reduce the migration time. The cloud migration manager 1528 keeps track of resource-intensive VMs, and pre-copies them to enable shorter bring-up and migration times.
2. Live VM cloner: running instances of the applications that are cloned to instantiate or move between clouds intelligently using live VM cloning. Cloning helps to reduce setup latency drastically and is ready with a warmed-up cache. That is, the cache is already prepared. In an embodiment of the invention, the cache resides in the cloud balancing and burst module 1610. Live VM cloning and migration also implicitly provides clustering/high availability (HA)/failover. Once a VM is up (or operational) on the target host, any operation that is being performed on the original VM is also sent to the target VM until the cloning migration is complete and then the application is moved to the new host.
3. Adding elastic VMs: Elastic VMs may be added to address short-lived bursts in the traffic to an application. Tiny flavors of the VMs are used in such cases to reduce the temporary overhead associated with migrating an entire instance of the application and bringing-up a new target VM and to avoid unnecessary resource reservations. When the cloud migration manager 1528 recognizes SLA violations as being a temporary burst in traffic and not long-lived, it elastically adds temporary VMs to address the burst and once traffic dies down and the VMs are no longer required, the migration manager 1528 removes them. Thus, migration is avoided, therefore, resources are not unnecessarily tied up, and overhead is accordingly reduced.
In an embodiment of the invention, instances of images are securely transferred between clouds by the cloud migration manager 1528 using a built-in secure connection. The cloud migration manager 1528 establishes a secure tunnel between the source cloud and the target cloud for migration of instances of an application.
In one embodiment of the invention, the cloud migration manager 1528 migrates an entire tier between clouds.
In another embodiment of the invention, the cloud migration manager 1528 clones the tier/topology configuration or metadata (for example, of a source cloud) and applies the cloned tier/topology configuration to a different tier. This is done either for cloud duplication or deploying a new tier with new VM instances but with the same configuration characteristics as an existing tier. The cloud migration manager 1528, relative to an existing tier, copies the meta data and configuration associated with the application of the existing tier and brings up another tier resembling the original tier using the meta data and configuration from the existing tier. The resemblance to the original tier is caused by applying the copied metadata and configuration associated with the application of the existing tier to the tier that is to resemble the existing tier (brought up by the cloud migration manager 1528).
The foregoing method is particularly effective when applications are stateless. In such cases, the cloud migration manager 1628, instead of migrating an entire database, deploys an application in the target host and applies the metadata and configuration file of the source host. It is believed that an effective method of migration in accordance with a method and embodiment of the invention is to launch a new VM, apply the metadata and configuration file of the source host to the new VM, and thereafter redirect the traffic to the new VM. There is no need to move the data in the memory, which resides in the RAM of the VM, over to the target host.
The NoSQL database manager 1624 is shown to include driver 1626. In
The HTTP Client is shown to include FlexCloud Restful API 1612 and drivers 1616, 1618, and 1620. The drivers 1616, 1618, and 1620 provide abstraction layers for migrating VMs across various heterogeneous public and private clouds. Examples of public and private migration tools are vMotion employed by VMware-based clouds, KVM live migration employed by clouds such as OpenStack and Rackspace, or EBS-backed instance employed by Amazon EC2 clouds. The drivers 1616, 1618, and 1620 can be easily extended to support any future clouds.
Restful-based APIs 1612 convert REST APIs to the appropriate drivers 1616, 1618, or 1620 for communications with the particular cloud.
Although the description has been described with respect to particular embodiments thereof, these particular embodiments are merely illustrative, and not restrictive.
As used in the description herein and throughout the claims that follow, “a”, “an”, and “the” includes plural references unless the context clearly dictates otherwise. Also, as used in the description herein and throughout the claims that follow, the meaning of “in” includes “in” and “on” unless the context clearly dictates otherwise.
Thus, while particular embodiments have been described herein, latitudes of modification, various changes, and substitutions are intended in the foregoing disclosures, and it will be appreciated that in some instances some features of particular embodiments will be employed without a corresponding use of other features without departing from the scope and spirit as set forth. Therefore, many modifications may be made to adapt a particular situation or material to the essential scope and spirit.
This application claims priority to U.S. Provisional Patent Application No. 61/994,093, filed on May 15, 2014, by Rohini Kumar Kasturi, et al., and entitled “METHOD AND APPARATUS TO MIGRATE APPLICATIONS AND NETWORK SERVICES TO ANY CLOUD”, and is a continuation-in-part of U.S. patent application Ser. No. 14/702,649, filed on May 1, 2015, by Rohini Kumar Kasturi, et al., and entitled “METHOD AND APPARATUS FOR APPLICATION AND L4-L7 PROTOCOL AWARE DYNAMIC NETWORK ACCESS CONTROL, THREAT MANAGEMENT AND OPTIMIZATIONS IN SDN BASED NETWORKS”, which is a continuation-in-part of U.S. patent application Ser. No. 14/681,057, filed on Apr. 7, 2015, by Rohini Kumar Kasturi, et al., and entitled “SMART NETWORK AND SERVICE ELEMENTS”, which is a continuation-in-part of U.S. patent application Ser. No. 14/214,682, filed on Mar. 17, 2014, by Kasturi et al. and entitled “METHOD AND APPARATUS FOR CLOUD BURSTING AND CLOUD BALANCING OF INSTANCES ACROSS CLOUDS”, which is a continuation-in-part of U.S. patent application Ser. No. 14/214,666, filed on Mar. 17, 2014, by Kasturi et al., and entitled “METHOD AND APPARATUS FOR AUTOMATIC ENABLEMENT OF NETWORK SERVICES FOR ENTERPRISES”, which is a continuation-in-part of U.S. patent application Ser. No. 14/214,612, filed on Mar. 14, 2014, by Kasturi et al., and entitled “METHOD AND APPARATUS FOR RAPID INSTANCE DEPLOYMENT ON A CLOUD USING A MULTI-CLOUD CONTROLLER”, which is a continuation-in-part of U.S. patent application Ser. No. 14/214,572, filed on Mar. 14, 2014, by Kasturi et al., and entitled “METHOD AND APPARATUS FOR ENSURING APPLICATION AND NETWORK SERVICE PERFORMANCE IN AN AUTOMATED MANNER”, which is a continuation-in-part of U.S. patent application Ser. No. 14/214,472, filed on Mar. 14, 2014, by Kasturi et al., and entitled, “PROCESSES FOR A HIGHLY SCALABLE, DISTRIBUTED, MULTI-CLOUD SERVICE DEPLYMENT, ORCHESTRATION AND DELIVERY FABRIC”, which is a continuation-in-part of U.S. patent application Ser. No. 14/214,326, filed on Mar. 14, 2014, by Kasturi et al., and entitled, “METHOD AND APPARATUS FOR HIGHLY SCALABLE, MULTI-CLOUD SERVICE DEVELOPMENT, ORCHESTRATION AND DELIVERY”, which are incorporated herein by reference as though set forth in full.
Number | Date | Country | |
---|---|---|---|
61994093 | May 2014 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14702649 | May 2015 | US |
Child | 14712880 | US | |
Parent | 14681057 | Apr 2015 | US |
Child | 14702649 | US | |
Parent | 14214682 | Mar 2014 | US |
Child | 14681057 | US | |
Parent | 14214666 | Mar 2014 | US |
Child | 14214682 | US | |
Parent | 14214612 | Mar 2014 | US |
Child | 14214666 | US | |
Parent | 14214572 | Mar 2014 | US |
Child | 14214612 | US | |
Parent | 14214472 | Mar 2014 | US |
Child | 14214572 | US | |
Parent | 14214326 | Mar 2014 | US |
Child | 14214472 | US |