Embodiments presented in this disclosure generally relate deployment and management of network fabrics and service meshes within the network fabrics. More specifically, embodiments disclosed herein include using network level performance insights to modulate health scores for service level applications, services, and/or micro-services.
In current network fabric designs and implementations, there is an increasing focus on using service meshes to provide Layer 7 (L7) services for applications on the network. A sidecar proxy is a proxy instance that is dedicated to a specific application instance (e.g., per application container pod) and communicates with other sidecar proxies executing on different nodes for information regarding the application.
Sidecar proxies are often used due to their utility and reliability in providing required services. For instance, middle proxies are not close enough to the workload to obtain information about the application instances without sending a query to each of the nodes. Additionally, middle proxies can be a single point of failure for a plurality of nodes. Sidecar proxies solve these issues by executing in tandem with the application instances and providing a distributed information source such that the failure of one does not condemn the whole system. However, sidecar proxies are inefficient in obtaining information regarding application/service instances executing on other nodes and the network wide service level performance of the application/service and updating a performance of the service instance based on the network wide service level performance.
So that the manner in which the above-recited features of the present disclosure can be understood in detail, a more particular description of the disclosure, briefly summarized above, may be had by reference to embodiments, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate typical embodiments and are therefore not to be considered limiting; other equally effective embodiments are contemplated.
To facilitate understanding, identical reference numerals have been used, where possible, to designate identical elements that are common to the figures. It is contemplated that elements disclosed in one embodiment may be beneficially used in other embodiments without specific recitation.
Overview
A system of one or more computers can be configured to perform particular operations or actions by virtue of having software, firmware, hardware, or a combination of them installed on the system that in operation causes or cause the system to perform the actions. One or more computer programs can be configured to perform particular operations or actions by virtue of including instructions that, when executed by data processing apparatus, cause the apparatus to perform the actions. One general aspect includes a method for service level performance updates based on network level factors including: receiving at a virtual network edge (VNE) instance on a node, one or more network health indicators, determining from the one or more network health indicators, a network level health-score, determining from the network level health-score a modulated health-score for a service instance executing on the node, and updating a health-score of the service instance at the service proxy. Other embodiments of this aspect include corresponding computer systems, apparatus, and computer programs recorded on one or more computer storage devices, each configured to perform the actions of the methods.
One example embodiment includes a system for service level performance updates based on network level factors, including: a processor; and a memory including instructions which, when executed on the processor, performs an operation. The operation includes: receiving at a virtual network edge (VNE) instance on a node, one or more network health indicators, determining from the one or more network health indicators, a network level health-score, determining from the network level health-score a modulated health-score for a service instance executing on the node, and updating a health-score of the service instance at the service proxy.
One example embodiment includes a computer program product for service level performance updates based on network level factors, the computer program product including a non-transitory computer-readable medium program having program instructions embodied therewith, the program instructions executable by a processor to perform an operation. The operation includes: receiving at a virtual network edge (VNE) instance on a node, one or more network health indicators, determining from the one or more network health indicators, a network level health-score, determining from the network level health-score a modulated health-score for a service instance executing on the node, and updating a health-score of the service instance at the service proxy.
Service mesh data plane implementations in networks include proxies such as sidecar proxies. While these sidecar proxies delegate certain functions to other network components, these proxies do not have detailed insight into network fabric-level events such as congestion or service-level agreement (SLA) violations at network edges. This lack of insight into network fabric-level events may have adverse effects on services that make application traffic flow decisions based only on layer 7 (L7) parameters (ex: application load balancers, etc.). For example, an application load balancer may continue routing traffic via network elements that are congested and/or have failed for a period of time before the load balancer can detect the issue.
In some examples, a service mesh manages service-level (i.e., L7) communication. The service meshes provide powerful primitives that can be used for at least three failure management strategies. For example, service meshes implement: dynamic routing, which can be used for different release and testing strategies such as canary routing, traffic shadowing, or blue/green deployments; resilience, which mitigates the impact of failures through strategies such as circuit breaking and rate limiting; and observability, which helps improve response time of application behavior by collecting metrics and adding context (e.g., tracing data) to service-to-service communication.
As discussed above, a service mesh is usually implemented as a collection of inter-connected sidecar proxies where sidecar proxies are typically deployed per container pod on nodes. Sidecar proxies often delegate responsibilities including service discovery and service health to external providers (such as a datacenter fabric) in order to conserve bandwidth at the container and node. As described herein, the sidecar proxies may delegate service discovery and service health to a virtual network edge (VNE) instance on the node.
The systems and methods described herein provide for service level performance updates based on network level factors by establishing a co-operative model between a network fabric and service proxies to enhance the service mesh failure management primitives as well as bring in network level intelligence in service (service instance) placement decisions in the fabric. The VNE instance, with delegated responsibilities, interacts with the network fabric including the next level switches (such as a top of rack switch) and a network controller in order to determine a network level health-score and a modulated health-score for a service instance executing on the node. The modulated health score causes actions such as an influence on load balancing, request routing, rolling upgrades, canary deployments, change in the utilization of network resources, a downgrade of service, etc., based on the network level health-score.
The compute nodes, nodes 102a-102d, may be any computer systems configured to communicate with a network, such as computer systems in a datacenter. In some examples, the nodes 102a-d may host one or more container environments, such as Docker or Kubernetes containers which may be hosted on server computers. Each container environment includes a stand-alone executable software package which provides particular services and includes every element needed to run the software package, including code, runtime, system libraries, settings, etc. The container environments may implement a Container Network Interface (CNI) networking model or a Container Network Model (CNM) to perform local programming.
The nodes 102a-d also execute one or more service instances 103a-103d. In some examples, the one or more service instances 103a-103d comprise an application instance for an application, where the application instance is executed in a container environment. The varying nodes may execute application instances of various services. In some examples, the service instances 103a-103d executing on the various nodes may differ in configuration and/or application type. The application instances may be executed as part of a service mesh. The service mesh comprises a configurable infrastructure layer for one or more applications.
In some examples, the nodes 102a-d each comprise a VNE instance (e.g., the VNEs 105a-105d) executing on the node. The VNEs 105a-d comprises executable software which identifies and authenticates end points and registers end-point identification information in a fabric host-tracking database. The VNEs may also contain a port in each container pod namespace and act as a bridge between the nodes 102 and the top of rack (ToR) switches (e.g., switches 110a-110b).
As described herein, each local VNE instance acts as the proxy for any delegated L7 service. A delegated L7 service is one in which the network fabric provides policy or statistics for enforcement to the service proxies. In some examples, the actual enforcement of any application behavior can still be done by the service proxy 104 (e.g., a sidecar proxy).
In some examples, service proxies 104a-d execute in the container environment on the respective nodes. The service proxies 103a-d may be configured to implement enforcement of policies relating to the respective service instances. For example, the service proxy 103a may function as an Envoy sidecar proxy to enforce policies for an Envoy service mesh. The service proxy 103a may be further configured to communicate with the VNE 105a to obtain information regarding the service/application. For example, the network fabric may be integrated with the service mesh by allowing the VNE 105a to provide key services to either the service instance 103a or the service proxy 104a.
In some examples, the service proxies 104a-d query the VNE instances for performing specific checks such as determining health of a service, endpoint discovery, rate limiting, etc. In response, the VNE instance delegates to the next-hop switch, e.g., the switches 110a-110b. The fabric controller, e.g., the controller 120, downloads the requested information into the switches 110a-110b and the response from switch is then cached at the VNE instance level. Thus allowing the node-level proxy to provide delegated service health statistics to the mesh proxies in a hub-and-spoke model.
In some examples, the VNE routes network traffic through the switches 110a-110b. The switches 110a-110b comprise network switches which interconnect nodes in a network (such as nodes in a datacenter). In some examples the switches 110a-110b may be connected to one or more spines based on a leaf/spine datacenter fabric, such as the CLOS network topology.
The network controller 120 is a central entity that manages operation of a network fabric, such as by providing overlay reachability information and end points-to-routing location mapping. An example of a fabric controller is the Cisco Application Policy Infrastructure Controller (APIC) which is a unifying point of automation and management for the Cisco Application Centric Infrastructure (ACI) fabric. The network controller 120 may be implemented on a computing system that is communicatively coupled to the nodes through the network fabric architecture.
In some examples, the network controller 120 also functions as a container orchestrator which acts as a resource manager for container environments operating on nodes. The container orchestrator may allocate and provide resources to container environments, provide instructions to systems instructions executing container environments, obtain and distribute attribute data regarding container environments, and otherwise manage deployment and implementation of container environments.
In some examples, the VNE 105a is also delegated service health monitoring functions. For example, at step 206, the service proxy 104a associated with the service instance 102a sends a service health request to the VNE 105a. When the VNE 105a receives the service health request from the service proxy 104a, the VNE 105a queries a next-level network element for the one or more network health indicators at step 208. For example, the VNE 105a will query and/or delegate the request for service health request to the switch 110a.
At step 210, the switch 110a queries the controller 120 for the network level network health indicators. At step 212, the fabric controller, controller 120, downloads the requested information into the switch 110a and at step 214 the response (which includes the network health indicators from the switch 110a) is cached at the VNE level in the VNE 105a from the switch 110a. For example, as also shown in block 252 of method 200 in
At step 216 in
In some examples, the VNE 105a uses the upstream/external/global signals received in the network health indicators 212 to determine the network level health-score and in turn, the VNE 105a modulates/generates/changes the health-score of a particular app/service, e.g., the service instance 103a that is located behind an affected leaf/spine. In some examples, the modulated health-score influences an upstream L7-based service (such as a load-balancer) to degrade the availability of a specific service instance and lower its priority to serve traffic. This results in a reduced level of service/application requests being routed to the degraded instance, (e.g., 103a) which can be temporarily degraded and/or migrated from the node 102a. For example, as described in relation to method 400 of
In some examples, the network health indicators include one or more congestion indicators, one or more node level indicators, and one or more network wide service level factors. For example, the congestion indicators may include indications of resource exhaustion or over-utilization at the leaf switch layer. These indicators may include signals related congestion-related resources such as buffer space, TCAM table space for ACLs, L2/L3 tables at a leaf that are needed to provide services to the service instance, etc. In some examples, the VNE 105a queries an upstream leaf for resource utilization metrics and determines the service utilization load on the leaf. For example, at block 302 of method 300 in
In some examples, the determination of the a network level health-score and the related modulation of the service instance health score is due to real or impending resource exhaustion or congestion being experienced by the upstream network as demonstrated by a queue buildup, such as in an L2/L3 table buildup etc. This congestion may be proactively triaged by the VNE instances by modulating/degrading health-scores to specific services instances hosted on the nodes in order to temporarily divert traffic.
In some examples, the determination of the a network level health-score and the related modulation of the service instance health score is related to a measure of the ratio of resource utilization of an service instance in the upstream switch, e.g., the switch 110a, to that service instance's usefulness to the service overall. For example, when the ratio is computed to be higher than a pre-defined threshold, then the VNE 105a modulates/degrades the service instance health score with an intent to evict the service instance 103a from the node 102a.
For example, the network fabric may track the percentage of resources consumed by a specific service/application instance as well as the percentage of overall traffic at the switch 110a consumed by the service instance 103a. In an example, where the percentage of resources used by the service instance is not proportional with the percentage of overall traffic for that service instance, then the affects the service instance are determined to be negatively affecting the switch and thus the associated VNE signals a an eviction in the modulated health-score, where the service instance is evicted/removed/transitioned from the node 102a.
In some examples, the one or more node level indicators include Node-level maintenance and/or high availability (HA) events. For example these indicators may include proactive notifications for nodes undergoing planned updates/maintenance. Returning to
In some examples, the one or more network wide service level factors include service-level insights from sensors in the network. This category of indicators relate to telemetry and other data gathered from various service/application level sensors which may include SLA violations and/or metrics that are customized for the specific service/application. At block 306 of
In some examples, AppD and/or other monitoring solutions are integrated with the controller 120 to provide visibility into issues/problems with services instances such as policy violations, anomalous network activity, maintenance (e.g., application needs to be updated), vulnerability alerts with application versions, etc. In general, these indicators relate to service health as measured by an external entity (e.g., the controller 120) which is integrated with the network fabric and not just service level health as computed by a service load balancer, etc.
In this example, the VNE instances 105a-d modulate/adjust the health-score in a spectrum based on the severity of the issue and the urgency to address it. For example, the health score for the service-instance may be downgraded severely for a critical issue and/or may also be downgraded to a lower degree for less critical issues in order for the service mesh to gradually adjust.
At step 220 in
Bus 550 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. By way of example, and not limitation, such architectures include Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnects (PCI) bus.
Node 102 typically includes a variety of computer system readable media. Such media may be any available media that is accessible by Node 102, and it includes both volatile and non-volatile media, removable and non-removable media.
Memory 510 can include computer system readable media in the form of volatile memory, such as random access memory (RAM) and/or cache memory. Node 102 may further include other removable/non-removable, volatile/non-volatile computer system storage media. In some examples, 520 may be included as part of memory 510 and may typically provide a non-volatile memory for the networked computing devices, and may include one or more different storage elements such as Flash memory, a hard disk drive, a solid state drive, an optical storage device, and/or a magnetic storage device. For example, storage system, storage 520, can be provided for reading from and writing to a non-removable, non-volatile magnetic media (not shown and typically called a “hard drive”). Although not shown, a magnetic disk drive for reading from and writing to a removable, non-volatile magnetic disk (e.g., a “floppy disk”), and an optical disk drive for reading from or writing to a removable, non-volatile optical disk such as a CD-ROM, DVD-ROM or other optical media can be provided. In such instances, each can be connected to bus 550 by one or more data media interfaces. Storage 520 may include media for storing network health indicators 521, modulated health-scores 522 and other data described herein.
Memory 510 may include a plurality of program modules 515 for performing various functions related to the node described herein. The program modules 515 generally include program code that is executable by one or more of the processors 505. As shown, program modules 515 include the VNE module 511 and service modules 512 to perform the functions related to the VNE and service instances on a node. The program modules 515 may also interact with each other and storage system, storage 520, to perform certain functions as described herein.
In the current disclosure, reference is made to various embodiments. However, the scope of the present disclosure is not limited to specific described embodiments. Instead, any combination of the described features and elements, whether related to different embodiments or not, is contemplated to implement and practice contemplated embodiments. Additionally, when elements of the embodiments are described in the form of “at least one of A and B,” it will be understood that embodiments including element A exclusively, including element B exclusively, and including element A and B are each contemplated. Furthermore, although some embodiments disclosed herein may achieve advantages over other possible solutions or over the prior art, whether or not a particular advantage is achieved by a given embodiment is not limiting of the scope of the present disclosure. Thus, the aspects, features, embodiments and advantages disclosed herein are merely illustrative and are not considered elements or limitations of the appended claims except where explicitly recited in a claim(s). Likewise, reference to “the invention” shall not be construed as a generalization of any inventive subject matter disclosed herein and shall not be considered to be an element or limitation of the appended claims except where explicitly recited in a claim(s).
As will be appreciated by one skilled in the art, the embodiments disclosed herein may be embodied as a system, method or computer program product. Accordingly, embodiments may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit,” “module” or “system.” Furthermore, embodiments may take the form of a computer program product embodied in one or more computer readable medium(s) having computer readable program code embodied thereon.
Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for embodiments of the present disclosure may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C++ or the like and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider).
Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatuses (systems), and computer program products according to embodiments presented in this disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the block(s) of the flowchart illustrations and/or block diagrams.
These computer program instructions may also be stored in a computer readable medium that can direct a computer, other programmable data processing apparatus, or other device to function in a particular manner, such that the instructions stored in the computer readable medium produce an article of manufacture including instructions which implement the function/act specified in the block(s) of the flowchart illustrations and/or block diagrams.
The computer program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process such that the instructions which execute on the computer, other programmable data processing apparatus, or other device provide processes for implementing the functions/acts specified in the block(s) of the flowchart illustrations and/or block diagrams.
The flowchart illustrations and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments. In this regard, each block in the flowchart illustrations or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the Figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In view of the foregoing, the scope of the present disclosure is determined by the claims that follow.
Number | Name | Date | Kind |
---|---|---|---|
7823204 | Gupta et al. | Oct 2010 | B2 |
7843843 | Papp, III | Nov 2010 | B1 |
7979368 | Kapoor et al. | Jul 2011 | B2 |
8131838 | Bornhoevd | Mar 2012 | B2 |
8316438 | Bush | Nov 2012 | B1 |
8381297 | Touboul | Feb 2013 | B2 |
8489720 | Morford | Jul 2013 | B1 |
8762367 | Burger | Jun 2014 | B2 |
9043453 | Kielhofner | May 2015 | B1 |
9270559 | Raleigh et al. | Feb 2016 | B2 |
9407509 | Porras et al. | Aug 2016 | B2 |
9805345 | Dailianas | Oct 2017 | B1 |
9998339 | Brajkovic | Jun 2018 | B1 |
10033602 | Russell | Jul 2018 | B1 |
10044581 | Russell | Aug 2018 | B1 |
10142254 | Olofsson | Nov 2018 | B1 |
10230586 | Cordray | Mar 2019 | B2 |
10462061 | Lin | Oct 2019 | B1 |
10623390 | Rosenhouse | Apr 2020 | B1 |
10725825 | Sindhu | Jul 2020 | B2 |
10776091 | Wagner | Sep 2020 | B1 |
10819562 | Parra | Oct 2020 | B2 |
10846788 | Klein | Nov 2020 | B1 |
11202228 | Bartfai-Walcott | Dec 2021 | B2 |
20050232153 | Bishop | Oct 2005 | A1 |
20070104208 | Svensson | May 2007 | A1 |
20070274337 | Purpura | Nov 2007 | A1 |
20080025230 | Patel | Jan 2008 | A1 |
20080052387 | Heinz | Feb 2008 | A1 |
20100131639 | Narayana | May 2010 | A1 |
20100162348 | Narayanan | Jun 2010 | A1 |
20100177650 | Wittgreffe | Jul 2010 | A1 |
20100232370 | Jing | Sep 2010 | A1 |
20110176551 | Chawla | Jul 2011 | A1 |
20110185052 | Nakahira | Jul 2011 | A1 |
20130145375 | Kang | Jun 2013 | A1 |
20130173756 | Luna | Jul 2013 | A1 |
20130203399 | Gupta | Aug 2013 | A1 |
20140133293 | Ismail | May 2014 | A1 |
20140155043 | Gell | Jun 2014 | A1 |
20140237118 | Matthews | Aug 2014 | A1 |
20140269269 | Kovvali | Sep 2014 | A1 |
20150106502 | Shakhmetov | Apr 2015 | A1 |
20150117213 | Pinheiro | Apr 2015 | A1 |
20150195150 | Ritter | Jul 2015 | A1 |
20150215845 | Pinheiro | Jul 2015 | A1 |
20150222549 | Kakadia | Aug 2015 | A1 |
20150304409 | Steuer | Oct 2015 | A1 |
20150373588 | C K | Dec 2015 | A1 |
20160014632 | Siow | Jan 2016 | A1 |
20160043944 | Felstaine | Feb 2016 | A1 |
20160062795 | Hu | Mar 2016 | A1 |
20160065421 | Barghouthi | Mar 2016 | A1 |
20160080221 | Ramachandran | Mar 2016 | A1 |
20160080502 | Yadav | Mar 2016 | A1 |
20160094629 | Abushaban | Mar 2016 | A1 |
20160112269 | Singh | Apr 2016 | A1 |
20160182286 | Lunden | Jun 2016 | A1 |
20160262044 | Calin | Sep 2016 | A1 |
20160315912 | Mayya | Oct 2016 | A1 |
20160353461 | Vachuska | Dec 2016 | A1 |
20170006053 | Greenberg | Jan 2017 | A1 |
20170013501 | Kim | Jan 2017 | A1 |
20170063705 | Gilson | Mar 2017 | A1 |
20170078208 | Panin | Mar 2017 | A1 |
20170098086 | Hoernecke | Apr 2017 | A1 |
20170116016 | Kim | Apr 2017 | A1 |
20170126564 | Mayya | May 2017 | A1 |
20170237710 | Mayya | Aug 2017 | A1 |
20170244643 | Lawrence | Aug 2017 | A1 |
20170245170 | Henry | Aug 2017 | A1 |
20170351241 | Bowers | Dec 2017 | A1 |
20170353991 | Tapia | Dec 2017 | A1 |
20180006954 | Arora | Jan 2018 | A1 |
20180062928 | Beveridge | Mar 2018 | A1 |
20180063201 | Zhang | Mar 2018 | A1 |
20180091392 | Richards | Mar 2018 | A1 |
20180091394 | Richards | Mar 2018 | A1 |
20180091401 | Richards | Mar 2018 | A1 |
20180091413 | Richards | Mar 2018 | A1 |
20180115463 | Sinha | Apr 2018 | A1 |
20180123950 | Garg | May 2018 | A1 |
20190028407 | Perumal Vijayan | Jan 2019 | A1 |
20190042290 | Bailey | Feb 2019 | A1 |
20190045037 | Sukhomlinov | Feb 2019 | A1 |
20190052551 | Barczynski | Feb 2019 | A1 |
20190132211 | Yeung | May 2019 | A1 |
20190141561 | Altay | May 2019 | A1 |
20190280932 | Gazzetti | Sep 2019 | A1 |
20190306076 | Masputra | Oct 2019 | A1 |
20190312784 | Altay | Oct 2019 | A1 |
20200007413 | Kerpez | Jan 2020 | A1 |
20200007460 | Guim Bernat | Jan 2020 | A1 |
20200034353 | Innocenti | Jan 2020 | A1 |
20200036574 | Parra | Jan 2020 | A1 |
20200106706 | Mayya | Apr 2020 | A1 |
20200119992 | Yan | Apr 2020 | A1 |
20200267051 | Ranjbar | Aug 2020 | A1 |
20210194821 | Guim Bernat | Jun 2021 | A1 |
20230291674 | Richards | Sep 2023 | A1 |
Number | Date | Country | |
---|---|---|---|
20210036935 A1 | Feb 2021 | US |