Virtualization allows the abstraction and pooling of hardware resources to support virtual machines in a software-defined data center (SDDC). For example, through server virtualization, virtualization computing instances such as virtual machines (VMs) running different operating systems may be supported by the same physical machine (e.g., referred to as a “host”). Each VM is generally provisioned with virtual resources to run a guest operating system and applications. The virtual resources may include central processing unit (CPU) resources, memory resources, storage resources, network resources, etc. In practice, various network-related problems may occur, which adversely affects the performance of hosts and VMs.
In practice, a user (e.g., organization) may run various applications using “on-premise” data center infrastructure in a private cloud environment that is under the user's ownership and control. Alternatively or additionally, the user may run applications “in the cloud” using infrastructure under the ownership and control of a public cloud provider. In the latter case, it may be challenging to provide various services (e.g., firewall) for applications running in a public cloud environment.
In the following detailed description, reference is made to the accompanying drawings, which form a part hereof. In the drawings, similar symbols typically identify similar components, unless context dictates otherwise. The illustrative embodiments described in the detailed description, drawings, and claims are not meant to be limiting. Other embodiments may be utilized, and other changes may be made, without departing from the spirit or scope of the subject matter presented here. It will be readily understood that the aspects of the present disclosure, as generally described herein, and illustrated in the drawings, can be arranged, substituted, combined, and designed in a wide variety of different configurations, all of which are explicitly contemplated herein.
Challenges relating to service insertion in public cloud environments will now be explained in more detail using
In the example in
Throughout the present disclosure, the term “virtual network” in a public cloud environment may refer generally to a software-implemented network, such as a logical overlay network, that is logically isolated from at least one other virtual network in a public cloud environment. For example, virtual networks 101-102 may be Amazon Virtual Private Clouds (VPCs) provided by Amazon Web Services® (AWS). Amazon VPC and Amazon AWS are registered trademarks of Amazon Technologies, Inc. Using the AWS example in
VMs 110-120 will be explained in more detail using
Hosts 210A-B may each include virtualization software (e.g., hypervisor 214A/214B) that maintains a mapping between underlying hardware 212A/212B and virtual resources allocated to VMs 110-140. Hosts 210A-B may be interconnected via a physical network formed by various intermediate network devices, such as physical network devices (e.g., physical switches, physical routers, etc.) and/or logical network devices (e.g., logical switches, logical routers, etc.). Hardware 212A/212B includes suitable physical components, such as processor(s) 220A/220B; memory 222A/222B; physical network interface controller(s) or NIC(s) 224A/224B; and storage disk(s) 228A/228B accessible via storage controller(s) 226A/226B, etc.
Virtual resources are allocated to each VM to support a guest operating system (OS) and applications (see 112/122/132/142). Agent 114/124/134/144 may be configured on each VM 110/120/130/140 to perform any suitable processing to support packet handling (e.g., encapsulation and decapsulation), etc. Corresponding to hardware 212A/212B, the virtual resources may include virtual CPU, virtual memory, virtual disk, virtual network interface controller (VNIC), etc. Hardware resources may be emulated using virtual machine monitors (VMMs) 241-244, which may be considered as part of (or alternatively separated from) corresponding VMs 110-140. For example, VNICs 251-254 are virtual network adapters emulated by respective VMMs 241-244.
Although examples of the present disclosure refer to VMs, it should be understood that a “virtual machine” running on a host is merely one example of a “virtualized computing instance.” or “workload.” A virtualized computing instance may represent an addressable data compute node or isolated user space instance. In practice, any suitable technology may be used to provide isolated user space instances, not just hardware virtualization. Other virtualized computing instances may include containers (e.g., running within a VM or on top of a host operating system without the need for a hypervisor or separate operating system or implemented as an operating system level virtualization), virtual private servers, client computers, etc. Such container technology is available from, among others, Docker, Inc. The VMs may also be complete computational environments, containing virtual equivalents of the hardware and software components of a physical computing system. The term “hypervisor” may refer generally to a software layer or component that supports the execution of multiple virtualized computing instances, including system-level software in guest VMs that supports namespace containers such as Docker, etc.
Hypervisor 214A/214B further implements virtual switch 215A/215B to handle egress packets from, and ingress packets to, corresponding VMs 110-140. The term “packet” may refer generally to a group of bits that can be transported together from a source to a destination, such as message, segment, datagram, etc. The term “traffic” may refer generally to a flow of packets. The term “layer 2” may refer generally to a Media Access Control (MAC) layer; “layer 3” to a network or Internet Protocol (IP) layer; and “layer-4” to a transport layer (e.g., using transmission control protocol (TCP) or user datagram protocol (UDP)) in the Open System Interconnection (OSI) model, although the concepts described herein may be used with other networking models. Virtual switches 215A, 215B may be regarded as physical layer-2 switching devices implemented in software at the hypervisor layer. Collectively, a set a virtual switches may implement a logical switch distributed across multiple hosts. The logical switch is a conceptual abstraction that corresponds to the virtual network previously described.
Network manager 270, cloud service manager 280 and network controller 290 are example network management entities that facilitate management of various entities deployed in public cloud environment 100. An example network controller is the NSX controller component of VMware NSX® (available from VMware, Inc.) that resides on a central control plane. Network manager 270 (e.g., NSX manager) and cloud service manager 280 may be entities that reside on a management plane. Cloud service manager 280 may provide an interface for end users to configure their public cloud inventory (e.g., VMs 110-140) in public cloud environment 100. Management entity 270/280/290 may be implemented using physical machine(s), virtual machine(s), a combination thereof, etc.
Referring to
Conventionally, there are various challenges associated with east-west service insertion in public cloud environment 100, particularly for endpoints located within the same virtual network. In contrast with a private cloud environment with on-premise infrastructure, a user generally does not have any direct control over underlying hypervisors and hardware that support VMs 110-140. For example, a route table (see 103) is usually configured for VPC1101. Based on route (destination=CIDR1, target=local), any packet between VM1110 and VM2120 (i.e., same CIDR1=11.0.0.0/16) will be treated as local traffic within VPC1101. Since public cloud providers usually do not allow overriding of a CIDR block route in route table 103, it is challenging to steer traffic (originating in and destined for VPC1101) to service path 104 that is located outside of VPC1101.
East-West Service Insertion
According to examples of the present disclosure, east-west service insertion may be implemented for endpoints that are deployed in the same virtual network. For example in
In the example in
As used herein, the term “service path” may refer generally to a path between a source and a destination through which packets are steered to provide service(s) to the packets. A service path may include at least one “service virtualized computing instance” configured to provide a “service.” The term “service” may be any suitable networking or non-networking service, such as firewall, load balancing, NAT, intrusion detection system (IDS), intrusion prevention system (IPS), deep packet inspection (DPI), traffic shaping, traffic optimization, packet header enrichment or modification, packet tagging, content filtering, etc. It should be understood that the packet processing operation(s) associated with a service may or may not modify the content (i.e., header and/or payload) of the packets. The term “endpoint” may refer generally an originating node (“first endpoint” or “source endpoint”) or terminating node (“second endpoint” or “destination endpoint”) of a bi-directional inter-process communication flow.
In more detail,
At 310 in
At 330, 340 and 350 in
In one example (shown in
In the example in
It should be understood that service path 104 may include multiple SVMs (forming a service chain) that includes SVM1150. In practice, a service chain may represent an instantiation of an ordered set of service functions. Depending on the desired implementation, service path 104 may perform packet modification (i.e., decapsulated packet 172 is different to 174), or not (i.e., 172 same as 174). For example, SVM1150 implementing a firewall service usually does not modify the header and payload of a packet. In contrast, a NAT service will usually modify address information in a packet, such as by translating a private IP address to a public IP address, etc. Various examples will be discussed below using
Example Configuration
Using AWS as an example public cloud deployment, VMs 110-140 may be deployed in first virtual network=VPC1101 associated with CIDR1=11.0.0.0/16. Depending on the desired implementation, multiple subnets may be configured in VPC1101, each subnet being a subset of CIDR1=11.0.0.0/16. CGW1160 supports TIER-1 DR 520 and TIER-0 SR 530, which are deployed in second virtual network=VPC2102 associated CIDR2=12.0.0.0/16 in the example in
(a) High Availability (HA) Pairs
Referring first to 405-410 in
Similarly, SVM1150 may be deployed as a member of another HA pair. For example, SVM1150 may be assigned with role=primary (i.e., active), and SVM2 (not shown) with role=secondary (i.e., standby) using an active-standby configuration. When the active SVM fails, the standby SVM may take over as the active SVM. It should be understood that examples of the present disclosure may be implemented for active-active configuration, in which case all members are active at the same time.
To implement the active-standby configuration, each member of the HA pair is configured to detect the aliveness or failure of its peer. For example, a monitoring session may be established between members of the HA pair using any suitable fault detection or continuity check protocol, such as Border Gateway Protocol (BGP), etc. For example, using a monitoring session, members of each HA pair may monitor each other's status (i.e., alive or not) through control messages. HA members may also detect the aliveness by exchanging heartbeat messages.
(b) Tunnel Establishment
At 415 and 420 in
Using IPSec for example, encapsulated packets 640-650 in
(c) Route Information Exchange
At 425 and 430 in
Similarly, CGW1160 may generate and send a second route advertisement (see 542) via tunnel 105 to advertise default route information to SVM1150. In practice, a “default route” takes effect when no other route is available for an IP destination address according to a longest prefix match approach. For example, the default route is designated as “0.0.0.0/0” in IP version 4 (IPv4), and “::/0” in IP version 6 (IPv6). In response to receiving second route advertisement 520 via interface VT21142, SVM1150 updates its route information to store default route (destination=0.0.0.0/0, interface=VTI2). This way, SVM1150 may be configured to send packets to CGW1160 after performing packet processing. See 551 in
Any suitable inter-domain routing protocol (also known as gateway protocol) may be used for route advertisements 540-542, such as such as BGP, Open Shortest Path First (OSPF), Intermediate System to Intermediate System (IS-IS), etc. For example, BGP is an exterior gateway protocol that is used to exchange route information among routers in different autonomous systems.
(d) Hybrid Ports and Service Insertion Rules
At 435 in
Hybrid switch 520 is connected to logical router port 521 (labelled “LRP1”) of TIER-1 DR 520. See also routing information 553 that directs traffic destined for 11.0.0.0/16 via LRP1521. Hybrid switch 520 may be implemented collectively using agents 114-144 of respective VMs 110-140. Hybrid ports 511-514 may be implemented using respective agents 114-144. Using hybrid ports 511-154, a pair of VMs (e.g., VM1110 and VM2120) communicating within the same VPC 101 may retain their underlay IP address (usually assigned by the cloud provider). Example hybrid ports are discussed in related U.S. patent application Ser. No. 16/112,599 entitled “Intelligent Use of Peering in Public Cloud,” and U.S. patent application Ser. No. 16/112,597 entitled “Transitive Routing in Public Cloud.” These patent applications are incorporated herein by reference.
To facilitate east-west service insertion between VMs in the same VPC, service insertion rules may be configured to steer traffic to SVM1150 located outside of the VPC. In practice, a service insertion rule may be a (e.g., high priority) policy-based rule configured for VPC1101 with CIDR1=11.0.0.0/16, or a subnet within VPC1101. Each service insertion rule may specify a set of characteristic(s) to be matched to a packet, and an action to be performed when there is a match. The set may be include five-tuple information of a packet flow, such as source IP address, source port number (PN), destination IP address, destination PN, and protocol. Depending on the desired implementation, a group may be configured to represent a group of IP addresses, port numbers, protocols, etc. Additionally or alternatively, a characteristic may be derived from any suitable meta data associated with the packet.
In the example in
East-West Service Insertion
(a) Processing by Source Endpoint
Referring to
At 455 and 460 in
According to examples of the present disclosure, any suitable approach may be used to cause CGW1160 to steer “ENCAP1” 610 to SVM1150. In the example in
As discussed using
(b) Processing by Cloud Gateway
At 470 and 475 in
Further, at 480-482, CGW1160 performs route lookup to identify SVM1150, and generates and sends second encapsulated packet 620 (labelled “ENCAP2”) that includes inner packet (P1) and second outer header (O2). Outer header (O2) may be a tunnel header addressed from source tunnel IP address=IP-CGW to destination tunnel IP address=IP-Y, which is a routable IP address of SVM1150 having virtual service endpoint IP address=IP-SVM. Using IPSec for example, encapsulated packet 620 may be padded with encryption-related data (not shown for simplicity), such as ESP trailer data and ESP authentication data before being sent over tunnel 105. “ENCAP2” 620 may be sent over tunnel 105 via tunnel interface VTI1161 based on route information (destination=IP-SVM, interface=VTI1). See 552 in
(c) Processing by Service Path
At 483-484 in
At 485-486 in
(d) Forwarding Towards Destination (Directly)
At 487-488 in
In the example in
(e) Forwarding Towards Destination (Via Source)
Alternatively, “ENCAP4” 640 may be first sent to VM1110 to cause VM1110 to forward the processed packet to VM2120. An example is shown in
According to 489-490 in
It should be understood that example process 400 in
Variations
Another example is shown in
In contrast with the example in
On the return path, processed packet (P2*) 850 may be sent to VM1110 in an encapsulated form (see ENCAP4″ 840) before being forwarded to VM3130 via HP1511, hybrid logical switch 510 and HP2512. Processed packet (P*) 850 is then forwarded to APP3132, thereby completing the end-to-end packet forwarding process with east-west service insertion for VM3130 via VM1110.
Although examples of the present disclosure have been explained using tunnel 105, CGW1160 and SVM1150 may communicate natively (i.e., without any tunnel and encapsulation) in some scenarios. For example, a non-tunneling approach may be implemented by deploying both source endpoint (e.g., VM1110) and destination endpoint (e.g., VM2120 or VM3130) in VPC1101, while CGW1160 and SVM1150 are deployed in VPC2102 (i.e., different VPC). This way, from the perspective of CGW1160, CGW1160 may forward/receive packets in decapsulated form (i.e., natively) to/from SVM1150. From the perspective of SVM1150, SVM1150 may forward/receive decapsulated packets in decapsulated form to/from CGW1160. In this case, IPSec tunnels are not used between CGW1160 and SVM1150, which means it is not necessary to perform encryption and decryption operations, which may be resource-intensive.
Container Implementation
Although explained using VMs 110-140, it should be understood that public cloud environment 100 may include other virtual workloads, such as containers, etc. As used herein, the term “container” (also known as “container instance”) is used generally to describe an application that is encapsulated with all its dependencies (e.g., binaries, libraries, etc.). In the examples in
Computer System
The above examples can be implemented by hardware (including hardware logic circuitry), software or firmware or a combination thereof. The above examples may be implemented by any suitable computing device, computer system, etc. The computer system may include processor(s), memory unit(s) and physical NIC(s) that may communicate with each other via a communication bus, etc. The computer system may include a non-transitory computer-readable medium having stored thereon instructions or program code that, when executed by the processor, cause the processor to perform process(es) described herein with reference to
The techniques introduced above can be implemented in special-purpose hardwired circuitry, in software and/or firmware in conjunction with programmable circuitry, or in a combination thereof. Special-purpose hardwired circuitry may be in the form of, for example, one or more application-specific integrated circuits (ASICs), programmable logic devices (PLDs), field-programmable gate arrays (FPGAs), and others. The term ‘processor’ is to be interpreted broadly to include a processing unit, ASIC, logic unit, or programmable gate array etc.
The foregoing detailed description has set forth various embodiments of the devices and/or processes via the use of block diagrams, flowcharts, and/or examples. Insofar as such block diagrams, flowcharts, and/or examples contain one or more functions and/or operations, it will be understood by those within the art that each function and/or operation within such block diagrams, flowcharts, or examples can be implemented, individually and/or collectively, by a wide range of hardware, software, firmware, or any combination thereof.
Those skilled in the art will recognize that some aspects of the embodiments disclosed herein, in whole or in part, can be equivalently implemented in integrated circuits, as one or more computer programs running on one or more computers (e.g., as one or more programs running on one or more computing systems), as one or more programs running on one or more processors (e.g., as one or more programs running on one or more microprocessors), as firmware, or as virtually any combination thereof, and that designing the circuitry and/or writing the code for the software and or firmware would be well within the skill of one of skill in the art in light of this disclosure.
Software and/or to implement the techniques introduced here may be stored on a non-transitory computer-readable storage medium and may be executed by one or more general-purpose or special-purpose programmable microprocessors. A “computer-readable storage medium”, as the term is used herein, includes any mechanism that provides (i.e., stores and/or transmits) information in a form accessible by a machine (e.g., a computer, network device, personal digital assistant (PDA), mobile device, manufacturing tool, any device with a set of one or more processors, etc.). A computer-readable storage medium may include recordable/non recordable media (e.g., read-only memory (ROM), random access memory (RAM), magnetic disk or optical storage media, flash memory devices, etc.).
The drawings are only illustrations of an example, wherein the units or procedure shown in the drawings are not necessarily essential for implementing the present disclosure. Those skilled in the art will understand that the units in the device in the examples can be arranged in the device in the examples as described, or can be alternatively located in one or more devices different from that in the examples. The units in the examples described can be combined into one module or further divided into a plurality of sub-units.
| Number | Name | Date | Kind |
|---|---|---|---|
| 8830834 | Sharma | Sep 2014 | B2 |
| 8989192 | Foo | Mar 2015 | B2 |
| 9130872 | Kumar | Sep 2015 | B2 |
| 9531850 | Nainar | Dec 2016 | B2 |
| 10021196 | Akers | Jul 2018 | B1 |
| 10148459 | Chiu | Dec 2018 | B2 |
| 10333822 | Jeuk et al. | Jun 2019 | B1 |
| 10536563 | Wang | Jan 2020 | B2 |
| 10560345 | Patel | Feb 2020 | B2 |
| 20100254385 | Sharma et al. | Oct 2010 | A1 |
| 20120082048 | Taft et al. | Apr 2012 | A1 |
| 20130163594 | Sharma et al. | Jun 2013 | A1 |
| 20130254762 | Cochran | Sep 2013 | A1 |
| 20140050223 | Foo et al. | Feb 2014 | A1 |
| 20140215010 | Liang et al. | Jul 2014 | A1 |
| 20160087940 | Miller | Mar 2016 | A1 |
| 20160119226 | Guichard et al. | Apr 2016 | A1 |
| 20160337234 | Duda et al. | Nov 2016 | A1 |
| 20160352538 | Chiu et al. | Dec 2016 | A1 |
| 20170126435 | Benny et al. | May 2017 | A1 |
| 20180115519 | Bonomi | Apr 2018 | A1 |
| 20180139073 | Han et al. | May 2018 | A1 |
| 20180241809 | Gandhi et al. | Aug 2018 | A1 |
| 20190123962 | Guo et al. | Apr 2019 | A1 |
| 20190245949 | Wang et al. | Aug 2019 | A1 |
| 20190391741 | Barinov | Dec 2019 | A1 |
| 20190392070 | Johnson et al. | Dec 2019 | A1 |
| 20200236046 | Jain | Jul 2020 | A1 |
| 20200236047 | Hira | Jul 2020 | A1 |
| Entry |
|---|
| International Search Report and Written Opinion of the International Searching Authority, International application No. PCT/US2019/036812, dated Oct. 9, 2019. |
| International Search Report and Written Opinion of the International Searching Authority, International application No. PCT/US2020/013954, dated Mar. 25, 2020. |
| Number | Date | Country | |
|---|---|---|---|
| 20210021486 A1 | Jan 2021 | US |