Graceful scaling in software driven networks

Description

TECHNICAL FIELD

This disclosure relates generally to data processing and, more specifically, to methods and system for scaling data networks.

BACKGROUND

The approaches described in this section could be pursued but are not necessarily approaches that have previously been conceived or pursued. Therefore, unless otherwise indicated, it should not be assumed that any of the approaches described in this section qualify as prior art merely by virtue of their inclusion in this section.

In a typical load balancing scenario, a service hosted by a group of servers is front-ended by a load balancer (LB) (also referred to herein as a LB device) which represents this service to clients as a virtual service. Clients needing the service can address their packets to the virtual service using a virtual Internet Protocol (IP) address and a virtual port. The LB will inspect incoming packets and, based on the policies/algorithms, will choose a particular server from the group of servers, modify the packet if needed, and forward the packet towards the server. On the way back from the server (optional), the LB will get the packet, modify the packet if needed, and forward the packet back towards the client.

The traditional approach for load balancing of network of servers has several drawbacks. For example, the network request load may stay lower than a maximum capacity of a LB device for a long time, which could lead to wasted resources. In another situation, a network request might exceed the maximum capacity a single LB device can handle. Generally speaking, the limitations of traditional load balancing of networks are due to there only being one or several static devices responsible for deciding how and where to send packets, which does not allow for dynamic changes in network configuration when one needs to scale down or scale up the resources.

Therefore, more efficient methods and systems for scaling data networks may be needed.

SUMMARY

This summary is provided to introduce a selection of concepts in a simplified form that are further described in the Detailed Description below. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.

The present disclosure is related to approaches for scaling in a data network (for example, a software driven network (SDN)). Specifically, a method for scaling in a data network may comprise receiving an indication of removing a first node from a plurality of nodes of the data network, generating a service policy, and sending the service policy to each of the plurality of nodes of the data network. The service policy reassigns service requests associated with the first node to a second node from the plurality of nodes of the data network. The first node is removed from the data network and the second node remains in the data network. The service policy can include a classification traffic map, wherein classes of service requests are assigned to nodes from the plurality of nodes of the data network.

In some embodiments, the method for scaling in a data network may include additional steps preventing interruption of heavy-duty connections served by the first node. For instance, before sending the service policy to each of the plurality of nodes of the data network, an indication of a presence of old connections associated with the first node can be received, and a redirection node can be created in a data network. A redirection policy can then be generated. The redirection policy contains an indication that service requests associated with the old connections should be sent from the second node via the redirection node to the first node. The redirection policy is sent to the second node and the redirection node. After sending the service policy to each of the plurality of nodes of the data network, the method for scaling in a data network can include waiting for an indication of finishing the old connection. Upon receiving the indication, the redirection policy is removed from the second node and the redirection node is removed from the data network.

In some embodiments, a method for scaling out a data network may comprise receiving an indication of a presence of a further node in the data network, generating a further node service policy, and sending the further node service policy to each of a plurality of nodes of the data network and to the further node. The further node service policy reassigns one or more of the service requests associated with any of the plurality of nodes of the data network to the further node.

In some embodiments, the method for scaling out a data network includes additional steps preventing interruption of heavy-duty connections served by one or more old nodes in plurality of the nodes of the data network. For example, before sending the further node service policy to each of the plurality of nodes of the data network, the method may include receiving an indication of a presence of old connections associated with an old node from the plurality of nodes of the data network. A redirection node can be created in the data network, and a redirection policy can be generated. The redirection policy indicates that service requests associated with the old connections should be sent from the further node via the redirection node to the old node. The redirection policy is sent to the redirection node and to the further node. After sending the further node service policy to each of the plurality of nodes of the data network and to the further node, the method may include waiting for an indication of finishing the old connections. Upon receiving the indication of finishing the old connections, the redirection policy is removed from the further node and the redirection node is removed from the data network.

According to another example embodiment, there is provided a system for scaling a data network. The system includes a cluster master and a plurality of nodes of a data network. The cluster master is configured to retrieve and analyze network data associated with the data network and service node data associated with one or more service nodes. The data network may be scaled in. For example, the cluster master can receive an indication of removing a first node from the plurality of nodes of the data network, generate a service policy, and send the service policy to each of the plurality of nodes of the data network. The service policy reassigns service requests associated with the first node to a second node from the plurality of nodes of the data network.

In other embodiments, the data network may be scaled out. For example, the cluster master can receive an indication of presence of a further node in the data network, generate a further node service policy, and send the further node service policy to each of the plurality of nodes of the data network and to the further node. The further node service policy reassigns one or more of the service requests associated with any of existing nodes of the plurality of nodes of the data network to the further node. In certain embodiments, the cluster master can be configured to carry out additional steps preventing interruption of heavy-duty connections served by the first node when the data network is scaled in or by one of the existing nodes when the data network scaled out.

In further example embodiments of the present disclosure, the steps of a method for scaling a data networks are stored on a machine-readable medium comprising instructions, which when implemented by one or more processors perform the recited steps. In yet further example embodiments, hardware systems or devices can be adapted to perform the recited steps. Other features, examples, and embodiments are described below.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments are illustrated by way of example, and not by limitation, in the figures of the accompanying drawings, in which like references indicate similar elements.

FIG. 1 shows an environment within which a method and a system for service load distribution in a data network can be implemented, according to an example embodiment.

FIG. 2 is a block diagram showing various modules of a system for service load distribution in a data network.

FIG. 3 is a scheme for service load distribution of a data network.

FIG. 4 is a scheme showing graceful scaling in a data network.

FIG. 5 is a scheme showing graceful scaling in a data network.

FIG. 6 is a scheme showing graceful scaling out a data network.

FIG. 7 is a process flow diagram showing a method for graceful scaling in a data network.

FIG. 8 is a process flow diagram showing a method for graceful scaling out a data network.

FIG. 9 shows a diagrammatic representation of a computing device for a machine in the example electronic form of a computer system, within which a set of instructions for causing the machine to perform any one or more of the methodologies discussed herein, can be executed.

DETAILED DESCRIPTION

The following detailed description includes references to the accompanying drawings, which form a part of the detailed description. The drawings show illustrations in accordance with example embodiments. These example embodiments, which are also referred to herein as “examples,” are described in enough detail to enable those skilled in the art to practice the present subject matter. The embodiments can be combined, other embodiments can be utilized, or structural, logical, and electrical changes can be made without departing from the scope of what is claimed. The following detailed description is therefore not to be taken in a limiting sense, and the scope is defined by the appended claims and their equivalents. In this document, the terms “a” and “an” are used, as is common in patent documents, to include one or more than one. In this document, the term “or” is used to refer to a nonexclusive “or,” such that “A or B” includes “A but not B,” “B but not A,” and “A and B,” unless otherwise indicated.

The present disclosure relates to scaling in a data network, such as an SDN. The techniques of the embodiments disclosed herein may be implemented using a variety of technologies. For example, the methods described herein are implemented in software executing on a computer system or in hardware utilizing either a combination of microprocessors or other specially designed application-specific integrated circuits (ASICs), programmable logic devices, or various combinations thereof. In particular, the methods described herein may be implemented by a series of computer-executable instructions residing on a storage medium such as a disk drive or computer-readable medium. It should be noted that methods disclosed herein can be implemented by a computer (e.g., a desktop computer, a tablet computer, a laptop computer, and a server), game console, handheld gaming device, cellular phone, smart phone, smart television system, and so forth.

According to an example embodiment, the method for graceful scaling a data network comprises receiving an indication of removing a first node from a plurality of nodes of the data network, generating a service policy, sending the service policy to each of the plurality of nodes of the data network, wherein the service policy reassigns service requests associated with the first node to a second node from the plurality of nodes of the data network. In some embodiments, the method for graceful scaling a data network comprises receiving an indication of a presence of a further node in the data network, generating a further service policy, and sending the further service policy to each of the plurality of nodes of the data network and to the further node, wherein the further service policy reassigns one or more of the service requests associated with any of the plurality of nodes of the data network to the further node.

Referring now to the drawings, FIG. 1 illustrates an environment 100 within which a method and a system for load distribution in an SDN can be implemented. The environment 100 includes a network 110, a client 120, a system 200 for load distribution, and servers 140. The client 120 includes a user or a host associated with the network 110.

The network 110 includes the Internet or any other network capable of communicating data between devices. Suitable networks include an interface with any one or more of, for instance, a local intranet, a PAN (Personal Area Network), a LAN (Local Area Network), a WAN (Wide Area Network), a MAN (Metropolitan Area Network), a virtual private network (VPN), a storage area network (SAN), a frame relay connection, an Advanced Intelligent Network (AIN) connection, a synchronous optical network (SONET) connection, a digital T1, T3, E1 or E3 line, Digital Data Service (DDS) connection, DSL (Digital Subscriber Line) connection, an Ethernet connection, an ISDN (Integrated Services Digital Network) line, a dial-up port such as a V.90, V.34 or V.34 bis analog modem connection, a cable modem, an ATM (Asynchronous Transfer Mode) connection, or an FDDI (Fiber Distributed Data Interface) or CDDI (Copper Distributed Data Interface) connection. Furthermore, communications may also include links to any of a variety of wireless networks, including WAP (Wireless Application Protocol), GPRS (General Packet Radio Service), GSM (Global System for Mobile Communication), CDMA (Code Division Multiple Access) or TDMA (Time Division Multiple Access), cellular phone networks, GPS (Global Positioning System), CDPD (cellular digital packet data), RIM (Research in Motion, Limited) duplex paging network, Bluetooth radio, or an IEEE 802.11-based radio frequency network. The network 110 can further include or interface with any one or more of an RS-232 serial connection, an IEEE-1394 (Firewire) connection, a Fiber Channel connection, an IrDA (infrared) port, a SCSI (Small Computer Systems Interface) connection, a USB (Universal Serial Bus) connection or other wired or wireless, digital or analog interface or connection, mesh or Digi® networking. The network 110 may include a network of data processing nodes that are interconnected for the purpose of data communication. The network 110 includes an SDN. The SDN includes one or more of the above network types. Generally, the network 110 includes a number of similar or dissimilar devices connected together by a transport medium enabling communication between the devices by using a predefined protocol. Those skilled in the art will recognize that the present disclosure can be practiced within a variety of network configuration environments and on a variety of computing devices.

As shown in FIG. 1, the client 120 sends service requests 150 to backend servers 140. The service requests 150 include an HTTP request, a video streaming request, a file download request, a transaction request, a conference request, and so forth. The servers 140 include a web server, a wireless application server, an interactive television server, and so forth. The system 200 for load distribution balance flow of the service requests 150 among traffic forwarding devices of the network 110. The system 200 for load distribution may analyze the flow of the service requests 150 and determine which and how many traffic forwarding devices of the network 110 are needed to deliver the service requests 150 to the servers 140.

FIG. 2 shows a block diagram illustrating various modules of a system 200 for service load distribution in an SDN, according to an example embodiment. The system 200 includes a cluster of devices eligible as a cluster master. The system 200 includes a cluster master 205 elected from these devices. The cluster master 205 is configured to keep track of the SDN and retrieve network data associated with the SDN. In an example embodiment, the network data include one or more of the following: a number of total connections, processing unit utilization, a memory status, a network connectivity, backend server capacity, and so forth. Furthermore, the cluster master 205 is configured to keep track of the service nodes and retrieve service node data associated with one or more service nodes. The service node data include one or more of the following: health, dynamic state, responsiveness of the one or more service nodes, and so forth. In other words, the cluster master 205 keeps track of the health of the network and each service node associated with the system 200. The cluster master 205 may analyze the retrieved network data and service node data. Based on the analysis, the cluster master 205 generates a service policy. The service policy includes a service address, a service node address, a service node load policy, a traffic distribution policy (also referred to as a traffic mapping), and so forth. The cluster master 205 may provide the generated service policy to the devices associated with the data network, such as service nodes and traffic classification engines.

In an example embodiment, the cluster master 205 is further configured to develop, based on the analysis, a further service policy. The further policy is associated with scaling out; scaling down; remedying; removing devices, such as service nodes, traffic classification engines, backend servers and so forth; and introducing new service nodes, traffic classification engines, backend servers, and so forth.

In an example embodiment, the cluster master 205 is further configured to facilitate an application programmable interface (not shown) for a network administrator to enable the network administrator to develop, based on the analysis, a further service policy using the retrieved network data and service node data and analytics. This approach allows application developers to write directly to the network without having to manage or understand all the underlying complexities and subsystems that compose the network.

In a further example embodiment, the cluster master 205 may include a backup unit (not shown) configured to replace the cluster master in case of a failure of the cluster master 205.

The system 200 may comprise a traffic classification engine 210. The traffic classification engine 210 may be implemented as one or more software modules, hardware modules, or a combination of hardware and software. The traffic classification engine 210 may include an engine configured to monitor data flows and classify the data flows based on one or more attributes associated with the data flows (e.g., uniform resource locators (URLs), IP addresses, port numbers, and so forth). Each resulting data flow class can be specifically designed to implement a certain service for a client. In an example embodiment, the cluster master 205 may send a service policy to the traffic classification engine 210. The traffic classification engine 210 may be configured to receive the service policy from the cluster master 205. Furthermore, the traffic classification engine 210 may be configured to receive one or more incoming service requests 215 (e.g. incoming data traffic from routers or switches (not shown)). Typically, the data traffic may be distributed from the routers or switches to each of the traffic classification engines 210 evenly. In an example embodiment, a router performs a simple equal-cost multi-path (ECMP) routing to distribute the traffic equally to all the traffic classification engines 210. The traffic classification engines 210 distribute the one or more service requests among one or more service nodes 220 according to the service policy. The traffic is distributed to the one or more service nodes 220 in an asymmetric fashion. The traffic to the service nodes 220 may be direct or through a tunnel (IP-in-IP or other overlay techniques). The traffic classification engine 210 may be stateless or stateful, may act on a per packet basis, and may direct each packet of the traffic to the corresponding service node 220. When there is a change in the state of the service nodes, the cluster master 205 sends a new service policy, such as a new traffic map, to the traffic classification engine 210.

The system 200 may comprise the one or more service nodes 220. The one or more service nodes 220 may include a virtual machine or a physical device that may serve a corresponding virtual service to which the traffic is directed. The cluster master 205 sends the service policy to the service nodes 220. The service nodes 220 may be configured to receive the service policy from the cluster master 205. Furthermore, the service nodes 220 receive, based on the service policy, the one or more service requests 215 from the traffic classification engine 210. The one or more service nodes 220 may process the received one or more service requests 215 according to the service policy. The processing of the one or more service requests 215 may include forwarding the one or more service requests 215 to one or more backend destination servers (not shown). Each service node 220 may serve one or more virtual services. The service nodes 220 may be configured to send the service node data to the cluster master 205.

According to further example embodiments, an existing service node may redirect packets for existing flows to another service node if it is the new owner of the flow based on the redistribution of flows to the service nodes. In addition, a service node taking over the flow may redirect packets to the service node that was the old owner for the flows under consideration, for cases where the flow state needs to be pinned down to the old owner to maintain continuity of service.

Furthermore, in an example embodiment, the cluster master 205 may perform a periodic health check on the service nodes 220 and update the service nodes 220 with a service policy, such as a traffic map. When there is a change in the traffic assignment and a packet of the data traffic in a flow reaches a service node 220, the service node 220 may redirect the packet to another service node. Redirection may be direct or through a tunnel (e.g., IP-in-IP or other overlay techniques).

It should be noted that if each of the devices of the cluster in the network performs the backend server health check, it may lead to a large number of health check packets sent to an individual device. In view of this, a few devices of the cluster may perform the backend server health check, and the result may be shared among the rest of the devices in the cluster. The health check may include a service check and a connectivity check. The service check may include determining whether the application or the backend server is still available. As already mentioned above, not every device in the cluster needs to perform this check. The check can be performed by a few devices, and the result propagated to the rest of the devices in the cluster. A connectivity check includes determining whether the service node can reach the backend server. The path to the backend server may be specific to the service node, so this may not be distributed across service nodes, and each device in the cluster may perform its own check.

In an example embodiment, the system 200 comprises an orchestrator 225. The orchestrator 225 may be configured to bring up and bring down the service nodes 220, the traffic classification engines 210, and backend servers. The orchestrator 225 may detect a presence of the one or more service nodes 220 and transmit data associated with the presence of the one or more service nodes 220 to the cluster master 205. Furthermore, the orchestrator 225 may inform the cluster master 205 of bringing up or bringing down the service nodes 220. The orchestrator 225 may communicate with the cluster master 205 and the service nodes 220 using one or more Application Programming Interfaces (APIs).

In an example embodiment, a centralized or distributed network database may be used and shared among all devices in the cluster of the system 200, such as the cluster master, the traffic classification engine, and other service nodes. Each device may connect to the network database and update tables according to its role. Relevant database records may be replicated to the devices that are part of the cluster. The distributed network database may be used to store configurations and states of the devices (e.g., to store data associated with the cluster master, the traffic classification engine, the one or more service nodes, and backend servers). The data stored in the distributed network database may include the network data and the service node data. The distributed network database may include tables with information concerning service types, availability of resources, traffic classification, network maps, and so forth. The cluster master 205 may be responsible for maintaining the distributed network database and replicating it to devices. The network database may be replicated to the traffic classification engines 210 and the service nodes 220. In an example embodiment, the network database may internally replicate data across the participant nodes.

In the embodiments described above, the system 200 comprises a dedicated cluster master 205, dedicated traffic classification engines 210, and dedicated service nodes 220. In other words, specific devices can be responsible for acting as the cluster master, the traffic classification engine, and the service node. In further example embodiments, the system 200 includes no dedicated devices acting as a cluster master. In this case, the cluster master functionality is provided by either the traffic classification engines or by the service nodes. Thus, one of the traffic classification engines or one of the service nodes is operable to act as the cluster master. In case the traffic classification engine or service node acting as the cluster master fails, another traffic classification engine or service node may be elected as the cluster master. The traffic classification engines and the service nodes not elected as the cluster master are configured as backup cluster masters and synchronized with the current cluster master. In an example embodiment, the cluster master consists of multiple active devices that can act as a single master by sharing duties among the devices.

In further example embodiments, the system 200 comprises a dedicated cluster master with no dedicated devices acting as traffic classification engines. In this case, the traffic classification may be performed by one of upstream routers or switches. Also, the service nodes may distribute the traffic among themselves. In an example embodiment, the cluster master and the service nodes are configured to act as a traffic classification engine.

In further example embodiments, the system 200 includes no devices acting as cluster masters and traffic classification engines. In this case, one of the service nodes is configured to also act as the cluster master. Upstream routers or switches can do the traffic classification. The cluster master programs the upstream routers with the traffic mapping. Additionally, the service nodes distribute the traffic among themselves.

It should be noted that bringing up new service nodes when the load increases and bringing down the service nodes when the load becomes normal can be performed gracefully, without affecting existing data traffic and connections. When the service node comes up, the distribution of traffic changes from distribution to n service nodes to distribution to (n+1) service nodes.

When a service node is about to be brought down, the traffic coming to this service node is redirected to other service nodes. For this purpose, a redirection policy associated with the service node about to be brought down may be created by the cluster muster and sent to the traffic distribution engine and/or the service nodes. Upon receiving the redirection policy, the traffic distribution engine directs the traffic to another service node.

In an example embodiment, the system 200 comprises, for example, a plurality of traffic distribution engines, each of which serves traffic to multiple services. Each of the traffic distribution engines may communicate with a different set of service nodes. In case one of the traffic distribution engines fails, another traffic distribution engine is configured to substitute the failed traffic distribution engine and to distribute the traffic of the failed traffic distribution engine to the corresponding service nodes. Therefore, each of the traffic distribution engines comprises addresses of all service nodes and not only addresses associated with the service nodes currently in communication with the traffic distribution engine.

FIG. 3 shows a block diagram 300 for load distribution of an SDN, according to an example embodiment. As shown, diagram 300 includes clients 120 (e.g., a computer connected to a network 110). The network 110 may include the SDN. The clients 120 may send one or more service requests for services provided by one or more servers of a virtual machine/server pool 305. These servers include web servers, wireless application servers, interactive television servers, and so forth. These service requests can be load balanced by a system for load distribution described above. In other words, the service requests of the clients 120 may be intelligently distributed among virtual machine/server pool 305 of the SDN.

The system for load distribution includes a service control plane 310. The service control plane 310 includes one or more data network applets 315 (for example, a real time data network applet). The data network applets 315 check the health and other data associated with the SDN 110 and the virtual machines 305. For example, the data network applets 315 may determine responsiveness of the virtual machine/server pool 305. Furthermore, the data network applets 315 monitor the total connections, central processing unit utilization, memory, network connectivity on the virtual machine/server pool 305, and so forth. Therefore, the data network applets 315 may retrieve fine-grained, comprehensive information concerning the SDN and virtual machine service infrastructure.

The retrieved health data may be transmitted to a service policy engine 320. In example embodiments, a cluster master 205 described above may act as the service policy engine 320. The service policy engine 320 may analyze the health data and, upon the analysis, generate a set of service policies 330 to scale up/down the services, to secure services, to introduce new services, to remove services, to remedy or repair failed devices, and so forth. The system for load distribution may further comprise an orchestrator (not shown) configured to bring up more virtual machines on demand. Therefore, in order to deliver a smooth client experience, the service requests may be load balanced across the virtual machine/server pool 305.

Furthermore, service policies 330 may be provided to an SDN controller 335. The SDN controller 335, in turn, may steer service requests, i.e., data traffic, across the network devices in the SDN. Effectively, these policies may control load balancing, availability, and functionality of the SDN network to scale up or scale down services.

Generally speaking, by unlocking the data associated with the network, service nodes, and the server/virtual machines from inside the network, transforming the data into relevant information and the service policies 330, and then presenting the service policies 330 to the SDN controller 335 for configuring the SDN 110, the described infrastructure may enable feedback loops between underlying infrastructure and applications to improve network optimization and application responsiveness.

The service control plane 310 working in conjunction with the controller 335 and the service policy engine 320 may create a number of deployment possibilities, which may offer an array of basic and advanced load distribution features. In particular, to provide a simple load balancing functionality, the SDN controller 335 and the service control plane 310 may provide some load balancing of their own by leveraging the capabilities of the SDN 110 or, alternatively, work in conjunction with an ADC 340, also referred to as a service data plane included in the SDN 110 to optionally provide advanced additional functionality.

In an example embodiment, the service control plane 310 may be standalone, i.e., without an ADC 340, virtual machines 305 or, when scaled up, may be programmed with a virtual Internet Protocol (VIP) address on a loopback interface of the virtual machines 305. Thus, for data traffic in need of simple service fulfillment, the service control plane 310 may establish simple policies for distributing service requests and instruct the SDN controller 335 to program network devices to distribute the service requests directly to different virtual machines/physical servers 305. This step may be performed over a physical or logical network.

In an example embodiment, while the service control plane 310 may work in cooperation with an ADC 340, the service control plane 310 may manage a set of service policy mapping service requests to one or more ADC devices for more sophisticated ADC functionality typically offered by a purpose built ADC device. The service control plane 310 may instruct the SDN controller 335 to program network devices such that the service requests, i.e., the traffic, may reach one or more ADCs 340. The ADC 340 then may relay the service request to a backend server over a physical or logical network.

In the described embodiment, several traffic flow scenarios may exist. In an example embodiment, only forward traffic may go through the ADC 340. If a simple functionality of the ADC 340 (e.g., rate limiting, bandwidth limiting, scripting policies) is needed, the forward traffic may traverse the ADC 340. The loopback interface on the servers may be programmed with the VIP address. Response traffic from the virtual machines 305 may bypass the ADC 340.

In a further example embodiment, forward and reverse traffic may traverse the ADC 340. If it is needed that the ADC 340 provide a more advanced functionality (e.g., transmission control protocol (TCP) flow optimization, secure sockets layer (SSL) decryption, compression, caching and so forth), the service control plane 310 may need to ensure that both the forward and reverse traffic traverses through the ADC 340 by appropriately programming the SDN 110.

FIG. 4 is a scheme 400 showing graceful scaling in a data network, according to an example embodiment. In example scheme 400, two service nodes 420 (B1) and 425 (B2) receive data traffic from node 410 (D1) belonging to traffic classification engine 210 (shown in FIG. 2). The service node 425 is being removed from the data network. The node 410 sends the data traffic according to a service policy described in traffic map 440 generated by a cluster master 205 (shown in FIG. 2).

In the example traffic map 440, the column A contains “active” nodes and the column S contains the “stand by” nodes for service requests belonging to class C1 and C2. According to the traffic map 440, the all service requests belonging to class C1 are sent first to the node B1 and, if node B1 cannot handle the received service requests, to the node B2. Correspondingly, all service requests classified to class C2 are sent first to the node B2, and, if node B2 cannot handle the received packet, to the node B1.

In order to remove the service node B2, the cluster master 205 first generates new traffic map 450, wherein the node B2 is removed from the “stand by” column. Another service node from the data network, for example Bk, may be assigned as the “stand by” node for the packets belonging to class C1.

Further, the cluster master generates another traffic map 460, wherein the node B2 is replaced as “active” node by the “stand by” node for the corresponding class of the service requests. In example scheme 400, the B2 node is replaced by the B1 node. The cluster master sends the copy of the new traffic map 460 to the nodes of the traffic classification engine and service nodes of the data network. The B2 node can be safely removed or shutdown from the data network.

It should be noted that example scheme 400 for graceful scaling in a data network is suitable for removing nodes involved in connections with light traffic flow.

FIG. 5 is a scheme 500 showing graceful scaling in a data network, according to another example embodiment. The example scheme 500 is suitable for removing nodes involved in heavy duty traffic connections (for example, SSL (Secure Sockets Layer) type connections.)

In example scheme 500, two service nodes 420 (B1) and 425 (B2) receive data traffic from node 410 (D1) belonging to traffic classification engine 210 (shown in FIG. 2). The node 410 sends the data traffic according to a service policy described in traffic map 440 generated by a cluster master 205 (shown in FIG. 2). The service node 425 is needed to be removed from the data network.

In order to remove the B2 node, the cluster master generates and sends to the traffic classification engine and service nodes new traffic map 460 as described in FIG. 4. Before generating and sending the new traffic map 460, an indication of a presence of and old connections associated with the B2 node is received, and a redirection node 520 (R1) is created. The cluster master 205 sends a redirection service policy to the nodes B1 and R1. According to the redirection service policy, all service requests belonging to the old connections should be sent from the node B1 through the redirection node R1 to the B2 node. All other service requests flow through the B1 node according to the new traffic map 460. After the cluster map receives an indication from the service node B2 that all old connections are finished, the redirection node R1 may be removed from data network and the redirection service policy can be cleared from the B1 node. The B2 service node can be safely removed from the data network.

FIG. 6 is a scheme 600 showing graceful scaling out a data network, according to an example embodiment. In example scheme 600 the service node 420 (B1) receives data traffic from node 410 (D1) belonging to traffic classification engine 210 (shown in FIG. 2). The node 410 sends the data traffic according to a service policy described in traffic map 640 generated by a cluster master 205 (shown in FIG. 2). The service node 425 (B2) is added to the data network.

In the example traffic map 640, the column A contains “active” nodes and the column S contains the “stand by” nodes for service requests belonging to class C1 and C2. According to the traffic map 640, all service requests belonging to classes C1 and C2 are send first to the node B1 and, if node B1 cannot handle the received service requests, then the packets of class C1 are sent to the node Bk and the packets of class C2 are sent to node Bn.

After service node B2 becomes active in the data network, the cluster master may generate new traffic map 660. According to new traffic map 660, all service requests classified to class C2 should be sent to the node B2. The cluster master sends the new traffic map to nodes of traffic classification engine and all service nodes in the data network.

If the service node B1 was handling a heavy traffic connection before adding new node B2, such as a SSL connection, then before the cluster master generates and sends the new traffic map 660, a redirection node 620 (R2) can be created. The cluster master sends a redirection service policy to the node B2 and R2. According to the redirection service policy, all service requests belonging to old connections that flowed through the B1 node before generating the new traffic map 460 should be sent from the B2 node through the redirection node R2 to the B1 node. All other service requests flow through the B2 node according to the service policy described in the new traffic map 460. After the cluster map receives an indication from the service node B1 that all old connections are finished, the redirection node R2 may be removed from data network and the redirection service policy can be cleared from the B2 node.

FIG. 7 is a process flow diagram showing a method 700 for graceful scaling in a data network, according to an example embodiment. The method 700 can be performed by processing logic that may comprise hardware (e.g., decision making logic, dedicated logic, programmable logic, and microcode), software (such as software running on a general-purpose computer system or a dedicated machine), or a combination of both. In particular, the method 700 can be implemented in example SDN shown in FIG. 2.

The method 700 may commence in step 702 with receiving an indication of removing a node from a plurality of nodes in a data network. In step 704, the method 700 proceeds further with generation of a service policy, and the service policy reassigns service requests associated with the first node to a second node from the plurality of nodes in the data network.

The method 700 may proceed further with following optional steps 706-712. In optional step 706, the method 700 continues with receiving an indication of a presence of old connections associated with the first node. In step 708, a redirection node is created in data network. In step 710, a redirection policy is generated. The redirection policy contains an indication that service requests belonging to old connections should be sent from the second node via the redirection node to the first node. In step 712, the redirection policy is sent to the second node and the redirection node.

In step 714, the service policy generated in step 704 is sent to each node in the plurality of nodes in the data network. The method 700 can further proceed with following optional steps 716-720. In optional 716, the method 700 proceeds with waiting for an indication that all old connections in the first node are finished. In step 718, the redirection policy is removed from the second node. In step 720, the redirection node is removed from the data network.

FIG. 8 is a process flow diagram showing a method 800 for graceful scaling out a data network, according to an example embodiment. The method 800 can be performed by processing logic that may comprise hardware (e.g., decision making logic, dedicated logic, programmable logic, and microcode), software (such as software running on a general-purpose computer system or a dedicated machine), or a combination of both. In particular, the method 800 can be implemented in example SDN shown in FIG. 2.

The method 800 may commence in step 802 with receiving an indication of a presence of a further node in a plurality of nodes of a data network. In step 804, a further node policy is generated. The further node policy reassigns to the further node one or more service requests associated earlier with any of plurality of nodes of the data network.

The method 800 may further proceed with optional steps 806-812. In optional step 806, the method 800 may proceed with receiving an indication of old connections associated with an old node of the plurality of the nodes of the data network. In step 808, a redirection node can be created in data network. In step 810, a redirection policy is generated. The redirection policy indicates that service requests associated with old connections should be sent to the further node via the redirection node to the old node. In step 812, the redirection policy is sent to the redirection node and to the further node.

In step 814, the further node policy is sent to all nodes in the plurality of nodes of the data network and to the further node.

The method 800 may further proceed with optional steps 816-820. In step 816, the method 800 continues with waiting for an indication that all old connections are finished. In step 818, the redirection policy is removed from the further node. In step 820, the redirection node is removed from the data network.

FIG. 9 shows a diagrammatic representation of a machine in the example electronic form of a computer system 900, within which a set of instructions for causing the machine to perform any one or more of the methodologies discussed herein may be executed. In various example embodiments, the machine operates as a standalone device or may be connected (e.g., networked) to other machines. In a networked deployment, the machine may operate in the capacity of a server or a client machine in a server-client network environment, or as a peer machine in a peer-to-peer (or distributed) network environment. The machine may be a personal computer (PC), a tablet PC, a set-top box (STB), a cellular telephone, a portable music player (e.g., a portable hard drive audio device such as an Moving Picture Experts Group Audio Layer 3 (MP3) player), a web appliance, a network router, switch or bridge, or any machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine. Further, while only a single machine is illustrated, the term “machine” shall also be taken to include any collection of machines that individually or jointly execute a set (or multiple sets) of instructions to perform any one or more of the methodologies discussed herein.

The example computer system 900 includes a processor or multiple processors 902 (e.g., a central processing unit (CPU), a graphics processing unit (GPU), or both), a main memory 904, and a static memory 906, which communicate with each other via a bus 908. The computer system 900 may further include a video display unit 910 (e.g., a liquid crystal display (LCD) or a cathode ray tube (CRT)). The computer system 900 may also include an alphanumeric input device 912 (e.g., a keyboard), a cursor control device 914 (e.g., a mouse), a disk drive unit 916, a signal generation device 918 (e.g., a speaker), and a network interface device 920.

The disk drive unit 916 includes a non-transitory computer-readable medium 922, on which is stored one or more sets of instructions and data structures (e.g., instructions 924) embodying or utilized by any one or more of the methodologies or functions described herein. The instructions 924 may also reside, completely or at least partially, within the main memory 904 and/or within the processors 902 during execution thereof by the computer system 900. The main memory 904 and the processors 902 may also constitute machine-readable media.

The instructions 924 may further be transmitted or received over a network 926 via the network interface device 920 utilizing any one of a number of well-known transfer protocols (e.g., HTTP).

While the computer-readable medium 922 is shown in an example embodiment to be a single medium, the term “computer-readable medium” should be taken to include a single medium or multiple media (e.g., a centralized or distributed database and/or associated caches and servers) that store the one or more sets of instructions. The term “computer-readable medium” shall also be taken to include any medium that is capable of storing, encoding, or carrying a set of instructions for execution by the machine and that causes the machine to perform any one or more of the methodologies of the present application, or that is capable of storing, encoding, or carrying data structures utilized by or associated with such a set of instructions. The term “computer-readable medium” shall accordingly be taken to include, but not be limited to, solid-state memories, optical and magnetic media, and carrier wave signals. Such media may also include, without limitation, hard disks, floppy disks, flash memory cards, digital video disks (DVDs), random access memory (RAM), read only memory (ROM), and the like.

The example embodiments described herein can be implemented in an operating environment comprising computer-executable instructions (e.g., software) installed on a computer, in hardware, or in a combination of software and hardware. The computer-executable instructions can be written in a computer programming language or can be embodied in firmware logic. If written in a programming language conforming to a recognized standard, such instructions can be executed on a variety of hardware platforms and for interfaces to a variety of operating systems. Although not limited thereto, computer software programs for implementing the present method can be written in any number of suitable programming languages such as, for example, Hypertext Markup Language (HTML), Dynamic HTML, Extensible Markup Language (XML), Extensible Stylesheet Language (XSL), Document Style Semantics and Specification Language (DSSSL), Cascading Style Sheets (CSS), Synchronized Multimedia Integration Language (SMIL), Wireless Markup Language (WML), Java™, Jini™, C, C++, Perl, UNIX Shell, Visual Basic or Visual Basic Script, Virtual Reality Markup Language (VRML), ColdFusion™ or other compilers, assemblers, interpreters or other computer languages or platforms.

Thus, methods and systems for graceful scaling nodes in an SDN are disclosed. Although embodiments have been described with reference to specific example embodiments, it will be evident that various modifications and changes can be made to these example embodiments without departing from the broader spirit and scope of the present application. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.

Claims

1. A computer-implemented method for graceful scaling of a data network, the method comprising: receiving, by a cluster master, a first indication of a removal of a first node from a plurality of nodes of the data network;receiving, by the cluster master, a second indication of a presence of at least one old connection associated with the first node, the at least one old connection including at least one unfinished connection currently served by the first node;in response to receiving the first indication and the second indication, generating, by the cluster master, a service policy, the service policy being operable to reassign service requests associated with the first node to a second node selected from the plurality of nodes of the data network, the service policy indicating that one or more of the service requests associated with the at least one old connection are to be redirected by the second node to the first node, the service requests including requests sent by one or more clients for services provided by one or more of the plurality of nodes;sending, by the cluster master, the service policy to one or more of the plurality of nodes of the data network;receiving, by the cluster master, from the first node, an indication of a completion of the at least one old connection associated with the first node; andremoving the first node from the data network upon the completion of the at least one old connection associated with the first node.
2. The method of claim 1, wherein the plurality of nodes of the data network include a virtual machine and a physical device.
3. The method of claim 1, wherein the plurality of nodes include a plurality of cluster devices, the cluster master, a traffic classification engine, service nodes, and an orchestrator.
4. The method of claim 3, wherein the traffic classification engine is configured to distribute the service requests processed by the data network to the service nodes in accordance with the service policy.
5. The method of claim 1, wherein the first indication is generated based on one or more of the following: first node health, a first node service policy, analysis of a first node data, and absence of the first node data.
6. The method of claim 5, wherein the first node data includes a number of total connections, destination server address, destination server capacity, and destination server network connectivity.
7. The method of claim 1, wherein the service policy includes a traffic map table, wherein classes of the service requests are assigned to nodes in the plurality of nodes of the data network.
8. The method of claim 7, wherein the classes of the services requests are obtained by classification of the service request based on one of the following: a source address of the service request and a destination address of the service request.
9. The method of claim 1, further comprising, before sending the service policy to each of the plurality of nodes of the data network: creating, by the cluster master, a redirection node in the data network;generating, by the cluster master, a redirection policy, the redirection policy indicating that service requests associated with the old connections are to be sent from the second node to the first node via the redirection node; andsending, by the cluster master, the redirection policy to the redirection node and the second node; andafter sending the service policy to each of the plurality of nodes of the data network:waiting for an indication of finishing the old connections and upon receiving the indication of finishing the old connections: removing, by the cluster master, the redirection policy from the second node; andremoving, by the cluster master, the redirection node from the data network.
10. The method of claim 1, further comprising: receiving, by the cluster master, an indication of presence of a further node in the data network;generating, by the cluster master, a further node service policy, the further node service policy reassigning one or more of the service requests associated with any of the plurality of nodes of the data network to the further node; andsending, by the cluster master, the further node service policy to each of the plurality of nodes of the data network and to the further node.
11. The method of claim 10, further comprising before sending the further node service policy to each of the plurality of nodes of the data network: receiving, by the cluster master, an indication of presence of old connections, the old connections being associated with an old node from the plurality of nodes of the data network;creating, by the cluster master, a redirection node in the data network;generating, by the cluster master, a redirection policy, the redirection policy indicating that service requests associated with the old connections be sent from the further node via the redirection node to the old node; andsending, by the cluster master, the redirection policy to the redirection node and the further node; andafter sending the further node service policy to each of the plurality of nodes of the data network and to the further node:waiting for an indication of finishing the old connections and upon receiving the indication of finishing the old connections: removing, by the cluster master, the redirection policy from the further node; andremoving, by the cluster master, the redirection node from the data network.
12. A system for graceful scaling a data network, the system comprising: a plurality of nodes; anda cluster master, the cluster master operable to: receive a first indication of removing a first node from the plurality of nodes of the data network;receive a second indication of a presence of at least one old connection associated with the first node, the at least one old connection including at least one unfinished connection currently served by the first node;in response to the first indication and the second indication, generate a service policy, the service policy reassigning any service requests associated with the first node to a second node from the plurality of nodes of the data network, the service policy indicating that one or more of the service requests associated with the at least one old connection are to be redirected by the second node to the first node, the service requests including requests sent by one or more clients for services provided by one or more of the plurality of nodes;send the service policy to each of the plurality of nodes of the data network;receive from the first node, an indication of a completion of the at least one old connection associated with the first node; andremove the first node from the data network upon the completion of the at least one old connection associated with the first node.
13. The system of claim 12, wherein the plurality of nodes of the data network include a virtual machine and a physical device.
14. The system of claim 12, wherein the plurality of nodes include a plurality of cluster devices, a traffic classification engine, service nodes, and an orchestrator.
15. The system of claim 12, wherein the first indication is generated based on one or more of the following: a first node health, a first node service policy, an analysis of the first node data, and an absence of the first node data.
16. The system of claim 15, wherein the first node data includes a number of total connections, destination server address, destination server capacity, and destination server network connectivity.
17. The system of claim 12, wherein the service policy is a traffic map table, wherein classes of the services requests are assigned to nodes in the plurality of nodes of the data network.
18. The system of claim 17, wherein the classes of the service requests are obtained by classification of the service requests based on one of the following: a source address of the service request and a destination address of the service request.
19. The system of claim 12, wherein before sending the service policy to each of the plurality of nodes of the data network, the cluster master is further operable to: receive an indication of presence of old connections, the old connections being associated with the first node;generate an instruction to create a redirection node in the data network;generate a redirection policy, the redirection policy indicating that service requests associated with the old connections be sent from the second node to the first node via the redirection node; andsend the redirection policy to the redirection node and the second node; andafter sending the service policy to each of the plurality of nodes of the data network, the cluster master is further operable to:wait for an indication of finishing the old connections and upon receiving the indication of finishing the old connections: remove the redirection policy from the second node; andgenerate an instruction to remove the redirection node from the data network.
20. The system of claim 12, wherein the cluster master is further operable to: receive an indication of presence of a further node in the data network;generate a further node service policy, the further node service policy reassigning one or more of the service requests associated with any of the plurality of nodes of the data network to the further node; andsend the further node service policy to each of the plurality of nodes of the data network and to the further node.
21. The system of claim 20, wherein before sending the further node service policy to each of the plurality of nodes of the data network, the cluster master is further operable to: receive an indication of presence of old connections, the old connections being associated with an old node from the plurality of nodes of the data network;generate an instruction to create a redirection node in the data network;generate a redirection policy, the redirection policy indicating that service requests associated with the old connections be sent from the further node via the redirection node to the old node; andsend the redirection policy to the redirection node and the further node; andafter sending the further node service policy to each of the plurality of nodes of the data network and to the further node, the cluster master is further operable to:wait for an indication of finishing the old connections and upon receiving the indication of finishing the old connections: remove the redirection policy from the further node; andgenerate an instruction to remove the redirection node from the data network.
22. A non-transitory computer-readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for graceful scaling of a data network, the method comprising: receiving, by a cluster master, a first indication of removing a first node from a plurality of nodes of the data network;receiving, by the cluster master, a second indication of a presence of old connections, the old connections being associated with the first node, the at least one old connection including at least one unfinished connection currently served by the first node;in response to receiving the first indication and the second indication, generating, by the cluster master, a service policy, the service policy reassigning any service requests associated with the first node to a second node from the plurality of nodes of the data network, the service policy indicating that one or more of the service requests associated with the at least one old connection are to be redirected by the second node to the first node, the service requests including requests sent by one or more clients for services provided by one or more of the plurality of nodes;sending the service policy to each of the plurality of nodes of the data network;receiving, by the cluster master, from the first node, an indication of a completion of the at least one old connection associated with the first node; andremoving the first node from the data network upon the completion of the at least one old connection associated with the first node.
23. The non-transitory computer-readable storage medium of claim 22, wherein the method for graceful scaling of the data network further comprises before sending the service policy to each of the plurality of nodes of the data network: creating, by the cluster master, a redirection node in the data network;generating, by the cluster master, a redirection policy, the redirection policy indicating that service requests associated with the old connections be sent from the second node to the first node via the redirection node; andsending, by the cluster master, the redirection policy to the redirection node and the second node; andafter sending the service policy to each of the plurality of nodes of the data network:waiting for an indication of finishing the old connections and upon receiving the indication of finishing the old connections: removing, by the cluster master, the redirection policy from the second node; andremoving, by the cluster master, the redirection node from the data network.
24. The non-transitory computer-readable storage medium of claim 22, wherein the method for graceful scaling of the data network further comprises: receiving, by the cluster master, an indication of presence of a further node in the data network;generating, by the cluster master, a further node service policy, the further node service policy reassigning one or more of the service requests associated with any of the plurality of nodes of the data network to the further node; andsending, by the cluster master, the further node service policy to each of the plurality of nodes of the data network and to the further node.
25. The non-transitory computer-readable storage medium of claim 24, wherein the method for graceful scaling of the data network further comprises before sending the further node service policy to each of the plurality of nodes of the data network: receiving, by a cluster master, an indication of presence of old connections, the old connections being associated with an old node from the plurality of nodes of the data network;creating, by the cluster master, a redirection node in the data network;generating, by the cluster master, a redirection policy, the redirection policy indicating that service requests associated with the old connections be sent from the further node via the redirection node to the old node; andsending, by the cluster master, the redirection policy to the redirection node and the further node; andafter sending the further node service policy to each of the plurality of nodes of the data network and to the further node:waiting for an indication of finishing the old connections and upon receiving the indication of finishing the old connections: removing, by the cluster master, the redirection policy from the further node; andremoving, by the cluster master, the redirection node from the data network.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of U.S. patent application Ser. No. 14/326,325, titled “Graceful Scaling in Software Driven Networks,” filed Jul. 8, 2014, which is a Continuation-in-Part of U.S. patent application Ser. No. 14/029,656, titled “Load Distribution in Data Networks,” filed Sep. 17, 2013, now U.S. Pat. No. 9,705,800 which claims the priority benefit of U.S. provisional patent application No. 61/705,618, filed Sep. 25, 2012. The disclosures of the above referenced patent applications are incorporated herein by reference for all purposes.

US Referenced Citations (393)

Number	Name	Date	Kind
5218602	Grant et al.	Jun 1993	A
5774660	Brendel et al.	Jun 1998	A
5935207	Logue et al.	Aug 1999	A
5958053	Denker	Sep 1999	A
5995981	Wikstrom	Nov 1999	A
6003069	Cavill	Dec 1999	A
6047268	Bartoli et al.	Apr 2000	A
6131163	Wiegel	Oct 2000	A
6219706	Fan et al.	Apr 2001	B1
6259705	Takahashi et al.	Jul 2001	B1
6321338	Porras et al.	Nov 2001	B1
6374300	Masters	Apr 2002	B2
6459682	Ellesson et al.	Oct 2002	B1
6587866	Modi et al.	Jul 2003	B1
6748414	Boumas	Jun 2004	B1
6772334	Glawitsch	Aug 2004	B1
6779017	Lamberton et al.	Aug 2004	B1
6779033	Watson et al.	Aug 2004	B1
6952728	Alles et al.	Oct 2005	B1
7010605	Dharmarajan	Mar 2006	B1
7013482	Krumel	Mar 2006	B1
7058718	Fontes et al.	Jun 2006	B2
7069438	Balabine et al.	Jun 2006	B2
7076555	Orman	Jul 2006	B1
7143087	Fairweather	Nov 2006	B2
7167927	Philbrick et al.	Jan 2007	B2
7181524	Lele	Feb 2007	B1
7218722	Turner et al.	May 2007	B1
7228359	Monteiro	Jun 2007	B1
7234161	Maufer et al.	Jun 2007	B1
7236457	Joe	Jun 2007	B2
7254133	Govindarajan et al.	Aug 2007	B2
7269850	Govindarajan et al.	Sep 2007	B2
7277963	Dolson et al.	Oct 2007	B2
7301899	Goldstone	Nov 2007	B2
7308499	Chavez	Dec 2007	B2
7310686	Uysal	Dec 2007	B2
7328267	Bashyam et al.	Feb 2008	B1
7334232	Jacobs et al.	Feb 2008	B2
7337241	Boucher et al.	Feb 2008	B2
7343399	Hayball et al.	Mar 2008	B2
7349970	Clement et al.	Mar 2008	B2
7370353	Yang	May 2008	B2
7391725	Huitema et al.	Jun 2008	B2
7398317	Chen et al.	Jul 2008	B2
7423977	Joshi	Sep 2008	B1
7430755	Hughes et al.	Sep 2008	B1
7463648	Eppstein et al.	Dec 2008	B1
7467202	Savchuk	Dec 2008	B2
7472190	Robinson	Dec 2008	B2
7492766	Cabeca et al.	Feb 2009	B2
7506360	Wilkinson et al.	Mar 2009	B1
7509369	Tormasov	Mar 2009	B1
7512980	Copeland et al.	Mar 2009	B2
7533409	Keane et al.	May 2009	B2
7552323	Shay	Jun 2009	B2
7584262	Wang et al.	Sep 2009	B1
7584301	Joshi	Sep 2009	B1
7590736	Hydrie et al.	Sep 2009	B2
7613193	Swami et al.	Nov 2009	B2
7613822	Joy et al.	Nov 2009	B2
7673072	Boucher et al.	Mar 2010	B2
7675854	Chen et al.	Mar 2010	B2
7703102	Eppstein et al.	Apr 2010	B1
7707295	Szeto et al.	Apr 2010	B1
7711790	Barrett et al.	May 2010	B1
7739395	Parlamas et al.	Jun 2010	B1
7747748	Allen	Jun 2010	B2
7751409	Carolan	Jul 2010	B1
7765328	Bryers et al.	Jul 2010	B2
7792113	Foschiano et al.	Sep 2010	B1
7808994	Vinokour et al.	Oct 2010	B1
7826487	Mukerji et al.	Nov 2010	B1
7881215	Daigle et al.	Feb 2011	B1
7948952	Hurtta et al.	May 2011	B2
7970934	Patel	Jun 2011	B1
7979585	Chen et al.	Jul 2011	B2
7983258	Ruben et al.	Jul 2011	B1
7990847	Leroy et al.	Aug 2011	B1
7991859	Miller et al.	Aug 2011	B1
8019870	Eppstein et al.	Sep 2011	B1
8032634	Eppstein et al.	Oct 2011	B1
8090866	Bashyam et al.	Jan 2012	B1
8122116	Matsunaga et al.	Feb 2012	B2
8179809	Eppstein et al.	May 2012	B1
8185651	Moran et al.	May 2012	B2
8191106	Choyi et al.	May 2012	B2
8224971	Miller et al.	Jul 2012	B1
8266235	Jalan et al.	Sep 2012	B2
8296434	Miller	Oct 2012	B1
8312507	Chen et al.	Nov 2012	B2
8379515	Mukerji	Feb 2013	B1
8499093	Grosser et al.	Jul 2013	B2
8539075	Bali et al.	Sep 2013	B2
8554929	Szeto et al.	Oct 2013	B1
8560693	Wang et al.	Oct 2013	B1
8584199	Chen et al.	Nov 2013	B1
8595791	Chen et al.	Nov 2013	B1
RE44701	Chen et al.	Jan 2014	E
8675488	Sidebottom et al.	Mar 2014	B1
8681610	Mukerji	Mar 2014	B1
8745221	Willbanks	Jun 2014	B1
8750164	Casado et al.	Jun 2014	B2
8782221	Han	Jul 2014	B2
8813180	Chen et al.	Aug 2014	B1
8826372	Chen et al.	Sep 2014	B1
8879427	Krumel	Nov 2014	B2
8885463	Medved et al.	Nov 2014	B1
8897154	Jalan et al.	Nov 2014	B2
8918673	Rangaiah	Dec 2014	B1
8965957	Barros	Feb 2015	B2
8977749	Han	Mar 2015	B1
8990262	Chen et al.	Mar 2015	B2
9094364	Jalan et al.	Jul 2015	B2
9106561	Jalan et al.	Aug 2015	B2
9154577	Jalan et al.	Oct 2015	B2
9154584	Han	Oct 2015	B1
9215275	Kannan et al.	Dec 2015	B2
9219751	Chen et al.	Dec 2015	B1
9253152	Chen et al.	Feb 2016	B1
9270705	Chen et al.	Feb 2016	B1
9270774	Jalan et al.	Feb 2016	B2
9338225	Jalan et al.	May 2016	B2
9350744	Chen et al.	May 2016	B2
9356910	Chen et al.	May 2016	B2
9386088	Zheng et al.	Jul 2016	B2
9497201	Chen et al.	Nov 2016	B2
9531846	Han et al.	Dec 2016	B2
9544364	Jalan et al.	Jan 2017	B2
9602442	Han	Mar 2017	B2
9609052	Jalan et al.	Mar 2017	B2
9705800	Sankar et al.	Jul 2017	B2
9843484	Sankar et al.	Dec 2017	B2
9900252	Chiang	Feb 2018	B2
9906422	Jalan et al.	Feb 2018	B2
9906591	Jalan et al.	Feb 2018	B2
20010049741	Skene et al.	Dec 2001	A1
20020032777	Kawata et al.	Mar 2002	A1
20020078164	Reinschmidt	Jun 2002	A1
20020091844	Craft et al.	Jul 2002	A1
20020103916	Chen et al.	Aug 2002	A1
20020133491	Sim et al.	Sep 2002	A1
20020138618	Szabo	Sep 2002	A1
20020143991	Chow et al.	Oct 2002	A1
20020178259	Doyle et al.	Nov 2002	A1
20020191575	Kalavade et al.	Dec 2002	A1
20020194335	Maynard	Dec 2002	A1
20020194350	Lu et al.	Dec 2002	A1
20030009591	Hayball et al.	Jan 2003	A1
20030014544	Pettey	Jan 2003	A1
20030023711	Parmar et al.	Jan 2003	A1
20030023873	Ben-Itzhak	Jan 2003	A1
20030035409	Wang et al.	Feb 2003	A1
20030035420	Niu	Feb 2003	A1
20030065762	Stolorz et al.	Apr 2003	A1
20030091028	Chang et al.	May 2003	A1
20030131245	Linderman	Jul 2003	A1
20030135625	Fontes et al.	Jul 2003	A1
20030195962	Kikuchi et al.	Oct 2003	A1
20040062246	Boucher et al.	Apr 2004	A1
20040073703	Boucher et al.	Apr 2004	A1
20040078419	Ferrari et al.	Apr 2004	A1
20040078480	Boucher et al.	Apr 2004	A1
20040111516	Cain	Jun 2004	A1
20040128312	Shalabi et al.	Jul 2004	A1
20040139057	Hirata et al.	Jul 2004	A1
20040139108	Tang et al.	Jul 2004	A1
20040141005	Banatwala et al.	Jul 2004	A1
20040143599	Shalabi et al.	Jul 2004	A1
20040187032	Gels et al.	Sep 2004	A1
20040199616	Karhu	Oct 2004	A1
20040199646	Susai et al.	Oct 2004	A1
20040202182	Lund et al.	Oct 2004	A1
20040210623	Hydrie et al.	Oct 2004	A1
20040210663	Phillips et al.	Oct 2004	A1
20040213158	Collett et al.	Oct 2004	A1
20040268358	Darling et al.	Dec 2004	A1
20050005207	Herneque	Jan 2005	A1
20050009520	Herrero et al.	Jan 2005	A1
20050021848	Jorgenson	Jan 2005	A1
20050027862	Nguyen et al.	Feb 2005	A1
20050036501	Chung et al.	Feb 2005	A1
20050036511	Baratakke et al.	Feb 2005	A1
20050044270	Grove et al.	Feb 2005	A1
20050074013	Hershey et al.	Apr 2005	A1
20050080890	Yang et al.	Apr 2005	A1
20050102400	Nakahara et al.	May 2005	A1
20050108593	Purushothaman	May 2005	A1
20050125276	Rusu	Jun 2005	A1
20050163073	Heller et al.	Jul 2005	A1
20050198335	Brown et al.	Sep 2005	A1
20050213586	Cyganski et al.	Sep 2005	A1
20050240989	Kim et al.	Oct 2005	A1
20050249225	Singhal	Nov 2005	A1
20050259586	Hafid et al.	Nov 2005	A1
20050289231	Harada et al.	Dec 2005	A1
20060023721	Miyake et al.	Feb 2006	A1
20060036610	Wang	Feb 2006	A1
20060036733	Fujimoto et al.	Feb 2006	A1
20060064478	Sirkin	Mar 2006	A1
20060069774	Chen et al.	Mar 2006	A1
20060069804	Miyake et al.	Mar 2006	A1
20060077926	Rune	Apr 2006	A1
20060092950	Arregoces et al.	May 2006	A1
20060098645	Walkin	May 2006	A1
20060112170	Sirkin	May 2006	A1
20060168319	Trossen	Jul 2006	A1
20060187901	Cortes et al.	Aug 2006	A1
20060190997	Mahajani et al.	Aug 2006	A1
20060209789	Gupta et al.	Sep 2006	A1
20060212453	Eshel	Sep 2006	A1
20060230129	Swami et al.	Oct 2006	A1
20060233100	Luft et al.	Oct 2006	A1
20060251057	Kwon et al.	Nov 2006	A1
20060277303	Hegde et al.	Dec 2006	A1
20060280121	Matoba	Dec 2006	A1
20070019543	Wei et al.	Jan 2007	A1
20070086382	Narayanan et al.	Apr 2007	A1
20070094396	Takano et al.	Apr 2007	A1
20070118881	Mitchell et al.	May 2007	A1
20070156919	Potti et al.	Jul 2007	A1
20070165622	O'Rourke et al.	Jul 2007	A1
20070174660	Peddada	Jul 2007	A1
20070185998	Touitou et al.	Aug 2007	A1
20070230337	Igarashi et al.	Oct 2007	A1
20070245090	King et al.	Oct 2007	A1
20070259673	Willars et al.	Nov 2007	A1
20070283429	Chen et al.	Dec 2007	A1
20070286077	Wu	Dec 2007	A1
20070288247	Mackay	Dec 2007	A1
20070294209	Strub et al.	Dec 2007	A1
20080031263	Ervin et al.	Feb 2008	A1
20080101396	Miyata	May 2008	A1
20080109452	Patterson	May 2008	A1
20080109870	Sherlock et al.	May 2008	A1
20080134332	Keohane et al.	Jun 2008	A1
20080162679	Maher et al.	Jul 2008	A1
20080228781	Chen et al.	Sep 2008	A1
20080250099	Shen et al.	Oct 2008	A1
20080263209	Pisharody et al.	Oct 2008	A1
20080271130	Ramamoorthy	Oct 2008	A1
20080282254	Blander et al.	Nov 2008	A1
20080291911	Lee et al.	Nov 2008	A1
20090049198	Blinn et al.	Feb 2009	A1
20090070470	Bauman et al.	Mar 2009	A1
20090077651	Poeluev	Mar 2009	A1
20090092124	Singhal et al.	Apr 2009	A1
20090106830	Maher	Apr 2009	A1
20090138606	Moran et al.	May 2009	A1
20090138945	Savchuk	May 2009	A1
20090141634	Rothstein et al.	Jun 2009	A1
20090164614	Christian et al.	Jun 2009	A1
20090172093	Matsubara	Jul 2009	A1
20090213858	Dolganow et al.	Aug 2009	A1
20090222583	Josefsberg et al.	Sep 2009	A1
20090227228	Hu et al.	Sep 2009	A1
20090228547	Miyaoka et al.	Sep 2009	A1
20090262741	Jungck et al.	Oct 2009	A1
20090271472	Scheifler et al.	Oct 2009	A1
20090313379	Rydnell et al.	Dec 2009	A1
20100004004	Browne-Swinburne et al.	Jan 2010	A1
20100008229	Bi et al.	Jan 2010	A1
20100023621	Ezolt et al.	Jan 2010	A1
20100036952	Hazlewood et al.	Feb 2010	A1
20100054139	Chun et al.	Mar 2010	A1
20100061319	Aso et al.	Mar 2010	A1
20100064008	Yan et al.	Mar 2010	A1
20100082787	Kommula et al.	Apr 2010	A1
20100083076	Ushiyama	Apr 2010	A1
20100094985	Abu-Samaha et al.	Apr 2010	A1
20100098417	Tse-Au	Apr 2010	A1
20100106833	Banerjee et al.	Apr 2010	A1
20100106854	Kim et al.	Apr 2010	A1
20100128606	Patel et al.	May 2010	A1
20100162378	Jayawardena et al.	Jun 2010	A1
20100210265	Borzsei et al.	Aug 2010	A1
20100211669	Dalgas et al.	Aug 2010	A1
20100217793	Preiss	Aug 2010	A1
20100223378	Wei	Sep 2010	A1
20100223630	Degenkolb et al.	Sep 2010	A1
20100228819	Wei	Sep 2010	A1
20100228878	Xu et al.	Sep 2010	A1
20100235507	Szeto et al.	Sep 2010	A1
20100235522	Chen et al.	Sep 2010	A1
20100238828	Russell	Sep 2010	A1
20100265824	Chao et al.	Oct 2010	A1
20100268814	Cross et al.	Oct 2010	A1
20100293296	Hsu et al.	Nov 2010	A1
20100312740	Clemm et al.	Dec 2010	A1
20100318631	Shukla	Dec 2010	A1
20100322252	Suganthi et al.	Dec 2010	A1
20100330971	Selitser et al.	Dec 2010	A1
20100332617	Goodwin	Dec 2010	A1
20100333101	Pope et al.	Dec 2010	A1
20110007652	Bai	Jan 2011	A1
20110019550	Bryers et al.	Jan 2011	A1
20110022882	Jaehde	Jan 2011	A1
20110023071	Li et al.	Jan 2011	A1
20110029599	Pulleyn et al.	Feb 2011	A1
20110032941	Quach et al.	Feb 2011	A1
20110040826	Chadzelek et al.	Feb 2011	A1
20110047294	Singh et al.	Feb 2011	A1
20110060831	Ishii et al.	Mar 2011	A1
20110060840	Susai et al.	Mar 2011	A1
20110093522	Chen et al.	Apr 2011	A1
20110099403	Miyata et al.	Apr 2011	A1
20110110294	Valluri et al.	May 2011	A1
20110145324	Reinart et al.	Jun 2011	A1
20110153834	Bharrat	Jun 2011	A1
20110178985	San Martin Arribas et al.	Jul 2011	A1
20110185073	Jagadeeswaran et al.	Jul 2011	A1
20110191773	Pavel et al.	Aug 2011	A1
20110196971	Reguraman et al.	Aug 2011	A1
20110226810	Wang	Sep 2011	A1
20110276695	Maldaner	Nov 2011	A1
20110276982	Nakayama et al.	Nov 2011	A1
20110289496	Steer	Nov 2011	A1
20110292939	Subramaian et al.	Dec 2011	A1
20110302256	Sureshehandra et al.	Dec 2011	A1
20110307541	Walsh et al.	Dec 2011	A1
20120008495	Shen et al.	Jan 2012	A1
20120023231	Ueno	Jan 2012	A1
20120026897	Guichard et al.	Feb 2012	A1
20120030341	Jensen et al.	Feb 2012	A1
20120066371	Patel et al.	Mar 2012	A1
20120084460	McGinnity et al.	Apr 2012	A1
20120101998	Cahill	Apr 2012	A1
20120106355	Ludwig	May 2012	A1
20120117571	Davis et al.	May 2012	A1
20120144014	Natham et al.	Jun 2012	A1
20120151353	Joanny	Jun 2012	A1
20120170548	Rajagopalan et al.	Jul 2012	A1
20120173759	Agarwal	Jul 2012	A1
20120191839	Maynard	Jul 2012	A1
20120224486	Battestilli	Sep 2012	A1
20120239792	Banerjee et al.	Sep 2012	A1
20120240185	Kapoor et al.	Sep 2012	A1
20120290727	Tivig	Nov 2012	A1
20120297046	Raja et al.	Nov 2012	A1
20130007505	Spear	Jan 2013	A1
20130046876	Narayana et al.	Feb 2013	A1
20130054806	Francis	Feb 2013	A1
20130054813	Bercovici	Feb 2013	A1
20130054817	Moen	Feb 2013	A1
20130058335	Koponen et al.	Mar 2013	A1
20130074177	Varadhan et al.	Mar 2013	A1
20130083725	Mallya et al.	Apr 2013	A1
20130103817	Koponen et al.	Apr 2013	A1
20130124713	Feinberg et al.	May 2013	A1
20130148500	Sonoda et al.	Jun 2013	A1
20130159487	Patel	Jun 2013	A1
20130173795	McPherson	Jul 2013	A1
20130176854	Chisu et al.	Jul 2013	A1
20130182558	Orten	Jul 2013	A1
20130191486	Someya et al.	Jul 2013	A1
20130198385	Han et al.	Aug 2013	A1
20130250765	Ehsan et al.	Sep 2013	A1
20130250770	Zou et al.	Sep 2013	A1
20130258846	Damola	Oct 2013	A1
20130268646	Doron	Oct 2013	A1
20130282791	Kruglick	Oct 2013	A1
20130336159	Previdi et al.	Dec 2013	A1
20140029423	Badhani	Jan 2014	A1
20140226658	Kakadia et al.	Aug 2014	A1
20140235249	Jeong et al.	Aug 2014	A1
20140248914	Aoyagi et al.	Sep 2014	A1
20140258465	Li	Sep 2014	A1
20140269728	Jalan et al.	Sep 2014	A1
20140286313	Fu et al.	Sep 2014	A1
20140298091	Carlen et al.	Oct 2014	A1
20140325649	Zhang	Oct 2014	A1
20140330982	Jalan et al.	Nov 2014	A1
20140334485	Jain et al.	Nov 2014	A1
20140359052	Joachimpillai et al.	Dec 2014	A1
20150003252	Badhani	Jan 2015	A1
20150098333	Lin et al.	Apr 2015	A1
20150156223	Xu et al.	Jun 2015	A1
20150215436	Kancherla	Jul 2015	A1
20150237173	Virkki et al.	Aug 2015	A1
20150244781	Agarwal	Aug 2015	A1
20150281087	Jalan et al.	Oct 2015	A1
20150281104	Golshan et al.	Oct 2015	A1
20150296058	Jalan et al.	Oct 2015	A1
20150312268	Ray	Oct 2015	A1
20150350048	Sampat et al.	Dec 2015	A1
20150350379	Jalan et al.	Dec 2015	A1
20160006836	LeFaucheur	Jan 2016	A1
20160042014	Jalan et al.	Feb 2016	A1
20160044095	Sankar et al.	Feb 2016	A1
20160088074	Kannan et al.	Mar 2016	A1
20160094470	Skog	Mar 2016	A1
20160139910	Ramanathan et al.	May 2016	A1
20160164792	Oran	Jun 2016	A1

Foreign Referenced Citations (115)

Number	Date	Country
1372662	Oct 2002	CN
1449618	Oct 2003	CN
1473300	Feb 2004	CN
1529460	Sep 2004	CN
1575582	Feb 2005	CN
1714545	Dec 2005	CN
1725702	Jan 2006	CN
1910869	Feb 2007	CN
101004740	Jul 2007	CN
101094225	Dec 2007	CN
101163336	Apr 2008	CN
101169785	Apr 2008	CN
101189598	May 2008	CN
101193089	Jun 2008	CN
101247349	Aug 2008	CN
101261644	Sep 2008	CN
101495993	Jul 2009	CN
101878663	Nov 2010	CN
102143075	Aug 2011	CN
102546590	Jul 2012	CN
102571742	Jul 2012	CN
102577252	Jul 2012	CN
102918801	Feb 2013	CN
103533018	Jan 2014	CN
103944954	Jul 2014	CN
104040990	Sep 2014	CN
104067569	Sep 2014	CN
104106241	Oct 2014	CN
104137491	Nov 2014	CN
104796396	Jul 2015	CN
102577252	Mar 2016	CN
102918801	May 2016	CN
102571742	Jul 2016	CN
104067569	Feb 2017	CN
1209876	May 2002	EP
1770915	Apr 2007	EP
1885096	Feb 2008	EP
2296313	Mar 2011	EP
2577910	Apr 2013	EP
2622795	Aug 2013	EP
2647174	Oct 2013	EP
2760170	Jul 2014	EP
2772026	Sep 2014	EP
2901308	Aug 2015	EP
2760170	Dec 2015	EP
2772026	Feb 2017	EP
1182560	Nov 2013	HK
1183569	Dec 2013	HK
1183996	Jan 2014	HK
1189438	Jan 2014	HK
1198565	May 2015	HK
1198848	Jun 2015	HK
1199153	Jun 2015	HK
1199779	Jul 2015	HK
1200617	Aug 2015	HK
3764CHN2014	Sep 2015	IN
261CHE2014	Jan 2016	IN
1668CHENP2015	Jul 2016	IN
H0997233	Apr 1997	JP
H1196128	Apr 1999	JP
H11338836	Dec 1999	JP
2000276432	Oct 2000	JP
2000307634	Nov 2000	JP
2001051859	Feb 2001	JP
2001298449	Oct 2001	JP
2002091936	Mar 2002	JP
2003141068	May 2003	JP
2003186776	Jul 2003	JP
2005141441	Jun 2005	JP
2006332825	Dec 2006	JP
2008040718	Feb 2008	JP
2009500731	Jan 2009	JP
2013528330	Jul 2013	JP
2014504484	Feb 2014	JP
2014143686	Aug 2014	JP
2015507380	Mar 2015	JP
5855663	Feb 2016	JP
5906263	Apr 2016	JP
5913609	Apr 2016	JP
5946189	Jul 2016	JP
5963766	Aug 2016	JP
20080008340	Jan 2008	KR
100830413	May 2008	KR
20130096624	Aug 2013	KR
101576585	Dec 2015	KR
101632187	Jun 2016	KR
101692751	Jan 2017	KR
WO2001013228	Feb 2001	WO
WO2001014990	Mar 2001	WO
WO2001045349	Jun 2001	WO
WO2003103237	Dec 2003	WO
WO2004084085	Sep 2004	WO
WO2006098033	Sep 2006	WO
WO2008053954	May 2008	WO
WO2008078593	Jul 2008	WO
WO2011049770	Apr 2011	WO
WO2011079381	Jul 2011	WO
WO2011149796	Dec 2011	WO
WO2012050747	Apr 2012	WO
WO2012075237	Jun 2012	WO
WO2012083264	Jun 2012	WO
WO2012097015	Jul 2012	WO
WO2013070391	May 2013	WO
WO2013081952	Jun 2013	WO
WO2013096019	Jun 2013	WO
WO2013112492	Aug 2013	WO
WO2013189024	Dec 2013	WO
WO2014031046	Feb 2014	WO
WO2014052099	Apr 2014	WO
WO2014088741	Jun 2014	WO
WO2014093829	Jun 2014	WO
WO2014138483	Sep 2014	WO
WO2014144837	Sep 2014	WO
WO2014179753	Nov 2014	WO
WO2015153020	Oct 2015	WO

Non-Patent Literature Citations (13)

Entry
Abe, et al., “Adaptive Split Connection Schemes in Advanced Relay Nodes,” IEICE Technical Report, 2010, vol. 109 (438), pp. 25-30.
Cardellini, et al., “Dynamic Load Balancing on Web-Server Systems,” IEEE Internet Computing, 1999, vol. 3 (3), pp. 28-39.
FreeBSD, “tcp—TCP Protocal,” Linux Programmen□s Manual [online], 2007, [retrieved on Apr. 13, 2016], Retreived from the Internet: <https://www.freebsd.org/cgi/man.cgi?query=tcp&apropos=0&sektion=7&manpath=SuSe+Linux%2Fi386+11.0&format=asci>.
Gite, “Linux Tune Network Stack (Buffers Size) to Increase Networking Performance,” nixCraft [online], 2009, [retreived on Apr. 13, 2016], Retreived from the Internet: <URL:http://www.cyberciti.biz/faq/linux-tcp-tuning/>.
Goldszmidt, et al., “NetDispatcher: A TCP Connection Router,” IBM Researc Report, RC 20853, 1997, pp. 1-31.
Kjaer, et al., “Resource Allocation and Disturbance Rejection in Web Servers Using SLAs and Virtualized Servers,” IEEE Transactions on Network Service Management, 2009, vol. 6 (4), pp. 226-239.
Koike, et al., “Transport Middleware for Network-Based Control,” IEICE Technical Report, 2000, vol. 100 (53), pp. 13-18.
Noguchi, “Realizing the Highest Level “Layer 7” Switch”= Totally Managing Network Resources, Applications, and Users =, Computer & Network LAN, 2000, vol. 18 (1), pp. 109-112.
Ohnuma, “AppSwitch: 7th Layer Switch Provided with Full Setup and Report Tools,” Interop Magazine, 2000, vol. 10 (6), pp. 148-150.
Sharifian, et al., “An Approximation-Based Load-Balancing Algorithm with Admission Control for Cluster Web Servers with Dynamic Workloads,” The Journal of Supercomputing, 2010, vol. 53 (3), pp. 440-463.
Spatscheck, et al., “Optimizing TCP Forwarder Performance,” IEEE/ACM Transactions on Networking, 2000, vol. 8 (2), pp. 146-157.
Takahashi, “The Fundamentals of the Windows Network: Understanding the Mystery of the Windows Network from the Basics,” Network Magazine, 2006, vol. 11 (7), pp. 32-35.
Yamamoto, et al., “Performance Evaluation of Window Size in Proxy-Based TCP for Multi-Hop Wireless Networks,” IPSJ SIG Technical Reports, 2008, vol. 2008 (44), pp. 109-114.

Related Publications (1)

	Number	Date	Country
	20180102945 A1	Apr 2018	US

Provisional Applications (1)

	Number	Date	Country
	61705618	Sep 2012	US

Continuations (1)

	Number	Date	Country
Parent	14326325	Jul 2014	US
Child	15838119		US

Continuation in Parts (1)

	Number	Date	Country
Parent	14029656	Sep 2013	US
Child	14326325		US

Graceful scaling in software driven networks

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications

Term Extension

Abstract