Global server load balancing

Description

BACKGROUND OF THE INVENTION

1. Field of the Invention

The disclosure invention relates generally to load balancing among servers. More particularly but not exclusively, the present disclosure relates to achieving load balancing by, in response to resolving a DNS query by a client, providing the address of a server that is expected to serve the client with a high performance in a given application.

2. Description of the Related Art

Under the TCP/IP protocol, when a client provides a symbolic name (“URL”) to request access to an application program or another type of resource, the host name portion of the URL needs to be resolved into an IP address of a server for that application program or resource. For example, the URL (e.g., http://www.foundrynet.com/index.htm) includes a host name portion www.foundrynet.com that needs to be resolved into an IP address. The host name portion is first provided by the client to a local name resolver, which then queries a local DNS server to obtain a corresponding IP address. If a corresponding IP address is not locally cached at the time of the query, or if the “time-to-live” (TTL) of a corresponding IP address cached locally has expired, the DNS server then acts as a resolver and dispatches a recursive query to another DNS server. This process is repeated until an authoritative DNS server for the domain (e.g., foundrynet.com, in this example) is reached. The authoritative DNS server returns one or more IP addresses, each corresponding to an address at which a server hosting the application (“host server”) under the host name can be reached. These IP addresses are propagated back via the local DNS server to the original resolver. The application at the client then uses one of the IP addresses to establish a TCP connection with the corresponding host server. Each DNS server caches the list of IP addresses received from the authoritative DNS for responding to future queries regarding the same host name, until the TTL of the IP addresses expires.

To provide some load sharing among the host servers, many authoritative DNS servers use a simple round-robin algorithm to rotate the IP addresses in a list of responsive IP addresses, so as to distribute equally the requests for access among the host servers.

The conventional method described above for resolving a host name to its IP addresses has several shortcomings. First, the authoritative DNS does not detect a server that is down. Consequently, the authoritative DNS server continues to return a disabled host server's IP address until an external agent updates the authoritative DNS server's resource records. Second, when providing its list of IP addresses, the authoritative DNS sever does not take into consideration the host servers' locations relative to the client. The geographical distance between the server and a client is a factor affecting the response time for the client's access to the host server. For example, traffic conditions being equal, a client from Japan could receive better response time from a host server in Japan than from a host server in New York. Further, the conventional DNS algorithm allows invalid IP addresses (e.g., that corresponding to a downed server) to persist in a local DNS server until the TTL for the invalid IP address expires.

SUMMARY OF THE INVENTION

One aspect of the present invention provides an improved method and system for serving IP addresses to a client, based on a selected set of performance metrics. In accordance with this invention, a global server load-balancing (GSLB) switch is provided as a proxy for an authoritative DNS server, together with one or more site switches each associated with one or more host servers. Both the GSLB switch and the site switch can be implemented using the same type of switch hardware in one embodiment. Each site switch provides the GSLB switch with current site-specific information regarding the host servers associated with the site switch. Under one aspect of the present invention, when an authoritative DNS server resolves a host name in a query and returns one or more IP addresses, the GSLB switch filters the IP addresses using the performance metrics compiled from the site-specific information collected from the site switches. The GSLB switch then returns a ranked or weighted list of IP addresses to the inquirer. In one embodiment, the IP address that is estimated to provide the best-expected performance for the client is placed at the top of the list.

Examples of suitable performance metrics include availability metrics (e.g., a server's or an application's health), load metrics (e.g., a site switch's session capacity or a corresponding preset threshold), and proximity metrics (e.g., a round-trip time between the site switch and a requesting DNS server, the geographic location of the host server, the topological distance between the host server and the client program). (A topological distance is the number of hops between the server and the client). Another proximity metrics is the site switch's “flashback” speed (i.e., how quickly a switch receives a health check result). Yet another metric is a connection-load metric that is based on a measure of new connections-per-second at a site. The ordered list can also be governed by other policies, such as the least selected host server.

The present invention is better understood upon consideration of the detailed description of the embodiments below, in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a global server load-balancing configuration in accordance with one embodiment of the invention.

FIGS. 2A-2D illustrate in a flow chart one embodiment of an algorithm for selecting the “best” address from the list of addresses supplied by an authoritative DNS, where FIG. 2D depicts the relative position of portions of the flow chart.

FIG. 3 is a block diagram showing the functional modules of a GSLB switch and a site switch relevant to the global server load balancing function in accordance with one embodiment of the invention.

DETAILED DESCRIPTION

Embodiments for global server load-balancing are described herein. In the following description, numerous specific details are given to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, materials, etc. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.

Reference throughout this specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

FIG. 1 illustrates one embodiment of the present invention that provides a global server load-balancing configuration. As shown in FIG. 1, global server load balancing (GSLB) switch 12 is connected to Internet 14 and acts as a proxy to an authoritative Domain Name System (DNS) server 16 for the domain “foundrynet.com” (for example). That is, while the actual DNS service is provided by DNS server 16, the IP address known to the rest of the Internet for the authoritative DNS sever of the domain “foundrynet.com” is a virtual IP (VIP) address configured on GSLB switch 12. Of course, DNS server 16 can also act simultaneously as an authoritative DNS for other domains. GSLB switch 12 communicates, via Internet 14, with site switches 18A and 18B at site 20, site switches 22A and 22B at site 24, and any other similarly configured site switches. Site switch 18A, 18B, 22A and 22B are shown, for example, connected to routers 19 and 21 respectively and to servers 26A, . . . , 26I, . . . 26N. Some or all of servers 26A, . . . , 26I, . . . , 26N may host application server programs (e.g., http and ftp) relevant to the present invention. These host servers are reached through site switches 18A, 18B, 22A and 22B using one or more virtual IP addresses configured at the site switches, which act as proxies to the host servers. A suitable switch for implementing either GSLB switch 12 or any of site switches 18A, 18B, 22A and 22B is the “ServerIron” product available from Foundry Networks, Inc.

FIG. 1 also shows client program 28 connected to Internet 14, and communicates with local DNS server 30. When a browser on client 28 requests a web page, for example, using a Universal Resource Locator (URL), such as http://www.foundrynet.com/index.htm, a query is sent to local DNS server 30 to resolve the symbolic host name www.foundrynet.com to an IP address of a host server. The client program receives from DNS server 30 a list of IP addresses corresponding to the resolved host name. This list of IP addresses is either retrieved from local DNS server 30's cache, if the TTL of the responsive IP addresses in the cache has not expired, or obtained from GSLB switch 12, as a result of a recursive query. Unlike the prior art, however, this list of IP addresses is ordered by GSLB switch 12 based on performance metrics described in further detail below.

In the remainder of this detailed description, for the purpose of illustrating embodiments of the present invention only, the list of IP addresses returned are assumed to be the virtual IP addresses configured on the proxy servers at switches 18A, 18B, 22A and 22B (sites 20 and 24). In one embodiment, GSLB switch 12 determines which site switch would provide the best expected performance (e.g., response time) for client 28 and returns the IP address list with a virtual IP address configured at that site switch placed at the top. (Within the scope of the present invention, other forms of ranking or weighting the IP addresses in the list can also be possible.) Client program 28 can receive the ordered list of IP addresses, and typically selects the first IP address on the list to access the corresponding host server.

FIG. 3 is a block diagram showing the functional modules of GSLB switch 12 and site switch 18A relevant to the global server load balancing function. As shown in FIG. 3, GSLB switch 12 includes a GSLB switch controller 401, health check module 402, DNS proxy module 403, metric agent 404, routing metric collector 405, and site-specific metric collector 406. GSLB switch controller 401 provides general control functions for the operation of GSLB switch 12. Health check module 402 is responsible for querying, either periodically or on demand, host servers and relevant applications hosted on the host servers to determine the “health” (e.g., whether or not it is available) of each host server and each relevant application. Site-specific metric collector 406 communicates with metric agents in site-specific switches (e.g., FIG. 3 shows site-specific metric collector 406 communicating with site-specific metric agent 407 of a site server load balancing ServerIron or “SLB SI”) to collect site-specific metrics (e.g., number of available sessions on a specific host server and/or connection-load data at that host server).

For example for a connection-load metric in one embodiment, site-specific metric agent(s) 407 can perform sampling to obtain connections-per-second at their respective site, and then obtains load averages from the samples or performs other calculations. The site-specific metric collector 406 of the GLSB switch 12 then obtains the load averages from the site-specific metric agent(s) 407 and provides these load averages to the switch controller 401, to allow the switch controller 401 to use the load averages to rank the IP addresses on the ordered list. Alternatively or in addition to the site-specific metric agent(s) 407, the switch controller 401 can perform at least some or most of the connection-load calculations from sampling data provided by the site-specific metric agent(s) 407.

Routing metric collector 405 collects routing information from routers (e.g., topological distances between nodes on the Internet). FIG. 3 shows, for example, router 408 providing routing metric collector 405 with routing metrics (e.g., topological distance between the load balancing switch and the router), using the Border Gateway Protocol (BGP). DNS proxy module 403 (a) receives incoming DNS requests, (b) provides the host names to be resolved to DNS server 16, (c) receives from DNS server 16 a list of responsive IP addresses, (d) orders the IP addresses on the list received from DNS server 16 according to an embodiment of the present invention, using the metrics collected by routing-metric collector 405 and site specific collector 406, and values of any other relevant parameter, and (e) provides the ordered list of IP addresses to the requesting DNS server. Since GSLB switch 12 can also act as a site switch, GSLB switch 12 is provided site-specific metric agent 404 for collecting metrics for a site-specific metric collector.

In one embodiment, the metrics used in a GSLB switch 12 includes (a) the health of each host server and selected applications, (b) each site switch's session capacity threshold, (c) the round trip time (RTT) between a site switch and a client in a previous access, (d) the geographical location of a host server, (e) the connection-load measure of new connections-per-second at a site switch, (f) the current available session capacity in each site switch, (g) the “flashback” speed between each site switch and the GSLB switch (i.e., how quickly each site switch responds to a health check from the GSLB switch), and (h) a policy called the “Least Response Selection” (LRS) which prefers the site least selected previously. Many of these performance metrics can be provided default values. Each individual metric can be used in any order and each metric can be disabled. In one embodiment, the LRS metric is always enabled.

FIGS. 2A-2D illustrate in a flow diagram one embodiment of an optimization algorithm utilized by GSLB switch 12 to process the IP address list received from DNS server 16, in response to a query resulting from client program 28, where FIG. 2D shows the relative position of portions of the flow diagram shown in FIGS. 2A-2C. At least some of the elements of the flow diagram can be embodied in software or other machine-readable instruction stored on one or more machine-readable storage media. For example, such software to perform portions of the algorithm may be present at the GSLB switch 12 in one embodiment and executed by the switch controller 401.

As shown in FIG. 2A, in act 100, upon receiving the IP address list from DNS server 16, GSLB switch 12 performs, for each IP address on the IP address list (e.g., host server 261 connected to site switch 18B), a layer 4 health check and a layer 7 check. Here, layers 4 and 7 refer respectively to the transport and application protocols in the Open System Interconnection (OSI) protocol layers. The layer 4 health check can be a Transmission Control Protocol (TCP) health check or a User Datagram Protocol (UDP) health check. Such a health check can be achieved, for example, by a “ping-like” operation defined under the relevant protocol. For example, under the TCP protocol, a TCP SYN packet can be sent, and the health of the target is established when a corresponding TCP ACK packet is received back from the target. In this embodiment, the layer 7 health check is provided for specified applications, such as the well-known HyperText Transport Protocol (HTTP) and the File Transfer Protocol (FTP) applications. If a host server or an associated application fails any of the health checks it is disqualified (act 100) from being the “best” site and may be excluded from the IP address list to be returned to client program 28. Since the health check indicates whether or not a host server or an associated application is available, the health check metric is suitable for use to eliminate an IP address from the candidates for the “best” IP address (i.e., the host server expected to provide the highest performance). After act 100, if the list of IP addresses has only one IP address (act 101), the list of IP addresses is returned to client program 28 at act 108.

After act 100, if the list of candidate IP addresses for the best site has multiple IP addresses, it is further assessed in act 102 based upon the capacity threshold of the site switch serving that IP address. Each site switch may have a different maximum number of TCP sessions it can serve. For example, the default number for the “ServerIron” product of Foundry Network is one million sessions, although it can be configured to a lower number. The virtual IP address configured at site switch 18B may be disqualified from being the “best” IP address if the number of sessions for switch 18B exceed a predetermined threshold percentage (e.g., 90%) of the maximum number of sessions. (Of course, the threshold value of 90% of the maximum capacity can be changed.) After act 102, if the list of IP addresses has only one IP address (act 103), the list of IP addresses is returned to client program 28 at act 108.

After act 102, if the IP address list has multiple IP addresses (act 103), the remaining IP addresses on the list can then be reordered in act 104 based upon a round-trip time (RTT) between the site switch for the IP address (e.g., site switch 18B) and the client (e.g., client 28). The RTT is computed for the interval between the time when a client machine requests a TCP connection to a proxy server configured on a site switch, sending the proxy server a TCP SYN packet, and the time a site switch receives from the client program a TCP ACK packet. (In response to the TCP SYN packet, a host server sends a TCP SYN ACK packet, to indicate acceptance of a TCP connection; the client machine returns a TCP ACK packet to complete the setting up of the TCP connection.) The GSLB switch (e.g., GSLB switch 12) maintains a database of RTT, which it creates and updates from data received periodically from the site switches (e.g., site switches 18A, 18B, 22A and 22B). Each site collects and stores RTT data for each TCP connection established with a client machine. In one embodiment, the GSLB switch favors one host server over another only if the difference in their RTTs with a client machine is greater than a specified percentage, the default specified percentage value being 10%, for example. To prevent bias, the GSLB switch ignores, by default, RTT values for 5% of client queries from each responding network, for example. After act 105, if the top entries on the list of IP addresses do not have equal RTTs, the list of IP addresses is returned to client program 28 at act 108.

If multiple sites have equal RTTs (act 105), then the list is reordered in act 106 based upon the location (geography) of the host server. The geographic location of a server is determined according to whether the IP address is a real address or a virtual IP address (“VIP”). For a real IP address, the geographical region for the host server can be determined from the IP address itself. Under IANA, regional registries RIPE (Europe), APNIC (Asia/Pacific Rim) and ARIN (the Americas and Africa) are each assigned different prefix blocks. In one embodiment, an IP address administered by one of these regional registries is assumed to correspond to a machine located inside the geographical area administered by the regional registry. For a VIP, the geographic region is determined from the management IP address of the corresponding site switch. Of course, a geographical region can be prescribed for any IP address to override the geographic region determined from the procedure above. The GSLB switch prefers an IP address that is in the same geographical region as the client machine in an embodiment. At act 107, if the top two entries on the IP list are not equally ranked, the IP list is sent to the client program 28 at act 108.

After act 107, if multiple sites are of equal rank for the best site, the IP addresses can then be reordered based upon site connection load (act 114). The connection-load metric feature allows comparison of sites based on the connection-load on their respective agent (e.g., at the metric agent 407 of the site ServerIron switch 18A in FIG. 3, for instance).

The connection-load is a measure of new connections-per-second on the agent 407 in one embodiment. An administrator can set a threshold limit for the connection-load to pass a given site; can select the number of load sampling intervals and duration of each interval; and can select the relative weight for each interval to calculate the average load for a period of time (i.e., new connections per the period of time).

The “connection load limit” value specifies the load limit for any site to pass the metric. The minimum value is 1, and a parser or other software component in the site switch 18A, for instance, limits the maximum value—there need not be a default value. By default, this connection-load metric is turned off and can be turned on when the load limit is specified. The average load for a given site is calculated using the user-defined weights and intervals, which will be explained later below. If the calculated average load is less than the load limit specified, the site is passed on to the next stage of the GSLB algorithm described herein—otherwise that site is eliminated/rejected from the set of potential candidates.

In one embodiment, the number of “load sampling intervals” and also the “sampling rate” can be configured. The sampling rate defines the duration of each sampling interval in multiples of the initial rate. For example, if 6 sampling intervals and a sampling rate of 5 seconds are chosen, the site will sample the average load at 5, 10, 15, 20, 25, and 30. At any instant, the site will have the average load for the previous 5 seconds, 10 seconds, 15 seconds, 20 seconds, 25 seconds, and 30 seconds. This is a “moving average” in that at the 35th second, for example, the average for the 5th to 35th seconds is calculated. Note that even though this is a moving average, the accuracy is limited by the initial sampling rate, meaning that since samples are taken after every 5 seconds, at the 7th second, the average for the 1 st to 5th second is available and not the 2nd to 7th second average.

The sampling rate also defines the update interval for the site (e.g., the site-specific metric agent 407) to upload the load averages to the metric collector 406 at the GSLB switch 12. A given site is capable of maintaining load-averages for any number of collectors at a time. Each collector is updated with the load information periodically, and the update interval is also specific to the collector in various example embodiments.

The minimum number of intervals is 1 and the max is 8 in one embodiment. The default number is 5, which is set when the connection load limit is configured. It is appreciated that these are merely illustrative examples and may be different based on the particular implementation.

For the load-sampling interval, the minimum value is 1 second and maximum value is 60 seconds. The default value is 5 seconds. So, the maximum range for load average calculation is 60*8 seconds=480 seconds=8 minutes. Thus, one can consider up to the previous 8-minute average for load analysis. Again, these are example settings.

Weights can be assigned to each interval to calculate the average load. By default in one embodiment, each interval is given an equal weight of 1. The average load for a site can be calculated using the following formula:

$\frac{\sum_{i = 0}^{N} (AvgLoad of interval i) * (Weight of interval i)}{\sum_{i = 0}^{N} (Weight of interval i)}$

where N=Number of sampling intervals and AvgLoad of interval i=new connections of interval i.

The contribution of any interval can be nullified by giving it a weight of zero. If every interval is given a weight of zero, the average load is zero. (We cannot divide by zero). In one embodiment, the site-specific metric agent 407 can calculate this average load and provide it to the metric collector 406 at the GSLB switch 12. In other embodiments, the metric collector 406 and/or the switch controller 401 can perform the average load calculation based on values collected and provided by the site-specific metric agent 407.

By default, the connection-load metric is not turned on in the GSLB algorithm. The metric is automatically turned on when the user specifies the connection-load limit, in an embodiment. The specific configuration needs for connection-load sampling and calculation can be configured on the switch controller 401, whether the switch 12 is used for GSLB or as a site-specific switch.

To configure the connection load limit (such as a connection load limit of 500), at the GSLB policy configuration level, the following example command can be used:

SW-GSLB-Controller (config-gslb-policy) #connection-load limit 500

Again, as described above, if the calculated average load is less than this limit, then the site is kept as a potential candidate.

To configure the number of sampling intervals and the sampling rate (e.g., sampling rate=5, interval=6), the following example command may be used:

SW-GSLB-Controller (config-gslb-policy) #connection-load intervals 6 5.

To configure the interval weights, the following example command can be used:

SW-GSLB-Controller (config-gslb-policy) #connection-load weights 1 2 3 4 5 6

The syntax of this command is: connection-load weights<weight of interval-1><weight of interval-2><weight of interval-3> . . . up to 8, for example.

All weights for all intervals need not be configured if not considering beyond a certain point. The configured weights will be assigned to intervals starting from the first and any non-configured interval will be assigned a weight of zero. For example, if only the 5-second average is desired, the following can be used:

SW-GSLB-Controller (config-gslb-policy) #connection-load intervals 6 5

SW-GSLB-Controller (config-gslb-policy) #connection-load weights 1

Thus, even though 6 intervals are configured in the above example, all the others are nullified due to zero weights.

By default the connection-load metric is not included in the GSLB algorithm. Once the connection-load limit is configured, the metric is included after the geographic-location metric in the metric order according to one embodiment, such as shown in FIG. 2B. It is understood that the metric order can be changed or customized.

At act 115, if there are no multiple candidates at the top of the IP list that have passed the connection-load metric (or there are none of equal rank), then the IP address list is sent to the client program 28 at act 108. After act 115, if multiple sites are of equal rank for the best site, the IP addresses can then be reordered based upon available session capacity (act 109). For example in one embodiment, if switch 18A has 1,000,000 sessions available and switch 22B has 800,000 sessions available, switch 18A is then preferred, if a tolerance limit, representing the difference in sessions available expressed as a percentage of capacity in the larger switch, is exceeded. For example, if the tolerance limit is 10%, switch 18A will have to have at a minimum 100,000 more sessions available than switch 22B to be preferred. If an IP address is preferred (act 110), the IP address will be placed at the top of the IP address list, and is then returned to the requesting entity at act 108. Otherwise, if the session capacity does not resolve the best IP address, act 111 then attempts to a resolution based upon a “flashback” speed. The flashback speed is a time required for a site switch to respond to layers 4 and 7 health checks by the GSLB switch. The flashback speed is thus a measure of the load on the host server. Again, the preferred IP address will correspond to a flashback speed exceeding the next one by a preset tolerance limit.

In one embodiment, flashback speeds are measured for well-known applications (layer 7) and their corresponding TCP ports (layer 4). For other applications, flashback speeds are measured for user selected TCP ports. Layer 7 (application-level) flashback speeds are compared first, if applicable. If the application flashbacks fail to provide a best IP address, layer 4 flashback speeds are compared. If a host server is associated with multiple applications, the GSLB switch selects the slowest response time among the applications for the comparison. At act 112, if a best IP address is resolved, the IP address list is sent to client program 28 at act 108. Otherwise, at act 113, an IP address in the site that is least often selected to be the “best” site is chosen. The IP address list is then sent to client program 28 (act 108).

Upon receipt of the IP address list, the client program 28 uses the best IP address selected (i.e., the top of the list) to establish a TCP connection with a host server. Even then, if there is a sudden traffic surge that causes a host server to be overloaded, or if the host servers or the applications at the site become unavailable in the mean time, the site switch can redirect the TCP connection request to another IP address using, for example, an existing HTTP redirection procedure.

To provide an RTT under an embodiment of the present invention described above, at the first time a client accesses an IP address, a site switch (e.g., site switch 22A of FIG. 2) monitors the RTT time—the time difference between receiving a TCP SYN and a TCP ACK for the TCP connection—and records it in an entry of the cache database. The RTT time measured this way corresponds to the natural traffic flow between the client machine and the host server specified, rather than an artificial RTT based on “pinging” the client machine under a standard network protocol. Periodically, the site switches report the RTT database to a GSLB switch along with load conditions (e.g., number of sessions available). The GSLB switch aggregates the RTTs reported into a proximity table indexed by network neighborhood. (A network neighborhood is the portion of a network sharing a prefix of an IP address.) The GSLB switch can thus look up the RTT for a client machine to any specific host server, based on the client's network neighborhood specified in the client's IP address. From the accesses to the host servers from a large number of network neighborhoods, the GSLB switch can build a comprehensive proximity knowledge database that enables smarter site selection. In order to keep the proximity table useful and up-to-date, the GSLB switch manages the proximity table with cache management policies (e.g., purging infrequently used entries in favor of recently obtained RTTs). The proximity data can be used for all IP addresses served by each site switch.

All of the above U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in the Application Data Sheet, are incorporated herein by reference, in their entirety.

The above description of illustrated embodiments of the invention, including what is described in the Abstract, is not intended to be exhaustive or to limit the invention to the precise forms disclosed. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the invention and can be made without deviating from the spirit and scope of the invention.

These and other modifications can be made to the invention in light of the above detailed description. The terms used in the following claims should not be construed to limit the invention to the specific embodiments disclosed in the specification and the claims. Rather, the scope of the invention is to be determined entirely by the following claims, which are to be construed in accordance with established doctrines of claim interpretation.

Claims

1. An apparatus, comprising: a load balance switch that includes: switch hardware; anda controller adapted to calculate an average load of new connections to each respective site switch of a plurality of site switches, and to rank virtual IP addresses of the site switches based on the calculated average load of new connections to each said site switch,wherein a number of said new connections is determined at least in part using a weighted sampling interval of a plurality of sampling intervals,wherein at least one sampling interval of the plurality of sampling intervals is configurable with a weight of one and at least another sampling interval of the plurality of sampling intervals is configurable with a non-zero weight other than one.
2. The apparatus of claim 1 wherein said number of said new connections is determined using a sampling rate and said weighted sampling interval.
3. The apparatus of claim 2 wherein said load balance switch is configurable with its own at least one virtual IP address, and said load balance switch further includes: a metric agent to sample a number of new connections to the load balance switch for the at least one virtual IP address configured on the load balance switch, using a sampling rate and a sampling interval, and to provide information resulting from the sampling to the controller, wherein the controller is adapted to calculate an average load of new connections to the load balance switch based on the provided information resulting from the sampling, andwherein the controller is adapted to perform said ranking of the virtual IP addresses based also on the calculated average load of new connections to the load balance switch.
4. The apparatus of claim 1 wherein the controller is adapted to rank the virtual IP addresses by accepting a virtual IP address of any of said site switches that has a calculated average load of new connections less than a connection load limit and by rejecting a virtual IP address of any of said site switches that has a calculated average load of new connections greater than the connection load limit.
5. The apparatus of claim 1 wherein said controller is adapted to rank said virtual IP addresses based on a plurality of performance metrics.
6. The apparatus of claim 1 wherein still another sampling interval of the plurality of sampling intervals is configurable with a weight of 0 to nullify a contribution of said still another sampling interval.
7. An apparatus, comprising: a load balance switch that includes: switch hardware; anda controller adapted to rank respective virtual IP addresses of respective site switches based on a calculated average load of new connections to each of said site switches,wherein a number of said new connections is determined at least in part using a sampling interval that is configurable with a weight and that is included amongst a plurality of sampling intervals,wherein intervals of said plurality of sampling intervals are configurable with non-zero weights that are different from each other.
8. The apparatus of claim 7 wherein the average load of new connections is calculated by the respective site switch using a sampling rate and said sampling interval that is configurable with the weight.
9. The apparatus of claim 7 wherein the controller is adapted to calculate the average load of new connections for the respective site switch using said number of new connections, which is received from the respective site switch.
10. The apparatus of claim 7 wherein the controller is adapted to rank the virtual IP addresses by accepting a virtual IP address of any of said site switches that has a calculated average load of new connections less than a connection load limit and by rejecting a virtual IP address of any of said site switches that has a calculated average load of new connections greater than the connection load limit.
11. The apparatus of claim 7 wherein said controller is adapted to rank said virtual IP addresses based on a plurality of performance metrics.
12. The apparatus of claim 7 wherein at least another sampling interval of the plurality of sampling intervals is configurable with a weight of 0 to nullify a contribution of said still another sampling interval.
13. The apparatus of claim 7 wherein said at least two sampling intervals of said plurality of sampling intervals that are configurable with non-zero weights that are different from each other include: a first sampling interval configurable with a weight of one; anda second sampling interval configurable with a non-zero weight other than one.
14. An apparatus, comprising: a load balance switch that includes: switch hardware;a DNS proxy module adapted to receive a list of virtual IP addresses; anda controller coupled to the DNS proxy module and adapted to arrange the received list of virtual IP addresses based on a calculated average load of new connections to each of said virtual IP addresses,wherein a number of said new connections is determined at least in part using a sampling interval that is configurable with a weight and that is included amongst a plurality of sampling intervals,wherein at least two sampling intervals of said plurality of sampling intervals are configurable with non-zero weights that are different from each other.
15. The apparatus of claim 14 wherein the load balance switch is also is configured with its own at least one virtual IP address, and said load balance switch further includes: a metric agent adapted to sample new connections to the load balance switch for the at least one virtual IP address using a sampling rate and a sampling interval so as to obtain a number of new connections to the at least one virtual IP address, and to provide the obtained number of new connections to the controller for calculation of an average load of new connections to the at least one virtual IP address.
16. The apparatus of claim 14 wherein the controller is adapted to receive the calculated average load of new connections from each respective site switch of a plurality of site switches.
17. The apparatus of claim 15 wherein the calculated average load of new connections is calculated from said number of new connections as sampled using a sampling rate and said sampling interval that is configurable with the weight.
18. The apparatus of claim 14 wherein said controller is adapted to arrange said virtual IP addresses based on a plurality of performance metrics.
19. The apparatus of claim 14 wherein still another sampling interval of the plurality of sampling intervals is configurable with a weight of 0 to nullify a contribution of said still another sampling interval.
20. The apparatus of claim 14 wherein said at least two sampling intervals of said plurality of sampling intervals that are configurable with non-zero weights that are different from each other include: a first sampling interval configurable with a weight of one; anda second sampling interval configurable with a non-zero weight other than one.
21. An apparatus, comprising: a load balance switch to receive a list of virtual IP addresses and that includes: switch hardware; andcontroller means for ranking the virtual IP addresses in the received list based on a calculated average load of new connections to each of said virtual IP addresses,wherein a number of said new connections is determined at least in part using a sampling interval that is configurable with a weight and that is included amongst a plurality of sampling intervals,wherein at least two sampling intervals of said plurality of sampling intervals are configurable with non-zero weights that are different from each other.
22. The apparatus of claim 21, wherein: at least one virtual IP address is configurable on the load balance switch; andsaid controller means for ranking ranks said virtual IP addresses based on a plurality of performance metrics.
23. The apparatus of claim 21 wherein the calculated average load of new connections is received from each respective site switch of a plurality of site switches.
24. The apparatus of claim 21 wherein the controller means for ranking calculates the average load of new connections from said number of new connections as sampled using a sampling rate and said sampling interval that is configurable with the weight.
25. The apparatus of claim 21 wherein still another sampling interval of the plurality of sampling intervals is configurable with a weight of 0 to nullify a contribution of said still another sampling interval.
26. The apparatus of claim 21 wherein said at least two sampling intervals of said plurality of sampling intervals that are configurable with non-zero weights that are different from each other include: a first sampling interval configurable with a weight of one; anda second sampling interval configurable with a non-zero weight other than one.
27. A method, comprising: receiving, by a load balance switch having switch hardware, a list of virtual IP addresses; andranking, by said load balance switch, the virtual IP addresses in the received list based on a calculated average load of new connections to each of said virtual IP addresses,wherein a number of said new connections is determined at least in part using a sampling interval that is configurable with a weight and that is included amongst a plurality of sampling intervals,wherein at least two sampling intervals of said plurality of sampling intervals are configurable with non-zero weights that are different from each other.
28. The method of claim 27 wherein: at least one virtual IP address is configurable on the load balance switch; andnew connections to the at least one virtual IP address on the load balance switch is obtained using a sampling rate and a sampling interval.
29. The method of claim 27 wherein the calculated average load of new connections is received by said load balance switch from each respective site switch of a plurality of site switches.
30. The method of claim 27, further comprising calculating, by said load balance switch, the average load of new connections from said number of new connections as sampled using a sampling rate and said sampling interval that is configurable with the weight.
31. The method of claim 27 wherein said at least two sampling intervals of said plurality of sampling intervals that are configurable with non-zero weights that are different from each other include: a first sampling interval configurable with a weight of one; anda second sampling interval configurable with a non-zero weight other than one.
32. A system, comprising: a plurality of site switches, each site switch being respectively configured with a virtual IP address; anda load balance switch configured to balance load, amongst the plurality of site switches, according to a set of performance metrics so as to select a preferred virtual IP address configured at one of the site switches,wherein the set of performance metrics includes a metric based on a calculated average load of new connections to each of the plurality of site switches,wherein a number of said new connections is determined at least in part using a weighted sampling interval of a plurality of sampling intervals,wherein at least one sampling interval of the plurality of sampling intervals is configurable with a weight of one and at least another sampling interval of the plurality of sampling intervals is configurable with a non-zero weight other than one.
33. The system of claim 32, further comprising an authoritative domain name server coupled to the load balance switch, wherein the load balance switch is further configured to be a proxy to said authoritative domain name server, and wherein the load balance switch is configured to said balance load by use of said set of performance metrics to arrange virtual IP addresses in a list received from the authoritative domain name server.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application is a continuation that claims the benefit under 35 U.S.C. §120 of U.S. patent application Ser. No. 10/206,580, entitled “GLOBAL SERVER LOAD BALANCING,” filed Jul. 25, 2002, which is a continuation-in-part of U.S. application Ser. No. 09/670,487, entitled “GLOBAL SERVER LOAD BALANCING,” filed Sep. 26, 2000, both of which are assigned to the same assignee as the present application, and which are incorporated herein by reference their entireties.

US Referenced Citations (211)

Number	Name	Date	Kind
5031094	Toegel et al.	Jul 1991	A
5359593	Derby et al.	Oct 1994	A
5867706	Martin et al.	Feb 1999	A
5948061	Merriman et al.	Sep 1999	A
5951634	Sitbon et al.	Sep 1999	A
6006269	Phaal	Dec 1999	A
6006333	Nielsen	Dec 1999	A
6078956	Bryant et al.	Jun 2000	A
6092178	Jindal et al.	Jul 2000	A
6112239	Kenner et al.	Aug 2000	A
6115752	Chauhan	Sep 2000	A
6119143	Dias et al.	Sep 2000	A
6128279	O'Neil et al.	Oct 2000	A
6128642	Doraswamy et al.	Oct 2000	A
6148410	Baskey et al.	Nov 2000	A
6157649	Peirce et al.	Dec 2000	A
6167445	Gai et al.	Dec 2000	A
6167446	Lister et al.	Dec 2000	A
6178160	Bolton et al.	Jan 2001	B1
6182139	Brendel	Jan 2001	B1
6195691	Brown	Feb 2001	B1
6233604	Van Horne et al.	May 2001	B1
6260070	Shah	Jul 2001	B1
6262976	McNamara	Jul 2001	B1
6286039	Van Horne et al.	Sep 2001	B1
6286047	Ramanathan et al.	Sep 2001	B1
6304913	Rune	Oct 2001	B1
6317775	Coile et al.	Nov 2001	B1
6324177	Howes et al.	Nov 2001	B1
6324580	Jindal et al.	Nov 2001	B1
6327622	Jindal et al.	Dec 2001	B1
6336137	Lee et al.	Jan 2002	B1
6381627	Kwan et al.	Apr 2002	B1
6389462	Cohen et al.	May 2002	B1
6393473	Chu	May 2002	B1
6405252	Gupta et al.	Jun 2002	B1
6411998	Bryant et al.	Jun 2002	B1
6427170	Sitaraman et al.	Jul 2002	B1
6434118	Kirschenbaum	Aug 2002	B1
6438652	Jordan et al.	Aug 2002	B1
6446121	Shah et al.	Sep 2002	B1
6449657	Stanbach, Jr. et al.	Sep 2002	B2
6470389	Chung et al.	Oct 2002	B1
6473802	Masters	Oct 2002	B2
6480508	Mwikalo et al.	Nov 2002	B1
6487555	Bharat	Nov 2002	B1
6490624	Sampson et al.	Dec 2002	B1
6513061	Ebata et al.	Jan 2003	B1
6542964	Scharber	Apr 2003	B1
6549944	Weinberg et al.	Apr 2003	B1
6578066	Logan et al.	Jun 2003	B1
6578077	Rakoshitz et al.	Jun 2003	B1
6606643	Emens et al.	Aug 2003	B1
6611861	Schairer et al.	Aug 2003	B1
6647009	Kubota et al.	Nov 2003	B1
6665702	Zisapel et al.	Dec 2003	B1
6681232	Sistanizadeh et al.	Jan 2004	B1
6681323	Fontanesi et al.	Jan 2004	B1
6691165	Bruck et al.	Feb 2004	B1
6725253	Okano et al.	Apr 2004	B1
6745241	French et al.	Jun 2004	B1
6748416	Carpenter et al.	Jun 2004	B2
6754699	Swildens et al.	Jun 2004	B2
6760775	Anerousis	Jul 2004	B1
6772211	Lu et al.	Aug 2004	B2
6779017	Lamberton et al.	Aug 2004	B1
6789125	Aviani et al.	Sep 2004	B1
6795434	Kumar et al.	Sep 2004	B1
6795860	Shah	Sep 2004	B1
6801949	Bruck et al.	Oct 2004	B1
6810411	Coughlin et al.	Oct 2004	B1
6826198	Turina et al.	Nov 2004	B2
6839700	Doyle et al.	Jan 2005	B2
6850984	Kalkunte et al.	Feb 2005	B1
6874152	Vermeire et al.	Mar 2005	B2
6879995	Chinta et al.	Apr 2005	B1
6880000	Tominaga et al.	Apr 2005	B1
6883028	Johnson et al.	Apr 2005	B1
6898633	Lyndersay et al.	May 2005	B1
6901081	Ludwig	May 2005	B1
6920498	Gourlay et al.	Jul 2005	B1
6928485	Krishnamurthy et al.	Aug 2005	B1
6950848	Yousefi'zadeh	Sep 2005	B1
6963914	Breitbart et al.	Nov 2005	B1
6963917	Callis et al.	Nov 2005	B1
6985956	Luke et al.	Jan 2006	B2
6987763	Rochberger et al.	Jan 2006	B2
6996615	McGuire	Feb 2006	B1
6996616	Leighton et al.	Feb 2006	B1
7000007	Valenti	Feb 2006	B1
7020698	Andrews et al.	Mar 2006	B2
7020714	Kalyanaraman et al.	Mar 2006	B2
7028083	Levine et al.	Apr 2006	B2
7032010	Swildens et al.	Apr 2006	B1
7032031	Jungck et al.	Apr 2006	B2
7036039	Holland	Apr 2006	B2
7047300	Oehrke et al.	May 2006	B1
7058706	Iyer et al.	Jun 2006	B1
7058717	Chao et al.	Jun 2006	B2
7062642	Langrind et al.	Jun 2006	B1
7082102	Wright	Jul 2006	B1
7086061	Joshi et al.	Aug 2006	B1
7089293	Grosner et al.	Aug 2006	B2
7099915	Tenereillo et al.	Aug 2006	B1
7114008	Jungck et al.	Sep 2006	B2
7117269	Lu et al.	Oct 2006	B2
7117530	Lin	Oct 2006	B1
7124188	Mangipudi et al.	Oct 2006	B2
7127713	Davis et al.	Oct 2006	B2
7136932	Schneider	Nov 2006	B1
7139242	Bays	Nov 2006	B2
7177933	Foth	Feb 2007	B2
7185052	Day	Feb 2007	B2
7197547	Miller et al.	Mar 2007	B1
7206806	Pineau	Apr 2007	B2
7213068	Kohli et al.	May 2007	B1
7225272	Kelley et al.	May 2007	B2
7240015	Karmouch et al.	Jul 2007	B1
7240100	Wein et al.	Jul 2007	B1
7254626	Kommula et al.	Aug 2007	B1
7257642	Bridger et al.	Aug 2007	B1
7260645	Bays	Aug 2007	B2
7277954	Stewart et al.	Oct 2007	B1
7296088	Padmanabhan et al.	Nov 2007	B1
7321926	Zhang et al.	Jan 2008	B1
7330908	Jungck	Feb 2008	B2
7383288	Miloushev et al.	Jun 2008	B2
7423977	Joshi et al.	Sep 2008	B1
7441045	Skene et al.	Oct 2008	B2
7454500	Hsu et al.	Nov 2008	B1
7496651	Joshi	Feb 2009	B1
7523181	Swildens et al.	Apr 2009	B2
7573886	Ono	Aug 2009	B1
7574508	Kommula	Aug 2009	B1
7581009	Hsu et al.	Aug 2009	B1
7584262	Wang et al.	Sep 2009	B1
7584301	Joshi	Sep 2009	B1
7657629	Kommula	Feb 2010	B1
7676576	Kommula	Mar 2010	B1
7756965	Joshi	Jul 2010	B2
7840678	Joshi	Nov 2010	B2
7885188	Joshi	Feb 2011	B2
7899899	Joshi	Mar 2011	B2
7949757	Joshi	May 2011	B2
20010049741	Skene et al.	Dec 2001	A1
20010052016	Skene et al.	Dec 2001	A1
20020026551	Kamimaki et al.	Feb 2002	A1
20020038360	Andrews et al.	Mar 2002	A1
20020055939	Nardone et al.	May 2002	A1
20020059170	Vange	May 2002	A1
20020059464	Hata et al.	May 2002	A1
20020062372	Hong et al.	May 2002	A1
20020078233	Biliris et al.	Jun 2002	A1
20020087722	Datta et al.	Jul 2002	A1
20020091840	Pulier et al.	Jul 2002	A1
20020112036	Bohannon et al.	Aug 2002	A1
20020120743	Shabtay et al.	Aug 2002	A1
20020120763	Miloushev et al.	Aug 2002	A1
20020124096	Loguinov et al.	Sep 2002	A1
20020133601	Kennamer et al.	Sep 2002	A1
20020150048	Ha et al.	Oct 2002	A1
20020154600	Ido et al.	Oct 2002	A1
20020188862	Trethewey et al.	Dec 2002	A1
20020194324	Guha	Dec 2002	A1
20020194335	Maynard	Dec 2002	A1
20030018796	Chou et al.	Jan 2003	A1
20030031185	Kikuchi et al.	Feb 2003	A1
20030035430	Islam et al.	Feb 2003	A1
20030065711	Acharya et al.	Apr 2003	A1
20030065763	Swildens et al.	Apr 2003	A1
20030105797	Dolev et al.	Jun 2003	A1
20030115283	Barbir et al.	Jun 2003	A1
20030135509	Davis et al.	Jul 2003	A1
20030154239	Davis et al.	Aug 2003	A1
20030210686	Terrell et al.	Nov 2003	A1
20030210694	Jayaraman et al.	Nov 2003	A1
20030229697	Borella	Dec 2003	A1
20040019680	Chao et al.	Jan 2004	A1
20040024872	Kelley et al.	Feb 2004	A1
20040039847	Persson et al.	Feb 2004	A1
20040064577	Dahlin et al.	Apr 2004	A1
20040194102	Neerdaels	Sep 2004	A1
20040249939	Amini et al.	Dec 2004	A1
20040249971	Klinker	Dec 2004	A1
20040259565	Lucidarme	Dec 2004	A1
20050002410	Chao et al.	Jan 2005	A1
20050021883	Shishizuka et al.	Jan 2005	A1
20050033858	Swildens et al.	Feb 2005	A1
20050086295	Cunningham et al.	Apr 2005	A1
20050149531	Srivastava	Jul 2005	A1
20050169180	Ludwig	Aug 2005	A1
20050286416	Shimonishi et al.	Dec 2005	A1
20060020715	Jungck	Jan 2006	A1
20060036743	Deng et al.	Feb 2006	A1
20060167894	Wunner	Jul 2006	A1
20060209689	Nakano et al.	Sep 2006	A1
20070168448	Garbow et al.	Jul 2007	A1
20070168547	Krywaniuk	Jul 2007	A1
20070180113	Van Bemmel	Aug 2007	A1
20080037420	Tang	Feb 2008	A1
20080123597	Arbol et al.	May 2008	A1
20080144784	Limberg	Jun 2008	A1
20080147866	Stolorz et al.	Jun 2008	A1
20100010991	Joshi	Jan 2010	A1
20100011120	Kommula	Jan 2010	A1
20100153558	Kommula	Jun 2010	A1
20100223621	Joshi	Sep 2010	A1
20100293296	Hsu et al.	Nov 2010	A1
20100299427	Joshi	Nov 2010	A1
20110099261	Joshi	Apr 2011	A1
20110122771	Joshi	May 2011	A1

Related Publications (1)

	Number	Date	Country
	20100082787 A1	Apr 2010	US

Continuations (1)

	Number	Date	Country
Parent	10206580	Jul 2002	US
Child	11707697		US

Continuation in Parts (1)

	Number	Date	Country
Parent	09670487	Sep 2000	US
Child	10206580		US

Global server load balancing

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

US Classifications

Field of Search

US

International Classifications

Term Extension