A technique according to the present invention relates to information processing using a load balancer.
In one conventionally known method for sharing session data, an application server, selected through load balancing performed by a load balancer, adds session data to a response packet and transmits the response packet as a reply to the load balancer, and the load balancer stores the session data added to a response packet. Then the load balancer transfers the response packet to a client terminal after deleting the session data therefrom (see Japanese Patent Application Laid-open No. 2010-79523).
Communication identification information such as a session ID has conventionally been used to identify communications between a client and a server. It is a common practice, in a case where a service is provided to the client by using a plurality of servers, to replicate the communication identification information among the servers to maintain communications.
Unfortunately, a method in which the communication identification information is replicated among all the servers involves a large load considering the amount of communications and the number of replicating processes. However, in a method in which the communication identification information is replicated among some of the servers, allocation of communication may be implemented to a server which does not have the communication identification information.
The present invention is made in view of the problems described above, and an object of the present invention is to reduce a load for maintaining communications when a service is provided to a client by using a plurality of servers.
An information processing system according to an aspect of the present invention is an information processing system including a load balancer that determines an allocation target of communication with a client, with the communication being allocated to any of a plurality of servers to provide a service to the client, the information processing system including: a server identification unit that identifies a first server to which the communication has been allocated by the load balancer, from among the plurality of servers; a replication target server determination unit that determines at least one server from among the plurality of servers as a second server which is a target of replication of communication identification information for identifying the communication; a replication unit that replicates to the second server the communication identification information to be stored in the second server; and a notification unit that notifies the second server of information with which the first server is able to be identified.
The replication target server determination unit determines the second server by processing a part or all of the communication identification information on a basis of a predetermined algorithm that has reproducibility with respect to an input.
The information processing system according to the present invention may further include a server selection unit that selects the second server by processing the communication identification information extracted from a content of the communication on the basis of the predetermined algorithm, when the load balancer allocates the communication to the server other than the first server.
The information processing system according to the present invention may further include a communication identification information generation unit that generates the communication identification information in the first server, when the load balancer allocates the communication to the first server.
The replication target server determination unit may further determine at least one of the plurality of servers as the second sever, when one of the second servers is the same as the first server.
The plurality of servers may each include a non-volatile storage device, and when multiple servers are determined as the replication target servers by the replication target server determination unit, at least one of the multiple servers may not store the communication identification information in the non-volatile storage device.
The replication target server determination unit may determine, as the second server, the server that the first server has determined as the target of replication of the communication identification information.
The present invention may be regarded as an information processing apparatus, an information processing system including one or more information processing apparatus, a method performed by a computer, or a program that causes the computer to perform the method. The present invention may further be regarded as a recording medium stored with the program, readable by a computer, a device, a machine, and the like. Here, the recording medium, readable by a computer and the like, is a recording medium that stores information such as data and a program electrically, magnetically, optically, mechanically, or through a chemical action, the information being readable by the computer and the like.
An embodiment of the present invention is described below with reference to the drawings. The embodiment described below is merely an example, and the present invention is not limited to specific configurations described below.
The present invention may be implemented as appropriate using specific configurations for respective embodiments.
<System configuration>
The servers 1 are information processing apparatuses each including: a controller 10 including a central processing unit (CPU) 11, a random access memory (RAM) 12, and a read only memory (ROM) 13; a non-volatile auxiliary storage 14; an input device 15; an output device 16; and a network interface 17. It is to be noted that the configuration of the information processing apparatus according to an embodiment of the present invention needs not to be the same as the configuration described above. Components in a specific hardware configuration of the information processing apparatus may be omitted, replaced, or added as appropriate, in accordance with an embodiment.
Although not elaborated in the figure, the load balancer 2 is also an information processing apparatus including: a controller including the CPU, the RAM, and the ROM; an auxiliary storage device; an input device; an output device; and a network interface, as is the case of the server 1. Although not elaborated in the figure, the client terminal 9 is also an information processing apparatus including: a controller including the CPU, the RAM, and the ROM; an auxiliary storage device; an input device; an output device; and a network interface, as is the case of the server 1.
In the present embodiment, the servers 1 are each a server that functions to manage a website and transmit an HTML content in response to a content acquisition request made with HTTP. The server 1 distributes the content in the website in response to a request from the client terminal 9. In the present embodiment, a web service is described as an example of a service provided by the server 1. However, in implementing the present invention, the server 1 is not limited to a server that provides the web service. The present invention may be applied to a group of servers for providing other services as well.
In the present embodiment, a communication from the client terminal 9 to the website is first received by the load balancer 2. This load balancer 2, which serves as a load balancing apparatus allocates, upon receiving the communication from the client terminal 9, the communication to one of the servers 1 thereby distributing the load, imposed on the website, among the servers 1. The load balancer 2 used in the present embodiment determines the server 1 as a load distribution target to which the communication from the client terminal 9 is allocated, based on a certain load balancing algorithm. Basically, the load balancer 2 performs the allocation of communication from one client terminal 9 to one server 1 in order to maintain a session between the client terminal 9 and the server 1. However, due to a certain situation on a side of the server 1 or the load balancer 2, the communication from the one client terminal 9 may be allocated to a server 1 that is different from the one server 1. For example, as the situation on the side of the server 1, the server 1 might be down. As the situation on the side of the load balancer 2, the load balancer 2 might change the server 1 as the allocation target in an attempt to increase a processing speed or there might be a problem in the performance of the load balancer 2.
The server 1, allocated with the communication from the client terminal 9 by the load balancer 2, issues a session ID to the client terminal 9. With the session ID, at least the client terminal 9 or a user can be uniquely identified. After this process, the server 1 identifies communications between the client terminal 9 and the server 1 by using the session ID, and manages the communications between the client terminal 9 and the server 1 by using session information identified by the session ID. Thus, after the issuance of the session ID, a packet transmitted between the server 1 and the client terminal 9 includes the session ID. The session ID is an identifier with which a communication with the client terminal 9 or the user is identified and a service is provided to each client terminal 9 or user. The session ID corresponds to a “communication identifier” of the present invention.
The client terminal 9 has a function of transmitting a packet to a website designated with a uniform resource locator (URL) and a function of receiving a content transmitted as a reply from the server 1 allocated with the packet by the load balancer 2. The client terminal 9 also stores a session ID received from a webserver and used for managing communications with the connected server 1.
In the present embodiment, the session information, including the session ID issued by any of the servers 1, is replicated to at least one server 1. The server 1 determined as a target of replication of the session information is referred to as “replication target server”. The server 1 as the session information replication target is determined by using an algorithm common to all the servers 1. The algorithm employed as the common algorithm may be any algorithm having reproducibility in the server 1 and being selected in accordance with an input (the session ID in the present embodiment). Because the server 1 is determined to be the replication target server by using such an algorithm, the algorithm can be used for selecting the replication target server after the replication target server is determined by using the session ID. In the present invention, a method for determining the replication target server with Consistent Hashing is employed as a method for determining the replication target server by using such an algorithm.
The server 1 uses a hashing operation algorithm, common to all the servers 1, and obtains a hash value unique to each session ID for determining and selecting the server 1 as the session information replication target server. The hash value may be obtained by performing a hashing operation with a part or all of the session IDs as input values. When a hash value within the range from 0 to n is used as the session ID, the session ID may be directly used as the hash value for determining a point on the continuum.
First of all, the replication target server determination unit 23 or the server selection unit 26 initiates search in the single direction (clockwise in the figure) in use of a point on the continuum corresponding to the obtained hash value as the starting point. The first server 1 to be found is determined as the session information replication target (or selected as an acquisition target). The search is terminated when a single server 1 is set to be the replication target, or when the session information is acquired from the selected server 1. The search in the single direction (clockwise in the figure) on the continuum continues when two or more servers 1 are set be the replication target, or when the session information is not acquired from the selected server 1. The server 1 found next is again determined as the target of replication of the session information (or selected as the acquisition target). When the value of the searched position exceeds the limit value n (2̂32−1 for example) while the search is still in process, the searched position returns to the starting point (that is, 0) and the search continues.
<Flow of Processing>
Processing in the present embodiment is described in detail with reference to
In step S101, a communication from the client terminal 9 is received. Specifically, the packet transmitted from the client terminal 9 is first received by the load balancer 2 and then is received by the server 1. Upon receiving the packet transmitted from a client terminal 9, the load balancer 2 determines one of the servers 1 as the allocation target of the packet on a basis of on a predetermined load balancing algorithm, and transfers the packet to the server 1 thus determined. Thus, the packet reaches one of the servers 1. In other words, the server 1 allocated with the communication by the load balancer 2 receives the packet transmitted from the client terminal 9. The server 1 that has received the packet is hereinafter referred to as “received server”. The processing then proceeds to step S102.
In step S102, whether the session ID is set to the received packet is determined. Specifically, the received server refers to the packet received in step S101 to determine whether the packet includes the session ID for identifying communications between the server 1 and the client terminal 9. The processing proceeds to step S110, when the packet includes the session ID, that is, when one of the servers 1 has already issued the session ID to the client terminal 9. The processing proceeds to step S103, when the packet includes no session ID, that is, when no session ID has been issued to the client terminal 9 as the transmission source of the packet.
Processing from step S103 to step 108, performed when no session ID has been issued to the client terminal 9 as the transmission source of the packet, is processing in which a session ID is issued, and session information related to the session ID is replicated to another server 1.
In step S103, an owner is identified. The server identification unit 21 of the received server identifies the host apparatus (received server) as the owner. In the present embodiment, the owner is one of the servers 1 that serves as the “owner” of the session information in the session information replication processing, and corresponds to a “first server” of the present invention. In the present embodiment, the received server is selected as the owner. The received server is the server 1 allocated with the communication by the load balancer 2, and is the server 1 that has generated the session information. However, the owner is not limited to the server 1 that has generated the session information or the received server, and any server 1 with the session information may be designated as the owner. The processing then proceeds to step S104.
In steps S104 and S105, the session information is generated and notified. The communication identification information generation unit 22 of the received server generates the session information including the session ID (step S104). The session information is information for identifying communications between the client terminal 9 as the transmission source of the packet and the server 1. The received server notifies the client terminal 9, as the transmission source of the packet, of the session information thus generated (step S105). The processing then proceeds to step S106.
In step S106, the replication target server is determined based on the session ID. The replication target server determination unit 23 of the received server determines the replication target server by using Consistent Hashing described above. More specifically, the received server performs the hashing operation with a part or all of the session IDs as an input value to obtain a hash value. The employed hashing operation has reproducibility with respect to an input. The server 1 performs the search in use of a point corresponding to the hash value thus obtained as the starting point on the continuum, and determines a predetermined number of servers 1 as the replication target servers in order these servers have been found. The replication target server corresponds to a “second server” of the present invention. The processing then proceeds to step S107.
In step S106, whether the owner is identical to the replication target server may be determined. The received server determines whether the owner is identical to the replication target server by comparing the server identifier of the owner determined in step S103 with the server identifier of at least one replication target server found in step S106. When the received server determines that the owner has been identical to the replication target server (one replication target server is the same as the owner), the replication target server determination unit 23 implements redetermination of the replication target server. Specifically, the replication target server determination unit 23 performs the search in use of a point corresponding to the obtained hash value as the starting point on the continuum. The replication target server determination unit 23 determines the predetermined number of servers 1, which are not identical to the owner, as the replication target servers in order these servers have been found. As described above, in the processing shown in the flowchart, the replication target server determination processing is repeated until a predetermined number of replication target server, different from the owner, are selected. When the predetermined number of replication target servers are determined, the processing proceeds to step S107.
In step S107 and step S108, the session information is replicated to the replication target server, and the replication target server is notified of the identifier of the owner. The replication unit 24 of the received server transmits the session information, generated in step S104, to the replication target server determined in step S106, so that the session information is replicated to and stored in the replication target server (step S107).
When a plurality of replication target servers are determined so that the session information is replicated to the replication target servers, only a part (for example, one) of the replication target servers may store the session information in the non-volatile auxiliary storage 14. When only the part of the replication target servers stores the session information in the non-volatile auxiliary storage 14, the remaining part of the replication target servers stores the session information in the RAM 12. Thus, an area used for the session information replication in the auxiliary storage 14 can be saved.
Then, the notification unit 25 of the received server, which is the owner, notifies the replication target server, which has been determined in step S106, of the server identifier of the notification unit 25 to be used in the continuum as the information with which the owner can be identified (step S108). The replication target server can recognize the server 1 as the owner of the session information (the server 1 that has generated the session information) with the server identifier notified from the owner. The replication of the session information in step S107 and the notification of the owner identifier in step S108 may be performed through different packets respectively or the same packet. The processing in step S107 and the processing in step S108 may be performed in the opposite order. The processing shown in this flowchart is then terminated.
In step S110, whether the session information is stored in the received server is determined. When the session ID is set to the received packet, the received server determines whether the session information, corresponding to the session ID set to the received packet, is stored in the received server. When the received server determines that the session information is stored in the received server, the received server manages the content of the communications with the client terminal 9 by using the session information. Then processing proceeds to step S117. The processing proceeds to step S111 when the session information is not stored in the received server.
Processing from step S111 to step S115 is processing for acquiring the session information from another server 1, when the session ID is set to the received packet (step S102: YES), but the session information is not stored in the received server (step S110: NO). In the system according to the present embodiment, even when the server 1 without the session ID is allocated with the communication by the load balancer 2, the server 1 allocated with the communication can select the replication target server based on the session ID to acquire the session information through the processing from step S111 to step S115.
In step S111, the server 1 is selected based on the session ID. The server selection unit 26 of the received server selects the server 1 through Consistent Hashing described above. More specifically, the received server performs the hashing operation with the session ID as an input value to obtain a hash value. The server 1 performs a search in use of a point corresponding to the hash value thus obtained as the starting point on the continuum, and selects the first server 1 found. The algorithm used for the selection in step S111 is the same as the one used for determining the replication target server in step S106 (Constant Hashing using the same continuum in the present embodiment). Thus, the replication target server determined in step S106 to which the session information has been replicated is selected with a higher priority in step S111. The processing then proceeds to step S112.
In step S112, an attempt to acquire the session information from the selected server 1 is made. Specifically, the received server requests the session information corresponding to the session ID set to the received packet from the server 1 selected in step S111. When the requested server 1 has the session information, the session information is successfully acquired. However, the session information might fail to be acquired due to a certain circumstance including a case where the requested server 1 is not the replication target server or a case where the requested server 1 is under failure.
In step S113 to step S115, the selection of the server 1 based on the session ID and the request for the session information are repeated until the session information is acquired. Thus, the received server determines whether the session information has been successfully acquired in step S112 (step S113). Upon determining that the session information has been successfully acquired, the received server manages the content of the communications performed with the client terminal 9 by using the acquired session information. The processing then proceeds to step S117 for updating the session information.
When the received server determines that the acquisition of the session information has failed, the server selection unit 26 of the received server selects the next server 1 based on the session ID (step S115), and requests from the selected server 1 the session information (step S112). Here, the “next server” to be selected is the next server 1 found after the server 1, found in the processing in immediately preceding step S112, by continuing the search with Consistent Hashing described with reference to
In step S113 to S115, the selection of the server 1 based on the session ID and the request for the session information are repeated until these session information is acquired, regardless of whether the server 1 selected each time is the replication target server. This processing is nothing more than auxiliary processing for increasing the reliability of the session information replication processing using Consistent Hashing, and thus can be omitted.
In step S117 to step S127, the session information stored in the owner and the replication target server is updated.
In step S117, whether the session information needs to be updated is determined. Specifically, the received server determines whether the session information used between the received server and the client terminal 9 needs to be updated in accordance with the communications with the client terminal 9. The session information is updated when the content to be managed in the communications is changed, added, and deleted in the course of the communications between the client terminal 9 and the received server. When the received server determines that the session information needs not to be updated, the processing shown in this flowchart is terminated. When the received server determines that the session information needs to be updated, the processing proceeds to step S118.
In steps S118 and step S119, the session information is updated, and the client terminal 9 is notified of the updated session information. The received server updates the session information (step S118) and transmits the response packet including the updated session information to the client terminal 9, whereby the client terminal 9 is notified of the updated session information (step S119). The updating and notification of the session information is processing that has conventionally been performed in the server/client communications, and thus will not be described in detail. The processing then proceeds to step S120.
In step S120 to step S122, the session information of the replication target server is updated when the received server is the owner. The received server determines whether the received server is the owner of the session information related to the session ID set to the received packet (step S120). When the received server determines that the received server is not the owner of the session information, the processing proceeds to step S123. When the received server is determined to be the owner of the session information, the server selection unit 26 of the received server selects the replication target server based on the session ID (step S121), and updates the session information stored in the replication target server (step S122). The session information stored in the owner has been updated in step S118. The processing in step S121 is substantially the same as the processing in step S111 described above, and thus the explanation thereof will be omitted. In step S122, the received server transmits the session information updated in step S118 to the replication target server selected in step S121. Thus, the session information stored in the replication target server is updated (step S122). When the received server is the owner, the session information held by the owner and the replication target server is updated through the processing in step S118 to step S122. The processing shown in this flowchart is then terminated.
In step S123 and step S124, the session information of the owner is updated when the identifier of the owner is stored in the received server. When the received server is not the owner of the session information (step 120: NO), the received server determines whether the identifier of the owner is stored in the received server (step S123). For example, when the received server is one of the replication target servers, the server identifier of the owner that has been notified through the processing in step S108 is stored in the received server. When the received server determines that the server identifier of the owner is not stored in the received server, the processing proceeds to step S125. When the server identifier of the owner is determined to have been stored in the received server, the received server transmits the session information updated in step S118 to the server 1 indicated by the server identifier of the owner. Thus, the session information stored in the owner is updated (step S124). The processing then proceeds to step S121.
Then, the server selection unit 26 of the received server selects the other replication target servers based on the session ID (step S121), and updates the session information stored in the other replication target servers (step S122). Then, the processing shown in this flowchart is terminated.
In step S125 to step S127, the replication target server is selected based on the session ID, and the owner is identified by the server identifier of the owner stored in the replication target server which has been selected secondary. When the received server is not the owner and the identifier of the owner is not stored in the received server (step S120: NO, step S123: NO), the server selection unit 26 of the received server selects the replication target server based on the session ID (step S125), and requests from the selected replication target server the owner information. Thus, the server identifier of the owner is acquired (step S126). The received server transmits the session information updated in step S118 to the server 1 indicated by the server identifier of the owner to update the session information stored in the owner (step S127).
Then, the received server transmits the session information updated in step S118 to the replication target server selected in step S126 to update the session information stored in the replication target server (step S122). The processing shown in this flowchart is then terminated.
This application is a continuation application of International Application PCT/JP2012/064934 filed on Jun. 11, 2012 and designated the U.S., the entire contents of which are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2012/064934 | Jun 2012 | US |
Child | 14566192 | US |