The invention generally relates to the field of systems and methods related to the filling of a routing table of a distributed hash table (DHT) client node, and more particularly to a processing method, a proxy processing agent, a system and method for filling a routing table, a router and a DHT client node.
Distributed Hash Table (DHT) favours communication between strongly distributed nodes. This often leads to expensive communication between nodes, even though nodes that allow much cheaper communication may be available while not being explicitly known to the actually involved nodes.
A DHT node currently builds up its view on the available foreign nodes in the DHT network from nodes/peers that it finds in DHT response messages that it receives on DHT request messages (e.g. a DHT find-node or get-peers request message) that it sends to foreign nodes, and/or nodes that it finds in DHT request messages that it receives from foreign nodes, and/or peers that it finds in BitTorrent response messages that it receives on BitTorrent announce request messages that it sends to BitTorrent trackers, etc. The resulting view is semi-permanently stored in the DHT Routing Table of the DHT node. Once a foreign node is entered in the Routing Table of a DHT node, it typically remains there at least until the foreign node does not respond to DHT request messages any longer or until the DHT node is switched down, at which point it may clean its DHT Routing Table. As a DHT node exclusively uses the foreign nodes in its DHT Routing Table to further communicate within the DHT network, there is no control on what nodes will actually communicate in the DHT network, leading to a more expensive communication than needed using e.g. “far away” nodes, requiring transatlantic links and or multiple IP-hops where “nearby” nodes are available.
The object of the invention is to provide a processing method, a proxy processing agent, a system and method for filling a routing table, a router and a DHT client node leading to an improved communication between DHT nodes, in particular for BitTorrent applications.
According to an embodiment there is provided a processing method for replying to a distributed hash table (DHT) message from a DHT client in order to allow the DHT client to fill his DHT routing table. The method comprises a number of steps performed at a proxy processing agent having a plurality of predefined nodes, each node having an IP address and port number. In a first step one of those nodes receives from the DHT client a DHT request message. Next the proxy processing agent generates at least one node identifier and associates each node identifier with an IP address and a port number. Each IP address/port number combination corresponds with a node of the proxy processing agent. Next a DHT response message is sent to the DHT client containing said at least one node identifier and associated IP address and port. Typically the proxy processing agent will maintain a database storing in connection with each DHT client the generated node identifiers and associated IP addresses and ports.
Such a method allows for grabbing control of the routing table in a DHT client node. Embodiments of such a method can e.g. ensure that a DHT node residing in an ISP network fills its Routing Table with predefined nodes with which the communication cost is known to be low.
According to a preferred embodiment the DHT request message is a message for which the associated DHT response message includes at least one node. The DHT request message can e.g. be any one of the following: a DHT find-node request message, a DHT ping request message, a DHT get-peers request message, a DHT announce request message, etc.
According to an embodiment of the processing method of the invention, the first message received from a particular DHT client will typically be a first DHT request message addressed to a bootstrap server and deflected by a router to the proxy processing agent. The generating step then typically consists in generating a first group, typically eight, of first node identifiers for filling a first bucket of a node space. According to an embodiment of the processing method of the invention, a further DHT request message received from the same DHT client will be addressed to a node with an address/port associated with first node identifiers of said first group. The generating step then consists in generating a further group of further node identifiers for filling a further bucket. It will be understood that further DHT request messages may be received from the same DHT client, typically to keep the routing table of that DHT client sufficiently filled.
According to a preferred embodiment, for each received DHT find-node request message a plurality of node identifiers, typically eight node identifiers are generated. This embodiment is preferred for a node space with a size of 2160 where buckets are considered to be full when they contain eight node identifiers.
According to a preferred embodiment the at least one node identifier is generated according to a predetermined pattern taking into account the node identifier of the DHT client node. Preferably, for every received DHT request message a plurality of node identifiers (e.g. eight node identifiers) are generated at the proxy processing agent. Further preferably, the pattern is such that every node identifier of said plurality of generated node identifiers is in the same bucket and that the node identifier of the DHT client does not reside in said bucket. If e.g. the routing table uses a node space having a size of 2160, the pattern may consist in splitting the node space in two buckets, filling the bucket not containing the DHT identifier with a predefined number of, typically eight, generated node identifiers, and repeating said splitting and filling step for the other buckets until the bucket size is reduced to a predefined number, typically eight, and fill said last bucket with the last generated node identifiers.
According to a possible embodiment the method further comprises receiving at a node of said plurality of nodes of the proxy processing agent a further DHT request message (e.g. a DHT ping or repeated find-node request message) from the DHT client. In reply to this DHT request message the stored node identifier associated with said node is looked up for said DHT client and a DHT response message containing said looked up stored node identifier is sent to the DHT client.
The invention further relates to a proxy processing agent for replying to a distributed hash table (DHT) message from a DHT client in order to allow the DHT client to fill his DHT routing table. The agent comprises a plurality of nodes capable of receiving from the DHT client a DHT request message; a processing means for generating at least one node identifier and for associating with each node identifier an IP address and a port number; means for composing and sending in reply to said DHT request message a DHT response message to the DHT client, said DHT response message containing said at least one node identifier and associated IP address and port; and storage means for storing in connection with said DHT client the at least one generated node identifier and associated IP address and port.
According to a preferred embodiment the processing means are adapted to generate a group of node identifiers according to a predetermined pattern taking into account the node identifier of the DHT client. For possible patterns reference is made to the method set out above and to the examples given in the figure description below.
Further the invention relates to a system for filling a routing table of a distributed hash table (DHT) client node in a DHT infrastructure comprising a DHT bootstrap server. The system comprises a proxy processing agent with a plurality of proxy nodes; and a plurality of routers adapted to deflect a packet addressed to the bootstrap server towards said proxy processing agent. In that way the proxy processing agent will be able to reply to a DHT request message from a DHT client which is address to a bootstrap server. Hence the returned nodes can be chosen between the plurality of nodes of the proxy processing agent, introducing a certain degree of centralization in the otherwise decentralized DHT infrastructure.
The invention also relates to such a router for use in a network of a DHT client node, wherein the router is adapted to deflect a packet addressed to the DHT bootstrap server towards a first node of said plurality of proxy nodes of said proxy processing agent. Preferably the router contains a policy rule indicating that a packet directed to a particular IP address/UDP port associated with the DHT bootstrap server should be routed instead to a first IP address/UDP port associated with the first node of said plurality of proxy nodes.
The invention further relates to a method for filling a routing table of a distributed hash table (DHT) client node in a DHT infrastructure, wherein the following steps are performed at the DHT client node:
According to a preferred embodiment the method further comprising sending a further DHT request message to at least one node of said group and receiving from said at least one node at least one further group of nodes comprising for each node a node identifier and associated IP address and port. This further group was created and managed at the proxy processing agent, preferably according to a predetermined pattern as explained above. Upon receipt at the client node, the further group of node identifiers and associated IP addresses and ports is stored in its routing table. Typically the sending and receiving step will be repeated for a number of nodes of each further group of nodes until the routing table is sufficiently full.
According to another aspect of the invention there is provided a DHT client node having storage means storing a routing table, wherein said routing table contains a group of node identifiers obtained according to any of the above disclosed methods.
According to another aspect of the invention there is provided a computer program for generating for every received DHT request message a plurality of node identifiers using a pattern which is such that every node identifier of said plurality of generated node identifiers is in the same bucket and that the node identifier of the DHT client does not reside in said bucket. According to yet another aspect there is provided a computer program product for storing such a computer program.
According to a preferred aspect of the invention a node of the proxy processing agent, also called proxy node, is a predefined node of an ISP network characterized by a node identification number, an IP-address and a UDP-port with which it can be reached. The DHT client and routers disclosed above are then located in the same ISP network.
According to an aspect of the invention there is introduced a certain degree of centralization for the DHT request messages that a DHT node launches via grabbing main control over the Routing Table of the DHT node.
Preferably, the node identifier of a predefined proxy node is generated in such a way that a set of proxy nodes guarantees sufficient filling of the routing table of any DHT client node in the ISP network.
The accompanying drawings are used to illustrate presently preferred non-limiting exemplary embodiments of the present invention. The above and other advantages, features and objects of the invention will become more apparent, and the invention will be better understood from the following detailed description when read in conjunction with the accompanying drawings, in which:
According to the prior art, when a DHT client 100 wants to join the DHT network for the first time, it needs to go through a bootstrap process, see
An embodiment of the system and method of the invention is illustrated in
The proxy processing agent 108 is typically implemented as a set of server machines capable of fulfilling a number of proxy node activities. More in particular the proxy processing agent 108 is adapted to receive a DHT message from the DHT client. This can be an initial DHT find-node request message 101b deflected by the router 107 or a further DHT request message 103b, see further. In reply to such a message the proxy processing agent 108 generates a number of node identifiers, typically eight. This generating is preferably done according to a predetermined pattern, taking into account the node identifier of the client node 100 sending the DHT request message and the already generated node identifiers for that client, see further. Each node identifier represents a proxy node 105. Further the agent is capable of associating with each node identifier an IP address and a port number, and of sending a DHT response message containing the generated node identifiers and associated IP addresses and ports to the DHT client. Also, the proxy processing agent 108 is provided with a storage means for storing for each client 100 the generated number of node identifiers and associated IP addresses and ports.
In the example of
Now a more detailed embodiment of the system and method of the invention will be explained with reference to
In a next step the client node 200 will send further DHT find-node request messages to the nodes ID2-9. A first such further DHT find-node request message is sent in step 216 to the proxy node with IP address IP—1 and UDP port P—2. In response to this DHT find-node request message eight further proxy nodes are generated with nodeIDs ID10-17 and associated IP|UDP combinations, see step 217. Those nodeIDs are generated using the following pattern which is similar to the pattern used in step 212: split the bucket in which IDX resides further in two equally sized (smaller) “buckets”, see also step 219, and choose eight appropriate nodeIDs ID10-17 in the bucket in which IDX does not reside. In the present example it is assumed that IDX is in the first (smaller) bucket and ID10-17 are given the values 3.2158 to 3.2158+7.
Next further DHT request messages 221 may be sent until the routing table is completely filled—this is filled with 1280 entries for the example of a node space with size 2160—or sufficiently filled, depending on the behaviour of the client node. If it is possible/desirable to completely fill the routing table then steps 216-220 are repeated until the bucket size is 8 (in which case it cannot be split further) and the last bucket is filled with eight appropriate nodeIDs.
The skilled person will understand that this particular way of filling is only one possible pattern that may be used in embodiments of the invention and that many other patterns may be used, which pattern could possibly be made dependent on other parameters relevant for the DHT process. In the example of
The DHT node 100, 200 may launch successive DHT request messages, such as DHT ping request messages or repeated DHT find-node request messages, to the individual proxy nodes 105, 205 of which it has become aware to verify whether they are still alive. Usually, as long as those proxy nodes respond, the DHT node 100, 200 keeps the proxy nodes 105, 205 in its DHT Routing Table. This will typically depend on the BitTorrent client implementation. A proxy node 105, 205 will answer to such a DHT request message of the DHT node 100, 200 with the same nodeID that in the earlier DHT find-node response message to that DHT node 100, 200 has been associated with that proxy node 105, 205. This is possible thanks to the database that is maintained in the proxy processing agent, where for each client the generated nodeIDs and associated address|port combinations are stored. Note that since the data is stored coupled to a client identifier, it is possible to use the same address|port combinations for different clients.
Embodiments of the invention introduce a certain centralisation in the otherwise completely distributed DHT network. This centralisation can be exploited to make at least part of the DHT network behave in a way that is better tailored to the ISP's needs, without jeopardizing the end-users' experience of BitTorrent file exchange.
Further embodiments of the invention have the advantage that a modification to the DHT clients in the market is not required. However, note that a DHT client in the market using the invention may be recognised from the information stored in its DHT routing table. In embodiments of the invention, the DHT Routing table will typically be sufficiently filled. Also, typically the entries in the DHT Routing Table will follow a specific pattern as opposed to being random in the prior art.
While the principles of the invention have been set out above in connection with specific embodiments, it is to be clearly understood that this description is merely made by way of example and not as a limitation of the scope of protection which is determined by the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
10305904.4 | Aug 2010 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP11/64107 | 8/16/2011 | WO | 00 | 2/20/2013 |