The present invention relates generally to hierarchical flooding particularly but not exclusively of PUT and GET messages in overlay networks such as but not limited to peer-to-peer networks.
A peer-to-peer network is an example of a network (of a limited number of peer devices) that is overlaid on another network, in this case, the Internet. In such networks it is often the case that a piece of content or a service desired by one of the peers can be provided by more than one other node in the overlay network, and it is desirable to select the source node to optimize efficiency.
Distributed hash tables (DHTs) are a class of decentralized distributed systems that provide a lookup service similar to a hash table: (name, value) pairs are stored in the DHT, and any participating node can efficiently retrieve the value associated with a given name. Responsibility for maintaining the mapping from names to values is distributed among the nodes, in such a way that a change in the set of participants causes a minimal amount of disruption. This advantageously allows DHTs to scale to extremely large numbers of nodes and to handle continual node arrivals, departures, and failures. DHTs form an infrastructure that can be used to build more complex services, such as distributed file systems, peer-to-peer file sharing and content distribution systems, cooperative web caching, multicast, anycast, domain name services, and instant messaging.
The details of the present invention, both as to its structure and operation, can best be understood in reference to the accompanying drawings, in which like reference numerals refer to like parts, and in which:
As understood herein, peering among DHTs can be achieved by broadcasting Put and Get messages (respectively, messages seeking to place data and messages seeking to obtain data) among the peered DHTs. If all DHTs are directly connected to all other DHTs then broadcasting is straightforward, but as understood herein, if the relationship between peering DHTs is more topologically complex so that some DHTs do not connect directly to other DHTs, then flooding Put and Get messages is potentially expensive, and requires a loop prevention mechanism.
Techniques for flooding of messages are known in link-state routing protocols, but as understood herein important differences exist between link-state routing and DHT peering. First, the overhead of flooding Puts and Gets, which are expected to be broadcast frequently, is of greater concern in DHT peering than are the relatively infrequent updates performed by routing protocols. Second, message passing among networks as envisioned herein differs from route advertisement because the gateways do not retain state about previously seen messages, rendering transferring flooding solutions from the route advertisement problem to the problem of passing messages between networks problematic.
Present principles are directed to alleviate one or more of the above drawbacks.
Accordingly, an apparatus has a processor and a computer readable storage medium bearing instructions to cause the processor to receive, at a subject network in a system of networks, a distributed hash table (DHT) data transfer message from a sending network. The instructions also cause the processor to forward the message on to all other networks in the system except the customer network if the sending network is a customer network of the subject network. On the other hand, if the sending network is a peer network to the subject network or a provider network to the subject network, the processor forwards the message to only networks in the system that are customer networks of the subject network.
The data transfer message can be a DHT Put or Get message. The processor may be embodied in a gateway. If desired, the data transfer message may include information regarding data transfer policy.
In another embodiment a tangible computer readable medium bears instructions executable by a computer processor in a subject network for determining whether a DHT Put or Get message is received from a customer network of the subject network. Based on the determining act, the processor selectively sends the message only to other customer networks of the subject network.
In another embodiment a computer-implemented method includes receiving, at a subject network, from a sending network in a multi-tiered system of networks, a distributed hash table (DHT) data transfer message. If the sending network is at a tier below a subject tier at which the subject network is, the message is forwarded to all networks in the system other than the sending network. In contrast, if the sending network is at a tier equal to or above the subject tier, the message is forwarded only to networks in the system at a lower tier than the sending network.
The following acronyms and definitions are used herein:
Present principles apply to one or more usage scenarios. For example, in one scenario multiple Autonomous Systems are provided within a single provider. More specifically, for operational reasons, a single service provider may choose to operate a network as a set of autonomous systems (AS). Each AS may be run by a different organization. These AS do not necessarily have to be true AS in the routing sense. For example, an AS may be an “Autonomous DHT” (AD). An Autonomous DHT is a group of nodes that form their own independent DHT ring and operate largely independently of other ADs. Each AD has access to the complete DHT-ID space, but may or may not store content that is stored in other ADs. It is desirable in this case that content located in one AD can be selectively accessed from another. There are many variants of this scenario, such as a provider having one AD that hosts the provider's content and a number of ADs that serve different regions or different classes of customer (such as mobile, DSL, etc).
Another usage scenario is peering among providers, in which service providers who operate DHTs may wish to peer with each other. This scenario differs from the preceding case mainly in the fact that a high degree of co-operation or trust among competing providers cannot be assumed. Thus, this scenario requires an appropriate level of isolation and policy control between providers. Variants of this scenario include providers whose main function is to host content, who then peer with providers whose main function is to connect customers to the content. Other variants may include providers who provide connectivity between small providers and “backbone” providers.
In both of the above usage scenarios the graph of providers should not be assumed to have any particular structure.
Accordingly and turning now to
Each network in
In
Furthermore,
In example embodiments the gateway 24 includes a list of the DHTs with which it has peering relationships to enable a Get operation to be forwarded to the correct gateway to reach the origin DHT. This also allows Puts to be forwarded to appropriate DHTs in the case where a Put is to be published in some peer DHTs and not others.
It may now be appreciated that by applying the above logic, messages flow up towards the Tier 1 networks and then down to the lower tiers. By ensuring that the route of a message is in effect “valley-free”—that is, once a message starts going down the hierarchy, it can't start going back up again—
As understood herein, a case may arise in which a link cannot be categorized as either customer-provider or peer to peer, in which case such a link is always used to forward messages no matter where they were received from, and messages will be forwarded to all networks when they are received on such “other” links. In this way, there are no restrictions on the relationships that may exist among providers. Every relationship is either peer to peer, customer-provider, or other.
It is to be understood that the categorization of links into “customer-provider”, “peer to peer” and “other” does not necessarily imply anything about economic relationships. It simply implies that messages will be forwarded along these links according to the logic described above.
It is to be further understood that autonomous policies can be used with the above logic on a per-message and/or on per-message-type basis. For example, network X2a in
In example implementations, the “put” and “get” messages above make data that is stored in one DHT available to other DHTs. In one implementation broadcast Puts are used, in which a Put in one DHT is forwarded on to all other DHTs and the same data is Put in each DHT. Thus, a Get of the data in any DHT can be served by the node in the same DHT as the Get. In a second, presently preferred implementation, a broadcast Put is done of a DHT key only, such that a Put in one DHT is forwarded to all DHTs, but only the DHT-ID (key) and a pointer back to the original DHT is stored. A Get in any DHT can locate that pointer and then be forwarded on to the original DHT. In yet another implementation a broadcast Get is contemplated in which a Put in one DHT is stored only in that DHT. A get in any other DHT for that key must be broadcast to all DHTs; it will succeed in the original DHT.
If desired, each time a put occurs the particular option to use for that key could be specified as an additional parameter. Similarly a Get operation may be augmented with additional parameters to specify whether it should be forwarded or broadcast outside the local DHT.
DHT modes of operating, in simple -terms, include putting and getting-opaque data.
Below are example non-limiting examples of put ad get message processing.
Put Example:
Get Example:
Understanding that it may be desirable for a Put or Get message sent within a DHT to be identified as local or remote, e.g., to signal to a root node whether it should or should not issue a Get to a gateway for content that it currently does not have, a simple binary flag or policy tag may be appended to Put or Get messages. As also mentioned above, to enable directed gets to be routed correctly, a DHT Path can be stored with a descriptor, and this object may be provided by a gateway when it issues a Put to a root node, and thus must be carried in a Put message. Further, if an inter-DHT message requests a non-default timeout when Putting a descriptor, that information may also be conveyed to the DHT root node.
While the particular HIERARCHICAL FLOODING AMONG PEERING OVERLAY NETWORKS is herein shown and described in detail, it is to be understood that the subject matter which is encompassed by the present invention is limited only by the claims.
Number | Name | Date | Kind |
---|---|---|---|
20040064693 | Pabla et al. | Apr 2004 | A1 |
20040085329 | Xu et al. | May 2004 | A1 |
20040088646 | Yeager et al. | May 2004 | A1 |
20040249970 | Castro et al. | Dec 2004 | A1 |
20050004916 | Miller et al. | Jan 2005 | A1 |
20060168304 | Bauer et al. | Jul 2006 | A1 |
20060239275 | Zlateff et al. | Oct 2006 | A1 |
20070011267 | Overton et al. | Jan 2007 | A1 |
20070230482 | Shim et al. | Oct 2007 | A1 |
20070237152 | Zhu | Oct 2007 | A1 |
20080008178 | Tychon et al. | Jan 2008 | A1 |
20080016240 | Balandin | Jan 2008 | A1 |
20080181219 | Chen et al. | Jul 2008 | A1 |
20080225852 | Rasnuk et al. | Sep 2008 | A1 |
20090083390 | Abu-Ghazaleh et al. | Mar 2009 | A1 |
20090092124 | Singhal et al. | Apr 2009 | A1 |
20090234917 | Despotovic et al. | Sep 2009 | A1 |
20100064008 | Yan et al. | Mar 2010 | A1 |
20100162035 | Rancurel et al. | Jun 2010 | A1 |
20100172270 | Smith et al. | Jul 2010 | A1 |
20100293223 | Bhardwaj | Nov 2010 | A1 |
20100293295 | Tian et al. | Nov 2010 | A1 |
Number | Date | Country |
---|---|---|
102037711 | Apr 2011 | CN |
102037712 | Apr 2011 | CN |
10 2006 021591 | Apr 2007 | DE |
2 034 665 | Mar 2009 | EP |
WO 2005079020 | Aug 2005 | WO |
WO 2008110054 | Sep 2008 | WO |
WO 2010135251 | Nov 2010 | WO |
WO 2010135254 | Nov 2010 | WO |
Number | Date | Country | |
---|---|---|---|
20100226374 A1 | Sep 2010 | US |