The invention relates to the management of allocations of resources in a computer of the HPC or high performance computing type.
In a high performance computer, computations are generally implemented on data processing systems called clusters. A cluster comprises a set of interconnected computing nodes. The connection between the nodes is achieved using Ethernet or Infiniband communication links (Ethernet and Infiniband are trademarks). These interconnection networks are generally connected in a multi-stage pyramidal architecture (also known as a CLOS network). When the packets are routed, they can pass through several switches and spread out the number of routes per link. The allocation of resources from one computing node to another, when a computation is started, by the resource manager takes place in a known manner according to different criteria which may be:
A network is either non-blocking or blocking. If the network is blocking, this signifies that the number of routes per physical link may be different from one place of the interconnection topology to another. If the network is non-blocking, this signifies that there exists one route per physical link and in this case all the routes are the same as each other and all the equipment can make the most of the interconnection network. The blocking factor may also appear on an initially non-blocking configuration for example during the loss of an interconnection equipment. A non-blocking configuration for a network is the most interesting from the point of view of the rapidity and reactivity of said network but its implementation is extremely expensive and thus difficult to achieve financially because it would come down to having a dedicated link per pair of nodes. Thus most existing clusters are blocking.
However the information according to which a network is blocking or not, and thus the number of routes per link, is not taken into account at the level of the allocation is of resources by the resource manager.
The invention aims to overcome all or part of the drawbacks of the prior art identified above, and in particular to propose a method making it possible to manage the blocking or non-blocking character of the network for the allocation of resources.
To this end, one aspect of the invention relates to a method for allocating, in order to carry out a first computation, at least one first resource of a plurality of resources, said first resource being connected to a first port of a switch, said method comprising a step of acquiring a first weighting of said first resource, said first weighting corresponding to the number of resources of the plurality of resources connected to the first port of the switch.
The resources are connected to each other and to the switches by means of communication links. The resources are interconnected within a cluster, the cluster generally comprises a plurality of switches. Each resource is connected to at least one port of at least one switch of the cluster. The first weighting of the first resource may be acquired by searching in a database, for example a routing database. The weighting of a resource makes it possible to determine if the resource is a blocking or non-blocking node.
Apart from the main characteristics that have been mentioned in the preceding paragraph, the method according to the invention may have one or more additional characteristics among the following, considered individually or according to any technically possible combinations thereof:
Other characteristics and advantages of the invention will become clear on reading the description that follows, with reference to the appended figures, which illustrate:
For greater clarity, identical or similar elements are marked by identical reference signs in all of the figures.
In
In this example, the topology of the network is designated all-to-all. Five InfiniBand switches 2 (one of the types of interconnection network), each having 18 input and output ports, are each connected to 18 computing nodes.
Moreover, each of the InfiniBand switches is connected to each of the other switches by an InfiniBand link, i.e. in total four links come out of a switch.
In this configuration, it is chosen to connect each of the switches to each of the other switches of the topology by means of three links 3 (represented by a single line in
The information of the number of routes per physical link is an information available at the level of the routing manager of the interconnection infrastructure, that is to say that it is possible to know how many routes pass through a given physical link and also to which “nodes” these routes correspond. It is thus possible to know in a precise manner what are the “privileged” nodes (i.e. with the smallest number of routes per physical link) versus the less privileged nodes. This information is thus brought to the level of the resource manager in charge of allocating resources (in general computing nodes) according to given criteria,
The objective is thus to pass on this information of privilege level of a node as a function of the number of routes present on the links of the interconnection network making it possible to access it. This criterion may also be used to define “privileges” between computing nodes and certain components of the computer such as for example the part where the storage is located (inputs/outputs—E/S) and thus privilege the nodes which will have the least to share of the physical links to access the data.
The method comprises a prior step E0 of updating the first weighting of the first resource. The first weighting corresponds to the number of resources connected to the first port of the switch to which is connected the first resource within an interconnection network. The updating takes place by rewriting for example in the routing table of the network which takes an inventory of the number of routes per physical link of the network once said network has been covered.
The method also comprises a step E1 that corresponds to the acquisition of the first weighting of the first resource. This step of acquisition can take place by reading within a routing database of the network. The method then comprises a step E2 of comparing the first weighting with a predetermined value which corresponds to the level of service for example.
Depending on the result of step E2 of comparing the first weighting with the predetermined value, the method comprises either:
The invention is not limited to the embodiments described previously with reference to the figures and variants could be envisaged without going beyond the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
1458596 | Sep 2014 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/FR2015/052212 | 8/17/2015 | WO | 00 |