COMPUTING INFRASTRUCTURE RESOURCE-WORKLOAD MANAGEMENT METHODS AND APPARATUSES

Description

TECHNICAL FIELD

The present disclosure relates to the field of computing. More particularly, the present disclosure relates to computing infrastructure resource and workload management methods and apparatuses.

BACKGROUND

The background description provided herein is for the purpose of generally presenting the context of the disclosure. Unless otherwise indicated herein, the materials described in this section are not prior art to the claims in this application and are not admitted to be prior art by inclusion in this section.

By leveraging cloud computing service-models, service providers deploy and maintain multiple workloads across globally distributed data centers. Current approaches to workload scheduling and placement sacrifice either precision or scalability when presented with heterogeneous platform configurations, which limits the value of specialization in the infrastructure. In a heterogeneous environment, the challenge for the service provider is to enable the infrastructure resource manager (also referred to as the “orchestrator”) to properly decide amongst multiple placement options, the best-match in terms of compute, network and storage resources for instantiating the different components of a workload whilst meeting the SLA (Service Level Agreement). Another key challenge is the trade-off between the business objectives of the service customers who seeks best service performance vs the service provider who seeks to optimize the data center for total cost of ownership (TCO).

However, currently, there is no unified methodology for the calculation of the preference of a placement/rebalancing solution and its comparison from another. Thus, the technical problems that need to be solved to improve the operation of cloud computing infrastructure, in particular, the management of cloud computing infrastructure resources, are:

a) how to effectively quantify the parameters affecting placementsrebalancing of the cloud computing infrastructure resources;

b) how an orchestrator could efficiently compare and contrast potential placement/rebalancing solution; and

c) how the orchestrator can automatically and at scale reason and execute placement decisions to optimally deliver value from differentiating platform features.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the block structure and robot cooperation methods and apparatuses of the present disclosure will be readily understood by the following detailed description in conjunction with the accompanying drawings. To facilitate this description, like reference numerals designate like structural elements. Embodiments are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings.

FIG. 1 illustrates an overview of a computing infrastructure with resources to provide computing services to various client systems, having the computing infrastructure resource-workload management technology of the present disclosure, according to various embodiments.

FIG. 2 illustrates an example computing infrastructure resource-workload manager, according to various embodiments.

FIG. 3 illustrates example attributes for managing computing infrastructure resources and workloads, according to various embodiments.

FIG. 4 illustrates an example process for managing computing infrastructure resources and workloads, according to various embodiments.

FIG. 5 illustrates an example computing device suitable for use to practice aspects of the present disclosure, according to various embodiments.

FIG. 7 illustrates an example output of the computing infrastructure resource-workload manager, according to various embodiments.

DETAILED DESCRIPTION

The present disclosure may solve the above technical problems improve the operation of computing infrastructure, such as cloud computing infrastructure, in particular, the management of the computing infrastructure's resources and workload assignment. The present disclosure may allow the computing service provider as well as the customer, the ability to capture the benefits and shortcomings of each placement/rebalancing solution on their own choice of parameters.

More specifically, embodiments of the present disclosure may use utility theory to quantify a potential placement/rebalancing solution based on a k-dimensional parameter space to indicate the preference of one over the other. The present disclosure may formalize these various influencing parameters as ‘attributes’ and provide a unified perspective, considering both the context of the service provider and the customer. Thus, the attributes of the present disclosure may include definitions and mathematical formulizations for revenue and expenditure (for the service provider) while also incorporating the attributes that are derived from the SLA and take customer experience (throughput and latency) into consideration.

Embodiments of the present disclosure may then evaluate the preferences and priorities of the attribute values by using the multi-attribute utility theory (MAUT). Using the individual sub-utility functions for each attribute, embodiments of the present disclosure may formulate two multiplicative utility functions, one for the service provider and one for the customer, that examine the attributes as non-independent entities.

Using this methodology, an orchestrator of a computing infrastructure may use these two sets of utility values, and by comparison decide for the best placement or rebalancing solution considering the business model and goals. Because expenditure can incorporate all cost elements, the value proposition of differentiated or heterogeneous infrastructures can be exposed precisely.

In embodiments, an apparatus for managing resources and their assigned workloads of a computing infrastructure, may comprise a resource-workload manager having: a placement solution generator to receive resource data and workload data for the computing infrastructure, and generate a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads; one or more utility function calculators communicatively coupled with the placement solution generator to calculate one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions; an analyzer communicatively coupled to the one or more utility function calculators to analyze the attributes and select one of the potential resource placement solutions; and a resource allocator communicatively coupled to analyzer to allocate the resources of the computing infrastructure, based at least in part on the selected resource placement solution.

In the following detailed description, reference is made to the accompanying drawings which form a part hereof wherein like numerals designate like parts throughout, and in which is shown by way of illustration embodiments that may be practiced. It is to be understood that other embodiments may be utilized and structural or logical changes may be made without departing from the scope of the present disclosure. Therefore, the following detailed description is not to be taken in a limiting sense, and the scope of embodiments is defined by the appended claims and their equivalents.

Aspects of the disclosure are disclosed in the accompanying description. Alternate embodiments of the present disclosure and their equivalents may be devised without parting from the spirit or scope of the present disclosure. It should be noted that like elements disclosed below are indicated by like reference numbers in the drawings.

Various operations may be described as multiple discrete actions or operations in turn, in a manner that is most helpful in understanding the claimed subject matter. However, the order of description should not be construed as to imply that these operations are necessarily order dependent. In particular, these operations may not be performed in the order of presentation. Operations described may be performed in a different order than the described embodiment. Various additional operations may be performed and/or described operations may be omitted in additional embodiments.

For the purposes of the present disclosure, the phrase “A and/or B” means (A), (B), or (A and B). For the purposes of the present disclosure, the phrase “A, B, and/or C” means (A), (B), (C), (A and B), (A and C), (B and C), or (A, B and C).

The description may use the phrases “in an embodiment,” or “in embodiments,” which may each refer to one or more of the same or different embodiments. Furthermore, the terms “comprising,” “including,” “having,” and the like, as used with respect to embodiments of the present disclosure, are synonymous.

As used herein, the term “module” may refer to, be part of, or include an Application Specific Integrated Circuit (ASIC), an electronic circuit, a processor (shared, dedicated, or group) and/or memory (shared, dedicated, or group) that execute one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that provide the described functionality.

Referring now FIG. 1, wherein a block diagram illustrating an overview of a computing infrastructure with resources to provide computing services to various client systems, having the computing infrastructure resource and workload management technology of the present disclosure, according to various embodiments, is illustrated. As shown, computing infrastructure 102 may include a number of resources, such as computing resources 112, network resources 114, and storage resources 116, that may be allocated to provide computing services to various client systems 104. Client systems 104 may be communicatively coupled to computing infrastructure 102 via network 106. Resources of computing infrastructure 102 may be allocated to provide computing services to various client systems 104 under the control of computing infrastructure resource-workload manager 120, incorporated with the management technology of the present disclosure, to be described more fully later. In embodiments, computing infrastructure 102 may be a cloud computing infrastructure with resources to provide cloud computing services to client systems 104.

Compute resources 112, network resources 114, and storage resources 116 of computing infrastructure 102 may be homogeneous, of the same types, or heterogeneous, of different types. Compute resources 112, network resources 114, and storage resources 116 may be distributed/located in a plurality of different physical locations (data centers). Some resources may be located in the same physical location (data center). Computing resources 112 may include any number of computing devices of any type, virtual or real, such as, servers, desktops, laptops, set-top boxes, tablets, mobile devices, wearable devices, and virtual machines, that can be allocated to perform computing services for client systems 104. Network resources 114 of computing infrastructure 102 may include any number of networking resources of any type, virtual or real, to selectively couple computing resources 112, network resources 114 and storage resources 116, to one another, such as gateway, switches, routers, access points, base stations, and so forth. Storage resources 116 of computing infrastructure 102 may include any number of storage resources of any type, such as, volatile or non-volatile memory, solid state, magnetic or optical mass storages, that can be allocated to store working or persistent data of client systems 104.

Client systems 104 and network 106 may be any number of such elements known in the art. For examples, client systems 104 may be desktops, laptops, set-top boxes, game consoles, tablets, mobile devices, or wearable devices. Network 106 may be any number of wired or wireless networks, including private and/or public networks, such as the Internet. An example of wired networks may include, but are not limited to local Ethernet networks or wide area optical networks. Examples of wireless networks may include, but are not limited to, cellular, WiFi, Bluetooth®, and so forth.

Computing infrastructure resource-workload manager (or orchestrator) 120 may be implemented in hardware or software, or both. A hardware implementation may include various ASIC or FPGA programmed with the operation logic. A software implementation may include various modules/routines of instructions executable by computer processors. A hardware/software implementation may include elements of both types.

Current commercial and open-source orchestration solutions typically schedule pessimistically to avoid conflicts, and workload pattern induced performance issues. Various innovations in research have responded to aspects of this problem. Researchers have explored genetic algorithms, approximation algorithms, stochastic bin-packing methods, and multiple heuristics for efficient placement. However, the known approaches typically limit the number of attributes under investigation and do not study the trade-off between the benefits gained by the service provider vs the customer for a placement decision.

As will be described more fully below, the management or orchestration technology of the present disclosure may define these benefits as utility functions for improving overall infrastructure throughput. The management or orchestration technology of the present disclosure may implement utility functions for a wide array of attributes (including provider-centric and customer-centric attributes) to formally quantify a potential placement/rebalancing solution. In embodiments, non-additive utility functions may be employed, which give the advantage of considering the trade-offs between the attributes instead of considering them as independent attributes. Further, in embodiments, these functions may be implemented for the placement/rebalancing phase of the machines instead of the negotiation phase of the SLA for a better understanding of the way a placement affects the service provider and the customer.

Referring now to FIG. 2, wherein an example computing infrastructure resource-workload manager, according to various embodiments, is illustrated. As shown, in embodiments, computing infrastructure resource-workload manager (or orchestrator) 120 may include potential placement solution generator (or simply, placement solution generator) 202, one or more utility function calculators 204-206, an analyzer 208 and a resource allocator 210, communicatively coupled to each other as shown

Potential placement solution generator 202 may be configured to receive resource data 212 and workload data 214, and in response, generate potential placement solutions, i.e., placements or allocations of various resources for the various workloads. In embodiments, resource data 212 may set forth the resources of the computing infrastructure (i.e., resources-services landscape), and workload data 214 (service requests) may set forth the computing services to be provided to various customer client systems (optionally, including service level requirements). In embodiments, both the resources—services landscape and the service requests can be described as a graph. Nodes may be used to detail the entities (compute, network and storage physical and virtual elements) and edges may be used to denote their relationships. For the service requests, the graph may indicate a possible topology, the resources and services landscape may be an actual topology—which can even be enhanced with indicators from telemetry (such as Utilisation and Saturation values indicating hot and cold spots in the landscape).

In embodiments, one or more utility function calculators 204-206 may include utility function calculator A 204 and utility function calculator B 206. Utility function calculator A 204 may be configured to calculate values of various provider-centric attributes for the various potential placement solutions, and utility function calculator B 206 may be configured to calculate values of various customer-centric attributes for the various potential placement solutions.

In embodiments, analyzer 208 may be configured to process and compare the values of the provider-centric attributes and values of the customer-centric attributes, and select the best or optimal placement solution. In embodiments, analyzer 208 may output the selected placement solution for resource allocator 210. In embodiments, resource allocator 210 may be configured to allocate the various resources of the computing infrastructure to provide computing services to the various client systems, in accordance with at least the selected placement solution.

Referring now to FIG. 3, wherein various example attributes for managing computing infrastructure resources, according to various embodiments, is illustrated. As shown, in embodiments, provider-centric attributes 302 may include revenue attribute 312, expenditure attribute 314, data locality attribute 316 and data replication attribute 318. For customer-centric attributes 322, they may include service components locality/anti-locality attribute 332, availability attribute 334, network throughput attribute 336, response time attribute 338 and latency attribute 340.

In embodiments, revenue attribute 312 may detail an amount of credit ($) the provider would gain by hosting a service (with a particular placement solution). For example, revenue attribute 312 may be calculated based on type & amount of nodes in the graph of workload data 214 detailing the service request. In embodiments, expenditure attribute 314 may detail the total cost of ownership (TCO) in credits ($) of the resources (e.g. CPUs, disks) in the resource-service landscape which are (possibly) needed to handle the service request (with a particular placement solution). Together, revenue attribute 312 and expenditure attribute 314 may define a profit attribute (for a particular placement solution): custom-character (P).

In embodiments, data locality attribute 316 may detail the distance or access frequency between the infrastructure's compute resources and (possibly pre-existing) storage entities (for a particular placement solution). In embodiments, values of data locality attribute 316 may be calculated by looking at distances between compute and storage nodes in the graph of resource data 212. This attribute assumes moving data around incurs a cost. The attribute may be named: custom-character (). In embodiments, data replication attribute 318 may detail the number of possible data/storage replicas the infrastructure has to deal with for fault-tolerance/high availability/performance issues (for a particular placement solution). Values of data replication attribute 318 may be calculated based on the number of duplicate nodes in the resource-landscape graph of resource data 212. The attribute may be named: custom-character (R).

In embodiments, service components locality/anti-locality attribute 332 may detail the requirements of the service to be hosted in one or in separated locations (geo, rack, server etc.) (for a particular service request/workload). The requirements may be set forth e.g., in the service request graph of workload data 214. In embodiments, values of service components locality/anti-locality attribute 332 may be calculated based on the distance between nodes in the graph. The attribute may be named: custom-character (L). In embodiments, availability attribute 334 may detail the desired availability of the service (of a particular service request/workload). In embodiments, it may be defined as gold, silver, bronze level offerings, and hence available as attribute in the service request graph of workload data 214. In embodiments, values of availability attribute 334 may be calculated based on replicated nodes in the graph. The attribute may be named: custom-character (A).

In embodiments, network throughput attribute 336 may detail the desired minimum network throughput the service needs (for a particular service request/workload). The minimum network throughput the service needs may be located in the service request graph of workload 214. In embodiments, network throughput attribute 336 may be calculated based on the edge between two particular nodes. The attribute may be named: custom-character (T). In embodiments, response time attribute 338 may detail the time at which a task is to be completed—can include notions of runtime (for a particular service request/workload). The time at which a task is to be completed may be located in the service request graph of workload data 214. In embodiments, response time attribute 338 may be calculated based on the number of entities minimally involved in ensuring the service availability. The attribute may be named: custom-character (Rt).

In embodiments, latency attribute 340 may detail the maximum desired latency between two service components (for a particular service request/workload). The maximum desired latency between two service components may be located in the service request graph of workload data 214. In embodiments, latency attribute 340 may be calculated based on the nodes and edge attributes (possible bandwidth current usage) between nodes in the landscape graph of resource data 212. The attribute may be named: custom-character (Lt).

In embodiments, all attributes represent the context of the current running service and resources of the landscape. This includes notions of which physical resources are available and which service components are already running.

Embodiments of these attributes will be further described below, after the description of an example process for managing computing infrastructure resources and workloads.

Referring now to FIG. 4, wherein an example process for managing computing infrastructure resources, according to various embodiments, is illustrated. As shown, process 400 for managing computing infrastructure resources may include operations performed at blocks 402-412. In embodiments, the operations may be performed by potential placement solution generator 202, utility function calculators 204-206, analyzer 208 and resource allocator 210, of computing infrastructure resource-workload manager (or orchestrator) 120.

Process 400 may begin at blocks 402 and 404. At blocks 402 and 404, resource data denoting resources of the computing infrastructure, and workload data denoting service requests of the client systems for computing services may be received.

Next, at block 406, potential placement solutions of the resources of the computing infrastructure for the various services requests of the workload may be generated.

At blocks 408a and 408b, the values for the provider-centric attributes and the values for the customer-centric attributes may be computed for each of the placement solutions. The operations at blocks 408a and 408b may be repeated as many times as necessary to have the values of the provider-centric and customer-centric attributes computed for each of the placement solutions.

In embodiments, the calculation of the utility functions for one solution based on these attributes, may be represented by custom-character =₁∪ ₂={α₁. . . α_i. . . α_k}, with k≥2, associated with our placement/rebalancing decision problem. Here subset ₁may contain all provider-centric attributes and subset ₂may contain all customer-centric attributes. For each attribute, the service provider/customer may also provide a weight or a priority value to indicate the importance of the attribute (α_k) along with additive weight that stores dependence on other attributes (β_k). Additionally, the preferences for these attributes are not hard constraints, and they evolve with each request and incorporate the trade-offs or relational operations between the attributes.

In embodiments, once the sub-utility values for the attributes are available, a decomposed assessment may be used to calculate the two utilities of a placement/rebalancing solution, namely custom-character _swhich stands for the service provider-centric utility function and _ewhich stands for the customer-centric utility function. These in- turn may be calculated based on a multiplicative function (as attributes may be considered inter-independent) on the k conditional utilities for each attribute.

For the provider utility, custom-character _smay be defined as:

custom-character
_s=(α₁*(P)+β₁)*(α₂*()+β₂)*(α₃*(R)+β₃) (1)

And for the customer utility, custom-character _emay be defined as:

custom-character
_e=(α₄*(L)+β₄)*(α₅*(A)+β₅)*(α₆*(T)+β₆)*(α₇*(Rt)+β₇)*(α₈* (Lt)+β₈) (2)

Thus, the methodology efficiently captures and implements the preferences of the service provider as well as the customer as part of the orchestrator module. The formal representation and calculation of the impact of deployment for the service provider and the customer can effectively contribute towards efficient placement/rebalancing solution decision making. For example, for a set of service placement options the customer and utility can be mapped on a plane as detailed in the sample output shown in FIG. 7. This plane may be computed by the computing infrastructure resource-workload manager (orchestrator).

Still referring to FIG. 4, at block 410, the computed values of the provider-centric and customer-centric attributes computed for each of the placement solutions may be analyzed/compared, and a best/optimal placement solution may be selected based at least in part on the values of the provider-centric and customer-centric attributes of the various placement solutions. In embodiments, on selection, the selected placement solution may be outputted.

In embodiments, placement groups can be seen as groupings of possible placements based on criteria like “best effort” to “dedicated & overprovisioned” service delivery. The trade-off between customer and provider utility may be considered in the context of the resources and services already in place.

In embodiments, the decision (analyse, compare and select) may be made by choosing the option which allows a balanced trade-off dependent on the business model. For example, a higher value of the customer-centric utility may be considered as implying a lower profit for the service-provider leading to a low value of the provider-centric utility. Thus, a balance may be made and the focus may be on choosing a placement which provides a good value for both the provider-centric and the customer-centric utility function. By projecting the minimum/maximum thresholds for e.g., the provider and the customer utility and the desired placement group (e.g., best effort vs. random) on the analysis plane the possible “valid” candidates can be highlighted and hence actuated upon. In alternate embodiments, other options like multi-level optimizations can be applied to this analysis plane too.

Furthermore, the methodology may allow the orchestrator to adapt, as the business incentive changes, favour one attribute or one sub-utility function as desired. In addition, the quantification of these attributes and the formal expression of the utility functions for the provider and customer pave the way for the application of optimization algorithms and techniques which take into account business objectives of both the service provider and customer.

At block 412, resources of the computing infrastructure may be allocated to the various service requests of the workload, in accordance with the selected placement solution. Allocation of resources may include e.g., but are not limited to, pinning virtual machines, container, or task to a CPU core, block and or object storage object association with disks and virtual network elements to physical network entities (NIC, Switches, . . . ), and so forth.

Referring now to FIG. 3 again, embodiments of the provider-centric attributes 302 and customer-centric attributes 304 with further details will be described.

I. Provider-centric attributes 302.

a) Revenue 312. In embodiments, the service provider may be considered as having a rate for provisioning the request depending on its actual runtime T_r(in hrs) which depends on the placement of the request and can be predicted/is known from the SLA. There is an hourly rate related with each VM flavour (which defines the sizes for memory, disk, number of cores etc.) denoted as R_flavourand every SLA template denoted as R_SLA. Additionally, each fault tolerant procedure e ∈ E that can be implemented by the service provider may be represented by a tuple {Name of Procedure, R_e, λ}. E may be defined as a set of fault tolerant procedures which can be implemented. These can range from configuring space and time redundancy, enabling matrix or vector-level checksum, pre-emptive job migration as well as defined exception handling. Here R_emay be the rate at which this procedure is available, determined by calculated the reserved extra resources value and λ_sis a 0-1 value indicating whether this procedure has been requested by the customer or not. Thus, for these embodiments, revenue may be considered as composed of three parts:

- i. Service price per VM flavor=T_r*Σ_VMsR_flavor
- ii. SLA price per SLA template=T_r*Σ_SLAsR_SLA
- iii. Failure Tolerance Cost=T_r*Σ_|E|λ_s*R_e

b) Expenditure 314. In embodiments, the service provider may be considered as incurring cost when provisioning a request related to each requested resource. The hourly rate for provisioning the node may be denoted as R_nand its service/maintenance rate may be denoted as M_n. The number of cores on the VM v is indicated by cores_vand on the host server as cores_n. Additional costs may be related to powering and cooling the systems and the mean power cost is power $/kWh. For every SLA Violation, j ϵJ represents the cost paid for the violation and j represents the corresponding number of times this violation occurs. Thus, for these embodiments, expenditure may be considered as composed of:

- i. Server and

$Equipment Costs (depending on cores in VM and on the host server) = T_{r} * \langle _{compute} \rangle * \frac{{cores}_{v}}{{cores}_{n}} R_{n},$

- where n is in the compute resource in the infrastructure
- ii. Service and Maintenance Cost=T_r*M_n
- iii. Power Consumption and

$Cooling Costs = \frac{{cores}_{v}}{{cores}_{n}} * T_{r} * \overline{power} * R_{n}$

- iv. Network Connections Costs=T_r*Σ|_{networkElement}|*R_{networkElement}, where a network element can be a switch, a router etc. in the network plane represented by set N_{networkElement}with provisioning rate R_{networkElement}
- v. SLA Violations=Σ_j∈Jj*j.

Profit. In embodiments, using the above stated definitions, utility profit may be defined as custom-character =Revenute−Expenditure to represent both the Revenue and Expenditure. Its utility function may be modelled on

$p (%) = \frac{p}{Expenditure} * 100.$

When depicting this as a utility function, a minimum profit level might be set by the service provider, represented by custom-character (min) which must be attained. Thus, for these embodiments, the utility function may be mathematically formulated as below:

$ (P) = {\begin{matrix} p (%), & p > p_{(\min)} \\ 0, & p = p_{(\min)} \\ p (%), & p < p_{(\min)} \end{matrix}$

c) Data Locality 316. In embodiments, this attribute may define the degree to which the placement must consider data locality for node n over time τ and is denoted by custom-character _n. The information may be derived from these variables:

- i. The number of data packets to be transferred δ
- ii. Rate of transmission ƒ(n)
- iii. The one-hop packet transmission cost to/from node n over the entire time it takes to transmit. This includes transmission from node n₁(which is the data producing node) indicated by w(n₁, n) and transmission to node n₂(which is the data storing node) indicated by w(n, n₂).
- iv. The distance between the nodes indicated by the number of switches that lie on the shortest path indicated by d(n₁, n) and d(n, n₂), where n₁, n₂are the data producing and data storing nodes respectively. [Note: By using the number of switches, the nodes may be allowed to be in different ranks or in different data-centres.]
- v. Access Frequency of these data packets node n over time period τ given by α_n^τ.

The total cost of transferring the data to node n denoted by custom-character (_n) and the access frequency of the data α_n^τ, may determine whether the data should be localized on node n or not. As

$(_{n}) \propto \frac{δ}{f (n)} * (w (n, n_{1}) * d (n, n_{1}) + w (n, n_{2}) * d (n, n_{2})),$

the data locality for the node can be defined as a multiplicative function of access frequency and cost. Thus, for these embodiments, the utility function for data locality of a placement, depicted by custom-character () may be given by:

custom-character ()=Σ_n=Σα_n^τ**(_n)

In some embodiments, the various values can also be derived from the table below:

(

_n): Cost
a_n^τ: Access Frequency
D_n: Data Locality

Low
Low
Not required

Low
High
Not required

High
Low
Not required

High
High
Required

[Notes: 1) It is possible that n₁== n₂. 2) Partitioning of data is not considered for this formulation. 3) If n == n₁or n == n₂, then d (n, n) = 0 and w(n, n) = 0].

d) Data Replication 318. In embodiments, this attribute may define the number of replications that should ideally be maintained by service providers to ensure performance. Once the cumulative access frequency of the data increases for all nodes n in a particular data-centre x ∈ custom-character over time period τ and goes beyond a threshold θ, the data may be considered for replication. is the set of data centers and x defines one such data center. The value of θ may be decided by the service-provider and depend on the cost of replication. This can be stated as:

Σ_n∈xα_n^τ>θ⇒replication

II. Customer-centric attributes 322.

A) Service component locality/anti-locality 332. In embodiments, for service component locality requirements, including geo-location, hybrid computing and IoT scenarios, a modified request graph may be created by using the concepts of edge contraction. Embodiments may make use of Edmund's blossom as presented for matching algorithms in any graph. See e.g., J. Edmonds, “Paths, Trees, and Flowers,” in Classic Papers in Combinatorics, Springer, 2009, pp. 361-379. for further details. For example. if two nodes in the request graph need to be on the same service component, the edge between them on the graph may be contracted, so that the orchestrator now sees one node, ensuring that both original nodes are assigned to the same service component.

(b) Availability 334. In embodiments, availability of node n may be represented as A_n^τ and may be mapped to the capacity C_n^t of the node, as shown:

A
_n
^t
=K*C
_n
^t

- where K is a constant used to normalize and map the value of capacity to availability.

In embodiments, a linear utility function may be defined for availability of node n indicated by custom-character (A) over the run-time. The values of the normalization variables y₁and y₂used to normalize the utility values between [0, 1] may be derived by replacing best and worst values for the availability as acceptable by customer, as the boundary conditions will be _n(A_n)^worst=0 and _n(A_n)^best=1 at all time-instances.

$ (A) = \sum_{t \in T_{r}}^{} y_{1} * A_{n}^{t} + y_{2}$

(c) Network Throughput 336. In embodiments, the network utility links may be studied in context of the communication links between different resources. As stated before, l ∈ custom-character may be a communication link within communication link set with capacity c_l. Node n may transmit a flow over a set of the links, indicated by path P at the source rate of ƒ_n(the transmission rate) whilst ensuring capacity bounds and within the utilization parameters of each network interface card encountered in path P. Thus, a utility of the node may be represented as custom-character (T) which may be maximized over all nodes. This utility function may be defined as a log function, since it may be concave, differentiable, and non-decreasing.

custom-character (T)=y₃log(ƒ_n) s.t: y₃>0 and ƒ_n≤c_l, ∀ l ∈ P

(d) Response Time 338. In embodiments, response time t_rmay refer to the time at which a task is completed and T_ris the time taken for the same. Arrival time may be represented as t_a, expected completion time value as t_e. Also, some amount of delay may be acceptable to the customer beyond expected completion time of the job indicated by s. In an ideal situation, the relation between these time instances can be shown: t_l≤t_r=t_e. Thus, for these embodiments, the utility function for response time can then be presented as:

$ (Rt) = {\begin{matrix} 1, t_{r} \leq t_{e} \\ - y_{4} (t_{r} - s), t_{e} + s < t_{r} \\ - 1, otherwise \end{matrix}$

(e) Latency 340. In embodiments, the latency experienced for a workload may be mainly dependent on serialization, queueing delay, routing latencies and propagation delay. This may be modelled by using the factors below:

- i. Number of shared cores or ‘Degree of Core Isolation’ may be denoted by l_nfor node n as:

$I_{n} = \frac{{cores}_{n}}{Σ_{v} {cores}_{v}}$

- ii. Serialization Latency denoted by L_s, may be dependent on data sizes transferred (total denoted by ρ and ƒ(n) which is the transmission rate).

$L_{s} = σ_{1} * \frac{ρ}{f (n)}$

- iii. Queueing Latency denoted by L_qmay be a summation of queuing delay Q_nfor each node n over response time T_r(The delay experienced due to a resource or node n being saturated, indicated by its saturation value S_n^tat time t and unable to process further requests).

L_q=Σ_nQ_n=σ₂Σ_nΣ_t_a_≤t≤ϵ_yS_n^t

- iv. Propagation Delay denoted by L_pmay be dependent on the summation of the geographical length covered by the request (distance to user):

$L_{p} = σ_{3} \sum_{p}^{} len (l), \forall l \in P, \forall P$

- v. Routing Latency denoted by L_rmay be dependent on the latency due to the various switches and routers.

L_r=σ₄* custom-character

σ1, σ2, σ3, and σ4 are constants used to relate the terms in the definitions. custom-character _routersdefines the set of routers present as resources.

Thus, the utility function for latency, denoted by custom-character (Lt) given in SLA and penalty can be defined as l given in SLA and penaly can be defined as:

$ (Lt) = y_{5} * \sum_{n}^{} I_{n} * e^{(\frac{y_{6}}{L_{s} + L_{q} + L_{p} + L_{r}})}$

Variables y₁, y₂, y₃, y₄, y₅, and y₆are constants for normalization.

III. Provider and Customer Utility.

For these embodiments, the provider-centric utility function for a potential placement/rebalancing solution (eq. (1)) may instead be expressed as follows:

custom-character
_s=(α₁*(P)+β₁)*(α₂*(D)+β₂)*(α₃*(R)+β₃) s.t. α₁+α₂+α₃ (3)

The customer-centric utility function for a potential placement/rebalancing solution (eq. (2)) may instead be expressed as follows:

custom-character
_e=(α₄*ζ(L)+β₄)*(α₅*(A)+β₅)*(α₆*(T)+β₆)*(α₇*(Rt)+β₇)*(α₈* (Lt)+β₈) s.t. α₄+α₅+α₆+α₇+α₈=1 (4)

FIG. 5 illustrates an example computing device suitable for use to practice aspects of the present disclosure, according to various embodiments. As shown, computing device 500 may include one or more hardware processors 502, each having one or more hardware processor cores 514 and optionally, one or more hardware accelerators 516, and system memory 504. Additionally, computing device 500 may include input/output device interfaces 508 (for interfacing with I/O devices such as display, keyboard, cursor control and so forth) and communication interfaces 510 for communication devices (such as network interface cards, modems and so forth). The elements may be communicatively coupled to each other via system bus 512, which may represent one or more buses. In the case of multiple buses, they may be bridged by one or more bus bridges (not shown). Additionally, computing device 500 may include mass storage devices 506 (such as diskette, hard drive, compact disc read only memory (CD-ROM) and so forth).

Each of these elements may perform its conventional functions known in the art. In embodiments, system memory 504 and mass storage devices 506 may be employed to store a working copy and a permanent copy of the programming instructions implementing an operating system and various applications, in particular, the operating logic for potential placement solution generator 202, one or more utility function calculators 204-206, an analyzer 208 and a resource allocator 210, collectively referred to as computational logic 522. As shown, aspects of computational logic 522, e.g., some functions of utility function calculators 204-206, may be implemented in hardware accelerator 516. The software portion of computational logic 522 may be implemented by assembler instructions supported by hardware processor(s) 502 or high-level languages, such as, for example, C, that can be compiled into such instructions.

The number, capability and/or capacity of these elements 510-512 may vary, depending on whether computing device 500 is used as a compute resource 112, a client system 104 or computing infrastructure resource-workload manager 120. Otherwise, the constitutions of elements 510-512 are known, and accordingly will not be further described.

FIG. 6 illustrates an example storage medium having bit streams to configure a hardware accelerator or instructions to cause an apparatus to practice aspects of the present disclosure, according to various embodiments. As shown, non-transitory computer-readable storage medium 502 may include a number of bit streams or programming instructions 604. Bit streams 604 may be employed to configure hardware accelerators to perform e.g., various operations associated with some functions of utility function calculators 204-206. Programming instructions 604 may be configured to enable a computing device, e.g., computing device 500, in response to execution of the programming instructions, to perform, e.g., various operations associated with potential placement solution generator 202, one or more utility function calculators 204-206, an analyzer 208 and a resource allocator 210, described with references to FIGS. 1-4. In alternate embodiments, bit streams or programming instructions 604 may be disposed on multiple computer-readable non-transitory storage media 602 instead. In alternate embodiments, bit streams or programming instructions 904 may be disposed on computer-readable transitory storage media 602, such as, signals.

Referring back to FIG. 5, for one embodiment, at least one of hardware processors 502 may be packaged together with memory having computational logic 522 (in lieu of storing on memory 504 and storage 506). For one embodiment, at least one of hardware processors 502 may be packaged together with memory having aspects of computational logic 522 to form a System in Package (SiP). For one embodiment, at least one of hardware processors 502 may be integrated on the same die with memory having aspects of computational logic 522. For one embodiment, at least one of hardware processors 502 may be packaged together with memory having aspects of computational logic 522 to form a System on Chip (SoC).

Thus various example embodiments of the present disclosure have been described including, but are not limited to:

Example 1 may be an apparatus for managing resources and their assigned workloads of a computing infrastructure, comprising: a resource-workload manager having: a placement solution generator to receive resource data and workload data for the computing infrastructure, and generate a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads; one or more utility function calculators communicatively coupled with the placement solution generator to calculate one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions; an analyzer communicatively coupled to the one or more utility function calculators to analyze the attributes and select one of the potential resource placement solutions; and a resource allocator communicatively coupled to analyzer to allocate the resources of the computing infrastructure, based at least in part on the selected resource placement solution.

Example 2 may be example 1, wherein the one or more utility function calculators may include an utility function calculator to calculate values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions.

Example 3 may be example 2, wherein the one or more provider-centric attributes may comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute custom-character (P), a data locality attribute (), and a data replication attribute (R).

Example 4 may be example 3, wherein the plurality of provider-centric attributes may comprise a revenue attribute that details an amount of credit ($) a provider of the computing infrastructure would gain by hosting a service with a particular resource placement solution, an expenditure attribute that details the total cost of ownership in credits ($) of the resources in a particular resource placement solution, or a profit attribute custom-character (P) of a particular resource placement solution derived from the revenue and expenditure attributes of the particular resource placement solution.

Example 5 may be example 3, wherein the plurality of provider-centric attributes may comprise a data locality attribute custom-character () that details distance or access frequency between compute resources and storage resources of the computing infrastructure a particular resource placement solution, or a data replication attribute (R) that details a number of possible data/storage replicas the computing infrastructure has to deal with for fault-tolerance/high availability/performance issues for a particular resource placement solution.

Example 6 may be example 3 wherein the utility function calculator may further calculate a provider utility custom-character _sfor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution:

custom-character
_s=(α₁*(P)+β₁)*(α₂*()+β₂)*(α₃*(R)+β₃)

where α₁, α₂, and α₃are relative weights of profit attribute custom-character (P), data locality attribute (), and data replication attribute (R), and β₁, β₂, and β₃are additive weights that store dependence on other attributes.

Example 7 may be example 1, wherein the one or more utility function calculators may include an utility function calculator to calculate values for one or more customer-centric attributes, for the plurality of potential solutions.

Example 8 may be example 7, wherein the one or more customer-centric attributes may comprise at least two attributes selected from an attribute group that may include a service components locality or anti-locality attribute custom-character (L), an availability attribute (A) network throughput attribute (T), a response time attribute (Rt), and a latency attribute) (Lt).

Example 9 may be example 8, wherein the one or more customer-centric attributes may comprise a service components locality or anti-locality attribute custom-character (L)that details the requirements of a workload to be hosted by a resource placement solution in one or in separated locations of the computing infrastructure.

Example 10 may be example 8, wherein the one or more customer-centric attributes may comprise a service components an availability attribute ∥(A) that details the desired availability of a service for a particular workload, or a network throughput attribute ∥(T) that detail a desired minimum network throughput a service of a particular workload needs.

Example 11 may be example 8, wherein the one or more customer-centric attributes may comprise a service components a response time attribute custom-character (Rt) that details a time at which a task of a workload may be completed, and a latency attribute (Lt) that details a maximum desired latency between two service components of a particular workload.

Example 12 may be example 8, wherein the utility function calculator may further calculate a customer utility custom-character _efor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution:

custom-character
_e=(α₄*(L)+β₄)*(α₅*(A)+β₅)*(α₆*(T)+β₆)*(α₇*(Rt)+β₇)*(α₈* (Lt)+β₈)

where α₄, α₅, α₆, α₇, and α₈are relative weights of locality or anti-locality attribute custom-character (L), availability attribute (A), network throughput attribute (T), response time attribute (Rt) and latency attribute (Lt), and β₄, β₅, β₆, β₇, and β₈and (3₈are additive weights that store dependence on other attributes.

Example 13 may be any one of examples 1-12, further comprising a hardware processor to operate the computing infrastructure resource-workload manager.

Example 14 may be example 13, wherein the hardware processor may include one or more processor cores respectively dedicated to operate the one or more utility function calculators.

Example 15 may be example 13, wherein where the hardware processor may include one or more processor cores, and one or more hardware accelerators, wherein the one or more hardware accelerators are respectively dedicated to operate the one or more utility function calculators.

Example 16 may be a method for managing resources and their assigned workloads of a computing infrastructure, comprising: receiving, by a placement solution generator of a resource-workload manager operating on a computing device, resource data and workload data for the computing infrastructure; generating, by the placement solution generator, a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads; calculating, by one or more utility function calculators of the resource-workload manager, one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions; analyzing, by an analyzer of the resource-workload manager, the attributes and selecting one of the potential resource placement solutions; and allocating, by a resource allocator of the resource-workload manager, the resources of the computing infrastructure, based at least in part on the selected resource placement solution.

Example 17 may be example 16, wherein calculating may comprise calculating values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions.

Example 18 may be example 17, wherein the one or more provider-centric attributes may comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute custom-character (P), a data locality attribute (), and a data replication attribute (R).

Example 19 may be example 18, wherein the plurality of provider-centric attributes may comprise a revenue attribute that details an amount of credit ($) a provider of the computing infrastructure would gain by hosting a service with a particular resource placement solution, an expenditure attribute that details the total cost of ownership in credits ($) of the resources in a particular resource placement solution, or a profit attribute custom-character (P) of a particular resource placement solution derived from the revenue and expenditure attributes of the particular resource placement solution.

Example 20 may be example 18, wherein the plurality of provider-centric attributes may comprise a data locality attribute custom-character (π) that details distance or access frequency between compute resources and storage resources of the computing infrastructure a particular resource placement solution, or a data replication attribute (R) that details a number of possible data/storage replicas the computing infrastructure has to deal with for fault-tolerance/high availability/performance issues for a particular resource placement solution.

Example 21 may be example 18 wherein calculating further may comprise calculating a provider utility custom-character _sfor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution:

custom-character
_s=(α₁*(P)+β₁)*(α₂*()+β₂)*(α₃*(R)+β₃)

Example 22 may be any one of examples 16-21, wherein calculating may comprise calculating values for one or more customer-centric attributes, for the plurality of potential solutions.

Example 23 may be example 22, wherein the one or more customer-centric attributes may comprise at least two attributes selected from an attribute group that may include a service components locality or anti-locality attribute custom-character (L), an availability attribute (A), a network throughput attribute (T), a response time attribute (Rt), and a latency attribute (Lt).

Example 24 may be example 23, wherein the one or more customer-centric attributes may comprise a service components locality or anti-locality attribute custom-character (L) that details the requirements of a workload to be hosted by a resource placement solution in one or in separated locations of the computing infrastructure.

Example 25 may be example 23, wherein the one or more customer-centric attributes may comprise a service components an availability attribute custom-character (A) that details the desired availability of a service for a particular workload, or a network throughput attribute) (T) that detail a desired minimum network throughput a service of a particular workload needs.

Example 26 may be example 23, wherein the one or more customer-centric attributes may comprise a service components a response time attribute custom-character (Rt) that details a time at which a task of a workload may be completed, and a latency attribute (Lt) that details a maximum desired latency between two service components of a particular workload.

Example 27 may be example 23, wherein calculating further may comprise calculating a customer utility custom-character _efor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution:

custom-character
_e=(α₄*(L)+β₄)*(α₅*(A)+β₅)*(α₆*(T)+β₆)*(α₇*(Rt)+β₇)*(α₈* (Lt)+β₈)

where α₄, α₅, α₆, α₇, and α₈are relative weights of locality or anti-locality attribute) custom-character (L), availability attribute (A), network throughput attribute (T), response time attribute (Rt), and latency attribute (Lt), and β₄, β₅, β₆, β₇, and β₈are additive weights that store dependence on other attributes.

Example 28 may be one or more computer-readable media (CRM) comprising instructions that cause a computing device, in response to execution of the instructions by a hardware processor, to operate a resource-workload manager of a computing infrastructure to: receive resource data and workload data for the computing infrastructure; generate a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads; calculate one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions; analyze the attributes and selecting one of the potential resource placement solutions; and allocating the resources of the computing infrastructure, based at least in part on the selected resource placement solution.

Example 29 may be example 28, wherein to calculate may comprise to calculate values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions.

Example 30 may be example 29, wherein the one or more provider-centric attributes may comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute custom-character (P), a data locality attribute () and a data replication attribute (R)

Example 31 may be example 30, wherein the plurality of provider-centric attributes may comprise a revenue attribute that details an amount of credit ($) a provider of the computing infrastructure would gain by hosting a service with a particular resource placement solution, an expenditure attribute that details the total cost of ownership in credits ($) of the resources in a particular resource placement solution, or a profit attribute custom-character (P) of a particular resource placement solution derived from the revenue and expenditure attributes of the particular resource placement solution.

Example 32 may be example 30, wherein the plurality of provider-centric attributes may comprise a data locality attribute custom-character () that details distance or access frequency between compute resources and storage resources of the computing infrastructure a particular resource placement solution, or a data replication attribute (R) that details a number of possible data/storage replicas the computing infrastructure has to deal with for fault-tolerance/high availability/performance issues for a particular resource placement solution.

Example 33 may be example 30 wherein to calculate further may comprise to calculate a provider utility custom-character _sfor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution:

custom-character
_s=(α₁*(P)+β₁)*(α₂*z,98 ()+β₂)*(α₃*(R)+β₃)

Example 34 may be any one of examples 28-33, wherein the one or more utility function calculators may include an utility function calculator to calculate values for one or more customer-centric attributes, for the plurality of potential solutions.

Example 35 may be example 34, wherein the one or more customer-centric attributes may comprise at least two attributes selected from an attribute group that may include a service components locality or anti-locality attribute custom-character (L), an availability attribute (A), a network throughput attribute (T), a response time attribute (Rt), and a latency attribute (Lt).

Example 36 may be example 35, wherein the one or more customer-centric attributes may comprise a service components locality or anti-locality attribute custom-character (L)that details the requirements of a workload to be hosted by a resource placement solution in one or in separated locations of the computing infrastructure.

Example 37 may be example 36, wherein the one or more customer-centric attributes may comprise a service components an availability attribute custom-character (A) that details the desired availability of a service for a particular workload, or a network throughput attribute (T)that detail a desired minimum network throughput a service of a particular workload needs.

Example 38 may be example 35, wherein the one or more customer-centric attributes may comprise a service components a response time attribute custom-character (Rt) that details a time at which a task of a workload may be completed, and a latency attribute (Lt) that details a maximum desired latency between two service components of a particular workload.

Example 39 may be example 35, wherein to calculate further may comprise to calculate a customer utility custom-character _efor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution:

custom-character
_e=(α₄*(L)+β₄)*(α₅*(A)+β₅)*(α₆*(T)+β₆)*(α₇*(Rt)+β₇)*(α₈* (Lt)+β₈)

where α₄, α₅, α₆, α₇, and α₈are relative weights of locality or anti-locality attribute custom-character (L), availability attribute (A), network throughput attribute (T), response time attribute (Rt), and latency attribute (Lt), and β₄, β₅, β₆, β₇, and β₈are additive weights that store dependence on other attributes.

Example 40 may be an apparatus for managing resources and their assigned workloads of a computing infrastructure, comprising: means for receiving resource data and workload data for the computing infrastructure, and generating a plurality of potential resource placement solutions for allocating the various resources of the computing infrastructure to various components of the various workloads; means for calculating one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions; means for analyzing the attributes, and selecting one of the potential resource placement solutions; and means for allocating the resources of the computing infrastructure, based at least in part on the selected resource placement solution.

Example 41 may be example 40, wherein means for calculating may comprise means for calculating values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions.

Example 42 may be example 41, wherein the one or more provider-centric attributes may comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute custom-character (P), a data locality attribute ω(), and a data replication attribute ω(R).

Example 43 may be example 42, wherein the plurality of provider-centric attributes may comprise a revenue attribute that details an amount of credit ($) a provider of the computing infrastructure would gain by hosting a service with a particular resource placement solution, an expenditure attribute that details the total cost of ownership in credits ($) of the resources in a particular resource placement solution, or a profit attribute ω(P) of a particular resource placement solution derived from the revenue and expenditure attributes of the particular resource placement solution.

Example 44 may be example 42, wherein the plurality of provider-centric attributes may comprise a data locality attribute ω(∠) that details distance or access frequency between compute resources and storage resources of the computing infrastructure a particular resource placement solution, or a data replication attribute ω(R) that details a number of possible data/storage replicas the computing infrastructure has to deal with for fault-tolerance/high availability/performance issues for a particular resource placement solution.

Example 45 may be example 42, wherein means for calculating further may comprise means for calculating a provider utility custom-character _sfor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution:

custom-character
_s=(α₁*(P)+β₁)*(α₂*()+β₂)*(α₃*(R)+β₃)

where α₁, α₂, and α₃are relative weights of profit attribute custom-character (P), data locality attribute (), and data replication attribute (R), and β₁, β₂, and β₃, are additive weights that store dependence on other attributes.

Example 46 may be any one of examples 40-45, wherein means for calculating may comprise means for calculating values for one or more customer-centric attributes, for the plurality of potential solutions.

Example 47 may be example 46, wherein the one or more customer-centric attributes may comprise at least two attributes selected from an attribute group that may include a service components locality or anti-locality attribute custom-character (T), an availability attribute (A) network throughput attribute (T), a response time attribute (Rt) and a latency attribute (Lt).

Example 48 may be example 47, wherein the one or more customer-centric attributes may comprise a service components locality or anti-locality attribute custom-character (L)that details the requirements of a workload to be hosted by a resource placement solution in one or in separated locations of the computing infrastructure.

Example 49 may be example 47, wherein the one or more customer-centric attributes may comprise a service components an availability attribute custom-character (A) that details the desired availability of a service for a particular workload, or a network throughput attribute) (T)that detail a desired minimum network throughput a service of a particular workload needs.

Example 50 may be example 47, wherein the one or more customer-centric attributes may comprise a service components a response time attribute custom-character (Rt) that details a time at which a task of a workload may be completed, and a latency attribute (Lt) that details a maximum desired latency between two service components of a particular workload.

Example 51 may be example 47, wherein means for calculating further may comprise means for calculating a customer utility custom-character _efor each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution:

custom-character
_e=(α₄*(L)+β₄)*(α₆*(A)+β₆)*(α₆*(T)+β₆)*(α₇z,114 (Rt)+β₇)*α₈* (Lt)+β₈)

where α₄, α₅, α₆, α₇, and α₈are relative weights of locality or anti-locality attribute custom-character (L), availability attribute (A), network throughput attribute (T), response time attribute (Rt), and latency attribute (Lt), and β₄, β₅, β₆, β₇, and β₈are additive weights that store dependence on other attributes.

It will be apparent to those skilled in the art that various modifications and variations can be made in the disclosed embodiments of the disclosed device and associated methods without departing from the spirit or scope of the disclosure. Thus, it is intended that the present disclosure covers the modifications and variations of the embodiments disclosed above provided that the modifications and variations come within the scope of any claims and their equivalents.

Claims

1. An apparatus for managing resources and their assigned workloads of a computing infrastructure, comprising: a resource-workload manager having: a placement solution generator to receive resource data and workload data for the computing infrastructure, and generate a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads;one or more utility function calculators communicatively coupled with the placement solution generator to calculate one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions;an analyzer communicatively coupled to the one or more utility function calculators to analyze the attributes and select one of the potential resource placement solutions; anda resource allocator communicatively coupled to analyzer to allocate the resources of the computing infrastructure, based at least in part on the selected resource placement solution.
2. The apparatus of claim 1, wherein the one or more utility function calculators include an utility function calculator to calculate values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions.
3. The apparatus of claim 2, wherein the one or more provider-centric attributes comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute (P), a data locality attribute (), and a data replication attribute (R).
4. The apparatus of claim 3, wherein the plurality of provider-centric attributes comprise a revenue attribute that details an amount of credit ($) a provider of the computing infrastructure would gain by hosting a service with a particular resource placement solution, an expenditure attribute that details the total cost of ownership in credits ($) of the resources in a particular resource placement solution, or a profit attribute (P) of a particular resource placement solution derived from the revenue and expenditure attributes of the particular resource placement solution.
5. The apparatus of claim 3, wherein the plurality of provider-centric attributes comprise a data locality attribute () that details distance or access frequency between compute resources and storage resources of the computing infrastructure a particular resource placement solution, or a data replication attribute (R) that details a number of possible data/storage replicas the computing infrastructure has to deal with for fault-tolerance/high availability/performance issues for a particular resource placement solution.
6. The apparatus of claim 3 wherein the utility function calculator is to further calculate a provider utility s for each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution: s=(α1*(P)+β1)*(α2*()+β2)*(α3*(R)+β3)where α1, α2, and α3 are relative weights of profit attribute (P), data locality attribute (), and data replication attribute (R), and β1, β2, and β3 are additive weights that store dependence on other attributes.
7. The apparatus of claim 1, wherein the one or more utility function calculators include an utility function calculator to calculate values for one or more customer-centric attributes, for the plurality of potential solutions.
8. The apparatus of claim 7, wherein the one or more customer-centric attributes comprise at least two attributes selected from an attribute group that includes a service components locality or anti-locality attribute (L), an availability attribute (A), a network throughput attribute (T) a response time attribute (Rt), and a latency attribute (Lt).
9. The apparatus of claim 8, wherein the one or more customer-centric attributes comprise a service components locality or anti-locality attribute (L)that details the requirements of a workload to be hosted by a resource placement solution in one or in separated locations of the computing infrastructure.
10. The apparatus of claim 8, wherein the one or more customer-centric attributes comprise a service components an availability attribute (A) that details the desired availability of a service for a particular workload, or a network throughput attribute (T)that detail a desired minimum network throughput a service of a particular workload needs.
11. The apparatus of claim 8, wherein the one or more customer-centric attributes comprise a service components a response time attribute (R2) that details a time at which a task of a workload is to be completed, and a latency attribute (Lt) that details a maximum desired latency between two service components of a particular workload.
12. The apparatus of claim 8, wherein the utility function calculator is to further calculate a customer utility e for each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution: e=(α4*(L)+β4)*(α5*(A)+5)*(α6*(T)+β6)*(α7*(Rt)+β7)*(α8* (Lt)+β8)where α4, α5, α6, α7, and α8 are relative weights of locality or anti-locality attribute) (L), availability attribute (A), network throughput attribute (T), response time attribute (Rt), and latency attribute (Lt), and β4, β5, β6, β7, and β8 are additive weights that store dependence on other attributes.
13. The apparatus of claim 1, further comprising a hardware processor to operate the computing infrastructure resource-workload manager.
14. The apparatus 13, wherein the hardware processor include one or more processor cores respectively dedicated to operate the one or more utility function calculators.
15. The apparatus of claim 13, wherein where the hardware processor include one or more processor cores, and one or more hardware accelerators, wherein the one or more hardware accelerators are respectively dedicated to operate the one or more utility function calculators.
16. A method for managing resources and their assigned workloads of a computing infrastructure, comprising: receiving, by a placement solution generator of a resource-workload manager operating on a computing device, resource data and workload data for the computing infrastructure;generating, by the placement solution generator, a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads;calculating, by one or more utility function calculators of the resource-workload manager, one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions;analyzing, by an analyzer of the resource-workload manager, the attributes and selecting one of the potential resource placement solutions; andallocating, by a resource allocator of the resource-workload manager, the resources of the computing infrastructure, based at least in part on the selected resource placement solution.
17. The method of claim 16, wherein calculating comprises calculating values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions; wherein the one or more provider-centric attributes comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute (P), a data locality attribute (), and a data replication attribute (R); and wherein calculating comprises calculating values for one or more customer-centric attributes, for the plurality of potential solutions; wherein the one or more customer-centric attributes comprise at least two attributes selected from an attribute group that includes a service components locality or anti-locality attribute (L), an availability attribute (A), a network throughput attribute (T), a response time attribute (Rt), and a latency attribute (Lt).
18. The method of claim 17 wherein calculating further comprises calculating a provider utility s for each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution: s=(α1*(P)+β1)*(α2*()+β2)*(α3*(R)+β3)where α1, α2, and α3 are relative weights of profit attribute (P), data locality attribute (), and data replication attribute (R), and β1, β2, and β3 and (33 are additive weights that store dependence on other attributes.
19. The method of claim 18, wherein calculating further comprises calculating a customer utility e for each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution: e=(α4*(L)+β4)*(α5*(A)+β6*(α6*(T)+β6)*(α7*(Rt)+β7)*(α8* (Lt)+β8)where α4, α5, α6, α7, and α8 are relative weights of locality or anti-locality attribute (L), availability attribute (A), network throughput attribute (T), response time attribute (Rt) and latency attribute (Lt), and β4, β5, β6, β7, and β8 are additive weights that store dependence on other attributes.
20. One or more computer-readable media (CRM) comprising instructions that cause a computing device, in response to execution of the instructions by a hardware processor, to operate a resource-workload manager of a computing infrastructure to: receive resource data and workload data for the computing infrastructure;generate a plurality of potential resource placement solutions for allocation of the various resources of the computing infrastructure to various components of the various workloads;calculate one or more values for one or more provider-centric attributes, and one or more values for one or more customer-centric attributes, for the plurality of potential resource placement solutions;analyze the attributes and selecting one of the potential resource placement solutions; andallocating the resources of the computing infrastructure, based at least in part on the selected resource placement solution.
21. The CRM of claim 20, wherein to calculate comprises to calculate values for the one or more provider-centric attributes, for the plurality of potential resource placement solutions.
22. The CRM of claim 21, wherein the one or more provider-centric attributes comprise one or more of a revenue attribute, an expenditure attribute, a profit attribute (P), a data locality attribute (), and a data replication attribute (R).
23. The CRM of claim 22, wherein to calculate further comprises to calculate a provider utility s for each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the plurality of provider-centric attributes for the particular potential resource placement solution: s=(α1*(P)+β1)*(α2*()+β2)*(α3*R)+β3)where α1, α2, and α3 are relative weights of profit attribute (P), data locality attribute (), and data replication attribute (R), and β1, β2, and β3 are additive weights that store dependence on other attributes.
24. The CRM of claim 20, wherein the one or more utility function calculators include an utility function calculator to calculate values for one or more customer-centric attributes, for the plurality of potential solutions; wherein the one or more customer-centric attributes comprise at least two attributes selected from an attribute group that includes a service components locality or anti-locality attribute an availability attribute (A), a network throughput attribute (T), a response time attribute (Rt) and a latency attribute (Lt).
25. The CRM of claim 24, wherein to calculate further comprises to calculate a customer utility e for each of the plurality of potential resource placement solutions, based at least in part on the corresponding calculated values for the one or more provider-centric attributes for the particular potential resource placement solution: e=(α4*(L)+β4)*(α5*(A)+β5)*(α6*(T)+β6)*(α7*(Rt)+β7)*(α8* (Lt)+β8)where α4, α5, α6, α7, and α8 are relative weights of locality or anti-locality attribute (L), availability attribute (A) network throughput attribute (T), response time attribute (Rt) and latency attribute (Lt), and β4, β5, β6, β7, and β8 are additive weights that store dependence on other attributes.

COMPUTING INFRASTRUCTURE RESOURCE-WORKLOAD MANAGEMENT METHODS AND APPARATUSES

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims