The present description relates to a framework and architecture for managing resources in a distributed system.
Resource allocation in a computing system is the assignment of available resources to various uses. Resource management is the scheduling of activities and the resources required by those activities while taking into consideration resource availability and timing. Resource management includes resource allocation and resource enforcement, which is ensuring that resource allocation is respected.
Resource management in systems of distributed resources is challenging and therefore, improvements are desired.
The present disclosure is drawn to methods and systems for resource enforcement in a distributed system. At least one first host of the system has an enforcement agent configured to trigger a master enforcement controller in response to the first host running a task requiring access to a distributed resource. The master enforcement controller obtains identification information regarding the distributed resource and transmits an enforcement command to other hosts of the distributed system involved in providing the distributed resource. The enforcement command is received by enforcement agents on each of the involved hosts and triggers a slave enforcement controller on each of the involved hosts. The slave enforcement controllers locally enforce a resource quota for the distributed resource.
In accordance with a broad aspect, there is provided a method for resource enforcement in a distributed system having a plurality of hosts. A master enforcement controller on a first host of the plurality of hosts is triggered in response to the first host accessing a distributed resource of the distributed system. Identification information regarding the distributed resource is obtained by the master enforcement controller. The master enforcement controller transmits an enforcement command including the identification information to at least one second host of the plurality of hosts, the at least one second host being associated with the distributed resource, the enforcement command configured for triggering a slave enforcement controller on the at least one second host for locally enforcing a resource quota.
In some embodiments, obtaining identification information comprises determining an accessing mechanism for the distributed resource. In some embodiments, obtaining identification information comprises identifying the at least one second host by determining which ones of the plurality of hosts are involved in providing the distributed resource.
In some embodiments, transmitting the enforcement command comprises transmitting to two or more hosts of the plurality of hosts.
In some embodiments, the enforcement command further comprises the resource quota for each of the two or more hosts.
In some embodiments, the method further comprises dynamically adjusting the resource quota based on access to the distributed resource by the first host. In some embodiments, dynamically adjusting the resource quota comprises setting different resource quotas for each of the two or more hosts.
In some embodiments, the method further comprises receiving the enforcement command from another one of the plurality of hosts of the distributed system; and triggering the slave enforcement controller on the first host.
In some embodiments, the enforcement command includes an adjusted resource quote.
In accordance with another aspect, there is provided a host in a distributed system having a plurality of hosts. The host comprises a processing unit and a non-transitory memory communicatively coupled to the processing unit. The memory comprises computer-readable program instructions executable by the processing unit for triggering a master enforcement controller on the host in response to the host accessing a distributed resource of the distributed system; obtaining, by the master enforcement controller, identification information regarding the distributed resource; and transmitting an enforcement command including the identification information to at least one other host of the plurality of hosts, the at least one other host being associated with the distributed resource, the enforcement command configured for triggering a slave enforcement controller on the at least one other host for locally enforcing a resource quota.
In some embodiments, obtaining identification information comprises determining an accessing mechanism for the distributed resource.
In some embodiments, obtaining identification information comprises identifying the at least one other host by determining which ones of the plurality of hosts are involved in providing the distributed resource.
In some embodiments, transmitting the enforcement command comprises transmitting to two or more hosts of the plurality of hosts.
In some embodiments, the enforcement command further comprises the resource quota for each of the two or more hosts.
In some embodiments, the processing unit is further executable for dynamically adjusting the resource quota based on access to the distributed resource by the host.
In some embodiments, dynamically adjusting the resource quota comprises setting different resource quotas for each of the two or more hosts.
In some embodiments, the processing unit is further executable for receiving the enforcement command from another one of the plurality of hosts of the distributed system; and triggering the slave enforcement controller on the host.
In some embodiments, the enforcement command includes an adjusted resource quote.
In accordance with another broad aspect, there is provided a distributed resource management system comprising a plurality of hosts, at least one of the plurality of hosts comprising an enforcement agent, the enforcement agent configured for locally triggering a master enforcement controller in response to access of a distributed resource by a corresponding host, the master enforcement controller configured for obtaining identification information regarding the distributed resource and transmitting an enforcement command with the identification information to at least one other host associated with the distributed resource, the enforcement agent also configured for locally triggering a slave enforcement controller upon receipt of the enforcement command from another one of the plurality of hosts, the slave enforcement controller configured for locally enforcing a resource quota.
In some embodiments, each one of the plurality of hosts comprise the enforcement configured for locally triggering the master enforcement controller and the slave enforcement controller.
Further features and advantages of the present invention will become apparent from the following detailed description, taken in combination with the appended drawings, in which:
It will be noted that throughout the appended drawings, like features are identified by like reference numerals.
Referring to
The system 100 can be arranged according to any one of the following architectures: host-based hierarchy, decentralized stand-alone, peer-to-peer Local Access Network (LAN)-based, hybrid enterprise-wide, client-server, and Internet-centric. At least one distributed service is provided across at least two of the hosts 102, and at least one of the hosts 102 can run a task. Examples of tasks are High Performance Computing (HPC) batch jobs, Message Passing Interface (MPI), serial batches, real-time analytics, elastic applications, long running services, virtual machines, and task containers. Examples of distributed resources are distributed file systems, virtualized file systems, distributed databases, virtualized networks, and distributed cache.
At least one first host 1021 of the plurality of hosts 102 in the distributed system 100 is configured for distributed resource enforcement across the system 100 when the first host 1021 runs a task that accesses a distributed resource. At least one second host 1022 of the plurality of hosts 102 in the distributed system 100 is configured for distributed resource enforcement across the system 100 when the first host 1021 accesses a distributed resource to which the second host 1022 is associated, i.e. the second host 1022 is involved in providing the distributed resource accessed by the first host 1021.
An enforcement agent 2061 is provided on first host 1021 and is operatively coupled to a master enforcement controller 2081. Enforcement agent 2061 may be running an application task and/or a distributed resource on first host 1021 and is configured to trigger master enforcement controller 2081 in response to the first host accessing a distributed resource of the system 100. Therefore, master enforcement controller 2081 is started by enforcement agent 2061 on first host 1021. Master enforcement controller 2081 obtains identification information regarding the distributed resource accessed by first host 1021. Master enforcement controller 2081 then transmits an enforcement command with the identification information to at least the second host 1022 in the distributed system 100. If additional ones of the plurality of hosts 102 are also associated with the accessed distributed resource, then the enforcement command is transmitted to all of the associated hosts 102.
An enforcement agent 2062 is provided on second host 1022 and is operatively coupled to a slave enforcement controller 3062. Enforcement agent 2062 runs the distributed resource accessed by first host 1021 and is configured to trigger slave enforcement controller 3062 in response to receipt of the enforcement command from first host 1021. Therefore, enforcement agent 2062 starts slave enforcement controller 3062 on second host 1022. Slave enforcement controller 3062 conducts local resource enforcement according to a resource quota.
The resource quota can be provided to second host 1022 by any one of the hosts 102 in the distributed system 100, or by another entity that acts as a master scheduler for setting an initial resource quota. In some embodiments, master enforcement controller 2081 is configured to dynamically adjust the resource quota based on access to the distributed resource by first host 1021. The adjusted resource quota is then transmitted from master enforcement controller 2081 to slave enforcement controller 3062, either as part of the identification information or separately therefrom. In some embodiments, the initial resource quota is transmitted from master enforcement controller 2081 to slave enforcement controller 3062, either as part of the identification information or separately therefrom, and the initial resource quota is iteratively adapted and retransmitted from master enforcement controller 2081 to slave enforcement controller 3062 as first host 1021 continues to perform the task involving access to the distributed resource.
When more than one of the hosts 102 is involved in providing the distributed resource, master enforcement controller 2081 transmits the enforcement command to each one of hosts 102 involved. The enforcement command will cause a local enforcement agent on each one of the involved hosts 102 to trigger a local slave enforcement controller. In some embodiments, master enforcement controller 2081 provides each one of the involved hosts 102 with a different resource quota, as a function of the specific needs of each involved host 102.
In some embodiments, at least one host 1023 has an enforcement agent 2063 configured for selectively triggering a local master enforcement controller 2083 and a local slave enforcement controller 3063, as illustrated in
Referring to
At step 504, identification information is obtained by master enforcement controller 2083 regarding the distributed resource. In some embodiments, obtaining the identification information involves identifying which distributed resource is to be accessed. For example, identification information such as Transmission Control Protocol (TCP) connections, Internet Protocol (IP) addresses, and Ports may be obtained. The access task may require access to multiple distributed resources. In some embodiments, obtaining the identification information also involves determining which ones of the hosts 102 are involved in providing the one or more distributed resources required by the access task. These may be identified by, for example, IP address. Identification information may include one or more of connections, flows, and/or requests between the distributed resource and the access task running on host 1023. In some embodiments, identification information includes process ID, connection socket ID, and port ID.
At step 506, the master enforcement controller 2083 transmits an enforcement command to other hosts 102 of the distributed system 100 in order to locally enforce, on each recipient host 102, a resource quota associated with the distributed resource. The enforcement command is sent to all hosts 102 involved in the one or more distributed resource required for the access task run by host 1023. The enforcement command includes the identification information obtained by the master enforcement controller 2083 in step 504, and is received by an enforcement agent of a corresponding host. The enforcement command is configured to trigger a slave enforcement controller on each corresponding host, for locally enforcing the resource quota. For example, local resource enforcement may involve limiting a corresponding TCP connection rate.
In some embodiments, the master enforcement controller 2083 continues to monitor the task performed by host 1023, as per step 508, and will send updated information to involved hosts 102 as required, as per step 510. For example, if the requirements regarding the distributed resource change, an adjusted resource quota is provided to each of the involved hosts 102.
The change in requirements may be detected by the master enforcement controller 2083 or by any of the slave enforcement controllers of corresponding hosts 102. Indeed, each slave enforcement controller can monitor connections, requests, flow status, and usage. For example, if host 1021 and host 1022 each enforce a connection flow rate of 45 MB/s for a target bandwidth of 90 MB/s, and the connection to host 1022 is terminated, slave enforcement controller 3062 detects the terminated connection and transmits the information to master enforcement controller 2083. Master enforcement controller 2083 can then adjust the resource quota of host 1021 to 90 MB/s. Alternatively, master enforcement controller 2083 detects the terminated connection and transmits an adjusted resource quota to host 1021. In some embodiments, an adjusted resource quota is transmitted from the master enforcement controller 2083 to one or more other hosts 102 for reasons other than a terminated connection.
In some embodiments, step 508 involves determining that a new distributed resource is needed for the access task. Sending updated information 510 then involves identifying any additional hosts 102 involved in the new distributed resource and triggering slave enforcement controllers on each of the additional hosts 102 by repeating steps 502, 504, 506.
When the access task is completed, the method proceeds to step 512 and ends.
Note that each one of the hosts 102 may have more than one master enforcement controller associated with an enforcement agent. The hosts 102 may also have more than one slave enforcement controller associated with an enforcement agent. For example, each task of host 1023 may be assigned one master enforcement controller and multiple slave controllers, so that when multiple tasks are running on host 1023, multiple enforcement controllers are also running on host 1023.
Each computer program described herein may be implemented in a high level procedural or object oriented programming or scripting language, or a combination thereof, to communicate with a computer system. Alternatively, the programs may be implemented in assembly or machine language. The language may be a compiled or interpreted language. Each such computer program may be stored on a storage media or a device, for example a ROM, a magnetic disk, an optical disc, a flash drive, or any other suitable storage media or device. The computer program may be readable by a general or special-purpose programmable computer for configuring and operating the computer when the storage media or device is read by the computer to perform the procedures described herein. Embodiments of the system may also be considered to be implemented by way of a non-transitory computer-readable storage medium having a computer program stored thereon. The computer program may comprise computer-readable instructions which cause a computer, or more specifically the at least one processing unit of the computer, to operate in a specific and predefined manner to perform the functions described herein.
Computer-executable instructions may be in many forms, including program modules, executed by one or more computers or other devices. Generally, program modules include routines, programs, objects, components, data structures, etc., that perform particular tasks or implement particular abstract data types. Typically the functionality of the program modules may be combined or distributed as desired in various embodiments.
Various aspects of the present distributed system 100 may be used alone, in combination, or in a variety of arrangements not specifically discussed in the embodiments described in the foregoing and is therefore not limited in its application to the details and arrangement of components set forth in the foregoing description or illustrated in the drawings. For example, aspects described in one embodiment may be combined in any manner with aspects described in other embodiments. Although particular embodiments have been shown and described, it will be obvious to those skilled in the art that changes and modifications may be made without departing from this invention in its broader aspects. The appended claims are to encompass within their scope all such changes and modifications.
Number | Name | Date | Kind |
---|---|---|---|
8706798 | Suchter | Apr 2014 | B1 |
8954584 | Subbarayan | Feb 2015 | B1 |
9374787 | de Lind van Wijngaarden | Jun 2016 | B2 |
20020002578 | Yamashita | Jan 2002 | A1 |
20050033844 | Andrzejak | Feb 2005 | A1 |
20050138175 | Kumar | Jun 2005 | A1 |
20050193023 | Ismail | Sep 2005 | A1 |
20055193023 | Ismail | Sep 2005 | |
20050235288 | Yamakabe | Oct 2005 | A1 |
20070106636 | Sridharan | May 2007 | A1 |
20070174839 | Takahashi | Jul 2007 | A1 |
20090157928 | Riegebauer | Jun 2009 | A1 |
20090234941 | Ammerlaan | Sep 2009 | A1 |
20090300348 | Aciicmez | Dec 2009 | A1 |
20090320029 | Kottomtharayil | Dec 2009 | A1 |
20100119277 | Nakamichi | May 2010 | A1 |
20100153542 | Arimilli | Jun 2010 | A1 |
20110002275 | Shousterman | Jan 2011 | A1 |
20110106950 | Schlack | May 2011 | A1 |
20110255526 | Kaneko | Oct 2011 | A1 |
20120084342 | Brown | Apr 2012 | A1 |
20120131367 | Kamijima | May 2012 | A1 |
20120174106 | Seo | Jul 2012 | A1 |
20120235823 | Trock | Sep 2012 | A1 |
20120272237 | Baron | Oct 2012 | A1 |
20130061167 | Rhodes | Mar 2013 | A1 |
20140029531 | Chang | Jan 2014 | A1 |
20140208328 | Chen | Jul 2014 | A1 |
20140282046 | Gonsalves | Sep 2014 | A1 |
20140304352 | Chaudhary | Oct 2014 | A1 |
20140325307 | Resch | Oct 2014 | A1 |
20150161157 | Ishizaki | Jun 2015 | A1 |
20150195303 | Holden | Jul 2015 | A1 |
20150220363 | Cypher | Aug 2015 | A1 |
20150222546 | Van Phan | Aug 2015 | A1 |
20150254108 | Kurtzman | Sep 2015 | A1 |
20150334696 | Gu | Nov 2015 | A1 |
20150347181 | Myrick | Dec 2015 | A1 |
20150363237 | Finnie | Dec 2015 | A1 |
20150378993 | Eisler | Dec 2015 | A1 |
20160055241 | Gower | Feb 2016 | A1 |
20160072917 | Huang | Mar 2016 | A1 |
20160165087 | Okuda | Jun 2016 | A1 |
20160226250 | Fukubayashi | Aug 2016 | A1 |
20160226713 | Dellinger | Aug 2016 | A1 |
20160283366 | Nguyen Tien | Sep 2016 | A1 |
20170303175 | Chen | Oct 2017 | A1 |
Number | Date | Country |
---|---|---|
102243598 | Nov 2011 | CN |
103703724 | Apr 2014 | CN |
105471950 | Apr 2016 | CN |
Entry |
---|
Mace et al., “Retro: Targeted Resource Management in Multi-tenant Distributed Systems”, Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI '15), USENIX Association, May 4-6, 2015, pp. 589-603. |
Van Do et al., “A Framework for Supporting the Bandwidth Enforcement of Reading from HDFS”, Jul. 13, 2015, Analysis, Design and Development of ICT systems (AddICT) Laboratory, Budapest University of Technology and Economics, Hungary, 22 pages. http://www.hit.bme.hu/˜dohoai/documents/HdfsTrafficControl.pdf. |
Number | Date | Country | |
---|---|---|---|
20170295220 A1 | Oct 2017 | US |