The subject matter disclosed herein relates to software defined storage and more particularly relates to deploying a validated software defined solution.
Software Defined Storage (SDS) solutions are often manually selected, deployed, and tuned.
An apparatus for deploying a validated SDS solution is disclosed. The apparatus includes a processor and a memory that stores code that is executable by the processor. The processor generates one or more desired Software SDS parameters for an SDS deployment. In addition, the processor identifies a validated SDS solution from the SDS repository that satisfies a filter threshold for the SDS parameters. In response to identifying the validated SDS solution, the processor deploys the validated SDS solution. A method and computer program product also perform the functions of the apparatus.
In order that the advantages of the embodiments of the invention will be readily understood, a more particular description of the embodiments briefly described above will be rendered by reference to specific embodiments that are illustrated in the appended drawings. Understanding that these drawings depict only some embodiments and are not therefore to be considered to be limiting of scope, the embodiments will be described and explained with additional specificity and detail through the use of the accompanying drawings, in which:
Reference throughout this specification to “one embodiment,” “an embodiment,” or similar language means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment. Thus, appearances of the phrases “in one embodiment,” “in an embodiment,” and similar language throughout this specification may, but do not necessarily, all refer to the same embodiment, but mean “one or more but not all embodiments” unless expressly specified otherwise. The terms “including,” “comprising,” “having,” and variations thereof mean “including but not limited to” unless expressly specified otherwise. An enumerated listing of items does not imply that any or all of the items are mutually exclusive and/or mutually inclusive, unless expressly specified otherwise. The terms “a,” “an,” and “the” also refer to “one or more” unless expressly specified otherwise.
Furthermore, the described features, advantages, and characteristics of the embodiments may be combined in any suitable manner. One skilled in the relevant art will recognize that the embodiments may be practiced without one or more of the specific features or advantages of a particular embodiment. In other instances, additional features and advantages may be recognized in certain embodiments that may not be present in all embodiments.
The present invention may be a system, a method, and/or a computer program product. The computer program product may include a computer readable storage medium (or media) having computer readable program instructions thereon for causing a processor to carry out aspects of the present invention.
The computer readable storage medium can be a tangible device that can retain and store instructions for use by an instruction execution device. The computer readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. A non-exhaustive list of more specific examples of the computer readable storage medium includes the following: a portable computer diskette, a hard disk, a random access memory (“RAM”), a read-only memory (“ROM”), an erasable programmable read-only memory (“EPROM” or Flash memory), a static random access memory (“SRAM”), a portable compact disc read-only memory (“CD-ROM”), a digital versatile disk (“DVD”), a memory stick, a floppy disk, a mechanically encoded device such as punch-cards or raised structures in a groove having instructions recorded thereon, and any suitable combination of the foregoing. A computer readable storage medium, as used herein, is not to be construed as being transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission media (e.g., light pulses passing through a fiber-optic cable), or electrical signals transmitted through a wire.
Computer readable program instructions described herein can be downloaded to respective computing/processing devices from a computer readable storage medium or to an external computer or external storage device via a network, for example, the Internet, a local area network, a wide area network and/or a wireless network. The network may comprise copper transmission cables, optical transmission fibers, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer readable program instructions from the network and forwards the computer readable program instructions for storage in a computer readable storage medium within the respective computing/processing device.
Computer readable program instructions for carrying out operations of the present invention may be assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine dependent instructions, microcode, firmware instructions, state-setting data, or either source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C++ or the like, and conventional procedural programming languages, such as the “C” programming language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through any type of network, including a local area network (LAN) or a wide area network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet Service Provider). In some embodiments, electronic circuitry including, for example, programmable logic circuitry, field-programmable gate arrays (FPGA), or programmable logic arrays (PLA) may execute the computer readable program instructions by utilizing state information of the computer readable program instructions to personalize the electronic circuitry, in order to perform aspects of the present invention.
Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.
These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer readable program instructions may also be stored in a computer readable storage medium that can direct a computer, a programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer readable storage medium having instructions stored therein comprises an article of manufacture including instructions which implement aspects of the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other device to cause a series of operational steps to be performed on the computer, other programmable apparatus or other device to produce a computer implemented process, such that the instructions which execute on the computer, other programmable apparatus, or other device implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts or carry out combinations of special purpose hardware and computer instructions.
Many of the functional units described in this specification have been labeled as modules, in order to more particularly emphasize their implementation independence. For example, a module may be implemented as a hardware circuit comprising custom VLSI circuits or gate arrays, off-the-shelf semiconductors such as logic chips, transistors, or other discrete components. A module may also be implemented in programmable hardware devices such as field programmable gate arrays, programmable array logic, programmable logic devices or the like.
Modules may also be implemented in software for execution by various types of processors. An identified module of program instructions may, for instance, comprise one or more physical or logical blocks of computer instructions which may, for instance, be organized as an object, procedure, or function. Nevertheless, the executables of an identified module need not be physically located together, but may comprise disparate instructions stored in different locations which, when joined logically together, comprise the module and achieve the stated purpose for the module.
The computer program product may be deployed by manually loading directly in the client, server and proxy computers via loading a computer readable storage medium such as a CD, DVD, etc., the computer program product may be automatically or semi-automatically deployed into a computer system by sending the computer program product to a central server or a group of central servers. The computer program product is then downloaded into the client computers that will execute the computer program product. Alternatively, the computer program product is sent directly to the client system via e-mail. The computer program product is then either detached to a directory or loaded into a directory by a button on the e-mail that executes a program that detaches the computer program product into a directory. Another alternative is to send the computer program product directly to a directory on the client computer hard drive. When there are proxy servers, the process will, select the proxy server code, determine on which computers to place the proxy servers' code, transmit the proxy server code, then install the proxy server code on the proxy computer. The computer program product will be transmitted to the proxy server and then it will be stored on the proxy server.
The computer program product, in one embodiment, may be shared, simultaneously serving multiple customers in a flexible, automated fashion. The computer program product may be standardized, requiring little customization and scalable, providing capacity on demand in a pay-as-you-go model.
The computer program product may be stored on a shared file system accessible from one or more servers. The computer program product may be executed via transactions that contain data and server processing requests that use Central Processor Unit (CPU) units on the accessed server. CPU units may be units of time such as minutes, seconds, hours on the central processor of the server. Additionally, the accessed server may make requests of other servers that require CPU units. CPU units are an example that represents but one measurement of use. Other measurements of use include but are not limited to network bandwidth, memory usage, storage usage, packet transfers, complete transactions etc.
When multiple customers use the same computer program product via shared execution, transactions are differentiated by the parameters included in the transactions that identify the unique customer and the type of service for that customer. All of the CPU units and other measurements of use that are used for the services for each customer are recorded. When the number of transactions to any one server reaches a number that begins to affect the performance of that server, other servers are accessed to increase the capacity and to share the workload. Likewise, when other measurements of use such as network bandwidth, memory usage, storage usage, etc. approach a capacity so as to affect performance, additional network bandwidth, memory usage, storage etc. are added to share the workload.
The measurements of use used for each service and customer are sent to a collecting server that sums the measurements of use for each customer for each service that was processed anywhere in the network of servers that provide the shared execution of the computer program product. The summed measurements of use units are periodically multiplied by unit costs and the resulting total computer program product service costs are alternatively sent to the customer and or indicated on a web site accessed by the customer which then remits payment to the service provider.
In one embodiment, the service provider requests payment directly from a customer account at a banking or financial institution. In another embodiment, if the service provider is also a customer of the customer that uses the computer program product, the payment owed to the service provider is reconciled to the payment owed by the service provider to minimize the transfer of payments.
Furthermore, the described features, structures, or characteristics of the embodiments may be combined in any suitable manner. In the following description, numerous specific details are provided, such as examples of programming, software modules, user selections, network transactions, database queries, database structures, hardware modules, hardware circuits, hardware chips, etc., to provide a thorough understanding of embodiments. One skilled in the relevant art will recognize, however, that embodiments may be practiced without one or more of the specific details, or with other methods, components, materials, and so forth. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of an embodiment.
The description of elements in each figure may refer to elements of proceeding figures. Like numbers refer to like elements in all figures, including alternate embodiments of like elements.
An SDS solution may provide flexible, highly configurable data storage for the customer. Unfortunately, the flexibility and configurability of the SDS solution may result in the deployment of SDS solutions that are prone to operational problems and failures. The embodiments described herein validate SDS solutions and make the validated SDS solutions available for deployment to customers as will be described hereafter.
The system 100 may deploy SDS solutions from the deployment portion over the network 150 to the customer portion. The network 150 may comprise the Internet, a wide-area network, a local area network, a Wi-Fi network, a mobile telephone network, and combinations thereof. The software image manager 135 may receive an SDS solution. The SDS deployment system 130 may deploy the SDS solution as directed by the customer administrator 125. Hardware and software elements of the SDS solution may be provided by the xCAT provisioning manager 140. The one or more deployed SDS solutions 145 may provide data storage for the customer. Notifications, error messages, and the like relating to the operation of the deployed SDS solutions 145 may be stored in the log data 160.
The SDS test system 105 may validate SDS solutions. In one embodiment, the SDS test system 105 employs the test suite 165 to validate SDS solutions. Validated SDS solutions may be stored in the SDS repository 110.
The SDS risk analyzer 115 may evaluate the risks of deploying an SDS solution. In one embodiment, the SDS risk analyzer 115 evaluates the risks of deploying an SDS solution that has not been validated by the SDS test system 105 using the test suite 165. Based on the evaluation of the risks, and un-validated SDS solution may be deployed.
The log data 160 and other information from the operation of the deployed SDS solutions 145 may be communicated to the SDS field data 155. The SDS field data 155 may be employed by the SDS risk analyzer 115 to evaluate the risk of deploying an SDS solution.
The component identifier 210 may uniquely identify the SDS component 205. The component identifier 210 may be an index, an alphanumeric string, a key in a key-value store, or any other similar type of indexing method. The hardware identifier 215 may identify one or more hardware devices. The hardware devices may be integral to the SDS component 205. Alternatively, the hardware devices may be prerequisites required by the SDS component 205.
The software prerequisites 220 may identify one or more software instances that are required by the SDS component 205. In one embodiment, the software prerequisites 220 identify one or more combinations of software instances that could each be employed by the SDS component 205.
The operating system identifier 225 identifies an operating system that is required by the SDS component 205. The operating system version 230 identifies one or more required versions of the operating system.
The driver identifier 233 identifies a software and/or firmware driver for the SDS component 205. The driver version 235 identifies one or more required versions of the driver.
The hardware devices, software instances, operating systems, and device drivers of the SDS component 205 may be referred to generically as elements. Thus, the SDS component 205 comprises a plurality of elements. One or more of the hardware identifier 215, software prerequisites 220, operating system identifier 225, operating system version 230, driver identifier 233, and driver version identifier 235 may have a NULL value that indicates that the element is not used and/or not required.
The discrepancy data 240 may record discrepancies, errors, problem reports, failures, and the like associated with the SDS solution 200 identified by the SDS solution identifier 201 and/or the SDS component 205 identified by the component identifier 210. In one embodiment, the discrepancy data 240 is calculated as a function of the failure data 247 such as hard failures and soft failures in the failure data 247 and a hard failure threshold and the soft failure threshold. In a certain embodiment, hard failures are failures that exceed the hard failure threshold. In addition, soft failures may be failures that exceed the soft failure threshold but do not exceed the hard failure threshold.
The performance data 245 may record one or more performance metrics associated with the SDS solution 200 identified by the SDS solution identifier 201 and/or the SDS component 205 identified by the component identifier 210. The failure data 247 is described in more detail in
The SDS parameters 175 may be specified by the customer administrator 125, the SDS risk analyzer 115, and/or a computer to provide a preliminary description of the model SDS solution 200 and/or the desired SDS solution 200. A filter threshold 209 may be created for the SDS parameters 175 and used to identify a validated SDS solution 200 as will be described hereafter.
In one embodiment, a trade-off analytics filter 211 may be calculate a deployment risk for the SDS parameters 175 and/or an SDS solution 200. The trade-off analytics function 211 may be calculated as a function of SDS components 205, discrepancy data 240, and performance data 245. In a certain embodiment, the trade-off analytics function 211 is further calculated as a function of failure data 247. In one embodiment, the trade-off analytics function 211 is a trade-off analytics Application Program Interface (API) such as the WATSON® trade-off analytics API. The trade-off analytics function 211 may be trained using training data.
The element identifier 255 may identify an SDS element 207 that is used in one or more SDS components 205. The element identifier 255 may be a software identifier, hardware model number, or combinations thereof. The element version 260 may specify a unique version of the SDS element 207. The element availability 265 may specify whether or not the SDS element 207 is available. For example, if a hard disk drive identified by the element identifier 255 with a model number specified by the element identifier 255 cannot be procured and/or cannot be deployed, the element availability 265 may be set to “unavailable.” However, if the hard disk drive can be procured and/or can be deployed, the element availability 265 may be set to “available.”
The raw failures 308 may comprise uncategorized failures from the log data 160. In one embodiment, the hard failures 310 record a number of raw failures 308 that exceed the hard failure threshold for the SDS solution 200 identified by the SDS solution identifier 201 and/or the SDS component 205 identified by the component identifier 210. The soft failures 315 may record a number of raw failures 308 that exceed the soft failure threshold for the SDS solution 200 identified by the SDS solution identifier 201 and/or the SDS component 205 identified by the component identifier 210.
The hard failure threshold 355 may specify one or more of a type of failure and/or a quantity of failures. When the hard failure threshold 355 is exceeded, a hard failure 310 may be identified. The soft failure threshold 360 may specify one or more of a type of failure and a quantity of failures. When the soft failure threshold 360 is exceeded, a soft failure 315 may be identified. In one embodiment, if both the hard failure threshold 355 and the soft failure threshold 360 are exceeded, a hard failure 310 is identified.
The method 500 starts, and in one embodiment, the processor 405 generates 505 a model SDS solution 200. In one embodiment, the model SDS solution 200 is generated 505 based on one or more desired SDS parameters 175 for a desired SDS solution 200 supplied by the customer administrator 125. In addition, the SDS risk analyzer 115 analyze the one or more desired SDS parameters 175 to generate 505 the model SDS solution 200. In one embodiment, the SDS risk analyzer 115 employs the neural network 401 to analyze the one or more desired SDS parameters 175 and generate the model SDS Solution 200.
The processor 405 may validate 510 the model SDS solution 200 using the test suite 165. In one embodiment, the SDS test system 105 autonomously performs the test suite 165 on the model SDS solution 200. If the model SDS solution 200 fails the test suite 165, the model SDS solution 200 may be iteratively modified until the model SDS solution 200 passes the test suite 165.
In one embodiment, the processor 405 may validate 510 the model SDS solution 200 using the availability matrix 250. The processor 405 may determine that each element of each SDS component 205 of the model SDS solution 200 is available for deployment. In a certain embodiment, the model SDS solution 200 must be both validated by the test suite 165 and the availability matrix 250 to be considered fully validated.
In response to validating the model SDS solution 200, the processor 405 may store the validated SDS solution 200 in the SDS repository 110 and the method 500 ends.
The method 600 starts, and in one embodiment, the processor 405 queries 605 a deployed SDS solution 145 for performance data 245. The processor 405 may query 605 the deployed SDS solution 145 through the network 150. In one embodiment, a query request includes one or more authorization credentials. In a certain embodiment, the processor 405 also queries 605 the deployed SDS solution 145 for failure data 247. The failure data 247 may be embodied in the log data 160.
The processor 405 further receives 610 the performance data 245 from the deployed SDS solution 145. In a certain embodiment, the performance data 245 is included in the log data 160. The processor 405 may store 615 the performance data 245. In one embodiment, the performance data 245 is stored 615 in the SDS field data 155.
The processor 405 may receive 620 the failure data 247. The failure data 247 may be included in the log data 160. The processor 405 may further calculate 625 the discrepancy data 240 from the failure data 247. In one embodiment, the processor 405 generates a training data set from the log data 160 that includes outputs of an SDS solution identifier 201 and a component identifier 210 for previous failure data 247, hard failures 310 for the previous failure data 247, and soft failures 315 for the previous failure data 247. The processor 405 may further train the neural network 401 using the training data set. The processor 405 may calculate 625 the discrepancy data 240 from the failure data 247 of the log data 160 by encoding the log data 160 and introducing the encoded log data 160 to the neural network 401.
In an alternative embodiment, the processor 405 may identify a discrete error message within the failure data 247. The processor 405 may further identify the SDS solution identifier 201 and the component identifier 210 from the error message. In one embodiment, the processor 405 calculate 625 whether a failure of the error message exceeds the hard failure threshold 355. If the failure exceeds the hard failure threshold 355, the processor 405 may identify a hard failure 310. In one embodiment, the calculation 625 stops after identifying one hard failure 310. The processor 405 may further calculate 625 whether the failure of the error message exceeds the soft failure threshold 360. If the failure exceeds the soft failure threshold 360 and does not exceed the hard failure threshold 355, the processor 405 may identify a soft failure 315.
In one embodiment, the processor 405 employs a heuristic algorithm that analyzes the discrete error message and two to five preceding error messages to calculate 625 the hard failures 310 and the soft failures 315. The processor 405 may store 630 the discrepancy data 240 to the SDS field data 155 and the method 600 ends.
The method 700 starts, and in one embodiment, the processor 405 may generate 705 one or more desired SDS parameters 175. The desired SDS parameters 175 may be communicated from the customer administrator 125 to the processor 405 and/or SDS repository 110. Alternatively, the desired SDS parameters 175 may be generated 705 based on a desired SDS solution 200.
The processor 405 may interrogate 710 the SDS repository 110. The SDS repository 110 may be interrogated 710 using the filter threshold 209. The filter threshold 209 may be applied to the SDS solutions 200 in the SDS repository 110. Alternatively, the SDS repository 110 may be interrogated 710 with the desired SDS parameters 175. In one embodiment, the desired SDS parameters 175 are used as indices to the SDS repository 110.
The processor 405 further identifies 715 a validated SDS solution 200 from the SDS repository 110. In a certain embodiment, the validated SDS solution 200 comprises each of the desired SDS parameters 175. Alternatively, the validated SDS solution 200 may include a greatest number of desired SDS parameters 175.
In one embodiment, the processor identifies 715 a validated SDS solution 200 from the SDS repository 110 that has no hard failures 310 and a number of soft failures 315 that is less than a deployment threshold. For example, the deployment threshold may be in the range of one to five soft failures 315.
In one embodiment, the processor 405 identifies 715 the validated SDS solution 200 that satisfies the filter threshold 209. The identification 715 of the validated SDS solution 200 using the filter threshold is described in more detail in
The processor 405 may determine 720 if the validated SDS solution 200 is available. In one embodiment, the processor 405 uses the availability matrix 250 to determine that each element 207 of each SDS component 205 of the validated SDS solution 200 is available. If each SDS element 207 or each SDS component 205 of the validated SDS solution 200 is not available, the processor 405 may generate 705 one or more desired SDS parameters 175.
If each element 207 of each SDS component 205 of the validated SDS solution 200 is available, the processor 405 may download 725 each software image of the validated SDS solution 200 to a landing zone of the software image manager 135. In addition, the processor may deploy 730 the validated SDS solution 200 by deploying the software images and the method 700 ends. In one embodiment, the SDS deployment system 130 and/or the xCAT provisioning manager 140 deploy 730 the software images.
The method 800 starts, and in one embodiment, the processor 405 determines 805 the filter threshold 209 from filter training data. The filter training data may be based on log data 160 and the resulting failure data 247, performance data 245, and/or discrepancy data 240. In one embodiment, the filter training data comprises SDS components 205 and SDS elements 207 associated with the component identifier 210 as inputs and the discrepancy data 240 and performance data 245 as outputs. The filter training data may be used to train a trade-off analytics API. Alternatively, the filter training data may be used to train the neural network 401. The trained API and/or the trained neural network 401 may comprise the filter threshold 209.
The processor 405 may generate 810 a soft score for each SDS solution 200 in the SDS repository 110. The soft score may comprise one or more of a discrepancy data forecast 240 and a performance data forecast 245. In one embodiment, each soft score is generated 810 by analyzing the SDS components 205 and SDS elements 207 of each SDS solution 200 against the filter threshold 209.
In one embodiment, the processor 405 identifies 815 the validated SDS solution 200 with the highest soft score that satisfies the filter threshold and the method 800 ends. The processor 405 may sort each SDS solution 200 in the SDS repository 110 and identify 815 the SDS solution 200 with the highest soft score.
The embodiments validate SDS solutions 200 and store the validated SDS solutions 200 in the SDS repository 110. In addition, the embodiments generate desired SDS parameters 175 for an SDS deployment and identify the validated SDS solution 200 from the SDS repository 110 that satisfies the filter threshold. As a result, the validated SDS solution 200 is more likely to be robust and perform as desired. In addition, the embodiments deploy the validated SDS solution 200 in response to identifying the validated SDS solution 200.
The embodiments may be practiced in other specific forms. The described embodiments are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.