The present disclosure relates generally to a database calculation that utilizes parallel-computation of tasks in a directed acyclic graph.
In an enterprise or business system, an inventory management performed by a computerized inventory management system includes receiving of inventories such as raw materials, shipping orders, overhead costs, etc. These inventories may be in volumes and data relations between the raw materials, shipping orders, etc. may complicate a computation of even the simplest item that may be searched from the database.
For example, in manufacturing enterprises, there is a need to calculate production cost of their finished and semi-finished goods. Direct costs like raw materials and direct labors may be easily measured, because they are straightly posted to production orders. However, indirect overhead costs, such as storage cost, utility cost and administration cost, have to be calculated based on other factors such as, a percentage of the raw materials, a percentage of direct labor amounts, etc.
Disclosed herein are technologies related to database calculation that utilizes parallel-computation of tasks in a directed acyclic graph. In accordance with one aspect, dependency of tasks is converted into a directed acyclic graph that topologically orders the tasks into layers of tasks. A database calculation may be performed to compute in parallel the tasks in each layer of the layers of tasks.
This Summary is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
The Detailed Description references the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The same numbers are used throughout the drawings to reference like features and components.
Disclosed herein are technologies for a database calculation that utilizes parallel-computation of tasks in a directed acyclic graph. For example, a dependency of tasks in a particular datasheet is first identified. In this example, the identified dependency of tasks is converted or represented in the form of directed acyclic graph. The parallel-computation is then performed in accordance with a sequence that is defined by a topological ordering in the directed acyclic graph. Technologies herein may be applied to computing and mobile applications.
Although the inventory management 100 illustrates a relatively small number of variables on its datasheet, relations between the variables and relations between itemized contents (not shown) of the variables may increase the complexity of itemized calculations.
To illustrate a one rule by one rule calculation of the overhead cost per production order, the calculation logic may perform different processing for different row types on each row.
For example, for rows 202-2 to 202-6, the row 202-6 (i.e., material usage) utilizes the variables base row 204-2 and the overhead rate 206-2. The base row 204-2, for example, corresponds to a direct cost from a production order. The direct cost may be a sum or a total of one or more cost elements.
The overhead rate 206-2, for example, may be a percentage of the row 202-2. In this example, the overhead rate 206-2 has a range that is defined by the start-pointer row 210 and end-pointer row 212. As shown in the example costing sheet 200, the range of the overhead rate 206-2 includes the row 202-2 alone. To this end, the calculation of the material usage—row 202-6 is based on the overhead rate 206-2, which is further based upon the defined range row 202-2.
In another example, the calculation of the row 202-22 (i.e., cost of goods sold) utilizes the variables of the row 202-2 and the rows 202-8 to 202-20.
The base rows 204-4 and 204-6, for example, are amounts of wages and salaries, respectively. The base rows 204-4 and 204-6 may be derived directly from the production orders. In other words, the base rows 204-4 and 204-6 may be considered as independent variables.
On the other hand, the overhead rates 206-4 to 206-8, for example, are manufacturing overheads that are defined by the same range. For example, as shown, the calculation of the overhead rates 206-4 to 206-8 utilizes the range row 202-2 alone. As opposed to the overhead rate 206-12, the range is defined by the range row 202-16. The row 202-16 further depends upon the row 202-2 and as such, the calculation of the overhead rate 206-12 is now treated as multi-layered calculation. In other words, the overhead rate 206-12 is based on row 202-16, which is further based upon the row 202-2.
For a total row calculation, the sum of the row 202-6 and the row 202-22 may be summed up to arrive at the final result.
As further described below, the one rule by one rule calculation has a time complexity (e.g., Big O notation) of O (N*R), where N is a number of production orders, R is number of rows, and O characterizes, for example, processing time in relation to changes in input size.
Although the costing sheet 200 is illustrated as overhead cost calculation, the principle described herein may apply to different other group of calculation logics that need to process a set of data sequentially with embedded loops.
Suppose we have “N” production orders and the costing sheet 200 has “R” rules, then an imperative execution in the costing sheet 200 will have to loop “N*R” times. With the example directed acyclic graph 300, the “N*R” times represent the number of nodes while the dependency between the rules represent the edges. The possible number of edges in this example, following topological ordering feature of direct acyclic graphs, is about “N*R” squared.
For example, with two production orders P1 and P2 and five rules R1 to R5, the total number of nodes in the directed acyclic graph 300 is ten (i.e., 2*5). Furthermore, in this example, the number of possible edges or dependencies for the topological ordering in the directed acyclic graph 300 is about one hundred possible edges (i.e., ten squared).
With continuing reference to the example directed acyclic graph 300, the edges 321 to 346 represent the topological ordering from the nodes in each column to the nodes in each row. For example, the edge 321 represents the node P1R3306 (i.e., row node) to be dependent from the node P1R1302 (i.e., column node). In this example, the topological ordering dictates that the column node P1R1302 is first calculated prior to the calculation of the row node P1R3306.
In another example, the edge 322 represents the row node P1R3306 to be further dependent from the column node P1R2304. In other words, following again the topological ordering feature of the direct acyclic graph, the column node P1R2304 is first calculated prior to the calculation of the row node P1R3306. In this example and the preceding example above, the tasks in the nodes P1R1302 and P1R2304 may belong in the same layer and are capable of being processed at the same time or calculated in parallel.
In another example still, edge 324 represents the row node P1R4308 to be dependent from the column node P1R2304. In other words, following again the topological ordering feature of the directed acyclic graph, the column node P1R2304 is first calculated prior to the calculation of the row node P1R4308. Because the row nodes P1R3306 and P1R4308 are one depth away from the nodes in the topological ordering, then parallel-processing may be implemented on the row nodes P1R3306 and P1R4308. In other words, the row nodes P1R3306 and P1R4308 may be treated to belong in the same layer in the directed acyclic graph 300.
As further described below, the nodes and edges of the directed acyclic graph 300 may further be illustrated in circles and unidirectional lines, respectively.
In an implementation, a topological sorting algorithm may identify which node or nodes belong to a particular layer. For example, following the directed acyclic graph 300 in
After the first loop (i.e., first layer 402) is calculated, then the next second loop or second layer 404 is processed. The second layer 404 is defined by the node or nodes that become independent node or nodes after removing the calculated node or nodes from the directed acyclic graph 300.
For example, after calculating in parallel the nodes P1R1302, P1R2304, P2R1312, and P2R2314 in the first loop above, the nodes P1R3306, P1R4308, P2R3316, and P2R4318 become independent nodes. These new independent nodes P1R3306, P1R4308. P2R3316, and P2R4318 define the next layer (i.e., second layer 404) and are processed at the same time or computed in parallel. As opposed to the nodes P1R5310 and P2R5320, these two nodes are not independent after the calculation of the first layer 402 above.
Finally, after the first layer 402 and the second layer 404 are calculated, the third layer 406 (i.e., third loop) is defined by the node or nodes that become independent node or nodes after removing the calculated node or nodes from the directed acyclic graph 300.
For example, with the nodes in the first layer 402 and the nodes in the second layer 404 taken out of the picture, the remaining nodes P1R5310 and P2R5320 become independent nodes. In this example, the nodes P1R5310 and P2R5320 may be processed in parallel or at the same time.
As
For example, in a costing sheet with N production orders and R number of costing rules, an internal parallelism may be generated after deriving the directed acyclic graph of the costing sheet. The internal parallelism, for example, may focus on the costing rules that are independent rules at the start of the loop or after a certain loop has been calculated in parallel.
After the first layer or first loop (i.e., rules R1500, R2502, and R3504) is processed in parallel and the first layer or first loop is taken out of the picture of
At 602, identifying dependency of tasks is performed. For example, for rows 202-2 to 202-6 of the costing sheet 200, the row 202-6 (i.e., material usage) utilizes the variables base row 204-2 and the overhead rate 206-2. The base row 204-2, for example, corresponds to a direct cost from a production order while the overhead rate 206-2, for example, may be a percentage of the row 202-2.
At 604, the identified dependency of tasks into is converted into a directed acyclic graph. For example, the identified dependency of tasks is represented in a topological order. That is, a task in one node may be utilized as a dependent of another node but not symmetrically. The directed acyclic graph (e.g., directed acyclic graph 300) may facilitate the identification of the tasks by its depth of dependency.
At 606, a database calculation is performed. For example, the directed acyclic graph 300 is divided by layers (e.g., first layer 402, second layer 404, etc.) and each layer may be calculated in parallel. As an example, the database calculation may be implemented, by SAP High Performance Analytic Appliance (SAP HANA) software that utilizes an in-memory, column oriented, relational database management.
At 702, a first set of nodes is selected. For example, the first set of nodes are the node or nodes at outermost vertices (i.e., first layer 402) of the directed acyclic graph 300. In this example, the first set of nodes are independent nodes.
For example, as illustrated in
At 704, parallel-computation on the selected first set of nodes is performed. For example, referring again to the first layer 402 of
At 706, a second set of nodes is selected. For example, the second set of nodes are the independent node or nodes after taking out the calculated first set of nodes from the directed acyclic graph 300. In this example, the second set of nodes are independent by themselves.
For example, as shown in
At 708, parallel-computation on the second set of nodes is performed. The second set of nodes may refer to independent node or nodes in a subsequent layer after the preceding layer has been calculated. For example, after the second set of nodes is calculated and as a result, another set of node or nodes become independent node or nodes, then the new independent node or nodes are treated as the second set of nodes.
At 710, a determination is performed to determine if the second set of nodes is a root node. The root node, for example, is the last independent node or nodes in the directed acyclic graph 300.
If the second set of nodes is a root node, following the “YES” branch of 710, at 712, the database calculation is ended. If the next set of nodes is not the root node, then following the “NO” branch of 710, 706 is performed. As discussed in block 708, the second set of nodes may refer to the node or nodes that become independent as a result of the calculation of the previous second set of nodes.
Computer system 800 also includes a main memory 806, preferably Random Access Memory (RAM), containing possibly inter alia computer software and/or data 808.
Computer system 800 may also include a secondary memory 810. Secondary memory 810 may include, for example, a hard disk drive 812, a removable storage drive 814, a memory stick, etc. A removable storage drive 814 may comprise a floppy disk drive, a magnetic tape drive, an optical disk drive, a flash memory, or the like. A removable storage drive 814 reads from and/or writes to a removable storage unit 816 in a well-known manner. A removable storage unit 816 may comprise a floppy disk, a magnetic tape, an optical disk, etc. which is read by and written to by removable storage drive 814. As will be appreciated by persons skilled in the relevant art(s) removable storage unit 816 includes a computer usable storage medium 818 having stored therein possibly inter alia computer software and/or data 820.
In alternative implementations, secondary memory 810 may include other similar means for allowing computer programs or other instructions to be loaded into computer system 800. Such means may include, for example, a removable storage unit 824 and an interface 822. Examples of such means may include a program cartridge and cartridge interface (such as that found in video game devices), a removable memory chip (such as an Erasable Programmable Read-Only Memory (EPROM), or Programmable Read-Only Memory (PROM)) and associated socket, and other removable storage units 824 and interfaces 822 which allow software and data to be transferred from the removable storage unit 824 to computer system 800.
Computer system 800 may also include an input interface 826 and a range of input devices 828 such as, possibly inter alia, a keyboard, a mouse, etc.
Computer system 800 may also include an output interface 830 and a range of output devices 832 such as, possibly inter alia, a display, one or more speakers, etc.
Computer system 800 may also include a communications interface 834. Communications interface 834 allows software and/or data 838 to be transferred between computer system 800 and external devices. Communications interface 834 may include a modem, a network interface (such as an Ethernet card), a communications port, a Personal Computer Memory Card International Association (PCMCIA) slot and card, or the like. Software and/or data 838 transferred via communications interface 834 are in the form of signals which may be electronic, electromagnetic, optical, or other signals capable of being received by communications 834. These signals 836 are provided to communications interface 834 via a communications path 840. Communications path 840 carries signals and may be implemented using a wire or cable, fiber optics, a phone line, a cellular phone link, a Radio Frequency (RF) link or other communication channels.
As used in this document, the terms “computer-program medium,” “computer-usable medium,” and “computer-readable medium” generally refer to media such as removable storage unit 816, removable storage unit 824, and a hard disk installed in hard disk drive 812. Computer program medium and computer usable medium can also refer to memories, such as main memory 806 and secondary memory 810, which can be memory semiconductors (e.g. Dynamic Random Access Memory (DRAM) elements, etc.). These computer program products are means for providing software to computer system 800.
Computer programs (also called computer control logic) are stored in main memory 806 and/or secondary memory 810. Such computer programs, when executed, enable computer system 800 to implement the present technology described herein. In particular, the computer programs, when executed, enable processor 804 to implement the processes of aspects of the above. Accordingly, such computer programs represent controllers of the computer system 800. Where the technology described herein is implemented, at least in part, using software, the software may be stored in a computer program product and loaded into computer system 800 using removable storage drive 814, interface 822, hard disk drive 812 or communications interface 834.
The technology described herein may be implemented as computer program products comprising software stored on any computer useable medium. Such software, when executed in one or more data processing devices, causes data processing device(s) to operate as described herein. Embodiments of the technology described herein may employ any computer useable or readable medium, known now or in the future. Examples of computer useable mediums include, but are not limited to, primary storage devices (e.g., any type of random access memory), secondary storage devices (e.g., hard drives, floppy disks, Compact Disc Read-Only Memory (CD-ROM) disks, Zip disks, tapes, magnetic storage devices, optical storage devices, Microelectromechanical Systems (MEMS), and nanotechnological storage device, etc.).
A computing system may take the form of any combination of one or more of inter alia a wired device, a wireless device, a mobile phone, a feature phone, a smartphone, a tablet computer (such as for example an iPad™), a mobile computer, a handheld computer, a desktop computer, a laptop computer, a server computer, an in-vehicle (e.g., audio, navigation, etc.) device, an in-appliance device, a Personal Digital Assistant (PDA), a game console, a Digital Video Recorder (DVR) or Personal Video Recorder (PVR), a cable system or other set-top-box, an entertainment system component such as a television set, etc.
In the above description of exemplary implementations, for purposes of explanation, specific numbers, materials configurations, and other details are set forth in order to better explain the present invention, as claimed. However, it will be apparent to one skilled in the art that the claimed invention may be practiced using different details than the exemplary ones described herein. In other instances, well-known features are omitted or simplified to clarify the description of the exemplary implementations.
The inventors intend the described exemplary implementations to be primarily examples. The inventors do not intend these exemplary implementations to limit the scope of the appended claims. Rather, the inventor has contemplated that the claimed invention might also be embodied and implemented in other ways, in conjunction with other present or future technologies.
Moreover, the word “exemplary” is used herein to mean serving as an example, instance, or illustration. Any aspect or design described herein as exemplary is not necessarily to be construed as preferred or advantageous over other aspects or designs. Rather, use of the word “exemplary” is intended to present concepts and techniques in a concrete fashion. The term “technology,” for instance, may refer to one or more devices, apparatuses, systems, methods, articles of manufacture, and/or computer-readable instructions as indicated by the context described herein.
As used in this application, the term “or” is intended to mean an inclusive “or” rather than an exclusive “or.” That is, unless specified otherwise or clear from context, “X employs A or B” is intended to mean any of the natural inclusive permutations. That is, if X employs A; X employs B; or X employs both A and B, then “X employs A or B” is satisfied under any of the foregoing instances. In addition, the articles “a” and “an” as used in this application and the appended claims should generally be construed to mean “one or more,” unless specified otherwise or clear from context to be directed to a singular form.
Note that the order in which the processes are described is not intended to be construed as a limitation, and any number of the described process blocks can be combined in any order to implement the processes or an alternate process. Additionally, individual blocks may be deleted from the processes without departing from the spirit and scope of the subject matter described herein.
One or more embodiments described herein may be implemented fully or partially in software and/or firmware. This software and/or firmware may take the form of instructions contained in or on a non-transitory computer-readable storage medium. Those instructions may then be read and executed by one or more processors to enable performance of the operations described herein. The instructions may be in any suitable form, such as but not limited to source code, compiled code, interpreted code, executable code, static code, dynamic code, and the like. Such a computer-readable medium may include any tangible non-transitory medium for storing information in a form readable by one or more computers, such as but not limited to read only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; a flash memory, etc.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2014/072051 | 2/13/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/120603 | 8/20/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5546564 | Horie | Aug 1996 | A |
5574900 | Huang et al. | Nov 1996 | A |
6282708 | Augusteijn | Aug 2001 | B1 |
6308166 | Breuker et al. | Oct 2001 | B1 |
6499023 | Dong | Dec 2002 | B1 |
7324936 | Saldanha et al. | Jan 2008 | B2 |
7370325 | Hull | May 2008 | B1 |
7702482 | Graepel et al. | Apr 2010 | B2 |
7945489 | Weiss et al. | May 2011 | B2 |
8099383 | Naibo et al. | Jan 2012 | B2 |
8126870 | Chowdhuri et al. | Feb 2012 | B2 |
8185908 | Taniguchi et al. | May 2012 | B2 |
8280915 | Jagannath et al. | Oct 2012 | B2 |
8347250 | Gonzalez et al. | Jan 2013 | B2 |
8359292 | Morrison et al. | Jan 2013 | B2 |
8417732 | Rapp | Apr 2013 | B2 |
8504791 | Cheriton et al. | Aug 2013 | B2 |
8626740 | Harinarayan et al. | Jan 2014 | B1 |
8631416 | Rodriguez | Jan 2014 | B2 |
20020069103 | Puri et al. | Jun 2002 | A1 |
20030115545 | Hull | Jun 2003 | A1 |
20050120010 | Philpott et al. | Jun 2005 | A1 |
20080120129 | Seubert et al. | May 2008 | A1 |
20090254774 | Chamdani et al. | Oct 2009 | A1 |
20120109934 | Weyerhaeuser et al. | May 2012 | A1 |
20120323971 | Pasupuleti | Dec 2012 | A1 |
20120324472 | Rossbach | Dec 2012 | A1 |
20130024645 | Cheriton et al. | Jan 2013 | A1 |
20130046743 | Gruschko et al. | Feb 2013 | A1 |
20130086495 | Guzansky et al. | Apr 2013 | A1 |
20130152057 | Ke | Jun 2013 | A1 |
20130159285 | Dees et al. | Jun 2013 | A1 |
20130226318 | Procyk et al. | Aug 2013 | A1 |
20130290293 | Weyerhaeuser et al. | Oct 2013 | A1 |
20130290298 | Weyerhaeuser et al. | Oct 2013 | A1 |
20130298130 | Pienaar | Nov 2013 | A1 |
20140013318 | Rychikhin | Jan 2014 | A1 |
20140282572 | Kang | Sep 2014 | A1 |
20140359563 | Xie | Dec 2014 | A1 |
20150067088 | Guerin | Mar 2015 | A1 |
Number | Date | Country |
---|---|---|
102207883 | Oct 2011 | CN |
103077006 | May 2013 | CN |
Number | Date | Country | |
---|---|---|---|
20150234935 A1 | Aug 2015 | US |