This application claims the benefit of Taiwan application Serial No. 109129269, filed Aug. 27, 2020, the disclosure of which is incorporated by reference herein in its entirety.
The disclosure relates in general to a learning-based resource allocation method, a learning-based resource allocation system and a user interface.
Along with the rapid development in culture and economy, supply chains have become an inextricable part in the industries. The industries are facing problems of the logistics time being too lengthy, the outsourcing system lacking an efficient management mode, and the supply chains having scheduling difficulty due to the multi-plant or multi-equipment arrangement. Currently, the feedback of production progress in the supply chains still depends on manual control, and therefore is inaccurate and cannot be provided in a real-time manner. Besides, abnormalities are complicated and hard to resolve. Therefore, resource allocation is getting more and more important.
The allocation of production resources is a non-deterministic polynomial-time hardness (NP Hard) problem. Many research personnel used to resolve the above problems using one single algorithm, such as the multi-objective algorithm. However, the multi-objective algorithm has a low convergence speed, and takes more computing time to obtain an optimal solution.
The disclosure is directed to a learning-based resource allocation method, a learning-based resource allocation system and a user interface.
According to one embodiment, a learning-based resource allocation method is provided. The learning-based resource allocation method includes the following steps. Several setting contents of several resources applicable to several batch number products are obtained from an available resource database. Several resource allocation solutions are obtained. Each of the resource allocation solutions is a combination of the batch number products and the setting contents and is classified in an excellent group or an inferior group. The setting contents corresponding to a first part of the resource allocation solutions belonging to the inferior group are changed using a first algorithm. The setting contents corresponding to a second part of the resource allocation solutions belonging to the inferior group are changed using a second algorithm. The first algorithm is different from the second algorithm. An optimal resource allocation solution is obtained according to the resource allocation solutions which are updated.
According to another embodiment, a learning-based resource allocation system is provided. The learning-based resource allocation system includes a data acquisition device, a knowledge learning device and an output device. The data acquisition device includes an available resource database and an allocation unit. The available resource database records several setting contents of several resources applicable to several batch number products. The allocation unit is configured to obtain several resource allocation solutions. Each of the resource allocation solutions is a combination of the batch number products and the setting contents. Each of the resource allocation solutions is classified in an excellent group or an inferior group. The knowledge learning device includes a first calculation unit and a second calculation unit. The first calculation unit is configured to change the setting contents a first part of the resource allocation solutions belonging to the inferior group using a first algorithm. The second calculation unit is configured to change the setting contents corresponding to a second part of the resource allocation solutions belonging to the inferior group using a second algorithm. The first algorithm is different from the second algorithm. The output device is configured to obtain an optimal resource allocation solution is obtained according to the resource allocation solutions which are updated.
According to an alternative embodiment, a user interface is provided. The user interface includes a parameter setting window, a resource allocation result window and a resource allocation suggestion window. The parameter setting window is configured to select an available resource database, which records several setting contents of several resources applicable to several batch number products. The resource allocation result window is configured to output an optimal resource allocation solution, which is a combination of the batch number products and the setting contents. The resource allocation suggestion window is configured to output a heat map, which records the number of times of positive improvements of the resources when several resource allocation solutions are changed.
The above and other aspects of the invention will become better understood with regard to the following detailed description of the preferred but non-limiting embodiment(s). The following description is made with reference to the accompanying drawings.
In the following detailed description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the disclosed embodiments. It will be apparent, however, that one or more embodiments may be practiced without these specific details. In other instances, well-known structures and devices are schematically shown in order to simplify the drawing.
Referring to
Referring to
The learning-based resource allocation system 1000 can perform two machine learning algorithms through the knowledge learning device 200 to improve the efficiency of machine learning. Additionally, the learning-based resource allocation system 1000 can provide concrete information through the knowledge conversion device 500 as a reference for the operator to perform resource allocation. The calculations of each of the above elements are disclosed below with an accompanying flowchart.
Referring to
Next, the method proceeds to step S120, several resource allocation solutions (such as resource allocation solutions RA_1 to RA_10) are obtained by the allocation unit 120 of the data acquisition device 100. Each of the resource allocation solutions RA_1 to RA_10 is a combination of the batch number products BN and the setting contents SC. Refer to Table 2, setting contents corresponding to the resource allocation solution RA_1 are shown. In initial, the resource allocation solution RA_1, the setting content SC corresponding to each of the batch number products BN is randomly selected. In the resource allocation solution RA_1 of Table 2, a fifth setting content SC corresponding to the 1st batch number product BN is randomly selected, a 2nd setting content SC corresponding to the second batch number product BN is randomly selected, an 8-th setting content SC corresponding to the 3rd batch number product BN is randomly selected, and the rest can be obtained by the same analogy.
Referring to
After the resource allocation solutions RA_1 to RA_10 are obtained, the setting contents SC corresponding to the resource allocation solutions RA_5 to RA_10 belonging to the inferior group G2 are optimized.
Then, the method proceeds to step S130, the setting contents SC corresponding to a first part (such as resource allocation solutions RA_5 to RA_6) of the resource allocation solutions RA_5 to RA_10 belonging to the inferior group G2 are changed by a first calculation unit 210 of the knowledge learning device 200 using a first algorithm, and the setting contents SC corresponding to a second part (such as resource allocation solutions RA_7 to RA_10) of the resource allocation solutions RA_5 to RA_10 belonging to the inferior group G2 are changed by a second calculation unit 220 of the knowledge learning device 200 using a second algorithm. The first algorithm is different from the second algorithm. In the present step, the setting contents SC corresponding to all resource allocation solutions RA_5 to RA_10 belonging to the inferior group G2 are changed.
In the present embodiment, the knowledge learning device 200 performs the first algorithm and the second algorithm by way of collaborative learning.
Referring to
The Q value QV is obtained according to the following formula (1):
Wherein, wm represents an original setting content SC, w′m, represents an updated setting content SC, and F(w′m-wm) represents the degree of improvement.
In terms of the resource allocation solution RA_5, the largest Q value QV (marked by stars) corresponds to the resource allocation solution RA_1. That is, in terms of the resource allocation solution RA_5, the largest degree of improvement can be obtained if the setting contents are changed with reference to the resource allocation solution RA_1.
Then, the first calculation unit 210 randomly selects N batch number products BN (such as the 3rd batch number product BN, the 11-th batch number product BN, and the 22nd batch number product BN), and changes the setting contents SC corresponding to the resource allocation solution RA_5 with reference to the setting contents SC corresponding to the resource allocation solution RA_1.
Similarly, in terms of the resource allocation solutions RA_6, the largest degree of improvement can be obtained if the setting contents are changed with reference to the resource allocation solution RA_4.
Referring to
After the setting contents SC corresponding to the resource allocation solutions RA_5 to RA_10 are changed, the resource allocation solutions RA_1 to RA_10 are re-sorted. For example, the resource allocation solution RA_5 may ascend by an order and be classified in the excellent group G1, the resource allocation solutions RA_4 may descend by one order and be classified in the inferior group G2. In the next calculation, only the setting contents SC corresponding to the resource allocation solutions RA_4, RA_6 to RA_10 belonging to the inferior group G2 are changed.
The second algorithm is the evolutionary algorithm which is mainly for enabling the learning process to be converged to the global optimal solution but has a slow converging speed. The first algorithm is the re-enforce learning algorithm, which is capable of accumulating the optimization experience to increase the converging speed but may converge to a local optimal solution. The resource allocation method of the present disclosure uses both the first algorithm and the second algorithm, and therefore possesses the strengths of both algorithms, not only enabling the learning process to converge to the global optimal solution, but also increasing the converging speed.
Then, the method proceeds to step S140, the Q matrix QM in the improvement knowledge database 230 is updated so that the first algorithm can be performed again. Regardless of the setting contents corresponding to the resource allocation solutions RA_5 to RA_10 being changed using the first algorithm or the second algorithm, corresponding values in the Q matrix QM are updated. Referring to
In the above calculation, the resource allocation solutions RA_5 to RA_6 belonging to the inferior group G2 use the first algorithm, and the resource allocation solutions RA_7 to RA_10 belonging to the inferior group G2 use the second algorithm. That is, the ratio of the first part to the second part is 2:4. In an embodiment, the ratio of the first part to the second part can be gradually adjusted. The first part and the second part can be adjusted according to the first number a of positive improvement of the resource allocation solutions using the first algorithm and the second number b of positive improvement of the resource allocation solutions using the second algorithm. For example, the first part and the second part can be adjusted according to the ratio of 1/a:1/b. Given that the first number a of positive improvement is 1 and the second number b of positive improvement is 2, the ratio of the first part to the second part is adjusted to be 1/1:1/2=2:1. Next time when the first algorithm and the second algorithm are performed, the resource allocation solutions RA_5 to RA_8 belonging to the inferior group G2 will use the first algorithm, and the resource allocation solutions RA_9 to RA_10 belonging to the inferior group G2 will use the second algorithm.
Then, the method proceeds to step S150, whether the convergence condition is met is determined. The convergence condition is, for example, the cost reduction in the optimal resource allocation solution RA_1 is lower than a predetermined value. If the convergence condition is met, then the method proceeds to step S170; otherwise, the method proceeds to step S160 and returns to step S130, the calculation is performed again (in an embodiment, step S160 can be omitted and the method directly returns to step S130).
In step S160, after the setting contents SC corresponding to the resource allocation solutions RA_1 to RA_10 are changed, statistics of the number of times of positive improvements of the resources RS are collected by the knowledge conversion device 500 to obtain a heat map (such as the heat map MP of
As indicated in the heat map MP of
Then, the method proceeds to step S170, an optimal resource allocation solution is obtained by the output device 400 according to the resource allocation solutions RA_1 to RA_10 which are updated. After the setting contents SC corresponding to the resource allocation solutions RA_1 to RA_10 are changed, the performances of the resource allocation solutions are no longer ranked in a descending order from the resource allocation solution RA_1 to the resource allocation solution RA_10. Meanwhile, the outputted optimal resource allocation solution is the first resource allocation solution outputted according to the last ranking result.
Referring to
Referring to Table 3, cost changes in a steel factory using the present embodiment are shown. The cost changes show that the learning-based allocation method of the present disclosure significantly reduces the cost.
Referring to
According to the above embodiments, the learning-based allocation method and the learning-based resource allocation system 1000 using the same can perform two machine learning algorithms to increase the efficiency of machine learning. Besides, the heat map MP can provide concrete information as a reference for the operator to perform resource allocation.
It will be apparent to those skilled in the art that various modifications and variations can be made to the disclosed embodiments. It is intended that the specification and examples be considered as exemplary only, with a true scope of the disclosure being indicated by the following claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
109129269 | Aug 2020 | TW | national |