COMPUTER-READABLE RECORDING MEDIUM STORING TEMPERATURE ADJUSTMENT PROGRAM, DATA PROCESSING APPARATUS, AND DATA PROCESSING METHOD

CROSS-REFERENCE TO RELATED APPLICATION

This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2023-112457, filed on Jul. 7, 2023, the entire contents of which are incorporated herein by reference.

FIELD

The embodiments discussed herein are related to a temperature adjustment program, a data processing apparatus, and a data processing method.

BACKGROUND

When searching for a solution to a combinatorial optimization problem, there is a method of converting the combinatorial optimization problem into an Ising model representing a behavior of a spin of a magnetic substance. The Ising model is represented by an Ising-type evaluation function for evaluating the solution to the combinatorial optimization problem. The Ising-type evaluation function includes a plurality of state variables and a plurality of weight values. A state of the Ising model is represented by values of the plurality of state variables. In the Ising-type evaluation function, a state variable is a binary variable having a value of 0 or 1 (or −1 or +1). The state variable may be denoted by a bit. The value of the Ising-type evaluation function may also be referred to as energy of the Ising model.

Japanese Laid-open Patent Publication No. 2022-94510 and Japanese Laid-open Patent Publication No. 2020-46718 are disclosed as related art.

SUMMARY

According to an aspect of the embodiments, a non-transitory computer-readable recording medium stores a temperature adjustment program for causing a computer to execute a process including: acquiring an average value of values of an evaluation function obtained in a search processing by a replica circuit in which a first temperature value higher than a minimum temperature value by n (n is an integer of 1 or more) is set, among a plurality of replica circuits in which a plurality of temperature values different from each other are set during the search processing a plurality of times from the search processing that performs the search processing of a solution to a combinatorial optimization problem by a replica exchange method by using the plurality of replica circuits that correspond to a plurality of replicas of the evaluation function based on an Ising model obtained by converting the combinatorial optimization problem; changing the first temperature value based on a comparison result between a first average value among the average values acquired the plurality of times and a second average value acquired before the first average value; newly determining the plurality of temperature values by, while fixing a maximum temperature value and the changed first temperature value among the plurality of temperature values, changing other temperature values that include the minimum temperature value based on the maximum temperature value and the changed first temperature value; and setting the plurality of determined temperature values.

The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram illustrating an example of a data processing apparatus according to a first embodiment;

FIG. 2 is a flowchart illustrating an example of a procedure for adjusting a temperature value;

FIG. 3 is a schematic diagram illustrating an example of adjustment of T₀;

FIG. 4 is a diagram illustrating an example of a data processing apparatus according to a second embodiment;

FIG. 5 is a diagram illustrating an example of a temperature adjustment unit;

FIG. 6 is a flowchart illustrating a flow of an example of a procedure for adjusting a temperature value according to the second embodiment;

FIG. 7 is a diagram illustrating an example of calculation results of a combinatorial optimization problem; and

FIG. 8 is a diagram illustrating an example of hardware of a computer that is an example of the data processing apparatus.

DESCRIPTION OF EMBODIMENTS

A Markov-chain Monte Carlo (MCMC) method is used in the search for a solution. Hereinafter, the search for the solution by the MCMC method is referred to as MCMC processing. For example, in the MCMC processing, a state transition is accepted with an acceptance probability of the state transition defined by a Metropolis method or a Gibbs method. At this time, the state transition that increases the value of the evaluation function is also stochastically permitted. As the increase amount of the value of the evaluation function is larger, the acceptance probability is lower.

As one type of the MCMC method, there is a replica exchange method (also referred to as a parallel tempering method or the like). According to the replica exchange method, the MCMC processing using a plurality of temperature values is performed independently of each other by a plurality of replicas of the evaluation function based on the Ising model. For each certain number of trials, the values of the evaluation function obtained in each MCMC processing are compared with each other, and states for the two temperature values are exchanged with an appropriate exchange probability. Compared with a simulated annealing method in which the temperature value is gradually decreased, the replica exchange method reduces a possibility of being constrained to a local solution and enables an efficient search of the entire solution space.

Heretofore, there has been a method in which a minimum temperature value (hereafter, may also be referred to as a minimum temperature) and a maximum temperature value (hereafter, may also be referred to as a maximum temperature) among a plurality of temperature values in the replica exchange method are determined from information on a resolution of a value of an evaluation function.

At the time of solving the combinatorial optimization problem by using the replica exchange method, in a case where a minimum temperature is not appropriately set, a time until a solution is obtained may be very long. For example, it is because, when the minimum temperature is too low, state transition accompanied by the increase in a value of an evaluation function hardly occurs, and once a local solution is obtained, there is a possibility that it may not be possible to escape from the local solution.

However, to determine an appropriate minimum temperature, a procedure of performing solution search processing by using a certain minimum temperature and adjusting the minimum temperature based on a result of the solution search processing is repeated. For this reason, there is a problem that it takes time to adjust the minimum temperature.

In one aspect, an object of the present disclosure is to provide a temperature adjustment program, a data processing apparatus, and a data processing method capable of shortening an adjustment time of a minimum temperature used in a replica exchange method.

Hereinafter, embodiments of the present disclosure will be described with reference to the drawings.

First Embodiment

FIG. 1 is a diagram illustrating an example of a data processing apparatus according to a first embodiment.

A data processing apparatus 10 includes a storage unit 11, a search unit 12, and a processing unit 13.

For example, the storage unit 11 is a volatile storage apparatus that is an electronic circuit such as a dynamic random-access memory (DRAM) or a non-volatile storage apparatus that is an electronic circuit such as a hard disk drive (HDD) or a flash memory. The storage unit 11 may include an electronic circuit such as a static random-access memory (SRAM) register.

For example, the storage unit 11 stores information on a combinatorial optimization problem to be calculated, calculation conditions, and the like. Various programs such as a temperature adjustment program may be stored in the storage unit 11.

For example, the search unit 12 may be implemented by using an electronic circuit such as an application-specific integrated circuit (ASIC) or a field-programmable gate array (FPGA). The search unit 12 may be implemented by software processing in which a processor that is hardware such as a central processing unit (CPU), a graphics processing unit (GPU), or a digital signal processor (DSP) executes a program.

The search unit 12 uses replica circuits 12a0, . . . , 12an, . . . , and 12aN corresponding to (N+1) replicas of an evaluation function based on an Ising model obtained by converting a combinatorial optimization problem to be calculated, and searches for a solution to the combinatorial optimization problem by a replica exchange method.

For example, the evaluation function based on the Ising model is defined by Expression (1) below.

$\begin{matrix} E (x) = - \sum_{〈 i, j 〉}^{M} W_{ij} x_{i} x_{j} - \sum_{i = 1}^{M} b_{i} x_{i} & (1) \end{matrix}$

A first term on a right side is a sum obtained by adding up products of values of two state variables and a weight coefficient without missing and overlapping for all combinations of the two state variables selectable from all state variables included in the Ising model. x_iis an i-th state variable. x_jis a j-th state variable. W_ijis a weight coefficient indicating a weight (for example, strength of coupling) between the i-th state variable and the j-th state variable. M is a total number of state variables.

A second term on the right side is a total sum of products of a bias coefficient of each of all the state variables and a value of the state variable. b_iindicates a bias coefficient for the i-th state variable.

For example, “−1” of a spin in the Ising model corresponds to a value “0” of the state variable. “+1” of a spin in the Ising model corresponds to a value “1” of the state variable. Therefore, the state variable may also be referred to as a bit taking a value of 0 or 1.

A state when a minimum value among local minimum values of E(x) is obtained is an optimum solution. By changing a sign of each term on the right side of Expression (1), the search unit 12 may search for a state in which a value of E(x) is locally maximized (in this case, the state when the value of E(x) is the maximum value is the optimum solution). Hereinafter, the value of the evaluation function may also be referred to as energy.

Each of the replica circuits 12a0 to 12aN in which a plurality of temperature values different from each other are set repeats MCMC processing. For example, the MCMC processing includes processing of determining whether to permit a change of any bit of a plurality of bits, based on a comparison result between a change amount of the value of the evaluation function when a value of the bit is changed and a thermal noise value. The MCMC processing includes processing of causing, in a case where it is determined that the change in the value of a certain bit is permitted, a state transition by changing the value of the bit. Hereinafter, changing the value of the bit is referred to as a flip. The thermal noise value is obtained based on a temperature value and a random number value set for each of the replica circuits 12a0 to 12aN. As the temperature value increases, an amplitude of the thermal noise value increases.

The search unit 12 includes a temperature control unit 12b and an average energy calculation unit 12c.

The temperature control unit 12b sets (N+1) temperature values different from each other, which are obtained by the processing unit 13, for the replica circuits 12a0 to 12aN.

In the example illustrated in FIG. 1, T₀is set in the replica circuit 12a0, T_nis set in the replica circuit 12an, and T_Nis set in the replica circuit 12aN as a temperature value (T). T₀< . . . <T_n< . . . <T_Nholds. For example, T₀is a minimum temperature, and T_Nis a maximum temperature. Hereinafter, a replica corresponding to the replica circuit 12a0 to which the minimum temperature is set is referred to as a minimum temperature replica.

The temperature control unit 12b controls replica exchange in replica circuits 21a0 to 21aN. Based on an exchange probability (p_ij) represented by Expression (2) below, the temperature control unit 12b determines whether to perform replica exchange for each pair of replica circuits having adjacent temperature values.

$\begin{matrix} p_{ij} = \exp ((E_{i} - E_{j}) (\frac{1}{{kT}_{i}} - \frac{1}{{kT}_{j}})) & (2) \end{matrix}$

In Expression (2), E_iis energy corresponding to a state of a replica (hereafter, referred to as a replica i) corresponding to an i-th replica circuit among the replica circuits 12a0 to 12aN. E_jis energy corresponding to a state of a replica (hereafter, referred to as a replica j) corresponding to a j-th replica circuit. T_iis a temperature value set for the i-th replica circuit. T_jis a temperature value set for the j-th replica circuit. k is a Boltzmann constant. States of the replicas i and j are represented by M state variables x₁to x_M.

With the probability of p_ijdescribed above, the temperature control unit 12b causes the state of the replica i and the state of the replica j to be exchanged.

Instead of exchanging the states between the replicas i and j, the temperature control unit 12b may exchange T_iand T_jwith the probability of p_ijdescribed above.

The average energy calculation unit 12c calculates an average value (hereafter, referred to as average energy) of values of the above-described evaluation function (E (x)) obtained in the search processing by the replica circuit 12an in which the temperature value (T_n) higher than the minimum temperature by n is set. n is an integer equal to or greater than 1. Although not particularly limited, for example, n may be a value of about 10% of the number of replicas N+1. An upper limit of T_nis T_N−1. Accordingly, the upper limit of n is the number of replicas minus 2.

For example, the average energy calculation unit 12c updates average energy every time the replica circuit 12an flips any value of x₁to X_M.

For example, the processing unit 13 may be implemented by software processing in which a processor that is hardware such as a CPU, a GPU, or a DSP executes a program such as the temperature adjustment program stored in the storage unit 11. The processing unit 13 may be implemented by using an electronic circuit such as an ASIC or an FPGA.

The processing unit 13 has a function of adjusting a plurality of temperature values that are different from each other used for solution search by the replica exchange method.

FIG. 2 is a flowchart illustrating an example of a procedure for adjusting a temperature value.

Step S10: The processing unit 13 determines whether it is an adjustment cycle of a temperature value. For example, the processing unit 13 determines that it is an adjustment cycle of a temperature value every time replica exchange is performed a predetermined number of times in the search unit 12. By increasing the number of times of replica exchange serving as the adjustment cycle, adjustment may be performed in consideration of a wider search space, but an adjustment frequency decreases. For this reason, the adjustment cycle may be appropriately set in accordance with a difficulty level of the combinatorial optimization problem to be calculated, or the like.

Every time the MCMC processing is performed a predetermined number of times in the search unit 12, the processing unit 13 may determine that it is the adjustment cycle of the temperature value.

When the processing unit 13 determines that it is the adjustment cycle of the temperature value, the processing unit 13 performs processing in step S11. When the processing unit 13 determines that it is not the adjustment cycle of the temperature value, the processing unit 13 repeats the processing in step S10.

Step S11: The processing unit 13 acquires average energy from the search unit 12. The processing unit 13 holds the acquired average energy.

Step S12: The processing unit 13 compares average energy acquired last time with the average energy acquired this time.

Step S13: Based on the comparison result obtained in the processing of step S12, the processing unit 13 changes T_n. For example, the processing unit 13 changes T_nas follows.

First, for example, the processing unit 13 divides a range between an initial value of T_ninput from a user and a maximum temperature determined in advance by a predetermined division number, and calculates a plurality of candidate values of T_nthat have a plurality of steps of magnitude based on the division number. When the average energy acquired this time is equal to or smaller than the average energy acquired last time, the processing unit 13 changes T_nto a candidate value larger than the current T_nby one step. When the average energy acquired this time is larger than the average energy acquired last time, the processing unit 13 changes T_nto a candidate value smaller than the current T_nby one step.

Step S14: The processing unit 13 newly determines T₀to T_Nby, while fixing a maximum temperature value (T_N) and the changed T_namong T₀to T_N, changing other temperature values that include T₀based on T_Nand the changed T_n.

For example, to cause replica exchange to occur with the same frequency at any temperature value, a range between T₀and T_Nmay be divided by a ratio of an exponential function. In this case, a temperature value (T_m) other than T_nand T_Nis given by Expression (3) below.

$\begin{matrix} T_{m} = T_{0} \exp (\frac{m}{N} \log \frac{T_{N}}{T_{0}}) (m = 0, \dots, N) & (3) \end{matrix}$

In Expression (3), T₀is given by Expression (4) below.

$\begin{matrix} T_{0} = \exp [\frac{1}{N - n} (N \log T_{n} - n \log T_{N})] & (4) \end{matrix}$

As in Expression (4), T₀is changed in accordance with T_Nand T_n.

A method of dividing the range between T₀and T_Nis not limited to the above-described example. When there is a more appropriate division method (for example, linear division or the like) from the problem information of the combinatorial optimization problem, the division method may be applied.

Step S15: The processing unit 13 sets the determined T₀to T_Nfor the search unit 12.

With the above, one-time adjustment processing of the temperature value ends. Such adjustment processing is repeated until an end condition of the solution search processing is satisfied.

As described above, in the data processing apparatus 10, the search unit 12 uses the replica circuits 12a0 to 12aN corresponding to a plurality of replicas of an evaluation function based on an Ising model obtained by converting a combinatorial optimization problem, and performs search processing for a solution to the combinatorial optimization problem by the replica exchange method. During the search processing, the processing unit 13 acquires, from the search unit 12, average energy obtained in the search processing by the replica circuit 12an, in which a temperature value (T_n) higher than T₀by n (n is an integer of 1 or more) is set, among the replica circuits 12a0 to 12aN, a plurality of times. Based on a result of comparison between first average energy among average energies acquired the plurality of times and second average energy acquired before the first average energy, the processing unit 13 changes T_n. The processing unit 13 newly determines T₀to T_Nby, while fixing the maximum temperature value (T_N) and the changed T_namong T₀to T_N, changing other temperature values that include T₀based on T_Nand the changed T_n. The processing unit 13 sets the determined T₀to T_Nfor the search unit 12.

To determine an appropriate T₀, in a case where a procedure of performing solution search processing using a certain T₀and adjusting T₀based on the result of the search processing is repeated, it takes time to adjust T₀. By contrast, in the data processing apparatus 10 according to the first embodiment, average energy to be used to determine new temperature values may be obtained during the solution search processing. For this reason, it is possible to determine new temperature values including T₀during the solution search processing based on the average energy acquired a plurality of times. For example, since it is possible to adjust the temperature values including T₀during the search processing without waiting for the end of the search processing, it is possible to shorten the adjustment time of T₀.

Constraint of a solution to a local solution by replica exchange is suppressed. Thus, by setting an adjustment cycle to a predetermined number of times of replica exchange, it is possible to appropriately adjust T₀in consideration of a wide search space.

A reason why the temperature values including T₀are adjusted based on the average energy obtained in the search processing by the replica circuit 12an in which T_nis set will be described.

The manner in which the average energy of the minimum temperature replica changes with respect to T₀is similar to that of the minimum energy. For this reason, there is a possibility that the adjustment of the optimum T₀for obtaining smaller minimum energy may be performed based on the average energy of the minimum temperature replica. However, as a result of experiments, it has been found that, in the case of the method of adjusting T₀based on the average energy of the minimum temperature replica, an initial value of T₀takes a value closer to the optimum T₀than T₀after adjustment, depending on the combinatorial optimization problem.

FIG. 3 is a schematic diagram illustrating an example of adjustment of T₀. FIG. 3 illustrates an adjustment example of T₀by a method (hereafter, referred to as a method of a comparative example) in which adjustment of T₀is performed based on average energy of a minimum temperature replica, together with an adjustment example of T₀in the method of the present embodiment.

In the example illustrated in FIG. 3, an initial value of T₀is 0.1 in both of the methods. An optimum value of T₀is 0.4. According to the method of the comparative example, the main T₀after adjustment is 0.9. By contrast, in the method according to the present embodiment, the main T_nafter adjustment is 0.9, and the main T₀after adjustment is 0.6, and T₀closer to the optimum value than that of the method of the comparative example is obtained.

As described above, even when T_nis adjusted to a value higher than the optimum T₀, (n−1) temperature values on the lower temperature side than T_nare set in (n−1) replica circuits, and search processing in the vicinity of the optimum T₀is performed.

It may be expected that the data processing apparatus 10 as described above is useful as means for obtaining an accurate solution in a short period of time when solving various problems in modern society which may be converted into a combinatorial optimization problem.

Second Embodiment

FIG. 4 is a diagram illustrating an example of a data processing apparatus according to a second embodiment.

A data processing apparatus 20 according to the second embodiment includes a search unit 21, a temperature adjustment unit 22, and an overall control unit 23. The search unit 21 is an example of the search unit 12 illustrated in FIG. 1, and the temperature adjustment unit 22 and the overall control unit 23 are an example of the processing unit 13 illustrated in FIG. 1. An element corresponding to the storage unit 11 illustrated in FIG. 1 is not illustrated.

For example, the search unit 21 may be implemented by using an electronic circuit such as an ASIC or an FPGA. For example, the temperature adjustment unit 22 and the overall control unit 23 may be implemented by software processing in which a processor such as a CPU executes a program. The implementation is not limited to this, and a part or entirety of each of the search unit 21, the temperature adjustment unit 22, and the overall control unit 23 may be implemented by using the electronic circuit as described above. A part or entirety of each of the search unit 21, the temperature adjustment unit 22, and the overall control unit 23 may be implemented by software processing.

In the data processing apparatus 20 according to the second embodiment, the search unit 21 searches for a solution to a combinatorial optimization problem by a replica exchange method in which states are exchanged between replicas.

For example, as illustrated in FIG. 4, the search unit 21 includes the replica circuits 21a0, 21a1, . . . , 21an, . . . , and 21aN corresponding to (N+1) replicas. The search unit 21 includes a temperature control unit 21b.

Temperature values (T₀to T_N) different from each other are set for the replica circuits 21a0 to 21aN. T₀is set for the replica circuit 21a0, T₁is set for the replica circuit 21a1, T_nis set for the replica circuit 21an, and T_Nis set for the replica circuit 21aN. T₀<T₁< . . . <T_n< . . . <T_Nholds. For this reason, a replica corresponding to the replica circuit 21a0 is a minimum temperature replica. Hereinafter, a replica corresponding to the replica circuit 21an is referred to as a replica n.

As an initial value of T_n, for example, a value input by the user may be used.

T_N, which is a maximum temperature, is calculated in advance. For example, the temperature adjustment unit 22 of the data processing apparatus 20 may calculate T_Nin advance by the following method.

First, in a state where each temperature value from T₀to T_Nis fixed, the search unit 21 performs the MCMC processing a predetermined number of times using replica exchange. The temperature adjustment unit 22 acquires a state (hereafter, referred to as a local solution) corresponding to minimum energy obtained in each of the replica circuits 21a0 to 21aN. From the plurality of acquired local solutions, the temperature adjustment unit 22 selects two local solutions in ascending order of energy, for example. The temperature adjustment unit 22 calculates a change amount (ΔE) of a value of an evaluation function caused in a case where one value of a plurality of bits different from a bit string of another local solution among bit strings of one selected local solution is changed. This calculation processing is sequentially performed for each of the plurality of bits. In a case where an increase in energy occurs continuously a plurality of times, the temperature adjustment unit 22 calculates a total value of the energies increased in the plurality of times. Based on the total value, the temperature adjustment unit 22 calculates a maximum temperature (T_max) by, for example, Expression (5) below.

$\begin{matrix} T_{\max} = - \frac{Dsum}{\log (A)} & (5) \end{matrix}$

As described above, in a case where an increase in energy occurs continuously a plurality of times, Dsum is a total value of energies increased in the plurality of times. A is a parameter indicating a transition probability of accepting a state transition that causes a maximum energy increase, and is set in advance.

Even in a case where there is a large peak of energy that may not be crossed unless a plurality of continuous energy increases occur between the two local solutions, the use of T_maxdescribed above as T_Nallows crossing of such an energy peak with a relatively high probability.

Initial values of temperature values other than T_nand T_Nmay be obtained from Expressions (3) and (4) described above.

Each of the replica circuits 21a0 to 21aN implements the solution search based on the evaluation function represented by Expression (1) by, for example, a circuit as illustrated in FIG. 4.

The replica circuit 21an corresponding to the replica n includes bit flip availability determination units 30a1, 30a2, . . . , and 30aM, a selector unit 30b, a holding unit 30c, and an average energy calculation unit 30d.

Processing of determining availability of flipping each bit included in the state of the replica n and flipping any bit determined to be flippable corresponds to one-time processing of the MCMC processing by the replica circuit 21an. The one-time processing is repeatedly executed.

Each of the bit flip availability determination units 30a1 to 30aM is, for example, an arithmetic processing circuit that determines availability of flipping related to one bit handled thereby. The bit flip availability determination units 30a1 to 30aM may perform the above-described determination processing in parallel.

When a value of a bit (state variable x_i) with index=i changes to 1−x_i, a change amount of x_iis represented as δx_i=(1−x_i)−x_i=1−2x_i. Accordingly, a change amount (ΔE_i) of the value of the evaluation function accompanied by the change in the value of x_imay be represented by Expression (6) below from Expression (1).

$\begin{matrix} \begin{matrix} {Δ E_{i} = E (x) ❘}_{x_{i} \to 1 - x_{i}} - E (x) \\ = - δ x_{i} (\sum_{j} W_{ij} x_{j} + b_{i}) \\ = - δ x_{i} h_{i} \\ = {\begin{matrix} - h_{i} & for x_{i} = 0 \to 1 \\ + h_{i} & for x_{i} = 1 \to 0 \end{matrix} \end{matrix} & (6) \end{matrix}$

In Expression (6), h_iis referred to as a local field and may be represented by Expression (7) below.

$\begin{matrix} h_{i} = \sum_{j} W_{ij} x_{j} + b_{i} & (7) \end{matrix}$

Each of the bit flip availability determination units 30a1 to 30aM holds h_ifor x_i, and obtains, from h_i, ΔE_iin a case where the value of x_iis changed, based on Expression (6).

Hereinafter, the bit flip availability determination unit 30a1 will be mainly described as an example. The bit flip availability determination units 30a2 to 30aM that are configurations with the same name have the same function.

The bit handled by the bit flip availability determination unit 30a1 is referred to as an own bit, and the bits handled by the bit flip availability determination units 30a2 to 30aM are referred to as other bits.

The bit flip availability determination unit 30a1 stores weight coefficients (W_ij(j=1 to N)) between the own bit and the other bits. A subscript “j” of W_ijindicates an index of one of the bits including the own bit (bit with index=1). W₁₁=0.

The bit flip availability determination unit 30a1 uses W_ijto calculate h₁based on Expression (7).

By using h₁, the bit flip availability determination unit 30a1 generates ΔE₁caused in a case where the own bit is flipped, based on Expression (6). For example, the bit flip availability determination unit 30a1 may determine whether the value of the own bit changes to 0 or 1 from a current value of the own bit supplied from the holding unit 30c. The bit flip availability determination unit 30a1 outputs the generated ΔE₁to the selector unit 30b.

The bit flip availability determination unit 30a1 determines availability of flipping the own bit, based on a comparison result between ΔE₁and a thermal noise value. As the thermal noise value, for example, T_n·log (u) may be used. T_nis a temperature value set for the replica circuit 21an, and u is a uniform random number taking a value from 0 to 1. The bit flip availability determination unit 30a1 permits the flip when, for example, −ΔE₁≥T_n·log (u).

The selector unit 30b receives the determination result of the flip availability output by each of the bit flip availability determination units 30a1 to 30aM. When there are a plurality of bits determined to be flippable, the selector unit 30b selects one of the plurality of bits randomly or according to a predetermined rule. When there is a bit determined to be flippable, the selector unit 30b outputs an index of the selected bit, an instruction signal for instructing flipping, and the change amount (ΔE) of the value of the evaluation function in a case where the bit is flipped. The index, the instruction signal, and ΔE are supplied to the holding unit 30c. The index is further supplied to each of the bit flip availability determination units 30a1 to 30aM.

For example, the holding unit 30c has a register and holds the state of the replica n and ΔE output by the selector unit 30b. In a case where the holding unit 30c receives an instruction signal for instructing flipping, the holding unit 30c flips the bit designated by the index output by the selector unit 30b. The holding unit 30c outputs the state of the replica n when the search processing in the replica circuit 21an is completed a predetermined number of times or for a predetermined period to the temperature adjustment unit 22 and the overall control unit 23.

The average energy calculation unit 30d calculates average energy (E_ave) of the replica n. E_aveis supplied to the temperature adjustment unit 22. For example, the average energy calculation unit 30d holds an initial value of the evaluation function represented by Expression (1), and updates E_aveby using ΔE output by the selector unit 30b and held by the holding unit 30c every time the MCMC processing is repeated. In a case where none of the bits are determined to be flippable, ΔE=0.

Although the replica circuits corresponding to the other replicas may also be implemented by a circuit configuration similar to that of the replica circuit 21an, those replica circuits may not include the average energy calculation unit 30d.

The temperature control unit 21b sets T₀to T_Nobtained by the temperature adjustment unit 22 in the bit flip availability determination units 30a1 to 30aM included in each of the replica circuits 21a0 to 21aN. The temperature control unit 21b controls the exchange of the states (X₀to X_N) of each of the replicas in the replica circuits 21a0 to 21aN. Based on the exchange probability (p_ij) represented by Expression (2) described above, the temperature control unit 21b determines, on a pair-by-pair basis, whether to exchange states for a pair of replica circuits having adjacent temperature values.

T₀obtain E_iand E_jof Expression (2), the temperature control unit 21b holds, for example, an initial value of an evaluation function of each replica, and updates the value of the evaluation function of each replica based on ΔE held in the holding units 30c of the replica circuits 21a0 to 21aN. Each of the replica circuits 21a0 to 21aN may have an energy calculation unit that calculates an evaluation function (energy) of a replica for which the replica circuit itself is responsible for processing.

When exchanging the states of the replica i and the replica j, the temperature control unit 21b sets X_i, which is the state of the replica i, in the replica circuit corresponding to the replica j, and sets X_j, which is the state of the replica j, in the replica circuit corresponding to the replica i.

Every time replica exchange is performed b times, the temperature adjustment unit 22 acquires E_ave, which is the average energy of the replica n, and adjusts T_nbased on a comparison result between E_aveand e_{ave, old}, which is E_aveacquired last time. An example of the temperature adjustment unit 22 will be described later.

The overall control unit 23 controls overall operations of the data processing apparatus 20. Upon receiving input of an activation signal from an outside of the data processing apparatus 20, the overall control unit 23 outputs the activation signal to the temperature control unit 21b, activates the search unit 21, and starts search processing for a solution to a combinatorial optimization problem. When the search processing by the search unit 21 ends, the overall control unit 23 acquires X₀to X_Mfrom the search unit 21 and obtains the solution for the combinatorial optimization problem.

For example, the overall control unit 23 sets, as a solution, a state in which the value of the evaluation function is the minimum among the acquired X₀to X_M. The overall control unit 23 outputs an end signal indicating an end of the arithmetic operation to the outside of the data processing apparatus 20. The end signal may include information indicating the solution obtained by the arithmetic operation. For example, the overall control unit 23 may output image information indicating the solution to a display apparatus (not illustrated) coupled to the data processing apparatus 20 and causes the display apparatus to display the image information indicating the solution to present details of the obtained solution to the user.

The overall control unit 23 may receive problem information (W_ijor b_i) described above stored in a storage unit (not illustrated), an initial value of the state, and the like, and may set them in each unit of the search unit 21. The setting of these pieces of information may be performed by another control unit. Upon receiving input of a reset signal from the outside of the data processing apparatus 20, the overall control unit 23 clears information held by the search unit 21 and the temperature adjustment unit 22.

Example of Temperature Adjustment Unit 22

FIG. 5 is a diagram illustrating an example of a temperature adjustment unit. A configuration for calculating the maximum temperature described above is not illustrated in FIG. 5.

The temperature adjustment unit 22 includes a parameter acquisition unit 40, an average energy acquisition unit 41, an average energy holding unit 42, a comparison unit 43, a T_nadjustment unit 44, a temperature value dividing unit 45, and a temperature value setting unit 46.

The parameter acquisition unit 40 acquires, for example, a number n of redundant replica circuits, a division number a, an adjustment cycle b, and an initial value (T_{n, 0}) of T_ninput from the user. The redundant replica circuits are n replica circuits in which temperature values of T₀to T_(n−1)on a lower temperature side than T_namong T₀to T_Nare set. The division number a is a division number, of the range between T_{n, 0}and T_N(=maximum temperature), for determining “a” candidate values of T_n. A lower limit of the “a” candidate values of T_nis T_{n, 0}, and an upper limit is a candidate value of a division point that is one immediately below T_N. The adjustment cycle b is represented by the number of times of replica exchange. For example, every time the replica exchange is performed b times, adjustment of T_nis performed.

Every time the replica exchange is performed b times, the average energy acquisition unit 41 acquires E_ave, which is average energy of the replica n. For example, the average energy acquisition unit 41 acquires information indicating whether the replica exchange has been performed from the temperature control unit 21b.

The average energy holding unit 42 holds E_aveacquired by the average energy acquisition unit 41 as e_{ave, old}.

The comparison unit 43 outputs a comparison result between E_aveacquired this time and e_{ave, old}, which is E_aveacquired last time, held in the average energy holding unit 42.

Based on the above-described comparison result, the T_nadjustment unit 44 adjusts T_n. At the time of initialization, the T_nadjustment unit 44 divides the range between T_{n, 0}and T_Nto calculate the “a” candidate values described above. When E_ave≤e_{ave, old}, the T_nadjustment unit 44 changes T_nto a candidate value larger than the current T_nby one step among the “a” candidate values. When E_ave>e_{ave, old}, the T_nadjustment unit 44 changes T_nto a candidate value smaller than the current T_nby one step among the “a” candidate values.

However, when E_ave≤e_{ave, old}, the T_nadjustment unit 44 does not change T_nin a case where the current T_nis a candidate value closest to T_N(maximum temperature), for example, the upper limit of the “a” candidate values. When E_ave>e_{ave, old}, the T_nadjustment unit 44 does not change T_nin a case where the current T_nis T_{n, 0}, for example, the lower limit of the “a” candidate values.

According to the above-described adjustment method, adjustment may be performed with a small number of parameters, but the adjustment method is not limited to the above-described adjustment method. For example, after the T_nadjustment unit 44 changes T_nto the candidate value smaller than the current candidate value by one step, in a case where E_ave>e_{ave, old}also holds for E_aveacquired next, the T_nadjustment unit 44 may change T_nto a candidate value larger than the current candidate value by one step. This is because a state of lower energy may be obtained by increasing T_n.

The T_nadjustment unit 44 may set the initial value of T_nas the upper limit among the “a” candidate values.

When T_nis changed, the temperature value dividing unit 45 determines T₀to T_Nby dividing the range between T₀and T_Nto include the maximum temperature value (T_N). The temperature value dividing unit 45 calculates (changes) temperature values other than T_nand T_Nbased on, for example, Expressions (3) and (4) described above.

The temperature value setting unit 46 sets the determined T₀to T_Nin the search unit 21.

FIG. 6 is a flowchart illustrating a flow of an example of a procedure for adjusting a temperature value according to the second embodiment.

For example, the following processing is performed under the control of the overall control unit 23.

As parameters for adjusting the temperature value, the temperature adjustment unit 22 acquires the number n of redundant replica circuits, the division number a, the adjustment cycle b, and T_{n, 0}(step S20).

At the time of initialization, the temperature adjustment unit 22 divides the range between T_{n, 0}and T_Nto calculate the “a” candidate values described above (step S21). Hereinafter, the “a” candidate values are assumed to be T′₀, T′₁, . . . , and T′_(a-1).

The temperature adjustment unit 22 sets i=0 and T_n=T′₀(step S22). The temperature adjustment unit 22 initializes j indicating the number of times of replica exchange to j=1 (step S23).

After that, the MCMC processing is performed in the search unit 21 (step S24). Accordingly, the state of each replica is updated in accordance with the acceptance probability of a predetermined state transition. Replica exchange is performed in the search unit 21 (step S25).

The temperature adjustment unit 22 sets j=j+1 (step S26). The temperature adjustment unit 22 determines whether j≤b+1 holds (step S27). When it is determined that j≤b+1 does not hold, the processing from step S24 is repeated.

When it is determined that j≤b+1 holds, the temperature adjustment unit 22 acquires E_ave, which is the average energy of the replica n, from the search unit 21 (step S28).

The temperature adjustment unit 22 determines whether there is e_{ave, old}in the average energy holding unit 42 (step S29).

When it is determined that there is no e_{ave, old}in the average energy holding unit 42, the temperature adjustment unit 22 causes the average energy holding unit 42 to hold the currently acquired E_aveas e_{ave, old}(step S30). After the processing of step S30, processing from step S23 is repeated.

When it is determined that there is e_{ave, old}in the average energy holding unit 42, the temperature adjustment unit 22 determines whether E_ave≤e_{ave, old}holds (step S31).

When it is determined that E_ave≤e_{ave, old}does not hold, the temperature adjustment unit 22 changes T_nto T_n=T′_i-1(step S32). T′_i-1is a candidate value that is smaller than T′_i, which is the current candidate value, by one step among the “a” candidate values. When T′_i=T_{n, 0}, T_nis not changed.

When it is determined that E_ave≤e_{ave, old}holds, the temperature adjustment unit 22 changes T_nto T_n=T′_i+1 (step S33). T′_i+1 is a candidate value that is larger than T′_i, which is the current candidate value, by one step among the “a” candidate values. When T′_iis the upper limit of the candidate values, T_nis not changed.

After the processing of steps S32 and S33 described above, as described above, the temperature adjustment unit 22 determines T₀to T_Nobtained by dividing the ranges between T₀and T_N(step S34). In the processing of step S34, the temperature adjustment unit 22 sets the determined T₀to T_Nin the search unit 21.

The temperature adjustment unit 22 causes the average energy holding unit 42 to hold E_aveacquired this time as e_{ave, old}(step S35).

After that, for example, the overall control unit 23 determines whether a predetermined end condition is satisfied (step S36). For example, when the number of times of MCMC processing reaches a predetermined number of times, the overall control unit 23 determines that the end condition is satisfied. Although not illustrated, for example, in a case where it is determined that the end condition is satisfied, the overall control unit 23 acquires X₀to X_Nfrom the search unit 21, outputs a state in which the value of the evaluation function is the minimum among X₀to X_Nas a solution, and ends the search processing.

When the overall control unit 23 determines that the predetermined end condition is not satisfied, the processing from step S23 is repeated.

The order of the processing described above is an example, and the order may be appropriately changed.

The data processing apparatus 20 according to the second embodiment described above also provides similar effects to those of the data processing apparatus 10 according to the first embodiment. For example, by determining a new temperature value including the minimum temperature based on E_aveacquired a plurality of times during the solution search processing, it is possible to shorten the adjustment time of the minimum temperature. The constraint of the solution to the local solution by the replica exchange is suppressed. Thus, by setting the adjustment cycle to the predetermined number of times of replica exchange, it is possible to appropriately adjust the minimum temperature in consideration of a wide search space.

The data processing apparatus 20 according to the second embodiment adjusts T_nbased on E_aveobtained in the search processing by the replica circuit 21an in which T_nis set, and determines temperature values including T₀based on the adjustment result. Accordingly, even when T_nis adjusted to a value higher than the optimum minimum temperature, (n−1) temperature values on the lower temperature side than T_nare set in (n−1) replica circuits, and search processing in the vicinity of the optimum minimum temperature may be performed.

Although the search unit 21 exchanges states between replicas in the replica exchange in the data processing apparatus 20 according to the second embodiment described above, temperature values may be exchanged between the replicas. In this case, the temperature values set for the replica circuits 21a0 to 21aN corresponding to the replicas 0 to N are not fixed. For this reason, the replica circuit that performs the processing of the replica n is not fixed to the replica circuit 21an. Accordingly, the average energy calculation unit 30d, that calculates the average energy (E_ave) of the replica n, may be provided in the temperature control unit 21b instead of the replica circuits 21a0 to 21aN.

By exchanging temperature values instead of exchanging states between replicas, an amount of data to be moved at the time of replica exchange may be reduced.

Even in a case where states are exchanged between replicas in the replica exchange, the average energy calculation unit 30d as described above may be provided in the temperature control unit 21b. The average energy calculation unit 30d may be provided in the overall control unit 23, or may be provided independently of the replica circuits 21a0 to 21aN, the temperature control unit 21b, the overall control unit 23, and the like.

Experimental Example

FIG. 7 is a diagram illustrating an example of calculation results of a combinatorial optimization problem.

FIG. 7 illustrates a calculation example of an average value of five seeds of minimum values of the evaluation function obtained in a case where 15 instances of Gset, which is a benchmark problem of a maximum cut problem, are calculated by the data processing apparatus 20 illustrated in FIG. 4. The maximum cut problem is an example of the combinatorial optimization problem. The calculation time is 30 minutes. The number of bits is the number of state variables x₁to x_Mincluded in the evaluation function. The number n of redundant replicas is set to 8. For comparison, a calculation result in a case where the number of redundant replicas n=0, for example, the minimum temperature is adjusted by using the average energy of the minimum temperature replica as in the comparative example described above is also indicated.

As illustrated in FIG. 7, the average value of the minimum values of the evaluation function is smaller in 14 problems of 15 problems than in the case where n=0. For example, it may be seen that a better solution (a solution close to the optimum solution) than the case of n=0 is obtained.

Example of Implementation by Computer

Details of the above-described processing (for example, FIG. 2 or 6) performed by the data processing apparatus 10 or 20 illustrated in FIG. 1 or 4 may be implemented by software by causing a computer as described below to execute a program.

The program may be recorded on a computer-readable recording medium. As the recording medium, for example, a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like may be used. Examples of the magnetic disk include a flexible disk (FD) and an HDD. Examples of the optical disk include a compact disc (CD), a CD-recordable (R)/rewritable (RW), a Digital Versatile Disc (DVD), and a DVD-R/RW. The program may be recorded on a portable-type recording medium and distributed. In such a case, the program may be copied from the portable-type recording medium to another recording medium and executed.

FIG. 8 is a diagram illustrating an example of hardware of a computer that is an example of the data processing apparatus.

A computer 50 includes a processor 51, a RAM 52, an HDD 53, a GPU 54, an input interface 55, a medium reader 56, and a communication interface 57. The units described above are coupled to a bus.

For example, the processor 51 may function as the search unit 12 and the processing unit 13 in FIG. 1, and as the search unit 21, the temperature adjustment unit 22, and the overall control unit 23 in FIG. 4. The processor 51 is a processor such as a GPU or a CPU including an arithmetic circuit for executing instructions of a program and a storage circuit such as a cache memory. The processor 51 loads at least a part of a program and data stored in the HDD 53 into the RAM 52 and executes the program. For example, to execute the functions of the replica circuits 21a0 to 21aN in parallel as illustrated in FIG. 4, the processor 51 may include a plurality of processor cores. The computer 50 may include a plurality of processors. A set of the plurality of processors (multiprocessor) may be referred to as a “processor”.

For example, the RAM 52 functions as the storage unit 11 illustrated in FIG. 1. The RAM 52 is a volatile semiconductor memory that temporarily stores the program to be executed by the processor 51 and data to be used for the arithmetic operation by the processor 51. The computer 50 may include a type of memory other than the RAM 52 and may include a plurality of memories.

The HDD 53 is a non-volatile storage apparatus that stores a software program such as an operating system (OS), middleware, or application software, and data. Examples of the program include a program for causing the computer 50 to execute the processing of searching for the solution to the combinatorial optimization problem and the adjustment processing of the temperature value as described above. The computer 50 may include another type of a storage apparatus such as a flash memory or a solid-state drive (SSD) and may include a plurality of non-volatile storage apparatuses.

According to an instruction from the processor 51, the GPU 54 outputs an image (for example, an image representing a search result or the like of the solution to the combinatorial optimization problem) to a display 54a coupled to the computer 50. As the display 54a, a cathode ray tube (CRT) display, a liquid crystal display (LCD), a plasma display panel (PDP), an organic electro-luminescence (OEL) display, or the like may be used.

The input interface 55 acquires an input signal from an input device 55a coupled to the computer 50 and outputs the input signal to the processor 51. As the input device 55a, a pointing device such as a mouse, a touch panel, a touch pad, or a trackball, a keyboard, a remote controller, a button switch, or the like may be used. A plurality of types of input devices may be coupled to the computer 50.

The medium reader 56 is a reading apparatus that reads a program or data recorded on a recording medium 56a. As the recording medium 56a, for example, a magnetic disk, an optical disk, a magneto-optical (MO) disk, a semiconductor memory, or the like may be used. Examples of the magnetic disk include an FD and an HDD. Examples of the optical disk include a CD and a DVD.

For example, the medium reader 56 copies a program or data read from the recording medium 56a to another recording medium such as the RAM 52 or the HDD 53. For example, the read program is executed by the processor 51. The recording medium 56a may be a portable-type recording medium and may be used to distribute the program or data. The recording medium 56a or the HDD 53 may be referred to as a computer-readable recording medium.

The communication interface 57 is an interface that is coupled to a network 57a and communicates with another information processing apparatus via the network 57a. The communication interface 57 may be a wired communication interface coupled to a communication apparatus such as a switch via a cable or may be a wireless communication interface coupled to a base station via a wireless link.

An accelerator card having an electronic circuit such as an FPGA or an ASIC may be coupled to the bus of the computer 50. The processing of the search units 12 and 21 may be executed by the accelerator card.

Although aspects of the temperature adjustment program, the data processing apparatus, and the data processing method of the present disclosure have been described above based on the embodiments, these are merely examples and the present disclosure is not limited to the above description.

All examples and conditional language provided herein are intended for the pedagogical purposes of aiding the reader in understanding the invention and the concepts contributed by the inventor to further the art, and are not to be construed as limitations to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although one or more embodiments of the present invention have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.

COMPUTER-READABLE RECORDING MEDIUM STORING TEMPERATURE ADJUSTMENT PROGRAM, DATA PROCESSING APPARATUS, AND DATA PROCESSING METHOD

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)