The present application claims priority from Japanese application JP 2017-204990, filed on Oct. 24, 2017, the content of which is hereby incorporated by reference into this application.
The present invention relates to a computing technology which is able to perform computation at a high speed with respect to combinatorial optimization problems that need an exhaustive search.
As being representative as the word “IoT” (Internet of Things), various things are connected to the Internet in the present days; information is collected from the things; and the things are controlled by the collected information. In the control, an optimal solution is found from among many choices and is performed. Extremely speaking, the information technology in the present days can be said to search an optimal solution.
In this situation, a quantum annealing, or adiabatic quantum computing, has recently received attention. In this method, a problem is set such that the ground state of a certain physical system is a solution and the solution is obtained through finding the ground state. Let the Hamiltonian of a physical system in which a problem is set be Ĥp. At the beginning of computation, however, Hamiltonian is not Ĥp but Ĥ0 for which the ground state is easily prepared. Next, the Hamiltonian is transformed from Ĥ0 to Ĥp with a sufficient period of time. When the transformation takes a sufficient period of time, the system continues to stay in the ground state, and the ground state (solution state) for Hamiltonian Ĥp is finally obtained. This is a principle of the quantum annealing.
A ground state-searching method in which a physical system called Ising spin glass is used can be applicable even to a problem called NP-hard. Meanwhile, a highly difficult problem in combinatorial optimization problems belongs to the NP-hard. Moreover, problems classified into “P” and problems classified into “NP” in the computational complexity theory all can be transformed to an NP-hard problem. Therefore, if the quantum annealing is applied to the Ising spin glass system, almost all the combinatorial optimization problems can be solved, and the most important issue of the information technology is solved.
Another reason why the quantum annealing receives attention is robustness with respect to decoherence. In a quantum computer, quantum coherence must be preserved over a computing time. However, in the quantum annealing, the condition is relaxed. If the ground state is maintained, a correct solution is obtained. The quantum coherence is not necessarily to be maintained. In a current technology level, it is difficult to establish a pure quantum system. Therefore, the quantum coherence is hardly kept over the computing time. For this reason, the quantum annealing has received attention. However, there is a drawback even in the quantum annealing. The quantum annealing can only be realized in a superconducting magnetic flux qubit system as it stands, and thus, a cryogenic cooling apparatus is needed. The need for an extremely low temperature is an issue of achieving a practical computer.
To solve the issue, the local-field response method was proposed as described in the following (PTLs 1-3 and NPL 1). First, let us review the quantum annealing again. The concept of the annealing lies regardless of quantum or classical by its nature. The quantum annealing is devised to improve the performance of the classic annealing using quantum properties. The reason why the quantum annealing does not require the quantum coherence over the computing time and only requires the ground state being maintained comes from the wide concept of the annealing. As just described, the concept of the annealing is so wide that another method to use quantum properties may exist, different from the quantum annealing.
The local-field response method has been disclosed from such a point of view. In this method, similarly to the quantum annealing, a transverse field is applied to a spin system at time t=t0, which is a computing device, and the magnetic field is gradually decreased to obtain a solution at time t=τ. The computing device itself is classical, and quantum-mechanical information is added to the response of spins to the magnetic field. The method works on a classical machine at room temperature, and therefore, it can solve the issue of cryogenic cooling in the quantum annealing. In PTLs 1-3 and NPL 1, quantum effects were introduced in the response function, where the response function which has an average quantum effect was determined empirically or based on the results solved for similar problems. In NPL 2, solution accuracy was improved compared to the methods in PTLs 1-3 and NPL 1 by phenomenologically incorporating the properties of the linear superposition in quantum mechanics. However, the property of quantum entanglement which is another important property in quantum mechanics has not been sufficiently incorporated.
As described above, the quantum annealing requires a cryogenic cooling apparatus to use a superconducting magnetic flux qubit system. Meanwhile, the local-field response method operates at room temperature, but quantum entanglement which is an important property in quantum mechanics is not sufficiently incorporated, which limits the performance. Thus, an object of the invention is to provide a computing apparatus and a computing program which can operate at room temperature and have a sufficient performance for a difficult assignment such as an issue that needs an exhaustive search.
In a local-field response method in which spins being variables respond to local effective magnetic fields, a time axis is discretely treated; When spins respond to effective magnetic fields, the effective magnetic fields are determined sequentially from the site having the small magnitude of a spin, and the spins respond to the fields in order. When the sign of the spin is inverted, the information is reflected in the subsequent process of determining the effective magnetic fields for other sites. Thus, a many-body effect due to quantum entanglement is phenomenologically incorporated. More specific descriptions are provided in the following.
A specific figure is a computing apparatus which includes a computing unit, a storage unit, and a control unit, and performs computation under the control of the control unit while transferring data between the storage unit and the computing unit, or a computing method using the computing apparatus. N variables sjz (j=1, 2, . . . , N) take a range of −1≤sjz≤1, and an assignment is set with coefficients gj indicating local terms and coefficients Jkj (k, j=1, 2, . . . , N) indicating inter-variable interactions. Time is divided into m, and the computing unit discretely performs computation from t=t0 (t0=0) to tm (tm≤τ). Herein, N and m are natural numbers.
Variables Beff,jz(ti) and sjz(ti) at each time ti (i=1, 2, . . . , m) are determined in this order. Beff,jz(ti) is a function of skz(ti−1), Jkj, gj, and ti. sjz(ti) is a function of Beff,jz(ti) and ti. Initial values at time t0 are set as Bjz(t0)=0 and sjz(t0)=0. For determining Beff,jz(ti) and sjz(ti) at time ti (i=1, 2, . . . , m), first, sjz(ti−1) are put in descending order such that |sm1z(ti−1)|≤|sm2z(ti−1)|≤sm3z(ti−1)|≤ . . . ≤|smNz(ti−1)|. Then, Beff,m1z(ti) and sm1z(ti) at site m1 are determined at the first time, and sm1z(ti−1) is set to be sm1z(ti−1)=sgn(sm1z(ti))|sm1z(ti−1)|. Next, Beff,m2z(ti) and sm2z(ti) at site m2 are determined and sm2z(ti−1) is set to be sm2z(ti−1)=sgn(sm2z(ti))|sm2z(ti−1)|. The computation at site m3 is performed similarly, and the computation up to site mN is performed similarly for the computation at time ti. As the time step progresses from t=t0 to t=tm, the variable sjz approaches −1 or 1, and a solution is determined as sjzfd=−1 if sjz<0 and as sjzfd=1 if sjz>0.
As described, the same computation has been performed up to the site mN for the computation at time ti, but there is an option that the same computation is performed up to mx (herein, 0<x≤N), and that after site mx all remaining sites are processed in an individual and parallel manner similarly to the original local-field response method for the computation at time ti. Here, x is a natural number.
The present method operates on a classical machine. Extremely low temperature is not needed. In addition, there is no need to take quantum coherence into consideration. As a result, usable resources are expanded, and electric circuits can also be used. Moreover, solution accuracy is improved and a computation time is shortened through phenomenologically incorporating the effect of quantum entanglement. Thanks to these properties, it is possible to realize a practical computing apparatus which can solve difficult problems with high solution accuracy.
Embodiments will be described in detail using the drawings. However, the content of the embodiments described below is not interpreted in a way of limiting the invention. A person skilled in the art can easily understand that the specific configuration may vary in a scope not departing from the idea and the spirit of the invention.
Portions having the same or similar functions in the configuration of the invention described below will be represented with the same symbol and commonly used in different drawings, and the redundant description will be omitted.
In a case where there are elements having similar or corresponding functions, the elements may be attached to the same symbol with different suffixes. However, in a case where there is no need to distinguish plural elements, the suffixes may be omitted.
In the first embodiment, we will give the basic principle, starting from a quantum-mechanical description and transforming it into a classical form.
Let the problem Hamiltonian and the Hamiltonian at t=0 be Eqs. (1) and (2), respectively.
Let the Hamiltonian at time t be Eq. (3).
Herein, τ is a computation time. From an analogy with a one-spin system, the effective magnetic field at site j is given by B̂eff,j=−∂Ĥ/∂σ̂j.
The local-field response method of this embodiment operates on a classical machine in which expectation value <σ̂j> is regarded as a spin variable. As seen in Eqs. (1) and (2), <σ̂j> and <B̂eff,j> consists of only x and z components. Therefore, a response function rb(t) is defined only by the x and z components as described by Eq. (5).
{circumflex over (σ)}jz(t)/{circumflex over (σ)}jx(t)=rb(t)·{circumflex over (B)}eff,jz(t)/{circumflex over (B)}eff,jx(t) (5)
A spin direction is determined based on the response function. When the spin system is classical, the response of each spin is determined only by the effective magnetic field at each site, and the response function is rb(t)=1.
However, there is a non-local correlation (quantum entanglement) in quantum mechanics, and generally rb(t)≠1. As mentioned already, Eq. (5) has been transformed to a classic form by taking the expectation value, but a quantum effect is incorporated in rb(t)≠1. A value of rb(t) is determined empirically or through quantum-mechanical prior calculations for similar problems. The quantum effect herein is an average one because it is empirically determined or it is based on the results for similar problems. The local-field response method can incorporate quantum effects through various methods. The method through rb(t)≠1 is an example. Note that setting rb(t)=1 without the quantum effect is also allowed in the local-field response method. The local-field response method itself can operate even when the quantum effect is not incorporated.
Four variables <σ̂jz(ti)>, <σ̂jx(ti)>, <B̂eff,jz(ti)>, and <B̂eff,jx(ti)> in Eq. (5) are expectation values, and therefore, classical quantities. For this reason, let us change the quantum-mechanical description to a classical-physics one, i.e., <σ̂jx(ti)>→sjx(ti), <σ̂jz(ti)>→sjz(ti), <B̂eff,jx(ti)>→Beff,jx(ti), <B̂eff,jz(ti)>→Beff,jz(ti). With the description change, Eq. (5) is modified to Eq. (6).
(sjz(t)/sjx(t))=rb(t)·(Beff,jz(t)/Beff,jx(t)) (6)
The time-evolution in the local-field response method is discrete. Beff,jz(ti) at time ti is determined from skz(ti−1) at time ti−1 in accordance with Eq. (4). sjz(ti) at time ti is determined from Beff,jz(ti) at time ti in accordance with Eq. (6). This procedure is repeated.
Step 101 in
Step 103 can be generally written using a function f such as sjz(ti)=f(Beff,jz(ti), ti), and f(Beff,jz(ti), ti)=rs(t)·sin{arctan(rb(t)·Beff,jz(tk)/Beff,jx(tk))}. rs(t) is a parameter expressing the magnitude of the spin and satisfies 0≤rs(t)≤1. Thus, −1≤sjz(ti)≤1. Here, when rb(t)=1 and rs(t)=1, then the system is purely classical.
Hitherto, an example of the response function has been described in
Here, similarly to rb(t), rs(t) is also possible to set rs(t)=1. While the range of rb(t) is −∞<rb(t)<∞, rs(t) is 0≤rs(t)≤1. Therefore, the influence of assuming rs(t)=1 on a final solution is smaller than that of assuming rb(t)=1. Thus, it is effective to set rs(t)=1 when prior information for determining rs(t) is insufficient.
The first embodiment has been described that quantum effects can be averagely incorporated through rb(t)≠1. However, quantum effects depend on problems and vary with time. Quantum effects cannot be sufficiently incorporated only through averaged quantities. This embodiment describes a method of phenomenologically incorporating a quantum entanglement related-quantum effect in the formulation depending on the spin state at each time.
The influence of quantum entanglement appears as a many-body effect. When quantum entanglement is large, if a certain spin is inverted (its sign is inverted), another spin is simultaneously inverted with high probability. In the algorithm of
In the vicinity of the spin inversion, sjz≈0. Therefore, the spin that has a smaller value of |sjz| has higher probability of inversion. Thus, let us first put |sjz(ti−1)| at each time ti−1 in descending order.
Let m1, m2, m3, . . . be the site number of |sjz(ti−1)| in the descendent order as illustrated in Step 111 in
According to this procedure, the change for site m1 at time ti is immediately reflected on the time evolution for site m2. When sites m1 and m2 are entangled, if the spin at site m1 is inverted, the spin at site m2 is inverted with high probability. Such an effect is incorporated as the change of the sign described herein. In other words, the effect of quantum entanglement is incorporated phenomenologically.
The computed result of sm2z(ti) is also processed similarly to sm1z(ti) i.e., sm2z(ti−1)→sgn(sm2z(ti))·|sm2z(ti−1)| (Step 112). By repeating the similar process for site m3 and the following sites, we obtain N components of sjz(ti) (Step 113), and the process at time ti is completed.
Once the process at time ti is completed, the next procedure is the process at time ti+1, and the similar processes are repeated until time t=tm≤τ to obtain the final solution. The final solution is sjzfd=1 if sjz(tm)>0 and sjzfd=−1 if sjz(tm)<0 similarly to the case of the first embodiment (Steps 201 and 202).
The spin inversion is connected to a tunneling phenomenon in a viewpoint of quantum mechanics. According to this interpretation, the treatment in this embodiment is said to take multiple tunneling into consideration.
The energy value for the obtained solution is given by Eq. (7).
The local-field response method operates similarly to quantum annealing. The spin system in which the ground state is prepared at t=0 time-evolves and comes to the ground state of the problem-embedded system at t=τ ideally. The convergence of solutions is high according to the method of this embodiment. The lowest energy state during the computation is the state at t=tm with high probability (the ground state with high probability). However, in the method of the first embodiment, the convergence of the solution is low, and the lowest energy state might be the state at t<tm. When the first embodiment is used, the energy needs to be calculated at each time using Eq. (7) and the state corresponding to the lowest energy needs to be selected as the final solution. The amount of computation for that purpose is O(N2) for the first term in Eq. (7) and O(N) for the second term (here, O is the Landau symbol). The sum of both terms are summarized to O(N2). In contrast, if the method of this embodiment is used, because the convergence of the solution is high, the energy does not need to be calculated, and the amount O(N2) of computation can be saved.
On the other hand, the method of this embodiment rearranges the spins in Step 111. The amount of the processing is estimated as follows. If each spin is simply compared with the other spins in a repeated manner, the amount is O(N2). This is the upper limit for the processing. The lower limit is estimated as follows. Let us consider that an array of spins has already put in the descending order, and that only the adjacent spins are compared and exchanged. If there is no exchanging, the amount of computation is O(N) because each spin is compared with only one side of adjacent spins. Because a huge number of exchanges is generally rare, the actual amount of computation approaches O(N). Thus, the overhead of this embodiment is about O(N), which is smaller than the overhead O(N2) of the first embodiment.
The amount of computation for the entire local-field response method is dominated by the computation of the effective magnetic field in Step 102a. Because every N site amounts to O(N), the total amount is O(N2). Therefore, the overhead O(N) in this embodiment is negligible in a system in which N is sufficiently large.
As described above, in the example of
Quantum-mechanically, the effective magnetic field is determined based on Eq. (4). An eigenvalue of σ̂kz is ±1. However, because the local-field response method operates such that a spin variable skz takes an expectation value <σ̂kz>, |skz|≤1 is satisfied. For this reason, the term Σk(≠j)Jkjskz is generally underestimated compared with gj.
If the computation is performed while the term of Σk(≠j)Jkjskz is underestimated, the solution accuracy is degraded. Therefore, the value of gj is normalized with reference to the value of skz. A factor ci=(Σkskz(ti−1)2/N)1/2 is multiplied to gj to obtain gjnorm(ti)=cigj. If gjnorm(ti) is set as a local term, the contributions of the terms gjnorm(ti) and Σk(≠j)Jkjskz are almost equal, and the solution accuracy is improved. Here, let m (tm≤τ) be the number of divisions in the discrete time axis, and c1 is set as about c1=1/m. If c1 is simply determined in accordance with ci=(Σkskz(ti−1)2/N)1/2 and skz(t0)=0, then c1=0. The setting of c1=1/m is to cope with c1=0.
In quantum-mechanical spin systems, the spins always affect other spins with each other. That is, spin σ̂jz at a site j affects spine σ̂kz at another site k, and inversely σ̂kz affects σ̂jz. Therefore, the spin σ̂jz affects itself through the spin σ̂kz at site k. That is the reason why, in quantum mechanics, the state of a spin depends not only on the state of other spins interacting with the relevant spin but also on the state of itself. A magnitude of the influence on itself through the interaction is proportional to Σk(≠j)Jkj2. The first embodiment has described the averaged quantum effect. Because Σk(≠j)Jkj2 are squared terms, they are left even after averaged. That is the reason why the averaged response function becomes rb0(t)≠1. The open circles and the solid line in
Terms Σk(≠j)Jkj2 that are the origin of rb0(t)≠1 contain information Jkj for each problem. If the information is usable, the computation becomes more accurate. Thus, we replace the averaged response function rb0(t) with rb0mod(t), where rb0mod(t) is defined by 1−rb0mod(t)=(1−rb0(t))Σk(≠j)Jkj2/Σk(≠j)ave(Jkj2). Herein, ave(Jkj2) is the average of Jkj2 which are used to determine rb0(t). With the improvement, the response function reflects an actual problem, and the solution accuracy increases.
Quantum entanglement and linear superposition are representative characteristic natures in quantum mechanics. The effect of the former quantum entanglement has been phenomenologically incorporated in up to the fourth embodiment. The effect of the latter linear superposition is phenomenologically incorporated in NPL 2. In this embodiment, the both effects are incorporated at the same time.
In the embodiment illustrated in
The characteristic of linear superposition is prominent in a time region where the sign of a spin changes.
Quantum-mechanically, bands anti-cross in the vicinity of that region, and the state is linear superposition there. As a result, sjz(t)≈0, and correspondingly Beff,jz(t)≈0. In order to phenomenologically incorporate the effect, the effective magnetic fields at times ti and ti−1 are linearly combined to give the effective magnetic field Beff,jz(ti) at t=ti. Specifically, Bjz0(ti) defined by Eq. (8) is first determined using the spin values sjz(ti−1) at time ti−1.
Next, the term at prior time is taken into consideration, and Bjz(ti) defined by Eq. (9) is computed.
B
j
z(ti)=(1−u)Bjz0(ti)+uBjz(ti−1) (9)
Herein, u is appropriately adjusted within 0≤u≤1 to make the solution accuracy high, typically u≈0.1. The effective magnetic field, including a transverse field and its schedule, is given by Eq. (10).
When the effective magnetic field is obtained, sjz(t) is determined using the response function rb0mod(t) in accordance with Eq. (11).
s
j
z(ti)/sjx(ti)=rb mod0·Beff,jz(ti)/Beff,jx(ti) (11)
Up to the fifth embodiment, we have described the embodiments to improve the solution accuracy by adding various quantum effects, especially by adding the effect of quantum entanglement. However, the correct solution is not necessarily derived even if the above methods are used. For this reason, this embodiment will describe an auxiliary means.
Because the local-field response method is a deterministic method, the result is always the same if the same process is performed with the same initial spin values. If the initial values or the process is changed, the result may be different. Hence, we sweep the magnetic field several times while changing the initial values or the process, and we select the spin state giving the lowest energy in the process as the final solution to further improve the solution accuracy. FIG. 9 illustrates a flowchart for that case.
In
Process 10a just performs that in
This embodiment is described as an algorithm, which may be executed as a software on a general computer or on a dedicated hardware. The local-field response method in the embodiments has a feature that a relatively simple computation is repeated. Therefore, it is effective that the repeated computation part is established with a dedicated hardware, and the other parts are achieved with a general-purpose device.
The above configuration may be constructed as a single computer. Alternatively, the configuration may be constructed from different computers connected through a network, where arbitrary parts such as a main storage device 501, a general computation device 502, a control device 503, an auxiliary storage device 504, an input device 505, and an output device 506 are connected to the network.
General computation is performed in the same procedure as in an ordinary computing apparatus. The main storage device 501 (storage unit) and the general computation device 502 (computing unit) repeatedly transfer data between them so as to progress the computation. At that time, the control device 503 works as a control unit. A program executed with the general computation device 502 is stored in the main storage device 501 (storage unit). When the main storage device 501 has an insufficient memory capacity, the auxiliary storage device 504 that is similarly a storage unit is used. The input device 505 is used for inputting data, a program, and the like, and the output device 506 is used for outputting results. For the input device 505, not only a manual input device such as a keyboard but also an interface for a network connection may be used. In addition, the interface serves as the output device as well.
As described in the first to sixth embodiments, N spin variables sjz(t) and N effective magnetic field variables Beff,jz(t) are iteratively determined in the local-field response computation in the embodiments. The local-field response computing device 600 dedicatedly executes this iterative computation.
In the sixth embodiment, we performed the similar Processes 10a to 10d and obtained solutions sjzfd for the respective processes. The obtained solutions are transferred from the local-field response computing device 600 to the main storage device 501, and the energy Hpq(tm) and Ebest are computed using the general computation device 502. That is, individual computations not belonging to the iterated computation is performed using the general computation device 502, and the dedication rate in the local-field response computing device 600 is increased.
High-difficulty problems in combinatorial optimization problems belong to NP-hard. In addition, problems classified into “P” and problems classified into “NP” both can be transferred to an NP-hard problem. Therefore, if an NP-hard combinatorial optimization problem is solved, almost all the combinatorial optimization problems are solved. A ground state-searching problems described Eq. (1) include NP-hard problems. In this embodiment, we show how to treat an NP-hard problem in accordance with Eq. (1) by using a maximum cut (MAX-CUT) problem that is a representative NP-hard problem as an example.
The maximum cut problem is a problem of graph theory. In the graph theory, a graph G is constructed from an node set V and a vertex set E, and is written as G=(V, E). An edge e is written as e={i, j} using two nodes. When a graph is defined by taking the direction of edges e into consideration, the graph is called a directed graph. When a graph is defined without taking the direction into consideration, the graph is called an undirected graph. For an edge e, a weight is also defined, and it is written as wij and wji. For an undirected graph, wij=wji. Let us think of dividing nodes in a weighted undirected graph G=(V, E) into two groups. The MAX-CUT problem is to find a division method to maximize a total sum of weights of cut edges in the division problem. Let G1=(V1, E1) and G2=(V2, E2) be the divided two undirected graphs. Then, the MAX-CUT problem is to maximize Eq. (12).
If si=1 is set for edge i∈V1, and sj=−1 is set for edge j∈V2, Eq. (13) is derived.
Because the first term in the rightest side is a constant once a graph G is given, the MAX-CUT problem becomes a problem of minimizing Σi>jwijsisj. Because the Hamiltonian of the Ising spin glass model is given by Eq. (14),
the MAX-CUT problem is equivalent to the ground state-searching problem of Eq. (14) with Jij=−wij and gj=0.
In the second embodiment illustrated in
In this embodiment, the same function as that configured with software can be achieved with hardware such as an FPGA (Field Programmable Gate Array) and an ASIC (Application Specific Integrated Circuit).
The invention is not limited to the above embodiments, and various modifications can be made. For example, some configurations of a certain embodiment may be replaced with the configurations of another embodiment, and the configuration of the other embodiment may also be added to the configuration of a certain embodiment. In addition, part of the configurations of each embodiment may be added to or replaced with part of the configurations of other embodiments, and some of the configurations may be omitted.
Number | Date | Country | Kind |
---|---|---|---|
2017-204990 | Oct 2017 | JP | national |