The present invention relates to a computer which enables a high-speed computation for an inverse problem or a combinatorial optimization problem requiring an exhaustive search.
As represented by words such as big data, the present age is full of data. In information science, knowing how to analyze huge data and how to handle becomes one of the most important problems to be solved. Big data has many problems that need a complex analysis. For example, when a certain result is obtained, it may be desired to find cause of the result. This is referred to as an inverse problem. It becomes difficult to find the cause as a phenomenon becomes more complicated, and in general, efficient algorithm for obtaining an initial value from a result is not present. In the worst case, the exhaustive search should be conducted for the initial value. This is one of the difficult problems in big data. Alternatively, there are also many problems to select an optimal solution from among many choices on the basis of big data. Also, in this case, when all possibilities are taken into account, a need for the exhaustive search comes out. From this background, a computer which efficiently solves a problem which needs the exhaustive search is needed.
On the exhaustive search problem, expectations for a quantum computer are large. The quantum computer simultaneously realizes “0” and “1”, each of which is composed of a basic element called a quantum bit. For that reason, the quantum computer has a potential to simultaneously calculate all solution candidates as the initial value and certainly realize the exhaustive search. However, the quantum computer needs to maintain quantum coherence over the entire calculation time and there appears no prospect that this is realized.
In this situation, a method that has come to be noted is called adiabatic quantum computing (NPL 1). This method is one in which a problem is converted such that a ground state of a certain physical system becomes a solution and the solution is obtained by finding the ground state. The Hamiltonian of a physical system for which a problem is set is assumed as Ĥp. However, the Hamiltonian is not assumed as Ĥp at a time point of starting computation and is assumed as another Hamiltonian Ĥ0 with which a ground state is prepared easily and clearly, apart from ĤD. Next, the Hamiltonian is allowed to transition from Ĥ0 to Ĥp by spending enough time. When enough time is spent, a system remains in the ground state and a ground state of the Hamiltonian Ĥp is obtained. This is the principle of adiabatic quantum computing. When a calculation time is assumed as τ, the Hamiltonian becomes Equation (1).
The solution which is time-evolved is obtained based on the Schrodinger equation of Equation (2).
The adiabatic quantum computing is also applicable to the problem that needs the exhaustive search and reaches the solution in a unidirectional process. However, when a calculation process needs to follow the Schrodinger equation of Equation (2), it is necessary to maintain quantum coherence similar to the quantum computer. However, the quantum computer repeats a gate operation for 1 quantum bit or between 2 quantum bits, whereas adiabatic quantum computing is for simultaneously interacting over the entirety of a quantum bit system and the way of thinking for coherence is different. For example, the gate operation to a certain quantum bit is considered. At this time, if there is interaction between the quantum bit and other quantum bits, interaction is cause of decoherence, but in adiabatic quantum computing, all quantum bits are allowed to simultaneously interact and thus, decoherence is not caused in a case such as this example. Adiabatic quantum computing in which this difference is reflected is thought to be robust to decoherence as compared with the quantum computer.
However, there is also a problem to be solved in adiabatic quantum computing. Even though adiabatic quantum computing becomes more robust with respect to decoherence compared to the quantum computer, if a computation process follows the Schrodinger equation of Equation (2), sufficient coherence is needed as well. Matters that a system implement adiabatic quantum computing is a superconducting magnetic flux quantum bit system are a problem to be solved (PTL 1 and NPL 2). This is because in a case of using superconductivity, a cryogenic cooling device is needed. Matters that an extremely low temperature is needed are the problem to be solved for realizing a practical computer.
As described above, adiabatic quantum computing is effective against a challenge that needs an exhaustive search. However, quantum coherence is still needed and a cryogenic cooling device is also needed in a case of using a superconducting quantum bit. A problem to be solved is to provide a practical computer while eliminating these two necessary conditions.
In order to solve the above-described problem, an object of the present invention is to provide a computer does not need quantum coherence or a cryogenic cooling device and a computing program thereof.
Spin is used as a variable in the computation and a problem intended to be solved is set using interspin interaction and a local field acting on each spin. All spins are caused to orient toward one direction by an external magnetic field at time t=0 and the external magnetic field is gradually reduced such that the external magnetic field becomes zero at time t=τ. Each spin is time-evolved in such a way that the direction, which follows an effective magnetic field determined by all actions of interspin interaction and external magnetic fields of each site at time t, is determined. In this case, the direction of spin is not completely aligned in the effective magnetic field and is caused to be a quantum mechanically corrected direction such that the system is caused to maintain in an approximately ground state.
Additionally, a term (relaxation term) to maintain each spin in an original direction during time evolution is added to the effective magnetic field to improve convergence of a solution.
Although quantum mechanical correction is made in this method, the method operates in a classical system. For that reason, there is no need to take quantum coherence into account and resources in a wide range are available. When energy scale on the bit is set to be sufficiently larger than energy scale of a temperature, temperature fluctuations can also be ignored and a special apparatus such as a cryogenic apparatus and a special environment is not needed.
The relaxation term is added to thereby suppress vibration related to a spin direction in time evolution and improve convergence of a solution.
In the following, various examples of the present invention together with a principle of computation will be described with reference to the accompanying drawings. However, the present invention is not to be construed as being limited to the description in the following embodiment. Matters that change to a specific configuration of the present invention may be made without departing from the spirit or gist of the present invention is easily understood by a person skilled in the art.
In a configuration of an invention to be described below, the same portions or portions having similar functions are denoted by the same reference numerals to be commonly used in different drawings and redundant descriptions thereof will be omitted.
Notations such as “first”, “second”, and “third” in the present specification are intended to identify constituents and are not necessarily intended to limit the number or order. Furthermore, the number for identifying constituents is used for each context and the number used in one context does not necessarily indicate the same configuration also in other contexts. The constituent identified by a certain number is not precluded from functioning as the constituent identified by other numbers as well.
A position, size, shape, range or the like of each configuration indicated in the drawings and the like may not represent an actual position, size, shape, range or the like for easy understanding of an invention. For that reason, the present invention is not necessarily limited to the position, size, shape, range or the like disclosed in the drawings and the like.
Adiabatic quantum computing is also called quantum annealing as an alias and is one obtained by developing the concept of classical annealing to quantum mechanics. That is, adiabatic quantum computing can operate in a classical basis of behavior and can be interpreted that quantum mechanical effects are added in order to improve performance in terms of high-speed or a correct answer rate of a solution. In the present invention, a computing component itself is assumed to classical one and parameters determined quantum mechanically are introduced in the computation process to thereby realize a computing method and apparatus that are classical ones but include quantum mechanical effects.
Based on the concept described above, a classical algorithm to obtain a ground state as a solution and an apparatus for realizing will be described in the following description, while explaining relevance to adiabatic quantum computing.
A typical form to be described in the following example is a computer which includes a computing unit, a storing unit, and a control unit and performs computation while exchanging data between the storing unit and the computing unit by control of the control unit, and in which N variables sjz (j=1, 2, . . . , N) take a range of −1≦sjz≦1 and a problem to be solved is set using a local field gj and intervariable interaction Jij (i, j=1, 2, . . . , N). In the computing unit, computation is discretely performed from t=t0 (t0=0) to tm (tm=τ) by dividing time into m timepieces, Bjz(tk)={ΣiJijsiz(tk−1)+gj+sgn(sjz(tk−1))·gpina}·tk/τ or Bjz={ΣiJijsiz(tk−1)+gj+gpinb·sjz(tk−1)}·tk/τ is obtained by using a value of a variable siz(tk−1) (i=1, 2, . . . , N) of previous time tk−1 and a coefficient gpina or gpinb of a relaxation term each time when the variable sjz(tk) is obtained at each time tk, and a function f is determined so as to cause the range of the variable sjz(tk) to become −1≦sjz(tk)≦1 and results in sjz(tk)=f(Bjz(tk), tk), the variable sjz is caused to approach −1 or 1 by making a time step advance from t=t0 to t=tm, and finally determines a solution in such a way that if sjz<0, then sjzd=−1 and otherwise, if sjz>0, then sjzd=1.
The coefficient gpinb is, for example, a value from 50% to 200% of an average value of |Jij|. With respect to the local field gj for setting of a problem to be solved, it is possible to add a correction term δgj′ to gj′ only for a certain site j′ and increase the size of gj′ only for the site j′. The correction term δgj′ is, for example, a value from 10% to 100% of the average value of |Jij|.
In Example 1, a principle of the present invention will be described through migration to a classical form starting from quantum mechanical description.
Ising spin-Hamiltonian ground state search problem given by Equation (3) includes a classification problem called NP-hard and is known to be a useful problem (NPL 3).
Jij and gj are problem to be solved-setting parameters and σ̂jz takes eigenvalues of ±1 in the z-component of Pauli spin matrix. i and j represent sites of spin. Ising spin is a variable taking only ±1 as a value such that an Ising spin system is formed with the eigenvalues of ±1 of the σ̂jz in Equation (3). Ising spin of Equation (3) needs not be literally spin and may be anything physically as long as the Hamiltonian is described by Equation (3). For example, it is possible to associate high and low of a logic circuit to ±1 and is also possible to associate a vertically polarized wave and horizontally polarized wave of light to ±1 or associate 0 phase and π phase to ±1. In the method of the present example, similar to adiabatic quantum computing, a computation system is prepared in the ground state of the Hamiltonian given by Equation (4) at time t=0.
γ is a proportional constant determined by the magnitude of an external field uniformly applied to all sites j and σ̂jx is the x-component of Pauli spin matrix. If a computation system is spin itself, the external field means a magnetic field. Equation (4) is equivalent to the Hamiltonian obtained by applying a transverse magnetic field and a case where all spins are directed to the x-direction (γ>0) is the ground state. Although the Hamiltonian for setting of a problem is defined as the Ising spin system having only the z-component, the x-component of spin appears in Equation (4). Accordingly, spin in the computation process is a vector (Bloch vector) rather than Ising. Although the Hamiltonian of Equation (4) is started at t=0, the Hamiltonian gradually changes with the progress of time t and becomes the Hamiltonian described by Equation (3), and finally, the ground state of Equation (4) is obtained as a solution.
First, it considers how spin respond to the external field in a case of a 1-spin system. The Hamiltonian of the 1-spin system is given by Equation (5).
H=−B·{circumflex over (σ)} [Equation 5]
Here, σ̂ represents three components of the Pauli spin matrix as a vector. In a case where spin is directed to a magnetic field direction, the ground state is written as <σ̂>=B/|B| by using <•> as a quantum mechanical expected value. In the adiabatic process, the system always tries to keep the ground state and thus, the spin direction always follows the direction of magnetic field.
The above discussion can be extended to a multi-spin system. At t=0, the Hamiltonian is given by Equation (4). This means that the magnetic field Bjx=γ is uniformly applied to all spins. At t>0, the x-component of the magnetic field is gradually weakened and becomes Bjx=γ(1−t/τ). Regarding the z-component, interspin interaction exists and thus, the effective magnetic field is represented as Equation (6).
The spin direction is specified by a <σ̂jz/<σ̂jx> and thus, if spin direction follows the effective magnetic field, spin direction is determined by Equation (7).
{circumflex over (σ)}jz/{circumflex over (σ)}jx={circumflex over (B)}jz(t)/{circumflex over (B)}jx(t) [Equation 7]
Although Equation (7) is quantum mechanical description, but since Equation (7) takes an expected value, Equation (7) is a related equation related a classical quantity unlike Equations (1) to (6). Since non-local correlation (quantum entanglement) of quantum mechanics does not exist in the classical system, the spin direction is completely determined by the local field of each site and Equation (7) determines the behavior of the classical spin system. Although non-local correlation exists in the quantum system and thus, Equation (7) is deformed, matters regarding this will be described in and after Example 2, and a classical system determined by Equation (7) will be described in the present example in order to describe a basic form of the invention.
Since the z-component as well as the x-component is applied, spin according to the present invention is vectorial spin. The behavior as a vector can be understood also from
In a procedure 100 of
s
j
z(tk)/sjx(tk)=Bjz(tk)/Bjx(tk) [Equation 8]
Equation (8) is obtained by rewriting Equation (7) to the notation on the classical quantity and thus, the symbol of <•> is not added.
Next, an effective magnetic field at t=tk+1 will be obtained by using a value of spin at t=tk. If the effective magnetic field at each time is specifically written, the effective magnetic field becomes Equations (9) and (10).
In the following, spin and the effective magnetic field are alternately obtained in accordance with a procedure schematically illustrated in the procedure 100 of
In a classical system, the magnitude of a spin vector is 1. In this case, each component of spin vector is described as sjz(tk)=sin θ and sjx(tk)=cos θ by using the parameter θ defined in tan θ=Bjz(tk)/Bjx(tk). This is rewritten again like sjz(tk)=sin(arctan(Bjz(tk)/Bjx(tk))), sjx(tk)=cos(arctan(Bjz(tk)/Bjx(tk))).
As is evident from Equation (9), only tk is the variable of Bjx(tk) and τ and γ are constants. Accordingly, sjz(tk)=sin(arctan(Bjz(tk)/Bjx(tk))) and sjx(tk)=cos(arctan(Bjz(tk)/Bjx(tk))) may be generally represented to be sjz(tk)=f1(Bjz(tk),tk) and sjx(tk)=f2(Bjz(tk),tk) as a function having Bjz(tk) and tk as variables.
Although spin is described as the two-dimensional vector and thus two components of sjz(tk) and sjx(tk) appear, if Bjz(tk) is determined based on Equation (10), sjx(tk) is not needed. This responds to the matters that a spin state can be described by only sjz(tk) having [−1, 1] as a range. The final solution sjzd needs to become sjzd=−1 or 1 and it is assumed that if sjz(τ)>0, then sjzd=1, and otherwise, if sjz(τ)<0, then sjzd=−1.
Thus far, it is indicated how a problem to be solved is solved in a case where the problem to be solved is represented by Equation (3). Next, description will be made by enumerating a specific example as to how a specific problem to be solved is represented by Equation (3) including a local field gj and intervariable interaction Jij (i, j=1, 2, . . . , N). For example, a problem of electrical power supply management is considered as the specific problem to be solved. In this case, the local field is an amount of a natural phenomenon such as a temperature, or an electrical power use amount. That is, it is assumed that the temperature of each district is represented by the local field gj (j=1-10), the electrical power use amount of public facilities (library, theater, supermarket, and the like) in each district is represented by the local field gj (j=11-20), and the electrical power use amount of each household is represented by the local field gj (j=21-100).
σ̂jz (j=11-100) is a variable indicating where electrical power is to be distributed. However, j=1-10 is a subscript representing the temperature and thus, σ̂jz (j=1-10) does not represent electrical power distribution and considers the temperature as a variable which influences activities of public facilities or households. The temperature is determined by a natural phenomenon and is hardly influenced by artificial factors and thus, the local field gj (j=1-10) is set to a large value so that σ̂jz (j=1-10) is not influenced by other variables.
Correlation intensity between the temperature and activities of the public facilities and households is represented through intervariable interaction Jij. The correlation of the temperature and electrical power use is also influenced by the concept of electrical power sharing that is proposed in recent years. For example, the electrical power sharing is a movement that tries to reduce electrical power of each household in such a way that household members go to the public facilities without using an air conditioner at each household in a period of time at which air-conditioning is needed. The movement is represented by allowing a non-zero value of intervariable interaction Jij to be taken with respect to the subscript i=11-20 which represents the public facilities and the subscript j=21-100 which represent the house. However, interaction based on this concept is smaller compared to direct correlation on the temperature and activities of households and thus, the value of intervariable interaction Jij is relatively small. Respective households do not live independently and influence on each other and thus, intervariable interaction Jij (i, j=21-100) also becomes finite. The intervariable interaction Jij is specifically set through the discussion as described above and the optimum electrical power supply distribution (eigenvalue of σ̂jz=+1 or −1) is obtained through the ground state search of Equation (3)
In a case where it is unable to represent σ̂jz for each item by a single variable, a plurality of σ̂jz may be used and according to this, the plurality of the local fields gj and intervariable interaction Jij may also be used for each item. Although σ̂jz is the variable representing electrical power distribution, σ̂jz is correlated with human movement and an opening situation of public facilities. For that reason, it can be interpreted as “certain public facility is closed” by the obtained solution.
Description as above is a simple example that represents a specific problem to be solved by Equation (3). The specific problem to be solved to which the present example is applicable is not limited to the problem of electrical power supply management as exemplified as above and is applicable to a lot of problems to be solved, for example, travel route optimization, vehicle guidance for avoiding congestion, circuit design, product supply management, scheduling, and financial assets selection.
In Example 1, the transition to the classical quantity was made by taking the expected value based on a quantum mechanical equation, and the algorithm by the classical quantity was described using
There are a linear superposition state and quantum entanglement (non-local correlation) as the characteristics of quantum mechanics. For example, a quantum bit that takes two states of |0> and |1> is considered. The linear superposition state is one that is the sum of two states as |ψ>=α|0>+β|1>. The nature of the linear superposition state is already incorporated by vectorially treating spin in Example 1. That is, if sjz(tk)=1, then the state is |0>, and otherwise, if sjz(tk)=−1, then the state is |1>. |0> and |1> correspond to the state in a case where the z-axis is selected as an quantization axis of spin, and a case of sjx(t0)=1 which is directed to the x-axis is represented |ψ(t0)>=(|0>+|1>)/√2. If sjx(t)=−1, then the state is |ψ(t0)>=(|0>−|1>)/√2. Taking the x-axis into consideration means that linear superposition is considered.
In the present example, the quantum entanglement which is the quantum mechanical effect will be described. As an example, a case where a state of a 2-quantum bit system can be written as |ψ>=α|00>+β|11> is considered. It is expressed that |α|2+|β|2=1 by the standardization condition. The first variable and the second variable of |00> and |11> represent a first quantum bit and a second quantum bit, respectively. Since σ̂jz|0>=|0> and σ̂jz|1>=−11> as nature of the Pauli spin matrix, it becomes that σ̂1z|ψ>=α|00>−|11> and <ψ|σ̂1z|ψ>=|α|2−|β|2. Since σ̂1x|0>=∥> and σ̂1x|1>=|0>, it becomes that σ̂1x<|ψ>=α|10>+β|01> and <ψ|σ̂1x|ψ>=0. Since σ̂1y|0>=i|1> and σ̂1y|1>=−i|0>, it becomes that σ̂1y|ψ>=iα|10>−iβ|01> and <ψ|σ̂1y|ψ>=0. Accordingly, it is expressed that <σ̂1x(τ)>2+<σ̂1y(τ)2+σ̂1z(τ)>2=(|α|2−|β|2)2. As an extreme example, in a case of α=β at which quantum entanglement is maximized, it becomes that <σ̂1x(τ)>2+<σ̂1y(τ)>2+<σ̂1z(τ)>2=0 and the magnitude of the first spin vector becomes 0. Such a situation does not occur in the absence of quantum entanglement. For example, 1-spin system is considered and if a state is assumed to be |ψ>=α|0>+β|1>, then it becomes that <ψ|σ̂1z|ψ>=|α|2−|β|2, <ψ|σ̂1x|ψ>=α*β+αβ*, <ψ|σ̂1y|ψ>=iαβ*−iα*β, and <σ̂1x(τ)>2+<σ̂1y(τ)>2+<σ̂1z(τ)>2=(|α|2+|β|2)2=1, and accordingly, the magnitude is certainly saved to 1.
As described above, although it is one example, it was found out that in a case where quantum entanglement is present, the magnitude of spin vector is not saved to 1. In a classical system, the magnitude of spin vector is a fixed value of 1, but if quantum entanglement is present, the magnitude of spin vector is not 1. In Example 1, on the premise that the magnitude of spin vector is 1, it was set in such a way sjz(tk)=sin θ and sjx(tk)=cos θ by using θ defined by tan θ=<Bjz(t)>/<Bjx(t)> as a parameter. However, in this method, nature of quantum entanglement inherent in the system is not reflected. Here, the way to reflect nature of quantum entanglement is considered.
As described above, spin vector is not saved to 1. Here, a correction parameter rs (0<=rs<=1) representing the magnitude of spin is defined (“<=” means “greater than or equal to”). A proportional relationship of Equation (8) is not satisfied by being associated with matters that spin vector is not saved to 1. Here, the correction parameter rB is defined to deform Equation (8) to Equation (11).
s
j
z(tk)/sjx(tk)=rBBjz(tk)/Bjx(tk) [Equation 11]
Similar to the case of Example 1, an angle θ representing the spin direction is defined by tan θ=sjz(tk)/sjx(tk). If Equation (11) is applied to this, then tan θ=rBBjz(tk)/Bjx(tk). Considering that the magnitude of spin is rs, it becomes that sjz(tk)=rs·sin θ and sjx(tk)=rs·cos θ. By these related equations, effects of quantum entanglement are incorporated into the classical algorithm through the correction parameters rs and rB. If it is notated by not using θ, it becomes that sjz(tk)=rs·sin(arctan(rB·Bjz(tk)/Bjx(tk))) and sjx(tk)=rs·cos(arctan(rB·Bjz(tk)/Bjx(tk))). If rs and rB are incorporated into functions f1 and f2, then sjz(tk)=f1(Bjz(tk) tk) and sjx(tk)=f2(Bjz(tk),tk).
It is preferable that the correction parameters rs and rB are originated in quantum entanglement and finely controlled depending on tk, sjz(tk) and sjx(tk). However, it is difficult to accurately acquire information about quantum entanglement in principle and it is necessary to consider any coping method. Although it is actually determined semi-empirically depending on a problem, a coarse determining method becomes as follows. The rB is an amount of which the sign can be changed and is an amount obtained by most properly reflecting quantum entanglement. On the other hand, the rs is a correction factor satisfying 0<=rs<=1 and has a role smaller than the rB. Accordingly, the rs may be set to be rs=˜1 over the total computation time (“=˜” means “approximately equal to”) and mainly incorporates quantum effects by the rB. Since there is no quantum entanglement at the start of the computation, rB=1 at t=0 and rB gradually comes closer to 0 at t>0. If it comes closer to t=τ, many of spins converge to sjz=1 or −1, but some of spins behave subtly of whether to become sjz>0 or sjz<0. What ultimately determines success or failure of the computation is these spins having bad convergence. Accordingly, at t=˜τ, the rB is determined so as to be most suitable for these spins. Effects of quantum entanglement are incorporated as much as possible and thus, rB=˜0. The direction of spin converging to sjz=1 or −1 is stable such that there is little adverse effect according to the fact that rB=˜0.
Description as above is the method for setting the rB regarding time dependency. It is also effective to provide magnetic field dependency to rB. In a case of Bjz(tk)/Bjx(tk)=˜0, sjz(tk)/sjx(tk) becomes inevitably indefinite. Accordingly, in a case of Bjz(tk)/Bjx(tk)=˜0, it is effective to speed up a change in which rB becomes from rB=˜1 to rB=˜0 with the progress of time t, in comparison with a case of |Bjz(tk)/Bjx(tk)|>>0.
Although in a case where there is no special intersite features, the rs or rB is not allowed to have site dependency, in a case where the feature per site is known in advance, it may be needed to respond to each feature and if the rs and rB become site dependent, improvement of the correct answer rate of a solution can be expected.
In Examples 1 and 2, the Hamiltonian of the problem to be solved was given by Equation (3) and the effective magnetic field of each site was given by Equations (9) and (10). The local field gj was present in each site. If the local field gj is present, a direction of sjz determined by gj becomes a zero-approximation direction and the direction of sjz is corrected with application of interaction determined by Jij. However, in a case where all sites becomes gj=0, a concept of the zero-approximation direction is not present and a degeneration number is increased and thus, whether it becomes sjz>0 or sjz<0 is not determined and, sjz is not escaped from sjz=˜0 even if the computation time has elapsed. When it is not the correct answer even if sjz is escaped therefrom, a force, which reverses the direction between spins, is applied to each other as a result from interspin interaction, a vibration phenomenon in which the direction of spin is reversed occurs at each step of time, and the solution is not converged.
In order to solve these problems, a relaxation term (pin fixture term) is added to Equation (10) to allow the effective magnetic field to be set as Equation (12A).
The third term is the relaxation term. sgn(•) is a sign function and sgn(sjz)=1 for sjz>0, sgn(sjz)=0 for sjz=0, and sgn(sjz)=−1 for sjz<0. The relaxation term serves to keep based on spin direction to eliminate the above-described vibration phenomenon and improve convergence of a solution. The value of gpina is empirically determined. The relaxation term is an additional term for improving convergence of a solution and needs to be sufficiently smaller than |Jij|. On the other hand, if it is too small, enough work cannot be expected. If a range is specified, it is considered appropriate that the coefficient gpina is adjusted to be a value from 1% to 50% of the average value of |Jij|. As a guide, the coefficient gpina may be set to about 1/10 of the average value of |Jij|.
The relaxation term (third term) of Equation (12A) depends only on the sign of sjz and does not depend on the size of sjz. On the other hand, the first term depends on the size of sjz. There is a method in which the third term also depends on the size of sjz. Such a case corresponds to Equation (12B).
The size of the third term in a case of Equation (12B) is changed depending on the size of sjz and thus, the coefficient gpinb typically has a size of about an average value of |Jij| and is in the order of 50% to 200% in a range point of view.
In a case where the ground state is degenerated, it is necessary to induce computation into one solution. When the computation is not induced, it is not escaped from sjz=˜0 which is the average value of the solution. The relaxation term is also useful for this induction. It is necessary to appropriately set a zero-approximation solution in order to induce the computation to one solution. Here, it is assumed that a single site (assumed as j site) assumed to be a reference is selected and set as sjz(t0)=1 and the direction of other sites is determined based on the sign of Jij by using the j site as a reference at an early stage. In this way, appropriate setting of zero-approximation is made and the computation is converged to one correct answer through the computation of the local field response after the setting. In this case, the relaxation term contributes as in the following.
At t=t0, it is set as sjz(t0)=1 and sjz(t0)=˜0 (i≠j), Equation (12A) or Equation (12B) is time-evolved based on Equation (11). Here, siz(t0)=˜0 (i≠j) means that it is set to, for example, about siz(t0)= 1/1000, so that the siz hardly influences on other sites. By the time step t0→t1, siz>0 or siz<0 is obtained for an i site at which Jij becomes Jij≠0 based on Equation (12A) or Equation (12B) and Equation (11). Although initial setting of sjz(t0)=1 is not propagated at step of t0→t1 for the i site at which Jij becomes Jij=0, the number of sites at which siz becomes siz>0 or siz<0 is increased and thus, the initial setting of sjz(t0)=1 is indirectly propagated through the site in the next time step and almost all sites become siz>0 or siz<0 at an early stage of the computation. Since the relaxation term is present, sjz>0 of the j site is maintained at an early stage of the computation and information of sjz(t0)=1 is propagated to almost all sites while a history of sjz(t0)=1 remains in the j site itself. This is the appropriate setting of zero-approximation.
As described above, description was made on matters that setting of sjz(t0)=1 and siz(t0)=˜0 (i≠j) are made at t=t0 as a method for inducing the computation to one solution in a case where the ground state is degenerated. In this method, a mandatory factor is only sjz(t0)=1 at t=t0. At t>t0, only the history by the relaxation term is present and the mandatory factor is not present. It is effective to add the mandatory factor over all the time in order to further increase inductiveness to one solution. To do so, a method in which a local field term δgj′ is additionally added only to one site j′ is considered. That is, δgj′ is added to the local field term gj′ of Equations (12A) and (12B), which results in gj′→gj′+δgj′. Here, in two amounts of the right-hand side, the δgj′ is assigned the same sign as the sign of gj′ so as not to cancel each other. If it is originally gj′=0, the sign of δgj′ may be either positive or negative. The spin of j′ site is strongly induced in a certain direction by the additional term. Although it is proper that δgj′ is adjusted to be in a range of about from 10% to 100% of the average value of |Jij|, one guide may be set to a range of about 50% of the average value of |Jij|. Furthermore, since the mandatory factor is added through δgj′, in this case, initial setting of sj′z(t0)=1 is not needed and siz(t0)=˜0 may be set for all sites. A flowchart for this case is illustrated in
As described above, the relaxation term is added to thereby suppress vibration related to the direction of spin in time evolution and improve convergence of the solution. Furthermore, the relaxation term also has the following effects. In a case where the degeneration number of the ground state is large, there is a possibility that it is unable to determine whether which direction of the system orients is preferable in the ground state, the system falls into a state of sjz=0 which is the average value of a solution, and the system falls into a situation without escaping from the state. Here, only spin of one site (assumed as site j) is clearly determined (sjz=1 or −1) at an initial state and other spins are set as skz=˜0 (k≠j, skx=˜1), and a spin arrangement is determined by using the j site as a reference through spin interaction at an early stage of time evolution. Since the relaxation term is present, the direction of the j site is fixed and the reference is maintained at an early stage of the computation. For that reason, a good approximate solution corresponding to one of the degenerated solutions is implemented at an early stage of the computation and is induced to one of the degenerated solutions as it is. As such, the relaxation term increases the convergence of the solution and is converged to one of solutions in a case where the degeneration number is large. Furthermore, in a case where the degeneration number is large, there is also a second method as a method for causing the relaxation term to be converged to one of solutions. In Equations (12A) and (12B), the local field term δgj′ is additionally added to the site j′ of one solution to strongly induce only spin of j′ site in a certain direction. With this, it is strongly induced to one of the degenerated solutions relative to the j′ site set as a reference.
In Example 3, description was made on matters that the convergence of the solution is improved by introducing the relaxation term. The relaxation term mainly exerts power on a case where all sites become gj=0. If there is a term for which gj becomes gj≠0, convergence is relatively good even without adding the relaxation term. When determination is made as to whether the relaxation term is to be added based on whether it is gj=0 for all sites, it becomes an efficient computing method using only necessary terms.
As understood from Equation (1) or the like described above, although the computation time is assumed as τ, there are several methods as a final solution determination method. Various solution determination methods will be described by using Example 5.
In the first method, if sjz>0, then sjzd=1 and if sjz<0, then sjzd=−1 at t=τ(t=tm) (115), as illustrated in
In the second method, as illustrated in
In the third method, as illustrated in
That is, by assuming that if sjz(tk)<0 at each time tk, then sjzd(tk)=−1, and otherwise, if sjz(tk)>0, then sjzd(tk)=1 (119), Hp(tk)=−Σi>jJijsizd(tk)sjzd(tk)−Σjgjsjzd(tk) is calculated at each time tk (123), and it is assumed that sjzd(tk′) at time tk′, at which Hp(tk) became the minimum value, is the final solution (124).
In the fourth method, as illustrated in
The example in which the time axis is discretely set as illustrated in
Important time in the computation process is time when the sign of sjz changes. The change in the sign of sjz is relatively small near the start or end of the computation and the change in the sign of sjz is violent in an intermediate stage of the computation. As the first method, there is a setting method in which the time interval is programmatically set to be larger at the start of computation, the time interval is decreased over time, and then the time interval is increased.
In the second method, a possibility of spin reversal is evaluated at each time and the time interval is set based on the evaluation. For example, it becomes as follows. If the sizes of |sjz| are roughly equal at all spins, the possibility of occurrence of spin reversal is low. In this case, the time interval is increased. On the other hand, if a size of |sjz| of a specific spin is smaller than other spins, a probability that spin reversal occurs is high. In this case, the time interval is decreased. One specific example of a time interval determination is as follows. The minimum time interval is set as δtmin. A mean square of spins of all sites at time tk is set as save(tk)2, and the size of square of the minimum spin is set as smin(tk)2. That is, save(tk)2=Σj(sjz(tk))2/N, smin(tk)2=min sjz(tk)2. [x] is set to ΔTk+1,k=tk+1−tk=δtmin×max (1, [100×(smin(tk)2/save(tk)2)1/2]) as the maximum integer which is less than or equal to x. In a case of this example, the minimum value of the time interval becomes σtmin and the maximum value of the time interval becomes 100·δtmin.
Whether to take any of the methods will be decided by the user.
In Example 3, the site j was selected and sjz(t0) was set as sjz(t0)=1 and siz(t0)=˜0 (i≠j). Since the site j is optional, it is possible to solve the same problem by changing selection of the site j. In this way, if an optimum solution, which is attained by repeatedly solving the same problem, is selected, the correct answer rate is improved.
In Examples 1 to 7, description was made on a computation principle and computation algorithm. In Example 8, first, description will be made on a configuration example of a computer which causes algorithm to be operated as a program.
The program executed by the computing device 202 is stored in the main storage device 201 which is the storing unit. In a case where a storage capacity of the main storage device 201 is not enough, an auxiliary storage device 204 which is the same storing unit is used. An input device 205 is used for inputting data, the program, and the like and an output device 206 is used for outputting a result. The input device 205 includes an interface for network connection in addition to a manual input device such as a keyboard. The interface also serves as the output device. Although the algorithm described in Examples 1 to 7 as the program is applied to the configuration of
On the other hand, there is also a method in which the computation principle and algorithm described in Examples 1 to 7 are used, including execution of the program as well as an apparatus configuration.
As information that is needed, there are a time parameter tk, correction parameters rs and rB related to quantum entanglement, coefficients gpina and gpinb of the relaxation term, and the like, in addition to the problem to be set parameters called intervariable interaction Jij and the local field gj. Processing which takes, for example, synchronization is a role of the control device 203 similar to the computer of the configuration of
In the third and fourth method described in Example 5, as illustrated in
The local field response computing device 1000 described in Example 8 can be realized by various methods. In the present example, a method for efficiently using parallelism of light will be described first.
Although the variable sjz takes a value of [−1,1], an output of the light source is not able to take a negative value. Accordingly, it is assumed that sjz is represented in a pair of two LEDs in the LED array 1110. That is, it is assumed that for sjz>0, sjz is set as sjz+=sjz and sjz−=0 and for sjz<0, sjz is set as sjz+=0, and sjz+=sjz=−|sjz|, one of the LED output is set as sjz+, the other is set as |sjz−|, and thus, sjz is represented by a difference of both outputs sjz=sjz+−|sjz−|=sjz++sjz−. The detector array 1130 is also assumed to represent in a pair of two detectors in association with the light source side. With this, it is possible to correspond to the variable mask 1120 which is unable to take a negative value. A signal intended to be obtained in the detector array 1130 is bjz≡ΣiJijsiz. Similar to sjz, if Jij=Jij++Jij−=Jij+−|Jij−|, it becomes that bjz=ΣiJijsiz=Σi(Jij++Jij−)(siz++siz−)=Σi(Jij+siz++Jij−siz−)+Σi(Jij+siz−+Jij−siz+). Each of pairs of two detectors of the detector array 1130 detects bjz+=Σi(Jij+sjz++Jij−sjz−) and |bjz−|=Σi(Jij+|sjz−|+|J1j−|+|Jij−|sjz+) and bjz=bjz+−|bjz−|=bjz++bjz− obtained by taking a difference between the detectors becomes the signal. Furthermore, as described above, if Jii is set as Jii=gpinb it corresponds to Equation (12B).
If bjz is obtained, Bjz based on Equation (10) or Equations (12A) and (12B) is obtained by adding gj and gpina terms. This calculation is performed by the control unit 1100. A calculation to obtain sjz from Bjz is performed by the control unit 1100 and the value of sjz is sent to the LED array 1110. Like this, 1 step spanning from time tk to tk+1 is ended. Furthermore, the control unit 1100 is intended to repeat same processing and is a dedicated circuit for that. sjz at each time is transferred to the main storage device 201 to be used for analysis.
In the example of
In Example 9 and Example 10, sjz was represented in a pair of two light sources. If sjz+ and sjz− are represented by using polarization, a single light source can be used for sjz.
In the present example, sjz=√2A sin θ was set as sjz=˜√2Aθ by setting θ as θ<<1. Here, if θ is set as θ=arcsin φ, sjz becomes that sjz=√2Aφ without imposing a condition of θ<<1. That is, an input signal to the polarized wave modulator 1112 is adjusted to thereby make it possible to maintain linearity. In Example 2, description was made on matters that rB is shifted from rB=1 to rB=˜0 according to time variation from t=0 to t=τ. In this case, a functional form of θ=arcsin φ is further deformed.
The local field response computing device 1000 may also be realized by an electrical circuit as well as the method using light as in Examples 9 to 11.
siz is a continuous quantity and respective cells of the buffer arrays 1210 and 1230 with respect to siz are configured with multi-bits and are assumed as a pseudo-continuous quantity.
Effect of the temperature in the present invention is estimated as follows. Bit manipulation is performed in the LED (LD) array 1110, the polarized wave modulator 1112, and the buffer arrays 1210 and 1230. A voltage needed for bit inversion is about 1V. If it is assumed that e is the elementary charge and kB is the Boltzmann constant, a conversion temperature is T=˜1.2×104K by T=eV/kB. This value is sufficiently larger than a room temperature of 300K, an influence of the temperature can be ignored in the configuration as in Examples 9 to 12, and it can be operated in the room temperature.
The present invention is not limited to the embodiments described above and includes various modifications. For example, it is possible to replace a portion of a configuration of an example with a configuration of another example and also, it is possible to add a configuration of another example to a configuration of a certain example. Also, it is possible to add, delete, and replace of a configuration of another example, with respect to a portion of a configuration of a certain example.
For example, it is available to a field of a computer for dealing with a problem to be solved that needs exhaustive search.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2015/059765 | 3/27/2015 | WO | 00 |