The present invention relates to a computing apparatus that possibly performs high-speed computation on inverse problems or combinatorial optimization problems that need an exhaustive search.
The present age is overflowing with data, which is represented by the term “Big Data.” In information science, one of the most important assignments is how this Big Data is interpreted and is handled. In Big Data, there are quite a lot of problems that need complicated interpretation. For example, when a certain result is obtained, there is a case where the cause thereof is desired. This is called an inverse problem. The more complicated a phenomenon is, the more difficult it is to make clear the cause thereof. Generally, there is no efficient algorithm that obtains the initial value from a result. In a worst case, an exhaustive search has to be made to obtain the initial value. This is one of the difficult problems in Big Data. Alternatively, there are also many problems that an optimal solution is chosen from many choices based on Big Data. In this case, an exhaustive search is needed as well if all possibilities are taken into consideration. For this background, a computer that efficiently solves the problems that need an exhaustive search is required.
Quantum computers are placed expectations in for exhaustive-search problems. The quantum computers consist of basic elements called qubits, which simultaneously achieve “0” and “1”. For this reason, all candidates for the initial value can be simultaneously calculated, and there is a possibility that the exhaustive search can be done. However, the quantum computers need to maintain quantum coherence throughout calculation time, and a prospect for achieving it has not been made.
In this situation, a technique that is called adiabatic quantum computing is getting more attention (NPL 1). In this method, a problem is converted in such a manner that the ground state of a certain physical system is a solution and the solution is obtained through the discovery of the ground state. Let Ĥp be Hamiltonian of a physical system in which a problem is set. At the beginning of computation, however, Hamiltonian is not Ĥp but different Hamiltonian H0 for which the ground state is easily prepared. Next, the Hamiltonian is transformed from Ĥ0 to Ĥp with a sufficient period of time. When the transformation takes a sufficient period of time, the system continues to stay in the ground state, and the ground state for Hamiltonian Ĥp is finally obtained. This is a principle of the adiabatic quantum computing. Let the calculation time be τ. Hamiltonian is as expressed in Equation (1).
A solution is obtained by time-evolving the state with the Schroedinger's equation of Equation (2).
The adiabatic quantum computing is also applicable to a problem that needs an exhaustive search, and a solution is obtained in a one-way process. However, if the calculation process needs to be in accordance with the Schroedinger's equation of Equation (2), quantum coherence needs to be maintained as similar to that in the quantum computer. However, whereas the quantum computing repeats gate operations on one qubit or between two qubits, the adiabatic quantum computing makes the whole qubit system interact at the same time. There is a difference in the considered coherence between two methods. For example, let us consider a gate operation on a qubit. If there is an interaction between the qubit and a different qubit, the interaction causes decoherence. However, in the adiabatic quantum computing, all qubits interact with each other at the same time, and therefore, there is no decoherence in that example. For this reason, the adiabatic quantum computing is believed to be robust against decoherence compared with the quantum computing.
However, there is a problem associated with the adiabatic quantum computing as well. Even though the adiabatic quantum computing is more robust than the quantum computing about decoherence, if a computation process is in accordance with the Schroedinger's equation of Equation (2), sufficient coherence is needed as well. Furthermore, there is also a problem, i.e., the system which can achieve the adiabatic quantum computing is a superconducting magnetic flux qubit system (PTL 1 and NPL 2). The problem is that when the superconductivity is used, a cryogenic cooling apparatus is needed. The need for an extremely low temperature is an issue for achieving a practical computer.
JP-T-2009-524857
As described above, the adiabatic quantum computing is effective against difficult problems that need an exhaustive search, but there is still a need for quantum coherence, and there is also a need for the cryogenic cooling apparatus when the superconductive magnetic flux qubit is used. Thus, an assignment is that these two necessary conditions are removed and a practical computing apparatus is provided.
A target of the present invention is to provide a computing apparatus that solves the problems described above and does not need quantum coherence and a cryogenic cooling apparatus.
Spins work variables in computation, and a problem that is solved is set with inter-spin interactions and local fields applied to each spin. An external magnetic field is gradually decreased in such a manner that all spins are directed uniformly along one direction at time t=0 by the external magnetic field, and that the external magnetic field is reduced to zero at time t=τ, as an ideal treatment.
Each spin is time-evolved in such a manner that its orientation is determined in accordance with an effective magnetic field that consists of all actions, that is, the external magnetic field and inter-spin interactions at each site at time t. Here, the spins are not completely directed along the effective magnetic field, and the orientation is quantum-mechanically corrected, and thus the system maintains the ground state almost accurately.
The spin described above is introduced as a model, and it does not need a real spin. A computing apparatus according to the present invention can be achieved using means such as an electric circuit. Here, an important point is that it is desirable that all interactions in the N-spin system can be set arbitrarily.
In a specific example according to the present invention, a computing unit is separated into a cross computing unit and an individual-computing unit, and the cross computing unit performs pipeline processing. That enables us to arbitrarily set the interactions with the relatively small number of switches without decreasing computation speed.
According to another aspect of the present invention, a computing apparatus is provided with a general computing unit, a local-field response computing unit, a storage unit, and a control unit, in which under the control of the control unit, data is transmitted and received between the storage unit and the general computing unit or between the storage unit and the local-field response computing unit. Here, the local-field response computing unit includes a cross computing unit, an individual-computing unit, a buffer, and a switch unit. The cross computing unit determines N variables Bjz=Bjz(s1z, s2z, . . . , sNz) from N variables sjz (j=1, 2, . . . , N). The individual-computing unit determines sjz=sjz(Bjz) from each of the N variables Bjz. Computations are repeatedly performed using the components described above, i.e., the individual-computing unit, the buffer, the switch, the cross computing unit, and the individual-computing unit are repeatedly used in this order. Furthermore, information that is necessary in the switch, the cross computing unit, and the individual-computing unit is stored in the storage unit, and computation is performed under the control of the control unit.
The local-field response computing unit can be configured with a piece of dedicated or general-purpose hardware.
According to a preferable aspect of the present invention, N variables B1z, B2z, . . . , BNz are determined in this order through performing pipeline processing in the cross computing unit, using the N sjz that are stored in the buffer.
According to another preferable aspect of the present invention, the variables sjz and Bjz are configured with multi-bits.
According to another preferable aspect of the present invention, the variables sjz are configured with multi-bits; the N sjz's that are stored in the buffer are transferred beginning at the least significant bit (LSB); and the processing of sjz in the cross computing unit is performed beginning at the LSB.
According to another preferable aspect of the present invention, when the N sjz's that are stored in the buffer are transferred to the switch unit, sjz is set to be a time-direction analog value and is processed with analog format in the processing in the cross computing unit.
Let us describe a more specific example of a processing sequence according to the present invention. The N variables sjz (j=1, 2, . . . , N) take the range of −1≤sjz≤1, and an assignment is set using local fields gj and inter-variable interactions Jij (i, j=1, 2, . . . , N). In the local-field response computing unit, time is divided by m and computing is performed discretely from t=t0 (t0=0) to tm (tm=τ). In the computation, in obtaining a variable sjz(tk) at each time tk, Bjz(tk)={ΣiJijsiz(tk−1)+gj}·tk/τ is calculated using values of variables siz(tk−1) (i=1, 2, . . . , N) at time tk−1 that is earlier than time tk. Then, sjz(tk) is determined as sjz(tk)=f(Bjz(tk), tk), where the function f is defined in such a manner that the range of sjz(tk) is −1≤sjz(tk)≤1. As a time step proceeds from t=t0 to t=tm, sjz approaches −1 or 1, and a solution is determined as sjzd=−1 if finally sjz<0 and sjzd=1 if sjz>0.
Moreover, let us describe a specific example. At each time tk, if sjz(tk)<0, sjzd(tk)=−1 is set, and if sjz(tk)>0, sjzd(tk)=1 is set, and Hp(tk)=−Σi>jJijsizd(tk)−Σjgjsjzd(tk) is calculated. The final solution is sjzd(tk′) at time tk′ at which Hp(tk) took the minimum value.
These processing sequences are possibly achieved with software-based information processing, but can be typically achieved using an ordinary computer and a hardware resource that includes the local-field response computing unit. Alternatively, all processing sequences may be achieved with a dedicated hardware.
According to another aspect of the present invention, a computing apparatus is provided with a general computing unit, a local-field response computing unit, a storage unit, and a control unit, in which under the control of the control unit, data is transmitted and received between the storage unit and the general computing unit or between the storage unit and the local-field response computing unit. In the computing apparatus, N variables sjz (j=1, 2, . . . , N) take multi-bit values that are set in the range of −1≤sjz≤1.
An assignment is set by local fields gj and inter-variable interactions Jij (i, j=1, 2, . . . , N). In the local-field response computing unit, time is divided by m and computing is performed discretely from t=t0 (t0=0) to tm (tm=τ). In the computation in obtaining a variable sjz(tk) at each time tk, Bjz(tk)={ΣiJijsiz(tk−1)+gj}·tk/τ is calculated with values of variables siz(tk−1) (i=1, 2, . . . , N) at time tk−1 that is earlier than time tk, sjz(tk) is determined as sjz(tk)=f(Bjz(tk), tk), where the function f is defined in such a manner that the range of sjz(tk) is −1≤sjz(tk)≤1. As a time step proceeds from t=t0 to t=tm, sjz approaches −1 or 1. Calculation of Bjz(tk)={ΣiJijsiz(tk−1)+gj}tk/τ is performed using a multiplexed transmission path, and after the calculation, the transmission path is demultiplexed to separate every Bjz(tk). In a more specific example according to the present invention, the multiplexing is configured as time division multiplexing.
The present method is operational in a classical system although quantum-mechanical corrections are added. For this reason, we do not need to take quantum coherence into consideration; usuable resources are widened; and an electric circuit can also be used. If an energy scale related to a bit is set to be sufficiently larger than an energy scale of temperature, temperature fluctuations can be ignored and a special apparatus, such as a cryogenic apparatus or a special environment, is also unnecessary.
Various embodiments of the present invention will be described below together with the principle of computation with reference to the accompanying drawings. However, the present invention is not limited to descriptions in embodiments that will be described below. It is apparent that a person with the same expertise can modify a specific configuration of the present invention within a range that does not depart from the nature and gist of the present invention.
For the configuration of the invention that will be described below, the same portions or portions that have similar functions are given the same reference numerals in different drawings, and the same descriptions are frequently omitted.
The terms “first”, “second”, “third”, and so forth in the present specification are used for distinguishing constituents, and the terms do not necessarily restrict the number itself or the order itself. Furthermore, the number for distinguishing the constituents is used context-by-context, and the number that is used in one context does not necessarily refer to the same constituent in other contents. Furthermore, a constituent that is identified by a certain number is not prevented from being part of a constituent that is identified by a different number.
In some cases, to give an easy understanding of the invention, a position, size, shape, and range and the like of each constituent do not show their actual position, shape, range, and the like, respectively. For this reason, the present invention does not necessarily impose any limitation on a position, size, shape, range, and the like that are illustrated in the drawings and the like.
Adiabatic quantum computing is also called quantum annealing, and it is a method where the concept of classical annealing has been quantum-mechanically expanded. That is, the adiabatic quantum computing is interpreted as the one that it can classically operate in nature and quantum-mechanical effects are added to it in order to improve the performance in terms of speed and success probability of solutions. Hence, in the present embodiment, computing apparatus itself is produced classically, but a computing method or the apparatus is intended to include quantum-mechanical effects in the computation process through introducing parameters that are quantum-mechanically determined.
Based on the concept described above, the following embodiments describe a classical methodology (algorithm) for obtaining the ground state as the solution and a configuration of an apparatus for achieving it, referring to the adiabatic quantum computing.
Embodiment 1 describes the principle of the present embodiment, starting to describe it quantum-mechanically and transforming it to a classical form.
A problem for searching for the ground state of Ising spin Hamiltonian given by Equation (3) includes problems classified so-called NP-hard and is known to be a useful problem (NPL 3).
Jij and gj are parameters for setting the problem, and σ̂jz is the z component of the Pauli spin matrix and takes the eigenvalue of ±1. i and j represent spin sites. The Ising spin is a variable that takes only ±1 as a value, and Equation (3) expresses an Ising spin system because the eigenvalue of σ̂jz is ±1. The Ising spin in Equation (3) does not need to be a spin as the name implies, and it may be anything physical as long as the Hamiltonian is described by Equation (3). For example, high and low states of a logic circuit can be associated with +1 and −1; vertical and horizontal polarizations of light can be associated with +1 and −1; or 0 and π phases are associated with +1 and −1. In the method in the present embodiment, the computing system is prepared in the ground state for the Hamiltonian of Equation (4) at time t=0 as similar to that in the adiabatic quantum computing.
γ is a proportional constant that is determined in accordance with the magnitude of an external field that is uniformly applied to all sites j, and σ̂jx is the x component of the Pauli spin matrix. When the computing system consists of spins themselves, the external field means a magnetic field. Equation (4) corresponds to applying a transverse field, and the ground state is the case where all spins are directed to the x-direction (γ>0). The Hamiltonian for setting a problem is defined using an Ising spin system that has only the z components, but the x component of the spins appear in Equation (4). Therefore, the spins in the computation process are not the Ising but vectors (Block vectors). The computation starts with the Hamiltonian of Equation (4) at t=0. The Hamiltonian gradually changes with the passage of time t; it is finally transformed to the Hamiltonian described in Equation (3); and the ground state for the Hamiltonian is the solution.
Let us consider how the spin responds to the external field in the case of a one-spin system first. The Hamiltonian of the one-spin system is given by Equation (5).
Ĥ=−B·{circumflex over (σ)} [Equation 5]
Here, σ̂ represents the three components of the Pauli spin matrices as a vector. The ground state is the case where the spin is directed to the magnetic field direction. Let <•> be a quantum-mechanical expectation value. The ground state is written as <σ̂>=B/|B|. Because an adiabatic process continues to maintain the ground state, the direction of the spin always follows that of the magnetic field.
The description so far can be expanded to a multi-spin system. The Hamiltonian is given by Equation (4) at =0. This equation means that magnetic field Bjx=γ is applied to all spins. The x component of the magnetic field is gradually weakened in accordance with Bjx=γ(1−t/τ) at t>0. The z component of the effective magnetic field comprised of inter-spin interactions is given by Equation (6).
Because the spin direction can be prescribed with <σ̂jz>/<σ̂jx>, if the direction of the spin follows that of the effective magnetic field, the spin direction is determined by Equation (7).
{circumflex over (σ)}jz/{circumflex over (σ)}jx={circumflex over (B)}jz(t)/{circumflex over (B)}jx(t) [Equation 7]
Although Equation (7) is based on a quantum-mechanical description, it is an equation related to classical quantities because expectation values are taken, unlike Equations (1) to (6). Because there is no non-local correlation (quantum entanglement) of the quantum mechanics in classical systems, the orientation of the spin should be completely determined by the local field in each site, and Equation (7) determines the behavior of classical spin systems. Because there is the non-local correlation in quantum systems, Equation (7) will be modified. However, the modification will be described in embodiment 2 and subsequent embodiments. The present embodiment describes the classical system prescribed with Equation (7) in order to describe a basic form.
The spins in the present embodiment are vectorial because the x component is added in addition to the z component. A vectorial behavior can be also understood from
In the procedure 100 in
s
j
z(tk)/sjx(tk)Bjz(tk)/Bjx(tk) [Equation 8]
Because Equation (8) is the one rewritten from Equation (7) to express a relation between the classical quantities, the symbol <•> is not used.
Next, the effective magnetic field at t=tk+1 is calculated using values of the spins at t=tk. The effective magnetic field at each time is specifically written by Equations (9) and (10).
In the following, the spin and the effective magnetic field will be alternately determined in accordance with a procedure 100 schematically illustrated in
The magnitude of a spin vector is 1 in the classical system.
Each component of the spin vector in this case is described as sjz(tk)=sin θ, sjx(tk)=cos θ, using the parameter θ that is defined with tan θ=Bjz(tk)/Bjx(tk). These are rewritten to sjz(tk)=sin(arctan(Bjz(tk)/Bjx(tk))) and sjx(tk)=cos(arctan(Bjz(tk)/Bjx(tk))).
As apparent from Equation (9), the variable in Bjx(tk) is only tk, and τ and γ are constants. Therefore, sjz(tk)=sin(arctan(Bjz(tk)/Bjx(tk))) and sjx(tk)=cos(arctan(Bjz(tk)/Bjx(tk))) can also be generally expressed as functions like sjz(tk)=f1(Bjz(tk) tk) and sjx(tk)=f2(Bjz(tk), tk), arguments of which are Bjx(tk) and tk.
Because the spin is described as a two-dimensional vector, two components of sjz(tk) and sjx(tk) appear, but if Bjz(tk) is determined based on Equation (10), sjx(tk) is not necessary. This corresponds to the fact that a spin state can be described only with sjz(tk), the range of which is [−1,1]. Because the final solution sjzd needs to be sjzd=−1 or 1, if sjz(τ)>0, sjzd=1, and if sjz(T)<0, sjzd=−1.
It has so far been described how a solution is provided in a case where a problem is expressed with Equation (3). Next, let us describe how a specific problem is expressed with Equation (3) that includes local fields gj and inter-variable interactions Jij (i, j=1, 2, . . . , N), using a specific example. As an example, let us consider a problem of an electric power supply management as a specific issue. In this case, the local field is assigned to be a quantity of natural phenomenon, such as temperature, or electric power consumption. That is, let local field gj (j=1 to 10) be the temperature at each district; let local field gj (j=11 to 20) be electric power consumption at public facilities (a library, a theater, a supermarket, and the like) in each district; and let local field gj (j=21 to 100) be the electric power consumption at each household.
Let σ̂jz (j=11 to 100) be a variable representing where electric power is distributed. Here, because j=1 to 10 is a subscript representing a temperature, σ̂jz (j=1 to 10) does not represent an electric power distribution, and it is considered as a variable for expressing an influence of temperature on activities at the public facilities and households. Because the temperature is determined with natural phenomena and thus is hardly influenced by artificial factors, the local field gj (j=1 to 10) is set to such a high value with which σ̂jz (j=1 to 10) is not influenced by other variables.
The degree of correlation between the temperature and each of the public facilities and households is expressed through inter-variable interaction Jij. The correlation between the temperature and the electric power consumption is also influenced by the concept of electric power sharing that has been proposed in recent years. An example of the electric power sharing is a movement where electric power consumption is reduced in such a manner that a household member goes to the public facilities not to use an air conditioner at each household for a time span at which air conditioning is necessary. The movement is expressed through inter-variable interaction Jij, value of which is not zero for subscript i=11 to 20 representing public facilities and for subscript j=21 to 100 representing households. However, because the interaction based on this concept is smaller than direct correlations between the temperature and the activities in the households, the value of the inter-variable interaction Jij is relatively small. Furthermore, because households are not managed independently and exert influences on each other, the inter-variable interactions Jij (i, j=21 to 100) are finite as well. In accordance with the considerations described above, the inter-variable interactions Jij are specifically set, and an optimal electric power supply distribution (eigenvalue of σ
When each item cannot be expressed with one variable of σ
We have described a simple example that expresses a specific problem with Equation (3). A specific problem, to which the present embodiment can be applied, is not limited to the problem of the electric power supply management. The method in this embodiment is applicable to many problems, such as a tour course optimization, vehicle guidance for avoiding traffic congestion, circuit design, product supply management, scheduling, and financial asset selection.
In Embodiment 1, we have transferred quantum-mechanical quantities to classical quantities by taking expectation values using quantum-mechanical equations, and explained the algorithm for the classical quantities using
The characteristics of quantum mechanics include a linear superposition state and quantum entanglement (non-local correlation). For example, let us consider a qubit that takes the two states of |0> and |1>. A linear superposition state is a sum state like |Ψ>α|0>+β|1>. The attribute of the linear superposition state has already been incorporated through the vectorial treatment of spins in Embodiment 1. That is, if sjz(tk)=1, the state is |0>, and if sjz(tk)=−1, the state if |1>; |0> and |1> correspond to a state in a case where the z-axis is selected as the quantization axis for spins; for s (to)=1 corresponding to an x-directed spin, the state is expressed with |Ψ(t0)>=(|0>+∥>)/√{square root over ( )}2; and if s (t)=−1, the state is |Ψ(t0)>=(|0>−|1>)/√{square root over ( )}2. Considering the x-axis means considering the linear superposition.
In the present embodiment, we describe the quantum entanglement that is another quantum mechanical effect. Let us consider a state in a two-qubit system described with |Ψ>=α|00>+β|11> as an example. |α|2+|β|2=1 is satisfied due to the normalization condition. The first and second variables in |00> and |11> are the first and second qubits, respectively. Because of σ̂jz|0>=|0> and σ̂jz|1>=−|1> based on a property of the Pauli spin matrix, σ
As described above, although this is one example, it is understood that when the quantum entanglement is present, the magnitude of the spin vector is not retained as 1. Although the magnitude of the spin vector is a fixed value of 1 in classical systems, if there is quantum entanglement, the magnitude of the spin vector is not 1. In Embodiment 1, based on the assumption that the magnitude of the spin vector is 1, a parameter θ was defined with tan θ=<Bjz(t)>/<Bjx(t)>, and a spin was described with sjz(tk)=sine and sjx(tk)=cos θ. However, this method does not reflect the property of the quantum entanglement inherent in this system. Thus, let us consider how the quantum entanglement is reflected.
As described above, the spin vector is not retained as 1. Hence, let us define a correction parameter rs (0≤rs≤1) that represents the magnitude of the spin vector. Here, the proportional relationship in Equation (8) get unsatisfied in accordance with the fact that the spin vector is not retained as 1. For this reason, a correction parameter rB is defined, and Equation (8) is modified into Equation (11).
s
j
z(tk)sjx(tk)rBBjz(tk)/Bjx(tk) [Equation 11]
Let us define an angle θ representing the orientation of the spin with tan θ=sjz(tk)/sjx(tk) as similar to the case of Embodiment 1. When this is substituted into Equation (11), tan θ=rB·Bjz(tk)/Bjx(tk) is obtained. Because the magnitude of the spin is rs, sjz(tk)=rs·sin θ and sjx(tk)=rs·cos θ are obtained. With these relational equations, the effects of the quantum entanglement are incorporated into the classical algorithm through the correction parameters rs and rB. If the equations are written without using θ, then sjz(tk)=rs·sin(arctan(rB·Bjz(tk)/Bjx(tk))) and sjx(tk)=rs·cos(arctan(rB·Bjz(tk)/Bjx(tk))). Furthermore, if rs and rB are incorporated into functions f1 and f2, then sjz(tk)=f1(Bjz(tk), tk) and sjx(tk)=f2(Bjz(tk), tk).
These parameters rs and rB are originated in the quantum entanglement. It is preferable that they are finely controlled depending on tk, sjz(tk), and sjx(tk), or tk, Bjz(tk), and Bjx(tk). However, it is difficult to accurately acquire the information related to the quantum entanglement in principle, and we need to consider any method to cope with this difficulty. Actually, the parameters will be determined semi-empirically depending on the problem, but a general determination method is as follows.
rB can change its sign and reflects the quantum entanglement most effectively. On the other hand, rs is a correction factor satisfying 0≤rs≤1, and has a smaller role than rB. Therefore, rs may be set to be approximately equal to 1 over a total computation time, and the quantum effect is mainly incorporated through rB. Because there is no quantum entanglement at the beginning of the computation, rB=1 is set at t=0, and rB is set to gradually approach zero at t>0. Most of the spins converge to sjz=1 or −1 near t=τ, but some of the spins behave vaguely about whether sjz>0 or sjz<0. It is those poor-convergence spins that determine whether the computation succeeds or not. Therefore, when t is near to τ, rB is determined to be optimal for those spins. Because the effect of the quantum entanglement should be incorporated to the maximum, rB is set to be nearly zero. Because the orientation of the spins that converge to sjz=1 or −1 is stable, there is few adverse influence brought from rB being set to be nearly zero.
We have described the method to make rB time-dependent so far. It is also effective to make rB magnetic-field-dependent. When Bjz(tk)/Bjx(tk) is nearly equal to zero, sjz(tk)/sjx(tk) is necessarily indefinite. Therefore, let Bjz(tk)/Bjx(tk)=Bzx and let rB−rB(Bzx); it is effective to make rB magnetic-field-dependent as rB(“Bzx is nearly equal to zero”)<rB(“|Bzx|>>0”) at all times t.
When there is no specific feature between sites, rs and rB should not be site-dependent, but when site-dependent features are known in advance, rs and rB should be site-dependent in accordance with the features; the improvement in the success probability of solutions can be expected.
In Embodiments 1 and 2, we have described the principle of the computation and the computation algorithm. In Embodiments 3, we will describe a configuration example of running the algorithm on an apparatus.
The configuration described above can be constructed from a single computer, or it can be constructed from different computers connected through a network, where arbitrary parts, such as a main memory device 201, a general computing device 202, a control device 203, an auxiliary storage device 204, an input device 205, or an output device 206 are placed on those different computers.
General computation is performed in the same procedure as in an ordinary computing apparatus; data is transmitted and received between the main memory device 201 that is a storage unit and the general computing device 202 that is a computing unit; and the computation is executed by repeating the procedure. Here, the computation is controlled by the control device 203 that is a control unit. A program executed in the general computing device 202 is stored in the main memory device 201 that is the storage unit. When the main memory device 201 has an insufficient memory capacity, the auxiliary storage device 204 that is similarly a storage unit is used.
Furthermore, parameters that are necessary for a cross computing unit, an individual-computing unit, and a switch unit, which will be described below, are stored in the main memory device 201 and the auxiliary storage device 204 as well. The input device 205 is used for inputting data, a program, and the like, and the output device 206 is used for outputting results. For the input device 205, not only a manual input device such as a keyboard but also an interface for a network connection can be used. Furthermore, the interface serves as the output device as well.
The local-field response computation, as described in Embodiments 1 and 2 (
The computation of Bjz(tk)-to-sjz(tk) was expressed as sjz(tk)=f1(Bjz(tk) tk) in Embodiments 1 and 2. As described in Embodiments 1 and 2, the function f1 is a rather complicated function that includes trigonometric functions, but because the computation is executed independently site-by-site, the individual-computing unit 1030 can perform parallel processing about N sites. Parameters, such as rs and rB, that are necessary in the function f1 are stored in the main memory device 201. They are transferred to the individual-computing unit 1030 according to an instruction of the control device 203.
The computation of sjz(tk−1)-to-Bjz(tk) is based on Equation (10). In order to determine each Bjz(tk), sjz(tk−1) at every site i that satisfies Jij≠0 is used. In the cross computing unit 1020, calculations of B1z(tk), B2z(tk), . . . , BNz(tk) are performed in this order. siz(tk−1) which is needed in calculations, is temporarily stored in a buffer 1040, and only necessary s1z(tk−1) is transferred from the buffer 1040 to the cross computing unit 1020 through the switch unit 1010. Information on Jij is stored in the main memory device 201, and in accordance with this Jij-related-information, on/off switching and the computation in the cross computing unit 1020 is performed. The computation is controlled by the control device 203. Bjz(tk) obtained is transferred to the individual-computing unit 1030, and now one cycle of the repetitive computation has been accomplished.
In the present embodiment, it is ideal if we can arbitrarily set all interactions in N spin systems. If we intend to N spins to operate simultaneously to achieve high-speed computation, the number of switches for arbitrary setting is enormous. On the other hand, there is a need to reduce the number of switches in order to manufacture practical computing apparatuses. The need is achievable at the expense of the high speed. That is, there is a trade-off between the high speed and the reduction in the number of switches. Nevertheless, this issue is solved by separating the computing unit into the cross computing unit 1020 and the individual-computing unit 1030, and by performing pipeline processing in the cross computing unit.
The cross computing unit is configured to be able to perform pipeline processing, and the individual-computing unit is configured to be able to calculate independently N variables. Thus, both units can perform high-speed computing. Moreover, because the cross computing unit performs the pipeline processing, the number of switches is reduced. Thus, the high speed and the reduction in the number of switches are both achieved.
The local-field response method achieves the highest speed if N s1z's can be processed in parallel. Because the individual-computing unit 1030 independently performs the computation for each site j, parallel processing is possible. The computation inside the cross computing unit 1020 is also semi-parallel owing to the pipeline processing as apparent from
Moreover, if a plurality of cross computing units 1020 are arranged, the degree of parallelism further increases.
The present embodiment so far has described the cross computing unit 1020 and the individual-computing unit 1030 from the viewpoints of principal constituents. Now let us see the repetitive computation described above from the perspective of a transmission path (see
The local-field response computing device 1000 may be configured as a one-chip co-processor (sub-processing device), for example, and it is added to a configuration of an ordinary computer. It is achieved as a hardware configuration in
Because s1z is a continuous quantity of [−1, 1] and the computation inside the cross computing unit 1020 is simple computation consisting of only multiplication and summation, it is also effective to perform the computation inside the cross computing unit 1020 using analog computation on the time-axis.
The multiplication and summation for the time-direction analog values are easy (NPL 4). The computation inside the cross computing unit 1020 is all performed in analog format, then the time-direction analog values are converted into digital values in the TD conversion unit 1025, and the converted data are transferred to the individual-computing unit 1030.
As seen in Equation (1) and others described above, the computation time is assumed to be τ, but there are several methods of determining the final solution. Embodiment 5 describes the various methods of determining the solution.
In the first method, as illustrated in
In the second method, the convergence of sjz is checked as illustrated in
In the third method, as illustrated in
That is, at each time tk, if sjz(tk)<0, sjzd(tk)=−1, and if sjz(tk)>0, sjzd(tk)=1 (119); Hp(tk)=−Σi>jJijsizd(tk)−Σjgjsjzd(tk) is calculated at each time tk (123); and the final solution is sjzd(tm) at time tk′ at which Hp(tk) is minimum (124). Here, the calculation of the energy is performed with the general computing device 202. For this reason, the values of sjz are transferred from the buffer 1040 to the main memory device 201.
In the fourth method, as illustrated in
Again, the calculation of the energy is performed with the general computing device 202, and therefore, the values of sjz are transferred from the buffer 1040 to the main memory device 201 as well.
Which of the methods is used is determined by a user.
We have described the embodiments in which the time-axis is discretely treated as illustrated in
The important time in the computation process is the time at which the sign of sjz changes. The frequency of sjz's changing the sign is relatively low near the starting and ending time in the computation. It is greatly high in the intermediate stage of the computation. Therefore, as the first method, there is a setting method: the time interval is set to be large at the beginning of the computation as a program; next, the time interval is made to be small with the passage of time; and then the time interval is reversely made to be large.
The second method is a method in which the probability that the spin will be inverted is evaluated at each time and the time interval is set based on the result of the evaluation. An example is as follows. When the magnitudes of |sjz| are almost equal to each other in all spins, the probability of spin inversion is low. In this case, the time interval is set to be large. On the other hand, the magnitude of |sjz| of a specific spin is smaller than that of other spins, the probability with which the spin inverts is high. In this case, the time interval is set to be small. The following is a specific example of the method of determining the time interval. Let δtmin a minimum time interval. Let save(tk)2 be the mean square of spins of all sites at time tk, and let smin(tk)2 be the magnitude of the square of the minimum spin. That is, save(tk)2=Σj(sjz(tk))2/N and smin(tk)2=min (sjz(tk)2). Let [x] be the largest integer that is equal to or smaller than x. Let ΔTk+1, k=tk+1−tk=δtmin×max (1, [100×(smin(tk)2/save(tk)2)1/2]). In this case, the minimum value of the time interval is δtmin, and the maximum value thereof is 100·δtmin. The calculation for determining the time interval is performed using the general computing device 202.
Which of the methods is used is determined by a user.
In the present embodiments, the influence of temperature is estimated as follows. A voltage necessary for bit inversion is of the order of 1 V. Let e be the elementary charge and let kB be the Botzmann's constant. The reduced temperature T is about 1.2×104 K due to T=eV/kB. This value is sufficiently higher than a room temperature of 300 K. Thus, the influence of temperature can be ignored in a configuration like that in the present embodiments, and the apparatus can operate at room temperature.
The present invention is not limited to the embodiments described above, and includes various modified embodiments. For example, one or several configurations in a certain embodiment are possibly replaced by a configuration in other embodiments, and a configuration of other embodiments is possibly added to the configuration of the certain embodiment. Moreover, a configuration in other embodiments is possibly added to one portion of a configuration in each embodiment; the one portion is possibly deleted; and the one portion is possibly replaced with the configuration in other embodiments.
The present invention is applicable to analyzing various kinds of data such as Big Data.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/JP2015/066324 | 6/5/2015 | WO | 00 |