The present invention generally relates to computer-implemented optimization, and more particularly relates to methods and systems for optimization problems transformation for facilitated resolution.
Mathematical optimization problems arise in many fields and disciplines, including chemistry, logistics, and machine learning. In many cases, such problems can be mapped onto (i.e., represented as an instance of) Ising spin systems or quadratic unconstrained binary optimization (QUBO) problems.
A possible angle to tackle such optimization problems is to make use of Monte-Carlo algorithms, like the Metropolis-Hastings algorithm or simulated annealing, which allow one to sample from the space of all possible solutions and potentially find or get close to the best configuration(s) of variables. One issue with such methods is that the samples they produce are often correlated due to the locality of transitions considered when exploring the space, giving only the chance to visit a small part of the entire space of solutions.
To circumvent this issue, one can turn to cluster Monte-Carlo algorithms to allow non-local configurations updates and potentially improve exploration. One cluster Monte-Carlo method of interest that has been studied recently is the Houdayer algorithm. This Monte-Carlo move and its derivatives, exhibit the interesting property that they are able to perform isoenergetic moves, i.e., moves that preserve the total energy of the configurations given as input.
Although it is useful in many situations, in practice the Houdayer move and its derivatives have the downside that, to be successful, they require the QUBO formulation of the optimization problem to be such that the number of pairwise interactions between variables is small. Equivalently, the graph underlying the problem should not be too connected.
Thus, there is a need to provide a method and system for transforming an optimization problem in order to facilitate its resolution and provide a higher chance of successful optimization by, for example, enabling the use of the Houdayer move. Furthermore, other desirable features will become apparent from the subsequent detailed description and the appended claims, taken in conjunction with the accompanying drawings and this background of the disclosure.
According to at least one aspect of the present embodiments, a method for transformation of an optimization problem to facilitate its resolution is provided. The method includes casting the optimization problem into a quadratic unconstrained binary model and transforming the optimization problem into an optimization problem having the same quadratic unconstrained binary model with reduced connectivity.
Transforming the optimization problem into the reduced connectivity optimization problem includes partitioning decision variables in the quadratic unconstrained binary model into one or more groups, each of the one or more groups comprising more than one decision variable node and introducing a register variable node between adjacent pairs of the more than one decision variable node in the one or more groups to hold partial values of a sum in a linear constraint to form the optimization problem with reduced connectivity.
According to another aspect of the present embodiments, a system for transformation of an optimization problem to facilitate its resolution is provided. The system includes one or more processors and a storage means comprising instructions for controlling the one or more processors. The instructions include instructions for controlling the one or more processors to cast the optimization problem into a quadratic unconstrained binary model and transform the optimization problem into an optimization problem having the quadratic unconstrained binary model with reduced connectivity. The instructions to transform the optimization problem into an optimization problem with reduced connectivity include instructions to partition decision variables in the quadratic unconstrained binary model into two or more groups, each of the two or more groups comprising at least one decision variable node and instructions to introduce a register variable node between adjacent pairs of the at least one decision variable node in the two or more groups to hold partial values of a sum in a linear constraint to form the optimization problem with reduced connectivity.
The accompanying figures, where like reference numerals refer to identical or functionally similar elements throughout the separate views and which together with the detailed description below are incorporated in and form part of the specification, serve to illustrate various embodiments and to explain various principles and advantages in accordance with present embodiments.
And
Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been depicted in full detail.
The following detailed description is merely exemplary in nature and is not intended to limit the invention or the application and uses of the invention. Furthermore, there is no intention to be bound by any theory presented in the preceding background of the invention or the following detailed description. It is the intent of present embodiments to provide systems and methods for transforming optimization problems that can be formulated as Ising models or Quadratic Unconstrained Binary Optimization (QUBO) models into an equivalent formulation that allows a successful use of the Houdayer algorithm. Thus, in accordance with present embodiments, general optimization problems comprised of a quadratic objective function of binary variables together with linear equality constraints involving these variables are considered, as they can be easily cast into an Ising/QUBO formulation by turning linear constraints into quadratic penalties. In particular, the systems and methods in accordance with the present embodiments provide optimization solutions even for problems whose Ising/QUBO formulations have interactions between every pair of variables, the transformation reducing connectivity in the formulation by introducing new binary variables which allow rewriting quadratic penalties in a different manner, where “connectivity” refers to the structure of interactions in the Ising/QUBO formulation. To achieve this, the variables in the problem are partitioned into multiple groups which allow one to break down the computation of each sum involved in each linear constraint by saving intermediate values of those sums into registers. Thus, new binary variables that act as registers are introduced in order to hold partial values of intermediate sums in a linear constraint. With this augmented set of variables, the original constraints can be rewritten accordingly to make use of the new variables acting as registers, hence leading to a new Ising/QUBO formulation with a reduced connectivity overall. An advantage of considering problems in the form a quadratic objective function with linear constraints is that they encompass a large number of optimization problems such as Traveling Salesman problems, Maximum Cut problems, Number Partitioning problems, Knapsack problems, Minimum Vertex Cover problems and Graph Colouring problems.
Further, the following definitions, notation and conventions are used. When considering an optimization problem, a “state” is used to refer to any configuration of the underlying variables in a general sense (not necessarily in the context of an optimizer). For instance, a “state space” is referred to independently of any optimization procedure within it. In addition, a “configuration” is any combination of the underlying problem variables that might be visited during exploration of the state space by an optimizer and a “solution” is the final configuration returned by the optimizer, i.e., returned once the optimizer termination criteria have been satisfied. For an optimization problem with n binary variables (which take value in {0,1}), variables are represented with xi, where i=1, 2, . . . , n. When collecting all variables into a vector, the resulting object is denoted as x∈{0, 1}n. And for an optimization problem with n spin variables (which take value in {−1,1}), variables are represented with σi, where i=1, 2, . . . , n. When collecting all variables into a vector, the resulting object is denoted as σ∈{−1, 1}n.
The methods and systems in accordance with the present embodiments are particularly applicable to optimization problems which can be mapped onto a constrained optimization of the form
namely a quadratic objective function with m linear equality constraints, and n binary variables x1, . . . , xn. In fact, it is shown hereinafter that advantageously linear inequality constraints can also be taken into account. The variables xi of the original problem are referred to as “decision variables”. In addition, for simplicity in the discussion of the methods and systems in accordance with the present embodiments hereinafter, the case with m=1 linear constraint is mainly treated, but those skilled artisans will realize that the schemes discussed hereinafter can be generalized to quadratic objective functions with any number m linear constraints. While a technique to handle the general case of real numbers is discussed hereinafter, for simplicity in the general discussion herein, the coefficients bij and the numbers bi are assumed to be all positive integers.
Optimization problems that follow the formulation of Equation (1) can be readily cast into a QUBO formulation. A QUBO (short for “Quadratic Unconstrained Binary Optimization” as indicated above) problem is an optimization problem that can be formulated as the minimization of a function of the form
where xi∈{0, 1}∀i=1, . . . , n. As the name suggests, these optimization problems are only concerned with minimizing a quadratic objective function with binary variables since there are no constraint. To cast a problem formulated as in Equation (1) into a QUBO formulation only requires turning linear equality constraints into quadratic penalties, which is a technique that is used typically in certain approaches to solving optimization problems. Hence, the QUBO formulation corresponding to an optimization problem in the form of Equation (1) amounts to minimizing the function
In the QUBO model, the interest is in finding the assignment x* which minimizes the objective function , that is, compute
It should be further noted that a model equivalent to the QUBO one is the Ising model. The Ising model was initially presented as a mathematical model for representing physical systems, such as magnets. The Ising model consists of variables called spins, which can take values −1 or 1, and which interact with each other according to a given graph structure. Such a graph structure is shown in
The model assigns a value (often called “energy”) to each configuration of spins through a particular function (often called “Hamiltonian”) of the following form
where σi∈{−1, 1}∀i=1, . . . , n. In order to map an Ising formulation to a QUBO one, the range of the variables are changed from {−1, 1} to {0, 1}, which is achieved by substituting each variable σi with 2xi−1. Similarly, one can start from a QUBO problem and substitute each variable xi with (σi+1)/2 to retrieve the corresponding Ising formulation. Thus, hereinafter, a mention of an Ising/QUBO formulation can refer to any optimization problem which demands the minimization of a quadratic objective function with variables that can only take two arbitrary distinct values, and where there is no further constraint on the variables.
Similar to the QUBO model, in the Ising model, the interest is in finding the assignment which minimizes the function in Equation (4), that is, compute
One is often also interested in sampling configurations with low-energy. Note that for either of the two models, adding a constant term to the function to be minimized does not alter the structure of the energy spectrum and, in particular, the lowest energy configurations remain unchanged.
A convenient way to visualize an Ising/QUBO formulation is through a graph G=(V,E), which is a structure containing a set of objects V (called “nodes”) linked together by a set of “edges” E, where each edge links a pair of nodes. In the context of Ising/QUBO problems, the corresponding graph has the set of nodes V={1, . . . , n}, where each node i∈V corresponds to the variable xi in the problem. Each possible pair of nodes (i,j)∈V×V, which represent the term Qijxixj that appear in the sum of Equation (2), belongs to the set of edges only if Qij≠0. As an example of an Ising formulation viewed as a graph, one can consider an Ising lattice, where spins are placed on a rectangular grid and connected to their nearest neighbours as shown in the lattice illustration 100 of
The connectivity of an Ising/QUBO problem is of particular interest since when applying a Houdayer move, the connectivity plays a large role in how successful the move is. Ising/QUBO problems that are very connected, that is where most coefficients Qij are non-zero, tend to prevent Houdayer moves from yielding an output which differs from the input, rendering the move useless.
Since mentions of how connected a graph is are recurrent hereinafter, a commonly used measure called “graph density” is introduced. Let G=(V,E) be an undirected graph. The graph density d(G) of G is defined as the ratio of the number of edges |E| to the maximum possible number of edges, that is
In accordance with the methods and systems of the present embodiments, the optimization problems considered are those that can be formulated as those in Equation (1) since they can be further cast into Ising/QUBO problems. In such a case, it is then possible to use tools like the Houdayer move and its variants. Moreover, the optimization problems considered are not restricted to the study of spin systems in physics, meaning that general optimization problems may be considered where any pairwise interaction can happen. In particular, the methods and systems in accordance with the present embodiments work even for optimization problems which have interactions between every pair of variables. In order to do so, the approaches of the methods and systems in accordance with the present embodiments actively modify the input given to the Houdayer move itself, resulting in a higher chance of successful optimization.
As an example, the number partitioning problem, such as used in logistics for efficient cargo packing and shipping, can be formulated as an Ising modelC={c1, c2, . . . , cn}etbe a finite set of positive numbers. When one wishes to find a partition of C into two disjoint subsets A and C\A such that the sum of the elements in both sets is equal, i.e., Σa∈Aa=Σb∈C\Ab, one can fit this into an Ising formulation by defining the Hamiltonian function set out in Equation (4) as seen in Equation (5):
which is of the desired form −ΣijJijσiσj−Σihiσi of an Ising model, with Jij=−cicj and hi=0 and ∀i, j∈{1, 2, . . . , n}.
More generally, real-life optimization problems, such as logistics and job shop optimized scheduling, improved biological self-assembly via optimized protein folding, and optimized financial portfolio trading, can be expressed in the form of Equation (1), so that the corresponding Ising/QUBO formulation can be used for optimization and reducing connectivity in accordance with the methods and systems of the present embodiments.
As mentioned hereinabove, the Houdayer move and its derivatives exhibit the interesting property that they are able to perform isoenergetic moves, i.e., moves that preserve the total energy of the input configurations. Although useful in many situations, the Houdayer move has the downside that in practice, to be successful, it requires the number of pairwise interactions between variables to be small. Equivalently, the graph connectivity underlying the problem should not be too connected. To alleviate this issue for an important class of problems and to advantageously provide enhanced solutions for real-world optimization problems, a technique for reducing connectivity is provided in accordance with the methods and systems of the present embodiments where, given a certain Ising/QUBO problem, a new graph structure with reduced connectivity is generated while being equivalent to the original problem. In connection with the methods and systems of the present embodiments, this technique is referred to as “ReduCon”. ReduCon advantageously provides the skilled person the possibility to apply the Houdayer move to certain very strongly connected real-world problems to provide solutions in situations where other techniques would most likely fail, as the Houdayer move would most likely fail on the original underlying graph.
Although originally developed for two-dimensional lattices, the Houdayer move has been altered in various works in order to work on lattices in any space dimension. Such variations differ from the ReduCon techniques in accordance with the methods and systems of the present embodiments in several important application-constraining ways. The conventional Houdayer algorithm and its variations tend to yield poor performance in the general case where interactions happen between arbitrary pairs of variables in a system. In addition, application of the Houdayer algorithm and its variations are restricted to certain pre-defined conditions, which are typically known to be favourable for a successful outcome (i.e., the output is different to the input). However, this methodology does not improve the intrinsic success rate of the Houdayer move; it only makes use of the Houdayer move when the input is such that the technique is most likely to work.
In order to provide optimization solutions even for problems which have interactions between every pair of variables, the methods and systems in accordance with the present embodiments actively modify the input given to the Houdayer move itself, resulting in a higher chance to make it successful.
In order to make use of the methods and systems in accordance with the present embodiments, one should know how to formulate an optimization problem of interest in the form of Equation (1). As mentioned hereinabove, for simplicity, we consider in the following such optimization problems but with only one linear equality constraint, yielding the formulation
Moreover, the user should be able to represent a solution of the discrete problem through a configuration of the variables xi's in the binary formulation specified above. Note that if the problem statement has an inequality constraint Σi=1bixi≤b instead of an equality, it can be converted to an equality by adding t=└ log2(b)┘+1 “slack” variables s1, . . . , st∈{0,1}. The constraint then becomes Σi=1nbixi+Σi=1t2i−1si=b, which fits our formulation, but there are now n+t binary variables, namely x1, . . . , xn and s1, . . . , st. If the problem is specified using Ising variables (that is variables take values in {−1,1}) as
it can be converted to the formulation in Equation (6) through the substitution σi→2xi−1, yielding a new formulation with x∈{0, 1}∀i=1, . . . n as desired.
In order to input such a formulation in practice, one needs to specify the values ij, bi and b, which can be given in various ways. For example, the Qij's can be specified using an n-by-n matrix Q with elements (Q)ij=Qij, and a list of pairs (i,j) of indices with the corresponding weight
ij. The coefficients bi can be conveniently given as a sequence [b1, . . . , bn], while the coefficient b can be given as is.
To see when the methods and systems in accordance with the present embodiments can be used, we first convert the formulation to a QUBO formulation by turning the linear constraint in Equation (7) into a quadratic penalty, yielding the new objective function
As mentioned hereinabove, we visualize the QUBO problem as a graph G=(V,E) with vertices V={1, . . . , n}, and where there is an edge (i,j)∈E, the quadratic term xixj has a non-zero coefficient. The resulting graph visualizing the QUBO problem can thus be more or less connected depending on the number of distinct quadratic terms, and the goal of the ReduCon methods and systems in accordance with the present embodiments is to reduce this connectivity and thereby advantageously improve the success of the Houdayer move and its variants.
In Equation (10), the first part Σi=1nΣj=1nQijxixj is the initial quadratic objective function that appears in Equation (7) and already induces some edges in the underlying graph. This “initial” connectivity is a constraint in regards to the ReduCon methods and systems of the present embodiments in that it cannot be reduced in accordance with the present embodiments. Since the ReduCon methods and systems in accordance with the present embodiments cannot prevent variables in the “initial” connectivity from being connected, the Houdayer move is bound to perform poorly if the “initial” connectivity is too dense. Accordingly, in accordance with the present embodiments, it is assumed that this first quadratic double-sum Σi=1nΣj=1nQijxixj) induces a low number of pairwise interactions, or even none (which happens when Qij=0 for all i≠j).
The second part of Equation (10), (Σi=1nb; xi−b)2=Σi,j=1nbibjxixj−2bΣi=1nbixi+b2 accounts for the linear constraint and adds edges (i,j) to G whenever bi>0 and bj>0. In a worst case, all bi's are strictly positive, inducing a fully-connected topology. This is the connectivity that the ReduCon methods and systems in accordance with the present embodiments are able to reduce. And the ReduCon methods and systems in accordance with the present embodiments reduce this connectivity by introducing new variables to the QUBO problem, thereby advantageously enabling application of the Houdayer move and its variants.
In order to reduce the connectivity, the ReduCon methods and systems in accordance with the present embodiments introduce extra variables which act as intermediate registers to hold partial values of the sum Σi=1nbixi. This way, not all variables need to interact, since some values are stored into the intermediate registers. In the following, the resulting graph representing the problem is denoted as G′=(V′, E′).
Referring to
At step 220, the set {1, . . . , n} representing all decision variables are partitioned into a desired number k of subsets G1, . . . , Gk⊆{1, . . . , n}. With these subsets, one can define which intermediate values to store. In the present example, the partial sums S1=Σi∈G
At step 230, “register” variables are introduced to hold the values of the intermediate sums (i.e., the partial sums S1=Σi∈G
At step 240, new constraints are introduced so that the new formulation is equivalent to the original one. Indeed, some constraints need to be imposed to ensure that the register variables are indeed holding the values of the intermediate sums, which can be specified through the equalities of Equation (10) to finally rewrite the original constraint Σi=1nbixi=b of Equation (7) using the last register's variables Σi=1|R
Then, at step 250, the new formulation is converted to a QUBO formulation. By turning constraints into quadratic penalties this yields the new function to minimize as shown in Equation (12):
inducing a graph G′=(V′, E′) which, advantageously, is not fully connected—i.e., thereby reducing connectivity in accordance with the ReduCon methods and systems of the present embodiments.
The resulting connectivity can be determined as follows. As discussed hereinabove, it is assumed that Σi=1nΣj=1nQijxixj induces a low number of interactions between the decision variables. One can show that the new interactions through Equation (12) allow one to lower the connectivity overall. Indeed, the first quadratic penalty (Σi∈G
There are multiple ways in which ReduCon can be modified to handle more general constraints in accordance with the methods and systems of the present embodiments. For example, one can consider a problem of the general form of Equation (1) with m constraints. In such a case, after grouping, one needs registers to store partial sums corresponding to each constraint. That is, after choosing k ReduCon groups G1, . . . , Gk to partition the decision variables into, the lth partial sum for the jth constraint is now Si(j)=Σi∈G
Another way in which ReduCon can be modified to handle more general constraints in accordance with the methods and systems of the present embodiments is where one considers that the coefficients bij in the linear constraint(s) are not necessarily positive integers, but real numbers instead. In this case, one needs to change the representation of the registers' values since, in conventional binary representation, they can only handle positive integer values. The simplest way to do so is to use a floating-point representation for each register's value, such as that described in the IEEE 754 standard. Depending on what precision is needed, each register will need a certain number of bits defined by the format chosen. These variables could be binary 32 (i.e., 32 binary variables), binary 64 (i.e., 64 binary variables), or binary 128 (i.e., 128 binary variables).
In the formulation resulting from the ReduCon methods and systems in accordance with the present embodiments (i.e., the formulation formed at step 250 of the flowchart 200 (
The number of edges can be computed by looking at the illustration 300 of
As an example, consider the following formulation with five variables:
This example indeed fits the framework of the ReduCon methods and systems, with
Once it is converted into QUBO form, the function to optimize becomes Q(x)=Σi=15cixi+(Σi=15−1)2, giving rise to a fully connected topology due to the penalty term (second term), since all variables are part of it. The connectivity thus has a density d(G)=1. Since the interactions arise from the linear constraint, the ReduCon prescription can thus be used and, for example, create k=3 groups as follows: G1={1,2}, G2={3,4} and G3={5} as shown in the illustration 500 in
The new number of variables is now V′=5+2+3+3=13, and so the maximum number of edges is
Computing the number of edges using Equation (14) as follows:
yields a graph density of d(G′)=44/78≈0.564 for the new graph. Hence, thanks to the transformation induced by the ReduCon methods and systems in accordance with the present embodiments, the original fully-connected problem is turned into a much sparser one.
Hereinabove, it has been discussed how ReduCon needs to group decision variables into different sets (i.e., into the ReduCon groups) in order to build and store partial values of the sums involved in the linear constraints. The question remains whether there exists an optimal way to do so and, in this context, what optimal means. Ultimately, the goal of the ReduCon methods and systems in accordance with the present embodiments is to reduce graph connectivity to allow the use of Houdayer moves. This means that it is preferred that for most pairs of configurations, applying the Houdayer move results in a successful transition, one where the resulting pair of configurations is different from the starting one. Hence, what is defined as the optimal groupings of variables is one for which the resulting graph yields the highest chance of success for the Houdayer move, regardless of the pair of configurations given.
Generally, as discussed hereinabove, the Houdayer move tends to be more successful when the problem at hand has a weakly-connected underlying graph. To obtain a new graph in accordance with present embodiments, the ReduCon methods and systems perform a very specific choice of addition of nodes and how they relate to each other. This means that even though the graph density may be reduced, it could be that certain choices of groupings can only yield unsuccessful Houdayer moves.
This can be made explicit through simple experiments where, for a certain problem, how different groupings lead to different densities and different rates of success for the Houdayer move are compared. Experiments are performed as follows. For a desired choice of ReduCon groups, apply the ReduCon method in accordance with the present embodiments to an input problem with an underlying graph G to get a new formulation with graph G′, from which a new graph density d(G′) can be deduced. Secondly, repeat m times the following three steps (or, if the number of variables n in the problem is low, consider all 22n possible pairs): (a) sample uniformly at random a pair of configurations (σ(1), σ(2)) in the original problem; (b) convert the pair of configurations to a corresponding pair (σ′(1), σ″(2)) in the new formulation G′, and then try to perform a Houdayer move and record whether the Houdayer move is a success or not. And, thirdly, return the density d(G′) together with the “success rate”
{sample i was successful}.
Repeating this process for groupings of interest (potentially all groupings), one can collect all resulting densities and success rates and plot them on a two-dimensional plane to visualize the performance of different groupings. For example, assume a problem with six variables x1, . . . , x6 and linear constraint Σi=16xi=1, that is xi=1, ∀i∈{1, . . . , 6}. The process explained above is repeated for all possible groupings and, since the number of variables is low, all 22-6=4096 pairs of configurations are considered. A graph 600 in
Suppose instead of having coefficients being equal in the linear equality constraint, coefficients of different integers, such as b1=b2=3, b3=b4=2, b5=1, b6=8, are used.
To understand which grouping of variables is best, one should go back to the definition of the Houdayer move. The Houdayer move works with configurations of spins, that is when variables take values in {−1,1}. To use the Houdayer move with binary variables, one can use the conversion to the formulation in Equation (6) through the one-to-one transformation σi=2xi−1, yielding a new formulation with xi∈{0,1}∀i∈1, . . . , n. At the heart of the Houdayer move is the selection of multiple variables to be flipped, which is done by first computing the “local overlap” at each site between a pair of spin configurations (σ(1), σ(2)), which informs about the similarity of spins between the configurations. Formally, the local overlap at site i∈V between the two configurations is defined as qi=σi(1)σi(2)). After that, the Houdayer moves picks a group of variables which differ between the two configurations and which form a connected component. Finally, the values of the chosen variables are flipped in both configurations, possibly yielding a new pair of configurations. The process is visualized in illustration 800 in
If the number of connected components is equal to one, then the Houdayer move has no choice but to pick that component, resulting in a failure in the sense that the resulting pair is identical to the original one, where just the configurations are swapped. Otherwise, if there is more than one connected component, new pairs of configurations at the same total energy can be reached. Thus, the goal is to have more than one connected component in the local overlap for any pair of configurations.
To create more than one connected component in the local overlap 840 of two given configurations, the variables which are equal (the nodes 805 in the illustration 800) must form some kind of “barrier” to allow the connected components formed by the nodes 810 to not join together, hence forming multiple clusters. To create such “barriers” in the graph that results from the ReduCon methods and systems in accordance with the present embodiments, we look again at the example discussed in connection with Equation (15). Let (σ(1), σ(2)) be a pair of configurations in the original problem with underlying graph G, and suppose that the two matching configurations in the new graph structure G′ created by the ReduCon methods and systems in accordance with the present embodiments are (σ′(1), σ′(2)). Referring to
Knowing this, it can be deduced that a good grouping is one that for most pairs of configurations (σ(1), σ(2)), it is very likely that when converted to a pair (σ′(1), σ′(2)) in the new formulation with graph G′, at least one of the registers is going to have the same value for both configurations in the pair. To have more opportunities to create “register barriers”, a grouping which yields the greatest number of registers, which is the grouping formed by putting each decision variables xi in its own ReduCon group, should be used. An example of such grouping is shown in an illustration 1000 in
As has been described hereinabove, the creation of “register barriers” is what makes the Houdayer move successful. However, it is clear that the last register cannot act as a barrier since no group comes after it. Thus, the ReduCon methods and systems in accordance with the present embodiments can be slightly changed to simply drop the last register Rk and adjust the constraints depending on the last register accordingly. Specifically, instead of using the two constraints Σi=1|R
Even with the advantages provided by the ReduCon methods and systems in accordance with the present embodiments, time and storage space required to perform computations in the graph resulting from the ReduCon methods and systems, even though many nodes have been added, is not seriously impacted. The computation of connected components in the local overlap during the Houdayer move can be greatly simplified when the original objective function Σi=1nΣj=1nQijxixj induces no interaction between variables in different ReduCon groups (i.e., when =0 for two variables xi and xj that are in distinct ReduCon groups). In fact, since one already knows the structure of the resulting graph, the connected components can be determined once the registers that act as barriers are located. Hence, no graph structure needs to be stored in this case, reducing the storage space as well the running time of the Houdayer move.
Herein, a few applications of the ReduCon methods and systems in accordance with the present embodiments which show how they can be used together with the Houdayer move are presented. In all instances considered, the use of a Houdayer move on a pair of configurations (σ(1), σ(2)) is replaced by multiple steps. Denoting the underlying graph of the original optimization problem by G, the ReduCon methods and systems in accordance with the present embodiments allows one to retrieve the new graph G′ and to easily convert configurations between the two formulations.
Referring to
The first obvious application of the ReduCon Houdayer move in accordance with the present embodiments is in the context of Monte-Carlo methods, in particular in the context of Metropolis-Hastings algorithms and their derivatives such as simulated annealing or parallel tempering. These probabilistic algorithms aim to find global optimal solutions of an optimization problem by searching a large state space, and sometimes are content with sampling low-energy configurations.
The use of Houdayer moves combined with parallel tempering has been shown to speed-up convergence to minima, as it acts as a form of a tunnelling effect by helping to explore solutions with similar levels of energy as the current ones. As indicated hereinabove, the success of the Houdayer move is limited by the connectivity of the graph at hand. This limitation can advantageously be alleviated by performing a ReduCon Houdayer move when the graph becomes too connected. Problems such as Knapsack or MAX 2-SAT can benefit from the ReduCon Houdayer move, making it much easier to rapidly reach low-energy solutions when the ReduCon Houdayer move is combined with usual Monte-Carlo methods. Note also that ReduCon can be used in combination with other versions of the Houdayer move such as the generalization presented in “The Houdayer Algorithm: Overview, Extensions, and Applications” (https://arxiv.org/pdf/2211.11556.pdf).
Another class of methods which the ReduCon Houdayer move can enhance is one termed “hybrid”. For such methods, the main Monte-Carlo procedure is performed on a non-binary formulation of the optimization problem, but the (ReduCon) Houdayer move is performed on the equivalent Ising/QUBO formulation in accordance with methods and systems of the present embodiments. There is thus a conversion taking place to go from the non-binary formulation to the binary one. This is done because it is sometimes much easier to perform an optimization algorithm (like simulated annealing or parallel tempering) on a formulation of the problem where variables can take up many different values rather than converting it to a complex Ising/QUBO formulation before performing the optimization in the Ising/QUBO formulation. The Ising/QUBO formulation is only used to enable the application of the (ReduCon) Houdayer move, giving once again the opportunity to find pairs of configurations at the same energy level.
An optimization procedure (like simulated annealing) in this context starts from some pair of configurations of the variables in the non-binary formulation. One step of the optimization algorithm is performed in which a new pair of configurations is found. The pair of configurations is then converted to a corresponding one in the QUBO formulation. The Houdayer move (or the ReduCon Houdayer move if the underlying graph is too connected) is next performed to obtain a new pair of configurations at the same total energy level. The pair of configurations is then converted back to the corresponding pair in the non-binary formulation. Finally, the procedure goes back to performing a step of the optimization algorithm and this process is repeated unless some stopping criterion is met.
Another application where the methods and systems in accordance with the present embodiments provides beneficial efficiencies as discussed in “The Houdayer Algorithm: Overview, Extensions, and Applications” (https://arxiv.org/pdf/2211.11556.pdf) is improving sampling of low-energy configurations. It is known that use of the Houdayer improves the sampling of low-energy configurations, especially where ground states are the ultimate aim. Some conventional strategies make use of the principle that, since the Houdayer move preserves the total energy of the pair of configurations, it must be either that one of the two configurations decreases in energy or that both stay at the same energy level. In general, for this application the methods and systems in accordance with the present embodiments would start with generating some low-energy configurations through typical heuristic methods like simulated annealing to form a set P. The configurations in P are then paired, allowing one to use Houdayer moves to potentially generate more pairs at the same total energy level. If the Houdayer moves are successful, new configurations with lower energy levels are reached and these new configurations are stored in the set P with the previous configurations. As the set P has been augmented with the new configurations with lower energy levels, the configurations in P are paired and more pairs are generates using the Houdayer move unless some stopping criterion is met. The stopping criteria may include a predetermined number of iterations or a determination that no new pairs are being generated at the same total energy level. These steps can easily be customized and improved aimed at focusing the generation towards configurations with lower energy, yielding for example genetic algorithms when a selection process is added.
More generally, any algorithm that makes use of the Houdayer move benefits from the ReduCon methods and systems in accordance with the present embodiments as the ReduCon methods and systems offer access to more successful Houdayer moves by reducing connectivity of the original formulation. Additional examples of uses of the Houdayer move which can benefit from the ReduCon methods and systems in accordance with the present embodiments include genetic algorithms making use of the Houdayer during the reproduction step and multi-objective optimization such as those where one is interested in finding Pareto optimal solutions, or more generally, the Pareto front (i.e., the set of Pareto optimal outcomes). In regards to multi-objective optimization interested in finding Pareto optimal solutions, if all objectives are themselves in Ising/QUBO, one can combine them into a bigger Ising/QUBO problem by using a technique called linear scalarization. For example, if there are m objective functions (x), . . . ,
(x) to be optimized, one can consider the new unique objective function Q(x)=Σi=1mwiQi(x), where wi>0∀i∈{1, . . . , m}. This can then be solved with the help of Houdayer moves, and thus ReduCon Houdayer moves when beneficial. In particular, it can be shown that the solutions to the linear scalarization problem
(x) is on the Pareto front, no matter the choice of weights. This means that if one is able to get a few solutions to the linear scalarization problem, one could use them together with the ReduCon Houdayer move in accordance with the methods and systems of the present embodiments to potentially generate many more solutions at the Pareto front.
Those skilled in the art will realize that there are a number of practical applications which can benefit from the ReduCon methods and systems in accordance with the present embodiments. For example, applications in logistics and scheduling can benefit from the ReduCon methods and systems in accordance with the present embodiments. These include logistic planning for cargo shipping, car manufacturing, job shop scheduling or traffic flow optimization. As a shipping example, packages are usually shipped through cargo trucks or planes. While every package has a certain weight and dimension, a cargo vehicle has a fixed capacity and weight limitation. The cost to the shipper of sending a package usually does not depend on the weight, since dispatchers get paid on a contract. The dispatchers, therefore, try to maximize their profit by packing efficiently and shipping the maximum weight in a fixed volume. Improved solutions to this profit optimization problem can advantageously be obtained by the ReduCon methods and systems in accordance with the present embodiments. Another logistics application for the ReduCon methods and systems in accordance with the present embodiments is optimizing solutions for vehicle routing. The vehicle routing problem is an important combinatorial optimization problem in which the goal is to find an optimal set of routes for a fleet of vehicles that deliver goods from an origin (depot) to a set of destinations (customers).
In the area of biology and chemistry, protein folding and molecular similarity problems can benefit from the ReduCon methods and systems in accordance with the present embodiments. Finance, particularly financial investing, can benefit from the ReduCon methods and systems in accordance with the present embodiments. Examples include investment portfolio optimization and portfolio allocation.
Also, there are several software services, products or toolkits where ReduCon could potentially be added or built in to enhance the applicability and performance of the products and provide improved optimization capabilities. For example, Microsoft Azure QIO is a service offering CPU and FPGA implementations of algorithms such as simulated annealing, parallel tempering, population annealing, tabu search, quantum Monte Carlo, and substochastic Monte Carlo. As many of these algorithms are likely to incorporate Houdayer moves, the ReduCon methods and systems in accordance with the present embodiments could improve the algorithms capabilities. A similar Microsoft offering is 1 Qbit (1 Qloud).
Matlab Global Optimization Toolbox (not to be confused with the Matlab Optimization Toolbox), Google OR-Tools, Microsoft Excel, Anylogistix supply chain and logistics software, and Gurobi appear to target similar problems and, even if not presently capable of benefiting from ReduCon methods and systems, may evolve into products that will require the optimization problem solving benefits and advantages provide by the ReduCon methods and systems in accordance with the present embodiments.
To test the advantages of using the ReduCon methods and systems in accordance with the present embodiments to address such problems, a 0-1 Knapsack problem is studied. Given n items numbered 1 to n, each with associated value vi and weight wi, the test is to find which subset of the items to pack into a knapsack with weight capacity W such that the value of the knapsack is maximized. The 0-1 Knapsack problem can be formulated as a linear program as seen in Equation (17):
By adding slack variables, one can easily turn the inequality constraint into an equality constraint, thus recovering a problem of the form of Equation (1) or the form of Equations (6) and (7). In particular, the linear constraint creates a full-connectivity in the underlying graph, which is a perfect use-case for the ReduCon methods and systems in accordance with the present embodiments.
By comparing different variants of the parallel tempering procedure to understand the benefits of the ReduCon methods and systems in accordance with the present embodiments, the Knapsack problem can more thoroughly be investigated and the methods and systems in accordance with the present embodiments can methodically be tested. The variants are changes in the Monte-Carlo moves used within parallel tempering to explore the state space. The Monte-Carlo moves used are single-flips combined with Houdayer moves, and single-flips combined with ReduCon Houdayer moves, in order to understand how ReduCon is able to enhance the results obtained by only using the Houdayer move. Moreover, the parallel tempering algorithm is run for 2500 iterations using 30 temperatures spaced geometrically between 0.001 and 21, where at each temperature two configurations are simulated. The algorithm works as follows: at every iteration, the two configurations at each temperature are first altered by a single-flip independently. Then, for each temperature, the Houdayer (or ReduCon Houdayer) move is performed on the pair at that temperature. Also, only the energy of the two configurations at the lowest temperature are kept track of.
Ten instances with n=100 items were chosen with weights wi chosen uniformly at random in the set {1, . . . , 10}, values given by νi=i, and weight capacity given by W=20. For each instance, the optimal solution was computed via dynamic programming (since the problem size is small) in order to compare our results. Experiments were performed as described hereafter, where for each instance, the parallel tempering scheme described hereinabove was used.
Since the optimum value is known in this example, the relative error for each run was computed at each iteration to see its evolution, noting that given some target value ν and its approximation {tilde over (ν)}, the relative error is defined as
The mean and standard deviation of the relative errors computed over the 10 instances were then plotted in order to visualize the evolution of the average relative error. Results of the multiple aggregated parallel tempering procedures for different types of Monte-Carlo moves are shown in a graph 1300 in
Looking at the graph 1300, some interesting behaviours can be observed. As expected, since the Knapsack has an underlying fully-connected graph, the performance of the single-flips with the Houdayer move is the worst. It is in fact the same as if no Houdayer move was used at all. The most striking result is the outcome of the ReduCon Houdayer move which was able to reduce the relative error by orders of magnitude and even reach the optimal solution, showing the benefit of the ReduCon methods and systems in accordance with the present embodiments.
While the utility of the ReduCon strategy in the context of Houdayer Monte-Carlo moves has been discussed, at its core the ReduCon methods and systems in accordance with the present embodiments is a scheme which allows one to reduce the density of certain types of interaction in a system by introducing more variables (i.e., “register” variables). Thus, it is clear that the ReduCon methods and systems in accordance with the present embodiments can be applied in other situations where the ultimate objective is not to improve the success of Houdayer moves.
Specifically, the ReduCon methods and systems in accordance with the present embodiments may be applied when a high graph density is potentially induced by an optimization constraint, such as that of Equation (6). Optimization problems of this nature are very common (e.g., the Knapsack problem discussed above), and in recent years there has been much interest in solving them in the Ising/QUBO formulation. At the same time, there has emerged a generation of specialized CMOS classical hardware designed specifically for running highly parallel implementations of physics-based algorithms, such as simulated annealing and parallel tempering. Optics-based platforms may also be used to solve complex combinatorial optimization problems, leveraging the natural physical behaviour of the system to find its lowest energy state.
In constructing such hardware, it may be challenging to fully interconnect many thousands of nodes or qubits, resulting in limitations to the scalability of such devices to larger sizes. For the classes of problem where ReduCon may be applied, the ReduCon methods and systems in accordance with the present embodiments could be used to design specialized hardware with a lower density of connectivity nodes, potentially overcoming scalability issues. Since constrained optimization problems similar to the Knapsack problem are of significant economic value, there may be a compelling case for constructing hardware designed specifically to solve them. Thus, ReduCon systems in accordance with the present embodiments implemented in hardware products designed specifically for optimization resolution, and even being designed for specific optimization problem(s), can provide advantages and benefits as discussed herein.
Thus, it can be seen that the present embodiments provide systems and methods for a novel procedure designed to reduce the connectivity of graphs underlying Ising/QUBO problems. In doing so, new avenues are opened to use the Houdayer cluster Monte-Carlo algorithm, enabling one to reach lower energy levels in settings where it would previously fail. In accordance with the present embodiments, systems and methods are provided which enable improved reduction in the density of certain types of interaction in a system by introducing more variables (i.e., “register” variables), those variables, in some instances, forming barriers (i.e., “register barriers”) to allow connected components to not join together and thereby reducing connectivity. Since the ReduCon methods and systems in accordance with the present embodiments describe a relatively general procedure, there are multiple choices to be made when employing them. In particular, schemes in accordance with the present embodiments yield groupings of variables which empirically increase the chance of success of the Houdayer move.
Overall, the methods and systems in accordance with the present embodiments go hand in hand with the Houdayer move and, thus, can be used in any application where the Houdayer move is used, such as the behaviour of parallel tempering on a specific Knapsack instance, an optimization problem where the usual Houdayer move is inefficient. In addition, the ReduCon methods and systems in accordance with the present embodiments allows a variety of complex problems to enjoy the benefits of the Houdayer move, resulting in a better optimization.
While exemplary embodiments have been presented in the foregoing detailed description of the present embodiments, it should be appreciated that a vast number of variations exist. It should further be appreciated that the exemplary embodiments are only examples, and are not intended to limit the scope, applicability, operation, or configuration of the invention in any way. Rather, the foregoing detailed description will provide those skilled in the art with a convenient road map for implementing exemplary embodiments of the invention, it being understood that various changes may be made in the function and arrangement of steps and method of operation described in the exemplary embodiments without departing from the scope of the invention as set forth in the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
10202200507Y | Jan 2022 | SG | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/SG2023/050037 | 1/19/2023 | WO |