This invention relates to solving optimization problems using quantum computer.
Quantum computers have been applied to solution of binary optimization using adiabatic quantum computation, for instance, quantum annealing. An example of such an approach, and a physical realization of a quantum computer to host such an approach is described in US Pat. Pub. 2010/0306142, titled “Methods for Adiabatic Quantum Computation.” This publication, and below referenced patent publications and/or issued patents, are incorporated herein by reference.
A class of binary optimization problems referred to as Polynomial Unconstrained Binary Optimization (PUBO) seeks to find an assignment x ∈ N, where ={0,1}, such that ƒ(x)=min [ƒ(N)]. The function ƒ(x) has a unique representation as
The coefficients cS essentially specify ƒ. These coefficients also specify a Hamiltonian H(ƒ). At least some such Hamiltonians may be used to configure a quantum computer (i.e., a physical machine), yielding an electronic solution of the assignment problem corresponding to the Hamiltonian. In at least some implementations and configurations of a quantum computer, each problem variable xi corresponds to a physical element of the quantum computer. The physical (quantum) state of each element is represented as having two basis elements (corresponding to xi=0 and xi=1) and the (complex) coefficient characterize the physical state of the element. Therefore for N variables xi there are 2N complex variables that characterized the physical state of the corresponding elements of the quantum computer. The Hamiltonian represents an operator on the set of the quantum states of the computer elements, which through physical coupling of elements of the computer defines a physical time evolution of the states. When the operators qi of a Hamiltonian
are coupled with the same coefficients cS as the xi are coupled in ƒ( ), then the vector x that minimizes ƒ( ) corresponds to an eigenvector of the Hamiltonian and the physical computer physically maintains that state.
In practice, there may a number of constraints on the Hamiltonians that can be used to configure a quantum computer. First, the maximum degree of coupling may be limited, for instance, to two, such that there each subset S has at most two elements, such that
Such Hamiltonians are referred to as “2-local.” In some hardware implementations of quantum computer, a graph whose nodes represent variables xi and links represent pairs of variables that are constrained by non-zero coupling coefficients αij is embedded in a hardware based graphs that implements the constraints. An example of such an approach to graph embedding is described in U.S. Pat. No. 8,244,662, titled “Graph Embedding Techniques.” A second constraint of certain quantum computers is on the precision or range of values cS that may be represented in configuration of the hardware. For example, certain quantum computers are limited to a precision range of 16 values (e.g., ±1, . . . ,±8). A third constraint is that a quantum computer is generally limited in the number of variables xi, each corresponding to a quantum bit represented in a corresponding hardware element in the quantum computer.
In order to apply a quantum computer to a problem that is specified by a k-local form, where k>2 (e.g., k=3), one approach is to introduce ancilla variables. For example, one approach is to transform a 3-local specification on a set of variables x1 to xN, by introducing a set of ancilla variables xij, each corresponding to a pair of variables xi and xj. In this way, terms of the form xixjxk may be replaced with quadratic terms xij xk. In some examples, the problem variable pairs to combine to form the ancilla variables are selected at random.
However, in general, introduction of ancilla variables can easily cause the total number of variables and/or the required precision of the coupling coefficients to exceed the capacity of the quantum computer.
There is a need configure a quantum computer to solve a 3-local (or more generally k-local for k=3, 4, or >4) optimization within the hardware capacity for number of variables and/or available precision of coupling coefficients. For example, arbitrary introduction of ancilla variables may achieve the technical requirement of conversion of a problem to a 2-local form, but that converted form may exceed the number of variables that can be supported by the quantum computer hardware or require a precision that exceeds the available precision of the quantum computer.
In one aspect, in general, a method is for use in solution of a problem of determining values of a set of N problem variables xi by a quantum processor that has a limited number of hardware elements for representing quantum bits and/or limitations on coupling between quantum bits. The method includes accepting a specification of the problem that includes a specification of a set of terms where each term corresponds to a product of at least three variables and is associated with a non-zero coefficient. A set of ancilla variables, each ancilla variable corresponding to a pair of problem variables, is determined by applying an optimization procedure to the specification of the set of the terms. The accepted problem specification is then transformed according to the determined ancilla variables to form a modified problem specification for use in configuring the quantum processor and solution of problem.
Aspects may include one or more of the following features.
The modified problem is solved using the quantum processor using an adiabatic quantum annealing approach.
The optimization procedure includes a set covering procedure in which a minimal set of variable pairs cover all the terms of the set of terms. In some examples, applying the optimization procedure includes applying an Integer Linear Programming (ILP) procedure. In some examples, applying the optimization procedure includes applying a greedy procedure to determine successive ancilla variables.
Applying the optimization procedures includes selecting the ancilla variables to limit (e.g., to minimize) a required precision required for solution of the modified problem.
In some examples, multiple ancilla variables are determined for at least some of the problem variable pairs. For instance, three ancilla variables are determined for at least some problem variable pairs.
The problem comprises a Polynomial Binary Optimization Problem (PUBO).
In another aspect, a specification compiler (e.g., software including instructions for causing a computer to perform the transformation of the problem specification outlined above) is used to configure a programmable computer, which then implements the transformation procedure from a problem specification to a computing specification suitable for application on a quantum computer.
In another aspect, in general, the approach to forming the modified problem specification is applied to solution on the problem using a digital implementation of an annealing processor.
One or more of the approaches described in this application address a technical problem stemming from limitations on the form of problem specifications that may be handled by quantum computers that have limitations on the number of qubits, the coupling between qubits (e.g., a limitation that coupling is 2-local), and the precision of coupling. A solution provided herein addressed this technical problem by transforming an initial problem specification according to technical considerations regarding how this transformation is performed, for example, according to which subsets of variables should be represented as ancilla variables that are then physically realized as qubits in the quantum computer hardware elements. These technical considerations are incorporated into the optimization of the selection, for example, as represented in an Integer Linear Programming or a Greedy selection approach.
Other features and advantages of the invention are apparent from the following description, and from the claims.
Referring to
The approaches described below are based on transformations of the problem specification to form the computing specification prior to control of the quantum computer according to the computing specification. These transformations may be considered somewhat analogous to the compilation of a high-level computing language to form an assembly language specification, which can then be further processed before execution on a digital computer.
The transformation of the problem specification to form the computing specification is implemented on a digital computer having a processor 126 as well as program memory, referred to in
It should be understood that the output of the Specification Compiler is not necessarily directly in a form for control of the hardware in the quantum computer. For example, as discussed in the Background, further transformations, for example based on graph embedding, may be performed on the output of the Specification Compiler before the quantum computer controller can use the result for controlling the quantum computer hardware. As discussed above, the computing specification may be processed further to determine characteristics of physical coupling between hardware elements (i.e., the qubits) of the quantum computer so that the time evolution of the state of the hardware elements configured based on the computing specification is used to determine the problem solution. In the case in which the problem specification represents a physical problem (e.g., an assignment problem), the user or the system controller may used the problem solution provided by the quantum computer to implement a physical solution to based on the problem solution.
As outlined above, the general procedure implemented by the Specification Compiler takes a problem specification of an problem, for example, an binary assignment problem, in the form of a coefficients for a polynomial ƒ(x), which in this example, includes coefficients of terms of the form xixjxk. In at least some embodiments, M (e.g., M=1, M=3) ancilla variables xij(m) are introduced for each pair of a selected subset of variable pairs (xi, xj). This selection is not arbitrary or random. Rather, these pairs are selected according to one or both of the considerations of (a) the total number of variables, for example, so that the number does not exceed the number of qubit hardware elements in the quantum computer, and/or (b) limiting the range (“precision”) of the coefficients, which represent degrees of coupling between the qubit representations of the variables in the quantum computer.
More specifically, one or more of the embodiments apply exact classical gadgets. In order to efficiently reduce locality, many-body terms are collapsed in a systematic fashion that takes into account the appearance of specific pairs of qubits in multiple higher-order terms. For applications in which qubits are the limiting resource, we show below how to map the optimal reduction to set cover and 0-1 integer linear programming (ILP) so that conventional solvers can be leveraged to quickly find the best encoding. For control precision limited problems we formalize the optimal reduction problem and provide a greedy algorithm that significantly outperforms the status quo.
Referring to
In order to compile NP-hard optimization problems into an experimental Hamiltonian, the problem of interest is encoded into a graph of binary variables with physically realizable interactions. Perhaps the simplest model of interacting binary variables is Polynomial Unconstrained Binary Optimization (PUBO): given a pseudo-Boolean function ƒ:N→, find an assignment x∈N such that ƒ(x)=min [ƒ(N)], where ={0,1}. Every pseudo-Boolean ƒ has a unique multi-linear polynomial representation
where cS ∈. From this expression we can construct an optimization Hamiltonian that embeds the energy landscape of a given PUBO in its eigenspectrum,
acting on N qubits, where qi=(I−Zi) and Zi is the Pauli matrix σz acting on the i th qubit, i.e.
Z
i
=I
⊗(i−1)⊗σz⊗I⊗(N−i), (3)
where I is the one-qubit identity operator. Note that while we write H (ƒ) for convenience, in practice ƒ will be specified by its coefficients cS. Every element |x of the computational basis is an eigenstate of H(ƒ) with eigenvalue ƒ(x). Specifically, the ground state of H(ƒ) is spanned by the set of states |x such that ƒ(x)=min[ƒ (N)].
However, experimental interactions using a quantum computer are typically limited to pairwise couplings between qubits, allowing Hamiltonians of the form
where αij∈. The case in which the indices are equal is used to include 1-local terms: qiqi=qi. Such Hamiltonians correspond to a second-order pseudo-Boolean ƒ,
Thus, to encode a general instance of PUBO into an experimentally realizable Hamiltonian, one reduces the problem to Quadratic Unconstrained Binary Optimization (QUBO), defined analogously to PUBO with the restriction that the pseudo-Boolean function to be minimized is quadratic. In practice, many common optimization problems have been reduced to PUBO in such a way that the pseudo-Boolean function to be minimized is cubic, i.e. of the form
It is therefore desirable to have a general method for reducing a cubic function ƒ: N→ to a quadratic function ƒ′: N′→ in such a way that an assignment x∈N that minimizes ƒ can be efficiently computed given an assignment x′ that minimizes ƒ′, where N′ is a polynomial function of N. One family of methods employs a set of N′−N ancilla variables {y1, . . . , yN′−N} ∈ N′−N such that if (x1, . . . ,xN, y1, . . . yN′−N) minimizes ƒ′, then (x1, . . . ,xN) minimizes ƒ. That is, a minimizing assignment (x1, . . . ,xN) of ƒ is directly encoded in the N computational qubits of a ground state |x1 . . . xNy1 . . . yN′−N of H (ƒ′). In the methods examined here, each ancilla variable corresponds to a pair of computational variables (i, j) and so for convenience is denoted by xij or xij(m).
Integral to the exact gadget is the penalty function
s(x,y,z)=3z+xy−2xz−2yz, (6)
with the important property that s (x, y, z)=0 if xy=z and s (x, y, z)≥1 if xy≠z, as shown in Table 1, below. While s is not the only quadratic ternary pseudo-Boolean with this property, we show that it is optimal for our purposes.
In our reductions, we replace a part xixj of a 3-local term xixjxk with xij, where xij is an ancilla variable, thereby reducing locality, while simultaneously adding the penalty function s (xi, xj, xij), scaled by an appropriate factor to ensure that the value of the reduced form is greater if xij≠xixj than it is if xij=xixj, for any assignment of the computational variables. In this way, we ensure that if an assignment of the computational and ancilla variables minimizes the reduced form, then that assignment of the computational variables also minimizes the original form. Consider the reduction
αijkxixjxk→αijkxijxk+(1+|αijk|)s(xi,xj,xij). (7)
If xij=xixj, then s (xi,xj,xi)=0 and the reduced form simplifies to the unreduced form αijkxixjxk. If xij=1−xixj, then s(xi, xj,1−xixj)=3−2xi−2xj+2xixj and the reduced form always has a greater value than it does if xij=xixj. That is,
for all xi, xj, and xk.
To decrease the number of ancilla variables needed to reduce many 3-local terms, it is advantageous to use the same ancilla variable xij to reduce more than one 3-local term. Let Kij be the set of indices k such that the term xixjxk is reduced using the ancilla variable xij corresponding to the pair of variables {xi, xj}. Each non-zero 3-local term is reduced using exactly one ancilla, and so we must choose {Kij} such that for each αijk≠0, there is exactly one pair of indices {w, v} with {w, v, Kwv}={i, j, k}. (Note that the indices on the coefficients are unordered, e.g. αijk=αkji=αjki.) Then the entire set of 3-local terms can be reduced by
where the single term reduction in Eq. (7) is applied to every term in the rewritten original expression. The essential conditions (that, for any i and j for which an ancilla variable is used, the value of the reduced form is greater if xij≠xixj than the value thereof if xij=xixj and in the latter case the reduced form is equal to the original form) are preserved by linearity. We explain below a method for choosing which pair of variables to use to reduce each 3-local term (i.e. for choosing Kij with the constraints given) in a way that minimizes the total number of ancilla variables (the number of non-empty Kij). This strategy can be generalized as discussed below to minimize the number of ancilla required in 4-local to 2-local reductions.
It is often the case that the limiting factor in encoding a PUBO instance into experimentally realizable form is the control precision rather than the number of qubits available. Existing hardware is able to implement 2-local Hamiltonians of the form in Eq. (4) such that the coefficients are integral multiples of a fixed step size Δα with a maximum magnitude of NαΔα, where Nα is the control precision. An arbitrary 2-local Hamiltonian can be made to have coefficients that are integral multiples of Δα by dividing them all by their greatest common divisor and multiplying by Δα. The control precision needed for an arbitrary instance is thus the quotient of the greatest magnitude of the coefficients and their greatest common divisor. We assume without loss of generality that the coefficients of the PUBO to be reduced are integers and structure the reductions so that the reduced QUBO also has integral coefficients. The greatest common divisor of the coefficients of the reduced QUBO is thus one with high probability, and the control precision needed is the greatest magnitude of the coefficients. As a preliminary, we show that s as defined is optimal in that the greatest coefficient (3) cannot be reduced any further.
Suppose ƒ(x1, x2, x3) is a quadratic pseudo-Boolean function with integer coefficients (i.e. in the form of Eq. (5)) such that ƒ(x1, x2, x3)=0 if x3=x1x2 and is at least one otherwise. First note that ƒ(0, 0, 0)=0 and thus that ƒ(1,0,0)=α11=0 and ƒ(0,1,0)=α22=0. Because ƒ(1,1,1)=α33+α12+α13+α23=0, α33+α23=−α12−α13, and so ƒ(0,1,1)=α33+α23=−α12−α13≥1, which implies α13≤−α12−1. Because α12=ƒ(1,1,0)≥1, α13≤−2. Finally, ƒ(1,0,1)=α33+α13≥−1 and so α33≥1−α13≥3.
For each i and j in the reduction shown in Eq. (9), (1+|αijk|)s(xi,xj,xij) is added for each k ∈ Kij, and so the coefficients in s (xi, xj, xij) are multiplied by Σk∈K
for each i and j. For all xi, xj, and {xk|k ∈Kij},
That is, for any assignment of the computational variables, the value of the reduced form is greater if the ancilla variable xij≠xixj than it is if xij=xixj. The δij given in Eq. (10) is optimal in the sense that it requires the least control precision of all possibilities which satisfy the appropriate conditions. Consider the reduced form
for some δ ∈ to be determined. We guarantee that
for all xi, xj, and {xk|k ∈Kij}. For xi=1 and xj=0 or xi=0 and xj=1, this inequality simplifies to
for xi=xj=1 it simplifies to
and for xi=xj=0 it simplifies to
Eq. (16) is implied by Eq. (14) and so it is sufficient to ensure that Eq. (14) and Eq. (15) are satisfied. We see that the term −Σk∈K
and so if and only if
then Eq. (14) is satisfied for all {xk|k ∈Kij}. The term Σk∈K
then Eq. (15) is satisfied for all {xk|k ∈Kij}. Together, Eq. (18) and Eq. (19) and imply that
Note that the terms introduced in Eq. (10) only appear in the reduction for that pair (i, j), and so the coefficient for a term therein is the coefficient in the total reduced form, with the exception of xixj which may also appear in the original unreduced form, which is to be addressed later. The greatest term introduced in Eq. (10) is 3δij, which greatly increases the control precision needed.
Below, we introduce an alternative method that adds terms whose greatest coefficient is approximately a third of this. Because the complexity of the final form obscures the simplicity of the method, we begin with a special case and extend it gradually to the general case. To reduce a single term whose coefficient is divisible by three, we introduce three ancillary bits and penalty functions:
When xij(1)=xij(2)=xij(3)=xixj, the reduced form simplifies to αijkxixjxk. Otherwise, it is always greater than αijkxixjxk, and so the reduction is valid. Furthermore, the greatest coefficient introduced is 3+|αijk|. In general however, the coefficient will not be divisible by 3. In that case, we define a new coefficient βijk(m) for each ancilla variable xij(m) that depends on αijk mod3 such that each βijk(m) is an integer and Σm=1βijk(m)=αijk. This is elucidated by Table 2, below:
We now use the reduction
If xij(1)=xij(2)=xij(3)=xixj, then s(xi,xj,xij(m))=0 and this simplifies to αijkxixjxk. We can rewrite the replacement terms as,
In all cases and for each m
βijk(m)xijk(m)xk+(1+|βijk|)s(xi,xj,xij(m))≥βijk(m)xixjxk. (24)
If not xij(1)=xij(2)=xij(3)=xixj, strict inequality holds for at least one m and the replacement terms are greater than αijkxixjxk. Here, the greatest coefficient is
3+max{3|βijk(m)|,|αijk|}. (25)
Finally, we use the same set of ancilla variables {αij(m)} to reduce all of the 3-local terms:
and Kij is defined as above with the same constraints. In the reduced form, for every i, j, and m the coefficient of xij(m) is 3δij(m) and for every i and j the coefficient of xixj is Σm=13δij(m). The latter will be added to the coefficient αij of the corresponding quadratic term in the original expression. Thus the control precision needed is
Below we describe a greedy algorithm to find a set of Kij that greatly decreases the control precision needed.
Certain of the classical gadgets described above have already been characterized in the literature. However, such available characterization is not enough to efficiently encode a problem. Below we describe how to efficiently apply these gadgets so that the resulting Hamiltonian meets the demands of available hardware. For simplicity, and because it is the most frequently encountered situation, we will focus first on reductions from 3-local to 2-local, although the techniques are applicable to 4-local to 2-local reduction as well.
When working with a qubit limited encoding, the goal in applying these gadgets is to choose the smallest set of qubit pairs that collapses all 3-local terms. We explain how to cast this problem as canonical set cover and map to 0-1 ILP so that popular optimization software can be leveraged to find the optimal set of collapsing pairs. When working with a control precision limited encoding, the goal is to choose the set of qubits for which the sum of penalty functions contains the smallest maximum coefficient. We approach this problem with a greedy algorithm but later show numerics which validate the efficiency of our technique.
The qubit-optimized application of classical gadgets can be cast as set cover. In this context, the universe U that we seek to cover is the set of 3-local terms that we must collapse. For example, U={x1x2x3, x1x4x5,x2x3x5}. Treating each 3-local term as a set of single qubits, we define A as the union of all 2-subsets of each 3-local term. In the example given,
Next, we construct S by replacing each element Ai with the union of proper supersets of Ai in U,
In this way, A is the set of products of pairs of qubits xixj that can be used in the reduction, and each element Si is the set of 3-local terms that the corresponding Ai can be used to reduce. The problem is clearly setcover if we view the 3-local terms as elements (as opposed to sets themselves). Given U and S, find the minimal covering set, i.e. *argmin{C|C⊆S∧∪C=U} |C|. In this form, the problem is easily cast as 0-1 ILP. 0-1 ILP is the problem of finding a Boolean-valued vector v that minimizes the quantity cTv subject to Mv≥b. In set cover each element of v is a Boolean which says whether or not to include the associated element of S in the cover C. Thus, c is a vector of ones with length equal to the cardinality of S so that the cost function cTv represents the cardinality of C.
The matrix M multiplies v to set up a system of equations which guarantees that C covers U. Thus, the matrix element Mij is 1 if the Sj contains the Ui and 0 otherwise. Accordingly b is a vector of all ones with length equal to the cardinality of U. Both setcover and 0-1 ILP are well known to be NP-Complete. In fact, the exact problem of cubic to quadratic polynomial binary reduction has been shown to be NP-Complete by analogy with vertex cover.
In
For the case of 3-local to 2-local PUBO reduction, the complexity of a random problem instance is characterized by the number of logical qubits, n, and the number of 3-local clauses, λ. While ancilla requirements scale as 3λ for perturbative gadgets and λ for exact gadgets without optimized application, numerics from
Unfortunately, we should not expect to do better than a quadratic improvement for extremely large problem sizes because the constant scaling region appears to coincide with the most difficult to reduce problem instances as indicated by the computational time scaling in
To minimize the control precision, as expressed in Eq. (27), we develop a greedy procedure which chooses the collapsing pairs, {Kij}. Recall that Kij is the set of indices k such that the term xixjxk is reduced using the ancilla variable xij corresponding to the pair of variables (xi,xj). In the following pseudo-code we employ the convention that K({i, j})=Kij, α({i,j,k})=αijk, and α({i, j})=αij for ease of exposition.
The above procedure is initialized by setting K({i, j}) to the empty set for every pair of variable indices {i, j}, and by collecting the triplet of variable indices {i, j, k} for every 3-local term αijkxixjxk with a non-zero coefficient αijk into the set A. We also introduce the notation B(a) for the set of three pairs of indices contained by a triplet of indices a, e.g. B({i, j, k})={{i, j}, {i, k}, {j, k} }. The remainder of the procedure consists choosing a 3-local term (as represented by the set of indices of its variables d) and a pair of variables contained therein (also represented by their indices Δ(d)) with which to collapse it, which is repeated until such a choice has been made for every term that we wish to collapse. Throughout, the set A contains those terms for which the decision has not been made.
The repeated procedure is as follows: first, for every 3-local term α ∈ A for a which a pair has not been chosen with which to collapse it and for every pair therein b ∈ B(a), the cost of collapsing the term a using that pair b is calculated. The cost is defined as w(a,b)=α(b)+3+max {Σθ∈Θ+θ, Σθ∈Θ
*argminx∈Xƒ(x)={x∈X|ƒ(x)=*minx∈Xƒ(x)}.
If there is more than one such pair, we find which of those is contained in the fewest number of terms in A, those for which a choice has not yet been made. If there is then more than one such pair, a pair Δ(a) is chosen arbitrarily. Having found the minimum cost w(a, Δ(a)) of each term a e A, we find the set of terms with the minimum cost D and choose one d arbitrarily. Finally, we append the index in d that is not in the reduction pair Δ(d) to K(Δ(d)) and then remove the term d from the set A of terms for which a decision needs to be made. This procedure is repeated until a reduction pair has been chosen for every term, i.e. until A is empty.
Referring to
While we do not claim that this greedy algorithm is optimal, we present numerical evidence to show that it outperforms the default approach of selecting Kij in a non-systematic fashion.
Above we have expanded the definition of an exact classical gadget and formalized the difficult problem of efficiently applying these tools. We introduced a novel and useful form of classical gadgets that uses multiple ancilla qubits to decrease the required control precision of compiling arbitrary problems. Using this new gadget we derived Eq. (27), a general expression for the optimal control precision of a 3-local to 2-local reduction. While exactly solving this equation appears extremely difficult, we introduced a simple greedy algorithm which significantly outperforms the status quo. For the problem of minimizing ancilla qubit requirements during 3-local to 2-local reduction, we demonstrated how to map the problem to set cover which allowed us to find minimal ancilla encodings with the use of Integer Linear Programming. These techniques will be very useful to anyone wishing to compile classical problems into realizable Hamiltonians for adiabatic quantum computation. These techniques are applicable to problems in protein folding and related optimization problems of interest to chemistry and biophysics.
We show how the problem of reducing a quartic pseudo-Boolean to a quadratic one using the minimum number of ancilla bits can be recast as Weighted Max-SAT (WMAXSAT). An instance of WMAXSAT consists of a set of clauses, each of which is a disjunction of literals, and a function w that assigns a non-negative weight to each clause; the problem is to find an assignment that maximizes the sum of the weights of clauses satisfied thereby.
Consider an arbitrary 4-local term xixjxkxl. It can be reduced to 2-local in two ways, both of which require two ancilla bits. The first way is to use two ancilla bits that each correspond to the conjunction of two computational bits. For example, the term can be reduced using the ancilla bits xij and xkl, which entails replacing the term xixjxkxl with xijxkl and adding the penalty functions s(xi, xj, xij) and s(xk, xl, xkl), scaled by the appropriate factor. Similarly, the term can also be reduced using xik and xjl, or xil and xjk. The second way is to use an ancilla bit corresponding to the conjunction of three bits, which requires a second ancilla bit. (No quadratic pseudo-Boolean ƒ(x, y, z, a) exists such that ƒ(x, y, z, a)=0 if a=xyz and ƒ(x, y, z, a)≥1 otherwise, which can be shown in a similar manner to that of the proof that the minimum coefficient in the penalty function for the conjunction of two variables is three.) For example, the term xixjxkxl can reduced to 2-local using the ancilla bits xijk and xij, where xijk corresponds to the conjunction of xij and xk. (Accordingly, just as the indices of the ancilla bit xij were unordered, i.e. xij=xji, so are the subscript indices of the ancilla bit xijk, i.e. xijk=xjik, though the distinction between subscript and superscript indices must be made. Though in reducing a single term the choice of which pair of computational bits to use for the intermediary ancilla bit is unimportant, when reducing several the same ancilla bit may be used as an intermediary for several ancilla bits each corresponding to the conjunction of three computational bits.) The reduction entails replacing the term by xijkxl and adding the penalty functions s(xi, xj, xij) and s(xij, xk, xijk), scaled by the appropriate factor. There are twelve distinct ancilla bit pairs that can be used to reduce the term using the second way.
Now consider a quartic pseudo-Boolean
that we would like to reduce to quadratic. Let T3 and T4 be sets of the sets of indices of the variables in the 3-local and 4-local terms with non-zero coefficients, respectively, i.e.
T
3
={{i,j,k}⊂{1, . . . ,N}|αijk≠0} (31)
and T4={{i,j,k,l}⊂{1, . . . ,N}|αijkl≠0}. (32)
For each ancilla bit xij that represents a conjunction of two computational bits, we introduce a Boolean variable rij ∈ {true, false} that represents its actual use. For each triplet of computational bits {xi, xj, xk}, we introduce a Boolean variable rijk ∈ {true, false} corresponding to the use of an ancilla corresponding to their conjunction, regardless of which intermediate ancilla bit was used. While the choice of intermediate ancilla bit must be made when doing the reduction, the minimum set of ancilla bits used in a reduction cannot contain two distinct ancilla bits corresponding to the conjunction of the same three ancilla variables and so here there is no need to make the distinction. Let
There are three sets of clauses that must be included. First, the goal is to minimize the number of ancilla bits used in the reduction, and so for each variable representing the use of a unique ancilla bit we include the single-literal clause consisting of its negation, and assign to each such clause a weight of 1:
={(
and w(C)=1 for every C ∈ . This first set consists of so-called soft clauses. The remaining two sets of clauses and consist of hard clauses, those that must be satisfied. This is ensured by assigning to every hard clause a weight greater than the sum of the weights of a1 the soft clauses. Here, we set w(C)=||+1=|R|+1 for every C∈ U. Note that
Second, we must ensure that for each ancilla bit used that corresponds to the conjunction of three computational bits there is at least one intermediate ancilla bit that can be used in its construction, i.e.
(rijk→(rij∨rik∨rjk))≡(
Let
={(
Third, we must ensure that the set of ancilla bits used reduces all the cubic and quartic terms. A cubic term xixjxk can be reduced using xij, xik, or xjk, i.e. if (rij∨rij∨rjk). Note that while an ancilla bit corresponding to the term itself can be used to reduce it to 1-local, that ancilla bit can only be constructed using one of the three ancilla bits mentioned, and any one of those three is sufficient to reduce the term to quadratic. A quartic term xixjxkxl can be reduced using one of twelve ancilla bits (though each requires an intermediary). These twelve can be partitioned into four triplets by the triplet of variables whose conjunction they correspond to, i.e. by the Boolean variable that represents the use of any one. Thus the quartic term can be reduced to quadratic if (rijk ∨rijl∨rikl ∨rjkl). It can also be reduced using two ancilla bits that correspond to the conjunctions of disjoint pairs of computational bits, i.e. if ((rij∧rkl) ∨(rik∨rjl) ∨(ril ∧rjk)). These clauses must be written in conjunctive normal form:
Finally, let =++. The WMAXSAT instance is specified by and
We prove here that the minimum number of ancilla variables needed to reduce all 3-local terms over n variables to 2-local is
and therefore that the minimum number of ancilla variables needed to reduce any set of 3-local terms over n variables is upper-bounded by the same.
The basis of the proof is Mantel's Theorem: A triangle-free graph with n vertices can have at most
vertices. We identify a set of ancilla bits A used to reduce locality with the edge set E(A) of a graph G(A) whose vertices V={vi|1≤i≤N} correspond to the computational variables and in which there is an edge between any two vertices vi and vj if and only if the anncilla bit xij representing the conjunction of the corresponding computational bits xi and xj is used. (In reducing a cubic pseudo-Boolean to a quadratic, only ancilla bits of this type are needed.) The set of ancilla bits A can be used to reduce all possible 3-local terms if and only if for every set of three computational bits there is at least one ancilla bit in A corresponding to the conjunction of any two. In graph-theoretic terms, A can be used to reduce all 3-local terms if and only if every possible triangle in the complete graph with the same the vertex set V contains at least one edge in E(A), or equivalently if the complement EC(A) of E(A) is triangle-free. Suppose that the set of ancilla bits A reduces all 3-local terms. Then by Mantel's Theorem
this yields
Furthermore, by construction we show that the minimal set reaches this bound.
Let E={{vi, vj} (1≤i<j≤┌N/2┐)∨(┌N/2┌+1≤i≤j≤N}. That is, partition the vertices into sets of as equal size as possible and include an edge between every pair within each set. Let N=2m+b as above. The total number of edges constructed in this way is
It is to be understood that the foregoing description is intended to illustrate and not to limit the scope of the invention, which is defined by the scope of the appended claims. Other embodiments are within the scope of the following claims.
This application claims the benefit of U.S. Provisional Application No. 61/859,388, filed on Jul. 29, 2013, which is incorporated herein by reference in its entirety.
This invention was made with government support under contract M1144-201167-DS awarded by the United States Department of Defense. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
61859388 | Jul 2013 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14908260 | Jan 2016 | US |
Child | 16505366 | US |