Hypothesis and Theory ARTICLE
Exponential Enhancement of the Efficiency of Quantum Annealing by Non-Stoquastic Hamiltonians
- Department of Physics, Tokyo Institute of Technology, Tokyo, Japan
Non-stoquastic Hamiltonians have both positive and negative signs in off-diagonal elements in their matrix representation in the standard computational basis and thus cannot be simulated efficiently by the standard quantum Monte Carlo method due to the sign problem. We describe our analytical studies of this type of Hamiltonians with infinite-range non-random as well as random interactions from the perspective of possible enhancement of the efficiency of quantum annealing or adiabatic quantum computing. It is shown that multi-body transverse interactions like XX and XXXXX with positive coefficients appended to a stoquastic transverse-field Ising model render the Hamiltonian non-stoquastic and reduce a first-order quantum phase transition in the simple transverse-field case to a second-order transition. This implies that the efficiency of quantum annealing is exponentially enhanced, because a first-order transition has an exponentially small energy gap (and therefore exponentially long computation time) whereas a second-order transition has a polynomially decaying gap (polynomial computation time). The examples presented here represent rare instances where strong quantum effects, in the sense that they cannot be efficiently simulated in the standard quantum Monte Carlo, have analytically been shown to exponentially enhance the efficiency of quantum annealing for combinatorial optimization problems.
Quantum annealing is a metaheuristic for combinatorial optimization problems (Kadowaki, 1998; Kadowaki and Nishimori, 1998; Brooke et al., 1999; Farhi et al., 2000, 2001; Santoro et al., 2002; Santoro and Tosatti, 2006; Das and Chakrabarti, 2008; Morita and Nishimori, 2008). A combinatorial optimization problem can generally be expressed as the minimization of an Ising Hamiltonian, i.e., the ground-state search of a classical Ising model (Lucas, 2014). Then, quantum fluctuations are appended, typically as a uniform transverse field, and the total Hamiltonian constitutes the transverse-field Ising model. The amplitude of the appended term for quantum fluctuations is gradually decreased from a very large value, large relative to the original classical Ising model, toward zero. If one starts from the ground state of the initial Hamiltonian and the rate of change of the amplitude is sufficiently slow, the system follows the instantaneous ground state according to the adiabatic theorem of quantum mechanics. This implies that the system eventually reaches the ground state of the original Ising model representing the solution to the combinatorial optimization problem. There exists a large body of analytical, numerical, and experimental studies on quantum annealing, and active debates are going on to compare quantum annealing with the corresponding classical heuristic, simulated annealing, recent examples of which include Matsuda et al. (2009), Young et al. (2010), Hen and Young (2011), Farhi et al. (2012), Boixo et al. (2014), Katzgraber et al. (2014, 2015), Rønnow et al. (2014), Albash et al. (2015), Heim et al. (2015), Hen et al. (2015), Isakov et al. (2016), Martin-Mayor and Hen (2015), Steiger et al. (2015), Venturelli et al. (2015), Crosson and Harrow (2016), Denchev et al. (2016), Kechedzhi and Smelyanskiy (2016), Mandrà et al. (2016a,b), Marshall et al. (2016), and Muthukrishnan et al. (2016).
To numerically test the performance of quantum annealing, one often uses quantum Monte Carlo simulation, which is a classical algorithm to sample the equilibrium distribution of the transverse-field Ising model. Although the quantum Monte Carlo simulation is designed to sample the equilibrium Boltzmann distribution, it has been found that some aspects of dynamics of quantum annealing can also be described by quantum Monte Carlo simulations (Isakov et al., 2016; Denchev et al., 2016; Jiang et al., 2017). Also remarkable are the generic convergence conditions for quantum annealing under quantum dynamics (Morita and Nishimori, 2007, 2008; Somma et al., 2007) and quantum Monte Carlo simulations (Morita and Nishimori, 2006, 2008), both of which have a very similar asymptotic polynomial decrease of the control parameter that is much quicker than the corresponding inverse-log law for simulated annealing (Geman and Geman, 1984). These observations suggest the possibility that quantum annealing might be efficiently simulated on classical computers even for its dynamical aspects, the latter being important to judge the performance of quantum annealing. If this is indeed the case, the role of dedicated hardware to run quantum annealing may have to be reconsidered.
Related to the above observation is the concept of stoquastic Hamiltonians (Bravyi et al., 2008). Loosely speaking, it is a class of Hamiltonians that can usually be simulated efficiently on classical computers because there is no sign problem in the standard classical implementation using the Suzuki–Trotter decomposition (Suzuki, 1976).1 More formally, a stoquastic Hamiltonian has off-diagonal elements all non-positive in the standard computational basis to diagonalize the z component of the Pauli matrix at each site i. The transverse-field Ising model belongs to this category. A non-stoquastic Hamiltonian, by contrast, has both signs in the off-diagonal elements, which causes negative signs in the effective Boltzmann factors when Trotter-decomposed to run simulations on classical computers. This means that it is practically impossible to classically simulate non-stoquastic Hamiltonians by the standard method. It may then be the case that a proper term added to a stoquastic Hamiltonian, which causes both signs in the matrix representation in the computational basis, represents strong quantum effects not to be classically simulated in a straightforward manner. Such a term might lead to enhanced performance of quantum annealing as compared to the conventional method with the stoquastic transverse-field Ising model. In this relation, it is to be noticed that the transverse-field Ising model with longitudinal fields can be universal in quantum computation if XX interactions are added with appropriate coefficients (Biamonte and Love, 2008).
There exist several studies related to this idea. Farhi et al. (2002) investigated the effects of randomly generated non-stoquastic Hamiltonians in a variant of the infinite-range Ising model and found that a finite fraction of examples showed enhancement of performance compared to the stoquastic case. Crosson et al. (2014) ran extensive numerical tests of hard MAX-2SAT problems by directly solving the Schrödinger equation for small-size systems. They concluded that additional terms, which make the Hamiltonian non-stoquastic, improve the success rate, although not decisively better than stoquastic cases. Hormozi et al. (2016) numerically studied the spin-glass problem to find that non-stoquastic Hamiltonians have improved success probabilities for hard instances, possibly not by increasing the energy gap for strict adiabatic evolution but by promoting diabatic transitions. Seki and Nishimori (2012, 2015) and Seoane and Nishimori (2012) used quantum statistical–mechanical techniques to analyze systematically the infinite-range Ising models with ferromagnetic as well as random interactions to conclude that additional terms, by which the Hamiltonian becomes non-stoquastic, sometimes reduce first-order quantum phase transitions in the stoquastic Hamiltonian to second-order transitions. This means an exponential enhancement of the efficiency, exponential in the system size, because second-order quantum phase transitions have the minimum energy gap that decreases polynomially as a function of the system size whereas first-order transitions have an exponentially small gap. Remember that the adiabatic theorem states that the time needed for a system to stay close to the instantaneous ground state is proportional to the inverse of a polynomial of the minimum energy gap (Jansen et al., 2007; Lidar et al., 2009; Elgart and Hagedorn, 2012).
The present article describes the findings in Seki and Nishimori (2012, 2015) and Seoane and Nishimori (2012) from the viewpoint of possible enhancement of the efficiency by non-stoquastic Hamiltonians, which was not mentioned explicitly in those papers. Also explained is the effect of interactions of the system with its environment.
2. Ferromagnetic p-Spin Model
where is the z component of the Pauli matrix at site i(=1, 2, ⋯ , N), and p (≥3) is an integer. The ground state of this Ising Hamiltonian is doubly degenerate for p even, and , and non-degenerate for p odd, . This Hamiltonian (the cost function for combinatorial optimization) is a simple polynomial of the order parameter , and the steepest descent method readily finds the ground state. In this sense, the problem is easily solved classically. Our focus, however, is on how quantum annealing compares with its classical counterpart, simulated annealing, according to the criterion of “limited quantum speedup” (Rønnow et al., 2014) as well as on how a non-stoquastic Hamiltonian compares with its stoquastic counterpart.
In the present section, we consider the case with p ≥ 3 since the p = 2 model in a transverse field has a second-order quantum phase transition and is therefore easy to solve already in the stoquastic case. We restrict ourselves to the subspace of m ≥ 0 without losing generality.
2.1. Simulated Annealing
Simulated annealing is a classical heuristic to sample the Boltzmann distribution with the temperature decreasing from a very high value to zero (Kirkpatrick et al., 1983). To understand the process theoretically for the present problem, it is convenient to see how the free-energy landscape behaves as a function of the order parameter at each given temperature. To this end, we write the partition function as
where β = 1/T is the inverse temperature with the Boltzmann constant chosen to be 1 for simplicity. We have dropped a trivial prefactor 1/2π in the above expression. The exponent in the integrand in the last line is the generalized free energy for given values of m and . In the thermodynamic limit N → ∞, the integral is evaluated by the saddle point method. The extremum condition of the exponent with respect to m is . By eliminating using this equation, we obtain the Landau-type free energy per site as a function of m,
As shown in Figure 1, there exists a jump in the minimum as the temperature changes and hence the transition is of first order. In simulated annealing, one should drive the system from a disordered state (m = 0) to an ordered state m > 0 over the free-energy barrier as the temperature is decreased. This takes an exponentially long time since the probability to go over the peak of the free-energy barrier is exponentially small, proportional to exp(−NβΔf), where Δf is the height of the barrier of the free energy per site at the transition temperature. Therefore, the present simple problem is hard to solve by simulated annealing. The existence of a first-order phase transition is the origin of the difficulty.
Figure 1. Free energy per site f (m) as a function of the order parameter m for p = 4 at four values of the temperature, T = 0.9, 1.5, 1.8, and 2.2, from top to bottom. The minimum jumps between m = 0 and m ≈ 1 as the temperature changes.
2.2. Quantum Annealing with Stoquastic Hamiltonian
What will happen if we apply quantum annealing to the same problem? The conventional choice of the transverse-field Ising model for quantum annealing has the Hamiltonian
where s is the time-dependent parameter to control the dynamical evolution of the system running from the initial value s(t = 0) = 0 to the final s(t = τ) = 1 with τ being the computation time. A typical example is s = t/τ. Equation (4) is a stoquastic Hamiltonian.
For our problem Hamiltonian of equation (1), equation (4) reads
which can be expressed in terms of the normalized total spin operators,
The normalized total spin operators satisfy the commutation relation,
Since the norm of those operators, defined as the largest absolute eigenvalue, is unity, the right-hand side of equation (8) vanishes in the thermodynamic limit N → ∞. The same is true for other commutators of mx, my, and mz. It is also useful to remember that the total spin operator commutes with the Hamiltonian and therefore is conserved in the present p-spin model. We are interested in the subspace with the largest value of the total spin, since we start quantum annealing in this subspace. For these reasons, we may regard the operators mx, my, and mz as x, y, and z components of a classical vector m of unit length written as
in the polar coordinate. Equation (5) then reduces to a classical energy, whose value per site is
To minimize this energy, the angle ϕ is 0 if p is odd, and 0 or π for p even. We may thus drop cos ϕ and write
As one sees in Figure 2 for the case of p = 5, the minimum jumps from θ = 0 to θ > 0 at some s. The system has a first-order phase transition, and the energy gap between the ground state and the first excited state is expected to decrease exponentially as a function of the system size, which has indeed been shown to be the case explicitly in the present problem both analytically and numerically (Jörg et al., 2010). This is a difficult situation for quantum annealing in its adiabatic formulation (adiabatic quantum computation (Farhi et al., 2000, 2001)) because one should spend an exponentially long computation time τ to reach the correct ground state of the target Hamiltonian H0. The authors of Jörg et al. (2010) thus wrote legitimately that this is “a problem that quantum annealing cannot solve.”
Figure 2. Energy per site as a function of the polar angle θ for p = 5 at s = 0.2 (dashed), s = 0.47 (dotted), and s = 0.6 (full line). The minimum jumps between θ = 0 and θ > 0.
One may wonder if the above analysis using a classical vector would properly describe the essential features of quantum annealing under the Hamiltonian equation (5). The answer is positive as far as the properties of phase transitions are concerned: Jörg et al. (2010) used full quantum statistical–mechanical tools to reach the same conclusion as above. Quantum effects should be carefully taken into account if one wishes to fully understand the behavior of the energy gap for finite-size systems, as was done by Jörg et al. (2010), and to describe more subtle properties of the system around the phase transition and within the ferromagnetic phase (Susa et al., 2016). However, the classical analysis is sufficient to predict the type of phase transitions in the thermodynamic limit. We take advantage of this observation in the next section for a non-stoquastic Hamiltonian.
It is also worth noticing that the performance of quantum annealing is comparable to that of simulated annealing discussed in the previous section, both of which should spend an exponentially long time to reach the ground state. In this sense, there is no ‘limited quantum speedup’ in the present case, according to the classification of Rønnow et al. (2014), although there may exist quantitative differences such as the difference in the coefficients of the exponent.
2.3. Quantum Annealing with Non-Stoquastic Hamiltonian
We next study the non-stoquastic case with the Hamiltonian
where λ ∈ [0, 1] is a parameter to control the strength of the additional term, the second term on the right-hand side (to be called the antiferromagnetic multiple-X term), and k(≥2) is an integer. The parameter λ will later be chosen to be a function of s. Notice that the coefficient of the second term s(1 − λ) is positive so that this term makes the Hamiltonian non-stoquastic. For λ = 1, the above Hamiltonian reduces to the stoquastic equation (5).
Quantum annealing starts at s = 0 (λ arbitrary), in which case the Hamiltonian is the simple transverse field,
just as in the stoquastic case of the previous section. Then one increases s toward 1 and, at the same time, λ is increased toward 1 in an appropriate way as will be described later. The goal is at s = λ = 1, where the final Hamiltonian is the target cost function,
The analysis proceeds as before by the replacement of the normalized total spin operator with a classical unit vector. The energy per site is
Two typical examples of the behavior of this energy at p = 5 and k = 2 are shown in Figure 3 for λ = 0.95 and Figure 4 for λ = 0.1. The former is essentially the same as the stoquastic case (λ = 1) of Figure 2 with a first-order phase transition at s = 0.47. The latter is drastically different with a second-order phase transition at s = 0.357. This second-order transition point can be understood by a Landau-type expansion of the energy near θ = 0,
A second-order phase transition takes place when the coefficient of the quadratic term vanishes according to the Landau theory (Nishimori and Ortiz, 2011), s = 1/(3 − 2λ), which gives s = 0.357 for λ = 0.1. This second-order transition is masked by a first-order transition if the latter happens at a smaller s, which is indeed the case for λ = 0.95.
Figure 3. Energy per site for the non-stoquastic Hamiltonian with p = 5, k = 2, and λ = 0.95 at s = 0.2 (dashed), 0.47 (dotted), and 0.6 (full line). A first-order phase transition happens at s = 0.47.
Figure 4. Energy per site for the non-stoquastic Hamiltonian with p = 5, k = 2, and λ = 0.1 at s = 0.1 (dashed), 0.357 (dotted), and 0.5 (full line). The phase transition at s = 0.357 is of second order.
Comparison of Figures 3 and 4 suggests that the antiferromagnetic multiple-X term in the Hamiltonian with a large amplitude (λ close to 0) would change a first-order phase transition (for large λ) to second order (small λ), thus reducing the computation time drastically from exponential to polynomial as a function of the system size. Exhaustive studies have been carried out along this line (Seki and Nishimori, 2012; Seoane and Nishimori, 2012). The results are positive for p ≥ 4. Figures 5 and 6 show typical examples of the λ–s phase diagrams. Figure 5 is for k = 2, i.e., with XX interactions. For a fixed p, the first-order transition at λ = 1 (stoquastic Hamiltonian) extends down to a smaller value of λ around the middle of the phase diagram and then is replaced by a line of second-order transitions that continues to λ = 0 and s = 0.33. The first-order transition persists even after the second-order transition line branches out, and there exists a line of first-order transitions within the ferromagnetic phase. We have denoted those two ferromagnetic phases as F and F′, though there is no qualitative difference between F and F′. In this way, it has been established that a first-order phase transition in the stoquastic Hamiltonian at λ = 1 has been reduced to second order by the effects of the antiferromagnetic multiple-X term with a relatively large amplitude, λ close to 0. Quantum annealing starts at s = 0 and λ arbitrary (anywhere on the line s = 0 at the bottom of the phase diagram) and ends at s = λ = 1 (the upper right corner of the phase diagram), and it is possible to choose a path connecting those points, which avoids a first-order transition (i.e., to go only across the boundary between QP and F′ phases). In other words, the antiferromagnetic multiple-X term, which represents strong quantum effects not to be simulated classically in the standard way, exponentially enhances the efficiency of quantum annealing.
Figure 5. λ–s phase diagram of the non-stoquastic Hamiltonian with k = 2 (antiferromagnetic XX interactions). The line separating QP (quantum paramagnetic) and F′ (ferromagnetic) phases represents second-order phase transitions, and all other lines are for first-order transitions. There is a line of first-order phase transitions within the ferromagnetic phase, and thus labels F and F′ are given to distinguish the two ferromagnetic phases although they have no qualitative difference. The axis λ = 1 on the right of the panel corresponds to the stoquastic case.
Figure 6. λ–s phase diagram of the non-stoquastic Hamiltonian with k = 5 (antiferromagnetic XXXXX interactions). The boundary between QP+ (quantum paramagnetic) and F′ (ferromagnetic) phases is for second-order phase transitions, and all other boundaries represent first-order transitions. A new phase QP− exists for k odd, where spins point to the −x direction whereas they point to the +x direction in the QP+ phase. The two ferromagnetic phases F and F′ are clearly separated for p = 21 but not for smaller p.
The case of p = 3 turns out to be an exception in that the first-order transition line persists down to λ = 0. This fact may be interpreted in terms of the Landau theory of phase transitions that there would appear a cubic term in the Landau free energy for the cubic Hamiltonian with p = 3, which strongly enhances the possibility of first-order transition.
The case of k = 5 in Figure 6 is similar with a few minor differences. There exist two paramagnetic phases, denoted as QP+ and QP−. The former has θ = 0 with magnetization vector m = (1, 0, 0) and the latter θ = π with m = (−1, 0, 0). The QP− phase appears at the top left corner of the phase diagram, where the antiferromagnetic multiple-X term s(1 − λ)(mx)k dominates. The transition is of second order only between QP+ and F′ phases. The first-order transition line within the ferromagnetic phase between F and F′ extends toward QP− at the upper left part of the phase diagram. For larger p, this line reaches the phase boundary of the QP− phase as seen for p = 21 in Figure 6. The two ferromagnetic phases, F and F′, are then completely separated by a line of first-order transitions. It is therefore concluded for k = 5 that a proper choice of annealing path makes it possible to reduce the first-order phase transition for the stoquastic case (λ = 1) to second order (smaller λ) as long as p is not too large. These features are shared by other values of k, even k similar to Figure 5 and odd k to Figure 6.
Although the above analyses use only classical variables, it has been shown that quantum statistical–mechanical computations reproduce those phase boundaries quantitatively very faithfully in the thermodynamic limit (Seki and Nishimori, 2012; Seoane and Nishimori, 2012). It has also been confirmed numerically that the energy gap as a function of the system size closes exponentially at first-order phase transitions and polynomially at second-order transitions (Seki and Nishimori, 2012). We therefore conclude with confidence that the antiferromagnetic multiple-X term in the Hamiltonian has the capacity to reduce the computational complexity drastically from exponential to polynomial for the present infinite-range ferromagnetic p-spin model.
2.4. Quantum Annealing under the Influence of the Environment
It is important to study how the environment affects the behavior of the system. One of the standard models to describe the interactions of the system with its environment is the following Hamiltonian (Leggett et al., 1987), in which the spin degrees of freedom are coupled linearly with harmonic oscillators representing the environment,
Here l runs over all possible modes of harmonic oscillators, gl denotes the coupling strength, and ωl is the frequency of mode l. In this model, Hamiltonian equation (17), the interactions with the environment are assumed to apply uniformly over all sites i. Such a situation may exist when the correlation length of the environment is much larger than the linear size of the system (Breuer and Petruccione, 2002).
Following the previous analysis (see also Sinha and Dattagupta, 2013), we replace by the classical variable Nmα and rewrite equation (17) as
Let us define the spectral density of couplings as
and assume super/normal/sub Ohmic dissipation with cutoff frequency ωc (Leggett et al., 1987)
Super, normal, and sub Ohmic cases correspond, respectively, to s > 1, s = 1, and s < 1. We can then write the coefficient Λ in equation (19) as
where Γ (s) is the Gamma function.
Equation (18) reveals that the environment and the spin system are effectively decoupled and can be treated independently. Since we are interested in the ground state, the environment is simply in the vacuum. The effects of environment to the spin system have been taken into account as the term −ΛN(mα)2. If we consider for simplicity the case of k = 2, the environment coupled with the x component of the system (α = x) effectively reduces the coefficient of the antiferromagnetic multiple-X term from s(1 − λ)N to s(1 − λ)N − ΛN. This is detrimental to the performance of quantum annealing for the reason discussed in the previous section. On the other hand, if α = z, the normalization of the vector implies that the final coupling term in equation (18) has a positive contribution to the antiferromagnetic multiple-X term. Therefore, the two types of couplings with the environment (α = x or z) give completely the opposite contributions. More detailed analyses will be given in a forthcoming paper.
3. Hopfield Model
One may wonder if the above results for the p-spin model would apply to more difficult problems. To answer this question, we have studied the Hopfield model (Seki and Nishimori, 2015), which has randomness in interactions, and the ground state is non-trivial (Amit et al., 1985a,b, 1987; Nishimori and Nonomura, 1996). In the present section, we compare quantum annealing strategies with and without an antiferromagnetic multiple-X term in the Hamiltonian, i.e., non-stoquastic and stoquastic Hamiltonians, for the Hopfield model.
3.1. Finite Patterns Embedded
The Hopfield model with p-body interactions has the Hamiltonian
with each (representing the state of the ith site for the μth embedded pattern) being ±1 randomly with equal probability. The total non-stoquastic Hamiltonian has the same form as before,
The antiferromagnetic multiple-X term has been chosen to be quadratic (k = 2 in the notation of the previous section) for simplicity. We first discuss the case with the number of embedded patterns r finite.
It is impossible to apply the simple classical method used in the p-spin ferromagnetic model because of the complexity of interactions. Quantum statistical–mechanical techniques have been exploited in Seki and Nishimori (2015), by which the quantum system is reduced to a corresponding classical Ising model by the Suzuki–Trotter decomposition. We refer the reader to Seki and Nishimori (2015) for details and write the resulting energy per site as a function of the order parameters,
where the square brackets stand for the average over the random variables . Notice that the index i of has disappeared in the above equation due to the infinite-range (mean-field) characteristics of the Hopfield model.
The parameter mx has the same meaning as in the p-spin ferromagnetic model, the x component of the averaged spin operator. The other parameter represents the overlap (or similarity) of the z component of the Pauli matrix with the μth embedded pattern,
where the angular brackets denote the average by the ground-state wave function.
The energy of equation (26) is to be minimized with respect to the order parameters mx and . There exist a large number of candidate states that are the solutions to the self-consistent equation obtained as the vanishing condition of the derivatives of the energy with respect to the order parameters. It is known in the classical Hopfield model (s = λ = 1) at finite temperatures that the simplest non-trivial solution, and , has the lowest free energy and is realized at low temperatures, in addition to the paramagnetic solution (all ) valid at high temperatures (Amit et al., 1985b). Almost the same turns out to be the case in the quantum Hopfield model (Nishimori and Nonomura, 1996; Seki and Nishimori, 2015), the trivial differences being that the energy, not the free energy, is to be minimized and that the quantum paramagnetic state has mx > 0.
When only is finite with all other ’s being zero, the energy equation (26) turns out to have exactly the same form as the corresponding energy of the p-spin ferromagnetic model analyzed by the quantum statistical–mechanical methods. Thus the analyses of the previous section apply directly. This is the same situation as in the classical Hopfield model (Amit et al., 1985b). We may then conclude that the antiferromagnetic multiple-X term helps the Hopfield model avoid first-order phase transitions in the process of quantum annealing exactly in the same way as in the p-spin ferromagnetic system. It has thus been established that the antiferromagnetic multiple-X term exponentially improves the efficiency of quantum annealing even in the presence of randomness in interactions.
3.2. Many Numbers Embedded (I)
When r, the number of patterns embedded, increases with the system size N, the situation becomes dependent on p. We discuss the case of p = 2 in this section.
When p = 2 and r is supposed to increase with N, r turns out to be proportional to N, r = αN, in order for the free energy to be extensive. Under this condition, the free energy for arbitrary temperature can be evaluated using the standard techniques from quantum statistical mechanics, the Suzuki–Trotter decomposition, the replica method under replica-symmetric ansatz, and the static approximation to drop the Trotter-number dependence of order parameters (Nishimori and Nonomura, 1996; Seki and Nishimori, 2015). The resulting free energy per site in the zero-temperature limit (the ground-state energy) is
with . The parameters m, q, and mx denote the overlap, the spin-glass order parameter, and the magnetization along the x axis, respectively,
It has been assumed that the solution with only one of the embedded patterns being retrieved is more stable than other possibilities, as in the case of the classical Hopfield model (Amit et al., 1985a,b, 1987).
Extremization conditions of e with respect to m, q, and mx and comparison of energy values among different solutions lead to the phase diagram of Figure 7. There exist three phases, quantum paramagnetic (QP) (m = q = 0), spin glass (SG) (m = 0, q > 0), and retrieval (R) (m > 0, q > 0). In the stoquastic case with λ = 1, it is known that the phase transition between the spin glass and retrieval phases is of first order (Nishimori and Nonomura, 1996). As shown in Figure 7, this first-order transition persists even when an antiferromagnetic multiple-X term is introduced down to λ = 0. It is impossible to reach the final state s = λ = 1 through a path that avoids a first-order transition starting from the initial state with s = 0. The spin-glass phase covering the middle of the phase diagram causes an essential difficulty in the present case.2
Figure 7. Phase diagram of the Hopfield model with p = 2 and r = 0.04N. The boundary between the quantum paramagnetic phase (QP) and the spin-glass phase (SG) is for second-order transition, and the boundary between the SG and the retrieval phase (R) is of first order.
3.3. Many Numbers Embedded (II)
We next discuss the case of p ≥ 3. Again, the standard quantum statistical–mechanical method can be applied to the analysis of the model with p ≥ 3 (Seki and Nishimori, 2015). Since the computations are straightforward but quite lengthy, we refer the reader to Seki and Nishimori (2015) for details.3 The result for the ground-state energy as a function of order parameters is
The extremization condition of the energy leads to a set of self-consistent equations for the order parameters, the solutions to which indicate possible phases at each point in the λ–s phase diagram. It turns out that the spin-glass phase always has a higher energy than other phases, the retrieval phase and the paramagnetic phase, and is not realized as a stable phase for p ≥ 3. The transition between the retrieval and paramagnetic phases for the stoquastic model (λ = 1) is of first order. The introduction of the antiferromagnetic multiple-X term (λ < 1) leads to replacement of this first-order transition by a second-order transition below a threshold value of λ provided that p > 3. An example of p = 4 is depicted in Figure 8. For p = 3, the first-order transition continues to exist up to λ = 0 as was the case without randomness.
Figure 8. Phase diagram of the Hopfield model with p = 4 and r = 0.04N3. The first-order transition in red is replaced by second-order transitions in blue for small λ.
We have therefore established that the antiferromagnetic multiple-X term has the effect of drastically enhancing the computational efficiency even in some cases with randomness.
We have shown that antiferromagnetic multiple-X terms reduce a first-order quantum phase transition to second order in the infinite-range ferromagnetic p-spin model as well as in the quantum Hopfield model. This means that the efficiency of quantum annealing in its formulation as adiabatic quantum computation is exponentially enhanced by the antiferromagnetic multiple-X term, which renders the Hamiltonian non-stoquastic. Although not shown explicitly in the present article, it has been confirmed numerically for the ferromagnetic p-spin model in Seki and Nishimori (2012) that the minimum gap at the phase transition indeed closes exponentially or polynomially according to the order of phase transition. It is reasonable to expect that the same holds for the Hopfield model. Since a non-stoquastic Hamiltonian cannot be simulated efficiently on classical computers in the standard quantum Monte Carlo simulation, it may be interpreted to represent strong quantum effects. We may therefore conclude that the exponential enhancement of the efficiency for quantum annealing is achieved in the present models by strong quantum effects. These are the first cases, as far as the authors are aware of, where such a conclusion has been drawn by analytical methods. Notice in this relation that numerical evidence of related nature was presented in Farhi et al. (2002), Crosson et al. (2014), and Hormozi et al. (2016). Our conclusion does not necessarily exclude the existence of other efficient numerical methods to study a given non-stoquastic Hamiltonian including, possibly, the spin-vector dynamics (Owerre and Paranjape, 2014; Smolin and Smith, 2014; Albash et al., 2015; Muthukrishnan et al., 2016) or even the simple steepest descent method.
We have also shown for the ferromagnetic p-spin model that certain types of system–environment couplings either enhance or reduce the effect of the antiferromagnetic multiple-X term depending on the component of spin operators appearing in the coupling term. The argument leading to this conclusion crucially depends on the special property of the ferromagnetic p-spin model that the spin Hamiltonian commutes with the total spin operator. It is an interesting question whether or not similar behavior can be observed in other cases.
It should be remembered that antiferromagnetic multiple-X terms discussed in the present paper are far from versatile to enhance the efficiency. Indeed, the first-order transitions in the ferromagnet with p = 3 and the Hopfield model with p = 2 have been shown not to be relaxed to second order. It has also been known that the first-order transition in the p-body interacting random-field Ising model persists in the presence of antiferromagnetic multiple-X terms if the distribution of random field is bimodal (Ichikawa, 2014).
One may wonder if there is any other way to show that an antiferromagnetic multiple-X term indeed represents strong quantum effects by a more direct method, not via the impossibility of the standard quantum Monte Carlo technique. We are now investigating this problem, and the result will soon be published (Susa et al., 2016). One of the hints may lie in the sign of coefficients of the ground-state wave function in the standard computational basis. For a stoquastic Hamiltonian, the coefficients can be chosen to be all non-negative, according to the Perron–Frobenius theorem. This leads to the natural interpretation of the (squared) magnitude of a coefficient as the probability. If, in contrast, the Hamiltonian is non-stoquastic, some of the coefficients can be negative or even complex, and the conventional interpretation of the squared absolute value of the coefficient as the probability does not necessarily fit very well to our (classical) intuition. Whether or not this fact suggests strong quantum effects needs further scrutiny.
It is an interesting question how far the present results for the infinite-range fully connected models apply to more realistic problems with relatively sparse connections, e.g., a problem on a finite-dimensional lattice with short-range interactions. It is of course difficult to say something with confidence without explicit evidence. Nevertheless, our experience in the physics of phase transitions suggests that a mean-field analysis often provides reliable results also for finite-dimensional systems with a finite number of connections per site as far as qualitative descriptions are concerned (Nishimori and Ortiz, 2011). It would come as a surprise if this general rule of thumb does not apply to the present case. It is worth an effort to investigate if and how antiferromagnetic multiple-X terms widen the energy gap in systems with finite connections. We are studying a related problem, and the results will be published before too long.
HN played a major role in conducting the research described in this article. KT contributed to section 2.4.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Most of the technical results described in this article appeared in Seki and Nishimori (2012, 2015) and Seoane and Nishimori (2012) albeit from a little different viewpoint than presented here. One of the authors (HN) sincerely thanks Yuya Seki and Beatriz Seoane for stimulating collaboration. Also acknowledged are useful comments by Tameem Albash, Jacob Biamonte, Eddie Farhi, Itay Hen, Layla Hormozi, Helmut Katzgraber, and Daniel Lidar. This work was funded by the ImPACT Program of Council for Science, Technology and Innovation, Cabinet Office, Government of Japan and by the JSPS KAKENHI Grant No. 26287086.
- ^It is to be noticed that, in some cases, it is non-trivial to efficiently simulate a stoquastic Hamiltonian. See, for example, Hastings and Freedman (2013) and Jarret et al. (2016).
- ^It has been pointed out in Knysh (2016) that a different type of difficulty exists within the spin glass phase of the Hopfield model when the random variables are Gaussian distributed, not binary as in the present paper.
- ^Notice that the replica symmetric ansatz (Nishimori, 2001) is used in the calculations. Our experience in the simple quantum Hopfield model in a transverse field suggests that the replica symmetry breaking takes place only in a very limited region in the phase diagram (Nishimori and Nonomura, 1996), and we expect it to be reasonable to assume a similar situation in the present case as well.
Albash, T., Rønnow, T., Troyer, M., and Lidar, D. (2015). Reexamining classical and quantum models for the D-Wave One processor. Euro. Phys. J. Spec. Top. 224, 111–129. doi: 10.1140/epjst/e2015-02346-0
Boixo, S., Rønnow, T. F., Isakov, S. V., Wang, Z., Wecker, D., Lidar, D. A., et al. (2014). Evidence for quantum annealing with more than one hundred qubits. Nat. Phys. 10, 218–224. doi:10.1038/nphys2900
Denchev, V. S., Boixo, S., Isakov, S. V., Ding, N., Babbush, R., Smelyanskiy, V., et al. (2016). What is the computational value of finite range tunneling? Phys. Rev. X 6, 031015. doi:10.1103/PhysRevX.6.031015
Farhi, E., Goldstone, J., Gutmann, S., Lapan, J., Lundgren, A., and Preda, D. (2001). A quantum adiabatic evolution algorithm applied to random instances of an NP-complete problem. Science 292, 472–475. doi:10.1126/science.1057726
Farhi, E., Gosset, D., Hen, I., Sandvik, A. W., Shor, P., Young, A. P., et al. (2012). Performance of the quantum adiabatic algorithm on random instances of two optimization problems on regular hypergraphs. Phys. Rev. A 86, 052334. doi:10.1103/PhysRevA.86.052334
Isakov, S. V., Mazzola, G., Smelyanskiy, V. N., Jiang, Z., Boixo, S., Neven, H., et al. (2016). Understanding quantum tunneling through quantum Monte Carlo simulations. Phys. Rev. Lett. 117, 180402. doi:10.1103/PhysRevLett.17.180402
Jiang, Z., Smelyanskiy, V. N., Isakov, S. V., Boixo, S., Mazzola, G., Troyer, M., et al. (2017). Scaling analysis and instantons for thermally-assisted tunneling and quantum Monte Carlo simulations. Phys. Rev. A 95, 012322. doi:10.1103/PhysRevA.95.012322
Jörg, T., Krzakala, F., Kurchan, J., Maggs, A. C., and Pujos, J. (2010). Energy gaps in quantum first-order mean-field-like transitions: the problems that quantum annealing cannot solve. EPL 89, 40004. doi:10.1209/0295-5075/89/40004
Katzgraber, H. G., Hamze, F., and Andrist, R. S. (2014). Glassy chimeras could be blind to quantum speedup: designing better benchmarks for quantum annealing machines. Phys. Rev. X 4, 021008. doi:10.1103/PhysRevX.4.021008
Katzgraber, H. G., Hamze, F., Zhu, Z., Ochoa, A. J., and Munoz-Bauza, H. (2015). Seeking quantum speedup through spin glasses: the good, the bad, and the ugly. Phys. Rev. X 5, 031026. doi:10.1103/PhysRevX.5.031026
Mandrà, S., Zhu, Z., Wang, W., Perdomo-Ortiz, A., and Katzgraber, H. G. (2016b). Strengths and weaknesses of weak-strong cluster problems: a detailed overview of state-of-the-art classical heuristics versus quantum approaches. Phys. Rev. A 94, 022337. doi:10.1103/PhysRevA.94.022337
Matsuda, Y., Nishimori, H., and Katzgraber, H. G. (2009). Ground-state statistics from annealing algorithms: quantum versus classical approaches. New J. Phys. 11, 073021. doi:10.1088/1367-2630/11/7/073021
Steiger, D. S., Rønnow, T. F., and Troyer, M. (2015). Heavy tails in the distribution of time to solution for classical and quantum annealing. Phys. Rev. Lett. 115, 230501. doi:10.1103/PhysRevLett.115.230501
Venturelli, D., Mandrà, S., Knysh, S., O’Gorman, B., Biswas, R., and Smelyanskiy, V. (2015). Quantum optimization of fully connected spin glasses. Phys. Rev. X 5, 031040. doi:10.1103/PhysRevX.5.031040
Keywords: quantum annealing, quantum adiabatic algorithm, stoquastic Hamiltonian, non-stoquastic Hamiltonian, exponential speedup
Citation: Nishimori H and Takada K (2017) Exponential Enhancement of the Efficiency of Quantum Annealing by Non-Stoquastic Hamiltonians. Front. ICT 4:2. doi: 10.3389/fict.2017.00002
Received: 23 September 2016; Accepted: 26 January 2017;
Published: 17 February 2017
Edited by:Itay Hen, University of Southern California, USA
Reviewed by:Faisal Shah Khan, Khalifa University, United Arab Emirates
Evgeny Andriyash, D-Wave Systems Inc., Canada
Layla Hormozi, Massachusetts Institute of Technology, USA
Copyright: © 2017 Nishimori and Takada. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hidetoshi Nishimori, email@example.com