# On the Difference Between Additive and Subtractive QM/MM Calculations

- Department of Theoretical Chemistry, Chemical Centre, Lund University, Lund, Sweden

The combined quantum mechanical (QM) and molecular mechanical (MM) approach (QM/MM) is a popular method to study reactions in biochemical macromolecules. Even if the general procedure of using QM for a small, but interesting part of the system and MM for the rest is common to all approaches, the details of the implementations vary extensively, especially the treatment of the interface between the two systems. For example, QM/MM can use either additive or subtractive schemes, of which the former is often said to be preferable, although the two schemes are often mixed up with mechanical and electrostatic embedding. In this article, we clarify the similarities and differences of the two approaches. We show that inherently, the two approaches should be identical and in practice require the same sets of parameters. However, the subtractive scheme provides an opportunity to correct errors introduced by the truncation of the QM system, i.e., the link atoms, but such corrections require additional MM parameters for the QM system. We describe and test three types of link-atom correction, viz. for van der Waals, electrostatic, and bonded interactions. The calculations show that electrostatic and bonded link-atom corrections often give rise to problems in the geometries and energies. The van der Waals link-atom corrections are quite small and give results similar to a pure additive QM/MM scheme. Therefore, both approaches can be recommended.

## Introduction

Combined quantum mechanics and molecular mechanics (QM/MM) is a popular method to study biological macromolecules, as well as homogeneous catalysis and nanostructures, with computational methods (Balcells and Maseras, 2007; Lin and Truhlar, 2007; Ramos and Fernandes, 2008; Stoyanov et al., 2008; Senn and Thiel, 2009; Keal et al., 2011; Chung et al., 2015; Jover and Maseras, 2016; Ryde, 2016). In this approach, a small region of central interest (typically 20–300 atoms) is treated with quantum mechanical (QM) methods, whereas the remainder of the macromolecule, as well as a considerable amount of explicit solvent are treated by molecular mechanics (MM). This is supposed to combine the accuracy of QM methods with the speed of MM methods. Moreover, the entire macromolecule is included in the calculations [in contrast to the alternative QM-cluster approach (Blomberg et al., 2014), in which most parts of the macromolecule are omitted], reducing the risk of making a biased choice in the selection of the considered system and allowing for a detailed study of how the surroundings affect the properties of interest.

A problem with QM/MM approaches is that there exists so many variants and that the details of these are seldom discussed. For example, QM/MM approaches can use either subtractive or additive schemes (Senn and Thiel, 2009). In a subtractive scheme, three separate calculations are performed: One QM calculation with the QM region (system 1; ${E}_{1}^{\mathrm{\text{QM}}}$) and two MM calculations, one for the entire system (systems 1 and 2; ${E}_{12}^{\mathrm{\text{MM}}}$) and one for the QM region (${E}_{1}^{\mathrm{\text{MM}}}$) (Maseras and Morokuma, 1995; Ryde, 1996b; Svensson et al., 1996):

The advantage with this approach is the simplicity: It automatically ensures that no interactions are double-counted and it can be set up for any QM and MM software (provided that they can write out energies and forces), without the need of any modification of the code. Thereby, the QM/MM software is updated every time the underlying QM or MM software is updated. Moreover, it can be easily extended to more than two computational methods and regions (Svensson et al., 1996). The typical example of a subtractive scheme is ONIOM (Svensson et al., 1996), but other software use similar methods, e.g., ComQum (Ryde, 1996b).

In the additive scheme, only two calculations are performed: the same QM calculation for the QM region, but only a single MM calculation (${E}_{2-1}^{\mathrm{\text{MM}}}$) (Sherwood et al., 2003; Senn and Thiel, 2009):

although the latter is often formally divided into two terms, a MM energy of system 2 and a QM/MM interface energy (${E}_{2-1}^{\mathrm{\text{MM}}}=\text{}{E}_{2}^{\mathrm{\text{MM}}}+\text{}{E}_{12}^{\mathrm{\text{QM/MM}}}$). In this case, it is up to the developer to ensure that no interactions are omitted or double-counted. Therefore, an additive scheme requires a special MM software, in which the user or developer can select which MM terms to include. The advantage of the additive QM/MM scheme is that no MM parameters for the QM atoms are needed, because those energy terms are calculated by QM.

Further differences may arise if the QM region is covalently connected to the MM region. Then, the QM system needs to be properly truncated. This can be done by special localized orbitals (Levitt, 1976; Théry et al., 1994; Gao et al., 1998; Murphy et al., 2000; Senn and Thiel, 2009), but it is more common that the QM system is simply truncated by hydrogen atoms, the hydrogen link-atom approach (Singh and Kollman, 1986; Field et al., 1990; Reuter et al., 2000; Senn and Thiel, 2009; Ryde, 2016) In the subtractive scheme, MM parameters for the link atoms are needed (Senn and Thiel, 2009).

The interaction between the QM and MM regions is typically dominated by electrostatics. This interaction can also be treated at different levels of approximation (Senn and Thiel, 2009; Ryde, 2016). In mechanical embedding, it is calculated at the MM level (Maseras and Morokuma, 1995; Svensson et al., 1996). In electrostatic embedding, the electrostatic QM–MM interaction is instead treated at the QM level by including a point-charge model (i.e., atomic partial MM charges) of system 2 in the QM calculations (Singh and Kollman, 1986; Field et al., 1990; Ryde, 1996b; Dapprich et al., 1999). Thereby, system 1 is polarized by system 2, but not vice versa. In polarized embedding, both systems are mutually and self-consistently polarized in the QM calculations (Poulsen et al., 2001; Söderhjelm et al., 2009; Olsen et al., 2010). This requires a polarizable MM force field for system 2 (Lopes et al., 2009) and a QM software that can treat polarizabilities, which are still rather unusual. Therefore, such calculations are less common and typically restricted to single-point calculations of accurate properties. Mechanical embedding is normally considered to be less accurate than electrostatic embedding (Senn and Thiel, 2009), and the latter has therefore been the most widely used approximation, although it involves polarization of only parts of the system and is more sensitive to the treatment of the link atoms (Hu et al., 2011b). Strictly, Equations (1, 2) apply only to mechanical embedding, but they can easily be adapted to electrostatic embedding by including a point-charge model of system 2 in the QM term (*E*_{QM1 + ptch2}) and setting the charges of system 1 to zero in the MM calculations (Ryde, 1996b).

Unfortunately, the distinction between the subtractive and additive schemes in literature is often unclear and confused. In many cases, the subtractive scheme is equated with mechanical embedding and the additive scheme with electrostatic embedding (Senn and Thiel, 2009; Götz et al., 2014). In other cases, the subtractive scheme is equated with the ONIOM method (Roßbach and Ochsenfeld, 2017). We prefer the definition in Equations (1, 2), emphasizing that the subtractive scheme employs two MM calculations with an external MM program, whereas the additive scheme employs a single MM calculation with an internal MM program, allowing the developer to cherry-pick the MM terms actually needed. In particular, both additive and subtractive schemes may use either mechanical or electrostatic (or even polarized) embedding.

It is normally assumed that the subtractive scheme is harder to set up and requires accurate MM parameters for the QM region and link atoms. For example, Roßbach and Ochsenfeld state in a recent article comparing subtractive and additive QM/MM (Roßbach and Ochsenfeld, 2017): “The (additive) QM/MM approach has the advantage that parameters for QM and link atoms, saturating covalent bonds between QM and MM, are unnecessary, as these are never described by the force field. The subtractive ONIOM approach requires accurate parameters for all atoms, including link atoms, because an MM calculation of the QM region is also necessary to avoid double counting.” On the other hand, Sousa et al. present the opposite view that the subtractive scheme is more advantageous, because of the “lack of a requirement for a parameterized expression describing the interaction of the various regions, and the fact that all systematic errors in the treatment of the inner regions by the lower levels of theory are canceled out” (Sousa et al., 2017).

In this article, we aim at clarifying the difference between the two schemes and compare their performance. We will show that with a proper setup, additive, and subtractive schemes should give identical results with a similar effort and that they require the same set of MM parameters. However, the subtractive scheme may be tuned to correct errors introduced by the link atoms and then additional parameters are needed.

## Methods

### The ComQum QM/MM Software and Its Subtractive Scheme

A problem when comparing QM/MM methods is that there exist so many variants and that the details of the calculations are seldom discussed (Ryde, 1996b; Senn and Thiel, 2009). Therefore, we here give a thorough discussion of our QM/MM software and details of all QM/MM variants implemented. All QM/MM calculations in this article were performed with the ComQum software (Ryde, 1996b; Ryde and Olsson, 2001). ComQum is a modular program, combining the QM software Turbomole (Ahlrichs et al., 1989; Furche et al., 2014; TURBOMOLE version 7.1, 2016) and the MM software AMBER (Case et al., 2014). It consists of five small Fortran programs that read, write, manipulate, and transfer coordinates, energies, forces, and charges between the MM and QM software. It employs a subtractive scheme and it was developed in 1992–1995, concurrently and independently from the ONIOM software (Ryde, 1995, 1996a,b). It has always used electrostatic embedding, in contrast to ONIOM. Junctions are treated by the hydrogen link-atom approach (Ryde, 1996b; Reuter et al., 2000; Senn and Thiel, 2009).

To make the discussion clear, we use the following conventions (Senn and Thiel, 2009): The QM region is called system 1, whereas atoms in the MM region are called system 2. The QM region is terminated by hydrogen link atoms, called HL. They replace the corresponding carbon link (CL) atoms in the real system. HL and CL are different representations of the same atom and never appear in the same (MM or QM) calculation. This is illustrated in Figure 1. A superscript HL or CL show which representation is used. We will use XL to denote either HL or CL. The HL atom is covalently bound to a single Q_{1} atom the QM region, whereas the CL atom is connected to (typically several) M_{2} atoms in the MM region (we use this somewhat illogical notation, because several other QM/MM descriptions use M_{1} to denote CL; we prefer our notation because in ComQum, HL and CL are two different representations of the same atom, which belongs to the QM region). The Q_{1} atoms are covalently bound to Q_{2} atoms, which are covalently bound to Q_{3} atoms, and so on. Likewise, the M_{2} atoms are covalently bound to M_{3} atoms, and so on.

The HL atom is placed along the Q_{1}−CL bond, with the Q_{1}−HL bond length (*r*_{Q1−HL}) calculated from (Ryde, 1996b):

where r_{Q1−CL} is the current Q_{1}–CL bond length, ${\text{r}}_{{\text{Q}}_{1}-\text{CL}}^{0}$ is the optimum Q_{1}–CL bond length in the MM force field and ${\text{r}}_{{\text{Q}}_{1}-\text{HL}}^{0}$ is the Q_{1}–HL bond length in a model of the isolated truncated residue, optimized with the current QM method and basis set. ${\text{r}}_{{\text{Q}}_{1}-\text{CL}}^{0}$ can be found in the MM force field libraries and ${\text{r}}_{{\text{Q}}_{1}-\text{HL}}^{0}$is easily obtained by a simple QM geometry optimisation (which typically takes less than a minute). Equation (3) can also be used in reverse during the QM/MM geometry optimization to calculate the CL coordinates from the HL coordinates and it is also used together with the chain rule to obtain the MM forces on the HL atom (Ryde and Olsson, 2001). Thus, the HL atoms do not introduce any additional degrees of freedom.

The total QM/MM energy in standard ComQum (which uses a subtractive scheme with electrostatic embedding) is calculated from Equation (4) (Ryde, 1996b; Ryde and Olsson, 2001):

where ${E}_{\mathrm{\text{QM1 + ptch2}}}^{\mathrm{\text{HL}}}$ is the QM energy of the QM system truncated by HL atoms and embedded in the set of point charges modeling system 2 (but excluding the self-energy of the point charges). All atoms in system 2 are included in the point-charge model (but not the CL atoms, which do not belong to system 2 in our view). In our original development, charges of some additional atoms were excluded (Ryde, 1996b), but a comparison of several different charge distribution schemes did not show any advantage of more complicated schemes (Hu et al., 2011b). ${E}_{\mathrm{\text{MM1}},{\mathrm{\text{q}}}_{\mathrm{\text{1}}}=0}^{\mathrm{\text{HL}}}$ is the MM energy of the QM system 1, still truncated by HL atoms, but without any electrostatic interactions. Finally, ${E}_{\mathrm{\text{MM12}},{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{CL}}}$ is the MM energy of all atoms in the system with CL atoms and with the charges of the QM system set to zero (to avoid double counting of the electrostatic interactions).

In the original implementation of ComQum (Ryde, 1996b), it was necessary to set up two AMBER parameter and topology (prmtop) files for the MM calculations, one for the full system (${E}_{\mathrm{\text{MM}}12,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{CL}}}$) and one for the truncated QM region (${E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{HL}}}$). This involved development of MM parameters for all junctions, which is tedious and error prone, even if the same truncated residues can be used for several different proteins. Initially, some attempts were made to allow the calculations on the small system compensate for the truncations and the introduction of link atoms (i.e., for the conversion of CL atoms to HL atoms). However, we did not see any consistent improvement in the results; instead such a treatment often introduced instability in the calculations.

In particular, we soon realized that the parameters of the HL–Q_{1} bond cannot be freely selected. The deterministic relation between HL and CL in ComQum (Equation 3) implies that we should use ${\text{r}}_{{\text{Q}}_{1}-\text{HL}}^{0}$ as the equilibrium bond length and the force constant must be

where *k*_{Q1−CL} is the force constant of the Q_{1}−CL bond in the MM force field. Otherwise, a spurious force will be introduced. Moreover, for the Q_{2}−Q_{1}−HL angle and the Q_{3}−Q_{2}−Q_{1}−HL dihedral parameters we simply used the corresponding MM Q_{2}−Q_{1}−CL angle and the Q_{3}−Q_{2}−Q_{1}−CL parameters, obtained by copying these entries from the MM force field.

Even if this was a completely mechanical procedure, it was still somewhat tedious and error prone, because it had to be redone every time the MM force field, QM method, or basis set was changed. Therefore, we implemented in 2006 a program (changeparm) that performed this task automatically: Reading the AMBER prmtop file of the full system and a file with the ideal QM bond length (${\text{r}}_{\text{XL}-\text{HL}}^{0}$), it automatically generates the prmtop and coordinate files for the MM calculation of the QM system (${E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{HL}}}$) according to these rules. Thereby, only a single MM calculation needs to be set up (that of the full system, which typically is already done, because QM/MM studies of macromolecules normally start with an equilibration of the structure with molecular dynamics) and no special parameters needs to be developed for the truncated system or the link atoms. This removes one of the disadvantages with the subtractive scheme (but it also discards a potential advantage with the method, as will be discussed below). However, the changeparm program required procedures to read and write the prmtop file, as well as a complete understanding of the meaning of all entries in it, a significant programming effort. Still, it has been so valuable that we recently have implemented the corresponding program (Cao et al., 2018) also for the crystallography and NMR system (CNS; Brunger et al., 1998; Brunger, 2007) software for quantum refinement (Ryde et al., 2002; Ryde and Nilsson, 2003).

Thus, with our implementation, also the subtractive QM/MM scheme requires only a single prmtop for the entire system. However, this file must contain parameters for all atoms, including those in the QM region. This is so because (the *leap* module in) AMBER refuses print the file if any parameter is missing. It may be that other MM software is less restrictive. However, this is a rather minor restriction, because the parameters do not need to be accurate. On the contrary, for all interactions involving only QM atoms (except van der Waals interactions involving the link atoms), the MM energies cancel exactly (owing to the ${E}_{\mathrm{\text{MM}}12,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{CL}}}-{E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{HL}}}$ terms in Equation 4). Therefore, dummy (e.g., zeroed) parameters may be used. Moreover, with the general MM force fields available in most MM software (Vanommeslaeghe et al., 2010), including AMBER (Wang et al., 2004), most parameters already exist. The only problem may be metal sites, but if no explicit bonds are defined between the metals and any ligand atoms, only van der Waals parameters for the metal are needed and these are necessary also in the additive scheme. Thus, the need of parameters for the QM region is a very minor restriction of the subtractive scheme and the setup of dummy parameters can easily be automatized (although we have never felt such a need).

### Additive ComQum

Recently, we implemented a simple software (calcforce), which calculates MM energies and forces, based on AMBER coordinate and parameter–topology files. This was done to allow calculations without any cut-off for non-bonded interactions, even for very large systems (AMBER employs a non-bonded pair list that can become too large for the memory with more than ~10^{5} atoms) and to gain control over exactly what forces are written out by the program (for example, turning on the dumfrc option in the AMBER software changes the electrostatic energy). As a by-product, it also gives us full control over the energy function and allowed us to implement an additive QM/MM scheme.

In our additive approach, we use the energy function:

where the ${E}_{\mathrm{\text{QM}}1+\text{ptch}2}^{\mathrm{\text{HL}}}$ QM term is identical to that used in the subtractive scheme. As shown by the superscript, all terms in ${E}_{\mathrm{\text{MM}}\text{}2-1}^{\mathrm{\text{CL}}}$ employ CL atoms, coordinates and parameters, never any HL atoms. We employed the following rules to determine what MM terms to include in ${E}_{\mathrm{\text{MM}}\text{}2-1}^{\mathrm{\text{CL}}}$ (note again that in our notation, the CL atoms belong to the QM region):

• All interactions involving only QM atoms are excluded (already included in the QM term).

• All interactions involving only MM atoms are included.

• Bonded interactions involving at least one MM atom are included.

• Electrostatic interactions involving one MM and one QM atom are excluded (already included in the QM calculation).

• Van der Waals interactions involving one MM and one QM atom are included.

These rules are based on the simple philosophy that we should calculate by MM all terms that are not already considered in the QM calculations. This seems very natural and should represent a typical implementation of additive QM/MM (Sherwood et al., 2003), although the rules are seldom discussed explicitly. This selection is illustrated in Table 1 for the simple ethanol model in Figure 1 (with CB as the link atom). Note that the first four rules are identical to the rules we use for setting up the truncated prmtop file in the subtractive scheme, so that the two schemes should give identical bonded and electrostatic energies. In particular, the two schemes give identical energies for the XL–Q_{1} bond term (assuming the harmonic term ${E}_{{\mathrm{\text{Q}}}_{1}-\mathrm{\text{XL}}}^{\mathrm{\text{bond}}}={\mathrm{\text{k}}}_{{\mathrm{\text{Q}}}_{1}-\mathrm{\text{XL}}}{({\text{r}}_{{\mathrm{\text{Q}}}_{1}-\mathrm{\text{XL}}}-\text{}{\mathrm{\text{r}}}_{{\mathrm{\text{Q}}}_{1}-\mathrm{\text{XL}}}^{0})}^{2}$ employed in AMBER and most other macromolecular force fields) even if the subtractive scheme uses HL coordinates and the additive scheme CL coordinates, because of the relations between the coordinates in Equation (3) and the force constants in Equation (5) (the force constants in Equation 5 were constructed with this aim).

**Table 1**. Illustration of which terms are included in the various energies for the ethanol molecules in Figure 1 (atom names are shown in that figure, except that H1 indicates either H11 or H12 and H2 indicates H21, H22, or H23).

The only difference between the two schemes is the van der Waals interactions involving the link atoms and another atom in the QM system (possibly other link atoms): In the additive scheme, no such interactions are calculated by MM (because both atoms belong to the QM system). However, in the subtractive scheme, all van der Waals interactions involving link atoms are calculated twice: In ${E}_{\mathrm{\text{MM}}12,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{CL}}}$, they are obtained with MM parameters and coordinates for CL atoms, whereas in ${E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{HL}}}$, they are obtained with MM parameters and coordinates for HL atoms. In variance to all the other QM atoms, these two terms are not identical and therefore will not cancel. Instead, they provide a MM correction to the link atom, i.e., to the fact that the HL atoms in the QM region are H atoms and not the correct C atoms (i.e., they have smaller van der Waals radii) and that they are at incorrect positions. Thus, these van der Waals terms in ${E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{HL}}}$ can be seen as a correction to the corresponding energy in the QM calculation (${E}_{\mathrm{\text{QM}}1+\mathrm{\text{ptch}}2}^{\mathrm{\text{HL}}}$), which also involves the incorrect HL coordinates and atoms. We will call it the *van der Waals link-atom correction* (VLAC). MM van der Waals parameters are normally quite accurate, so this approach is used in most subtractive schemes, but it cannot be included in a strict additive scheme, which may be a disadvantage. It should be noted that these interactions are only within the QM region, so with a small QM region, it involves only a few interactions and the correction is small.

In fact, we can exactly reproduce the additive QM/MM calculations within a subtractive scheme by replacing the ${E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{HL}}}$ term in Equation (4) with a ${E}_{\mathrm{\text{MM}}1,{\mathrm{\text{q}}}_{1}=0}^{\mathrm{\text{CL}}}$ term, in which parameters and coordinates corresponding to the CL atoms, rather than the HL atoms, are used. This was done manually to confirm that the implementations are correct, but it has never been implemented for production calculations (because the additive scheme gives the same results).

### Mechanical Embedding

For comparison, we have also implemented mechanical embedding (ME) in ComQum [we have calculated single-point ME energies before (Hu et al., 2011a,b), but not done full geometry optimizations]. ME calculations can be run by simply deleting the point-charge model from the QM calculations (removing the $point_charges keyword from the Turbomole control file) and (re-)inserting charges of the QM system in the prmtop files for the two MM calculations in Equation (4). This gives the energy function:

Two issues need to be settled in this implementation. The first is how to treat the link atoms. If the charge of the HL atom is identical to that of the CL atom, the electrostatics within the QM system cancel exactly in the ${E}_{\mathrm{\text{MM}}12}^{\mathrm{\text{CL}}}-{E}_{\mathrm{\text{MM}}1}^{\mathrm{\text{HL}}}$ terms in Equation (7). However, then the total energy reflects electrostatics involving HL atoms, from the ${E}_{\mathrm{\text{QM1}}}^{\mathrm{\text{HL}}}$ term. In analogy with the van der Waals energy correction described in the previous section, we prefer to have different charges for the HL and CL atoms, with those of the HL atoms being representative for a H atom and those of the CL atoms being representative for the true (typically C) atoms.

This leads us to the second issue, viz. how the charges are calculated for the QM system, including the HL and CL atoms. Since QM calculations are done for the QM system, it is natural to use some sort of QM-derived charges. Originally, ComQum employed Mulliken charges (these charges are used also with electrostatic embedding when parts of system 2 is optimized by MM; Ryde, 1996b). However, it is well-known that charges fitted to the electrostatic potential (ESP) give (by construction) more accurate electrostatic interaction energies (Sigfridsson and Ryde, 1998). Therefore, we have used such ESP charges [obtained with the Merz–Kollman scheme (Besler et al., 1990)] ever since ESP charges were implemented in Turbomole (note that in Turbomole, the point-charge model needs to be removed before the charges are calculated, without reoptimizing the wavefunction). These charges can be directly used in the ${E}_{\mathrm{\text{MM}}1}^{\mathrm{\text{HL}}}$ term and therefore also for the HL atoms, since they were obtained for a QM system with HL atoms.

However, for the ${E}_{\mathrm{\text{MM}}12}^{\mathrm{\text{CL}}}$ term, these charges need to be adapted so that the charge of the total system remains integer, meaning that the HL charges need to be adapted to apply for CL atoms instead. It is not evident how this should be done and it is seldom discussed, although this needs to be done for essentially all QM/MM and MD simulations employing QM charges calculated for QM systems with link atoms.

We have selected to follow this procedure:

1. Start from the MM charges of the entire system with formal (integer) charges on metals and other inorganic groups.

2. Use the QM charges for all QM atoms, except the HL atoms.

3. Use the original MM charges for all MM atoms.

4. For each amino acid (or other biochemical building unit) with link atoms: Add a constant offset to the original MM charges of each CL atom in the unit so that the total charge of this unit is the same as the sum of the QM charge of these atoms, possibly with the addition of an integer charge from atoms outside the QM system.

The procedure is illustrated in Table 2. It is fully automatic, except that a possible integer charge outside the QM system needs to be specified. This way, charge transfer within the QM system is allowed, meaning that none of the QM residues has an integer charge. The modification of the charges on the CL atoms is also kept to a minimum. However, alternative approaches are conceivable, e.g., dividing the remaining charge equally over all link atom, after the charges of the non-HL QM atoms have been set. Thus, the CL charges are somewhat ambiguous. Importantly, the procedure keeps all charges of the QM system equal between ${E}_{\mathrm{\text{MM}}12}^{\mathrm{\text{CL}}}$ and ${E}_{\mathrm{\text{MM}}1}^{\mathrm{\text{HL}}}$, except for the HL/CL atoms, allowing for a proper cancelation of those electrostatic terms in the ME approach, in analogy with the VLAC correction for the subtractive scheme.

**Table 2**. Illustration of the method to determine charges for methanol and ethanol (shown in Figure 1 with atom names; H1 is H11 and H12; H2 is H21, H22, and H23).

### Electrostatic Link-Atom Corrections

In the electrostatic-embedding variant of ONIOM (Vreven et al., 2006), as well as in our QTCP approach (QM/MM thermodynamic cycle perturbation; Rod and Ryde, 2005), further attempts are made to correct errors introduced by the link atoms. In the QM calculations, the HL atoms are of the wrong element, located at the incorrect position (compared to the CL atom) and they may make Coulombic interactions with point charges of nearby atoms that are not included or are scaled down in normal MM calculations (viz. interactions with the M_{2}, M_{3}, and M_{4} atoms). These errors can be compensated by calculating exactly the same interactions in the ${E}_{\mathrm{\text{MM}}1}^{\mathrm{\text{HL}}}$ term and replace them with the corresponding interactions in the ${E}_{\mathrm{\text{MM}}12}^{\mathrm{\text{CL}}}$ term.

In practice, this is accomplished by using QM charges for the QM system in both MM calculations and including the same point-charge model of the surroundings in the ${E}_{\text{MM}1}^{\text{HL}}$ term. We call this approach electrostatic link-atom correction (ELAC) and it gives the following energy function:

For these calculations, we used the same QM charges for the QM system and the same MM charges for MM system as described for the mechanical-embedding calculations.

### Bonded Link-Atom Corrections

Finally, the subtractive scheme allows for a third type of link-atom corrections, viz. for the bonded terms. It is likely that a HL atom will give rise to slightly different bonded terms than the corresponding CL atom, e.g., smaller XL–Q_{1}−Q_{2} ideal angles. Again, we may try to use the ${E}_{\text{MM}1}^{\text{HL}}$ calculation to correct for these errors (i.e., so that the HL bonded terms would cancel between the ${E}_{\mathrm{\text{QM}}1+\mathrm{\text{ptch}}2}^{\mathrm{\text{HL}}}$ and ${E}_{\mathrm{\text{MM}}1}^{\mathrm{\text{HL}}}$ terms and the corresponding CL result in ${E}_{\mathrm{\text{MM}}12}^{\mathrm{\text{CL}}}$ would remain, instead of the exact cancelation of these terms between ${E}_{\mathrm{\text{MM}}1}^{\mathrm{\text{HL}}}$ and ${E}_{\mathrm{\text{MM}}12}^{\mathrm{\text{CL}}}$ as in both the standard additive and subtractive approaches). This is done by using different parameters for the Q_{2}−Q_{1}–HL and Q_{2}−Q_{1}–CL angles and the Q_{3}−Q_{2}–Q_{1}−HL and Q_{3}–Q_{2}−Q_{1}−CL dihedral parameters (and possibly also for the Q_{1}–HL and Q_{1}–CL bond parameters). We will call this bonded link-atom corrections (BLAC).

Of course, the parameters need to be accurate for there to be a hope of any improved results. In this paper, we tried two approaches. In the first (BLAC1), we used standard AMBER parameters for both the CL and HL terms, the latter typically coming from the GAFF (Wang et al., 2004) force field. These parameters involved also the bonded Q_{1}-XL terms.

In the second approach (BLAC2), we instead performed a parametrisation of both the full and truncated systems, based on a QM frequency calculation on each system. The bonded parameters were then extracted with the Seminario approach (Seminario, 1996), using the Hess2FF program (Nilsson et al., 2003; Hu and Ryde, 2011). We used the same atom types as in the AMBER files in BLAC1. This means that in principle all bonded parameters will differ in the two MM calculations, not only those involving the XL atom. Finally, BLAC1J and BLAC2J was obtained from BLAC1 and BLAC2 by changing the Q_{1}–HL force constants according to Equation (5), keeping everything else the same.

### Test Systems

All QM calculations were carried out using the Turbomole software (versions 7.1 and 7.2; Ahlrichs et al., 1989; Furche et al., 2014). They were performed using the TPSS (Tao et al., 2003) functional in combination with def2-SV(P) (Schäfer et al., 1992) basis set, including empirical dispersion corrections with the DFT-D3 approach (Grimme et al., 2010) with Becke–Johnson damping (Grimme et al., 2011), as implemented in Turbomole. The MM calculations were performed with the AMBER ff14SB (Maier et al., 2015) force field for protein residues, GAFF (Wang et al., 2004) for non-protein molecules and TIP3P (Jorgensen et al., 1983) for water. In all QM/MM calculations, the MM system was kept fixed to simplify the interpretation of the results.

The various approaches were tested on three systems. The first was an isolated ethanol molecule. The reference system was ethanol, optimized with QM. In the QM/MM calculations, the QM system was methanol with C2 converted to a HL atom (Figure 1). The terminal methyl group was in the MM system.

The second test system was sulfite oxidase. The calculations were taken from our recent study of this enzyme (Caldararu et al., 2018). The active site contains a Mo ion coordinated to a molybdopterin (MPT) molecule, as well as a Cys residue and two oxo groups. All these groups were included in the QM system (Cys modeled as CH_{3}S^{−}), as well as the sulfite substrate [note that this QM system is smaller than in our previous study (Caldararu et al., 2018), in which nine additional residues and five water molecules were also included]. In this paper we compared the effects of either including the full MPT residue (in the reduced and protonated form, called MPH in our previous paper) or truncating it to a dimethyldithiolene molecule [DMDT, (CH_{3}CS${(}_{2}^{2-}$; both shown in Figure 2]. Thus, the QM/MM calculations with the full MPT molecule were the reference structures and QM/MM calculations with DMDT in the QM system were run to compare the performance of the various QM/MM variants.

**Figure 2**. The MPT and DMDT ligand, as well as the RS, Im, and PS states in the reaction mechanism of sulfite oxidase. Atom names are indicated for the RS.

In DMDT two of the CL atoms are covalently bonded. This gives a complication when automatically setting up the prmtop file for the truncated system: This bond needs to be explicitly removed from the file (together with the corresponding angles and dihedrals); otherwise, spurious MM forces will cause the calculations to crash with distorted structures. Normally, we discourage from having covalently connected CL atoms, but in this study, it provides a hard test for the various methods to provide junction corrections.

Five states were studied in the reaction: The Mo^{VI} = O+${\text{SO}}_{3}^{2-}$ reactant state (RS) with sulfite in the second coordination sphere of Mo, the Mo^{IV}-${\text{SO}}_{4}^{2-}$ intermediate (Im) with sulfate coordinated to Mo, the Mo^{IV}+${\text{SO}}_{4}^{2-}$ product state (PS) with sulfate in the second sphere of Mo (shown in Figure 2), as well as the two transition states (TS1 and TS2) connecting these three states. The transition states were obtained from potential-energy scans along the S–O1 (TS1, 2.0 Å) and Mo–O1 (TS2, 3.7 Å) reaction coordinates. PS was also obtained with a restraint in the Mo–O2 distance of 3.85 Å, taken from calculations with an appreciably larger QM system (Caldararu et al., 2018). Besides the QM system, the setup was identical to that in our previous study (Caldararu et al., 2018).

The third test system was the conversion of oxophlorin to verdohaem by haem oxygenase. Again, the calculations were taken from a recent study of this enzyme (Alavi et al., 2017). The QM system consisted of the oxophlorin group (an oxidized haem molecule), as well as O_{2} and His (truncated to imidazole) as axial ligands of the Fe ion [again, this QM system is smaller than in our previous study (Alavi et al., 2017), in which two additional residues and six water molecules were also included]. We compared the effects of including either the full oxophlorin ring (OXF) with its eight peripheral propionate, vinyl and methyl substituents or truncating all substituents to HL atoms (OXT; both shown in Figure 3). Thus, the calculations with the full OXF were the reference structures and QM/MM calculations with OXT were run to compare the performance of the various QM/MM variants.

**Figure 3**. The OXF and OXT ligands, as well as the five states in the reaction mechanism of haem oxygenase. Atom names are indicated for the state **1**.

Nine states were studied in the reaction as is shown in Figure 3: The Fe–O_{2} reactant state (**1**), the first intermediate, in which O_{2} is bridging between Fe and one of the OXF carbon atoms (C4B; **2**), the second intermediate (**3**) in which the O1–O2 bond is cleaved and the C4B–O2 bond is formed, the third intermediate (**4**) in which the O2 atom has formed a bond with the C1C OXF atom, giving a four-membered C4B–O2–C1B–CMC ring, the verdohaem product (**5**), in which CO has dissociated, but the C4B–O2–C1B bonds are kept, as well as the four connecting transition states (**T1**–**T4**). The transition states were obtained from potential-energy scans along the O2–C4B (**T1**, 1.8 Å), O1–O2 (**T2**, 1.6 Å), O2–C1B (**T3**, 1.9 Å), and the C4B–C (**T4** 1.8 Å). Besides the QM system, the setup was identical to that in our previous study (Alavi et al., 2017).

## Result and Discussion

In this paper, we clarify the difference between additive and subtractive variants of the QM/MM approach. In principle, the two approaches can be tuned to give exactly the same results, as the additive approach can freely pick almost any energy in the ${E}_{2-1}^{\mathrm{\text{MM}}}$ term in Equation (2). However, in a typical implementation of the two approaches, the primary difference between the two approaches is that the additive scheme employs only a single MM term for each interaction, whereas in the subtractive scheme, there are two MM terms for the QM system, one in ${E}_{12}^{\mathrm{\text{MM}}}$ and one in ${E}_{1}^{\mathrm{\text{MM}}}$. Depending on the implementation, these duplicate terms can either be selected to be identical (and therefore canceling, which would give exactly the same results as in the additive scheme) or they can be different, in particular with the aim of correcting the errors introduced by the HL link atoms in the QM system. We have investigated three different levels of link-atom corrections, involving van der Waals terms (VLAC), electrostatic terms (ELAC), and bonded terms (BLAC). In the following, we test the performance of the various correction schemes for three different systems: ethanol, sulfite oxidase, and haem oxidase. The results are described in three separate sections. For each system, we study six different approaches: the additive scheme (Add, i.e., without any link-atom corrections), the subtractive scheme with van der Waals (VLAC), electrostatic (ELAC), and bonded link-atom corrections (BLAC), the latter in two variants (BLAC1 or BLAC2 and BLAC2J) and mechanical embedding (ME, using a subtractive scheme and VLAC). ELAC and the three variants of BLAC always also include VLAC. Therefore, we use the abbreviation Sub for the subtractive scheme involving only VLAC (emphasizing that this is the standard approach for the subtractive scheme in ComQum). For ethanol, we tested a few additional combinations.

### Ethanol

We first tested all methods on a very simple model system, viz. ethanol, in which methanol was used as the QM system in QM/MM calculations and the results were compared to a QM calculation on the full ethanol molecule. Nine different calculations were run for this system: Add, Sub, two variants of ELAC, BLAC1, BLAC1J, BLAC2, BLAC2J, and ME, as well as ELAC combined with BLAC2J. The two variants of ELAC used the same ESP charges for the QM system. However, for the MM system, they used either the ESP charges of ethanol (ELAC2) or the ESP charges for methanol for all methanol atoms except HL, the ethanol ESP charges for the H21–H23 atoms on the terminal methyl group, whereas the charge on the CL atom was adapted to give a vanishing net charge (ELAC1). The latter approach is similar to what is used for the two enzyme systems, for which ESP charges for the full MM system are not available. The two sets of charges are shown in Table 2.

The results of the QM/MM calculations on ethanol are collected in Table 3. The first column gives the total QM/MM energies, relative to the Add calculations. This is only to illustrate that the QM/MM energy depends on the MM force field and therefore give different results for the various calculations.

The second column gives the QM energy of the QM/MM optimized structure of ethanol. It can be seen that BLAC1J and ME give the lowest energy, 1.7 kJ/mol above the QM minimum. On the other hand, ELAC+BLAC1 gives the highest energy, 3.3 kJ/mol. Thus, the variation in energies is small, showing that all structures give excellent structures. A MM minimisation with the GAFF force field gives a slightly higher energy, 3.8 kJ/mol (row MM in Table 3).

The third column shows the root-mean-squared deviation (RMSD) of the coordinates from the optimized QM structure of ethanol. ELAC2 gives the lowest RMSD (0.012 Å), followed by the two BLAC1 variants (0.013 Å), as well as Add, Sub and ME (0.014–0.016 Å). The three variants involving BLAC2 give slightly higher values, 0.036–0.039 Å.

Finally, the three last columns give the mean absolute deviation (MAD) for the 8 bonds, the 13 angles and the 12 dihedrals in the molecule. For the bonds, the three BLAC2 variants give minimal errors (0.001–0.002 Å), whereas the other methods give slightly higher errors, 0.005–0.006 Å. The same applies for the angles and the dihedrals, the three BLAC2 variants are still the best (0.8 and 0.4°), whereas the other methods give slightly larger errors, 0.8–1.2° and 0.5–0.7°.

In conclusion, the test calculations show that the BLAC approaches give the best result, but it depends on the force field used. The best structures are obtained with the Hess2FF force field, which is tailored for the molecule and the QM method. ELAC sometimes improves the results, sometimes not. Sub and Add give similar results and in the differences among the various method are minimal for this small test molecule.

### Sulfite Oxidase

Next, we studied a more realistic enzyme system, viz. sulfite oxidase. Based on our recent QM-cluster (Van Severen et al., 2014) and QM/MM (Caldararu et al., 2018) studies, we considered the S → OMo mechanism, in which the S atom of the sulfite substrate attacks the equatorial oxo group of Mo^{VI}, directly forming a Mo^{IV}-sulfate intermediate (Im), via a first transition state TS1 (Figure 2). The sulfate product then dissociates into the second coordination sphere of the Mo ion via a second transition state (TS2).

For all five states, we tested six different methods: Add, Sub, ELAC, BLAC2, BLAC2J, and ME. For all methods, we compare the QM/MM results obtained with the small DMDT model of the molybdopterin ligand with those obtained with a standard (subtractive with VLAC) QM/MM calculation with the full MPT ligand (Figure 2). Initially, we tested also BLAC1, but it failed for all systems. The RS state with ME could be obtained only if the S_{Sub}−O2 distance was fixed at 2.43 Å (taken from the reference structure).

Table 4 shows the RMSD deviations of the small QM system between the MPT and DMDT calculations. It can be seen that all calculations give similar results, with a RMSD of 0.02–0.10 Å. The RMSD is typically lowest for the Im and TS1 states.

**Table 4**. RMS deviations (Å) of the various QM systems for sulfite oxidase, compared to the QM/MM structures optimized with the full MPT ligand.

The smallest RMSD is always found for Add and Sub methods, with an average RMSD of 0.05 Å for the five states. However, ELAC also gives a similar average, whereas that of the other three methods is somewhat larger, 0.06 and 0.08 Å for BLAC and ME, respectively. This increase in the RMSD is not caused by a single structure, but is seen for all structures.

In Table S1, the Mo–ligand and S_{Sub}−O distances for all structures are listed. It can be seen that these key distances are well preserved in the truncated calculations. The best results are again obtained for Add and Sub, for which the average difference for the nine distances and five sets of structures is only 0.008 Å. The maximum deviation is 0.14 Å for Add and 0.16 Å for Sub, in both cases obtained for the non-bonded S_{Sub}–O2 distance in the RS structure. Besides this distance, the largest deviation is only 0.02–0.03 Å. The results for the other four sets of calculations were slightly worse, with an average error of 0.017 Å for ME and 0.009 Å for the other three approaches. The maximum error is 0.14–0.17 Å, again for S_{Sub}–O2 distance in the RS structure, except for ME (Mo–O2 distance of the Im state).

Figure 4 shows the energies (relative to the RS state) in the seven sets of calculations. It can be seen that the Add, Sub, BLAC2, and BLAC2J methods give very similar results. In fact, the two sets of BLAC2 energies differ by only 0.1 kJ/mol for all states and these methods differ by 0–2 kJ/mol from Sub, and slightly more from Add (2 kJ/mol on average). The Add and Sub results agree within 2 kJ/mol. All these curves follow quite closely the reference with a systematic underestimation that increases from 6–8 kJ/mol for TS1 to 11–13 kJ/mol for PS. On average, all methods give a MAD of 10 kJ/mol, lowest for BLAC2J.

**Figure 4**. Relative energies (kJ/mol) of the five states in the sulfite oxidase reaction, obtained with the various QM/MM methods.

On the other hand, ME and ELAC give appreciably larger errors, with MADs of 29 and 32 kJ/mol, respectively and maximum errors of 47 and 65 kJ/mol. For ME, the problem is related to the failure to find the RS state—different restraints for this state may translate the curve upwards and therefore reduce the error, but it always remains worse than the other four methods.

Thus, we can conclude that all methods give reasonable structures for sulfite oxidase, although ME has problem with one of the states. However, ME and ELAC give quite large errors for the energies. In general, Add and Sub seem to give the best (and similar) results.

### Haem Oxygenase

The third test case is haem oxygenase, for which we studied the conversion of oxophlorin (OXF) to verdohaem (Alavi et al., 2017). As is shown in Figure 3, this involves five states and four transition states. It starts from the Fe^{III}-OXF–${\text{O}}_{2}^{-}$ complex in the doublet state (**1**), which has one unpaired electron on each of the three moieties, in analogy with previous studies (Alavi et al., 2017; Gheidi et al., 2017). In the first step of the reaction, the terminal oxygen atom in O_{2} (O2) reacts with the C4B atom of OXF, forming a bridging intermediate (**2**). In the next step, the O–O bond is cleaved, giving intermediate **3**. Next, the O2 atom reacts with another atom in the OXF ring (C1C), forming a four-membered ring in intermediate **4**. Finally, the C4B–CMC and C1C–CMC bonds are cleaved, and CO dissociates, giving rise to verdohaem (**5**). These states are separated by four transition states (**T1–T4**). We study the effect of moving the side chains of the OXF ring from the QM to the MM system.

This test case is challenging for at least two reasons. First, the electronic structure is more complicated than for the other two test cases, with several antiferromagnetically coupled open-shell moieties. Second, the OXF ring, for which we test the effect of truncation, is involved in the reaction. In fact, the C1C and C4B atoms are both only two bonds away from the HL atoms in the truncated OXT model.

As for sulfite oxidase, we tested six different methods with the truncated OXT model, Add, Sub, ELAC, BLAC1, BLAC2J, and ME. In contrast to sulfite oxidase, the simplest BLAC1 approach, with standard GAFF parameters for both OXF and OXT worked well. Considering the similar results for BLAC2 and BLAC2J for sulfite oxidase, we did not test BLAC2.

As for sulfite oxidase, we used the QM/MM calculations (subtractive with VLAC) with the full OXF ligand as the reference and study how the various QM/MM calculations with OXT reproduce these calculations in terms of the RMSD deviation for the entire (truncated) QM system, key distances and energies. The RMSD deviations of the nine different systems are shown in Table 5. It can be seen that most methods give similar results with an average RMSD of 0.06 Å. Add gives the lowest RMSD for most systems, but that of Sub is very similar and sometimes lower. The largest RMSD (0.09–0.10 Å) is typically found for state **3**, for which ME actually gives the best results and the latter method also gives the lowest maximum RMSD.

**Table 5**. RMS deviations (Å) of the various QM systems of haem oxygenase, compared to the QM/MM structures optimized with the full OXF ligand.

However, for the BLAC1 method, the RMSD is slightly larger, with an average of 0.07 Å and a maximum of 0.11 Å (still for **3**). For BLAC2J, the results are even worse (0.08 Å on average and a maximum of 0.13 Å for **T3**). In particular, BLAC2J failed to converge to any reasonable structure for the product (**5**). This most likely reflects that the force fields, especially that of BLAC2J, were determined for the starting structure **1** (note that BLAC2J gives the second-best structure for that state) and was then used unchanged for the other states. This is clearly suboptimal for structures later in the reaction mechanism. Of course, we could have determined new force fields for each intermediate and transition state in the mechanism, but this would have required much large computational and manual effort. Moreover, it would have given problems in the calculated energies, because the force field would be different for every state, making the energies not comparable. Thus, we do not recommend the BLAC approaches if the reaction is within three bonds of the XL atoms.

These trends in the RMSD are reflected also in the individual distances in the complexes. In Table S2, we examine the six Fe–N/O distances, as well as distances involving the reacting O2, C1C, C4B, and CMC atoms. It can be seen that all methods give similar MAD and maximum deviations from the reference distances (0.02–0.03 and 0.29–0.32 Å, respectively). However, Add and Sub still give the best results on average and BLAC2J the worst. All methods give large errors for the Fe–N_{His} distances in the **3, T4** and **4** states (0.26–0.32 Å too short). All methods also give a large error for the O2–C1C distance in state **3** (0.18–0.27 Å too short).

In Figure 5, the relative energies of the various states are shown. It can be seen that the Add and Sub methods still give similar results, with a MAD of only 4 kJ/mol. However, the Sub and BLAC1 methods give even more similar results with a MAD of only 1.4 kJ/mol. This reflects that GAFF parameters for OXT and OXF differ only for a few bonds, angles, and dihedrals around the periphery of the ring, which apparently do not affect the results much. These three methods also reproduce the reference calculations fairly well, with MADs of 14–15 kJ/mol, with Sub giving the smallest error. The maximum error, 24–25 kJ/mol, is obtained for **T2**.

**Figure 5**. Relative energies (kJ/mol) of the five states in the haem oxygenase reaction, obtained with the various QM/MM methods.

BLAC2J gives somewhat worse energies (MAD = 30 kJ/mol, with a maximum error of 47 kJ/mol for both **3** and **4**), especially for the later states in the reaction, reflecting that the force field is worse for these states. As for sulfite oxidase, the results for ELAC and ME are much worse, with MADs of 96–100 kJ/mol and maximum errors of 156–184 kJ/mol.

Finally, we checked also the electronic structures in the various calculations. However, they showed only a small variation between the various methods. For example, for the spin densities on Fe (shown in Table S3), Add, Sub, BLAC1 and BLAC2J give the same MAD from the reference calculations, 0.06 *e*, and the MADs of ELAC and ME are only slightly larger, 0.07 and 0.08 *e*, respectively. The spin density is lower for all calculations with OXT, except for the **1** state and the difference is largest for **3** and **T3** (0.09–0.12 *e*).

In conclusion, Sub and Add again give the best results, but those of BLAC1 are also good. Again, ELAC and ME give large errors in the energies and BLAC2J has problems with geometries of the later states in the reaction.

## Conclusions

In this paper, we have tried to clarify similarities and differences between the subtractive and additive QM/MM schemes. In our view, the primary difference is that the subtractive scheme allows for an attempt to correct for errors introduced by the link atoms. This correction can be introduced for three different types of interactions: van der Waals, electrostatic, and bonded interactions. Different software implements different corrections and it is also partly up to the user to define the level of correction (when setting up the force field for the QM system). For example, our ComQum software automatically implements the van der Waals correction, whereas ONIOM with electrostatic embedding implements also the electrostatic correction. Both software can implement the bonded correction, but typically do not do so.

Of course, the corrections come at some extra cost. For the van der Waals correction (VLAC), the cost is minimal: It requires van der Waals parameters of the HL atoms, which can almost always be taken from standard parameters for hydrogen atoms in the used MM force field (using the rules for atom types of the force field). Beside these parameters, the calculations can be automatically set up from a prmtop file for the full system with no extra effort, as in the ComQum implementation.

For the electrostatic link-atom corrections (ELAC), QM charges of the QM region are required, both with HL and CL atoms. This can be obtained from most QM software at a small extra cost and be automatized. Moreover, such charges are normally already available, because most QM/MM studies start with a MD equilibration of the full system (including solvent), for which a proper charge model of the QM system is needed. However, the charges on the CL atoms are ambiguous and ESP charges of buried atoms are poorly defined, which becomes a serious problem for large QM regions. Therefore, we have not seen any advantage for the electrostatic correction, at least not when the link atoms are rather close to the reactive atoms (Hu et al., 2011b). Moreover, the test calculations in this article indicate that ELAC can give rise to problems with energies in a reaction sequence. ELAC is implemented in the ONIOM software, but in practice it gives often severe convergence problems and is therefore seldom used. Instead, alternative approaches have been implemented, based on iterative calculations with mechanical embedding and updated charges (Kawatsu et al., 2011; Dutta and Mishra, 2014; Wójcik et al., 2014). It could therefore be recommended that ONIOM implements electrostatic embedding with only VLAC, which is a very stable approach in ComQum.

For the bonded link-atom correction (BLAC), an accurate MM force field for the QM region is required. Nowadays, the general MM force field for organic and drug-like molecules often provide the required parameters. However, it is unclear whether these (together with the MM parameters of the full system) are accurate enough to give any advantage of this approach. The alternative is to make a tailored force field for the QM region, both when truncated and in the full protein. In this article, we have tested both approaches. For the simple ethanol test case, the best results were actually obtained with BLAC2, i.e., with the optimized Hess2FF force field. However, for the two enzyme systems, the results were worse, especially if the reactive site is close to the link atoms. Therefore, we cannot recommend BLAC for general use.

Beside the parameters needed for these link-atom corrections, there is no difference in the requirement of MM parameters for the subtractive and additive QM/MM schemes; without any link-atom corrections, the two schemes should give identical results if correctly implemented and require exactly the same MM parameters. However, in practice there may be differences because the subtractive scheme is typically based on a standard (general-purpose) MM software, whereas the additive scheme is based on a software tailored for QM/MM. In particular, most MM software refuse to run if any MM parameters are missing. Therefore, subtractive calculations normally require a full set of MM parameters, also for the QM region. However, these can be dummy (zeroed) parameters, because they cancel in the QM/MM calculations. Moreover, also the additive scheme requires these parameters for an initial MD equilibration of the full solvated system. The same parameters can normally be used throughout a reaction mechanism (again because these MM terms cancel in Equation 4).

Thus, our conclusion is that intrinsically, the subtractive and additive QM/MM schemes are equivalent if properly implemented. The subtractive scheme allows the introduction of various link-atom corrections, at the expense of requiring more MM parameters. For van der Waals and electrostatic corrections, the extra cost is minimal and fully automatic, whereas for the bonded corrections, significant extra effort may be needed. Of course, the same corrections may be implemented also in an additive scheme, by picking proper terms, but this goes outside a standard implementation (and also a strict definition) of the additive scheme and therefore should be thoroughly specified.

In practice, the subtractive scheme is easier to implement and maintain (standard QM and MM software are used). On the other hand, the additive scheme may be somewhat easier to set up and can be tailored for QM/MM calculations. Moreover, if (a major part of) the MM system is fixed, calculations can be somewhat sped up by not calculating the MM energy and forces for the fixed atoms. In our test calculations, the additive and subtractive calculations with VLAC, but no other corrections (i.e., Add and Sub) give closely similar results, showing that the VLAC has only a minor influence on the results. Thus, both approaches can be recommended for QM/MM calculations.

We have also included mechanical embedding (within a subtractive scheme with VLAC) in the comparison. However, this approach gave rather poor structures, especially for sulfite oxidase. Moreover, the energies were quite poor, although this may be partly attributed to the fact that the reference energies employed electrostatic embedding and not mechanical embedding. For a more fair comparison, reference energies obtained with very large QM systems should be used, as in our previous study (Hu et al., 2011b).

Finally, we want to emphasize the importance of specifying exactly what is done in the QM/MM calculations, owing to the many different implementations. Obviously, it is not enough to say that a subtractive or additive scheme is used. Instead, for a subtractive scheme, it must be specified what type of link-atom corrections is applied (van der Waals, electrostatic, or bonded). In addition, the treatment of QM–MM electrostatics (mechanical or electrostatic embedding) must be specified, together with a detailed account of what charges are included in the point-charge model, if any charge redistribution scheme is employed and how charges on CL atoms are obtained. Finally, the treatment of link atoms need to be specified, as well as the relation between the coordinates of the HL and CL atoms.

## Author Contributions

LC: Performed most of the calculations; UR: Did some calculations, designed the project, and wrote the article.

## Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Acknowledgments

This investigation has been supported by grants from the Swedish research council (project 2014-5540), from Knut and Alice Wallenberg Foundation (KAW 2013.0022) and from COST through Action CM1305 (ECOSTBio). The computations were performed on computer resources provided by the Swedish National Infrastructure for Computing (SNIC) at Lunarc at Lund University.

## Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2018.00089/full#supplementary-material

## References

Ahlrichs, R., Bär, M., Häser, M., Horn, H., and Kölmel, C. (1989). Electronic structure calculations on workstation computers: the program system Turbomole. *Chem. Phys. Lett.* 162, 165–169. doi: 10.1016/0009-2614(89)85118-8

Alavi, F. S., Zahedi, M., Safari, N., and Ryde, U. (2017). QM/MM study of the conversion of oxophlorin into verdoheme by heme oxygenase. *J. Phys. Chem. B* 121, 11427–11436. doi: 10.1021/acs.jpcb.7b08332

Balcells, D., and Maseras, F. (2007). Computational approaches to asymmetric synthesis. *New J. Chem.* 31, 333–343. doi: 10.1039/b615528f

Besler, B. H., Merz, K. M., and Kollman, P. A. (1990). Atomic charges derived from semiempirical methods. *J. Comput. Chem.* 11, 431–439. doi: 10.1002/jcc.540110404

Blomberg, M. R. A., Borowski, T., Himo, F., Liao, R.-Z., and Siegbahn, P. E. M. (2014). Quantum chemical studies of mechanisms for metalloenzymes. *Chem. Rev.* 114, 3601–3658. doi: 10.1021/cr400388t

Brunger, A. T. (2007). Version 1.2 of the Crystallography and NMR system. *Nat. Protoc.* 2, 2728–2733. doi: 10.1038/nprot.2007.406

Brunger, A. T., Adams, P. D., Clore, G. M., Delano, W. L., Gros, P., Grosse-Kunstleve, R. W., et al. (1998). Crystallography & NMR system : a new software suite for macromolecular structure determination. *Acta Crystallogr. Sect. D Biol. Crystallogr.* 54, 905–921. doi: 10.1107/S0907444998003254

Caldararu, O., Feldt, M., Cioloboc, D., van Severen, M.-C., Starke, K., Mata, R. A., et al. (2018). QM/MM study of the reaction mechanism of sulfite oxidase. *Sci. Rep*. 8:4684. doi: 10.1038/s41598-018-22751-6

Cao, L., Caldararu, O., Rosenzweig, A. C., and Ryde, U. (2018). Quantum refinement does not support dinuclear copper sites in crystal structures of particulate methane monooxygenase. *Angew. Chem. Int. Ed.* 57, 162–166. doi: 10.1002/anie.201708977

Case, D. A., Berryman, J. T., Betz, R. M., Cerutti, D. S., Cheatham, T. E., Darden, T. A., et al. (2014). *AMBER 14. San Francisco, CA: University of California*. Available online at: http://ambermd.org

Chung, L. W., Sameera, W. M. C., Ramozzi, R., Page, A. J., Hatanaka, M., Petrova, G. P., et al. (2015). The ONIOM Method and its applications. *Chem. Rev.* 115, 5678–5796. doi: 10.1021/cr5004419

Dapprich, I. K., Komáromia, I., Morokuma, M. J., and Frisch, K. S. B. (1999). A New ONIOM Implementation in Gaussian 98. 1. The calculation of energies, gradients and vibrational frequencies and electric field derivatives. *J. Mol. Struct.* 462, 1–21. doi: 10.1016/S0166-1280(98)00475-8

Dutta, D., and Mishra, S. (2014). The structural and energetic aspects of substrate binding and the mechanism of action of the DapE-encoded N-succinyl-l{,}l-diaminopimelic acid desuccinylase (DapE) investigated using a hybrid QM/MM method. *Phys. Chem. Chem. Phys.* 16, 26348–26358. doi: 10.1039/C4CP03986F

Field, M. J., Bash, P. A., and Karplus, M. (1990). A combined quantum mechanical and molecular mechanical potential for molecular dynamics simulations. *J. Comput. Chem.* 11, 700–733. doi: 10.1002/jcc.540110605

Furche, F., Ahlrichs, R., Hättig, C., Klopper, W., Sierka, M., and Weigend, F. (2014). Turbomole. *Wiley Interdiscip. Rev. Comput. Mol. Sci.* 4, 91–100. doi: 10.1002/wcms.1162

Gao, J., Amara, P., Alhambra, C., and Field, M. J. (1998). A Generalized Hybrid Orbital (GHO) method for the treatment of boundary atoms in combined QM/MM calculations. *J. Phys. Chem. A* 102, 4714–4721. doi: 10.1021/jp9809890

Gheidi, M., Safari, N., and Zahedi, M. (2017). Density functional theory studies on the conversion of hydroxyheme to iron-verdoheme in the presence of dioxygen. *Dalt. Trans.* 46, 2146–2158. doi: 10.1039/C6DT04250C

Götz, A. W., Clark, M. A., and Walker, R. C. (2014). An extensible interface for QM/MM molecular dynamics simulations with AMBER. *J. Comput. Chem.* 35, 95–108. doi: 10.1002/jcc.23444

Grimme, S., Antony, J., Ehrlich, S., and Krieg, H. (2010). A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. *J. Chem. Phys.* 132:154104. doi: 10.1063/1.3382344

Grimme, S., Ehrlich, S., and Goerigk, L. (2011). Effect of the damping function in dispersion corrected density functional theory. *J. Comput. Chem.* 32, 1456–1465. doi: 10.1002/jcc.21759

Hu, L., and Ryde, U. (2011). Comparison of methods to obtain force-field parameters for metal sites. *J. Chem. Theory Comput.* 7, 2452–2463. doi: 10.1021/ct100725a

Hu, L., Farrokhnia, M., Heimdal, J., Shleev, S., Rulíšek, L., and Ryde, U. (2011a). Reorganization energy for internal electron transfer in multicopper oxidases. *J. Phys. Chem. B* 115, 13111–13126. doi: 10.1021/jp205897z

Hu, L., Söderhjelm, P., and Ryde, U. (2011b). On the convergence of QM/MM energies. *J. Chem. Theory Comput.* 7, 761–777. doi: 10.1021/ct100530r

Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W., and Klein, M. L. (1983). Comparison of simple potential functions for simulating liquid water. *J. Chem. Phys.* 79, 926–935. doi: 10.1063/1.445869

Jover, J., and Maseras, F. (2016). QM/MM calculations on selectivity in homogeneous catalysis. *Struct. Bond.* 167, 59–79. doi: 10.1007/430_2015_188

Kawatsu, T., Lundberg, M., and Morokuma, K. (2011). Protein free energy corrections in ONIOM QM:MM modeling: a case study for isopenicillin N synthase (IPNS). *J. Chem. Theory Comput.* 7, 390–401. doi: 10.1021/ct1005592

Keal, T. W., Sherwood, P., Dutta, G., Sokol, A. A., and Catlow, C. R. A. (2011). Characterization of hydrogen dissociation over aluminium-doped zinc oxide using an efficient massively parallel framework for QM/MM calculations. *Proc. R. Soc. Lond. A Math. Phys. Eng. Sci.* 467, 1900–1924. doi: 10.1098/rspa.2010.0613

Levitt, M. (1976). A simplified representation of protein conformations for rapid simulation of protein folding. *J. Mol. Biol.* 104, 59–107. doi: 10.1016/0022-2836(76)90004-8

Lin, H., and Truhlar, D. G. (2007). QM/MM: what have we learned, where are we, and where do we go from here? *Theor. Chem. Acc.* 117, 185–199. doi: 10.1007/s00214-006-0143-z

Lopes, P. E. M., Roux, B., and MacKerell, A. D. (2009). Molecular modeling and dynamics studies with explicit inclusion of electronic polarizability: theory and applications. *Theor. Chem. Acc.* 124, 11–28. doi: 10.1007/s00214-009-0617-x

Maier, J. A., Martinez, C., Kasavajhala, K., Wickstrom, L., Hauser, K. E., and Simmerling, C. (2015). ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. *J. Chem. Theory Comput.* 11, 3696–3713. doi: 10.1021/acs.jctc.5b00255

Maseras, F., and Morokuma, K. (1995). IMOMM: a new integrated ab initio + molecular mechanics geometry optimization scheme of equilibrium structures and transition states. *J. Comput. Chem.* 16:1170. doi: 10.1002/jcc.540160911

Murphy, R. B., Philipp, D. M., and Friesner, R. A. (2000). A mixed quantum mechanics/molecular mechanics (QM/MM) method for large-scale modeling of chemistry in protein environments. *J. Comput. Chem.* 21, 1442–1457. doi: 10.1002/1096-987X(200012)21:16<1442::AID-JCC3>3.0.CO;2-O

Nilsson, K., Lecerof, D., Sigfridsson, E., and Ryde, U. (2003). An automatic method to generate force-field parameters for hetero-compounds. *Acta Crystallogr. D Biol. Crystallogr.* 59, 274–289. doi: 10.1107/S0907444902021431

Olsen, J. M., Aidas, K., and Kongsted, J. (2010). Excited states in solution through polarizable embedding. *J. Chem. Theory Comput.* 6, 3721–3734. doi: 10.1021/ct1003803

Poulsen, T. D., Kongsted, J., Osted, A., Ogilby, P. R., and Mikkelsen, K. V. (2001). The combined multiconfigurational self-consistent-field/molecular mechanics wave function approach. *J. Chem. Phys.* 115, 2393–2400. doi: 10.1063/1.1374559

Ramos, M. J., and Fernandes, P. A. (2008). Computational enzymatic catalysis. *Acc. Chem. Res.* 41, 689–698. doi: 10.1021/ar7001045

Reuter, N., Dejaegere, A., Maigret, B., and Karplus, M. (2000). Frontier Bonds in QM/MM Methods: a comparison of different approaches. *J. Phys. Chem. A* 104, 1720–1735. doi: 10.1021/jp9924124

Rod, T. H., and Ryde, U. (2005). Accurate QM/MM free energy calculations of enzyme reactions: methylation by catechol O-methyltransferase. *J. Chem. Theory Comput.* 1, 1240–1251. doi: 10.1021/ct0501102

Roßbach, S., and Ochsenfeld, C. (2017). Influence of coupling and embedding schemes on QM size convergence in QM/MM approaches for the example of a proton transfer in DNA. *J. Chem. Theory Comput.* 13, 1102–1107. doi: 10.1021/acs.jctc.6b00727

Ryde, U. (1995). On the role of Glu-68 in alcohol dehydrogenase. *Protein Sci.* 4, 1124–1132. doi: 10.1002/pro.5560040611

Ryde, U. (1996a). The coordination chemistry of the structural zinc ion in alcohol dehydrogenase studied by ab initio quantum chemical calculations. *Eur. Biophys. J.* 24, 213–221. doi: 10.1007/BF00205102

Ryde, U. (1996b). The coordination of the catalytic zinc in alcohol dehydrogenase studied by combined quantum-chemical and molecular mechanics calculations. *J. Comput. Aided. Mol. Des.* 10, 153–164. doi: 10.1007/BF00402823

Ryde, U. (2016). QM/MM calculations on proteins. *Methods Enzymol.* 577, 119–158. doi: 10.1016/bs.mie.2016.05.014

Ryde, U., and Nilsson, K. (2003). Quantum chemistry can locally improve protein crystal structures. *J. Am. Chem. Soc.* 125, 14232–14233. doi: 10.1021/ja0365328

Ryde, U., and Olsson, M. H. M. (2001). Structure, strain, and reorganization energy of blue copper models in the protein. *Int. J. Quantum Chem.* 81, 335–347. doi: 10.1002/1097-461X(2001)81:5<335::AID-QUA1003>.30CO2-Q

Ryde, U., Olsen, L., and Nilsson, K. (2002). Quantum chemical geometry optimizations in proteins using crystallographic raw data. *J. Comput. Chem.* 23, 1058–1070. doi: 10.1002/jcc.10093

Schäfer, A., Horn, H., and Ahlrichs, R. (1992). Fully optimized contracted Gaussian basis sets for atoms Li to Kr. *J. Chem. Phys.* 97, 2571–2577. doi: 10.1063/1.463096

Seminario, J. M. (1996). Calculation of intramolecular force fields from second-derivative tensors. *Int. J. Quantum Chem.* 30, 1271–1277. doi: 10.1002/(SICI)1097-461X(1996)60:7<1271::AID-QUA8>3.0.CO;2-W

Senn, H. M., and Thiel, W. (2009). QM/MM methods for biomolecular systems. *Angew. Chemie* 48, 1198–1229. doi: 10.1002/anie.200802019

Sherwood, P., De Vries, A. H., Guest, M. F., Schreckenbach, G., Catlow, C. R. A., French, S. A., et al. (2003). QUASI: a general purpose implementation of the QM/MM approach and its application to problems in catalysis. *J. Mol. Struct. THEOCHEM* 632, 1–28. doi: 10.1016/S0166-1280(03)00285-9

Sigfridsson, E., and Ryde, U. (1998). Comparison of methods for deriving atomic charges from the electrostatic potential and moments. *J. Comput. Chem.* 19, 377–395. doi: 10.1002/(SICI)1096-987X(199803)19:4<377::AID-JCC1>3.0.CO;2-P

Singh, U. C., and Kollman, P. A. (1986). A combined ab initio quantum mechanical and molecular mechanical method for carrying out simulations on complex molecular systems: applications to the CH3Cl + Cl– exchange reaction and gas phase protonation of polyethers. *J. Comput. Chem.* 7, 718–730. doi: 10.1002/jcc.540070604

Söderhjelm, P., Husberg, C., Strambi, A., Olivucci, M., and Ryde, U. (2009). Protein influence on electronic spectra modeled by multipoles and polarizabilities. *J. Chem. Theory Comput.* 5, 649–658. doi: 10.1021/ct800459t

Sousa, S. F., Ribeiro, A. J. M., Neves, R. P. P., Brás, N. F., Cerqueira, N. M. F. S. A., Fernandes, P. A., et al. (2017). Application of quantum mechanics/molecular mechanics methods in the study of enzymatic reaction mechanisms. *Wiley Interdiscip. Rev. Comput. Mol. Sci.* 7:e1281. doi: 10.1002/wcms.1281

Stoyanov, S. R., Gusarov, S., Kuznicki, S. M., and Kovalenko, A. (2008). Theoretical modeling of zeolite nanoparticle surface acidity for heavy oil upgrading. *J. Phys. Chem. C* 112, 6794–6810. doi: 10.1021/jp075688h

Svensson, M., Humbel, S., Froese, R. D. J., Matsubara, T., Sieber, S., and Morokuma, K. (1996). ONIOM: a multilayered integrated MO + MM method for geometry optimizations and single point energy predictions. A test for diels-alder reactions and Pt(P(t-Bu) 3) 2 + H 2 oxidative addition. *J. Phys. Chem.* 100, 19357–19363. doi: 10.1021/jp962071j

Tao, J., Perdew, J. P., Staroverov, V. N., and Scuseria, G. E. (2003). Climbing the density functional ladder: non-empirical meta-generalized gradient approximation designed for molecules and solids. *Phys. Rev. Lett.* 91:146401. doi: 10.1103/PhysRevLett.91.146401

Théry, V., Rinaldi, D., Rivail, J.-L., Maigret, B., and Ferenczy, G. G. (1994). Quantum mechanical computations on very large molecular systems: the local self-consistent field method. *J. Comput. Chem.* 15, 269–282. doi: 10.1002/jcc.540150303

TURBOMOLE version 7.1 (2016). *University of Karlsruhe and Forschungszentrum Karlsruhe GmbH*. Available online at: http://www.turbomole.com/

Van Severen, M.-C., Andrejić, M., Li, J., Starke, K., Mata, R. A., Nordlander, E., et al. (2014). A quantum-mechanical study of the reaction mechanism of sulfite oxidase. *J. Biol. Inorg. Chem.* 19, 1165–1179. doi: 10.1007/s00775-014-1172-z

Vanommeslaeghe, K., Hatcher, E., Acharya, C., Kundu, S., Zhong, S., Shim, J., et al. (2010). CHARMM general force field: a force field for drug-like molecules compatible with the CHARMM all-atom additive biological force fields. *J. Comput. Chem.* 31, 671–690. doi: 10.1002/jcc.21367

Vreven, T., Byun, K. S., Komáromi, I., Dapprich, S., Montgomery, J. A. Jr, Morokuma, K., et al. (2006). Combining quantum mechanics methods with molecular mechanics methods in ONIOM. *J. Chem. Theory Comput.* 2, 815–826. doi: 10.1021/ct050289g

Wang, J. M., Wolf, R. M., Caldwell, J. W., Kollman, P. A., and Case, D. A. (2004). Development and testing of a general Amber force field. *J. Comput. Chem.* 25, 1157–1174. doi: 10.1002/jcc.20035

Keywords: QM/MM, haem oxygenase, sulfite oxidase, mechanical embedding, electrostatic embedding, additive QM/MM, subtractive QM/MM

Citation: Cao L and Ryde U (2018) On the Difference Between Additive and Subtractive QM/MM Calculations. *Front. Chem*. 6:89. doi: 10.3389/fchem.2018.00089

Received: 31 January 2018; Accepted: 14 March 2018;

Published: 03 April 2018.

Edited by:

Sam P. De Visser, University of Manchester, United KingdomReviewed by:

Albert Poater, University of Girona, SpainJiayun Pang, University of Greenwich, United Kingdom

Copyright © 2018 Cao and Ryde. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ulf Ryde, Ulf.Ryde@teokem.lu.se