# Crystal Structure Prediction of Binary Alloys via Deep Potential

^{1}School of Electronic Science and Applied Physics, Hefei University of Technology, Hefei, China^{2}Yuanpei College of Peking University, Beijing, China^{3}Beijing Institute of Big Data Research, Peking University, Beijing, China^{4}Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, United States^{5}Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Beijing, China

Predicting crystal structure has been a challenging problem in physics and materials science for a long time. A reliable energy calculation engine combined with an efficient global search algorithm, such as particle swarm optimization algorithm or genetic algorithm, is needed to conduct crystal structure prediction. In recent years, machine learning-based interatomic potential energy surface models have been proposed, potentially allowing us to perform crystal structure prediction for systems with the accuracy of density functional theory (DFT) and the speed of empirical force fields. In this paper, we employ a previously developed Deep Potential model to predict the intermetallic compound of the aluminum–magnesium system, and find six meta-stable phases with negative or nearly zero formation energy. In particular, Mg_{12}Al_{8} shows excellent ductility and Mg_{5}Al_{27} has a high Young's modulus. Based on our benchmark results, we propose a relatively robust structure screening criterion that selects potentially stable structures from the Deep Potential-based convex hull and performs DFT refinement. By using this criterion, the computational cost needed to construct the convex hull with *ab initio* accuracy can be dramatically reduced.

## Introduction

In recent years, crystal structure prediction has played an increasingly important role, not only for understanding the ground-state structure of matter, but also for designing materials and drug molecules with desired functionality (Oganov, 2018; He et al., 2019; Zhao et al., 2019; Xie et al., 2020). Generally speaking, a ground-state crystal structure prediction method involves three components: a model that generates the interatomic potential energy surface (PES) and forces, a sampling technique for exploring different conformations in the configuration space, and a relaxation procedure to find the local minima on the PES (Podryabinkin et al., 2019). While the relaxation procedure is relatively standard, different sampling techniques have been championed by different software packages. For instance, the genetic evolutionary algorithm, the particle swarm algorithm, and the firefly algorithm have been widely used in the USPEX software (Glass et al., 2006), the CALYPSO package (Wang et al., 2010), and the PyChemia library (Avendaño-Franco and Romero, 2016), respectively. Due to the high accuracy required by both sampling and relaxation, density functional theory (DFT) (Kohn and Sham, 1965) is typically used for generating the PES. Despite its widespread success, DFT has a high computational cost that typically scales cubically with the system size, which, to some extent, hinders routine applications to large and complex systems.

Many empirical PES models for popular solid-state systems have been proposed (Mendelev et al., 2009; Jelinek et al., 2012; Dickel et al., 2018) to address the efficiency issue of DFT. Due to the relatively simple and analytical expressions adopted by these empirical models, an acceleration of many orders of magnitude in terms of computational cost can be gained, but presumably at the price of accuracy and transferability. As such, trail-and-error processes are typically required for developing such models, yet challenges have remained for systems involving multiple elements, complex and exotic phases, or bond breaking and formation events.

In recent years, a few machine learning (ML) techniques have been proposed for representing the PES (Behler and Parrinello, 2007; Bartók et al., 2010; Artrith and Urban, 2016; Khorshidi and Peterson, 2016; Shapeev, 2016; Han et al., 2018; Zhang et al., 2018a,b). Unlike typically empirical PES models, representations coming from ML tasks, such as kernel functions and neural networks (NNs), have shown great promise to fit high-dimensional functions. When trained on a suitably generated dataset of atomic configurations and corresponding potential energies and forces, a good ML-based PES model can be used with an accuracy of the reference DFT model, and an efficiency comparable to that of empirical PES models. Not surprisingly, ML-based PES models have been employed in recent work for structure search tasks. For instance, boron has been studied by several groups: Podryabinkin et al. (2019) adopted the moment tensor potential (Shapeev, 2016) and the USPEX evolutionary algorithm; Huang et al. (2018) used the Behler–Parrinello potential (Behler and Parrinello, 2007) and the stochastic surface walking global optimization method; Tong et al. (2018) used the Gaussian Approximation Potential (Bartók et al., 2010) and the CALYPSO approach.

In this work, we target at using ML-based PES models for crystal structure prediction of alloys. We adopt the smooth version of the Deep Potential (DP) model (Zhang et al., 2018b), which employs NN architectures to parameterize two networks, the embedding network that defines a list of symmetry-preserving descriptors, and the fitting network that maps these descriptors to local energy contributions. The versatile architecture of DP makes it particularly suitable for multicomponent systems and those involving bond breaking and formation, for which most methodologies are hard to handle. The aluminum–magnesium (Al–Mg) binary alloy system is selected as an example based on the following reasons: First, Al–Mg binary alloys are important in real-life applications. They are widely used in automotive, aerospace, and electronic device industries (Gupta and Ling, 2011) due to their lightweight nature and excellent mechanical properties. However, only a limited number of intermetallic compounds of the Al–Mg binary system have been documented in well-known databases, such as the American Society for Metals (ASM) Alloy Phase Diagram Database^{1}, the Inorganic Crystal Structure Database (ICSD)^{2}, the Open Quantum Materials Database (OQMD)^{3}, and the Material Project database (MP)^{4}. Second, our previous study has established an Al–Mg DP model (Zhang et al., 2019), which describes well the basic physicochemical properties and has been carefully tested. As such, this DP model can be readily used for crystal structure prediction and can be download online^{5}.

Combining the particle swarm optimization (PSO) method and the DP model, potential intermetallic compounds of the Al–Mg system are systematically explored. Compared with a previous study (Zhuang et al., 2017), which only explored the Mg-rich phases, our simulation covers a much wider concentration range. Six new Al–Mg intermetallic compounds (Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16}, and Mg_{5}Al_{27}) are found to be meta-stable. The mechanical properties of these new compounds are further investigated. To facilitate future investigations of more complicated tasks, special attention is given to the whole simulation protocol and the selection criterion for further DFT validations. Direct comparisons with popular empirical PES models and DFT show the advantage of DP in terms of both accuracy and efficiency.

## Computational Methods

We adopt the PSO method, as implemented in the CALYPSO package (Wang et al., 2010), to search potentially stable and meta-stable Mg–Al intermetallic structures. PSO is inspired by the choreography of a bird flock and can be seen as a distributed behavior algorithm that performs multidimensional search. In the CALYPSO package, there are three steps for a global structure prediction task. First, a group of structures called population is generated randomly with symmetric constraints to allow a diverse sampling of the PES. The number of structures employed here is defined by a parameter called population size (PopSize). Second, a local relaxation of the population is performed based on a PES engine, which is typically a DFT model, and here we replace it with a DP model. A procedure that eliminates similar structures by using the so-called bond characterization matrix is followed up to enhance the search efficiency. Third, a certain number of new structures (the best 60% of the population size) are generated by PSO. Within the PSO scheme, a velocity vector associated with each structure is updated using the information of the previously proposed and optimized structure, as well as the globally best structure, that is, the structure with the lowest enthalpy, at the current generation. The new structures are generated based on the current structures and the velocity vectors. The last two steps continue iteratively until the predefined largest number of generations (GenNumb) is reached. The parameter GenNumb is typically selected to be large enough so that the structure with the lowest energy can sustain for several generations. Generally speaking, the more atoms (Natom) in a structure, the larger PopSize and GenNumb are required. We refer to Yanchao Wang and Ma (2012) for more details of the CALYPSO code. We set the CALYPSO parameters according to the following criteria:

The DP model (Zhang et al., 2018b) used here employs NN functions to represent the PES. In short, the total energy of a system is described as the sum of atomic energies,

where ϵ_{i} is the *i*th atomic energy. The atomic energy is represented as

where *N*_{ω(i)} is called a sub-network that computes the atomic contribution to the total energy, and ω(*i*), which depends on the chemical species of atom *i*, denotes the weights used to parameterize the sub-network. The neighbors of atom *i* within the cut-off radius *R*_{c} are denoted by ${N}(i)$. *R*_{ij} is the position of atom *j* relative to *i* used to describe the local environment of atom *i*. To generate uniformly accurate DP models in a way that minimizes both human intervention and the computational cost for data generation and model training, a concurrent learning strategy called the Deep Potential GENerator (DP-GEN) (Zhang et al., 2019) has been proposed. In this strategy, an initial dataset (random Al–Mg alloy structures) labeled by DFT calculations is used to train an ensemble of DP models, and molecular dynamics is driven by one of the DP models to sample the configuration space. An error indicator serves to select a small fraction out of the new samples as candidates, which are labeled with *ab initio* energies and forces and added to the database. Such iterations are repeated until the configuration space has been explored sufficiently, and a decent DP model has been obtained with high accuracy and transferability. The training is performed using the DeePMD-kit package (Wang et al., 2018) and the concurrent learning strategy is realized by the DP-GEN software package (Zhang et al., 2020). In details, the DeepPot-SE model is used with a cutoff radius of 8.0 Å. The size of the embedding and fitting NNs are 25 × 50 × 100 and 240 × 240 × 240, respectively. During the training, the learning rate decreases exponentially with respect to the starting value of 0.0005. The decay rate and decay step are set to 0.95 and 128,000, respectively. In addition, the prefactors of loss functions are set to ${p}_{e}^{start}=0.02,{p}_{e}^{limit}=2,{p}_{f}^{start}=1000$, ${p}_{f}^{limit}=1,{p}_{v}^{start}=0.0,{p}_{v}^{limit}=0.0$. Both DeePMD-kit and DP-GEN are publicly available online^{6}. For more details, we refer the reader to Wang et al. (2018) and Zhang et al. (2019, 2020).

All DFT calculations are carried out with the Vienna Ab-Initio Simulation Package (VASP, version 5.4.4) (Kresse and Furthmüller, 1996). The generalized gradient approximation within the Perdew–Burke–Ernzerhof (Perdew et al., 1996) (PBE) functional is used to model the exchange-correlation energy. The plane wave basis sets with kinetic energy cutoff of 600 eV are used to expand the valence electron wave functions. For all structural relaxations, the convergence criterion for the energy in electronic SCF iterations and the Hellmann–Feynman forces in ionic step iterations are set to 1.0 × 10^{−6} eV and 1.0 × 10^{−2} eV/Å, respectively. The Brillouin zone is represented by Monkhorst–Pack (Pack and Monkhorst, 1977) special k-point mesh with a grid spacing of 0.08 Å^{−1}. The phonon spectra are obtained based on finite-difference method as implemented in the Atomic Simulation Environment (ASE) (Bahn and Jacobsen, 2002; Larsen et al., 2017) software, where the forces are calculated by the python interface of DeePMD-kit. To calculate the phonon density of states, the q-point mesh is set to 20 × 20 × 20. The local structure relaxation is carried out by the LAMMPS package (Plimpton, 1995), and the DP model used here has been reported and extensively tested in Zhang et al. (2019).

All structure data and convex hulls are analyzed by pymatgen software (Ong et al., 2013).

## Results and Discussions

To demonstrate the validity of CALYPSO+DP scheme, we perform some preliminary tests for several different stoichiometric proportions. Here, we take Mg_{12}Al_{8} as an example to show the evolution of the energies of all the structures (Figure 1A), as well as the lowest energy (Figure 1C), during the CALYPSO structure search process. According to the energy histogram in Figure 1B, it can be found that there are about 458 structures, out of a total number of 3,200 structures, within an energy range <20 meV/atom (compared with the ground state structure). The evolution of the lowest energy for all generations shows that the global optimization converged quickly. It is remarkable to find that one potentially stable structures can be readily obtained in the first few generations (labeled by red star in Figure 1A). When refined by DFT, it shows that this structure is the ground state structure of the corresponding combination. In the tests of some other stoichiometric proportions, that is, Mg_{3}Al_{3}, Mg_{2}Al_{2}, Mg_{1}Al_{1}, and Mg_{3}Al, the corresponding known structures in the materials project database, that is, mp-1038779, mp-1094987, mp-1038934, and the *L*1_{2} phase (Mendelev et al., 2009), are found by the CALYPSO+DP scheme, which further confirms the validity of our approach.

**Figure 1**. Schematic illustration of the DP+CALYPSO process for the Mg_{12}Al_{8} system. **(A)** Evolution of the energies of all structures during the structure prediction process. The red star indicates the global minimal structure. **(B)** Energy histogram. **(C)** Evolution of the lowest energy during the structure prediction process.

Next, we use the DP+CALYPSO scheme to construct the convex hull of the Al-Mg system systematically. We restrict the number of atom in the supercell to <32 atoms. In other words, we consider the systems Mg_{x}Al_{y}{*x* + *y* ≤ 32, *x* ≥ 1, *y* ≥ 1, *x, y* ∈ ℤ}, which represent 496 combinations, or 323 concentrations, in total. According to these parameter settings, the total number of local relaxations is up to 2 × 10^{7}. In the following, we consider three prerequisites that should be satisfied for a stable crystal structure: (i) thermodynamic stability, which is estimated by the formation energy and convex hull; (ii) dynamic stability, which can be assessed by phonon dispersions; and (iii) mechanical stability, which is evaluated via elastic constants (Xu et al., 2019).

According to the preliminary tests, we notice that although the DP model can generate energies and forces that are very close to the DFT reference model, small intrinsic error still exists. Therefore, if our goal is to calculate properties with the accuracy of the DFT-based PES landscape, an additional refinement step has to be adopted based on structures selected from a DP+CALYPSO process. Two concepts we pay particular attention to are the formation energy (*E*_{fa}) and the energy above convex hull (*E*_{abh}). The formation energy (Haastrup et al., 2018) of an alloy system is the energy required to produce the system from the most stable crystal structures of the individual components, which is defined as

where *E*(Mg_{x}Al_{y}) is the total energy of the material Mg_{x}Al_{y}, and *E*(Mg) and *E*(Al) are the average energies of the elements Mg and Al in their stable crystal at 0 K. *E*_{abh} measures the energy for a material to decompose into the set of most stable materials with the same chemical composition. A positive *E*_{abh} indicates that this material is unstable with respect to such decomposition. A zero *E*_{abh} indicates that this is the most stable material at its composition. To accurately determine these properties, we use two criteria for an additional DFT refinement: *E*_{fa} being less than 20 meV/atom, and *E*_{abh} being <20 meV/atom. We use these two criteria at the same time based on the following reasons. Our goal is to find potential stable or meta-stable structures. Due to the error of the DP model, some structures with positive DP-predicted *E*_{fa} may turn negative if we refine it by DFT and vice versa. At the same time, the thermodynamic stability is also controlled by *E*_{abh}. If *E*_{abh} is too high, this structure will decompose into other phases even if this structure has a negative formation energy. In the tested example, we will show that since in general DP exhibits a ~2 meV/atom average error. Compared with DFT, the criteria used here are fairly robust. In contrast, for a previously established empirical model, due to its large intrinsic error, the procedure introduced above is no longer practical, since the number of DFT refinements is so large that little efficiency can be gained.

As shown in Figure 2A, we first use the formation energy *E*_{fa} to screen the candidate structures, from which the number of structures is significantly reduced from 2 × 10^{7} to 5,169. Based on these 5,169 DP-optimized structures, a crude convex hull is constructed. Then the criterion for *E*_{abh} is introduced to remove the thermodynamically unstable structures, and, finally, 1,495 structures are selected for further DFT refinements. As shown in the inset of Figure 2B, for all DFT refined structures, the energies per atom calculated by DP and DFT are in excellent agreement. The largest and root mean square error (RMSE) of the total energy per atom are about 17 and 2 meV/atom, respectively. As for the formation energy in Figure 2B, the largest error and RMSE are about 15 and 6 meV/atom, respectively. Among them, the errors of the formation energies of experimentally stable phases (Zhuang et al., 2017) Mg_{17}Al_{12} and Mg_{23}Al_{30} (labeled by pentagon in Figure 2C) are 15 and 6 meV/atom, which confirms the validity of our 20 meV/atom selection criteria.

**Figure 2. (A)** The funnel used for screening candidate structures of aluminum–magnesium (Al–Mg), the intermetallic compound via Deep Potential (DP) model. *E*_{fa} stands for formation energy and *E*_{abh} labels energy above the convex hull. **(B)** Scatter plot of the formation energy calculated by density functional theory (DFT) and DP for potentially candidate structures. The inset shows relationship between average energy calculated by DFT and DP. **(C)** The formation energy as a function of molar fraction of Al atom for different Mg–Al phases where solid line denotes the convex hull constructed by DFT results. The formation energies calculated by DFT are marked by blue circle and DP ones are marked by red diamond. All of known structures from materials project are re-optimized and directly used to construct the convex hull. Pentagon indicates the stable experimental phases and star labels new stable phase.

The convex hull based on DFT results is then constructed and presented in Figure 2C, including two experimentally stable structures Mg_{17}Al_{12} and Mg_{23}Al_{30} labeled by green pentagon. One new stable structure with a formula of MgAl_{29} is discovered and denoted by red star. Based on the DFT-refined convex hull, we look for new structures that are potentially synthesizable by experiments. We use the following criteria: *E*_{abh} <20 meV/atom and *E*_{fa} <1 meV/atom, where *E*_{abh} and *E*_{fa} are DFT-calculated values, and obtain 31 potentially candidates, including 1 stable structure and 30 meta-stable structures. However, we may note that most of those newly proposed meta-stable structures locate at the boundary region of the concentration range. That is to say, most of these structures have very low concentration of Al or Mg. Compared with these phases, the phases in the middle region are of more interesting, from which we propose six new meta-stable structures, namely Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16}, and Mg_{5}Al_{27}. The corresponding side view of the crystal structures are shown in Figures 3A–F and the geometric structure parameters are listed in Table 1.

**Figure 3. (A–F)** The side view of conventional crystal structures of Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16}, and Mg_{5}Al_{27} is shown, respectively. The Mg and Al atoms are indicated by yellow and blue ball.

**Table 1**. Lattice parameters a(Å), b(Å), c(Å), density ρ (g/cm^{3}), space group symbol *S*_{m}, lattice type, formation energy *E*_{fa} (meV/atom), and energy above the convex hull *E*_{abh}(meV/atom) of Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16}, and Mg_{5}Al_{27} calculated by DFT.

As listed in Table 1, the meta-stable structures can be divided into two groups according to their lattice types. Mg_{7}Al_{9}, Mg_{14}Al_{18}, and Mg_{6}Al_{10} have a tetragonal lattice, whereas Mg_{12}Al_{8}, Mg_{8}Al_{16}, and Mg_{5}Al_{27} have a cubic lattice. Moreover, all structures have nearly zero or negative formation and small energy above convex hull, which indicates that these structures may be synthesizable by experiments in future.

Given the encouraging stability metrics above, we proceed to study the dynamic and mechanical stability of these newly proposed intermetallic compounds via DP model. As shown in Figure 4, the phonon structures show no imaginary frequency, which indicates that all of those intermetallic compounds are dynamically stable. As for the mechanical aspect, the elastic stability conditions (Mouhat and Coudert, 2014) for cubic crystals are given as:

and those for tetragonal crystals are given as:

According to Table 2, both groups of structures meet the elastic stability conditions, which indicates that these 6 intermetallic compounds are mechanically stable. Further, we use the Pugh's ratio *B*_{v}/*G*_{v} to assess the expected average ductility (Pugh, 1954). According to Pugh, a larger *B*_{v}/*G*_{v} value implies a better ductility property. As shown in Table 2, the Pugh's ratio of both Mg_{8}Al_{16} and Mg_{12}Al_{8} are larger than that of hcp Mg (2.08). In particular, the Pugh's ratio of Mg_{12}Al_{8} (2.42) is comparable to that of Al (2.47), so it may have excellent ductility. Considering its lower density (2.10 g/cm^{3}) compared with Al (2.72 g/cm^{3}), this intermetallic compound may have potential applications in automotive, aerospace, electronic, and device industries if it can be synthesized. In addition, the Mg_{5}Al_{27} has a higher Young's modulus among all these structures, which indicates that this material may be applied to manufacture high strength devices.

**Figure 4. (A–F)** The phonon structures of Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16}, and Mg_{5}Al_{27} calculated by Deep Potential (DP) model are shown.

**Table 2**. Elastic constants, bulk modulus *B*_{v} (GPa), shear modulus *G*_{v} (GPa), Young's modulus *E*_{v} (GPa), Pugh's ratio (*B*_{v}/*G*_{v}), and Poisson's ratio ν of Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16} and Mg_{5}Al_{27} calculated by the DP model.

Finally, we test the accuracy of a recent version of the MEAM potential (Jelinek et al., 2012) and see whether it can be used for a similar task or not. For a direct comparison, we test it on all DFT-refined structures and report the results in Figure 5. As shown by the red diamonds and green pentagons in Figure 5A, MEAM exhibits much larger errors compared with DP for most of the structures, and there are MEAM predictions outside the range of the plot (±100 meV/atom) due to large errors. MEAM results show a large spread on the convex hull plot constructed by DFT results (Figure 5B). For the tested structures, the largest error of *E*_{fa} is 204 meV/atom, and the RMSE is 44 meV/atom. Moreover, the largest error of per-atom total energy is 465 meV/atom, and the mean error is 236 meV/atom. As such, if we use MEAM+CALYPSO to do a screening of the structures, the selection criteria for further DFT refinement would be *E*_{fa} and *E*_{abh} larger than at least 200 meV/atom. As a rough estimation, according to these criteria, ~1 × 10^{6} structures will need to be refined by DFT, which is not computationally feasible at all. Above all, we conclude that although MEAM is very efficient, it cannot be used to improve the efficiency of constructing a convex hull at the level of DFT accuracy.

**Figure 5. (A)** Scatter plot of the formation energy calculated by density functional theory (DFT) and Deep Potential (DP) for potentially candidate structures. **(B)** The formation energy as a function of molar fraction of Al atom for different Mg–Al phases where solid line denotes the convex hull constructed by DFT results. The formation energies calculated by DFT, DP, and MEAM are marked by blue circle, red diamond, and green pentagon, respectively.

To demonstrate the efficiency of our DP-based procedure, we use two groups of structures to compare the time performance. One group is composed of several Mg_{31}Al structures and the other group consists of MgAl_{31} structures. The test results shows that, compared with DFT relaxation, DP has an average speed-up ratio about 3,700 and 650 for Mg_{31}Al and MgAl_{31}, respectively, which indicates DP has better time scaling and can be used for larger systems. All tests are performed on Intel(R) Xeon(R) Gold 6248 CPU @ 2.50 GHz.

## Conclusions

In this paper, we demonstrate that the DP+CALYPSO scheme is reliable for crystal structure prediction for binary alloy system in a wide concentration range. As a concrete example, we use this scheme to predict potentially stable intermetallic compounds of the Al–Mg binary system. Six new meta-stable Al–Mg intermetallic compounds are successfully predicted, including Mg_{12}Al_{8}, Mg_{7}Al_{9}, Mg_{14}Al_{18}, Mg_{6}Al_{10}, Mg_{8}Al_{16}, and Mg_{5}Al_{27}. All the meta-stable structures are predicted to have thermodynamic stability, dynamic stability, and mechanical stability. In particular, Mg_{12}Al_{8} shows excellent ductility and Mg_{5}Al_{27} has high Young's modulus. We remark that the exploration strategy proposed in this work can be combined with the DP-GEN protocol (Zhang et al., 2019, 2020) to generate more training data and improve the DP potential. Moreover, to serve a larger community, DeePMD-kit can be interfaced with other popular general-purpose crystal structure prediction software such as CALYPSO, USPEX, and Pychemia. However, some disadvantages also exits for current scheme, such as expensive cost for training a model, low interface efficiency with CALYPSO, and so on, which limits its application to search complex multicomponent systems with larger number of atoms. We will leave these problems in our future work.

## Data Availability Statement

All datasets presented in this study are included in the article/Supplementary Material.

## Author Contributions

HaiW, HanW, and LZ conceived the idea. HaiW performed the calculations and wrote the script. All authors revised the manuscript.

## Funding

The work of HaiW was supported by the Fundamental Research Funds for the Central Universities. The work of LZ was supported in part by the Center Chemistry in Solution and at Interfaces (CSI) funded by the DOE Award DE-SC0019394. The work of HanW was supported by the National Science Foundation of China under Grant No. 11871110, the National Key Research and Development Program of China under Grants Nos. 2016YFB0201200 and 2016YFB0201203, and Beijing Academy of Artificial Intelligence (BAAI).

## Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

## Acknowledgments

We are grateful for computing resources provided by the High-performance Computing Platform of Peking University, and the Beijing Institute of Big Data Research, University of Science and Technology of China.

## Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2020.589795/full#supplementary-material

## Footnotes

1. ^ASM https://www.asminternational.org/phase-diagrams

2. ^ICSD https://icsd.fiz-karlsruhe.de/search/basic.xhtml

3. ^OQMD http://www.oqmd.org

4. ^MP https://www.materialsproject.org

5. ^deepmd http://www.deepmd.org

## References

Artrith, N., and Urban, A. (2016). An implementation of artificial neural-network potentials for atomistic materials simulations: performance for *TiO*_{2}. *Comput. Mater. Sci*. 114, 135–150. doi: 10.1016/j.commatsci.2015.11.047

Avenda no-Franco, G., and Romero, A. H. (2016). Firefly algorithm for structural search. *J. Chem. Theory Comput*. 12, 3416–3428. doi: 10.1021/acs.jctc.5b01157

Bahn, S. R., and Jacobsen, K. W. (2002). An object-oriented scripting interface to a legacy electronic structure code. *Comput. Sci. Eng*. 4, 56–66. doi: 10.1109/5992.998641

Bartók, A. P., Payne, M. C., Kondor, R., and Csányi, G. (2010). Gaussian approximation potentials: the accuracy of quantum mechanics, without the electrons. *Phys. Rev. Lett*. 104:136403. doi: 10.1103/PhysRevLett.104.136403

Behler, J., and Parrinello, M. (2007). Generalized neural-network representation of high-dimensional potential-energy surfaces. *Phys. Rev. Lett*. 98:146401. doi: 10.1103/PhysRevLett.98.146401

Dickel, D. E., Baskes, M. I., Aslam, I., and Barrett, C. D. (2018). New interatomic potential for Mg-Al-Zn alloys with specific application to dilute Mg-based alloys. *Model. Simul. Mater. Sci. Eng*. 26:045010. doi: 10.1088/1361-651X/aabaad

Glass, C. W., Oganov, A. R., and Hansen, N. (2006). USPEX-Evolutionary crystal structure prediction. *Comput. Phys. Commun*. 175, 713–720. doi: 10.1016/j.cpc.2006.07.020

Gupta, M., and Ling, S. N. M. (2011). *Magnesium, Magnesium Alloys, and Magnesium Composites*. Singapore: John Wiley & Sons. doi: 10.1002/9780470905098

Haastrup, S., Strange, M., Pandey, M., Deilmann, T., Schmidt, P. S., Hinsche, N. F., et al. (2018). The Computational 2D Materials Database: high-throughput modeling and discovery of atomically thin crystals. *2D Mater*. 5:042002. doi: 10.1088/2053-1583/aacfc1

Han, J., Zhang, L., Car, R., and E, W. (2018). Deep potential: a general representation of a many-body potential energy surface. *Commun. Comput. Phys*. 23, 629–639. doi: 10.4208/cicp.OA-2017-0213

He, J., Xia, Y., Naghavi, S. S., Ozolins, V., and Wolverton, C. (2019). Designing chemical analogs to PbTe with intrinsic high band degeneracy and low lattice thermal conductivity. *Nat. Commun*. 10, 1–8. doi: 10.1038/s41467-019-08542-1

Huang, S. D., Shang, C., Kang, P. L., and Liu, Z. P. (2018). Atomic structure of boron resolved using machine learning and global sampling. *Chem. Sci*. 9, 8644–8655. doi: 10.1039/C8SC03427C

Jelinek, B., Groh, S., Horstemeyer, M. F., Houze, J., Kim, S. G., Wagner, G. J., et al. (2012). Modified embedded atom method potential for Al, Si, Mg, Cu, and Fe alloys. *Phys. Rev. B* 85:245102. doi: 10.1103/PhysRevB.85.245102

Khorshidi, A., and Peterson, A. A. (2016). AMP: a modular approach to machine learning in atomistic simulations. *Comput. Phys. Commun*. 207, 310–324. doi: 10.1016/j.cpc.2016.05.010

Kohn, W., and Sham, L. J. (1965). Self-consistent equations including exchange and correlation effects. *Phys. Rev*. 140:A1133. doi: 10.1103/PhysRev.140.A1133

Kresse, G., and Furthmüller, J. (1996). Efficiency of *ab-initio* total energy calculations for metals and semiconductors using a plane-wave basis set. *Phys. Rev. B* 6, 15–50. doi: 10.1016/0927-0256(96)00008-0

Larsen, A. H., Mortensen, J. J., Blomqvist, J., Castelli, I. E., Christensen, R., Dułak, M., et al. (2017). The atomic simulation environment-a python library for working with atoms. *J. Phys*. 29:273002. doi: 10.1088/1361-648X/aa680e

Mendelev, M. I., Asta, M., Rahman, M. J., and Hoyt, J. J. (2009). Development of interatomic potentials appropriate for simulation of solid-liquid interface properties in Al-Mg alloys. *Philos. Mag*. 89, 3269–3285. doi: 10.1080/14786430903260727

Mouhat, F., and Coudert, F. X. (2014). Necessary and sufficient elastic stability conditions in various crystal systems. *Phys. Rev. B* 90:224104. doi: 10.1103/PhysRevB.90.224104

Oganov, A. R. (2018). Crystal structure prediction: reflections on present status and challenges. *Faraday Discuss*. 211, 643–660. doi: 10.1039/C8FD90033G

Ong, S. P., Richards, W. D., Jain, A., Hautier, G., Kocher, M., Cholia, S., et al. (2013). Python Materials Genomics (pymatgen): a robust, open-source python library for materials analysis. *Comput. Mater. Sci*. 68, 314–319. doi: 10.1016/j.commatsci.2012.10.028

Pack, J. D., and Monkhorst, H. J. (1977). Special points for Brillouin-zone integrations. *Phys. Rev. B* 16, 1748–1749. doi: 10.1103/PhysRevB.16.1748

Perdew, J. P., Burke, K., and Ernzerhof, M. (1996). Generalized gradient approximation made simple. *Phys. Rev. Lett*. 77, 3865–3868. doi: 10.1103/PhysRevLett.77.3865

Plimpton, S. (1995). Fast parallel algorithms for Short-Range Molecular Dynamics. *J. Comput. Phys*. 117, 1–19. doi: 10.1006/jcph.1995.1039

Podryabinkin, E. V., Tikhonov, E. V., Shapeev, A. V., and Oganov, A. R. (2019). Accelerating crystal structure prediction by machine-learning interatomic potentials with active learning. *Phys. Rev. B* 99:064114. doi: 10.1103/PhysRevB.99.064114

Pugh, S. (1954). XCII. Relations between the elastic moduli and the plastic properties of polycrystalline pure metals. *Lond. Edinburgh Dubl. Philos. Mag. J. Sci*. 45, 823–843. doi: 10.1080/14786440808520496

Shapeev, A. V. (2016). Moment tensor potentials: a class of systematically improvable interatomic potentials. *Multisc. Model. Simul*. 14, 1153–1173. doi: 10.1137/15M1054183

Tong, Q., Xue, L., Lv, J., Wang, Y., and Ma, Y. (2018). Accelerating CALYPSO structure prediction by data-driven learning of a potential energy surface. *Faraday Discuss*. 211, 31–43. doi: 10.1039/C8FD00055G

Wang, H., Zhang, L., Han, J., and E, W. (2018). DeePMD-kit: a deep learning package for many-body potential energy representation and molecular dynamics. *Comput. Phys. Commun*. 228, 178–184. doi: 10.1016/j.cpc.2018.03.016

Wang, Y., Lv, J., Zhu, L., and Ma, Y. (2010). Crystal structure prediction via particle-swarm optimization. *Phys. Rev. B* 82:094116. doi: 10.1103/PhysRevB.82.094116

Wang, Y., Lv, J., Zhu, L., and Ma, Y. (2012). CALYPSO: A method for crystal structure prediction. *Comput. Phys. Commun*. 183, 2063–2070. doi: 10.1016/j.cpc.2012.05.008

Xie, H., Zhang, W., Duan, D., Huang, X., Huang, Y., Song, H., et al. (2020). Superconducting zirconium polyhydrides at moderate pressures. *J. Phys. Chem. Lett*. 11, 646–651. doi: 10.1021/acs.jpclett.9b03632

Xu, W. W., Yu, Q., Xia, F., Yin, G., Huang, Y., and Chen, L. (2019). Exploration of crystal structure and the origin of unexpected intrinsic ductility of χ*Co*_{7}*Ta*_{2}. *J. Alloys Compounds* 797, 1198–1204. doi: 10.1016/j.jallcom.2019.05.223

Zhang, L., Han, J., Wang, H., Car, R., and E, W. (2018a). Deep potential molecular dynamics: a scalable model with the accuracy of quantum mechanics. *Phys. Rev. Lett*. 120:143001. doi: 10.1103/PhysRevLett.120.143001

Zhang, L., Han, J., Wang, H., Saidi, W., Car, R., and E, W. (2018b). End-to-end symmetry preserving inter-atomic potential energy model for finite and extended systems. *Adv. Neural Inform. Process. Syst*. 31, 4441–4451.

Zhang, L., Lin, D.-Y., Wang, H., Car, R., and E, W. (2019). Active learning of uniformly accurate interatomic potentials for materials simulation. *Phys. Rev. Mater*. 3:023804. doi: 10.1103/PhysRevMaterials.3.023804

Zhang, Y., Wang, H., Chen, W., Zeng, J., Zhang, L., Wang, H., et al. (2020). DP-GEN: a concurrent learning platform for the generation of reliable deep learning based potential energy models. *Comput. Phys. Commun*. 253:107206. doi: 10.1016/j.cpc.2020.107206

Zhao, Z., Zhang, S., Yu, T., Xu, H., Bergara, A., and Yang, G. (2019). Predicted pressure-induced superconducting transition in electride Li6 P. *Phys. Rev. Lett*. 122:97002. doi: 10.1103/PhysRevLett.122.097002

Keywords: many-body potential energy, deep learning, crystal structure prediction, Al-Mg, alloy

Citation: Wang H, Zhang Y, Zhang L and Wang H (2020) Crystal Structure Prediction of Binary Alloys via Deep Potential. *Front. Chem.* 8:589795. doi: 10.3389/fchem.2020.589795

Received: 31 July 2020; Accepted: 28 August 2020;

Published: 26 November 2020.

Edited by:

Wei Hu, Lawrence Berkeley National Laboratory, United StatesReviewed by:

Honghui Shang, Chinese Academy of Sciences (CAS), ChinaZp Hu, Nankai University, China

Copyright © 2020 Wang, Zhang, Zhang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Linfeng Zhang, linfengz@princeton.edu; Han Wang, wang_han@iapcm.ac.cn