<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mater.</journal-id>
<journal-title>Frontiers in Materials</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mater.</abbrev-journal-title>
<issn pub-type="epub">2296-8016</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">679269</article-id>
<article-id pub-id-type="doi">10.3389/fmats.2021.679269</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Materials</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Data-Driven Discovery of 2D Materials for Solar Water Splitting</article-title>
<alt-title alt-title-type="left-running-head">Agarwal et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Inverse Design of 2D Photocatalysts</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Agarwal</surname>
<given-names>Abhishek</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1372856/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Goverapet Srinivasan</surname>
<given-names>Sriram</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1156459/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Rai</surname>
<given-names>Beena</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1468743/overview"/>
</contrib>
</contrib-group>
<aff>TCS Research, Tata Consulatncy Services Ltd., <addr-line>Pune</addr-line>, <country>India</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/140188/overview">H&#xe9;lio Anderson Duarte</ext-link>, Federal University of Minas Gerais, Brazil</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/155755/overview">Kesong Yang</ext-link>, University of California, San Diego, United&#x20;States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/171691/overview">Tim Kowalczyk</ext-link>, Western Washington University, United&#x20;States</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Sriram Goverapet Srinivasan, <email>s.goverapet@tcs.com</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Computational Materials Science, a section of the journal Frontiers in Materials</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>16</day>
<month>09</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>8</volume>
<elocation-id>679269</elocation-id>
<history>
<date date-type="received">
<day>23</day>
<month>03</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>12</day>
<month>07</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Agarwal, Goverapet Srinivasan and Rai.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Agarwal, Goverapet Srinivasan and Rai</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Hydrogen economy, wherein hydrogen is used as the fuel in the transport and energy sectors, holds significant promise in mitigating the deleterious effects of global warming. Photocatalytic water splitting using sunlight is perhaps the cleanest way of producing the hydrogen fuel. Among various other factors, widespread adoption of this technology has mainly been stymied by the lack of a catalyst material with high efficiency. 2D materials have shown significant promise as efficient photocatalysts for water splitting. The availability of open databases containing the &#x201c;computed&#x201d; properties of 2D materials and advancements in deep learning now enable us to do &#x201c;inverse&#x201d; design of these 2D photocatalysts for water splitting. We use one such database (Jain et&#x20;al., ACS Energ. Lett. 2019, 4, 6, 1410&#x2013;1411) to build a generative model for the discovery of novel 2D photocatalysts. The structures of the materials were converted into a 3D image&#x2013;based representation that was used to train a cell, a basis autoencoder and a segmentation network to ascertain the lattice parameters as well as position of atoms from the images. Subsequently, the cell and basis encodings were used to train a conditional variational autoencoder (CVAE) to learn a continuous representation of the materials in a latent space. The latent space of the CVAE was then sampled to generate several new 2D materials that were likely to be efficient photocatalysts for water splitting. The bandgap of the generated materials was predicted using a graph neural network model while the band edge positions were obtained <italic>via</italic> empirical correlations. Although our generative modeling framework was used to discover novel 2D photocatalysts for water splitting reaction, it is generic in nature and can be used directly to discover novel materials for other applications as&#x20;well.</p>
</abstract>
<kwd-group>
<kwd>generative modeling</kwd>
<kwd>variational autoencoder</kwd>
<kwd>inverse design</kwd>
<kwd>photocatalysts</kwd>
<kwd>water splitting reaction</kwd>
</kwd-group>
<contract-sponsor id="cn001">Tata Consultancy Services<named-content content-type="fundref-id">10.13039/100012913</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Hydrogen as an alternate fuel and energy carrier has the potential to substantially mitigate carbon emissions for a green and sustainable future (<xref ref-type="bibr" rid="B50">Turner, 2004</xref>). Since it is not naturally available in free form for large scale applications, hydrogen is produced synthetically through a variety of processes (<xref ref-type="bibr" rid="B42">Sigfusson, 2007</xref>). Photocatalytic/photoelectrochemical splitting of water using sunlight, a suitable photocatalyst, water, and renewable electricity is perhaps the environmentally most benign method to produce hydrogen at scale (<xref ref-type="bibr" rid="B9">Edwards et&#x20;al., 2007</xref>). Ever since the demonstration of solar water splitting by <xref ref-type="bibr" rid="B11">Fujishima and Honda (1972</xref>) using TiO<sub>2</sub> electrodes, enormous amount of efforts has been put in identifying new photocatalysts. Various materials, such as metal oxides, nitrides, sulfides, oxysulfides, oxynitrides, and Z-scheme materials, have been developed with enhanced efficiencies for solar water splitting. A detailed overview of these developments and the progress made in the field has been documented in several excellent review articles (<xref ref-type="bibr" rid="B35">Osterloh, 2008</xref>; <xref ref-type="bibr" rid="B22">Kudo and Miseki, 2009</xref>; <xref ref-type="bibr" rid="B27">Maeda and Domen, 2010</xref>; <xref ref-type="bibr" rid="B36">Osterloh and Parkinson, 2011</xref>; <xref ref-type="bibr" rid="B48">Tachibana et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B14">Hisatomi et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B2">Ahmad et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B63">Zou and Zhang, 2015</xref>; <xref ref-type="bibr" rid="B30">Moniruddin et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B37">Prasad, 2020</xref>). The emergence of 2D materials, heralded by the discovery of graphene (<xref ref-type="bibr" rid="B33">Novoselov et&#x20;al., 2004</xref>), has added a new dimension in the search of efficient photocatalysts. In addition to stability and suitable electronic structure, these materials provide a large surface to volume ratio, higher charge carrier mobility, and reduced recombination rates, all of which aid in enhancing the reaction rates at the photocatalyst surface (<xref ref-type="bibr" rid="B24">Li et&#x20;al., 2017</xref>). Various 2D materials, mostly chalcogenides such as SnS and SnSe (<xref ref-type="bibr" rid="B47">Sun et&#x20;al., 2014</xref>), CdS (<xref ref-type="bibr" rid="B56">Xu et&#x20;al., 2013</xref>), WS<sub>2</sub> (<xref ref-type="bibr" rid="B53">Voiry et&#x20;al., 2013</xref>), SnS<sub>2</sub> (<xref ref-type="bibr" rid="B46">Sun et&#x20;al., 2012</xref>), and MoS<sub>2</sub> (<xref ref-type="bibr" rid="B28">Maitra et&#x20;al., 2013</xref>) have been synthesized and shown to have enhanced photocatalytic performance.</p>
<p>With rapid advancements in first principles methods and computational power, in-silico design/screening of materials has emerged as a promising alternative method to narrow the search space of novel functional materials (<xref ref-type="bibr" rid="B1">Agrawal and Choudhary, 2016</xref>). For instance, high-throughput density functional theory (DFT) calculations have been used to identify oxynitrides (<xref ref-type="bibr" rid="B54">Wu et&#x20;al., 2013</xref>), perovskites (<xref ref-type="bibr" rid="B6">Castelli et&#x20;al., 2012a</xref>; <xref ref-type="bibr" rid="B5">Castelli et&#x20;al., 2012b</xref>), and chalcogenides (<xref ref-type="bibr" rid="B61">Zhuang and Hennig, 2013a</xref>; <xref ref-type="bibr" rid="B62">Zhuang and Hennig, 2013b</xref>; <xref ref-type="bibr" rid="B43">Singh et&#x20;al., 2015</xref>) as potential photocatalysts for water splitting. Properties of a vast number of materials computed in such high-throughput fashion using accurate first principles methods have been made openly available in repositories such as the Materials Project (MP) (<xref ref-type="bibr" rid="B17">Jain et&#x20;al., 2013</xref>), the Open Quantum Materials Database (OQMD) (<xref ref-type="bibr" rid="B41">Saal et&#x20;al., 2013</xref>), Automatic FLOW for materials discovery (AFLOW) (<xref ref-type="bibr" rid="B8">Curtarolo et&#x20;al., 2012</xref>), and Novel Materials Discovery (NOMAD) (<xref ref-type="bibr" rid="B49">The NOMAD (Novel Materials Discovery) Center of Excellence (CoE), (2021)</xref>). While these repositories primarily contain data on bulk materials, two different datasets containing DFT-computed properties for 2D materials were also published recently (<xref ref-type="bibr" rid="B13">Haastrup et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B60">Zhou et&#x20;al., 2019</xref>). Knowledge stored in these repositories has then been mined to screen materials for diverse applications (<xref ref-type="bibr" rid="B58">Zhang et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B44">Singh et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B59">Zhang et&#x20;al., 2019</xref>). In addition, machine learning models have also been trained using data from these repositories to predict properties of novel materials (<xref ref-type="bibr" rid="B3">Ahmad et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B55">Xie and Grossman, 2018</xref>; <xref ref-type="bibr" rid="B57">Ye et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B19">Joshi et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B25">Liu et&#x20;al., 2020</xref>). In a recent article, <xref ref-type="bibr" rid="B45">Sorkun et&#x20;al. (2020)</xref> identified several potential 2D materials for photocatalytic water splitting, CO<sub>2</sub> reduction, and N<sub>2</sub> reduction by training AI models on the computational 2D materials database and using the predictions from these models to screen a vast chemical space obtained by systematic elemental substitution in 2D material prototypes.</p>
<p>An alternate approach to the high-throughput screening is to build unsupervised deep learning (DL) models that can learn the encodings of materials in a continuous latent space. This latent space could then be sampled to generate novel materials. When linked with one or more material property, such techniques can enable discovery of novel materials conditioned on certain properties (i.e.,&#x20;inverse design of functional materials). Variational autoencoder (VAE) (<xref ref-type="bibr" rid="B21">Kingma and Welling, 2019</xref>) and generative adversarial network (GAN) (<xref ref-type="bibr" rid="B12">Goodfellow et&#x20;al., 2014</xref>) are two of the most widely used generative models. VAEs use concepts of variational inference to learn the representation of input data by minimizing the reconstruction loss (formally called maximizing the log likelihood of observations) as well as divergence of the learned distribution from an assumed prior distribution (formally called Kullback-Leibler divergence) (<xref ref-type="bibr" rid="B21">Kingma and Welling, 2019</xref>). On the other hand, GANs use concepts from game theory to adversarially train a generative and a discriminative network. While the objective of the generative network is to fool the discriminator by generating realistic samples, the discriminator aims to correctly distinguish fake samples created by the generator from true samples (<xref ref-type="bibr" rid="B12">Goodfellow et&#x20;al., 2014</xref>). Recently, both VAEs and GANs have been used for the generation of novel inorganic materials. In their iMatGen framework, <xref ref-type="bibr" rid="B32">Noh et&#x20;al. (2019)</xref> used an image-based representation of crystal structures and trained a VAE to generate novel phases of vanadium oxides. While their model was restricted to only two element types (V and O), <xref ref-type="bibr" rid="B15">Hoffmann et&#x20;al. (2019)</xref> introduced a generalization of this concept <italic>via</italic> inclusion of a segmentation network, to generate novel materials containing multiple types of elements. <xref ref-type="bibr" rid="B7">Court et&#x20;al. (2020)</xref> used these concepts to build a conditional VAE for the generation of novel binary alloys, ternary perovskites, and Heusler compounds, all in cubic symmetry. <xref ref-type="bibr" rid="B38">Ren et&#x20;al. (2020)</xref> used an invertible representation of crystal structures by a combination of descriptors in both real and reciprocal spaces and trained a VAE to generate novel thermoelectric materials. <xref ref-type="bibr" rid="B26">Long et al. (2020)</xref> and <xref ref-type="bibr" rid="B20">Kim et&#x20;al. (2020)</xref> used GANs to discover a new crystal structure of the Bi-Se and Mg-Mn-O systems, respectively.</p>
<p>In this study, we have developed a generative modeling framework for the discovery of novel 2D materials as photocatalysts for water splitting. In comparison to prior works, our framework does not place any restriction on the structure or the stoichiometry of the materials. The bandgap of the generated materials was predicted using the CGCNN model (<xref ref-type="bibr" rid="B55">Xie and Grossman, 2018</xref>) while their band edge positions were computed using empirical correlations. Using this framework, we have discovered several novel 2D materials as potentially good photocatalyst for water splitting. While we have demonstrated the discovery of 2D photocatalysts as an application, our framework is generic enough to be applied for any kind of functional material discovery.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec id="s2-1">
<title>Data Preparation and Representation</title>
<p>The dataset of 2D materials, to train our hierarchical generative model was obtained from the earlier published study of <xref ref-type="bibr" rid="B18">Jain et&#x20;al. (2019)</xref>. This dataset included data for all the materials that were included in earlier 2D materials&#x2019; databases such as C2DB (<xref ref-type="bibr" rid="B13">Haastrup et&#x20;al., 2018</xref>) and 2DMatPedia (<xref ref-type="bibr" rid="B60">Zhou et&#x20;al., 2019</xref>) as well as the materials cloud (<xref ref-type="bibr" rid="B31">Mounet et&#x20;al., 2018</xref>). Furthermore, properties such as the bandgap and energy above hull for all the materials were reported using a uniform level of theory, thereby providing us a consistent set of data to learn from. Around 7,500 unique 2D materials were present in the dataset whose structures were provided as <italic>cif</italic> files. These structures were converted to image-based representations which were subsequently used to train all our models.</p>
<p>In order to represent the crystal structures as images, we followed the same concept as proposed by <xref ref-type="bibr" rid="B32">Noh et&#x20;al. (2019)</xref> in their iMatGen framework. Just as a crystal structure is construed as a &#x201c;basis&#x201d; of atoms in an underlying &#x201c;lattice&#x201d;, each structure in our dataset was represented using a &#x201c;cell&#x201d; and a &#x201c;basis&#x201d; 3D image. Both the images had a dimension of (32 &#xd7; 32&#x20;&#xd7; 32). The voxel values of the cell image were obtained using a Gaussian function as:<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:msub>
<mml:mi>F</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>e</mml:mi>
<mml:mi>x</mml:mi>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:msubsup>
<mml:mi>r</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>k</mml:mi>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msubsup>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:msup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>where <italic>r</italic>
<sub>
<italic>ijk</italic>
</sub> is the Euclidean distance between the center of the lattice and (<italic>i, j, k</italic>)<sup>
<italic>th</italic>
</sup> voxel. The basis image was generated using an atomic number weighted Gaussian transformation as described by <xref ref-type="bibr" rid="B15">Hoffmann et&#x20;al. (2019)</xref>. Concretely, the voxel values of the basis image were obtained as follows:<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mi>G</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mrow>
<mml:msup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>3</mml:mn>
</mml:msup>
<mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mn>2</mml:mn>
<mml:mi>&#x3c0;</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mn>1.5</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:munder>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
</mml:mstyle>
<mml:mi>l</mml:mi>
</mml:munder>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mi>e</mml:mi>
<mml:mi>x</mml:mi>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>d</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:msup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>where <italic>Z</italic>
<sub>
<italic>l</italic>
</sub> is the atomic number at site &#x201c;<italic>l</italic>&#x201d; of the material, <italic>d</italic>[<italic>Z</italic>
<sub>
<italic>l</italic>
</sub>
<italic>,</italic> (<italic>i,,j,k</italic>)] is the Euclidean distance between the site &#x201c;<italic>l</italic>&#x201d; and the (<italic>i, j, k</italic>)<sup>
<italic>th</italic>
</sup> voxel, and &#x201c;<italic>&#x3c3;</italic>&#x201d; is the width of the Gaussian. We used a value of <italic>&#x3c3;</italic> &#x3d; 1.0, consistent with earlier works by <xref ref-type="bibr" rid="B32">Noh et&#x20;al. (2019)</xref> and <xref ref-type="bibr" rid="B15">Hoffmann et&#x20;al. (2019)</xref>, since testing with lower values of <italic>&#x3c3;</italic> resulted in larger errors. In contrast, <xref ref-type="bibr" rid="B7">Court et&#x20;al. (2020)</xref> used the ionic radius of various elements for <italic>&#x3c3;</italic> instead of a constant value. Prior to generating the basis image, the atoms in a material were translated such that their center of geometry lay at the center of a cube of length 10&#xa0;&#xc5;. Together with the basis image, an elements matrix was also constructed to ascertain the positions and types of atoms from the basis image. The elements matrix had the same dimensionality as the basis image (i.e.,&#x20;32 &#xd7; 32&#x20;&#xd7; 32). The voxel values of the &#x201c;elements matrix&#x201d; were assigned as:<disp-formula id="e3">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>j</mml:mi>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mi>f</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi>Z</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mi mathvariant="italic">&#x212b;</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>o</mml:mi>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>w</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>s</mml:mi>
<mml:mi>e</mml:mi>
</mml:mrow>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>Use of a larger value for the cutoff (larger than <inline-formula id="inf1">
<mml:math id="m4">
<mml:mrow>
<mml:mn>0.5</mml:mn>
<mml:mi mathvariant="italic">&#x212b;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>) would result in an overlap of nearby atoms, thereby rendering unique assignment of atomic numbers to voxels difficult. On the other hand, the use of a smaller value of the cutoff would result in too few voxels (or data) having non-zero values among the 32&#x20;&#xd7; 32&#x20;&#xd7; 32 voxels, making it difficult for the segmentation network to correctly identify atoms. <xref ref-type="fig" rid="F1">Figure&#x20;1</xref> shows a representative crystal structure from our dataset, its cell and basis images and the corresponding elements matrix. In order to ensure that the generated images had adequate resolution to faithfully represent a crystal structure as well as limit the memory requirement, we only considered those materials from our dataset whose lattice dimensions along the basal plane directions as well the slab thickness were not more than 10&#xa0;&#xc5; each. The resulting dataset had a total of about 6,300 structures. This dataset was augmented by creating supercells as well as applying random translations and rotations to the crystal structures to ensure that each element was represented in at least 3,000 structures. Overall, this augmentation resulted in a dataset containing about 0.2 million structures which was split in a 90:10 ratio for train and&#x20;test.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>
<bold>(A)</bold> Crystal structure of a material in our dataset, <bold>(B)</bold> its cell image, <bold>(C)</bold> its basis image, and <bold>(D)</bold> the elements matrix.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g001.tif"/>
</fig>
</sec>
<sec id="s2-2">
<title>Deep Learning Model and Network Architecture</title>
<p>We constructed a two-step hierarchical deep learning model like the iMatGen framework (<xref ref-type="bibr" rid="B32">Noh et&#x20;al., 2019</xref>) to learn the representations of the 2D materials in our dataset and to generate novel materials by sampling from learned continuous representations. The first step of the model consisted of training a cell and basis autoencoder as well as a segmentation network for identification of atomic positions and corresponding element types from the basis image and the elements matrix. Both the autoencoders were constructed as 3D convolutional neural networks (3D CNNs). The encoder of the cell autoencoder consisted of four 3D convolutional layers while the decoder used four 3D convolution transpose layers (i.e.,&#x20;a mirror image of the encoder). Similarly, the encoder of the basis autoencoder consisted of four 3D convolutional layers followed by a fully connected layer. However, the decoder used upsampling instead of 3D convolution transpose. The dimensions of cell and basis encoding vectors (i.e.,&#x20;the autoencoder bottleneck dimension) were 128 and 256, respectively. While training, mean squared error (MSE) was used as the loss function. The detailed architecture of cell and basis autoencoders is shown in <xref ref-type="fig" rid="F2">Figures 2</xref>, <xref ref-type="fig" rid="F3">3</xref>, respectively.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Architecture of the cell autoencoder.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g002.tif"/>
</fig>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Architecture of the basis autoencoder.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g003.tif"/>
</fig>
<p>After training of the basis autoencoder, the segmentation network [a 3D attention U-net model (<xref ref-type="bibr" rid="B34">Oktay et&#x20;al., 2018</xref>)] was trained independently using the reconstructed basis images (i.e.,&#x20;images obtained as the output from the decoder of the basis autoencoder) to identify location and types of elements at that location as atomic clusters. This contrasts with the study of <xref ref-type="bibr" rid="B15">Hoffmann et&#x20;al. (2019)</xref> who trained their segmentation network together with the basis autoencoder in an end-to-end fashion to identify the locations of atoms in a material. The elements matrix prepared earlier for each structure was converted into a species matrix <italic>via</italic> one hot encoding into 95 classes at each grid point. Of these 95 classes, one class represented the background (or vacuum) while the other 94 classes corresponded to different elements. If a particular element type was present at a grid point of the elements matrix, its corresponding class was set to 1 while the rest of the values of the one hot vector remained as zeros. Thus, for each material, the ground truth to train the segmentation network was a species matrix of dimension (32&#x20;&#xd7; 32&#x20;&#xd7; 32&#x20;&#xd7; 95). The binary cross entropy (BCE) loss was used while training the segmentation network.</p>
<p>In the second step of our hierarchical model, we trained a generative model to obtain a continuous representation of the 2D materials that can be sampled to discover novel materials. Thermodynamic stability and the presence of a bandgap are two necessary conditions that any 2D material must satisfy to qualify as a potential photocatalyst for water splitting reaction. As a thumb rule, we considered a material in our database to be stable if its energy above the hull (e_hull) value was less than 150&#xa0;meV per atom. Thus, the materials in our training dataset were classified into four categories as shown in <xref ref-type="table" rid="T1">Table&#x20;1</xref>.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Classification of the 2D materials in our dataset into four different classes based on their bandgap and energy above the hull values.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Condition</th>
<th align="center">One hot encoding</th>
<th align="center">Category</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">1) Gap &#x3e; 0&#xa0;eV, e-hull &#x3c;&#x3d; 0.15&#xa0;eV/atom</td>
<td align="center">(1,0,0,0)</td>
<td align="left">Nonmetal, stable</td>
</tr>
<tr>
<td align="left">2) Gap &#x3d; 0&#xa0;eV, e-hull &#x3c;&#x3d; 0.15&#xa0;eV/atom</td>
<td align="center">(0,1,0,0)</td>
<td align="left">Metal, stable</td>
</tr>
<tr>
<td align="left">3) Gap &#x3e; 0&#xa0;eV, e-hull &#x3e; 0.15&#xa0;eV/atom</td>
<td align="center">(0,0,1,0)</td>
<td align="left">Non-metal, unstable</td>
</tr>
<tr>
<td align="left">4) Gap &#x3d; 0&#xa0;eV, e-hull &#x3e; 0.15&#xa0;eV/atom</td>
<td align="center">(0,0,0,1)</td>
<td align="left">Metal, unstable</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The objective of our study was to discover novel 2D materials belonging to class 1) (i.e.,&#x20;thermodynamically stable with a finite electronic bandgap) so that potential photocatalysts for water splitting reaction could be identified. Accordingly, a conditional variational autoencoder (CVAE) was chosen as our generative model so that, while sampling the latent space for new materials, control could be exerted over the class of material to be generated (i.e.,&#x20;material belonging to class 1) described above). Our CVAE model was trained using the cell and basis encodings from the previous step (step 1) together with the one hot encoded class vectors. Cell encodings were padded with zeros such that both the cell and basis encodings were 256-dimension vectors. Subsequently, these were scaled using the normal quantile transformer with 1000 quantiles. The four dimensional one-hot encoded vector was connected to a 256 dimension hidden layer so that the cell, basis, and the class encodings were all 256 dimensional vectors. These vectors were then concatenated as &#x201c;channels&#x201d; so that each training data was now represented by a (256 &#xd7; 3) dimension image. The CVAE network comprised of a probabilistic encoder and a probabilistic decoder. We represented both the encoder and the decoder <italic>via</italic> 2D CNNs. The detailed architecture of our CVAE model is shown in <xref ref-type="fig" rid="F4">Figure&#x20;4</xref>.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>CVAE network architecture.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g004.tif"/>
</fig>
<p>The probabilistic encoder encoded the input into a distribution with mean <italic>&#xb5;</italic> and standard deviation <italic>&#x3c3;</italic>. A latent vector was then sampled from this distribution using the reparameterization trick, z &#x3d; &#xb5;&#x2b;&#x3b5;&#x2a;&#x3c3;, where &#x3b5; is a random variable from a normal distribution. This vector was passed through the probabilistic decoder to obtain the cell and basis encodings as the output. To train CVAE, we implemented optimal &#x3c3;-VAE variant, a simple and effective methodology suggested by <xref ref-type="bibr" rid="B40">Rybkin et&#x20;al. (2020)</xref>, that did not require tuning the weight on the KL divergence term of the objective function as hyperparameter. The implementation automatically balances the two terms of CVAE objective function, namely, reconstruction loss (or MSE) and KL-divergence. The objective function for our CVAE network was defined as:<disp-formula id="e4">
<mml:math id="m5">
<mml:mrow>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mi>C</mml:mi>
<mml:mi>V</mml:mi>
<mml:mi>A</mml:mi>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>D</mml:mi>
<mml:mi>l</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>&#x3c3;</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x2b;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mfrac>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:msup>
<mml:mi>&#x3c3;</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
<mml:mi>M</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mi>x</mml:mi>
<mml:mo>&#x5e;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mo>,</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x2b;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:msub>
<mml:mi>D</mml:mi>
<mml:mrow>
<mml:mi>K</mml:mi>
<mml:mi>L</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>q</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>&#x7c;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#x7c;</mml:mo>
<mml:mo>&#x7c;</mml:mo>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>z</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>where D is dimensionality of the input (x), D<sub>KL</sub> is the KL divergence, q (z&#x7c;x) is the encoding distribution, p(z) is the prior distribution (chosen as a normal distribution with zero mean and unit standard deviation), and <italic>&#x3c3;</italic> is the weighting parameter to balance the KL-divergence and MSE&#x20;terms.</p>
</sec>
<sec id="s2-3">
<title>Bandgap and Band Edge Positions of 2D materials</title>
<p>The bandgap of a material and its band edge positions must be of appropriate values for a material to be a potentially good photocatalyst for water splitting. While DFT has been the method of choice to compute these properties of a material, several DL models with good accuracy have been reported recently that are well suited for rapid screening of novel materials. We used the CGCNN model (<xref ref-type="bibr" rid="B55">Xie and Grossman, 2018</xref>) to predict the bandgaps of the materials obtained from our model. The weights of the CGCNN model were retrained using our 2D materials dataset. Since data augmentation of the aforementioned kind is irrelevant for graph-based models, we considered only those materials from the original dataset that had a non-zero bandgap. The bandgap predicted from the trained CGCNN model was used to compute the band edge positions using the empirical equations given below:<disp-formula id="e5">
<mml:math id="m6">
<mml:mrow>
<mml:msubsup>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mi>C</mml:mi>
<mml:mi>B</mml:mi>
</mml:mrow>
<mml:mn>0</mml:mn>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>&#x3c9;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>H</mml:mi>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:mfrac>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>g</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>
<disp-formula id="e6">
<mml:math id="m7">
<mml:mrow>
<mml:msubsup>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mi>B</mml:mi>
</mml:mrow>
<mml:mn>0</mml:mn>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>&#x3c9;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>H</mml:mi>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mn>2</mml:mn>
</mml:mfrac>
<mml:msub>
<mml:mi>E</mml:mi>
<mml:mi>g</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(6)</label>
</disp-formula>
<disp-formula id="e7">
<mml:math id="m8">
<mml:mrow>
<mml:mi>&#x3c9;</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>X</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mroot>
<mml:mrow>
<mml:msubsup>
<mml:mi>X</mml:mi>
<mml:mn>1</mml:mn>
<mml:mi>a</mml:mi>
</mml:msubsup>
<mml:msubsup>
<mml:mi>X</mml:mi>
<mml:mn>2</mml:mn>
<mml:mi>b</mml:mi>
</mml:msubsup>
<mml:msubsup>
<mml:mi>X</mml:mi>
<mml:mn>3</mml:mn>
<mml:mi>c</mml:mi>
</mml:msubsup>
<mml:mn>....</mml:mn>
<mml:msubsup>
<mml:mi>X</mml:mi>
<mml:mi>n</mml:mi>
<mml:mi>q</mml:mi>
</mml:msubsup>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:mroot>
<mml:mo>,</mml:mo>
</mml:mrow>
</mml:math>
<label>(7)</label>
</disp-formula>where <italic>E</italic>
<sub>
<italic>CB</italic>
</sub>
<sup>
<italic>0</italic>
</sup> and <italic>E</italic>
<sub>
<italic>VB</italic>
</sub>
<sup>
<italic>0</italic>
</sup> are the conduction and valence band edge energies, <italic>E</italic>
<sub>
<italic>g</italic>
</sub> is the bandgap predicted by the CGCNN model, <italic>E</italic>
<sub>SHE</sub> is the absolute electrode potential of the standard hydrogen electrode (&#x3d; 4.4&#xa0;V), and <italic>X</italic>
<sub>
<italic>i</italic>
</sub> is the electronegativity of the constituent elements in a material while <italic>a,b,c..q</italic> are there number of each of these elements in the materials&#x2019; unit cell. &#x201c;<italic>&#x3c9;</italic>&#x201d; is the geometric mean of the electronegativities of the constituent elements in a material.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Deep Learning Model Training</title>
<p>A two-step hierarchical DL model using an image-based representation of materials was developed to discover novel 2D materials as potential photocatalysts for water splitting reaction. The first step of the model consisted of autoencoders and a segmentation network to encode the cell and basis images and ascertain the location and types of atoms from the basis images. The subsequent step used the cell and basis encodings together with a conditional property vector to obtain a continuous latent space encoding of the 2D materials using a CVAE. This latent space could be sampled to generate novel 2D materials whose bandgaps and band edge positions were predicted using a reparametrized CGCNN model and empirical correlations, respectively. While the usual practice of training these DL models initializes the weights of the networks to random values, we used a more &#x201c;informed&#x201d; initial guess by pretraining these networks on the data from the Materials Project (MP) database (<xref ref-type="bibr" rid="B17">Jain et&#x20;al., 2013</xref>). Details of the dataset used for this pretraining as well as all the model hyperparameters are provided in <xref ref-type="sec" rid="s8">Supplementary Tables S1, S2</xref> of the supplementary material. We first present the training results for individual models and then present the error metrics upon execution of the entire pipeline.</p>
<p>
<xref ref-type="table" rid="T2">Table&#x20;2</xref> provides a summary of the test set error after training our individual DL models from the first step on the 2D materials dataset. For the cell and basis autoencoders, the MSE and MAE correspond to the error incurred in reconstructing the input images while for the segmentation network, the MAE corresponds to the error in reproducing the species matrix. Clearly, we see that the networks were able to accurately reconstruct the cell and basis images. <xref ref-type="fig" rid="F5">Figure&#x20;5</xref> compares a 2D-slice from the input 3D cell image for a material in our test set as well as the corresponding reconstructed image produced by the cell autoencoder.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Test set errors in the cell and basis autoencoder and the segmentation network after training these models on the 2D materials dataset.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th align="center">Mean squared error (MSE)</th>
<th align="center">Mean absolute error (MAE)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Cell autoencoder</td>
<td align="center">3.17 &#xd7; 10<sup>&#x2212;8</sup>
</td>
<td align="center">8.32 &#xd7; 10<sup>&#x2212;6</sup>
</td>
</tr>
<tr>
<td align="left">Basis autoencoder</td>
<td align="center">1.99 &#xd7; 10<sup>&#x2212;4</sup>
</td>
<td align="center">6.59 &#xd7; 10<sup>&#x2212;3</sup>
</td>
</tr>
<tr>
<td align="left">&#x2014;</td>
<td align="center">Binary cross entropy loss (BCE)</td>
<td align="center">Mean Absolute Error (MAE)</td>
</tr>
<tr>
<td align="left">Segmentation network</td>
<td align="center">3.60 &#xd7; 10<sup>&#x2212;5</sup>
</td>
<td align="center">2.17 &#xd7; 10<sup>&#x2212;5</sup>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>A 2D slice of the input cell image <bold>(A)</bold> and its comparison with the corresponding 2D slice from the reconstructed (output of cell autoencoder) cell image <bold>(B)</bold>.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g005.tif"/>
</fig>
<p>While the autoencoders learned to reconstruct the images well, the cell parameters of the materials (i.e.,&#x20;the cell lengths and angles) themselves were obtained from the output of the cell autoencoder (i.e.,&#x20;the decoded cell image) by feeding the voxel values to the inverse of the Gaussian function that was used to construct the cell images originally. <xref ref-type="table" rid="T3">Table&#x20;3</xref> lists the reconstruction errors in the cell parameters. Firstly, we observed that the intrinsic error (i.e.,&#x20;the error in transforming the lattice parameters to the cell image and back calculating the lattice parameters from the constructed image) in the cell image representation was zero, suggesting that the lattice to image transformation was perfect. Secondly, we observed that the error in cell lengths and angles obtained upon inverting the output image from the cell autoencoder was also very small, suggesting that the learned cell encodings represented the cell images&#x20;well.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Reconstruction error in the cell parameters for 2D materials.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th align="center">&#x394;a (&#xc5;)</th>
<th align="center">&#x394;b(&#xc5;)</th>
<th align="center">&#x394;&#x3b1;(&#xb0;)</th>
<th align="center">&#x394;&#x3b2;(&#xb0;)</th>
<th align="center">&#x394;&#x3b3;(&#xb0;)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Intrinsic</td>
<td align="char" char=".">0.00</td>
<td align="char" char=".">0.00</td>
<td align="char" char=".">0.00</td>
<td align="char" char=".">0.00</td>
<td align="char" char=".">0.00</td>
</tr>
<tr>
<td align="left">Test set</td>
<td align="char" char=".">0.04</td>
<td align="char" char=".">0.04</td>
<td align="char" char=".">0.70</td>
<td align="char" char=".">0.61</td>
<td align="char" char=".">0.87</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In comparison to cell parameters, obtaining the atomic positions from the output of basis autoencoder and segmentation network required a multi-step post processing. Firstly, the output of segmentation network was converted to elements matrix using the argmax function on one-hot encoded species matrix. This assigned atomic numbers to each site in the elements matrix. Then clusters of atoms were found from the elements matrix using the skimage package (<xref ref-type="bibr" rid="B52">Van der Walt et&#x20;al., 2014</xref>). Finally, positions of the atoms were assigned as the centroids of clusters while the type of atom at that location (i.e. the atomic number) was assigned based on majority voting among sites belonging to that cluster. The error in the atomic position was obtained by computing the distance between the predicted atom &#x201c;<italic>i</italic>&#x201d; in the output element matrix and the nearest true atom &#x201c;<italic>j</italic>&#x201d; in the original element matrix (i.e.,&#x20;ground truth) of that material. <xref ref-type="fig" rid="F6">Figure&#x20;6A</xref> shows the test set predictions from the segmentation network as a parity plot between the predicted vs. true number of atoms while <xref ref-type="fig" rid="F6">Figure&#x20;6B</xref> shows a parity plot between the predicted vs. true atom types for those materials predicted to have correct number of atoms. Clearly, from <xref ref-type="fig" rid="F6">Figure&#x20;6</xref> and the loss values mentioned in <xref ref-type="table" rid="T2">Table&#x20;2</xref>, it can be inferred that the networks were able to closely reconstruct the materials in the test&#x20;set.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Performance of the segmentation network. <bold>(A)</bold> Predicted vs. true number of atoms in materials from the test set. <bold>(B)</bold> Predicted vs. true atomic numbers of atoms in those materials that were predicted to have the correct number of&#x20;atoms.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g006.tif"/>
</fig>
<p>Further analysis of these predictions revealed that for 87.7% of materials in the test set, the basis autoencoder and segmentation network was able to predict the correct number of atoms as well as material composition with a very small RMSE of 0.06&#xa0;&#xc5; in the atomic positions. Such good accuracy of the basis autoencoder and segmentation network can also be gleaned from <xref ref-type="fig" rid="F7">Figure&#x20;7</xref> which shows a 2D slice of an input and reconstructed basis images of a material from the test set as well as the corresponding elements matrices.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Performance of a basis autoencoder and a segmentation network. <bold>(A)</bold> and <bold>(B)</bold> panels show a 2D slice of the input and reconstructed basis images for a material from the test set, respectively. <bold>(C)</bold> and <bold>(D)</bold> panels show the corresponding input and output elements matrices, respectively.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g007.tif"/>
</fig>
<p>Having trained the cell, basis autoencoders and the segmentation network, we next trained the generative model (CVAE) with the cell and basis encodings, one hot property vector as the inputs. Once again, pretrained weights from the MP dataset were taken as the initial guess for the CVAE model. The main objective of this study being the discovery of novel 2D materials for photocatalytic water splitting, it was essential that the learned latent space be smooth and continuous for generating realistic materials. The kernel density estimate (KDE) plot in <xref ref-type="fig" rid="F8">Figure&#x20;8</xref> shows that the 128-dimensional latent space was mostly smooth and continuous and approximately followed a unit Gaussian profile. This is further elucidated by the tSNE plot (<xref ref-type="bibr" rid="B51">Van der Maaten and Hinton, 2008</xref>) shown in <xref ref-type="sec" rid="s8">Supplementary Figure S1</xref> of the supplementary material, which shows a uniform distribution of the latent space encodings. Note that unlike conventional autoencoders, the CVAE latent space is not expected to be segregated into different regions based on the class of material since every sampling produces an instance of a material of a particular class (<xref ref-type="bibr" rid="B4">Atienza, 2018</xref>). The average mean and variance of the latent vectors were found to be 10<sup>&#x2212;3</sup> and 0.99, respectively. The test set KL loss was 1.97, while the reconstruction loss was&#x20;0.014.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Kernel density estimate plot for the 128-dimensional CVAE latent space. The learned latent space was mostly smooth and continuous and approximately followed a unit Gaussian.</p>
</caption>
<graphic xlink:href="fmats-08-679269-g008.tif"/>
</fig>
<p>After training the individual models, we ran the entire two-step generative model pipeline to obtain the errors in our test set prediction upon end-to-end execution. The cell and basis encodings of the test set materials obtained from the respective autoencoders together with their appropriate one hot encoded property values were passed through the CVAE network. The output cell and basis encodings from the CVAE network were passed through the decoders of the respective autoencoders. The output cell images were then inverted to obtain the cell parameters, while the output basis images were segmented to obtain the positions of atoms in each material. Analysis of the results revealed that for 93.8% of the materials, the pipeline was able to predict the correct number of atoms. Among those materials predicted to have the correct number of atoms, 51% of materials were predicted to have the correct stoichiometry. For 22.4% of materials, the largest deviation in the atomic number of any atom constituting the material was within &#xb1;2. 26.6% of materials and had larger than &#xb1;2 deviation in the predicted atomic numbers. When all the test set materials&#x20;were included, the errors in the lattice parameters were 0.07&#xa0;&#xc5; in &#x201c;<italic>a</italic>&#x201d; and &#x201c;<italic>b</italic>&#x201d; cell lengths, 0.85&#x30a;, 0.86&#x30a;, and 1.13&#x30a; in the <italic>&#x3b1;</italic>, <italic>&#x3b2;,</italic> and <italic>&#x3b3;</italic> cell angles, respectively. These values reduced to 0.05&#xc5; and 0.06&#xc5; for &#x201c;a&#x201d;, &#x201c;b&#x201d; cell lengths and 0.73&#x30a;, 0.72&#x30a;, and 0.93&#x30a; for <italic>&#x3b1;</italic>, <italic>&#x3b2;,</italic> and <italic>&#x3b3;</italic> cell angles, respectively, when only those materials in the test set with correctly reconstructed stoichiometry were considered. While the accuracy in reconstructing the lattice and basis of the materials slightly reduced upon end-to-end execution of the pipeline, they are comparable to those reported by <xref ref-type="bibr" rid="B15">Hoffmann et&#x20;al. (2019)</xref>.</p>
<p>Finally, to predict the bandgap of the generated materials, we reparameterized the CGCNN model with our 2D dataset. Use of the network weights directly from the original CGCNN model resulted in a large MAE in the bandgap of 0.727&#xa0;eV. To reduce the prediction error, we retrained the network beginning with the original CGCNN model weights as the initial guess. A dropout of 0.5 was introduced after the pooling layer of the model to prevent overfitting. The trained model gave a test set MAE of 0.567&#xa0;eV. While this error is smaller than the original model error of 0.727&#xa0;eV on our dataset, it is still larger than the CGCNN model error of 0.388&#xa0;eV reported for bulk materials (<xref ref-type="bibr" rid="B55">Xie and Grossman, 2018</xref>). Nevertheless, owing to the generalizability of the model as well as rapid prediction of bandgaps, we used this model to predict the bandgaps of the generated materials.</p>
</sec>
<sec id="s3-2">
<title>Generation of Novel 2D Materials</title>
<p>The trained DL models were used to generate novel 2D materials by sampling from the CVAE latent space. While the latent space can be sampled in several different ways, we chose to explore the space around the encodings of the materials belonging to class 1 in our training set (i.e.,&#x20;thermodynamically stable and non-metal) so that the generated materials are likely to share the characteristics of the pivot material. Specifically, for each of the 1000 randomly chosen materials belonging to class 1 in our model training set, we drew 100 samples from a normal distribution of the form <italic>N</italic>(<bold>
<italic>&#xb5;</italic>
</bold>
<sub>
<bold>
<italic>pivot</italic>
</bold>
</sub>
<italic>,</italic> <bold>
<italic>0.1</italic>
</bold>), where <bold>
<italic>&#xb5;</italic>
</bold>
<sub>
<bold>
<italic>pivot</italic>
</bold>
</sub> was the mean of the distribution learned by the CVAE for the pivot material (i.e.,&#x20;the material in the training set). The sample drawn from the normal distribution was then passed through the probabilistic decoder of the CVAE and subsequently through the cell and basis autoencoders to get the respective images. The lattice and atomic basis of the materials were obtained from these images as described earlier. After constructing the crystal structure, the materials were subjected to a set of post-processing steps to filter improbable structures as well as narrow our search space for photocatalysts. These steps were as follows:<list list-type="simple">
<list-item>
<p>1) Hydrogen atom position curation: Hydrogen atoms that were more than 1.8&#xa0;&#xc5; away from atoms in the generated crystal structure were deemed &#x201c;free&#x201d; hydrogen atoms in vacuum, which were generated due to segmentation network errors. Such hydrogen atoms were deleted from the structure.</p>
</list-item>
<list-item>
<p>2) Bond distance&#x2013;based filtering: Those materials in which the interatomic distance between any pair of atoms without hydrogen was less than 1.2&#xa0;&#xc5;, were discarded. If an atom pair contained hydrogen atom, this distance threshold was set to 0.8&#xa0;&#xc5;.</p>
</list-item>
<list-item>
<p>3) Number of elements&#x2013;based filtering: We discarded those materials that contained more than four element&#x20;types.</p>
</list-item>
</list>
</p>
<p>In all, &#x223c;45% of the sampled materials were discarded after the above screening procedure. The crystal structure of the remaining materials was passed through the CGCNN model to obtain an estimate of their bandgap. Finally, the obtained bandgap was used in empirical <xref ref-type="disp-formula" rid="e5">equations 5</xref>&#x2013;<xref ref-type="disp-formula" rid="e7">7</xref> to obtain the position of the valence and conduction band edges. Analysis of the filtered materials firstly revealed that the sampling generated 411 materials with 73 unique compositions that were present in the test set, but not in the training set. Of these, the crystal structures of 42 materials closely matched with that in the test set. <xref ref-type="sec" rid="s8">Supplementary Table S3</xref> in the supplementary material lists these compositions as well as the absolute deviations in the predicted lattice parameters and bond lengths from their true values. The crystal structures of the predicted materials ranged from simple metal halide structures (such as MoI<sub>2</sub>, containing alternate layers of metal and halide ions) to more complicated structures containing molecular species such as carbonates (e.g., MnC<sub>2</sub>O<sub>6</sub>) and phosphates (e.g., Mo<sub>2</sub>P<sub>2</sub>O<sub>10</sub>). These results show that our model was able to generate not only realistic material compositions unseen by it during training but also closely predict their crystal structure, further emphasizing on the accuracy of model training and reliability of its predictions. In addition, the model was also able to suggest different phases (i.e.,&#x20;crystal structure) for a given material composition.</p>
<p>Having established the reliability of the trained model, we analyzed the filtered materials to search for novel 2D materials as potential water splitting catalysts. Attention was paid to those materials that were present neither in our training nor test set, so that the generated materials were truly novel. In addition to the material composition, the bandgap and the band edges of the material had to be in suitable ranges to qualify as a potential photocatalyst. Specifically, the bandgap of the material had to be between 1.6 and 3&#xa0;eV, while the conduction and valence band edge had to lie below and above 0&#xa0;eV and 1.23&#xa0;eV, respectively. Such alignment of band edges ensures that the holes generated in the valence band upon photoexcitation are able to oxidize water [since they lie at a more positive potential than the water oxidation potential (&#x3d; 1.23&#xa0;V vs. SHE)] while the electrons populating the conduction band are able to reduce protons [since they lie at a more negative potential than the hydrogen evolution potential (&#x3d; 0&#xa0;V vs. SHE)]. Furthermore, we imposed a constraint of charge neutrality on the generated materials by assigning formal atomic charges corresponding to all the well-known oxidation states of each atom in a material. Then the charges on all the sites were summed up to ensure that at least one combination of oxidation states led to a net zero charge. Considering these aspects, our model generated about 150 new materials as potential photocatalysts for water splitting. A list of these materials, together with their bandgap, band edge positions, and lattice parameters are given in <xref ref-type="sec" rid="s10">Supplementary Table S4</xref> of the supplementary material.</p>
<p>To further narrow this set down to a few tens of materials, we used a CGCNN-based model to classify the materials as stable vs. unstable using a more stringent criteria for e_above_hull &#x2264; 50&#xa0;meV/atom. As before, the CGCNN model was pretrained on the MP dataset followed by training on the 2D materials data. Details of the model training and hyperparameters are provided in <xref ref-type="sec" rid="s10">Supplementary Table S5</xref> of the supporting information. The test accuracy of the model was 0.87 while the area under the receiver operating characteristic curve (AUC curve) was 0.924. Subsequently, the &#x223c;150 materials identified above were passed through the classification network resulting in 19 materials that had a probability of &#x3e;0.99 to belong to the stable class (i.e.,&#x20;e_above_hull &#x2264; 50&#xa0;meV/atom). These 19 materials are listed at the beginning of <xref ref-type="sec" rid="s10">Supplementary Table S4</xref> in bold while their structures are provided as <italic>cif</italic> files. From <xref ref-type="sec" rid="s10">Supplementary Table S4</xref> we see that all the materials generated were either halides or oxides/chalcogenides apart from Ag<sub>2</sub>PdN<sub>2</sub>, LiRhN<sub>2</sub>, and InRhN<sub>2</sub>. This stems from the fact that halides were the dominant materials in the 2D materials dataset followed by oxides and chalcogenides. Furthermore, analysis of the e_above_hull values of the materials in class 1 of the dataset revealed that the mean value was 47&#xa0;meV/atom for halides while it was 69&#xa0;meV/atom for oxides. Consistently, all 19 shortlisted materials were seen to be halides.</p>
<p>Visualization of the structure of these 19 materials revealed that they belonged to a few different structural prototypes. Ce<sub>2</sub>N<sub>2</sub>I<sub>2</sub>, Dy<sub>2</sub>S<sub>2</sub>Cl<sub>2</sub>, Lu<sub>2</sub>S<sub>2</sub>Cl<sub>2</sub>, Tm<sub>2</sub>S<sub>2</sub>Br<sub>2</sub>, and W<sub>2</sub>P<sub>2</sub>Cl<sub>2</sub> had a BiOCl oxyhalide-like orthorhombic structure with each metal ion coordinated to 4&#xa0;S/P/N atoms which were in turn coordinated to four metal ions. The halide ion occupied the hollow site above the metal ions. Other halides such as LuClI, LuSeCl, LuSCl, and RePBr adopted the 2H-MoS<sub>2</sub>-like hexagonal structure while InSCl adopted the 1T-MoS<sub>2</sub>-like structure. In both cases, each metal ion was coordinated to six anions, and each anion was coordinated to three metal ions. It must be noted that both BiOCl (<xref ref-type="bibr" rid="B10">Faraji et&#x20;al., 2019</xref>) and MoS<sub>2</sub> (<xref ref-type="bibr" rid="B23">Li et&#x20;al., 2013</xref>) depicted excellent photocatalytic activity for water splitting reaction themselves. Given that these newly generated materials display favorable bandgap and band edge positions, high confidence of being thermodynamically stable and adopting a structure similar to known photocatalysts, they could perhaps be considered as new targets for synthesis and evaluation.</p>
<p>Ce<sub>2</sub>Se<sub>2</sub>Br<sub>4</sub>, Nb<sub>2</sub>S<sub>4</sub>Cl<sub>2</sub>, ScTiCl<sub>6</sub>&#xb8; CeNdBr<sub>6</sub>, NdTbBr<sub>6</sub>, and PrNdCl<sub>6</sub> had a metal trihalide-like structure, with the former three adopting a BiI<sub>3</sub>-like trigonal structure and the latter three adopting an NdBr<sub>3</sub>-like orthorhombic structure. Earlier reports have shown that metal trihalides depicted interesting magnetic behavior and could potentially be used in magnetic and spintronic applications (<xref ref-type="bibr" rid="B29">McGuire, 2017</xref>; <xref ref-type="bibr" rid="B64">Tomar et&#x20;al., 2019</xref>). Thus, in addition to photocatalysts, these newly generated materials could be studied for other interesting applications as well. Finally, GaSCl adopted an HgI<sub>2</sub>-like structure with four coordinated metal ions and two coordinated anions while W<sub>2</sub>CCl<sub>2</sub> adopted an MXene-like structure with chloride termination.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>With rapid increase in computational power and advancements in AI algorithms, applications of generative models in synthesizing realistic data has widespread appeal in various fields. Application of these techniques in materials science holds significant promise for realizing in-silico design/discovery/screening of functional materials. In this study, we have demonstrated one such generative modeling approach for the discovery of novel 2D materials as photocatalysts for water splitting. Using an image-based representation of crystal structures, our two-step model first built cell and basis autoencoders to obtain a representation of these images in a lower dimensional space. The reconstructed images from the basis autoencoder were used to train a segmentation network so that the positions and types of atoms in a material could be ascertained. Next, a CVAE model was trained using the cell and basis encodings together with a conditional one hot property vector to obtain a continuous latent space that can be sampled to generate new materials. The bandgap of the generated materials was predicted using a reparameterized CGCNN model, which was then used to obtain their band edge positions <italic>via</italic> empirical relations. Evaluation of the model showed good accuracy in reconstructing materials from the test set. The latent space was then sampled to generate novel 2D materials by exploring the region around materials from the training set. An important metric of reliability for any generative model is its ability to produce realistic samples, which in our case is the crystal structure of known materials that were previously unseen by the model. To that end, our model was able to predict 73 different compositions that were present in the test set but not in the training set. Of these, the structures of 42 compounds matched closely with their true structures. Further analysis of the sampled materials gave several novel materials as potential photocatalysts for water splitting.</p>
<p>Our generative modeling framework is an advancement over other related models reported in the literature. While our model is conceptually similar to the iMatGen framework (<xref ref-type="bibr" rid="B32">Noh et&#x20;al., 2019</xref>), the latter was restricted to predicting novel phases of vanadium oxides only. The use of atomic number weighted gaussians to construct the basis image together with segmentation allowed us to generalize the model to all crystal and atom types. While <xref ref-type="bibr" rid="B7">Court et&#x20;al. (2020)</xref> used a somewhat similar approach in their model, lack of an explicit representation of the lattice precluded the application of their model to non-orthogonal systems. Furthermore, all these generative models hold an advantage over high-throughput virtual screening approaches such as those reported by <xref ref-type="bibr" rid="B45">Sorkun et&#x20;al. (2020)</xref>, since they possess the capability to not only identify new material compositions but also new phases for known material compositions. However, this in no way undermines the importance of high-throughput screening approaches. A large amount of data is usually required to build accurate generative models. In cases where such data is absent (which often happens in materials science), building shallow models with available data and using these models in high-throughput screening is perhaps the only viable approach to identifying novel materials.</p>
<p>Although our generative modeling framework showed good accuracy, admittedly, there is scope for improvement. For instance, the cell and basis accuracies deteriorated upon end-to-end execution of the full model owing to the reconstruction error of the CVAE network. Better performance of the VAE network could perhaps be achieved by using deep feature consistent (DFC) VAEs (<xref ref-type="bibr" rid="B16">Hou et&#x20;al., 2017</xref>), as was demonstrated by <xref ref-type="bibr" rid="B7">Court et&#x20;al. (2020</xref>). Instead of minimizing the pixel-to-pixel difference between the input and reconstructed images (<italic>via</italic> MSE loss), DFC-VAEs attempt to minimize the difference in the hidden representations between the two images (called feature perceptual loss), which eventually leads to a truer (less noisy) reconstruction of the input image. Accurate reconstruction of the cell and basis encodings would then reflect in better accuracies in cell parameters, atomic positions, and element types. Furthermore, while we suggested several novel 2D photocatalysts for water splitting reaction by sampling from thermodynamically stable class of non-metals, this screening has been entirely based on bandgap and band edge positions. In addition to these necessary conditions, low aqueous solubility, small exciton binding energies and recombination rates, and favorable surface reaction kinetics are some of the other necessary conditions for a viable 2D water splitting photocatalyst (<xref ref-type="bibr" rid="B43">Singh et&#x20;al., 2015</xref>). Our study, as also other reports based on high-throughput screening, do not currently incorporate these properties, primarily owing to the exorbitant cost associated with computing some of these quantities. Finally, any material designed in-silico gains relevance only when it is realized experimentally and displays anticipated properties. In the current era of big data, this calls for automated laboratories that could rapidly synthesize (or show otherwise) and characterize new materials. Such high-throughput experimentation when combined with data-based predictive models can significantly accelerate the discovery of novel functional materials. For instance, one could imagine a scenario in which our own generative modeling framework is trained in an active learning fashion by integrating with automated experimentation (<italic>via</italic> orchestration software such as ChemOS (<xref ref-type="bibr" rid="B39">Roch et&#x20;al., 2018</xref>)). The generated materials could be rapidly evaluated in experiments and the outcome could be fed back to the training set so that the model can be improved iteratively. In our view, implementation of such frameworks could significantly help us move closer to realizing the vision of truly inverse design of materials.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s8">Supplementary Material</xref>; further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>SGS and AA conceptualized the idea. AA implemented the generative modeling framework. SGS and AA jointly wrote the manuscript. All authors discussed the results, their implications, and reviewed the manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work was funded by the TCS-CTO organization under SWON number 1009292.</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>AA, SGS, and BR are employed by Tata Conusultancy Services Ltd., India.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ack>
<p>The authors wish to thank the TCS R&#x26;I infra team for providing necessary computational resources to train the deep learning models.</p>
</ack>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmats.2021.679269/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmats.2021.679269/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.ZIP" id="SM1" mimetype="application/ZIP" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet2.docx" id="SM2" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Agrawal</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Choudhary</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Perspective: Materials Informatics and Big Data: Realization of the &#x201c;Fourth Paradigm&#x201d; of Science in Materials Science</article-title>. <source>Apl Mater.</source> <volume>4</volume> (<issue>5</issue>), <fpage>053208</fpage>. <pub-id pub-id-type="doi">10.1063/1.4946894</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahmad</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Kamarudin</surname>
<given-names>S. K.</given-names>
</name>
<name>
<surname>Minggu</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Kassim</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Hydrogen from Photo-Catalytic Water Splitting Process: A Review</article-title>. <source>Renew. Sust. Energ. Rev.</source> <volume>43</volume>, <fpage>599</fpage>&#x2013;<lpage>610</lpage>. <pub-id pub-id-type="doi">10.1016/j.rser.2014.10.101</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahmad</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Maheshwari</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Grossman</surname>
<given-names>J.&#x20;C.</given-names>
</name>
<name>
<surname>Viswanathan</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Machine Learning Enabled Computational Screening of Inorganic Solid Electrolytes for Suppression of Dendrite Formation in Lithium Metal Anodes</article-title>. <source>ACS Cent. Sci.</source> <volume>4</volume> (<issue>8</issue>), <fpage>996</fpage>&#x2013;<lpage>1006</lpage>. <pub-id pub-id-type="doi">10.1021/acscentsci.8b00229</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Atienza</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2018</year>). <source>Advanced Deep Learning with Keras: Apply Deep Learning Techniques, Autoencoders, GANs, Variational Autoencoders, Deep Reinforcement Learning, Policy Gradients, and More</source>. <publisher-loc>Birmingham, United kingdom</publisher-loc>: <publisher-name>Packt Publishing Ltd</publisher-name>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Castelli</surname>
<given-names>I. E.</given-names>
</name>
<name>
<surname>Landis</surname>
<given-names>D. D.</given-names>
</name>
<name>
<surname>Thygesen</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Dahl</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chorkendorff</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Jaramillo</surname>
<given-names>T. F.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>New Cubic Perovskites for One- and Two-Photon Water Splitting Using the Computational Materials Repository</article-title>. <source>Energy Environ. Sci.</source> <volume>5</volume> (<issue>10</issue>), <fpage>9034</fpage>&#x2013;<lpage>9043</lpage>. <pub-id pub-id-type="doi">10.1039/c2ee22341d</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Castelli</surname>
<given-names>I. E.</given-names>
</name>
<name>
<surname>Olsen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Datta</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Landis</surname>
<given-names>D. D.</given-names>
</name>
<name>
<surname>Dahl</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Thygesen</surname>
<given-names>K. S.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Computational Screening of Perovskite Metal Oxides for Optimal Solar Light Capture</article-title>. <source>Energ. Environ. Sci.</source> <volume>5</volume> (<issue>2</issue>), <fpage>5814</fpage>&#x2013;<lpage>5819</lpage>. <pub-id pub-id-type="doi">10.1039/c1ee02717d</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Court</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Yildirim</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Jain</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Cole</surname>
<given-names>J.&#x20;M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>3-D Inorganic Crystal Structure Generation and Property Prediction via Representation Learning</article-title>. <source>J.&#x20;Chem. Inf. Model.</source> <volume>60</volume> (<issue>10</issue>), <fpage>4518</fpage>&#x2013;<lpage>4535</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.0c00464</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Curtarolo</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Setyawan</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>R. H.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>AFLOWLIB.ORG: A Distributed Materials Properties Repository from High-Throughput Ab Initio Calculations</article-title>. <source>Comput. Mater. Sci.</source> <volume>58</volume>, <fpage>227</fpage>&#x2013;<lpage>235</lpage>. <pub-id pub-id-type="doi">10.1016/j.commatsci.2012.02.002</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Edwards</surname>
<given-names>P. P.</given-names>
</name>
<name>
<surname>Kuznetsov</surname>
<given-names>V. L.</given-names>
</name>
<name>
<surname>David</surname>
<given-names>W. I. F.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Hydrogen Energy</article-title>. <source>Phil. Trans. R. Soc. A.</source> <volume>365</volume> (<issue>1853</issue>), <fpage>1043</fpage>&#x2013;<lpage>1056</lpage>. <pub-id pub-id-type="doi">10.1098/rsta.2006.1965</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Faraji</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Yousefi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Yousefzadeh</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zirak</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Naseri</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Jeon</surname>
<given-names>T. H.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Two-dimensional Materials in Semiconductor Photoelectrocatalytic Systems for Water Splitting</article-title>. <source>Energ. Environ. Sci.</source> <volume>12</volume> (<issue>1</issue>), <fpage>59</fpage>&#x2013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.1039/c8ee00886h</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fujishima</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Honda</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>1972</year>). <article-title>Electrochemical Photolysis of Water at a Semiconductor Electrode</article-title>. <source>nature</source> <volume>238</volume> (<issue>5358</issue>), <fpage>37</fpage>&#x2013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1038/238037a0</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Goodfellow</surname>
<given-names>I. J.</given-names>
</name>
<name>
<surname>Pouget-Abadie</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mirza</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Warde-Farley</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ozair</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <source>Generative Adversarial Networks</source>. <comment>arXiv preprint arXiv:1406.2661</comment>.</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haastrup</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Strange</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Pandey</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Deilmann</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>P. S.</given-names>
</name>
<name>
<surname>Hinsche</surname>
<given-names>N. F.</given-names>
</name>
<name>
<surname>Thygesen</surname>
<given-names>K. S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>The Computational 2D Materials Database: High-Throughput Modeling and Discovery of Atomically Thin Crystals</article-title>. <source>2D Mater.</source> <volume>5</volume> (<issue>4</issue>), <fpage>042002</fpage>. <pub-id pub-id-type="doi">10.1088/2053-1583/aacfc1</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hisatomi</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Kubota</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Domen</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Recent Advances in Semiconductors for Photocatalytic and Photoelectrochemical Water Splitting</article-title>. <source>Chem. Soc. Rev.</source> <volume>43</volume> (<issue>22</issue>), <fpage>7520</fpage>&#x2013;<lpage>7535</lpage>. <pub-id pub-id-type="doi">10.1039/c3cs60378d</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Hoffmann</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Maestrati</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Sawada</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sellier</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Bengio</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2019</year>). <source>Data-driven Approach to Encoding and Decoding 3-D crystal Structures</source>. <comment>arXiv preprint arXiv:1909.00949</comment>.</citation>
</ref>
<ref id="B16">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Hou</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Qiu</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2017</year>).<article-title>Deep Feature Consistent Variational Autoencoder</article-title>. in <conf-name>2017 IEEE Winter Conference on Applications of Computer Vision (WACV)</conf-name>. <publisher-name>IEEE</publisher-name>, <conf-loc>Santa Rosa, CA</conf-loc>, <conf-date>March 24,2017 &#x2013; March 31, 2017</conf-date>, <fpage>1133</fpage>&#x2013;<lpage>1141</lpage>. </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jain</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ong</surname>
<given-names>S. P.</given-names>
</name>
<name>
<surname>Hautier</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Richards</surname>
<given-names>W. D.</given-names>
</name>
<name>
<surname>Dacek</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Commentary: The Materials Project: A Materials Genome Approach to Accelerating Materials Innovation</article-title>. <source>APL Mater.</source> <volume>1</volume> (<issue>1</issue>), <fpage>011002</fpage>. <pub-id pub-id-type="doi">10.1063/1.4812323</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jain</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>N&#xf8;rskov</surname>
<given-names>J.&#x20;K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Stable Two-Dimensional Materials for Oxygen Reduction and Oxygen Evolution Reactions</article-title>. <source>ACS Energ. Lett.</source> <volume>4</volume> (<issue>6</issue>), <fpage>1410</fpage>&#x2013;<lpage>1411</lpage>. <pub-id pub-id-type="doi">10.1021/acsenergylett.9b00876</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Joshi</surname>
<given-names>R. P.</given-names>
</name>
<name>
<surname>Eickholt</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fornari</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Barone</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Peralta</surname>
<given-names>J.&#x20;E.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Machine Learning the Voltage of Electrode Materials in Metal-Ion Batteries</article-title>. <source>ACS Appl. Mater. Inter.</source> <volume>11</volume> (<issue>20</issue>), <fpage>18494</fpage>&#x2013;<lpage>18503</lpage>. <pub-id pub-id-type="doi">10.1021/acsami.9b04933</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Noh</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Gu</surname>
<given-names>G. H.</given-names>
</name>
<name>
<surname>Aspuru-Guzik</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Jung</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Generative Adversarial Networks for crystal Structure Prediction</article-title>. <source>ACS Cent. Sci.</source> <volume>6</volume> (<issue>8</issue>), <fpage>1412</fpage>&#x2013;<lpage>1420</lpage>. <pub-id pub-id-type="doi">10.1021/acscentsci.0c00426</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Kingma</surname>
<given-names>D. P.</given-names>
</name>
<name>
<surname>Welling</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <source>An Introduction to Variational Autoencoders</source>. <comment>arXiv preprint arXiv:1906.02691</comment>. <pub-id pub-id-type="doi">10.1561/9781680836233</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kudo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Miseki</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Heterogeneous Photocatalyst Materials for Water Splitting</article-title>. <source>Chem. Soc. Rev.</source> <volume>38</volume> (<issue>1</issue>), <fpage>253</fpage>&#x2013;<lpage>278</lpage>. <pub-id pub-id-type="doi">10.1039/b800489g</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.-L.</given-names>
</name>
<name>
<surname>Araujo</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Ahuja</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Single-layer MoS2 as an Efficient Photocatalyst</article-title>. <source>Catal. Sci. Technol.</source> <volume>3</volume> (<issue>9</issue>), <fpage>2214</fpage>&#x2013;<lpage>2220</lpage>. <pub-id pub-id-type="doi">10.1039/c3cy00207a</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.-L.</given-names>
</name>
<name>
<surname>Sa</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Ahuja</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Review of Two-Dimensional Materials for Photocatalytic Water Splitting from a Theoretical Perspective</article-title>. <source>Catal. Sci. Technol.</source> <volume>7</volume> (<issue>3</issue>), <fpage>545</fpage>&#x2013;<lpage>559</lpage>. <pub-id pub-id-type="doi">10.1039/c6cy02178f</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Pang</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Screening Stable and Metastable ABO3 Perovskites Using Machine Learning and the Materials Project</article-title>. <source>Comput. Mater. Sci.</source> <volume>177</volume>, <fpage>109614</fpage>. <pub-id pub-id-type="doi">10.1016/j.commatsci.2020.109614</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Long</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Fortunato</surname>
<given-names>N. M.</given-names>
</name>
<name>
<surname>Opahle</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Samathrakis</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <source>CCDCGAN: Inverse Design of crystal Structures</source>. <comment>arXiv preprint arXiv:2007.11228</comment>.</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maeda</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Domen</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Photocatalytic Water Splitting: Recent Progress and Future Challenges</article-title>. <source>J.&#x20;Phys. Chem. Lett.</source> <volume>1</volume> (<issue>18</issue>), <fpage>2655</fpage>&#x2013;<lpage>2661</lpage>. <pub-id pub-id-type="doi">10.1021/jz1007966</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maitra</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Gupta</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>De</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Datta</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Govindaraj</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>C. N. R.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Highly Effective Visible-Light-Induced H2Generation by Single-Layer 1T-MoS2and a Nanocomposite of Few-Layer 2H-MoS2with Heavily Nitrogenated Graphene</article-title>. <source>Angew. Chem. Int. Ed.</source> <volume>52</volume> (<issue>49</issue>), <fpage>13057</fpage>&#x2013;<lpage>13061</lpage>. <pub-id pub-id-type="doi">10.1002/anie.201306918</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>McGuire</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Crystal and Magnetic Structures in Layered, Transition Metal Dihalides and Trihalides</article-title>. <source>Crystals</source> <volume>7</volume> (<issue>5</issue>), <fpage>121</fpage>. <pub-id pub-id-type="doi">10.3390/cryst7050121</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moniruddin</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ilyassov</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Serikov</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ibrayev</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Recent Progress on Perovskite Materials in Photovoltaic and Water Splitting Applications</article-title>. <source>Mater. Today Energ.</source> <volume>7</volume>, <fpage>246</fpage>&#x2013;<lpage>259</lpage>. <pub-id pub-id-type="doi">10.1016/j.mtener.2017.10.005</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mounet</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Gibertini</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Schwaller</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Campi</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Merkys</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Marrazzo</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Two-dimensional Materials from High-Throughput Computational Exfoliation of Experimentally Known Compounds</article-title>. <source>Nat. Nanotech</source> <volume>13</volume> (<issue>3</issue>), <fpage>246</fpage>&#x2013;<lpage>252</lpage>. <pub-id pub-id-type="doi">10.1038/s41565-017-0035-5</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Noh</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>H. S.</given-names>
</name>
<name>
<surname>Sanchez-Lengeling</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gregoire</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Aspuru-Guzik</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Inverse Design of Solid-State Materials via a Continuous Representation</article-title>. <source>Matter</source> <volume>1</volume> (<issue>5</issue>), <fpage>1370</fpage>&#x2013;<lpage>1384</lpage>. <pub-id pub-id-type="doi">10.1016/j.matt.2019.08.017</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Novoselov</surname>
<given-names>K. S.</given-names>
</name>
<name>
<surname>Geim</surname>
<given-names>A. K.</given-names>
</name>
<name>
<surname>Morozov</surname>
<given-names>S. V.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Dubonos</surname>
<given-names>S. V.</given-names>
</name>
<etal/>
</person-group> (<year>2004</year>). <article-title>Electric Field Effect in Atomically Thin Carbon Films</article-title>. <source>science</source> <volume>306</volume> (<issue>5696</issue>), <fpage>666</fpage>&#x2013;<lpage>669</lpage>. <pub-id pub-id-type="doi">10.1126/science.1102896</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Oktay</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Schlemper</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Folgoc</surname>
<given-names>L. L.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Heinrich</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Misawa</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Rueckert</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2018</year>). <source>Attention U-Net: Learning where to Look for the Pancreas</source>. <comment>arXiv preprint arXiv:1804.03999</comment>.</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Osterloh</surname>
<given-names>F. E.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Inorganic Materials as Catalysts for Photochemical Splitting of Water</article-title>. <source>Chem. Mater.</source> <volume>20</volume> (<issue>1</issue>), <fpage>35</fpage>&#x2013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1021/cm7024203</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Osterloh</surname>
<given-names>F. E.</given-names>
</name>
<name>
<surname>Parkinson</surname>
<given-names>B. A.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Recent Developments in Solar Water-Splitting Photocatalysis</article-title>. <source>MRS Bull.</source> <volume>36</volume> (<issue>1</issue>), <fpage>17</fpage>&#x2013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1557/mrs.2010.5</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Prasad</surname>
<given-names>U.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>BiVO4-Based Photoanodes for Photoelectrochemical Water Splitting</article-title>. <source>Clean. Energ. Mater.</source> <volume>1364</volume>, <fpage>137</fpage>&#x2013;<lpage>167</lpage>. <pub-id pub-id-type="doi">10.1021/bk-2020-1364.ch005</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Ren</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Noh</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Oviedo</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Buonassisi</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2020</year>). <source>Inverse Design of Crystals Using Generalized Invertible Crystallographic Representation</source>. <comment>arXiv preprint arXiv:2005.07609</comment>.</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Roch</surname>
<given-names>L. M.</given-names>
</name>
<name>
<surname>H&#xe4;se</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Kreisbeck</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Tamayo-Mendoza</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yunker</surname>
<given-names>L. P.</given-names>
</name>
<name>
<surname>Hein</surname>
<given-names>J.&#x20;E.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>ChemOS: Orchestrating Autonomous Experimentation</article-title>. <source>Sci. Robotics</source> <volume>3</volume> (<issue>19</issue>). <pub-id pub-id-type="doi">10.1126/scirobotics.aat5559</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Rybkin</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Daniilidis</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Levine</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2020</year>). <source>Simple and Effective VAE Training with Calibrated Decoders</source>. <comment>arXiv preprint arXiv:2006.13202</comment>.</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saal</surname>
<given-names>J.&#x20;E.</given-names>
</name>
<name>
<surname>Kirklin</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Aykol</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Meredig</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Wolverton</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Materials Design and Discovery with High-Throughput Density Functional Theory: the Open Quantum Materials Database (OQMD)</article-title>. <source>Jom</source> <volume>65</volume> (<issue>11</issue>), <fpage>1501</fpage>&#x2013;<lpage>1509</lpage>. <pub-id pub-id-type="doi">10.1007/s11837-013-0755-4</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sigfusson</surname>
<given-names>T. I.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Pathways to Hydrogen as an Energy Carrier</article-title>. <source>Phil. Trans. R. Soc. A.</source> <volume>365</volume> (<issue>1853</issue>), <fpage>1025</fpage>&#x2013;<lpage>1042</lpage>. <pub-id pub-id-type="doi">10.1098/rsta.2006.1960</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>A. K.</given-names>
</name>
<name>
<surname>Mathew</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zhuang</surname>
<given-names>H. L.</given-names>
</name>
<name>
<surname>Hennig</surname>
<given-names>R. G.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Computational Screening of 2D Materials for Photocatalysis</article-title>. <source>J.&#x20;Phys. Chem. Lett.</source> <volume>6</volume> (<issue>6</issue>), <fpage>1087</fpage>&#x2013;<lpage>1098</lpage>. <pub-id pub-id-type="doi">10.1021/jz502646d</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>A. K.</given-names>
</name>
<name>
<surname>Montoya</surname>
<given-names>J.&#x20;H.</given-names>
</name>
<name>
<surname>Gregoire</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Persson</surname>
<given-names>K. A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Robust and Synthesizable Photocatalysts for CO 2 Reduction: a Data-Driven Materials Discovery</article-title>. <source>Nat. Commun.</source> <volume>10</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-08356-1</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sorkun</surname>
<given-names>M. C.</given-names>
</name>
<name>
<surname>Astruc</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Koelman</surname>
<given-names>J.&#x20;V. A.</given-names>
</name>
<name>
<surname>Er</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>An Artificial Intelligence-Aided Virtual Screening Recipe for Two-Dimensional Materials Discovery</article-title>. <source>npj&#x20;Comput. Mater.</source> <volume>6</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1038/s41524-020-00375-7</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Q.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Freestanding Tin Disulfide Single-Layers Realizing Efficient Visible-Light Water Splitting</article-title>. <source>Angew. Chem. Int. Ed.</source> <volume>51</volume> (<issue>35</issue>), <fpage>8727</fpage>&#x2013;<lpage>8731</lpage>. <pub-id pub-id-type="doi">10.1002/anie.201204675</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Lei</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>All-Surface-Atomic-Metal Chalcogenide Sheets for High-Efficiency Visible-Light Photoelectrochemical Water Splitting</article-title>. <source>Adv. Energ. Mater.</source> <volume>4</volume> (<issue>1</issue>), <fpage>1300611</fpage>. <pub-id pub-id-type="doi">10.1002/aenm.201300611</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tachibana</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Vayssieres</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Durrant</surname>
<given-names>J.&#x20;R.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Artificial Photosynthesis for Solar Water-Splitting</article-title>. <source>Nat. Photon</source> <volume>6</volume> (<issue>8</issue>), <fpage>511</fpage>&#x2013;<lpage>518</lpage>. <pub-id pub-id-type="doi">10.1038/nphoton.2012.175</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="web">
<collab>The NOMAD (Novel Materials Discovery) Center of Excellence (CoE)</collab>(<year>2021</year>). <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://nomad-coe.eu">https://nomad-coe.eu</ext-link> (last Accessed March 9</comment>, <comment>2021)</comment>.</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Turner</surname>
<given-names>J.&#x20;A.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Sustainable Hydrogen Production</article-title>. <source>Science</source> <volume>305</volume> (<issue>5686</issue>), <fpage>972</fpage>&#x2013;<lpage>974</lpage>. <pub-id pub-id-type="doi">10.1126/science.1103197</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van der Maaten</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Hinton</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Visualizing Data Using T-SNE</article-title>. <source>J.&#x20;machine Learn. Res.</source> <volume>9</volume> (<issue>11</issue>). <fpage>2579</fpage>&#x2013;<lpage>2605</lpage>. </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van der Walt</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sch&#xf6;nberger</surname>
<given-names>J.&#x20;L.</given-names>
</name>
<name>
<surname>Nunez-Iglesias</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Boulogne</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Warner</surname>
<given-names>J.&#x20;D.</given-names>
</name>
<name>
<surname>Yager</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Gouillart</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Scikit-Image: Image Processing in Python</article-title>. <source>PeerJ</source> <volume>2</volume>, <fpage>e453</fpage>. <pub-id pub-id-type="doi">10.7717/peerj.453</pub-id> </citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Voiry</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Yamaguchi</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Silva</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Alves</surname>
<given-names>D. C. B.</given-names>
</name>
<name>
<surname>Fujita</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Enhanced Catalytic Activity in Strained Chemically Exfoliated WS2 Nanosheets for Hydrogen Evolution</article-title>. <source>Nat. Mater</source> <volume>12</volume> (<issue>9</issue>), <fpage>850</fpage>&#x2013;<lpage>855</lpage>. <pub-id pub-id-type="doi">10.1038/nmat3700</pub-id> </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lazic</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Hautier</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Persson</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ceder</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>First Principles High Throughput Screening of Oxynitrides for Water-Splitting Photocatalysts</article-title>. <source>Energ. Environ. Sci.</source> <volume>6</volume> (<issue>1</issue>), <fpage>157</fpage>&#x2013;<lpage>168</lpage>. <pub-id pub-id-type="doi">10.1039/c2ee23482c</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xie</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Grossman</surname>
<given-names>J.&#x20;C.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Crystal Graph Convolutional Neural Networks for an Accurate and Interpretable Prediction of Material Properties</article-title>. <source>Phys. Rev. Lett.</source> <volume>120</volume> (<issue>14</issue>), <fpage>145301</fpage>. <pub-id pub-id-type="doi">10.1103/physrevlett.120.145301</pub-id> </citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Synthesis of Ultrathin CdS Nanosheets as Efficient Visible-Light-Driven Water Splitting Photocatalysts for Hydrogen Evolution</article-title>. <source>Chem. Commun.</source> <volume>49</volume> (<issue>84</issue>), <fpage>9803</fpage>&#x2013;<lpage>9805</lpage>. <pub-id pub-id-type="doi">10.1039/c3cc46342g</pub-id> </citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ye</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Dwaraknath</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jain</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ong</surname>
<given-names>S. P.</given-names>
</name>
<name>
<surname>Persson</surname>
<given-names>K. A.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Harnessing the Materials Project for Machine-Learning and Accelerated Discovery</article-title>. <source>MRS Bull.</source> <volume>43</volume> (<issue>9</issue>), <fpage>664</fpage>&#x2013;<lpage>669</lpage>. <pub-id pub-id-type="doi">10.1557/mrs.2018.202</pub-id> </citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Computational Screening of 2D Materials and Rational Design of Heterojunctions for Water Splitting Photocatalysts</article-title>. <source>Small Methods</source> <volume>2</volume> (<issue>5</issue>), <fpage>1700359</fpage>. <pub-id pub-id-type="doi">10.1002/smtd.201700359</pub-id> </citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Computational Screening of Layered Materials for Multivalent Ion Batteries</article-title>. <source>ACS omega</source> <volume>4</volume> (<issue>4</issue>), <fpage>7822</fpage>&#x2013;<lpage>7828</lpage>. <pub-id pub-id-type="doi">10.1021/acsomega.9b00482</pub-id> </citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Costa</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Persson</surname>
<given-names>K. A.</given-names>
</name>
<name>
<surname>Ong</surname>
<given-names>S. P.</given-names>
</name>
<name>
<surname>Huck</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>2DMatPedia, an Open Computational Database of Two-Dimensional Materials from Top-Down and Bottom-Up Approaches</article-title>. <source>Scientific data</source> <volume>6</volume> (<issue>1</issue>), <fpage>1</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1038/s41597-019-0097-3</pub-id> </citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhuang</surname>
<given-names>H. L.</given-names>
</name>
<name>
<surname>Hennig</surname>
<given-names>R. G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Single-layer Group-III Monochalcogenide Photocatalysts for Water Splitting</article-title>. <source>Chem. Mater.</source> <volume>25</volume> (<issue>15</issue>), <fpage>3232</fpage>&#x2013;<lpage>3238</lpage>. <pub-id pub-id-type="doi">10.1021/cm401661x</pub-id> </citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhuang</surname>
<given-names>H. L.</given-names>
</name>
<name>
<surname>Hennig</surname>
<given-names>R. G.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Computational Search for Single-Layer Transition-Metal Dichalcogenide Photocatalysts</article-title>. <source>J.&#x20;Phys. Chem. C</source> <volume>117</volume> (<issue>40</issue>), <fpage>20440</fpage>&#x2013;<lpage>20445</lpage>. <pub-id pub-id-type="doi">10.1021/jp405808a</pub-id> </citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zou</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Noble Metal-free Hydrogen Evolution Catalysts for Water Splitting</article-title>. <source>Chem. Soc. Rev.</source> <volume>44</volume> (<issue>15</issue>), <fpage>5148</fpage>&#x2013;<lpage>5180</lpage>. <pub-id pub-id-type="doi">10.1039/c4cs00448e</pub-id> </citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tomar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ghosh</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Mardanya</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Rastogi</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Bhadoria</surname>
<given-names>B. S.</given-names>
</name>
<name>
<surname>Chauhan</surname>
<given-names>Y. S.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Intrinsic magnetism in monolayer transition metal trihalides: A comparative study</article-title>. <source>Journal of Magnetism and Magnetic Materials</source>, <volume>489</volume>, <fpage>165384</fpage>. <pub-id pub-id-type="doi">10.1016/j.jmmm.2019.165384</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>