<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Chem.</journal-id>
<journal-title>Frontiers in Chemistry</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Chem.</abbrev-journal-title>
<issn pub-type="epub">2296-2646</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">908572</article-id>
<article-id pub-id-type="doi">10.3389/fchem.2022.908572</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Chemistry</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A Comprehensive Database for DNA Adductomics</article-title>
<alt-title alt-title-type="left-running-head">La Barbera et al.</alt-title>
<alt-title alt-title-type="right-running-head">A Comprehensive Database for DNA Adductomics</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>La Barbera</surname>
<given-names>Giorgia</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/1744677/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Nommesen</surname>
<given-names>Katrine Dalmo</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Cuparencu</surname>
<given-names>Catalina</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Stanstrup</surname>
<given-names>Jan</given-names>
</name>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Dragsted</surname>
<given-names>Lars Ove</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/687315/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Department of Nutrition Exercise and Sports</institution>, <institution>University of Copenhagen</institution>, <addr-line>Frederiksberg</addr-line>, <country>Denmark</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/621228/overview">Chiung-Wen Hu</ext-link>, Chung Shan Medical University, Taiwan</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/659865/overview">Peter William Villalta</ext-link>, University of Minnesota Twin Cities, United States</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/143210/overview">Yukari Totsuka</ext-link>, National Cancer Centre, Japan</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Lars Ove Dragsted, <email>ldra@nexs.ku.dk</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Analytical Chemistry, a section of the journal Frontiers in Chemistry</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>27</day>
<month>05</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>10</volume>
<elocation-id>908572</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>03</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>22</day>
<month>04</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 La Barbera, Nommesen, Cuparencu, Stanstrup and Dragsted.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>La Barbera, Nommesen, Cuparencu, Stanstrup and Dragsted</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The exposure of human DNA to genotoxic compounds induces the formation of covalent DNA adducts, which may contribute to the initiation of carcinogenesis. Liquid chromatography (LC) coupled with high-resolution mass spectrometry (HRMS) is a powerful tool for DNA adductomics, a new research field aiming at screening known and unknown DNA adducts in biological samples. The lack of databases and bioinformatics tool in this field limits the applicability of DNA adductomics. Establishing a comprehensive database will make the identification process faster and more efficient and will provide new insight into the occurrence of DNA modification from a wide range of genotoxicants. In this paper, we present a four-step approach used to compile and curate a database for the annotation of DNA adducts in biological samples. The first step included a literature search, selecting only DNA adducts that were unequivocally identified by either comparison with reference standards or with nuclear magnetic resonance (NMR), and tentatively identified by tandem HRMS/MS. The second step consisted in harmonizing structures, molecular formulas, and names, for building a systematic database of 279 DNA adducts. The source, the study design and the technique used for DNA adduct identification were reported. The third step consisted in implementing the database with 303 new potential DNA adducts coming from different combinations of genotoxicants with nucleobases, and reporting monoisotopic masses, chemical formulas, .cdxml files, .mol files, SMILES, InChI, InChIKey and IUPAC nomenclature. In the fourth step, a preliminary spectral library was built by acquiring experimental MS/MS spectra of 15 reference standards, generating <italic>in silico</italic> MS/MS fragments for all the adducts, and reporting both experimental and predicted fragments into interactive web datatables. The database, including 582 entries, is publicly available (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database</ext-link>). This database is a powerful tool for the annotation of DNA adducts measured in (HR)MS. The inclusion of metadata indicating the source of DNA adducts, the study design and technique used, allows for prioritization of the DNA adducts of interests and/or to enhance the annotation confidence. DNA adducts identification can be further improved by integrating the present database with the generation of authentic MS/MS spectra, and with user-friendly bioinformatics tools.</p>
</abstract>
<kwd-group>
<kwd>DNA adduct</kwd>
<kwd>database</kwd>
<kwd>mass spectrometry</kwd>
<kwd>toxicology</kwd>
<kwd>carcinogenesis</kwd>
<kwd>identification</kwd>
</kwd-group>
<contract-sponsor id="cn001">H2020 Marie Sk&#x142;odowska-Curie Actions<named-content content-type="fundref-id">10.13039/100010665</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Carlsbergfondet<named-content content-type="fundref-id">10.13039/501100002808</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Systems toxicology covers the totality of exposures negatively affecting a living organism. It integrates the exposure to toxic chemicals that enter the human body from exogenous sources such as pollutants, food, and drugs, with the exposure to harmful endogenous chemicals derived from cellular metabolism or other processes, including inflammation, oxidative stress, microbial metabolism, and infection. Exposures of many kinds and biological responses of an organism can nowadays be investigated in great detail thanks to the development of new powerful technologies such as metabolomics (<xref ref-type="bibr" rid="B31">Scalbert et al., 2014</xref>), proteomics and transcriptomics, (<xref ref-type="bibr" rid="B17">Heijne et al., 2005</xref>). The exposure of human DNA to genotoxic compounds induces the formation of covalent DNA adducts which, if not repaired, can lead to gene mutation, possibly initiating the process of carcinogenesis. Given the broadness of the chemical exposome, DNA adducts derive from a large number of compounds, with a wide variety of chemical properties, e.g., polycyclic aromatic hydrocarbons (PAH), nitrosamines and other alkylating agents (<xref ref-type="bibr" rid="B26">Ma et al., 2019</xref>), heterocyclic aromatic amines (HAA) (<xref ref-type="bibr" rid="B37">Turesky and Le Marchand, 2011</xref>), reactive oxygen species (ROS) (<xref ref-type="bibr" rid="B44">Yu et al., 2016</xref>) and many others (<xref ref-type="bibr" rid="B20">Hemeryck and Vanhaecke, 2016</xref>). However, many unknown DNA adducts are likely to exist and could help to identify new carcinogenic compounds and their sources. The measurement of DNA adducts is of fundamental importance in assessing the potential carcinogenic effects of toxic compounds from diet and environment and in understanding their mechanisms of action, ultimately leading to the development of new diagnostic, preventive and therapeutic approaches towards cancer. DNA adductomics is a new -omics science that covers the comprehensive measurement of DNA adducts, being a promising technique in systems toxicology (<xref ref-type="bibr" rid="B3">Balbo et al., 2014</xref>).</p>
<p>Over the past 30&#xa0;years, several analytical methods have been developed to measure DNA adducts. While immunochemical methods and <sup>32</sup>P-postlabeling techniques have been used extensively, liquid chromatography coupled with mass spectrometry (LC-MS) has become the gold standard, as it provides additional information about the chemical identity of the adducts. These analyses are usually targeting one or a few DNA adducts per assay, while failing to provide a global picture of the &#x201c;DNA-adductome,&#x201d; i.e., the totality of DNA adducts present in a biological sample. An untargeted approach, on the other hand, would allow the simultaneous profiling of thousands of adducts covering the adductome and in turn enabling the identification of unknown DNA adducts. The relatively recent development of routine high-resolution mass spectrometry (HRMS) allows the identification of unknown adducts with a high confidence level due to high mass accuracy. Therefore, ultra-high performance (UHP)LC-HRMS holds promise to open new horizons in the screening of both known and unknown DNA adducts (<xref ref-type="bibr" rid="B19">Hemeryck et al., 2016</xref>; <xref ref-type="bibr" rid="B38">Villalta and Balbo, 2017</xref>; <xref ref-type="bibr" rid="B14">Guo and Turesky, 2019</xref>).</p>
<p>A considerable bottleneck in the application of HRMS to detect DNA adducts is the assignment of identities to the thousands of features that are detected in a typical untargeted HRMS analysis. Although this has been a challenge also in the application of untargeted metabolomics, open-source MS-based databases and spectral libraries have been a great help for the community to annotate and interpret metabolomics data. Such resources include the Kyoto Encyclopedia of Genes and Genomes (KEGG) (<xref ref-type="bibr" rid="B23">Kanehisa and Goto, 2000</xref>), METLIN (<xref ref-type="bibr" rid="B13">Guijas et al., 2018</xref>), Human Metabolome Database (HMDB) (<xref ref-type="bibr" rid="B43">Wishart et al., 2022</xref>), and MassBank (<xref ref-type="bibr" rid="B21">Horai et al., 2010</xref>). However, these databases either do not include DNA adducts or report only a few of them. The application of DNA adductomics is therefore hindered by the lack of available databases, mass spectral libraries and software for identification of DNA adducts; such tools are highly needed for the application of DNA adductomics in future studies. In previous papers, a few databases including either diet-related DNA adducts (<xref ref-type="bibr" rid="B18">Hemeryck et al., 2015</xref>), DNA adducts deriving from alkylation, lipid peroxidation products (LPO) and ROS (<xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>), or bulky hydrophobic DNA adducts (<xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>), have been reported. This represents substantial work, but some of these databases are limited to specific classes of genotoxicants, some lack details on the structures and analytical techniques used for identification, and some lack information about the design of the experiment performed. The lack of a comprehensive database including all the different classes of DNA adducts with uniform consensus on names and molecular formulas, limits the application of DNA adductomics, as already highlighted recently (<xref ref-type="bibr" rid="B15">Guo et al., 2020</xref>). In this context, the goal of our work is to start, develop and curate a freely available, comprehensive, DNA adduct database, which can be further developed by other contributors with new DNA adducts and completed with generated MS/MS spectra, ultimately resulting in the creation of a proper spectral library for the DNA adductomics community.</p>
<p>With this work, we start off the DNA adduct database by compiling existing databases and existing data retrieved from an extensive literature search. We performed data curation and standardization of the DNA adducts in terms of names, molecular formulas, structures and sources. We reported DNA adducts together with the technique used for identification, the experimental study design and relation to known genotoxicants and sources. We implemented the DNA adduct database with further possible DNA adducts deriving from potential genotoxicants and their combination with all nucleobases. Finally, we included <italic>in-silico</italic> MS/MS fragments for all the adducts and experimental MS/MS fragments for available reference standards. Such a database can be used to compare the measured mass with exact masses of known and suspected DNA adducts and to compare MS fragments with <italic>in silico</italic> generated fragments and experimental MS/MS spectra, representing a valuable tool for supporting the annotation of DNA adducts in untargeted approaches. In addition it provides metadata and information on the plausibility of DNA adducts in terms of source, detection technique and experiment, enhancing the annotation confidence of DNA adducts.</p>
</sec>
<sec id="s2">
<title>2 Materials and Methods</title>
<sec id="s2-1">
<title>2.1 Literature Search</title>
<p>Two preliminary searches were performed in PubMed and Scopus to retrieve: 1) existing DNA adduct databases and 2) reviews of genotoxicant classes that can form DNA adducts. For the first preliminary search a combination of the search terms (Database OR Screening) and (DNA adduct) were used, whereas for the latter the search terms used were (Cancer OR Carcinogenesis) AND (DNA adduct) AND (Review). The two searches were limited to articles published between 2010 and 2020, in English language. A systematic literature search was then performed addressing DNA adducts reported in connection to the following <italic>genotoxicant classes/sources</italic>: mycotoxins, pyrrolizidine alkaloids, aromatic amines (AA), furans, N-nitroso compounds (NOC), PAHs, acrylamide, aldehydes, alcohol, tobacco, pollution, heated foods, meat/red meat/processed meat, and herbs and spices. Individual searches were performed for each genotoxicant class/source among all types of publications, except reviews, in English language. The search terms included: (DNA adduct OR thymine OR thymidine OR cytosine OR deoxycytidine OR guanine OR deoxyguanosine OR adenine OR deoxyadenosine) AND <italic>genotoxicant class/source</italic> AND (Cancer OR Carcinogenesis) AND NOT (Review). The publications were screened for title and abstract, and if they fulfilled the following specific inclusion criteria, they were kept: a) <italic>in vitro</italic> studies, where carcinogens were incubated with either nucleobases, deoxynucleosides, DNA from Calf Thymus and others, or with cells; b) <italic>in vivo</italic> studies, where animals were dosed with carcinogens and DNA from target tissues analysed; c) human studies where both target tissue and surrogates were analysed. Publications including cyclic adducts (<xref ref-type="bibr" rid="B44">Yu et al., 2016</xref>), cross-linked adducts (<xref ref-type="bibr" rid="B35">Stornetta et al., 2015</xref>; <xref ref-type="bibr" rid="B22">Hu et al., 2019</xref>) and phosphate adducts (<xref ref-type="bibr" rid="B27">Ma et al., 2018</xref>) were excluded for this first version of the database. A manual secondary search investigating the reference lists of included publications was added. After removal of duplicates, the full text of the articles was read.</p>
</sec>
<sec id="s2-2">
<title>2.2 DNA Adduct Database Curation and Harmonization</title>
<p>The DNA adducts resulting from the literature search were included in the database if their structure was confirmed with either an authentic reference standard or with nuclear magnetic resonance (NMR). DNA adducts were also included if identified by HRMS coupled with tandem MS/MS, and were labeled in the database as &#x201c;tentatively identified&#x201d;. Not all references resulting from the literature search were referenced in the database. Instead, for each DNA adduct, we prioritized publications from c) human studies &#x3e; b) <italic>in vivo</italic> &#x3e; a) <italic>in vitro</italic>, and we prioritized MS-based techniques over others. For example, if one adduct was reported in all types of studies, we only referenced two to three papers from b) and c). The names, molecular formulas and structures of the DNA adducts were checked across referenced publications and harmonized. Each DNA adduct was reported with a unique name and abbreviation; for some adducts an alternate name was also provided as commonly used in the literature. A detailed description of rules applied here for the adducts names is given in the Supplementary Material (<xref ref-type="sec" rid="s11">Supplementary Table S1</xref>) The structure of each DNA adduct, was drawn and its corresponding molecular formula and monoisotopic mass calculated using ChemDraw v.19.1.0.8 (Perkin Elmer Informatics, Waltham, MA). All the DNA adducts were reported in their deoxynucleoside form (nucleobase plus deoxyribose -dR), regardless of how they were reported in the original paper. The structures of the DNA adducts in which the modification occurred on the N3 and N7 of dG, the N1, N3, or N7 of dA, and the N3 of dC were shown as positively charged. Positional isomers have been distinguished and considered as separate entries in the database. Diastereoisomers and enantiomers have not been distinguished in order to avoid an excessive number of interchangeable entries. Additional details per DNA adduct covered in the database include: 1) source and causative genotoxicant, 2) experimental conditions or biological sample analysed, 3) technique used for detection and identification, and 4) the corresponding publication. The source and causative genotoxicant were reported based on the corresponding publication and/or other publications, making sure that different literature sources were in agreement, whereas experimental conditions and technique used for identification are specific to the primary publication. One unique entry was associated to each DNA adduct and they were ordered according to their chemical class. The DNA adduct database with metadata (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Word format</ext-link>) was uploaded in.docx format in a freely accessible online repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database</ext-link>).</p>
</sec>
<sec id="s2-3">
<title>2.3 Implementation of the DNA Adduct Database</title>
<p>A simplified version of the DNA adduct database was converted to an .xlsx file. The file includes information on name, abbreviation, alternative name, source of the DNA adduct, molecular formula and monoisotopic mass of reported DNA adducts. Additional &#x201c;suspected&#x201d; DNA adducts deriving from all the possible theoretical combinations of potential genotoxicants with the four nucleobases were added to the database; their structures and molecular formula were drawn and calculated using ChemDraw v.19.1.0.8. The charged monoisotopic masses [M &#x2b; H]<sup>&#x2b;</sup> or [M]<sup>&#x2b;</sup> and [M &#x2b; H-dR]<sup>&#x2b;</sup> or [M-dR]<sup>&#x2b;</sup> were also calculated for each DNA adduct. The database was then implemented to provide searchable data with computer readable identifiers. In particular, the ChemDraw files (.cdxml) were converted into.mol files and IUPAC nomenclature using ChemScript (PerkinElmer Informatics). The.mol files were subsequently converted to .SDF, SMILES, InChI, InChIKey using Open Babel 3.1.1 (<xref ref-type="bibr" rid="B28">O&#x2019;Boyle et al., 2011</xref>). IUPAC nomenclature, SMILES, InChI, InChIKey were included in the .xlsx file. Hyperlinks to the .cdxml, the .mol files and the reference (for the adducts found in the literature), were included in the .xlsx file. Lastly, the .xlsx, .cdxml, and .mol, have been zipped and uploaded in the repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Excel format</ext-link>).</p>
</sec>
<sec id="s2-4">
<title>2.4 Building a Preliminary Spectral Library</title>
<sec id="s2-4-1">
<title>2.4.1 Chemicals and Materials</title>
<p>Milli-Q ultra-pure water (Merck Life Sciences, S&#xf8;borg, Denmark), methanol optima LC/MS grade from Thermo Fisher Scientific (Waltham, MA), and ammonium bicarbonate from Merck (St. Louis, MO.) were used for the UHPLC analysis. The following DNA adducts reference standards, were purchased from Toronto Research Chemicals: 2&#x2032;-deoxy-N<sup>6</sup>-methyladenosine (N<sup>6</sup>-Me-dA); 5-methyl-2&#x2032;-deoxycytidine (5-Me-dC); O<sup>6</sup>-methyl-2&#x2032;-deoxyguanosine (O<sup>6</sup>-Me-dG); N3-methylthymidine (3-Me-dT); N<sup>4</sup>,5-dimethyldeoxycytidine (N<sup>4</sup>,5-DiMe-dC); N<sup>2</sup>-ethyl-2&#x2032;-deoxyguanosine (N<sup>2</sup>-ethyl-dG); N<sup>6</sup>-(2-hydroxyethyl)-2&#x2032;-deoxyadenosine (N<sup>6</sup>-(2-OH-ethyl)-dA); 8-oxo-2&#x2032;deoxyguanosine (8-oxo-dG); etheno-2&#x2032;-deoxy-&#x3b2;-D-adenosine (1,N<sup>6</sup>-&#x3b5;-dA); 3,N<sup>4</sup>-etheno-2&#x2032;-deoxycytidine (3,N<sup>4</sup>-&#x3b5;-dC); 3-(2-deoxy-&#x3b2;-D-erythro-pentofuranosyl)pyrimido [1,2-a]purin-10(3H)-one (M1-dG); 3-(2-Deoxy-&#x3b2;-D-erythro-pentofuranosyl)-3,5-dihydropyrimido [1,2-a]purine-6,10-dione (6-Oxo-M1-dG); &#x3b3;-Hydroxy-1,N2-propano-2&#x2032;-deoxyguanosine (1,N<sup>2</sup>-&#x3b3;-OH-P-dG) (Acr-1I-dG); N-(2&#x2032;-deoxyguanosin-8-yl)-4-aminobiphenyl (8-ABP-dG); and N2-(deoxyguanosin-8-yl)-2-amino-3,8-dimethylimidazo [4,5-f] quinoxaline (8-MeIQx-dG). Stock solutions of the DNA adduct standards were dissolved at 1 or 0.5&#xa0;mg&#xa0;ml<sup>&#x2212;1</sup> in methanol, or a mixture of water and methanol. The working solutions were diluted with water to 100&#xa0;ng&#xa0;ml<sup>&#x2212;1</sup>.</p>
</sec>
<sec id="s2-4-2">
<title>2.4.2 Acquisition of Tandem Mass Spectrometry Spectra</title>
<p>The analysis of the DNA adduct standards was performed on an H class Acquity UHPLC coupled to a Vion-IMS-qTOF (Waters, Milford, MA) <italic>via</italic> ESI source. The UHPLC system was equipped with a quaternary pump and an autosampler thermostated at 10&#xb0;C. A C18 HSS T3 column (100 &#xd7; 2.1&#xa0;mm, 1.8&#xa0;&#x3bc;m particle size) (Waters) was used at 0.4&#xa0;ml&#xa0;min<sup>&#x2212;1</sup>, at 50&#xb0;C and with the following gradient: 0&#x2013;1&#xa0;min (5% B), 1&#x2013;21&#xa0;min (0&#x2013;99% B), followed by a 2&#xa0;min wash at 99% B and 2&#xa0;min equilibration at 5% B where A) was H<sub>2</sub>O with 10&#xa0;mM NH<sub>4</sub>HCO<sub>3</sub> and B) MeOH with 10&#xa0;mM NH<sub>4</sub>HCO<sub>3</sub>. The tuning parameters of the Vion-IMS-qTOF were: capillary voltage 0.5&#xa0;kV; sampling cone voltage 20&#xa0;V; source temperature 110&#xb0;C; desolvation temperature 600&#xb0;C; desolvation gas 800 (L/h); collision energy 6&#xa0;eV; cone gas 50 (L/h). The detector voltage was set to 3000&#xa0;V. The Vion-IMS-Q-TOF was operated in MS/MS acquisition mode with a scan time 0.4 s, in positive polarity. Three MS/MS spectra were acquired at 20, 40, and 60&#xa0;eV. The mass spectrometer was externally calibrated using the calibration solution Major Mix (Waters). Lock mass correction was applied continuously during the run by injecting 15&#xa0;&#x3bc;L&#xa0;min<sup>&#x2212;1</sup> of 100&#xa0;ng&#xa0;ml<sup>&#x2212;1</sup> leucine/enkephalin (Waters) every 5&#xa0;s. Raw data files were acquired by UNIFI software (version 1.9.4.053) (Waters) and they were exported as .xlsx files.</p>
</sec>
<sec id="s2-4-3">
<title>2.4.3 Creation of a Searchable Online DNA Adduct Database</title>
<p>The information contained in the .zip file (point 2.3) was converted into a user friendly web-based searchable database to facilitate an easy and straight forward search within all the categories (columns) of the database, i.e., name, short name, alternative name, source, monoisotopic masses, SMILES, InChI, InChIKey,.mol file and .cdxml file. For this, R Project for Statistical Computing software (<xref ref-type="bibr" rid="B30">R Core Team, 2020</xref>) version 4.1.1 together with the R package DT (<ext-link ext-link-type="uri" xlink:href="https://github.com/rstudio/DT">https://github.com/rstudio/DT</ext-link>) and a number of Tidyverse R packages (<xref ref-type="bibr" rid="B41">Wickham et al., 2019</xref>) were used to create the interactive and searchable datatable (<ext-link ext-link-type="uri" xlink:href="https://datatables.net/">https://datatables.net</ext-link>) that included all of the above mentioned information [<ext-link ext-link-type="uri" xlink:href="https://nexs-metabolomics.gitlab.io/projects/dna_adductomics_database/index.html">The DNA adduct database (online)</ext-link>]. <italic>In silico</italic> fragments were generated for all the adducts at three energy levels by using the docker image (<ext-link ext-link-type="uri" xlink:href="https://hub.docker.com/r/wishartlab/cfmid">https://hub.docker.com/r/wishartlab/cfmid</ext-link>) of CFM-ID (<xref ref-type="bibr" rid="B39">Wang et al., 2021</xref>), and the fragments above 5% intensity were included in a datatable [<ext-link ext-link-type="uri" xlink:href="https://nexs-metabolomics.gitlab.io/projects/dna_adductomics_database/predicted.html">The database of predicted fragments (online)</ext-link>]. Finally, a separate datatable with all experimental fragments [<ext-link ext-link-type="uri" xlink:href="https://nexs-metabolomics.gitlab.io/projects/dna_adductomics_database/experimental.html">The database of experimental fragments (online)</ext-link>] was created. A single entry was created for each fragment ion for improving searchability. An overview of the steps included in the creation of this database is provided in <xref ref-type="fig" rid="F1">Figure 1</xref>.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Overview of the steps included in the creation of the DNA adduct database.</p>
</caption>
<graphic xlink:href="fchem-10-908572-g001.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="s3">
<title>3 Results</title>
<sec id="s3-1">
<title>3.1 Literature Search</title>
<p>The preliminary literature search aimed at retrieving existing DNA adduct databases resulted in 390 hits in Pubmed and 377 in Scopus, of which three articles were selected after reading through title, abstract and text. The three publications (<xref ref-type="bibr" rid="B18">Hemeryck et al., 2015</xref>; <xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>; <xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>) were used to build a preliminary DNA adduct database. The second preliminary search, conducted to reveal genotoxicant classes that can form DNA adducts, resulted in 88 review articles in Pubmed and 423 in Scopus. After reading through title and abstract, many genotoxicant classes and sources were found, that were classified in 1) eight larger genotoxicant classes: mycotoxins, pyrrolizidine alkaloids, AA (incl. HAAs), furans (incl. cis-butene-dial, furfuryl alcohol and hydroxymethyl-furfural), NOC (incl. nitrosamines and N-nitroso-pyrrolidine), PAHs, acrylamide, aldehydes (incl. several &#x3b1;,&#x3b2;-unsaturated aldehydes, formaldehyde, acetaldehyde, malondialdehyde, crotonaldehyde, glyoxal, acrolein) and 2) and six larger sources: alcohol, tobacco, pollution (i.e., environmental, industrial and chemical contaminants), heated foods (i.e., charred or grilled meats, cereals and others), meat/red meat/processed meat, and herbs and spices. Additional adducts listed in database arose from secondary searches and include adducts deriving from additional eight genotoxicant classes (incl. ROS, LPO, reactive nitrogen species (RNS), estrogens, alkylating agents, halogenation products, aristolochic acid and alkenylbenzenes) and three sources or processes (incl. endogenous processes, UV irradiation, plant-based foods). An overview of the individual literature searches is given in <xref ref-type="sec" rid="s11">Supplementary Table S2</xref>. In total, 279 DNA adducts have been included in the database. A total of 132 publications have been cited in the database following our decision to include only up to three papers per adduct, prioritizing human studies over <italic>in vivo</italic> over <italic>in vitro</italic> studies.</p>
</sec>
<sec id="s3-2">
<title>3.2 The DNA Adduct Database</title>
<p>The names of the DNA adducts were found to be heterogeneous across different publications. In addition, many DNA adducts derive from multiple genotoxicants and/or exposures. Therefore, after removal of duplicates and standardization of identities and structures, the final DNA adduct database includes 279 structures. The DNA adduct database with metadata is available in.docx format both in the repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Word format</ext-link>) and in the Supplementary Material (<xref ref-type="sec" rid="s11">Supplementary Table S3</xref>). The mass range of the DNA adducts reported in the database is between 220 and 640&#xa0;Da, and the distribution did not change after implementation with suspected DNA adducts, as shown in <xref ref-type="fig" rid="F2">Figure 2A</xref>. Although priority was given to publications deriving from c) human studies &#x3e; b) <italic>in vivo</italic> animal studies &#x3e; a) <italic>in vitro</italic> studies, the studies were equally distributed among <italic>in vitro</italic> (38%), <italic>in vivo</italic> (35%) and human studies (27%) as shown in <xref ref-type="fig" rid="F2">Figure 2B</xref>. The majority of <italic>in vitro</italic> studies were performed on DNA from calf thymus and few others (26%). The animal tissues that were studied the most were liver (18%), lung (12%) and colorectal (7%) tissue. The majority of the DNA adducts in humans were found in target tissues (77%), i.e., related to the tissue where cancer develops, rather than surrogate samples (23%) (i.e., blood, urine, saliva). As shown in <xref ref-type="fig" rid="F2">Figure 2C</xref>, most of the adducts were analyzed by using LC (94%) rather than GC or TLC. The advanced 2D-LC, nano-LC, and capillary LC account for 20% of the chromatographic techniques. As shown in <xref ref-type="fig" rid="F2">Figure 2D</xref>, 14% of DNA adducts were identified by UV, fluorescence spectroscopy or immunoanalysis. The <sup>32</sup>P-postlabeling method was used for 5% of DNA adducts. 12% of DNA adducts were identified with NMR. The majority of DNA adducts (78%) were identified by MS. Only 16% were identified by HRMS coupled to tandem MS/MS. However, half of them were only tentatively identified since they were not confirmed with reference standards. As shown in <xref ref-type="fig" rid="F2">Figure 2E</xref>, the genotoxicants responsible of adduct formation were grouped in 16 different classes. The majority of DNA adducts derived from ROS (22%), alkylation (13%), and aldehydes (19%). As shown in <xref ref-type="fig" rid="F2">Figure 2F</xref>, the different sources of DNA adducts were grouped in nine large classes, however some overlap. The majority of DNA adducts derived from endogenous sources (24%), tobacco (24%) followed by heated, cooked and grilled food (15%).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>
<bold>(A)</bold> mass range distribution of the DNA adduct database before and after implementation with suspected DNA adducts; <bold>(B)</bold> samples where the DNA adducts have been analyzed; <bold>(C)</bold> chromatographic technique used for DNA adduct analysis; <bold>(D)</bold> technique used for DNA adduct identification; <bold>(E)</bold> DNA adduct causative genotoxicants grouped in 16 classes where &#x201c;aromatic amines&#x201d; includes heterocyclic aromatic amines, &#x201c;LPO&#x201d; excludes aldehydes, i.e., &#x3b1;,&#x3b2; unsaturated aldehydes, malondialdehyde, formaldehyde, glyoxal, acrolein, crotonaldehyde, &#x201c;NOC&#x201d; includes N-nitroso pyrrolidine and nitrosamine; <bold>(F)</bold> sources of the causative genotoxicants grouped in nine classes. Abbreviations: LC, liquid chromatography; MS, mass spectrometry; capLC, capillary LC; FLNS, fluorescence spectrometry ; GC, gas chromatography; HRMS, high resolution MS; MS/MS, tandem mass spectrometry; NMR, nuclear magnetic resonance; TLC, thin layer chromatography; HM, human; AM, animal; LPO, lipid peroxidation product; NOC, N-nitroso compounds; PAH, polycyclic aromatic hydrocarbons; RNS, reactive nitrogen species; ROS, reactive oxygen species.</p>
</caption>
<graphic xlink:href="fchem-10-908572-g002.tif"/>
</fig>
</sec>
<sec id="s3-3">
<title>3.3 Implementation of the DNA Adduct Database</title>
<p>The DNA adduct database has been posted on the online repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Excel format</ext-link>) as a downloadable .xlsx file in order to create a practical MS tool for DNA adduct identification by us and others. The updated database includes 582 entries, 279 of which have been previously found in the literature and 303 are suspected DNA adducts. It is possible to distinguish the two classes of adducts by noticing that the suspected ones are not linked to any reference. Furthermore, the structure of the suspected DNA adducts describe only a plausible position, and the position of the modifications on the nucleobase has not been specified in the name. The number of dT adducts among the suspected adducts is considerably smaller than those derived from dG, dA and dC, since many DNA adducts cannot be formed by conjugation or modification of dT (e.g., etheno-dT cannot be formed because of the lack of an amino group).</p>
</sec>
<sec id="s3-4">
<title>3.4 Preliminary Spectral Library and Searchable DNA Adduct Database</title>
<p>The DNA adduct database including the precursor ions [M &#x2b; H]<sup>&#x2b;</sup> or [M]<sup>&#x2b;</sup> and the fragment ions deriving from -dR loss has been transformed into an interactive online database, for allowing search of monoisotopic masses [<ext-link ext-link-type="uri" xlink:href="https://nexs-metabolomics.gitlab.io/projects/dna_adductomics_database/index.html">The DNA adduct database (online)</ext-link>]. In addition the spectra of 15 DNA adduct reference standards were acquired at three different collision energies and were used for retrieving the most significant fragments of the DNA adducts and uploading them in a searchable web datatable [<ext-link ext-link-type="uri" xlink:href="https://nexs-metabolomics.gitlab.io/projects/dna_adductomics_database/experimental.html">The database of experimental fragments (online)</ext-link>]. The experimental MS/MS spectra and the <italic>in silico</italic> fragmentation spectra obtained with CFM-ID were compared and reported in the Supplementary Material (<xref ref-type="sec" rid="s11">Supplementary Figures S1&#x2013;S15</xref>). The comparison showed considerable overlap, indicating that the predicted fragments can be used for supporting DNA adduct annotation. Therefore, the predicted fragments were also reported in a separate datatable [<ext-link ext-link-type="uri" xlink:href="https://nexs-metabolomics.gitlab.io/projects/dna_adductomics_database/predicted.html">The database of predicted fragments (online)</ext-link>].</p>
</sec>
</sec>
<sec id="s4">
<title>4 Discussion</title>
<p>In this paper we introduce a newly curated and freely available database for DNA adductomics starting from an extensive literature search, and aiming to ultimately create a proper spectral library for the DNA adductomics community.</p>
<sec id="s4-1">
<title>4.1 Data Curation and Harmonization</title>
<p>Following the results of the literature search, there was a clear necessity for data curation and harmonization of DNA adducts structures, names and sources as these were often found in disagreement across different publications.</p>
<p>The structures of DNA adducts have been mainly reported in their nucleobase form in the older papers, where acidic hydrolysis of DNA and other approaches led to the breakage of the bond between -dR and the nucleobase. However, since most recent approaches use enzymatic hydrolysis of DNA, the DNA adducts are reported here in their deoxyribonucleoside form (DNA adduct with -dR).</p>
<p>Different structures were proposed in the literature for adducts where the modification of the nucleobase occurred on an N sitting on a double bond of the heterocycle (e.g., N3 and N7 of dG; N1, N3, or N7 of dA; the N3 of dC). Some DNA adducts have been drawn with a positive charge (<xref ref-type="bibr" rid="B36">Tang et al., 2019</xref>), some in the zwitterionic form by adding a negative charge in the structure, and some in their neutral form by saturating the double bond (<xref ref-type="bibr" rid="B11">Essigmann et al., 1979</xref>; <xref ref-type="bibr" rid="B25">Li et al., 1992</xref>). Since both the zwitterionic and saturated forms are chemically unlikely, we have reported all such DNA adducts in their most likely form with a positive charge. This has to be taken into account when calculating and analyzing the charged adducts [M &#x2b; H]<sup>&#x2b;</sup> and [M]<sup>&#x2b;</sup> by MS.</p>
<p>As shown in <xref ref-type="sec" rid="s11">Supplementary Table S4</xref>, where a comparison of three previously reported DNA adduct databases (<xref ref-type="bibr" rid="B18">Hemeryck et al., 2015</xref>; <xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>; <xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>) has been carried out, different names were used for the same DNA adduct. In this work, a set of systematic rules was proposed for harmonizing names of DNA adducts as reported in <xref ref-type="sec" rid="s11">Supplementary Table S1</xref>. However, when it was not practical to apply these rules, the name commonly accepted in the scientific community was kept together with an alternative name (e.g., Cro-1I-dG &#x3d; 1,N<sup>2</sup>-(Me-OH-P)-dG, Cro-2II-dG &#x3d; N<sup>2</sup>-Paraldol-dG). The complexity in finding an agreement between different names led us to also report the IUPAC nomenclature.</p>
<p>Besides names and structures, inconsistency was also found regarding the sources of DNA adducts (see <xref ref-type="sec" rid="s11">Supplementary Table S4</xref>). In this database, the source was associated to DNA adducts based on literature reviews (<xref ref-type="bibr" rid="B2">Bakhiya and Appel, 2010</xref>; <xref ref-type="bibr" rid="B37">Turesky and Le Marchand, 2011</xref>; <xref ref-type="bibr" rid="B40">Wei and Yin, 2015</xref>; <xref ref-type="bibr" rid="B20">Hemeryck and Vanhaecke, 2016</xref>; <xref ref-type="bibr" rid="B24">Lee and Ryu, 2016</xref>; <xref ref-type="bibr" rid="B34">Stegelmeier et al., 2016</xref>; <xref ref-type="bibr" rid="B44">Yu et al., 2016</xref>; <xref ref-type="bibr" rid="B1">Alshannaq and Yu, 2017</xref>; <xref ref-type="bibr" rid="B26">Ma et al., 2019</xref>; <xref ref-type="bibr" rid="B45">Zhang et al., 2021</xref>; <xref ref-type="bibr" rid="B10">Eisenreich et al., 2021</xref>), or based on the specific publication where the adduct was identified. Both the causative genotoxin and its source have been reported (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Word format</ext-link>). Multiple sources were reported for some DNA adducts, since they may derive from different biological processes, foods or environmental factors. Several DNA adducts deriving from endogenous metabolic processes such as alkylation, oxidation and LPO may also be related to intake of different foods, or to various environmental exposures. Furthermore, several exogenous compounds can derive from different sources, such as PAHs and HAAs from smoking, pollution or charcoal grilling of meat (<xref ref-type="bibr" rid="B26">Ma et al., 2019</xref>). Due to the complexity and multiplicity of DNA adduct sources, the .xlsx database (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Excel format</ext-link>) was simplified by reporting only the causative genotoxin. As a result, the DNA adduct database was harmonized in terms of structures, names and sources, finding consensus among the literature references, thereby establishing a new starting point for future DNA adductomics studies and for a community effort to complete the database.</p>
</sec>
<sec id="s4-2">
<title>4.2 Plausibility of DNA Adduct Identification</title>
<p>The chemical and biological plausibility of the DNA adducts reported in the literature are major determinants for addition into the current database. Some entries in the previous databases and in the literature include 1) a lack of structural characterization of the DNA adduct, 2) no specification of the technique used for detection, or 3) no specification of the biological sample where the DNA adduct was formed; all of these limitations may compromise the reliability of DNA adduct identifications. MS has gradually become the technique of choice in DNA adductomics, because of its sensitivity and selectivity, which facilitates the elucidation of the chemical structure, especially when MS/MS and/or HRMS are employed (<xref ref-type="bibr" rid="B38">Villalta and Balbo, 2017</xref>). In this database, the majority of DNA adducts was identified by MS-based techniques, including low resolution MS, MS/MS and HRMS. However, DNA adducts identified by NMR, and DNA adducts identified by fluorescence, UV, immunoassays, and <sup>32</sup>P-postlabelling, after comparison with reference standards, were included as unequivocally identified. The only included DNA adducts that are not confirmed with reference standards are those identified by HRMS/MS, since the accurate measure of both the precursor and fragment ions increases the reliability of identification. However, these DNA adducts have been marked as &#x201c;tentatively identified&#x201d;. Additional DNA adducts tentatively identified by low resolution MS/MS can be eventually added in the future, if detailed fragmentation patterns are provided (<xref ref-type="bibr" rid="B6">Chang et al., 2021</xref>).</p>
<p>The biological plausibility of the identified DNA adducts depends on the chosen experimental approach. The DNA adducts included in our database derive from: a) <italic>in vitro</italic> studies, b) <italic>in vivo</italic> animal studies, c) human studies, prioritizing c) &#x3e; b) &#x3e; a) as rating the relevance of the adduct for humans (i.e., quality of the evidence). The detection of DNA adducts <italic>in vitro</italic> does not necessarily reflect the <italic>in vivo</italic> DNA adductome, and not all DNA adducts found in animals are observable and pro-carcinogenic in humans (<xref ref-type="bibr" rid="B8">Cohen and Arnold, 2011</xref>). Therefore, we rated their biological plausibility as c) &#x3e; b) &#x3e; a) and it can be used as an extra degree of screening for DNA adduct identification. However, a) and b) can still be relevant for monitoring potential new DNA adducts in humans. Due to the difficulty in obtaining unlimited amounts of target tissues from humans, 73% of DNA adducts were found <italic>in vitro</italic> and in animals, rather than in humans. The DNA adducts reported in humans were mostly found in target tissues, however 22% of DNA adducts were found in surrogate samples, such as blood and urine. DNA adducts found in urine belong to the class of DNA adduct-repair products and they are both excreted in their deoxynucleoside (with -dR) form and nucleobase form (without -dR) (<xref ref-type="bibr" rid="B9">Cooke et al., 2018</xref>; <xref ref-type="bibr" rid="B7">Chao et al., 2021</xref>). However, in this database they were all reported in the form of their original DNA adduct (with -dR) for consistency. Future updates of the database should consider differentiation of DNA adducts and their related DNA adduct-repair products. In conclusion, the chemical and biological information provided in the database should be evaluated for the reliability and plausibility of the DNA adducts reported in the literature.</p>
</sec>
<sec id="s4-3">
<title>4.3 An MS Tool for Identification</title>
<p>The current DNA adduct database was created not only to give a comprehensive overview of existing DNA adducts, but to offer a tool for targeted and untargeted LC-MS analysis, screening and annotation of DNA adducts. The database was implemented with new potential DNA adducts, providing additional molecular formulas and masses to screen for identifying unknown DNA adducts. The.xlsx file containing the 582 DNA adducts can be uploaded to software for MS data handling and annotation such as MZmine (<xref ref-type="bibr" rid="B29">Pluskal et al., 2010</xref>) or XCMS (<xref ref-type="bibr" rid="B32">Smith et al., 2006</xref>), and can be also used as inclusion list in data dependent acquisition (DDA) approaches (<xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>). Both ions and formulas have been included to enable the user to overcome different requirements in the software packages in terms of entry formats. In addition, this is the first database that provides specific structural information summarized in InChI, InChI key, SMILES and with a direct link to .chemdraw and.mol files. This information can be uploaded into any software for annotation and <italic>in silico</italic> fragmentation. The database has been made available in a data repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database</ext-link>) under Creative Commons as CC-BY 4.0, and it represents the first publicly available MS tool for DNA adduct annotation.</p>
</sec>
<sec id="s4-4">
<title>4.4 Tandem Mass Spectrometry Spectra and <italic>in Silico</italic> Prediction</title>
<p>The number of known and possible DNA adducts is large. As a consequence, the annotation process can produce a large number of false positives. DNA adduct identification requires comparison of multiple orthogonal properties, including accurate mass and retention time in a well defined LC system, preferably matching with authentic reference standards. This process is costly and often impossible due to the lack of DNA adduct standards. However, including MS/MS mass spectra is one of the main approaches for improving the likelihood of correct feature annotation. In this database, we included the MS/MS spectra acquired from a total of 15 DNA adducts, with the purpose of building the first DNA adduct spectral library. However, due to the limited number of experimental MS/MS spectra available, <italic>in silico</italic> generation of MS/MS spectra is the second best option. For this, we chose to use CFM-ID as we provide here some evidence for a good fit with the included MS/MS spectra of 15 DNA adduct standards. However, due to a certain degree of uncertainty in all predictions, the approach proposed by Carra&#x2019; et al., could be used in the future for generating more accurate <italic>in silico</italic> spectra (<xref ref-type="bibr" rid="B5">Carr&#xe0; et al., 2019b</xref>). The incorporation of fragment ions produced with CFM-ID, or other approaches, can be used as a screening tool for selecting the features that are most likely DNA adducts. However, if/when DNA adduct reference standards become available, we encourage all researchers working in the field to upload them in the current DNA adduct database, by following the contact information in the GitLab repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database</ext-link>).</p>
</sec>
<sec id="s4-5">
<title>4.5 The New DNA Adduct Database</title>
<p>The current database fills a critical gap in the DNA adductomics field. Previous works have gathered significant information from the literature for building DNA adduct databases (<xref ref-type="bibr" rid="B18">Hemeryck et al., 2015</xref>; <xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>; <xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>). Hemeryck et al., reported 121 different diet-related DNA adducts in their in-house database. These were mainly DNA adducts originating from DNA alkylation, oxidation and lipoxidation, listed along with their relevant origin (<xref ref-type="bibr" rid="B18">Hemeryck et al., 2015</xref>). Carra&#x2019; et al. created a database of known endogenous DNA adducts deriving from alkylation, LPO and ROS. A total of 122 DNA adducts were reported in their database, including information on adducts name, chemical formula, [M &#x2b; H]<sup>&#x2b;</sup> mass, origin, literature reference and chemical structure (<xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>). Finally, Guo et al. built a database including 102 known bulky hydrophobic DNA adducts, mainly deriving from HAAs, AAs, PAHs and others. The DNA adducts were reported with their structure, their formula, the monoisotopic mass of the precursor [M &#x2b; H]<sup>&#x2b;</sup> and the major fragment [M &#x2b; H-dR]<sup>&#x2b;</sup>, and literature reference (<xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>). This represents substantial initial work, but these databases include only a few classes of DNA adducts. The comprehensive database reported here (<xref ref-type="sec" rid="s11">Supplementary Table S3</xref> and <ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Word format</ext-link>) builds on previous works and reviews to include as many of the previously observed DNA adducts as possible. The literature search led to the identification of a total of 279 DNA adducts coming from different exogenous and endogenous exposures and especially focused on, but not limited to, diet and life-style habits. A comparison of the three previously mentioned databases and the current database has been carried out and reported in <xref ref-type="fig" rid="F3">Figure 3A</xref> and in <xref ref-type="sec" rid="s11">Supplementary Table S4</xref> in the Supplementary materials.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Venn diagrams obtained by comparison of the DNA adduct databases of (<xref ref-type="bibr" rid="B4">Carr&#xe0; et al., 2019a</xref>) (<xref ref-type="bibr" rid="B18">Hemeryck et al., 2015</xref>), (<xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>) and the current database, <bold>(A)</bold> built upon literature search and <bold>(B)</bold> after implementation with suspected DNA adducts.</p>
</caption>
<graphic xlink:href="fchem-10-908572-g003.tif"/>
</fig>
<p>Due to the fact that the different databases are focusing on different classes of DNA adducts, they show a relatively limited overlap with each other (<xref ref-type="fig" rid="F3">Figure 3A</xref>). The overlap of the current database with the other three DNA adduct databases is instead considerable, and 92 additional DNA adducts have been retrieved from the literature. Fifty-four DNA adducts present in the previous databases are not included in the current database, since several DNA adducts were not found or were not identified at a sufficient level of confidence. Although relatively strict criteria were established for building the initial database (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Word format</ext-link>), some of the DNA adducts found in the literature but not sufficiently characterized were still included as suspected DNA adducts in the .xlsx database (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">The DNA adduct database in Excel format</ext-link>). In addition, all the possible combinations of genotoxicants with the four nucleobases were generated, by taking into consideration chemical plausibility. After implementation, a comprehensive database including 582 DNA adducts was obtained, with more than 300 new DNA adducts (<xref ref-type="fig" rid="F3">Figure 3B</xref>), meaning that this database will help in the identification of new DNA adducts having masses not reported or predicted yet, ultimately expanding the current knowledge on DNA adductomics. Some of the adducts reported in the other three databases were still not reported since they belong to classes that we decided to exclude from the database. We believe that these classes deserve a separate systematic work in the future (<xref ref-type="bibr" rid="B35">Stornetta et al., 2015</xref>; <xref ref-type="bibr" rid="B27">Ma et al., 2018</xref>; <xref ref-type="bibr" rid="B22">Hu et al., 2019</xref>). There are very likely additional DNA adducts not retrieved from this literature search, or that were not yet included because of insufficiently characterization. These adducts along with cyclic adducts, crosslinks, phosphate conjugated DNA adducts (<xref ref-type="bibr" rid="B35">Stornetta et al., 2015</xref>; <xref ref-type="bibr" rid="B44">Yu et al., 2016</xref>; <xref ref-type="bibr" rid="B27">Ma et al., 2018</xref>; <xref ref-type="bibr" rid="B22">Hu et al., 2019</xref>) and DNA adduct-repair products (<xref ref-type="bibr" rid="B9">Cooke et al., 2018</xref>), can be future additions to the DNA adduct database by the authors or other contributors in the community. The DNA adductome changes over time due to our increasing understanding of DNA adducts and their importance in toxicity and cancer development. Also, the higher number of compounds in exposome databases compared to the lower number of DNA adducts suggests a need for systematic untargeted approaches by including more and more DNA adducts into the overall DNA adductome chemical space (<xref ref-type="bibr" rid="B16">Guo et al., 2017</xref>; <xref ref-type="bibr" rid="B9">Cooke et al., 2018</xref>; <xref ref-type="bibr" rid="B42">Wilson et al., 2019</xref>; <xref ref-type="bibr" rid="B12">Guidolin et al., 2021</xref>; <xref ref-type="bibr" rid="B33">Sousa et al., 2021</xref>). The database described here should be intended as a first version of the DNA adduct database and spectral library proposed by Guo et al., where thirty-six collaborators worldwide agreed to provide DNA standards to populate the DNA adductome database (<xref ref-type="bibr" rid="B15">Guo et al., 2020</xref>). The current database is open-source and is projected as a constantly developing tool where other authors can, upon authorization from the administrators, add new DNA adducts, or additional information on physico-chemical properties (e.g., recorded spectra) or biological information such as induced genomic mutations and disease relationship with associated references. Throughout its usage and further development, research groups should be able to benefit from this database as a major advantage for the investigation of DNA adducts.</p>
</sec>
</sec>
<sec id="s5">
<title>5 Conclusion</title>
<p>In this paper we established a database to be used for screening of DNA adducts in biological samples using untargeted HRMS, and thus providing a resource for chemical annotation of the DNA adductome. The database was built by aggregating and curating existing DNA adduct databases, and integrating with DNA adducts found through an extensive literature search. The current database contains a systematic collection of DNA adducts, where names, structures and sources have been harmonized and manually curated. The database includes 279 adducts coming from 16 genotoxicant classes and nine sources or processes. Information on the source of the DNA adducts, the samples where they have been detected and the technique that has been used for identification, provides useful metadata to verify the chemical and biological plausibility of annotations. Implementation of the database with new combinations of genotoxicants and nucleobases generated 303 new entries, thereby providing a comprehensive database of 582 DNA adducts to support identification of unknown DNA adducts. Information on structure, molecular formula, monoisotopic mass, and <italic>in silico</italic> predicted fragments for all the 582 entries are provided along with authentic MS/MS spectra for 15 DNA adduct reference standards. The database provides data with computer readable identifiers (SMILES, IUPAC, InChI, InChIKey) presented in publicly available interactive and searchable data tables, which can be easily updated with new entries, new spectra and more detailed metadata in the repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database</ext-link>). The full development of this database and its integration with MS/MS spectra and informatics tools will allow DNA adductomics to play a major role in systems toxicology, cancer genotoxicity, and cancer prevention.</p>
</sec>
</body>
<back>
<sec id="s6">
<title>Data Availability Statement</title>
<p>The datasets presented in this study can be found in the GitLab repository (<ext-link ext-link-type="uri" xlink:href="https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database">https://gitlab.com/nexs-metabolomics/projects/dna_adductomics_database</ext-link>).</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>GLB: Conceptualization, Methodology, Formal analysis, Investigation, Data curation, Validation, Writing&#x2013;original draft, review, and editing. Supervision, Project administration. KN: Methodology, Formal analysis, Investigation, Data curation, Writing&#x2013;original draft. CC: Data curation, Validation, Writing&#x2013;original draft, review, and editing. JS: Data curation, Validation, Writing&#x2013;original draft. LD: Conceptualization, Data curation, Supervision, Project administration. Writing&#x2013;review, and editing.</p>
</sec>
<sec id="s8">
<title>Funding</title>
<p>This work, and the fellowship of GLB, were funded by the European Commision for the H2020 Marie Curie Slodowska Individual Fellowships (grant no. 843892). LD was supported by a Semper Ardens grant (CF15-0574) from the Carlsberg Foundation.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s11">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fchem.2022.908572/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fchem.2022.908572/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table1.xlsx" id="SM1" mimetype="application/xlsx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.docx" id="SM2" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alshannaq</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>J.-H.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Occurrence, Toxicity, and Analysis of Major Mycotoxins in Food</article-title>. <source>Ijerph</source> <volume>14</volume>, <fpage>632</fpage>. <pub-id pub-id-type="doi">10.3390/ijerph14060632</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bakhiya</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Appel</surname>
<given-names>K. E.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Toxicity and Carcinogenicity of Furan in Human Diet</article-title>. <source>Arch. Toxicol.</source> <volume>84</volume>, <fpage>563</fpage>&#x2013;<lpage>578</lpage>. <pub-id pub-id-type="doi">10.1007/s00204-010-0531-y</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Balbo</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Turesky</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>DNA Adductomics</article-title>. <source>Chem. Res. Toxicol.</source> <volume>27</volume>, <fpage>356</fpage>&#x2013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1021/tx4004352</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carr&#xe0;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Guidolin</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Dator</surname>
<given-names>R. P.</given-names>
</name>
<name>
<surname>Upadhyaya</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kassie</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<etal/>
</person-group> (<year>2019a</year>). <article-title>Targeted High Resolution LC/MS3 Adductomics Method for the Characterization of Endogenous DNA Damage</article-title>. <source>Front. Chem.</source> <volume>7</volume>, <fpage>658</fpage>. <pub-id pub-id-type="doi">10.3389/fchem.2019.00658</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carr&#xe0;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Macaluso</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Spezia</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Balbo</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019b</year>). <article-title>Fragmentation Spectra Prediction and DNA Adducts Structural Determination</article-title>. <source>J. Am. Soc. Mass Spectrom.</source> <volume>30</volume>, <fpage>2771</fpage>&#x2013;<lpage>2784</lpage>. <pub-id pub-id-type="doi">10.1007/S13361-019-02348-7</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chang</surname>
<given-names>Y.-J.</given-names>
</name>
<name>
<surname>Cooke</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.-R.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>S.-F.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>P.-S.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>C.-W.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Is High Resolution a Strict Requirement for Mass Spectrometry-Based Cellular DNA Adductomics?</article-title> <source>Chemosphere</source> <volume>274</volume>, <fpage>129991</fpage>. <pub-id pub-id-type="doi">10.1016/J.CHEMOSPHERE.2021.129991</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chao</surname>
<given-names>M.-R.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>C.-W.</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>M&#xf8;ller</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Rossner</surname>
<given-names>P.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Biomarkers of Nucleic Acid Oxidation - A Summary State-Of-The-Art</article-title>. <source>Redox Biol.</source> <volume>42</volume>, <fpage>101872</fpage>. <pub-id pub-id-type="doi">10.1016/J.REDOX.2021.101872</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cohen</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Arnold</surname>
<given-names>L. L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Chemical Carcinogenesis</article-title>. <source>Toxicol. Sci.</source> <volume>120</volume>, <fpage>S76</fpage>&#x2013;<lpage>S92</lpage>. <pub-id pub-id-type="doi">10.1093/toxsci/kfq365</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cooke</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>C.-W.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>Y.-J.</given-names>
</name>
<name>
<surname>Chao</surname>
<given-names>M.-R.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Urinary DNA Adductomics - A Novel Approach for Exposomics</article-title>. <source>Environ. Int.</source> <volume>121</volume>, <fpage>1033</fpage>&#x2013;<lpage>1038</lpage>. <pub-id pub-id-type="doi">10.1016/j.envint.2018.10.041</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eisenreich</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>G&#xf6;tz</surname>
<given-names>M. E.</given-names>
</name>
<name>
<surname>Sachse</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Monien</surname>
<given-names>B. H.</given-names>
</name>
<name>
<surname>Herrmann</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Sch&#xe4;fer</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Alkenylbenzenes in Foods: Aspects Impeding the Evaluation of Adverse Health Effects</article-title>. <source>Foods</source> <volume>10</volume>, <fpage>2139</fpage>. <pub-id pub-id-type="doi">10.3390/FOODS10092139</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Essigmann</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Barker</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Fowler</surname>
<given-names>K. W.</given-names>
</name>
<name>
<surname>Francisco</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Reinhold</surname>
<given-names>V. N.</given-names>
</name>
<name>
<surname>Wogan</surname>
<given-names>G. N.</given-names>
</name>
</person-group> (<year>1979</year>). <article-title>Sterigmatocystin-DNA Interactions: Identification of a Major Adduct Formed after Metabolic Activation <italic>In Vitro</italic>
</article-title>. <source>Proc. Natl. Acad. Sci. U.S.A.</source> <volume>76</volume>, <fpage>179</fpage>&#x2013;<lpage>183</lpage>. <pub-id pub-id-type="doi">10.1073/PNAS.76.1.179</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guidolin</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Carlson</surname>
<given-names>E. S.</given-names>
</name>
<name>
<surname>Carr&#xe0;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Maertens</surname>
<given-names>L. A.</given-names>
</name>
<name>
<surname>Hecht</surname>
<given-names>S. S.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Identification of New Markers of Alcohol-Derived DNA Damage in Humans</article-title>. <source>Biomolecules</source> <volume>11</volume>, <fpage>366</fpage>. <pub-id pub-id-type="doi">10.3390/BIOM11030366</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guijas</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Montenegro-Burke</surname>
<given-names>J. R.</given-names>
</name>
<name>
<surname>Domingo-Almenara</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Palermo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Warth</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Hermann</surname>
<given-names>G.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>METLIN: A Technology Platform for Identifying Knowns and Unknowns</article-title>. <source>Anal. Chem.</source> <volume>90</volume>, <fpage>3156</fpage>&#x2013;<lpage>3164</lpage>. <pub-id pub-id-type="doi">10.1021/ACS.ANALCHEM.7B04424</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Turesky</surname>
<given-names>R. J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Emerging Technologies in Mass Spectrometry-Based DNA Adductomics</article-title>. <source>High-Throughput</source> <volume>8</volume>, <fpage>13</fpage>. <pub-id pub-id-type="doi">10.3390/HT8020013</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Turesky</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Tarifa</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>DeCaprio</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Cooke</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Walmsley</surname>
<given-names>S. J.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Development of a DNA Adductome Mass Spectral Database</article-title>. <source>Chem. Res. Toxicol.</source> <volume>33</volume>, <fpage>852</fpage>&#x2013;<lpage>854</lpage>. <pub-id pub-id-type="doi">10.1021/ACS.CHEMRESTOX.0C00031</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guo</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Turesky</surname>
<given-names>R. J.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Data-Independent Mass Spectrometry Approach for Screening and Identification of DNA Adducts</article-title>. <source>Anal. Chem.</source> <volume>89</volume>, <fpage>11728</fpage>&#x2013;<lpage>11736</lpage>. <pub-id pub-id-type="doi">10.1021/acs.analchem.7b03208</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heijne</surname>
<given-names>W. H.</given-names>
</name>
<name>
<surname>Kienhuis</surname>
<given-names>A. S.</given-names>
</name>
<name>
<surname>van Ommen</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Stierum</surname>
<given-names>R. H.</given-names>
</name>
<name>
<surname>Groten</surname>
<given-names>J. P.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Systems Toxicology: Applications of Toxicogenomics, Transcriptomics, Proteomics and Metabolomics in Toxicology</article-title>. <source>Expert Rev. Proteomics</source> <volume>2</volume>, <fpage>767</fpage>&#x2013;<lpage>780</lpage>. <pub-id pub-id-type="doi">10.1586/14789450.2.5.767</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hemeryck</surname>
<given-names>L. Y.</given-names>
</name>
<name>
<surname>Decloedt</surname>
<given-names>A. I.</given-names>
</name>
<name>
<surname>Vanden Bussche</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Geboes</surname>
<given-names>K. P.</given-names>
</name>
<name>
<surname>Vanhaecke</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>High Resolution Mass Spectrometry Based Profiling of Diet-Related Deoxyribonucleic Acid Adducts</article-title>. <source>Anal. Chim. Acta</source> <volume>892</volume>, <fpage>123</fpage>&#x2013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1016/j.aca.2015.08.019</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hemeryck</surname>
<given-names>L. Y.</given-names>
</name>
<name>
<surname>Moore</surname>
<given-names>S. A.</given-names>
</name>
<name>
<surname>Vanhaecke</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Mass Spectrometric Mapping of the DNA Adductome as a Means to Study Genotoxin Exposure, Metabolism, and Effect</article-title>. <source>Anal. Chem.</source> <volume>88</volume>, <fpage>7436</fpage>&#x2013;<lpage>7446</lpage>. <pub-id pub-id-type="doi">10.1021/acs.analchem.6b00863</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hemeryck</surname>
<given-names>L. Y.</given-names>
</name>
<name>
<surname>Vanhaecke</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Diet-Related DNA Adduct Formation in Relation to Carcinogenesis</article-title>. <source>Nutr. Rev.</source> <volume>74</volume>, <fpage>475</fpage>&#x2013;<lpage>489</lpage>. <pub-id pub-id-type="doi">10.1093/nutrit/nuw017</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Horai</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Arita</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kanaya</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Nihei</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Ikeda</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Suwa</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>MassBank: A Public Repository for Sharing Mass Spectral Data for Life Sciences</article-title>. <source>J. Mass Spectrom.</source> <volume>45</volume>, <fpage>703</fpage>&#x2013;<lpage>714</lpage>. <pub-id pub-id-type="doi">10.1002/JMS.1777</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>C.-W.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>Y.-J.</given-names>
</name>
<name>
<surname>Cooke</surname>
<given-names>M. S.</given-names>
</name>
<name>
<surname>Chao</surname>
<given-names>M.-R.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>DNA Crosslinkomics: A Tool for the Comprehensive Assessment of Interstrand Crosslinks Using High Resolution Mass Spectrometry</article-title>. <source>Anal. Chem.</source> <volume>91</volume>, <fpage>15193</fpage>&#x2013;<lpage>15203</lpage>. <pub-id pub-id-type="doi">10.1021/acs.analchem.9b04068</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kanehisa</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Goto</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>KEGG: Kyoto Encyclopedia of Genes and Genomes</article-title>. <source>Nucleic Acids Res.</source> <volume>28</volume>, <fpage>27</fpage>&#x2013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1093/NAR/28.1.27</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>H. J.</given-names>
</name>
<name>
<surname>Ryu</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Worldwide Occurrence of Mycotoxins in Cereals and Cereal-Derived Food Products: Public Health Perspectives of Their Co-Occurrence</article-title>. <source>J. Agric. Food Chem.</source> <volume>65</volume>, <fpage>7034</fpage>&#x2013;<lpage>7051</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jafc.6b04847</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Segal</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Solomon</surname>
<given-names>J. J.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>
<italic>In Vitro</italic> Reaction of Ethylene Oxide with DNA and Characterization of DNA Adducts</article-title>. <source>Chemico-Biological Interact.</source> <volume>83</volume>, <fpage>35</fpage>&#x2013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1016/0009-2797(92)90090-8</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Stepanov</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Hecht</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Recent Studies on DNA Adducts Resulting from Human Exposure to Tobacco Smoke</article-title>. <source>Toxics</source> <volume>7</volume>, <fpage>16</fpage>. <pub-id pub-id-type="doi">10.3390/TOXICS7010016</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Zarth</surname>
<given-names>A. T.</given-names>
</name>
<name>
<surname>Carlson</surname>
<given-names>E. S.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Upadhyaya</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Stepanov</surname>
<given-names>I.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Identification of More Than 100 Structurally Unique DNA-Phosphate Adducts Formed During Rat Lung Carcinogenesis by the Tobacco-Specific Nitrosamine 4-(methylnitrosamino)-1-(3-Pyridyl)-1-Butanone</article-title>. <source>Carcinogenesis</source> <volume>39</volume>, <fpage>232</fpage>&#x2013;<lpage>241</lpage>. <pub-id pub-id-type="doi">10.1093/carcin/bgx135</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O&#x27;Boyle</surname>
<given-names>N. M.</given-names>
</name>
<name>
<surname>Banck</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>James</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Morley</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Vandermeersch</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Hutchison</surname>
<given-names>G. R.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Open Babel: An Open Chemical Toolbox</article-title>. <source>J. Cheminform.</source> <volume>3</volume>, <fpage>1</fpage>&#x2013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1186/1758-2946-3-33</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pluskal</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Castillo</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Villar-Briones</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Ore&#x161;i&#x10d;</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>MZmine 2: Modular Framework for Processing, Visualizing, and Analyzing Mass Spectrometry-Based Molecular Profile Data</article-title>. <source>BMC Bioinforma.</source> <volume>11</volume>. <pub-id pub-id-type="doi">10.1186/1471-2105-11-395</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="book">
<collab>R Core Team</collab> (<year>2020</year>). <source>R: A Language and Environment for Statistical Computing</source>. <publisher-loc>Vienna, Austria</publisher-loc>: <publisher-name>R Core Team</publisher-name>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.r-project.org/">https://www.r-project.org/</ext-link>
</comment>. </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Scalbert</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Brennan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Manach</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Andres-Lacueva</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Dragsted</surname>
<given-names>L. O.</given-names>
</name>
<name>
<surname>Draper</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>The Food Metabolome: A Window over Dietary Exposure</article-title>. <source>Amer. J. Clin. Nutr.</source> <volume>99</volume> (<issue>6</issue>), <fpage>1286</fpage>&#x2013;<lpage>1308</lpage>. <pub-id pub-id-type="doi">10.3945/ajcn.113.076133</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smith</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Want</surname>
<given-names>E. J.</given-names>
</name>
<name>
<surname>O&#x27;Maille</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Abagyan</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Siuzdak</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>XCMS: Processing Mass Spectrometry Data for Metabolite Profiling Using Nonlinear Peak Alignment, Matching, and Identification</article-title>. <source>Anal. Chem.</source> <volume>78</volume>, <fpage>779</fpage>&#x2013;<lpage>787</lpage>. <pub-id pub-id-type="doi">10.1021/ac051437y</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sousa</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Martella</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>&#xc5;berg</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Esfahani</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Motwani</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>nLossFinder-A Graphical User Interface Program for the Nontargeted Detection of DNA Adducts</article-title>. <source>Toxics</source> <volume>9</volume>, <fpage>78</fpage>. <pub-id pub-id-type="doi">10.3390/TOXICS9040078</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stegelmeier</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Colegate</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Brown</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Dehydropyrrolizidine Alkaloid Toxicity, Cytotoxicity, and Carcinogenicity</article-title>. <source>Toxins</source> <volume>8</volume>, <fpage>356</fpage>. <pub-id pub-id-type="doi">10.3390/TOXINS8120356</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stornetta</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Hecht</surname>
<given-names>S. S.</given-names>
</name>
<name>
<surname>Sturla</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Balbo</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Screening for DNA Alkylation Mono and Cross-Linked Adducts with a Comprehensive LC-MS3 Adductomic Approach</article-title>. <source>Anal. Chem.</source> <volume>87</volume>, <fpage>11706</fpage>&#x2013;<lpage>11713</lpage>. <pub-id pub-id-type="doi">10.1021/acs.analchem.5b02759</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Simultaneous Quantitation of 14 DNA Alkylation Adducts in Human Liver and Kidney Cells by UHPLC-MS/MS: Application to Profiling DNA Adducts of Genotoxic Reagents</article-title>. <source>J. Pharm. Biomed. Analysis</source> <volume>166</volume>, <fpage>387</fpage>&#x2013;<lpage>397</lpage>. <pub-id pub-id-type="doi">10.1016/J.JPBA.2019.01.034</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Turesky</surname>
<given-names>R. J.</given-names>
</name>
<name>
<surname>Le Marchand</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Metabolism and Biomarkers of Heterocyclic Aromatic Amines in Molecular Epidemiology Studies: Lessons Learned from Aromatic Amines</article-title>. <source>Chem. Res. Toxicol.</source> <volume>24</volume>, <fpage>1169</fpage>&#x2013;<lpage>1214</lpage>. <pub-id pub-id-type="doi">10.1021/tx200135s</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Villalta</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Balbo</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>The Future of DNA Adductomic Analysis</article-title>. <source>Ijms</source> <volume>18</volume>, <fpage>1870</fpage>. <pub-id pub-id-type="doi">10.3390/ijms18091870</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Liigand</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Arndt</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Greiner</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Wishart</surname>
<given-names>D. S.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>CFM-ID 4.0: More Accurate ESI-MS/MS Spectral Prediction and Compound Identification</article-title>. <source>Anal. Chem.</source> <volume>93</volume>, <fpage>11692</fpage>&#x2013;<lpage>11700</lpage>. <pub-id pub-id-type="doi">10.1021/ACS.ANALCHEM.1C01465</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wei</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Covalent Modification of DNA by &#x3b1;, &#x3b2;-Unsaturated Aldehydes Derived from Lipid Peroxidation: Recent Progress and Challenges</article-title>. <source>Free Radic. Res.</source> <volume>49</volume>, <fpage>905</fpage>&#x2013;<lpage>917</lpage>. <pub-id pub-id-type="doi">10.3109/10715762.2015.1040009</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wickham</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Averick</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bryan</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>McGowan</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Fran&#xe7;ois</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Welcome to the Tidyverse</article-title>. <source>Joss</source> <volume>4</volume>, <fpage>1686</fpage>. <pub-id pub-id-type="doi">10.21105/JOSS.01686</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wilson</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Villalta</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Stornetta</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Boudreau</surname>
<given-names>P. D.</given-names>
</name>
<name>
<surname>Carr&#xe1;</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>The Human Gut Bacterial Genotoxin Colibactin Alkylates DNA</article-title>. <source>Science</source> <volume>363</volume>, <fpage>709</fpage>. <pub-id pub-id-type="doi">10.1126/science.aar7785</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wishart</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Oler</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Anjum</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>H.</given-names>
</name>
<etal/>
</person-group> (<year>2022</year>). <article-title>HMDB 5.0: The Human Metabolome Database for 2022</article-title>. <source>Nucleic Acids Res.</source> <volume>50</volume>, <fpage>D622</fpage>&#x2013;<lpage>D631</lpage>. <pub-id pub-id-type="doi">10.1093/NAR/GKAB1062</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Cui</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Niedernhofer</surname>
<given-names>L. J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Occurrence, Biological Consequences, and Human Health Relevance of Oxidative Stress-Induced DNA Damage</article-title>. <source>Chem. Res. Toxicol.</source> <volume>29</volume>, <fpage>2008</fpage>&#x2013;<lpage>2039</lpage>. <pub-id pub-id-type="doi">10.1021/ACS.CHEMRESTOX.6B00265</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Bai</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A Review on the Pharmacological Properties, Toxicological Characteristics, Pathogenic Mechanism and Analytical Methods of Aristolochic Acids</article-title>. <source>Toxin Rev.</source> <volume>40</volume>, <fpage>606</fpage>&#x2013;<lpage>615</lpage>. <pub-id pub-id-type="doi">10.1080/15569543.2020.1811344</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>