<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Earth Sci.</journal-id>
<journal-title>Frontiers in Earth Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Earth Sci.</abbrev-journal-title>
<issn pub-type="epub">2296-6463</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">931157</article-id>
<article-id pub-id-type="doi">10.3389/feart.2022.931157</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Earth Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Hidden molecular clues in marine sediments revealed by untargeted mass spectrometry imaging</article-title>
<alt-title alt-title-type="left-running-head">Liu et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/feart.2022.931157">10.3389/feart.2022.931157</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Liu</surname>
<given-names>Weimin</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1776746/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Alfken</surname>
<given-names>Susanne</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>W&#x00F6;rmer</surname>
<given-names>Lars</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/200833/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lipp</surname>
<given-names>Julius S.</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/30401/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Hinrichs</surname>
<given-names>Kai-Uwe</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/31868/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Organic Geochemistry Group</institution>, <institution>MARUM&#x2014;Center for Marine Environmental Sciences</institution>, <institution>University of Bremen</institution>, <addr-line>Bremen</addr-line>, <country>Germany</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/824089/overview">David C. Podgorski</ext-link>, University of New Orleans, United States</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/94388/overview">Fengfeng Zheng</ext-link>, Tongji University, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/334612/overview">Jeffrey Alistair Hawkes</ext-link>, Uppsala University, Sweden</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Weimin Liu, <email>wliu@marum.de</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Biogeoscience, a section of the journal Frontiers in Earth Science</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>08</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>10</volume>
<elocation-id>931157</elocation-id>
<history>
<date date-type="received">
<day>28</day>
<month>04</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>06</day>
<month>07</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Liu, Alfken, W&#x00F6;rmer, Lipp and Hinrichs.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Liu, Alfken, W&#x00F6;rmer, Lipp and Hinrichs</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Mass spectrometry imaging (MSI) in sedimentary archives can produce records of molecular proxies at &#x3bc;m-scale resolution. For example, in annually varved sediments of the Santa Barbara Basin, such a fine resolution allows deciphering sub-annual distributions of archaeal tetraether lipids, haptophyte-derived alkenones, and sterols. Herein, we reported the establishment of an untargeted data processing workflow aimed at dissecting the MSI datasets and extracting information beyond that obtained by targeted analysis of known molecular proxies. The combination of MSI and the untargeted workflow not only increases the spatial resolution for molecular stratigraphy but also dramatically broadens the number and diversity of molecular signals evaluated, enabling us to discover unique molecular signatures imprinted by various biogeochemical processes. We applied the proposed workflow to two MSI datasets that were both measured on the uppermost &#x223c;10&#xa0;cm of the Santa Barbara Basin sediments while covering different mass ranges. Two matrices of <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mn>18625</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>293</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:mn>18963</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>323</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> (number of spectra <inline-formula id="inf3">
<mml:math id="m3">
<mml:mo>&#xd7;</mml:mo>
</mml:math>
</inline-formula> number of peaks) were, respectively, extracted after peak alignment using bin-wise kernel density estimation and subsequent peak picking by peak prominence filtering combined with geochemical context-based filtering. Feature extraction by non-negative matrix factorization revealed in total 15 stable molecular clusters with distinct spatial distributions in the sediments. Each cluster typically comprised several to dozens of compounds, with the majority of compounds in each cluster likely belonging to similar chemical taxonomies. Some of these clusters can be linked to specific biogeochemical processes. For example, chlorin-like compounds are possibly related to diatom production, alkenones are related to coccolithophorid production, and steranes and long-chain fatty acids likely represent terrigenous input. Supervised learning from these data mining results further extracted molecular signatures with proxy potential that appear to be linked to specific environmental conditions inferred from historical oceanographic data. However, generalizability to other sedimentary settings will require further investigation.</p>
</abstract>
<kwd-group>
<kwd>marine sediments</kwd>
<kwd>supervised learning</kwd>
<kwd>mass spectrometry imaging</kwd>
<kwd>untargeted data mining</kwd>
<kwd>molecular signature</kwd>
<kwd>biomarker discovery</kwd>
<kwd>paleoclimate</kwd>
</kwd-group>
<contract-num rid="cn001">670115</contract-num>
<contract-num rid="cn002">390741603</contract-num>
<contract-sponsor id="cn001">HORIZON EUROPE European Research Council<named-content content-type="fundref-id">10.13039/100019180</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Deutsche Forschungsgemeinschaft<named-content content-type="fundref-id">10.13039/501100001659</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Biomarkers in geological samples encode information about past environments and ecosystems (<xref ref-type="bibr" rid="B24">Hayes et al., 1990</xref>; <xref ref-type="bibr" rid="B46">Peters et al., 2005</xref>). Biomarker-based molecular proxies have been widely applied in reconstructing paleoecological and paleoclimatic history (<xref ref-type="bibr" rid="B15">Damst&#xe9; et al., 1990</xref>; <xref ref-type="bibr" rid="B27">Hinrichs et al., 1999</xref>; <xref ref-type="bibr" rid="B28">Huang et al., 1999</xref>; <xref ref-type="bibr" rid="B56">Summons et al., 2022</xref>). The initial proposal of molecular stratigraphy occurred more than three&#xa0;decades ago (<xref ref-type="bibr" rid="B8">Brassell et al., 1986</xref>), when the degree of unsaturation of long-chain methyl and ethyl ketones (i.e., alkenones) produced by ubiquitous coccolithophores was suggested to indicate variations in sea surface temperatures (SST). Another example of a biomarker-based proxy is the stanol/stenol ratio, which is sensitive to redox conditions in depositional settings (<xref ref-type="bibr" rid="B40">Nishimura and Koyama, 1977</xref>; <xref ref-type="bibr" rid="B69">Wakeham, 1989</xref>) as <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:msup>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mn>5</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>-sterols are microbiologically reduced to <inline-formula id="inf5">
<mml:math id="m5">
<mml:mrow>
<mml:mn>5</mml:mn>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> (H)-stanols under anoxic conditions (<xref ref-type="bibr" rid="B51">Rosenfeld and Hellman, 1971</xref>; <xref ref-type="bibr" rid="B19">Eyssen et al., 1973</xref>). Conventional biomarker analysis includes the solvent extraction of geological samples, the subsequent separation of the extract into varying fractions, and the instrumental analysis of the crude extract or selected fractions. Due to the requirement of at least gram-sized (cm-scale) sample amounts in such analytical protocols, the resulting biomarker records usually indicate the average state of past environments and ecosystems over decades to millennia and are not fine enough for identifying abrupt environmental changes and high-frequency climate oscillations.</p>
<p>Recently, we have increased the resolution of biomarker records in sediments to &#x3bc;m-scale by an extraction-free, mass spectrometry imaging (MSI)-based approach <italic>via</italic> matrix-assisted laser desorption/ionization coupled to Fourier transformation-ion cyclotron resonance-mass spectrometry (MALDI-FT-ICR-MS; <xref ref-type="bibr" rid="B72">W&#xf6;rmer et al., 2014</xref>, <xref ref-type="bibr" rid="B74">W&#xf6;rmer et al., 2019</xref>; <xref ref-type="bibr" rid="B5">Alfken et al., 2019</xref>). In the annually varved sediment of the Santa Barbara Basin (SBB), such a fine resolution allows deciphering sub-annual distributions of archaeal tetraether lipids, haptophyte-derived alkenones, and sterols, which are sensitive to changes in upwelling intensity, sea surface temperature, and water column redox conditions (<xref ref-type="bibr" rid="B4">Alfken et al., 2020</xref>, <xref ref-type="bibr" rid="B3">2021</xref>). The MSI-based protocol has also successfully revealed astonishingly diverse spatial signatures of lipid biomarkers along a &#x223c;1&#xa0;cm thick microbial mat (<xref ref-type="bibr" rid="B73">W&#xf6;rmer et al., 2020</xref>), demonstrating its capability in resolving fine spatial distributions of biomarker records beyond rather coarse concentration profiles. Nevertheless, our MSI-based studies of sedimentary archives so far have only focused on certain groups of biomarkers, such as alkenones and sterols. Thousands of ions are generated in a single spectrum of each &#x3bc;m-sized laser spot in addition to these targeted biomarkers, resulting in massive amounts of spectrometry data that are not readily mineable but could contain novel, biogeochemically relevant biomarkers. An untargeted data processing protocol is thus in need for the comprehensive understanding of sedimentary MSI datasets.</p>
<p>Over the last decade, advances in bioinformatics have shed light on hidden molecular clues in MSI datasets obtained from biological tissues, providing automated tools/protocols for data cleaning, as well as unsupervised and supervised data mining (reviews in <xref ref-type="bibr" rid="B1">Alexandrov, 2012</xref>; <xref ref-type="bibr" rid="B60">Trede et al., 2012</xref>; <xref ref-type="bibr" rid="B58">Thiele et al., 2014</xref>; <xref ref-type="bibr" rid="B62">Verbeeck et al., 2020</xref>). For example, <xref ref-type="bibr" rid="B17">Eriksson et al. (2019)</xref> proposed a sensitive peak detection method based on cluster-wise kernel density estimation (KDE), an algorithm for estimating the probability density function of random variables (<xref ref-type="bibr" rid="B7">Botev et al., 2010</xref>), allowing the discovery of both faint and localized peaks in MSI datasets. Gray level co-occurrence matrices (GLCM; <xref ref-type="bibr" rid="B22">Hall-Beyer, 2017</xref>) that compute the distribution of co-occurring pixel values at specified offsets in an image were utilized in <xref ref-type="bibr" rid="B71">Wijetunge et al. (2015)</xref> for unsupervised peak picking, which enables identifying molecules with structured spatial distributions. Various matrix factorization techniques, such as principal component analysis (PCA; <xref ref-type="bibr" rid="B30">Jolliffe, 2002</xref>), independent component analysis (ICA; <xref ref-type="bibr" rid="B31">Jutten and Herault, 1991</xref>; <xref ref-type="bibr" rid="B13">Comon, 1994</xref>), and non-negative matrix factorization (NMF; <xref ref-type="bibr" rid="B9">Brunet et al., 2004</xref>; <xref ref-type="bibr" rid="B34">Kim and Park, 2007</xref>) have been successfully applied to MSI datasets for dimensionality reduction and feature extraction. Among these, NMF is specifically useful in extracting easy-to-interpret biologically relevant information (e.g., <xref ref-type="bibr" rid="B77">Siy et al., 2008</xref>; <xref ref-type="bibr" rid="B78">Gut et al., 2015</xref>; <xref ref-type="bibr" rid="B79">Nijs et al., 2021</xref>). In addition to these unsupervised approaches, supervised machine learning has also been reported for identifying tumor tissues and discovering tumor-related biomarkers (e.g., <xref ref-type="bibr" rid="B48">Quanico et al., 2017</xref>; <xref ref-type="bibr" rid="B42">Ovchinnikova et al., 2020</xref>; <xref ref-type="bibr" rid="B39">Mittal et al., 2021</xref>).</p>
<p>Data mining approaches designed for biological MSI datasets often take advantage of the spatial resolution of MSI and search for biomarkers that co-localize with specific biological structures, e.g., tumors. Although geological samples are generally not as clearly structured as biological tissues, the laminated samples from the Santa Barbara Basin should be imprinted with distinct spatial molecular signatures caused by seasonally recurring biogeochemical processes. For instance, sediments in the center of SBB are deposited in oxygen-depleted settings, and they are characterized by annual varve couplets of terrigenous mineral-rich laminae deposited during the rainy fall&#x2013;winter season alternating with biogenic-rich laminae deposited during the highly productive spring&#x2013;summer season (<xref ref-type="bibr" rid="B29">H&#xfc;lsemann and Emery, 1961</xref>; <xref ref-type="bibr" rid="B50">Reimers et al., 1990</xref>; <xref ref-type="bibr" rid="B59">Thunell et al., 1995</xref>; <xref ref-type="bibr" rid="B52">Schimmelmann and Lange, 1996</xref>). Such distinct sources should be reflected in biomarker signatures that are, to some extent, correlated with environmental conditions, such as salinity and nutrient concentrations.</p>
<p>In this study, we reported an untargeted data processing workflow for dissecting sedimentary MSI datasets and extracting information beyond conventional molecular proxies, enabling us to discover unique molecular signatures imprinted by varying biogeochemical processes in sediments. The proposed workflow is validated by re-analyzing the MSI datasets obtained from the SBB sediments measured by <xref ref-type="bibr" rid="B4">Alfken et al. (2020</xref>, <xref ref-type="bibr" rid="B3">2021)</xref>. Untargeted peak detection, selection of informative peaks whose distributions mirror the lamination of the sediment, pattern recognition for molecular signature discovery, and relating molecular signatures to environmental variables are all part of the re-analysis. In addition, conventional molecular proxies are often established in a bottom-up manner, starting at characterizing specific biomarkers in geological samples, then identifying their roles in biogeochemical processes, and, in the end, building geochemically meaningful proxies that stand the test of &#x201c;time&#x201d; and &#x201c;space&#x201d;. In this study, based on the proposed data processing workflow, we tried to demonstrate the potential of supervised learning for discovering novel, biogeochemically relevant molecular proxies in a top-down manner, i.e., building indicative proxies without prior knowledge of their biogeochemical implications.</p>
</sec>
<sec id="s2">
<title>2 Samples and methods</title>
<sec id="s2-1">
<title>2.1 Mass spectrometry imaging experiments and datasets</title>
<p>Four MSI datasets (<xref ref-type="table" rid="T1">Table 1</xref>) acquired from the uppermost &#x223c;10&#xa0;cm of box core SPR0901-05BC collected from the center of SBB were employed to evaluate the data processing workflow proposed in this study. The sediment samples and MALDI-FT-ICR-MS analysis protocol have been described in detail in <xref ref-type="bibr" rid="B4">Alfken et al. (2020</xref>, <xref ref-type="bibr" rid="B3">2021)</xref>. Datasets A1 and A2 were measured in <xref ref-type="bibr" rid="B4">Alfken et al. (2020)</xref> with a mass window of 520&#x2013;580&#xa0;Da for mapping the <inline-formula id="inf6">
<mml:math id="m6">
<mml:mrow>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mn>37</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>K</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> index, and datasets B1 and B2 were measured in <xref ref-type="bibr" rid="B3">Alfken et al. (2021)</xref> with a wider mass window of 375&#x2013;525&#xa0;Da for mapping the stanol/stenol ratios. A1 and B1 were obtained from the uppermost &#x223c;5&#xa0;cm of the sediment, and A2 and B2 were obtained from 5&#x2013;10&#xa0;cm intervals on the sediment. For the sake of brevity, A1 and A2, and B1 and B2 were merged, respectively, in this study, and the resulting new datasets were coded as datasets A and B. In addition, although datasets resulting from analysis of the m/z range 1280&#x2013;1360 are available from <xref ref-type="bibr" rid="B3">Alfken et al. (2021)</xref>, they were deemed not to be practical for evaluating the untargeted workflow due to the low abundance of known biomarkers in this mass range.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Basic information of the MSI datasets employed in this study.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Label</th>
<th align="left">m/z window (Da)</th>
<th align="left">Depth</th>
<th align="left">Number of data points</th>
<th align="left">Merged label</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">A1</td>
<td align="left">520&#x2013;580</td>
<td align="left">Upper &#x223c;5&#xa0;cm</td>
<td align="left">9071</td>
<td rowspan="2" align="left">A</td>
</tr>
<tr>
<td align="left">A2</td>
<td align="left">520&#x2013;580</td>
<td align="left">&#x223c;5&#x2014;10&#xa0;cm</td>
<td align="left">9554</td>
</tr>
<tr>
<td align="left">B1</td>
<td align="left">375&#x2013;525</td>
<td align="left">Upper &#x223c;5&#xa0;cm</td>
<td align="left">9475</td>
<td rowspan="2" align="left">B</td>
</tr>
<tr>
<td align="left">B2</td>
<td align="left">375&#x2013;525</td>
<td align="left">&#x223c;5&#x2014;10&#xa0;cm</td>
<td align="left">9488</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2-2">
<title>2.2 Data processing workflow</title>
<p>The MSI datasets were first exported to plain text files (&#x2a;.txt) from Data Analysis version 4.4 software (Bruker Daltonics). These plain text files were taken as inputs by the proposed workflow (<xref ref-type="fig" rid="F1">Figure 1</xref>) in this study, which is implemented in an in-house developed python package that is publicly available (<ext-link ext-link-type="uri" xlink:href="https://github.com/weimin-liu/msi_feature_extraction">https://github.com/weimin-liu/msi_feature_extraction</ext-link>). In brief, the workflow includes a data cleaning module for mass calibration, peak normalization, peak alignment, and peak picking, and a data mining module for molecular signature extraction and clustering of molecules. In addition, ElasticNet was employed to train multivariate linear regression models for indicating environmental conditions using the data mining results, in order to demonstrate the potential of supervised learning for molecular proxy discovery. The individual steps and the rationale of the proposed workflow are described in detail in the following sections.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>
<bold>(A)</bold> Proposed workflow including data cleaning and data mining modules for the untargeted analysis of the sedimentary MSI dataset, supplemented by a supervised learning example to demonstrate its potential for the discovery of novel biomarker proxies on the basis of the proposed workflow. &#x2a; Molecular signatures refer to the small sets of hidden features extracted by repeated NMF, and they denote the commonality of co-occurring molecules. By contrast, ElasticNet tries to find the variability in these co-occurring molecules that correlate with certain environmental parameters. <bold>(B)</bold> Graph shows an example of the workflow, including the detection of reference peaks between <italic>m/z</italic> values 551.0 and 552.0, the calculation of the gray level co-occurrence matrix (GLCM), and the unsupervised data mining with non-negative matrix factorization (NMF).</p>
</caption>
<graphic xlink:href="feart-10-931157-g001.tif"/>
</fig>
<sec id="s2-2-1">
<title>2.2.1 Data cleaning</title>
<p>In this step, the spectra were first calibrated, normalized by median peak intensity, aligned onto common mass-to-charge (<italic>m/z</italic>) ratios detected by bin-wise KDE, and cleaned by a two-step peak picking method to extract robust and informative signals in the spectra, including a peak prominence filter and a geochemical-context-based filter based on the varying deposits visualized by the X-Ray image of the sediment.</p>
<sec id="s2-2-1-1">
<title>2.2.1.1 Mass calibration and peak intensity normalization</title>
<p>Raw spectra were calibrated with lock-mass calibration in data analysis before being exported (<xref ref-type="bibr" rid="B4">Alfken et al., 2020</xref>, <xref ref-type="bibr" rid="B3">2021</xref>). Subsequently, a quadratic mass error function was fitted from the mass deviations of calibrants in each spectrum for further calibration. The calibrants used in datasets A and B are listed in <xref ref-type="sec" rid="s8">Supplementary Table S1</xref>. They were the most ubiquitous compounds in each dataset whose chemical formulas can be assigned with great certainty. In addition, peak intensities in each spectrum were normalized to the median peak intensity of the spectrum as it was found to be robust against artifact generation such as ion images showing inaccurate ion distributions (<xref ref-type="bibr" rid="B16">Deininger et al., 2011</xref>; <xref ref-type="bibr" rid="B20">Fonville et al., 2012</xref>).</p>
</sec>
<sec id="s2-2-1-2">
<title>2.2.1.2 Peak alignment with bin-wise KDE and peak prominence filtering</title>
<p>For a molecule whose theoretical <italic>m/z</italic> is <inline-formula id="inf7">
<mml:math id="m7">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, its measured <italic>m/z</italic> in the spectrum <inline-formula id="inf8">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mi>S</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is <inline-formula id="inf9">
<mml:math id="m9">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> that has a random deviation from <inline-formula id="inf10">
<mml:math id="m10">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>. We assume <inline-formula id="inf11">
<mml:math id="m11">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> is normally distributed around <inline-formula id="inf12">
<mml:math id="m12">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and determined <inline-formula id="inf13">
<mml:math id="m13">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> from the distribution of <inline-formula id="inf14">
<mml:math id="m14">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> with KDE. The so-determined <inline-formula id="inf15">
<mml:math id="m15">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mi>z</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> ratios are referred to as reference <italic>m/z</italic> ratios in this study since they do not necessarily represent the exact theoretical <italic>m/z</italic> ratios.</p>
<p>Concretely, we adapted the cluster-wise KDE approach proposed in <xref ref-type="bibr" rid="B17">Eriksson et al. (2019)</xref> by omitting peak clustering and detecting peaks directly in mass bins. We used mass bins with an interval of 1&#xa0;Da starting at every integer mass (e.g., 375.0&#x2014;376.0&#xa0;Da) to collect <italic>m/z</italic> from all spectra and got an array of bins [<inline-formula id="inf16">
<mml:math id="m16">
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf17">
<mml:math id="m17">
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf18">
<mml:math id="m18">
<mml:mrow>
<mml:mi>b</mml:mi>
<mml:mi>i</mml:mi>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, &#x2026;]. We then fitted KDE to the distribution of <italic>m/z</italic> within each sorted bin using a Gaussian kernel function, the bandwidth of which is automatically selected by the improved Sheather Jones algorithm implemented in the python package KDEpy (<xref ref-type="bibr" rid="B41">Odland, 2018</xref>), which is robust in fitting multimodal distributions (<xref ref-type="bibr" rid="B7">Botev et al., 2010</xref>). Reference <italic>m/z</italic> ratios were detected on the KDE curve by first normalizing its height to [0, 1] and then picking local maxima with peak prominence (defined in <xref ref-type="bibr" rid="B63">Virtanen et al., 2020</xref>) greater than a pre-defined threshold <italic>p</italic>
<sub>
<italic>th</italic>
</sub> (<italic>p</italic>
<sub>
<italic>th</italic>
</sub> <inline-formula id="inf19">
<mml:math id="m19">
<mml:mo>&#x2208;</mml:mo>
</mml:math>
</inline-formula> (0, 1))<italic>.</italic> How the choice of <italic>p</italic>
<sub>
<italic>th</italic>
</sub> influences the peak picking results, including the quality and the coverage of resulting peaks, is evaluated in <xref ref-type="sec" rid="s8">Supplementary Text S1</xref>. Finally, all spectra were aligned onto the reference <italic>m/z</italic> ratios by picking the nearest neighbor (i.e., the closest peak) with a maximum drift threshold of 10&#xa0;ppm.</p>
</sec>
<sec id="s2-2-1-3">
<title>2.2.1.3 Peak picking with GLCM features: Geochemical-context-based peak picking</title>
<p>The reference peaks detected by bin-wise KDE likely contained signals of background noise and non-informative peaks that cannot be reliably removed by peak prominence filtering. In this study, we defined informative peaks as those having certain spatial structures instead of being randomly distributed in the sediments. The two datasets were obtained from the varved SBB sediments consisting of seasonally alternating deposits, which allow the visualization of the associated density differences on an X-ray image (positive image). Since the dense, terrestrial deposits are displayed as dark, and the more porous, diatom-rich biogenic deposits as light laminae (<xref ref-type="bibr" rid="B29">H&#xfc;lsemann and Emery, 1961</xref>; <xref ref-type="bibr" rid="B55">Soutar and Crill, 1977</xref>; <xref ref-type="bibr" rid="B50">Reimers et al., 1990</xref>; <xref ref-type="bibr" rid="B59">Thunell et al., 1995</xref>), it is reasonable to assume that geochemically informative peaks should inherit similar spatial patterns. Based on this assumption, a geochemical-context-based peak picking method was employed in the workflow as a supplementary approach to peak prominence filtering.</p>
<p>Concretely, ion images were first generated using normalized ion intensity maps across the sediment slide and were then preprocessed using a similar approach, as described in <xref ref-type="bibr" rid="B71">Wijetunge et al. (2015)</xref>. In brief, each ion image was first rescaled to include all intensities that fall between the 2nd and 98th percentiles to remove hotpots and was subsequently smoothed by a mean filter with a 3 <inline-formula id="inf20">
<mml:math id="m20">
<mml:mo>&#xd7;</mml:mo>
</mml:math>
</inline-formula> 3 neighborhood (raster) to reduce technical artifacts. All intensity values excluding 0 in each image were then quantized to 11 integer intensity levels based on the respective intensity histogram. The resulting ion images had 12 shades of gray, in which 0 denotes the absence of the peak, while 1&#x2014;11 denote increasing peak intensities. The X-ray photograph of the uppermost &#x223c;10&#xa0;cm sediment slide exhibits the characteristic varved structure of SBB sediments, and it was thus targeted for spatial similarity comparison. Instead of computing pixel-to-pixel correlations or the GLCM scores proposed in <xref ref-type="bibr" rid="B71">Wijetunge et al. (2015)</xref>, the features extracted using GLCM from the ion images and X-ray photographs were compared (<xref ref-type="bibr" rid="B23">Haralick et al., 1973</xref>). We used the approach implemented in the python package scikit-image (<xref ref-type="bibr" rid="B61">van der Walt et al., 2014</xref>) to calculate GLCM features, including contrast, dissimilarity, correlation, energy, and homogeneity, for the quantized ion images and X-ray photographs in 0&#xb0;, 15&#xb0;, 45&#xb0;, 90&#xb0;, 105&#xb0;, 135&#xb0;, and 270&#xb0; angles, with pixel pair distance offsets from 1&#x2014;5. The output for each image was a 175-dimension GLCM feature vector. Principal component analysis (PCA) was performed on the standardized GLCM feature vectors to visualize their variability in GLCM features. The cosine similarities between the GLCM feature vector of ion images and those of X-ray photographs were computed for ranking the ion images, and the cutoff for peak picking was determined by manually examining every fifth percentile of the ranked ion images.</p>
</sec>
</sec>
<sec id="s2-2-2">
<title>2.2.2 Data mining with NMF</title>
<sec id="s2-2-2-1">
<title>2.2.2.1 Spatial molecular signatures discovered with repeated NMF</title>
<p>The MSI dataset <inline-formula id="inf21">
<mml:math id="m21">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula> after data cleaning is a <inline-formula id="inf22">
<mml:math id="m22">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xd7;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> matrix that has <inline-formula id="inf23">
<mml:math id="m23">
<mml:mi>m</mml:mi>
</mml:math>
</inline-formula> spectra, each of which has <inline-formula id="inf24">
<mml:math id="m24">
<mml:mi>n</mml:mi>
</mml:math>
</inline-formula> peaks. We assume molecules that are produced by the same living organisms or related to the same biogeochemical processes should have similar spatial distribution patterns (signatures) in the sediments. Mathematically, assuming <inline-formula id="inf25">
<mml:math id="m25">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula> has a total of <italic>k</italic> unique signatures, we can discover a matrix <inline-formula id="inf26">
<mml:math id="m26">
<mml:mi>W</mml:mi>
</mml:math>
</inline-formula> that denotes these spatial signatures and a matrix <inline-formula id="inf27">
<mml:math id="m27">
<mml:mi>H</mml:mi>
</mml:math>
</inline-formula> that denotes the associated molecule clusters, given the factorization <inline-formula id="inf28">
<mml:math id="m28">
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:mi>W</mml:mi>
<mml:mi>H</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B9">Brunet et al., 2004</xref>). The resulting matrices <inline-formula id="inf29">
<mml:math id="m29">
<mml:mi>W</mml:mi>
</mml:math>
</inline-formula> and <inline-formula id="inf30">
<mml:math id="m30">
<mml:mi>H</mml:mi>
</mml:math>
</inline-formula> have a dimension of <inline-formula id="inf31">
<mml:math id="m31">
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xd7;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf32">
<mml:math id="m32">
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#xa0;</mml:mo>
<mml:mo>&#xd7;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, respectively, and both contain only non-negative entries (i.e., NMF). The NMF algorithm implemented in the python package scikit-learn (<xref ref-type="bibr" rid="B45">Pedregosa et al., 2011</xref>) was employed in this study.</p>
<p>Since NMF does not necessarily converge to the same solution over multiple runs, its rank <inline-formula id="inf33">
<mml:math id="m33">
<mml:mi>k</mml:mi>
</mml:math>
</inline-formula>, i.e., the number of clusters/signatures, needs to be properly estimated in order to get the most stable results. Therefore, we factorized <inline-formula id="inf34">
<mml:math id="m34">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula> 30 times with random initial conditions at each rank between 3&#x2014;20 and computed the corresponding consensus matrix <inline-formula id="inf79">
<mml:math id="m79">
<mml:mrow>
<mml:mover accent="true">
<mml:mi mathvariant="normal">C</mml:mi>
<mml:mo>&#x304;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula>, following the approach described in <xref ref-type="bibr" rid="B9">Brunet et al. (2004)</xref>. The stability of <inline-formula id="inf80">
<mml:math id="m80">
<mml:mrow>
<mml:mover accent="true">
<mml:mi mathvariant="normal">C</mml:mi>
<mml:mo>&#x304;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> denotes how strong the clustering of <inline-formula id="inf35">
<mml:math id="m35">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula> into <italic>k</italic> classes is and can be measured by the cophenetic correlation coefficient (<xref ref-type="bibr" rid="B9">Brunet et al., 2004</xref>) and dispersion coefficient (<xref ref-type="bibr" rid="B34">Kim and Park, 2007</xref>), in both of which 1 denotes the most stable clustering, while 0 denotes the least stable clustering. In addition, given the apparent fact that sedimentary MSI datasets are not as well-structured as datasets obtained from biological tissues, similar <inline-formula id="inf36">
<mml:math id="m36">
<mml:mi>W</mml:mi>
</mml:math>
</inline-formula> and <inline-formula id="inf37">
<mml:math id="m37">
<mml:mi>H</mml:mi>
</mml:math>
</inline-formula> derived over multiple NMF runs were accumulated to highlight the stable signatures. In other words, for the <inline-formula id="inf38">
<mml:math id="m38">
<mml:mrow>
<mml:msub>
<mml:mi>i</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> spatial signature discovered in the <inline-formula id="inf39">
<mml:math id="m39">
<mml:mrow>
<mml:msub>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> NMF runs <inline-formula id="inf40">
<mml:math id="m40">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf41">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, if they were strongly correlated (Pearson&#x2019;s r &#x3e; 0.9) with signatures <inline-formula id="inf42">
<mml:math id="m42">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf43">
<mml:math id="m43">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> (<inline-formula id="inf44">
<mml:math id="m44">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x3c;</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> discovered in previous NMF runs, <inline-formula id="inf45">
<mml:math id="m45">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf46">
<mml:math id="m46">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> were, respectively, added onto <inline-formula id="inf47">
<mml:math id="m47">
<mml:mrow>
<mml:msub>
<mml:mi>W</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> and <inline-formula id="inf48">
<mml:math id="m48">
<mml:mrow>
<mml:msub>
<mml:mi>H</mml:mi>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>In addition, a co-occurrence molecular network was constructed based on the consensus matrix C&#x304;. Nodes in the network represent the column and row indices in C&#x304;, and the thickness of edges between the nodes denotes the corresponding entries in C&#x304;.</p>
</sec>
<sec id="s2-2-2-2">
<title>2.2.2.2 Linking quarterly averaged signatures with environmental parameters</title>
<p>Each obtained signature is a matrix that has the same dimension as the ion images in the spectra, and its entry <inline-formula id="inf49">
<mml:math id="m49">
<mml:mrow>
<mml:msub>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> denotes the abundance (median normalized peak intensity) of the signature at the location <inline-formula id="inf50">
<mml:math id="m50">
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> on the sediment slide. In this step, in order to find the geochemical implications and the potential drivers of these mathematically derived signatures, we converted them to quarterly averaged time series using the same approach described in <xref ref-type="bibr" rid="B3">Alfken et al. (2021)</xref>, including correcting tilted laminae, and transforming the spot coordinates to depth and age using the tie points of the age model. The age model of SPR0901-05BC was established by varve counting and additional identification of specific marker layers on the basis of the X-ray image, yielding an accuracy of &#xb1;1&#xa0;year for the data reported in this study (<xref ref-type="bibr" rid="B52">Schimmelmann &#x26; Lange, 1996</xref>). The resulting temporal sequences were compared with seasonally measured water column data, including temperature, salinity, oxygen concentrations, and nutrient concentrations, at the CalCOFI (California Cooperative Oceanic Fisheries Investigation) station 81.8 46.9 (<xref ref-type="bibr" rid="B12">CalCOFI, 2018</xref>), situated in the center of the SBB and in close proximity to the core location, using dynamic time warping (DTW) constrained by a maximum shift of 12&#xa0;months. The water column data were treated by the same approach described in <xref ref-type="bibr" rid="B4">Alfken et al. (2020)</xref>. The rationale behind the use of DTW is that it takes varying amounts of time for the deposition of biomarkers into sediments. For example, biogenic materials usually deposit rapidly, while most terrigenous materials usually arrive at the topmost sediment in the center of the SBB basin within 1&#xa0;year (<xref ref-type="bibr" rid="B52">Schimmelmann and Lange, 1996</xref>). DTW is particularly useful in measuring the similarity between time series that may have varying lags (<xref ref-type="bibr" rid="B75">Zhang et al., 2011</xref>; <xref ref-type="bibr" rid="B70">Wannes et al., 2022</xref>). Furthermore, although the terms &#x201c;positive&#x201d; and &#x201c;negative&#x201d; correlation are used in the section below to characterize the link between signatures and environmental variables, they have no statistical significance. In fact, the words imply that the temporal sequences of quarterly averaged signatures mirror the temporal sequences and inverted temporal sequences of environmental variables, respectively.</p>
</sec>
</sec>
</sec>
</sec>
<sec sec-type="results|discussion" id="s3">
<title>3 Results and discussion</title>
<p>We applied the data cleaning and data mining workflow described earlier to MSI datasets A and B obtained from the uppermost &#x223c;10&#xa0;cm of the SBB box core SPR0901-05BC. In the following part, we report the output of the workflow at each step, with emphasis on the accuracy of reference <italic>m/z</italic> ratios detected by bin-wise KDE, the performance of the geochemical context-based filter, and the geochemical implications of extracted molecular signatures. In addition, the potential of supervised learning based on these untargeted data mining results for molecular proxy discovery is also discussed.</p>
<sec id="s3-1">
<title>3.1 Peak alignment with bin-wise KDE</title>
<p>In this study, since peak prominence filtering was further supplemented by a geochemical-context-based filter, and data reduction has already been applied to both MSI datasets employed to remove peaks with low signal-to-noise ratios (<xref ref-type="bibr" rid="B4">Alfken et al., 2020</xref>, <xref ref-type="bibr" rid="B3">Alfken et al., 2021</xref>), we chose a low <inline-formula id="inf51">
<mml:math id="m51">
<mml:mrow>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> of 0.1 that favored the detection of more reference peaks for both datasets. As a result, 1472 and 1724 reference peaks were detected in A and B, respectively. After aligning spectra onto these reference peaks, the TIC of the resulting spectra accounted for 70.70% and 51.24% of the TIC of original spectra A and B, respectively. TIC recovery percentages per laser spot are shown in <xref ref-type="sec" rid="s8">Supplementary Figures S2A,B</xref>, and they were mostly homogenous across the sediment slides. We examined the areas where the TIC recovery percentages were specifically low and found that most of them are in fact fissures on the sediment slides. Nevertheless, in order to avoid possible bias introduced by the differences in TIC recovery percentages, peak intensities of each spectrum were normalized by the median peak intensity of the spectrum. <xref ref-type="sec" rid="s8">Supplementary Figures S2C,D</xref> shows the mean absolute mass drifts for each reference peak in datasets A and B. Most reference peaks detected in A have mean absolute mass drifts between 0 and 4&#xa0;ppm, and the reference peaks detected in B have a slightly larger mean absolute mass drift between 1 and 5&#xa0;ppm.</p>
<p>In addition, differences between the theoretical mass of some biomarkers that are detectable by MSI, as reported by <xref ref-type="bibr" rid="B74">W&#xf6;rmer et al. (2019)</xref>, and the mass of the nearest reference peaks detected in A and B are compared in <xref ref-type="sec" rid="s8">Supplementary Table S2</xref>. The 18 compounds listed were detected with absolute mass deviations ranging from 0.00 to 2.67&#xa0;ppm, with an average of 0.84&#xa0;ppm. This suggests that reference <italic>m/z</italic> ratios estimated by bin-wise KDE provide a reliable approximation of the theoretical mass and can thus be used for assessing possible chemical formulas, although the precise structures cannot be determined due to the complexity of spectra without prior separation and the general difficulty of obtaining spatially resolved MS/MS spectra in an untargeted way (<xref ref-type="bibr" rid="B2">Alexandrov, 2020</xref>).</p>
</sec>
<sec id="s3-2">
<title>3.2 Peak picking with GLCM features</title>
<p>The GLCM features computed for the ion images of the reference peaks picked by bin-wise KDE and peak prominence filtering are deposited in Metabolights (<xref ref-type="bibr" rid="B80">Haug et al., 2020</xref>). The reference peaks were ranked by the similarities of their GLCM features to the X-ray photograph of the sediment slide (i.e., geochemical-context-based peak picking). We examined the ion images at every fifth percentile (<xref ref-type="sec" rid="s8">Supplementary Figure S3</xref>) from the top of the ranked lists to determine the cut-off points for peak picking and found that the ion images at the upper 20th percentile in datasets A and B begin to show relatively uniform spatial distribution on the sediment slides. All other lower-ranking peaks below the upper 20th percentile were removed, resulting in a total of 293 and 323 peaks (i.e., the top 20% of the ranked peaks) to be picked in datasets A and B, respectively.</p>
<p>To visualize the performance of geochemical-context-based peak picking, variabilities in GLCM features among all reference peaks detected by bin-wise KDE and peak-prominence filtering are shown in the biplots (<xref ref-type="fig" rid="F2">Figure 2</xref>) obtained by PCA of the standardized GLCM features&#x2019; table. The definition and the computation of the five GLCM properties were described in detail in <xref ref-type="bibr" rid="B22">Hall-Beyer (2017)</xref>. Briefly, <italic>correlation</italic> measures how correlated the specified pixel pairs are, and constant pixel pairs have no contribution to the correlation of the whole ion image. Both <italic>contrast</italic> and <italic>dissimilarity</italic> measure how different the specified pixel pairs are, and both <italic>energy</italic> and <italic>homogeneity</italic> measure how uniform the pixel intensities are over the whole ion image. The first two principal components (PC1 and PC2) explain 98.05% and 98.63% of total GLCM features&#x2019; variation in A and B, respectively. On the PC1-PC2 plane, the reference peaks detected in both datasets exhibit U-shaped distributions, and the reference peaks on the arm in the direction of <italic>correlation</italic> are closer to the X-ray photograph than the others. In other words, the reference peaks on the arm in the direction of <italic>correlation</italic> are geochemically more informative than the others. Most variations in GLCM features occur among varying GLCM properties rather than varying pixel pairs at different angles and distances. <xref ref-type="fig" rid="F2">Figure 2</xref> suggests that the ion images of the most informative peaks are characterized by high <italic>correlation</italic> but low <italic>energy</italic>, <italic>homogeneity</italic>, <italic>dissimilarity</italic>, and <italic>contrast</italic>. This is to be expected since ion images that show laminations should have many co-occurring light pixels (i.e., high intensities) and co-occurring dark pixels (i.e., low intensities). In comparison, the ion images of the least informative peaks are characterized by low <italic>correlation</italic> and either high <italic>contrast</italic> and <italic>dissimilarity</italic> or high <italic>homogeneity</italic> and <italic>energy</italic>, which suggests that these ion images are either rather scattered, i.e., with a large number of zeros, rather uniform, i.e., of low pixel intensity variations, or have too large local pixel intensity variations to show laminae.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Biplots obtained by principal component analysis (PCA) of the standardized GLCM features&#x2019; table illustrate variabilities in GLCM features extracted among reference peaks in datasets (A) and (B). The colors of the dots denote their distance from the X-ray photograph of the sediment slide. GLCM features are denoted by arrows in the biplots, and the arrows with the same color denote the same GLCM property derived from different pixel pairs. <italic>cont</italic> &#x3d; <italic>contrast</italic>; <italic>homo</italic> &#x3d; <italic>homogeneity</italic>; <italic>corr</italic> &#x3d; <italic>correlation</italic>; <italic>diss</italic> &#x3d; <italic>dissimilarity</italic>; <italic>ener</italic> &#x3d; <italic>energy</italic>.</p>
</caption>
<graphic xlink:href="feart-10-931157-g002.tif"/>
</fig>
<p>The geochemical-context-based peak picking allows us to select peaks that are potentially linked to biogeochemical processes without prior knowledge of their identity, which is of key importance to untargeted data mining. <xref ref-type="sec" rid="s8">Supplementary Figure S4</xref> shows the ion images of the five most informative peaks (most similar to the X-ray photograph) and the five least informative ones (least similar to the X-ray photograph) in datasets A and B. In dataset A, the two most informative peaks have <italic>m/z</italic> of 557.252 and 558.255. They likely represent pyropheophorbide <italic>a</italic> and its <sup>13</sup>C-isotopologue as Na<sup>&#x2b;</sup>-adducts, respectively. The C<sub>37</sub> di-unsaturated alkenone, which was originally the targeted compound of dataset A in <xref ref-type="bibr" rid="B4">Alfken et al. (2020)</xref>, is also among the top ranked molecules (fourth, <italic>m/z</italic> &#x3d; 553.532) and serves as proof of concept for the ability to detect meaningful compounds with this workflow. In dataset B, the spatial patterns of the five most informative peaks show less laminae-like structure than those in dataset A. The two top ranked molecules can be attributed to chemical formulas C<sub>29</sub>H<sub>48</sub>O(Na<sup>&#x2b;</sup>) and C<sub>29</sub>H<sub>50</sub>O(Na<sup>&#x2b;</sup>) and likely represent sterols, although the specific structures are yet to be determined. Notably, the third&#x2013;fifth ion images are characterized by the presence of hotspots in addition to the laminae; these hotspots might represent lacustrine debris. In contrast to these most informative molecules, the ion images of the least informative peaks in datasets A and B are rather scattered in the sediment, which agrees with the biplots in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<p>In addition, <xref ref-type="sec" rid="s8">Supplementary Table S2</xref> shows the rank of some MSI-detectable putatively identified biomarkers after ranking all reference peaks from most informative to least informative. Most of these biomarkers are ranked relatively high. Notably, C<sub>37</sub> and C<sub>38</sub> alkenones are closely ranked with each other, indicating that they have similar spatial patterns. Interestingly, the two di-unsaturated ones and the two tri-unsaturated ones are, respectively, close to each other, indicating that alkenones with the same unsaturation degree have more similar spatial patterns than those with the same carbon chain length. This agrees with the fact that the degree of unsaturation is susceptible to changing environmental conditions, i.e., SST (<xref ref-type="bibr" rid="B66">Volkman et al., 1980a</xref>, <xref ref-type="bibr" rid="B67">Volkman et al., 1980b</xref>; <xref ref-type="bibr" rid="B37">Marlowe et al., 1984a</xref>, <xref ref-type="bibr" rid="B38">Marlowe et al., 1984b</xref>; <xref ref-type="bibr" rid="B8">Brassell et al., 1986</xref>). Such consistency demonstrates the power of MSI in revealing the fine spatial patterns of biomarkers in sediments and the capability of geochemical-context-based peak picking in selecting geochemically informative peaks from sedimentary MSI datasets without any prior knowledge of their corresponding molecules. Moreover, the application of geochemical-context-based peak picking is not limited to the MSI datasets obtained from varved SBB sediments as it should also be possible to target geological structures other than sediment laminae.</p>
</sec>
<sec id="s3-3">
<title>3.3 Spatial signature identification with NMF</title>
<p>The data cleaning procedures mentioned previously extracted a <inline-formula id="inf52">
<mml:math id="m52">
<mml:mrow>
<mml:mn>18625</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>293</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> data matrix (number of spectra <inline-formula id="inf53">
<mml:math id="m53">
<mml:mo>&#xd7;</mml:mo>
</mml:math>
</inline-formula> number of peaks) from dataset A and a <inline-formula id="inf54">
<mml:math id="m54">
<mml:mrow>
<mml:mn>18963</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>323</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> data matrix from dataset B, which constitutes an unimaginable information density for a &#x223c;10&#xa0;cm long sediment section. Repeated NMF was employed to factorize the two resulting matrices in order to extract unique molecular signatures with the associated molecule clusters, in analogy to its most popular use in bioinformatics, i.e., clustering gene expression data and finding the most representative genes of the clusters (<xref ref-type="bibr" rid="B9">Brunet et al., 2004</xref>).</p>
<sec id="s3-3-1">
<title>3.3.1 Stable spatial signatures and their environmental drivers</title>
<p>The performances of NMF at varying ranks between 3 and 20 are compared in <xref ref-type="sec" rid="s8">Supplementary Text S2</xref>. As a result, an NMF rank of 10 was chosen for both dataset A and dataset B, resulting in the extraction of in total 17 and 14 distinct spatial signatures, respectively, over 30 NMF runs (<xref ref-type="sec" rid="s8">Supplementary Figures S6, S7</xref>). From these, 5 and 10 spatial signatures (<xref ref-type="fig" rid="F3">Figure 3</xref>) are stable because they were reached in at least 27 out of 30 (i.e., &#x3e;90 percent) NMF runs. The unstable signatures likely represent the local outliers among these stable signatures, but their differences are too trivial to be stably separated. For example, in dataset B, the unstable signature B-NMF12 is in fact similar to the stable signature B-NMF6. Although they are not well-separated from the rank selected in this study, they are in fact separated from each other in the consensus-matrix-based molecular network.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Stable spatial signatures discovered in datasets A <bold>(A)</bold> and B <bold>(B)</bold>. The left figures show the ion images of spatial signatures, i.e., the columns of <inline-formula id="inf55">
<mml:math id="m55">
<mml:mi>W</mml:mi>
</mml:math>
</inline-formula>, the weighted average of all ion images in the cluster, with a color scale in &#x201c;viridis&#x201d;, where the lighter colors denote a higher abundance of the signatures. The right figures show the pseudo mass spectrum constructed using the contributions of molecules, i.e., in which peak intensities denote the entries in the rows <inline-formula id="inf56">
<mml:math id="m56">
<mml:mi>H</mml:mi>
</mml:math>
</inline-formula>.</p>
</caption>
<graphic xlink:href="feart-10-931157-g003.tif"/>
</fig>
<p>In the following, we compared the spatial signatures with the varved structures of the sediment revealed by its X-ray photograph, and their quarterly averages after applying an age model with the environmental parameters determined in the water column to show that these mathematically derived signatures are indeed relevant to biogeochemical processes. However, one should keep in mind that the inferred ecological and geochemical implications hereafter are speculative and could be oversimplified due to the numerous stereochemical possibilities of the molecular formulas detected, along with other factors. The spatial signatures are coded in the following way: assuming dataset <inline-formula id="inf57">
<mml:math id="m57">
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mi>A</mml:mi>
<mml:mo>,</mml:mo>
<mml:mo>&#xa0;</mml:mo>
<mml:mi>B</mml:mi>
<mml:mo>}</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, given <inline-formula id="inf58">
<mml:math id="m58">
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mo>&#x223c;</mml:mo>
<mml:mi>W</mml:mi>
<mml:mi>H</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> for dataset <inline-formula id="inf59">
<mml:math id="m59">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula>, we refer to the columns in <inline-formula id="inf60">
<mml:math id="m60">
<mml:mrow>
<mml:mi>W</mml:mi>
<mml:mo>
</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> as signatures and the <inline-formula id="inf61">
<mml:math id="m61">
<mml:mrow>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> column as a signature <inline-formula id="inf62">
<mml:math id="m62">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula>-NMF<italic>k</italic>, whose ion images are denoted by its entries. The contribution of each molecule to a signature <inline-formula id="inf63">
<mml:math id="m63">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula>-NMF<italic>k</italic> is denoted by the entries in the <inline-formula id="inf64">
<mml:math id="m64">
<mml:mrow>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> row of <inline-formula id="inf65">
<mml:math id="m65">
<mml:mi>H</mml:mi>
</mml:math>
</inline-formula>. Although the unstable signatures are worth investigating, for example, B-NMF10 and B-NMF12 show almost complementary ion images (<xref ref-type="sec" rid="s8">Supplementary Figure S7</xref>) that likely indicate distinct sources, only the stable signatures are discussed in the following for succinctly demonstrating the capability of MSI in conjunction with untargeted data mining in extracting biogeochemically relevant molecular signatures from the sediments. For stable signature <inline-formula id="inf66">
<mml:math id="m66">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula>-NMF<italic>k</italic>, we show<list list-type="simple">
<list-item>
<p>(1) its ion image that represents the cluster center (the weighted average of all ion images in the cluster; <xref ref-type="fig" rid="F3">Figure 3</xref>, left), constructed using the entries in the <inline-formula id="inf67">
<mml:math id="m67">
<mml:mrow>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> column of <inline-formula id="inf68">
<mml:math id="m68">
<mml:mi>W</mml:mi>
</mml:math>
</inline-formula>;</p>
</list-item>
<list-item>
<p>(2) the pseudo mass spectrum of each molecule&#x2019;s contribution (<xref ref-type="fig" rid="F3">Figure 3</xref>, right), i.e., the peak intensities denote the entries in the <inline-formula id="inf69">
<mml:math id="m69">
<mml:mrow>
<mml:msub>
<mml:mi>k</mml:mi>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi>h</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> row of <inline-formula id="inf70">
<mml:math id="m70">
<mml:mi>H</mml:mi>
</mml:math>
</inline-formula>;</p>
</list-item>
<list-item>
<p>(3) the associated co-occurrence molecular network constructed using the consensus matrix over multiple NMF runs (<xref ref-type="fig" rid="F4">Figure 4</xref>);</p>
</list-item>
<list-item>
<p>(4) the DTW distance between the quarterly averaged abundance of <inline-formula id="inf71">
<mml:math id="m71">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula>-NMF<italic>k</italic> and environmental parameters (<xref ref-type="fig" rid="F5">Figure 5A</xref>), where closer distance suggests a stronger positive correlation, and the DTW distance between the quarterly averaged abundance of <inline-formula id="inf72">
<mml:math id="m72">
<mml:mi>V</mml:mi>
</mml:math>
</inline-formula>-NMF<italic>k</italic> multiplied by <inline-formula id="inf73">
<mml:math id="m73">
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> (i.e., flipping the temporal sequence upside down) and environmental parameters (<xref ref-type="fig" rid="F5">Figure 5B</xref>), where closer distance suggests stronger negative correlation. Examples of selected best matched signatures and environmental parameters are displayed in <xref ref-type="sec" rid="s8">Supplementary Figure S8</xref>.</p>
</list-item>
</list>
</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Examples of spatially co-occurring molecular networks discovered in datasets (A) and (B). Each node represents a molecule, with the labels being the name of biomarkers, possible chemical formulas, or its <italic>m/z</italic> ratios. The size of the node denotes the average intensity of the molecule in the dataset. Isotopologues are represented by the green nodes. Edges (lines between the nodes) denote that the two connected nodes are classified into the same cluster for at least 90% of total NMF runs, and thicker edges denote the two connected nodes are clustered together more frequently.</p>
</caption>
<graphic xlink:href="feart-10-931157-g004.tif"/>
</fig>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>
<bold>(A)</bold> DTW distance between the quarterly averaged abundance of the signatures and the environmental parameters integrated over varying depths of water parcel (above the dashed lines: 0&#x2014;50&#xa0;m in 10&#xa0;m increments; below the dashed lines: 50 m&#x2014;bottom in 100&#xa0;m increments), where smaller distance indicates stronger positive correlation; <bold>(B)</bold> DTW distance between the quarterly averaged abundance of the signatures multiplied by <inline-formula id="inf74">
<mml:math id="m74">
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> (i.e., flipping the temporal sequence upside down) and the environmental parameters integrated over varying depths of water parcel (from surface to bottom), where smaller distance indicates stronger negative correlation.</p>
</caption>
<graphic xlink:href="feart-10-931157-g005.tif"/>
</fig>
<p>Among all signatures discovered in the two datasets, A-NMF3 shows the most distinct laminations (<xref ref-type="fig" rid="F3">Figure 3</xref>). <xref ref-type="fig" rid="F6">Figure 6</xref> further shows that the light laminae of A-NMF3 match well with the light diatom ooze laminae of the SBB sediments, which result from enhanced primary production (<xref ref-type="bibr" rid="B52">Schimmelmann and Lange, 1996</xref>; <xref ref-type="bibr" rid="B10">Bull et al., 2000</xref>). The corresponding molecular network in <xref ref-type="fig" rid="F4">Figure 4</xref> indicates that A-NMF3 consists of chlorin-like compounds, and the most abundant molecule is putatively pyropheophorbide <italic>a</italic>. We speculated that pyropheophorbide <italic>a</italic> and the other chlorin-like compounds in A-NMF3 derive from zooplankton and zoobenthos grazing on diatoms (<xref ref-type="bibr" rid="B25">Head et al., 1994</xref>; <xref ref-type="bibr" rid="B57">Szymczak-&#x17b;y&#x142;a et al., 2011</xref>), and the distinct light/dark laminae of A-NMF3 may result from the opportunistic bloom-and-bust life cycle of diatoms (<xref ref-type="bibr" rid="B11">Butterfield, 1997</xref>). In fact, by comparing the quarterly averaged abundance of A-NMF3 to environmental parameters (<xref ref-type="fig" rid="F5">Figure 5</xref>), we found that the abundance of A-NMF3 is positively correlated with salinity and nutrient concentrations including the concentrations of nitrite, phosphate, and nitrate, while negatively correlated with bottom water oxygen concentrations. The positive correlation could indicate the increase in diatom populations and consequently A-NMF3 associated with increases in nutrient supply (<xref ref-type="bibr" rid="B36">Lange et al., 1997</xref>), while the negative correlation could suggest the depletion of oxygen in bottom waters, resulting from the subsequent organic matter remineralization (<xref ref-type="bibr" rid="B50">Reimers et al., 1990</xref>; <xref ref-type="bibr" rid="B3">Alfken et al., 2021</xref>).</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Comparison between selected stable spatial signatures and the X-ray photograph of the sediments. The color scale of the ion images is in &#x201c;viridis&#x201d;, where the lighter colors denote higher abundance of the signatures.</p>
</caption>
<graphic xlink:href="feart-10-931157-g006.tif"/>
</fig>
<p>A-NMF6 consists of alkenones produced by the ubiquitous coccolithophores (<xref ref-type="bibr" rid="B67">Volkman et al., 1980b</xref>, <xref ref-type="bibr" rid="B66">1980a</xref>; <xref ref-type="bibr" rid="B37">Marlowe et al., 1984a</xref>, <xref ref-type="bibr" rid="B38">Marlowe et al., 1984b</xref>), i.e., C<sub>37</sub>- and C<sub>38</sub>-, di- and tri-unsaturated alkenones, together with their corresponding isotopologues and a compound with a tentative chemical formula of C<sub>37</sub>H<sub>70</sub>O<sub>2</sub>, possibly a coccolith derived di-unsaturated alkenoate, e.g., methyl hexatriacontadienoate (<xref ref-type="bibr" rid="B37">Marlowe et al., 1984a</xref>; <xref ref-type="fig" rid="F4">Figure 4</xref>). Compared to A-NMF3, A-NMF6 shows less distinct laminations (<xref ref-type="fig" rid="F6">Figure 6</xref>), and its light laminae do not match very well with each other. This agrees with previous observations that coccolithophores usually do not bloom along with diatoms (<xref ref-type="bibr" rid="B76">Zhao et al., 2000</xref>). In addition, the quarterly averaged A-NMF6 has a slightly different correlation with environmental parameters compared to A-NMF3 (<xref ref-type="fig" rid="F5">Figure 5</xref>). It is positively correlated with nutrient concentrations, including the concentrations of silicate, phosphate, and nitrate, while negatively correlated with the temperature of the water column. Moreover, although both A-NMF6 and A-NMF3 are negatively correlated with oxygen concentrations in the water column, A-NMF6 is best matched with the oxygen concentrations in the shallower water column, while A-NMF3 is best matched with the oxygen concentrations in the bottom water column.</p>
<p>Although B-NMF6 also shows distinct laminations in its ion image, in contrast to A-NMF3 and A-NMF6, it is more likely to be a terrigenous signal as its light laminae match with the dark laminae of the sediment (<xref ref-type="fig" rid="F6">Figure 6</xref>) formed by seasonal runoff during heavy winter rains (<xref ref-type="bibr" rid="B52">Schimmelmann and Lange, 1996</xref>; <xref ref-type="bibr" rid="B10">Bull et al., 2000</xref>). MSI also reveals scattered hotspots in B-NMF6 in addition to the laminae couplets, which could indicate the presence of terrigenous particles. The corresponding molecular network of B-NMF2 shows that it consists of steranes and unspecified oxygen-containing compounds (<xref ref-type="fig" rid="F4">Figure 4</xref>). As steranes are not formed from autochthonous organic matter in immature sediments, they are likely derived from eroded source rocks of the Monterey Formation or related oil seeps (<xref ref-type="bibr" rid="B26">Hinrichs et al., 1995</xref>); the oxygen-containing compounds with even-numbered carbons, i.e., C<sub>24</sub>H<sub>48</sub>O<sub>2</sub>, C<sub>26</sub>H<sub>52</sub>O<sub>2</sub>, and C<sub>28</sub>H<sub>56</sub>O<sub>2</sub> could be derived from higher plants (<xref ref-type="bibr" rid="B49">Ratnayake et al., 2005</xref>; <xref ref-type="bibr" rid="B35">Kusch et al., 2010</xref>). Moreover, the quarterly averaged B-NMF6 is positively correlated with the bottom water oxygen concentrations, while negatively correlated to salinity and nutrient concentrations (<xref ref-type="fig" rid="F5">Figure 5</xref>), which is consistent with its assignment to wintertime terrigenous input during periods of weakened upwelling.</p>
<p>The ion image of B-NMF2 is rather complicated: it is ubiquitous in the sediment, while some of its light laminae (more abundant) match quite well with the dark laminae of the sediment (<xref ref-type="fig" rid="F6">Figure 6</xref>). Hierarchical clustering of the most representative compounds in B-NMF2 displayed in <xref ref-type="sec" rid="s8">Supplementary Figure S9</xref> further reveals the internal structure of the molecular cluster, which consists of sterols and compounds that have one to three oxygen atoms and double bond equivalents typically found in steroids. Sterols and steroids are known to be produced by diverse species of eukaryotes such as zoo- and phytoplankton and terrestrial plants (<xref ref-type="bibr" rid="B64">Volkman et al., 1998</xref>; <xref ref-type="bibr" rid="B68">Volkman, 2003</xref>). The mechanism causing the colocalization of these compounds in the sediment deserves further investigation. It is possible that B-NMF2 captures the superimposed signal of steroid-like compounds from varying sources. For example, one of the most abundant molecules in B-NMF2, C<sub>30</sub>H<sub>52</sub>O, can be assigned to dinosterol, which is mainly produced by dinoflagellates (<xref ref-type="bibr" rid="B65">Volkman et al., 1993</xref>), the algae which often blooms during relaxed upwelling (<xref ref-type="bibr" rid="B32">Kennedy and Brassell, 1992</xref>; <xref ref-type="bibr" rid="B53">Smayda and Trainer, 2010</xref>). However, the formula is also consistent with terrigenous biomarkers such as lupanol (<xref ref-type="bibr" rid="B43">Pancost et al., 2002</xref>) or the bacterial biomarker hopanol (<xref ref-type="bibr" rid="B44">Pearson et al., 2001</xref>). Alternatively, B-NMF2 could reflect benthic biogeochemical processes because of its strong positive correlation to bottom water oxygen concentration and strong negative correlation to bottom water salinity and phosphate concentrations (<xref ref-type="fig" rid="F5">Figure 5</xref>).</p>
<p>The other signatures exhibit distinctive spatial patterns as well, although not all chemical formulas of the associated molecules can be assigned with certainty, partly due to the absence of isotopologue peaks. For example, signatures A-NMF5 and B-NMF7 are both characterized by the &#x201c;intrusion&#x201d; structure in the uppermost &#x223c;5&#xa0;cm sediment in addition to the dark and light laminae (<xref ref-type="fig" rid="F3">Figure 3</xref>); such a structure is also visible on the sediment slide. A-NMF7 is omnipresent in the sediment, with distinct laminations only visible in the topmost sediment, and it consists of a large number of molecules, with the most abundant one likely to be bacteriohopanetetrol (<xref ref-type="fig" rid="F4">Figure 4</xref>). Quarterly averaged A-NMF7 is positively correlated with silicate concentration but negatively correlated with temperature and nitrite concentration (<xref ref-type="fig" rid="F5">Figure 5</xref>). A-NMF9 is of low abundance in the topmost surface sediment, and it is associated with three compounds that are apparently homologs, together with their <sup>13</sup>C isotopologues. Notably, A-NMF9 does not show a strong correlation with any environmental parameters. In addition, B-NMF1 is characterized by curved dark/light boundaries in the sediment (<xref ref-type="fig" rid="F3">Figure 3</xref>). We speculated that they are polyethylene glycol contaminants (PEG; H<sup>&#x2b;</sup> and Na<sup>&#x2b;</sup> adducts) because of the characteristic mass difference of 44&#xa0;Da between individual molecules (<xref ref-type="fig" rid="F4">Figure 4</xref>). Moreover, it should be noted that all the signatures discovered in this study spatially overlap with each other to some extent, causing image segmentation to be difficult. This is due to either a blurred zonation of biomarkers or the relatively small mass window analyzed, which limits the number of diagnostic biomarkers.</p>
</sec>
<sec id="s3-3-2">
<title>3.3.2 Potential of supervised learning for novel molecular proxy discovery</title>
<p>The proposed data processing workflow in this study is explorative and mostly unsupervised as except in the geochemical-context-based peak picking step, it only utilizes the information carried by sedimentary MSI datasets themselves. In order to demonstrate the potential of supervised learning approaches for novel molecular proxy discovery on the basis of data mining results, we trained linear models for indicating SST and sediment-water interface redox conditions, using the abundances of co-occurring molecules associated with clusters A-NMF6 (alkenones and derivatives) and B-NMF2 (incl. steroid-like compounds), respectively, based on the assumption that residuals among these co-occurring molecules are not random and result from biogeochemical influences such as changing environmental conditions or selectivity in biogeochemical processes. ElasticNet, a popular regularized linear regression algorithm implemented in the python package scikit-learn (<xref ref-type="bibr" rid="B45">Pedregosa et al., 2011</xref>), was employed to tackle multicollinearity (<xref ref-type="bibr" rid="B21">Gunst and Webster, 1975</xref>; <xref ref-type="bibr" rid="B6">Alin, 2010</xref>). Time-based cross-validation implemented in scikit-learn (<xref ref-type="bibr" rid="B45">Pedregosa et al., 2011</xref>) was employed to reduce overfitting and autocorrelation; whether the derived proxies are applicable in longer time series or at other sites is beyond the scope of this study.</p>
<p>Since it has been established that in dataset A, the <inline-formula id="inf75">
<mml:math id="m75">
<mml:mrow>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mn>37</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>K</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> index has a strong correlation with SST (<xref ref-type="bibr" rid="B4">Alfken et al., 2020</xref>), we first validated the use of ElasticNet on the alkenone cluster A-NMF6 and its relationship to SST. Although A-NMF6 contains four <sup>13</sup>C isotopologues and an alkenoate in addition to the monoisotopic alkenones, the whole cluster is used for training without removing any peaks in order to show the reliability of ElasticNet with redundant variables. <xref ref-type="fig" rid="F7">Figure 7A</xref> compares the SST measured in the water column and the SST indicated by the trained model and the <inline-formula id="inf76">
<mml:math id="m76">
<mml:mrow>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mn>37</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>K</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> index, suggesting that the model trained in this study indicates changes in SST. <xref ref-type="sec" rid="s8">Supplementary Table 3</xref> shows the resulting coefficients assigned to the nine molecules in A-NMF6 after cross-validated ElasticNet training and indicates that the two di-unsaturated alkenones and their isotopologues are positively correlated with SST, while the two tri-unsaturated alkenones and their isotopologues are negatively correlated with SST. This is in agreement with the definition of the <inline-formula id="inf77">
<mml:math id="m77">
<mml:mrow>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mn>37</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>K</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> index proposed in the previous study (<xref ref-type="bibr" rid="B47">Prahl and Wakeham, 1987</xref>). In addition, the putative alkenoate was assigned with a negative coefficient after training, although it weighed less than the two tri-unsaturated alkenones. This may suggest that the molecule is also regulated by coccolithophores to adapt to changing SST, which agrees with the observation by <xref ref-type="bibr" rid="B14">Conte et al. (1998)</xref> that the cellular ratio of alkenoates to alkenones in coccolithophores decreases with increasing water temperature.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>
<bold>(A)</bold> Comparison between the ElasticNet model trained by A-NMF6, SST, and <inline-formula id="inf78">
<mml:math id="m78">
<mml:mrow>
<mml:msubsup>
<mml:mi>U</mml:mi>
<mml:mrow>
<mml:mn>37</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mi>K</mml:mi>
<mml:mo>&#x2032;</mml:mo>
</mml:msup>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:math>
</inline-formula> ratio (<xref ref-type="bibr" rid="B4">Alfken et al., 2020</xref>); <bold>(B)</bold> comparison between the ElasticNet model trained by B-NMF2, bottom water oxygen concentrations, and the C<sub>29</sub> stanol/stenol ratio (<xref ref-type="bibr" rid="B3">Alfken et al., 2021</xref>). In both figures, the vertical axes of the time series are staggered (i.e., adjusted along the vertical axes to avoid overlap) for better visualization.</p>
</caption>
<graphic xlink:href="feart-10-931157-g007.tif"/>
</fig>
<p>B-NMF2 consists of diverse steroid-like compounds with one or more oxygen-containing functional groups, which might be sensitive to redox conditions. We further applied ElasticNet to train a model for indicating bottom water redox conditions using the abundances of co-localized molecules in B-NMF2 against bottom water oxygen concentrations. <xref ref-type="fig" rid="F7">Figure 7B</xref> compares the bottom water oxygen concentrations with the trained model and the redox-sensitive C<sub>29</sub> stanol/stenol ratio. In contrast to the model for indicating SST (cf. in <xref ref-type="sec" rid="s8">Supplementary Table S3</xref>), the resulting coefficients for indicating redox conditions are all equal or greater than 0, with smaller absolute values (<xref ref-type="sec" rid="s8">Supplementary Table S4</xref>). The non-negative coefficients of the compounds suggest that variations in the model could be driven by 1) the relative abundances amongst these steroid-like compounds in B-NMF2 or 2) the relative abundance of these steroid-like compounds as a group compared to all other compounds in the spectrum. Although all non-negative coefficients could make the model less favorable as a paleoenvironmental proxy and more sophisticated nonlinear models are likely required to characterize the link between these compounds and the redox condition, such rather complicated supervised learning approaches are beyond the scope of this study and will be evaluated in our future work. Since standard scaling was applied before model training, the magnitudes of resulting coefficients depend largely on the number of dependent variables. The smaller absolute coefficients assigned in B-NMF2 compared to those assigned in A-NMF6 results from the much larger numbers of variables (molecules) in B-NMF2. Nevertheless, the resulting coefficients agree with the observation in the aforementioned section that B-NMF2 is positively correlated with bottom water oxygen concentrations, and the ElasticNet model, here, further extracted the most redox-related biomarkers from B-NMF2. It is interesting that the top three molecules are all C<sub>30</sub> pentacyclic triterpenoids with varying unsaturation degrees. This may suggest that they are the most recalcitrant components of B-NMF2 in the sediment. However, other factors cannot be ruled out, for example, it is also possible that the three putative C<sub>30</sub> pentacyclic triterpenoids have the strongest contribution from terrestrial input during the winter season as compounds with the same chemical formulas have been reported in higher plants (e.g., <xref ref-type="bibr" rid="B18">Escobedo et al., 2012</xref>; <xref ref-type="bibr" rid="B33">Kennedy, 2012</xref>).</p>
</sec>
</sec>
</sec>
<sec id="s4">
<title>4 Conclusion</title>
<p>This study introduced an untargeted data processing workflow, including data cleaning and untargeted data mining, for sedimentary MSI datasets and evaluated the workflow by re-analyzing two existing MSI datasets obtained from &#x223c;10&#xa0;cm SBB sediment sections. Bin-wise KDE employed for peak detection and alignment extracted more than a thousand peaks from each dataset and achieved an average mass deviation of 0.84&#xa0;ppm between the resulting reference <italic>m/z</italic> and the theoretical <italic>m/z</italic> of 18 established biomarkers detected by MSI in this study. The detected peaks were then evaluated by the peak-prominence filter that measures the sparsity of these peaks in sediments, in conjunction with the geochemical-context-based filter that measures the distance of the ion images of these peaks from the X-ray photograph of the sediment, allowing the selection of hundreds of biogeochemically informative peaks that exhibit spatial patterns reminiscent of the sediment laminae. The subsequent untargeted data mining using repeated NMF further extracted a total of 15 stable molecular signatures from these hundreds of informative peaks. The relevance between these mathematically derived signatures and historical oceanographic data proves the capability of the proposed workflow in extracting biogeochemically relevant molecular signatures from the sedimentary MSI datasets, broadening the number and diversity of available candidate compounds utilizable for molecular stratigraphy. On the basis of these molecular signatures, ElasticNet, a supervised learning algorithm, combined with cross-validation was able to train easy-to-interpret multivariate linear regression models using the residuals among co-occurring molecules against historical oceanographic data for paleoenvironmental and paleoceanographic reconstruction, and it holds great potential for novel biomarker discovery in a top-down manner and unleashing the full power of MSI in the field of organic geochemistry.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: <ext-link ext-link-type="uri" xlink:href="https://www.ebi.ac.uk/metabolights/MTBLS4609">https://www.ebi.ac.uk/metabolights/MTBLS4609</ext-link>.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>WL, SA, LW, JL, and KH contributed to the conception and design of the study. SA collected the samples and performed the experiment. WL compiled the data processing workflow and wrote the corresponding codes. WL wrote the first draft of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.</p>
</sec>
<ack>
<p>This research was funded by the European Research Council under the European Union&#x2019;s Horizon 2020 Research and Innovation Programme (grant agreement No. 670115 ZOOMECULAR; PI Kai-Uwe Hinrichs). Additional support has been provided by Germany&#x2019;s Excellence Strategy&#x2014;EXC-2077&#x2014;390741603. The authors also thank the developers of Python, scikit-learn, scipy, and all other open-source libraries employed in the data processing workflow in this study.</p>
</ack>
<sec sec-type="COI-statement" id="s7">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s8">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s9">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/feart.2022.931157/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/feart.2022.931157/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="Table1.docx" id="SM1" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="DataSheet1.docx" id="SM2" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Image1.pdf" id="SM3" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alexandrov</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>MALDI imaging mass spectrometry: Statistical data analysis and current computational challenges</article-title>. <source>BMC Bioinforma.</source> <volume>13</volume>, <fpage>S11</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-13-S16-S11</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/23176142/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1186/1471-2105-13-S16-S11">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=MALDI+imaging+mass+spectrometry:+Statistical+data+analysis+and+current+computational+challenges&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alexandrov</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Spatial metabolomics and imaging mass spectrometry in the age of artificial intelligence</article-title>. <source>Annu. Rev. Biomed. Data Sci.</source> <volume>3</volume>, <fpage>61</fpage>&#x2013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-biodatasci-011420-031537</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/34056560/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1146/annurev-biodatasci-011420-031537">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Spatial+metabolomics+and+imaging+mass+spectrometry+in+the+age+of+artificial+intelligence&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alfken</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>W&#xf6;rmer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Lipp</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Napier</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Elvert</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Wendt</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Disrupted coherence between upwelling strength and redox conditions reflects source water change in Santa Barbara Basin during the 20th century</article-title>. <source>Paleoceanogr. Paleoclimatol.</source> <volume>36</volume>, <fpage>e2021PA004354</fpage>. <pub-id pub-id-type="doi">10.1029/2021PA004354</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1029/2021PA004354">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Disrupted+coherence+between+upwelling+strength+and+redox+conditions+reflects+source+water+change+in+Santa+Barbara+Basin+during+the+20th+century&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alfken</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>W&#xf6;rmer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Lipp</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Wendt</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Schimmelmann</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Hinrichs</surname>
<given-names>K.-U.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Mechanistic insights into molecular proxies through comparison of subannually resolved sedimentary records with instrumental water column data in the Santa Barbara Basin, Southern California</article-title>. <source>Paleoceanogr. Paleoclimatol.</source> <volume>35</volume>, <fpage>e2020PA004076</fpage>. <pub-id pub-id-type="doi">10.1029/2020PA004076</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1029/2020PA004076">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Mechanistic+insights+into+molecular+proxies+through+comparison+of+subannually+resolved+sedimentary+records+with+instrumental+water+column+data+in+the+Santa+Barbara+Basin,+Southern+California&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alfken</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>W&#xf6;rmer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Lipp</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Wendt</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Taubner</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Schimmelmann</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Micrometer scale imaging of sedimentary climate archives &#x2013; sample preparation for combined elemental and lipid biomarker analysis</article-title>. <source>Org. Geochem.</source> <volume>127</volume>, <fpage>81</fpage>&#x2013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.1016/j.orggeochem.2018.11.002</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.orggeochem.2018.11.002">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Micrometer+scale+imaging+of+sedimentary+climate+archives+&#x2013;+sample+preparation+for+combined+elemental+and+lipid+biomarker+analysis&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Alin</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Multicollinearity</article-title>. <source>WIREs Comput. Stat.</source> <volume>2</volume>, <fpage>370</fpage>&#x2013;<lpage>374</lpage>. <pub-id pub-id-type="doi">10.1002/wics.84</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/wics.84">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Multicollinearity&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Botev</surname>
<given-names>Z. I.</given-names>
</name>
<name>
<surname>Grotowski</surname>
<given-names>J. F.</given-names>
</name>
<name>
<surname>Kroese</surname>
<given-names>D. P.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Kernel density estimation via diffusion</article-title>. <source>Ann. Stat.</source> <volume>38</volume>. <pub-id pub-id-type="doi">10.1214/10-AOS799</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1214/10-AOS799">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Kernel+density+estimation+via+diffusion&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brassell</surname>
<given-names>S. C.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Marlowe</surname>
<given-names>I. T.</given-names>
</name>
<name>
<surname>Pflaumann</surname>
<given-names>U.</given-names>
</name>
<name>
<surname>Sarnthein</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>1986</year>). <article-title>Molecular stratigraphy: A new tool for climatic assessment</article-title>. <source>Nature</source> <volume>320</volume>, <fpage>129</fpage>&#x2013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1038/320129a0</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/320129a0">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Molecular+stratigraphy:+A+new+tool+for+climatic+assessment&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brunet</surname>
<given-names>J.-P.</given-names>
</name>
<name>
<surname>Tamayo</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Golub</surname>
<given-names>T. R.</given-names>
</name>
<name>
<surname>Mesirov</surname>
<given-names>J. P.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Metagenes and molecular pattern discovery using matrix factorization</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>101</volume>, <fpage>4164</fpage>&#x2013;<lpage>4169</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.0308531101</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/15016911/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1073/pnas.0308531101">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Metagenes+and+molecular+pattern+discovery+using+matrix+factorization&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bull</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kemp</surname>
<given-names>A. E. S.</given-names>
</name>
<name>
<surname>Weedon</surname>
<given-names>G. P.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>A 160-k.y.-old record of el ni&#xf1;o&#x2013;southern oscillation in marine production and coastal runoff from Santa Barbara Basin, California, USA</article-title>. <source>Geol.</source> <volume>28</volume>, <fpage>1007</fpage>. <pub-id pub-id-type="doi">10.1130/0091-7613(2000)28&#x3c;1007:akroen&#x3e;2.0.co;2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1130/0091-7613(2000)28&#x3c;1007:akroen&#x3e;2.0.co;2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=A+160-k.y.-old+record+of+el+ni&#xf1;o&#x2013;southern+oscillation+in+marine+production+and+coastal+runoff+from+Santa+Barbara+Basin,+California,+USA&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Butterfield</surname>
<given-names>N. J.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Plankton ecology and the Proterozoic-phanerozoic transition</article-title>. <source>Paleobiology</source> <volume>23</volume>, <fpage>247</fpage>&#x2013;<lpage>262</lpage>. <pub-id pub-id-type="doi">10.1017/S009483730001681X</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1017/S009483730001681X">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Plankton+ecology+and+the+Proterozoic-phanerozoic+transition&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B12">
<citation citation-type="web">
<collab>CalCOFI</collab> (<year>2018</year>). <article-title>California cooperative oceanic Fisheries investigation (CalCOFI), march 2018</article-title>. <comment>AvaliableAt: <ext-link ext-link-type="uri" xlink:href="http://www.calcofi.org">http://www.calcofi.org</ext-link>
</comment>. <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=California+cooperative+oceanic+Fisheries+investigation+(CalCOFI),+march+2018&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Comon</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Independent component analysis, a new concept?</article-title> <source>Signal Process.</source> <volume>36</volume>, <fpage>287</fpage>&#x2013;<lpage>314</lpage>. <pub-id pub-id-type="doi">10.1016/0165-1684(94)90029-9</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0165-1684(94)90029-9">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Independent+component+analysis,+a+new+concept?&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Conte</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Thompson</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lesley</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Harris</surname>
<given-names>R. P.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Genetic and physiological influences on the alkenone/alkenoate versus growth temperature relationship in Emiliania Huxleyi and Gephyrocapsa Oceanica</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>62</volume>, <fpage>51</fpage>&#x2013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1016/S0016-7037(97)00327-X</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0016-7037(97)00327-X">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Genetic+and+physiological+influences+on+the+alkenone/alkenoate+versus+growth+temperature+relationship+in+Emiliania+Huxleyi+and+Gephyrocapsa+Oceanica&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Damst&#xe9;</surname>
<given-names>J. S. S.</given-names>
</name>
<name>
<surname>Kohnen</surname>
<given-names>M. E. L.</given-names>
</name>
<name>
<surname>de Leeuw</surname>
<given-names>J. W.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Thiophenic biomarkers for palaeoenvironmental assessment and molecular stratigraphy</article-title>. <source>Nature</source> <volume>345</volume>, <fpage>609</fpage>&#x2013;<lpage>611</lpage>. <pub-id pub-id-type="doi">10.1038/345609a0</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/345609a0">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Thiophenic+biomarkers+for+palaeoenvironmental+assessment+and+molecular+stratigraphy&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deininger</surname>
<given-names>S.-O.</given-names>
</name>
<name>
<surname>Cornett</surname>
<given-names>D. S.</given-names>
</name>
<name>
<surname>Paape</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Becker</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Pineau</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Rauser</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Normalization in MALDI-TOF imaging datasets of proteins: Practical considerations</article-title>. <source>Anal. Bioanal. Chem.</source> <volume>401</volume>, <fpage>167</fpage>&#x2013;<lpage>181</lpage>. <pub-id pub-id-type="doi">10.1007/s00216-011-4929-z</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/21479971/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00216-011-4929-z">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Normalization+in+MALDI-TOF+imaging+datasets+of+proteins:+Practical+considerations&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eriksson</surname>
<given-names>J. O.</given-names>
</name>
<name>
<surname>Rezeli</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Hefner</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Marko-Varga</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Horvatovich</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Clusterwise peak detection and filtering based on spatial distribution to efficiently mine mass spectrometry imaging data</article-title>. <source>Anal. Chem.</source> <volume>91</volume>, <fpage>11888</fpage>&#x2013;<lpage>11896</lpage>. <pub-id pub-id-type="doi">10.1021/acs.analchem.9b02637</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/31403280/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1021/acs.analchem.9b02637">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Clusterwise+peak+detection+and+filtering+based+on+spatial+distribution+to+efficiently+mine+mass+spectrometry+imaging+data&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Escobedo</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Lozada</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Hern&#xe1;ndez-Ortega</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Villarreal</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Gnecco</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Enriquez</surname>
<given-names>R.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>108. <sup>1</sup>H and <sup>13</sup>C NMR characterization of new cycloartane triterpenes from Mangifera indica</article-title>. <source>Magn. Reson. Chem.</source> <volume>50</volume>, <fpage>52</fpage>&#x2013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1002/mrc.2836</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/22271257/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/mrc.2836">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=108.+1H+and+13C+NMR+characterization+of+new+cycloartane+triterpenes+from+Mangifera+indica&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eyssen</surname>
<given-names>H. J.</given-names>
</name>
<name>
<surname>Parmentier</surname>
<given-names>G. G.</given-names>
</name>
<name>
<surname>Compernolle</surname>
<given-names>F. C.</given-names>
</name>
<name>
<surname>de Pauw</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Piessens-Denef</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>1973</year>). <article-title>Biohydrogenation of sterols by eubacterium ATCC 21, 408&#x2014;nova species</article-title>. <source>Eur. J. Biochem.</source> <volume>36</volume>, <fpage>411</fpage>&#x2013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1111/j.1432-1033.1973.tb02926.x</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/4730962/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/j.1432-1033.1973.tb02926.x">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Biohydrogenation+of+sterols+by+eubacterium+ATCC+21,+408&#x2014;nova+species&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fonville</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Carter</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Cloarec</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Nicholson</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Lindon</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Bunch</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>Robust data processing and normalization strategy for MALDI mass spectrometric imaging</article-title>. <source>Anal. Chem.</source> <volume>84</volume>, <fpage>1310</fpage>&#x2013;<lpage>1319</lpage>. <pub-id pub-id-type="doi">10.1021/ac201767g</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/22148759/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1021/ac201767g">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Robust+data+processing+and+normalization+strategy+for+MALDI+mass+spectrometric+imaging&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B78">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gut</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Boiret</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bultel</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Renaud</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Chetouani</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Hafiane</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Application of chemometric algorithms to MALDI mass spectrometry imaging of pharmaceutical tablets</article-title>. <source>J. Pharm. Biomed.</source> <volume>105</volume>, <fpage>91</fpage>&#x2013;<lpage>100</lpage>. <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/25543287/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.jpba.2014.11.047">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Application+of+chemometric+algorithms+to+MALDI+mass+spectrometry+imaging+of+pharmaceutical+tablets&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gunst</surname>
<given-names>R. F.</given-names>
</name>
<name>
<surname>Webster</surname>
<given-names>J. T.</given-names>
</name>
</person-group> (<year>1975</year>). <article-title>Regression analysis and problems of multicollinearity</article-title>. <source>Commun. Stat. Simul. Comput.</source> <volume>4</volume>, <fpage>277</fpage>&#x2013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.1080/03610927308827246</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/03610927308827246">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Regression+analysis+and+problems+of+multicollinearity&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B22">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Hall-Beyer</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>GLCM texture: A tutorial v.3.0 march 2017</article-title>. <comment>AvaliableAt: <ext-link ext-link-type="uri" xlink:href="http://www.ucalgary.ca/UofC/nasdev/mhallbey/research.htm">http://www.ucalgary.ca/UofC/nasdev/mhallbey/research.htm</ext-link>
</comment>. <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=GLCM+texture:+A+tutorial+v.3.0+march+2017&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haralick</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Shanmugam</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Dinstein</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>1973</year>). <article-title>Textural features for image classification</article-title>. <source>IEEE Trans. Syst. Man. Cybern.</source> <volume>3</volume>, <fpage>610</fpage>&#x2013;<lpage>621</lpage>. <pub-id pub-id-type="doi">10.1109/TSMC.1973.4309314</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/TSMC.1973.4309314">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Textural+features+for+image+classification&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B80">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Haug</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Cochrane</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Nainala</surname>
<given-names>V. C.</given-names>
</name>
<name>
<surname>Williams</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jayaseelan</surname>
<given-names>K. V.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>MetaboLights: A resource evolving in response to the needs of its scientific community</article-title>. <source>Nucleic Acids Res.</source> <volume>48</volume> (<issue>D1</issue>), <fpage>D440</fpage>&#x2013;<lpage>D444</lpage>. <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/31691833/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/nar/gkz1019">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=MetaboLights:+A+resource+evolving+in+response+to+the+needs+of+its+scientific+community&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hayes</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Freeman</surname>
<given-names>K. H.</given-names>
</name>
<name>
<surname>Popp</surname>
<given-names>B. N.</given-names>
</name>
<name>
<surname>Hoham</surname>
<given-names>C. H.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Compound-specific isotopic analyses: A novel tool for reconstruction of ancient biogeochemical processes</article-title>. <source>Org. Geochem.</source> <volume>16</volume>, <fpage>1115</fpage>&#x2013;<lpage>1128</lpage>. <pub-id pub-id-type="doi">10.1016/0146-6380(90)90147-R</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/11540919/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0146-6380(90)90147-R">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Compound-specific+isotopic+analyses:+A+novel+tool+for+reconstruction+of+ancient+biogeochemical+processes&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Head</surname>
<given-names>E. J. H.</given-names>
</name>
<name>
<surname>Hargrave</surname>
<given-names>B. T.</given-names>
</name>
<name>
<surname>Subba Rao</surname>
<given-names>D. V.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Accumulation of a pheophorbide a-like pigment in sediment traps during late stages of a spring bloom: A product of dying algae?</article-title> <source>Limnol. Oceanogr.</source> <volume>39</volume>, <fpage>176</fpage>&#x2013;<lpage>181</lpage>. <pub-id pub-id-type="doi">10.4319/lo.1994.39.1.0176</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.4319/lo.1994.39.1.0176">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Accumulation+of+a+pheophorbide+a-like+pigment+in+sediment+traps+during+late+stages+of+a+spring+bloom:+A+product+of+dying+algae?&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hinrichs</surname>
<given-names>K.-U.</given-names>
</name>
<name>
<surname>Rullk&#xf6;tter</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Stein</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Preliminary assessment of organic geochemical signals in sediments from hole 893A, Santa Barbara Basin, offshore California</article-title>. <source>Proc. Odp. Sci. Results</source> <volume>146</volume>, <fpage>201</fpage>&#x2013;<lpage>211</lpage>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.2973/odp.proc.sr.146-2.307.1995">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Preliminary+assessment+of+organic+geochemical+signals+in+sediments+from+hole+893A,+Santa+Barbara+Basin,+offshore+California&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hinrichs</surname>
<given-names>K.-U.</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>R. R.</given-names>
</name>
<name>
<surname>M&#xfc;ller</surname>
<given-names>P. J.</given-names>
</name>
<name>
<surname>Rullk&#xf6;tter</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>A biomarker perspective on paleoproductivity variations in two Late Quaternary sediment sections from the Southeast Atlantic Ocean</article-title>. <source>Org. Geochem.</source> <volume>30</volume>, <fpage>341</fpage>&#x2013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1016/S0146-6380(99)00007-8</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0146-6380(99)00007-8">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=A+biomarker+perspective+on+paleoproductivity+variations+in+two+Late+Quaternary+sediment+sections+from+the+Southeast+Atlantic+Ocean&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Street-Perrott</surname>
<given-names>F. A.</given-names>
</name>
<name>
<surname>Perrott</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Metzger</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Glacial&#x2013;interglacial environmental changes inferred from molecular and compound-specific &#x3b4;<sup>13</sup>C analyses of sediments from Sacred Lake, Mt. Kenya</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>63</volume>, <fpage>1383</fpage>&#x2013;<lpage>1404</lpage>. <pub-id pub-id-type="doi">10.1016/S0016-7037(99)00074-5</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0016-7037(99)00074-5">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Glacial&#x2013;interglacial+environmental+changes+inferred+from+molecular+and+compound-specific+&#x3b4;13C+analyses+of+sediments+from+Sacred+Lake,+Mt.+Kenya&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>H&#xfc;lsemann</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Emery</surname>
<given-names>K. O.</given-names>
</name>
</person-group> (<year>1961</year>). <article-title>Stratification in recent sediments of Santa Barbara Basin as controlled by organisms and water character</article-title>. <source>J. Geol.</source> <volume>69</volume>, <fpage>279</fpage>&#x2013;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.1086/626742</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1086/626742">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Stratification+in+recent+sediments+of+Santa+Barbara+Basin+as+controlled+by+organisms+and+water+character&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B30">
<citation citation-type="book">
<person-group person-group-type="editor">
<name>
<surname>I. T.</surname>
<given-names>Jolliffe</given-names>
</name>
</person-group> (Editor) (<year>2002</year>). &#x201c;<article-title>Principal component analysis for special types of data</article-title>,&#x201d; <source>
<italic>Principal component analysis</italic> springer series in statistics</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>338</fpage>&#x2013;<lpage>372</lpage>. <pub-id pub-id-type="doi">10.1007/0-387-22440-8_13</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/0-387-22440-8_13">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Principal+component+analysis+for+special+types+of+data&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jutten</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Herault</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>1991</year>). <article-title>Blind separation of sources, part I: An adaptive algorithm based on neuromimetic architecture</article-title>. <source>Signal Process.</source> <volume>24</volume>, <fpage>1</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1016/0165-1684(91)90079-X</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0165-1684(91)90079-X">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Blind+separation+of+sources,+part+I:+An+adaptive+algorithm+based+on+neuromimetic+architecture&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kennedy</surname>
<given-names>J. A.</given-names>
</name>
<name>
<surname>Brassell</surname>
<given-names>S. C.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Molecular stratigraphy of the Santa Barbara Basin: Comparison with historical records of annual climate change</article-title>. <source>Org. Geochem.</source> <volume>19</volume>, <fpage>235</fpage>&#x2013;<lpage>244</lpage>. <pub-id pub-id-type="doi">10.1016/0146-6380(92)90040-5</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0146-6380(92)90040-5">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Molecular+stratigraphy+of+the+Santa+Barbara+Basin:+Comparison+with+historical+records+of+annual+climate+change&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kennedy</surname>
<given-names>M. L.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Phytochemical profile of the stems of Aeonium lindleyi</article-title>. <source>Rev. Bras. Farmacogn.</source> <volume>22</volume>, <fpage>676</fpage>&#x2013;<lpage>679</lpage>. <pub-id pub-id-type="doi">10.1590/S0102-695X2012005000037</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1590/S0102-695X2012005000037">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Phytochemical+profile+of+the+stems+of+Aeonium+lindleyi&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis</article-title>. <source>Bioinformatics</source> <volume>23</volume>, <fpage>1495</fpage>&#x2013;<lpage>1502</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btm134</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/17483501/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btm134">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Sparse+non-negative+matrix+factorizations+via+alternating+non-negativity-constrained+least+squares+for+microarray+data+analysis&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kusch</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Rethemeyer</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Schefu&#xdf;</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Mollenhauer</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Controls on the age of vascular plant biomarkers in Black Sea sediments</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>74</volume>, <fpage>7031</fpage>&#x2013;<lpage>7047</lpage>. <pub-id pub-id-type="doi">10.1016/j.gca.2010.09.005</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.gca.2010.09.005">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Controls+on+the+age+of+vascular+plant+biomarkers+in+Black+Sea+sediments&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lange</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Weinheimer</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Reid</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Thunell</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>Sedimentation patterns of diatoms, radiolarians, and silicoflagellates in Santa Barbara Basin, California</article-title>. <source>Calif. Coop. Ocean. Fish. Investig. Rep.</source> <volume>38</volume>, <fpage>161</fpage>&#x2013;<lpage>170</lpage>. <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Sedimentation+patterns+of+diatoms,+radiolarians,+and+silicoflagellates+in+Santa+Barbara+Basin,+California&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marlowe</surname>
<given-names>I. T.</given-names>
</name>
<name>
<surname>Brassell</surname>
<given-names>S. C.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>J. C.</given-names>
</name>
</person-group> (<year>1984a</year>). <article-title>Long chain unsaturated ketones and esters in living algae and marine sediments</article-title>. <source>Org. Geochem.</source> <volume>6</volume>, <fpage>135</fpage>&#x2013;<lpage>141</lpage>. <pub-id pub-id-type="doi">10.1016/0146-6380(84)90034-2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0146-6380(84)90034-2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Long+chain+unsaturated+ketones+and+esters+in+living+algae+and+marine+sediments&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marlowe</surname>
<given-names>I. T.</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>J. C.</given-names>
</name>
<name>
<surname>Neal</surname>
<given-names>A. C.</given-names>
</name>
<name>
<surname>Brassell</surname>
<given-names>S. C.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Course</surname>
<given-names>P. A.</given-names>
</name>
<etal/>
</person-group> (<year>1984b</year>). <article-title>Long chain (<italic>n</italic>C<sub>37</sub>&#x2013;C<sub>39</sub>) alkenones in the Prymnesiophyceae. Distribution of alkenones and other lipids and their taxonomic significance</article-title>. <source>Br. Phycol. J.</source> <volume>19</volume>, <fpage>203</fpage>&#x2013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1080/00071618400650221</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1080/00071618400650221">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Long+chain+(nC37&#x2013;C39)+alkenones+in+the+Prymnesiophyceae.+Distribution+of+alkenones+and+other+lipids+and+their+taxonomic+significance&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mittal</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Condina</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Klingler-Hoffmann</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kaur</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Oehler</surname>
<given-names>M. K.</given-names>
</name>
<name>
<surname>Sieber</surname>
<given-names>O. M.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Cancer tissue classification using supervised machine learning applied to MALDI mass spectrometry imaging</article-title>. <source>Cancers</source> <volume>13</volume>, <fpage>5388</fpage>. <pub-id pub-id-type="doi">10.3390/cancers13215388</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/34771551/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3390/cancers13215388">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Cancer+tissue+classification+using+supervised+machine+learning+applied+to+MALDI+mass+spectrometry+imaging&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B79">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nijs</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Smets</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Waelkens</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>De Moor</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A mathematical comparison of non-negative matrix factorization related methods with practical implications for the analysis of mass spectrometry imaging data</article-title>. <source>Rapid Commun. Mass Spectrom.</source> <volume>35</volume> (<issue>21</issue>), <fpage>e9181</fpage>. <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/34374141/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/rcm.9181">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=A+mathematical+comparison+of+non-negative+matrix+factorization+related+methods+with+practical+implications+for+the+analysis+of+mass+spectrometry+imaging+data&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nishimura</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Koyama</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>1977</year>). <article-title>The occurrence of stanols in various living organisms and the behavior of sterols in contemporary sediments</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>41</volume>, <fpage>379</fpage>&#x2013;<lpage>385</lpage>. <pub-id pub-id-type="doi">10.1016/0016-7037(77)90265-4</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0016-7037(77)90265-4">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=The+occurrence+of+stanols+in+various+living+organisms+and+the+behavior+of+sterols+in+contemporary+sediments&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Odland</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>tommyod/KDEpy: Kernel density estimation in Python</article-title>. <source>Zenodo</source>. <pub-id pub-id-type="doi">10.5281/zenodo.2392268</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.2392268">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=tommyod/KDEpy:+Kernel+density+estimation+in+Python&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ovchinnikova</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Stuart</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Rakhlin</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Nikolenko</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Alexandrov</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>ColocML: Machine learning quantifies co-localization between mass spectrometry images</article-title>. <source>Bioinformatics</source> <volume>36</volume>, <fpage>3215</fpage>&#x2013;<lpage>3224</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btaa085</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/32049317/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btaa085">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=ColocML:+Machine+learning+quantifies+co-localization+between+mass+spectrometry+images&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pancost</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Baas</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>van Geel</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Sinninghe Damst&#xe9;</surname>
<given-names>J. S.</given-names>
</name>
</person-group> (<year>2002</year>). <article-title>Biomarkers as proxies for plant inputs to peats: An example from a sub-boreal ombrotrophic bog</article-title>. <source>Org. Geochem.</source> <volume>33</volume>, <fpage>675</fpage>&#x2013;<lpage>690</lpage>. <pub-id pub-id-type="doi">10.1016/S0146-6380(02)00048-7</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0146-6380(02)00048-7">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Biomarkers+as+proxies+for+plant+inputs+to+peats:+An+example+from+a+sub-boreal+ombrotrophic+bog&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pearson</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>McNichol</surname>
<given-names>A. P.</given-names>
</name>
<name>
<surname>Benitez-Nelson</surname>
<given-names>B. C.</given-names>
</name>
<name>
<surname>Hayes</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>T. I.</given-names>
</name>
</person-group> (<year>2001</year>). <article-title>Origins of lipid biomarkers in Santa monica basin surface sediment: A case study using compound-specific &#x394;<sup>14</sup>C analysis</article-title>. <source>Geochim. Cosmochim. Acta</source> <volume>65</volume>, <fpage>3123</fpage>&#x2013;<lpage>3137</lpage>. <pub-id pub-id-type="doi">10.1016/S0016-7037(01)00657-3</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0016-7037(01)00657-3">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Origins+of+lipid+biomarkers+in+Santa+monica+basin+surface+sediment:+A+case+study+using+compound-specific+&#x394;14C+analysis&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pedregosa</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Varoquaux</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Gramfort</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Michel</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Thirion</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Grisel</surname>
<given-names>O.</given-names>
</name>
<etal/>
</person-group> (<year>2011</year>). <article-title>Scikit-learn: Machine learning in Python</article-title>. <source>J. Mach. Learn. Res.</source> <volume>12</volume>, <fpage>2825</fpage>&#x2013;<lpage>2830</lpage>. <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Scikit-learn:+Machine+learning+in+Python&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B46">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Peters</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>K. E.</given-names>
</name>
<name>
<surname>Walters</surname>
<given-names>C. C.</given-names>
</name>
<name>
<surname>Moldowan</surname>
<given-names>J. M.</given-names>
</name>
</person-group> (<year>2005</year>). <source>The biomarker guide</source>. <publisher-loc>Cambridge, United Kingdom</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>. <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=The+biomarker+guide&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Prahl</surname>
<given-names>F. G.</given-names>
</name>
<name>
<surname>Wakeham</surname>
<given-names>S. G.</given-names>
</name>
</person-group> (<year>1987</year>). <article-title>Calibration of unsaturation patterns in long-chain ketone compositions for palaeotemperature assessment</article-title>. <source>Nature</source> <volume>330</volume>, <fpage>367</fpage>&#x2013;<lpage>369</lpage>. <pub-id pub-id-type="doi">10.1038/330367a0</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/330367a0">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Calibration+of+unsaturation+patterns+in+long-chain+ketone+compositions+for+palaeotemperature+assessment&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B48">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Quanico</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Franck</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wisztorski</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Salzet</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Fournier</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Progress and potential of imaging mass spectrometry applied to biomarker discovery</article-title>,&#x201d; in <source>
<italic>Neuroproteomics: Methods and protocols</italic> methods in molecular biology</source>. Editors <person-group person-group-type="editor">
<name>
<surname>Kobeissy</surname>
<given-names>F. H.</given-names>
</name>
<name>
<surname>Stevens Stanley</surname>
<given-names>M.</given-names>
</name>
</person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>21</fpage>&#x2013;<lpage>43</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-4939-6952-4_2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/978-1-4939-6952-4_2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Progress+and+potential+of+imaging+mass+spectrometry+applied+to+biomarker+discovery&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ratnayake</surname>
<given-names>N. P.</given-names>
</name>
<name>
<surname>Suzuki</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Matsubara</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Sources of long chain fatty acids in deep sea sediments from the Bering Sea and the North Pacific Ocean</article-title>. <source>Org. Geochem.</source> <volume>36</volume>, <fpage>531</fpage>&#x2013;<lpage>541</lpage>. <pub-id pub-id-type="doi">10.1016/j.orggeochem.2004.11.004</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.orggeochem.2004.11.004">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Sources+of+long+chain+fatty+acids+in+deep+sea+sediments+from+the+Bering+Sea+and+the+North+Pacific+Ocean&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reimers</surname>
<given-names>C. E.</given-names>
</name>
<name>
<surname>Lange</surname>
<given-names>C. B.</given-names>
</name>
<name>
<surname>Tabak</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bernhard</surname>
<given-names>J. M.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Seasonal spillover and varve formation in the Santa Barbara Basin, California</article-title>. <source>Limnol. Oceanogr.</source> <volume>35</volume>, <fpage>1577</fpage>&#x2013;<lpage>1585</lpage>. <pub-id pub-id-type="doi">10.4319/lo.1990.35.7.1577</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.4319/lo.1990.35.7.1577">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Seasonal+spillover+and+varve+formation+in+the+Santa+Barbara+Basin,+California&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rosenfeld</surname>
<given-names>R. S.</given-names>
</name>
<name>
<surname>Hellman</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>1971</year>). <article-title>Reduction and esterification of cholesterol and sitosterol by homogenates of feces</article-title>. <source>J. Lipid Res.</source> <volume>12</volume>, <fpage>192</fpage>&#x2013;<lpage>197</lpage>. <pub-id pub-id-type="doi">10.1016/S0022-2275(20)39529-8</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/5554107/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0022-2275(20)39529-8">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Reduction+and+esterification+of+cholesterol+and+sitosterol+by+homogenates+of+feces&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schimmelmann</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Lange</surname>
<given-names>C. B.</given-names>
</name>
</person-group> (<year>1996</year>). <article-title>Tales of 1001 varves: A review of Santa Barbara Basin sediment studies</article-title>. <source>Geol. Soc. Lond. Spec. Publ.</source> <volume>116</volume>, <fpage>121</fpage>&#x2013;<lpage>141</lpage>. <pub-id pub-id-type="doi">10.1144/GSL.SP.1996.116.01.12</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1144/GSL.SP.1996.116.01.12">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Tales+of+1001+varves:+A+review+of+Santa+Barbara+Basin+sediment+studies&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B77">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Siy</surname>
<given-names>P. W.</given-names>
</name>
<name>
<surname>Moffitt</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Parry</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Sullards</surname>
<given-names>M. C.</given-names>
</name>
<etal/>
</person-group> (<year>2008</year>). &#x201c;<article-title>Matrix factorization techniques for analysis of imaging mass spectrometry data</article-title>,&#x201d; in <conf-name>8th IEEE International Conference on BioInformatics and BioEngineering</conf-name>, <fpage>1</fpage>&#x2013;<lpage>6</lpage>. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1109/bibe.2008.4696797">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Matrix+factorization+techniques+for+analysis+of+imaging+mass+spectrometry+data&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B53">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Smayda</surname>
<given-names>T. J.</given-names>
</name>
<name>
<surname>Trainer</surname>
<given-names>V. L.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Dinoflagellate blooms in upwelling systems: Seeding, variability, and contrasts with diatom bloom behaviour</article-title>. <source>Prog. Oceanogr.</source> <volume>85</volume>, <fpage>92</fpage>&#x2013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.1016/j.pocean.2010.02.006</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.pocean.2010.02.006">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Dinoflagellate+blooms+in+upwelling+systems:+Seeding,+variability,+and+contrasts+with+diatom+bloom+behaviour&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B55">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Soutar</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Crill</surname>
<given-names>P. A.</given-names>
</name>
</person-group> (<year>1977</year>). <article-title>Sedimentation and climatic patterns in the Santa Barbara Basin during the 19th and 20th centuries</article-title>. <source>GSA Bull.</source> <volume>88</volume>, <fpage>1161</fpage>&#x2013;<lpage>1172</lpage>. <pub-id pub-id-type="doi">10.1130/0016-7606(1977)88&#x3c;1161:SACPIT&#x3e;2.0.CO;2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1130/0016-7606(1977)88&#x3c;1161:SACPIT&#x3e;2.0.CO;2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Sedimentation+and+climatic+patterns+in+the+Santa+Barbara+Basin+during+the+19th+and+20th+centuries&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Summons</surname>
<given-names>R. E.</given-names>
</name>
<name>
<surname>Welander</surname>
<given-names>P. V.</given-names>
</name>
<name>
<surname>Gold</surname>
<given-names>D. A.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Lipid biomarkers: Molecular tools for illuminating the history of microbial life</article-title>. <source>Nat. Rev. Microbiol.</source> <volume>20</volume>, <fpage>174</fpage>&#x2013;<lpage>185</lpage>. <pub-id pub-id-type="doi">10.1038/s41579-021-00636-2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/34635851/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/s41579-021-00636-2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Lipid+biomarkers:+Molecular+tools+for+illuminating+the+history+of+microbial+life&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B57">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szymczak-&#x17b;y&#x142;a</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kowalewska</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Louda</surname>
<given-names>J. W.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Chlorophyll-<italic>a</italic> and derivatives in recent sediments as indicators of productivity and depositional conditions</article-title>. <source>Mar. Chem.</source> <volume>125</volume>, <fpage>39</fpage>&#x2013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1016/j.marchem.2011.02.002</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.marchem.2011.02.002">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Chlorophyll-a+and+derivatives+in+recent+sediments+as+indicators+of+productivity+and+depositional+conditions&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B58">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thiele</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Heldmann</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Trede</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Strehlow</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wirtz</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Dreher</surname>
<given-names>W.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>2D and 3D MALDI-imaging: Conceptual strategies for visualization and data mining</article-title>. <source>Biochimica Biophysica Acta - Proteins Proteomics</source> <volume>1844</volume>, <fpage>117</fpage>&#x2013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.1016/j.bbapap.2013.01.040</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/23467008/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.bbapap.2013.01.040">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=2D+and+3D+MALDI-imaging:+Conceptual+strategies+for+visualization+and+data+mining&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thunell</surname>
<given-names>R. C.</given-names>
</name>
<name>
<surname>Tappa</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Anderson</surname>
<given-names>D. M.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Sediment fluxes and varve formation in Santa Barbara Basin, offshore California</article-title>. <source>Geol.</source> <volume>23</volume>, <fpage>1083</fpage>. <pub-id pub-id-type="doi">10.1130/0091-7613(1995)023&#x3c;1083:SFAVFI&#x3e;2.3.CO;2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1130/0091-7613(1995)023&#x3c;1083:SFAVFI&#x3e;2.3.CO;2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Sediment+fluxes+and+varve+formation+in+Santa+Barbara+Basin,+offshore+California&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Trede</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kobarg</surname>
<given-names>J. H.</given-names>
</name>
<name>
<surname>Oetjen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Thiele</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Maass</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Alexandrov</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2012</year>). <article-title>On the importance of mathematical methods for analysis of MALDI-imaging mass spectrometry data</article-title>. <source>J. Integr. Bioinform.</source> <volume>9</volume>, <fpage>1</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1515/jib-2012-189</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/22433312/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1515/jib-2012-189">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=On+the+importance+of+mathematical+methods+for+analysis+of+MALDI-imaging+mass+spectrometry+data&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>van der Walt</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Sch&#xf6;nberger</surname>
<given-names>J. L.</given-names>
</name>
<name>
<surname>Nunez-Iglesias</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Boulogne</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Warner</surname>
<given-names>J. D.</given-names>
</name>
<name>
<surname>Yager</surname>
<given-names>N.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Scikit-image: Image processing in Python</article-title>. <source>PeerJ</source> <volume>2</volume>, <fpage>e453</fpage>. <pub-id pub-id-type="doi">10.7717/peerj.453</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/25024921/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.7717/peerj.453">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Scikit-image:+Image+processing+in+Python&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Verbeeck</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Caprioli</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Van de Plas</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Unsupervised machine learning for exploratory data analysis in imaging mass spectrometry</article-title>. <source>Mass Spectrom. Rev.</source> <volume>39</volume>, <fpage>245</fpage>&#x2013;<lpage>291</lpage>. <pub-id pub-id-type="doi">10.1002/mas.21602</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/31602691/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1002/mas.21602">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Unsupervised+machine+learning+for+exploratory+data+analysis+in+imaging+mass+spectrometry&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Virtanen</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Gommers</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Oliphant</surname>
<given-names>T. E.</given-names>
</name>
<name>
<surname>Haberland</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Reddy</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Cournapeau</surname>
<given-names>D.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>SciPy 1.0: Fundamental algorithms for scientific computing in Python</article-title>. <source>Nat. Methods</source> <volume>17</volume>, <fpage>261</fpage>&#x2013;<lpage>272</lpage>. <pub-id pub-id-type="doi">10.1038/s41592-019-0686-2</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/32015543/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/s41592-019-0686-2">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=SciPy+1.0:+Fundamental+algorithms+for+scientific+computing+in+Python&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Volkman</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Barrett</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Blackburn</surname>
<given-names>S. I.</given-names>
</name>
<name>
<surname>Mansour</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Sikes</surname>
<given-names>E. L.</given-names>
</name>
<name>
<surname>Gelin</surname>
<given-names>F.</given-names>
</name>
<etal/>
</person-group> (<year>1998</year>). <article-title>Microalgal biomarkers: A review of recent research developments</article-title>. <source>Org. Geochem.</source> <volume>29</volume>, <fpage>1163</fpage>&#x2013;<lpage>1179</lpage>. <pub-id pub-id-type="doi">10.1016/S0146-6380(98)00062-X</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0146-6380(98)00062-X">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Microalgal+biomarkers:+A+review+of+recent+research+developments&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Volkman</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Barrett</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Dunstan</surname>
<given-names>G. A.</given-names>
</name>
<name>
<surname>Jeffrey</surname>
<given-names>S. W.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Geochemical significance of the occurrence of dinosterol and other 4-methyl sterols in a marine diatom</article-title>. <source>Org. Geochem.</source> <volume>20</volume>, <fpage>7</fpage>&#x2013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1016/0146-6380(93)90076-N</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0146-6380(93)90076-N">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Geochemical+significance+of+the+occurrence+of+dinosterol+and+other+4-methyl+sterols+in+a+marine+diatom&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Volkman</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Corner</surname>
<given-names>E. D. S.</given-names>
</name>
<name>
<surname>Forsberg</surname>
<given-names>T. E. V.</given-names>
</name>
</person-group> (<year>1980a</year>). <article-title>Long-chain alkenes and alkenones in the marine coccolithophorid Emiliania huxleyi</article-title>. <source>Phytochemistry</source> <volume>19</volume>, <fpage>2619</fpage>&#x2013;<lpage>2622</lpage>. <pub-id pub-id-type="doi">10.1016/S0031-9422(00)83930-8</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0031-9422(00)83930-8">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Long-chain+alkenes+and+alkenones+in+the+marine+coccolithophorid+Emiliania+huxleyi&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Volkman</surname>
<given-names>J. K.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Corner</surname>
<given-names>E. D. S.</given-names>
</name>
<name>
<surname>Sargent</surname>
<given-names>J. R.</given-names>
</name>
</person-group> (<year>1980b</year>). <article-title>Novel unsaturated straight-chain C<sub>37</sub>&#x2013;C<sub>39</sub> methyl and ethyl ketones in marine sediments and a coccolithophore Emiliania huxleyi</article-title>. <source>Phys. Chem. Earth.</source> <volume>12</volume>, <fpage>219</fpage>&#x2013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1016/0079-1946(79)90106-X</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/0079-1946(79)90106-X">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Novel+unsaturated+straight-chain+C37&#x2013;C39+methyl+and+ethyl+ketones+in+marine+sediments+and+a+coccolithophore+Emiliania+huxleyi&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B68">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Volkman</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Sterols in microorganisms</article-title>. <source>Appl. Microbiol. Biotechnol.</source> <volume>60</volume>, <fpage>495</fpage>&#x2013;<lpage>506</lpage>. <pub-id pub-id-type="doi">10.1007/s00253-002-1172-8</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/12536248/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1007/s00253-002-1172-8">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Sterols+in+microorganisms&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wakeham</surname>
<given-names>S. G.</given-names>
</name>
</person-group> (<year>1989</year>). <article-title>Reduction of stenols to stanols in particulate matter at oxic&#x2013;anoxic boundaries in sea water</article-title>. <source>Nature</source> <volume>342</volume>, <fpage>787</fpage>&#x2013;<lpage>790</lpage>. <pub-id pub-id-type="doi">10.1038/342787a0</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1038/342787a0">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Reduction+of+stenols+to+stanols+in+particulate+matter+at+oxic&#x2013;anoxic+boundaries+in+sea+water&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B70">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wannesm</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Yurtman</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Robberechts</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Vohl</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>wannesm/dtaidistance: v2.3.5</article-title>. <source>Zenodo</source>. <pub-id pub-id-type="doi">10.5281/ZENODO.5901139</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/ZENODO.5901139">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=wannesm/dtaidistance:+v2.3.5&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B71">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wijetunge</surname>
<given-names>C. D.</given-names>
</name>
<name>
<surname>Saeed</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Boughton</surname>
<given-names>B. A.</given-names>
</name>
<name>
<surname>Spraggins</surname>
<given-names>J. M.</given-names>
</name>
<name>
<surname>Caprioli</surname>
<given-names>R. M.</given-names>
</name>
<name>
<surname>Bacic</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2015</year>). <article-title>Exims: An improved data analysis pipeline based on a new peak picking method for EXploring Imaging Mass Spectrometry data</article-title>. <source>Bioinformatics</source> <volume>31</volume>, <fpage>3198</fpage>&#x2013;<lpage>3206</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btv356</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/26063840/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1093/bioinformatics/btv356">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Exims:+An+improved+data+analysis+pipeline+based+on+a+new+peak+picking+method+for+EXploring+Imaging+Mass+Spectrometry+data&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B72">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>W&#xf6;rmer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Elvert</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Fuchser</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lipp</surname>
<given-names>J. S.</given-names>
</name>
<name>
<surname>Buttigieg</surname>
<given-names>P. L.</given-names>
</name>
<name>
<surname>Zabel</surname>
<given-names>M.</given-names>
</name>
<etal/>
</person-group> (<year>2014</year>). <article-title>Ultra-high-resolution paleoenvironmental records via direct laser-based analysis of lipid biomarkers in sediment core samples</article-title>. <source>Proc. Natl. Acad. Sci. U. S. A.</source> <volume>111</volume>, <fpage>15669</fpage>&#x2013;<lpage>15674</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1405237111</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/25331871/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1073/pnas.1405237111">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Ultra-high-resolution+paleoenvironmental+records+via+direct+laser-based+analysis+of+lipid+biomarkers+in+sediment+core+samples&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B73">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>W&#xf6;rmer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Gajendra</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Schubotz</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Matys</surname>
<given-names>E. D.</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>T. W.</given-names>
</name>
<name>
<surname>Summons</surname>
<given-names>R. E.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>A micrometer-scale snapshot on phototroph spatial distributions: Mass spectrometry imaging of microbial mats in Octopus spring, yellowstone national Park</article-title>. <source>Geobiology</source> <volume>18</volume>, <fpage>742</fpage>&#x2013;<lpage>759</lpage>. <pub-id pub-id-type="doi">10.1111/gbi.12411</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/32936514/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1111/gbi.12411">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=A+micrometer-scale+snapshot+on+phototroph+spatial+distributions:+Mass+spectrometry+imaging+of+microbial+mats+in+Octopus+spring,+yellowstone+national+Park&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B74">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>W&#xf6;rmer</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wendt</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Alfken</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J.-X.</given-names>
</name>
<name>
<surname>Elvert</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Heuer</surname>
<given-names>V. B.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Towards multiproxy, ultra-high resolution molecular stratigraphy: Enabling laser-induced mass spectrometry imaging of diverse molecular biomarkers in sediments</article-title>. <source>Org. Geochem.</source> <volume>127</volume>, <fpage>136</fpage>&#x2013;<lpage>145</lpage>. <pub-id pub-id-type="doi">10.1016/j.orggeochem.2018.11.009</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/j.orggeochem.2018.11.009">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Towards+multiproxy,+ultra-high+resolution+molecular+stratigraphy:+Enabling+laser-induced+mass+spectrometry+imaging+of+diverse+molecular+biomarkers+in+sediments&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B75">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wen</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Washburn</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Florens</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Improving proteomics mass accuracy by dynamic offline lock mass</article-title>. <source>Anal. Chem.</source> <volume>83</volume>, <fpage>9344</fpage>&#x2013;<lpage>9351</lpage>. <pub-id pub-id-type="doi">10.1021/ac201867h</pub-id> <ext-link ext-link-type="uri" xlink:href="https://pubmed.ncbi.nlm.nih.gov/22044264/">PubMed Abstract</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1021/ac201867h">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=Improving+proteomics+mass+accuracy+by+dynamic+offline+lock+mass&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
<ref id="B76">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Eglinton</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Read</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Schimmelmann</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>An alkenone () quasi-annual sea surface temperature record (A.D. 1440 to 1940) using varved sediments from the Santa Barbara Basin</article-title>. <source>Org. Geochem.</source> <volume>31</volume>, <fpage>903</fpage>&#x2013;<lpage>917</lpage>. <pub-id pub-id-type="doi">10.1016/S0146-6380(00)00034-6</pub-id> <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.1016/S0146-6380(00)00034-6">CrossRef Full Text</ext-link> &#x7c; <ext-link ext-link-type="uri" xlink:href="https://scholar.google.com/scholar?hl=en&#x0026;as_sdt=0%2C5&#x0026;q=An+alkenone+quasi-annual+sea+surface+temperature+record+(A.D.+1440+to+1940)+using+varved+sediments+from+the+Santa+Barbara+Basin&#x0026;btnG=">Google Scholar</ext-link>
</citation>
</ref>
</ref-list>
</back>
</article>