Predicting Dissolved Lignin Phenol Concentrations in the Coastal Ocean from Chromophoric Dissolved Organic Matter (CDOM) Absorption Coefficients

Dissolved lignin is a well-established biomarker of terrigenous dissolved organic matter (DOM) in the ocean, and a chromophoric component of DOM. Although evidence suggests there is a strong linkage between lignin concentrations and chromophoric DOM (CDOM) absorption coefficients in coastal waters, the characteristics of this linkage and the existence of a relationship that is applicable across coastal oceans remain unclear. Here, 421 paired measurements of dissolved lignin concentrations (sum of 9 lignin phenols) and CDOM absorption coefficients (ag(λ)) were used to examine their relationship along the river-ocean continuum (0-37 salinity) and across contrasting coastal oceans (sub-tropical, temperate, high-latitude). Overall, lignin concentrations spanned four orders of magnitude and revealed a strong, non-linear relationship with ag(λ). The characteristics of the relationship (shape, wavelength dependency, lignin-composition dependency) and evidence from degradation indicators were all consistent with lignin being an important driver of CDOM variability in coastal oceans, and suggested physical mixing and long-term photodegradation were important in shaping the relationship. These observations were used to develop two simple empirical models for estimating lignin concentrations from ag(λ) with a +/- 20% error relative to measured values. The models are expected to be applicable in most coastal oceans influenced by terrigenous inputs.


INTRODUCTION
Lignin is a major biochemical component of vascular plant tissues, and a well-established biomarker of terrigenous organic matter in the ocean. Lignin is a complex aromatic heteropolymer that binds cellulose and hemicellulose microfibrils in the secondary cell wall of xylem tissues, and provides biomechanical support to stems in vascular plants. Lignin is exclusively produced on land by vascular plants, with the exception of an intertidal red seaweed Calliarthron cheilosporioides (Martone et al., 2009). During decomposition of plant matter by microorganisms in litter and soils, some lignin is mobilized in particulate or dissolved form to nearby streams and rivers (Aiken et al., 1985;Aiken and Cotsaris, 1995;Eriksson, 2010). Lignin eventually reaches the ocean, where its vascular-plant origin makes it an unambiguous biomarker of terrigenous organic matter (Hedges et al., 1997). As a result, lignin extracted from seawater and marine sediments has long been used to derive qualitative and quantitative information about the origins, transformations, and fates of terrigenous organic matter in the ocean (Opsahl and Benner, 1997;Hernes and Benner, 2006;Goni et al., 2008;Amon et al., 2012;Tesi et al., 2014;Medeiros et al., 2015).
Dissolved lignin molecules are also chromophores, and evidence suggests they contribute significantly to the optical properties of chromophoric dissolved organic matter (CDOM) in natural waters. Lignin is an aromatic macromolecule, and is therefore an efficient absorber of ultraviolet radiation in natural waters (Chin et al., 1994;Weishaar and Aiken, 2001;Schmidt, 2010). The complex macromolecular structure of lignin is also thought to promote a continuum of intramolecular chargetransfer interactions that extend the absorption properties of lignin well into the visible region of the solar spectrum (Furman and Lonsky, 1988a,b;Del Vecchio and Blough, 2004;Boyle et al., 2009). As a result, strong linear relationships between dissolved lignin concentrations and absorption and fluorescence properties of CDOM have been observed in various estuarine and coastal environments (Hernes and Benner, 2003;Spencer et al., 2008;Hernes et al., 2009;Walker et al., 2009;Osburn and Stedmon, 2011;Fichot and Benner, 2012;Fichot et al., 2013;Yamashita et al., 2015). The ubiquity of correlations between CDOM and dissolved lignin in river-influenced ocean margins suggests lignin could be an important driver of the variability of CDOM optical properties in such systems. This strong linkage also suggests CDOM optical properties could serve as practical optical proxies for the concentration of this terrigenous biomarker in the ocean.
The utility of CDOM as a proxy for lignin concentration hinges strongly on the existence of relationships that are applicable to most river-influenced environments, and on an understanding of the factors driving and impacting these relationships. Despite the various reports of linkages between CDOM and dissolved lignin, the general characteristics of the CDOM-lignin relationship and of its drivers along the riverocean continuums of the coastal ocean remains poorly known. Differences in the origin and alteration of CDOM and lignin, or the occurrence of non-lignin and non-covarying components of CDOM are examples of factors that can impact the lignin-CDOM relationship and jeopardize the utility of the proxy. Here, we investigated the relationship between dissolved lignin and CDOM absorption coefficients in the ultraviolet range (250-400 nm) using a large and fairly comprehensive data set (n = 421) that spanned the full salinity gradient and multiple types of coastal systems in the global ocean. The objectives of this study were to reveal the characteristics of the relationship between CDOM absorption and dissolved lignin concentration in the ocean, provide insights about the factors driving and affecting the relationship, and provide simple models for predicting dissolved lignin concentration from CDOM absorption coefficients that are widely applicable in terrestrially-influenced coastal systems.

Field Sampling Overview and Study Regions
Field samples were collected from a wide variety of rivers, estuaries, and coastal waters across the northern hemisphere (Figure 1 and Table 1). A total of 421 paired water samples for CDOM and dissolved lignin analyses were collected as part of different research projects between September 2005 and October 2013. Nine different coastal-ocean areas off North America and Asia were sampled and encompassed subtropical, temperate, and arctic environments. A majority of samples was collected in waters from or influenced by rivers that contrast in terms of discharge dynamics and drainage-basin vegetation (e.g., Mississippi, Atchafalaya, Delaware, Susquehanna, Hudson, Connecticut, Mackenzie, Ob, Yukon, Noatak, Lena, and Yenisey). Two study regions stood out: (1) the Chukchi Sea was an area of high biological productivity that received less riverine inputs compared to the other regions (Cota et al., 1996;Grebmeier, 2012;Shen et al., 2012a;Palmer et al., 2013), and (2) the Otsuchi Bay was influenced by rivers with small drainage basins, steep slopes, and short residence times (Fukuda et al., 2007). Some samples in this data set (e.g., northern Gulf of Mexico, South Carolina Coast, Middle Atlantic Bight) were also collected in waters influenced by coastal wetlands and marshes. Finally, a significant number of samples were also representative FIGURE 1 | The nine ocean regions investigated in this study, along with their corresponding sampling sites. A total of 421 sample sets were collected between 2005 and 2013. Sampling details are provided in Table 1.
Frontiers in Marine Science | www.frontiersin.org The corresponding sampling areas and sites are shown in Figure 1. A total of 160 measurements were acquired below the surface (i.e., sample depth >20 m). *CFL, Circumpolar Flaw Lead; NABOS, Nansen and Amundsen Basin Observational System; ICESCAPE, Impacts of Climate on the Eco-Systems and Chemistry of the Arctic Pacific Environment; TEAMS, Tohoku Ecosystem-Associated Marine Science; UNOLS, University-National Oceanographic Laboratory System. † Peer-reviewed publication(s) in which part or all of the research project's data [TDLP 9 and ag(λ)] has previously been used.
of oligotrophic, subsurface (e.g., Arctic halocline), and deepwater environments (e.g., North Atlantic Deep Water). Overall, samples were collected during different seasons from waters spanning salinities ranging from 0 to 37, depths ranging from 0 to 2800 m, and water types ranging from nutrient-rich riverine waters to oligotrophic marine waters.

CDOM Sampling and Analysis
Most samples for CDOM analysis were gravity filtered from Niskin bottles using Whatman Polycap Aqueous Solution (AS) cartridges (0.2-µm pore size), collected in pre-combusted borosilicate glass vials, and stored immediately at 4 • C until analysis in the laboratory. A few samples were collected using a bucket and filtered using a peristaltic pump through the Whatman Polycap Aqueous Solution (AS) cartridges. Absorbance of the samples was measured from λ = 250-800 nm using an ultraviolet (UV)-visible dual-beam spectrophotometer (Shimadzu UV-1601 or UV-2401PC/2501PC) and 10-cm quartz cells. For riverine and low-salinity samples (salinity < 10), 5-cm quartz cells or 1-cm quartz cuvettes were used. An exponential fit of the absorbance spectrum over an optimal spectral range was used to derive an offset value that was subtracted from the absorbance spectrum (Fichot and Benner, 2011). The optimal spectral range for the fit was typically 400-700 nm, but was adjusted for each sample (e.g., 350-700, 450-700 nm) as to provide the best fit of the absorbance spectrum possible over that spectral range. Absorbance corrected for offset was then converted to Napierian absorption coefficients, a g (λ) (m −1 ). The dependence of a g (λ) on λ is described using Equation (1): where λ 0 < λ and S is the spectral slope coefficient in the λ 0 -λ nm spectral range. A careful examination of each a g (λ) spectrum demonstrated the a g (λ)-values (especially for λ > 350 nm) were well above the uncertainty of the measurement (±0.005 m −1 at λ = 400 nm, for a 10-cm pathlength). The spectral slope coefficient of CDOM between 275 and 295 nm, S 275−295 , was calculated as the slope of the linear regression of ln[a g (λ)] on λ, between λ = 275 and 295 nm (Helms et al., 2008;Fichot and Benner, 2011). The S 275−295 -values are reported in nm −1 .

Lignin Sampling and Analysis
Samples for lignin analysis (1-10 L) were gravity filtered from Niskin bottles using Whatman Polycap AS cartridges (0.2-µm pore size), acidified to pH = 2.5-3 with sulfuric acid, and extracted within a few hours using C-18 cartridges (Louchouarn et al., 2000). Cartridges were stored at 4 • C until elution with 30 mL of methanol (HPLC-grade). The eluent was stored in sealed, pre-combusted glass vials at -20 • C until analysis. Lignin phenols were analyzed using the CuO oxidation method of Kaiser and Benner (2012). Concentrations of lignin phenols were measured as trimethylsilyl derivatives using an Agilent 7890 gas chromatograph equipped with a Varian DB5-MS capillary column and an Agilent 5975 mass selective detector. The sum of nine lignin phenols was calculated as the total lignin phenol concentration: p-hydroxybenzaldehyde (PAL), p-hydroxyacetophenone (PON), p-hydroxybenzoic acid (PAD), vanillin (VAL), acetovanillone (VON), vanillic acid (VAD), syringaldehyde (SAL), acetosyringone (SON), and syringic acid (SAD). The sum of nine p-hydroxy, vanillyl, and syringyl lignin phenols is denoted as TDLP 9 and is reported in units of nmol L −1 . The sums of concentrations of the three p-hydroxy phenols (PAL+PON+PAD), three vanillyl phenols (VAL+VON+VAD), and three syringyl phenols (SAL+SON+SAD) were also calculated, and are denoted here as P, V, and S phenols. All values of P, V, and S phenols are reported in units of nmol L −1 . The P/V and S/V ratios were also calculated and are unitless.

Models
The models presented in this work for predicting dissolved lignin concentration from CDOM absorption coefficients were developed using the R environment for statistical computing and graphics and the "lm {stats}" function, obtained from the Comprehensive R Archive Network (https://cran.r-project.org).

The Relationship between Lignin and CDOM Absorption Coefficients Across Coastal Oceans
A strong, non-linear relationship between dissolved lignin concentrations and the ultraviolet (UV) absorption coefficients of CDOM was observed over the full range of lignin concentrations measured in this study (Figures 2A,C). Dissolved lignin concentrations, TDLP 9 , ranged more than 500-fold from <1 to 491 nmol L −1 , and the CDOM absorption coefficient, a g (λ), ranged 85-fold at λ = 250 nm (1.1-84.5 m −1 ), and 300-fold at λ = 400 nm (0.025-7.8 m −1 ). The relationship between TDLP 9 and a g (λ) followed a relatively well-behaved exponential over the wide range of lignin concentrations and coastal environments sampled, thereby indicating that the observed variations in CDOM UV absorption are tightly linked to changes in lignin along the river-ocean continuum (Figures 2A,C). The concavedown shape of the exponential (Figures 2A,C) further indicated that the rate of decrease of a g (λ) relative to that of TDLP 9 (i.e., the derivative of the fitted line) increased substantially as lignin concentration decreased along the river-ocean continuum. At 250 nm, this rate increased about nine-fold from 0.09 to 0.75 L nmol −1 m −1 , meaning a g (250) decreased 9 times faster in highsalinity waters than in low-salinity waters for the same 1 nmol L −1 decrease in TDLP 9 along the river-ocean continuum. The exponential behavior was less accentuated at 400 nm, for which the rate increased only 3-fold from ∼0.01 to 0.035 L nmol −1 m −1 . Linear regressions of the form shown in Equation (2) provided adequate fits for the exponential relationship between TDLP 9 and a g (λ) at any wavelength in the 250-400 nm range ( Figures 2B,D, 3A).
The relationship was stronger with a g (250) than with a g (400), as indicated by the coefficient of determination (R 2 ) of Equation (2) decreasing from 0.946 at 250 nm to 0.910 at 400 nm (Figures 2B,D). A comparison of the R 2 for every wavelength between 250 and 400 nm further indicated the overall relationship was strongest (R 2 > 0.95) at ∼285 nm ( Figure 3A). Despite the goodness of the regression, significant scatter around the fitted lines was evident. A close examination of the relationship between TDLP 9 and a g (250) using log-scale axes ( Figure 2B) revealed that the scatter was relatively uniform (i.e., homoscedastic) over the entire range of lignin concentration. In contrast, the scatter around the fit was much less uniform (i.e., heteroscedastic) for a g (400), and increased substantially and progressively with decreasing lignin concentration ( Figure 2D). The linear relationship between TDLP 9 and a g (400) was marginal (R 2 < 0.4) when lignin concentration was <6 nmol L −1 . The strength of the relationship between dissolved lignin concentration and CDOM absorption was maximal in the UV-C region at low lignin concentrations, and shifted to the UV-A region at higher lignin concentrations ( Figure 3B). Additional insights about the relationship were gained by comparing the R 2 of Equation (2) for two subsets of the entire data set: 1) a "low-CDOM" subset (n = 263) where a g (250) < 4 m −1 , and 2) a "high-CDOM" subset (n = 158) where a g (250) ≥ 4 m −1 . The R 2 analysis using the "low-CDOM" subset ( Figure 3B) revealed the relationship was optimal (R 2 ∼ 0.7) at 250-275 nm (UV-C region), but weakened progressively and rapidly with wavelength to reach a minimal value (R 2 ∼ 0.35) at 400 nm. The a g (263) was the optimal predictor of TDLP 9 for the "low-CDOM" subset. However, the a g (λ) in the UV-A region (λ > 320 nm) was generally a poor predictor of lignin concentration in waters less influenced by terrigenous inputs. Interestingly, the reverse trend was observed when examining the "high-CDOM" subset. The R 2 analysis using the "high-CDOM" subset ( Figure 3B) indicated the relationship was good at all wavelengths, but improved significantly from a minimum R 2 = 0.87 at 250 nm to a R 2 ∼ 0.92 in the UV-A region (320-400 nm). As will be explained later, this shift from UV-A to UV-C is likely related to the progressive change in the molecular structure of lignin, and potentially, of other terrigenous chromophores co-varying with lignin. This shift also indicates that a g (λ) at a single wavelength is not an optimal predictor for lignin concentration over the range of lignin concentration found in coastal oceans. The p-hydroxy phenols (P), vanillyl phenols (V), and syringyl phenols (S) that make up TDLP 9 exhibited varying degrees of correlation with the CDOM absorption coefficient (Figure 4). As explained earlier, the relationship between a g (λ) and TDLP 9 was optimal at ∼285 nm for the entire range of lignin concentrations (R 2 = 0.951). The separation of TDLP 9 into P, V, and S phenols indicated the relationship between a g (285) and P phenols across the full range of values was even stronger (R 2 = 0.959) than with TDLP 9 . Only samples from the Ob estuary deviated significantly from the general relationship, and were enriched in P phenols relative to a g (285). However, this enrichment in P phenols was apparent and most likely originated from elevated levels of peatderived sphagnum acid typically found in the Ob watershed, and their subsequent conversion into P phenols during the oxidation of lignin (Amon et al., 2012;Philben et al., 2014). The relationship became progressively less robust with V phenols (R 2 > 0.925) and S phenols (R 2 > 0.893). The trend is consistent with P phenols being more resistant to degradation in natural waters than V phenols, and with S phenols being the most susceptible to degradation (Opsahl and Benner, 1995;Benner and Opsahl, 2001;Benner and Kaiser, 2011). Furthermore, lignin produced by gymnosperms is devoid of S phenols, whereas P and V phenols are ubiquitous in lignin (Hedges and Mann, 1979;Benner and Opsahl, 2001;Shen et al., 2012b). This most likely impacted the relationship between a g (285) and S phenols by making it sensitive to variations in watershed vegetation.

Physical Mixing and Degradation Of Terrigenous CDOM
The decreasing trends in TDLP 9 concentrations and a g (285) along the salinity gradient confirmed the importance of physical mixing in the distribution of these two variables along the river-ocean continuum ( Figure 5). As expected from the strong relationship observed between TDLP 9 and ag(285), the relationships between salinity and TDLP 9 (Figures 5A,B) and between salinity and a g (285) (Figures 5C,D) were strikingly similar. Although a large scatter in TDLP 9 and a g (285) at salinities <5 and >35 indicated important differences among riverine sources and ocean end-members, the trends observed for each region clearly indicated physical mixing of riverine and oceanic end-members played a dominant role in regulating the variability of TDLP 9 and a g (285) concentrations. Two South Carolina samples strongly influenced by salt-marsh inputs (Ashley River and Hunting Island, SC) stood out as being relatively enriched in CDOM and lignin, in agreement with previous observations that marsh-derived CDOM can be chemically and optically distinct (e.g., stronger absorption) from other estuarine CDOM (Tzortziou et al., 2008). In contrast, samples with salinities ∼28-33 from the Western Arctic Ocean (Chukchi Sea, Beaufort Sea, Amundsen Gulf) stood out as being relatively depleted in CDOM and lignin. Considering most of these samples were collected in regions experiencing extensive sea-ice melting, this relative depletion in CDOM and lignin was likely due to the presence of ice-melt water (low salinity and depleted in DOM).
Several degradation indicators further indicated the general decrease in lignin concentrations from the rivers to the oligotrophic regions of coastal oceans was accompanied by a progressive change in the degradation state of terrigenous CDOM (Figure 6). The TDLP 9 -values decreased by almost four orders-of-magnitude along the river-ocean continuum. This decrease exhibited a weak, but significant and positive correlation (r = 0.51) with the ratio of S to V phenols (S/V), and was accompanied by exponential increases in the ratio of P to V phenols (P/V) and the spectral slope coefficient of CDOM absorption (S 275−295 ). These trends are consistent with similar observations along the salinity gradient of river-influenced ocean margins in previous studies, and with the physical mixing of two terrigenous CDOM end-members with different degradation states (Opsahl and Benner, 1998;Hernes and Benner, 2003;Helms et al., 2008;Fichot and Benner, 2012).
More specifically, the trends in S/V, P/V, and S 275−295 observed herein are consistent with the physical mixing of two FIGURE 5 | Relationships between salinity and TDLP 9 using a linear-scale y-axis (A) and a log-scale y-axis (B), and relationships between salinity and a g (285) using a linear-scale y-axis (C) and a log-scale y-axis (D). end-members: (1) a relatively unaltered terrigenous CDOM end-member from rivers, and (2) a photochemically altered terrigenous CDOM end-member from the oligotrophic regions of the coastal oceans (Benner and Opsahl, 2001;Hernes and Benner, 2003;Fichot and Benner, 2012). The S/V has been shown to decrease during natural sunlight photodegradation experiments and during coupled photochemical-biological degradation experiments under simulated sunlight (Opsahl and Benner, 1998;Spencer et al., 2009;Benner and Kaiser, 2011). Note that Spencer et al. (2009) actually showed an increase in S/V after extreme exposure to solar radiation, but only after an initial decrease in S/V. The P/V has also been shown to increase during photodegradation experiments (Benner and Kaiser, 2011). In addition, the S 275−295 has been shown to increase with decreasing molecular weight of CDOM and during exposure to solar radiation (Helms et al., 2008;Fichot and Benner, 2012;Yamashita et al., 2013). The trends in S/V, P/V, and S 275−295 observed in this data set are consistent with the effects of photodegradation, and suggest the progressive change in the degradation state of terrigenous CDOM played an important role shaping the relationship between lignin and CDOM absorption coefficients.
The relatively weak correlation (r = 0.51) between the S/V and TDLP 9 was not unexpected ( Figure 6A). The S/V is not only affected by solar exposure, but is also a well-established indicator of the source vegetation of lignin and terrigenous DOM in natural waters (Hedges and Mann, 1979;Hedges and Ertel, 1982;Opsahl et al., 1999;Dittmar and Kattner, 2003;Shen et al., 2012b). As mentioned earlier, the S/V reflects the relative contributions of gymnosperm and angiosperm vegetation in lignin and terrigenous DOM because S phenols are not produced by gymnosperms. Lower S/V are thus typically measured in lignin that originates from gymnosperm-dominated watersheds (e.g., boreal forests), whereas higher S/V are indicative of significant contributions from angiosperms (e.g., tundra, deciduous forests). The effect of vegetation source on lignin phenol composition was evident in this data set, with low S/V values (< 0.5) measured in Arctic waters. The drainage basins of large Arctic rivers are dominated by gymnosperms, and the S/V of Arctic rivers typically range from ∼0.2 to 0.45 (when calculated from nmol L −1 concentrations) and exhibit some moderate variations with flow regime (Amon et al., 2012). The effect of vegetation was also evident from the high S/V values (>0.5) measured in the sub-tropical and temperate coastal oceans, which receive substantial contributions from angiosperms. Some samples from the Chukchi Sea had relatively high S/V values, but these samples were strongly influenced by Pacific Ocean water or small rivers draining tundra. Finally, a large number of samples from the Middle Atlantic Bight had comparatively low S/V values. This was likely observed because most of the terrigenous DOM on the shelf of the Middle Atlantic Bight shelf originates from gymnosperm-dominated environments further north, and is transported southward by currents (Chapman and Beardsley, 1989;Mountain, 1991).
Evidence suggests the P/V ratio is a useful indicator of the extent of lignin degradation. Here, the P/V increased severalfold and exponentially with decreasing lignin concentrations ( Figure 6B). The P/V has been shown to increase substantially after continuous or intermittent solar exposure (Benner and Kaiser, 2011), and unlike the S/V, the P/V is not as sensitive to changes in source vegetation. The P phenols are widely distributed among vascular plants. However, as mentioned earlier, inputs from peatlands can increase the P/V, most likely as a result of the conversion of sphagnum acid into P phenols during CuO oxidation. Elevated concentrations of P phenols were observed in the peatland-influenced samples from the Ob River estuary, and to a much lesser degree, from the Mackenzie River estuary. Samples influenced by sub-tropical salt marshes (e.g., Terrebonne Bay, South Carolina Coast) also have slightly elevated values of P/V. Note the P/V can also reflect changes in the flow regime of rivers and be indicative of the 14 C age of terrigenous DOM immobilized (Amon et al., 2012). High P/V and lower lignin concentrations have thus been observed during base flow in Arctic Rivers, and the reverse observed during peak flow (Amon et al., 2012), suggesting that changes in river flow could contribute to the trends observed here between TDLP 9 and P/V. Finally, the highest P/V-values (>4) were observed in samples from oligotrophic eddies shed from the Loop Current and impinging on the continental shelf in the northern Gulf of Mexico (Schiller et al., 2011;. These high P/V were likely attributed to the long-term solar exposure and degradation in the optically clear Loop-Current waters, which originate in the Atlantic Ocean. The exponential increase in S 275−295 with decreasing lignin concentrations was also consistent with physical mixing and the effects of photochemical degradation ( Figure 6C). S 275−295 has been shown to increase during photodegradation and is a sensitive indicator of CDOM photodegradation in natural waters (Helms et al., 2008(Helms et al., , 2013Ortega-Retuerta et al., 2009;Fichot and Benner, 2012;Yamashita et al., 2013). Overall, the S 275−295 displayed a clear increasing trend with decreasing TDLP 9 concentrations in the present study. With the exception of a few outliers, S 275−295 increased monotonically with decreasing TDLP 9 -values ranging from 500 to 10 nmol L −1 . When TDLP 9 was <10 nmol L −1 , the relationship between S 275−295 and TDLP 9 split into two groups of data: one that continued to show a strong relationship between S 275−295 and TDLP 9 , and one that showed little correlation between the two. The first group consisted essentially of surface samples, whereas the second group corresponded primarily of sub-surface and deep waters, and of productive, high-latitude waters that were moderately influenced by terrigenous inputs (e.g., Chukchi Sea, Canada Basin, Amundsen Gulf). Limited solar exposure, a greater relative contribution of microbial degradation, or a more prominent influence of non-terrigenous sources of CDOM (in situ production) can be expected in these water types, and are consistent with the relatively low S 275−295 -values (∼0.025 nm −1 ) observed. The limited solar exposure in sub-surface and deep water likely contributed to the lower S 275−295 . Furthermore, previous studies reported S 275−295 -values of ∼0.025 nm −1 for plankton-derived CDOM, and have provided evidence that S 275−295 decreases slightly during microbial degradation (Ortega-Retuerta et al., 2009;Fichot and Benner, 2012).
It is important to understand that the progressive changes in the degradation state of terrigenous CDOM observed along the river-ocean continuum are largely driven by the physical mixing of riverine and ocean end-members (Fichot and Benner, 2012), and that the observed change largely reflects long-term effects of photochemical degradation on the ocean end-member of terrigenous CDOM. Terrigenous CDOM, which remains relatively unaltered in rivers, mixes with photochemically altered terrigenous CDOM from oligotrophic regions of ocean margins (Benner and Opsahl, 2001;Hernes and Benner, 2003;Fichot and Benner, 2012). Solar exposure in surface waters and the extent of photochemical degradation occurring over typical residence times of river water in ocean margins (e.g., months) has been shown to be relatively moderate, even in subtropical riverinfluenced ocean margins . Much of the photochemical degradation occurs beyond the shelf where the residence time of the water is much longer and solar exposure in surface waters is strongly enhanced by a sharp increase in UV penetration relative to the mixed layer depth. In some cases, the terrigenous CDOM found in the oligotrophic regions of the coastal oceans can have a very different and more distant origin, as was the case in this data set when Loop-Current oligotrophic eddies impinged on the Northern Gulf of Mexico shelf . In the Arctic Ocean, solar exposure and photodegradation are less than in subtropical regions, but the ocean end-member can have a more distant origin and the terrigenous CDOM can be more chemically altered (e.g., Pacific Water in Chukchi or Beaufort Seas, or Atlantic water off Svalbard).
The concave-down shape of the exponential relationship between TDLP 9 and a g (λ) was consistent with mixing and long-term photodegradation playing important roles in shaping the relationship between lignin concentration and CDOM absorption coefficients. Photodegradation has been shown to lead to a decrease in the molecular weight and aromaticity of terrigenous CDOM (Kieber et al., 1989(Kieber et al., , 1990Opsahl and Benner, 1998;Miller et al., 2002;Helms et al., 2008). The molecular weight and aromaticity of a compound is, in turn, strongly linked to its molar absorptivity (Chin et al., 1994;Weishaar and Aiken, 2001), such that a given concentration of lignin in lowmolecular-weight molecules absorbs much less efficiently than the same concentration of lignin within high-molecular weight molecules. The concave-down shape of the relationship indicated there was an acceleration in the loss of a g (λ) relative to that of TDLP 9 as lignin concentrations decreased from riverine to oligotrophic waters. A loss of absorption strictly due to the loss of chromophoric material (i.e., without a loss of molar absorptivity) would tend to produce a linear rather than a concave-down exponential relationship. Here, the acceleration in the loss of a g (λ) was consistent with a progressive loss of molar absorptivity superimposed on a loss of chromophoric material, and is consistent with previous observations of a decrease in molecular weight of both CDOM and lignin along the river-ocean continuum in ocean margins (Hernes and Benner, 2003;Helms et al., 2008).
The characteristics of the relationship between TDLP 9 and a g (400) were consistent with a progressive loss of intramolecular charge-transfer interactions in CDOM along the river-ocean continuum. Intramolecular charge-transfer interactions among chromophores in aromatic molecules such as lignin have been proposed as a mechanism to explain the exponential shape of the CDOM absorption spectrum and its absorption at wavelengths >350 nm (Del Vecchio and Blough, 2004;Boyle et al., 2009;Sharpless and Blough, 2014). In this model, charge-transfer interactions among electron donors (e.g., phenols and/or methoxylated phenols in lignin) and electron acceptors (e.g., quinones or aromatic ketones/aldehydes in lignin) are thought to produce lower-energy optical transitions and enhance CDOM absorption at wavelengths >350 nm. The contribution of electronic interactions to the CDOM absorption is linked to the molecular weight of terrigenous molecules like lignin, because compounds with higher molecular weight have a greater potential for intramolecular chargetransfer interactions. A sharp decrease in the molecular weight of lignin along the river-ocean continuum would therefore be associated with a loss of intramolecular charge-transfer interactions, and with a weakening contribution of lignin to the CDOM absorption at wavelengths >350 nm. The worsening of the relationship between TDLP 9 and a g (400) with decreasing lignin concentrations, which eventually leads to an insignificant relationship between TDLP 9 and a g (400) in the more oligotrophic regions of the coastal oceans, is consistent with the effects of photodegradation.

On the Contribution of Lignin and other Components to CDOM Absorption
The existence of a well-behaved relationship between TDLP 9 and a g (λ) from the rivers to the oligotrophic regions of several contrasting coastal oceans was a strong indication that lignin is generally an important driver of the variability of CDOM absorption coefficients in coastal waters. However, the contribution of lignin to the absorption coefficient of CDOM was not uniform across the 250-400 nm spectrum. Lignin appeared an important contributor to a g (λ) in the UV-C (λ < 280 nm) and UV-B (280 < λ < 320 nm) regions over the entire range of lignin concentrations observed in this study. However, for λ > 350 nm, lignin only seemed to contribute significantly to CDOM absorption for the upper range of TDLP 9 concentrations and a g (λ) values (e.g., TDLP 9 > 10 nmol L −1 or a g (250) > 4 m −1 ). For this upper range, lignin appeared an even greater contributor to a g (λ) in the UV-A region than in the UV-B and UV-C regions. As described earlier, the spectral dependency of the relationship likely results from the change in the degradation state of terrigenous CDOM. The P phenols in lignin appeared to be the greater contributor to the absorption coefficients of CDOM, followed by the V phenols and the S phenols. A greater contribution of the P phenols to CDOM absorption is consistent FIGURE 7 | Scatter plots of the relationship between a g (285) and TDLP 9 (log-scale axes) highlighting the data from each of the nine study regions.
with their greater resistance to photodegradation (Benner and Kaiser, 2011), and their selective preservation relative to the V and S phenols as lignin concentrations decrease along the river-ocean continuum. Although this is only speculation, it is possible the greater photochemical resistance of P phenols contributes to their preservation in higher molecular weight and more aromatic structures that have higher molar absorptivities and higher potential for intramolecular electronic interactions. The selective preservation of the P phenols is consistent with the strong relationship observed between P phenols and a g (λ) in the more oligotrophic regions of the coastal oceans.
A variety of other chromophoric constituents in terrigenous DOM also likely contribute to CDOM absorption and impact the relationship between lignin and CDOM. Lignin has the chemical moieties necessary to make it an important chromophore (Boyle et al., 2009;Sharpless and Blough, 2014). It is important to recognize, however, that a variety of other chromophoric components in terrigenous DOM likely co-vary strongly with lignin, and can contribute significantly to the CDOM absorption despite the strong linkage observed between CDOM and lignin. Tannins, a class of polyphenols, are components of many terrestrial plant tissues (e.g., foliage, bark) and are readily released to aquatic systems (Benner et al., 1990;Hernes et al., 2001;Kraus et al., 2003;Maie et al., 2008). Dissolved tannins also absorb very strongly in the ultraviolet and visible region (Lawrence, 1980;Alberts, 1982;Kraus et al., 2003). However, previous studies have reported that tannin concentrations are very low in natural waters (Maie et al., 2006), and that microbial degradation, precipitation, sorption to sediments and potentially photodegradation removes tannins from aquatic systems (Benner et al., 1986;Hernes et al., 2001;Maie et al., 2008). The contribution of tannins to the CDOM absorption in coastal waters remains unclear, and more work is needed to directly assess the linkage between tannins and CDOM in coastal oceans.
Dissolved black carbon is a component of terrigenous DOM, and several studies indicate it is a potentially important component of DOC in natural waters (Kim et al., 2004;Mannino and Harvey, 2004;Ziolkowski and Druffel, 2010;Dittmar et al., 2012). Dissolved black carbon is thought to contribute up to 10% of the global riverine DOC flux to the ocean (Jaffé et al., 2013), and it contains condensed aromatic compounds that absorb strongly in the ultraviolet region (Russo et al., 2012). Stubbins et al. (2015) showed that dissolved black carbon concentration was more strongly correlated with a g (250) than with a g (400) for high-CDOM samples [a g (250)∼10-150 m −1 ]. Interestingly, our study revealed the reverse trend for lignin, with lignin concentration being more strongly correlated with a g (400) than with a g (250) for samples from a similar range of CDOM absorption [a g (250) ∼ 4-100 m −1 ]. These opposing trends are possibly linked to differences in the molecular structures of dissolved black carbon and dissolved lignin. Black carbon contains condensed aromatic compounds that typically have very high molar absorptivities in the UV (Chin et al., 1994;Weishaar and Aiken, 2001;Russo et al., 2012), whereas partially oxidized lignin contains the moieties responsible for intramolecular charge-transfer interactions, which are particularly effective at absorbing at wavelengths >350 nm (Del Vecchio and Blough, 2004;Boyle et al., 2009;Sharpless and Blough, 2014). Soils are a major source of black carbon and lignin, suggesting these two components likely correlate strongly in river-influenced coastal areas. More work is needed to better understand the potential contribution of black carbon to the optical properties of CDOM along the river-ocean continuum.
An analysis of the percent residuals from the linear fit of the a g (285)-TDLP 9 relationship revealed some interesting variations among the different coastal oceans of this study (Figures 7, 8). Although deviations from the fitted line can occur for a variety of reasons, independent variations in the relative contributions of other, non-lignin components are likely a dominant factor. Plankton and microbes can also produce chromophoric compounds (e.g., amino acids, nucleic acids) that have variable effects on the CDOM dynamics of natural waters and will tend to contribute to the scatter in the relationship between TDLP 9 and a g (λ) (Rochelle-Newall and Fisher, 2002;Nelson et al., 2004;Yamashita and Tanoue, 2008;Ortega-Retuerta et al., 2009;Zhang et al., 2009;Fichot and Benner, 2012). Most of the samples from the Northern FIGURE 9 | Performance overview of dissolved-lignin-concentration prediction models proposed in this study. Two separate models are used to predict TDLP 9 from a g (λ), and the cutoff a g(250) = 4 m −1 is used to separate the two models. For the low-CDOM model where a g (250) < 4 m −1 , a simple linear regression between a g (263) and TDLP 9 is used. In contrast, a multiple-linear-regression model of ln(TDLP 9 ) on ln[a g (λ)] (λ = 275, 295, and 350 nm) is used for the high-CDOM model when a g (250) ≥ 4 m −1 . The parameters used to implement the models are provided in Equations (3) and (4).
Gulf of Mexico, Middle Atlantic Bight, Eurasian shelf, and Otsuchi Bay, exhibited elevated TDLP 9 -values relative to a g (285) (i.e., negative residuals), thereby suggesting lignin was a more dominant contributor to a g (285) in these waters. The deviation was particularly strong in the Otsuchi Bay waters, which receives inputs from short and steep rivers with short water residence times. In contrast, most samples from the Amundsen Gulf, Beaufort Sea, Chukchi Sea, Ob estuary, and South Carolina coast exhibited relatively low TDLP 9 -values compared to a g (285) (i.e., positive residuals), suggesting a greater contribution of nonlignin chromophores. The positive residuals in the Chukchi Sea (all collected in summer) were potentially related to a higher background of autochthonous CDOM in these productive waters (Shen et al., 2012a), whereas those of the Ob estuary are consistent with significant contribution of non-lignin CDOM from Sphagnum sp. peats (Amon et al., 2012;Philben et al., 2014). The positive residuals in the South Carolina coast samples are likely related to contributions from the tidal marshes, which have been shown to have distinctive CDOM chemical and optical properties (Tzortziou et al., 2008). Finally, many of the positive residuals were from samples collected in or near rivers during summer, and could be related to the in situ production of CDOM, or be related to depletion of lignin observed after the peak river flow during spring (Amon et al., 2012;Shen et al., 2012b).
Empirical Models For Estimating TDLP 9 from a g (λ) Exploration of lignin and CDOM relationships provided useful information for the development of two simple empirical models for the retrieval of TDLP 9 from a g (λ) in coastal oceans. The a g (285) was, on average, the best single-wavelength predictor of TDLP 9 over the range of absorption coefficients and lignin concentrations observed in this study. However, the spectral dependency of the relationship indicated a g (263) was the optimal single-wavelength predictor of TDLP 9 for samples where a g (250) < 4 m −1 , and a g (350) was the optimal singlewavelength predictor for TDLP 9 for samples where a g (250) ≥ 4 m −1 . Furthermore, photodegradation had a strong effect on the relationship, and S 275−295 helped factor in this effect and improve the TDLP 9 predictions for high-CDOM samples, where S 275−295 and TDLP 9 were most correlated [TDLP 9 ≥ 10 nmol L −1 or a g (250) ≥ 4 m −1 ]. These observations guided the development of two sub-models separated by the a g (250) = 4 m −1 cutoff.
When a g (250) < 4 m −1 , a "low-CDOM" sub-model based on a simple linear regression was used, ln (TDLP 9 ) = 0.7672 · a g (263) − 0.3987 (3) When a g (250) ≥ 4 m −1 , a "high-CDOM" sub-model based on a multiple linear regression was used, ln (TDLP 9 ) = −2.282 · ln a g (350) − 8.209 · ln a g (275) +11.365 · ln a g (295) + 2.909 (4) In the case of the multiple linear regression of Equation (4), all derived coefficients were significant by the t-test (p < 0.0005), thereby confirming a g (275) and a g (295) represented meaningful additions to the model. The addition of a g (275) and a g (295) also improved the adjusted R 2 of the regression (adj R 2 = 0.936) relative to a single regression model using only a g (350) (adj R 2 = 0.918). The use of a g (275) and a g (295) in Equation (4) was somewhat similar to using S 275−295 as one of the terms, but did not require the extra step of calculating S 275−295 . Note the use of additional predictors in either Equation (3) or (4), or the use of partial-least-square regressions did not significantly improve the adjusted R 2 of the regressions nor the % error of the estimates relative to these simple empirical models. This indicated the useful information contained in the spectral shape of CDOM was mostly exploited. Note here, that even though the predictor terms in Equation (4) are strongly correlated, multi-collinearity does not represent an issue for predictive models. Multi-collinearity is problematic when trying to interpret the meaning of the derived parameters in the equations.
Overall, these simple empirical models provide a practical means to derive reasonably accurate estimates of TDLP 9 from CDOM absorption coefficients in coastal oceans (Figures 9, 10). A comparison of the measured and estimated TDLP 9 indicated the model retrieved TDLP 9 -values that were on average within ±20% of the measured values (Figure 9). The % error was <17% for half of all retrieved TDLP 9 -values, and it very rarely exceeded 60%. As it is currently parameterized, the model might slightly underestimate some of the very high lignin concentrations found in rivers (>200 nmol L −1 ). The capacity of the model to reproduce realistic distributions of TDLP 9 was also assessed for a cross-shelf section of the Middle Atlantic Bight (Figure 10). This subset of the data set was chosen because it encompassed a coastal region with a moderate gradient in TDLP 9 , and included data from both surface and sub-surface waters. The FIGURE 10 | Comparison of the measured distribution of TDLP 9 to that predicted from a g (λ) using the proposed models [ Figure 9 and Equations (3) and (4)] along a cross-shelf transect in the Middle Atlantic Bight. The white contour lines shown in the cross-shelf sections correspond to salinity. model adequately reproduced the most important features of the measured-TDLP 9 distribution along the salinity gradient. For instance, the cross-shelf gradient from the lignin-rich mouth of the Hudson River to the depleted open waters of the Middle Atlantic Bight was well-reproduced, just like the subsurface TDLP 9 maximum measured at the entrance of Hudson canyon. The more subtle features of the TDLP 9 distribution were less well-reproduced. The average error of ±20% associated with the retrievals implies that the model can only reproduce significant gradients in lignin.
The model is expected to perform similarly well in most river-influenced coastal oceans of the world, as long as the TDLP 9 predictions are made within the ranges of a g (λ) and TDLP 9 -values used to train the model. The variety of coastal environments represented in this data set suggests the model will be suitable for most sub-tropical, temperate, and Arctic coastal oceans that receive notable inputs of terrigenous DOM. However, the performance of the model remains uncertain in coastal regions influenced by tropical rivers, as these were not represented in the training data set. More importantly, the use of the model is not recommended in major coastal upwelling systems that typically receive little terrigenous DOM inputs (e.g., California current, Benguela current). In these coastal water types, autochthonous sources of DOM are likely to exert a more dominant control on the dynamics of a g (λ), and the link between TDLP 9 and a g (λ) is likely to deviate significantly from the relationship established. Naturally, this simple model is not final, and can be very easily re-trained to improve and expand its applicability as more data become available.

AUTHOR CONTRIBUTIONS
CF, RB, KK, CL, and HO collected the in situ samples for CDOM and lignin analysis. CF, KK, YS, and CL processed the lignin samples, and CF, YS, CL, and RA processed the CDOM samples. CF wrote the manuscript with important discussion with RB, KK, YS, and RA and with comments from all authors.