Hundred Fifty Years of Herbarium Collections Provide a Reliable Resource of Volatile Terpenoid Profiles Showing Strong Species Effect in Four Medicinal Species of Salvia Across the Mediterranean

Herbarium samples are increasingly being recognized for their potential in answering a wide range of research questions. However, the suitability of herbarium samples for chemical analysis is largely unexplored as they are thought to be too degraded. The aim of this study was to explore terpenoid profiles across time and geographic space for four medicinal species of Salvia across the Mediterranean to assess the suitability of using herbarium specimens in chemical analyses. Herbarium samples of Salvia aethiopis, S. multicaulis, S. officinalis, and S. sclarea collected over 150 years across the Mediterranean were compared to modern samples using both targeted and untargeted gas chromatography-mass spectrometry analysis of terpene profiles. There was no effect of collection year on chemical composition, although the total concentration of the 20 assessed standards and two individual standards significantly decreased over time. Instead, chemical profiles were defined by species, with strong species effects identified on both the targeted and untargeted chemical composition. Geographic variation was a factor in regulating the untargeted chemical compositions, suggesting some underlying environmental effects. However, there was no effect of sample altitude on either the targeted or untargeted chemical compositions. Chemical composition of four Salvia species are predominantly defined by species, and there was a substantially smaller effect of year of sampling. Given these results herbarium collections may well represent a considerably underused resource for chemical analyses that can benefit biodiversity and other studies.

Herbarium samples are increasingly being recognized for their potential in answering a wide range of research questions. However, the suitability of herbarium samples for chemical analysis is largely unexplored as they are thought to be too degraded. The aim of this study was to explore terpenoid profiles across time and geographic space for four medicinal species of Salvia across the Mediterranean to assess the suitability of using herbarium specimens in chemical analyses. Herbarium samples of Salvia aethiopis, S. multicaulis, S. officinalis, and S. sclarea collected over 150 years across the Mediterranean were compared to modern samples using both targeted and untargeted gas chromatography-mass spectrometry analysis of terpene profiles. There was no effect of collection year on chemical composition, although the total concentration of the 20 assessed standards and two individual standards significantly decreased over time. Instead, chemical profiles were defined by species, with strong species effects identified on both the targeted and untargeted chemical composition. Geographic variation was a factor in regulating the untargeted chemical compositions, suggesting some underlying environmental effects. However, there was no effect of sample altitude on either the targeted or untargeted chemical compositions. Chemical composition of four Salvia species are predominantly defined by species, and there was a substantially smaller effect of year of sampling. Given these results herbarium collections may well represent a considerably underused resource for chemical analyses that can benefit biodiversity and other studies.
Keywords: age, altitude, collections, GC-MS, geography, herbarium, Salvia, terpene INTRODUCTION Herbarium collections are increasingly being recognized as a unique, verifiable, and underused resource of big data across time and space for a variety of research questions (Lavoie, 2012;Funk, 2014;Rønsted et al., 2017;Cardoso et al., 2018;James et al., 2018). Herbarium collections offer an easily accessible source of specimens for a plethora of research questions compared to field expeditions (Bebber et al., 2010;Hardion et al., 2014;Xu et al., 2015). Additionally, herbarium specimens are records in time and space and represent several 100 years of collection history across the globe, including species rare and extinct or from now gone habitats (Silva et al., 2017) that provide possibilities simply not available using only modern samples.
Collections provide a time window into the past allowing exploration of changes in composition of floras (Calinger, 2015), distribution of invasive weeds or threatened species (Rivers et al., 2011;Martin et al., 2016), changes in flowering times (Davis et al., 2015) or leaf-out times (Everill et al., 2014) in response to climate change as well as to model predictions of future trends (James et al., 2018).
As the technical difficulties of extracting high quality DNA from historical materials are being overcome (Bieker and Martin, 2018), herbarium materials are also increasingly being used in genomic studies at all scales, from populations to phylogenies (Kuzmina et al., 2017;Bieker and Martin, 2018) as well as to study domestication history (da Fonseca et al., 2015) or plant pathogens (Yoshida et al., 2014).
However, the exploration of herbarium material for plant metabolite data is comparably understudied and the reliability of chemical data from herbarium samples is uncertain due to the expectation that plant specialized metabolites may not be wellpreserved over longer time scales. Whereas alkaloids for example are generally considered highly stable (Cook et al., 2009;Dewick, 2009;Yilmaz et al., 2012), low molecular weight terpenoids tend to be both volatile and thermolabile and may be easily oxidized or hydrolyzed altering the chemical composition of the plant material dependent on the conditions during processing and storage of the plant material (Turek and Stinzing, 2013;Tasca et al., 2018).
It is therefore essential for any study using chemical data extracted from herbarium materials to test for potential age effects on results in order to verify interpretations (Jafari et al., 2018). Only few studies to date have systematically tested the stability of different compound classes in historical herbarium samples. An early study by Eloff (1999) showed no sample age effect on antibacterial activity of herbarium samples of Combretum erythophyllum (Burch.) Sond. and Helichrysum pedunculatum Hilliard & B.L. Burtt over almost 100 years and only minor differences were observed in chemical composition of flavonoids and terpenoids using thin layer chromatography (TLC). Using proton nuclear magnetic resonance ( 1H NMR) spectroscopic metabolite screening, Jafari et al. (2018) found no significant difference in metabolite profiles of recently dried and up to 25 year old fungi specimens. Cook et al. (2009) found no age effect on toxic diterpenoid alkaloid composition of Delphinium occidentale S. Watson from up to 100 year old herbarium specimens and Yilmaz et al. (2012) found no significant degradation of quinine alkaloids in 50-150 years old historical Cinchonae bark samples. Likewise, a study by Zangerl and Berenbaum (2005) found no sampling age effect on furanocoumarins in the phototoxic invasive Pastinaca sativa L. over 150 years. Consequently, these previous studies support the idea that herbarium specimens provide a significant untapped resource of chemical data in addition to data on distribution and morphological traits over time and space.
Plant specialized metabolites are selected through evolution for their biological activities and express some degree of phylogenetic clustering of different compound classes (e.g., Hegnauer, 1962Hegnauer, -1973Ehrlich and Raven, 1964;Rønsted et al., 2012). However, natural variation in plant chemical composition is common and attributed to both biotic (e.g., herbivory and microorganisms) and abiotic environmental conditions as well as potentially differential local chemotypes (Wink, 2003;Moore et al., 2014). The relative importance of different drivers of chemical diversity is an outstanding puzzle, but recent attention has focused on altitude as an explanatory parameter (Russo et al., 2013;Mahzooni-Kachapi et al., 2014;Senica et al., 2017;Pandey et al., 2018). In addition to abiotic components, correlation of chemical variation with altitude is hypothesized to be related to variable biotic pressure from herbivory and microbial communities (Abdala-Roberts et al., 2016;Maldonado et al., 2017;Pandey et al., 2018). Herbarium collections along with their associated data can provide specimens of verified geographical and altitudinal origin for further studies.
In this work, we tested the suitability of using herbarium specimens in plant chemical studies by assessing targeted and untargeted chemical variation of four Salvia species. Samples ranged from modern collections to 150 year old herbarium collections. We had three main objectives. (1) Our first objective was to investigate whether we could observe significant chemical variation between Salvia species using herbarium specimens, and compare the results to other studies that have used modern material.
(2) Our second objective was to investigate age effects on Salvia chemical variation, testing for changes in chemical composition and diversity associated with increasing sample age. (3) Our final objective was to assess whether herbarium specimens could be used in answering ecological questions on Salvia chemical composition variation and diversity across altitudes and geographical regions in the four Salvia species.

Plant Material Sampling Strategy and Study Design
As a case study, we selected three Iranian medicinal Salvia species, which are all locally rare, and exhibiting a diversity of distributional and altitudinal ranges (Jafari Foutami and Akbarlou, 2017; Figure 1). For each of the four species we attempted to sample across the geographical distribution of the species. Salvia sclarea L. (clary sage; N = 10) is native to the Northern Mediterranean but has also become a weed outside its native range including North America, where it is called European sage (Ghani et al., 2010;Dickinson and Royer, 2014). Salvia multicaulis Vahl (syn. S. acetabulosa L.; N = 8) is native to Turkey and bordering countries (Tepe et al., 2004). Salvia aethiopis L. (N = 4) is naturally occurring in the Northern Mediterranean but has become a noxious weed in North America, where it is referred to as Mediterranean sage (Chalchat et al., 2001;Dickinson and Royer, 2014). Additionally, we included S. officinalis (N = 12) allowing for comparison with extensive literature and verified reference material adhering to the European Pharmacopeia standards (Council of Europe, 2014).
Salvia officinalis L. Ph.Eur 8.3. quality leaf reference material was obtained from Alfred Galke GmbH, Germany. Fresh samples of S. aethiopis, S. multicaulis and S. sclarea were collected in Iran in 2017 and air-dried at room temperature. Herbarium material of S. aethiopis, S. multicaulis, S. officinalis, and S. sclarea was obtained from Herbarium C of the Natural History Museum of Denmark, University of Copenhagen spanning 150 years of collecting across the Mediterranean (Figure 2).
Taxonomic identity of specimen was confirmed to species morphologically by Isa Jafari Foutami. Collection year was recorded from labels and are listed in Table 1 together with altitude and GPS coordinates. However, in most cases, GPS coordinates and in a few cases altitude had to be inferred from the locality description on the labels. Details of all plant materials are listed in Table 1.

Terpenoid Extraction and Analysis
Twenty-Five Milligram leaf material was ground using a mortar and pestle under liquid N2 and transferred to a vial (Supelco, Pennsylvania, USA). Seven Hundred Microliter analytical grade hexane (Sigma-Aldrich, Denmark) was added, the sample was vortexed for 10 s and then incubated on a shaker at 37 • C for 2 h. Samples were then vortexed again for 10 s and left for 24 h to allow tissue to settle. Approximately 200 µL of the hexane extract was transferred into a weighed glass GC vial with a limited volume insert and kept at −20 • C until GC-MS analysis. The remainder of the sample was left uncapped at 65 • C until the solvent was evaporated. Vials were reweighed to obtain the dry mass of leaf material. Samples were analyzed in triplicate.
GC-MS analyses were carried out using an Agilent 6890N Gas Chromatograph equipped with a split/splitless injector (200 • C), a HP-5MS capillary column (30 m × 0.25 mm; film thickness 0.25 µm), and coupled with an Agilent 5975 MS Detector (MSD), operating in the electron impact (EI) mode at 70 eV. The carrier gas was helium (1.0 mL/min), and the oven temperature was programmed to increase from 60 • C to 280 • C at a rate of 3 • C/min. The injected volume was 2 µL.
Chromatograms were analyzed both as untargeted data using tentative identifications based on the mass spectra in the NIST 8.0 library (National Institute of Standards and Technology, Gaithersburg, MD, USA) and as targeted data using 20 standard compounds.
Commercially available pure chemical certified reference material standards of common Salvia constituents (Russo et al., 2013;Hatipoglu et al., 2016;Craft et al., 2017) were obtained from Sigma-Aldrich, Germany ( Table 2). Purity of the commercial standards was not investigated experimentally and the approach does therefore not guarantee no degradation of the standards could have happened. However, for the targeted approach, all samples were analyzed using the same standards.
GC-MS chromatograms were processed using PARADISe, a PARAFAC2 based deconvolution and identification system for direct analysis of complex raw GC-MS data (Johnsen et al., 2017). The targeted and untargeted data matrices are provided as Supplementary Material online (Supplementary Tables S1, S2). Triplicates were averaged and a number of samples did not have triplicates due to sample failure (sample 1, 2, 28, and 30). Compounds with retention times between α-Pinene (8.3 min) and Camphor (28.4 min) were retained for further analysis and compounds with zero or negative values were excluded.

Statistical Analysis
All statistical analyses were performed using R statistical software (version 2.14.9). Initially untargeted data was converted into a Bray-Curtis similarity matrix and non-metric multidimensional FIGURE 2 | Map of collections with shape and color representing species (note four samples for which we had no geographic information are excluded from the map). Maps were constructed using ggmap (Kahle and Wickham, 2013).  scaling was performed using the Vegan package (Oksanen et al., 2013), which was visualized using ggplot2 (Wickham, 2016). A significant species effect on the untargeted chemical composition was tested using multivariate generalized linear modeling followed by analysis of variance (MGLM-ANOVA), which was performed using the manyglm and anova functions within the MVABUND package (Wang et al., 2012). Given the strong expected effects of species on plant chemistry (e.g., Hegnauer, 1962Hegnauer, -1973Rønsted et al., 2012), PERMANOVAs were ran to test for the effects of collection year, altitude and geography on untargeted chemical composition, using the adonis function with the Vegan package. Due to incomplete datasets, the effect of each was tested for individually in a PERMANOVA. Furthermore, sample GPS coordinates were converted into a principle coordinates of neighbor matrix (PCNM), with the resulting first two principle components (PCNM1 and PCNM2) used within the PERMANOVA testing for geographical effects on the untargeted chemical composition.
One-way analysis of variance (ANOVA) was performed to test for significant differences in untargeted chemical richness between species. Meanwhile, mixed linear modeling (MLM) was performed using the lmer function from the package lme4 (Bates et al., 2014). In these, the significance of random effects (altitude and geography (as PCNM1 and PCNM2) were determined using the drop1 function, which performed likelihood ratio tests (chisquare), whilst accounting for variation associated with the random effect (species).
The targeted dataset consisted of 20 compounds for which standards were obtained as described above ( Table 2). The targeted chemical composition was analyzed as before, undergoing nMDS for visualization), MGLM-ANOVA and PERMANOVAs to assess for compositional effects, and ANOVA and MLM to assess for differences in chemical richness. Given that standards allowed for reliable quantification, each individual compound also underwent generalized linear modeling (GLM) to test for species effects, with compound serving as the independent variable and species serving as the dependent using the glm function of the native stats package of R. MLM was performed to test for the effects of year of collection, altitude and geography (as PCNM1 and 2) whilst accounting for potential species effects. Individual compounds from the targeted data served as the independent variable, species served as the random effect, and either year of collection, altitude, or PCNM1 and PCNM2 (together) serving as the fixed effect.

Species Effects
After quality filtering a total of 285 compounds were within the untargeted dataset (Supplementary Table S2). Nearly all samples contained detectable levels of the majority of compounds, with untargeted chemical richness varying from 249 to 283 between samples, with no clear effect of species on the richness (ANOVA; df = 3, F-value = 1.16, P-value = 0.338). However, the untargeted chemical composition demonstrated clear clustering in the nMDS plot (Figure 3A), which proved highly significant (MGLM-ANOVA; Table 3).
Within the targeted dataset, S. officinalis and S. multicaulis consistently contained significantly higher concentrations when compared to the other two species (Figure 4). For example, borneol was the most abundant compound, ranging from 172 to 96,234 ng/µL, and were significantly higher in S. officinalis (31,954 µg/g) and S. multicaulis (21,964 µg/g) than averages in S. sclarea (436 µg/g), and S. aethiopis (675 µg/g) ( Table 2). p-cymene was the second most abundant compound that was also concentrated in S. officinalis (11,559 µg/g) and S. multicaulis (6,774 µg/g) when compared to S. sclarea (51 ng/µL) and S. aethiopis (47 µg/g). Whilst camphor was present in lower abundance than borneol and p-cymene in S. officinalis (8291 µg/g) and S. multicaulis (5,273 µg/g), it was also present in comparable quantitates in S. sclarea (3,246 µg/g) and in two of the five S. aethiopis samples (average 2,696 µg/g). Terpinolene was also in abundance within S. officinalis (2,387 µg/g) whilst being almost absent from the other species. The other 15 compounds were present in relatively low amounts across all samples.

Sampling Age Effects
Given the confirmation of species effects on plant chemical composition, we next investigated age effects on the chemical TABLE 2 | Average percentage (standard deviation) of standard compounds in targeted dataset (monoterpene hydrocarbons, oxygenated monoterpenes, and sesquiterpenes) compared with Hatipoglu et al. (2016) and Raal et al. (2007).
Compound number and type    Initially, species effects were tested for using multivariate generalized modeling coupled with analysis of variance (MGLM-ANOVA). Secondly, PERMANOVAs were performed individually for year of collection, altitude and geography (as principal coordinates of neighbor matrix 1 and 2). Bold text represents significant effects (P < 0.050).  (Table 1).
composition. PERMANOVAs for both the targeted and untargeted datasets did not demonstrate a significant age effect on chemistry (PERMANOVA ; Table 3). There were  Table 4), α-phellandrene (Figure 5A), which was a very minor constituent, and camphor that was present in all four species (Figure 5B). Total amount of the compounds in the targeted dataset also significantly declined over time (MLM; χ 2 = 4.11, P-value = 0.042) (Figure 5C).

Terpenoid Composition of Salvia Species
The biological properties of essential oil of Salvia officinalis are attributed mainly to α-thujone and ß-thujone, camphor, and 1,8-cineole as reviewed by Raal et al. (2007). In the present study of herbarium samples, we also found monoterpenes to be the main fraction of compounds mainly consisting of borneol, p-cymene, camphor, and to a lesser extent camphene and terpinolene (Table 2). Whereas, high levels of borneol and camphor correspond to the findings of Raal et al. (2007), the high levels of p-cymene may reflect some degree of degradation of terpenoids, as p-cymene is often identified in aged oils (Turek and Stinzing, 2013). However, no age effect was observed for p-cymene. The monoterpenes borneol, α-pinene, p-cymene, camphor, and camphene, were the main compounds found in S. multicaulis in the present study ( Table 2). α-pinene, borneol, camphor, and camphene are also main compounds reported in other studies of S. multicaulis (e.g., Rustaiyan et al., 1999;Tepe et al., 2004;Morteza-Semnani et al., 2005;Bagci and Kocak, 2008;Karamian et al., 2013;Hatipoglu et al., 2016) in addition to 1,8cineole, eucalyptol, bornyl acetate, and sesquiterpenes such as ß-caryophyllene and germacrene-D.
Among 108 volatile compounds across 45 Turkish Salvia species (Hatipoglu et al., 2016), both total yield and percentage of individual compounds were highly variable between species. α-pinene, camphene, β-pinene, 1,8-cineole, camphor, and borneol were the main chemical markers of monoterpene rich species, whereas β-caryophyllene, germacrene-D, β-bisabolene, bicyclogermacrene, caryophyllene oxide, and spathulenol were the main chemical markers in sesquiterpene rich species. In summary, several studies of essential oil components of different Salvia species have reported presence of a huge range of compounds, many in trace amounts, and other consistently present in higher amounts in specific species.
In the present study of herbarium specimens, we observed a significant species effect using both a targeted and an untargeted approach. Variation in study design including the number of compounds, geographical origin of samples, samples per species, and plant parts analyzed, makes it difficult to directly compare our findings with the literature. However, in general the number and amount of compounds as well as the main individual chemical compounds found in the present study are also reflected in the literature confirming that the species profiles we observe across 150-years of herbarium specimens are comparable to modern samples.

Suitability of Herbarium Specimens for Extraction of Plant Metabolite Data
We observed a strong species effect and a weak geographical effect on terpenoid chemical composition of four Salvia species across the Mediterranean and no overall effect of sampling age, although two individual compounds did change with age of samples. These findings suggests that chemical composition appears to be well preserved in herbarium samples over at least 100-150 years suggesting herbarium samples can be used for chemical screening as well as for preliminary studies of pharmacological activity, which is also supported by previous studies reporting little effect of sampling age on overall chemical profiles (Eloff, 1999;Zangerl and Berenbaum, 2005;Cook et al., 2009;Yilmaz et al., 2012;Jafari et al., 2018). Herbarium samples may therefore provide an untapped resource for preliminary studies as well as for large-scale comparative surveys across taxonomical and geographical ranges. Whereas overall chemical composition appears to be well preserved over 100-150 years, the concentrations of individual compounds may be affected and we would therefore suggest herbarium samples may be well-suited for qualitative studies, whereas caution is needed in the use of herbarium samples for quantitative studies depending also on the type of compounds of interest (Rønsted et al., 2017). At a larger scale, a wide and thorough phytochemical screening of herbarium samples PCNM2

T-value P-value T-value P-value T-value P-value T-value P-value
Chi-Square P-value Additionally, mixed linear modeling was performed individually for sampling year, altitude and geography (as principal coordinates of neighbor matrix 1 and 2). Bold text represents significant effects (P < 0.050).
Frontiers in Plant Science | www.frontiersin.org from all over the world could give us a better understanding also of chemotaxonomic issues, and comparison with climatic data would allow for testing of long-term environmental impact.

Environmental Effect on Terpenoid Profiles
We observed a geographical effect on chemical richness in the untargeted but not in the targeted dataset of Salvia species across the Mediterranean. Geographic variation had no effect on chemical composition or on individual compounds. Russo et al. (2013) found chemical composition of S. officinalis essential oils to vary depending on environmental factors such as altitude, water availability and soil conditions in Italy. We did not observe an effect of altitude in the present study. However, our sampling covered a wide geographical range and therefore the potential effects of altitude alone may have been confounded by sampling across multiple ranges across such large scales as the Mediterranean. Future studies may disentangle geographical and environmental effects by more intensive sampling across single or multiple altitudinal gradients. However, it should be noted that sampling from herbaria is limited by the number of specimens available in the collection and herbarium specimens are mostly collected as individual specimens rather than representing populations, which limits the prospects of comparative environmental studies.
Many other studies have shown that the yield and chemical composition of essential oils varied with ecological factors and geographical areas (e.g., Uribe-Hernández et al., 1992;Salgueiro et al., 1997;Viljoen et al., 2006;Liu et al., 2015;Rezende et al., 2015;Jaradat et al., 2017). Essential oils play important biological functions related to environmental adaptation, protection against biotic and abiotic stresses, and pollinator attraction (Jassim and Naji, 2003;Zangerl and Berenbaum, 2005). Therefore, plants of the same species growing under different environmental conditions may differ in the composition of their essential oils in response to different environmental pressures (Weiss and Edwards, 1980;Salimpour et al., 2011). Over evolutionary time scales, unique chemotypes may eventually develop into separate genotypes adapted to different environmental conditions (Verpoorte et al., 2000;Heywood, 2002;Wink, 2003;Beccera, 2015). In addition to further comparison of environmental parameters, future studies may benefit from comparing genetic profiles of samples.
Salvia comprises nearly 1,000 species with distributions spread across the globe (Drew et al., 2017). Several Salvia species are used as food flavoring owing to their content of essential oils as well as in folk medicine for a range of conditions including microbial infections, inflammation, cancer, and malaria (Lu and Foo, 2002;Kamatou et al., 2008;Russo et al., 2013;European Medicines Agency, 2016;Hatipoglu et al., 2016).
Some species like Salvia officinalis are common and widespread, whereas others are narrow and potentially threatened endemics (Wood and Harley, 1989;Viney et al., 2006;Kahraman et al., 2012). A better understanding of the chemical composition of different Salvia species and how their chemical diversity is affected by environmental factors such as altitude can help inform both their local medicinal use as well as conservation policies and efforts of the threatened species.

CONCLUSIONS
In the present study, freshly collected and herbarium samples of four species of Salvia were used to explore the effect of sampling age on terpenoid chemical profiles. Our results show that chemical profiles are primarily driven by a species effect and to a lesser extent geography. Salvia multicaulis and S. officinalis displayed higher abundance of most compounds than S. aethiopis and S. sclarea. Sampling age had no effect on overall chemical composition in the untargeted approach, and only a slight effect in the untargeted dataset. Two of twenty targeted compounds, αphellandrene, and camphor, significantly declined with sampling year.

AUTHOR CONTRIBUTIONS
NR conceived the idea together with IF, CB, and RR. IF sampled the specimens. IF conducted the chromatography-mass spectrometry analysis together with TM and RR. IF analyzed the data together with CB. IF drafted the manuscript together with NR, CB, and RR. All authors read and approved the final manuscript.

ACKNOWLEDGMENTS
We thank Tao Li for advice on analyzing GC-MS data with PARADISe.

SUPPLEMENTARY MATERIAL
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2018. 01877/full#supplementary-material Table S1 | Filtered GC-MS data for all samples of the targeted dataset including occurrence of the 20 standard compounds. Table S2 | GC-MS results for all specimens in the untargeted dataset of 285 compounds identified tentatively by comparison with the NIST 8.0 library.