DATA REPORT article

Front. Environ. Sci.

Sec. Interdisciplinary Climate Studies

Volume 13 - 2025 | doi: 10.3389/fenvs.2025.1599188

This article is part of the Research TopicNew Challenges for Baltic Sea Earth System ResearchView all 6 articles

The BALTEX dataset of GRDC.the Global Runoff Data Centre (GRDC). A dataset of river discharge draining into the Baltic Sea

Provisionally accepted
Simon  MischelSimon Mischel*Claudia  FärberClaudia FärberHenning  PlessowHenning Plessow
  • Federal Institute of Hydrology (BfG), Koblenz, Germany

The final, formatted version of the article will be published soon.

Due to its separation from the Atlantic Ocean, the Baltic Sea is a special marine environment, with a unique brackish water regime (Meier et al. 2022). Its salinity setting is an important driver for the biogeochemical functioning and the ecological status of the Baltic Sea (Kulinski et al. 2022). Global and climate change, however, alter the hydrological cycle including river discharge, which is responsible for most of the freshwater input into the Baltic Sea (Meier et al. 2022).One of the major studies in the Baltic Sea was the Baltic Sea Experiment (BALTEX) as part of the Global Energy and Water Cycle Experiment (GEWEX), initiated in the 1980s under the World Climate Research Programme (WCRP). A key objective of GEWEX was to improve the understanding of heat exchange, energy, and water processes within the Earth's climate system at global, regional, and local scales (Raschke et al., 2001, Reckermann et al., 2011). For this purpose, the first river discharge dataset of BALTEX was compiled and made available by the Swedish Meteorological and Hydrological Institute (SMHI) through the BALTEX Hydrological Data Centre (BHDC, BALTEX, 1994). In 2015, the BALTEX database, comprising data from 592 monitoring stations, was transferred to the Global Runoff Data Centre (GRDC) to ensure long-term support and regular updates as part of GRDC's ongoing responsibilities providing river discharge time series, even beyond the BALTEX active phase.The aim of this data report is to provide an overview about the BALTEX dataset and its extension in order to provide a complete as possible hydrological river discharge database for the Baltic Sea. Similar databases have been collected by Mikulski (1970), who reported the inflow of river water to the Baltic Sea in the period 1951-1960. A more comprehensive database was compiled by Bergström and Carlson (1994), who provide more than 200 stations with monthly data draining into the Baltic Sea, covering the period of 1950-1990 and around 86 % of the total drainage area. They also analyzed the major 10 rivers and divided the data into 6 major sub-basins of the Baltic Sea. We compare the mean discharges for 30 large rivers in the research area with values presented in the Baltic Sea Model Intercomparison Project (BMIP, Väli et al., 2019).The data collected and the update provided by GRDC is useful for estimating and analyzing the various components of the water balance in the Baltic Sea, particularly the runoff and discharge contributions from surrounding landmasses. Large-scale hydrological models of river flow to the Baltic Sea exist and the data contributed to advances in research on the regional hydrology of the Baltic Sea, as well as developing different climate change scenarios of the Baltic Sea (Reckermann et al. 2011). With the BALTEX dataset models like E-HYPE (Donelly et al., 2016) can be further verified and evaluated.The Baltic Sea has a total surface area of 377,400 km² with a volume of 21,200 km³ (Figure 1). The five major sub-basins are the Bothnian Bay, the Bothnian Sea, the Gulf of Finland, the Gulf of Riga and the Baltic Proper. Following the definition of the Baltic Marine Environment Protection Commission (HELCOM, 2014) and Bergström and Carlson (1994), stations draining to the Danish Sound and Kattegat are included due to its significance for the water exchange of the Baltic Sea.Depending on the literature the total drainage area varies between 1,729.000 and 1,739.000 km² (Bergström and Carlson, 1994;Nilsson, 2006). The Baltic Sea basin stretches over 14 countries and different climatic zones, from subarctic climate in northern Finland to the temperate, continental climate in southern Poland (Raschke et al. 2001). During winter, the Baltic Sea freezes to various extend (HELCOM, 2018). At the Baltic drainage basin, river discharge is culminating at the Danish straits, with about 450 km³ per year, while 50% of the discharge is measured during the melting season only.Figure 1: The Map showing the catchment of the Baltic Sea, and its sub-basins and(see Table 1 for more details). Black dots mark the stations of the BALTEX dataset of GRDC., the red triangles mark stations with time series before 1900 and yellow dots represent 30 large rivers (see Table 2 for more details).The GRDCfoot_0 , operating under the auspices of the World Meteorological Organization (WMO), has the aim of supporting and promoting research in the field of global and climate change as well as integrated water resources management. GRDC provides in situ river discharge timeseriestime series, with daily or monthly resolution. To date, the total number of available stations in the database of GRDC has increased to nearly 11,000.Since the integration of the BALTEX database into GRDC in 2015, the curation and update for the BALTEX dataset is embedded in the regular work of GRDC. Most of data is obtained from the National Meteorological and Hydrological Services (NMHS) of the WMO member states. The time series data are verified,main quality-assuredcheck is the responsibility of the NMHS, who provide the data. GRDC performs an outlier test and internally checkeda visual quality-check prior to the inclusion into the GRDC database. (Färber et al., 2024). If the NMHS provide only daily data, the mean monthly values are calculated by GRDC. Routinely, the data is requested each year. Where possible, data provision service via application programming interfaces (API) are used. With this approach, the number of stations in the Baltic Sea catchment area of the BALTEX dataset was expanded by 126 stations, receiving regular, up-to-date time series updates. Since not all countries have productive APIs, data 3 provision is partly still delayed between 2-5 years, due to the time-consuming verification process. As soon as the data Iis available to GRDC, we provide the data is provided to the user. An example of a daily time series available for download is presented in Figure 2. The time series data are available in various formats from the GRDC data portalfoot_1 . The user can request the data for each station individually or by downloading the whole BALTEX dataset using the BALTEX catalogue provided at the GRDC websitefoot_2 or using the metadata catalogue provided in the Supplementary Material (SuppTable 1).supplementary material (Supplement Table 1). Together with the time series data, extensive metadata are attached to the files and catchment polygons for each of the stations are available. The BALTEX dataset is published under the GRDC data policy and WMO Resolution 25 (Cg-XIII-1999). Individual data licenses are reported in Supplement Table 2.To allocate the GRDC stations to the Baltic Sea catchment, the HELCOM GIS layer 4 available at the HELCOM metadata Cataloguecatalogue, was used, which includes the Baltic Sea's sub-basins drainage areas, aligning with Bergström and Carlson (1994), resulting in a total of 718 stations.Figure 1 shows the location of all GRDC stations that are included in the BALTEX dataset. Table 1 provides an overview of the number of stations in each sub-basin and their spatial and temporal coverage. There are 718 stations distributed across all sub-basins, which cover a total area of 1,737,019 km² and fallsfall within the range of the drainage area of the Baltic Sea mentioned above (see Chapter 2.1). The stations are equally distributed over the entire sub-basins upon availability. The GRDC station catchments cover in total 84% of the HELCOM basin layer. Important to note is that coastal areas are largely excluded and catchment areas are calculated for the location of the station.One limiting factor for the spatial coverage of the dataset is that GRDC avoids to include stations that are influenced by tides. Therefore, stations directly at the river mouth are often excluded, aswhen they contain negative discharge or unreliable discharge values. Another factor is the inability to determine the station catchment area for 34 of the 718 stations, as certain catchments cannot be accurately represented by the elevation models used for catchment delineation. The methodology used to generate the GRDC station catchments is described in Färber et al. (2024).Table 1 shows that the Baltic Proper has the highest station density, with a total of 287 stations, while in contrast, the Gulf of Riga has only 66 stations. All sub-basins have time series data available until 2024, except for the Gulf of Riga, which has records up to 2023. The oldest timeseriestime series is in service beginning insince 1807, with a total of ten stations in the whole Baltic Sea catchment with data before 1900. The average daily time series length for all sub-basins is 59 years, while the average monthly time series length is 49 years. Therefore, most of the data covers the BALTEX active phase and beyond.Table 1: Area of the sub-basins covered by stations of the BALTEX dataset and average time series length as well as the range, e.g., the maximum start and end years of the available time series.Figure 23 illustrates the availability of mean daily and monthly averages, represented in blue and light blue, respectively. As shown, a few time series reach back to the 19 th century. These exceptionally long time series are also geographically well distributed, as indicated in the penultimate column of Table 1.Overall, the number of available time series increases until the mid-20th century, with a noticeable increase in the 1950s, largely due to the incorporation and data provision for many Polish time series during this period. Data availability peaks towards the late 20th century but steadily declines after 2000. The two observed peaks are due to time series from Lithuania and Belarus, covering short periods of 2 and 6 years, respectively.Figure 23: The number of stations in the BALTEX dataset, separated into daily and monthly time series.The BALTEX database provides a viable source for river discharge data in the catchment area of the Baltic Sea. In the BALTEX dataset all available data is provided, even small catchments and short timeseriestime series are included. In total, there are 718 stations with daily and monthly data available, ranging between 1 and 215 years, the catchment size ranges vary from sub-square kilometer up to 218.000 km². Monthly timeseriestime series are separated between original timeseriestime series and calculated timeseriestime series. The latter is calculated using daily averages. All of the sub-basins draining into the Baltic Sea are represented with different numbers of total stations. To date, this dataset is the most comprehensive dataset of river discharge data draining into the Baltic Sea. It is freely and unrestricted available.Comparing the BALTEX dataset with available data is shown in Table 2. The datasets are in good agreement. As the original datasets of BMIP and BHDC are not available, small differences might occur due to a different selection of stations.Using the values from the BALTEX dataset and the 30 longest rivers allows the calculation of loads when combining the databases from other datasets, for example from the Global Freshwater Quality Database GEMStatfoot_3 .According to the data policy of GRDC, the dataset is freely and unrestricted available, with identified access. Commercial usage is prohibited and the user need to acknowledge the source of the data. The complete data policy can be found at the website of GRDC.The whole dataset is available for download in the framework of the GRDC regular data provision via the download portal of GRDCfoot_4 . The data is covered by the data policy of GRDC, the data is freely and unrestricted available, with identified access. The commercial usage is prohibited and the user need to acknowledge the source of the data. The complete data policy can be found at the website of GRDC 7

Keywords: Baltic Sea, river discharge, Catchment areas, time series, in situ stations

Received: 24 Mar 2025; Accepted: 13 Jun 2025.

Copyright: © 2025 Mischel, Färber and Plessow. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Simon Mischel, Federal Institute of Hydrology (BfG), Koblenz, Germany

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.