Toward Specialized or Integrated Systems in Northwest Europe: On-Farm Eco-Efficiency of Dairy Farming in Germany

Intensive confinement (IC) systems for dairying have become widespread during the last decades. However, potential advantages of alternative systems such as full-grazing (FG) or integrated dairy/cash-crop (IFG) systems with regards to better provision of ecosystem services are widely discussed. To investigate performance and environmental impacts, we compared four prevailing dairy systems using an on-farm research study. The farm types differed in their share of pasture access and quantity of resource inputs: (i) an IC with a high import of supplements and mineral fertilizers; (ii) a semi-confinement (SC) with daytime pasture access during summer and moderate import of supplementary feeds representing the base-line scenario; (iii) a FG based on grazed seeded grass-clover swards with no purchased N-fertilizers and low quantities of supplementary feeds; and (iv) an IFG comparable to FG but based on grass-clover leys integrated in a cash-crop rotation. Results revealed highest milk productivity (16 t energy-corrected-milk (ECM) ha−1) and farm-N-balance (230 kg N ha−1) in IC; however, the highest product carbon footprint (PCF; 1.2 CO2eq kg ECM−1) and highest N-footprint (13 g N kg ECM−1) were found in the baseline system SC. The FG and IFG revealed on average similar forage dry matter yields (10 – 11 t DM ha−1) at similar crude protein and net-energy-lactation ratios per kg DM-intake compared to the IC and SC. The PCF in FG were comparable to IC (0.9 vs. 1.1 kg CO2eq kg ECM−1) but at a lower N-footprint (9 vs. 12 g N kg ECM−1). However, despite low measured N-losses in the FG system, the farm-N-surplus was exceeded by 90 kg N ha−1. A further reduction was only possible in the IFG (50 kg N ha−1) by accounting for a potential N-carry-over from N-rich plant residues to the cash-crop unit, leading to the lowest PCF (0.6 kg CO2eq kg ECM−1) for the IFG, with still moderate milk yield levels (~10,500 kg ECM ha−1). According to this bottom-up approach based on field data, improved integrated grazing systems could provide an important opportunity to increase the ecosystem services from dairy farming, operating with land use efficiencies similar to IC.


INTRODUCTION
Ongoing intensification in agriculture has led in many developed countries worldwide to highly specialized dairy production systems, with declining numbers of farms, larger herd sizes, and increasing milk yields per hectare . In recent years, the spatial distribution of dairy farms has continued to shift into areas with lower land prices and unsuitable conditions for arable crop production, or regions that continue to specialize on animal husbandry due to the poor competition with other cash crop producers on the global market. For instance, in the European Union (EU) half of the livestock units are located on one-third of the agricultural area (Leterme et al., 2019). In the South Island Regions of New Zealand, where historically there were significant shares of oat and wheat production together with animal husbandry, the arable area for cash-crop production declined by 80%; at the same time the stocking rates increased by 150%. This was enabled by converting rain fed to irrigated grassland systems, which are now mainly used for dairying (Ledgard, 2013). As a result, the diversity of agricultural commodities produced on farms in these areas declined and undesired environmental impacts increased, including increases in nitrate concentrations in drinking water (Vogeler et al., 2014), and in ammonia (NH 3 ) volatilization (Fowler et al., 2013), as well as loss of plant, insect and bird species diversity (Kleijn et al., 2009;Ledgard, 2013;Allan et al., 2014), and a decline in natural forest (Ledgard, 2013).
The concept of sustainable intensification has evolved as a response to the environmental challenges associated with agriculture (Davies et al., 2009) and has been a topic of interest in recent years (Garnett et al., 2013;Barnes et al., 2016;Reheul et al., 2017;Struik and Kuyper, 2017). Sustainable intensification involves simultaneously improving the productivity and environmental management of agricultural land (Buckwell, 2014). Closely linked to sustainable intensification is the concept of resource-use efficiency or eco-efficiency (Keating et al., 2010;Taube et al., 2014;Tittonell, 2014;Cook et al., 2015), in which the quantity of resource input, environmental loads, or ecosystem services provided is related to the unit of product (Wilkins, 2008). Ecosystem services can be manifold, however, according to the current common agricultural policy (CAP) of the European union (EU), greenhouse gas (GHG) mitigation and nutrient cycling are of foremost importance in order to tackle the most relevant environmental impact categories (i.e., climate change, eutrophication, biodiversity loss). Critical voices question the applicability of sustainable intensification in Europe by arguing that agricultural systems are already operating at a high production intensity and the concept would be an appropriate strategy for regions characterized by a large yield gap, as for instance in developing countries (Mueller et al., 2012). In regions with highly intensive crop and livestock production, however, sustainable intensification would appear less likely to provide benefits in terms of yield progress when considering the high risk of environmental threats (Garnett et al., 2013;van Grinsven et al., 2015). Some authors are therefore proposing the term "ecological intensification" instead of "sustainable intensification" for use in OECD countries (Godfray and Garnett, 2014). A study by Schiefer et al. (2016), classifying soil biochemical and physical properties as indicators of soil resilience, found only 40% of soils in the European Union (EU-25) to be suitable for sustainable intensification, while on the remaining land de-intensification or conversion from arable to grassland may be warranted. In this context, the term "sustainable extensification" was coined (Bluwstein et al., 2015;van Grinsven et al., 2015).
As one strategy toward ecological intensification, several authors have recommended a paradigm change from highly specialized production systems back to integrated crop livestock systems (ICLS) in order to increase diversity of land use and resource efficiency (Rockström et al., 2009;Godfray and Garnett, 2014). Different levels of ICLS production are currently discussed: (i) integration of crop and animal production by exchanging materials, (ii) complementary exchange of materials with each system taking the production requirements of the cooperatives into account, (iii) temporal and spatial integration on farm-level using fully or partly the same territory, and (iv) the extrapolation of (v) on the regional level (Moraine et al., 2014). A meta-analysis conducted by Peterson et al. (2020) covering data from Australia, North America and South America found that ICLS systems provide higher cash crop yields on fertile sandy loamy soils in comparison to non-integrated systems. This effect was, however, lower, if dual-purpose crops and other soil types were considered. According to Ryschawy et al. (2012) the economic benefit of ICLS depends on the production level and management, as a wide range of management options exists in comparison with more specialized systems. The majority of studies indicated positive environmental effects of ICLS systems (Ryschawy et al., 2012;Moraine et al., 2014;Peterson et al., 2020) due to improved C-and N-cycling among the systems (Lemaire et al., 2015) and consequently a lower demand for external resources. Thus, lower N and phosphate surpluses can be attained. Furthermore, most of the studies found a positive effect on soil organic carbon (SOC) with increased rates of C-sequestration and enhanced soil functioning properties. The latter has mainly been observed when grass or grass-clover was included within the crop rotation (Lemaire et al., 2015;Loges et al., 2018a) often referred to as the ley-phase. The increments of soil C stocks increase with the duration of the ley-phase (Lemaire et al., 2015). For leys, grass-clover swards (usually white clover) are most commonly recommended, as the clover as a forage legume provides additional N through biological nitrogen fixation (BNF). Greenhouse gas emissions, in particular those from the potent GHG nitrous oxide (N 2 O), as well as N-leaching losses, are low from grass-clover leys, even though high amounts of BNF (193-319 kg N ha −1 year −1 ) can be reached (Høgh-Jensen et al., 2004;Reinsch et al., 2020). This indicates there is an effective N-cycling in such systems (Schmeer et al., 2014;Reinsch et al., 2020). Beyond the mentioned ecosystems services from ley systems, there are additional questions on biodiversity that can be addressed with multispecies swards at low rates of mineral N are used. For instance, Ebeling et al. (2008) found a linear trend for pollinator visits with increasing numbers of flowering plant species. Moreover, the introduction of leys in arable-cropping systems reduces the pressure of undesired weeds (Connolly et al., 2018;MacLaren et al., 2019) and consequently the use of agrochemicals, which may thereby allow a further increases in the biodiversity of agricultural land (Holzschuh et al., 2007). However, even greater biodiversity increase can be expected by increasing crop diversity and from the increased landscape heterogeneity of integrated crop livestock systems (Sirami et al., 2019).
Research on alternative production systems for dairying in northwest Europe currently attracts a lot of attention from policy makers and from society in general, because of the environmental effects of mainstream intensive systems as well as the questionable economic performance of dairying since the abolition of the milk quota system. Moreover, producers are facing the challenge on how to re-direct their systems in order to reach the multidisciplinary aims (e.g., compliance with EU-Nitrate Directive, EU-Water Framework Directive, EU-NEC Directive) and to be in line with the EU climate target plan (Green deal) to become climate friendly by 2050 (EC, 2020). Nowadays, dairy husbandry in northwest Europe is the second largest sector in terms of output value from agriculture (Augère-Granier, 2018). The average stocking rate in the intensive dairy producing countries (Netherlands, Germany and Denmark) is >1.4 LU per ha of UAA , and average milk yields per cow are 9,247 litre per year (Eurostat, 2020). To maintain high milk yields per ha for the market, silage production and intensive supplementary feeding has increased in the last decades, at the cost of grazing and low-cost feeding . Additionally, high amounts of mineral fertilizers are purchased to maintain high forage yields and yield stability, despite high amounts of organic manures being available. Consequently, the total amounts of nutrient supplies from organic manures and purchased fertilizers on dairy farms often exceed the demand requirements for on-farm forage production, particularly from seasonal surpluses of manure. This necessitates a high share of exportation of manures to cash crop producers (Oenema et al., 2014). The N in animal manures also has a high potential for gaseous N-emissions, during both storage and application depending on time and technique used (Misselbrook et al., 2000). The farm-N surplus can exceed 200 kg N ha −1 year −1 particularly if there is substantial supplementary feeding (Akert et al., 2020). van Grinsven et al. (2013) calculated a range of social costs from the impact of reactive N on human health, ecosystems and climate of 10-30, 5-20, and 4-17 e per each kg N emitted in the EU-27 states. As one of the consequences, the European Nitrates Directive restricts the total amount of manure N to 170 kg ha −1 year −1 and ban slurry application from autumn and winter in order to reduce the N-pollution of groundwater bodies. This, in combination with the current spatial distribution of highly specialized systems, has led to logistic challenges, with long transport distances for manures, long manure storage durations, and high capacity manure storage facilities needed (Kuhn et al., 2018). This handling of manures increases the potential for NH 3 volatilization, which decreases the N-use efficiency (NUE) of dairy farms to below 30% (Löw et al., 2020). Alternative solutions for dairy farms to use adapted forage crops, such as legumes, are limited as they provide additional N. This biologically fixed N has to be included in fertilizer planning and, as a consequence, more N has to be exported. In addition, farmers are often concerned that considerable yield losses could occur after the adoption of forage legumes, even though several studies have confirmed an effective N-fertilizer replacement value across a wide range of legume-based ley-mixtures (Suter et al., 2015). This, and the high inter-annual yield resilience under current and future climate change make such legume-based ley mixtures even more attractive (Lorenz et al., 2020).
Within the hierarchy of current developments in the dairy sector, there are discussions about whether a trend back to pasture-based production systems would contribute to reduced environmental impacts and more resilient farming systems in northwest Europe (Schils et al., 2019). The revenue of a dairy farm depends on the market price for milk, and the share of home-grown feed and use of external resources, like supplements. Several studies have revealed that the lowest costs per unit dry-matter intake (DM-intake) can be obtained through lowcost full-grazing strategies ensuring grazing almost year-round at a low share of supplementary feeding, as is common in Ireland, Australia and New Zealand. Such grazing strategies are best suited in regions with adequate rainfall and fertile soils, as found in parts of the Netherlands, northwest Germany and Denmark. The lowest feeding cost per liter of milk unit can be generated by a high home-grown forage use efficiency and low rates of supplementary feeding (Dillon et al., 2008). This can be achieved through a high frequency of short grazing intervals (i.e., rotational grazing), as this ensures that forage is offered with high energy and protein contents. Further improvements can be gained by including forage legumes in grass-clover swards, as this reduces costs further, and also can reduce GHG emissions (Li et al., 2011;Nyameasem et al., 2021). In comparison with confinement systems, these low-cost full-grazing strategies need adjustments in the choice of the animal breeds and calving interval. High yielding Holsteins often cannot realize a sufficient dry matter intake under grazing conditions and might be sensitive to variations in the herbage on offer and weather conditions (Heublein et al., 2017). Smaller cows, such as crossbreeds and Jerseys, are often judged as better grazers. Optimal grazing efficiency is often reached by synchronization of high (spring) pasture growth with the peak of lactation. To achieve this, seasonal spring calving with short calving intervals is the predominant calving pattern in pasture-based milk production.
However, such management results in lower productivity per animal, as well as per hectare of farmland as long as "imported land" is not included. This makes the overall environmental efficiency questionable, as it is often highlighted that high milk yields per cow result in a lower product carbon footprint (PCF). A negative relationship between the PCF and milk yield was found in a variety of studies (Christie et al., 2012;O'Brien et al., 2014;Zehetmeier et al., 2014). For example, Christie et al. (2012) explained 64% of the variation of the PCF by milk yield per cow, and Zehetmeier et al. (2014) explained 55% (Holstein Friesian cows) and 30% (Fleckvieh cows) of the milk PCF by the production system. Gerber et al. (2011) found that at energy corrected milk (ECM) yields below 2,000 kg per cow and year, an increase of milk yield provides a strong reduction of emissions per unit product, while above 6,000 kg the emissions stabilize. While the total energy requirement per cow increases with an increase in milk yield, the amount required for maintenance remains unchanged. This results in a decline of the proportion of total energy used for maintenance and the total energy requirement per kg of milk produced (Capper et al., 2009) in a ceteris paribus scenario (e.g., same number of lactations per cow). However, since it is very likely that higher milk performance per cow coincides with a reduced number of lactations, optimum levels of PCF do not increase linearly with milk performance. Considering more intensive systems with milk yields >5,000 kg energy corrected milk (ECM) per cow and year, Lorenz et al. (2019) found in a meta-analysis that low-cost full grazing systems show no disadvantages in the PCF in comparison with intensive confinement systems.
The suitability of ICLS systems for dairy production has, so far, been poorly evaluated on a regional level without taking high production intensity, such as for dairying in northwest Europe, into account. The insertion of leys into the crop rotation will also provide benefits for cash crop producers (e.g., nutrient provision, soil water retention, and control of pests) and also help ensure mutual outputs such as climate regulation and biodiversity, regardless of whether the ley is used for silage production, cut and carry, or grazing (Martin et al., 2020). However, where soil conditions allow grazing as a low resource-use practice the ley-phase should be sought to reach the maximum potential of ICLS.
Thus, as part of the debate on the development of more sustainable dairy systems within the frame of ecological intensification approaches that ensure appropriate production levels on the one hand, and additional ecosystem services on the other, four model scenarios along the hierarchy of current and possible systems in northwest Europe can be defined: intensive-confinement (IC), semi-confinement (SC) with a limited grazing period, full-grazing systems (FG) conducted on seeded permanent grassland, and an integrated full grazing system (IFG) predominately conducted on leys (Figure 1). In accordance with the meta-analysis from Lorenz et al. (2019) the thresholds between IC, SC, and FG systems are set by the forage intake from pasture. In the IC systems animals have no allowance for pasture, while in SC <50% of the feed intake is from pastures and a minimum 25% is from supplementary feeding, whereas in FG and IFG systems a minimum 50% of feed intake is from pastures with a maximum share of 25% from supplements. The main advantages of an IFG is the possibility of the inclusion of leys in a cash-crop arable system to accelerate C and N accumulation in the soil as a result of BNF from grass-clover leys and animal-N excreta; and in terms of forage provision the benefits of continuous progress in forage plant breeding, as expressed as high energy and protein values similar to concentrate feed, can be gained by frequent renovation of the seeded leys. Moreover, leys offer the opportunity to include alternative species such as forage herbs that may benefit the digestibility and forage intake of grazing animals (Loza et al., 2021) at low N 2 O emissions from pasture land (Nyameasem et al., 2021). Lorenz et al. (2019), moreover, noticed that the efficiency of the proposed dairy systems depends on the farm management and that the evaluation of such systems needs accurate estimates of the forage efficiency. Accordingly, we decided to choose a set of representative farms together with the regional commercial dairy extension service, which fulfilled the above-mentioned thresholds of feed allowance, and to investigate them over 2 years using a bottom-up approach. Even though on-farm research approaches often lack the statistical accuracy provided in a randomized experimental design, they offer valuable insights into practical management approaches, which may be shown to be economical resilient over successive years. Moreover, to gain a representative picture on the effects on C-and N-cycles, systems have to be established over years before the long-term impacts can be investigated. This situation is difficult to duplicate in randomized plot experiments for dairy systems, as (i) dairy cattle have a large requirement for land, especially if fundamentally different production systems are compared, and (ii) long-term investigation would be necessary in order to take account of altered chemical soil properties that occur as a result of the management. Thus, all farms in our study had already been established for several years and were located on the same soil type. Productivity and environmental impacts such GHGemissions and N-leaching from forage crop production were measured for two years and the PCF, NH 3, and farm N-balance were calculated. The following hypotheses have been made: • Increasing intensification in dairy causes higher emissions and high N-surpluses per ha and per kg of milk in comparison with semi-intensive systems that include grazing. • Full grazing is more advantageous in terms of the mentioned ecosystem services, in particular low-cost systems without high use of external resources. • The latter can be further improved by ICLS.

Research Area
The study was conducted in the eastern part of Schleswig-Holstein, northern Germany. The soil type in this region is predominantly sandy loam, and highly suitable for arable production with high yields of both cereals and forages Struck et al., 2019;Biernat et al., 2020b;Reinsch et al., 2020). Among the federal states of Germany, Schleswig-Holstein has the highest stocking density of dairy cattle (at 69 per km 2 ; compared to average of 27 per km 2 ) which may be attributed to its climate being well-suited to forage production. Moreover, farming practices are highly specialized toward either cashcrop production or dairying, with low interaction among these systems. This specialization results in high farm-N surpluses and high nitrate loads to the groundwater, particularly in areas where the highest numbers of dairy stock are located Biernat et al., 2020). The region has a maritime climate with moderate long-term average temperatures of 8.9 • C and annual rainfall of 737 mm. From the 1970s to 2019 the number of dairy farms in the region dropped from 8,000 to below 4,000 (Destatis, 2017). During the same period, the average milk yield per cow has increased from ∼6.000 to ∼9.000 kg ECM, with high variability in milk yields depending on system and specialization.
Nowadays, most cows in the region are kept inside year-round, receiving together with grass, maize-silage (at 6 kg DM cow −1 day −1 ) and additional high inputs of supplementary concentrate feed (2.7 t DM cow −1 year −1 ) (LKSH, 2019). The average herd size is 150 cows and average annual milk yield is 8 920 kg ECM per cow. Access to full-pasture land is low as a proportion of the total number of dairy cows (<9%). Within this region and yield levels, we identified four prevailing farm types: (i) intensiveconfinement (IC), (ii) semi-confinement (SC), (iii) full-grazed (FG), and (iv) integrated-full-grazed (IFG), with a gradient in the share of grazing and resource inputs ( Table 1). The SC farm type represents the average production conditions in this region, and its production data are comparable to data from officially published farm surveys. The annual released farm survey data cover 430 representative farms across the country (LKSH, 2019). In SC the animals had, in addition to grass and maize silage, access to pasture during daytime from May-September on the land surrounding the cowshed. The IC system was chosen from the top 64 farms in this area according to the milk yield records (>10,000 kg ECM cow −1 year −1 ). The cows were confined yearround. Forage in IC was provided from grass and maize silage only. The FG and IFG systems were identified on the same soil type and these systems are not commonly practiced in this region. Thus, these farms acted as an alternative in comparison with the business-as-usual scenarios (IC and SC). The proposed grazing management conducted in FG and IFG followed the principles of rotational grazing adopted from systems as used in Ireland and New Zealand. For instance, the farm manager of FG was part of an international knowledge transfer network working on improved rotational grazing management. The IFG collaborated intensively with the Irish research center Teagasc . Both the FG and IFG implemented rotational grazing on diverse grass-clover swards. Stocking rates for dairy cows on pastures in the FG and IFG were adjusted in accordance with non-destructive aboveground biomass measurements (AGB) obtained with a yield-plate-meter (Trott et al., 2002) to ensure an optimal forage allowance. The access to pasture ranged from 0 to 292 days year −1 among the systems. The N application from mineral fertilizer and slurry was on average 247 (IC) and 182 kg N ha −1 (SC). In FG and IFG the N-fertilization was provided by BNF and N-excreta from grazing animals. The farms IC, SC, and FG were exclusively specialized on dairy and forage production, whereas IFG was part of an ICLS. In detail, forage was offered as grazed grass-clover leys. These leys were plowed after two full grazing years (spring of the third year), and were followed by cash crops to benefit from the carry-over effect of N (N carry−over ) from the grass-clover leys. As a result, the share of permanent grassland was lowest in IFG and highest in FG. New leys in IFG were established as an understorey in winter wheat prior to the first production year, with a seeding rate of 29 kg ha −1 (70% Lolium perenne, 20% Trifolium pratense, 10% Trifolium repens). In the FG system, arable grass was renovated every 5 years, of which 4 ha were plowed in late summer followed by a catch crop for winter grazing for heifers. The remaining arable grass was renovated in spring and reseeded at a rate of 30 kg ha −1 (80% Lolium perenne and 20% Trifolium repens). Permanent grassland management and silage production was conducted according to the local recommended practice comprising 3-5 silage cuts on grassland  and maize seeding at the beginning of May and harvest at silage maturity at the beginning of October (Komainda et al., 2018). Where swards were grazed, the residual biomass after each of 8-10 grazing cycles per year was clipped when necessary and the clippings left as a mulch.

On-Farm Research Design
Each of the four dairy farms (representing systems IC, SC, FG, and IFG) was observed for 2 years during the period 2011-2019. The farms were selected on the basis of the above-mentioned production data representing for SC and IC the average and best 25% with regards to management and milk yield in the state. The grazing systems represent systems that are not common for the time being, but proposed promising alternatives, in comparison with confined systems. The IC and FG systems were observed during the years 2010 and 2011, and farm SC from 2014 to 2015. The IFG was observed from 2017 to 2019; however, because of heavy droughts during 2018 the data for this year were discarded as they were not comparable to the other study years. Weather data were recorded at weather stations from Germany's National Meteorological Service (DWD) close to the experimental sites. Average rainfall and daily temperature during the vegetation period (April-Sept.) and experimental years were 434 mm (SD 82) and 14 • C (SD 0.5). According to simulation runs for a 4 cut silage system with the grass growth model FOPROQ (Torssell and Kornher, 1983;Herrmann et al., 2005), used by national recommendation services to estimate the optimal cutting date for grass silage in the state, the average grass silage yields within the climatic region of the analyzed farms showed only small differences for the simulated yields [∼11 t DM ha −1 (SD 0.7)], indicating comparable climatic conditions during the different vegetation periods and experimental years. Soils on the investigated farms were classified as sandy loams, with the texture ranging from 56 to 76% sand, 16 to 29% silt, and 7 to 16% clay. For IC and SC the most common breed in the research area Holstein-Friesian was used. In the FG and IFG systems farm managers chose cows with a lower weight to ensure early and late grazing could take place with a minimum of sward damage from trampling.

Farm Management Data
Common farm management and herd data were collected throughout the study period. These data contained herd information (e.g., replacement rate, weeks of lactation, milk yield), farm management data (e.g., area of land, land-use, soil tillage intervals, and grassland management) as well as purchased resources by the farm manager (e.g., supplementary feeds, fertilizer, agro-chemicals).

Forage Productivity
The aboveground biomass (AGB) on grassland was measured by hand clipping with shears prior to the silage cut or grazing event.
The sampling area was 0.25 m 2 (with ten replicates per paddock) and the sward was cut leaving a stubble height of 50 mm. For all mown grassland there were 3-4 cuts, and for the pastures 3-11 rotations including silage cuts per experimental year according to the conducted grassland management of the different systems.
The AGB for the pastures were further defined as AGB access . In addition, enclosures were installed on the pastures to measure the biomass growth during grazing events (AGB regrowth ). After grazing the AGB residues (AGB residues ) were quantified by hand clipping as described above, but leaving 40 mm stubbles. The pasture-intake by grazing animals was calculated as: For maize, ten plants in a maize row were harvested at silage maturity and cut at a stubble height of 20 cm. This procedure was replicated three times on each ha of maize. A subsample of ∼100 g for grass and 1,000 g for maize, respectively, was taken to determine plant dry-matter (DM) content after drying in an oven at 58 • C until constant weight. In addition, subsamples of the grass-clover swards were separated into the fractions grass and clover. Prior to forage quality analysis, dried subsamples were ground in a two-step procedure: first passing a 5-mm (Retsch, GmbH, Haan, Germany) and subsequently a 1-mm screen (FOSS, GmbH, Rellingen, Germany). Forage N content and net-energy-lactation (NEL) were estimated by Near-Infrared-Reflection-Spectroscopy with a NIRsystems 5000 scanning monochromator (FOSS, Silver Spring, Maryland, USA). The total crude protein (CP) was calculated by multiplying a factor of x 6.25 to the respective N-content. The average daily DM-intake of feed by cows was estimated according to the equation from Gruber et al. (2004), which takes into account the milk yield, the share of different forage types, the MJ NEL kg DM −1 and CP-content of feeds as variables. The average milk urea nitrogen (MUN) was calculated as a function of the average CP in forage (Spek et al., 2013) and DM-intake, whereas the daily N-excretion (N ex ) was estimated by daily DMintake, its CP-content and live-weight following the approach by Nennich et al. (2005).

N 2 O-Emissions and N-Leaching
Fluxes of N 2 O on each field and dairy system were measured with the static closed chamber method. The minimum sampling frequency was once a week in all crops, taken between 10 a.m. and 12 p.m. In a pre-treatment, four collars for maize and grassland cut for silage and ten collars for pastures (d = 60 cm, h = 15 cm), made from polyvinyl chloride (PVC), were installed into the soil to a depth of 10 cm. The chambers were placed at uniform distances between the replicates in order to capture a representative area across each forage crop (avg. investigated field size 0.75 ha). The collars were removed during maize and grass-cut harvesting and tillage operations but remained in the grazed fields. During the flux measurements collars were closed gas tight with white PVC chambers (d = 60 cm, h = 35 cm). Gas samples were taken at 0, 20, 40, and 60 min after closure through a gas-tight septum on the top of the chamber using a 30 ml syringe. Samples were directly transferred into 12 ml preevacuated septum-capped vials (Labco, High Wycombe, UK). In the first 2 weeks after fertilizer application and grazing events, measurements were conducted more frequently at irregular intervals but at a minimum of two times per week. Gas samples were analyzed for N 2 O and CH 4 through a gas chromatograph (SCION 456-GC, Bruker, Leiderdorp, Netherlands). The change of gas concentration in the chamber headspace during the measurement period was calculated by linear regression.
To determine N leaching to the groundwater over the winter period, soil water samples were taken using ceramic suction cups (Mullit, pore size 1 µm, length 54 mm, diameter 20 mm., ecoTech. Bonn, Germany). Sixteen ceramic suction cups were installed during the two experimental years and per crop at a depth of 75 cm and at an angle of 60 • to minimize preferential flow and sampled from October to March. Prior to the first sampling date the suction cups were installed 6-months before the first sample was taken to ensure adequate time for soil settlement. To collect free drainage water a vacuum of 0.4 bars was applied to all suction cups. Soil water samples were obtained weekly until April. During sampling, four of the respective sixteen suction cup samples were mixed leading to four samples in total each week from every field. Leachate samples were stored at −20 • C until analysis. Concentrations of total-N were determined photometrically using a dual channel continuous flow analyzer (SKALAR Analytical Instrument, Breda, the Netherlands). The amount of percolating water was calculated by a climatic water balance model, using weather and soil data gathered from the experimental site, actual evapotranspiration (Mohrlok, 2009), and specific crop coefficients (Löpmeier, 1994;Häckel, 1999) to correct evaporation.

Product Carbon Footprint (PCF)
The PCF for milk production of the four contrasting dairy farms was calculated using measured data for N 2 O as direct emissions and N-leaching as an indirect source for N 2 O-emissions from land-use. Additional indirect N 2 O emissions from NH 3 volatilization in the cowshed were calculated according to Burgos et al. (2010). The emission factor (EF) for NH 3 volatilization from grazing animals were based on a review analysis of Sommer et al. (2019). Other gaseous N-emissions during manure application followed the methodology of the IPCC guidelines (IPCC, 2006;Mogensen et al., 2014). Methane emissions from ruminal digestion was calculated according to Schils et al. (2007).
In order to account for the on-farm soil carbon changes (SOC) of the tested production systems a simple approach developed by Petersen et al. (2013) was used. In this approach the different crops and management systems are compared to a reference system to estimate potential soil carbon changes. For the reference system a continuous long-term experiment with winter-rye cultivation, without any manure and fertilizer amendments, was chosen (for details see Merbach et al., 2000). Carbon inputs from roots and exudates were calculated as a ratio of AGB according to Taghizadeh-Toosi et al. (2014). For the various crops, different land-use factors (grassland:1, arable grass: 0.93, cropland: 0.8) and soil tillage factors (grassland:1.1, arable grass: 1.1, cropland: 1) were utilized, according to IPCC (2006). Emissions from external resources were adapted from the ecoinvent (vers. 3.3) data basis (econinvent, 2016). Estimates for the required off-farm cropland were based on the purchased supplements by the farm manager. Land requirement for soybean meal were calculated according to 10-year-average yield values (2007-2017) of soybean in Argentina (FAOSTAT, 2017), assuming an oil content of 20% and losses during the milling process of 2%. Supplementary feed is made from coproducts from the production of plant-based oils, starch and sugar. Imported feeds with relevant by-products, such as oils and sugar were considered on the basis of the mass allocation for soy, canola, sunflowers and sugar beet (Dalgaard et al., 2008).
For the slurry exported from dairy farms for cash crop production on other land, it was assumed that every exported kg N would replace the equivalent of one kg mineral fertilizer. To follow the concept of consequential LCA production, costs for ammonium nitrate were used (ecoinvent 3.3) as the GHG savings for exported nitrogen in organic manures. Transport distances to cash crop producers and related GHG emissions were assumed to be negligible and ignored in this calculation. In the IFG, grass-clover was used as the forage base and the residual N in stubble and roots were considered as an export. The N from both above-and belowground crop residues was considered, with an equivalent of 0.51. This factor is based on observations from Huss-Danell et al. (2007) from a N-partitioning experiment on red-clover grassland. To calculate the global warming potential (GWP) per ha, the respective value for each trace gas over a lifespan of 100 years was used (CO 2 = 1, N 2 O = 265, CH 4 = 28) and expressed in CO 2 -equivalents (CO 2eq ). The efficiency of the systems with regards to climate change was calculated relatively, on the basis of the functional unit energy corrected milk (ECM), according to Sjaunja (1990).

Ammonia Emissions and Farm-N-Balance
Ammonia emissions per kg energy-corrected-milk (ECM) were calculated in accordance with the emission factors described above [section Product Carbon Footprint (PCF)]. The onfarm N-balance was calculated by the sum of nitrogen inputs and deduction of nitrogen outputs at farm-gate. For nitrogen inputs all purchased mineral fertilizers, straw, seeds and their respective N-contents, as well as biological nitrogen fixation (BNF) were considered. The BNF was calculated using the fraction of AGB derived from clover yield together with the modeling approach of Høgh-Jensen et al. (2004) and the Ncycling on pastures derived from N ex . The N deposition from rainfall was used in the N-balance calculation, with a long-term annual average N deposition (1989)(1990)(1991)(1992)(1993)(1994)(1995)(1996)(1997)(1998)(1999)(2000)(2001)(2002)(2003)(2004)(2005) in this area of 12.5 kg N ha −1 year −1 .
Nitrogen outputs consisted of milk and meat. The nitrogen export of milk was estimated by dividing the protein yield by 6.38 (ISO, 2014). For dairy cows the live-weight with a carcass weight of 46% was used. The protein content of carcass was assumed as 19% and converted to nitrogen using the factor 6.25. The annual meat export was calculated on the basis of the replacement rate. This also included the export of calves from the farm. After deduction of slurry demand on farm and gaseous-N losses during storage the surplus of slurry-N was taken as export-N. Fertilization management on the farm was considered to be in accordance with the farm management records.

Forage Productivity
Measured average forage yields on farm were comparable at 10-11 t DM ha −1 year −1 . The average CP contents differed, with 16, 15, 21, and 21% of DM in the IC, SC, FG, and IFG. However, when considering forage imports CP in the feed ratio increased to 20 and 16% CP for IC and SC, whereas CP slightly decreased in FG and IFG, due to the lower CP contents in the supplementary feed. Average net-energy lactation in forage was 7 MJ NEL kg DM −1 in all systems. NEL was mainly provided by silage maize in IC and SC and highly digestible grazed grassclover in FG and IFG. Feed imports provided additional energy in total, in particular in SC, IC, and IFG, but without changing the average energy content of 7 MJ NEL kg DM-intake −1 . Calculated daily DM-intake showed a range of 17-24 kg day −1 for dairy cows. Lowest values were found in IFG and FG and highest in IC ( Table 2). 2 | Measured average DM-yields, energy-(MJ NEL kg DM −1 ) and crude-protein (CP) contents from forage production and DM-intake (forage + supplements) from the two experimental years, and the predicted milk urea (MUN) contents and annual N-excretion (N ex ) per cow among the different systems (IC, intensive-confinement; SC, semi-confinement; FG, full-grazed; IFG, integrated-full-grazed).  Gruber et al., 2004, d Spek et al., 2013, e Nennich et al., 2005.

N 2 O-Emissions and N-Leaching
Measured annual N 2 O emissions were in the ranges of 0.7-4.7 in IC, 1.1-11.9 in SC, 0.9-2.6 in FG, and 0.1-0.7 kg N 2 O-N ha −1 year −1 in IFG (Figure 2). Highest mean annual N 2 O emissions were measured in the SC system, which comprises N-inputs from mineral and organic fertilizers as well as N-excretion from grazed pastures. The N 2 O-emissions from the grazed grass-clover swards that received no additional N fertilization were generally low, showing maximum annual emissions of 1.6 kg N 2 O-N ha −1 year −1 in FG. The IFG system showed similar emissions on permanent grassland, but had lower emissions on grazed arable grass-clover leys in comparison with FG. Higher emissions in the grazed arable grass systems in FG were mainly a result of the grazed catch crops during winter prior to grassland sowing in spring (data not shown). Annual N 2 O emissions from silage maize were elevated in comparison with grassland in the IC and SC systems, with average emissions of 4.3 kg N 2 O-N ha −1 year −1 . In comparison, grassland used for silage revealed 1.7 kg N 2 O-N ha −1 year −1 . The total N-fertilization from mineral fertilizer and cattle slurry applied was on average 191 kg N ha −1 year −1 for silage maize and 284 kg N ha −1 year −1 for the permanent grassland utilized by mowing in IC and SC (Figure 2). Thus, emission factors for N-applied, calculated from the measured N 2 O emissions, were higher for silage maize (2.3% of each kg Napplied) and slightly lower for mown grassland (0.6%) compared to the current EF of the IPCC guidelines. For grazing systems, the IPCC takes an EF of 2% of excreted-N into account. Annual N-deposits excreted by cows during grazing were calculated as 131 in FG and 141 kg N ha −1 year −1 in the IFG. The calculated EF values were 1.2 and 0.6% in FG and IFG and thus higher for FG; however, they are in accordance with the IFG as currently shown in the IPCC refinement report for other N-inputs in wet climates. Nitrogen leaching was on average over the two experimental years 50, 25, 15, and 10 kg N ha −1 year −1 for SC, IC, FG and IFG. High N-leaching losses were observed in particular for Nfertilized arable red-clover grass swards (73 kg N) and silage maize (37 kg N) in SC. For N-fertilized permanent grassland and arable grass in the SC and IC system, leaching losses of 31 kg N ha −1 were measured. Non-N-fertilized and mown permanent grass-clover revealed 16 kg N ha −1 ; the grazed permanent grassclover showed leaching losses of 24 kg N ha −1 , and grazed arable grass-clover swards revealed slightly lower losses of 21 kg N ha −1 . However, N losses in the latter were dependent on sward age with increasing increments of N-losses for each additional year of grazing management (data not shown).

Product Carbon Footprint (PCF)
The largest contribution to the total global warming potential (GWP) per ha in all systems was from enteric CH 4 emissions, as a result of ruminal digestion. In addition, with energy expenditure for milking and animal housing, the share was 45 and 44% in the IC and SC system ( Table 3). Due to lower milk yields and lower share of other emission sources, the enteric fermentation had a larger contribution in the grazing systems, FG and IFG, and accounted, on average, for 66% in both systems. Taking the feed imports as well as the inputs of fertilizers and seeds into account, forage production showed the second largest share of emissions with 26, 28, 14, and 14% of GWP in the IC, SC, FG, and IFG systems. The third largest contributor in the IC and SC was manure management at the farm facilities. With increasing share of grazing (0% IC, 22% SC, 70% FG, and 80 IFG% expressed as a percentage of days on which they were grazed per year) the importance of manure management on the total GWP declined from 23 (IC) to 9% (IFG). Greenhouse gas emitted from the young stock showed a range of 6-14% of total GWP, with lowest shares in IC and IFG due to lowest replacement rates (see Table 1). Soil carbon sequestration reduced the GWP in all systems, as the high share of grass in the forage crop rotation, together with slurry inputs, led to positive soil carbon balances. Highest sequestration rates were observed in the IFG system due to the high acreage of 2-year grazed grass-clover leys ( Table 3). Calculated annual sequestration potential was 0.24, 0.36, 0.47, and 0.56 t C ha −1 across the systems IC, SC, FG, and IFG. With the exception of SC all farms are eligible to receive credits for the substitution of mineral fertilizers by slurry exports. However, there were large differences among the systems with highest exports in IC ( Table 3). The accounting of credits for BNF as a byproduct was only applicable in the IFG, as leys were part of ICLS. On the other farms N circulated only in the forage crop system leading to a potential excess of N (see section Farm-N-Surplus and Losses of Reactive Nitrogen). The PCF for each kg ECM produced was in the range of 0.9-1.3 kg CO 2 eq with lowest values in IFG and highest in SC. Taking soil carbon sequestration and consequential impacts due to greenhouse gas (GHG) crediting into account, the PCF was reduced in all systems but was still lowest in IFG (Table 3).  (Huss-Danell et al., 2007). a Aerial deposition included (12.5 kg N ha −1 year −1 ), b Høgh-Jensen et al., 2004, c Huss-Danell et al., 2007

Farm-N-Surplus and Losses of Reactive Nitrogen
The farm N-balance declined from 229 to 50 kg N ha −1 year −1 in the order of IC, SC, FG and IFG. In the IC and SC systems, highest imports at farm gate were provided by mineral fertilizer and feed imports, whereas N-imports by supplements exceeded 200 kg N ha −1 year −1 in the IC due to a high share of soybean meal. In the FG and IFG, biological nitrogen fixation (BNF) had, with 130 and 121 kg N ha −1 year −1 , the largest contribution to N-inputs, leading to comparable total average N-inputs per ha of 121 kg N ha −1 year −1 . Milk provided the largest N-export from the farm in IC (69%), SC (91%), and FG (60%) systems. However, these figures were different for the IFG, showing a share of 40%. Exports from BNF in the form of plant residual matter was only applicable in the IFG system, and this influenced the farm-N-balance positively ( Table 4). Total predicted and measured losses of reactive N from the farm were generally in accordance with the calculated farm-N-balance, showing slightly lower values ( Table 5). NH 3 emissions from manure storage and N-application were the largest contributors with a share of 66-85% to the total N-losses. The total NH 3 -emissions amounted to 160 kg N ha −1 year −1 in IC, whereas in SC, FG, and IFG these losses were reduced by 37, 72, and 70%. The range of N-leaching on total losses was 13-31%, with highest losses in IC and SC. Measured N 2 O-emission were with 2-3% of minor relevance in total. The N-footprint per kg milk produced was highest for SC and lowest for IFG (Table 5).
Considering the calculated N-balance (Table 4) and leaving the N-carry-over in IFG aside, 17, 23, 32, and 50% of N-losses of the farm N balance were not accounted for in the IC, SC, FG, and IFG, respectively. However, when accounting for soil C sequestration, with the site-specific C/N ratio, N accumulation in the soil amounted to 24, 36, 47, and 57 kg N ha −1 . This reduced the percentage of N not accounted for to +7, +5, −17, and −1%. The relative differences of environmental impacts compared to IC indicated a 18 and 43% lower PCF; 59 and 78% lower farm-N-balance, and 72 and 71% lower NH 3 -emissions per ha in FG and IFG. SC showed a 11% higher PCF and 13 and 38% lower N-balance and NH 3 volatilization.
Milk yield per ha on the farm was 15,817, 11,512, 7,420, and 10,485 kg ECM ha −1 , resulting in a land requirement of 0.6, 0.9, 1.3, and 1.0 m 2 kg ECM −1 in IC, SC, FG, and IFG. However, taking imported land from purchased supplementary feeds into account, this relative perspective changed to 1.2, 1.2, 1.4, and 1.3 m 2 kg ECM −1 . With increasing milk yields per ha the farm Nsurplus increased but did not show a clear linear trend among the systems. In contrast, the GHG emissions per ha correlated clearly positively with the N surplus on farm ( Figure 3A). However, lowest N-surplus does not necessarily provide the lowest PCF ( Figure 3B).

Forage Productivity
On sandy loamy soil, high productivity of forage crops can be achieved, particularly in areas with temperate maritime climate and evenly distributed rainfall Struck et al., 2019). The most important driver is the nutrient availability, which is mainly constrained by N (Biernat et al., 2020b). In high-input dairy systems, N is provided by slurry and purchased mineral fertilizers, whereas in low-cost systems the use of external N resources can be avoided by using forage legumes and their ability for BNF . The N available on farm, and consequently the total N in slurry, is further increased by imported N-rich feeds. Consequently, the forage DM yield in the different observed dairy systems did not differ significantly, as there was sufficient N available either through artificial Nfertilizer imports (IC and SC) or BNF (FG and IFG) on grassland. Nevens and Reheul (2003) reported from a 31-year ley-arable trial, on the same soil type as used in our study, that grazed leys provide similar amounts of forage to those of old permanent grassland swards, even at lower rates of fertilizer application. Low yield responses of pastures to N-fertilization were also reported elsewhere and can be explained by the N-return from grazing animals (Viljoen et al., 2020). This effect is more likely in grassclover swards, where forages with high N contents are offered, leading to a higher N status in the soil during the grazing period. Moreover, the high yields from the observed leys, without any disadvantage to permanent grassland, can be explained by the chosen forage legume-based seed mixture, which can lead to benefits in herbage growth and subsequent yield (Nyfeler et al., 2011;Lüscher et al., 2014). Environmental factors are the largest explanatory variables that affect yield differences between years. Changes in the climate in northwest Europe are anticipated in the coming decades, and in response to these changes grassland productivity may increase by 11% by 2050 (Höglind et al., 2013). However, severe droughts will also become more likely, reducing the yield in those years considerably below the long-term average. In the IFG system, the leys were established as an understorey prior to the first production year. The rooting system of the established red-clover mixture allowed a higher yield resilience to droughts, in comparison with that of pure grass or grasswhite clover swards (Lorenz et al., 2020). This is particularly important in this research area, where periods with lower rainfall often occur between May and June, increasing the inter-annual yield volatility.
In contrast to the very similar DM yield of the various systems, the observed forage systems differed in their ratio of CP and energy. The forage production in the IC and SC systems were dominated by grass and maize silage providing a ratio of 20-22 (g CP MJ NEL −1 ), whereas FG and IFG showed an outstanding ratio of 31. However, with addition of the supplementary feeds, the ratio increased to 28 in IC but remained constant in SC as a result of a disadvantageous feeding strategy. Thus, the low-cost grazing systems (FG and IFG) produced a high share of CP on their farm itself, mainly as white and red-clover having a CP content, which is slightly lower but with a high potential for supplementary feed substitution at moderate milk yield losses (Schulz et al., 2018). However, improvements in feeding with balanced CP/NEL ratios across the grazing season become even more important in FG and IFG, as the forage quality is determined by the cultivation strategy of grass-clover leys and the typical annual growth patterns. Seeded grass-clover leys can show a high biomass accumulation by the forage legume components, which are determined by a higher proportion of grass in the total yield during spring, and a lower proportion of grass and more legume in late summer and autumn (Lorenz et al., 2020). Typically, this can lead to a surplus of ruminal-N and elevated milk urea contents in the second half of the year, therefore requiring use of energy-rich supplements necessary to maintain an optimal ruminal-N-balance and to reduce the risk of high urea nitrogen contents (Selbie et al., 2015), and also making NH 3 -volatilization (Burgos et al., 2010) and N-leaching more likely (Cichota et al., 2012). In the IFG system, this was balanced by a moderate import of starch-rich components like cereals, which increases the milk yield per cow and ha compared to FG. In the FG system the swards were renovated every 5 years but were not part of an integrated crop system. Thus, the control of the desired sward composition was more challenging as renovated permanent white-clover grassland swards can show a high share of legumes 2-5 years after renovation, thereby producing a surplus of BNF and CP . This was present by highest CP-contents in offered forage and highest annual average milk urea nitrogen (MUN) in FG ( Table 2) as result of a lack of supplementation of energy. Thus, with regard to the feeding strategy rotational grazed grass-clover swards can generate high yields, which can compete with intensively fertilized forage systems in terms of energy-and protein yields. However, the absence of moderate supplementation may cause undesired feed-backs on animal health and milk yields, and therefore a balanced CP to energy ration should be sought.

N 2 O-Emissions and N-Leaching
The measured N 2 O-emissions on the different farms showed typical patterns throughout the year, with high emission peaks during winter and after N-application (data not shown). Thus, in the N-fertilized arable and permanent grassland sites the calculated EF was close to the recommended value of the IPCC guidelines (IPCC, 2006), due to the dominance of annual emissions from peaks shortly after application. Higher emissions from silage maize in comparison with grass have already been reported elsewhere; this is due to high application rates of slurry during spring, which is incorporated into the soil prior to maize sowing (Struck et al., 2020). Under these conditions, with high amounts of N and easily decomposable organic matter, high N 2 O-fluxes from soil heterotrophic denitrifies are likely. These can further be accelerated by additional N from soil organic matter mineralization induced by soil tillage activities (Struck et al., 2019(Struck et al., , 2020. In the grazed systems, the annual emissions were significantly lower than the former IPPC default factor advised (IPCC, 2006); however, they are in accordance with the refined factors currently released for wet areas (IPCC, 2019). Such lower emission factors have also been reported from other grazing experiments in South Africa (Smit et al., 2020) and New Zealand (van der Weerden et al., 2020) as well as on the same experimental site over a long gradient of plant diversity and environments (Nyameasem et al., 2021). In addition to N-excreted by grazing animals, BNF was a major N-input in the IF and IFG systems. Even without the use of mineral Nfertilizers positive N-balances were achieved, particularly if the proportion of forage legumes within the ley is high . However, symbiotically fixed N is mostly captured in plant tissues, such as leaves and roots, but it becomes available erratically, depending on soil moisture, temperature, and soil and residue management. At the same time grasslands can show a large quantity of C and N sequestration in the soil, thereby avoiding the majority of N-losses Reinsch et al., 2018a). Moreover, the plant uptake of decomposed N is efficient in low-input systems characterized by low N 2 Oemissions (Schmeer et al., 2014) and N-leaching . Nevertheless, if BNF is not accounted for in N-fertilizer planning, N-leaching losses over the drainage period are likely to be increased, due to mineralization of the plant residues, in particular in temperate grassland areas where air temperatures often remain above 5 • C during winter. This might explain the high measured N-leaching losses under red clover-grass swards in the SC system in both experimental years, where in addition to the predicted BNF of 305 kg N ha −1 there was also a N-fertilizer rate of 80 kg N ha −1 . However, in the grass-clover swards of FG and IFG only minor leaching losses were measured, which is in line with several other studies elsewhere on comparable soil types Biernat et al., 2020b). Under grazing situations N is frequently returned in the excreta. The higher availability of N in grazed swards changes the sward composition in comparison with mown red clover-grass toward one with a higher share of grasses with the consequence of a higher root length density, measured on the same site (Chen et al., 2016). This accelerates the N-uptake by the sward, which may have improved the N-efficiency of the grazed grass-clover system in FG and IFG in comparison with SC. Moreover, the SC system showed the highest N-footprint of all farm types leading to an acceleration of N in the system, which increased the level of total measured N-losses in all forage crops in comparison with IC, FG, and IFG. In this context, the farm management is of critical importance, as N-losses on the farm are exacerbated by disadvantageous feeding strategies, manure management and management of forage production.

Product Carbon Footprint (PCF)
Several studies have proposed that GHG emissions per product unit are negatively correlated with increasing milk yield per cow (Lesschen et al., 2011). Other studies mentioned that this has to be further differentiated by the production system used (Rotz et al., 2010;Zehetmeier et al., 2014;Lorenz et al., 2019). In confinement systems, large amounts of resources, usually imported, lead at some point to a compensation of reduced GHGemissions per product unit on farm. In comparison, low-cost systems generate their milk yield exclusively from the on-farm produced forage, which in turn leads to lower milk yields per ha on-farm but also to a lower use of resources, thus contributing negatively to the PCF. Lorenz et al. (2019) found that a milk yield of 6,000 kg ECM per cow −1 and year −1 provided by lowcost grazing is not disadvantageous in comparison to that of a confinement system, which uses a high share of imported supplementary feeds in order to achieve milk yields of 10,000 kg ECM per cow −1 and year −1 . This can be achieved by lower GHG emissions at the same level of feed-efficiency (kg of ECM produced per kg of DMI) (Drews et al., 2020). This was also demonstrated in our study, where IC and FG showed a similar PCF despite the large differences in milk yields per cow and the good herd performance in IC with a low replacement rate. Lorenz et al. (2019) further found that an increase in milk yield in grazing systems would show a higher GHG mitigation per kg ECM in comparison with additional milk yield increases in confinement systems, as the latter are already operating on a high production level. In our study the difference in PCF between FG and IFG accounted for 200 g CO 2 eq per kg ECM milk, which is in line with the values reported by Lorenz et al. (2019), who predicted a GHG mitigation of 120 g CO 2 eq due to an increase of animal performance by ∼1t ECM cow −1 year −1 for low-cost grazing systems. With regards to the supplementary feed intake, the FGI showed a higher share compared to FG with 18 vs. 3%. Thus, it can be assumed that the FGI system, within its defined thresholds, is almost producing on its lowest level the potential PCF. In contrast, the mitigation potential by milk yield increases in IC was estimated to be 60 g CO 2 eq kg ECM −1 , which provided already 35% of the daily DM-intake by supplementary feeds. The SC system showed the highest PCF in our study as a result of the poor feeding strategy and high N-losses. Using the production parameters on farm as an indicator, the SC system represents a business-as-usual scenario, with a milk yield and replacement rate comparable to a larger farm survey in this area (Drews et al., 2020). In comparison, the IC, FG and IFG represent more specialized systems of either intensive confinement or intensive grazing. These farms showed lower replacement rates and higher forage quality, indicating that specialized systems have higher management skills ensuring a higher resource efficiency and hence a lower PCF.
Further GHG mitigations were achieved on all farms when soil C-sequestration was also considered. Permanent grassland and leys offer the opportunity to sequester carbon, which on sandy loam soils has been reported to be as high as ∼4.8 t CO 2 eq ha −1 year −1 . However, frequent soil tillage due to grassland renovation or land-use change to other forage crops can substantially reduce the potential Csequestration to a lower level Reinsch et al., 2018a). Thus, after 2 years of ley farming most of the sequestered carbon can be lost during the following arable phase, if non-appropriate soil tillage practices are used. This somehow increases the uncertainty in evaluation ICLS  and potentially overestimates the sequestration rate in IFG as a high share of leys were present. However, the same uncertainty is given for permanent grassland in IC and SC as old swards with a high external N-input are likely to represent a C source rather than a considerable sink (Poyda et al., 2021). Additional C-sequestration was achieved by the return of animal excreta, regardless of whether slurry was applied or the deposition of dung from grazing animals were considered, the recycling of C and nutrients increases biomass yields and thus higher allocation of C plant residues. Loges et al. (2018a) found that young permanent grass-clover swards, when managed under cutting and fertilized with a moderate application rate of cattle slurry, will increase the C-sequestration by about 500 kg C ha −1 in comparison with management without slurry-N . This rate is in line with a maximum predicted sequestration rate of 560 kg C ha −1 in IFG. Even though for permanent systems the duration of highly positive sequestration rates is limited, even after 20 years no equilibrium is reached on sandy loamy soils (Reinsch et al., 2018. The soil Csequestration on external farms due to the exported slurry was ignored in this study, thereby likely underestimating the GHG savings from manure exports, which was only accounted for by crediting the substitution of mineral fertilizer using the nutrient contents in the slurry as a reference. Further crediting was applied for the IFG as plant mineralizable-N can be transferred to the integrated cropping system by using the same piece of land. The efficiency of plant removals is dependent on the soil tillage management and the post-cropping systems, whereas removal of leys in spring guarantees a higher N-use efficiency (Biernat et al., 2020b). However, plowing of grass-clover leys can cause distinct N 2 O-peaks as N can become enriched in the upper soil layer at warmer temperatures that coincide with low rainfall during spring . Consequently, such credits have to be taken with care, as the potential substitution of mineral fertilizers, which we accounted for in the subsequent cropping system, relies on minimal N-losses. Costa et al. (2020) reviewed 3,180 articles on the effect of integrating legumes within crop rotations and found considerable knowledge gaps in taking the legume-N carry-over effect into account in lifecycle-assessment (LCA) studies. They recommend that full crop rotations should be evaluated rather than to focus on only one crop. This applies also to ICLS and further research is needed to apply evidence-based results of cash crop producers (Biernat et al., 2020a,b) to our evaluated dairy systems. However, we argue that this credit gives the maximum threshold of a best-practice approach achieving a PCF+soil carbon+credits of 0.6 kg CO 2 eq kg ECM −1 , indicating a mitigation potential of 600 g CO 2 eq kg ECM −1 along the gradient of SC>IC>FG>IFG on sandy loamy soils. It has to be noted that the PCF calculation did not include emissions from produced infrastructure (e.g., machinery, cowshed). In grazing systems there is lower requirement for capital goods, of which the FG and IFG systems would further benefit in terms of their resource use. However, these investments need only to be considered for a specific duration of PCF calculation and therefore it depends on the age of the investments and their lifespan. Therefore, estimates on capital goods are difficult and are not influenced by the management strategy in the short term, and for these reasons are often not considered in LCA studies (Yan et al., 2011).

Farm-N-Balance and Losses of Reactive Nitrogen
The region where the study was conducted currently accounts for around 1 Mio. ha of agricultural land, which is dominated by specialized intensive arable cropping and dairy systems. Taube et al. (2015) estimated that in districts in this region, where dairy units are typically located, the average farm-N-balance exceeds 150 kg N ha −1 . Several programmes were released to counteract such high N-surpluses. The main foci during the last years have included fertilizer planning recommendations, legal adjustments in the fertilizer regulation, and efforts to strengthen the N-exports from animal husbandries to the regions with intensive cash-crop production. However, latest reports show that these attempts are not successful in reducing the N-surplus in the dairy sector significantly (Henning and Taube, 2020). The main reason for this is that animal manures are prone to high N-losses by NH 3 -volatilization. Thus, the majority of Nlosses in the IC and SC systems occurred due to the manure management. Mitigation can be achieved on the farm by manure storage covers and low emission spreading techniques (Maris et al., 2021) or slurry acidification (Seidel et al., 2017). However, mitigation of NH 3 -would increase the N-content in applied slurry, which in turn decreases the possible application amount of manure on the farm itself and thus requires export, which is often limited by large transport distances. Another option is to reduce the amount of mineral fertilizer-N. This can be sought by the mentioned efficient use of animal manures (which in the EU is restricted to 170 kg N ha −1 ) and the use of legumes. Even though BNF is an efficient way to capture N in crops, it represents an N-input in the system, which has to be considered in the fertilizer planning. Otherwise, the farm-N-balance cannot be improved significantly. The Baltic Marine Environment Protection Commission (HELCOM) further suggests that the share of grazing should be increased to reduce NH 3 pollutants to marine bodies. Due to these explained causalities, the farm-Nbalance and total N-losses decreased with the share of grazing and reduced fertilizer inputs. Accordingly, the nitrogen-use efficiency (NUE) for IC, SC, FG and IFG is 34, 24, 40, and 72%. These figures are in the range found by other authors (Löw et al., 2020), although Löw et al. (2020) also found slightly increasing NUE for confined systems in comparison with grazing. Their finding mainly relies on their particular circumstances, as lowcost grazing systems using forage legumes as the N source were not taken into account, making mineral N-imports of ∼178 kg N ha −1 necessary. In contrast, in our investigated system FG and IFG showed self-sufficiency of N as a result of BNF. Despite the absence of mineral fertilizer use and high share of grazing the FG systems still revealed a farm N-balance of 94 kg N ha −1 . However, due to the high share of pastures a high share of the farm surplus is related to N stock changes in organic matter, rather than to losses of reactive N to the environment, which were quantified as <65 kg N ha −1 year −1 in FG. With regards to the calculated NH 3 -losses on pastures the current IPCC factor of 0.2 for Nexcretion and slurry was applied. The majority of NH 4 -N, which after its deposition is prone to NH 3 -voltalization to the air, is present in the excreted urine. The DM content of urine (∼1% DM) is low in comparison to slurry (∼8% DM) ensuring a fast infiltration into the soil, where further NH 3 -losses are avoided. However, using the milk urea content as a proxy, highest values were found for the FG system, making a potential overestimate of NH 3 questionable. Further reductions could be achieved by an optimized feeding strategy (see section Forage Productivity) or urease inhibitors. Nevertheless, limits in further reduction are reached as N-residuals enriched by BNF accumulate, if high shares of legumes are sought. Further mitigation can be achieved in the ICLS, where the carry-over effect of legume-N can be used by subsequent crops as illustrated in the IFG.

Pros and Cons of the Examined Systems
In our comparison of four potential dairy production systems based on farm management data, measurement results and empirical methods, the IFG systems as examples for and ICLS (level iii) approach performed best with regards to the environmental impacts per ha and per product unit but also on a very high-level regarding land use efficiency. At the same time adequate milk yields and forage yields were generated, which were comparable to SC systems. This was mainly provided by the low resource use of external inputs and high forage yields from grazed grass-clover leys. The additional credit of N-carry-over to a potential cash-crop system increased these benefits further and might have positive effects on the cashcrop system as well, if the N status in the soil as a result of the advantageous pre-crop (2 years of leys) are taken into account in the fertilizer planning of the cash crop producers. In comparison, farms that are focused exclusively on dairy, such as IC, SC, and FG, are not capable of reducing their N-surplus below 90 kg N ha −1 because of the continued N-accumulation in the soil and volatile N-losses. Comparing the environmental impacts from specialist pure confinement or full-grazing, the IC system showed on the one hand a higher N-surplus and N-footprint per ECM milk compared to the low-cost FG, but only slight differences in the PCF. This implies a higher negative impact on a regional scale (e.g., on groundwater and surface water quality) in the IC but the differences considered on a global scale (i.e., potential influences on climate change) are negligible. However, several uncertainties have to be considered as the environmental impacts from external resources, e.g., imported supplementary feeds, can show wide differences depending on its origin (e.g., deforestation issues), which cannot be controlled at the regional level or by the producers themselves, indicating a higher uncertainty of results for IC and SC compared to FG. Nevertheless, according to the results obtained here and in the context of sandy loam soils, further improvements at regional and global levels can only achieved if N and C-cycles are coupled better by integrating dairy farming into an ICLS (Soussana and Lemaire, 2014).
Highlighting the disadvantages of an ICLS in combination with low-cost grazing, it has to be considered that the milk yield per ha was considerably lower compared with that of the IC. This can lead to income losses, though this will depend on the milk price and feed costs. At present, specialist IC systems in combination with large herd sizes (economy of scale) may have a higher potential to compete financially with the world milk market. In comparison, average dairy farms such as SC currently having a negative turn-over (after deducing of feed costs) (LKSH, 2019). However, low cost-grazing systems provide forage at low costs resulting in high revenue per kg milk solid (White et al., 2002). Continued technical innovation increasing animal performance from grazed grass, increasing herd genetic potential and developing labor efficient lower fixedcost systems are essential in this context (Dillon et al., 2008). Moreover, dairy production in the EU also relies on subsidies.
With regards to the current CAP, environmental indicators (e.g., GHG emissions, farm N-surplus, agro-biodiversity, and animal welfare) are becoming of increasing importance in order to achieve the additional public payments (EU, 2021). This may, in turn, increase the farm revenue considerably, if environmental services linked to dairy production are fulfilled. Beyond these policy developments, there is also a continuous increase in consumer acceptance of sustainably produced dairy products, and increased readiness by consumers to pay a higher price for labeled produce that reflects this (Kühl et al., 2017). However, it also has to be highlighted that specialized confinement systems are less dependent on soil physical and soil chemical properties, as the utilization of grass and maize by cutting is generally easier than managing intensive rotational grazing. In addition, high grazing yields can be only maintained if soils and swards are not easily damaged by overgrazing or trampling, even when high stocking rates are necessary to fulfill the economic requirements for productivity per ha. Moreover, water availability during summer plays a crucial role as yield losses due to drought would make undesired feed purchases necessary, and thereby reduce the efficiency of the proposed grazing systems. Irrigation as an alternative solution would require investment and processing cost as well as questionable resource efficiency in areas where access to groundwater supplies is limited (Emadodin and Reinsch, 2018). Moreover, irrigation in grazing systems can increase undesired GHG emissions from pasture land considerably (Smit et al., 2020). Therefore, specialized IC systems will continue to play an important role in the future on the export markets, and structural change is expected to continue to disadvantage the SC system. There are additional constraints, if ICLS systems for dairy are sought at FIGURE 4 | Overview of different ecosystem services and productivity provided by the different land-use types and resource efforts in the investigated prevailing dairy systems.
high use intensities. The claimed high milk yields at balanced CP/NEL ratios can make supplementary feeding necessary. On the one hand they could be produced locally; for instance, in the IFG system cereals for energy supplementation were provided in the cash-crop unit. On the other hand, however, increasing land consumption would accelerate local competition for food vs. feed. In comparison, most confinement systems in northwest Europe import a high share canola meal, which is a by-product of the oil industry (Broderick et al., 2015). van Hal et al. (2019) mentioned that the use of by-products, which are unsuitable for direct human consumption, increases the overall efficiency of a system even in the LCA perspective. Thus, the evaluated IFG system in our study needs further improvements as more then 80% of the supplementary feeds were provided from locally grown cereals. More maize grown for silage instead of cereals as an additional crop rotation segment might be an appropriate option regarding land use efficiency; however, on the basis of this study integrated dairy systems in the cash-crop sector may provide several positive opportunities (Figure 4) and should be implemented into the local policy.

CONCLUSION
Considering the environmental goals of the EU (e.g., "Green Deal") for the agricultural sector, there is a need to identify best strategies for agricultural land use linked to dairy systems.
An important question concerns the current dominant high input/high out models, and whether they are appropriate for the land-use challenges of minimizing carbon and nitrogen footprints and for maintaining appropriate levels of biodiversity. The on-farm research study on four farms representing different strategies for land use and milk production on sandy loam soils in northwest Germany has confirmed that ongoing intensification in the dairy sector is not in line with the need for reductions in GHG and nitrogen emissions per kg ECM produced on the regional level and thus not in line with ecological intensification. Full-grazing systems in areas with adequate rainfall and appropriate soil conditions offer the opportunity to improve the overall efficiency but only when low input systems are sought. However, the farm-N-balance as well as impacts on climate change can be improved further if integrated systems are favored. This requires exchanging by-products and soil fertility in the context of a local spatial distribution and by making all arable systems more resilient. This study provides a first step toward further analysis and extrapolation regarding the economic resilience of the proposed system in the future.

DATA AVAILABILITY STATEMENT
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

AUTHOR CONTRIBUTIONS
FT, RL, and TR: conceptualization. TR and CL: writing original draft preparation. CK and TR: data analysis. IV, FT, CSM, and RL: writing-review and editing. CK: visualization. FT: supervision. FT and RL: funding acquisition. All authors contributed to the article and approved the submitted version.

FUNDING
This project was partly financed by the European Union Interreg IVA programme, Resource Efficiency and Management Optimization in Dairy Farming (47-2.2-09) and from SusAn, an ERA-Net co-funded under European Union's Horizon 2020 research and innovation program, Grant Agreement no. 696231.